Create a summary table that describes sample

Assignment Help Applied Statistics
Reference no: EM132352299

Final Project Instructions -

Evaluate the set of maternal personal, demographic, medical history, and health care related variables as risk factors for low birthweight among infants born at the Baystate Medical Center in Springfield, MA. Your dependent variable is the continuous response variable bwt. You will use multivariable linear regression to identify the potential factors associated with birthweight in this sample of infants, paying particular attention to factors associated with lower birthweight. You will use the A-L dataset if your last name falls in the [A-L] range, and you will use the M-Z dataset if your last name falls in the [M-Z] range.

We expect that you will complete the project by yourself. In other words, you must work on your own on this project and you are not allowed to share your project, results or electronic files or documentation with others. You are expected to erase the dataset provided to you after the final grade has been posted for this project.

All analyses are to be completed using Stata. The dataset you are assigned is in a comma delimited text file (.csv) format. You will need to import this file into Stata using the import command: File > Import > Text data (delimited, *.csv, ...). The dataset may be different from the original data.

Your report need will have a total of 4 different sections: Introduction, Methods, Results, and Discussion. Keep the report to a maximum of 8 pages (double spaced)

Introduction: This section is expecting to answer, "What is the rationale for the scientific question asked?" The rationale needs to be based on a significant public health issue. Describe the relationships of interest and the purpose of the analysis.

Please conduct and a small literature search (1-3 references) to understand the scientific question asked in the project and provide a brief summary in this section. (this section should be brief and amount to a few paragraphs. Limit it to about 1 page double spaced.)

Methods: This section should describe what steps and statistical methods you did to analyze the data and how you applied them to solve the questions asked. You also need to provide a description of what statistical methods were used and the rationale or purpose for it. Please describe any statistical methods used for testing assumptions of the test if needed. If you created new variables for your analyses, you need to provide the rationale for creating the variable and describe the method you used to create the new variable. Add a sentence referencing the software, in this case, Stata, you used for all your analyses, just as you are expected to do for any peer review publication.

Results: The results section needs to mimic a peer review publication, so it needs to include the following elements:

  • Identify the variables used in the comparison and create a summary table that describes your sample. These descriptive statistics are based on the original data, rather than any new variables you create for your analyses.
  • Use the Table 1 Template in Appendix A to present your descriptive statistics.
  • For each set of variables compute the appropriate test statistic to assess the simple association between each independent variables and low versus normal birthweight infants. Describe the statistics you used in the Methods section and report the P value in the table.
  • Summarize you're your findings based on the initial descriptive statistics in a brief paragraph.
  • You will be using multi-sample tests and/or multivariable models to address the primary question(s), so you need to provide analyses that confirm that the model assumptions are met.
  • Present your initial exploratory analyses on the original data that you used to make a preliminary assessment on the presence of potential outliers and distributional characteristics relevant to the statistical model needed to address the primary hypotheses you are asked to evaluate.
  • Describe how you dealt with violations of the assumptions (selecting an appropriate transformation if applicable)
  • Fit the initial model with all the independent variables.
  • Provide detailed analysis of model fit based on residuals. At a minimum, you need to include a quantile normal plot to check the distribution of the residuals, residual versus fitted plot, partial residual plots (component-plus-residuals plot) for each continuous predictor (linearity).
  • Describe the remedial steps you took to address issues identified in your analysis of the residuals
  • How you dealt with non-linearity.
  • How you dealt with observations that appear as potential outliers
  • Fit the final model based on the remedial steps you took to resolve issues identified in your analysis of model fit.
  • Copy and Paste the regression results for your final model into your report and label it as Table 2.
  • Specify how the independent variables that appear in your final model were selected.
  • Summarize the key results from your final multivariable regression model in the text, and include a table with all the regression results in the body of the paper

Discussion: In this section you need to describe what the results mean in the context of the scientific question integrating all the questions asked for the project

The bulk of your report should be the methods and the results. The discussion, like the introduction should be kept brief. It is okay to turn in reports less than the maximum, as long as everything requested is included and adequately covered.

Direct any questions about the project to your instructor or the TAs assigned to your class.

Baystate Hospital Data Documentation.

The Baystate Hospital Study is a study designed to identify risk factors associated with giving birth to a low birthweight infant. At the time the study was conducted in 1986, low birthweight was defined as any newborn weighing less than 2500 grams. Your brief review of the literature may suggest other criteria based on current approaches to risk stratification for newborns. Since the original study had only 59 low birth infants in its sample, you may have difficulty applying newer risk definitions to the current data, but you are free to explore other definitions in logistic regression models and see how the results compare across the different definitions. Your primary analysis, however, should be based on a multivariable linear regression that uses birth weight in grams as the dependent variable.

Please notice that the variable "low" is a dichotomization of the response variable bwt and, therefore, you must not use it as explanatory variable in the regression model. But you will use it to fill out Table 1 in Appendix A.

Attachment:- Linear Regression Analysis Assignment Files.rar

Reference no: EM132352299

Questions Cloud

Domestic banks and international banks : Explain the differences between domestic banks and international banks.
Acquiring firm goodwill account : What amount would be allocated to the acquiring firm's goodwill account?
Discuss which bond will trade at a higher price in market : Two bonds A and B have the same credit rating, the same par value and the same coupon rate. Bond A has 30 years to maturity and bond B has 5 years to maturity.
How you plan to overcome the fears : As the CFO, describe your top-three fears in competing in the global market, and how you plan to overcome those fears. Provide support for your rationale.
Create a summary table that describes sample : Evaluate the set of maternal personal, demographic, medical history, and health care related variables as risk factors for low birthweight
How a firm manages the balance of high availability : In the global enterprise, firms are insistent that their accounting information systems (AIS) and other key systems are available 100% of the time.
Evaluate the success of harmonisation : "Discuss and critically evaluate the success of harmonisation. Choose a country as an example for illustration on how it has adopted IFRS in its attempt.
Six myths and realities of team work : Wright discussed the six myths and realities of team work. Briefly, explain each myth and provide a short personal thought of the myth bounced against reality.
Identify the industry market structure : Identify this industry's market structure and at least two or more market characteristics that support this market structure. (Market structures are covered).

Reviews

len2352299

8/5/2019 3:47:16 AM

This assignment consists of a short literature review (1 page double pace MAX) and Linear Regression analysis. The project should be around 8 pages total (double spaced), but can be less if all required information is clearly included. Attached are instructions with requirements and grading rubric as well as the data set to be used. I also need to add that the Linear Regression analysis needs to be done with STATA software. Attached the DO files used in lectures as examples to guide the project.

Write a Review

Applied Statistics Questions & Answers

  Analyze Independent and Dependent Samples t-Tests

Assignment: Analyze Independent and Dependent Samples t-Tests, Northcentral University, USA. What are the null and alternative hypotheses

  Scores by women on the sat-i test are normally distributed

Scores by women on the SAT-I test are normally distributed with a mean of 998 and a standard deviation of 202. Scores by women on the ACT test are normally distributed with a mean of 20.9 and a standard deviation of 4.6. Assume that the two tests ..

  Why would the chi square test be appropriate for this design

Describe a study you might design that could use the Chi Square statistic. What are the variables? What are the null and alternative hypotheses? Why would the Chi Square test be appropriate for this design? What would the Chi Square statistic indicat..

  What does last one say about the utility of standard errors

Lab: Exercise your R prowess! Try graphing variance against the inverse of sample size. What does the last one say about the utility of standard errors

  Find the amounts of recycling material at collecting points

Find the amounts of recycling material at collecting points according to the population or something else? The number of population is about 21, 850 people.

  Find the bias in as a function

Find MSE?(s2), and argue that its minimum over all c = 0 will not depend on s2 (or µ) -

  The probability of selecting the winning ball from bin

You are allowed to pick the balls one at a time from any bin you like until you find the winning one. Once a ball is picked, it is removed from the game.(a) What is P (Bj |bc) in English? Use Bayes' rule to find P (Bj |bc). The answer..

  How many people were surveyed

In a poll, 51% of he people polled answered yes to the question "are you in favor of the death penalty of a person convicted of murder?" the margin of error in he pole waa 2% and the estimate was made with 95% confidence. At least how many people wer..

  Estimate for gestational age from the sas output

Focuses on the interpretation of the Pearson correlation and a simple linear regression - Develop hypotheses, to calculate statistics, and to interpret output and summary tables.

  Find probability that at least one of four security systems

Assume an attempted break-in occurs. Use the binomial distribution to find the probability that at least one of the four security systems will detect it.

  Calculate standard deviation of the probability distribution

Calculate the mean and standard deviation of this probability distribution. Give a brief interpretation of the values of the mean and standard deviation.

  Calculate the variance and the standard deviation

Calculate the variance (V) and the standard deviation (s). Provide 2 different interpretations of the confidence interval that you have calculated

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd