Plot an appropriately labelled graph with age

Assignment Help Applied Statistics
Reference no: EM132367541

Assignment - STATA Questions

Please provide a full record of your software code and software output in an appendix.

The attached dataset is a dataset which contains survey responses from 2500 women aged over 70.

This dataset has been created in order to assess selected risk factors for depression. A summary of the dataset has been provided in Table 1.

Table 1: Depression dataset

Variable Name

Description

Key

studyno

Unique identifier

 

Age

Age in years at the time of survey completion

 

social_support_tertiles

Tertiles of the social support scale

1=In the lowest tertile of social support

2=In the middle tertile of social support

3=In the highest tertile of social support

e.g. those with social_support_tertile=3 are in the third who have the highest level of social support

depression

In the last 3 years have you been told by a doctor that you have Depression

0=No

1=Yes

Q1. Using a chi-squared test and a t-test, assess the association between age and depression, and social support and depression. Present your results in a table which would be suitable for inclusion in a scientific paper. Under the table describe and interpret these results.

[Note. For this question do not fit a statistical model, and look at each exposure variable one by one].

Q2a. Create a 'collapsed' dataset which records the number of depression records in each category of social support. [For this task you can temporally ignore the age variable. I am not assessing the procedure you used to create the data here, as long as the numbers are correct].

Using this grouped version of the data use software to run a logistic regression model which assesses the association between social support [as a categorical variable] and depression. Carefully interpret your results.

Q2b. Use software to run 'the same model' on the individual (non-collapsed) data. Present the software output and highlight that this model gives us the same estimates of association between social support and depression as the model in 2a.

Q3a. Using software fit a logistic regression model to assess whether there is an association between age and depression in this sample [including only age and depression]. Interpret the estimated age coefficient (and confidence interval and p-value).

Q3b. Use a Wald test to test whether the log(OR) associated with a 1-unit increase in age is greater than In(1.1).

Q3c. Using the model from part 3a, plot an appropriately labelled graph with age on the x-axis and the predicted log odds of depression on the y-axis.

Q3d. Detail how the value of the log-likelihood presented in your software output in 3a was calculated.

Q4a. Using software fit a single logistic regression model which assesses the association between the exposures social support [as a categorical variable] and age, and the outcome depression. Interpret the coefficients produced from this model.

Using software run a likelihood ratio test to assess the statistical significance of adding social support (as a categorical variable) to a more basic model which just includes the exposure age.

Q4b. What is the null and alternative hypothesis for this likelihood ratio test?

Q4c. How do you interpret the results of the likelihood ratio test?

Q4d. Using the model output from the relevant separate models (i.e. the log likelihood values) calculate the chi-squared statistic for this likelihood ratio test by hand.

Q5. Use statistical software and the Hosmer-Lemeshow method to assess how your model from Q4a (that includes age and social support) fits the data. Interpret the output produced. Briefly comment on possible limitations of the Hosmer-Lemeshow technique.

Q6a. Fit a logistic regression model with depression as the outcome, which includes age and social support as independent variables. This time include social support as a linear (trend) term as opposed to a categorical variable.

Interpret the results from your model. Explain whether you would you prefer to present the results of the model from Q6a or Q4a?

Explain why we would not use the Likelihood Ratio test compare the models form Q6 and Q4a.

Q6b. From this model in Q6a what is the predicted probability of depression for someone aged 75.25 and in the highest social support tertile?

Note - Attached the data file to be used to solve the above questions. The questions should be solved using STATA Software.

Attachment:- Data File.rar

Reference no: EM132367541

Questions Cloud

Pioneer in the study of personality types : Isabel Briggs Myers was a pioneer in the study of personality types. The personality types are broadly defined according to four main preferences.
What else might be going on to make up this relationship : What can we conclude about the relationship between these two variables? What else might be going on to make up this relationship?
Determine the? upper-tail critical value of test statistic : When performing a ?2 test for independence in a contingency table with r rows and c? columns, determine the? upper-tail critical value of the test statistic
What is the probability that roastbeef sandwich : If a sandwich is selected at random, what is the probability that it's a roastbeef sandwich?
Plot an appropriately labelled graph with age : Using the model from part 3a, plot an appropriately labelled graph with age on the x-axis and the predicted log odds of depression on the y-axis
State the null and alternative hypotheses : State the null and alternative hypotheses and explain how you develop these two (2) hypotheses.
5 steps of decision tree analysis : Given the 5 steps of decision tree analysis, which of these three conditions yields the most possible outcomes and alternatives and why?
What is the probability that the reporter : What is the probability that the reporter made no typographical errors for the article? Use the Poisson distribution and round answer to 4 decimal places.
Describe what process you would go through : Describe what process you would go through to determine student's GPA in your school using a stratified sample.

Reviews

Write a Review

Applied Statistics Questions & Answers

  How many total people does the credit union employ

How many total people does the credit union employ? If you work for the credit union, and one employee is randomly selected to go to a convention, what is the probability that you will be chosen?

  Comment on the production control situations

Comment on the production control situations depicted by the four control charts shown on the following page, and state what action, if any, would be necessary in each case.

  Q1in the work represented in the image below describe how

q1.in the work represented in the image below describe how each of the following manifest if at alla.nbsp awkward

  Examine the efficacy of studying in groups

A research study was conducted to examine the efficacy of studying in groups. Students were randomly assigned to one of three groups

  1 unemploymenta suppose that 98 million people work and 5

1. unemploymenta. suppose that 98 million people work and 5 million seek work. what is the unemployment rate?b. now

  Statistical questions for practice

Your paper should reflect scholarly writing and APA Referencing standard in case of using references to this activity - Statistical Questions for Practice

  What is the probability that the stock on that day closed

If a person bought 1 share of Google stock within the last year, what is the probability that the stock on that day closed within $45 of the mean for that year?

  University number of students

University Number of Students

  Find a confidence interval for the true mean time

GB513 - Sampling distributions & Estimation - Given these confidence intervals, would it seem very unusual if another sample of this size were to have a mean of 350.0 megabytes per month?

  What are the uses of the chi-square test

What are the uses of the chi-square test in terms of data analysis for quality, safety, and patient safety? Provide two specific examples

  Calculate the degrees of freedom

Use the following data to conduct a difference of means test: Calculate the degrees of freedom. Compare the t-statistic to the critical value for p = 0.05 for the corresponding degrees of freedom

  Compute the power of the acceptance region

Compute the power of the acceptance region - A randomly selected woman is 1 inch taller than the average woman in the sample. Would you predict her earnings to be higher or lower than the average earnings for women in the sample? By how much?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd