Reference no: EM132390153
BSTAT 3322. BUSINESS STATISTICS II
Assignment
Firstly create your own dataset. You have been assigned a random number seed (it appears in Canvas). Using random sample procedures for R that I have demonstrated, from Fall 2019 Assignment 2.xlsx (also available on Canvas), take a random sample of 100 observations. Use the first 80 of these for Parts 1, 2 and 3. Use the last 20 for Part 4.
These data result from a study of patient satisfaction at a network of dermatology practices. The variables are: "Patient" (once you've created your dataset you can ignore this one); "Sex"; "Satisfaction"; "Effectiveness" (the patient's self-reported view of the effectiveness of the procedure (1 to 80); and "Pain" (the patient's self-reported post- procedure pain (1 to 80).
Part 1 . What is a 90% confidence interval for the correlation between Satisfaction and Effectiveness?
Part 2 . Conduct a simple regression analysis with Satisfaction as your dependent variable and Pain as your independent variable. Discuss what you see/observe.
Part 3 . Conduct a multiple regression analysis with both Effectiveness and Pain as your predictors. Discuss what you see/observe. Having done this, evaluate (and discuss) whether the incorporation of Sex into your model would be useful.
Part 4 . Using your "hold-out" sample, cross-validate the model you fit in Part 3. Discuss what you see/observe.
Attachment:- Assignment Data.rar