What are significant predictors of chd

Assignment Help Basic Statistics
Reference no: EM13246805

A retrospective sample of males in a heart-disease high-risk region of the Western Cape, South Africa. There are roughly two controls per case of CHD. Many of the CHD positive men have undergone blood pressure reduction treatment and other programs to reduce their risk factors after their CHD event. In some cases the measurements were made after these treatments. These data are taken from a larger dataset, described in Rousseauw et al, 1983, South African Medical Journal.

There are 463 observations in the dataset. The variables in the dataset are :

sbp - systolic blood pressure
tobacco - cumulative tobacco (kg)
ldl - low density lipoprotein cholesterol adiposity
famhist - family history of heart disease (Present, Absent)
typea - type-A-behavior obesity
alcohol - current alcohol consumption
age - age at onset
chd - response, coronary heart diseease

If you would prefer to analyze this data in using some other statistical package, you will need to export the data from R using something like a write.table command (or some variation thereof).

The following questions are of practical interest:

1. What are significant predictors of CHD? What would a final model look like and can you provide an estimate of its predictive accuracy (i.e. do model selection and then evaluate predictive accuracy) ? What functional forms are most appropriate for the various predictors in your final model ?

2. Since high ldl often precedes a diagnosis of CHD, will a two stage model which first uses ldl as a response in stage 1 and then CHD as a response in stage 2, provide more accurate predictions of CHD than the model built question 1 above ?

3. There are often situations where finding just one obviously best submodel is dicult. There may be many good competing sub-models.

However, you might decide to bring together multiple models to improve predictive performance. Develop a strategy for doing this on this dataset, being careful to clearly compare and contrast (to the single model approach) predictive performance. Also, make sure to clearly motivate your strategy giving enough intuition so that I can follow things easily.

Please provide complete justifications for why you chose a particular modeling strategy including the underlying assumptions you are making. Analyze the data and provide some overall inferences with regards to the questions being posed. Write a report that details your analysis.

Reference no: EM13246805

Questions Cloud

What is the electric potential at given point : Point charges q1=+2.00?C and q2=?2.00?C are placed at adjacent corners of a square for which the length of each side is 4.50cm, What is the electric potential at point b
Describe what is the amat : Assume a block size of 256 bytes, a clock rate of 1GHz, an L1 miss rate of 2%, and that main memory takes 100ns of overload and then delivers 16 bytes per clock cycle. What is the AMAT
Find the largest electrical output : a river with a water temperature T(L)=20 degree C is to be used as the low temperature reservoir of a large power plant, what is the largest electrical output that the plant can deliver to its customers
How much average power is being wasted due to switching : A MOSFET transistor is being used as a converter switch in a 100v system. It is switching at 50 KHz and has a linear transition time of 3us. The full load current is 40 amps. - How much average power is being wasted due to switching
What are significant predictors of chd : What are significant predictors of CHD and what would a final model look like and can you provide an estimate of its predictive accuracy
Explain what is the concentration of cu2+ cell : What is the concentration of Cu2+ in the following cell at 25 degrees C if the cell voltage is 0.955V?
What is the mans speed at the instant : An 80.0-kg man jumps from a height of 2.50 m onto a platform mounted on springs, What is the man's speed at the instant he depresses the platform 0.120 m
Explain how many molecules of acetylene react with oxygen : How many molecules of acetylene (HCCH) react with 131 molecules of oxygen to produce carbon dioxide and water
Determine the surface charge density : An air-filled capacitor consists of two parallel plates, each with an area of 7.60 cm2, separated by a distance of 1.60 mm. An air-filled capacitor consists of two parallel plates, each with an area of 7.60 cm2, separated by a distance of 1.60 mm.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Lower and upper limits bounding

Suppose the weight of a product is normally distributed with a mean of 1.5 and a variance of 0.2. What percentage of products will have weights within +/- 3 standard deviations? Critically discuss the lower and upper limits bounding 50% of product we..

  Find how large should the sample sizes be

Suppose the pollster has no prior information about the proportions. If equal numbers of men and women are to be polled, how large should the sample sizes be?

  Information about confidence interval estimate

When estimating a population mean with a confidence interval estimate, then E is:

  Conclusions from hypothesis test for two sample proportion

The Wall Street Journal recently ran an article indicating differences in perception of sexual harrassment on the job between men and women.

  Estimating population proportion-sample size

Determine the sample size needed in order to be 99% confident that the sample proportion of the current customer accounts is within .03 of the true proportion of all current accounts for this company.

  When to say that claim is true even without a formal test

If the claim says that the population mean is greater than 200 and the sample mean is 215, we can say that the claim is true even without a formal test.

  Hypothesis testing for motor vehicle department

Data from the Motor Vehicle Department indicate that 80% of all licensed drivers are older than age 25. In a sample of n=60 people who recently received speeding tickets, 38 were older than 25 years and the other 22 were age 25 or younger.

  Is it significant to accept or reject null hypothesis

If the researcher computes the t-statistic to be 4.2 and t-value found in t table for df=60 and level of significance of 0.5, is 2.0 researcher would accept or reject null hypothesis.

  What to conclude at the significance level

Seven employees were included from Area A, 9 from Area B and 12 from Area C. The test statistic was computed to be 4.91. What can we conclude at the 0.05 level?

  Given the linear correlation coefficient r and the sample

Given the linear correlation coefficient r and the sample size n, determine the critical values of r and use your finding to state whether or not the given r represents a significant linear correlation

  Word problem for probability ratio

Suppose Napoleon were using Bayes' theorm to revise his information. To do so, he would have had to make some judgements about P(Prussian and English Join forces |Napoleon Wins) and P(Prussian and English Join forces |Napoleon loses).

  Estimate mean weight loss to within two pounds

The standard deviation of the population weight losses is about 10 pounds. How large a sample should he take to estimate the mean weight loss to within 2 pounds, with 95% confidence?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd