Write down the estimated regression model

Assignment Help Applied Statistics
Reference no: EM131020924

Assignment

This assignment consists of two sections: 1) a quiz with fill-in-the-blank questions; and 2) a SPSS data project.

SECTION 1: QUIZ

Regression with a Dummy Independent Variable:

1. Consider data on personal income (PI) of married and unmarried women. Suppose you find that the average PI is $50,000 for married women and $40,000 for unmarried women. Let Y=PI and X=dummy for married (that is, 1 = married, 0 = unmarried)

1) How much more do married women make compared to unmarried women, on average?__________________

2) Write down the estimated regression model Y = a + b*X (all info needed is given):

3) Interpret the intercept term: ____________________________________________

4) Interpret the slope term: _______________________________________________

Multiple Linear Regression:

2. Consider the following model of income with three independent variables from the lecture:
Y = a + b1*X1 + b2*X2 + b3*X3 = - 9,239 - 4,195 * X1 + 141 * X2 + 3,020 * X3

where X1 is a dummy variable, 0=male, 1=female

and X2 is years of work experience

and X3 is years of education

1) How much more do men earn compared to women on average?

2) How much do women with 10 years of work experience and 16 years of education earn on average?

3) How much do men with 10 years of work experience and 12 years of education earn on average?

4) What variables that could (further) mediate the effect of gender on earnings are omitted here?

Two-Way Table: Marginal Distribution and Conditional Distribution

3. Consider the two-way table below based on a four year study about the relationship between anger and heart disease among a random sample of individuals. The subjects (i.e. participants in the study) were free of heart disease at the beginning of the study when they took a test that measured how prone they were to sudden anger. Their heart health was monitored over a four year period and it was recorded whether they developed Coronary Heart Disease (CHD). In short, the study attempts to examine whether anger levels are associated with the likelihood of developing coronary heart disease. Now please answer the questions (a) to (c):

1) In the two-way table below, report the marginal distributions and the total sample size, in counts and percent.

2) In the two-way table below, report the conditional distributions of Coronary Heart Disease in percent. Note that the conditional distribution of Coronary Heart Disease refers to the distribution of Coronary Heart Disease given a certain Anger Level.

3) With reference to your calculations above, discuss whether there is potential association between anger and Coronary Heart Disease.

         #Individuals

Coronary Heart Disease

NO Coronary Heart Disease

 

Low Anger

 

530

 

3,057

 

Moderate Anger

1,100

4,621

 

High Anger

 

270

 

606

 

 

 

 

 

SECTION 2: SPSS PROJECT

1. Regression with One Independent Variable vs. Regression with Multiple Independent Variables

Use the dataset from Assignment#2 (StateData_hw2.sav) to estimate the following two models, and then answer questions 1) to 4):

Model 1: Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers. [You may have done this already in Assignment#2. If so, just repeat the estimation.]

Model 2: Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers (X1) and state median household income (X2).

[Hint: Topic about Multiple Regression was covered in Lecture Note #6.2. For the second model estimation based on two variables X1 and X2, the SPSS procedures are: Analyze Regression Linear select variable as Dependent variable and Independent variable, here you select two variables, X1 and X2, as Independent variables click "OK".]

1) Provide the regression equations for both models and the corresponding values for R2.

2) For the second model, provide interpretations of the constant term and the two slopes.

3) Explain intuitively why the effect of % smoking changed the way it did when the median income was accounted for.
[No loss of points for this question. Just give it a try. I hope to encourage you to think harder about the effect of each independent variable, as well as the interaction of the effects, in the multiple regression model. Formal discussion about such problems may come in 9172.]

4) Use the two models to predict the HDDR (heart disease death rate) for New York State, and then compare the two predicted values to the actual value of HDDR for New York State.

Reference no: EM131020924

Questions Cloud

What is most critical step in the capital budgeting process : What is the most critical step in the capital budgeting process? Why are there no "absolute" answers to capital budgeting decisions?
New heritage doll company case-harvard business review : What additional information does Harris need to complete her analyses and compare the two projects? What specific questions should she ask each of the project sponsors?
When evaluating the financial statements of a given firm : It is often said that anyone with a pencil can calculate financial ratios, but it takes a brain to interpret them. What kinds of things should the analyst keep in mind when evaluating the financial statements of a given firm?
Difficulties of obtaining accurate information : Do you think that this fraction is close to the actual proportion who cheated? Why? (Discuss the difficulties of obtaining accurate information on a question of this type.)
Write down the estimated regression model : Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers (X1) and state median household income (X2).
How long would it take you pay off the balance on new card : How many months will it take to pay off the debt if you only make the $200 minimum payment each month? All is not lost, because you just received an offer to transfer your $10,000 balance from your current credit card to a new credit card charging a ..
Two different bonds currently outstanding : Jallouk Corporation has two different bonds currently outstanding. Bond M has a face value of $20,000 and matures in 20 years. The bond makes no payments for the first 6 years, then pays $1,100 every six months over the subsequent eight years, and fi..
Which people change their behavior after they get insurance : The situation in which people change their behavior after they get insurance (illustrated by the above scenario) because the change benefits them but increases costs to the insurer is called
Has the writer followed all instructions for the assignment : Is the work honest and "relatable?" Does it employ a conversational tone and make use of narrative devices, such as dialogue, scene construction, and characterization? Does it address Lopate's notions of egotism and contrariety?

Reviews

Write a Review

Applied Statistics Questions & Answers

  Compare mean number of dealers visited by early replacement

Use the confidence intervals you computed in parts a and b to compare the mean number of dealers visited by early replacement buyers with the mean number of dealers visited by late replacement buyers.

  Find the percentile that corresponds to each life span

(a) The life spans of three randomly selected fruit flies are 34 days, 30 days, and 42 days. Find the z-score that corresponds to each life span. Determine whether any of these life spans are unusual. (b) The life spans of three randomly selected fru..

  Determine the stream function and plot the streamlines

Determine the stream function and plot the streamlines and Discretise the expression for the second derivative using the finite difference approach

  What are advantages and disadvantages of using confidence

What are the advantages and disadvantages of using confidence intervals as an alternative approach to statistical inference

  What differentiates the two formulas and why the difference

What differentiates the two formulas Σ (x - xbar)^2/n-1 from Σ (x - µ)^2/n, and why the difference?

  Data were collected in a clinical trial to compare a new dru

Data were collected in a clinical trial to compare a new drug

  Create one observation for each year

Initialize each of the variables below to their current values, and use a DO LOOP to calculate their estimated values for the next ten years. For example, next year's wage expense will be this year's wage expense plus 6 percent of this year's amount;..

  The percentages of satisfactory articles

It has been found from past experience that of the articles produced by a factory, 20% come from machine 1, 30% come from machine 2 and 50% come from machine 3. The percentages of satisfactory articles among those produced are 95% for machine 1, 85% ..

  A graphic designer makes a presentation to clients

A graphic designer makes a presentation to clients and this results in sales of her services in 1/4 of the cases. Assuming the results for different clients are independent.

  Formulate an integer programming mode

Formulate an integer programming model, and solve it using EXCEL.

  Lasso regression and ridge regression

Do a project in data mining with R - statistic and Economics but the project can be not only Economics argument.

  Made-to-order personal computers through direct

Fast Computers Inc. supplies made-to-order personal computers through direct (telephone and online) sales channels. A key competitive feature of its business is the delivery time - the time lapse between receipt of an order and final delivery t..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd