Write the associated linear model

Assignment Help Advanced Statistics
Reference no: EM132579041

Data description

We consider a dataset from 1994 relating hourly wage to education level, experience, gender, and physical attractiveness. More precisely, for each person, the following variables have been collected:
• Wage - hourly wage (in dollars);
• Educ - years of schooling (in years);
• Exper - working experience (in years);
• Female - Dummy variable (0. Male, 1. Female);

• Looks - Score ranging from 1 (not attractive) to 5 (attractive) (No unit);
Of particular interest is whether education has a statistical impact on people's salary or not. Moreover, this dataset will also enable us to investigate whether physical appearance plays a statistical role in the wage of a person.

Part I - Study on a small sample

We first pick at random from the dataset a small sample of n = 6 people and consider only the variables Educ (x) and Wage (y) for simplicity. The related data can be found in Table 1 and Table 2.

1. Use Table 1 and Table 2 to calculate the sample covariance cxy and the sample correlation rxy. What kind of relationship does it reveal?

2. Write the linear model associated to the linear regression of Wage on Educ. Use Table 2 and your result from the previous question to calculate the LSE (ˆb0, ˆb1).

3.Use Table 2 to calculate the coefficient of determination R2 of the regression.

Part II - Study on a large sample

Therefore, we now focus on a large sample of n = 706 people. Since the sample from Part I is very small, the numerical values are not reliable at all in practice.

1. We have reported in Figure 1 the scatter plot of Wage versus Educ with the line of best fit in red. In a few words, discuss the relationship between Wage and Educ according to the scatter plot. Is your conclusion the same as for Question I.1?

2. We consider the simple linear regression of Wage on Educ.
(a) Write the associated linear model. How many coefficients (including the intercept) do we have to estimate?
(b) Based on the results of the regression in Table 3, interpret in words the regression coefficient related to Educ and explain how hourly wage is affected by education. According to the model, what is the average impact of an additional year of education on hourly wage?
(c) Calculate a 95% confidence interval for b1. Is the relationship between Wage and Educ statistically significant? Explain.

(d) Do you find the estimated value for the intercept in Table 3 surprising? How would you interpret it? Use the reported p-value to assess the statistical significance of the intercept. Conclusion?

3. We consider the global regression of Wage on all the other variables. The results are documented in Table 4.

(a) Write the associated linear model. How many coefficients (including the intercept) do we have to estimate?

(b) Based on the results of the regression in Table 4, briefly interpret in words the regression coefficients related to each variable. Explain what is the meaning of the coefficient for the qualitative variable Female.

(c) Calculate the missing t-values in Table 4, and rank the variables according to their signifi- cances in the regression.

(d) Calculate the p-value for the variable Female (the p-value for the two-sided significance test). What are the significant variables in the regression? Explain. In particular, how does beauty affect wage? How do you explain this impact?

(e) Compare the R2 of the simple and the multiple model. Did we improve much the fit by adding Exper, Looks and Female in the regression? Explain.

(f) Predict the average hourly wage for a man who is such that: Educ = 12, Exper = 11, and Looks = 3.

(g) All things being equal, how many dollars per hour does a woman earn more/less than a man? Explain.

Part III - Tests

1. A study claims that the average hourly is $ 6.7. In the large sample of n = 706 people from Part II, we measured a sample average of $6.3 minutes, for a sample standard deviation of $4.7. Define the pair of hypotheses for the two-sided test of means, calculate the test statistic, its p-value, and finally run the test at significance level α = 5%. Do you agree with the study?

2. A study claims that exactly 50% of the population from which the dataset from Part II has been sampled are men. We measured a sample proportion of men of 55%. We consider the one-sided set of hypotheses H0 : p = 50% against H1 : p > 50% where p is the proportion of men. Calculate the related test statistic, its p-value, and finally run the test at significance level α = 1%. Do you agree with the study?

Attachment:- Statistics Final.rar

Reference no: EM132579041

Questions Cloud

Social media can enhance employee engagement : A proposal is a formal document prepared to offer a suggestion or recommendation. Social media can enhance employee engagement.
Essay Assignment Topic - Fire and Life Safety : Essay Assignment Topic - Fire and Life Safety. Description - Instructions - Fire and Life Safety Education Program Plan
What is the fundamental purpose of a prototype : How might a business learn about a proposed process improvement through a prototyping activity? What is the fundamental purpose of a prototype?
Currency risks associated with export strategy : Foreign Direct Investment from United States to Egypt for Apple Watch. The questions are What are the currency risks associated with an export strategy?
Write the associated linear model : Write the associated linear model. How many coefficients (including the intercept) do we have to estimate and Calculate the missing t-values in Table 4
American History Essay Assignment : American History Essay Assignment - Topic - Discuss what you consider to be the major causes of the civil war. Description - Use c-span video
Logistics tasks involved in one service supply chain : Explain logistics tasks involved in one service supply chain, say, involving a hospital or restaurant.
Identify the pros and cons of driverless vehicles : This chapter discusses autonomous vehicles or driverless cars. Identify the pros and cons of driverless vehicles.
Which size fragments will be closest to the wells : Which size fragments will be closest to the wells? Large or small?

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Marketing budget and control

Identify quantifiable elements that can be used to evaluate, monitor, and control the effectiveness of your marketing plan. The phones will be marketed globally thru AT&T. The budget is US dollars.

  What is the probability that there is no storm in january

What is the probability that there is no storm in january and what is the probability that there is no damage-inducing storm in january

  Question 1a corporation produces packages of paper clips

question 1a corporation produces packages of paper clips. the number of clips per package varies as indicated below for

  Frequency distribution of a variable and bar graph

Descriptives of a continuous : mean, median, mode, skewness, kurtosis, standard deviation and cross tabulation of two variables

  1 sam lucarelli owner of lucarelli products is evaluating

1. sam lucarelli owner of lucarelli products is evaluating whether to produce a new product line. after thinking

  What role do individuals and management play

What role do individuals and management play in ensuring the appropriate business model is chosen, used, and evaluated for effectiveness

  Collecting data in statistics

You anticipate that your presentation with PiggyBank will go well, and want to get ideas for collecting data. Go to the Discussion Board and discuss data collection methods with your peer/mentor group.

  1 jean siskel is an entertainment analyst for west

1. jean siskel is an entertainment analyst for west coast securities. he is trying to develop a model to

  Investigate the relationship between tumour size

It gives the body fat percentage, age and gender for 18 normal adults between the ages of 23 and 61 years. Are age and % fat related, and if so, in what way? There is correlation.

  Determine the number of degrees of freedom

derive MLE?s of probabilities A, B, and C, and determine the number of degrees of freedom for the LR goodness-of fit statistic

  Discuss issue of statistical power in non-parametric tests

Discuss the issue of statistical power in non-parametric tests (as compared to their parametric counterparts). Which type tends to be more powerful? Why?

  Compute the mean and standard deviation

Form a frequency distribution having 9 class intervals and form a percentage distribution from the frequency distribution (from part a) - Compute the mean, standard deviation and Coefficient of variation

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd