Test the global significance of full model applied to data

Assignment Help Applied Statistics
Reference no: EM132352704

Assignment - Answer all questions.

Instructions: You must use SAS to obtain appropriate analyses of the data where required and use the SAS output obtained in answering the questions that follow the problem statement. Use a 5% level of significance (α = 5%) where appropriate unless otherwise specified. (Use 3 decimal places in your calculations and answers). You must also submit your SAS program file.

The Problem - Air Pollution

A climatologist is interested in predicting air quality in cities in the USA. Air quality is measured by the mean concentration of sulphur dioxide (SO2) in the air. Information pertaining to 7 (possible) explanatory variables was gathered over a 3-year period. Full data was collected for 41 US cities.

Data collected:

Sulphur dioxide (SO2)

Average sulphur dioxide content of the air in micrograms per cubic metre.

Temp

Average annual temperature in degrees Fahrenheit

Factory

Number of manufacturing enterprises employing 20 or more workers

Pop

Population in thousands in 1999

Wind

Average annual wind speed

Precip

Average annual precipitation in inches

Dayrain

Average number of days with precipitation per year

Dust

Average concentration of dust particles in ppm (parts per million)

The data is available on Canvas as 'Pollution.txt'.

Use SAS to run a full multiple linear regression analysis on the data, in order to answer the questions that follow.

Questions -

(i) Copy the ANOVA table for the full model into the table below. Explain how the degrees of freedom for each component of variation is calculated.

Source

df

Sum of squares

Mean square

F

Model





Error





Total





(ii) Use an F test to test the global significance of the full model applied to the data, include your hypothesis, test statistics (i.e. F and critical F) and your full conclusion.

(iii) State the value of the coefficient of determination for the full model and explain how it relates to the SSError.

(iv) Examine the SAS output and test the significance of each of the individual explanatory variables. You should include your hypotheses, and explain how you have drawn your conclusions.

(v) Use a stepwise procedure to obtain the optimal (best fit) model. State the least squares estimate of this model, explaining each of the terms in your model.

(vi) State and describe how you would interpret the R2, adj R2 and Cp statistics to identify the optimal model obtained in part (iv).

(vii) Use the optimal model to predict SO2 for a city where the average temperature is 60.7oF, there are 350 enterprises with 20 or more workers, a population of 580000, average wind speed of 9.5, average precipitation of 30.0 inches, 150 days of rain on average and a dust concentration of 7.0 ppm. State and interpret the 95% prediction interval for this city.

(viii) Explain how you would calculate residuals for the optimal model. You may obtain and use residuals from the SAS output to aid your explanation.

(ix) Use the residual plots provided in the SAS output to assess the validity of assumptions that underlie the model. Refer specifically to the distribution of the residuals, the mean of the residuals and homoscedasticity.

Reference no: EM132352704

Questions Cloud

Diverse microorganisms that cause no harm : The skin is the human body's largest organ. It is colonized by diverse microorganisms that cause no harm to their host
Ratio analysis on the financial statements : Now that you have their financial information I would like you to perform a ratio analysis on the financial statements. Focus on financial statement analysis.
What quantity would maximize profits for the firm : What quantity would maximize profits for this firm? (Hint: Recall that profit maximizing is where MR = MC). At what price should this firm sell its product.
Explain how oxidation of a substrate proceeds without oxygen : Explain how oxidation of a substrate proceeds without oxygen? Does this involve glycolysis?
Test the global significance of full model applied to data : Use an F test to test the global significance of the full model applied to the data, include your hypothesis, test statistics (i.e. F and critical F)
What characteristics do you observe in the industry : What characteristics do you observe in the industry that support this identification? Provide evidence. What does the market type suggest about pricing.
Briefly discuss pros and cons of the merger : Briefly discuss pros and cons of the merger (effect on the industry, effect on consumers) based on the following: market concentration.
Composed of charged colored ions : Most stains used in microbiology are usually salts, or sometime acids or bases, composed of charged colored ions. The colored ions are called chromophores
Discuss non-price competition within the industry : Discuss non-price competition within the industry. Describe the bargaining power of the company's suppliers and buyers. Describe the long-run goals of company.

Reviews

len2352704

8/5/2019 11:44:01 PM

You are required to answer all questions on the test paper provided, this should be submitted via Canvas. Instructions: You must use SAS to obtain appropriate analyses of the data where required and use the SAS output obtained in answering the questions that follow the problem statement. Use a 5% level of significance where appropriate unless otherwise specified. (Use 3 decimal places in your calculations and answers). Show all calculations. Answer all questions in the spaces provided. You must also submit your SAS program file on Canvas.

Write a Review

Applied Statistics Questions & Answers

  Assume that the probability of conception in any given month

Assume that the probability of conception in any given month among sexually active couples not practicing birth control is constant at 0.20 per month, independent of the number of months the couple has been active. what is the expected waiting time t..

  Design two separate pivot tables for the training data

Evaluate and comment on the Results. Should the data be normalized? Discuss what characterizes the components.

  Plot the probability function of exact sampling distribution

Plot the probability function of the exact sampling distribution for P, assuming a sample size of n = 10 and population prevalence of 0.5

  Part-1- first reset the lower limit to zero and the upper

part-1- first reset the lower limit to zero and the upper limit to 1000 and then click update.- now put 6 points

  Compute expected frequencies for each of the cells

Compute expected frequencies for each of the cells in the table in part c). Do you feel that the exponential distribution provides an adequate description of these data?

  Use the compromise function to compute alpha and beta

Assume that the result is a sample size beyond what you can obtain. Use the compromise function to compute alpha and beta for a sample half the size. Indicate the resulting alpha and beta. Present an argument that your study is worth doing with the s..

  Plot a scatter graph of the data

Plot a scatter graph of the data. Does the scatter graph indicate that a linear relationship exists for all or part of the range of the data

  Calculate the sample size for the mean or sample

Calculate the sample size for the mean or sample or sample size for the proportion, using a 95% confidence level, estimated population standard deviation or estimate of the true population proportion, and a 5% margin of error.

  Develop two separate research questions

SOC 207 Assignment - Independent Data Analysis. Develop two separate research questions using the independent variables and dependent variables

  Exploration and analysis of data

Write a meaningful essay that includes the following terms: sample, population, confidence level, estimate, mean, margin of error. Discuss the ethical impli

  Prepare a numerical summary report about the data

HOLMES INSTITUTE, Australia - HI6007 Statistics and Research Methods for Business Decision Making Assignment. Prepare a numerical summary report about the data

  Discrete probability concepts to determine a course of actio

Given a business situation word problem or case study, such as defective items or waiting lines, use discrete probability concepts to determine a course of action

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd