1 in this exercise we will be building regression models

Assignment Help Basic Statistics
Reference no: EM13370439

1. In this exercise we will be building regression models for predicting house prices. We will be using data collected on 91 houses in Gainesville, Florida. The dataset contains the selling price of each house and information on four other explanatory variables.

The variables contained in the dataset are:

Y: Price. It is measured in thousands of dollars.

X1: Area. It is the floor area of the house measured in thousands of square feet.
X2: Bed. The number of bedrooms of the house.
X3: Bath. The number of bathrooms of the house.
X4: Pool. Indicates whether the house has a swimming pool.

Questions:
(a) Exploratory part.

i) Plot each of the predictors against the response. Plot the predictors against each other. The purpose here is to get a graphical idea of the relationships in the data. Do not include these plots in your report, just provide a brief summary of what you observed.

(b) Simple linear regression.

i) Fit 3 simple linear regression models with area, bed, and bath as the only predictor in each. Report the estimated parameters from the model that you consider to be the most useful in predicting house prices, along with an explanation why you consider that model to be the most useful one.

ii) Assuming that the best single predictor model is area, provide a 99% confidence interval for the mean price for a house area = 2500 square feet.

iii) Assume your neighbors own a house with area = 2500 square feet. Obtain a 99% prediction interval for the selling price of the house if they decided to sell it.

(c) Multiple linear regression.

i) Fit a regression model using all 4 predictor variables. Report the estimated i parameters and interpret the coefficient for the variable Pool.

ii) Suppose your neighbors house actually has area = 2500 square feet, 3 bedrooms, 3 bathrooms, and a pool. What is the predicted selling price for this house? Obtain a 95% prediction interval.

iii) Conduct an ANOVA P-test and interpret the results. Conduct a test to sec if the number of bedrooms a house has is a useful predictor of its price. Interpret the results. Should we include number of bedrooms in a model with the other 3 variables in it?

iv) Return to the model in (1) and use that as the full model. Fit a model without the variables pool and bath and use that as your reduced model. Conduct the F-test to see whether or not pool and bath are useful predictors using the full and reduced model. Interpret.

2. Let X1,. .. , Xn. denote a random sample from a normal distribution with mean μ and variance σ2. The probability density function (pdf) of Xi, i = 1, .. . , n, is given by

1097_Simple linear regression.png

(a) Derive Derive the likelihood and log-likelihood functions.

(b) Show that the arithmetic mean, X', is the maximum likelihood estimator of the unknown mean μ.

(c) Show that the arithmetic mean, X', is a sufficient statistic for the unknown mean μ.
(d) Show that the sufficient statistic from part 2c is distributed as X'~N(μ,σ2/n).

(e) Use the pdf from part 2d to show that the arithmetic mean, X' is the maximum likelihood estimator of the unknown mean μ.

Reference no: EM13370439

Questions Cloud

Process of performing financial analysis of a public : process of performing financial analysis of a public companygeneral component---no more than one paragraph describing
Term paper for management of strategic operationyou are : term paper for management of strategic operationyou are required to complete a course project that reveals mastery in
Design and synthesis of continuous time : design and synthesis of continuous time controllers.2.learning outcomes covered 1 use matlab and simulink to model and
Taskdesign and implement a c windows phone 8 application : taskdesign and implement a c windows phone 8 application based on the soundboard app in the windows phone 8 development
1 in this exercise we will be building regression models : 1. in this exercise we will be building regression models for predicting house prices. we will be using data collected
Poster presentation - component within a health : poster presentation - component within a health systemquestionselect a health system component to present in the
Variable costing net operating income last : variable costing net operating income last year....55800increase in ending inventory last year....3600variable costing
The bank statement showed a service charge of 56bull acorn : the bank statement showed a service charge of 56.bull acorn made a deposit on 31st may but this deposit did not appear
1 you are an auditor working for 15 million sales per year : 1 you are an auditor working for 15 million sales per year specialty chocolate candy manufacturer. the company is

Reviews

Write a Review

Basic Statistics Questions & Answers

  How many calories should it have using model

Build a predictive model for number of calories using fat grams. If a pizza has 15 grams of fat, how many calories should it have, using your model?

  Would the conclusion remain the same if the two confidence

Would the conclusion remain the same if the two confidence intervals had instead been calculated at 90% confidence? Explain.

  Advantage of using a cluster sample

Which of the following is not an advantage of using a cluster sample instead of other types of samples?

  Critical value for different types of sample size

Find the critical value for different types of sample size and level of significance:

  Conclude that the mean waiting time is less than minutes

At the 0.05 significance level, can we conclude that the mean waiting time at the Warren Road MacBurger is less than 3 minutes?

  Hypotheses to test for distribution of birthdays

Set up hypotheses to test if distribution of birthdays can be considered coming from uniform distribution.

  Mad-mse and mape for forecasting

Calculate (a) MAD, (b) MSE, and (c) MAPE for the following forecast versus actual sales figures.

  P value-critical value and state the final conclusion

Assume that a simple random sample has been selected from a normally distributed population. Find the test statistic, P-value, critical value(s), and state the final conclusion.

  Determining algebra and graphing

A person is planning on saving money according to a rigid savings schedule. Saving plan A is to make an initial deposit of $400 and then deposit $20 per month into the account.

  What ststistical test should be used to analyze data

Identify Ho and Ha for this study, conduct the appropriate analysis, and should Ho be rejected? what should the researcher conclude?

  Determine whether a marketing campaign to increase spending

Determine whether a marketing campaign to increase spending at a direct marketing retailer has resulted in incremental spend. For this campaign, I have two groups: Test and Control. I mail 75,000 customers (Test group) offering them rewards if they s..

  Statistics-using chi-square test

For many years TV executives used the guideline that 30 percent of the audience were watching each of the prime-time networks and 10 percent were watching cable stations on a weekday night.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd