Build regression models for predicting house

Assignment Help Basic Statistics
Reference no: EM1374968

1. In this exercise we will be building regression models for predicting house prices. We will be using data collected on 91 houses in Gainesville, Florida. The dataset contains the selling price of each house and information on four other explanatory variables.

The variables contained in the dataset are:

Y: Price. It is measured in thousands of dollars.

X1: Area. It is the floor area of the house measured in thousands of square feet.
X2: Bed. The number of bedrooms of the house.
X3: Bath. The number of bathrooms of the house.
X4: Pool. Indicates whether the house has a swimming pool.

Questions:
(a) Exploratory part.

i) Plot each of the predictors against the response. Plot the predictors against each other. The purpose here is to get a graphical idea of the relationships in the data. Do not include these plots in your report, just provide a brief summary of what you observed.

(b) Simple linear regression.

i) Fit 3 simple linear regression models with area, bed, and bath as the only predictor in each. Report the estimated parameters from the model that you consider to be the most useful in predicting house prices, along with an explanation why you consider that model to be the most useful one.

ii) Assuming that the best single predictor model is area, provide a 99% confidence interval for the mean price for a house area = 2500 square feet.

iii) Assume your neighbors own a house with area = 2500 square feet. Obtain a 99% prediction interval for the selling price of the house if they decided to sell it.

(c) Multiple linear regression.

i) Fit a regression model using all 4 predictor variables. Report the estimated i parameters and interpret the coefficient for the variable Pool.

ii) Suppose your neighbors house actually has area = 2500 square feet, 3 bedrooms, 3 bathrooms, and a pool. What is the predicted selling price for this house? Obtain a 95% prediction interval.

iii) Conduct an ANOVA P-test and interpret the results. Conduct a test to sec if the number of bedrooms a house has is a useful predictor of its price. Interpret the results. Should we include number of bedrooms in a model with the other 3 variables in it?

iv) Return to the model in (1) and use that as the full model. Fit a model without the variables pool and bath and use that as your reduced model. Conduct the F-test to see whether or not pool and bath are useful predictors using the full and reduced model. Interpret.

2. Let X1,. .. , Xn. denote a random sample from a normal distribution with mean μ and variance σ2. The probability density function (pdf) of Xi, i = 1, .. . , n, is given by

1097_Simple linear regression.png

(a) Derive Derive the likelihood and log-likelihood functions.

(b) Show that the arithmetic mean, X', is the maximum likelihood estimator of the unknown mean μ.

(c) Show that the arithmetic mean, X', is a sufficient statistic for the unknown mean μ.
(d) Show that the sufficient statistic from part 2c is distributed as X'~N(μ,σ2/n).

(e) Use the pdf from part 2d to show that the arithmetic mean, X' is the maximum likelihood estimator of the unknown mean μ.

Reference no: EM1374968

Questions Cloud

Term paper for management of strategic operation : You are required to complete a course project that reveals mastery in application of the operations management concepts emphasised in the course. This involves reporting on a specific operation or process for an organisations and the operations ma..
Design and synthesis of continuous time controllers : Design and synthesis of continuous time controllers - A graphical user interface (GUI) is a human-computer interface that uses menus, dialogue box, and button which can be manipulated by a mouse
Design a c# windows phone 8 application : Design and implement a C# Windows Phone 8 application based on the SoundBoard app in the Windows Phone 8 Development for Absolute Beginners textbook.
What is density of this mixture of gases : A mixture of CO2 and N2 gases has an experiment density of 21922 mol/m3 at T=225K and Pressure= 68.759 MPa and Z=1.67661. What is density of this mixture of gases in mol/m3 by Peng-Robinson equation of state? (Mole fraction of Co2= 10.56%)
Build regression models for predicting house : Build regression models for predicting house prices. We will be using data collected on 91 houses in Gainesville, Florida. The dataset contains the selling price of each house and information on four other explanatory variables.
Illustrates what is maximum number of application per hours : Illustrates what is maximum number of application per hours that can be handled by present configuration of process.
Discuss competitive supply and profit maximization : A number of stores offer film expanding as a service to their consumers. Assume that each store that offers this service has a cost function C(q)=50+0.5q+0.08q2 .
Illustrate what is average inventory if costs are minimized : Illustrate what is average inventory if costs are minimized. Suppose that ordering cost is not $20.68 and Cotteler has been ordering 175 units each time an order is placed. For this order policy.
Discuss transportation and logistics management policy : Discuss transportation and logistics management policy. Illustrate what impact does policy (local, state, and/or federal) have on transportation.

Reviews

Write a Review

Basic Statistics Questions & Answers

  What is the mean of the sample means

A state-wide sample survey is to be made. First, the state is subdivided into counties. Seven counties are selected at random and further sampling is concentrated on these seven counties.

  Estimating probability values using tree diagram

What is the probability that a randomly chosen employee is an Administrator if the employee has the opportunity of gaining promotion next year?

  Finding the level of satisfaction with the given data

Do the data indicate that the sleeping pill is effective at a 1% level of significance?

  Confidence interval for average number of credit card

Use this information to compute a 95% confidence interval for average number of credit cards carried by students.

  Using anova instead of regression to solve problem

What can be done to remedy multicollinearity? Use ANOVA instead of Regression.

  Fundamental difference in the findings

Create a hypothesis and perform a series of tests to determine if there is any fundamental difference in the findings.

  Least square regression line

Based on a scatter diagram, would you estimate the linear correlation coefficient to be:

  Develop and complete the total cost for each plan

Expand and aggregate plan using each of the following guidelines and complete the total cost for each plan. Which plan has the lowest total cost.

  Permutations rule to determine number of different ways

Use the permutations rule to determine the total number of different ways the winners can be drawn.

  What is the likelihood at least one of selected flights

Assume the likelihood that any flight on Northwest Airlines arrives within 15 minutes of the scheduled time is 0.90. We choose four flights from yesterday for study.

  The probability she did not receive any email

An internal study by the Technology Services department at Lahey Electronics revealed company employees receive an average of two emails per hour. Suppose the arrival of these emails is approximated by the Poisson distribution.

  Nick uses the equally likely criterion

Assume Nick doesn't know how his received he suppose that all three states of nature are equally likely to occur. If Nick uses the equally likely criterion illustrate what decision would he make.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd