Build regression models for predicting house

Assignment Help Basic Statistics
Reference no: EM1374968

1. In this exercise we will be building regression models for predicting house prices. We will be using data collected on 91 houses in Gainesville, Florida. The dataset contains the selling price of each house and information on four other explanatory variables.

The variables contained in the dataset are:

Y: Price. It is measured in thousands of dollars.

X1: Area. It is the floor area of the house measured in thousands of square feet.
X2: Bed. The number of bedrooms of the house.
X3: Bath. The number of bathrooms of the house.
X4: Pool. Indicates whether the house has a swimming pool.

Questions:
(a) Exploratory part.

i) Plot each of the predictors against the response. Plot the predictors against each other. The purpose here is to get a graphical idea of the relationships in the data. Do not include these plots in your report, just provide a brief summary of what you observed.

(b) Simple linear regression.

i) Fit 3 simple linear regression models with area, bed, and bath as the only predictor in each. Report the estimated parameters from the model that you consider to be the most useful in predicting house prices, along with an explanation why you consider that model to be the most useful one.

ii) Assuming that the best single predictor model is area, provide a 99% confidence interval for the mean price for a house area = 2500 square feet.

iii) Assume your neighbors own a house with area = 2500 square feet. Obtain a 99% prediction interval for the selling price of the house if they decided to sell it.

(c) Multiple linear regression.

i) Fit a regression model using all 4 predictor variables. Report the estimated i parameters and interpret the coefficient for the variable Pool.

ii) Suppose your neighbors house actually has area = 2500 square feet, 3 bedrooms, 3 bathrooms, and a pool. What is the predicted selling price for this house? Obtain a 95% prediction interval.

iii) Conduct an ANOVA P-test and interpret the results. Conduct a test to sec if the number of bedrooms a house has is a useful predictor of its price. Interpret the results. Should we include number of bedrooms in a model with the other 3 variables in it?

iv) Return to the model in (1) and use that as the full model. Fit a model without the variables pool and bath and use that as your reduced model. Conduct the F-test to see whether or not pool and bath are useful predictors using the full and reduced model. Interpret.

2. Let X1,. .. , Xn. denote a random sample from a normal distribution with mean μ and variance σ2. The probability density function (pdf) of Xi, i = 1, .. . , n, is given by

1097_Simple linear regression.png

(a) Derive Derive the likelihood and log-likelihood functions.

(b) Show that the arithmetic mean, X', is the maximum likelihood estimator of the unknown mean μ.

(c) Show that the arithmetic mean, X', is a sufficient statistic for the unknown mean μ.
(d) Show that the sufficient statistic from part 2c is distributed as X'~N(μ,σ2/n).

(e) Use the pdf from part 2d to show that the arithmetic mean, X' is the maximum likelihood estimator of the unknown mean μ.

Reference no: EM1374968

Questions Cloud

Term paper for management of strategic operation : You are required to complete a course project that reveals mastery in application of the operations management concepts emphasised in the course. This involves reporting on a specific operation or process for an organisations and the operations ma..
Design and synthesis of continuous time controllers : Design and synthesis of continuous time controllers - A graphical user interface (GUI) is a human-computer interface that uses menus, dialogue box, and button which can be manipulated by a mouse
Design a c# windows phone 8 application : Design and implement a C# Windows Phone 8 application based on the SoundBoard app in the Windows Phone 8 Development for Absolute Beginners textbook.
What is density of this mixture of gases : A mixture of CO2 and N2 gases has an experiment density of 21922 mol/m3 at T=225K and Pressure= 68.759 MPa and Z=1.67661. What is density of this mixture of gases in mol/m3 by Peng-Robinson equation of state? (Mole fraction of Co2= 10.56%)
Build regression models for predicting house : Build regression models for predicting house prices. We will be using data collected on 91 houses in Gainesville, Florida. The dataset contains the selling price of each house and information on four other explanatory variables.
Illustrates what is maximum number of application per hours : Illustrates what is maximum number of application per hours that can be handled by present configuration of process.
Discuss competitive supply and profit maximization : A number of stores offer film expanding as a service to their consumers. Assume that each store that offers this service has a cost function C(q)=50+0.5q+0.08q2 .
Illustrate what is average inventory if costs are minimized : Illustrate what is average inventory if costs are minimized. Suppose that ordering cost is not $20.68 and Cotteler has been ordering 175 units each time an order is placed. For this order policy.
Discuss transportation and logistics management policy : Discuss transportation and logistics management policy. Illustrate what impact does policy (local, state, and/or federal) have on transportation.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd