Write the equation for the regression line

Assignment Help Applied Statistics
Reference no: EM132536256

How Much is a Fireplace Worth?

We are going to step away from Fast data for this assignment. We are interested in being able to predict the price of a home using regression on available data. More specifically, we are interested in knowing how much value having a fireplace adds to the price of a home.

Download the Excel workbook M5A1 Home Price Data. Start Excel. Open the workbook M5A1 Home Price Data and immediately save the workbook with a new name. Use your name and include the assignment name, e.g. Wright_Dawn_M5A1. This will ensure you have a good copy in case you make mistakes. M5A1 also requires you to submit a Word file containing your written report on your findings. The Word file should be named in the same fashion. It will also make your instructor happy when grading your work, which is a good thing.

Background: The website Zillow recently sponsored a contest with a prize over a $1 million to whoever could come up with a method that would reliably predict the selling price of a home in the United States. Zillow was using a method based on regression in which many variables were used to develop an equation predicting the actual selling price of homes listed on the site. The old method was producing median estimates that were consistently within 5% of the actual selling price, but Zillow customers were not satisfied. Get more information on the "Zestimates" here.

In this assignment, we are going to simplify the Zillow problem by focusing on a relatively small geographic area in Saratoga County, New York and on a small group of variables. The data in the Home Price Data file is actual data gathered from public records a few years ago.

In 2007, the National Association of Realtors said their survey showed that home buyers were willing pay about $1220 more for a house with a fireplace but having a fireplace could boost the value of a home by $12,000 on some locations. In 2016, an Angie's list survey found that having a fireplace could add about 12% to the selling price of a home.

1. Using the method you used in M4A1 and all the data, run a two-sample t-test for means to determine if the difference in mean Home Price for homes with a fireplace and for those without a fireplace is statistically significant. Do not try to test to see if the difference is the same as the Angie's List estimate or the NAR estimate.

Using that information from your t-test, how much does having a fireplace change the value of a home? How does your estimate of the value of a fireplace from the t-test compare to the surveys discussed above? What could explain why your "fireplace value" is different than the values reported in the surveys?

2a. Using all the data for all homes, make a scatter plot of Home Price, the response variable (y), and Living Area, the predictor variable (x).

How to do it:
• Excel Scatter Plot
• Excel Simple Linear Regression

2b. Add a trend line with equation and R2 to the scatter chart.

How to do it: Excel Trendline & Equation

3a. Run a simple linear regression between Home Price, the response variable (y), and Living Area, the predictor variable (x). Use all records in the data set. Is the regression statistically significant? Are the coefficients statistically significant? What is the R2 and how do you interpret it?

3b. Write the equation for the regression line using the information from the regression output. Using that equation, predict the value of a home of 1000 SF, 2500 SF, and 6000 SF.

How to do it: Excel Response Variable Predictions

3c. Find confidence and prediction intervals around the values of the three size homes in 3b. Report the home price point estimates as well as the CIs and PIs for each of the three. In your report, state why the two intervals are different, and which should be used to predict a specific size house price.

How to do it: Excel CIs and PIs of Regression Predictions

Note: Confidence and Prediction Interval Excel Calculator is in your M5A1 Data file.

4a. Run a multiple regression on the entire data set using Home Price as the response variable (y) and Living Area and Fireplace as independent variables. This means one of your independent variables will be categorical. Is the regression statistically significant? Are the coefficients statistically significant? What is the adjusted R2 and how does it compare to the R2 you found in the simple linear regression?

How to do it:
• Multiple Regression with Categorical Data
• Plot Two Datasets on One Graph

4b. Use the regression equation in 4a to find the prices of 1000, 2500, 6000 SF homes with and without a fireplace. Estimate the value of having a fireplace on the price of a home with this information. How does this estimate differ from the ones you found earlier?

5. Separate the data into two subsets of homes with and without a fireplace.
a. Create a new scatter graph with the data of both "with fireplace" and "without fireplace" plotted on the same graph. This means you will have two sets of data and trend lines with equations and R2s on the same graph. Format and label the graph to make it communicate clearly the two data sets.

b. Analyze the graph with the two datasets. Are the trend lines approximately parallel or do they intersect? Hint: if the slopes are not equal, the lines will intersect eventually. If they intersect, what does that imply about the effect of the size of a home on price and having a fireplace or not?

c. Using the two equations for the two trend lines, find the predicted values of the 1000, 2500 and 6000 SF homes with and without a fireplace. Create a table similar in Excel to this which you will insert a copy in your written report:

6. In question 4, you ran a multiple regression using a dummy variable for the categorical variable Fireplace. You will recall that the resulting regression equation used to predict prices for homes with and without a fireplace gave a constant value regardless of size of the home.

In question 5, you made a scatter plot of the with and without fireplace data and possibly saw the two regression lines were not parallel and likely intersected. The latter suggests that the rate of increase in value is depended upon more than just the presence or absence of a fireplace.

The following image is of the MBA/No MBA analysis in the video. You can see the intersection at about a $10,000 salary on the x-axis.

This is known as an interaction between variables, here between the dummy variable for a fireplace and the area of a home.

6a. Conduct a multiple regression incorporating living area, the dummy variable for a fireplace, and an interaction term for living area of a home with the dummy variable for a fireplace. Is the regression statistically significant? Are the coefficients statistically significant? Is the final Adjusted R2 different from previous regressions and the trend lines?

6b. Using the new regression equation including the interaction term, calculate the prices of homes of 1000, 2500, and 6000 SF with and without a fireplace. Produce a table like this:

7. Write a short business report using Word to your instructor with your findings and conclusions. Include

• Key information on each of the 6 questions

• Include appropriate graphs, tables, and references.
• Summary table of the various "values of a fireplace" for the methods you used.

Be sure to answer these questions in your Summary and Conclusions:

• Why are the various estimates of the impact of having or not having a fireplace on the price of a house different?
• What does that comparison indicate about the ability of the regressions to explain the variability of home prices?

Attachment:- Student Instructions.rar

Reference no: EM132536256

Questions Cloud

Find unit cost for each product use activity-based costing : Hakara Company,Use activity-based costing to determine a unit cost for each product. (Round your final answers to 2 decimal places.)
Create a database to keep track of all the courses : The chair for the information technology (IT) department at the University of Denver, needs to create a database to keep track of all the courses
Write a paragraph or two about his biography : Do a little research and write a paragraph or two about his biography and why he is an important Jewish writer. The heart of the paper is a discussion.
What are principles you have personally learned : What are the principles you have personally learned in course that have impacted or will impact your leadership in your workplace, home, church, and community
Write the equation for the regression line : Write the equation for the regression line using the information from the regression output. Using that equation, predict the value of a home
Why company use process cost vs project cost accounting : Why would a company use process cost vs. project cost accounting? How do the two methods differ in terms of accounting for the flow of costs?
Analyze various issues affecting the media business : Analyze various issues affecting the media business. Evaluate the effects of the digital information expansion / explosion on society.
What are the activity-based rates for each area : Locke recorded 50,000 hours of data analysis and 150,000 hours of data entry. What are the activity-based rates for each area of direct labor?
Vertical component of jim velocity : Jim ran up a hill at 7.0 m/s, and the horizontal component of his velocity vector was 5.2 m/s. What was the vertical component of Jim's velocity, in m/s?

Reviews

Write a Review

Applied Statistics Questions & Answers

  Random samples of 25 female and 22 male customers

Q-mart is interested in comparing its male and female customers. Q-mart would like to know if the amount of money spent by its female charge customers differs, on average, from the amount spent by its male charge customers. To answer this question, a..

  Need SPSS analysis for my diploma

Given an online questionnaire done online, results are available as excel sheet. Need below points to be done: Need SPSS analysis for my diploma

  Design and sketch an average control chart

Design and sketch an average control chart, showing upper and lower design limits, upper and lower action limits, and nominal value. Calculate and state the range control limit value.

  Exploration and analysis of data

Write a meaningful essay that includes the following terms: sample, population, confidence level, estimate, mean, margin of error. Discuss the ethical impli

  Calculate descriptive statistics for the unit prices

Visually present data for the unit prices of Zara's products for the three different styles. Calculate descriptive statistics for the unit prices of Zara's products for the three different styles. Comments on the location, shape and variability of..

  Define opportunity loss

Define opportunity loss. What decision-making criteria are used with an opportunity loss table? Explain how a scatter diagram can be used to identify the type of regression to use.

  Perform logistic regression and assess the error rate

STAT 601 Assignment - According to this, create a training set and testing set. Perform logistic regression and assess the error rate

  Data were collected in a clinical trial to compare a new dru

Data were collected in a clinical trial to compare a new drug

  Describe the relationship between probability and odd ratios

Describe the relationship between probability, odds, and odds ratios. Make sure that you give an example (make one up) that illustrates how you would interpret

  Clearly state your hypothesis and conclusions

Clearly state your hypothesis and conclusions.

  What is the new project completion time

What is the new project completion time and what is the new total project cost - Identify what are the time and the path of this minimum cost schedule.

  Determine the appropriate sample size and collect the data

Assignment 1 - Correlation Project. Consider a possible linear relationship between two variables - Determine the appropriate sample size and collect the data

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd