Examine the various predictor variables

Assignment Help Applied Statistics
Reference no: EM132343841

Assignment -

Answer the following questions. Answers should be uploaded in a neat, easy-to-read Word document. Move all graphs, charts, and tables to the single document. Do not upload spreadsheets. Be sure to read this week's written lecture for links and other helpful information. Necessary datasets are linked. These questions ask you to explain, describe, or outline something in addition to the program output. The essay parts count for about 50 percent of the points in your answer so be sure and include well-considered, detailed explanation and discussion in your own words. Use APA style references and citations if needed. Copying and pasting or similar plagiarism/cheating will result in zero points on the entire assignment. These questions are from Chapter 6, Shmueli, Bruce, and Patel.

1. The file BostonHousing (See attached) contains census information concerning housing in Boston, MA. The dataset has information on 506 housing tracts. The dataset contains 12 predictor variables and one outcome variable, MEDV, median house price. See the text for a table containing variable descriptions or run Analytic Solver (XLMiner) or similar software to view the data description. The following questions refer to this dataset.

2. Why is the data partitioned into training and validation sets as part of the data mining process? What is the purpose of each?

3. Fit a multiple linear regression model to the median house price as a function of CRIM, CHAS and RM using Solver or SPSS Modeler. Use the coefficient table in the output to write the linear equation predicting the median house price.

4. Examine the various predictor variables. Which predictors are likely to be measuring the same thing? Discuss the relationships among INDUS, NOX, and TAX.

5. Compute the correlation table for the numerical predictors and look for highly correlated pairs. These could cause multicollinearity. Which ones should be removed?

6. Use exhaustive search to reduce the remaining predictors. Choose the top three models. Run each on the training set and compare their accuracy for the validation set. Compare RMSE, average error, and lift charts. Describe the best model.

Attachment:- Assignment & Data Files.rar

Reference no: EM132343841

Questions Cloud

What are you most proud of about your cultural heritage : What are you most proud of about your cultural heritage and why? How might media coverage affect the public's perception of your culture?
What is the market-implied growth rate : What is the market-implied growth rate, g, of a stock with the following parameters. Dividend payment forecasted for next year is $3.8.
Be sure to explain all sides of the ethical dilemma : Explain an ethical issue involving a child or adolescent in the context of the illness. Be sure to explain all sides of the ethical dilemma.
Identify an area of hr practice for investigation : Summarise the stages of the research process and compare different data collection methods.Identify an area of HR practice for investigation.
Examine the various predictor variables : Examine the various predictor variables. Which predictors are likely to be measuring the same thing? Discuss the relationships among INDUS, NOX, and TAX
What is its estimated price per share today : The required rate of return that investors demand to hold AB Corp.'s stock is 8% What is its estimated price per share today?
How much should you pay for this stock : The dividend is expected to decrease by 3.6% each year forever. How much should you pay for this stock today if your required return is 20%?
The building blocks of culture : A discussion of different building blocks and how you saw them exhibited.Reflection on your reaction towards this different culture and what helped you to adapt
What is the company price per share : The company has 20 million shares outstanding. Using this information and a WACC of 12.5%, what is the company's price per share?(in $millions).

Reviews

Write a Review

Applied Statistics Questions & Answers

  The atomic weight of a single compound selected at random

A machine is designed to always have a compound weighing 20.4. the machine has a defect and it doesnt always produce the compound with the precise weight. assume the distribution of the weights of the compounds produced by the machine is normal..

  An independent-measures study produces

An independent-measures study produces t(21)=3.00, p

  Keeps records of the annual precipitation in different citie

A meteorological office keeps records of the annual precipitation in different cities. For one city, the mean annual precipitation is 15.3 and the standard deviation of the annual precipitation amounts is 4.2. Let x represent the annual precipitation..

  What are the range and standard deviation

Find the mean, median and mode of this information and what are the range and standard deviation - Use the Empirical Rule to establish an interval which includes about 95 percent of the observations.

  Best estimate of the correlation coefficient

What would be your best estimate of the correlation coefficient and Discuss which statistical tests to apply for different types of data

  Filtering and summarizing data

Filtering and Summarizing Data

  The u.s. environmental protection agency

Recent information published by the U.S. Environmental Protection Agency indicates that Honda is the manufacturer of four of the top nine vehicles in tems of fuel economy.

  Correlation coefficient between the two variables

Introduction to Statistics/G 6540/6554 [a] What was the proportion of children who did not participate in swimming? - What was the proportion of children

  Simulation experiment using a statistical computer package

Consider the four sample sizes n = 10, 20, 30, and 50, and in each case use 500 replications - For which of these sample sizes does the x‾ sampling distribution appear to be approximately normal?

  Study whether systolic blood pressure varied by time

Study whether systolic blood pressure varied by time

  Given the following complete the anova table and make the

Given the following, complete the ANOVA table and make the correct inference.

  Test to qualify for scholarship

A certain college will automatically give a president's scholarship to any student who is in the top 5% of those taking the test.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd