Determine the best model to predict the price of a used car

Assignment Help Other Subject
Reference no: EM132262778

Written Answer Part A - Components of a longer report

Introduction

Data
Introduce your data, in one or two paragraphs. In particular, describe the population and sample.

Preliminary Results
A.1 Price Of Two and Three Year Old Cars

Excel Output
Include a copy of your Excel output (that is, your histogram/frequency polygon and table of descriptive statistics) here.

Interpretation - two or three paragraphs

Write a brief interpretation of your graph and descriptive statistics, in particular:
– Discuss the shape, centre and dispersion of your histogram/frequency polygon. What conclusions can you reach about the distribution of prices of two and three year old used cars?
– Discuss and interpret the values of the descriptive statistics. In particular, present conclusions about the centre and variability of prices. Are these values consistent with your histogram/frequency polygon?
– Compare and discuss any difference in the measures of central tendency. Which would you use to represent the average price of atwo or three year old used car? Justify this.

A.2 Difference in Price between Cars for Sale Privately and Those for Sale by a Used Car Dealer.
Excel Output
Include a copy of your Excel output (that is, your boxplots and tables of descriptive statistics) here.
Interpretation - one or two paragraphs
Write a brief interpretation of your boxplots and statistics, in particular:
– Discuss the shape, centre and dispersion of your boxplots. What conclusions can you reach about the difference between the distribution of prices of cars sold privately and those sold through a used car dealer?
– Discuss and interpret the values of the descriptive statistics. In particular, present conclusions about the difference in thecentre and variability of prices of cars sold privately and those sold through a used car dealer. Are these values consistent with your boxplots?
– Does there appear to be a difference in price between cars sold privately and those sold through a used car dealer? Justify this.

A.3 Relationship between Price and Age,and Between Price and Odometer Reading
Excel Output
Include a copy of your Excel output (that is, your scatterplots and correlation coefficients) here.
Interpretation - two or three paragraphs
Write a brief interpretation of your scatter plots and correlation coefficients, in particular:
– Discuss any apparent relationship between age and price. Comment on the strength, shape and sign of the relationship.
– Discuss any apparent relationship between odometer reading and price. Comment on the strength, shape and sign of the relationship.
– Discuss what the values of the correlation coefficients tell you about the strength of the relationships between age and price, and between odometer reading and price. Which is the stronger relationship? Are these values consistent with the scatter plots?

To obtain your data
(1) Click on the Project Data file. This will download an Excel file.
(2) Select the 7 columns (Year to Price) of data for the sample specified by the last digit of your student ID number.
(3) Copy this into a new Excel file.

There are 10 sample data sets each of 7 columns (Year to Price)
Your sample number matches the last digit of your SCU student ID number. For example, if your student ID number ends in 2 your sample is Sample 2 and you will be analysing used car data for Toyota Corolla cars for sale in Western Australiain columns Q2:W120.

Project Situation

An online consumer group Oz-Price-Watch regularly analyses used car prices in various Australian states.

As a research assistant for Oz-Price-Watch, you are analysing the data for the Car and state specified by your sample. For example, if your student ID number ends in 0 your sample is Sample 0 and you will be analysing prices of used Toyota RAV4 (4 cylinder) in New South Wales.
You are required to analyse your sample data in response to the given questions and provide a written answer. You can assume that your written answers are components of a longer report on used car prices.

Project Preparation

You are expected to use Excel when completing the project.

Your written answers presenting your findings and conclusions should be considered as a part of a larger report on used car prices. Each written answer should be a word document into which your Excel output has been copied

In addition, your statistical workings for Part B should appear as appendices to your written answer. This should include all necessary steps and appropriate Excel output.
Each part of the project should be submitted as a SINGLE Word document, with appropriate Excel output added.

PROJECT - PART A

Part A Preliminary Analysis of Sample Data

Oz-Price-Watch has asked you for a preliminary analysis of your sample data. Your calculations and conclusions from this analysis may be incorporated in your answer for Part B

Tasks - Part A
Complete the following
1) Download and save your data.
2) Download the Project Part A cover sheets, name and save this file as
"FamilyName_FirstName_Part_A_Campus"
3) Enter your Sample Number on page 2 of the Part A coversheets.
4) Statistical Answers: For used cars of the make and model for sale in the state specified by your sample perform the following
a) Price of two and three year old cars
UsingPrice (7th column of data) explore prices of 2016 and 2017 used cars, by using Excel to:
– Construct a frequency histogram or polygon for the price of two and three year old cars.
– Calculate descriptive statistics for the price of two and three year old cars.
Note:The required data for 2016 and 2017 used cars is in the first rows of your sample.

b) Difference in price between cars for sale privately and those for sale by a used car dealer.
Use Price (7th column of data)andSeller(5th column of data), where Private indicates a private sale and Dealer a sale through a used car dealer, for all 115cars in your sample to explore if there is a difference in price between the samples by using Excel to:
– Construct separate boxplots, on the same plot or separately, for private sale prices and for used car dealer prices.
– Calculate descriptive statisticsfor private sale prices and for used car dealer prices.
Hint:Sort data on Seller to obtain two samples. That is, price of used cars sold privately and price of used cars sold through a used car dealer.

c) Relationship between price and age and between price and odometer reading
Explore the relationship between the price of a used car and its age and also the price of a used car and its odometer reading, by using Age (2nd column of data)and Odometer(3rd column of data)as independent variables with Price(7th column of data)as the dependent variable for all 115 cars in your sample, by using Excel to:
– Construct scatter plots for Age and Price and for Odometer and Price
– Calculate the correlation coefficient for Age and Price and for Odometer and Price.

5) Written Answer - Preliminary Analysis
Using the instructions given on pages4 and 5 of the Part A coversheets, introduce your data and the results of your preliminary investigation of theprice ofused cars, of the make and model in the state specified by your sample.
This should bethree to fivepages and 400 to 800words.
Use an appropriate style, without statistical jargon and equations, to clearly communicate your results.
6) Complete Coversheets 1 and 2, save and submit Part A of the project online using Project Part Alink in Submit Projectby the due date Tuesday 26 March 2019.

PROJECT - PART B

Purpose: To apply your knowledge of statistical inference and regression to answer questions about used cars for sale by analysing the data and communicating the results.
Part B Submission
You should submit a single word document consisting of:
– Part B coversheets
– Written answer as components of a report. This should follow the format given on pages 4 and 5 of Part B coversheets
– Appendices for Part B which contain full statistical working for the required statistical tasks.

Part B Preparation

The graphs, plots and interpretations in Part A may be required in the statistical and written answers in Part B. Therefore, check these and make any required corrections.

While the submission date for Part B isSunday 19 May 2019, you should be working on Part B during Weeks 6 to 11.

Task 1 Part B - Appendices Statistical Inference and Regression and Correlation Tasks

The following statistical tasks should appear as appendices to your written answers. These should include all necessary steps and appropriate Excel output.
These appendices should come after your written answer within your single Word document for Part B.

Statistical Inference
Choose a level of significance for any hypothesis tests and a level of confidence for any confidence intervals. Enter these values on page 2 of the Part B coversheets along with the sample number from Part A.
For used cars of the make and model for sale in the state specified by your sample answer the following questions using appropriate statistical inference and regression techniques.

Question 1 - Topic 5
Since many buyers wish to purchase a two or three year old used car Oz-Price-Watch has asked you to provide information onthe average price of 2016 and 2017 cars of the make and model for sale in the state specified by your sample.
To enable you to answer this use Price (7th column of your data) for 2016 and 2017cars only, your output from Part A and an appropriate statistical inference technique to:
Estimate the population mean price of two and three year old used cars of the make and model for sale in the state specified by your sample.
Note: The required data for 2016 and 2017 cars is in the first rows of your sample.

Question 2 - Topic 6
Many buyers believe that white cars are safer since they are more visible. Therefore, they wish to purchase a white car.Oz-Price-Watch has asked you to explore if restricting a purchase to white cars will limit abuyer's choice. Past research by Oz-Price-Watch has shown that if a search is restricted to a feature, for example colour or transmission, which at most 30% of cars for sale have then buyer choice is limited.
To provide a justified answer to the question use White (6th column of data,where Yes = car for sale is white and No = car for sale is not white) for ALL 115 cars in your sample and an appropriate statistical inference technique to answer the following question
Are more than 30% of used cars of the make and model for sale in the state specified by your sample white?
Hint: Sort data on White to enable you to easily count the number of white cars in your sample.

Question 3 Topic 7
Oz-Price-Watch wishes to know if there is a difference in price between cars for sale privately and those for sale by a used car dealer.
To provide a justified answer to this question use Price(7th column of data)and Seller (5th column of data) for all 115cars in your sample, your output from Part A and an appropriate statistical inference technique to answer the following question
Is there a difference in the average price of cars, of the specified make and model for sale in the specified state, for sale privately and by a used car dealer?
Hint: Sort data on Seller to easily obtain two samples - Prices for private sellers and for used car dealers.

Questions 4 and 5 Simple and Multiple Linear Regression
Oz-Price-Watch asks you how the value of a used car, of the specified make and model, depreciates.
To answer this you develop a simple linear regression model to predict price from age or odometer reading and a multiple linear regression model to predict price from age, odometer reading and transmission type. Then, to provide a justified answer to Oz-Price-Watch, choose and interpret the linear model that best fits your data.

Question 4 Simple Linear Regression Model Topic 8
From your results in Part A choose either Age or Odometer as an independent variable, to predict Price.
To explore the relationship between the age or odometer reading of a used car and its price, use your output from Part A and Ageor Odometer(2ndor 3rdcolumn of data)as an independent variable with Price(7th column of data)as the dependent variable, for all 115 cars in your sample, to develop and then explore a simple linear relationship between the two variables by:
– Calculating the least squares regression line, correlation coefficient and coefficient of determination.
– Interpreting the gradient and vertical intercept of the simple linear regression equation.
– Interpreting the correlation coefficient and coefficient of determination. Are these values consistent with your scatter plot?
Note: You can choose either Age or Odometer as the independent variable in this model.

Question 5 Multiple Linear Regression Model Topic 9
To explore what other factors may have an influence on the value of a used car use your output from Part A andAge, Odometer and Transmission (2nd, 3rd and 4th columns of data) as three independent variableswith Price (7th column of data) as the dependent variable for all115 cars in your sample, to develop and then explore the relationship between these four variables by:
– Calculating the multiple regression equation, multiple correlation coefficient, and coefficient of multiple determination.
– Interpreting the values of the multiple regression coefficients.
– Interpreting the values of the multiple correlation coefficient and coefficient of multiple determination. Compare these values with the corresponding values for the simple linear regression model.
Then determine the best model to predict the price of a used car by:
– Using appropriate tests to determine which independent variables make a significant contribution to the regression model.
– Give or calculate the simple or multiple regression equation which best fits the data.

Task 2 - Written Answer - Components of a report

For Questions 1, 2, 3 and Questions 4 and 5 combined present the results of your calculations, with your interpretation and conclusions as components of a longer report on used car prices.
Use the instructions given on pages 4 and 5 of the Part B coversheets.
This should be 500 to 1100 words and three to seven pages.
It should be submitted as a Word file with Excel output included.
Make sure you:
– Introduce each question and put it in context
– Answer each question in non-statistical language.
– Present the result of your calculations and tests without unnecessary statistical jargon
– Include a conclusion which answers the given question.
In particular, for Questions 4 and 5
– Mention or explain your choice of independent and dependent variables
– Include and justify the best model.
– Discuss and interpret the values of the regression and correlation coefficients of the best model.

Attachment:- Statistical Analysis Project.zip

Reference no: EM132262778

Questions Cloud

Equipment by pooling demand across the supply chain : What does it mean by 'the firm reduces risk of investing in the wrong equipment by pooling demand across the supply chain'?
What are the current legal and health care issues : Current legal and health care issues (use of medical marijuana, genomics, abortion, etc.) may create ethical and moral issues for the health care organization.
Marketing case studies have shown that firms suffer losses : Marketing case studies have shown that firms suffer losses at the hands of nimble competitors not because they do not have strategic plans
Explain the term compensation and benefits : Research a non-union company on the "Fortune 100 Best Companies to Work For" List. Describe the following items in a 15- to 20-slide presentation that includes.
Determine the best model to predict the price of a used car : MAT10251 - Statistical Analysis - Southern Cross University - Using appropriate tests to determine which independent variables make a significant contribution
Operations in both manufacturing and service environments : A foundation of operations in both manufacturing and service environments is vital in order to drive inefficiencies
Describe the type of promotional methods : Describe the type of promotional methods you will use to spread the word about your product.
Examine how the steps in the process are linked : Performance is determined by a combination of declarative knowledge, procedural knowledge, and motivation. If any of the three determinants of performance.
Main strategies for merging different corporate cultures : Explain the main strategies for merging different corporate cultures.

Reviews

len2262778

3/21/2019 10:25:34 PM

• If the page setup of your submitted Word file is not as required (that is, A4 Portrait, with appropriate format so that it is easily readable if printed), with at least 1.5 line spacing or your project is not submitted as a single Word document a maximum of two marks may be deducted, as this causes the marker extra work and frustration. • If your submitted file is not a Word file, for example it is a pdf or a zip file, a maximum of two marks may be deducted, as this causes the marker extra work and frustration. • In addition, if your file is not named as requested or the required cover sheets are not included or correctly completed a maximum of two marks may also be deducted, as this can cause the marker extra work and frustration.

len2262778

3/21/2019 10:25:26 PM

Incorrect Sample • If you use a sample that does not correspond to the last digit of your student ID number, to be entered on the cover sheet, a maximum of two marks may be deducted, as this causes the marker extra work and frustration.

len2262778

3/21/2019 10:25:14 PM

• Each part of the project should be a SINGLE Word file with Excel output included. • The given cover sheets should be the first pages of your submitted project and are not part of the page limit. • DO NOT submit your appendices, which are not part of the page or word limit, for Part B as a separate file. • Ensure that the page setup of your submitted document is A4 Portrait, with an appropriate format so that it is easily readable if printed. • Use line spacing of at least 1.5.

len2262778

3/21/2019 10:25:02 PM

Notes • You should not need to read beyond the study guide and textbook to complete the project. Referencing You are not required to reference. However, as the format of your written answers are components of a longer report it may be appropriate to reference. In this case, use any consistent referencing style. Furthermore, you are not required to use real references. That is, any reference can be fictitious/fake. You are not required to reference any output or text from Part A that you reuse in Part B.

len2262778

3/21/2019 10:24:51 PM

This project leads you through a statistical analysis of used car data. The data for this project was obtained from the car sales You are expected to use Excel when completing the project. Your written answers presenting your findings and conclusions should be considered as a part of a larger report on used car prices. Each written answer should be a word document into which your Excel output has been copied In addition, your statistical workings for Part B should appear as appendices to your written answer. This should include all necessary steps and appropriate Excel output. Each part of the project should be submitted as a SINGLE Word document, with appropriate Excel output added.

Write a Review

Other Subject Questions & Answers

  How might we view gender as being a performance

What are the implications of this view for how we see and experience gender in our lives?How might we view gender as being a "performance?"

  Describe the differences in marriage and family life

Why is the family considered the most important agent of socialization? What caused the dramatic changes to the American family? What are those changes?

  Discuss internal vs external recruitment practices

To respond to this topic, you first want to define and discuss INTERNAL vs. EXTERNAL recruitment practices. You want to demonstrate your understanding of the pros and cons of each approach

  The ethical issues that go along with the global societal

Describe effective methods you used in identifying and narrowing down to just one of the topics to further research for your Final Paper.

  According the absolute threshold theory

Brewster blew his very high pitched whistle, which no one was supposed to be able to hear. To his surprise, 2 out of 10 people heard the whistle. According the absolute threshold theory, the whistle sound is best described as reaching

  Employee safety and several welfare laws

Explain the application of UNEMPLOYMENT COMPENSATION and the IMMIGRATION REFORM ACT in the current employment environment. DESCRIBE the impetus and how it evolved from organized labor activity to more widespread application.

  Explain why apc will always be greater than mpc

In macroeconomics the average propensity to consume (APC) and the marginal propensity to consume (MPC) are defined as follows.

  Define the word civilization

Why is it difficult to define the word "civilization?", and give examples of what we find in civilizations and what problems arise with defining a standard.

  Discuss the exposure limits of the given toxicant

Select a toxicant that can be classified as an air, water or soil pollutant, or an organic solvent. Discuss the exposure limits of this toxicant, how an individual may be exposed, and the toxic effects.

  Which segments do you think would be most likely to purchase

Which segments do you think would be most likely to purchase each of the following products, and why? (If none would, explain why not).

  Supermarket development proposal

While this is a planning issue, it has many features about it that are of interest to geographers. Describe the relevance to TWO important geographical ideas in relation to this supermarket development proposal.

  Explain the turbine engine propulsion

Which type of pressure wave do you think performs the most work in a turbine engine and why? Post a complete, original response.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd