Problem of interest of buying a used car

Assignment Help Applied Statistics
Reference no: EM131973448

The Project -

This project concerns a problem of interest of buying a used car. The calling price of used cars will vary depending on the year of production and the mileage, any specific kind (brand and body type, e.g. Honda Civic Sedan) of cars,...including some random factors.

The purpose of this project is to examine the relationship between the mean calling price E(y) (the price asked for by the owner) of a specific kind of car and the following independent variables:

1. X1 (quantitative): The number of years since production; e.g. If a car is produced on2010, then X1 = 2017-2010 = 7.

2. X2 (quantitative): The original price of the car when it is brand new.

3. X3 (quantitative): The Current Mileage of the car.

4. X4 (qualitative): Title (Clean vs not clean).

Choose specific type of car (for example: Honda Civic sedan; Chevrolet Malibu, etc.), and collect your sample data with sample size n ≥ 30 (you can decide how many observations to be included but that number must be greater or equal to 30). Make sure your data contains the above quantitative and qualitative variables.

The objectives of this project are as follows:

1. Hypothesis a model for calling price and predictors (if necessary you need to consider the interactive effects)

2. Run variable selection procedure to choose most important x's (stepwise regression, all possible regression selection procedure)

3. For the selected x's in step 2, fit regression model you proposed in step 1. Conduct T-test on important β′s; comparing adjusted R2; compare 2s values.

4. Propose and fit other candidate models. Determine a best model for E(y) by checking nested model F-test (hint using anova() function in R for nested-F test);

5. Based on the best model you selected in step 4, perform residue analysis to check assumption on ε (whether or not ε's are independently from N(0, σ2)). (Hint: for normality assumption, use both Q-Q plot residual plots (code will be provided in later chapters)).

6. Remedy your model if you do detect some violation of assumption on ε and redo step 1, 2, 3, 4 and 5

7. Assess adequacy of best model by checking global F-test significant; adjusted R2 high; 2s value small

Format of Your Work -

Your work should be clear and easy to understand, follow the following format:

1. Statement of The Problem: You need to state your research question here. That is, tell us what your study is about and your purpose of the study (around 100 words).

2. The Data: You need to specify how you collect the data and summarize your sample data using the methods we learned in descriptive statistics.

(1) The following table must be included.

(2) Scatter plots: X1 versus Y; X2 versus Y; X3 versus Y.

Histogram: the histogram of the calling price Y.

3. The Models: Specify the hypothesized models you want to apply. In this part, you are expected to finish the first four objectives stated above. Hint: When you proposed a model, the first fitting might not an ideal model, you might need to improve your model by selecting variables, change the order of your model, considering interactive effect, ect. You need to compare all the models you fitted and explain why it is the best by checking the nested model F-test; T-test on important β′s; comparing adjusted R2; comparing 2s values.

4. Assumption check: In this part, you can do item 5 of objectives stated above, and write down your conclusion.

5. Model Remedy: In this part, do some transformation for y or x to make the model assumptions be satisfied and write down the new model and conclusion.

6. Model adequency: In this part, you are expected to finish the last objectives stated above.

7. Conclusion: Give a brief summary of your study.

Others -

This project is composed of 7 parts (see 2 Format of Your work)

All the analysis should be done by applying software R. You should collect your data and write your code by yourself. Project report does not include analysis and R code will not be graded. Your work should be a pdf file which contains your analysis and R code.

Attachment:- Assignment File.rar

Reference no: EM131973448

Questions Cloud

What is your opinion regarding film-tv viewing : What is your opinion regarding film/tv viewing? How are Chicana/os and Mexicans portrayed in the three films/tv programs that you have viewed for this project?
Mission statement-favorite dine-in restaurant : Please post your organization mission statement or your favorite dine-in restaurant (ex. Hash House, Hard Eight BBQ, Grand Lux, Roy's Pacific Cuisine
Consistent between various industries : What would this value be and would it be consistent between various industries? and what you refrences you based your answer on?
Make algorithm which is based on temperature sensor : You need to prepare methodology, Take a idea from uploaded file and make algorithm which is based on temperature sensor.
Problem of interest of buying a used car : This project concerns a problem of interest of buying a used car. Run variable selection procedure to choose most important x's
Repay the remaining balance on the mortgage : You must repay the remaining balance on the mortgage. How much will this balloon payment be?
Challenging for smaller businesses : Do you think SME's should consider holding larger cash reserves to ensure they are in a better position to make riskier investment
The firm uses macrs depreciation with five-year tax life : What is NPV if the firm uses MACRS depreciation with a 5-year tax life? What is project NPV?
External factors that have an influence on business : Give examples of the ways in which each factor can affect the business performance of two companies: Wal-Mart and Ford.

Reviews

len1973448

5/7/2018 3:46:26 AM

There are 7 questions and all of them need to be answered, please. When you will start working on it, when collecting the data from the towlines attached also in the project document, for the original price (data) you will need to check the dealer's website to know the original price of the car. Everything else, will be found in the attached document, what you need to do and how to complete it. Basically for the project, it requires to choose one car such as Honda Civic (need to be specific about the model of the car) and collect data etc. Everything is in the document attached. Looking forward to hearing back from you about the quote price. The work is urgent please. Thank you.

len1973448

5/7/2018 3:46:20 AM

This project is composed of 7 parts (see 2 Format of Your work). All the analysis should be done by applying software R. You should collect your data and write your code by yourself. Project report does not include analysis and R code will not be graded. Your work should be a pdf file which contains your analysis and R code.

Write a Review

Applied Statistics Questions & Answers

  An experiment that consists of 2 rolls of a balanced die

Consider an experiment that consists of 2 rolls of a balanced die. If X is the number of 4s and Y is the number of os obtained in the 2 rolls of the die, find (a) the joint probability distribution of A and V; (b) P[(X, Y) € A], where A is the region..

  Calculate the median for the group of results

State the five types of central tendency in general use. Which of these are most frequently used in SPC work and calculate the median for the group of results tabulated below: 1,3,4,6,7,9,11,14,16,17,18.

  What is the optimal solution?

Objective to minimize 925X1 + 2000X2 what is the Optimal Solution?

  Estimate the standard deviation

How do the shapes of the t distribution and the z distribution differ?  Why do you think we need to use a different distribution when the sample size is small and we have to estimate the standard deviation?

  According to analysis by usa today

According to analysis by USA Today, air flight is so safe that a person would have to fly every day for more than 64,000 years before dying in an accident. How can such a statement be justified?

  A recent period of high unemployment

15. During a recent period of high unemployment, hundreds of thousands of drivers dropped their automobile insurance. Sample data representative of the national automobile insurance coverage for individuals 18 years of age and older are shown here.

  Machines continuously process an unending number of jobs

Three machines continuously process an unending number of jobs. The time it takes to process a job on machine A is a Uniform [0,4] random variable, the time it takes to process a job on machine B is Uniform [1,3] random variable, and the time it take..

  Predictable relationship between verbal skills

A researcher would like to know whether there is a consistent, predictable relationship between verbal skills and math skills for high school students. A sample of 200 students is obtained and each student is given a standardized English test and a s..

  Write a plan for the project

Give a brief description of your project. Discuss why it is important to investigate your aims and why the project is of general interest.

  Create a pivot table for the training data

Create a pivot table for the training data with Online as a column variable, CC as a row variable, and Loan as a secondary row variable - Create two separate pivot tables for the training data.

  Somerset furniture companys global supply chain

Somerset Furniture Company's Global Supply Chain

  Determine whether employee development is needed for skills

Determine whether employee development is needed for technical skills and Determine whether employee development is needed for interpersonal skills

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd