What is the danger of using the best predictive model

Assignment Help Computer Engineering
Reference no: EM131926047

Problem

Competitive Auctions on eBay. The file eBayAuctions.csv contains information on 1972 auctions transacted on eBay during May-June 2004. The goal is to use these data to build a model that will distinguish competitive auctions from noncompetitive ones. A competitive auction is defined as an auction with at least two bids placed on the item being auctioned. The data include variables that describe the item (auction category), the seller (his or her eBay rating), and the auction terms that the seller selected (auction duration, opening price, currency, day of week of auction close). In addition, we have the price at which the auction closed. The goal is to predict whether or not an auction of interest will be competitive.

Data preprocessing. Create dummy variables for the categorical predictors. These include Category (18 categories), Currency (USD, GBP, Euro), End Day (Monday-Sunday), and Duration (1, 3, 5, 7, or 10 days). a. Create pivot tables for the mean of the binary outcome (Competitive?) as a function of the various categorical variables (use the original variables, not the dummies). Use the information in the tables to reduce the number of dummies that will be used in the model. For example, categories that appear most similar with respect to the distribution of competitive auctions could be combined.

b. Split the data into training (60%) and validation (40%) datasets. Run a logistic model with all predictors with a cutoff of 0.5.

c. If we want to predict at the start of an auction whether it will be competitive, we cannot use the information on the closing price. Run a logistic model with all predictors as above, excluding price. How does this model compare to the full model with respect to predictive accuracy?

d. Interpret the meaning of the coefficient for closing price. Does closing price have a practical significance? Is it statistically significant for predicting competitiveness of auctions? (Use a 10% significance level.)

e. Use stepwise selection (use function step() in the stats package or function stepAIC () in the MASS package) and an exhaustive search (use function glmulti() in package glmulti) to find the model with the best fit to the training data. Which predictors are used?

f. Use stepwise selection and an exhaustive search to find the model with the lowest predictive error rate (use the validation data). Which predictors are used?

g. What is the danger of using the best predictive model that you found?

h. Explain why the best-fitting model and the best predictive models are the same or different.

i. If the major objective is accurate classification, what cutoff value should be used?

j. Based on these data, what auction settings set by the seller (duration, opening price, ending day, currency) would you recommend as being most likely to lead to a competitive auction?

Reference no: EM131926047

Questions Cloud

What factors contributed to the creation of democracy : What factors contributed to the creation of democracy in America? how did the new democracy change the election and campaigning process?
Basic difference between unitary and split dx systems : Describe the basic difference between unitary and split DX systems. Under what circumstances would an absorption water chiller be an economical choice.
How did expectations for most americans change : How did expectations for most Americans change during Jefferson's presidency? How did Jefferson feel about Native and African-Americans?
Identify the scope of ramifications of this problem : Identify the scope of ramifications of this problem and the specific impacts on Facebook's business, shareholders, and management.
What is the danger of using the best predictive model : What is the danger of using the best predictive model that you found? Explain why the best-fitting model and the best predictive models are same or different.
Describe the influence of the magna carta : Describe the influence of the Magna Carta this document and how subsequent documents were impacted by the English Bill of Rights 1689.
How to create and run a batch file in windows : Decision how you want to maintain and name your program in files. That is, you may use only one single .cpp file to include all of your source code
Explain the sentiments of some jewish germans : How do Walter Bacharach's and Uri Ben Ari's experiences help to explain the sentiments of some Jewish Germans?
Calculate mci communications value : Calculate MCI Communications Value / FCFF. MCI Communications had earnings before interest and taxes of $3,356 million in 1994 (Its net income after taxes was).

Reviews

Write a Review

Computer Engineering Questions & Answers

  Discuss about various law enforcement agencies

the different IT management roles, various law enforcement agencies, emergency agencies/organizations

  What is role base access control models

Look at the XACML operational model that is claimed to be a generic RBAC implementation. Do you agree with the last statement?

  Explain about doing ethics technique

The Doing Ethics Technique (DET), ensuring you address each of the DET questions and relating specific clauses from this code to the ethical issue/s, you have identified.

  Who are the stakeholders in the scenario

You prepare a plan to minimise the impact of the buggy code but this would involve a delay in distribution of the software.

  List and describe the stages of an lca

List and describe the stages of an LCA. List and describe the key elements of an LCA. List and describe the impact metric (factors) of an LCA.

  Draw an ER diagram that captures the preceding information

DAT536 and SOF535 - Application Design and Database Development Case Study Research. Draw an ER diagram that captures the preceding information

  Describe what type of malware

the pieces of malware and write a complete overview of each piece of malware. Describe what type of malware

  How to write a program that allows the user to enter student

how to Write a program that allows the user to enter student names followed by their test scores and output the following information.

  What are differences between primary and secondary storage

What are the differences between primary and secondary storage? How does a workstation differ from a PC?

  In a sample of 1500 widgets it is found that 679 are

the quality control department of widgets manufacturing needs your help. they cant decide which significance level is

  Robotics research project

Write a paper on a topic of your choice in the area of robotics.

  Define the use and importance of a guided navigation system

explain the use and importance of a guided navigation system and shopping cart for a website designed for e-commerce and business purpose. Also explain how the site should take payments using a payment gateway.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd