How is the pruned tree used for classification

Assignment Help Computer Engineering
Reference no: EM131926076

Problem

Predicting Delayed Flights. The file FlightDelays.csv contains information on all commercial flights departing the Washington, DC area and arriving at New York during January 2004. For each flight, there is information on the departure and arrival airports, the distance of the route, the scheduled time and date of the flight, and so on. The variable that we are trying to predict is whether or not a flight is delayed. A delay is defined as an arrival that is at least 15 minutes later than scheduled.

Data Preprocessing. Transform variable day of week (DAY_WEEK) info a categorical variable. Bin the scheduled departure time into eight bins (in R use function cut()). Use these and all other columns as predictors (excluding DAY_OF_MONTH). Partition the data into training and validation sets.

a. Fit a classification tree to the flight delay variable using all the relevant predictors. Do not include DEP_TIME (actual departure time) in the model because it is unknown at the time of prediction (unless we are generating our predictions of delays after the plane takes off, which is unlikely). Use a pruned tree with maximum of 8 levels, setting = 0.001. Express the resulting tree as a set of rules.

b. If you needed to fly between DCA and EWR on a Monday at 7:00 AM, would you be able to use this tree? What other information would you need? Is it available in practice? What information is redundant?

c. Fit the same tree as in (a), this time excluding the Weather predictor. Display both the pruned and unpruned tree. You will find that the pruned tree contains a single terminal node.

i. How is the pruned tree used for classification? (What is the rule for classifying?)

ii. To what is this rule equivalent?

iii. Examine the unpruned tree. What are the top three predictors according to this tree?

iv. Why, technically, does the pruned tree result in a single node?

v. What is the disadvantage of using the top levels of the unpruned tree as opposed to the pruned tree?

vi. Compare this general result to that from logistic regression in the example in Chapter 10. What are possible reasons for the classification tree's failure to find a good predictive model?

Reference no: EM131926076

Questions Cloud

Describe the rise of nazism in germany : Describe the rise of Nazism in Germany. Indicate the conditions present in Germany that made it possible for Hitler to come to power.
Calculate basic descriptive statistics of selected variables : Calculate the basic descriptive statistics of the selected variables. Draw the distributions and box-plots of every variable you have
How can a cyclic object graph be represented using the data : How can a cyclic object graph be represented using the data types described in this chapter? In what ways is a .NET array different from a Sequence?
What is the fee to run the fund : You invested $1,250,000 with a market-neutral hedge fund manager. The fee structure is 2/20, and the fund has a high-water-mark provision.
How is the pruned tree used for classification : How is the pruned tree used for classification? Examine the unpruned tree. What are the top three predictors according to this tree?
How will you decide which department should get equipment : Imagine that you are the administrator of a nonprofit community hospital. You have been approached by medical staff from two different departments.
What role does healthcare leadership play : Prepare an analysis based on what your research can discover, on the role of HIT, including artificial intelligence and supercomputing
Official discourse in the dominican republic : 1. What was the result of the Haitian genocide for the official discourse in the Dominican Republic?
Describe the interesting and uninteresting information : Describe the interesting and uninteresting information that these rules provide. Is this model practical for predicting the outcome of a new auction?

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd