Write a modified classification algorithm for decision trees

Assignment Help Basic Computer Science
Reference no: EM131678153

Question: The standard DECISION-TREE-LEARNING algorithm described in the chapter does not handle cases in which some examples have missing attribute values.

a. First, we need to find a way to classify such examples, given a decision tree thal. includes tests on the attributes for which values can be missing. Suppose that an example X has a missing value for attribute A and that the decision tree tests for A at a node that X reaches. One way to handle this case is to pretend that the example has all possible values for the attribute, but to weight each value according to its frequency among all of the examples that reach that node in the decision tree. The classification algorithm should follow all branches at any node for which a value is missing and should multiply the weights along each path. Write a modified classification algorithm for decision trees that has this behavior.

b. Now modify the information gain calculation so .that in arry given collection of examples C at a given node in the tree during the construction process, the examples with missing values for any of the remaining attributes are given "as-if" values according to the frequencies of those values in the set C.

Reference no: EM131678153

Questions Cloud

How technology has enhanced the social roles of the female : You have just conducted an interview with new client. Develop a solid stand on the issue. How technology has enhanced the social roles of the female population.
Discuss about your role in the global society : How do you think your role in the global society will continue to evolve in the future? This could be a personal reflection about your place of business.
Technique give an age that is much too young : Why does this technique give an age that is much too young? (Note: there are at least 2 reasons that we discussed in class)
Residence time of water in the ocean : what is the residence time of water in the ocean in years? show all work
Write a modified classification algorithm for decision trees : First, we need to find a way to classify such examples, given a decision tree thal. includes tests on the attributes for which values can be missing.
What would you define as safe and unsafe : What do students learn about their digital footprint, How does the teacher address misconceptions of online safety
How about the second statistician who use maximum hypothesis : Two statisticians go to the doctor and are both given the same prognosis: A 40% chance that the problem is the deadly disease A, and a 60% chance of the fatal.
Determine the age of the earth : Many different techniques were used during the 1800's to determine the age of the Earth. One technique was to determine the volume of the ocean basins
How many drapes would the firm have to clean to break even : Given these data, if Everclean's variable costs were reduced to $50 per drape, how many drapes would the firm have to clean to break even?

Reviews

Write a Review

Basic Computer Science Questions & Answers

  What is a downside of using bagging

How does bagging contribute to a reduction in the prediction error?

  Appendix a for the grading rubric

The key to this assignment is to demonstrate your understanding of the topics, not to re-word the text or reference material. Please see Appendix A for the grading rubric on all written assignments.Please complete the scenario below following these g..

  Two strings and print a statement

Compare the two strings and print a statement to the console stating whether the two strings are equal.

  Five to seven elements at the same time

Psychologists note that people have a hard time processing more than five to seven elements at the same time unless the elements are broken into categories.

  Calculates and displays the body mass index

Write a Java application that calculates and displays the body mass index (BMI) for N people. N should be declared as a constant and should be equal to the largest digit of your student id number

  Local and stochastic volatility

What is a volatility surface and how does it point in general to the limitations of the Black-Scholes model? Discuss.

  Techniques for reading or writing files in php

1. Describe two techniques for reading or writing files in PHP. 2. Why we need to always sanitize user inputs before using them in your queries?

  Determine the number of sub strings

Determine the number of sub strings that start with the character 'I' and end with 'E' in the word below. Show all the work. I N T E L L I G E N C E

  List the binary values in register a and the carry flip-flop

The carry flip-flop is initially reset to 0. List the binary values in register A and the carry flip-flop after each shift.

  Can you come up with a method to overcome this issue

To understand why angle-based outlier detection is a heuristic method, give an example where it does not work well. Can you come up with a method to overcome this issue?

  Modelled by the logistic equation

Suppose a population is modelled by the logistic equation such that the population after t years is given by 5000 p ( t )= for some constant k. If the initial population is doubled after 1+3 e-kt one year, then what is the population after two yea..

  Explaining basic forensic procedures

Write a 1-page summary explaining basic forensic procedures and how they can be applied to your future IT career.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd