Build a decision tree by applying hunts algorithm

Assignment Help Basic Computer Science
Reference no: EM13849372

1. In this part, we are going to build a decision tree classifier in MATLAB to predict the acceptability of cars. The dataset can be found in the CSV file car.csv. The first six columns are the attributes of the cars and the last column is class label denoting the evaluated car acceptability. The attribute information is shown in the following table:

Attribute

Description

Possible values

buying

Buying Price

Vhigh,high,med,low

Maint

Maintenance Price

Vhigh,high,med,low

Doors

No.of doors

2,3,4,5 more

Persons

Passenger capacity

2,4 more

Lug boot

Size of luggage boot

Small big med

Safety

Estimated safety of the car

Low ,med, high

Accept (class label)

Acceptability of the car

Unacc,acc,good,vgood

a. All the six attributes (except the class label) require encoding (i.e. transforms them into integral values). Propose an encoding scheme, and implement it into a MATLAB function to perform the encoding. The answer should include the followings:

i. Encoding scheme for each attribute (preferably presented as an encoding table)

ii. MATLAB function source codes implementing the encoding scheme in (i)

b. By using the first 75% of the tuples for training, and the remaining 25% for testing, build a MATLAB decision tree classifier to predict the acceptability of the cars in the testing dataset.

The answer should include the followings:

 i. MATLAB source codes building classifier, and car acceptability prediction using the testing dataset. Clear instructions must be given for the execution of your source codes, or marks will be deducted otherwise

ii. The decision tree built, accuracy, sensitivity, and specificity (i)) Submit your MATLAB source codes in separate MATLAB files. c. Why is the accuracy in

(b.ii) much lower than sensitivity and specificity?

2. Name two techniques in data pre-processing. Discuss their basic principles and illustrate one example for each technique discussed.

3. The following table presents the survey results about the preference of the mobile phone game apps.

Age

Gender

Occupation

Preferred type of game apps

29

F

Manager

Puzzle

25

M

Manager

Action

27

M

Student

Sports

17

M

Student

Action

23

M

Clerk

Puzzle

30

F

Clerk

Sports

14

M

Student

Puzzle

28

F

Clerk

Sports

22

M

Clerk

Action

36

M

manager

puzzle

Using the above table as the training dataset, build a decision tree by applying Hunt's algorithm. The class label is the preferred type of game apps. Age, gender, and occupation are the attributes. Split the attributes using multi-way split with GINI index. Show clearly your steps and calculations.

Reference no: EM13849372

Questions Cloud

Describe how performance and quality measures : Describe how performance and quality measures are aligned to an organization's mission, vision, and strategic plan in general. Then, describe how the measures are aligned with the mission, vision, and strategic plan of your organization
Network the critical path is the longest path : In a __________ network the critical path is the longest path through the network.
Budget-constrained and profit-conscious style of evaluation : Understand how MAS can be are used to inform performance evaluation in organisations - evaluate the effectiveness of performance evaluation systems in organisations, both from a technical and behavioural perspective
Working in a business to learn the ropes : You have a two children, A and B. Child A is not going to college but is working in a business to learn the ropes. Child A plans on opening a business someday. Child B is attending college. You put a certain amount of money into an account.
Build a decision tree by applying hunts algorithm : build a decision tree by applying Hunt's algorithm
Analysis of the ratios calculated : Assess Wal-Mart, Stores Inc. concerning liquidity, solvency, profitability, and stock performance as of January 31, 2015. For each area, you should calculate the ratios and provide an analysis of the ratios calculated. Include historical stock pri..
What information could you share with this parent : What information could you share with this parent that would educate her about the importance and the safety of vaccinations? How would you provide this information
What price should newnovatel sell bonds : Newnovetel cellular is issuing bonds pay no interest but can be converted into $9K at maturity in 17 years. To price them competitively, it was determined they should yield eight percent compounded annually. At what price should Newnovatel sell these..
An investment has an expected return : An investment has an expected return of 11 percent per year with a standard deviation of 24 percent. Assuming that the returns on this investment are at least roughly normally distributed, how frequently do you expect to earn between -13 percent and ..

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Identifies the cost of computer

identifies the cost of computer components to configure a computer system (including all peripheral devices where needed) for use in one of the following four situations:

  Input devices

Compare how the gestures data is generated and represented for interpretation in each of the following input devices. In your comparison, consider the data formats (radio waves, electrical signal, sound, etc.), device drivers, operating systems suppo..

  Cores on computer systems

Assignment : Cores on Computer Systems:  Differentiate between multiprocessor systems and many-core systems in terms of power efficiency, cost benefit analysis, instructions processing efficiency, and packaging form factors.

  Prepare an annual budget in an excel spreadsheet

Prepare working solutions in Excel that will manage the annual budget

  Write a research paper in relation to a software design

Research paper in relation to a Software Design related topic

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Construct a truth table for the boolean expression

Construct a truth table for the Boolean expressions ABC + A'B'C' ABC + AB'C' + A'B'C' A(BC' + B'C)

  Evaluate the cost of materials

Evaluate the cost of materials

  The marie simulator

Depending on how comfortable you are with using the MARIE simulator after reading

  What is the main advantage of using master pages

What is the main advantage of using master pages. Explain the purpose and advantage of using styles.

  Describe the three fundamental models of distributed systems

Explain the two approaches to packet delivery by the network layer in Distributed Systems. Describe the three fundamental models of Distributed Systems

  Distinguish between caching and buffering

Distinguish between caching and buffering The failure model defines the ways in which failure may occur in order to provide an understanding of the effects of failure. Give one type of failure with a brief description of the failure

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd