Build a decision tree by applying hunts algorithm

Assignment Help Basic Computer Science
Reference no: EM13849372

1. In this part, we are going to build a decision tree classifier in MATLAB to predict the acceptability of cars. The dataset can be found in the CSV file car.csv. The first six columns are the attributes of the cars and the last column is class label denoting the evaluated car acceptability. The attribute information is shown in the following table:

Attribute

Description

Possible values

buying

Buying Price

Vhigh,high,med,low

Maint

Maintenance Price

Vhigh,high,med,low

Doors

No.of doors

2,3,4,5 more

Persons

Passenger capacity

2,4 more

Lug boot

Size of luggage boot

Small big med

Safety

Estimated safety of the car

Low ,med, high

Accept (class label)

Acceptability of the car

Unacc,acc,good,vgood

a. All the six attributes (except the class label) require encoding (i.e. transforms them into integral values). Propose an encoding scheme, and implement it into a MATLAB function to perform the encoding. The answer should include the followings:

i. Encoding scheme for each attribute (preferably presented as an encoding table)

ii. MATLAB function source codes implementing the encoding scheme in (i)

b. By using the first 75% of the tuples for training, and the remaining 25% for testing, build a MATLAB decision tree classifier to predict the acceptability of the cars in the testing dataset.

The answer should include the followings:

 i. MATLAB source codes building classifier, and car acceptability prediction using the testing dataset. Clear instructions must be given for the execution of your source codes, or marks will be deducted otherwise

ii. The decision tree built, accuracy, sensitivity, and specificity (i)) Submit your MATLAB source codes in separate MATLAB files. c. Why is the accuracy in

(b.ii) much lower than sensitivity and specificity?

2. Name two techniques in data pre-processing. Discuss their basic principles and illustrate one example for each technique discussed.

3. The following table presents the survey results about the preference of the mobile phone game apps.

Age

Gender

Occupation

Preferred type of game apps

29

F

Manager

Puzzle

25

M

Manager

Action

27

M

Student

Sports

17

M

Student

Action

23

M

Clerk

Puzzle

30

F

Clerk

Sports

14

M

Student

Puzzle

28

F

Clerk

Sports

22

M

Clerk

Action

36

M

manager

puzzle

Using the above table as the training dataset, build a decision tree by applying Hunt's algorithm. The class label is the preferred type of game apps. Age, gender, and occupation are the attributes. Split the attributes using multi-way split with GINI index. Show clearly your steps and calculations.

Reference no: EM13849372

Questions Cloud

Describe how performance and quality measures : Describe how performance and quality measures are aligned to an organization's mission, vision, and strategic plan in general. Then, describe how the measures are aligned with the mission, vision, and strategic plan of your organization
Network the critical path is the longest path : In a __________ network the critical path is the longest path through the network.
Budget-constrained and profit-conscious style of evaluation : Understand how MAS can be are used to inform performance evaluation in organisations - evaluate the effectiveness of performance evaluation systems in organisations, both from a technical and behavioural perspective
Working in a business to learn the ropes : You have a two children, A and B. Child A is not going to college but is working in a business to learn the ropes. Child A plans on opening a business someday. Child B is attending college. You put a certain amount of money into an account.
Build a decision tree by applying hunts algorithm : build a decision tree by applying Hunt's algorithm
Analysis of the ratios calculated : Assess Wal-Mart, Stores Inc. concerning liquidity, solvency, profitability, and stock performance as of January 31, 2015. For each area, you should calculate the ratios and provide an analysis of the ratios calculated. Include historical stock pri..
What information could you share with this parent : What information could you share with this parent that would educate her about the importance and the safety of vaccinations? How would you provide this information
What price should newnovatel sell bonds : Newnovetel cellular is issuing bonds pay no interest but can be converted into $9K at maturity in 17 years. To price them competitively, it was determined they should yield eight percent compounded annually. At what price should Newnovatel sell these..
An investment has an expected return : An investment has an expected return of 11 percent per year with a standard deviation of 24 percent. Assuming that the returns on this investment are at least roughly normally distributed, how frequently do you expect to earn between -13 percent and ..

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Characteristics of an executive information system

What are some distinguishing characteristics of an executive information system. Why have these systems become a part of business intelligence in many companies.

  Design a ram chip

Design a RAM chip that is 128K x 8. For each sub-part below, show the array of RAM cells and its dimensions, the decoder(s) required to access the array, and tabulate the numbers of gates required to implement the decoding.

  Research various types of hard drives available from online

Research the various types of hard drives available from online and/or local retail computer stores.Make a chart of several different brands or models of 1 TB desktop hard drives.

  Create logic circuit which will decode dtmf-like keypad

Create the logic circuit which will decode the DTMF-like keypad. The keypad comprises of 16 keys, numbered 0 to 9, *, #, A, B, C, and D.

  Create one z-specification for system

Create one (1) short Z-specification for this system using Visio or an equivalent such as Dia. Note: The graphically depicted solution is not included in the required page length.

  Computing a mathematical discipline a scientific discipline

Name the four areas in which a practitioner in computer science must be skilled.2) Is computing a mathematical discipline, a scientific discipline, or an engineering discipline? Explain in 50 words or less.

  What some of the ways you would troubleshoot a dns problem

What are some of the ways you would troubleshoot a DNS problem

  Cnt books hired you as a productivity consultant

CNT Books hired you as a productivity consultant. Currently, it employs six people who still use floppy disks to get files from one computer to the next. You're to bring them into the 21st century by configuring a network that allows them to..

  The number of page interrupts in fifo and lru page

A job has four pagse A, B, C, D and the main memory has two page frames only.

  Wrtie a recursive boolean function named ismember

wrtie a recursive boolean function named isMember that accepts two arguments: an array and a value. Teh function shourld return true if the value is found in the array, or false if the value is not found in the array.

  Design a complete program - area of a retangle

Design a flowchart and write the pseudocode for this program including a function that accepts a rectangle''s width and length as arguments and returns the rectangle's area.

  What continuing education is required for the holder

what continuing education is required for the holder of a CISSP to remain current and in good standing.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd