What is the percent accuracy of this tree on training set

Assignment Help Management Information Sys
Reference no: EM131450571

Smart Home Health Analytics Homework

For this assignment you will compare the performance of the naive Bayes, nearest neighbor, and decision tree learners. You will also learn more about the decision tree algorithm.

1. Recall the loan.arff dataset that provides 18 training examples of whether or not a loan is approved for an applicant based on their income, debt and education. Using this data, compute the entropy for the entire dataset and the information gain for each of the three features (Income, Debt, Education) as the top-level feature in a decision tree. Also indicate which of the three features is the best choice for the top-level split feature. Show all your work.

2. Use WEKA to run the J48 decision-tree classifier on the loan.arff dataset. Use the default parameter settings for J48, and use the training set as the test option.

a. Include in your report the printed results (tree and statistics) from WEKA.

b. Draw graphically the decision tree classifier learned by J48.

c. What is the percent accuracy of this tree on the training set?

3. WEKA's default parameter settings for J48 are -C 0.25 -M 2.

a. Explain in your own words what these parameters mean.

b. Find a setting for the -C and -M parameters so that the learned tree achieves 100% accuracy on the training set. Describe the difference between this tree and the one learned in problem 2.

4. Perform the same experiment as in Homework 2 Problem 4, except you should use the five classifiers: NaiveBayes, J48, and IBk (with k=1, k=3 and k=5). IBk is the nearestneighbor classifier. Use default parameters for each (except the K parameter for IBk). As before, include in your report a table giving the percent correctly classified instances in the test split for the five classifiers on each dataset.

5. Compare the performance of the five classifiers based on the results from the previous problem. Specifically, which classifier performs better on which datasets and why. The "why" part should consider the characteristics of the data, the hypothesis space, and the learning algorithm.

6. Turn in your nicely-formatted report (PDF preferred) containing your responses to the above problems in class.

Attachment:- Assignment File.rar

Reference no: EM131450571

Questions Cloud

Why each of the ratio has changed over the three-year period : Describe how and why each of the ratios has changed over the three-year period. For example, did the current ratio increase or decrease? Why?
Describe two issues that undermine the rights of client : Describe two issues that undermine the rights of clients in genetic- and genomic-related decision making and action.
What are three key points of argument : You will need to draft a memorandum to your chief executive identifying the value of a triple bottom line approach, which would represent an enormous shift.
Mutual funds impacted the risk faced by most investors : How have mutual funds impacted the risk faced by most investors in retirement accounts?What risks are or are not eliminated with mutual funds?
What is the percent accuracy of this tree on training set : IS 698/800: Smart Home Health Analytics Homework. For this assignment you will compare the performance of the naive Bayes, nearest neighbor, and decision tree
Identify a historical change or event of nursing practice : Identify a historical change or event that had significant impact on the development of nursing theory. Discuss the effect of the change/event on nursing.
Find bank profit-the return on equity and return on assets : Find bank profit, the return on equity, and return on assets.
Define valuable long-term sustainability for the firm : Now that you have an understanding of corporate culture and the variables that impact it, how would you characterize an ethically effective culture.
Credibility risk premium to the required return : She decides to add an extra 1% “credibility” risk premium to the required return as part of her valuation analysis.

Reviews

len1450571

4/4/2017 6:28:37 AM

Assignment Work with following details. Subject : Information System. General Instructions: Use printer paper for your answer sheets. Use blue or black ink. Number each page and write down the total number of pages on the upper right-hand corner of the first page. Thanks.

Write a Review

Management Information Sys Questions & Answers

  Describe in detail how this organization manages components

What personal knowledge management tools does this organization utilize? What steps has this organization taken in securing their information and knowledge? What has this organization done to gain and sustain an advantage over their competitors?

  Write a description of the issues in thecase

This is a group work, please find the attached and answer section a and b in two pages with references. no introduction no conclusion.a. Description of the issues in thecase.b. Overview of the impacts on the company - as much finan..

  What would the syntax look like

Thinking about repetition loops and things we do more than once can help identify something you would store in an array. For instance, if we were to define an array named that contained temperatures for the past 19 days, what would the syntax loo..

  Explain concepts of server virtualization to management

Determine the strategy you would use to explain the concepts of server virtualization to senior management so that they understand the concepts and can form an opinion on the solution. Provide a rationale for using your chosen strategy.

  Social media application edmodo

About social media application Edmodo, What are its major features? How are people using these applications

  Compare various vendors costs and other charges

Create a scenario for a fictional midsized company. Work on your proposed technical solution by describing the network topology required to address all the requirements of the scenario. Prepare a table in which you identify the vendor equipment, c..

  How does push technology differ from spam

How will software-as-a-service (Saas) make use of a personal application service provider?- How does push technology differ from spam?

  Select an information technology or services company

you will select an Information Technology or Services company. Alternately, you may select any other type of company, however, your strategic plan should be limited to the Information Technology or Services department within the company. You may f..

  How do dss facilitate use of analytics

What are analytics? How do DSS facilitate use of analytics? How does bounded rationality impact your decisions each day?

  Provides comprehensive reflection of the learning objectives

Write a 2 - 3 page paper (not including the title and reference pages) which provides a comprehensive reflection of the learning objectives and concepts addressed in the course so far.

  Amortization of intangible assets

During the year, samuels reported net income of $300,000, including amortization of intangible assets of $66,000, depreciation of plant assets of $132,000, and amortization of premium on investment in bonds of $20,000.

  Mock disaster response plan

Mock Disaster Response Plan

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd