Determine the accuracy of each according to the testing set

Assignment Help Other Subject
Reference no: EM132484547

Supervised Learning - OneR models

Part 1. Because we all know that more data is better, I have merged your survey data with that of the previous DM_2018 cohort.
Using the HW5_survey1820.xls dataset:
? Draw a 1R tree for each student Q-attribute (i.e. Q2 - Q8) to predict their rating for the 'ethical' descriptor to the Wired article's subject. What is each tree's error-rating?
? Draw a 1R tree to predict each student Q-attribute from answers for the descriptor 'deceitful'. What is each tree's error-rating?
? In each approach a and b above, which tree(s) seem to give the "best" (i.e. most trustworthy) results? What is the second-best tree for each? Why might you tend to prefer the 2nd-best tree's results to the "best" tree's results?

? Work up and describe the general profile of the person who rates the 'deceitful' descriptor as a 1. As a 2. As a 3. How confident are you of these resulting profiles' accuracy, as applied to CS majors generally?
? Which do you feel to be the most accurate, best-predictive trees?

Part 2. Using the Iris data in HW3_train.csv, discretize each of the numeric ranges for attributes A through D. Draw the resulting 1-R trees and their accuracy ratings according to your training data.
? Use 6 as your minimum-majority value -- that is the minimum size of your majority class in each discrete subset of your numeric data
? Determine the accuracy of each according to the testing set HW3_test.csv.
? Incorporate these results into a table including your accuracy-results from HW3 (K-Means and Fuzzy Classification models). Discuss these results and how they compare.

Part 3. Download the WEKA application to your computer. Use its Explorer module to select the full Iris dataset (provided by WEKA as an ARFF file). Use the entire dataset as a training-set; do not worry about using a test-file. Use WEKA's OneR classifier to determine the best single-attribute predictor for irises.

Add this result to your model accuracy-comparison table from #2 above, and discuss its placement among the previous three.

Survey Questions

For your reference, these were the survey questions in the Qualtrics survey you took earlier this semester:

Question 1 Rate the following attributes on a scale of 1 (least applicable) to 3 (most applicable)
:
:

Question 2 Are you currently (or have you been) in a long-term relationship?

Question 3 What is your gender?

Question 4 Are you a CS major?

Question 5 Are you 22 years old (or older)?

Question 6 Is/Was your hometown community of population 20,000 or less?

Question 7 Is your most recent cumulative GPA 3.0 or above?

Question 8 Will you have graduated by Summer '20?

Attachment:- Supervised Learning.rar

Reference no: EM132484547

Questions Cloud

Difference between incremental cost and marginal cost : What is the difference between incremental cost and marginal cost
What is the adjusting journal entry : On July 31, a physical count of supplies revealed that there was $2,200 on hand. What is the adjusting journal entry that On-Time Truckers should make
What are the equilibrium price and equilibrium quantity : What are the equilibrium price and equilibrium quantity in the ice cream market? Confirm your answer by graphing the demand and supply curves.
Prepare the journal entry for the purchase on december : The computer is expected to have a 5-year life and a $70,000 residual value. Prepare the journal entry for the purchase on December
Determine the accuracy of each according to the testing set : Determine the accuracy of each according to the testing set and Draw a 1R tree to predict each student Q-attribute from answers for the descriptor
Determine the effect of the tariff : If Canada and China are allowed to trade with one another, what will be the price and the volume of trade?
Question - Prepare Journal Entry : Question - Prepare Journal Entry. Dec. 12 Acquired additional equipment worth $24,000 by paying $500 cash and giving a long-term note payable for the balance
Calculate the nonlinear and linear regressions : Calculated the nonlinear and linear regressions. Given that the Standard Error of the Estimate (SEE) of the Nonlinear equation is 12.04
Identify the type of market failure : Identify the type of market failure. Is it a problem of negative externalities, positive externalities, public goods, or common resources?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd