Explaining accuracy when using training set data

Assignment Help Basic Computer Science
Reference no: EM1347446

Q1) Use the following learning schemes to analyze the attached files adult datasets (here is the description of the data). The files were converted to Weka file adult-Train.arff and adult-Test.arff.

ZeroR (majority class)

Naive Bayes Simple

J4.8

RandomForest

Target field is CLASS (i.e., income). For test options, first choose "Use training set", and then choose "Percentage Split" using default 66% percentage split.

1. Report each classification model, percent error rate, and accuracy. Because each learning schemes will be modeled two different test options, in total you should include 8 different classification models in your reports. For example, for ZeroR learning schema, two models will be generated one with "Use training set" and the other with "Percentage Split". Below are the steps to construct the classification model for ZeroR algorithm with test option - "Use training set".

a. Select a Classify learning schema: ZeroR
b. Training adult-Train by specifying "Use training set" as the test option. Report what classifier you get and its accuracy
c. Next, specify adult-Test as the supplied test set. Report what classifier you get and how its accuracy compare to the one in the previous step
d. Now using the other test option - "Percentage Split". Report what classifier you get and its accuracy.
e. Repeat step c. What do you observe? Does the accuracy on the test set improve and if so, why do you think it does?
f. Following the step a to e for other learning schemas: Naïve Bayes Simple, J4.8, and RandomForest.

2. Once you have built 8 models, which of these classifiers are you more likely to trust when determining whether the income is equal or grater than $50,000? Why?

3. Explain what can you say about accuracy when using training set data and when using separate percentage to train?

Reference no: EM1347446

Questions Cloud

Balance sheet under long term liabilities : ACME Corporation fiscal year ends on December 31st. At the end of 1st quarter on March 31, ACME owes $40,000 on a vehicle loan that matures in three (3) years.
Question regarding the cvp ananlysis : Suppose a fixed cost of $900, a variable cost of $4.50, and selling price of $5.50. Find out the break even point? How many units should be sold to make a profit of $500.? How many units should be sold to average $.0.25 profit per unit? .50 per un..
Find would the firms operating leverage : Would the firm's operating leverage increase or decrease if it made the change and would the new situation expose the firm to more or less business risk than the old one
What percentage of the molecules escape : A 0.410kg block is attached to a horizontal spring that is at its equilibrium length, and whose force constant is 22.0N/m . The block rests on a frictionless surface. A 6.00×10^2kg wad of putty is thrown horizontally at  block, hitting it with a s..
Explaining accuracy when using training set data : Explain what can you say about accuracy when using training set data and when using separate percentage to train?
Calculating-cost-volume-profit analysis : Sure Corporation has gathered the following information after its first year of sales. Net sales were $1,600,000 on 100,000 units; selling expenses $240,000 (40% variable and 60% fixed); direct materials $511,000; direct labor $285,000; administra..
Difference between operating and financial leverages : Show the difference between operating and financial leverage. Can there be too much financial leverage in a firm?
Construct a price weighted index : Three (3) stocks have share values of $12, $75, and $30 with total market rates of $400 million, $350 million & $150 million respectively.
Expalin performance and motivation theory : Expalin Performance and Motivation Theory - How does a leader leverage them as he or she focuses on improving business results

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Executing critical section in mutual exclusion protocol

In Lamport's mutual exclusion protocol, if process i is implementing critical section.

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Actions against company security camera

Joe the janitor is recorded on the company security camera one night taking pictures with his cell phone of the office of the CEO after he is done cleaning it. What will you do and what is your justification for your actions?

  Potential vulnerabilities in making purchase with debit card

Recognoze any potential vulnerabilities in making purchase with debit card, and which area of CIA triad they apply to.

  Examine about direct cash-payment method

Examine about Direct Cash-Payment method

  Finding content of ac and memory word at specified address

What are the content of the AC and the memory word at address 103 when the computer halts.

  Factoring is the problem of computing

Consider the one time pad encryption scheme to encrypt a 1-bit message. Replace the XOR operation with another operation X. For which X does the resulting scheme satisfy perfect secrecy?

  Explaining project manager-s role in project management

Describe in scholarly detail project manager's role in project management and job responsibilities related with position.

  Truth table validity of demorgan-s theorem for variables

Find out by means of truth table validity of DeMorgan's theorem for three variables: (ABC)' = A' + B' + C'. Simplify given expressions by using Boolean algebra.

  Explaining dns zone in secure dynamic updates

If a DNS zone accepts only secure dynamic updates and the DHCP server is a member of the DnsUpdateProxy security group.

  Explaining responsibility ofconfidentiality to employer

Describe what you must do in such a situation. You know that cost to your present employer will increase if ambiguities are not resolved. Though, you also have a responsibility of confidentiality to previous employer.

  Creating flowchart of data found on employee time cards

Create a flowchart depicting the following situations: The data found on employee time cards are keyed onto a hard disk before they are processed by a computer.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd