Describe the classification rule method

Assignment Help Advanced Statistics
Reference no: EM132515484

Data Mining: Basic Methods and Techniques

Laboratory Assignment:

Part 1. Describe the Classification Rule method.

Part  2. Use the Classification rule production method (Classify Tab-Rules Folder-JRip)on the Weather.nominal data set. How many rules did it produce? Compare this to the Decision tree produced on the same data. What is the difference between the two models?

Part  3. Describe the K-nearest neighbor method.

Part  4. Produce a K-NN model (classifiers.lazy.IBk) for Weather.numeric data set.
The standard K-nearest neighbor method can be found in the ‘lazy' submenu of the list presented when you click ‘Choose' in Explorer's Classify window. It is called ‘IBk'. Select this and then click on IBk so you can modify the parameters. The default value of k is 1. Set it to 3 (or other value of your preference) and then click Start to run the programs.

What is the output? How many instances did it classify correctly and how many incorrectly?
• Try changing the parameter K - the number of neighbors. Did that influence the model's performance?
• Try using different weighting schemes. Did does this change influence the model's performance?

Part  5. Upload the soybean.arff data set. Before running Weka, it is worth having a brief look at the data file under the Preprocess tab click Edit button. Alternatively, you can take a look at the data file using a text editor (Notepad or WordPad would work). Lines beginning with % are comments. Typically the beginning of the file provides background information on the data set. This includes details of the data itself and references to previous work using the data. The Soybean file contains 683 examples, each of which has 35 attributes plus the class attribute. The task is to assign examples to one of 19 disease classes. Apply the k-nearest neighbor classifier to the soybean data set.

What % of examples are correctly classified?Compare the result to the same result of the unpruned decision tree procedure. Try investigating the effect of repeating the run with different values for k. Compare and contrast the 2 methods and their outputs.

Reference no: EM132515484

Questions Cloud

Prepare a journal to record the exchange : At the time of this exchange, the market price of the engine was Rp5,500,000. Prepare a journal to record the exchange, the estimated age of the machine
ME606 Digital Signal Processing Assignment : ME606 Digital Signal Processing Assignment Help and Solution, Melbourne Institute of Technology - Assessment Writing Service
HC1072 Economics and International Trade Assignment : HC1072 Economics and International Trade Assignment Help and Solution, Holmes Institute - Assessment Writing Service - Develop a broad understanding
300976 Technologies for Mobile Applications Assignment : 300976 Technologies for Mobile Applications Assignment Help and Solution, Western Sydney University - Assessment Writing Service
Describe the classification rule method : Describe the Classification Rule method and Describe the K-nearest neighbor method - Produce a K-NN model (classifiers.lazy.IBk) for Weather.numeric data set
Analyse system functionality : Analyse system functionality and Review and update technical and user documentation for at least TWO systems or occasions
Explain what nutrition is and why it is important : Explain what nutrition is and why it is important and Describe the characteristics of a healthy diet and provide supporting examples
Differences between the three types of intervention : Explain the differences between the three types of intervention in group work: Interpersonal. Intrapersonal. Environmental and Cognitive Restructuring
Demonstrating the principles of data merging : Demonstrating the principles of data merging, RESTful Web Services and Mashups - explaining the principles of data merging, RESTful Web Services and Mashups

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Relationship between speed, flow and geometry

Write a project proposal on relationship between speed, flow and geometry on single carriageway roads.

  Logistic regression model

Compute the log-odds ratio for each group in Logistic regression model.

  Logistic regression

Foundations of Logistic Regression

  Probability and statistics

The tubes produced by a machine are defective. If six tubes are inspected at random , determine the probability that.

  Solve the linear model

o This is a linear model. If your model needs a different engine, then you need to rethink your approach to the model. Remember, there are no IF, Max, or MIN statements in linear models.

  Plan the analysis

Plan the analysis

  Quantitative analysis

State the hypotheses that you are going to test.

  Modelise as a markov chain

modelise as a markov chain

  Correlation and regression

What are the degrees of freedom for regression

  Construct a frequency distribution for payment method

Construct a frequency distribution for Payment method

  Perform simple linear regression

Perform simple linear regression

  Quality control analysis

Determining the root causes

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd