Provide a summary of the comparison

Assignment Help Computer Engineering
Reference no: EM131474851

Data Mining - Practical: Using Naïve Bayes Classifier for Classification

This is a combined practical and exercise session, in which we have two tasks. First, we use Weka to run two different classifiers (Naïve Bayes and J48) and compare their performances. Second, we use Weka to help us to calculate and decide whether someone is likely to get sunburned or not as Naïve Bayes Classifier does. This will help us to familiarise ourselves with the theories on Naïve Bayes Classifier we have learned in today's lecture.

Step 1:Launching Weka and Loading Training Data

Launch Weka by clicking on: RunWeka.bat

Select ‘Explorer' from the list of Applications.

Select the ‘Preprocess' tab and click on ‘Open File'. Choose the file ‘sunburn.arff' which contains the training data set.

Step 2: Exploring Naïve Bayes Classifier

Select the ‘Classify' tab and make sure that "NaiveBayes" is chosen from the classifier list and "Use training set" is clicked as the test option. Note that attribute "burned" is chosen by default as the class attribute but the class attribute can be changed if needed.

Click ‘Start'will create a classification model/classifier from the training dataset. The classifier is listed in the Result list while the details about the classifier are displayed in the ‘Classifier output' window.

Step 3: Comparing Naïve Bayes Classifier with Decision Tree-based Classifier J48 Using 10-fold Cross-validation

First, you can easily run Naïve Bayes Classifier again with a different evaluation method. Select the "cross-validation" test option with 10 folds as default and click Start again. The classifier output is quickly replaced to show how well the learned model performs on the cross-validation.

Second, you can also easily run J48 with 10-fold cross-validation. Select the ‘Classify' tab and make sure that "J48" is chosen from the classifier list and Select the "cross-validation" test option with 10 folds as default and click Start again. The classifier output is quickly replaced to show how well J48 performs on the cross-validation.

Finally, compare the two sets of evaluation results.

Q1. Provide a summary of the comparison between the classifier outputs by Naïve Bayes Classifier and J48.

Step 4: Discovering Naïve Bayes Classifier

We now focus on discovering how the Naïve Bayes Classifier produces the output. We have just covered the Naïve Bayes Classifier in today's lecture. We now want to familiarise ourselves with the theories that we have learned in today's lecture by working through an example using Weka.

Select the ‘Preprocess' tab and go back to examine the training data set. Click on attribute burned in the attributes window on the left and you will see that the class attribute has two class labels "burned" and ‘none".

Q2. Calculate the probability estimates of these two class labels, i.e., P(burned = "burned") and P(burned = "none") based on their class distributions. Click on attribute burned in the attributes window will display the class distribution, i.e., 9/16 with class label "burned" while 7/16 with class label "none"

P(burned = "burned") =
P(burned = "none") =

Click "Edit" to view the sunburn relation table that shows the value distribution of each attribute given a class label. Click on attribute burned will sort the instances in the table in alphabetic order. For example, among the 9 instances with class attribute value "burned", 2/9 has hair = "red".

Q3. Calculate the conditional probability of each attribute on each class, e.g., P(hair = "blonde"|burned = "burned"). Note that you would have 6 + 6 + 6 + 4 = 22 conditional probabilities to calculate.

P(hair = "blonde"|burned = "burned") =
P(hair = "brown"|burned = "burned") =
P(hair = "red"|burned = "burned") =
P(hair = "blonde"|burned = "none") =
P(hair = "brown"|burned = "none") =
P(hair = "red"|burned = "none") =
...

Q4. Calculate the classification probability of each class, given hair = "blonde", height = "short", weight = "average", lotion = "none".

P(burned = "burned"|hair = "blonde", height = "short", weight = "average", lotion = "none)
=

P(burned = "none"|hair = "blonde", height = "short", weight = "average", lotion = "none)
=

Q5. Is the given person likely to get sunburned based on the prediction by Naïve Bayes Classifier?

Attachment:- Practical2.rar

Reference no: EM131474851

Questions Cloud

Write a high-level program for this computation using a fork : Consider the following computation: ci = ai*bi + ci*di where, i = 1 to N. Write a high-level program for this computation using a fork/join.
Employee referrals for organizational recruitment : Employee referrals for organizational recruitment, paid or unpaid, can be an effective tool for recruitment.
Operate in its general and specific environments : identify the factors that operate in its general and specific environments.
What is distributional interest : What is distributional interest?
Provide a summary of the comparison : Provide a summary of the comparison between the classifier outputs by Naïve Bayes Classifier and J48 - Calculate the probability estimates of these two class
Marginal revenue curves of a competitive firm identical : Why does the firm in pure competition operate in the rising portion of its marginal cost curve?
Explain evaluation of alternatives and strategic choice : Identify that how Channel power & leadership can be an investment in profitability. Explain evaluation of alternatives and strategic choice?
Estimate the computation time for the program : Estimate the computation time for the program in the above problem assuming the following time characteristics.
Competitor advantages take place : How is game theory applied when competitor advantages take place? In other words, how do you implement game theory into the equation of competition?

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd