Calculate the classification accuracy for the decision tree

Assignment Help Other Subject
Reference no: EM133746309

Artificial Intelligence and Machine Learning

Decision Tree Training
Using the Orange Data Mining software, implement a decision tree to predict whether Sara will go sailing given weather, company, and boat size. Import the dataset named ‘Sailing' from Orange and specify the features and prediction target. Which column(s) correspond to features and which column(s) to the prediction target? Perform stratified, replicable train-test splitting on the dataset using the split ratio specified by your class facilitator. How many examples are there in the resulting training and test sets? Now, train a binary decision tree, with at least 2 instances in leaves, not splitting subsets smaller than 2, limiting the maximal tree depth to 3, and stopping splitting when the majority class reaches 90%. Visualize the resulting tree model as a tree graph.

Prediction and Testing
Using the tree graph, manually predict whether Sara will go sailing for the weather, company and boat size given in the first two test examples. Construct a confusion matrix for your tree model on the test set. Calculate the classification accuracy, precision, recall, and F1 score for each outcome. You may check your answers against Orange's Test and Score output.

Logistic Regression
Using the Orange Data Mining software, build a workflow to classify Iris flowers given petal length and width, and sepal length and width. Import the dataset named ‘Iris' from Orange and specify the features and prediction target. Prepare a scatter plot of petal width against petal length. Roughly speaking, how do the probabilities of different Iris species given petal width and petal length change as the two measurements change? Now, perform a stratified, replicable train-test splitting on the data using the split ratio specified by your class facilitator. How many examples are there in the training and testing sets? Train a logistic regression model with L2 regularization (using regularization strength of 1). Write down the resulting logistic regression model equations for the probabilities of Iris Setosa, Iris Virginica and Iris Versicolor given the flower measurements. Hint: the probability of Setosa is given by

Prediction and Testing
Using the logistic regression model equations from 1.3, compute the probabilities of Iris Setosa, Iris Virginica and Iris Versicolor for the measurements given in the first four test examples. Given the probabilities, what is the logistic regression model's prediction for each example?

After Class
The following problems are to be solved individually after class and submitted as part of the report by the assessment submission deadline.

Confusion Matrix
Table 1 lists test examples used for a Wine Type classification task, and predictions made by decision tree and logistic regression models. Complete the confusion matrices for the two models by hand. The use of the Orange Data Mining software is not required.

Classification Accuracy, Precision and Recall
Calculate the classification accuracy for the decision tree and logistic regression models used in 2.1. Additionally, calculate the precision, recall and F1 score per wine type for each model. Show all working. Suppose the decision tree predicts that a given wine is of Type 2. How likely (in %) is this wine actually to be Type 1? Also, what % of Type 3 wines can the logistic regression model correctly classify as Type 3?

Report
In no more than 1000 words, and in no more than 2 pages, summarize your work, answers, and other findings from Sections 1-2 above. Include screen shots, tables, and other figures to help illustrate your understanding of the prediction workflow and machine learning concepts.

Reference no: EM133746309

Questions Cloud

Dimensions of the ethical dilemma presented : Dimensions of the ethical dilemma presented in the scenario Dr. Will Anderson, a seasoned physician working at a well-renowned hospital,
How can social learning theory be used to understand crime : How can social learning theory be used to understand crime?
Went live with intake model for breast-non-breast patients : Went live with Intake model for Breast and Non-Breast patients Background: Intake workflow for breast patients followed Abbotts structure and workflows.
How diversity awareness impacts communication-collaboration : Reflect on how you give and receive feedback and how diversity awareness impacts communication and collaboration.
Calculate the classification accuracy for the decision tree : Calculate the classification accuracy for the decision tree and logistic regression models used in 2.1. Additionally, calculate the precision, recall
Acetazolamide for glaucoma : The client's medications include 250 mg acetazolamide for glaucoma. The nurse should take which action for this client?
What did you learn by using this strategy : What will you do differently next time you see an opportunity to advocate for early learning issues?
Describes headache dull and throbbing : She describes the headache a dull and throbbing coming and going throughout the day, 3-4 days a week. She denies recent falls or injuries to head.
Create machine learning models : Case Study and Individual Exercise - create machine learning models that can make predictions based on the data and report on the characteristics of the models

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd