Construct a confusion matrix for your tree model

Assignment Help Other Subject
Reference no: EM133746262

Artificial Intelligence and Machine Learning

Decision Tree Training
Using the Orange Data Mining software, implement a decision tree to predict whether Sara will go sailing given weather, company, and boat size. Import the dataset named ‘Sailing' from Orange and specify the features and prediction target. Which column(s) correspond to features and which column(s) to the prediction target? Perform stratified, replicable train-test splitting on the dataset using the split ratio specified by your class facilitator. How many examples are there in the resulting training and test sets? Now, train a binary decision tree, with at least 2 instances in leaves, not splitting subsets smaller than 2, limiting the maximal tree depth to 3, and stopping splitting when the majority class reaches 90%. Visualize the resulting tree model as a tree graph.

Prediction and Testing
Using the tree graph, manually predict whether Sara will go sailing for the weather, company and boat size given in the first two test examples. Construct a confusion matrix for your tree model on the test set. Calculate the classification accuracy, precision, recall, and F1 score for each outcome. You may check your answers against Orange's Test and Score output.

Logistic Regression
Using the Orange Data Mining software, build a workflow to classify Iris flowers given petal length and width, and sepal length and width. Import the dataset named ‘Iris' from Orange and specify the features and prediction target. Prepare a scatter plot of petal width against petal length. Roughly speaking, how do the probabilities of different Iris species given petal width and petal length change as the two measurements change? Now, perform a stratified, replicable train-test splitting on the data using the split ratio specified by your class facilitator. How many examples are there in the training and testing sets? Train a logistic regression model with L2 regularization (using regularization strength of 1). Write down the resulting logistic regression model equations for the probabilities of Iris Setosa, Iris Virginica and Iris Versicolor given the flower measurements. Hint: the probability of Setosa is given by

Prediction and Testing
Using the logistic regression model equations from 1.3, compute the probabilities of Iris Setosa, Iris Virginica and Iris Versicolor for the measurements given in the first four test examples. Given the probabilities, what is the logistic regression model's prediction for each example?

After Class
The following problems are to be solved individually after class and submitted as part of the report by the assessment submission deadline.

Confusion Matrix
Table 1 lists test examples used for a Wine Type classification task, and predictions made by decision tree and logistic regression models. Complete the confusion matrices for the two models by hand. The use of the Orange Data Mining software is not required.

Classification Accuracy, Precision and Recall
Calculate the classification accuracy for the decision tree and logistic regression models used in 2.1. Additionally, calculate the precision, recall and F1 score per wine type for each model. Show all working. Suppose the decision tree predicts that a given wine is of Type 2. How likely (in %) is this wine actually to be Type 1? Also, what % of Type 3 wines can the logistic regression model correctly classify as Type 3?

Report
In no more than 1000 words, and in no more than 2 pages, summarize your work, answers, and other findings from Sections 1-2 above. Include screen shots, tables, and other figures to help illustrate your understanding of the prediction workflow and machine learning concepts.

Reference no: EM133746262

Questions Cloud

Do you agree protection based on a contemporary social issue : Environmental deterioration and protection based on a contemporary social issue. Do you agree or disagree? Justify your answer.
Define comparative education in relation to a global society : How do you define comparative education in relation to a global society?
Client who requires treatment for infective endocarditis : The nurse is preparing discharge plans for a client who requires treatment for infective endocarditis.
What impact it may have on the american public : Explain why you find the spending wasteful, and if eliminated, what impact it may have on the American public.
Construct a confusion matrix for your tree model : Construct a confusion matrix for your tree model on the test set. Calculate the classification accuracy, precision, recall, and F1 score for each outcome
Physician ordered blood to be taken to run hormone panel : The physician ordered blood to be taken to run a hormone panel. What is the possible diagnosis of a patient in this visit?
Why media justice is important to transformative organizing : Why media justice is important to transformative organizing?
Address strategic failures based on a situation : Address strategic failures based on a situation in which one of their vessels was involved in a maritime accident involving multiple fatalities.
Barbiturates indications for tricyclic antidepressants : Adverse effects of all antidepressants Adverse effects of barbiturates Indications for tricyclic antidepressants (TCA's) Side effects of TCA's

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd