Describe the classification problem and data preprocessing

Assignment Help Management Information Sys
Reference no: EM132134635

Task description: Data Engineering and Mining

The data set comes from the Kaggle Digit Recognizer competition. The goal is to recognize digits 0 to 9 in handwriting images. Because the original data set is large, I have systematically sampled 10% of the data by selecting the 10th, 20th examples and so on.

You are going to use the sampled data to construct prediction models using multiple machine learning algorithms that we have learned recently: nai¨ve Bayes, kNN and SVM algorithms. Tune their parameters to get the best model (measured by cross validation) and compare which algorithms provide better model for this task.

Report structure:

Section 1: Introduction

Briefly describe the classification problem and general data preprocessing.

Note that some data preprocessing steps maybe specific to a particular algorithm. Report those steps under each algorithm section.

Section 3: Nai¨ve Bayes

Build a nai¨ve Bayes model. Tune the parameters, such as the discretization options, to compare results.

Section 3: K-Nearest Neighbor method Section 4: Support Vector Machine (SVM)

Section 4: Algorithm performance comparison

Compare the results from the two algorithms. Which one reached higher accuracy? Which one runs faster? Can you explain why?

Reference no: EM132134635

Questions Cloud

Identification of current skilled information systems : You are a newly appointed Chief Information Officer (CIO) of a $25 million dollar data collection and analysis company .
How nominal and ordinal data relate to a rating scale : Explain how nominal and ordinal data relate to a rating scale. List at least 2 quantitative attributes of outdoor sporting goods that market researchers might
Should children or teens receive guidelines for screen time : Should children or teens receive guidelines for screen time and social media use? Draft three guidelines that seem reasonable for children or teens.
Discuss the problem of underserved populations and subgroups : Discuss the problem of underserved populations and subgroups, includingcharacteristics of those groups and barriers to delivery
Describe the classification problem and data preprocessing : Briefly describe the classification problem and general data preprocessing. Compare the results from the two algorithms.
How should the transaction price be allocated : Oriole prices these services with a 20% margin relative to cost. How should the transaction price of $1,100,000 be allocated among the service obligations
Identify which theory or theories best exemplify : describe why these individuals are so successful and identify which theory or theories best exemplify their leadership style.
How the framework of the ebk can be adapted : Recommend three countermeasures that could enhance the information security measures of an enterprise. Justify your recommendations.
What is the probability that exactly two households withdrew : a. What is the probability that exactly two households withdrew funds from a retirement account for needs other than? retirement?

Reviews

Write a Review

Management Information Sys Questions & Answers

  Information technology and the changing fabric

Illustrations of concepts from organizational structure, organizational power and politics and organizational culture.

  Case study: software-as-a-service goes mainstream

Explain the questions based on case study. case study - salesforce.com: software-as-a-service goes mainstream

  Research proposal on cloud computing

The usage and influence of outsourcing and cloud computing on Management Information Systems is the proposed topic of the research project.

  Host an e-commerce site for a small start-up company

This paper will help develop internet skills in commercial services for hosting an e-commerce site for a small start-up company.

  How are internet technologies affecting the structure

How are Internet technologies affecting the structure and work roles of modern organizations?

  Segregation of duties in the personal computing environment

Why is inadequate segregation of duties a problem in the personal computing environment?

  Social media strategy implementation and evaluation

Social media strategy implementation and evaluation

  Problems in the personal computing environment

What is the basic purpose behind segregation of duties a problem in the personal computing environment?

  Role of it/is in an organisation

Prepare a presentation on Information Systems and Organizational changes

  Perky pies

Information systems to adequately manage supply both up and down stream.

  Mark the equilibrium price and quantity

The demand schedule for computer chips.

  Visit and analyze the company-specific web-site

Visit and analyze the Company-specific web-site with respect to E-Commerce issues

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd