What is the best model to classify the data

Assignment Help Computer Engineering
Reference no: EM133371018

The spam datafile contains 4601 emails, 1813 of which are spam. The file has 57 features that include indicators for the presence of 54 keywords (e.g. free, deal, ! etc), counts for capitalized characters etc., and a numeric spam variable for whether each email is tagged as spam by a human reader (spam column is 1 for spam, 0 for important emails).

You have to predict the probability that a message is spam or not.

1) Partition the data into a training set (with 70% of the observations), and testing set (with 30% of the observations) using the random state of 12345 for cross validation.

2) On the partitioned data, build the best KNN model. Show the accuracy numbers. (Hint: What is the best value of k? How do you decide the 'best k'?)

3) On the partitioned data, build the best logistic regression model. Show the accuracy numbers.

4) Based on the results of k-nearest neighbor, and logistic regression, what is the best model to classify the data? Provide explanation to support your argument

Reference no: EM133371018

Questions Cloud

How would software configuration management vary : How would software configuration management vary between organizations, depending on project complexity, software process (agile vs waterfall), and degree
Who is expected to comply with the gcps : Who is expected to comply with the GCPs? Provide examples of the specific roles and their responsibility within clinical research. What happens if a member
Discuss your own moral decision making and how it relates : Discuss your own moral decision making and how it relates to Kohlberg's stages. Do you make moral decisions at a different stage now than you did earlier
Electronic immigration system : Describe four ways United States Citizenship and Immigration Services failed during the modernization of ELIS (Electronic Immigration System).
What is the best model to classify the data : what is the best model to classify the data? Provide explanation to support your argument - On the partitioned data, build the best KNN model. Show the accuracy
Differences between each of purposes of punishment : Explain the differences between each of the purposes of punishment.
Describe how money obtained through tax is used in funding : Describe how money obtained through tax is used in funding healthcare services . this should include an introduction, how the money is generated
Write a thorough explanation of what it is : Write a thorough explanation of what it is, in plain English - approach to pseudo-code on the videos above (from section 2 "A Procedural View of the World")
Describe one of the tools or best practices of strategic : Briefly describe one of the tools or best practices of strategic planning or execution (implementation) (SWOT, Service-Value Chain, Appreciative Inquiry, etc.).

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd