Change in the cutoff value of each type of error

Assignment Help Basic Statistics
Reference no: EM131745694

Question: More spam. Consider again the points-based spam filter described in Exercise. When the points assigned to various components of an e-mail exceed the cutoff value you've set, the filter rejects its null hypothesis (that the message is real) and diverts that e-mail to a junk mailbox.

a) In this context, explain what is meant by the power of the test.

b) What could you do to increase the filter s power?

c) What's the disadvantage of doing that?

Exercise: Spam. Spam filters try to sort your e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points to the sender, the subject, key words in the message, and so on. The higher the point total, the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that cutoff passes through to your inbox, and the rest, suspected to be spam, are diverted to the junk mailbox.

We can think of the filter s decision as a hypothesis test. The null hypothesis is that the e-mail is a real message and should go to your inbox. A higher point total provides evidence that the message may be spam; when there's sufficient evidence, the filter rejects the null, classifying the message as junk. This usually works pretty well, but, of course, sometimes the filter makes a mistake.

a) When the filter allows spam to slip through into your inbox, which kind of error is that?

b) Which kind of error is it when a real message gets classified as junk?

c) Some filters allow the user (that's you) to adjust the cutoff. Suppose your filter has a default cutoff of 50 points, but you reset it to 60. Is that analogous to choosing a higher or lower value of for a hypothesis test? Explain.

d) What impact does this change in the cutoff value have on the chance of each type of error?

Reference no: EM131745694

Questions Cloud

Write a strategic evaluation about an organization : Demonstrate an ability to write a strategic evaluation about an organization of your choice - Why purposes in an organization are laid down, prior to anything
What is the probability that the smoker quit after six month : Public Health and Nutrition Tobacco smoke contains more than 7000 chemicals, What is the probability that the smoker quit after six months
Describe professional communication examples to assist : The specific course learning outcomes associated with this assignment are: Describe professional communication examples to assist in revision.
Write a paper on artificial intelligence in driverless cars : Need a 4 page paper on Artificial Intelligence in driverless cars. you can talk a little about Artificial technology in the beginning of paper
Change in the cutoff value of each type of error : More spam. Consider again the points-based spam filter described in Exercise. When the points assigned to various components of an e-mail exceed the cutoff.
What trends seem to apply to your technology : Can you compare the technology to a similar historic example? Will investment and adoption patterns be similar? What trends seem to apply to your technology?
Depreciation schedule for semi-trucks of sandhill : depreciation schedule for semi-trucks of Sandhill Manufacturing Company was requested by your auditor soon
Determine that the rate of home ownership is increasing : Homeowners 2005. In 2005 the U.S. Census Bureau reported that 68.9% of American families owned their homes. Census data reveal that the ownership rate.
Health insurance portability and accountability act : Take a U.S. regulation "Health Insurance Portability and Accountability Act (HIPAA)" discussed in the course and present an argument for why

Reviews

Write a Review

Basic Statistics Questions & Answers

  What is the mean of given process

An ergodic random process has a correlation function. -  What is the mean of this process?

  Calculate the actual probability model

Calculate the actual probability model.- Compare the distribution of outcomes in your simulation to the probability model.

  A simple linear regression model

Diamonds. Consider the diamond data of which Table 28.4 (page 28-35) is an excerpt. We are interested in predicting the total price of a diamond. Fit a simple linear regression model using Carat as the explanatory variable.

  Find, when two service windows are open

The post office uses a multiple channel queue, where customers wait in a single line for the first available window. If the average service time is 1 minute and the arrival rate is 7 customers every five minutes,

  State the optimal solution

Southern Sporting Good Company makes basketballs and footballs. Each product is produced from two resources rubber and leather. The resource requirements for each product and the total resources available are as follows:  a. State the optimal solu..

  Find the probability of a type ii error

A statistics practitioner wants to test the following hypotheses with s = 20 and n = 100: H0: µ = 100 H1: µ> 100 Using a = .10 find the probability of a Type II error when µ = 102.

  Designs that can address your research

In a few sentences, describe two designs that can address your research question. The designs must involve two different statistical analyses.

  Constitute a bernoulli trial

In the opinion survey illustration used to open this chapter, out of 100 students surveyed by the opinion pollster, 75 indicated a preference for closed-book exams. If one considers the sampling of each student to constitute a Bernoulli trial in w..

  The journal of quantitative criminology

The Journal of Quantitative Criminology (Vol. 8,1992) published a paper on the determinants of area property crime levels in the United Kingdom.

  Problem related to the honesty in the media

Honesty in the media. A Gallup Poll conducted from November 30 to December 2, 2007, asked a random sample of 1006 adults to rate the honesty and ethical.

  Characterize the shape of the distribution of quiz scores

About 75% of the students in a class score between 80 and 100 on a quiz. The other 25% of the students have scores spread out between 35 and 79. Characterize the shape of the distribution of quiz scores. Explain.

  Nature of the relationship

This correlation was judged to be statistically significant. In your own words, what can you say about the nature of the relationship?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd