Type of error when real message gets classified as junk

Assignment Help Basic Statistics

Reference no: EM13110173

Spam filters attempt to sort e-mails, deciding which are real messages and which are unwanted. One method used is point system. Filter reads each incoming e-mail and assigns points to sender, the subject, key words in message, and so on. Higher point total, the more likely it is that message is unwanted. Filter has cutoff value for point total; any message rated lower than that cutoff passes through to inbox, and rest, suspected to be spam, are diverted to junk mailbox.

We can think of filter's decision as hypothesis test. Null hypothesis is that e-mail is the real message and must go to your inbox. Higher point total gives evidence that message may be spam; when there's sufficient evidence, filter rejects null, classifying message as junk. This generally works pretty well, but, of course, sometimes filter makes a mistake.

a) When filter permits spam to slip through into your inbox, which type of error is that?
b) Which type of error is it when the real message gets classified as junk?
c) Some filters permit the user (that's you) to adjust cutoff. Assume your filter has default cutoff of 50 points, but you reset it to 60. Is that analogous ot choosing higher or lower value of α for hypothesis test? Describe.
d) What impact does this change in cutoff value have on chance of each type of error?

Reference no: EM13110173

Questions Cloud

Hypothesis testing using p value format : Can we conclude that more than 60% of retired people in the age group would return to work? I need to do a Hypothesis Testing using only P value format (Explain in everyday words but also have the formal steps show with results) on this problem.

Information about interpretations of p value : A poll of U.S. health professionals revealed 82% would choose the same career. In a hypothesis test conducted at a level of significance of 5%, a P-value of 0.023 was obtained.

What would likely be the immediate result of this mutation : a mutation results in a cell that no longer produces a normal protein kinase for the M-phase. What would likely be the immediate result of this mutation.

Performing appropriate hypothesis test : A 15-minute Oil and Lube service claims that their average service time is no more than 15 minutes. A random sample of 35 service times was collected, and the sample mean was found to be 16.2 minutes, with a sample standard deviation of 3.5 minute..

Type of error when real message gets classified as junk : When filter permits spam to slip through into your inbox, which type of error is that? Which type of error is it when the real message gets classified as junk?

Discuss responsibility of gm towards its employees : Based on GM's current condition, do you think it is ethical to reduce the number of employees? What responsibility does GM have towards its employees? Does GM have the same responsibility to the employees of its suppliers?

Determining hypothesis testing and effect size : Describe what is measured by the estimated standard error in the bottom of the independent-measures t statistics.

Null hypothesis-business executives owning blackberries : Suppose you have drawn a simple random sample of 400 business executives. In this sample, 150 executives owned Blackberries.

What is the resultant genotypic ratio : Dragon color and wing size are controlled by two separate gene loci with 2 alleles each. A dragon that is red with large wings (RrWw) mates with another that is silver with small wings (rrww).

User Account

All Pages