Make the classification matrix for the sample

Assignment Help Applied Statistics
Reference no: EM132332264

Assignment -

Answer the following questions. Move all graphs, charts, and tables to the single document. Use APA style references and citations if needed. These questions are from Chapter 5 of Shmueli, Bruce, and Patel.

1. A data mining procedure has classified 88 records as fraudulent (30 correctly so) and 952 as non-fraudulent (920 correctly so). Build the classification matrix and calculate the error rate. Explain how the error rate is calculated. Explain in your own words how each cell in the confusion matrix is calculated.

2. Suppose there is an adjustable cutoff value used to alter the proportion of records classified as fraudulent. Describe in your own words how moving the cutoff up or down would affect:

a. The classification error rate for records that are truly fraudulent.

b. The classification error rate for records that are truly non-fraudulent.

Be sure to write in complete, well-considered sentences and avoid ambiguity in your answers.

3. A large number of insurance records are to be examined to develop a model for predicting fraudulent claims. Of the claims in the historical database, 1 percent were fraudulent. A sample is used to develop a model, and oversampling is used to provide a balanced sample in light of the low response rate. When applied to this sample, (800 records), the model correctly classifies 310 frauds and 270 non-frauds. It missed 90 frauds and classified 130 records incorrectly as frauds when they were not.

a. Make the classification matrix for the sample.

b. Find the adjusted misclassification rate (adjusting for oversampling.)

c. What percentage of new records would you expect to be classified as fraudulent?

References - Shmueli, G., Bruce, P., Patel, N., (2016). Data Mining for Business Analytics, Concepts, Techniques, and Applications with XLMiner. Hoboken, NJ: John Wiley & Sons, Inc.

Reference no: EM132332264

Questions Cloud

How to assess risk if you moved your work network : Discuss in 500 words or more how to assess risk if you moved your personal or work network to the cloud using these categories: asset, threat, vulnerabilities.
Physical activity goals that are personally meaningful : Identify 1-2 long term health/physical activity goals that are personally meaningful and inspirational to you. Why are these long terms goals meaningful to you?
Why institutions are reluctant to move their it to the cloud : Discuss in 500 words, why institutions are reluctant to move their IT to the cloud. Consider specific industries like education, medicine, military, etc.
Retains a small ownership stake : One of the co-founders of Project Repat is no longer with the company, although he retains a small ownership stake.
Make the classification matrix for the sample : A large number of insurance records are to be examined to develop a model for predicting fraudulent claims. Make the classification matrix for the sample
Unusual nature of this influenza virus is attributed : The unusual nature of this influenza virus is attributed to a surface protein, hemagglutinin (HA) which resembled avian influenza HA receptors.
Main consumer buying behavior for product : What is the target market for each ad? What do you think that Anheuser-Busch has identified as its main consumer's buying behavior for each product?
Define what behaviors support or detract from your health : In a 750-1,000 word paper, discuss the relevance of the continuum to patient care and present a perspective of your current state of health in relation to the.
What would help brian with his goal of weight gain : What would help Brian with his goal of weight gain?

Reviews

len2332264

7/2/2019 10:40:04 PM

Answer the above questions. Answers should be uploaded in a neat, easy-to-read Word document. Move all graphs, charts, and tables to the single document. Do not upload spreadsheets. Be sure to read this week's written lecture for links and other helpful information. Answers should be your own work and in your own words. Use APA style references and citations if needed. These questions are from Chapter 5 of Shmueli, Bruce, and Patel.

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd