Reference no: EM132332264
Assignment -
Answer the following questions. Move all graphs, charts, and tables to the single document. Use APA style references and citations if needed. These questions are from Chapter 5 of Shmueli, Bruce, and Patel.
1. A data mining procedure has classified 88 records as fraudulent (30 correctly so) and 952 as non-fraudulent (920 correctly so). Build the classification matrix and calculate the error rate. Explain how the error rate is calculated. Explain in your own words how each cell in the confusion matrix is calculated.
2. Suppose there is an adjustable cutoff value used to alter the proportion of records classified as fraudulent. Describe in your own words how moving the cutoff up or down would affect:
a. The classification error rate for records that are truly fraudulent.
b. The classification error rate for records that are truly non-fraudulent.
Be sure to write in complete, well-considered sentences and avoid ambiguity in your answers.
3. A large number of insurance records are to be examined to develop a model for predicting fraudulent claims. Of the claims in the historical database, 1 percent were fraudulent. A sample is used to develop a model, and oversampling is used to provide a balanced sample in light of the low response rate. When applied to this sample, (800 records), the model correctly classifies 310 frauds and 270 non-frauds. It missed 90 frauds and classified 130 records incorrectly as frauds when they were not.
a. Make the classification matrix for the sample.
b. Find the adjusted misclassification rate (adjusting for oversampling.)
c. What percentage of new records would you expect to be classified as fraudulent?
References - Shmueli, G., Bruce, P., Patel, N., (2016). Data Mining for Business Analytics, Concepts, Techniques, and Applications with XLMiner. Hoboken, NJ: John Wiley & Sons, Inc.
How to assess risk if you moved your work network
: Discuss in 500 words or more how to assess risk if you moved your personal or work network to the cloud using these categories: asset, threat, vulnerabilities.
|
Physical activity goals that are personally meaningful
: Identify 1-2 long term health/physical activity goals that are personally meaningful and inspirational to you. Why are these long terms goals meaningful to you?
|
Why institutions are reluctant to move their it to the cloud
: Discuss in 500 words, why institutions are reluctant to move their IT to the cloud. Consider specific industries like education, medicine, military, etc.
|
Retains a small ownership stake
: One of the co-founders of Project Repat is no longer with the company, although he retains a small ownership stake.
|
Make the classification matrix for the sample
: A large number of insurance records are to be examined to develop a model for predicting fraudulent claims. Make the classification matrix for the sample
|
Unusual nature of this influenza virus is attributed
: The unusual nature of this influenza virus is attributed to a surface protein, hemagglutinin (HA) which resembled avian influenza HA receptors.
|
Main consumer buying behavior for product
: What is the target market for each ad? What do you think that Anheuser-Busch has identified as its main consumer's buying behavior for each product?
|
Define what behaviors support or detract from your health
: In a 750-1,000 word paper, discuss the relevance of the continuum to patient care and present a perspective of your current state of health in relation to the.
|
What would help brian with his goal of weight gain
: What would help Brian with his goal of weight gain?
|