Create a pivot table for the training data

Assignment Help Applied Statistics
Reference no: EM13758259 , Length: 4

Question 1:

Create a pivot table for the training data with Online as a column variable, CC as a row variable, and Loan as a secondary row variable.

The values inside the cells should convey the count (number of records).

Complete the numbers in the table below:

 

 

online=0

online=1

CC=0

Loan=0



CC=0

Loan=1



CC=1

Loan=0



CC=1

Loan=1



Question 2

Consider the task of classifying a customer who owns a bank credit card and is actively using online banking services. Looking at the pivot table that you created, what is the probability that this customer will accept the loan offer?

Question 3

Create two separate pivot tables for the training data. One will have Loan (rows) as a function of Online (columns) and the other will have Loan (rows) as a function of CC.

Compute the probabilities below (report three decimals).

Note: P(A|B) means "the probability of A given B".

1. P(CC = 1|Loan = 1) = the proportion of credit card holders among the loan acceptors = 

2. P(Online = 1|Loan = 1) = 

3. P(Loan = 1) = the proportion of loan acceptors = 

4. P(CC = 1|Loan = 0) = 

5. P(Online = 1|Loan = 0) = 

6. P(Loan = 0) = 

Question 4

Compute the naive Bayes probability P(Loan = 1|CC = 1, Online = 1).

Note: Use the quantities that you computed in the previous question.

Question 5

Of the two values that you computed earlier, which is a more accurate estimate of P(Loan=1|CC=1, Online=1)?

Select one:

The value based on the separate pivot tables (one with CC and Loan, and one with Online and Loan)

The value based on the complete crossed pivot table (with Online, CC, Loan)

Question 6

In XLMiner, run naive Bayes on the data and request Detail Report for the training data. Examine the "Conditional probabilities" table. Which of the entries in this table are needed for computing P(Loan = 1|CC = 1, Online = 1)? Mark all that apply (you may get slightly different but very close probabilities due to software upgrade, use the closest ones for selecting your  s.)

Select one or more:
0.301
0.402
0.374
0.288
0.712
0.699
0.598
0.626

Question 7

In the XLMiner Naive Bayes output, locate the predicted probability for P(Loan=1 | Online = 1, CC = 1). The 4-decimal value is given by...

Reference no: EM13758259

Questions Cloud

Retaining the value of position : Provide analysis showing the net profit from (i) the covered call and (ii) the protective put on the expiration date assuming the stock price has fallen 20%. Which strategy is more effective at retaining the value of your position?
Program implements the functionality of a deck of cards : Write a complete program using "ECLIPS" that implements the functionality of a deck of cards. In writing your program, use the provided DeckDriver and Card classes shown below. Write your own Deck class so that it works in conjunction with the two..
The outpatient center regarding possible bariatric surgery : Previous medical evaluations have not indicated any metabolic diseases, but he says he has high blood pressure, which he tries to control with sodium restriction and sleep apnea. He current works at a catalog telephone center.
Assuming-size of fish population satisfies logistic equation : A biologist stocked a lake with 45 fish and estimated the carrying capacity (the maximal population for the fish of that species in that lake) to be 7,000. The number of fish tripled in the first year. Assuming that the size of the fish population sa..
Create a pivot table for the training data : Create a pivot table for the training data with Online as a column variable, CC as a row variable, and Loan as a secondary row variable - Create two separate pivot tables for the training data.
What forest/domain model should shiv llc implement : What forest/domain model should Shiv LLC implement? What is the domain name? Where should the domain controllers be place? Should RODC be part of the consideration
Writing your own educational philosophy : Writing your own Educational Philosophy. Why do you want to teach? Whom are you going to teach? How and what are you going to teach?
restore credibility and generate positive press reporting : Create a public relations campaign for a financial institution that has recently received negative exposure in the media pertaining to its lack of responsiveness to those wishing to modify existing home loans. The goal of your campaign is to influenc..
Provide the owner with a reasonable rate of return : A business should provide the owner with a reasonable rate of return based upon:

Reviews

Write a Review

Applied Statistics Questions & Answers

  What is the overall accuracy of the test

A screening test for a newly discovered disease is being evaluated. In order to determine the effectiveness of the new test, it was administered to 900 workers; 150 of the individuals diagnosed with the disease tested positive.

  Calculate the mean, median and mode

State the statistical assumptions of this test and using the data set and variables you have selected, use SPSS to calculate the Mean and Median.

  A group of 9 workers decide to send a delegation of 3

A group of 9 workers decide to send a delegation of 3 to their supervisor to discuss their grievances.  If there are 4 women and 5 men in the group, how many delegations would include at least 1 woman?

  Participants select three numbers between 0 and 9

For the daily lottery game in Illinois, participants select three numbers between 0 and 9. A number cannot be selected more than once, so a winning ticket could be, say, 307 but not 337. Purchasing one ticket allows you to select one set of num..

  1 hint let pi 002 as shown in the table for nofree premium

1. hint let pi 0.02 as shown in the table for nofree premium channels.a. pxlt 3 b. px 0 px 1 c. px gt 4 1 - px le4

  What steps would you take to set up an analytical analysis

What steps would you take to set up an analytical analysis?

  Question an investigator wants to assess whether smoking is

an investigator wants to assess whether smoking is a risk factor for pancreatic cancer. electronic medical records at a

  Single sample hypothesis testing - z-tests

Compute the single-sample z-test to see if caffeine reduces dreamtime. Test the null hypothesis at the .05 level of significance.

  Does this correlation imply a causal relationship

Examine pairs of variables, using the method of linear regression, to determine if there is any correlation between the variables. Afterwards, you will postulate whether this correlation reveals a causal relationship.

  A more general exponential reliability model may be defined

A more general exponential reliability model may be defined by R(t)=a^(-bt) where a>1, b>0 and a and b are parameters to be determined. Find the hazard rate function, and show how this model is equivalent to R(t)=e^-(lambda*t).

  The cije and rie parts of the eric system

What is the difference between the CIJE and RIE parts of the ERIC system?

  Calculate the sample size needed given these factors

Calculate the sample size needed given these factors

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd