Reference no: EM131408471
1) Consider again the churn dataset. Create two learning curves (using WEKA) of the out of sample AUC on the test set (churn_test.arfff) using both logistic regression and the decision tree J48 (just go with the default settings). In particular, starting from the full training set, after each iteration, reduce the training set to half until you reach less than 100 examples. Provide a plot with both curves (copy the data into EXCEL and create the charts) .
• You can cut the dataset in half easily in Weka. In the Preprocess tab, in the box marked Filter, click on Choose. Under weka->filters->unsupervised->instance you will see RemovePercentage. (Normally, it is a good idea first to run the filter Randomize, to make sure that you are removing the data randomly; real data often will be sorted based on some attribute, which can result in throwing away many data items with similar values. Don't Randomize for this assignment; the data for this assignment already will be randomized.)
• The Undo button on the preprocess tab will undo the preprocessing (like Randomizing, RemovePercentage, etc.). Keep an eye on the data statistics (like the number of instances) in the preprocess tab to verify.
2) Create a fitting curve of the generalization AUC for decision trees as a function of the MinNumObj parameter. First change the option ‘unpruned' to ‘true'. Provide a plot of the parameter and the resulting out of sample performance using either cross validation or a training/test split. What does the parameter do? What is the optimal selection for the parameter?
3) Repeat the same experiment as in step 1, but setting minnumObj=100 and unpruned=TRUE. How does the learning curve of the decision tree change? What do you infer from this result?
Attachment:- Assignment.rar
What is included in nike''s balance sheet cash account
: Compute the change in NIKE's current ratio and working capital from 2008 to 2009. Which accounts are the most important in explaining that change?
|
Implement to insure compliance
: 1. Outline and briefly explain three action items you would recommend the company implement to insure compliance with both the Ontario Human Rights Code and the Employment Equity Act.
|
Problem regarding the cost of living
: The City of St. Albans has a unionized police force that is coming up for a contract renewal. The police have one issue: the cost of living increases. During the past 10 years, police officers have received minimal cost of living increases, and th..
|
What trends have affected malls
: This video describes the problems of suburban regional and superregional shopping centers. while malls were attractive for 50 years, they have fallen out of favor with many shoppers, leaving shopping center developers with significant challeng..
|
Create two learning curves of the out of sample auc
: Create two learning curves of the out of sample AUC on the test set using both logistic regression and the decision tree J48 (just go with the default settings). In particular, starting from the full training set, after each iteration, reduce t..
|
Discuss the factors that influence internal pay structures
: 1) Discuss the factors that influence internal pay structures. Based on your own experiences, which ones do you think are the most important? Why?
|
What would be ge’s 2008 inventory balance
: What would be GE's 2008 inventory balance if it used the FIFO assumption instead? Why is the disclosure of the LIFO reserve useful to financial statement users?
|
Public perception of an unethical organization
: How do you think unethical behavior affects employee productivity and morale? How about public perception of an unethical organization?
|
Compute the inventory purchases made by hp
: In its 2008 annual report, Hewlett-Packard reported beginning inventory of $8.0 billion, ending inventory of $7.9 billion on the balance sheet, and cost of goods sold of $69.3 billion on the income statement. Compute the inventory purchases made b..
|