Reference no: EM131126188
General assignment: Predictive and Prescriptive data analytics. You should develop and validate predictive models (regression, classification, clustering - using one or more of the methods covered in class to date or one of your choosing) for two of the five datasets below and apply them for decision purposes. Please use the section numbering below for your written submission for this assignment. References - websites, papers, packages, data refs, etc,
https://archive.ics.uci.edu/ml/datasets/Bank+Marketing,
https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29,
https://archive.ics.uci.edu/ml/datasets/Wine+Quality,
https://archive.ics.uci.edu/ml/datasets/Communities+and+Crime.
1. Exploratory Data Analysis
Explore the statistical aspects of both datasets. Analyze the distributions and provide summaries of the relevant statistics. Perform any cleaning, transformations, interpolations, smoothing, outlier detection/ removal, etc. required on the data. Include figures and descriptions of this exploration and a short description of what you concluded (e.g. nature of distribution, indication of suitable model approaches you would try, etc.). Min. 3/4 page text + graphics (required).
2. Model Development, Validation, Optimization and Tuning
Choose one or more models. Explain why you chose them. Construct the models, test and validate them. Explain the validation approach. You can use any method(s) covered in the course. Compare model results if applicable. Report the results of the model fits (coefficients, graphs, trees, etc.), predictors, and statistics. Min. 3 page text + graphics (required).
3. Decisions
Describe your conclusions in regard to the model fit, prediction and how well (or not) it could be used for decisions and why. Min. 3/4 page text + graphics.
Show the structures and names of all reactants and products
: Draw out the sequence of reactions showing how yeast cells synthesize the net production of L-malate from pyruvate under anaerobic conditions without any net NADH production.
|
A summary of cash flows for impeccable travel service
: Prepare a statement of cash flows for Impeccable Travel Service for the year ended November 30, 2010.
|
What is the nature of your business
: An example of a non-financial goal: "Starbucks will diversify its product lines to achieve 30 percent of sales revenue in latte products in the next three years."
|
Chymotrypsin mutated able to increase the rate of peptide
: why chymotrypsin mutated for all three amino acids in the active site catalytic triad is still able to increase the rate of peptide bond hydrolysis by approximately 50,000-fold.
|
Explore the statistical aspects of both datasets
: Choose one or more models. Explain why you chose them. Construct the models, test and validate them. Explain the validation approach. You can use any method(s) covered in the course. Compare model results if applicable.
|
Prepare the balance sheet as of 30 november 2010
: Using the data for Impeccable Travel Service shown in Practice Exercises 1-4A and 1-5A, prepare the balance sheet as of November 30, 2010.
|
Conaway company purchased a machine for cash
: Conaway Company purchased a machine for cash on January 1, 1998. The price was $31,000. In addition, Conaway incurred costs of $200 in transporting the machine to the factory site and a further $800 in installing the machine.
|
Using the data for express travel service shown in practice
: Janis Paisley invested an additional $30,000 in the business during the year and withdrew cash of $18,000 for personal use.
|
Did the pahler court use the same reasoning
: Recall the difference between a crime and a tort. Based on these two cases, analyze and discuss whether artists should be held liable for the actions of their fans.
|