Reference no: EM132440199
Data Mining Text book: Introduction to data mining 2nd ed. Boston: Pearson, 2019: pang-ning tan, michael steinbach, vipin kumar
1. Discuss the importance of preprocessing the datasets to ensure better data quality for data mining techniques. Give an example from your own personal experience.
2. Discuss the advantages and disadvantages of using sampling to reduce the number of data objects that need to be displayed. Would simple random sampling (without replacement) be a good approach to sampling? Why or why not?
3. Discuss the major issues in classification model overfitting. Give some examples to illustrate your points.
Organization Leadership & Decision-Making Text book: James D. McKeen, Heather A. Smith, IT Strategy: Issues and Practices, Third Edition. Pearson, 2015, ISBN-13 978-0-13-354424-4.
4. Read the RR Communications Case Study in the textbook.
5. Read the Nationstate Case Study on pages in the textbook.