How the evolution of database technology led to data mining

Assignment Help Database Management System
Reference no: EM131888930

Assignment

All responses should be of sufficient depth and detail. Answer the questions succinctly and clearly, and explain your answer. Use references but do not quote anybody else, use your own words.

Problem I

What is data mining? In your answer, address the following:

- Is it another fad?

- Out of the three pre-requisite data science skills (database management, statistics, and machine learning) which one(s) are most important to master?

- Explain how the evolution of database technology led to data mining.

- Describe the steps involved in data mining when viewed as a process of knowledge discovery.

Problem II

Robust data loading poses a challenge in database systems because the input data are often dirty. In many cases, an input record may have several missing values and some records could be contaminated (i.e., with some data values out of range or of a different data type than expected). Work out a step-by-step data cleaning and loading procedure so that the erroneous data will be marked and contaminated data will not be mistakenly inserted into the database during data loading.

Problem III

Outline the major steps of decision tree classification.

Problem IV

a. Compare the advantages and disadvantages of eager classification (e.g., Decision tree, Bayesian, neural network) versus lazy classification (e.g., k-nearest neighbor, case based reasoning).

b. Create a hypothetical example for one of the classifiers discussed in part a.

Problem V

Association rule mining often generates a large number of rules. Name at least one effective method that can be used to reduce the number of rules generated while still preserving most of the interesting rules.

Problem VI

You are a consultant working for the company "Data Mining R Us." Your client is a major luxury automobile manufacturer, Lexcedes. They have come up with a brand-new model called the "Chimera" and they want to target the car for young, filthy rich individuals. Besides having their own company databases, Lexcedes purchased a large collection of databases containing historic information about people, their attributes, and what they buy. They want to use data mining to help sell their new model.

Describe in detail a comprehensive step-by-step data mining procedure you would follow if you were given this task. Make sure that your answer reflects the situation stated above (in other words, do not give a generic answer). State your assumptions.

Reference no: EM131888930

Questions Cloud

Who holds the ability to conduct scientific inquiry : During the Scientific Revolution... In the history of science a march of progress? Who holds the ability to conduct scientific inquiry?
Conflict and collaboration and long distance trade : During empires in conflict and collaboration and long distance trade... How did various diaspora benefit trade? What are the negative effects of trade?
What is the probability of observing at least one : What is the probability of observing at least one call in a given 15 minute period? Round to three decimals; DO NOT use scientific notation.
Calculate the value of f statistics for testing : Using the following information for R2, k and N, calculate the value of F statistics for testing the over all multiple regression equation
How the evolution of database technology led to data mining : Explain how the evolution of database technology led to data mining. Describe the steps involved in data mining when viewed as a process of knowledge discovery.
How many different such lists are possible : How many different such lists are possible? Enter your answer without commas.
In what areas do you feel anxious or challenged : How has your coursework and prior experience prepared you to meet the expectations (personal, professional, and career) you have for this experience?
Data on the number of tissues used during a cold : Give the null and alternative hypotheses to determine if the number of tissues used during a cold is less than 60. (Note that Xbar is the sample mean.)
Discuss about the cost of generating power : Your boss believes the company's power plant is producing too much air pollution on a typical island. Your boss gives you three choices for dealing.

Reviews

Write a Review

Database Management System Questions & Answers

  Knowledge and data warehousing

Design a dimensional model for analysing Purchases for Adventure Works Cycles and implement it as cubes using SQL Server Analysis Services. The AdventureWorks OLTP sample database is the data source for you BI analysis.

  Design a database schema

Design a Database schema

  Entity-relationship diagram

Create an entity-relationship diagram and design accompanying table layout using sound relational modeling practices and concepts.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Prepare the e-r diagram for the movie database

Energy in the home, personal energy use and home energy efficiency and Efficient use of ‘waste' heat and renewable heat sources

  Design relation schemas for the entire database

Design relation schemas for the entire database.

  Prepare the relational schema for database

Prepare the relational schema for database

  Data modeling and normalization

Data Modeling and Normalization

  Use cases perform a requirements analysis for the case study

Use Cases Perform a requirements analysis for the Case Study

  Knowledge and data warehousing

Knowledge and Data Warehousing

  Stack and queue data structure

Identify and explain the differences between a stack and a queue data structure

  Practice on topic of normalization

Practice on topic of Normalization

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd