Describe the steps involved in data mining

Assignment Help Management Information Sys
Reference no: EM132145547

Problem I (Answer each piece in 75-150 words with reference but do not quote)

What is data mining? In your answer, address the following:

- Is it another fad?

- Out of the three pre-requisite data science skills (database management, statistics, and machine learning) which one(s) are most important to master?

- Explain how the evolution of database technology led to data mining.

- Describe the steps involved in data mining when viewed as a process of knowledge discovery.

Problem II

Robust data loading poses a challenge in database systems because the input data are often dirty. In many cases, an input record may have several missing values and some records could be contaminated (i.e., with some data values out of range or of a different data type than expected).

Work out a step-by-step data cleaning and loading procedure so that the erroneous data will be marked and contaminated data will not be mistakenly inserted into the database during data loading.

Problem III --(Answer 75-100 words with reference but do not quote)

Outline the major steps of decision tree classification.

Problem IV --(Answer each piece in 75-100 words with reference but do not quote)

a. Compare the advantages and disadvantages of eager classification (e.g., Decision tree, Bayesian, neural network) versus lazy classification (e.g., k-nearest neighbor, case based reasoning).

b. Create a hypothetical example for one of the classifiers discussed in part a.

Problem V -(Answer each piece in 75-100 words with reference but do not quote)

Association rule mining often generates a large number of rules. Name at least one effective method that can be used to reduce the number of rules generated while still preserving most of the interesting rules.

Problem VI

You are a consultant working for the company "Data Mining R Us." Your client is a major luxury automobile manufacturer, Lexcedes. They have come up with a brand-new model called the "Chimera" and they want to target the car for young, filthy rich individuals.

Besides having their own company databases, Lexcedes purchased a large collection of databases containing historic information about people, their attributes, and what they buy. They want to use data mining to help sell their new model.

Describe in detail a comprehensive step-by-step data mining procedure you would follow if you were given this task. Make sure that your answer reflects the situation stated above (in other words, do not give a generic answer). State your assumptions.

Reference no: EM132145547

Questions Cloud

Perfect competition better characterizes markets in general : If there is monopoly power in agriculture, do you think monopoly or perfect competition better characterizes markets in general?
Do you agree with the given problem : Some people argue that ethics codes are "just for show" and really do little to deter unethical behavior by employees. Do you agree?
Give an example of a binary relation : Give an example of a binary relation which is not transitive, and then give an example of a binary relation which is reflexive and transitive but not connected.
How does mancur olson explain differences : How does Mancur Olson explain differences in economic performance of nations by the concept of public goods?
Describe the steps involved in data mining : Out of the three pre-requisite data science skills (database management, statistics, and machine learning) which one(s) are most important to master?
Total expenditure on each input is identical : Suppose a firm is employing all its inputs so that the MRP per dollar spent on each sentence is the same. this suggest that:
Link changes in unemployment : Link changes in unemployment, inflation, wages, and GDP to one another and how they impacted each other during periods of economic decline (recessions)
How neoclassical economists derive the law of demand : Outline how neoclassical economists derive the law of demand and then criticize neo- classical consumer/demand theory from a heterodox perspective in light.
Recession with high unemployment and low output : The economy is in a recession with high unemployment and low output (i.e. the output currently is lower than the natural level of output)

Reviews

Write a Review

Management Information Sys Questions & Answers

  Information technology and the changing fabric

Illustrations of concepts from organizational structure, organizational power and politics and organizational culture.

  Case study: software-as-a-service goes mainstream

Explain the questions based on case study. case study - salesforce.com: software-as-a-service goes mainstream

  Research proposal on cloud computing

The usage and influence of outsourcing and cloud computing on Management Information Systems is the proposed topic of the research project.

  Host an e-commerce site for a small start-up company

This paper will help develop internet skills in commercial services for hosting an e-commerce site for a small start-up company.

  How are internet technologies affecting the structure

How are Internet technologies affecting the structure and work roles of modern organizations?

  Segregation of duties in the personal computing environment

Why is inadequate segregation of duties a problem in the personal computing environment?

  Social media strategy implementation and evaluation

Social media strategy implementation and evaluation

  Problems in the personal computing environment

What is the basic purpose behind segregation of duties a problem in the personal computing environment?

  Role of it/is in an organisation

Prepare a presentation on Information Systems and Organizational changes

  Perky pies

Information systems to adequately manage supply both up and down stream.

  Mark the equilibrium price and quantity

The demand schedule for computer chips.

  Visit and analyze the company-specific web-site

Visit and analyze the Company-specific web-site with respect to E-Commerce issues

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd