Perform Association Rule discovery on the dataset using R

Assignment Help Advanced Statistics
Reference no: EM132296957

Data Engineering and Mining Assignment -

Part I: For this part, you need to explore the bank data (bankdata_csv_all.csv) and an accompanying description (bankdataDescription.doc) of the attributes and their values. The dataset contains attributes on each person's demographics and banking information in order to determine they will want to obtain the new PEP (Personal Equity Plan).

Your goal is to perform Association Rule discovery on the dataset using R.

First perform the necessary preprocessing steps required for association rule mining, specifically the id field needs to be removed and a number of numeric fields need discretization or otherwise converted to nominal.

Next, set PEP as the right hand side of the rules, and see what rules are generated.

Select the top 5 most "interesting" rules and for each specify the following:

  • Support, Confidence and Lift values
  • An explanation of the pattern and why you believe it is interesting based on the business objectives of the company.
  • Any recommendations based on the discovered rule that might help the company to better understand behavior of its customers or to develop a business opportunity.

Note that the top 5 most interesting rules are most likely not the top 5 in the strong rules. They are rules, that in addition to having high lift and confidence, also provide some non-trivial, actionable knowledge based on underlying business objectives.

To complete this assignment, write a short report describing your association rule mining process and the resulting 5 interesting rules, each with their three items of explanation and recommendations. For at least one of the rules, discuss the support, confidence and lift values and how they are interpreted in this data set.

You should write your answers as if you are working for a client who knows little about data mining. Your report should give your client some insightful and reliable suggestions on what kinds of potential buyers your client should contact, and convince your client that your suggestions are reliable based on the evidence gathered from your experiment results.

In more detail, your answers should include:

  • Description of preprocessing steps
  • Description of parameters and experiments in order to obtain strong rules
  • Give the top 5 most interesting rules and the 3 items listed above for each rule.

Attachment:- Assignment Files.rar

Reference no: EM132296957

Questions Cloud

Impact of implementing a change management system : After reviewing the material your group has prepared so far, the management team has returned with a list of five specific concerns.
Create a porters five forces analysis : MGT 5170 Applying Strategy for Managers - Nova Southeastern University - Case: Southwest Airlines strategy analysis and formulation
Considering aggregation of inbound shipments to lower costs : Wilson Industries sources from multiple suppliers and is considering the aggregation of inbound shipments to lower its costs.
How will emerging technologies impact the given industries : For the selected industry, please provide comprehensive responses to the following items: What changes will these industries have to make regarding.
Perform Association Rule discovery on the dataset using R : CISC520 Data Engineering and Mining Assignment - Your goal is to perform Association Rule discovery on the dataset using R
What type of corporate strategy was disney pursuing : Before announcing its streaming services, what type of corporate strategy was Disney pursuing? Which core competencies are shared and how?
Models of organization behavior : Imagine you have your own business and you have to make a marketing decision. Write all about “models of organization behavior”
Explain the three types of flexible budget variance : Explain the three types of flexible budget variance and why it is important for every manager to understand these variables.
Special considerations aid each other in law enforcement : Explain how planning and special considerations aid each other in law enforcement.

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Relationship between speed, flow and geometry

Write a project proposal on relationship between speed, flow and geometry on single carriageway roads.

  Logistic regression model

Compute the log-odds ratio for each group in Logistic regression model.

  Logistic regression

Foundations of Logistic Regression

  Probability and statistics

The tubes produced by a machine are defective. If six tubes are inspected at random , determine the probability that.

  Solve the linear model

o This is a linear model. If your model needs a different engine, then you need to rethink your approach to the model. Remember, there are no IF, Max, or MIN statements in linear models.

  Plan the analysis

Plan the analysis

  Quantitative analysis

State the hypotheses that you are going to test.

  Modelise as a markov chain

modelise as a markov chain

  Correlation and regression

What are the degrees of freedom for regression

  Construct a frequency distribution for payment method

Construct a frequency distribution for Payment method

  Perform simple linear regression

Perform simple linear regression

  Quality control analysis

Determining the root causes

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd