Reference no: EM13342401
Your team has been assigned to work on a supermarket (fictitiously named as SIM_SuperMart) data to look at the transactional patterns of existing customers who are members of their customer loyalty programme. You have been given a dataset that contains purchases of different products when customers shop at SIM_SuperMart. The data is record in a CSV (Comma Separated Values) file named Jul14_ANL305_SIM_SuperMart.csv. Each column in a record shows whether a product category is purchased by a customer or not, during a shopping trip. A flag variable is being used to denote whether the product category is being purchased (denoted as 't') or not (denoted as 'f').
a) This assignment requires you and your team to analyse the given dataset using the IBM SPSS Modeller. You are required to prepare a report that addresses the following tasks and questions:
(a) Identify a potential business problem that may be addressed by applying association analysis on the given dataset. Discuss possible insights that may be derived from the analysis and how that may help address the business problem.
b) Using a Var. File node, read the Jul14_ANL305_SIM_SuperMart.csv data file into the IBM SPSS Modeler. Include the following table in you report, and use it to justify your settings for the Measurement and Role of each field for association analysis.
Assumption: Each variable attribute may be an antecedent in one rule and maybe a consequent in another rule.
(c) Analyse the properties of this dataset using the Data Audit node. In this dataset, five of the product categories contain only one unique value throughout all records. Discuss how these five product categories can be identified using the Data Audit node. Do not attempt to filter out these attributes yet.
Attachment:- SuperMart.csv