ITNPBD6 Data Analytics Assignment

Assignment Help Other Subject
Reference no: EM132850065 , Length: word count:3000

ITNPBD6 Data Analytics - University of Stirling

World Of Bargains Store Performance

Project Objective: Develop logistic regression, decision tree and neural network models that will identify whether stores will perform well or poorly.

Context:
Ivor Buquetlowd, the owner of a chain of over 100 shops in the UK, would like you to help grow his business effectively. The shops are all similar, but they make amounts of revenue varying from £1 million to nearly £5 million per year. He would like a computerised system to analyse the performance of his existing shops to help choose new locations for shops.

Your Task:

You must develop logistic regression, decision tree and neural network models that will identify whether stores will perform well or poorly. You can use Orange, Python, R, or any data mining package of your choice. The data for the assignment is in a file storedata.csv included with this document. The datasets contains the details of 136 stores. The data describes the following aspects of each store:
• Town
• Store ID
• Manager name
• Staff numbers
• Floor Space and Window Space
• Car park (yes or no)
• Demographic score
• Location (Shopping Centre, High Street, Retail Park)
• 40 min, 30 min, 20 min and 10 min drive time population size
• Store age
• Clearance space in store
• Competition number (how many competing stores are near ours)
• Competition score (from how good the competing stores are)
• Performance: Whether Ivor regards the store's profitability as "Good" or "Bad"

Feature 'Performance' is the response variable. You must construct a model that can accurately predict ‘Performance' from the other features.

Submission Requirement:
You should hand in a report describing the modelling process you followed and your results. You should attempt to frame the problem in the form of CRISP-DM framework to better facilitate the discussion. Refer to the relevant CRISP-DM stages at reach stage of your report. You do not need to submit code or data. The report is worth 100 marks in total and must cover the following:

Introduction
Describe the task you were given: is it classification or regression?; describe the data you received and the requirements of the finished system, including why data mining is suitable for this task. Define any terminology that you will use in the report (for example, model, variable, task, etc.).

Data Summary
List the variables that you found in the file provided by the company. For each one, say whether it should be treated as categorical or numeric; nominal, ordinal, continuous or discrete; and whether or not it is likely to be of use in building the solution. Explain your decisions: if you rule out any variables at this stage, you can justify your choice using summary statistics, or a histogram plot of its distribution.

Data Preparation
Describe what you did with the data prior to the modelling process. Show histograms of the data before and after any pre-processing that you carried out. (you do not need to give histograms of all variables, just the ones that need some cleaning) If you corrected any mis-typed or corrupted entries in the data, report what you changed, such as any rules you used, or examples of specific data points that were cleaned.

Modelling
You must use three different techniques and build models with each: these should include one tree-based model, one based on logistic regression, and one based on neural networks. Try to make each model perform as well as it can: if you varied the hyperparameters of a model, show which hyperparameters you varied and how this impacted on the results. Describe how you split the data for training, validation and testing purposes. Be methodical and record each result. This stage is a little like scientific research - you are carrying out experiments in your search for the best solution. Once you have a solution, show how you verified its robustness. For the three different techniques report on their comparative ability to predict store performance, but only select a single model for the final test.

Don't try to find a perfect or extremely accurate model - one does not exist! We are interested in the procedure you followed and the justification you give for choosing particular model types/parameters/features.

Results and Errors
Analyse and describe the level of accuracy the model achieves and the errors your model makes. Show a confusion matrix for each model. Are there any areas of the data where it performs worse than in others, and are there any types of error that World of Bargins would want to avoid more than others? Show a lift curve or a ROC curve for the decision as to whether or not a shop might be profitable and explain what it tells you.

Attachment:- Data Analytics Assignment.rar

Reference no: EM132850065

Questions Cloud

How much will each annual payment be : The savings account pays 4.76 percent per year, compounded annually. How much will each annual payment be
What should have been the exchange rate in January : The Argentine peso was fixed through a currency board at Ps1.00?/$ throughout the 1990s. What should have been the exchange rate in January
How much will the preferred and common shareholders receive : How much will the preferred and common shareholders receive under each of the following independent assumptions
Prepare multiple-step income statement for the month ended : Jan. 28 Collected the amounts due from customers for the January 12 transaction. Prepare multiple-step income statement for the month ended
ITNPBD6 Data Analytics Assignment : ITNPBD6 Data Analytics Assignment Help and Solution, University of Stirling - Assessment Writing Service - Develop logistic regression, decision tree
Describe the contract and how it was created : This could be anything - your cell phone contract, credit card agreement, Instagram account, etc. Describe the contract and how it was created
Journalize the transaction on the books of both companies : The cost of the goods is $470. Both companies use perpetual inventory systems. Journalize the transaction on the books of both companies
Analyze asymmetric and symmetric encryption : you will analyze asymmetric and symmetric encryption.
How many glasses of beer does he have to sell : If Terry sells only beer, how many glasses of beer does he have to sell each month to make a monthly profit of $500

Reviews

Write a Review

Other Subject Questions & Answers

  Develop a pecha kucha presentation based on the theme

Develop a Pecha Kucha presentation based on the chosen theme and issue(s) discussed in their research paper.

  How has this company handled the ethical implications

How has this company handled the ethical implications of its product with a focus on social responsibility, integrity and business ethics?

  What are the steps to using the method explain in detail

What are the steps to using this method? Explain in detail. Provide an example of one study where it has been used. Summarize the study and how it used your.

  Identify the article that best supports nursing intervention

Identify the article that best supports nursing interventions for your topic. Explain why this article best supports your topic as you compare the.

  How does the hendricks case relate to ochberg article

In his article "Quarantine Them beyond Their Jail Terms," Frank M. Ochberg suggested that some violent offenders are incurable and should be confined for life.

  Do you use nominalizations often

Write a few paragraphs on what you notice about your own writing style. Do you use nominalizations often? Is your writing overly wordy and complex?

  Social pressures and intergroup relations

Social Pressures and Intergroup Relations

  Reflect on the background and relevant facts of the case

Reflect on the background and relevant facts of the case for which the statement was prepared. In a minimum of 600 words, briefly describe the background.

  Explain the importance of professional commitment

Explain the importance of professional commitment in developing patient education as a clinical skill.

  Determining literature on group work and group therapy

Psychiatric mental health nursing practice is one of the newest disciplines to be licensed to provide psychotherapy As such, the majority of psychotherapy.

  Describe the procedures used

Describe the procedures used for each of the following response classes: tantrums, bedtime problems, wearing glasses, throwing glasses and verbal behavior.

  Carefully checked all the facts and your attitudes

When you've carefully checked all the facts and your attitudes and still find that there's "just something" about your supervisor that's causing a problem in your relationship, you should suspect that

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd