Define the objectives or goals of the data analysis

Assignment Help Other Subject
Reference no: EM132320652

Instructions:

The following assignment needs to be done in either R or Python.

INTRODUCTION

One of the most critical factors in customer relationship management that directly impacts a company's long-term profitability is customer attrition. When a company can better predict if a customer is likely to cut ties, it can take a more targeted approach to mitigate customer turnover.

In this task, you will use Python, SAS, or R to analyze data for a telecommunications company (see "Customer Data" web link) and create a data mining report in a word processor (e.g., Microsoft Word). You will create visual representations throughout the submission to show each step of your work and to visually represent the findings of your data analysis.

All algorithms and visual representations used need to be captured (either in tables within the word document or with screen shots added into the word document) and should be submitted as part of your document for final submission

A separate Excel (.xls or .xlsx) document of the cleaned data should be submitted along with the written aspects of the data mining report.

SCENARIO

You are an analyst for a telecommunications company that is concerned about the number of customers leaving their landline business for cable competitors. The company needs to know which customers are leaving and attempt to mitigate continued customer loss. You have been asked to analyze customer data to identify why customers are leaving and potential indicators to explain why those customers are leaving so the company can make an informed plan to mitigate further loss.

REQUIREMENTS

Your submission must be your original work. No more than a combined total of 30% of the submission and no more than a 10% match to any one individual source can be directly quoted or closely paraphrased from sources, even if cited correctly. An originality report is provided when you submit your task that can be used as a guide.

You must use the rubric to direct the creation of your submission because it provides detailed criteria that will be used to evaluate your work. Each requirement below may be evaluated by more than one rubric aspect. The rubric aspect titles may contain hyperlinks to relevant portions of the course.

I: Tool Selection

Execute data extraction from the "Customer Data" web link using data mining software (Python, R, or SAS). Provide a screen shot of the code you have written and its successful application with a copy of all the extracted data.

Describe the benefits of using the tool you have chosen (Python, R, or SAS) for extracting data in this scenario.

Define the objectives or goals of the data analysis. Ensure that your objectives or goals are reasonable within the scope of the scenario and are represented in the available data.

Select a descriptive method and a nondescriptive method (i.e., predictive, classification, or probabilistic techniques) you will use to analyze the data, and explain how the methods you have selected are appropriate for the objectives or goals you have defined.

II: Data Exploration and Preparation

Clean the data you have extracted and save as .xls or .xlsx format for submission. Be sure to address all necessary formatting, converting, and missing data.

Describe the target variable in the data and indicate the specific type of data the target variable is using, including examples that support your claims.

Describe an independent predictor variable in the data and indicate the specific type of data being described. Use examples from the data set that support your claims.

Propose the goal in manipulation of the data and define your data preparation aims.

Define the statistical identity of the data, including the essential criteria and phenomenon to be predicted.

Explain the steps used to clean the data and how you addressed any anomalies or missing data.

III: Data Analysis

For each of the following steps, be sure to clearly indicate each step within your data sheet with a screen shot and annotations in your final submission. All algorithms used need to be clearly identified in the screen shot and submission.

Identify the distribution of variables using univariate statistics from your cleaned and prepared data. Represent your findings visually as part of your submission.

Identify the distribution of variables using bivariate statistics from your cleaned and prepared data. Represent your findings visually as part of your submission.

Apply an analytic method and an evaluative method. Annotate the data showing both methods and your findings.

Justify the methods you have chosen to analyze your data. Be sure to include details about how the methods you have chosen better represents your findings than other methods.

Justify the methods you have chosen to visually present your data. Be sure to include details about how the presentation methods you chose better represents your findings than other presentation methods.

IV: Data Summary

Summarize the findings of your data evaluation. Provide the final findings dataset, including evaluation measures.

Explain how your data shows that it was discriminating or not and whether the phenomenon you wanted to detect was present in your findings. Provide specific examples from the data to support your claims.

Describe the methods you used for detecting interactions and for selecting the most important predictor variables. Include the specific interactions you detected and the most important predictor variables that you found.

Acknowledge sources, using in-text citations and references, for content that is quoted, paraphrased, or summarized.

Attachment:- Telco-Customer-Churn.rar

Reference no: EM132320652

Questions Cloud

Labour cost productivity between two shops : What is the difference (in percentage) of the labour cost productivity between these two shops?
Assembly line that uses the kanban system : An assembly line that uses the Kanban system has a demand rate of 120 items/hour, the container throughput (circuit) time is 1 hour and the containers
What is the reorder point : The accepted level of stockout is at most 5% and the standard deviation of demand during the lead time is 20 unit and the lead time is 10 days.
What additional information is required for check algorithm : Let's estimate that this computational task belongs to class NP. What is additional information, that in this case is required for check algorithm
Define the objectives or goals of the data analysis : Define the objectives or goals of the data analysis. Ensure that your objectives or goals are reasonable within the scope of the scenario and are represented.
Management important to business organizations : Why is operations management important to business organizations?
Receive a scholarship to college : What are some reasons why you feel you should receive a scholarship to College?
Describe the contemporary social : Describe the contemporary social, managerial and organized assets required to optimize returns from information technology investment.
Sig conference in johannesburg : Technology should be a means to an end and rather than a goal in itself". Thomas Nel during the IITPSA EA SIG conference in Johannesburg.

Reviews

Write a Review

Other Subject Questions & Answers

  What are the ideological views of human nature

Political ideology: Discuss the ideology that provided the Healthcare policy's context. What are the ideological views of human nature and non-governmental.

  How will you change your approach

After reading, "Dr. Ronald Berman on Time Management" and viewing the video at the conclusion of the article, describe two strategies you will implement.

  Charismatic or transformational leader

Would you classify Bill Gates as a charismatic or transformational leader? Why? Consider the followers and employees of Gates. What are some unique characterisitcs of Gate's followers that might identify him as charismatic or transformational?

  What is likely to happen in the court case

Lisa writes down all of these promises into a contract. Both parties sign the contract, and they go to the bank and have it notarized.

  Discuss the person-place-person that relate your condition

I have decided to change my population of focus to Colombia.After Brazil began to detect Zika,Colombia began official surveillance of this virus.

  What is the likely outcome based upon past decisions

Is there a chance the issue will be heard by the court soon? What is the likely outcome based upon past decisions?

  How the title of a company ethics documents affects attitude

Describe how the title of a company's ethics documents affects your attitude about the content? Do you find one title more attractive than another?

  What about the alternative schools of thought

Is economic liberalism (aka free market capitalism) the most efficient and productive economic model we have? What about the alternative schools of thought? Are they serious challengers? Why or why not?

  Discuss postmodernism

Answer all three of the following questions. Cite at least one example in your response for each question. You should reference your book to help you answer these questions. If you use additional sources, you must cite them. Your answers should be in..

  How effective is complementary alternative medicine

How Effective is Complementary & Alternative Medicine

  Describe an intercultural transaction

Describe an intercultural transaction in which you have participated in which one or more parties demonstrated an application of kinesics different from your cultural norm

  How do you see the actions of the national government

How do you see the actions of the national government affecting your everyday life?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd