Define the objectives or goals of the data analysis

Assignment Help Data Structure & Algorithms
Reference no: EM132410646

Data Mining Assignment -

SCENARIO - You are an analyst for a telecommunications company that is concerned about the number of customers leaving their landline business for cable competitors. The company needs to know which customers are leaving and attempt to mitigate continued customer loss. You have been asked to analyze customer data to identify why customers are leaving and potential indicators to explain why those customers are leaving so the company can make an informed plan to mitigate further loss.

REQUIREMENTS -

I: Tool Selection

Execute data extraction from the "Customer Data" web link using data mining software (Python, R, or SAS). Provide a screen shot of the code you have written and its successful application with a copy of all the extracted data.

A. Describe the benefits of using the tool you have chosen (Python, R, or SAS) for extracting data in this scenario.

B. Define the objectives or goals of the data analysis. Ensure that your objectives or goals are reasonable within the scope of the scenario and are represented in the available data.

C. Select a descriptive method and a nondescriptive method (i.e., predictive, classification, or probabilistic techniques) you will use to analyze the data, and explain how the methods you have selected are appropriate for the objectives or goals you have defined.

II: Data Exploration and Preparation

Clean the data you have extracted and save as .xls or .xlsx format for submission. Be sure to address all necessary formatting, converting, and missing data.

D. Describe the target variable in the data and indicate the specific type of data the target variable is using, including examples that support your claims.

E. Describe an independent predictor variable in the data and indicate the specific type of data being described. Use examples from the data set that support your claims.

F. Propose the goal in manipulation of the data and define your data preparation aims.

G. Define the statistical identity of the data, including the essential criteria and phenomenon to be predicted.

H. Explain the steps used to clean the data and how you addressed any anomalies or missing data.

III: Data Analysis

For each of the following steps, be sure to clearly indicate each step within your data sheet with a screen shot and annotations in your final submission. All algorithms used need to be clearly identified in the screen shot and submission.

I. Identify the distribution of variables using univariate statistics from your cleaned and prepared data. Represent your findings visually as part of your submission.

J. Identify the distribution of variables using bivariate statistics from your cleaned and prepared data. Represent your findings visually as part of your submission.

K. Apply an analytic method and an evaluative method. Annotate the data showing both methods and your findings.

L. Justify the methods you have chosen to analyze your data. Be sure to include details about how the methods you have chosen better represents your findings than other methods.

M. Justify the methods you have chosen to visually present your data. Be sure to include details about how the presentation methods you chose better represents your findings than other presentation methods.

IV: Data Summary

Summarize the findings of your data evaluation. Provide the final findings dataset, including evaluation measures.

N. Explain how your data shows that it was discriminating or not and whether the phenomenon you wanted to detect was present in your findings. Provide specific examples from the data to support your claims.

O. Describe the methods you used for detecting interactions and for selecting the most important predictor variables. Include the specific interactions you detected and the most important predictor variables that you found.

P. Acknowledge sources, using in-text citations and references, for content that is quoted, paraphrased, or summarized.

Reference no: EM132410646

Questions Cloud

Types of relationships is important for database designers : Discuss why you think making the distinction between these types of relationships is important for database designers.
Why the decisions you make should reflect organisation : In your response, provide the consequences of not considering these aspects.
To what area of philosophy does the topic belong : PHIL 1500 - What is the author's topic? To what area of philosophy does that topic belong? What is the author's claim with respect to his chosen topic?
Identify the type of analysis that is appropriate : Some critics of big business argue that CEOs are overpaid and that their compensation is not related the performance of their company. To test this theory
Define the objectives or goals of the data analysis : Data Mining Assignment - Define the objectives or goals of the data analysis. Describe the benefits of using the tool you have chosen for extracting data
Analyze the value chain analysis of favorite company : Analyze the value chain analysis of your favorite company and describe the strengths and how they achieve economies of scale in their industry.
Look for a very compelling strategy such as low cost leaders : Look for a very compelling strategy such as low cost leadership, differentiation, market segment/niche, innovation, etc. and describe the strategy
Anderson notes that a previously popular model of career : Anderson notes that a previously popular model of career development now seems to be inapplicable to the current environment. Do you agree?
Having a negative impact on freedom : In what way is the digital age (social networking sites, streaming videos, online games, smart phones, etc.) having a negative impact on Freedom?

Reviews

len2410646

12/1/2019 9:11:51 PM

This is a completed assignment but needs a small tweak to get the grade. The files attached are 1. Data mining Req - is the requirement document. 2. ACE3 - Data summary (solution Document Submitted). 3. Data Mining Algorithms(solution Document Submitted) 4. cleaned Dataset(solution Document Submitted). 5.Evaluation report for revision(need work based on this information). Your submission must be your original work. No more than a combined total of 30% of the submission and no more than a 10% match to any one individual source can be directly quoted or closely paraphrased from sources, even if cited correctly. An originality report is provided when you submit your task that can be used as a guide. You must use the rubric to direct the creation of your submission because it provides detailed criteria that will be used to evaluate your work. Each requirement below may be evaluated by more than one rubric aspect. The rubric aspect titles may contain hyperlinks to relevant portions of the course.

Write a Review

Data Structure & Algorithms Questions & Answers

  Provide the analysis and pseudo code only

Display the contents of the file GRADES created in Problem 1. Each student's record should appear on a separate line and include the total score (the sum of the three tests) for that student.

  Determine the mean salary as well as the number of salaries

Determine the mean salary as well as the number of salaries.

  Identify the critical path for the given activity

Consider Problem. Suppose that the normal and the expedited costs and times are as given in the following table.

  Write a flowchart to print the largest of any 3 numbers

Write a flowchart to print the largest of any 3 numbers - Write a flowchart to print a product of 3 numbers.

  Draw a flowchart for the algorithm

Compute the final answer by rounding the last value to 4 decimal places type the computed value at the bottom of the list and Draw a flowchart for the algorithm below and have it checked by the TA

  Explain the functions of a network node manager

Describe the processes that take place during network discovery and mapping

  Ease of changes in the processing algorithms

Ease of changes in the processing algorithms: For example, line shifting can be performed on each line as it is read from the input device, on all the lines after they have been read, or on demand when the alphabetization requires a new set of shi..

  Design a scheme to prevent messages from being modified

Design a scheme to prevent messages from being modified by an intruder. Random J. decides to append to each message a hash of that message.

  Implement the move-to-front heuristic for linked lists

If the order that items in a list are stored is not important, you can frequently speed searching with the heuristic known as move to front.

  What is the most difficult part of creating the algorithm

Pseudocode algorithm you would write for a simple task. What do you think is the most difficult part of creating the algorithm? What can you do to make this process easier?

  Explain the event scheduling approach

This example illustrates the simulation procedure when there is more than one service channel. Consider a computer techmcal support center where personnel.

  What would be the simplest way to draw a digraph

Given R is an equivalence relation, what would be the simplest way to draw a digraph that represents R with the walk relation? What would the digraph look like

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd