How and why developing and running experiments using cluster

Assignment Help Data Structure & Algorithms
Reference no: EM132358783 , Length: 7 Pages

Assignment - Descriptive Data Analytics: A Review

For the assignment, you will use the Data Analytics spreadsheet. There are two parts to this assignment.

Time for some exploring: You are a data analyst tasked to provide analysis of the data provided by the data warehouse team. You decide to use two unsupervised methods: clustering ( k-means) and association analysis (market basket analysis). This analysis will help the marketing team to group each widget into three or four different levels of products at different pricing. The market basket analysis will help operations and sales teams understand which products tend to be purchased together for potential opportunities to maximize shelf space or up-sell during the time of purchase.

Part 1 (k-means) of the assignment: Use the k-means algorithm with either Microsoft Excel or SAS on the data contained on the worksheet titled "K." Suppose the centers of each cluster are C, F, I, and L. Run the k-means algorithm for one epoch. What was the outcome of your experiment? Describe the clusters created with the experiment? What belongs to which cluster and what are the centers of the new clusters themselves?

Part 2 (basket analysis) of the assignment: You will work with the worksheet titled "Basket." Run experiments using with either Microsoft Excel or SAS, and run an association algorithm to obtain a number of combinations (itemsets) based off the data within the worksheet. Perform your analysis on the output of your association algorithm. What are the most common itemsets? What is the number of items within the top five itemsets? What is the average value of the top five itemsets? What is the total value of the tickets that contain items in the top five itemsets? What can a business analyst do with this information? Why is this information interesting? Is this information useful to solving a business problem?

Write a paper (5-7 pages for the body section). Use APA (6th edition) style and format; include your analysis and answers to the exercises in parts 1 and 2 above, with a minimum of five references; and cover the following topics:

1. Explain how and why developing and running experiments using cluster and association algorithms can help organizations solve business problems and improve data and information accuracy.

2. Explain why you selected and used the analytical software tool for your experiments and how the tool is useful for data analytics.

3. Demonstrate how analytical and statistical methods, processes, and tools are used to help decision makers make better decisions.

4. Demonstrate how analytical and statistical tools are used to aggregate data into information and knowledge with analysis and experimentation.

Assignment Requirements - Written communication is free of errors that detract from the overall message. Resources and citations are formatted according to APA (6th edition) style and formatting. Total 5-7 pages, excluding the references page.

Note - This assignment analysis can be done in either excel or SAS.

Attachment:- Descriptive Data Analytics Assignment Files.rar

Reference no: EM132358783

Questions Cloud

Professionals and investigators use digital forensic methods : Law enforcement professionals and investigators use digital forensic methods to solve crimes every day. Locate one current news article
Understand computer architecture and networking : A digital forensics professional must know basic IT skills, understand computer architecture and networking, and have analytical and investigative skills,
Explain human factors in achieving technical goals : Melbourne Institute of Technology- MN503 Overview of Internetworking - Network Requirement Analysis and Plan. Explain human factors in achieving technical goals
Identify trend in information systems-technology supported : Each student will identify a trend in Information Systems and Technology supported by three pieces of research to support why you think it is a trend
How and why developing and running experiments using cluster : Assignment - Descriptive Data Analytics: A Review - Explain how and why developing and running experiments using cluster
The power is disrupted again in the future due to hurricane : Create a short guide to keep business going if the power is disrupted again in the future due to hurricane. What procedures would you take to fulfill the order?
About benefits of cloud computing applications : Even with this great news about benefits of cloud computing applications, authors have warned business user community regarding dangers associated with cloud
Examine nursing theories related to advanced nursing roles : Question - Examine nursing theories related to advanced nursing roles
The inevitable tendency to shortcut the procedure : Is this a reasonable burden to place on a busy, competitive company? How would you argue against the inevitable tendency to shortcut the procedure?

Reviews

len2358783

8/19/2019 10:38:14 PM

Assignment Requirements - Written communication: Written communication is free of errors that detract from the overall message. APA formatting: Resources and citations are formatted according to APA (6th edition) style and formatting. Length of paper: 5-7 pages, excluding the references page. Font and font size: Times New Roman, 12 point. Please review the uploaded files for instructions and guidelines. THIS ASSIGNMENT ANALYSIS CAN BE DONE IN EITHER EXCEL OR SAS. IF THE TUTOR CHOOSES SAS PLEASE LET ME KNOW AND I WILL PROVIDE ACCESS TO THE SAS PROGRAM. The word document has the instructions there is also an excel document that needs to be used for the analysis part 1 is for the first TAB and Part 2 is for the second tab in excel. The PDF file is for grading guidelines also at least 5 references are needed with the assignment.

len2358783

8/19/2019 10:38:08 PM

Scoring Guide – Explains comprehensively how and why developing and running experiments using cluster and association algorithms can help organizations solve business problems and improve data and information accuracy. Comprehensively demonstrates how analytical and statistical tools are used to aggregate data into information and knowledge with analysis and experimentation. Comprehensively explains rationale for selecting and using the analytical software tool for the experiments and how the tool is useful for data analytics. Comprehensively demonstrates how analytical and statistical methods, processes, and tools are used to help decision makers make better decisions. Exhibits high level of proficiency in writing, critical thinking, and use of APA (6th edition) formatting of references and citations.

Write a Review

Data Structure & Algorithms Questions & Answers

  Develop a program that can cross-reference welfare

Develop a program that can cross-reference welfare and tax records. Unfortunately, this data is in separate databases, with the welfare data sorted by name, and the tax data sorted by tax file number.

  Discuss the two-way partitioning algorithm

Give an algorithm that performs a three-way in-place partition of an N element subarray using only N- I three-way comparisons.

  Maintain the set of campers enrolled in camp posanivee

Campers are enrolling and withdrawing from camp faster than her primitive filing system can handle, and she has turned to you. You have been offered free meals at the mess hall in return for a program that will help her keep track of who is enroll..

  Create an entity relationship diagram

you need to create an Entity Relationship (ER) diagram relevant to the above case study and perform logical design to produce appropriate 3NF Relations

  How your bucket should look like after you finished the step

Here is how your bucket should look like after you finished the step. Notice, you have left border, right border, and bottom border, and everything inside is empty string.

  Write algorithm to reverse elemens in queue

Using basic queue and stack operationns, write algorithm to reverse elemens in the queue. Suppose that 'Stack' is class described in section with 'StackType' set to int and STACK_CAPACITY

  How is a pert chart useful?

How is a Pert chart useful? How is a Gantt chart useful? What are the differences and similarities between both?

  Find min returns the minimum key in the search tree

Find min returns the minimum key in the search tree, find min obj returns the object belonging to the minimum key,

  Describe a binary tree as an empty tree

A binary tree is a special kind of rooted tree that has some additional structure that makes it tremendously useful as a data structure.

  Design an algorithm that switches two arbitrary trucks

Design an algorithm that switches two arbitrary trucks (not necessarily at distance at most k) in O(n/k) truck swaps.

  What is the time complexity of running the below bubblesort

Show a simple modification that can be made to the below bubblesort that significantly improves the time complexity for an array of sequential integers.

  Describe np-complete problem is vertex colourability problem

Another NP-complete problem is the 3-vertex colourability problem. In order to obtain a proper coloring of a graph G = (V,E), three colors are assigned.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd