Determine the ideal number of clusters

Assignment Help Basic Statistics
Reference no: EM131447692

Assignemnt

Included with this assignment is an Excel spreadsheet that contains data with two dimension values.

The purpose of this assignment is to demonstrate steps performed in a K-Means Cluster analysis.

Review the "k-MEANS CLUSTERING ALGORITHM" section in Chapter 4 of the Sharda et. al. textbook for additional background.

Use Excel to perform the following data analysis.

1. Plot the data on a scatter plot.

2. Determine the ideal number of clusters.

3. Choose random center points (centroids) for each cluster. (Note: Each student will select a different random set of centroids.)

4. Using a standard distance formula measure the distance from each data point to each center point.

5. Assign each data point to an initial cluster region based on closeness.

6. For each cluster calculate new center points.

7. Repeat steps 4 through 6.

You will use Excel to help with calculations, but only standard functions should be used (i.e. don't use a plug-in to perform the analysis for you.) You need to show your work doing this analysis the long way. If you were to repeat steps 4 through 6, what will likely happen with the cluster centroids? The rubric for this assignment can be viewed when clicking on the assignment link.

Here is a link to an example spreadsheet using a smaller data set. It contains two tabs. The first tab is the raw data. The second tab contains the analysis that was performed. Make sure that you use a different starting center points from the example.

Attachment:- Cluster_Data.xlsx

Reference no: EM131447692

Questions Cloud

Describe best way to present each person employment history : Planning a Résumé, If you haven't begun your professional career yet or you are pursuing a career change, the employment history section on your résumé can sometimes be a challenge to write. A brainstorming session with your wise and creative clas..
Develop the rest of your presentation : You are now ready to develop the rest of your presentation. Use research to explain your selected topic, and develop the content for the presentation with a strong conclusion.
Develop the skills to master practices of marketing : Develop the skills to master the following course competencies:Apply theories, models, and practices of marketing.Integrate marketing analyses into general business management planning and decision making.Communicate in a manner that is professional ..
How different would this calculation look for a worker : How different would this calculation look for a worker who earned $500,000 and lived in Vermont? This worker would face a state income tax rate of 9.5 percent and a federal income tax rate of 35 percent.
Determine the ideal number of clusters : Determine the ideal number of clusters. Choose random center points (centroids) for each cluster. (Note: Each student will select a different random set of centroids.)
Before and after marriage : A woman marries her butler. She paid him $60,000 a year. They get married. She earns one million per year, both before and after marriage.
Develop your main idea with adequate and relevant support : Write an essay that defines what it is to be a man or woman. Use examples from your experience to help define the concept.
Places on each bottle of water : It is hot day, and Bert is thirsty. Here is the value he places on each bottle of water:
Why are businesses interested in bop markets : Define BOP markets.  Why are businesses interested in BOP markets?  What are some examples of products developed to profitable serve BOP markets?  Identify and explain 4 challenges of serving BOP markets

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd