Review the k-means clustering algorithm

Assignment Help Database Management System
Reference no: EM131290697

Data Assignment

Included with this assignment is an Excel spreadsheet that contains data with two dimension values.

The purpose of this assignment is to demonstrate steps performed in a K-Means Cluster analysis.

Review the "k-MEANS CLUSTERING ALGORITHM" section in Chapter 4 of the Sharda et. al. textbook for additional background.

Use Excel to perform the following data analysis.

1. Plot the data on a scatter plot.
2. Determine the ideal number of clusters.
3. Choose random center points (centroids) for each cluster. (Note: Each student will select a different random set of centroids.)
4. Using a standard distance formula measure the distance from each data point to each center point.
5. Assign each data point to an initial cluster region based on closeness.
6. For each cluster calculate new center points.
7. Repeat steps 4 through 6.

You will use Excel to help with calculations, but only standard functions should be used (i.e. don't use a plug-in to perform the analysis for you.) You need to show your work doing this analysis the long way. If you were to repeat steps 4 through 6, what will likely happen with the cluster centroids? The rubric for this assignment can be viewed when clicking on the assignment link.

Here is a link to an example spreadsheet using a smaller data set. It contains two tabs. The first tab is the raw data. The second tab contains the analysis that was performed. Make sure that you use a different starting center points from the example.

Attachment:- Cluster_Data.xlsx

Reference no: EM131290697

Questions Cloud

Calculate the next payment each party makes : The payments are made semiannually based on the exact day count and 360 days in a year. The current period has 181 days. Calculate the next payment each party makes.
Why are people still buying the given tape drives : Use the Web to find the state-of-the-art in tape system capacity and speed. Why are people still buying these tape drives? Will solid-state drives and Cloud storage change this?
Define and explain a constant maturity swap : An interest rate swap has two primary risks associated with it. Identify and explain each risk.
Describe the tools and technology used to support it project : IT Project Management Tools- Describe the tools and technology used to support IT project management. What are the characteristics of future tools?
Review the k-means clustering algorithm : Review the "k-MEANS CLUSTERING ALGORITHM" section in Chapter 4 of the Sharda et. al. textbook for additional background.
Determine the payoff value of the swaption : At the expiration of the swaption, the LIBOR rates are 10 percent (360 days), 10.5 percent (720 days), 10.9 percent (1,080 days), and 11.2 percent (1,440 days). Assume 360 days in a year. Determine the payoff value of the swaption
Marketing trends impacting school and youth sports : Is the phenomenon of national championships consistent with marketing trends impacting school and youth sports?
Discuss about the ethical implications : COM 540:A discussion of the concept of an "authentic self" online and its relationship to professional identities, supported with research-based principles for each of the examples identified in the Module Three journal assignment,A discussion of t..
What does that tell us about how to price the options : If these two options have the same payoffs, what does that tell us about how to price the options?

Reviews

Write a Review

Database Management System Questions & Answers

  Knowledge and data warehousing

Design a dimensional model for analysing Purchases for Adventure Works Cycles and implement it as cubes using SQL Server Analysis Services. The AdventureWorks OLTP sample database is the data source for you BI analysis.

  Design a database schema

Design a Database schema

  Entity-relationship diagram

Create an entity-relationship diagram and design accompanying table layout using sound relational modeling practices and concepts.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Prepare the e-r diagram for the movie database

Energy in the home, personal energy use and home energy efficiency and Efficient use of ‘waste' heat and renewable heat sources

  Design relation schemas for the entire database

Design relation schemas for the entire database.

  Prepare the relational schema for database

Prepare the relational schema for database

  Data modeling and normalization

Data Modeling and Normalization

  Use cases perform a requirements analysis for the case study

Use Cases Perform a requirements analysis for the Case Study

  Knowledge and data warehousing

Knowledge and Data Warehousing

  Stack and queue data structure

Identify and explain the differences between a stack and a queue data structure

  Practice on topic of normalization

Practice on topic of Normalization

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd