Plot the data on a scatter plot

Assignment Help Database Management System
Reference no: EM131450351

Assignment: Data Analysis (Cluster Analysis)

1. Included with this assignment is an Excel spreadsheet that contains data with two dimension values.

The purpose of this assignment is to demonstrate steps performed in a K-Means Cluster analysis.

Review the "k-MEANS CLUSTERING ALGORITHM" section in Chapter 4 of the Sharda et. al. textbook for additional background.

Use Excel to perform the following data analysis.

1. Plot the data on a scatter plot.
2. Determine the ideal number of clusters.
3. Choose random center points (centroids) for each cluster. (Note: Each student will select a different random set of centroids.)
4. Using a standard distance formula measure the distance from each data point to each center point.
5. Assign each data point to an initial cluster region based on closeness.
6. For each cluster calculate new center points.
7. Repeat steps 4 through 6.

You will use Excel to help with calculations, but only standard functions should be used (i.e. don't use a plug-in to perform the analysis for you.) You need to show your work doing this analysis the long way. If you were to repeat steps 4 through 6, what will likely happen with the cluster centroids? The rubric for this assignment can be viewed when clicking on the assignment link.

Here is a link to an example spreadsheet using a smaller data set. It contains two tabs. The first tab is the raw data. The second tab contains the analysis that was performed. Make sure that you use a different starting center points from the example.

Attachment:- cluster_analysis_example.xlsx

Reference no: EM131450351

Questions Cloud

Explain uses and limitations-reliability modelling technique : Developing a preventive maintenance program involves following various processes. You need to understand the objectives of the processes.
What is the first-order approximation of the probability : Economics 5120, Spring 2017 Assignment. During a small time interval ?t, what is the first-order approximation of the probability of an unemployed worker
Customer segment research or go directly to market : Should the firm conduct customer segment research or go directly to market?
Critically analyses the concept of employee engagement : SHR012-6 Leading and Managing People (SHR012-6) - Demonstrate critical knowledge and understanding around key and contemporary debates about theory and practice in the specific field of employee engagement.
Plot the data on a scatter plot : Determine the ideal number of clusters. Choose random center points (centroids) for each cluster. Plot the data on a scatter plot.
Describe what you feel will be one or two key economic : Describe what you feel will be one or two key economic and social issues to be debated at the 2016 Presidential elections in the United States
Which legal structure-sole proprietorship and partnership : Which legal structure, sole proprietorship, partnership, corporation or a form of partnership or corporation do you think is best for a new business and why.
Write a paper on the file system of your choice : Write a 3 pages (maximum) paper on the file system of your choice that is not FAT32. It must also follow all rules for grammar and spelling.
What is revenue for the firms and what is gross margin : What is revenue for the firms? What is gross margin?

Reviews

Write a Review

Database Management System Questions & Answers

  Knowledge and data warehousing

Design a dimensional model for analysing Purchases for Adventure Works Cycles and implement it as cubes using SQL Server Analysis Services. The AdventureWorks OLTP sample database is the data source for you BI analysis.

  Design a database schema

Design a Database schema

  Entity-relationship diagram

Create an entity-relationship diagram and design accompanying table layout using sound relational modeling practices and concepts.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Prepare the e-r diagram for the movie database

Energy in the home, personal energy use and home energy efficiency and Efficient use of ‘waste' heat and renewable heat sources

  Design relation schemas for the entire database

Design relation schemas for the entire database.

  Prepare the relational schema for database

Prepare the relational schema for database

  Data modeling and normalization

Data Modeling and Normalization

  Use cases perform a requirements analysis for the case study

Use Cases Perform a requirements analysis for the Case Study

  Knowledge and data warehousing

Knowledge and Data Warehousing

  Stack and queue data structure

Identify and explain the differences between a stack and a queue data structure

  Practice on topic of normalization

Practice on topic of Normalization

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd