Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Weighted harmonic mean, Weighted Harmonic Mean Weighted Harmonic ...

Weighted Harmonic Mean Weighted Harmonic Mean is calculated with the help of the following formula: WHM Case

Standard erro, practical application of standard error

practical application of standard error

Correlation, prove that coefficient of correlation lies between -1 and+1

prove that coefficient of correlation lies between -1 and+1

Ashland MultiComm Services, Suppose that in the actual survey of 50 prospec...

Suppose that in the actual survey of 50 prospective customers, 6 subscribe to the 3 for all offer, what does this tell you about the previous estimate of the proportion of customer

Determine probability that the person tested has the disease, There are two...

There are two diagnostic tests for a disease. Among those who have the disease, 10% give negative results on the first test, and independently of this, 5% give negative results on

Characteristics of index number, Characteristics of Index Number  On th...

Characteristics of Index Number  On the analysis of various definitions of index number the following may be its characteristics: 1.      Expressed in Number : Index number

Stata question, i am new to stata and i am trying to figure out how to calc...

i am new to stata and i am trying to figure out how to calculate expected growth of sales tax revenue as well as average growth rate of sales tax revenue in stata. I have a dataset

Evaluate central tendency and variability, Why are graphs and tables useful...

Why are graphs and tables useful when examining data? A researcher is comparing two middle school 7th grade classes. One class at one school has participated in an arts program

Statistical generalisations, From the information given, what seems to be t...

From the information given, what seems to be the main flaw in each of the following statistical generalisations? (i) Banking industry employees are facing a crisis, if their

Histogram, Histogram: It is generally used for charting continuous fre...

Histogram: It is generally used for charting continuous frequency   distribution. In histogram, data are plotted as a series  of rectangle one over the other. Class intervals

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd