Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Stream flow gauging, (a) At a stream gauging station, the following dischar...

(a) At a stream gauging station, the following discharges and stage measurements were taken for the purpose of the rating curve at that section: Stage (m) 1

Measures of dispersion, Other Measures of Dispersion In this section, ...

Other Measures of Dispersion In this section, we look at relatively less used measures of dispersion like fractiles, deciles, percentiles, quartiles, interquartile range and f

Random sampling method, Random Sampling Method In this method the units...

Random Sampling Method In this method the units are selected in such a way that every item in the whole universe has an equal chance of being included. In the words of croxton

Data reduction, The PCA is amongst the oldest of the multivariate statistic...

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower

Data project, Dr. Jim Mirabella UNIT EIGHT: DATA ANALYSIS PROJECT All Excel...

Dr. Jim Mirabella UNIT EIGHT: DATA ANALYSIS PROJECT All Excel output should be copied into a single Word document where you must enter all of your responses to the questions below.

Quote, How much would u charge for 4 questions

How much would u charge for 4 questions

Statistical inquiry, Main stages of Statistical Inquiry The following a...

Main stages of Statistical Inquiry The following are the various stages of a statistical inquiry (1)   Planning the Inquiry: First of all we have to assess the problem und

Correlation, Correlation The board of directors of Bata Company is face...

Correlation The board of directors of Bata Company is faced with the problem of estimating what the annual sales might be in a shop to be opened in Bagpur where Bata has not op

Discriminant analysis, Discriminant analysis (DA) helps to determine which ...

Discriminant analysis (DA) helps to determine which variables discriminate between two or more naturally occurring groups. Mathematically equivalent to MANOVA, it ' is extensively

Statistics, Theories of Business forecasting

Theories of Business forecasting

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd