Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Vital statistics, How vital statistics are affects on our human life

How vital statistics are affects on our human life

Optimal number of cluster, Try different numbers of clusters in your progra...

Try different numbers of clusters in your program (K=2...15) and build a plot that shows the dependency between number K and value of RSS function on the last iteration. What is th

Sampling theory, difference between large sample test and small sample test...

difference between large sample test and small sample test

Initial centroids data set, Find unlabeled data set test.txt and initial ...

Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ...

Quota sampling, Quota sampling Under this method enumerators shall sele...

Quota sampling Under this method enumerators shall select the respondents in place of those not available, as per the quota fixed according  to guide lines   provided to them.

Regression and anova, The first step in this case is to ensure that you ar...

The first step in this case is to ensure that you are adequately clear on the General Linear Model and its relationship to both ANOVA and regression. The distinction is approxim

Arithmetic average or mean, Arithmetic Average or Mean The arithmetic m...

Arithmetic Average or Mean The arithmetic mean is the most widely and the most generally understandable of all the averages. This is clear from the reason that when the term

Index Number of formulae, discuss the mathematical test of adequacy of inde...

discuss the mathematical test of adequacy of index number of formulae. prove algebraically that the laspeyre, paasche and fisher price index formulae satisfies this test. What is

Limitations of arithmetic mean, The calculations of arithmetic mean m...

The calculations of arithmetic mean may be simple and foolproof, but the application of the result may not be so foolproof. An arithmetic mean may not merely lack

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd