Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Determine the maximum weight rounded down, Assume that the pulley at A is a...

Assume that the pulley at A is a small frictionless pulley. The cord AB is only allowed to support a maximum tension in Newtons as given in P4, and the cord supporting the block ca

Population variance, Examining the Population Variance Business decisio...

Examining the Population Variance Business decision making does not limit itself to setting up the hypothesis to test for the equality of more than two means or proportions sim

Good average, Examine properties of good average with reference to AM, GM, ...

Examine properties of good average with reference to AM, GM, HM, MEAN MEDIAN MODE

Find probability of remaining paint free - ball duel, In a three-cornered p...

In a three-cornered paint ball duel, A, B, and C successively take shots at each other until only one of them remains paint free. Once hit, a player is out of the game and gets no

Confirmatory factor analysis, Confirmatory factor analysis (CFA) seeks to d...

Confirmatory factor analysis (CFA) seeks to determine whether the number of factors and the loadings of measured (indicator) variables on them conform to what is expected on the ba

Evaluate the standard deviation, Use only the rare event rule, and make sub...

Use only the rare event rule, and make subjective estimates to determine whether events are likely. For example, if the claim is that a coin favors heads and sample results consis

Types of averages, The following are the various types of common averages u...

The following are the various types of common averages used in statistical analysis given in the form of a chart. Figure 1

Number of principal components, While there are p original variables the n...

While there are p original variables the number of principal components is m such that m

Large-sample and small-sample simulations, Show that when h = h* for the h...

Show that when h = h* for the histogram, the contribution to AMISE of the IV and ISB terms is asymptotically in the ratio 2:1. Compare the sensitivity of the AMISE(ch) in Equa

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd