Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Demand, A monopolist firm''s demand curve is given by P:100-2q. (a) Find it...

A monopolist firm''s demand curve is given by P:100-2q. (a) Find its marginal revenue function.

Business statistics, Betting on sporting events is big business both in the...

Betting on sporting events is big business both in the US and abroad. Consider, for instance, next winter’s American football tournament known as the Superbowl. Billions of dollars

Compute the output of correlation, Q. Compute the output of correlation? ...

Q. Compute the output of correlation? The following figure shows (a) a 3-bit image of size 5-by-5 image in the square, with x and y coordinates specified, (b) a Laplacian

Draw a network diagram for this problem, The project of building a backyard...

The project of building a backyard swimming pool consists of eight major activities and has to be completed within 19 weeks. The activities and related data are given in the follow

Limitations of arithmetic mean, The calculations of arithmetic mean m...

The calculations of arithmetic mean may be simple and foolproof, but the application of the result may not be so foolproof. An arithmetic mean may not merely lack

Eliminate all of the insignificant variables, The file Midterm Data.xls ha...

The file Midterm Data.xls has a tab labeled "Many vs. S&P" which presents historical price data for several assets, a volatility condition (VIDX = 1 if the NYSE volatility is grea

Compute the roughness of several parametric densities, An approximation to ...

An approximation to the error of a Riemannian sum: where V g (a; b) is the total variation of g on [a, b] de ned by the sup over all partitions on [a, b], including (a; b

Factor analysis, Factor analysis (FA) explains variability among observed r...

Factor analysis (FA) explains variability among observed random variables in terms of fewer unobserved random variables called factors. The observed variables are expressed in

Artificial neural network, Normal 0 false false false E...

Normal 0 false false false EN-US X-NONE X-NONE

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd