Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Factor analysis, Factor analysis (FA) explains variability among observed r...

Factor analysis (FA) explains variability among observed random variables in terms of fewer unobserved random variables called factors. The observed variables are expressed in

Find the distribution, The Elementary Teachers' Federation of Ontario make ...

The Elementary Teachers' Federation of Ontario make the following claim on their website as of February 13, 2013: For years, the Elementary Teachers' Federation of Ontario (ETFO

Which average is to be used to describe statistical data?, There ar...

There are situations where none of the three averages is fully satisfactory. For example, if the number of items in a series is very small, none of these av

Simple linear regression, We are interested in assessing the effects of tem...

We are interested in assessing the effects of temperature (low, medium, and high) and technical configuration on the amount of waste output for a manufacturing plant. Suppose that

The weekly treatment , A researcher is interested in comparing the effectiv...

A researcher is interested in comparing the effectiveness of three different parts of therapy for anger problems. 8 participants are randomly assigned to 3 treatment conditions: Co

Describe the opportunities for statistical learning, 1. Recognize and expla...

1. Recognize and explain the opportunities for statistical learning. 2. Describe how the use of statistics supports student learning. 3. Recognize appropriate data displays a

Harmonic mean, The Harmonic Mean is based on the reciprocals of numbers ave...

The Harmonic Mean is based on the reciprocals of numbers averaged. It is defined as the reciprocal of the arithmetic mean of the reciprocal of the given individual observations. Th

Che, Chebychev inequality

Chebychev inequality

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd