Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Sequential sampling, Sequential Sampling Under this method, a number of...

Sequential Sampling Under this method, a number of sample lots are drawn one after another from a universe depending on the results of the earlier samples. Such sampling is gen

Weighted arithmetic mean, Weighted Arithmetic Mean Another aspect...

Weighted Arithmetic Mean Another aspect to be considered is the importance we assign to each observation. The arithmetic mean as we calculated it so far gives equal

Correlation coefficient, Consider three stocks A, B and C costing $100 each...

Consider three stocks A, B and C costing $100 each. The annual returns on the three stocks have mean $5 and variance $10. a. Suppose that the returns on the three stocks are i.i

Initial centroids data set, Find unlabeled data set test.txt and initial ...

Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ...

Econometrics, Ask question From the household budget survey of 1980 of the...

Ask question From the household budget survey of 1980 of the Dutch Central Bureau of Statistics, J. S. Cramer obtained the following logit model based on a sample of 2820 househol

Quantitative Models, Consider the following new business venture. An agent ...

Consider the following new business venture. An agent is considering investment in one of three real estate parcels: • Option 1: multiunit rentals • Option 2: commercial building

Systematic random sampling, Systematic Random Sampling This method  is ...

Systematic Random Sampling This method  is generally used in such cases where a complete list of the population is available from which sample has to be selected. Under this

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd