Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Evaluate central tendency and variability, Why are graphs and tables useful...

Why are graphs and tables useful when examining data? A researcher is comparing two middle school 7th grade classes. One class at one school has participated in an arts program

Factor loadings matrix, As we stated above, we start factor analysis with p...

As we stated above, we start factor analysis with principal component analysis, but we quickly diverge as we apply the a priori knowledge we brought to the problem. This knowled

Quantitative and qualitative methods for forecasting sales, OmegaPlus Pty...

OmegaPlus Pty.Ltd. is a chain of Health Food stores operating in Australia: with 12 stores across Sydney, Melbourne and Brisbane. OmegaPlus has recently appointed a new CEO: San

Expected average time, Question: A car was machine washes each car in 5 min...

Question: A car was machine washes each car in 5 minutes exactly. It has been estimated that customers will arrive according to a Poisson distribution at an average of 8 per hour.

#vital statistics, # I have to make assignment on vital statistics so kindl...

# I have to make assignment on vital statistics so kindly guide me how to make and get good marks

Calculate the line of best fit, The manager of Pizza Hut provides a deliver...

The manager of Pizza Hut provides a delivery service for customers who telephone in an order. The manager would like to give callers an idea of the time it will take to deliver an

Weibull distribution, slope parameter of 1.4 and scale parameter of 550.cal...

slope parameter of 1.4 and scale parameter of 550.calculate Reliability, MTTF, Variance, Design life for R of 95%

Sample, types of sampling method

types of sampling method

Determine the optimal order size, The Truly Canadian Restaurant stocks a pr...

The Truly Canadian Restaurant stocks a private red table wine that it purchases from a local winery in the Niagara Falls region. The daily demand for the wine at the restaurant is

Angle count method, Angle Count method The method for estimating the pr...

Angle Count method The method for estimating the proportion of the area of a forest which is in fact covered by the bases of trees. An observer goes to each of the number of po

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd