K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Forecast, The particular projection which an investigator believes is most ...

The particular projection which an investigator believes is most likely to give an accurate prediction of the future value of some process. Commonly used in the context of the anal

Rates of return, An investor with a stock portfolio sued his broker, claimi...

An investor with a stock portfolio sued his broker, claiming that a lack of diversification in his portfolio had led to poor performance. The data, shown below, are the rates of re

Disease mapping, The method of displaying the geographical variability of t...

The method of displaying the geographical variability of the disease on maps using different colors, shading, etc. The logic is not new, but the arrival of computers and computer g

Degenerate distributions, The special cases of the probability distribution...

The special cases of the probability distributions in which the random variable's distribution is concentrated at one point only. For instance, a discrete uniform distribution when

Cycle hunt analysis, The procedure for clustering variables in the multivar...

The procedure for clustering variables in the multivariate data, which forms the clusters by performing one or other of the below written three operations: * combining two varia

Data collection - analysis and display, One of the most exciting areas of m...

One of the most exciting areas of mathematics involves the application of statistics to real-world settings to make informed decisions. In this task you will design, implement, and

Gauss markov theorem, This is the theorem which states that if the error te...

This is the theorem which states that if the error terms in a multiple regression have the same variance and are not corrected, then the estimators of the parameters in the model p

Weighted least squares, Weighted least squares  is the method of estimation...

Weighted least squares  is the method of estimation in which the estimates arise from minimizing the weighted sum of squares of the differences between response variable and its pr

Business forcastin.., elements , importance, limitation, and theories

elements , importance, limitation, and theories

Chains of infection, Chains of infection : The description of the course of...

Chains of infection : The description of the course of infection among the group of individuals. The susceptibles infected by the direct contact with the introductory cases are sai

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd