K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Hirap, #q A paper mill products two grade of paper viz., X & Y. Because of ...

#q A paper mill products two grade of paper viz., X & Y. Because of raw material restriction, it cannot produce more than 400 tons of grade X paper & 300 tons of grade Y paper in a

Centile reference charts, Centile reference charts : Charts which are used ...

Centile reference charts : Charts which are used inmedicine to observe the clinical measurements on individual patients in the context of the population values. If the population i

Disability adjusted life years (dalys), This is an attempt to measure the s...

This is an attempt to measure the suffering caused by the illness which takes into the account both the years of the potential life lost due to the premature mortality as well as t

Define matching coefficient, Matching coefficient is a similarity coeffici...

Matching coefficient is a similarity coefficient for data consisting of the number of binary variables which is often used in cluster analysis. It can be given as follows    he

Asymmetric proximity matrices, Asymmetric proximity matrices : Proximity ma...

Asymmetric proximity matrices : Proximity matrices in which the non-diagonal elements, in the ith row and jth column and the jth row and ith column, are not essentially equal. Exam

Glejsers test, The Null Hypothesis - H0:  There is no heteroscedasticity i....

The Null Hypothesis - H0:  There is no heteroscedasticity i.e. β 1 = 0 The Alternative Hypothesis - H1:  There is heteroscedasticity i.e. β 1 0 Reject H0 if |t | > t = 1.96

Residual calculation, Regression line drawn as y= c+ 1075x ,when x was2, an...

Regression line drawn as y= c+ 1075x ,when x was2, and y was 239,given that y intercept was 11. Calculate the residual ?

Window variables, Window variables are the variables measured during the c...

Window variables are the variables measured during the constrained interval of an observation period which is accepted as the proxies for the information over the whole period. Fo

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd