K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Expected monetary value, Ask quesoil company is considering whether or not ...

Ask quesoil company is considering whether or not to bid for an offshore drilling contract. If they bid, the value would be $600m with a 65% chance of gaining the contract. The com

Product-limit estimator, Product-limit estimator is a method for estimatin...

Product-limit estimator is a method for estimating the survival functions for the set of survival times, some of which might be censored observations. The logic behind the procedu

Curse of dimensionality, The phrase first spoken by one of the witches in M...

The phrase first spoken by one of the witches in Macbeth. Now this is used to describe the exponential rise in the number of possible locations in the multivariate space as dimensi

The f-wald test, Primary Model Below is a regression analysis without ...

Primary Model Below is a regression analysis without 17 outliers that have been removed Regression Analysis: wfood versus totexp, income, age, nk The regression equat

Behrens fisher problem, Behrens Fisher problem : The difficulty of testing ...

Behrens Fisher problem : The difficulty of testing for the equality of the means of the two normal distributions which do not have the equal variance. Various test statistics have

Cauchy distribution, Cauchy distribution : The probability distribution, f ...

Cauchy distribution : The probability distribution, f (x), can be given as follows   where α is the position of the parameter (median) and the beta β a scale parameter. Moments

Relative risk, Relative risk is the measure of the association between the...

Relative risk is the measure of the association between the exposure to a particular factor and the risk or probability of a convinced outcome, calculated as follows     therefor

Leaps-and-bounds algorithm, Leaps-and-bounds algorithm is an algorithm whi...

Leaps-and-bounds algorithm is an algorithm which is used to ?nd the optimal solution in problems which might have a large number of possible solutions. Begins by dividing the poss

Parks test, The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedastici...

The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedasticity errors and no heteroscedasticity exists The Alternative Hypothesis - H1: β 1 ≠ 0 i.e. there is no homoscedasti

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd