K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Cluster sampling, Cluster sampling : A method or technique of sampling in w...

Cluster sampling : A method or technique of sampling in which the members of the population are arranged in groups (called as 'clusters'). A number of clusters are selected at the

Minimum volume ellipsoid, Minimum volume ellipsoid is a term for ellipsoid...

Minimum volume ellipsoid is a term for ellipsoid of the minimum volume which covers some specified proportion of the set of multivariate data. It is commonly used to construct rob

Glejser test, Glejser test is the test for the heteroscedasticity in the e...

Glejser test is the test for the heteroscedasticity in the error terms of the regression analysis which involves regressing the absolute values of the regression residuals for the

Band matrix, Band matrix: A matrix which has its non zero elements arrange...

Band matrix: A matrix which has its non zero elements arranged uniformly near to the diagonal, so that aij = 0 if (i - j)> ml or (j - i)> mu where aij are the elements of matrix a

Prepare a depreciation schedule for the rental equipment, Sam Tyler, a sing...

Sam Tyler, a single taxpayer, social security number 111-44-1111, bought Rental Equipment on 04/01/2010. He paid $400,000 including all closing and delivery costs. In the current y

Diggle kenward model for dropouts, The model which is applicable to the lon...

The model which is applicable to the longitudinal data in which the dropout process might give rise to the informative lost values. Specifically if the study protocol specifies the

Behrens fisher problem, Behrens Fisher problem : The difficulty of testing ...

Behrens Fisher problem : The difficulty of testing for the equality of the means of the two normal distributions which do not have the equal variance. Various test statistics have

Sequencing of 4 machines, how to resolve sequencing problem if jobs 6 given...

how to resolve sequencing problem if jobs 6 given and 4 machines given. how to apply johnson rule for making to machines under this conditions. please give solution as soon as poss

Expected frequencies, A term commonly encountered in the analysis of the co...

A term commonly encountered in the analysis of the contingency tables. Such type of frequencies are the estimates of the values to be expected under hypothesis of interest. In a tw

Decision tree, The graphic representation of the alternatives in a decision...

The graphic representation of the alternatives in a decision making problem which summarizes all the possibilities foreseen by the decision maker. For instance, suppose we are give

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd