K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Gaussian markov random field, It is the multivariate normal random vector w...

It is the multivariate normal random vector which satisfies certain conditional independence suppositions. This can be viewed as a model framework which contains a wide range of st

Matching, Matching is the method of making a study group and a comparison ...

Matching is the method of making a study group and a comparison group comparable with respect to the extraneous factors. Generally used in the retrospective studies when selecting

Maximum likelihood estimation, Maximum likelihood estimation is an estimat...

Maximum likelihood estimation is an estimation procedure involving maximization of the likelihood or the log-likelihood with respect to the parameters. Such type of estimators is

Define misspecification, Misspecification  is the term is applied to descri...

Misspecification  is the term is applied to describe the assumed statistical models which are incorrect for one of the several of reasons, for instance, using the wrong probability

Dirichlet process mixture models, The nonparametric Bayesian inference appr...

The nonparametric Bayesian inference approach to using the finite mixture distributions for modelling data suspected of the containing distinct groups of observations; this approac

Explain interim analyses, Interim analyses : An analysis made before the pl...

Interim analyses : An analysis made before the planned end of a clinical trial, typically with the aim of detecting the treatment differences at the early stage and thus preventing

Explain kolmogorov smirnov two-sample method, Kolmogorov Smirnov two-sample...

Kolmogorov Smirnov two-sample method is a distribution free technique which tests for any difference between the two populations probability distributions. The test is relied on t

Develop an algebraic linear programming model, Duck Lovers Unlimited (DLU) ...

Duck Lovers Unlimited (DLU) Inc. assembles specially configured light jet aircrafts for airborne duck hunting. The quarterly demand forecasts for the upcoming fiscal year are:

Protopathic bias, Protopathic bias is the type of bias (also called as rev...

Protopathic bias is the type of bias (also called as reverse-causality) that is a consequence of differential misclassification of the exposure related to timing of occurrence. It

Continuous variable, Continuous variable : The measurement which is not res...

Continuous variable : The measurement which is not restricted to the particular values except in so far as this is constrained by the accuracy of measuring instrument. General exam

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd