K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Decision Analysis, Build-Rite construction has received favorable publicity...

Build-Rite construction has received favorable publicity from guest appearances on a public TV home improvement program. Public TV programming decisions seem to be unpredictable, s

Explanatory variables, The variables appearing on the right-hand side of eq...

The variables appearing on the right-hand side of equations defining, for instance, multiple regressions or the logistic regression, and which seek to predict or 'explain' response

Data screening, The initial evaluation of the set of observations to see wh...

The initial evaluation of the set of observations to see whether or not they appear to satisfy the hypotheses or assumptions of the methods to be used in their analysis. Techniques

Principal components analysis, Principal components analysis is a process ...

Principal components analysis is a process for analysing multivariate data which transforms original variables into the new ones which are uncorrelated and account for decreasing

Classification and regression tree technique (cart), Classification and reg...

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory va

Explain jelinski moranda model, Jelinski  Moranda model is t he model of ...

Jelinski  Moranda model is t he model of software reliability which supposes that failures occur according to the Poisson process with a rate decreasing as more faults are diagnos

Frequency polygon, It is the diagram used to display the values graphically...

It is the diagram used to display the values graphically in a frequency distribution. The frequencies are graphed as an ordinate against the class mid-points as abscissae. The p

Attack rate, Attack rate : This term frequently used for the incidence of t...

Attack rate : This term frequently used for the incidence of the disease or condition in the particular group, or during a limited interval of time, or under the special circumstan

Generalized method of moments (gmm), Generalized method of moments (gmm) is...

Generalized method of moments (gmm) is the estimation method popular in econometrics which generalizes the method of the moments estimator. Essentially same as what is known as the

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd