K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Doubly multivariate data, This term is sometimes used for the data collecte...

This term is sometimes used for the data collected in those longitudinal studies in which more than the single response variable is recorded for each subject on each occasion. For

Multidimensional scaling (mds), Multidimensional scaling (MDS)  is a generi...

Multidimensional scaling (MDS)  is a generic term for a class of techniques or methods which attempt to construct a low-dimensional geometrical representation of the proximity matr

Extreme value distribution, The probability distribution, f (x), of largest...

The probability distribution, f (x), of largest extreme can be given as    The location parameter, α is the mode and β is the scale parameter. The mean, variance skewn

Linear regression, regression line drawn as Y=C+1075x, when x was 2, and y ...

regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Copulas, Invariant transformations to combine marginal probability function...

Invariant transformations to combine marginal probability functions to form multivariate distributions motivated by the need to enlarge the class of multivariate distributions beyo

Frequency distribution, The division of a sample of observations into sever...

The division of a sample of observations into several classes, together with the number of observations in each of them.  It acts as a useful summary of the main features of the da

Exponential order statistics model, The model which arises in the context o...

The model which arises in the context of estimating the size of the closed population where individuals within the population could be identified only during some of the observatio

Bioinformatics, Bioinformatics : Essentially the application of the informa...

Bioinformatics : Essentially the application of the information theory to biology to deal with the deluge of the information resulting from the advances in molecular biology. The m

Determine the optimal strategy for the breeder, Consider a decision faced b...

Consider a decision faced by a cattle breeder. The breeder must decide how many cattle he should sell in the market each year and how many he should retain for breeding purposes. S

Design, Difference between tretment design and experimental design

Difference between tretment design and experimental design

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd