K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Quasi-experiment, Quasi-experiment is a term taken in use for studies whic...

Quasi-experiment is a term taken in use for studies which resemble experiments but are weak on some of the characteristics, particularly that allocation of the subjects to groups

Network sampling, Network sampling is a sampling design in which the simpl...

Network sampling is a sampling design in which the simple random sample or strati?ed sample of the sampling units is made and all observational units which are linked to any of th

Lipstick Dilemma, For a career woman, wearing lipstick has become an integr...

For a career woman, wearing lipstick has become an integral part of her daily life. It is not unusual for a woman to look for a lipstick that will stay on her lips and not smudge o

Degrees of freedom, A vague concept which occurs all through statistics. Es...

A vague concept which occurs all through statistics. Essentially the term means the number of independent units of the information in an easy relevant to the estimation of the para

Data squashing, An approach to decrease the size of very large data sets in...

An approach to decrease the size of very large data sets in which the data are first 'binned' and then statistics such as the mean and variance/covariance are calculated on each bi

Egret, This is acronym for the Epidemiological, Graphics, Estimation and Te...

This is acronym for the Epidemiological, Graphics, Estimation and Testing of the program developed for the analysis of the data from studies in epidemiology. It can be made in use

#titleassignment, I want to get the quotation of my on-line assignment its ...

I want to get the quotation of my on-line assignment its based on 1000 words. lecturer provide the video links and we have to watch the videos and highlights the key points also de

Gllamm, Gllamm is a program which estimates the generalized linear latent ...

Gllamm is a program which estimates the generalized linear latent and mixed models by the maximum likelihood. The models which can be fitted include structural equation models mul

Error rate estimation, The term used for the estimation of the misclassific...

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd