K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Morbidity, Morbidity is the term used in the epidemiological studies to de...

Morbidity is the term used in the epidemiological studies to describe sickness in the human populations. The WHO Expert Committee on the Health Statistics noted in its sixth repor

Explain longitudinal data, Longitudinal data : The data arising when each o...

Longitudinal data : The data arising when each of the number of subjects or patients give rise to the vector of measurements representing same variable observed at the number of di

Cohort component method, Cohort component method : A broadly used method or...

Cohort component method : A broadly used method or technique of forecasting the age- and sex-speci?c population to the upcoming years, in which the initial population is strati?ed

Doane''s rule, A rule for computing the number of classes to use while cons...

A rule for computing the number of classes to use while constructing a histogram and  can be given by   here n is the sample size and ^ γ is the estimate of kurtosis.

Direct edacyclic graph, Formal graphical representation of the "causal diag...

Formal graphical representation of the "causal diagrams" or the "path diagrams" where the  relationships are directed but acyclic (that is no feedback relations allowed). Plays an

Decision tree analysis, Ask questioThe finance manager of ‘Softy’ baby soap...

Ask questioThe finance manager of ‘Softy’ baby soap manufacturing company being successful in the first two years of the company’s operations is considering setting up another plan

Particlefilters, Particlefilters is a simulation method for tracking movin...

Particlefilters is a simulation method for tracking moving target distributions and for reducing computational burden of the dynamic Bayesian analysis. The method uses a Markov ch

Quittingill effect, Quittingill effect is a  problem which occurs most fre...

Quittingill effect is a  problem which occurs most frequently in studies of the smoker cessation where smokers frequently quit smoking following the onset of the disease symptoms

Negative binomial distribution, Negative binomial distribution is the prob...

Negative binomial distribution is the probability distribution of number of failures, X, before the kth success in the sequence of Bernoulli trials where the probability of succes

Completeness, Completeness : A term applied to a statistic t when there is ...

Completeness : A term applied to a statistic t when there is only one function of that the statistic which can have the given expected value. If, for instance, the one function of

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd