K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Define least significant difference test, Least significant difference test...

Least significant difference test is an approach to comparing a set of means which controls the family wise error rate at some specific level, let's assume it to be α. The hypothe

Bonferroni correction, Bonferroni correction : A procedure for guarding aga...

Bonferroni correction : A procedure for guarding against the rise in the probability of a type I error when performing the multiple signi?cance tests. To maintain probability of a

Coplot, This is the powerful visualization tool for studying how the respon...

This is the powerful visualization tool for studying how the response relies on an explanatory variable given the values of other explanatory variables. The plot comprises of a num

Prognostic scoring system, Prognostic scoring system is a technique of com...

Prognostic scoring system is a technique of combining the prognostic information contained in the number of threat factors, in a manner which best predicts each patient's risk of

Describe hurdle model, Hurdle Model:  The model for count data which postul...

Hurdle Model:  The model for count data which postulates two processes, one generating the zeros in the data and one generating positive values. The binomial model decides the bina

Integrated Economic Statistics, Advantages and disadvantages of Integrated ...

Advantages and disadvantages of Integrated Economic Statistics

Linear regression assignment help, Using World Bank (2004) World Developmen...

Using World Bank (2004) World Development Indicators; Washington: International Bank for Reconstruction & Development/ The World Bank, located in the reference section of the Learn

Mosaic displays, Mosaic displays  is the graphical display of the standardi...

Mosaic displays  is the graphical display of the standardized residuals from the fitting a log-linear model to a contingency table in which the colour and outline of the mosaic's '

Falsediscoveryrate (fdr), The approach of controlling the error rate in an ...

The approach of controlling the error rate in an exploratory analysis where number of hypotheses are tested, but where the strict control which is provided by multiple comparison p

Exponential order statistics model, The model which arises in the context o...

The model which arises in the context of estimating the size of the closed population where individuals within the population could be identified only during some of the observatio

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd