K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Current status data, The Current status data arise in the survival analysis...

The Current status data arise in the survival analysis if the observations are limited to the indicators of whether or not the event of interest has happened at the time the sample

Exponential family, A family of the probability distributions of the form g...

A family of the probability distributions of the form given as   here θ is the parameter and a, b, c, d are the known functions. It includes the gamma distribution, normal dis

Product-limit estimator, Product-limit estimator is a method for estimatin...

Product-limit estimator is a method for estimating the survival functions for the set of survival times, some of which might be censored observations. The logic behind the procedu

Error rate estimation, The term used for the estimation of the misclassific...

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group

Cure models, Models for the analysis of the survival times, or the time to ...

Models for the analysis of the survival times, or the time to event, data in which it is expected that a fraction of the subjects will not experience the event of interest. In a cl

Decision Models., An oil company thinks that there is a 60% chance that the...

An oil company thinks that there is a 60% chance that there is oil in the land they own. Before drilling they run a soil test. When there is oil in the ground, the soil test comes

Describe indirect least squares, Indirect least squares: An estimation tech...

Indirect least squares: An estimation technique used in the fitting of structural equation models. Commonly least squares are first used to estimate reduced form parameters. Usi

Collector''s problem, Collector's problem : A problem which derives from th...

Collector's problem : A problem which derives from the schemes in which packets of a particular brand of coffe, cereal etc., are sold with coupons, cards, or other tokens. There ar

Factorization theorem, The theorem relating structure of the likelihood to ...

The theorem relating structure of the likelihood to the concept of the sufficient statistic. Officially the necessary and sufficient condition which a statistic S be sufficient for

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd