K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

General location model, The model for data containing continuous and catego...

The model for data containing continuous and categorical variables both.The categorical data are summarized by the contingency table and their marginal distribution, 182by the mult

Average age at death, Average age at death : A ?awed statistic summarizing ...

Average age at death : A ?awed statistic summarizing expectancy of the life and other aspects of the mortality. For instance, a study comparing average age at the death for male sy

Factor, The term used in a variety of methods in statistics, but mostly to ...

The term used in a variety of methods in statistics, but mostly to refer to the categorical variable, with a less number of levels, under examination in an experiment as a possible

Fiducial inference, The problematic and enigmatic theory of an inference in...

The problematic and enigmatic theory of an inference introduced by the Fisher, which extracts a probability distribution for the parameter on the basis of the data without having f

Exponential order statistics model, The model which arises in the context o...

The model which arises in the context of estimating the size of the closed population where individuals within the population could be identified only during some of the observatio

Alternative hypotheses and spss calculation, 1) Question on the first day q...

1) Question on the first day questionnaire asked students to rate their response to the question Are you deeply moved by the arts or music? Assume the population that is sampled

Accelerated life testing, Normal 0 false false false EN...

Normal 0 false false false EN-US X-NONE X-NONE

Ordinal variable, Ordinal variable is a measurement which allows a sample ...

Ordinal variable is a measurement which allows a sample of the individuals to be ranked with respect to some characteristic but where differences at different points of the scale

Regression to the mean, Regression to the mean is the procedure first note...

Regression to the mean is the procedure first noted by Sir Francis Galton that 'each peculiarity in man is shared by his kinsmen, but on average to the less degree.' Hence the ten

Cure models, Models for the analysis of the survival times, or the time to ...

Models for the analysis of the survival times, or the time to event, data in which it is expected that a fraction of the subjects will not experience the event of interest. In a cl

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd