K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Student, the problem that demonstrates inference from two dependent samples...

the problem that demonstrates inference from two dependent samples uses hypothetical data from TB vaccinations and the number of new cases before and after vaccinations for cases o

Biplots, Biplots: It is the multivariate analogue of the scatter plots, wh...

Biplots: It is the multivariate analogue of the scatter plots, which estimates the multivariate distribution of the sample in a few dimensions, typically two and superimpose on th

Marginal matching, Marginal matching is the matching of the treatment grou...

Marginal matching is the matching of the treatment groups in terms of means or other summary characteristics of matching variables. This has been shown to be almost as efficient a

Conjoint analysis, Conjoint analysis : The method used basically in market ...

Conjoint analysis : The method used basically in market research which is similar in many respects to the various dimensional scaling. The method attempts to assign values to the l

Generate a scatter plot, Suppose we estimate the following model: Passen...

Suppose we estimate the following model: Passengersi = 1 + 2Populationi + ui a) Generate a scatter plot with passengers on the vertical axis and population on the horizonta

Explain regression through the origin, Regression through the origin : In s...

Regression through the origin : In some of the situations a relationship between the two variables estimated by the regression analysis is expected to pass by the origin because th

Describe martingale, Martingale: In the gambling context the term at first...

Martingale: In the gambling context the term at first referred to a system for recouping losses by doubling the stake after each loss has occured. The modern mathematical concept

Function of Power, In an experiment, power is a function of 1. The number o...

In an experiment, power is a function of 1. The number of variables being measured and the beta level 2. The effect size, internal validity and the beta level 3. The number of part

Probability, show all the ways in which 3 games of football can be conclude...

show all the ways in which 3 games of football can be concluded(it can be a win W,a loss L,or a draw X)

Cross over design, The type of longitudinal study in which the subjects rec...

The type of longitudinal study in which the subjects receive different treatments on the various occasions. Random allocation is required to determine the order in which the treatm

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd