K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Define radical statistics group, Radical statistics group : The national ne...

Radical statistics group : The national network of the social scientists in United Kingdom committed to the critique of statistics as taken in use in the policy making procedure. T

Dorfman scheme, An approach to investigations designed to recognize a parti...

An approach to investigations designed to recognize a particular medical condition in the large population, usually by means of a blood test, which might result in the considerable

Reasons for screening data, Reasons for screening data     Garbage i...

Reasons for screening data     Garbage in-garbage out     Missing data          a. Amount of missing data is less crucial than the pattern of it. If randomly

Missing data - reasons for screening data, Missing Data - Reasons for scree...

Missing Data - Reasons for screening data In case of any missing data, the researcher needs to conduct tests to ascertain that the pattern of these missing cases is random.

Describe lorenz curve., Lorenz curve : Essentially the graphical representa...

Lorenz curve : Essentially the graphical representation of cumulative distribution of the variable, most often used for the income. If the risks of disease are not monotonically in

Decision tree, The graphic representation of the alternatives in a decision...

The graphic representation of the alternatives in a decision making problem which summarizes all the possibilities foreseen by the decision maker. For instance, suppose we are give

Explain isobologram., Isobologram  is a diagram used to characterize the in...

Isobologram  is a diagram used to characterize the interactions among jointly administered drugs or the chemicals. The contour of the constant response (that is the isobole), which

Ain why the simulated result doesn''t have to be exact as the, ain why the ...

ain why the simulated result doesn''t have to be exact as the theoretical calculation

Odds ratio, Odds ratio is the ratio of the odds for the binary variable in...

Odds ratio is the ratio of the odds for the binary variable in two groups of the subjects, such as, males and females. If the two possible states of variable are labeled as 'succe

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd