K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Describe jonckheere terpstra test, Jonckheere Terpstra test  is the test fo...

Jonckheere Terpstra test  is the test for detecting particular types of departures from the independence in a contingency table in which both the row and column categories contain

Multimodal distribution, Multimodal distribution is the probability distri...

Multimodal distribution is the probability distribution or frequency distribution with number of modes. Multimodality is frequently taken as an indication which the observed di

Point scoring, Point scoring is an easy distribution free method which can...

Point scoring is an easy distribution free method which can be used for the prediction of a response which is a binary variable from the observations on several explanatory variab

Accelerated life testing, Normal 0 false false false EN...

Normal 0 false false false EN-US X-NONE X-NONE

Fisher''s scoring method, This is an alternative to the Newton-Raphson tech...

This is an alternative to the Newton-Raphson technique for optimization (finding out the minimum or the maximum) of some function, which includes replacing the matrix of second der

Reliability theory, Reliability theory is the theory which attempts to det...

Reliability theory is the theory which attempts to determine the reliability of the complex system from knowledge of the reliabilities of the components. Interest might centre on

Incubation period, Incubation period is the time elapsing amongs the receip...

Incubation period is the time elapsing amongs the receipt of infection and the appearance of the symptoms. The length of the incubation time period depends on the disease, ranging

Dummy variable, Discuss the use of dummy variables in both multiple linear ...

Discuss the use of dummy variables in both multiple linear regression and non-linear regression. Give examples if possible

Confidence interval, Confidence interval : A range of the values, calculate...

Confidence interval : A range of the values, calculated from the sample observations which is believed, with the particular probability, to posses the true parameter value. A 95% c

Functional data analysis, The analysis of data which are the functions obse...

The analysis of data which are the functions observed continuously, for instance, functions of time. Basically a collection of statistical techniques or methods for answering quest

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd