K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Mortality odds ratio, Mortality odds ratio  is the ratio equivalent to the ...

Mortality odds ratio  is the ratio equivalent to the odds ratio used in case-control studies where the equivalent of the cases are deaths from the cause of interest and the equival

Describe monty hall problem, Monty Hall problem : A apparently counter-intu...

Monty Hall problem : A apparently counter-intuitive problem in the probability which gets its name from the TV game show, 'Let's Make a Deal' hosted by the Monty Hall. On show a pa

Conjoint analysis, Conjoint analysis : The method used basically in market ...

Conjoint analysis : The method used basically in market research which is similar in many respects to the various dimensional scaling. The method attempts to assign values to the l

Parks test, The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedastici...

The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedasticity errors and no heteroscedasticity exists The Alternative Hypothesis - H1: β 1 ≠ 0 i.e. there is no homoscedasti

Correlation matrix, Correlation matrix : A square, symmetric matrix with th...

Correlation matrix : A square, symmetric matrix with the rows and columns corresponding to the variables, in which the non diagonal elements are correlations between the pairs of t

Describe prior distribution, Prior distributions : The probability distribu...

Prior distributions : The probability distributions which summarize the information about a random variable or parameter known or supposed at a given time instant, prior to attaini

Outliers - reasons for screening data, Outliers - Reasons for Screening Dat...

Outliers - Reasons for Screening Data Outliers are due to data entry errors, subject is not a member of the population that the sample is trying to represent, or the subject i

Chi-squared distribution, Chi-squared distribution : It is the probability ...

Chi-squared distribution : It is the probability distribution, f (x), of the random variable de?ned as the sum of squares of the number (v) of independent standard normal variables

Generalized estimating equations (gee), Technically the multivariate analog...

Technically the multivariate analogue of the quasi-likelihood with the same feature that it leads to consistent inferences about the mean responses without needing specific supposi

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd