K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Latin square, Latin square  is an experimental design targeted at removing ...

Latin square  is an experimental design targeted at removing from the experimental error the variation from two extraneous sources so that a more sensitive test of the treatment ef

Negative binomial distribution, Negative binomial distribution is the prob...

Negative binomial distribution is the probability distribution of number of failures, X, before the kth success in the sequence of Bernoulli trials where the probability of succes

Multivariate analysis of variance, Multivariate analysis of variance is th...

Multivariate analysis of variance is the procedure for testing equality of the mean vectors of more than two populations for the multivariate response variable. The method is dire

#title.Statistics for management, The growth in bad debt expense for Johnst...

The growth in bad debt expense for Johnston office supply Company over this time period.If this rate continues,estimate the percentage increase in bad debts for 1997,relative to 19

Degenerate distributions, The special cases of the probability distribution...

The special cases of the probability distributions in which the random variable's distribution is concentrated at one point only. For instance, a discrete uniform distribution when

Fisher''s transformation, The transformation of the Pearson's product momen...

The transformation of the Pearson's product moment correlation coefficient, r, can be given by   The statistic z has the normal distribution with mean   here ρ is the pop

Network sampling, Network sampling is a sampling design in which the simpl...

Network sampling is a sampling design in which the simple random sample or strati?ed sample of the sampling units is made and all observational units which are linked to any of th

L''abbe ´ plot, L'Abbe ´ plot is often used in the meta-analysis of the cl...

L'Abbe ´ plot is often used in the meta-analysis of the clinical trials where the result is the binary response of it. The event risk (number of events/number of the patients in a

Describe multiple imputation, Multiple imputation : The Monte Carlo techniq...

Multiple imputation : The Monte Carlo technique in which missing values in the data set are replaced by m> 1 simulated versions, where m is usually small (say 3-10). Each of simula

Cellular proliferation models, Cellular proliferation models : Models are u...

Cellular proliferation models : Models are used to describe the growth of the  cell populations. One of the example is the deterministic model   where N(t) is the number of cel

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd