K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Assignment, Different approaches to the study of early indian history

Different approaches to the study of early indian history

Describe indirect least squares, Indirect least squares: An estimation tech...

Indirect least squares: An estimation technique used in the fitting of structural equation models. Commonly least squares are first used to estimate reduced form parameters. Usi

Conjugate prior, Conjugate prior : The distribution for samples from the pa...

Conjugate prior : The distribution for samples from the particular probability distribution such that the posterior distribution at each stage of the sampling is of the identical f

Correlated failure times, Data which occur when failure period is recorded ...

Data which occur when failure period is recorded which are dependent. Such type of data can arise in number contexts, for instance, in epidemiological cohort studies in which th

Multi co linearity, Multi co linearity is the term used in the regression ...

Multi co linearity is the term used in the regression analysis to indicate situations where the explanatory variables are related by a linear function, making the inference of the

Exponential order statistics model, The model which arises in the context o...

The model which arises in the context of estimating the size of the closed population where individuals within the population could be identified only during some of the observatio

What is the expectation of the number of tosses required, Question 1 A box...

Question 1 A box contains 20 fuses of which 5 are defective If 2 fuses are chosen together at random what is the probability that both the fuses are defective? Question 2 A c

Profile plots, Profile plots  is a technique of representing the multivaria...

Profile plots  is a technique of representing the multivariate data graphically. Each of the observation is represented by a diagram comprising of a sequence of equispaced vertical

Non-randomized clinical trial, Non-randomized clinical trial is the clinic...

Non-randomized clinical trial is the clinical trial in which the series of consecutive patients receive a new treatment and those which respond (according to some of the pre-defin

Game theory, This is the branch of mathematics which deals with the theory ...

This is the branch of mathematics which deals with the theory of contests between two or more players under the specified sets of rules. The subject supposes a statistical aspect w

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd