K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Factor scores, The values assigned to factors for the individual sample uni...

The values assigned to factors for the individual sample units in a factor analysis. The most common approach is "regression method". When the factors are seen as the random variab

Relative poverty statistics, Relative poverty statistics is the statistics...

Relative poverty statistics is the statistics on the properties of populations falling below given fractions of average income which play a central role in debate of poverty. The

Chebyshev''s inequality, Chebyshev's inequality: A statement about the pro...

Chebyshev's inequality: A statement about the proportion of the observations which fall within some number of the standard deviations of the mean for any of the probability distri

Data mining, The non-trivial extraction of implicit, earlier unknown and po...

The non-trivial extraction of implicit, earlier unknown and potentially useful information from data, specifically high-dimensional data, using pattern recognition, artificial inte

Baddeley''smetric, Baddeley'smetric : A manner of measuring the 'error' in ...

Baddeley'smetric : A manner of measuring the 'error' in the image processing technique or method. The metric is derived using the fundamental theory from the stochastic geometry an

Centile reference charts, Centile reference charts : Charts which are used ...

Centile reference charts : Charts which are used inmedicine to observe the clinical measurements on individual patients in the context of the population values. If the population i

Graph theory, Why Graph theory? It is the branch of mathematics concerned w...

Why Graph theory? It is the branch of mathematics concerned with the properties of sets of points (vertices or nodes) some of which are connected by the lines known as the edges. A

Window estimates, Window estimates is a term which occurs in the context o...

Window estimates is a term which occurs in the context of the both frequency domain and time domain estimation for the time series. In the previous it generally applies to weights

Canonical correlation analysis, Canonical correlation analysis : A process ...

Canonical correlation analysis : A process of analysis for investigating the relationship between the two groups of variables, by ?nding the linear functions of one of the sets of

Blinding, Blinding : A procedure used in clinical trials to get rid of the ...

Blinding : A procedure used in clinical trials to get rid of the possible bias which might be introduced if the patient and/or the doctor knew which treatment the patient is receiv

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd