K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Frequency polygon, It is the diagram used to display the values graphically...

It is the diagram used to display the values graphically in a frequency distribution. The frequencies are graphed as an ordinate against the class mid-points as abscissae. The p

Weighted least squares, Weighted least squares  is the method of estimation...

Weighted least squares  is the method of estimation in which the estimates arise from minimizing the weighted sum of squares of the differences between response variable and its pr

Random success probability, a psychic claims to be able to "feel colors" th...

a psychic claims to be able to "feel colors" there are three pieces of colored paper(red, blue,green) he will place his hand on radomly selected pieces while blindfolded. you perfo

Matching distribution, Matching distribution is  a probability distributi...

Matching distribution is  a probability distribution which arises in the following manner. Suppose that the set of n subjects, numbered 1; . . . ; n respectively, are arranged in

Parks test, The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedastici...

The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedasticity errors and no heteroscedasticity exists The Alternative Hypothesis - H1: β 1 ≠ 0 i.e. there is no homoscedasti

Double sampling, The procedure in which initially the sample of subjects is...

The procedure in which initially the sample of subjects is selected for generating the auxillary information only, and then the second sample is selected in which the variable of i

Histogram, Histogram is the graphical representation of the set of observat...

Histogram is the graphical representation of the set of observations in which class frequencies are represented by the regions of rectangles centred on the class interval. If the f

Spreading function and scattering function, 1)  Consider an antenna with a ...

1)  Consider an antenna with a pattern: G(θ,φ) = sinn(θ/θ0) cos(θ/θ0)   where θ0 = Π/1.5 (a) What is the 3-dB bandwidth? (b) What is the 10-dB beam width? (c) What is t

Complier average causal effect (cace), Complier average causal effect (CACE...

Complier average causal effect (CACE): The treatment effect amid true compliers in the clinical trial. For the suitable response variable, the CACE is given by the difference in o

Linearity - reasons for screening data, Linearity - Reasons for Screening D...

Linearity - Reasons for Screening Data Many of the technics of standard statistical analysis are based on the assumption that the relationship, if any, between variables is li

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd