K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Analysis of variance, Thomas Economic Forecasting, Inc. and Harmon Economet...

Thomas Economic Forecasting, Inc. and Harmon Econometrics have the same mean error in forecasting the stock market over the last ten years. However, the standard deviation for Thom

Buffon''s needle problem, Buffon's needle problem : A problem proposed and ...

Buffon's needle problem : A problem proposed and solved by the scientist Comte de Buffon in 1777 which includes determining the probability, p, which a needle of length l will inte

Regression, calculate the mean yearly value using the average unemployment ...

calculate the mean yearly value using the average unemployment rate by month

Component bar chart, Component bar chart : A bar chart which shows the comp...

Component bar chart : A bar chart which shows the component parts of the aggregate represented by the whole length of the bar. The component parts are shown as the sectors of bar w

Ecological fallacy, The term used when the aggregated data (for instance, a...

The term used when the aggregated data (for instance, aggregated over different areas) are analysed and the results supposed to apply to the relationships at the individual level.

Censored observations, Censored observations : An observation xi on some va...

Censored observations : An observation xi on some variable of interest is consired to be censored if it is known that xi Li (left-censored)or xi Ui (right-censored) where Li and Ui

Chapter 7&8, Chapter 7 2. Describe the distribution of sample means (shape...

Chapter 7 2. Describe the distribution of sample means (shape, expected value, and standard error) for samples of n =36 selected from a population with a mean of µ = 100 and a sta

Identifying the necessary and sufficient conditions, You have probably noti...

You have probably noticed by now that some of the statements of necessary and sufficient conditions sound more natural than others. For example it seems more natural to express "We

Two-phase sampling, Two-phase sampling is the sampling scheme including tw...

Two-phase sampling is the sampling scheme including two distinct phases, in the first of which the information about the particular variables of interest is collected on all the m

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd