K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Chapter 7&8, Chapter 7 2. Describe the distribution of sample means (shape...

Chapter 7 2. Describe the distribution of sample means (shape, expected value, and standard error) for samples of n =36 selected from a population with a mean of µ = 100 and a sta

Intention-to-treat analysis, Intention-to-treat analysis is the process in...

Intention-to-treat analysis is the process in which all the patients randomly allocated to a treatment in the clinical trial are analyzed together as representing that particular

Persson rootze ´n estimator, Persson Rootze ´n estimator  is an estimator f...

Persson Rootze ´n estimator  is an estimator for the parameters in the normal distribution when the sample is truncated so that all the observations under some fixed value C are re

Generaliz ability theory, The theory of measurement which recognizes that i...

The theory of measurement which recognizes that in any measurement situation there are multiple (actually infinite) sources of variation (known as facets in the theory), and that a

Doob meyer decomposition, A theorem which shows that any counting process m...

A theorem which shows that any counting process may be uniquely decomposed as the sum of a martingale and a predictable, right-continous process called the compensator, assuming ce

Determine allowable setup cost, A metal fabrication process uses a die-cast...

A metal fabrication process uses a die-cast metal fastener at a uniform rate of 300 units per year. Currently, this item is currently purchased from an external supplier at a unit

Time series, relevancy of time series in business management

relevancy of time series in business management

Fuzzy set theory, A radically different approach of dealing with the uncert...

A radically different approach of dealing with the uncertainty than the traditional probabilistic and the statistical methods. The necessary feature of the fuzzy set is a membershi

Weighted least squares, Weighted least squares  is the method of estimation...

Weighted least squares  is the method of estimation in which the estimates arise from minimizing the weighted sum of squares of the differences between response variable and its pr

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd