K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Daycare, facts and statistics about daycare

facts and statistics about daycare

Back-projection, Back-projection: A term most often applied to the procedu...

Back-projection: A term most often applied to the procedure for reconstructing plausible HIV incidence curves from the AIDS incidence data. The method or technique assumes that th

Confidence interval estimation, An auditor for a government agency needs to...

An auditor for a government agency needs to evaluate payments for doctors' office visits paid by Medicare in a small regional town during the month of June. A total of 25,056 visit

Extreme values, The biggest and smallest variate values among the sample of...

The biggest and smallest variate values among the sample of observations. Significant in various regions, for instance flood levels of the river, speed of wind and snowfall.

Forest plot, A name sometimes given to the type of diagram generally used i...

A name sometimes given to the type of diagram generally used in meta-analysis, in which point estimates and confidence intervals are displayed for all the studies included in the a

Explain remedian, Remedian: The robust estimator of location which is comp...

Remedian: The robust estimator of location which is computed by an iterative process. By assuming that the sample size n can be written as bk where b and k are the integers, the s

Statistics HW, we are testing : Ho: µ=40 versus Ha: µ>40 (a= 0.01) Suppose...

we are testing : Ho: µ=40 versus Ha: µ>40 (a= 0.01) Suppose that the test statistic is z0=2.75 based on a sample size of n=25. Assume that data are normal with mean mu and standa

Partial autocorrelation function, The graph for Partial Autocorrelation Fun...

The graph for Partial Autocorrelation Function for RES1 shows that there is no autocorrelation even though there are alternating spikes because they fall inside the 5% significance

Bioinformatics, Bioinformatics : Essentially the application of the informa...

Bioinformatics : Essentially the application of the information theory to biology to deal with the deluge of the information resulting from the advances in molecular biology. The m

frequentist inference, The approach to statistics based on a frequency vie...

The approach to statistics based on a frequency view of probability in which it is supposed that it is possible to consider an in?nite sequence of the independent repetitions of th

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd