K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Explain interim analyses, Interim analyses : An analysis made before the pl...

Interim analyses : An analysis made before the planned end of a clinical trial, typically with the aim of detecting the treatment differences at the early stage and thus preventing

Define lagging indicators, Lagging indicators: The part of a collection of...

Lagging indicators: The part of a collection of the economic time series designed to give information about the broad swings in measures of the aggregate economic activity known a

Dot plot, The more effective display than a number of other methods or tech...

The more effective display than a number of other methods or techniques, for instance, pie charts and bar charts, for displaying the quantitative data which are labeled. An instanc

Epidemic, The rapid development or growth of the disease in a community or ...

The rapid development or growth of the disease in a community or region. Statistical thinking has made very much significant contributions to the understanding of such type of phen

Develop the equations to calculate the flow rates, A two-step distillation ...

A two-step distillation and mixing process is shown in the figure. The system operates at steady-state conditions and there are no chemical reactions. The known flow rates and comp

Quasi-experiment, Quasi-experiment is a term taken in use for studies whic...

Quasi-experiment is a term taken in use for studies which resemble experiments but are weak on some of the characteristics, particularly that allocation of the subjects to groups

Generalized linear models, Introduction to Generalized Linear Models (GLM) ...

Introduction to Generalized Linear Models (GLM) We introduce the notion of GLM as an extension of the traditional normal-theory-based linear regression models. This will be very

Decision Models., An oil company thinks that there is a 60% chance that the...

An oil company thinks that there is a 60% chance that there is oil in the land they own. Before drilling they run a soil test. When there is oil in the ground, the soil test comes

What is statistical inference, What is statistical inference?   Statis...

What is statistical inference?   Statistical inference can be defined as the  method of drawing conclusions from data which are subject to random variations. This is based o

Explain lancaster models., Lancaster models : The means of representing the...

Lancaster models : The means of representing the joint distribution of the set of variables in terms of the marginal distributions, supposing all the interactions higher than a par

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd