K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Fuzzy set theory, A radically different approach of dealing with the uncert...

A radically different approach of dealing with the uncertainty than the traditional probabilistic and the statistical methods. The necessary feature of the fuzzy set is a membershi

Catastrophe theory, Catastrophe theory : A theory of how little is the cont...

Catastrophe theory : A theory of how little is the continuous changes in the independent variables which can have unexpected, discontinuous effects on the dependent variables. Exam

Quantitative Methods, After graduating from Tech Julia was unable to find r...

After graduating from Tech Julia was unable to find regular employment and approached the Director of Athletics at Tech to request that she remain a vendor of the following year.

Statistcal computing flow charts for sums, 1. define statistical algorithms...

1. define statistical algorithms 2. write the flow charts for statistical algorithms for sums, squares and products. 3. write flow charts for statistical algorithms to generates ra

Define interval-censored observations, Interval-censored observations ar...

Interval-censored observations are the  observations which often occur in the context of studies of time elapsed to the particular event when subjects are not monitored regularl

Naor''s distribution, Naor's distribution is the discrete probability dist...

Naor's distribution is the discrete probability distribution which arises from the following model; Assume an urn contains n balls of which one is red and the remainder is whit

Point scoring, Point scoring is an easy distribution free method which can...

Point scoring is an easy distribution free method which can be used for the prediction of a response which is a binary variable from the observations on several explanatory variab

Hypotheses, a company suppliers specialized, high tensile Pins to customers...

a company suppliers specialized, high tensile Pins to customers. It uses an automatic lathe to produce the pins. Due to the factors such as vibration, temperature and wear and tear

Sequencing of 4 machines, how to resolve sequencing problem if jobs 6 given...

how to resolve sequencing problem if jobs 6 given and 4 machines given. how to apply johnson rule for making to machines under this conditions. please give solution as soon as poss

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd