K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Morbidity, Morbidity is the term used in the epidemiological studies to de...

Morbidity is the term used in the epidemiological studies to describe sickness in the human populations. The WHO Expert Committee on the Health Statistics noted in its sixth repor

Homoscedasticity - reasons for screening data, Homoscedasticity - Reasons f...

Homoscedasticity - Reasons for Screening Data Homoscedasticity is the assumption that the variability in scores for a continuous variable is roughly the same at all values of

Regression, regression line drawn as Y=C+1075x, when x was 2, and y was 239...

regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Tree, Tree is the term from the branch of the mathematics which known as t...

Tree is the term from the branch of the mathematics which known as the graph theory, used to describe any set of the straight-line segments joining the pairs of points in some pro

Excel, Software which started out as the spreadsheet targeting at manipulat...

Software which started out as the spreadsheet targeting at manipulating the tables of number for financial analysis, which has now developed into a more flexible package for workin

Higher criticism, Higher criticism is a multiple-comparison test concept a...

Higher criticism is a multiple-comparison test concept arising from the situation where there are number of independent tests of significance and interest lies in the rejecting jo

Cure models, Models for the analysis of the survival times, or the time to ...

Models for the analysis of the survival times, or the time to event, data in which it is expected that a fraction of the subjects will not experience the event of interest. In a cl

Explain historical controls, Historical controls : The group of patients tr...

Historical controls : The group of patients treated in the past with the standard therapy, taken in use as the control group for evaluating the new treatment on the present patient

Statistically modeling, A comprehensive regression analysis of the case stu...

A comprehensive regression analysis of the case study London has been carried out to test the 4 assumptions of regression: 1. Variables are normally distributed 2. Linear rel

Correlated failure times, Data which occur when failure period is recorded ...

Data which occur when failure period is recorded which are dependent. Such type of data can arise in number contexts, for instance, in epidemiological cohort studies in which th

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd