K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Explain median absolute deviation (mad), Median absolute deviation (MAD) : ...

Median absolute deviation (MAD) : It is the very robust estimator of the scale given by the following equation   or, in other words we can say that, the median of the absolute

Non central distributions, Non central distributions is the series of prob...

Non central distributions is the series of probability distributions each of which is the adaptation of one of the standard sampling distributions like the chi-squared distributio

Tree, Tree is the term from the branch of the mathematics which known as t...

Tree is the term from the branch of the mathematics which known as the graph theory, used to describe any set of the straight-line segments joining the pairs of points in some pro

Bootstrap, Bootstrap : The data-based simulation method/technique for the s...

Bootstrap : The data-based simulation method/technique for the statistical inference which can be used to study the variability of the estimated characteristics of the probability

Define matching coefficient, Matching coefficient is a similarity coeffici...

Matching coefficient is a similarity coefficient for data consisting of the number of binary variables which is often used in cluster analysis. It can be given as follows    he

Assignment, Different approaches to the study of early indian history

Different approaches to the study of early indian history

Chains of infection, Chains of infection : The description of the course of...

Chains of infection : The description of the course of infection among the group of individuals. The susceptibles infected by the direct contact with the introductory cases are sai

Hypotheses, a company suppliers specialized, high tensile Pins to customers...

a company suppliers specialized, high tensile Pins to customers. It uses an automatic lathe to produce the pins. Due to the factors such as vibration, temperature and wear and tear

Explain Grade of membership model, Grade of membership model: This is the ...

Grade of membership model: This is the general distribution free method for the clustering of the multivariate data in which only categorical variables are included. The model ass

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd