K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Explain remedian, Remedian: The robust estimator of location which is comp...

Remedian: The robust estimator of location which is computed by an iterative process. By assuming that the sample size n can be written as bk where b and k are the integers, the s

Greenhouse geissercorrection, Greenhouse geissercorrection is the method o...

Greenhouse geissercorrection is the method of adjusting the degrees of freedom of the within- subject F-tests in the analysis of the variance of longitudinal data so as to allow t

Experimental design, i have an assignment for experimental design which is ...

i have an assignment for experimental design which is must done by SAS program can you help me also i need to hand in the assignment till thursday shall i send it for you ?

Chains of infection, Chains of infection : The description of the course of...

Chains of infection : The description of the course of infection among the group of individuals. The susceptibles infected by the direct contact with the introductory cases are sai

QUANTITATIVE METHOD., an oil company is considering whether or not to bid f...

an oil company is considering whether or not to bid for an offshore drilling contract. The bid would cost $60 with a 65% chance of gaining the contract. Outcome success Probability

Fractional factorial design, Designs in which the information on main effec...

Designs in which the information on main effects and low-order inter- actions are attained by running only the fraction of the complete factorial experiment and supposing that part

Poisson regression, Poisson regression In case of Poisson regression w...

Poisson regression In case of Poisson regression we use ηi = g(µi) = log(µi) and a variance V ar(Yi) = φµi. The case φ = 1 corresponds to standard Poisson model. Poisson regre

Range, Range is the difference between the largest and smallest observatio...

Range is the difference between the largest and smallest observations in the data set. Commonly used as an easy-to-calculate measure of the dispersion in the set of observations b

Probability., 5. Packages from a machine a normally distributed with a mean...

5. Packages from a machine a normally distributed with a mean 200g and its standard deviation 2grams. Find the probability that a package from the machine weighs a) Less than

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd