K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Chains of infection, Chains of infection : The description of the course of...

Chains of infection : The description of the course of infection among the group of individuals. The susceptibles infected by the direct contact with the introductory cases are sai

Asymmetric proximity matrices, Asymmetric proximity matrices : Proximity ma...

Asymmetric proximity matrices : Proximity matrices in which the non-diagonal elements, in the ith row and jth column and the jth row and ith column, are not essentially equal. Exam

Multilevel models, Multilevel models are the regression models for the mul...

Multilevel models are the regression models for the multilevel or clustered data where units i are nested in the clusters j, for example a cross-sectional study where students are

Explain initial data analysis (ida), Initial data analysis (IDA): The firs...

Initial data analysis (IDA): The first phase in the examination of the data set which comprises  number of informal steps including the following steps * checking the quality o

Homework and Assignment assistance for RES610 Course, Interested in 10 hour...

Interested in 10 hour program with twice a week tutoring for 1 hour each. Need tutor to assist with answering the assignment questions for the next 5 weeks.

Weighted least squares, Weighted least squares  is the method of estimation...

Weighted least squares  is the method of estimation in which the estimates arise from minimizing the weighted sum of squares of the differences between response variable and its pr

Estimation, The process of providing the numerical value for the population...

The process of providing the numerical value for the population parameter on the basis of information gathered from a sample. If a single ?gure is computed for the unknown paramete

Analysis of variance, Thomas Economic Forecasting, Inc. and Harmon Economet...

Thomas Economic Forecasting, Inc. and Harmon Econometrics have the same mean error in forecasting the stock market over the last ten years. However, the standard deviation for Thom

Balanced incomplete block design, Balanced incomplete block design : A desi...

Balanced incomplete block design : A design in which all the treatments are not used in all blocks. Such designs have the below stated properties: * each block comprises the

Estimating functions, The functions of the data and the parameters of inter...

The functions of the data and the parameters of interest which can be brought in use to conduct inference about the parameters when full distribution of the observations is unknown

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd