Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Explain kleiner hartigan trees, Kleiner Hartigan trees is a technique for ...

Kleiner Hartigan trees is a technique for displaying the multivariate data graphically as the 'trees' in which the values of the variables are coded into length of the terminal br

Data squashing, An approach to decrease the size of very large data sets in...

An approach to decrease the size of very large data sets in which the data are first 'binned' and then statistics such as the mean and variance/covariance are calculated on each bi

Multivariate data, Multivariate data is the data for which each observatio...

Multivariate data is the data for which each observation consists of the values for more than one random variable. For instance, measurements on the blood pressure, temperature an

Line-intersect sampling, Line-intersect sampling is a technique of unequal...

Line-intersect sampling is a technique of unequal probability sampling for selecting the sampling units in the geographical area. A sample of lines is drawn in a study area and, w

General location model, The model for data containing continuous and catego...

The model for data containing continuous and categorical variables both.The categorical data are summarized by the contingency table and their marginal distribution, 182by the mult

Tests for heteroscedasticity, The Null Hypothesis - H0: There is no heteros...

The Null Hypothesis - H0: There is no heteroscedasticity i.e. β 1 = 0 The Alternative Hypothesis - H1:  There is heteroscedasticity i.e. β 1 0 Reject H0 if nR2 > MTB >

January 2015 Take-Home Assignment, 3. a. A researcher in Hong Kong computes...

3. a. A researcher in Hong Kong computes the correlation between the percentage of employee turnover and the local unemployment rate (also expressed as a percentage) over a 20-mont

Efficiency, This term applied in the context of comparing the different met...

This term applied in the context of comparing the different methods and techniques of estimating the same parameter; the estimate with the lowest variance being regarded as the mos

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd