Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Likelihood, Likelihood is the probability of a set of observations provide...

Likelihood is the probability of a set of observations provided the value of some parameter or the set of parameters. For instance, the likelihood of the random sample of n observ

Procrustes analysis, Procrustes analysis is a technique of comparing the a...

Procrustes analysis is a technique of comparing the alternative geometrical representations of a group of multivariate data or of the proximity matrix, for instance, two competing

Explain literature controls, Literature controls : The patients with the di...

Literature controls : The patients with the disease of interest who have received, in the past, one of two treatments under the investigation, and for whom the results have been pu

Gaussian process, The generalization of the normal distribution used for th...

The generalization of the normal distribution used for the characterization of functions. It is known as a Gaussian process because it has Gaussian distributed finite dimensional m

The time series analysis on the number of babies, importance of time series...

importance of time series on the number of babies given birth

Regression, regression line drawn as Y=C+1075x, when x was 2, and y was 239...

regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

General location model, The model for data containing continuous and catego...

The model for data containing continuous and categorical variables both.The categorical data are summarized by the contingency table and their marginal distribution, 182by the mult

Factor scores, The values assigned to factors for the individual sample uni...

The values assigned to factors for the individual sample units in a factor analysis. The most common approach is "regression method". When the factors are seen as the random variab

Residual calculation, Regression line drawn as y= c+ 1075x ,when x was2, an...

Regression line drawn as y= c+ 1075x ,when x was2, and y was 239,given that y intercept was 11. Calculate the residual ?

Glejser test, Glejser test is the test for the heteroscedasticity in the e...

Glejser test is the test for the heteroscedasticity in the error terms of the regression analysis which involves regressing the absolute values of the regression residuals for the

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd