Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Odds ratio, Odds ratio is the ratio of the odds for the binary variable in...

Odds ratio is the ratio of the odds for the binary variable in two groups of the subjects, such as, males and females. If the two possible states of variable are labeled as 'succe

Falsediscoveryrate (fdr), The approach of controlling the error rate in an ...

The approach of controlling the error rate in an exploratory analysis where number of hypotheses are tested, but where the strict control which is provided by multiple comparison p

Ordination, Ordination is the procedure of reducing the dimensionality (th...

Ordination is the procedure of reducing the dimensionality (that is the number of variables) of multivariate data by deriving the small number of new variables which contain much

Bivariate boxplot, Bivariate boxplot : A bivariate analogue of boxplot in w...

Bivariate boxplot : A bivariate analogue of boxplot in which the inner area contains 50%of the data, and a 'fence' helps to identify the potential outliers. Robust methods or techn

Individual differences, Individual differences scaling is a form of multid...

Individual differences scaling is a form of multidimensional scaling applicable to the data comprising of a number of proximity matrices from the different sources that is differe

Petersen''s factor theorem, Suppose the graph G is n-connected, regular of ...

Suppose the graph G is n-connected, regular of degree n, and has an even number of vertices. Prove that G has a one-factor. Petersen's 2-factor theorem (Theorem 5.40 in the note

Banach''s match-box problem, Banach's match-box problem : The person carrie...

Banach's match-box problem : The person carries two boxes of matches, one in his left and one in his right pocket. At first they comprise N number of matches each. When the person

Define recurrence risk, Recurrence risk : Usually the probability that an i...

Recurrence risk : Usually the probability that an individual experiences an event of interest given previous experience(s) of the event; for example, the probability of recurrence

Last observation carried forward, Last observation carried forward is a te...

Last observation carried forward is a technique for replacing the observations of the patients who drop out of the clinical trial carried out over a time period. It consists of su

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd