Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Decision Analysis, Build-Rite construction has received favorable publicity...

Build-Rite construction has received favorable publicity from guest appearances on a public TV home improvement program. Public TV programming decisions seem to be unpredictable, s

Decision tree, The graphic representation of the alternatives in a decision...

The graphic representation of the alternatives in a decision making problem which summarizes all the possibilities foreseen by the decision maker. For instance, suppose we are give

Explain multicentre study, Multicentre study : The clinical trial conducte...

Multicentre study : The clinical trial conducted simultaneously in the number of participating hospitals, with all centres following an agreed-upon study of the protocol and with

Non-randomized clinical trial, Non-randomized clinical trial is the clinic...

Non-randomized clinical trial is the clinical trial in which the series of consecutive patients receive a new treatment and those which respond (according to some of the pre-defin

Observation-driven model, Observation-driven model  is a term generally a...

Observation-driven model  is a term generally applied to models for the longitudinal data or time series which introduce within the unit correlation by specifying the conditional

Fuzzy set theory, A radically different approach of dealing with the uncert...

A radically different approach of dealing with the uncertainty than the traditional probabilistic and the statistical methods. The necessary feature of the fuzzy set is a membershi

Factor scores, The values assigned to factors for the individual sample uni...

The values assigned to factors for the individual sample units in a factor analysis. The most common approach is "regression method". When the factors are seen as the random variab

Huffman coding based compression, Huffman code is used to compress data fil...

Huffman code is used to compress data file, where the data is represented as a sequence of characters. Huffman's greedy algorithm uses a table giving how often each character occur

Glejser test, Glejser test is the test for the heteroscedasticity in the e...

Glejser test is the test for the heteroscedasticity in the error terms of the regression analysis which involves regressing the absolute values of the regression residuals for the

Accelerated life testing, Normal 0 false false false EN...

Normal 0 false false false EN-US X-NONE X-NONE

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd