Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Anova, a. Explain the meaning of the word non-orthogonal. b. What conditio...

a. Explain the meaning of the word non-orthogonal. b. What condition(s) must exist for non-orthogonality to occur? Be specific.

Calibration, Calibration : A procedure which enables a series of simply obt...

Calibration : A procedure which enables a series of simply obtainable but inaccurate measurements of some quantity of interest to be used to provide more precise estimates of the r

Reliability theory, Reliability theory is the theory which attempts to det...

Reliability theory is the theory which attempts to determine the reliability of the complex system from knowledge of the reliabilities of the components. Interest might centre on

Data mining, The non-trivial extraction of implicit, earlier unknown and po...

The non-trivial extraction of implicit, earlier unknown and potentially useful information from data, specifically high-dimensional data, using pattern recognition, artificial inte

Balanced incomplete repeated measures design (birmd), Balanced incomplete r...

Balanced incomplete repeated measures design (BIRMD): An arrangement of the N randomly selected experimental units and k treatments in which each and every unit receives k1 treatm

Variance inflation factor, VIF is the abbreviation of variance inflation fa...

VIF is the abbreviation of variance inflation factor which is a measure of the amount of multicollinearity that exists in a set of multiple regression variables. *The VIF value

Bioinformatics, Bioinformatics : Essentially the application of the informa...

Bioinformatics : Essentially the application of the information theory to biology to deal with the deluge of the information resulting from the advances in molecular biology. The m

Disclosure risk, The risk of being able to recognize the respondent's confi...

The risk of being able to recognize the respondent's confidential information in the data set. Number of approaches has been proposed to measure the disclosure risk some of which c

Define matching coefficient, Matching coefficient is a similarity coeffici...

Matching coefficient is a similarity coefficient for data consisting of the number of binary variables which is often used in cluster analysis. It can be given as follows    he

Network sampling, Network sampling is a sampling design in which the simpl...

Network sampling is a sampling design in which the simple random sample or strati?ed sample of the sampling units is made and all observational units which are linked to any of th

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd