Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Expected monetary value, Ask quesoil company is considering whether or not ...

Ask quesoil company is considering whether or not to bid for an offshore drilling contract. If they bid, the value would be $600m with a 65% chance of gaining the contract. The com

Continuous variable, Continuous variable : The measurement which is not res...

Continuous variable : The measurement which is not restricted to the particular values except in so far as this is constrained by the accuracy of measuring instrument. General exam

Identifying the necessary and sufficient conditions, You have probably noti...

You have probably noticed by now that some of the statements of necessary and sufficient conditions sound more natural than others. For example it seems more natural to express "We

Non parametric maximum likelihood (npml), Non parametric maximum likelihood...

Non parametric maximum likelihood (NPML) is a likelihood approach which does not need the specification of the full parametric family for the data. Usually, the non parametric max

Contour plot, Contour plot : A topographical map drawn from data comprising...

Contour plot : A topographical map drawn from data comprising observations on the three variables. One variable is represented on horizontal axis and the second variable is represe

Classification matrix, Classification matrix: A term many times used in di...

Classification matrix: A term many times used in discriminant analysis for the matrix summarizing the results and outputs obtained from the derived classi?cation rule, and obtaine

Median, Median is the value in a set of the ranked observations which divi...

Median is the value in a set of the ranked observations which divides the data into two parts of equal size. When there are an odd number of observations the median is middle v

Explain kurtosis, Kurtosis: The extent to which the peak of the unimodal p...

Kurtosis: The extent to which the peak of the unimodal probability distribution or the frequency distribution departs from its shape of the normal distribution, by either being mo

Dendro gram, A term commonly encountered in the application of the agglomer...

A term commonly encountered in the application of the agglomerative hierarchical clustering techniques, where it refers to the 'tree-like' diagram illustrating the series of steps

Explain information theory., Information theory: This is the branch of app...

Information theory: This is the branch of applied probability theory applicable to various communication and signal processing problems in the field of engineering and biology. In

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd