Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Common cause failures (ccf), Common cause failures (CCF): Simultaneous fai...

Common cause failures (CCF): Simultaneous failures of the number of components due to a same reason. A reason can be external to the components, or it can be the single failure wh

Scatter plots, The scatter plot of SRES1 versus totexp demonstrates that th...

The scatter plot of SRES1 versus totexp demonstrates that there is non-linear relationship that exists as most of the points are below and above zero. The scatter plot show that th

Extrapolation, This process of estimating from a data set those values lyin...

This process of estimating from a data set those values lying beyond range of the data. In the regression analysis, for instance, a value of the response variable might be estimate

Generalized linear models, Introduction to Generalized Linear Models (GLM) ...

Introduction to Generalized Linear Models (GLM) We introduce the notion of GLM as an extension of the traditional normal-theory-based linear regression models. This will be very

Random allocation, Random allocation is a technique for creating the treat...

Random allocation is a technique for creating the treatment and control groups particularly in accordance of the clinical trial. Subjects receive the active treatment or the place

Genomics, Genomics  is the study of the structure, function and the evoluti...

Genomics  is the study of the structure, function and the evolution of deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) sequences which comprise the genome of living organisms

Hill-climbing algorithm, Hill-climbing algorithm is  an algorithm which is ...

Hill-climbing algorithm is  an algorithm which is made in use in those techniques of cluster analysis which seek to find the partition of n individuals into g clusters by optimizin

Bartlett decomposition, Bartlett decomposition : The expression for the ra...

Bartlett decomposition : The expression for the random matrix A which has a Wishart distribution as the product of the triangular matrix and the transpose of it. Letting each of x

Huffman coding based compression, Huffman code is used to compress data fil...

Huffman code is used to compress data file, where the data is represented as a sequence of characters. Huffman's greedy algorithm uses a table giving how often each character occur

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd