Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Explain prospective studies, Prospective study : The studies in which indiv...

Prospective study : The studies in which individuals are followed-up over the period of time. A general example of this type of investigation is where the samples of individuals ar

Explain lancaster models., Lancaster models : The means of representing the...

Lancaster models : The means of representing the joint distribution of the set of variables in terms of the marginal distributions, supposing all the interactions higher than a par

Best subsets regression, In the time series plot and scatter graphs there w...

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and

Statistics HW, we are testing : Ho: µ=40 versus Ha: µ>40 (a= 0.01) Suppose...

we are testing : Ho: µ=40 versus Ha: µ>40 (a= 0.01) Suppose that the test statistic is z0=2.75 based on a sample size of n=25. Assume that data are normal with mean mu and standa

Explain median absolute deviation (mad), Median absolute deviation (MAD) : ...

Median absolute deviation (MAD) : It is the very robust estimator of the scale given by the following equation   or, in other words we can say that, the median of the absolute

Balanced incomplete block design, Balanced incomplete block design : A desi...

Balanced incomplete block design : A design in which all the treatments are not used in all blocks. Such designs have the below stated properties: * each block comprises the

Probability weighting, Probability weighting is the procedure of attaching...

Probability weighting is the procedure of attaching weights equal to inverse of the probability of being selected, to each respondent's record in the sample survey. These weights

Linear regression, regression line drawn as Y=C+1075x, when x was 2, and y ...

regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Network sampling, Network sampling is a sampling design in which the simpl...

Network sampling is a sampling design in which the simple random sample or strati?ed sample of the sampling units is made and all observational units which are linked to any of th

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd