Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Mixture experiment, Mixture experiment is an experiment in which the two o...

Mixture experiment is an experiment in which the two or more ingredients are blended together to form an end product. The measurements are taken on the several blends of the ingre

Frailty, A term usually used for unobserved individual heterogeneity. Such ...

A term usually used for unobserved individual heterogeneity. Such variation is of main concern in the medical statistics particularly in the analysis of the survival times where ha

Regression analysis, with the help of regression analysis create a model th...

with the help of regression analysis create a model that best describes the situation. Indicate clearly the effect that each factors given in the attached file and other factors ma

Locally weighted regression, Locally weighted regression  is the method of ...

Locally weighted regression  is the method of regression analysis in which the polynomials of degree one (linear) or two (quadratic) are used to approximate regression function in

Explain initial data analysis (ida), Initial data analysis (IDA): The firs...

Initial data analysis (IDA): The first phase in the examination of the data set which comprises  number of informal steps including the following steps * checking the quality o

Chi-squared distribution, Chi-squared distribution : It is the probability ...

Chi-squared distribution : It is the probability distribution, f (x), of the random variable de?ned as the sum of squares of the number (v) of independent standard normal variables

Population averaged models, Population averaged models are the models for ...

Population averaged models are the models for kind of clustered data in which the marginal expectation of response variable is the main focus of interest. An alternative approach

Disclosure risk, The risk of being able to recognize the respondent's confi...

The risk of being able to recognize the respondent's confidential information in the data set. Number of approaches has been proposed to measure the disclosure risk some of which c

Whites general heteroscedasticity test, The Null Hypothesis - H0:  γ 1 = γ...

The Null Hypothesis - H0:  γ 1 = γ 2 = ...  =  0  i.e.  there is no heteroscedasticity in the model The Alternative Hypothesis - H1:  at least one of the γ i 's are not equal

Construct a stem-and-leaf diagram, The number of employees absent from work...

The number of employees absent from work at a large electronics manufacturing plant over aperiod of 106 days is given in the table below. 146 141 139 140 145 141 142 131 142 140

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd