Classification and regression tree technique (cart), Advanced Statistics

Assignment Help:

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory variables most significant for prediction of the response variable. Rather than ?tting the model to the sample data, a tree structure is obtained by dividing the sample recursively into the various of sets, each division being chosen so as to maximize some measure of difference in the response variable in the resulting two sets. The resulting structure often gives us the easier interpretation than a regression equation, as those variables most significant for the prediction can be quickly identi?ed. In addition this approach does not need distributional assumptions and is also more resistant to the effects of the outliers. At each stage the sample is divided on the basis of a variable, xi, according to answers to such questions as 'Is xi c' (univariate split), is ' Paixi c' (which is linear function split) and 'does xi A' (if xi is the categorical variable).
1423_regression.png
A design of the application of this method or technique is shown in the figure 35.


Related Discussions:- Classification and regression tree technique (cart)

Describe respondent-driven sampling (rds), Respondent-driven sampling (RDS ...

Respondent-driven sampling (RDS ): The form of snowball sampling which starts with the recruitment of the small number of people in the target population to serve as the seeds. Aft

Cohort study, Cohort study : An investigation in which the group of individ...

Cohort study : An investigation in which the group of individuals (or the cohort) is identi?ed and followed prospectively, possibly for many years, and their subsequent medical his

Expectaton, sales per day for a product are as follows: x= 10, 11, 12, 13 (...

sales per day for a product are as follows: x= 10, 11, 12, 13 (p)= 0.2, 0.4, 0.3, 0.1 obtain mean and variance of daily sale. if the profit is described by the following equation p

Machine learning, Machine learning  is a term which literally means the ab...

Machine learning  is a term which literally means the ability of a machine to recognize patterns which have occurred repetitively and to improve its performance based on the past

Quittingill effect, Quittingill effect is a  problem which occurs most fre...

Quittingill effect is a  problem which occurs most frequently in studies of the smoker cessation where smokers frequently quit smoking following the onset of the disease symptoms

Odds ratio, Odds ratio is the ratio of the odds for the binary variable in...

Odds ratio is the ratio of the odds for the binary variable in two groups of the subjects, such as, males and females. If the two possible states of variable are labeled as 'succe

Binomial distribution with continuity correction, Records on the computer m...

Records on the computer manufacturing process at Pratt-Zungia Limited show that the percentage of defective computers sent to  customers has been 5% over the last few years. Shipme

Bartlett''s test for variances, Bartlett's test for variances : A test for ...

Bartlett's test for variances : A test for equality of the variances of the number (k)of the populations. The test statistic can be given as follows   where s square is an

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd