Best subsets regression, Advanced Statistics

Assignment Help:

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.


Related Discussions:- Best subsets regression

Double-dummy technique, It is the technique used in the clinical trials whe...

It is the technique used in the clinical trials when it is possible to make an acceptable place before an active treatment but not to make the two active treatments identical. In t

Growth curve analysis, Growth curve analysis is t he general term for metho...

Growth curve analysis is t he general term for methods dealing with development of the individuals over time. A classic instance includes recordings made on a group of children, sa

Prepare a depreciation schedule for the rental equipment, Sam Tyler, a sing...

Sam Tyler, a single taxpayer, social security number 111-44-1111, bought Rental Equipment on 04/01/2010. He paid $400,000 including all closing and delivery costs. In the current y

Matching, Matching is the method of making a study group and a comparison ...

Matching is the method of making a study group and a comparison group comparable with respect to the extraneous factors. Generally used in the retrospective studies when selecting

Exploratory data analysis, The approach to data analysis which emphasizes t...

The approach to data analysis which emphasizes the use of informal graphical procedures not based on former assumptions about structure of the data or on the formal models for the

Quittingill effect, Quittingill effect is a  problem which occurs most fre...

Quittingill effect is a  problem which occurs most frequently in studies of the smoker cessation where smokers frequently quit smoking following the onset of the disease symptoms

Z-tests, Hello! I am currently in graduate school earning a masters in ment...

Hello! I am currently in graduate school earning a masters in mental health counseling. I am in a stats course at current and we are reviewing z-scores. I am a little lost because

Mendelian randomization, Mendelian randomization is the term applied to th...

Mendelian randomization is the term applied to the random assortment of alleles at the time of gamete formation, a process which results in the population distributions of genetic

Decision theory, A unified approach to all problems of prediction, estimati...

A unified approach to all problems of prediction, estimation, and hypothesis testing. It is based on concept of the decision function, which tells the performer of experiment how t

Frailty, A term usually used for unobserved individual heterogeneity. Such ...

A term usually used for unobserved individual heterogeneity. Such variation is of main concern in the medical statistics particularly in the analysis of the survival times where ha

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd