Best subsets regression, Advanced Statistics

Assignment Help:

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.


Related Discussions:- Best subsets regression

Interior analysis, Interior analysis is the  term now and again applied to...

Interior analysis is the  term now and again applied to analysis carried out on the fitted model in regression problem. The basic target of such analyses is the identification of

Odds ratio, Odds ratio is the ratio of the odds for the binary variable in...

Odds ratio is the ratio of the odds for the binary variable in two groups of the subjects, such as, males and females. If the two possible states of variable are labeled as 'succe

Determine the probablity, Dr. Stallter has been teaching basic statistics f...

Dr. Stallter has been teaching basic statistics for many years. She knows that 80% of the students will complete the assigned problems. She has also determined that among those who

Collector''s problem, Collector's problem : A problem which derives from th...

Collector's problem : A problem which derives from the schemes in which packets of a particular brand of coffe, cereal etc., are sold with coupons, cards, or other tokens. There ar

Multivariate data, Multivariate data is the data for which each observatio...

Multivariate data is the data for which each observation consists of the values for more than one random variable. For instance, measurements on the blood pressure, temperature an

Complier average causal effect (cace), Complier average causal effect (CACE...

Complier average causal effect (CACE): The treatment effect amid true compliers in the clinical trial. For the suitable response variable, the CACE is given by the difference in o

Historigram, difference between histogram and historigram

difference between histogram and historigram

Mann whitney test, Mann Whitney test is a distribution free test which is ...

Mann Whitney test is a distribution free test which is used as an alternative to the Student's t-test for assessing that whether the two populations have the same median. The test

Statistically modeling, A comprehensive regression analysis of the case stu...

A comprehensive regression analysis of the case study London has been carried out to test the 4 assumptions of regression: 1. Variables are normally distributed 2. Linear rel

Explain personal probabilities, Personal probabilities : A radically specia...

Personal probabilities : A radically special approach for allocating probabilities to events than, for instance, the commonly used long-term relative frequency approach. In this ty

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd