Best subsets regression, Advanced Statistics

Assignment Help:

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.


Related Discussions:- Best subsets regression

Randomized encouragement trial, Randomized encouragement trial   is the cl...

Randomized encouragement trial   is the clinical trials in which the participants are encouraged to change their behaviour in a particular manner (or not, if they are allocated to

Explanatory analysis, This term is sometimes used for the analysis of data ...

This term is sometimes used for the analysis of data from the clinical trial in which treatments A and B are to be compared under the suppositions that the patients remain on their

Attitude scaling, Attitude scaling : The process of analysing the positions...

Attitude scaling : The process of analysing the positions of the individuals on scales purporting to measure attitudes, for instance a liberal-conservative scale, ora risk-willingn

Negative binomial distribution, Negative binomial distribution is the prob...

Negative binomial distribution is the probability distribution of number of failures, X, before the kth success in the sequence of Bernoulli trials where the probability of succes

January 2015 Take-Home Assignment, 3. a. A researcher in Hong Kong computes...

3. a. A researcher in Hong Kong computes the correlation between the percentage of employee turnover and the local unemployment rate (also expressed as a percentage) over a 20-mont

Lagrange multipliertest, The Null Hypothesis - H0:  There is autocorrelatio...

The Null Hypothesis - H0:  There is autocorrelation The Alternative Hypothesis - H1: There is no autocorrelation Rejection Criteria: Reject H0 (n-s)R 2 > = (1515 - 4) x (0.

Describe martingale, Martingale: In the gambling context the term at first...

Martingale: In the gambling context the term at first referred to a system for recouping losses by doubling the stake after each loss has occured. The modern mathematical concept

Protocol, Protocol is the formal document outlining the proposed process f...

Protocol is the formal document outlining the proposed process for carrying out the clinical trial. The basic features of the document are to study the objectives, patient selecti

Calibration, Calibration : A procedure which enables a series of simply obt...

Calibration : A procedure which enables a series of simply obtainable but inaccurate measurements of some quantity of interest to be used to provide more precise estimates of the r

Rates of return, An investor with a stock portfolio sued his broker, claimi...

An investor with a stock portfolio sued his broker, claiming that a lack of diversification in his portfolio had led to poor performance. The data, shown below, are the rates of re

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd