Select a subset of predictors for a model

Assignment Help Basic Statistics
Reference no: EM132096048

Two data scientists are discussing a strategy to select a subset of predictors for a model with n = 5,000 observations and p = 400 predictors. The ?rst suggests that they perform a forward stepwise selection procedure starting with a null model. Of these resulting models, they would ?nally choose the one with the smallest RSS. The second objects, saying that forward stepwise selection is a greedy algorithm and is unlikely to ?nd the true optimal model. Therefore, they should instead use a best subset algorithm. Do you agree with either of the data scientists? Explain your answer.

Reference no: EM132096048

Questions Cloud

How well would bidirectional search work on this problem : How well would bidirectional search work on this problem? What is the branching factor in each direction of the bidirectional search?
What could we do instead : Why would an SVM be a bad choice for this task? What could we do instead?
The difference between the terms flaming and shouting : Explain the difference between the terms "flaming" and "shouting" in relation to netiquette.
Complete a loan application form : Complete the Fact Find provided in Appendix 14 or your own company's Fact Find document - Complete the loan servicing calculation (NSR) by completing the form
Select a subset of predictors for a model : Two data scientists are discussing a strategy to select a subset of predictors for a model with n = 5,000 observations
Paper for a machine learning conference : Jacob is writing a paper for a machine learning conference. He has invented a "new" version of random forests:
Develop a method called min that takes a parameter : Develop a method called min that takes a parameter of an integer array and returns the smallest value stored in the parameter.
Estimating some parameter : Your friend who works in ?nancial investment company comes to you with a problem. She is interested in estimating some parameter a from her data
Add code to the onload event that calls the plugin : In the jquery.altrow.js file, code a plugin that uses the getElementsByTagName method to get all of an element's "tr" child elements.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Example on logistic regression model

Could mud-wrestling be the cause of a rash contracted by University of Washington students in the spring of 1992? Two physicians at the University of Washington health center wondered this when one male and six female students complained of rashes..

  Correlation or relationship between two variables

We say that two things have a positive relationship if they move in the same direction, or as one thing increases, so does the other. Or if one thing decreases, so does the other.

  Conduct an f test for a significant repression

Find the estimated regression line for age (x) and repair cost (y). Conduct an F test for a significant repression, and find the bounds on the p value for this test.

  Evidence to conclude that the average score

If the national average was 600, is there enough evidence to conclude that the average score has decreased? Use a p-value and 5% significance.

  Plot the random effects estimates for models

Plot the random effects estimates (forest plot) for models (3c) and (3d) and provide with an appropriate caption

  Estimate the two parameters of the beta distribution

a) Use the method of moments to estimate the two parameters of the beta distribution. b) Use a quantile-quantile plot to assess the goodness of fit.

  Whether p is the correct statistical notation

Explain whether p^ or p is the correct statistical notation for each proportion described:- The proportion that smokes in a randomly selected sample of n = 300 students in the eleventh and twelfth grades.

  Scatter diagram for best small companies

A recent article in Business Week listed the "Best Small Companies." We are interested in the current results of the companies' sales and earnings.

  Formulating suitable null and alternative hypothesis

Formulate a suitable null and alternative hypothesis to test the students' claim that they followed their instructor's recommendation at a 5% level of significance and interpret your findings.

  Provide an estimate of given difference

Provide an estimate of this difference.- Explain why it is incorrect to use the two-sample t test to see if the means differ.

  Testing the heights of baseball bounces

In previous tests, baseballs were dropped 24ft onto a concrete surface, and they bounced an average of 92.84% in. In a test of a sample of 40 new balls, the bounce had a mean of 92.67 in.

  Probability that randomly chosen manager will achieve score

Scores on a management aptitude examination are believed to be normally distributed with mean 650 (out of a total of 800 possible points).

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd