Subset of predictors for a model

Assignment Help Basic Statistics
Reference no: EM132090937

Two data scientists are discussing a strategy to select a subset of predictors for a model with n = 5,000 observations and p = 400 predictors. The ?rst suggests that they perform a forward stepwise selection procedure starting with a null model. Of these resulting models, they would ?nally choose the one with the smallest RSS. The second objects, saying that forward stepwise selection is a greedy algorithm and is unlikely to ?nd the true optimal model. Therefore, they should instead use a best subset algorithm. Do you agree with either of the data scientists? Explain your answer.

Reference no: EM132090937

Questions Cloud

Equivalent to ?tting the decision tree without bagging : Suppose we ?t a decision tree using bagging with B bootstrapped training sets. For each of the following
Simplicity of deployment : A data scientist wishes to ?t a linear model and has 100 predictors on hand. For simplicity of deployment, she wants no more than 10 of the predictors
Why would an svm be a bad choice for task : Why would an SVM be a bad choice for this task? What could we do instead?
Discuss the strategies you will use to mitigate problems : An important step in developing a projected (pro forma) income statement is to create a sales forecast and calculate anticipated revenue for the business.
Subset of predictors for a model : Two data scientists are discussing a strategy to select a subset of predictors for a model with n = 5,000 observations
Development is affected by administrators or school leaders : Discuss how the teacher's role in curriculum development is affected by administrators or school leaders.
Define the current problem and set of business requirements : Define the current problem and the set of business requirements for the problem you need to solve from a local and global perspective.
What responsibilities do businesses have toward a society : How would you justify Starbucks decision to not save money in the case of the polystyrene cups?
Identify personnel security functions based on risks : Determine three reasons why an organization should define the boundaries of control, identify personnel security functions based on risks, and manage change.

Reviews

Write a Review

Basic Statistics Questions & Answers

  We have b 18 times 109 bits that need to be sent over a

we have b 18 times 109 bits that need to be sent over a channel which has a constant but random rate r bitssecond with

  Probability that students support military spendings

30% of college freshmen support increased military spending. If 12 college freshmen are randomly selected, find the probability that more than 5 selected students support increased military spending.

  Probability that less than 64 patients will not be satisfied

According to a hospital administrator, historical records over the past 10 years have shown that 20 percent of major surgery patients are dissatisfied with after-surgery care in the hospital. A scientific poll based on 400 hospital patients has ju..

  Find the point that decreases the correlation

Make the plot such that if the outlier were removed, the correlation for the remaining points would be greater than r = 0.7.

  Proportion of viewers watching the channels

At the .05 significance level, is there a difference in the proportion of viewers watching the three channels?

  Find the population parameter of interest

What did they do? For the following reports about statistical studies, identify the following items (if possible).

  Stratified sampling-systematic sampling or cluster sampling

What kind of sampling have you done (simple random sampling, stratified sampling, systematic sampling or cluster sampling)?

  Procedure of hypothesis testing

Suppose the sample variance for 3 parts turns out to be S2 = .0005. Use a=.05 to test whether the population variance specification is being violated.

  A survey of 36 randomly selected iphone owners showed that

a survey of 36 randomly selected iphone owners showed that the purchase price has a mean of 392 with a sample standard

  Calculate the mean median and mode for the given data

This problem relates to Basic Statistics and it is about calculate the mean, median and mode for the given data

  How many turns might you expect given to take

On each turn, you will roll two dice. To win, you must roll a total of 3 or roll a 3 on one of the dice. How many turns might you expect this to take?

  Average salary for graduates entering

If the salaries are normally distributed with a standard deviation of $5000 (s = 5000), find the probability that:

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd