Reference no: EM13580479
Problem:
Provide me a half page response to each of the following two discussion posts. Tell me what you agree with and/or disagree with and if there is anything that you disagree with then provide any alternatives. Once again, pay close attention to detail in each of the following two discussions and label each discussion with the response to each.
Discussion 1:
When developing a multiple regression model choosing suitable predictor variables is important because if they are too closely correlated then the forecast will not be improved. One needs to figure out how many predictor values to use that might add accuracy to the forecast, however cost needs to be taken into consideration at the same time. This is where judgment comes in because finding a suitable number of predictors for your equation while keeping it simple and cost effective is the goal. So when building a multiple regression model you need to be able to use your judgment in the selection process.
Some of the issues that may be encountered are measuring the strength between the independent variables and the dependent variable. This analysis can lead to wasted computer time and physical time deciphering the outcomes. Also, if there are a lot of variables for the system to analyze it might choose to use some that are insignificant to the accuracy of the forecast. If the independent variables are too correlated then multicollinearity can occur which may cause computational difficulties and large stand errors. Overall it is a complex process from start to finish.
Based on previous experience multiple regression could be used by a car company to predict how well a particular new model will sell. Independent variables could be family size, age, income, gender, or geographic location. Dummy variable should also be introduced to the equation to determine if there are relationships between independent variables and the dependent variable. If there is multicollinearity present then it needs to be determined which independent variable should be dropped and judgment would come into play. The goal is to create a way to understand the present in hopes to better predict the future.
Discussion 2:
When using a multiple regression model, the first decision (besides what you would like to predict) is the selection of independent variables, or predictor variables. The best outcome is basically to achieve the highest R-squared value possible, but to do so with the least number of variables possible, specifically those variables that significantly contribute to an R-square increase. A large degree of judgement and experience on the forecaster's part is important here. Predictor variables should not be significantly correlated with each other (multicollinearity). Deletion of seemingly insignificant variables is an exercise of trial-and-error (t-tests). Removal of one variable affects the remaining ones and changes their statistical significance (t-values, or P-values in Minitab).
For example, if I wanted to predict the amount of commercial small business loan defaults for next quarter, I could use possible determinants: delinquency rates, debt service coverage ratios, unemployment rates, credit scores of principals, debt-to-income ratios of principals, instances of past-due property taxes. It is possible that two variables like the owners' credit scores and debt-to-income ratios may be intercorrelated and we would need to use only one to avoid multicollinearity which creates unstable forecast values. Delinquency rates may present some of the typical problems posed with time-series data in regression analysis. While this data is not usually affected by the season or time of year, it does exhibit behavior similar to seasonality because in the "short" months of February, April, June, September, and November there is a decline in number of delinquencies that hit the 30-day, 60-day, or 90-day status. This data would probably need to be adjusted before it was fit to use for regression. Again, it would take a good degree of judgement by the forecaster to determine which of the independent variables were actually helpful in producing a good forecast and which ones were not significant, and should be removed.