Reference no: EM132584368
Tasks
Comparing Two Samples:
1. Apply the function "plot" to the formula that relates the response "frequency" to the explanatory variable "march2007" in order to produce the two box-plots of the response. Redo the plotting with "frequency" replaced by "log(frequency)". The distribution of the variable "log(frequency)" is:
__ More symmetric, __ Less symmetric compared to the distribution of the variable "frequency".
Mark the most appropriate option and attach the R code that produces the two plots:
2. Mark the null hypotheses that you reject with a significance level of 5% and those that you do not reject:
(Reject/Don't Reject) H0: The expectation of "frequency" is the same in the two subsets,
(Reject/Don't Reject) H0: The expectation of "log(frequency)" is the same in the two subsets.
Explain your answer:
3. Mark the null hypotheses that you reject with a significance level of 5% and those that you do not reject:
(Reject/Don't Reject) H0: The variance of "frequency" is the same in the two subsets,
(Reject/Don't Reject) H0: The variance of "log(frequency)" is the same in the two subsets.
Explain your answer:
Linear Regression:
4. Apply the function "plot" to the formula that relates the response "frequency" to the explanatory variable "time" in order to produce the scatter plot. Add the regression line to the plot. The variability of the variable "frequency, for larger values of the explanatory variable, is:
__ Smaller, __ Larger, __ Constant.
Mark the most appropriate option and attach the R code that produces the two plots:
5. Mark the null hypotheses that you reject with a significance level of 5% and those that you do not reject:
(Reject/Don't Reject) H0: The slope of "time" in the regression line of the response "frequency" is equal to zero,
(Reject/Don't Reject) H0: The slope of "time" in the regression line of the response "log(frequency)" is equal to zero.
Explain your answer:
6. The 95%-confidence interval of slope of "time" in the regression line of the response "log(frequency)" is:
Lower end = ____, Upper end = ____.
Attach the R code that produces the confidence interval:
7. The regression line between "time" as an explanatory variable and "log(frequency)" as a response is:
__ Increasing, __ Decreasing, __ Constant.
Mark the most appropriate option and explain your answer:
The Relation Between Two Variables:
8. Apply the function "plot" to the formula that relates the response "frequency" to the explanatory variable "monetary" in order to produce the scatter plot. Add the regression line to the plot. The points in the scatter plot are:
__ All on the same line, __ Show a linear trend but are not on the same line, __ Don't show a linear trend.
Mark the most appropriate option and attach the R code that produces the plot:
Attachment:- Linear Regression Assignment.rar