Reference no: EM133228865
Econometrics of Data Science Assignment -
Suppose our goal is to quantify the impact of the repeal of the national maximum speed limit on average speeds on U.S. roads. Representative speed data are collected for particular time periods before and after the repeal. The spreadsheet "hw1_speed_data.csv" contains vehicular speed data before and after the repeal of a speed limit law. Use R to do the solve the following questions:
1. Generate summary statistics for vehicular speeds data and compare the results.
2. Generate and interpret box plots for vehicular speeds data (make them look nice).
3. Generate and interpret histograms for vehicular speeds data (make them look nice).
4. Find out the mean and median values of the after-speed data for those particular vehicles whose speeds before the repeal were greater than 60 mph.
5. Find out the frequency distribution of vehicular after-speed data and interpret results.
6. Generate 99% confidence intervals for mean vehicular after-speed data assuming the population variance is unknown. Explain each step and interpret the results.
7. Generate 95% confidence intervals for the variance of before-speed data. Explain each step and interpret the results.
8. Test whether the mean speed is 55 mph before and 60 mph after at the α=5% significance level. Explain each step and interpret the results.
9. Test whether the variance of after-speed data is less than 19 mph2 at the α=5% significance level. Explain each step and interpret the results.
10. Test that the mean vehicular speeds before and after are equal at the α=10% significance level. Explain each step and interpret the results.
11. Test that the vehicular speed variances before and after are equal at the α=5% significance level. Explain each step and interpret the results.
12. Use a Mann-Whitney-Wilcoxon test to assess whether the distributions of speeds before and after are equal. Also draw density plots using before and after speeds data. Interpret the results based on the test and drawing.
In each case, clearly explain what relationships you used, give each step of your reasoning as appropriate, and state your conclusions.
Notes: for questions 1, 2, 3 provide solutions for both of the vehicular before and after speeds data. Consider before and after speeds data as independent samples.
Attachment:- Speed_data.rar