Reference no: EM132372306
Question 1
Mauritius is one of the island where obesity and cardiovascular diseases are at its peak. According to Victoria Hospital most of the people suffering from obesity are from Plaines Willems and those suffering from cardiovascular disease are from black river. To test this hypothesis we need to know if really a relationship exist between obesity and cardiovascular. Moreover the two factors associated with obesity and cardiovascular are heights and weights. Therefore a survey was carried out by the NPCC.
The data for heights are as follows
151, 174, 138, 186, 128, 136, 179, 163, 152, 131
The data for weight are as follows 63, 81, 56, 91, 47, 57, 76, 72, 62, 48
Using the data above
• Create a relationship model
• Find the coefficients from the model created and create the mathematical equation.
• Get a summary of the relationship model to know the average error in prediction.
• Predict the weight of new persons.
• Plot the regression equation for the above data. In addition to that include "Your full name" as title and set the scatterplot Green. Label the y-axis and x -axis. Use abline function to include a line.
Note: You should explain all the steps in the code and in addition to that you are required to include the code in your assignment.
While including the code you should print screen the code from your PC including the date and time which shall appear on the right hand side .
R is a free statistical software which is available on goggle. Question 1 should be done completely on R .
Question 2
Running times (Y) and maximal aerobic capacity (X) for 14 female Runners. Data collected for running times and maximal aerobic capacity are listed below
X: 61.32 55.29 52.83 57.94 53.31 51.32 52.18 52.37 57.91 53.93 47.88 47.41 47.17 51.05
Y: 39.37 39.80 40.03 41.32 42.03 42.37 43.93 44.90 44.90 45.12 45.60 46.03 47.83 48.55
(a) Calculate the mean, median, MAD, MSD, and standard deviation for each variable. [Include all your steps and explain all the steps involved in details]
(b) Which of these statistics give a measure of the center of data and which give a measure of the spread of data? [Explain in your own words]
(c) Calculate the correlation of the two variables and pro-duce a scatterplot of Y against X. [Use excel for scatterplot, show all your computations concerning the correlation and explain all your steps]
(d) Why is it inappropriate to calculate the autocorrelation of these data? [Explain in your own words] Note: It should be typed on Microsoft word including all the formula used.
Question 3
Consider the quarterly electricity production for years 1-4:
Year
|
1
|
2
|
3
|
4
|
Q1
|
99
|
120
|
139
|
160
|
Q2
|
88
|
108
|
127
|
148
|
Q3
|
93
|
111
|
131
|
150
|
Q4
|
111
|
130
|
152
|
170
|
(a) Estimate the trend using a centered moving average.
(b) Using a classical additive decomposition, calculate the seasonal component.
(c) Explain how you handled the end points.
Note: Explain all the steps and computations in your own words and it should be typed on Microsoft word.