Find the correlation coefficient between the two variables

Assignment Help Basic Statistics
Reference no: EM131467543

Question 1

In an Australian city last year, the most popular organised sports for children were swimming and outdoor soccer. There were 19% of children who participated in swimming and 13% who participated in outdoor soccer. Suppose the two sports (swimming and outdoor soccer) were organised independently.

[a] What was the proportion of children who did not participate in swimming?

[b] What was the proportion of children who neither participated in swimming nor outdoor soccer?

[c] What was the proportion of children who participated in swimming only or outdoor soccer only but not both swimming and outdoor soccer?

Question 2

We have data on the lean body mass and resting metabolic rate for 14 women who are subjects in a dieting study. Lean body mass, given in kilograms, is a person's weight leaving out all fat. Metabolic rate, given in calories burned per 24 hours, is the rate at which the body consumes energy. The dataset is in the Excel file called ‘Metabolic' on Moodle.

[a] Construct a histogram to study the metabolic rate variable, and provide some brief comments describing the histogram.

[b] Find the correlation coefficient between the two variables. Provide some brief comments.

[c] Create a scatterplot that shows how metabolic rate depends on body mass. Comment on this scatterplot briefly.

[d] Find the least-squares regression line for predicting metabolic rate from body mass. Add this line to your scatterplot. Comment on the regression briefly.

[e] Identify one outlier in the Y direction and one outlier in the X direction. Remove them from the dataset. After removing these two outliers, find the new correlation between the two variables. Draw a new scatterplot for metabolic rate on body mass. Comment on how the correlation and the scatterplot changed after the two outliers were removed, and why.

[f] Fit a new regression for predicting metabolic rate from body mass using the dataset from [e], where the two outliers were removed. Add this regression line onto the scatterplot you created in [e]. Which subject has a particularly high metabolic rate value and which subject has a particularly low metabolic rate value relative to the pattern for the remaining subjects after the two outliers have been removed?

[g] Check the regression assumptions for the regression you fit in [f].

[h] Compare the two regression equations using their R-squared values, and identify which one is better. Why?

[i] Using the better regression equation which you identified in [h], predict the metabolic rate for a woman with a lean body mass of 45 kilograms.

Verified Expert

This document is prepared in word with the help of excel and stata software, it is based on fitting the data for regression analysis and finding the best fit regression line for prediction. This is completely original work and there is no plagiarism.

Reference no: EM131467543

Questions Cloud

European history was most impactful from the renaissance : What event in Western European history was most impactful from the Renaissance to the modern era?
Assumption of basic fixed-order-quantity inventory model : Which of the following is an assumption of the basic fixed-order-quantity inventory model?
Discuss the methods and data analysis section of the thesis : discuss the methods (including data collection) and data analysis section of the thesis.
Current legislation driving the quality reform in healthcare : Healthcare reform is not necessarily new on the healthcare scene. Current legislation driving the quality reform in healthcare.
Find the correlation coefficient between the two variables : What was the proportion of children who did not participate in swimming and What was the proportion of children who neither participated in swimming
Manufacturing best suited to the application of robotics : Give three (3) characteristics of robots and provide three (3) conditions in manufacturing best suited to the application of robotics.
Explain two styles of threats to validity : Describe the various components of an experimental method plan, as envisioned by the author.
Describe the spread of industry : Briefly describe the spread of industry throughout Europe and into America.
Define cumulative distribution function : Cumulative distribution function (CDF) Suppose that discrete random variable D take values {1, 2, 3,...,i,...} with probability 1/2i. What is its CDF?

Reviews

len1467543

4/19/2017 5:56:58 AM

[f] A soft copy of your assignment is required to be submitted onto the Moodle site before 11:59pm Friday ; email submission is NOT acceptable. After the submission of your assignment, there will be a compulsory validation quiz available on Moodle based on your assignment, which MUST be answered before 11:59pm Tuesday. The mark you receive for your validation quiz will be the mark you receive for your assignment.

len1467543

4/19/2017 5:56:53 AM

[a] Your assignment should be done individually. [b] Your student ID must be placed on the top right-hand corner of every page. [c] All the questions must be attempted, using Excel where required. [d] All graphs should be presented within the body of the assignment under the relevant questions, NOT at the end of the assignment in an appendix. [e] Your assignment is expected to contain no more than 7 single-sided pages.

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd