Create a multiple regression model

Assignment Help Basic Statistics
Reference no: EM13919933

The games of the 28th Olympiad were held in Athens, Greece, in the summer of 2004. The data file contains information about 131 of the participating nations, including the following variables.

Medals (Won at Athens Olympics)
Gross Domestic Product ($ Billion)
Population (Million)
Area (Million Sq. Km.)
Infant Deaths Per 1,000 Births
Inflation Rate
Fertility Rate
GDP Growth Rate
Telephones (Million)

The final medal count is perhaps somewhat misleading as a measure of national athletic excellence. For example, the United States won the most medals in absolute terms (Exhibit 1), but isn't even among the top ten in terms of medals per capita (Exhibit 2).

Country              Gold        Silver       Bronze         Total
United States       35            39             29             103
Russia                 27            27             38              92
China                  32            17             14              63
Australia              17            16             16              49
Germany             14            16             18              48
Japan                  16             9              12              37
France                 11             9              13              33
Italy                    10            11             11              32
South Korea          9            12              9               30
United Kingdom     9              9              12              30

Exhibit 1: Top Medal Counts, 2004 Athens Olympics

Country Medals Population Medals/Million Citizens
Bahamas 2 297,477 6.72
Australia 49 19,731,984 2.48
Cuba 27 11,263,429 2.40
Estonia 3 1,408,556 2.13
Slovenia 4 1,935,677 2.07
Jamaica 5 2,695,867 1.85
Latvia 4 2,348,784 1.70
Hungary 17 10,045,407 1.69
Bulgaria 12 7,537,929 1.59
Greece 16 10,665,989 1.50

Exhibit 2: Top Medal Counts per Capita, 2004 Athens Olympics

Add a dummy variable to the data set, representing whether the nation was the host of the Olympics. (Hint: The 2004 Athens Olympic Games were hosted by Greece.) Perform a correlation analysis on all of the variables in the data set, and show the results here in descending order of importance.

Some interesting countries, such as Russia and Jamaica, are excluded from the data set because information was not available for all variables.

1. Add a dummy variable to the data set, representing whether the nation was the host of the Olympics. (Hint: The 2004 Athens Olympic Games were hosted by Greece.) Perform a correlation analysis on all of the variables in the data set, and show the results here in descending order of importance. Explain your results.

2. Show scatter diagrams of the two most important predictors, showing their relationship to "Medals". Use labels to indicate interesting outliers, show a linear trend line, and write something intelligent about your graphs.

3. Create a multiple regression model, based on your analysis above. Try to find the model that has the highest adjusted R-square value for predicting the number of Olympic medals won by a country.

4. Using your model from above, perform a hypothesis test to see whether the true effect of "Area" is less than one medal for every 1,000,000 square kilometers at the 5% level of significance, all other factors taken into account. Find the p-value of your test and explain its meaning.

5. Using your model from above, and assuming the same test, alpha, and variance as in Part 10, what would be the risk of a Type II error if the true effect of "Area" were in fact known to be 0.5 medals per 1,000,000 square kilometers?

6. Using your model from Part 9 above, calculate the residual error in "Medals" for each country. Show the residual errors for the ten countries who most "overperformed" in the Athens Olympics (i.e. won more medals than your model predicted they would) and the ten countries who most "underperformed". Explain your results.

7. Discuss the residual errors in this model. Use charts as appropriate.

8. Do these data provide evidence of a "home field advantage" in the Olympics? In other words, can we conclude that Greece won medals above and beyond what would otherwise been expected because it was the host country?

Reference no: EM13919933

Questions Cloud

Discuss effects on emc''s planning and operating processes : Discuss the effects on EMC's planning and operating processes if EMC implements a "demand pull" production system. Identify and describe at least five benefits to EMC that should result form the demand pull" production operating approach
Dividends per share : JMH has the following stock outstanding-Preferred, 5%, $100 par value, 2,000 shares-Common, $60 par value, 5,000 shares-Compute the amount of dividends per share payable to each class of stock for each independent case.
Strategic viewpoint of game theory : The "Prisoner's Dilemma" was the gateway to the strategic viewpoint of game theory. In this assignment, you will explore the applications of game theory to economic business decisions.
How do we appear from the view of the shareholder : How do we appear from the view of the shareholder? Customer perspective: How can we best serve our customers? How do we position ourselves on the market?
Create a multiple regression model : Create a multiple regression model, based on your analysis above. Try to find the model that has the highest adjusted R-square value for predicting the number of Olympic medals won by a country.
Define validaty and reasonableness of letgo sales projection : What should Letsgo use to measure performance for each of the managers in the case? What bonus system would you suggest that incorporates these measures and also encourages the managers to work as a team?
Business intelligence and data warehouses : Businesses today are extremely reliant on large amounts of data for making intelligent business decisions. Likewise, the data warehouses are often structured in a manner that optimizes processing large amounts of data.
How can we sustain improve strength and innovative potential : How can we sustain and continuously improve our strengths and innovative potential? How can we organize ourselves in such a way so as to guarantee long-term, efficient performance?
Identify two types of staffing models : With more businesses projected to move to the area and the building of new housing developments, it is projected that this growth could be long term.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd