Case study: are the fish safe to eat?

Assignment Help Basic Statistics
Reference no: EM138501

CASE STUDY: Are the Fish Safe to Eat?

Assessing which Factors affect Mercury Levels in Fish in Maine Lakes.

Mercury is a heavy metal that occurs naturally in the environment through several forms (Elemental, organic, inorganic) and through human activity (burning of fossil fuels and Incineration of waste). In particular, large amounts of mercury are found in some fish.

Mercury enters fish in two ways: (i) direct absorption of mercury from water and (ii) by eating other organisms which contain mercury. Human consumption of mercury is known to lead to neurological and physical disorders. A study was carried out in Maine to investigate which characteristics of lakes are associated with higher levels of mercury in the fish in the lakes. Data were collected on 120 lakes.

The data is posted on the course website. The variables in the dataset are:

_ Lake: name of the lake.

_ Mercury: the mercury level in the Fish in parts per million (ppm).

_ Dam: whether a dam is present or not - an indicator variable which is 0 if there is no functional dam present (so all water flow is natural) and is 1 if there is a man-made dam in the drainage area of the lake.

_ Type: each lake is classified as 1. 'oligotrophic' (sustains fish based on its vege-tation and oxygen), 2. 'eutrophic' (has few fish), or 3. 'mesotrophic' (in between oligotrophic and eutrophic).

Some questions of interest are:

A. Most states consider a mercury level of more than 0.5 ppm to be high enough to take action (issuing a health advisory, clean-up methods, etc.). Are mercury levels in Maine high enough to be of concern?

B. The industries that benefit from dams are concerned that environmentalists will claim that high levels of mercury in fish are related to the presence of dams. Does the data support this claim?

C. Do mercury levels vary by lake type? If so, specifically how?

D. Do mercury levels vary by lake type differently for lakes with dams than for lakes without dams.

Assignment Questions:

It would be helpful to examine the data and do some preliminary analysis / descriptive statistics before answering these questions. (You do not have to hand in this preliminary work but you may use it when commenting on justification of methods). Treat your work as an exploratory analysis, in which case Type II errors are typically of more concern than Type I errors. So use a significance level of 0:1. For each of the following questions, include the necessary output/plots/numbers to answer the questions and give a complete commentary on the results:

1.    Answer Question of Interest A. Name the statistical method(s) you are using and Justify. Specify the parameter or test of interest and define your notation. Display a relevant confidence interval and/or test and then make a conclusion in practical terms.

2.       Create a new variable called \DamType" that combines the type of lake and whether or not a dam is present. i.e. create 6 categories, one for each  combination of lake type and dam status. Create three sets of side-by-side box plots to compare mercury levels: (1) between lakes that have a dam and do not have a dam, (2) between different types of lakes, (3) between the 6 combinations of type and dam status. Do there appear to be differences?  Elaborate. Are there any outlier lakes? If so, name them.

(To code \DamType" in R, create a new vector that will hold category names for the combined type and dam status. Then loop through the vectors for Dam and Type, use and if statement to check for each combination and then assign a category name for each combination. Use categories names that will easily identify the combinations. Similar code was used in the Spock Conspiracy Trial to create the variable \judge1").

3.    Use a two-sample t-test to investigate Question of Interest B. If you found out-lier(s) in Q2, then do the analysis both with and without the outlier(s) and give the conclusions. Are the outlier(s) influential?

4.    Use a One-Way ANOVA to investigate Question of Interest B. What relationship should exist between the test statistic from this question and from Q3? If you found outlier(s) in Q2, then do the analysis both with and without the outlier(s) and give the conclusions. Are the outlier(s) influential?

5.    Use a One-Way ANOVA to investigate Question of Interest C. If you found outlier(s) in Q2, then do the analysis both with and without the outlier(s) and give the conclusions. Are the outlier(s) influential? If you conclude that there are differences in the mean mercury levels between the types of lakes, conduct an appropriate post-hoc analysis to determine where the differences lie. Give a practical conclusion.

6.    Use a One-Way ANOVA to investigate Question of Interest D. If you found out-lier(s) in Q2, then do the analysis both with and without the outlier(s) and give the conclusions. Are the outlier(s) influential? If you conclude that there are differences in the mean mercury levels between the six categories of combined type and dam status, conduct an appropriate post-hoc analysis to determine where the differences lie. Give a practical conclusion.

7.    Assess the validity of the model assumptions required for Q6 above. Display the relevant diagnostic plots / numerical values and comment with reference to them.

8.    8. Suppose you were to answer Question of Interest D. using a Two-Way ANOVA instead. Without carrying out the analysis in R, what conclusion would you expect and why? (Do not do the analysis or include output.)

9. The mercury level in each lake was found by combining flesh from a few fish caught in the lake and then analyzing the mixed sample. The number of fish per lake ranged from 2 to 5 and hence different lakes had different numbers of fish sampled. Is this a  concern? If so, explain specifically why and suggest an alternative approach for doing the analysis (but do not actually redo the analysis). If not, explain why.

Attachment: data_mercury.xlsx

Reference no: EM138501

Questions Cloud

Discussion and thoughts : One of the key decisions a company must make on contemplating a new site, or a new operation at an existing site, is its capacity. What are your thoughts?
Consider the following lpp : Consider the following LPP  Without using artificial variable(s), solve the given LPP (do not solve the dual problem). Write the dual of the above problem.
Analyze key parameters of a company''s supply chain operation : Topic:  Analyze key parameters of a company's supply chain operations, suggest improvements, and predict the operational and economic impact of the changes.
Explain why only certain cell types are permissive : Explain why only certain cell types are permissive for transcription of HIV proviral DNA. What cellular factors are involved and why?
Case study: are the fish safe to eat? : CASE STUDY: Are the Fish Safe to Eat?  Assessing which Factors affect Mercury Levels in Fish in Maine Lakes.
What is the producer tax burden : Demonstrate using a consumer's diagram (budget constraints and indifference curves) that a tax on barley that leaves each consumer buying the same amount of barley as before can still create an excess burden.
What is the broad sense heritability : A) What is the broad sense heritability of seed count in this population of bluebonnets? B) What might cause this estimate of heritability to be inaccurate?
Novel just in time for holiday giving : Your publishing house is about ready to release John Grisham's newest novel just in time for Holiday giving.
Direct competition with the oligopoly : Assume that a very competitive start-up enters the market in direct competition with the oligopoly you described in the e-Activity.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Two histograms represent the distribution of acceptance rate

The accompanying two histograms represent the distribution of acceptance rates (percent accepted) among 25 business schools in 1995. The histograms use different class intervals, but are based on the same data. In each class interval, the left endpoi..

  Evaluate the random variable with standard deviation

The weight of the oranges, in pounds (1 pound = 16 ounces) is a random variable with standard deviation

  Partial anova table

Assume the data for the three groups are independent and the data are approximately Normal. The P-value is less than 0.01. The correct conclusion is that

  Evaluate p-value for the f test for main effect of smoking

A study of cardiovascular risk factors compared heavy smokers, light smokers, and nonsmokers. Both men and women were included in the study and gender differences were also to be investigated. One of the variables measured was the heart rate after..

  Kruskal-wallis rank test

The Kruskal-Wallis rank test is a nonparametric replacement - The Wilcoxon rank sum test tests a null hypothesis

  Margin of error with confidence

An SRS of 100 of a certain popular model car in 1993 found that 20 had a certain minor defect in the brakes. An SRS of 400 of this model car in 1994 found that 50 had the minor defect in the brakes. Let p1 and p2 be the proportion of all cars of t..

  Find the probability of getting a head and rolling a six

Find the probability of getting a head and rolling a six

  Graphically represent the relationship

A major study examined the relationship between cause of death (heart attack, cancer, stroke, accident, etc.) and age. A good way to graphically represent the relationship is with

  Evaluate the confidence interval

The 50 members of the placebo group had blood pressure lowered by an average of 15 points with standard deviation 8 points. To analyze this information we will use a, When testing whether or not the increases in spending are different, the test is ..

  Difference between the t and z distributions

The correct set of hypotheses to test the effect of the bonus plan, the difference between the t and z distributions

  Estimate the population mean

Estimate the population mean

  What conclusion can be made from this study

It has been claimed that women live longer than men; however, men tend to be older than their wives. Ages of sixteen husbands and wives from England were obtained. The null hypothesis of equality of means is rejected. What conclusion can be made f..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd