Obtain the pearson correlation coefficient

Assignment Help Applied Statistics
Reference no: EM132143371

Quantitative Methods in Health Statistical Analysis Report -

Question 1 -

Are you a Lark or an Owl? Studies indicate that about 10% of us are morning people (Larks) while 20% are evening people (Owls) and the rest are not specifically classified as either. Studies also indicate that this circadian preference may not be settled until the age of 22 or later.

(a) Is there evidence that the owl/lark preferences for university students differ from the claimed proportions? Formulate and perform an appropriate hypothesis test at 5% significance level using the data shown in Table 2. Use Minitab to obtain the test statistic and the P-value. For full marks, include appropriate Minitab output. Use the STATE-FORMULATE-SOLVE-CONCLUDE procedure and perform follow-up analysis if appropriate.

Type

Count

Claimed proportion

Lark

41

0.1

Neither

163

0.7

Owl

49

0.2

Table - Circadian preference: Summary of survey responses and claimed proportions

Additional Minitab instructions: In order to complete this question, enter the data from Table above into Minitab and perform a 'Chi-Square Goodness-of-Fit (One Variable)' test using the option to 'Test specific proportions'.

(b) Are stress levels of students affected by circadian preference? Use Minitab to obtain a 100% stacked column chart that shows the conditional distribution of stress level given circadian preferences. Make two observations based on your chart.

(c) Test to see if there is a statistically significant relationship between the variables LarkOwl and Stress (see descriptions in Table 1- attached). Formulate and perform an appropriate hypothesis test at 5% significance level. Use Minitab to obtain the test statistic and the P-value. For full marks, include appropriate Minitab output. Use the STATE-FORMULATESOLVE-CONCLUDE procedure and perform follow-up analysis if appropriate.

Question 2 -

Are cognitive skills and alcohol use related? In order to address this question, you are going to work with variables CognitionZscore and AlcoholUse from the SleepStudy.xlsx data file. Descriptions of these variables are given in Table 1.

(a) Use Minitab to produce boxplots of cognitive scores by alcohol use shown horizontally within the same graph. Comment briefly on how cognitive scores compare across groups and whether you expect to find any statistically significant differences.

(b) Is there a significant difference in mean cognitive skills based on the level of alcohol use as reported by students? Formulate and perform an appropriate hypothesis test at 5% significance level. Use Minitab to obtain the test statistic and the P-value. For full marks, include appropriate Minitab output. Use the STATE-FORMULATE-SOLVECONCLUDE procedure.

(c) Is it appropriate to argue cause and effect, in either direction, based on these results? Why or why not? Explain briefly. Hint: What type of study is this?

Question 3 -

Which attitudes and habits might influence academic performance? In order to answer this question, you are going to investigate the correlation between GPA and each of the following variables: the number of early classes, the number of missed classes and the average hours of sleep. Answer the questions that follow. Variable descriptions are shown in Table 1.

(a) Obtain the Pearson correlation coefficient and the corresponding P-value for GPA and the number of early classes (NumEarlyClass). What does a positive correlation mean in this case? Does the sample correlation provide sufficient evidence of an association between those two variables? Explain briefly.

(b) Now obtain the Pearson correlation coefficient and the corresponding P-value for GPA and the number of missed classes (ClassesMissed). Does the sample correlation provide sufficient evidence of an association between those two variables? Is it positive or negative? What does it mean in practical terms? Explain briefly.

(c) Finally obtain the Pearson correlation coefficient and the corresponding P-value for GPA and the average hours of sleep (AverageSleep). What does a positive correlation mean in this case? Does the sample correlation provide sufficient evidence of an association between those two variables? Explain briefly.

Question 4 -

Sleep Quality and DAS score. In the study students were rated on sleep quality (PoorSleepQuality) as well as on Depression, Anxiety and Stress scales, with the DAS score (DASScore) giving a composite of the three scores. How well does the DAS score predict sleep quality? Answer the questions that follow. Variable descriptions are given in Table 1.

(a) Use Minitab to obtain a scatterplot with DASScore as the independent variable (x) and PoorSleepQuality as the dependent variable (y). Does it make sense to fit a linear regression model in this case? Justify your answer briefly.

(b) Use Minitab to fit a simple linear regression model including residual plots. Are conditions for linear regression satisfied? Answer in terms of Linearity, Independence, Normality and Population standard deviations.

(c) Comment on the strength of the relationship between sleep quality and DAS score using the coefficient of determination. What is its value? What precisely does it measure in this scenario?

(d) What is the value of the slope? What does it measure in this scenario?

(e) Is the relationship between sleep quality and DAS score statistically significant? In other words, is the slope estimate statistically significant at 5% level? How do you know? Explain briefly.

(f) Suppose that one of the student at this university has a fairly high DAS score of 40. Use Minitab to obtain a prediction of sleep quality for this student, including an appropriate interval for that prediction. Discuss the accuracy of that prediction as shown in Week 9 workshop.

Statistical Analysis Report -

Your report should consist of sections described below.

Introduction - Provide the context and rationale for the study. Use your own words! There is no word limit, just ensure you have explained what the report will contain. As a guideline, one paragraph will be sufficient.

Methods - Include the following:

  • A brief description, in your own words, of how the data was collected.
  • What type of study was conducted? Name the study design.
  • A description of the sample (including the sample size and any demographic information).
  • A brief description of variables that you have analysed.
  • A list of statistical procedures that you have used.

Results & Discussion - Summarise and discuss the main results of your analyses from Questions 1 to 4. You may use subsections, tables etc. as you see fit. Present and discuss results in a clear and simple way:

  • Present findings of statistical analyses in a logical sequence. Descriptive statistics about variables of interest are usually presented first, followed by the results of further statistical analyses.
  • Include copies of key diagrams from Questions 1 to 4 as relevant to your presentation of results.
  • State each result and the corresponding statistical procedure, and report P-values to three decimal places. However, do not include numerical calculations or full details of statistical procedures and condition checking (e.g. full Minitab output).
  • Interpret your statistical findings and discuss their practical significance. In particular, use your results to answer the questions that prompted this study. Are any of the results surprising in any way?
  • Indicate shortcomings, if any, of the analyses that were performed. Indicate in particular whether there are any issues with internal and external validity of this study.

There is no word limit. As a guideline, two pages (two and a half at most) will be sufficient, including any tables and graphs. Remember, marks will be awarded for quality not quantity!

Conclusion - What can you conclude from your analysis about sleep, circadian preference and academic performance? Which other factors appear to be important? Explain briefly. There is no word limit. As guideline, one short paragraph will be sufficient. Do not introduce any new information in this section!

Attachment:- Assignment File.rar

Reference no: EM132143371

Questions Cloud

What is the minimum value for n and m : Assume that there are n columns with default values and there are m columns with NULL values.
Devise and analyze an efficient algorithm for finding : Devise and analyze an efficient algorithm for finding the median. Do the same for n arrays, each with n elements.
Write an algorithm to delete the node with largest integer : An unique integer is stored in each node of a doubly-linked list which has a reference, start, that points to the first node of the list.
Write a pseudocode which will take a matrix as input : The second function takes a matrix as input and returns a row echelon form for the input matrix.
Obtain the pearson correlation coefficient : MATH 1065 - Quantitative Methods in Health Statistical Analysis Report. Obtain the Pearson correlation coefficient and the corresponding P-value for GPA
What is the data rate of this transmission : Suppose you transferred three packets each containing 1000 bytes of data from one system to another. The entire process took 1 minute and 20 seconds.
Determine the possible total amounts you can form : Determine the possible total amounts you can form using these gift certificates. Prove your answer using strong induction.
What is the one-way propagation delay between a and b : Suppose that each switch has a 20-bit processing delay in addition to a store-and-forward delay. At what time, in seconds, is A's packet delivered at B?
Find the probability that there are more than 20 users : Find the probability that there are more than 20 users transmitting simultaneously. (You need to calculate the exact value, not just the formula.)

Reviews

len2143371

10/17/2018 5:04:44 AM

This assignment is worth 20% of your final mark. It is due no later than 11 pm on Friday 26 October in Week 12. You will need to submit your assignment via Gradebook. Marked assignments will be returned to you electronically. The file you submit needs to be in a pdf format and prepared using the template provided. Relevant MINITAB output should be copied and pasted into Word as a picture (Windows) or pdf (Mac). Do not use Print Screen as the quality may be compromised. For full marks, ensure that appropriate axis labels, meaningful titles and legends are included with all graphical displays. Failure to follow the template, poor communication or a messy layout will attract a penalty of up to 10 marks (10% of maximum marks available). Any late submission will attract a penalty of 10 marks (10% of maximum marks available) per working day, or part thereof, the assignment is late. The cut-off time is 11pm each day. An example of a ‘good’ and ‘bad’ statistical analysis report is available from the course website.

len2143371

10/17/2018 5:04:37 AM

Suggested schedule to work through this assignment: Suggested tasks to be completed - Download the assignment instructions and the submission template; read through them to get an idea of what is required. Download the data file and make sure that it can be opened from the computer you will use to work on this assignment. Start working on Questions 1 and 2. Work on Questions 3 and 4. Write and check the report – it should be consistent with results from Questions 1 to 4. Check the entire submission for completeness and adherence to the template, and submit. Note: You are not required to include additional sources (e.g. internet articles or scientific papers) but if you do, ensure you include a reference list and cite them in text appropriately.

Write a Review

Applied Statistics Questions & Answers

  How many years would it take to pay off this debt

As of october 2009, the united states federal dept was $11,963,668,027,500. If each of the 307,000,000 people in the United States paid an extra $1000 in taxes each year, every year, how many years would it take to pay off this debt?

  Hypotheses are created before the data is examined

In statistics, usually hypotheses are created before the data is examined. However, in today's high data output environment, many findings may be examined further after the data is analyzed. Do you think that it is a good approach to generate such hy..

  A marketing analyst in a large grocery store chain

A marketing analyst in a large grocery store chain

  Describe and interpret the shape of distribution of ratings

Describe where the satisfaction ratings seem to be concentrated. Describe and interpret the shape of the distribution of ratings. Write out the eight classes used to construct this histogram.

  Problem1 the data below shows the number of absences x and

problem1 the data below shows the number of absences x and the final grade y of seven students in the statistics

  Find tenth percentile of distribution of individual forecast

What percentage of individual forecasts are at or below the 10th percentile of the distribution of forecasts? What percentage are at or above the 10th percentile? Find the 10th percentile of the distribution of individual forecasts.

  A high school principal studied the amount of time

4. A high school principal studied the amount of time her students devote to working at an after-school job. She randomly selected 15 students, obtained their working hours, and computed the sample mean to be 12.3 hours and the sample standard deviat..

  How many of acme''s employees use drugs?

Acme Manufacturing Company requires all of its 5000 employees to take a drug test. Suppose 2% of the employees actually use drugs (although the company does not know this number). The drug test is 95% accurate.

  Difference in gas prices between ontario and quebec

Determine if there is a difference in gas prices between Ontario and Quebec.

  A quality control engineer at a potato chip company tests

A quality control engineer at a potato chip company tests the bag-filling machine by weighing bags of potato chips. Not every bag contains exactly the same weight. But if more than 15% of bags are overfilled, then they stop production to fix the mach..

  Is there a difference between married and single officers

Is there a difference between married and single officers on perceptions that their job was stressful - It is hypothesized that married officers would perceive the job as more stressful.

  Determine shape of the distribution based on sample data

Compute the mean and median. Determine the shape of the distribution based on the sample data? Explain your conclusion.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd