Application of data analysis software

Assignment Help Other Subject
Reference no: EM133049928

CIS4066-N Statistical Methods for Data Analytics - Teesside University

Statistical Methods for Data Analytics ICA

Learning outcome 1: Use own judgement to select a valid statistical method in the context of the research project.
Learning outcome 2: Manage complex data within the application of data analysis software in order to run tests.

Research, Knowledge and Cognitive Skills

Learning outcome 3: Analyse complex related and unrelated data sets using a range of methods.
Learning outcome 4: Critically analyse and interpret outcomes of statistical tests in order to identify patterns and significance levels.
Learning outcome 5: Critically appraise the validity and reliability of the methods available.

Professional Skills
Learning outcome 6: Demonstrate an ethical understanding of data analysis and its effect in a wider social context.

Qualitative statistics: thematic analysis

Using the Guardian website select an article with at least 20 comments. Select a subset of comments (around 20) for the following analysis.

1. Provide a link to the article you have selected. List the comments you have chosen, the key themes you identify and the number of times they occur from the comments.

2. Describe the themes you identified and the key features that make the theme recognisable (words, metaphors, emotional language, etc.). Discuss your findings
- what conclusions can you make?

Probability and statistics fundamentals

3. Given the event E: "The Covid-19 emergency will finish in 2022", define the following events F1 and F2, both theoretically and giving concrete examples of what they might be:

- an event F1 such that E and F1 are independent
- an event F2 such that E and F2 are dependent

4. Derive with mathematical steps the final formula of Bayes' theorem.
Describe in general terms in what cases the theorem is useful, and propose a concrete scenario where it can be used.

5. Discuss the different scales of measurements. Provide two examples for each scale.

6. There are four medals (Gold, Silver, Bronze and Wood) on a table, but they are all wrapped with dark wrapping paper, such that it is impossible to distinguish them. You would like to find the gold medal.

The game starts as follows. You pick one medal without unwrapping it, and then the game host unwraps one of the remaining medals and reveals that it is a silver medal. (Assume here that the host unwraps a medal with equal probability, but knowing where the gold medal was and avoiding unwrapping the gold medal if still on the table, to keep the game interesting to watch until the end.)

You have now three medals left to unwrap (one in your hand, two on the table). At this point, the host gives you the option to change your mind and swap your medal for one of the two left on the table.

What would you do at this point? Would you keep your medal, or swap it with one of the two medals left on the table? If so, which one?

Hints: Find the solution by using Bayes' theorem, calculating all the conditional probabilities involved. Start calculating the probability of having Gold in our hands

given that we know that the host unwraps Silver, P(G|Hs) = . . ..
Then compare with the probability of having Bronze or Wood in our hands given that we know that the host unwraps Silver, P(B|Hs) = . . .., P(W|Hs) = . . ..

7. In June 2021, during the vaccine rollout for the Covid-19 emergency, it was estimated that 90% of the population over 50 years old were fully vaccinated, while only 6% were completely unvaccinated. (The remaining 4% had only one dose or had an unknown vaccination status, and therefore will not be considered here.)

A Public Health England report on cases and hospitalisation from the "delta" variant (originally sequenced in India) was published at the end of June 2021. The report showed that, between February and June 2021, among the 418 people admitted to the hospital with the "delta" variant:

- 163 were fully vaccinated
- 136 were not vaccinated
- The remaining people had only one dose or an unknown vaccination status and will not be considered here

One may therefore wrongly conclude that a fully vaccinated person is surprisingly more likely to be hospitalised than an unvaccinated person. Using Bayes' theorem to calculate the relevant probabilities from the data above, prove that this claim is wrong. Show that this data actually proves that vaccines are extremely effective at reducing the risk of hospitalisation after contracting the "delta" variant.

Central tendency and variability

8. The 2010 salaries of the White House staff are provided in the table "2010_White_House_Staff.xlsx"

Perform a pipeline of descriptive statistical methods in R, including central tendency and variability measures, to describe, interpret and discuss the dataset.

Statistical tests

9. A variable X follows a normal distribution with mean 1.5 and standard deviation 2. Calculate the probability P(X < 0).

10. You would like to test whether a herb works for the treatment of insomnia. 100 people volunteered to take part in the study.

- Design how you would carry out the experiment, what tasks the participants should perform, and define what could be a null and alternative hypothesis in this case.
- Describe what could be an error of type I or type II, how they are defined theoretically and what they represent in this case.

11. A company has produced a batch of 1000 CPUs whose clock speeds follow a normal distribution centred around 2.1 GHz, with a standard deviation of 0.4 GHz. The company is trying new approaches to manufacture CPUs, therefore 20 of these CPUs were produced with an additional new experimental feature. The clock speed of these 20 experimental CPUs is as follows (in GHz):

2.6, 1.9, 2.9, 2.3, 1.5,
1.9, 1.9, 1.8, 2.5, 2.1,
2.3, 1.7, 1.8, 2.4, 2.2,
1.9, 2.9, 3.3, 1.8, 2.1.

Design and perform a statistical test to check if the difference in clock speed for this new experimental technology is statistically significant, or the difference is just due to chance.

Hints: use a one-sample t-test, specifying the assumptions and finding the p- value. What additional assumption would be needed to use the z-test instead?

Regression

12. Using the file boxOffice.csv, perform a logistic regression analysis to find out whether the budget spent on a movie affects its chances of winning an Oscar. Discuss the general role of logistic regression, the logit scores, z-values and p- values from the summary of the model output. Can we reject the null hypothesis?

13. What needs to change in the overall problem if one wants to use linear regression?

14. What other methods could one try if linear regression does not perform well?

Attachment:- Statistical Methods for Data Analytics ICA.rar

Reference no: EM133049928

Questions Cloud

What is a book value per share and how is it computed : Differentiate basic earnings per share from diluted earnings per share. What is a book value per share? How is it computed
Compute the cost per equivalent unit for materials : Costs added to production during the month: Materials $29,949. Compute the Cost Per Equivalent Unit for Materials and Conversion
Mse 7000 advance topics in engineering management : Course Name: MSE 7000 Advance Topics in Engineering Management A New Era for Global Leadership Development
Evaluate strategies for building competitive advantages : What are ways that an organization can differentiate and distance itself from competitors?
Application of data analysis software : Statistical Methods for Data Analytics ICA - Manage complex data within the application of data analysis software in order to run tests
What is the impact on operating income : Slippy Corporation, the worldwide leader in slipper manufacture, manufactures slippers and sells them for 9$ a pair. What is the impact on operating income
Describe the role of the federal reserve board : Describe the role of the Federal Reserve Board (The Fed) and list the four economic goals it must try to achieve with its monetary policy
Does the company zoom organization culture fit : Question 1: Does the company zoom organization culture fit with its strategy Question 2: Compare the zoom company culture and strategy
Qualities that employers look for in a job-seeker : Question 1: Describe at least three qualities that employers look for in a job-seeker? Why are these qualities important to an employer?

Reviews

len3049928

12/18/2021 12:58:16 AM

statistical methods ICA work AI foundations ICA work 8 and 12 questions professor gave some material Using that material we have to solve the questions

Write a Review

Other Subject Questions & Answers

  How and where the organization sources candidates

There is no best way to hire new employees - a strategy that works for one organization may not be best suited to another. Hiring practices and strategies vary.

  Define what needs to be kept in consideration

For this assignment you will research the performance specifications of commonly used disks as well as the impact of configurations such as RAID etc.

  What are the major challenges in addressing trafficking

What surprised you about the statistics related to human trafficking

  Compare watching movie at home with watching it in a theater

Some people prefer to go the theater to watch a movie while some others prefer to watch the movie at home. compare or contrast watching a movie at home.

  Sensors onboard earth-orbiting satellites

What natural phenomena in the ocean are regularly monitored by sensors onboard Earth-orbiting satellites?

  Characteristics on crime-reporting behavior

"Examining the Effect of Selected Demographic Characteristics on Crime-Reporting Behavior."

  Explain the quality issues related to reporting revenue

Referencing this week's readings and lecture, describe the quality issues related to reporting revenue. What is the importance of understanding various.

  The goals of punishment changed throughout history

Has the perspective of using penology theory as a framework to achieve the goals of punishment changed throughout history?

  Describe how public opinion is measured

Describe how public opinion is measured in the Untied States and the problems that ca arise as a result of this measurement.

  What is project-based learning

What do you think are the three most important components in a lesson plan and why did you select them? What is project-based learning

  Provide a summary introduction that succinctly identifies

Provide a summary introduction that succinctly identifies and explains chosen issue, including key terms. This will include identifying premises and conclusion.

  How is personal identification presented in the world today

How do you call a person who always feels guilty for the bad things that happen even when it's not their fault? How would you describe this type of person?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd