Application of data analysis software

Assignment Help Other Subject
Reference no: EM133049928

CIS4066-N Statistical Methods for Data Analytics - Teesside University

Statistical Methods for Data Analytics ICA

Learning outcome 1: Use own judgement to select a valid statistical method in the context of the research project.
Learning outcome 2: Manage complex data within the application of data analysis software in order to run tests.

Research, Knowledge and Cognitive Skills

Learning outcome 3: Analyse complex related and unrelated data sets using a range of methods.
Learning outcome 4: Critically analyse and interpret outcomes of statistical tests in order to identify patterns and significance levels.
Learning outcome 5: Critically appraise the validity and reliability of the methods available.

Professional Skills
Learning outcome 6: Demonstrate an ethical understanding of data analysis and its effect in a wider social context.

Qualitative statistics: thematic analysis

Using the Guardian website select an article with at least 20 comments. Select a subset of comments (around 20) for the following analysis.

1. Provide a link to the article you have selected. List the comments you have chosen, the key themes you identify and the number of times they occur from the comments.

2. Describe the themes you identified and the key features that make the theme recognisable (words, metaphors, emotional language, etc.). Discuss your findings
- what conclusions can you make?

Probability and statistics fundamentals

3. Given the event E: "The Covid-19 emergency will finish in 2022", define the following events F1 and F2, both theoretically and giving concrete examples of what they might be:

- an event F1 such that E and F1 are independent
- an event F2 such that E and F2 are dependent

4. Derive with mathematical steps the final formula of Bayes' theorem.
Describe in general terms in what cases the theorem is useful, and propose a concrete scenario where it can be used.

5. Discuss the different scales of measurements. Provide two examples for each scale.

6. There are four medals (Gold, Silver, Bronze and Wood) on a table, but they are all wrapped with dark wrapping paper, such that it is impossible to distinguish them. You would like to find the gold medal.

The game starts as follows. You pick one medal without unwrapping it, and then the game host unwraps one of the remaining medals and reveals that it is a silver medal. (Assume here that the host unwraps a medal with equal probability, but knowing where the gold medal was and avoiding unwrapping the gold medal if still on the table, to keep the game interesting to watch until the end.)

You have now three medals left to unwrap (one in your hand, two on the table). At this point, the host gives you the option to change your mind and swap your medal for one of the two left on the table.

What would you do at this point? Would you keep your medal, or swap it with one of the two medals left on the table? If so, which one?

Hints: Find the solution by using Bayes' theorem, calculating all the conditional probabilities involved. Start calculating the probability of having Gold in our hands

given that we know that the host unwraps Silver, P(G|Hs) = . . ..
Then compare with the probability of having Bronze or Wood in our hands given that we know that the host unwraps Silver, P(B|Hs) = . . .., P(W|Hs) = . . ..

7. In June 2021, during the vaccine rollout for the Covid-19 emergency, it was estimated that 90% of the population over 50 years old were fully vaccinated, while only 6% were completely unvaccinated. (The remaining 4% had only one dose or had an unknown vaccination status, and therefore will not be considered here.)

A Public Health England report on cases and hospitalisation from the "delta" variant (originally sequenced in India) was published at the end of June 2021. The report showed that, between February and June 2021, among the 418 people admitted to the hospital with the "delta" variant:

- 163 were fully vaccinated
- 136 were not vaccinated
- The remaining people had only one dose or an unknown vaccination status and will not be considered here

One may therefore wrongly conclude that a fully vaccinated person is surprisingly more likely to be hospitalised than an unvaccinated person. Using Bayes' theorem to calculate the relevant probabilities from the data above, prove that this claim is wrong. Show that this data actually proves that vaccines are extremely effective at reducing the risk of hospitalisation after contracting the "delta" variant.

Central tendency and variability

8. The 2010 salaries of the White House staff are provided in the table "2010_White_House_Staff.xlsx"

Perform a pipeline of descriptive statistical methods in R, including central tendency and variability measures, to describe, interpret and discuss the dataset.

Statistical tests

9. A variable X follows a normal distribution with mean 1.5 and standard deviation 2. Calculate the probability P(X < 0).

10. You would like to test whether a herb works for the treatment of insomnia. 100 people volunteered to take part in the study.

- Design how you would carry out the experiment, what tasks the participants should perform, and define what could be a null and alternative hypothesis in this case.
- Describe what could be an error of type I or type II, how they are defined theoretically and what they represent in this case.

11. A company has produced a batch of 1000 CPUs whose clock speeds follow a normal distribution centred around 2.1 GHz, with a standard deviation of 0.4 GHz. The company is trying new approaches to manufacture CPUs, therefore 20 of these CPUs were produced with an additional new experimental feature. The clock speed of these 20 experimental CPUs is as follows (in GHz):

2.6, 1.9, 2.9, 2.3, 1.5,
1.9, 1.9, 1.8, 2.5, 2.1,
2.3, 1.7, 1.8, 2.4, 2.2,
1.9, 2.9, 3.3, 1.8, 2.1.

Design and perform a statistical test to check if the difference in clock speed for this new experimental technology is statistically significant, or the difference is just due to chance.

Hints: use a one-sample t-test, specifying the assumptions and finding the p- value. What additional assumption would be needed to use the z-test instead?

Regression

12. Using the file boxOffice.csv, perform a logistic regression analysis to find out whether the budget spent on a movie affects its chances of winning an Oscar. Discuss the general role of logistic regression, the logit scores, z-values and p- values from the summary of the model output. Can we reject the null hypothesis?

13. What needs to change in the overall problem if one wants to use linear regression?

14. What other methods could one try if linear regression does not perform well?

Attachment:- Statistical Methods for Data Analytics ICA.rar

Reference no: EM133049928

Questions Cloud

What is a book value per share and how is it computed : Differentiate basic earnings per share from diluted earnings per share. What is a book value per share? How is it computed
Compute the cost per equivalent unit for materials : Costs added to production during the month: Materials $29,949. Compute the Cost Per Equivalent Unit for Materials and Conversion
Mse 7000 advance topics in engineering management : Course Name: MSE 7000 Advance Topics in Engineering Management A New Era for Global Leadership Development
Evaluate strategies for building competitive advantages : What are ways that an organization can differentiate and distance itself from competitors?
Application of data analysis software : Statistical Methods for Data Analytics ICA - Manage complex data within the application of data analysis software in order to run tests
What is the impact on operating income : Slippy Corporation, the worldwide leader in slipper manufacture, manufactures slippers and sells them for 9$ a pair. What is the impact on operating income
Describe the role of the federal reserve board : Describe the role of the Federal Reserve Board (The Fed) and list the four economic goals it must try to achieve with its monetary policy
Does the company zoom organization culture fit : Question 1: Does the company zoom organization culture fit with its strategy Question 2: Compare the zoom company culture and strategy
Qualities that employers look for in a job-seeker : Question 1: Describe at least three qualities that employers look for in a job-seeker? Why are these qualities important to an employer?

Reviews

len3049928

12/18/2021 12:58:16 AM

statistical methods ICA work AI foundations ICA work 8 and 12 questions professor gave some material Using that material we have to solve the questions

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd