Design how you would carry out the experiment

Assignment Help Other Subject
Reference no: EM133054176

CIS4066-N Statistical Methods for Data Analytics - Teesside University

Assignment - Statistical Methods for Data Analytics ICA

Learning outcome 1: Personal and Transferable Skills
1. Use own judgement to select a valid statistical method in the context of the research project.
2. Manage complex data within the application of data analysis software in order to run tests.

Research, Knowledge and Cognitive Skills
3. Analyse complex related and unrelated data sets using a range of methods.
4. Critically analyse and interpret outcomes of statistical tests in order to identify patterns and significance levels.
5. Critically appraise the validity and reliability of the methods available.

Professional Skills
6. Demonstrate an ethical understanding of data analysis and its effect in a wider social context.

Qualitative statistics: thematic analysis

Using the Guardian website

Select an article with at least 20 comments. Select a subset of comments (around 20) for the following analysis.

Question 1. Provide a link to the article you have selected. List the comments you have chosen, the key themes you identify and the number of times they occur from the comments.

Question 2. Describe the themes you identified and the key features that make the theme recognisable (words, metaphors, emotional language, etc.). Discuss your findings
- what conclusions can you make?
Probability and statistics fundamentals

Question 3. Given the event E: "The Covid-19 emergency will finish in 2022", define the following events F1 and F2, both theoretically and giving concrete examples of what they might be:

- an event F1 such that E and F1 are independent
- an event F2 such that E and F2 are dependent

Question 4. Derive with mathematical steps the final formula of Bayes' theorem.

Describe in general terms in what cases the theorem is useful, and propose a concrete scenario where it can be used.

Question 5. Discuss the different scales of measurements. Provide two examples for each scale.

Question 6. There are four medals (Gold, Silver, Bronze and Wood) on a table, but they are all wrapped with dark wrapping paper, such that it is impossible to distinguish them. You would like to find the gold medal.

The game starts as follows. You pick one medal without unwrapping it, and then the game host unwraps one of the remaining medals and reveals that it is a silver medal. (Assume here that the host unwraps a medal with equal probability, but knowing where the gold medal was and avoiding unwrapping the gold medal if still on the table, to keep the game interesting to watch until the end.)

You have now three medals left to unwrap (one in your hand, two on the table). At this point, the host gives you the option to change your mind and swap your medal for one of the two left on the table.

What would you do at this point? Would you keep your medal, or swap it with one of the two medals left on the table? If so, which one?

Hints: Find the solution by using Bayes' theorem, calculating all the conditional probabilities involved. Start calculating the probability of having Gold in our hands given that we know that the host unwraps Silver, P(G|Hs) = . . ..

Then compare with the probability of having Bronze or Wood in our hands given that we know that the host unwraps Silver, P(B|Hs) = . . .., P(W|Hs) = . . ..

Question 7. In June 2021, during the vaccine rollout for the Covid-19 emergency, it was estimated that 90% of the population over 50 years old were fully vaccinated, while only 6% were completely unvaccinated. (The remaining 4% had only one dose or had an unknown vaccination status, and therefore will not be considered here.)

A Public Health England report on cases and hospitalisation from the "delta" variant (originally sequenced in India) was published at the end of June 2021. The report showed that, between February and June 2021, among the 418 people admitted to the hospital with the "delta" variant:

- 163 were fully vaccinated
- 136 were not vaccinated
- The remaining people had only one dose or an unknown vaccination status and will not be considered here

One may therefore wrongly conclude that a fully vaccinated person is surprisingly more likely to be hospitalised than an unvaccinated person. Using Bayes' theorem to calculate the relevant probabilities from the data above, prove that this claim is wrong. Show that this data actually proves that vaccines are extremely effective at reducing the risk of hospitalisation after contracting the "delta" variant.

Central tendency and variability

Question 8. The 2010 salaries of the White House staff are provided in the table
"2010_White_House_Staff.xlsx"

Perform a pipeline of descriptive statistical methods in R, including central tendency and variability measures, to describe, interpret and discuss the dataset.

Statistical tests

Question 9. A variable X follows a normal distribution with mean 1.5 and standard deviation 2. Calculate the probability P(X < 0).

Question 10. You would like to test whether a herb works for the treatment of insomnia. 100 people volunteered to take part in the study.

- Design how you would carry out the experiment, what tasks the participants should perform, and define what could be a null and alternative hypothesis in this case.
- Describe what could be an error of type I or type II, how they are defined theoretically and what they represent in this case.

Question 11. A company has produced a batch of 1000 CPUs whose clock speeds follow a normal distribution centred around 2.1 GHz, with a standard deviation of 0.4 GHz. The company is trying new approaches to manufacture CPUs, therefore 20 of these CPUs were produced with an additional new experimental feature. The clock speed of these 20 experimental CPUs is as follows (in GHz):

2.6, 1.9, 2.9, 2.3, 1.5,
1.9, 1.9, 1.8, 2.5, 2.1,
2.3, 1.7, 1.8, 2.4, 2.2,
1.9, 2.9, 3.3, 1.8, 2.1.

Design and perform a statistical test to check if the difference in clock speed for this new experimental technology is statistically significant, or the difference is just due to chance.

Hints: use a one-sample t-test, specifying the assumptions and finding the p- value. What additional assumption would be needed to use the z-test instead?

Regression

Question 12. Using the file boxOffice.csv, perform a logistic regression analysis to find out whether the budget spent on a movie affects its chances of winning an Oscar. Discuss the general role of logistic regression, the logit scores, z-values and p- values from the summary of the model output. Can we reject the null hypothesis?

Question 13. What needs to change in the overall problem if one wants to use linear regression?

Question 14. What other methods could one try if linear regression does not perform well?

Verified Expert

This task provides a clear working example on probability distribution. Discrete probability distribution and continuous probability distribution was used to compute the probability values for the corresponding variables. Bayes theorem was used to compute the conditional probability values. Descriptive statistics, histogram and box plot was used to assess the distribution of whitehouse staff salary

Reference no: EM133054176

Questions Cloud

Business Impact Analysis and Risk Management worksheet : Security in the Cloud - Cloud Application Security - A business Impact Analysis and Risk Management worksheet has been attached to this assignment
Assignment on business analytics : You are a technology consultant and the client provides you with the following: I am providing a web-based payment service that Users can access using their mob
Agile approach to project management : As best practice, when should you consider using the Agile approach to project management? When should you use SDLC?
Assignment on business analytics : You are a managing consultant, and the client asks you what are the key considerations they should account for when building-out their application / IT infrastr
Design how you would carry out the experiment : What needs to change in the overall problem if one wants to use linear regression and What other methods could one try if linear regression does not perform
Describe four artifacts of an organization culture : Langton, Robbins, & Judge describe four artifacts of an organization's culture that can be used to "read" the culture: stories, rituals, material symbols, and l
Retrenchment strategy or a growth strategy : When might an event employ a retrenchment strategy or a growth strategy?
Explain the performance management changes : What performance management changes will employees undergo and who will most likely be affected?
Find a strategic plan to implement continuous improvements : Find a Strategic Plan to implement continuous improvements within a local gym (or industry/Centre of your choice). The intention of the improvements is to:

Reviews

len3054176

12/23/2021 11:46:59 PM

statistical methods ICA work AI foundations ICA work 8 and 12 questions professor gave some material Using that material we have to solve the questions

Write a Review

Other Subject Questions & Answers

  The difference between a dump and a sanitary landfill

What is the difference between a dump and a sanitary landfill?-  Describe how landfill gas and leachate can be successfully managed.-  What is legacy pollution?

  Identify priosn internal and external stakeholders

Identify priosn Internal and External Stakeholders. Discuss how internal or external stakeholders have influenced the situation in a positive or negative way

  Review the organizations mission statement

Review the organization's mission statement. Is it a good and effective mission statement? Explain why or why not. What might need to change, if anything?

  The area of data management

List at least three activities in which you might engage in your current practice setting that would increase your competencies in the area of data management.

  What lessons can be learn regarding project risk management

Within the Discussion Board area, write One page that respond to the following questions with your thoughts, ideas, and comments. This will be the foundation.

  Brief summary of the history of islam

Describe significant differences and similarities in how the branches of Islam (Sunni, Shiite, and Sufi) practice their traditions.

  Design a circuit that would be appropriate for a body weight

SISFFIT003 - Instruct fitness programs - Design a circuit that would be appropriate for a body weight exercise - Design a circuit that would be appropriate

  Case study1-evaluate harrogate borough councils approach to

case study1-evaluate harrogate borough councils approach to quality management with particular reference to the system

  Discuss individual health history and examination assignment

Complete a physical examination of the client using the Individual Health History and Examination Assignment resource

  Who is your industry partner

What specific problem will your research project explore and try to resolve? Why is it important to solve this problem?

  How you assess your competency level in management knowledge

How would you assess your competency level in Management Knowledge and Skills and Early Childhood Knowledge and Skills?

  Distances between different parts of the earth

These are (1) distances between different parts of the Earth, (2) distance between planets in our solar system, (3) distances between stars in our Galaxy, and (4) distances between clusters of galaxies.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd