What is the central limit theorem

Assignment Help Basic Statistics
Reference no: EM131044900

Business resource and Statistics practical exam

Question one-

1. What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.

2. In probability theory, if population mean is μ and standard deviation is σ, we know that the standard deviation of a sampling distribution is σ/√n If a random sample of size n = 49 is obtained from a population with µ = 75 and (population standard deviation) = 14

What can you say about the distribution of the sampling dataset in terms of?

i. The shape of the distribution curve?

ii. The value of the mean?

iii. The value of the standard deviation

Quewstion two -

i. What is meant by the term interquartile range

ii. Describe the steps you need to take to calculate the interquartile range

iii. Find the median, lower quartile, upper quartile and interquartile range of the following data set:

18 20 23 20 23 27 24 23 29

iv. Draw a box and whisker plot to visualise the five number summary of the data set in iii. above

iv. What is a binomial distribution and when is it used? Please list the five key aspects of the binomial distribution

vi. For a binomial distribution we know that the standard deviation, σ = √np(1-P)

Where n is the number of trials and p is the probability of success.

Calculate the standard deviation of the binomial distribution if n=4 and P=0.25

Question Three-

1. Explain the difference between the following terms. It would be useful to explain your answer with the diagram.

A. Data (or Raw Data)

B. Information (or records)

Question four-

Consider the following three data sets; A, B and C

A. = {9, 10, 11, 7, 13}

B. = {10, 10, 10, 10, 10}

C. = {{1, 1, 10, 19, 19}

I. Calculate the mean of each data set

II. Calculate the standard deviation of each data set, given the formula:

III. Which set has the largest standard deviation?

IV. Is it possible to answer question iii) without calculations of the standard deviation? Explain your answer (show the rough or working)

C. Statistics
D. Knowledge

2. List four basic ways of visualising data.

3. Briefly explain the difference between descriptive statistics and inferential statistics.

4. Univariate Analysis involves the examination across cases of one variable at a time. In analysing a variable, there are three major characteristics that we tend to look at:

Distribution
Central Tendency
Dispersion/Variability

Briefly explain each of these terms and give examples of each. You can use drawings to illustrate your answer

5. With regard to the standard deviation of data distributions, what is the Empirical Rule?

Question Five-

Fill in the missing words on the answer sheet provided.

1. A study is under way in Lough Key Forest Park to determine the adult height of oak Trees. Specifically, the study is attempting to determine what factors aid a tree in reaching heights greater than 15m tall. It is estimated that the park contains 25,000 adult oak trees. The study involves collecting heights from 250 randomly selected adult oak trees and analysing the results. Identify the population from which the study was sampled.

A. The 250 randomly selected adult oak trees.

B. The 25,000 adult oak trees in the park.

C. All the adult oak trees taller than 15m.

D. All chestnut trees, of any age, in the park.

2. From the same study identify the variable of interest:

A. The age of an oak tree in Lough Key Forest Park.

B. The height of an oak tree in Lough Key Forest Park.

C. The number of oak trees in Lough Key Forest Park.

D. The species of trees in Lough Key Forest Park.

3. Again from the same study above what is the sample in the study?

A. The 250 randomly selected adult oak trees.

B. The 25,000 adult oak trees in the park.

C. All the adult oak trees taller than 15m.

D. All oak trees, of any age, in the park.

4. To monitor campus security, the security superintendent is taking a survey of the number of students in a concourse each 30 minutes of a 24-hour period with the goal of determining when patrols of the concourse would best serve the most students. If X is the number of students in the concourse each period of time, then X is an example of

A. A categorical random variable.

B. A discrete random variable.

C. A continuous random variable.

D. A statistic.

Question Six-

1. What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.

2. In probability theory, if population mean is μ and standard deviation is σ, we know that the standard deviation of a sampling distribution is σ/√n If a random sample of size n = 49 is obtained from a population with µ = 75 and (population standard deviation) = 14

What can you say about the distribution of the sampling dataset in terms of?

i. The shape of the distribution curve?

ii. The value of the mean?

iii. The value of the standard deviation?

Question seven

i. What is meant by the term interquartile range

ii. Describe the steps you need to take to calculate the interquartile range

iii. Find the median, lower quartile, upper quartile and interquartile range of the following data set:

18 20 23 20 23 27 24 23 29

iv. Draw a box and whisker plot to visualise the five number summary of the data set in iii. above

i. What is a binomial distribution and when is it used? Please list the five key aspects of the binomial distribution

ii. For a binomial distribution we know that the standard deviation, σ = √np(1-P) Where n is the number of trials and p is the probability of success.

Calculate the standard deviation of the binomial distribution if n=4 and P=0.25

Question Eight

1. What is a "Normal Distribution" of data, and how would you recognise it if it was graphed on a histogram?

2. What are two branches of statistics in brief and explain statistical terms in detail.

2. Explain population parameters and sample statistics. Also define measures of central tendency.

3. Calculate mean, median, mode, variance and standard deviation of the following dataset:

45, 50, 55, 55, 55, 60, 40

. Explain percentiles, quartiles and Inter Quartiles. Describe population, sample proportions and various types of charts used in statistics

4. Explain the concept of probability, and detail calculations of population, variance and standard showing formulas.

5.What is data analytics? Explain data analytics at workplace and provide various methods of turning data to knowledge.

6. Explain data mining and business reporting. Give appropriate examples of business reporting tools.

7.Explain central limit theorem in detail by giving examples.

8. A quiz consists of four multiple-choice questions, each with four possible answer choices (A, B, C, or D). One of which is correct. Suppose that an unprepared student does not read the question, but simply makes a random guess for each question. Let the random variable X equal the number of correct guesses the student makes for the five questions. Is this binomial? And if so what is n and what is p? Calculate variance and standard deviation.

The level of economic activity in a region has been recorded over a period of four years and the data is presented below:

Year                                               Quarter                                                      Activity Level
1                                                           1                                                              105
                                                             2                                                              99
                                                             3                                                              90
                                                             4                                                              110
2                                                           1                                                              111
                                                             2                                                               104
                                                             3                                                               93
                                                             4                                                               119
3                                                           1                                                               118
                                                             2                                                               109
                                                             3                                                                 96
                                                             4                                                                 127
4                                                           1                                                                 126
                                                             2                                                                  115
                                                              3                                                                 100
                                                              4                                                                 135

a. Construct a graph of this data

b. Find a centred four-point moving average and place it on your graph

c. Calculate the corresponding seasonal components(*show the rough or working of it)

Question nine

Sales on article B (‘000 units)

                           Q1      Q2       Q3         Q4
2010                  24.8   36.3     38.1       47.5
2011                 31.2    42.0    43.4         55.9
2012                 40.0    48.8    54.0         69.1
2013                  54.7   57.8    60.3          68.9

a. Plot the time series of the sales figures

b. Find a centred four-point moving average and place it on your graph

c. Calculate the seasonal component for each quarter

Comment on the following:

‘The process of polling is often mysterious, particularly to those who don't see how the views of 1,000 people can represent those of hundreds of millions'.

Question 11

Discuss the relative advantages and disadvantages of the postal questionnaire and the personal interview as a means of collecting data.

b. Compare simple random sampling and quota sampling as methods of selecting a representative sample from a population.

Question 12

Sampling methods are frequently used for the collection of data. State which type of sampling method is being described in the following situations.

i. One school in an area is selected at random and then all pupils in that school are surveyed.

ii. The local authority has a list of all pupils in the area and the sample is selected in such a way that all pupils have an equal probability of selection

iii. An interviewer surveys pupils emerging from every school in the area, attempting to question them randomly but in line with specified numbers of boys and girls in the various age groups.

iv. The local authority has a list of all pupils in the selected area. The first pupil is selected randomly from the list and then after every 100th pupil thereafter is selected for the survey.

B. List the advantages and disadvantages of Quota Sampling.

Question 13

The following transactions have been recorded on an automatic cash dispenser:

Value of transactions

(€):                    Number:
10                         46
20                         57
30                         68
50                         56
100                       47
150                       39
200                       34

Required:

a. Determine the mean

b. What value has the mode

c. Draw an appropriate graph for the above data

Question 14

A. Give two examples of variables which might be correlated

B. Distinguish between positive and negative correlation

C. What range of values can the product moment correlation coefficient use?

D. When should Spearman's rank correlation coefficient be used?

E. What are the advantages and disadvantages of the least squares method of linear regression?

Reference no: EM131044900

Questions Cloud

Company promotional literature : A sample of 900 computer chips revealed that 76% of the chips do not fail in the first 1000 hours of their use. The company's promotional literature states that 78% of the chips do not fail in the first 1000 hours of their use.
Income statement and balance sheet : The tax effects of temporary differences that give rise to deferred tax assets and liabilities are as follows ($ thousands):
What is the probability that he will select : An urn contains 9 white balls and 8 green balls. If Juan choose 9 balls at random from the urn, what is the probability that he will select 5 white ballight and 4 green balls? Round your answer to 3 decimal places.
Develop a bar chart to help you manage this project : A local municipality has issued a request for proposals to conduct a feasibility study to design, implement, and sustain a recycling program in a small town. As PM, you need to develop a bar chart to assign responsibility and maintain a project sc..
What is the central limit theorem : What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.
Probability that at least one of cards : 5 cards are drawn from a standard deck without replacement. What is the probability that at least one of the cards drawn is a heart? Express your answer as a fraction or a decimal number rounded to four decimal places.
Measure of central tendency : QUESTION 1: To identify the point in a distribution at which 50% of scores fall above and 50% fall below a given score, which measure of central tendency would you report?
Discuss proposal planning and execution of a hcit project : Discuss the proposal, planning and execution of a HCIT project and the significance of these elements. Discuss the significance of proper planning in at least five project knowledge areas.
Find the schedule and cost variances for a project : Find the schedule and cost variances for a project that has an actual cost at month 16 of $540,000, a scheduled cost of $523,000, and an earned value of $535,000

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd