What is the central limit theorem

Assignment Help Basic Statistics
Reference no: EM131044900

Business resource and Statistics practical exam

Question one-

1. What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.

2. In probability theory, if population mean is μ and standard deviation is σ, we know that the standard deviation of a sampling distribution is σ/√n If a random sample of size n = 49 is obtained from a population with µ = 75 and (population standard deviation) = 14

What can you say about the distribution of the sampling dataset in terms of?

i. The shape of the distribution curve?

ii. The value of the mean?

iii. The value of the standard deviation

Quewstion two -

i. What is meant by the term interquartile range

ii. Describe the steps you need to take to calculate the interquartile range

iii. Find the median, lower quartile, upper quartile and interquartile range of the following data set:

18 20 23 20 23 27 24 23 29

iv. Draw a box and whisker plot to visualise the five number summary of the data set in iii. above

iv. What is a binomial distribution and when is it used? Please list the five key aspects of the binomial distribution

vi. For a binomial distribution we know that the standard deviation, σ = √np(1-P)

Where n is the number of trials and p is the probability of success.

Calculate the standard deviation of the binomial distribution if n=4 and P=0.25

Question Three-

1. Explain the difference between the following terms. It would be useful to explain your answer with the diagram.

A. Data (or Raw Data)

B. Information (or records)

Question four-

Consider the following three data sets; A, B and C

A. = {9, 10, 11, 7, 13}

B. = {10, 10, 10, 10, 10}

C. = {{1, 1, 10, 19, 19}

I. Calculate the mean of each data set

II. Calculate the standard deviation of each data set, given the formula:

III. Which set has the largest standard deviation?

IV. Is it possible to answer question iii) without calculations of the standard deviation? Explain your answer (show the rough or working)

C. Statistics
D. Knowledge

2. List four basic ways of visualising data.

3. Briefly explain the difference between descriptive statistics and inferential statistics.

4. Univariate Analysis involves the examination across cases of one variable at a time. In analysing a variable, there are three major characteristics that we tend to look at:

Distribution
Central Tendency
Dispersion/Variability

Briefly explain each of these terms and give examples of each. You can use drawings to illustrate your answer

5. With regard to the standard deviation of data distributions, what is the Empirical Rule?

Question Five-

Fill in the missing words on the answer sheet provided.

1. A study is under way in Lough Key Forest Park to determine the adult height of oak Trees. Specifically, the study is attempting to determine what factors aid a tree in reaching heights greater than 15m tall. It is estimated that the park contains 25,000 adult oak trees. The study involves collecting heights from 250 randomly selected adult oak trees and analysing the results. Identify the population from which the study was sampled.

A. The 250 randomly selected adult oak trees.

B. The 25,000 adult oak trees in the park.

C. All the adult oak trees taller than 15m.

D. All chestnut trees, of any age, in the park.

2. From the same study identify the variable of interest:

A. The age of an oak tree in Lough Key Forest Park.

B. The height of an oak tree in Lough Key Forest Park.

C. The number of oak trees in Lough Key Forest Park.

D. The species of trees in Lough Key Forest Park.

3. Again from the same study above what is the sample in the study?

A. The 250 randomly selected adult oak trees.

B. The 25,000 adult oak trees in the park.

C. All the adult oak trees taller than 15m.

D. All oak trees, of any age, in the park.

4. To monitor campus security, the security superintendent is taking a survey of the number of students in a concourse each 30 minutes of a 24-hour period with the goal of determining when patrols of the concourse would best serve the most students. If X is the number of students in the concourse each period of time, then X is an example of

A. A categorical random variable.

B. A discrete random variable.

C. A continuous random variable.

D. A statistic.

Question Six-

1. What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.

2. In probability theory, if population mean is μ and standard deviation is σ, we know that the standard deviation of a sampling distribution is σ/√n If a random sample of size n = 49 is obtained from a population with µ = 75 and (population standard deviation) = 14

What can you say about the distribution of the sampling dataset in terms of?

i. The shape of the distribution curve?

ii. The value of the mean?

iii. The value of the standard deviation?

Question seven

i. What is meant by the term interquartile range

ii. Describe the steps you need to take to calculate the interquartile range

iii. Find the median, lower quartile, upper quartile and interquartile range of the following data set:

18 20 23 20 23 27 24 23 29

iv. Draw a box and whisker plot to visualise the five number summary of the data set in iii. above

i. What is a binomial distribution and when is it used? Please list the five key aspects of the binomial distribution

ii. For a binomial distribution we know that the standard deviation, σ = √np(1-P) Where n is the number of trials and p is the probability of success.

Calculate the standard deviation of the binomial distribution if n=4 and P=0.25

Question Eight

1. What is a "Normal Distribution" of data, and how would you recognise it if it was graphed on a histogram?

2. What are two branches of statistics in brief and explain statistical terms in detail.

2. Explain population parameters and sample statistics. Also define measures of central tendency.

3. Calculate mean, median, mode, variance and standard deviation of the following dataset:

45, 50, 55, 55, 55, 60, 40

. Explain percentiles, quartiles and Inter Quartiles. Describe population, sample proportions and various types of charts used in statistics

4. Explain the concept of probability, and detail calculations of population, variance and standard showing formulas.

5.What is data analytics? Explain data analytics at workplace and provide various methods of turning data to knowledge.

6. Explain data mining and business reporting. Give appropriate examples of business reporting tools.

7.Explain central limit theorem in detail by giving examples.

8. A quiz consists of four multiple-choice questions, each with four possible answer choices (A, B, C, or D). One of which is correct. Suppose that an unprepared student does not read the question, but simply makes a random guess for each question. Let the random variable X equal the number of correct guesses the student makes for the five questions. Is this binomial? And if so what is n and what is p? Calculate variance and standard deviation.

The level of economic activity in a region has been recorded over a period of four years and the data is presented below:

Year                                               Quarter                                                      Activity Level
1                                                           1                                                              105
                                                             2                                                              99
                                                             3                                                              90
                                                             4                                                              110
2                                                           1                                                              111
                                                             2                                                               104
                                                             3                                                               93
                                                             4                                                               119
3                                                           1                                                               118
                                                             2                                                               109
                                                             3                                                                 96
                                                             4                                                                 127
4                                                           1                                                                 126
                                                             2                                                                  115
                                                              3                                                                 100
                                                              4                                                                 135

a. Construct a graph of this data

b. Find a centred four-point moving average and place it on your graph

c. Calculate the corresponding seasonal components(*show the rough or working of it)

Question nine

Sales on article B (‘000 units)

                           Q1      Q2       Q3         Q4
2010                  24.8   36.3     38.1       47.5
2011                 31.2    42.0    43.4         55.9
2012                 40.0    48.8    54.0         69.1
2013                  54.7   57.8    60.3          68.9

a. Plot the time series of the sales figures

b. Find a centred four-point moving average and place it on your graph

c. Calculate the seasonal component for each quarter

Comment on the following:

‘The process of polling is often mysterious, particularly to those who don't see how the views of 1,000 people can represent those of hundreds of millions'.

Question 11

Discuss the relative advantages and disadvantages of the postal questionnaire and the personal interview as a means of collecting data.

b. Compare simple random sampling and quota sampling as methods of selecting a representative sample from a population.

Question 12

Sampling methods are frequently used for the collection of data. State which type of sampling method is being described in the following situations.

i. One school in an area is selected at random and then all pupils in that school are surveyed.

ii. The local authority has a list of all pupils in the area and the sample is selected in such a way that all pupils have an equal probability of selection

iii. An interviewer surveys pupils emerging from every school in the area, attempting to question them randomly but in line with specified numbers of boys and girls in the various age groups.

iv. The local authority has a list of all pupils in the selected area. The first pupil is selected randomly from the list and then after every 100th pupil thereafter is selected for the survey.

B. List the advantages and disadvantages of Quota Sampling.

Question 13

The following transactions have been recorded on an automatic cash dispenser:

Value of transactions

(€):                    Number:
10                         46
20                         57
30                         68
50                         56
100                       47
150                       39
200                       34

Required:

a. Determine the mean

b. What value has the mode

c. Draw an appropriate graph for the above data

Question 14

A. Give two examples of variables which might be correlated

B. Distinguish between positive and negative correlation

C. What range of values can the product moment correlation coefficient use?

D. When should Spearman's rank correlation coefficient be used?

E. What are the advantages and disadvantages of the least squares method of linear regression?

Reference no: EM131044900

Questions Cloud

Company promotional literature : A sample of 900 computer chips revealed that 76% of the chips do not fail in the first 1000 hours of their use. The company's promotional literature states that 78% of the chips do not fail in the first 1000 hours of their use.
Income statement and balance sheet : The tax effects of temporary differences that give rise to deferred tax assets and liabilities are as follows ($ thousands):
What is the probability that he will select : An urn contains 9 white balls and 8 green balls. If Juan choose 9 balls at random from the urn, what is the probability that he will select 5 white ballight and 4 green balls? Round your answer to 3 decimal places.
Develop a bar chart to help you manage this project : A local municipality has issued a request for proposals to conduct a feasibility study to design, implement, and sustain a recycling program in a small town. As PM, you need to develop a bar chart to assign responsibility and maintain a project sc..
What is the central limit theorem : What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.
Probability that at least one of cards : 5 cards are drawn from a standard deck without replacement. What is the probability that at least one of the cards drawn is a heart? Express your answer as a fraction or a decimal number rounded to four decimal places.
Measure of central tendency : QUESTION 1: To identify the point in a distribution at which 50% of scores fall above and 50% fall below a given score, which measure of central tendency would you report?
Discuss proposal planning and execution of a hcit project : Discuss the proposal, planning and execution of a HCIT project and the significance of these elements. Discuss the significance of proper planning in at least five project knowledge areas.
Find the schedule and cost variances for a project : Find the schedule and cost variances for a project that has an actual cost at month 16 of $540,000, a scheduled cost of $523,000, and an earned value of $535,000

Reviews

Write a Review

Basic Statistics Questions & Answers

  Determine the largest proportion of meal prices

If the standard deviation of the price of meals in San Francisco was $10, determine the largest proportion of meal prices that could be larger than $50.

  Find onfidence interval for mean amount spent by patients

Find a 95% confidence interval for the mean amount spent by patients at the pharmacy. Show your calculations and/or explain the process used to obtain the interval.

  Identify best unbiased estimate of the concerned parameter

Find the asymptotic distribution of the MLE of the indicated parameter and find formulas for approximate standard errors of the MLE.

  Temperature on yield in a chemical process

To study the effect of temperature on yield in a chemical process, four batches were produced at each of three temperature levels. The results follow. Construct an analysis of variance table. State the Hypotheses. Use a .05 level of significance t..

  Use linear regression to forecast values for periods 11 to

use linear regression to forecast values for periods 11 to 13 for the following time

  A television station estimates that 60 of college students

a television station estimates that 60 of college students watch the super bowl. for a sample of 240 students selected

  Regression analysis-autocorrelation-multicollinearity

Suppose an appliance manufacturer is doing a regression analysis,using quarterly time-series data, of the factors affecting its sales of appliances.

  Null hypothesis based on the sample information

Make a decision regarding the null hypothesis based on the sample information. Interpret the results of the test and state your conclusion.

  What does r2686 tell us about the variation about the

tree circumference and heightx 1.89 1.9 1.8 2.4 5.1 3.1 5.5 5.1 8.3 13.7 5.3 4.9 3.7 3.8y 21 33.5 24.6 40.7 73.2 24.9

  Dclcnninc the al fraction with the rh eed system

Since, in normal growth conditions, the incorporation of Al and Ga atoms is unity, find an expression for the Al composition as a function of the growth rate of GaAs, AlAs, and AlGaAs. How would you dclcnninc the Al fraction with the RH EED ..

  Normal probability and margin of error

A population has a mean of 80 and a standard deviation of 7. A sample of 49 observations will be taken. The probability that the sample mean will be larger than 82 is?

  Compute descriptive statistics for each stock and the sampp

one measure of the risk or volatility of an individual stock is the standard deviation of thetotal return capital

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd