Compute mean of the sample means and standard deviation

Assignment Help Basic Computer Science
Reference no: EM132411335

Assignment

Part 1) Central Limit Theorem

The input data consists of the sequence from 11 to 20 (11:20). Show the following three plots in a single row.

a) Show the histogram of the densities of this distribution.
b) Using all samples of this data of size 2, show the histogram of the densities of the sample means.
c) Using all samples of this data of size 5, show the histogram of the densities of the sample means.
d) Compare of means and standard deviations of the above three distributions.

Part 2) Central Limit Theorem

The data in the file queries.csv contains the number of queries Google has had each day for a one year period (365 days).

a) Show the histogram of the distribution of the number of queries. Compute the mean and standard deviation of the number of queries Google has had per day.

b) Draw 1000 samples of this data of size 5, show the histogram of the densities of the sample means. Compute the mean of the sample means and the standard deviation of the sample means.

c) Draw 1000 samples of this data of size 20, show the histogram of the densities of the sample means. Compute the mean of the sample means and the standard deviation of the sample means.

d) Compare of means and standard deviations of the above three distributions.

Part 3) Central Limit Theorem - Negative Binomial distribution

Suppose the input data follows the negative binomial distribution with the parameters size = 5 and prob = 0.5.

a) Generate 1000 random numbers from this distribution. Show the barplot with the proportions of the distinct values of this distribution.

b) With samples sizes of 10, 20, 30, and 40, generate the data for 5000 samples using the same distribution. Show the histograms of the densities of the sample means. Use a 2 x 2 layout.

c) Compare of means and standard deviations of the data from a) with the four sequences generated in b).

Part 4) Sampling

Use the MU284 dataset from the sampling package. Use a sample size of 20 for each of the following.

a) Show the sample drawn using simple random sampling without replacement. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

b) Show the sample drawn using systematic sampling. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

c) Calculate the inclusion probabilities using the S82 variable. Using these values, show the sample drawn using systematic sampling. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

d) Order the data using the REG variable. Draw a stratified sample using proportional sizes based on the REG variable. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

e) Compare the means of RMT85 variable for these four samples with the entire data.

Attachment:- Central Limit Theorem.rar

Verified Expert

This paper demonstrates the Central Limit Theorem( CLT for short) applications in a real life scenario and how it help us to get around the problem to perform predictive modeling of large data set where the population is not normal.

Reference no: EM132411335

Questions Cloud

How do the practices impact consumers and the economy : How do these practices impact consumers, businesses (other than banks), and the economy. Give both a short run and long run answer.
What is statistical multiplexing : What are some similarities between neighborhood roads and LANs? What is statistical multiplexing? How is statistical multiplexing useful in WANs
What percentage of the population has been diagnosed : What percentage of the population has been diagnosed with this condition? What education can be provided to remove the stigma(s)?
Describe the growth you observed within your mentee : Describe the growth you observed within your mentee. Your mentee improve both personally, professionally, and toward to achievement of the mentee's goals?
Compute mean of the sample means and standard deviation : Compare of means and standard deviations of the data from with the four sequences - Calculate the inclusion probabilities using the S82 variable
Do you think the label was used inappropriately : Find a study published in a nursing journal in 2010 or earlier that is described a s a pilot study. Do you think the study really is a pilot study.
Compare difference between theory and practice in nursing : Compare the difference between theory, research, and practice in nursing. Choose a theory that best correlates with the EBP practice change that you would like.
Discussions frequently revolve around talk of sides : Why do discussions frequently revolve around talk of "sides"? Can attention be returned to serving the patients? How?
Which stage of the policy model does the scenario represent : Jeanne Blum, RN, is a nurse on a LDRP unit. Recently, the policy and procedures manual for Jeanne's unit included the premature rupturing of membranes.

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Design a main program that calls the procedure quick search

Design a main program that calls the procedure quick search. The main program should be able to read and search successive blocks of text.

  Describe the purpose and activities

Describe the purpose and activities

  Find the pep of the transmitter output

The output of the transmitter is connected to a 50-? dummy load that has a calibrated average reading wattmeter. The wattmeter reads 6.9 kW. Find the PEP of the transmitter output.

  Facilitate collaboration between virtual team members

Discuss additional rules that could be added to facilitate collaboration between virtual team members.

  Problems in the following design of class a

Write statements needed to do the following in order; one statement is needed for each point:

  Change your ip address in linux

1. Which of the following will allow you to change your IP address in Linux?

  Build an array of customer structures

Build an array of customer structures. Assume that there are no more than 200 loyal customers.

  Explain what sql is and its functions

Explain what SQL is and its functions. What do you enjoy the most about learning SQL? What you find the most difficult?

  Allowing employees of an organization

State two advantages and two disadvantages of allowing employees of an organization, other than systems administrators and security personnel.

  Difference between relative and absolute reference

Explain the difference between relative and absolute reference. Provide an example.

  Calcpay for a financial company

Write a C++ application called calcPay for a financial company. The goal of this program is to determine gross pay for a 4-week pay period based on an hourly rate and the number of hours worked in each week of the pay period. Anyone working over 4..

  What is the standard error of the mean

Employment An employment agency requires all clients to take an aptitude test. They randomly selected 150 clients and recorded the amount of time each client took to complete the test

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd