Plot the histogram of the samples

Assignment Help Other Subject
Reference no: EM132332201

Part 1: Law of Large Number

The first part of the empirical analysis serve the purpose to get you familiarize the LLN. LLN characterize the following fact:

Suppose {Xn} is a sequence of random variables draw from an underlying population space, then the sample mean of {Xn} defined as X-n = 1/n i=1n Xi, converge in Probability or Almost Surely to the population mean of that underlying population space.

(a) Randomly draw 10000000 samples from N(4, 16), plot the histogram of the samples and underlying population distribution together. What is the theoretical expectation for the population space? Variance? And Standard Deviation?

Let sample size n equal to 1, 2, 4, 8, 16, ··· , 222 = 4194304, 223 = 8388608 respectively. Randomly choose n samples from the original 10000000 sample space. (This is for speed consideration, you could draw n samples from N(4, 16) directly each time, but that would cost a lot of time), compute the sample mean X-n for each n.

Plot the sample mean with each sample size together with the population mean. What is your conclusion from the graph.

Change the parameters of the underlying distribution N(4, 16) to whatever you like, follow the same procedure again. Can you draw the same conclusion?

(b) Do the same thing as question (1) asked for Binomial Distribution B(50, 0.4) as the underlying population distribution.

(c) Do the same thing as question (1) asked for t-Distribution t(10) as the underlying population distribution. When you change your own parameters as the last question asked, make sure the degree of freedom be greater than 2, otherwise the sample mean may not converge, think about why?

(d) Do the same thing as question (1) asked for F-Distribution F(9,7) as the underlying population distribution. When you change your own parameters as the last question asked, make sure the degree of freedom df2 be greater than 4, otherwise the sample mean may not converge, think about why?

Part 2: Central Limit Theorem

The second part of the empirical analysis serve the purpose to get you familiarize the CLT. CLT characterize the following fact:

Suppose {Xn} is a sequence of random variables draw from an underlying population space, then the sample mean of {Xn} defined as X-n = 1/n i=1nXi, converge in distribution to Normal Distribution N(µ, σ2), where µ = E[Xi], σ2 = Var[Xi]/n.

(a) Randomly draw 10000000 samples from N(20,25), plot the histogram of the samples and underlying population distribution together.

What is the theoretical expectation for the population space? Variance? And Standard Deviation?

Let sample size n equal to 10 and 100000 respectively. We want to conduct the following experiment on both large sample cases (n = 100000) and small sample cases (n = 10).

Repeat the following process for 2000 times: Randomly choose n samples from the original 10000000 sample space. (This is for speed consideration, you could draw n samples from N(20,25) directly each time, but that would cost a lot of time), compute the sample mean X-n for each n.

As a result, we will have 2000 X-10 and 2000 X-100000, write down the first and last 10 of them respectively.

Plot the histogram of 2000 sample mean X-10 together with N(20, 25/10 ). What is your conclusion from the graph.

Plot the histogram of 2000 sample mean X-100000 together with N(20, 25/100000 ). What is your conclusion from the graph.

Plot the histogram of 2000 normalized sample mean X-10-20/√(25/10) together with N(0, 1). What is your conclusion from the graph?

Plot the histogram of 2000 normalized sample mean X-100000-20/√(25/100000) together with N(0, 1). What is your conclusion from the graph?

Change the parameters of the underlying distribution N(20, 25) to whatever you like, follow the same procedure again. Can you draw the same conclusion?

(b) Do the same thing as question (a) asked if Binomial Distribution B(40, 0.2) is the underlying population distribution.

(c) Do the same thing as question (a) asked if t-Distribution t(10) is the underlying population distribution. When you change your own parameters as the last question asked, make sure the degree of freedom be greater than 2, otherwise the sample mean may not converge in Normal Distribution, think about why?

(d) Do the same thing as question (a) asked if F-Distribution F(8,6) is the underlying population distribution. When you change your own parameters as the last question asked, make sure the degree of freedom df2 be greater than 4, otherwise the sample mean may not converge in Normal Distribution, think about why?

Part 3: Extra Credit

Find a new distribution, draw samples from that distribution and follow the same procedure in the previous two parts. See if LLN and CLT still hold.

Requirement: Please submit your Rcode and Final Report to sakai. Final Report must be converted into PDF format. You are encouraged to collaborate with each other but you have to write your own code and report.

Attachment:- Assignment Files - Central Limit Theorem.rar

Reference no: EM132332201

Questions Cloud

Analyse commercial aspects of a construction project : QSP7BEC - Building Economics - University College of Estate Management - explains how the developer's profit on the project would be established.
Elements from the course including the following sequence : Utilize elements from the course including the following sequence: resources, capabilities, core competencies, and competitive advantages.
Transformational leadership model : Explain the four transformational factors associated with the transformational leadership model.
Different learning and development methods : Analyse the advantages and limitations of different learning and development methods
Plot the histogram of the samples : Randomly draw 10000000 samples from N(4, 16), plot the histogram of the samples and underlying population distribution together
Effectiveness of different learning and development methods : Evaluate the effectiveness of different learning and development methods
Development needs from a range of information sources : Support individuals in identifying their current and likely future learning and development needs from a range of information sources
Explain the structure and operation of a key-value store : Your company has decided to adopt a NoSQL database management system for storage and processing of big data. You have been asked to investigate alternative NoSQ
Describe the length-tension relationship of muscle : Discuss the processes leading to voluntary movement in the brain. Describe the length-tension relationship of muscle? What does this mean in practical terms?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd