Find the posterior distribution

Assignment Help Advanced Statistics
Reference no: EM132880854

Question 1: Suppose that your are the CEO of a chain store called "729" . It consists of o convenience stores. Let yi be the revenue made by the ith store, and z, be the floor area of the ith store for i = 1, 2, ... , o. Assume that x1, ... , xn are known and deterministic unless otherwise specified. We are interested in knowing the association between floor area and revenue. Consider the following model

[yi |xi,θ] ~ N(θxi, h(xi)),            i = 1,2,....n, 

θ ~ N(μ, τ2)

for some μ ∈ R and r ∈ R+, where h(.) is a known function. Equation (1.1) allows Var(yi, | xi, θ) to vary with xi. It is called heteroscedasticity, which generalizes the standard constant noise variance assumption, known as homoscedasticity, in classical regression. Indeed, it is a very realistic assumption as the variability of revenue is likely to be increasing with the size of the store. See the graphical visualization below. This model is also useful for handling observations having different qualities

The aim of this question is to learn the relationship in terms of θ.
1. Find the posterior distribution [θ | x1:n y1:n] .
2. Construct an appropriate Bayesian estimator ^θB and a 95% credible interval of θ.
3. Give an empirical Bayes est etc ^θEB of θ.
4. Because of some operational reasons, the CEO fails to obtain the exact floor areas of some of the stores. Denote

Ii = 1(the floor area of the ith superstore is known),       i = 1, ......n.

5. Comment the following statement:

"The empirical Bayes estimator ^θEB emprically bases on the data in choosing be hpperporameters, whereas the Bayes estimator θo is less data-driuen. So, ^θEB must 6e better."

In this part, you may discuss whatever you think relevant and insightful to understand and interpret the statement. For example, you may (i) comment on the correctness and validity of the statement, (ii) propose better ways to state the assertion, (iii) use different statistical principles and philosophies to interpret the statement, (iv) perform simulation experiments to verify your claim(s), (v) visualize the results, etc. Your answer must be within 2 pages.

Question 2: The total number of enrollments in Stat 4010, Stat 4003 (Spring 2020), and Stat 3005 (Fall 2020 is n1 = 231 . Let (Si, Gi) ∈ {0, 100] x {A, A-, B+, B, B-, C+ , C , C -, D, D+, F } be the mid-term score and the coiavsponding grade of the ith student for i = 1, . . . , n1. All values of si's are observed. Unfortunately, owing to the possibility of pass/fail grading option (P/F option) in the previous three semesters, some of the Gi's are not fully observed.

1. We may regard Hi as a coarsened version of Gi . A "coarsening mechanism" is a generalization of a "missing mechanism* . Similar to Definition 9.2 in the lecture note, we define
» coarsening completely at random (CCAR) if P(Ii = r|Si = s, Gi = g) = Φ(r) for all r, s,g;
» coarsening at random (CAR) if P(Ii = r | Si = s,Gi = g) = Φ(r, s,gr + h (1-r)) for all r,s,g and
» coarsening not at random (CNAR) if otherwise,

where Φ is some function, and h = 1(g = 1). Which coarsening mechanism do you prefer to assume? Why? (Use no more than about 100 words).

2. Exam now on, assume that

P(Ii = 1|Si, Gi, Φ0, Φ1 = pnorm(Φ0, + Φ1Si), i = 1,....n1

for some free parameters Φ0, Φ1, v R. Propose a prior Π(μ, σ, θ1,...., θ10, β, Φ0, Φ1). Explain your choice.

3. Use an appropriate MCMC sampler to draw sample from the posterior

[μ, σ, θ1,......θ10, β,GEi|S1n1, GE0, HEi, I1:n1 ]

Briefly diggnne your MCMC sample.

4. Use an appropriate Bayesian method to recover the missing grade Gi for i ∈ E1

5. Predict the grades of the students this year, i.e., Gn1 +1, .. . , Gn1+n2.

6. In this part, you may do whatever you think relevant and insightful for under- standing Question 2. T6 earn the full credit, you must use the knowledge you learned in Stat 4010. Fbr example, you may
» conduct a simulation experiment (by simulating an artificial dataset having the same structure as coarseninggrade.csv) to check the accuracy, and precision of your ntimators and predictors,
» discuss the pre and oons of the models and (RB) from a Bayesian point of view,
» propose improved or alternative Bayesian-related models,
» use a hierarchical model to handle the covariate major, year, course and ID,
» perform additional statistical inference,
» compute empirical Bays estimates,
» conduct detailed and intensive oonvergence diagnosis on the MCMC sample,
» derive some theoretical results.

Attachment:- mock test.rar

Reference no: EM132880854

Questions Cloud

Who was the intended audience : When was the image produced? Who was the intended audience? What is the purpose of the image? What messages does it convey?
Define the research problem or challenge : Conduct your primary market research and document using the Market Research Template. Define the research problem or challenge
Why think film won an oscar for screenplay : Why you think this film won an Oscar for its screenplay (Do not chose a film that you have already posted on). Reference applicable concepts and vocabulary
Describe the rise of the democratic party : Describe the rise of the Democratic Party. What happened to the old Jeffersonian/Democractic-Republicans (the party of Jefferson, Madison, and Monroe)?
Find the posterior distribution : Find the posterior distribution - Construct an appropriate Bayesian estimator ^?B and a 95% credible interval of ?.
Should performance be tied to compensation : Should performance be tied to compensation? Support your response with what you have read and personal experience in the workplace.
Define human service model of employee supervision : Describe the ideas that define a human service model of employee supervision?
Write essay outlining a major turning point in world history : Write an essay outlining a major turning point in world history from the 16th century to present day. You will choose a turning point that, in your opinion
Describe to walter herbs manager suitable factors : Walter Herbs has been operating in Jamaica as a major manufacturer of herbal products. The materials and staff are genuinely Jamaican.

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Apply holt-winters double exponential smoothing

Apply Holt-Winters Double exponential smoothing (without seasonality) to forecast the values of sales X for periods 12 and 13, using α = 0.2 and β = 0.3. Assume 12 months of sales data.

  Do you think you have eating genetically modified foods

Do you think you have eating genetically modified foods

  Part 1identify the role of the third variablefor questions

part 1identify the role of the third variablefor questions 1-5 choose which of the following best describes the third

  Find the expected time that a customer waits

Find the expected time that a customer waits, starting from its arrival at the ferry terminal and ending at the departure of its ferry.

  Differences between dominant and recessive alleles

Briefly describe what an allele is, then discuss the differences between dominant and recessive alleles. Provide an example of each type of allele

  Application of statistical process controls

For Linear Programming, you are to make up (or collect actual data) so as to be able to apply a complete linear programming optimization. Make sure to show the data to be used, show the linear program you create, and the solutio..

  Legal and ethical implications of media giant mergers

Discuss the legal and ethical implications of the ongoing debate and take a position as to whether the changes should be allowed and would they benefit DWI.

  Discuss the importance of countercurrent flow

Discuss the importance of countercurrent flow to the functioning of the nephrons

  Consensual relationship agreements

Create a counter argument against the use of CRAs in your current (or future) workplace. Discuss the ethical principles involved in the use of CRAs. Create at least one (1) other option besides CRAs that would address workplace romances.

  Analyse the data from datasets

Analyse the data from 4 datasets. Analyse this data and find some basic outputs. need someone with R knowledge. Integrate multiple datasets

  How are students distributed across classes of freshman

What is the proportion of females at StatCrunchU and determine a range of plausible values for this proportion. Is this proportion significantly larger than 0.5?

  When do you conduct hypothesis testing

If a statistical test result is not significant at the 0.05 level,

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd