IST 557 Data Mining - Techniques and Applications Assignment

Assignment Help Advanced Statistics
Reference no: EM133030241

IST 557 Data Mining - Techniques and Applications - Pennsylvania State University

Homework

Consider a study of M families, the m-th family containing Dm individuals. For each individual, in each family, we measure their height (in centimeters). Your friend, who is working with you to analyze this data suggests the following Bayesian model for these data:
yi ~ N (αz , σ2) (3)
αm ~ N (µ, τ2) (4)

where i refers to the i-th individual in the study and zi refers to the family (zi ∈ {1, . . . , M}) to which individual i belongs, and α = {α1, . . . , αM }.

Problem 1

1. Briefly, describe an interpretation of the parameter σ2 (1-2 sentences). If you were to set this parameter yourself, what value would you choose and why (1-2 sentences)? Hint: remember y is measured in centimeters.

2. Briefly, describe an interpretation of the parameter τ2.

3. Briefly, describe an interpretation of the parameter µ2. If you were to set this parameter yourself, what value would you choose and why?

4. Briefly, describe an interpretation of the parameter αzi.

5. Briefly describe one biological reason why this model may be inappropriate or sub-standard.

Problem 2

Part A:

You have recently learned about Gaussian processes. You think they are the coolest things ever. So you decide to represent the above model as a Gaussian process. What is the mean function m(.) and a kernel function K(.,.) that represents the above model as a Gaussian process of the form y GP(m(.), K(.,.)). In other words, find m and K such that the GP model matches the marginal distribution of your friends proposed Bayesian model.

Part B:
You realize that in your haste the above GP model ignored the parameters α that were the most important part of your analysis: you marginalized over them (oops!). Still, you think GPs are the best so you decide to instead use Gaussian Process Regression. Find m and K that represents your friends model as the following Gaussian Process Regression model

yi ∼ N (α(zi), σ2)
α ∼ GP(m(.), K(., .))

Attachment:- Data Mining.rar

Reference no: EM133030241

Questions Cloud

Assignment on organization development : Organization Development: Why is it important to conduct a thorough diagnosis/discovery prior to beginning any intervention activity with a client
What is the equivalent annual cost of the furnace : A new furnace for your small factory is being installed right now, will cost $35,000, and will be completed in one year. What is the equivalent annual cost
Discuss the process that nicole should follow : Based on the review of the store, Nicole, the general manager concluded that one of the first things she has to attend involves developing the job description o
Creating a high-performance compensation plan : 1. Why is the right balance between salaries and incentives critical to creating a high-performance compensation plan?
IST 557 Data Mining - Techniques and Applications Assignment : IST 557 Data Mining: Techniques and Applications Assignment Help and Solution, Pennsylvania State University - Assessment Writing Service
What is the most the company should pay : The projects internal rate of return is 5%. The project will generate annual operating cash inflows of $20,000. What is the most the company should pay
What to do before the job interview : 1. What are the basic requirements needed in applying for a job?
What is the present value of the future cash flows : What is the present value of the future cash flows, if you also could earn $290,000 per year rent on the property? The rent is paid at the end of each year
Describe the five product mix pricing decisions : Compare and contrast market-skimming and market-penetration pricing strategies.

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Describe difference between internal and external validity

For what types of data would you use nonparametric versus parametric statistics? Briefly describe the difference between internal and external validity

  Financial health of a business enterprise

What is captial budgeting? Why are capital budgeting decisions crucial to the long run financial health of a business enterprise?

  Find the time average of the given quantity

Sketch the lower bound E [N(t)] /t ≥ 1/E [X] - 1/t on the same graph with (c). Sketch E rSN(t)+1 - tl as a function of t and find the time average of this quantity.

  Performance and dating policies

What are incentive plans? How can they help the organization to achieve their objectives? Do incentive plans really improve performance and are they cost effective? Why or why not?

  Product learning process

A new product learning cycle indicates companies learning from product failure or mistake they made. In business, especially technology, it is important to learn from failure and continue improving.

  Show that the pair of variables is statistically independent

Find Pr{Xn+1 = i, Dn+1 = j | Dn} and show that the pair of variables (Xn+1, Dn+1) is statistically independent of Dn. What do your results mean relative to Burke's theorem.

  What is the probability of selecting a queen

What is the probability (represented in percent) of selecting a Queen from the deck of cards after a person has already selected a Queen and did not put it back in the deck?

  Nucleotide composition of strain

You are studying the variation among natural bacterial isolates from thermophilic vents, where temperatures routinely reach 55°C (131°F).

  What are the measures of center and why are they important

What are the measures of center and why are they important? Describe the level of measurement for each variable included in your data set.

  Compare the action potentials of contractile cardiac muscle

Compare the action potentials of contractile cardiac muscle, autorhythmic cardiac muscle and skeletal muscle.

  Is energy required for the method to occur

Choose one method of transport that the cell uses to move materials across its membrane.

  DSC-510 Advanced Probability and Statistics Assignment

DSC-510 Advanced Probability and Statistics Assignment Help and Solution, Grand Canyon University - Assessment Writing Service

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd