IST 557 Data Mining - Techniques and Applications Assignment

Assignment Help Advanced Statistics
Reference no: EM133030241

IST 557 Data Mining - Techniques and Applications - Pennsylvania State University

Homework

Consider a study of M families, the m-th family containing Dm individuals. For each individual, in each family, we measure their height (in centimeters). Your friend, who is working with you to analyze this data suggests the following Bayesian model for these data:
yi ~ N (αz , σ2) (3)
αm ~ N (µ, τ2) (4)

where i refers to the i-th individual in the study and zi refers to the family (zi ∈ {1, . . . , M}) to which individual i belongs, and α = {α1, . . . , αM }.

Problem 1

1. Briefly, describe an interpretation of the parameter σ2 (1-2 sentences). If you were to set this parameter yourself, what value would you choose and why (1-2 sentences)? Hint: remember y is measured in centimeters.

2. Briefly, describe an interpretation of the parameter τ2.

3. Briefly, describe an interpretation of the parameter µ2. If you were to set this parameter yourself, what value would you choose and why?

4. Briefly, describe an interpretation of the parameter αzi.

5. Briefly describe one biological reason why this model may be inappropriate or sub-standard.

Problem 2

Part A:

You have recently learned about Gaussian processes. You think they are the coolest things ever. So you decide to represent the above model as a Gaussian process. What is the mean function m(.) and a kernel function K(.,.) that represents the above model as a Gaussian process of the form y GP(m(.), K(.,.)). In other words, find m and K such that the GP model matches the marginal distribution of your friends proposed Bayesian model.

Part B:
You realize that in your haste the above GP model ignored the parameters α that were the most important part of your analysis: you marginalized over them (oops!). Still, you think GPs are the best so you decide to instead use Gaussian Process Regression. Find m and K that represents your friends model as the following Gaussian Process Regression model

yi ∼ N (α(zi), σ2)
α ∼ GP(m(.), K(., .))

Attachment:- Data Mining.rar

Reference no: EM133030241

Questions Cloud

Assignment on organization development : Organization Development: Why is it important to conduct a thorough diagnosis/discovery prior to beginning any intervention activity with a client
What is the equivalent annual cost of the furnace : A new furnace for your small factory is being installed right now, will cost $35,000, and will be completed in one year. What is the equivalent annual cost
Discuss the process that nicole should follow : Based on the review of the store, Nicole, the general manager concluded that one of the first things she has to attend involves developing the job description o
Creating a high-performance compensation plan : 1. Why is the right balance between salaries and incentives critical to creating a high-performance compensation plan?
IST 557 Data Mining - Techniques and Applications Assignment : IST 557 Data Mining: Techniques and Applications Assignment Help and Solution, Pennsylvania State University - Assessment Writing Service
What is the most the company should pay : The projects internal rate of return is 5%. The project will generate annual operating cash inflows of $20,000. What is the most the company should pay
What to do before the job interview : 1. What are the basic requirements needed in applying for a job?
What is the present value of the future cash flows : What is the present value of the future cash flows, if you also could earn $290,000 per year rent on the property? The rent is paid at the end of each year
Describe the five product mix pricing decisions : Compare and contrast market-skimming and market-penetration pricing strategies.

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Relationship between speed, flow and geometry

Write a project proposal on relationship between speed, flow and geometry on single carriageway roads.

  Logistic regression model

Compute the log-odds ratio for each group in Logistic regression model.

  Logistic regression

Foundations of Logistic Regression

  Probability and statistics

The tubes produced by a machine are defective. If six tubes are inspected at random , determine the probability that.

  Solve the linear model

o This is a linear model. If your model needs a different engine, then you need to rethink your approach to the model. Remember, there are no IF, Max, or MIN statements in linear models.

  Plan the analysis

Plan the analysis

  Quantitative analysis

State the hypotheses that you are going to test.

  Modelise as a markov chain

modelise as a markov chain

  Correlation and regression

What are the degrees of freedom for regression

  Construct a frequency distribution for payment method

Construct a frequency distribution for Payment method

  Perform simple linear regression

Perform simple linear regression

  Quality control analysis

Determining the root causes

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd