The mock data file consists of 3 columns each containing

Assignment Help Basic Statistics
Reference no: EM13371252

The mock data file consists of 3 columns, each containing 1000 numbers:

1. a flag indicating which data row

2. the sampled x value (1 - 1000)

3. the corresponding sampled y value (1 - 1000)

The challenge is essentially to determine the linear relationship between x and y using these 1000 data pairs. It divides up into three steps, of increasing complexity.

Step 1: Use ordinary least squares to fit the linear model  y = a + bx  to the mock data

          (a) compute LS estimators of a and b,

          (b) estimate the variance of the (assumed Gaussian) noise which has been added to the mock y values

          (c) estimate errors on a_LS and b_LS, and their covariance

Step 2: By casting the data analysis challenge not as a least squares problem, but as a maximum likelihood problem, form an appropriate likelihood function for the mock data, which depends on the parameters (a,b).

          Then, by computing the log likelihood on a rectangular grid of values of a and b (you need to think carefully about the range of a and b values you should consider, and the spacing between them), and in turn computing the value of chi-squared for each (a,b) pair on your grid, you should find the minimum value of chi-squared.  You then should turn your grid of values into a rectangular array of Delta chi-squared values.  Finally, using the information in the table in Section 6, you should compute and plot Bayesian credible regions  for the parameters at e.g. 68.3%, 95.4%, 99.73%. (Carrying out the calculations and making a contour plot from the results is straightforward in e.g. MATLAB, although you are welcome to use any programming language you wish).

Step 3: Finally, using the Metropolis algorithm, and assuming a Gaussian likelihood function for the model parameters a and b, write an MCMC code to generate a sample from the likelihood function - thinking carefully about your choices of proposal density and prior range for a and b. Use this sample to estimate the mean values, errors and covariance of the parameters a and b from their sampled marginal distributions. Devise a method for estimating and plotting Bayesian credible regions for the paramters, using your MCMC sample.

while Steps 2 and 3 both involve more sophisticated methods and will require you to write some simple computer code (e.g. in MATLAB)

Reference no: EM13371252

Questions Cloud

Part-1weekly discussion thread assignments require that you : part-1weekly discussion thread assignments require that you select and respond to one of the learning outcomes listed
Q1in this area we will discuss the significance of the : q1in this area we will discuss the significance of the accounting equation the rules of debit and credit and the steps
1 describe the role and importance of communication in the : 1. describe the role and importance of communication in the managers job. communication is the process of transmitting
1 consider the following distribution of grades in a : 1. consider the following distribution of grades in a class.nbsp think of this as your population.3540 2 3 4 4 550 0 0
The mock data file consists of 3 columns each containing : the mock data file consists of 3 columns each containing 1000 numbers1. a flag indicating which data row2. the sampled
Question 1 variable xx lt- c3 4 8 4 2 1 0 6 variable yy lt- : question 1 variable xx lt- c3 4 8 4 2 1 0 6 variable yy lt- c1 2 4 2 2 0 1 4 calculate z-scores for all values in the
A company specializing in earth-moving equipment is : a company specializing in earth-moving equipment is contemplating expanding its operations. it must decide whether to
Explain the process for handling conflicts taking place in : explain the process for handling conflicts taking place in the organization.analyze the elements of organizational
New keynesian model with technology shocks consider a new : new keynesian model with technology shocks consider a new keynesian economy with equilibrium conditions given bywhere

Reviews

Write a Review

Basic Statistics Questions & Answers

  Probability that sample proportion is in plus-minus range

Assume the population proportion is p = .25. What is the probability that the sample proportion will be within +/- .03 of the population proportion if a sample of size 1,000 is selected (to 4 decimals)?

  Testing research hypotheses

In playing the Lemonade Stand Game, Bob decreased his price per cup by two cents, from $.27 per cup to $.25 per cup. At the .05 level of significance, did net revenue increase?

  Probability that vice-presidents not invitged to any games

One of the three vice-presidents has not been invitged to attend any of the last four games. what is the probability that this could happen?

  Research questionstatistical null-alternative hypotheses

State the research question and statistical null and alternative hypotheses; and Explain why this test is appropriate and why the correlation, t-test, regression, or ANOVA is not appropriate.

  Find students weigh less than one hundred twenty eight pound

How many students weigh less than 128 pounds? b) how many students weigh more than 165 pounds? c)how many students weigh between 135 and 165 pounds?

  Academic approach to confidence interval for mean

Find a point estimate of the population mean. Find the 95% confidence interval of the true mean. Assume the population standard deviation was 0.8.

  Find probability that mouse will live thirty two months

Find the probability that a given mouse will live. a. more than 32 months b. less than 28 months c. between 37 and 49 months.

  Explaining parameter and statistics

Explain the terms parameter and statistics. Make sure the concepts of population and sample are included in the definitions.

  Describing significance of random sampling

Kindly explain to me the importance of random sampling. What problems/limitations could prevent a truly random sampling and how can they be prevented

  Calculate the mean of the 20 samples

Calculate the mean of the 20 samples and draw a histogram showing the 20 sample means and describe the distribution of the x-bars that you see in part b (shape of distribution, center, and the amount of dispersion).

  Expected number of televisions in a home

Base on a survey of over 10,000 households, the number of televisions in the home was recorded as given in the table below. Based on the data, find the expected number of televisions in a home.

  Probability-uniform distribution

What is the probability that a randomly chosen eight-week old baby smiles between 2 and 18 seconds?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd