Write distribution in the canonical form of linear model

Assignment Help Basic Statistics
Reference no: EM131748799

Statistics General Linear Model Midterm Exam

Q1. There are a series of papers by de Souza et al. on the generalized linear models in astronomy. These articles can be found either from the links provided below, or from the Blackboard where three pdf files were uploaded. In this question, we focus on the second paper. You may read other papers if you are interested in.

  • R.S. de Souza, E. Cameron, M. Killedar, J. Hilbe, R. Vilalta, U. Maio, V. Biffi, B. Ciardi, J.D. Riggs (2015). The overlooked potential of generalized linear models in astronomy, I: binomial regression, Astronomy and Computing, 12, 21-32.
  • J. Elliott, R.S. de Souza, A. Krone-Martins, E. Cameron, E.E.O. Ishida, J. Hilbe (2015). The overlooked potential of generalized linear models in astronomy, II: gamma regression and photometric redshifts. Astronomy and Computing, 10, 61-72.
  • R. S. de Souza, J. M. Hilbe, B. Buelens, J. D. Riggs, E. Cameron, E. E. O. Ishida, A. L. Chies-Santos, M. Killedar (2015). The overlooked potential of generalized linear models in astronomy, III: Bayesian negative binomial regression and globular cluster populations. Monthly Notices of the Royal Astronomical Society, 453, 1928-1940.

(a) Read the second paper by Elliott et al. on gamma regression and photometric redshifts. Write a brief summary of section 2 overview of regression methods, page 62-64.

(b) Appendix A (page 68-69) provides instructions to perform the photometric redshift estimation using the R package. Run these R codes line by line, and explain the purpose and output of each command line. Elliott et al. also provide python codes in Appendix B. If you prefer python, you can run and explain the python codes. Note, you only need to choose either R or python.

Q2. Consider the data from All Time World Rankings. We use man's 100 meter dash records and woman's 100 meter dash records.

First, summarize these records by using a table with columns Time Record (second), Age (year), and Gender (Female or Male).

For the time record in each age and gender group, you should use the fastest times without wind assistance. Based on Rule 260.14(c) of IAAF Competition Rules 2016-2017, if a tail wind exceeds 2 meters per second the result cannot be registered as a record on any level. So you should use the fastest times among the wind speed less than or equal to +2 m/s.

For age, use the lower bound of each age group. For instance, the age for age group M35-39 is 35, the age for age group W90-94 is 90.

(a) Summarize the record of each age and gender group and form an R-readable table. For example, the first several rows of the table may be

Gender

Age

Time

M

35

9.97

M

40

10.29

. . . . . .

W

35

10.74

W

40

10.99

. . . . . .

(b) Consider time as the response variable (y) and age as the explanatory variable (x). For female students, use woman's record; for male students, use man's record. Fit the models

y = β10 + β11x

and

y = β20 + β21x + β22x2.

Include your R codes and report your estimates. Does the extra quadratic term appear necessary?

(c) Denote the estimates in part (a) of the intercept of model y = β10 + β20x as b0F in woman's record model, and as b0M in man's record model.

Include gender as an additional explanatory variable (v), and v = 1 corresponds to woman's record, and v = 0 corresponds to man's record. Consider the model

y = β30 + β31x + β32v.

Include your R codes and report your estimates. How does gender appear to affect the records?

(d) For female students, compare βˆ30 + βˆ32 and b0F. For male students, compare βˆ30 and b0M. Explain the difference.

(e) For female students, use woman's record; for male students, use man's record. Using the data fit a Gamma generalized linear model. Interpret your findings and compare with part (b). Include your R codes, and write down the link function you choose, and the equation of your fitted model.

(f) Show that the density of inverse Gaussian distribution lies in the exponential family, and write the distribution in the canonical form of a generalized linear model. Then repeat part (e) using an inverse Gaussian generalized linear model.

Q3. Two items A and B are weighed on a balance, first separately and then together, to yield observations y1, y2, and y3. Say, suppose the true weights of A and B are αA and αB, we have

y1 = αA + ε1

y2 = αB + ε2

y3 = αA + αB + ε3

(a) If εi ∼ N(0, σ2ε), i = 1, 2, 3, find the reasonable estimates of αA and αB. Show your work.

(b) If εi ∼ N(0, σ2ε) for i = 1, 2, and ε3 ∼ N(0, k2σ2ε), where constant k > 1, find the reasonable estimates of αA and αB. Show your work.

(c) Let y1 = 41, y2 = 53, y3 = 97, k = 1.2. Choose a suitable function in R, and find the estimates of αA and αB in (a) and (b). Include your R codes, and highlight the key R function you use. Compare the estimates of αA and αB in (a) and (b) and explain the differences.

Attachment:- Assignment File.rar

Reference no: EM131748799

Questions Cloud

Discuss the advantage of disciplinary approach to curriculum : Discuss the advantage of the disciplinary approach to curriculum and instruction over the more traditional subject centered approach.
Discuss how would you change the social security system : Do you feel that Social Security, as a means of ensuring that Americans have some sort of income in their later
Compute the quantization error sequences : Quantization Experiments Use fxquant ( ) to quantize the 8000 input speech samples from the file S5 . MAT. Using rounding and saturation.
Implement a connect four game using min max algorithm : CS 5346 - Advanced Artificial Intelligence - Artificial intelligence to implement a connect four game using min max algorithm and writing an evaluation function
Write distribution in the canonical form of linear model : Statistics 706 General Linear Model Midterm Exam. write the distribution in the canonical form of a generalized linear model
Discuss the medicare system along with the problems : Discuss the Medicare system along with the problems that currently exist in the program
Discuss the various stages or phases of group development : The icebreaker that really caught my attention was the yarn web exercise. This is when you go around the group and someone starts out by listing some facts
Write your own philosophy of education to keep in portfolio : Write your own philosophy of education to keep in your portfolio. Include your definition of a successful teacher, what led you to the teaching field.
How would you implement a special-purpose fft : Improvements for a Single Length If one wants to execute a single OFT repeatedly on different data, a special-purpose program can be written.

Reviews

len1748799

12/4/2017 12:36:00 AM

There are 3 questions, each question is between 3-6 points. A perfect score is 10 points. Show all work. You will receive partial credit for partially completed problems. You may use any references, any texts and any online media. Discussion between classmates in Stat 706 is encouraged. But it is not allowed to directly copy solutions from other students. If you have a group discussion (on-line or face-to-face), please mention it in your solution (including the names of participants in your discussion group). Mentioning the general discussion will not influence your score.

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd