Problem set for logistic regression, Applied Statistics

Assignment Help:

(1) What values can the response variable Y take in logistic regression, and hence what statistical distribution does Y follow? The response variable can take the value of either a 1 or a 0, and follows a binomial distribution.

(2) How are the parameters estimated in logistic regression?  Is this different from how the parameters are estimated in Ordinary Least Squares (OLS) regression? Logistic regressionparameters are estimated utilizing the maximum likelihood method, which is the same underlying method for OLS regression.  However, with logistic regression, an iterative method conducted via software because it is more complicated to estimate nonlinear parameters β0 and β1.  This differs from OLS, because the OLS method is by differentiating the sum of squared deviations.  This is an easier method because those deviations are linear in relation to β.

Coefficient estimates in logistic regression can also be found by utilizing the following methods

- noniterative weighted least squares

- discriminant function analysis

(3) How do we define a "residual" in logistic regression, and how is it computed?

In Logistic Regression, the Deviance fills the same role as the residual sumo f squares in linear regression. 

This is computed by calculating what is known as the likelihood-ratio test, Illustrated below:

D=-2ln ( likelihood of the fitted model / likelihood of the saturated model)

 Model 1:  Let's consider the logistic regression model, which we will refer to as Model 1, given by

                                log(pi / [1-pi]) = 0.25 + 0.32*X1 + 0.70*X2 + 0.50*X3                         (M1),

where X3 is an indicator variable with X3=0 if the observation is from Group A and X3=1 if the observation is from Group B.  The likelihood value for this fitted model on 100 observations is 0.0850.

(4)    (6 points) For X1=2 and X2=1 compute the log-odds for each group, i.e. X3=0 and X3=1.

Group A (X3=0);

Group B (X3=1);

(5) For X1=2 and X2=1 compute the odds for each group, i.e. X3=0 and X3=1. 

(6) For X1=2 and X2=1 compute the probability of an event for each group, i.e. X3=0 and X3=1. 

(7) Using the equation for M1, compute the relative odds associated with X3, i.e. the relative odds of Group B compared to Group A. 

(8) Use the odds for each group to compute the relative odds of Group B to Group A. How does this number compare to the result in Question #7.  Does this make sense?

Model 2:  Now let's consider an alternate logistic regression model, which we will refer to as Model 2, given by

                                log(pi / [1-pi]) = 0.25 + 0.32*X1 + 0.70*X2 + 0.50*X3 + 0.1*X4       (M2),

where X3 is an indicator variable with X3=0 if the observation is from Group A and X3=1 if the observation is from Group B.  The likelihood value from fitting this model to the same 100 observations as M1 is 0.0910.

(9) Use the G statistic to perform a likelihood ratio test of nested models for M1 and M2.  State the hypothesis that is being tested, compute the test statistic, and test the statistical significance using a critical value for alpha=0.05 from Table A.3 on page 375 in Regression Analysis By Example.  From these results should we prefer M1 or M2?


Related Discussions:- Problem set for logistic regression

What are the coefficients of the linear combination, For the following ques...

For the following questions we are interested in a comparison of the 16 years education vs. > 16 years. (Recall we did the analysis on the log scale, so these are actual means on t

Time series, Year Production 2006 8 2007 6 2008 10 2009 12 2010 11 2011 15 ...

Year Production 2006 8 2007 6 2008 10 2009 12 2010 11 2011 15 2012 14 2013 16 Determine the trend from data given above?

Evaluate central tendency and variability, Why are graphs and tables useful...

Why are graphs and tables useful when examining data? A researcher is comparing two middle school 7th grade classes. One class at one school has participated in an arts program

Factor loadings matrix, As we stated above, we start factor analysis with p...

As we stated above, we start factor analysis with principal component analysis, but we quickly diverge as we apply the a priori knowledge we brought to the problem. This knowled

Stratified random sampling, Stratified Random Sampling: This method of ...

Stratified Random Sampling: This method of sampling is used when the population is comprised of natural subdivision of units, The method consist in classifying the population u

#title., 1 Se toma una muestra de 81 observaciones con una desviación están...

1 Se toma una muestra de 81 observaciones con una desviación estándar de 5. La media de la muestra es de 40. Determine el intervalo de de confianza de 99% para la media

Determine the matrix of the transformation, Consider the linear transformat...

Consider the linear transformation (a) Find the image of (3 , -2 , 2) under T. (b) Does the vector (5, 3) belong to the range of T? (c) Determine the matrix of the transf

Luxury goods higher for men than for women, According to a recent study, wh...

According to a recent study, when shopping online for luxury goods, men spend a mean of $2,401, whereas women spend a mean of $1,527. Suppose that the study was based on a sample o

Decision making ., it is said that management is equivalent to decision mak...

it is said that management is equivalent to decision making? do you agree? explain

Gcnnv, Ask questiovdgngddndgdngngngngn #Minimum 100 words accepted#

Ask questiovdgngddndgdngngngngn #Minimum 100 words accepted#

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd