Calculate the probability for smoking for low edcuation

Assignment Help Microeconomics
Reference no: EM131024693

The following two problems will require a lot of calculations in STATA. It will generate many pages of output. Here is how your should organize it. The first pages should contain your answers to all the questions, along with showing any key algebraic equations or explanations you need to use along the way. After that, include a printout of the output from the regressions you executed in support of your answers. Highlight any numbers in this output that you used in the first section. (You are encouraged to save paper here, you may print this section with a small font, double-sided and/or with 2-up format.) Last, include a copy of the DO file that contains the commands you asked STATA to execute. Be sure you organize these in a way that will be clear to the reader.

Problem 1

In the dataset Smoke1, there is information on 1186 males from the United States. Data from this sample includes the variables:
smoke= 1 for smokers, and 0 for nonsmokers
age=age in years
educ= number of years of schooling
income= family income
pcigs= an index of the average price of cigarettes in the individual's state

Part 1)

a) Create a dummy variable "hi_ed" that is a 1 if a person has =17 years of education, zero otherwise.

b) Estimate a linear regression (which in this context is called a linear probability model (LPM)) for the binary variable smoke on the independent variable hi_ed. Report the beta coefficient on the dummy variable and its p-value. In words, express what the beta coefficient means in this case.

c) Create a frequency table for the smoke and hi_ed variables. The command in STATA is tabulate smoke hi_ed.

d) Calculate the probability that a person smokes if high education. Calculate the probability for smoking for low edcuation.

e) What is the relationship between the results for parts b and d?

f) Calculate the odds that a low education person smokes. Calculate the odds a high education person smokes. Calculate the odds ratio.

g) Estimate the logistic regression (logistic command) for smoke and hi_ed. Confirm that this equals the odds ratio. In words, express what the odds ratio coefficient means in this case.

Part 2)

a) Estimate the logistic regression for smoke on pcigs when hi_ed=0. Then, calculate the predicted values for Y. The command to do this is "predict yhat_0." Repeat for when hi_ed=1, and also create a predicted value variable yhat_1.

b) Compare the coefficients for the two models. In words, explain what the model is saying about the impact of pcigs for the two different education groups.

c) Create a graph of the predicted values for the two versions. Use the following syntax: twoway (line yhat_0 pcigs, sort) (line yhat_1 pcigs, sort), legend(label(1 "low ed") label(2 "hi ed"))

d) For a low education person, a 1 unit change of the price index of cigarettes will change the odds of smoking by how much?

Part 3)

a) Run a logit regression (stata command logit) for smoke on the independent variables: age, educ, pcigs, income. Report the coefficients and p-values. This Model 1. Remember, for logit, (rather than logistic), the coefficients have not been exponentiated.

b) Create an interaction variable of educ and income. Run another logit regression, adding this interaction to Model 1. Report the coefficients and p-values. This is Model 2.

c) What differences strike you about Model 1 and Model 2? In particular, note how the signs and significance levels of the variables of educ and income have changed now that the interaction is included. In a clearly articulated paragraph (or two) give a thoughtful answer as to what you think must be going on. (This is not easy. Take your time and think hard about it.Your answer should contain two parts. First, talk about what the coefficients of the Model 2 regression are implying. Second, try and come up with an intuitive/economic hypothesis for why we are observing these results.)

Problem 2

The file Divorce.dta contains a four-period panel data set including various socioeconomic factors for all 50 US States. The panels were taken in 1965, 1975, 1985, 1995. The variables we're particularly interested in are:

divorce: number of divorces and annulments per state per one thousand population.
birth: number of live births per state per one thousand population.
marriage: number of marriages per state per one thousand population.
unemploy: total unemployment rate as a percentage of the total work force.
crime: total # of criminal offenses known to police per one hundred thousand population.

AFDC: average monthly AFDC payments per family. (AFDC is commonly known as Welfare, an income support payment by the government to certain low income groups, particularly unmarried women with children.)

Part 1)

a) Tell STATA this is a panel data set, with panels defined by entities "state" across periods "time." The command for this is: xtset state time.

b) Divorce will be our dependent variable throughout. Hypothesize the sign of the coefficients (or zero) that each of the independent variables in the dataset will have on divorce rates. Briefly provide an explanation of your thinking.

c) Run a regression of divorce on birth, marriage unemployment, crime, afdc using the entire pooled dataset. I'll call this the "naïve" regression. Report the coefficients and their p-values. Which, if any of the variables have a sign different from what you hypothesized?

Part 2)

a) Run a fixed effects regression using dummy variables for each state. Use the same variables as part 1c. The command for this is: regress variables i.state. Report your coefficients and p-values for all variables EXCEPT the state dummies. Are there any substantial changes in the fixed effect regression vs the "naïve" pooled regression?

b) All else equal, what is the difference in the divorce rate for Alabama vs. Oregon.

c) If AFDC payments increase by 300, what is the predicted increase in the divorce rate.

Part 3)

a) Use the built-in fixed effects regression command: xtreg variables, fe. Note that the coefficients resulting from this are identical to part 2a. (Don't forget the fe part or you won't get a fixed effects regression.)

b) Add to the regression of 3a a series of dummy variables to account for the time period. That is, estimate xtreg variables i.time, fe.This essentially adds fixed-effects terms for time, in addition to the state specific fixed effects.Report your coefficients and p-values.

c) From the results of 3b, how much higher is the divorce rate in 1995 compared to 1965?

d) Once we have controlled for time, how has the effect of AFDC payments on divorce rates changed compared to in 2a? Write a brief paragraph where you hypothesize why this change has happened.

Reference no: EM131024693

Questions Cloud

Receivables from the slow-paying clients : He explains to the board that removing the $80,000 increase in accounts receivable from current assets will increase net cash provided by operations. This approach may help Moss get the loan.
Find the slope of the line that goes through the given point : Evaluate the function at the given value of the independent variable and simplify.
Individual task of developing a summary : As a member of the staff (CLC) team assigned to work with Dave, you are assigned the individual task of developing a summary of the legal restrictions placed on the client, the areas of tax planning that you recommend for the taxpayer, and what so..
Record the realization of other assets : Prepare the journal entry for Donald, Chief, & Berry LLP on August 1, 2011 to record the realization of Other Assets. Prepare the journal entry for Donald, Chief, & Berry LLP on August 1, 2011 to record payment of liabilities.
Calculate the probability for smoking for low edcuation : Estimate a linear regression (which in this context is called a linear probability model (LPM)) for the binary variable smoke on the independent variable hi_ed. Report the beta coefficient on the dummy variable and its p-value. In words, express w..
T-account for short term investment : How will Waverly classify the investment? What will Waverly report on its 20X6 income statement? Show T-account for short term investment.
What is an inelastic vs. an elastic product : List at least five products that you personally consider inelastic in your life and at least five products that you personally consider elastic in your life. Explain why you consider these products elastic or inelastic from your perspective.
Prove that the reduced qr factorization a = qr : If the eigenvectors w1 and w2 of A^H*A are orthogonal, then Awi and Aw2 are orthogonal.
What are fixed costs and variable costs of this company : What are fixed costs and variable costs of this company? Discuss the relationship between marginal physical product, average total costs and marginal costs using the specific information given about the company.

Reviews

Write a Review

Microeconomics Questions & Answers

  The free rider problem

Question: Explain why the free rider problem makes it difficult for perfectly competitive markets to provide the Pareto efficient level of a public good.

  Failure of the super committee is good thing for economy

Some commentators have argued that the failure of the “Super committee” is good thing for the economy?  Do you agree?

  Case study analysis about optimum resource allocation

Case study analysis about optimum resource allocation: -  Why might you suspect (even without evidence) that the economy might not be able to produce all the schools and clinics the Ministers want? What constraints are there on an economy's productio..

  Fixed cost and vairiable cost

Questions:  :   Which of the following are likely to be fixed costs and which variable costs for a chocolate factory over the course of a month?  Explain your choice.

  Problem - total cost, average cost, marginal cost

Problem - Total Cost, Average Cost, Marginal Cost: -  Complete the following table of costs for a firm.  (Note: enter the figures in the  MC   column  between  outputs of  0 and 1, 1 and 2, 2 and 3, etc.)

  Oligopoly and demand curve problem

Problem based on Oligopoly and demand curve,  Draw and explain the demand curve facing each firm, and given this demand curve, does this mean that firms in the jeans industry do or do not compete against one another?

  Impact of external costs on resource allocation

Explain the impact of external costs and external benefits on resource allocation;  Why are public goods not produced in sufficient quantities by private markets?  Which of the following are examples of public goods (or services)? Delete the incorrec..

  Shifts in demand and movements along the demand curve

Describe the differences between shifts in demand and movements along the demand curve. What are the main factors which can shift the demand curve? Explain why they cause the demand curve to shift. Use examples and draw graphs to support your discuss..

  Article review question

Article Review Question: Read the following excerpts from the article "Fruit, veg costs surge' by Todd, Dagwell, published in the Herald on January 25th 2011 and answer questions below:

  Long-term growth, international trade & globalization

Long-term Growth, International Trade & Globalization:- This question deals with concepts such as long-term growth, international trade and globalization. Questions related to trade deficit, trade surplus, gains from trade, an international trade sce..

  European monetary union (emu) in crisis

"Does the economic bailout of Spain and Greece spell the beginning of the end for the European Monetary Union (EMU)?"

  Development game “settlers of catan”

Read the rules of the game, the overview and the almanac for the Development Game "Settlers of Catan"

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd