Description of the datasetwe will be using a dataset on

Assignment Help Basic Statistics
Reference no: EM13371079

Description of the dataset

We will be using a dataset on neighborhood effects for this assignment.These data are drawn from the Project on Human Development in Chicago Neighborhoods (PHDCN), a probability sample of individuals nested within 342 Chicago neighborhoods. The individual was interviewed with regard to conditions, events, and relationships within the local area defined as the neighborhood. The individual-level variables are derived from the interviews, and the neighborhood-level variables are derived from the 1990 census. Rob Sampson, a sociologist at Harvard, was interested in assessing whether informal social control mediates the relationship between neighborhood social composition and perceived violence.

1. Read the Sampson, Raudenbush, & Earls article in preparation for this assignment. It will provide the necessary background on the study and the variables of interest.

2. The data set you will use is in a folder called Sampson on the class website. In the folder, there is a data dictionary describing the variables and the level-1 (individuals) and level-2 (neighborhoods) files. Download the dictionary and the data.

3. Based on your reading of the article and the variable list, create one or two research questions that can be answered using an HLM model. The outcome variable is perceived violence, measured at the individual level. You need to consider what combination of individual and neighborhood characteristics best explains variation in perceived violence. 4. In SPSS, check for "missing" data in the level-1 file prior to reading the files into HLM. Review the distributions of each variable in both level-1 and level-2 files by creating plots in SPSS. Comment on the variable distributions - are any cause for concern due to unusual skewness?

5. Read in the two data files into HLM and create the mdm file.

6. In the basic specifications menu, check that you want a level-1 and level-2 residual file. Use full maximum likelihood for estimation.

7. Choose VIOLENCE as the outcome variable. Fit an unconditional (intercept-only) model to the Sampson data. Interpretin words the coefficient for the fixed effect and the variance components at both levels. Report the confidence interval and the plausible values interval for the fixed effect. What do these intervals tell you? How are they different?

8. Decompose the variance of perceived violence into the percent attributable to individuals and the percent attributable to neighborhoods. This is the intraclass correlation or ICC. Show your computation. How does this compare (in magnitude) to the ICC for the High School and Beyond data we used in class?

Fit2 conditional models to the Sampson data.

Model 1:For the first model, use an intercept-only model at level-1 but add one (or more predictors) at level-2. This is similar in form to the means-as-outcomes model on your scorecard that we discussed in class.

• How will you center each level-2 variable? Justify all decisions.

• Interpret both the fixed effects and the variance components.

• Using proportional reduction in variance calculations (comparing this model to the unconditional model), describe how much variation in the intercept is attributable to the predictor(s) you choose.

Model 2:For the second model, add a predictor or set of predictors at level-1 but keep the same predictors at level-2 that you chose for the first conditional model.

• How does this change the model from Model 1? What variables do you want to add to level-1? How will you center each variable? Justify all decisions.

• Do all level-1 coefficients vary across neighborhoods at level-2, or will some be fixed?

• Interpret all parameters in your second model, both the fixed effects and all the variance components, including the covariances. Describe the meaning of the gammas and the elements of TAUin words.

• For the second model, provide both a table of your estimates (both fixed effects, variance components, and deviance statistic) and a plot of your final model. You can consult papers on the class website or the HLM text for good examples of multilevel tables. The Garner paper on SPARK is a useful model, but you may find other examples in your own disciplines.

10. Output the level-1 and level-2 residual files from your final model.

• Examine the residual files and produce appropriate residual plots to examine the assumptions. Are the Level-1 residuals normally distributed? Are the level-2 residuals multivariate normal? How do you know?

• Sort in ascending order the level-2 empirical bayes coefficients that represent the mean violence for each neighborhood (ECINTRCPT1). Depending on how you centered the level-1 predictors in your model, these may be adjusted means (if you use grand-mean centering, you are adjusting for differences across people in that neighborhood on that predictor). Identify the 5 best and 5 worst neighborhoods. Then examine some descriptive information about these neighborhoods from the level-2 SPSS file. In terms of perceived violence, what characterizes neighborhoodsthat are doing the best? The worst?

Reference no: EM13371079

Questions Cloud

Background to the national trust factthe national trust : background to the national trust factthe national trust cares for over 248000 hectares of countryside in england wales
1aphrodisin and odorant-binding protein are both examples : 1.aphrodisin and odorant-binding protein are both examples of lipocalins. first obtain the accession numbers for
Objective to learn the use of comments and basic math : objective to learn the use of comments and basic math operators.create a new folder called assign21 under
1consider a room 3 m x 3m x 3m with one faccedilade made of : 1.consider a room 3 m x 3m x 3m with one faccedilade made of glazing with diffuse transmittance 0.6 and reflectance
Description of the datasetwe will be using a dataset on : description of the datasetwe will be using a dataset on neighborhood effects for this assignment.these data are drawn
Problemst co is a closely held corporation incorporated : problemst co. is a closely held corporation incorporated under the laws of the state of delaware with 100 shares of
1pt barnum is concerned about the health of his star : 1.p.t. barnum is concerned about the health of his star trapeze artist. if the artist is capable of performing a triple
Labor markets further applications of microeconomicswhen : labor markets further applications of microeconomicswhen two goods are perfect complements the indifferencecurves area.
1 for same data a process called coding makes computation : 1. for same data a process called coding makes computation of the mean easier. one kind of coding involves subtracting

Reviews

Write a Review

Basic Statistics Questions & Answers

  Determine the optimum production rate

Delayed jobs normally results in lost business, which estimated to be C2 per job per week.Determine the optimum production rate?

  Difference between one-factor and two-factor anova

Explain the difference between one-factor and two-factor ANOVA. (b) Write the linear model form of one-factor ANOVA. (c) State the hypotheses for a one-factor ANOVA in two different ways.

  Normally distributed random variable problem

X is a normally distributed random variable with mean 10 and variance 24. a. Find P(X bigger 14) b. find P(8 smaller X smaller 20)

  Find confidence interval for population mean salaries

Find a 90% confidence interval for the population mean salaries of such personnel. Round your answer to the nearest dollar and don;t forget to use the $ sign.

  Conclude that the population means are different

At the .05 significance level, can Macaray conclude that the population means are different? Note that you should show all 5 steps in the test of hypothesis.

  To find the equation of the line of best fit

To find the equation of the line of the best fit

  Mean fuel efficiency rating for midsize cars

Formulate the hypothesis that can be used to determine whether the sample data support the hypothesis that the mean fuel efficiency rating for midsize cars is greater than the mean fuel efficiency for large cars.

  In how many different ways can the teams be selected

The same probability of getting choosen. In how many different ways can the teams be chosen so taht the number of employees on each project are as follows: 5 1 10.

  Amount of total claims over a period of days

Find the probability that the amount of total claims over a period of 100 days is at least $150,000. (Use the fact that the sum of independent normally distributed random variables is normally distributed, with mean equal to the sum of the individ..

  Determine relationship between fcat science-reading score

In effect, she wanted to determine if there was a relationship between FCAT Science Standardized score and FCAT Reading Standardized score.

  Find mean number of accidents per twenty four hour period

If the probability of no accidents during a 24 hour period is 0.1353, what is the mean number of accidents per 24 hour period?

  Find expected number of cups of coffee

The frequency of 2 cups of coffee is 600. The frequency of 3 cups of coffee is 300. The expected number of cups of coffee is?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd