Missing data - reasons for screening data, Advanced Statistics

Assignment Help:

Missing Data - Reasons for screening data

In case of any missing data, the researcher needs to conduct tests to ascertain that the pattern of these missing cases is random.

Create dichotomous variable - non-missing vs missing for a specific variable. Run a simple independent samples t-test on a different variable in the collected sample to see if there are any significant differences.

Handling missing values:

1. Delete missing data (good idea if there are only a few missing cases)

2. Delete variables containing missing values (good idea if most of the missing values are concentrated to only a couple of variables. Still problematic if they are important to the ultimate goal of the research)

3. Estimate missing values

4. Prior knowledge

5. Replace missing values with the mean (main concern: lowers the calculated variance as compared to the unknown actual variance)
One variation involves using group means for missing values for cases involving group comparison analysis

6. Regression approach: use several IVs to explain the DV (that includes several missing values). Predict missing values using IV values.

7. Concerns include finding proper IVs that explain DV, estimates obtained from prediction more consistent with the scores used to predict them compared to the real values.

8. When we use any of the techniques described above, as a researcher we have to ascertain that our solution hasn't changed the results of the analysis (run the tests, with and without the treatment).


Related Discussions:- Missing data - reasons for screening data

Ascertainment bias, Ascertainment bias : A feasible form of bias, particula...

Ascertainment bias : A feasible form of bias, particularly in the retrospective studies, which arises from the relationship between the exposure to the risk factor and the probabil

correlation, i will like to submit my project for you to do on chi-square,...

i will like to submit my project for you to do on chi-square, ANOVA, and correlation and simple regression. how can we do this?

Complier average causal effect (cace), Complier average causal effect (CACE...

Complier average causal effect (CACE): The treatment effect amid true compliers in the clinical trial. For the suitable response variable, the CACE is given by the difference in o

Explain remedian, Remedian: The robust estimator of location which is comp...

Remedian: The robust estimator of location which is computed by an iterative process. By assuming that the sample size n can be written as bk where b and k are the integers, the s

Parks test, The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedastici...

The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedasticity errors and no heteroscedasticity exists The Alternative Hypothesis - H1: β 1 ≠ 0 i.e. there is no homoscedasti

Uncertainty analysis, Uncertainty analysis is the process for assessing th...

Uncertainty analysis is the process for assessing the variability in the outcome variable that is due to the uncertainty in estimating the values of input parameters. A sensitivit

Regression dilution, Regression dilution is the term which is applied when...

Regression dilution is the term which is applied when a covariate in the model cannot be measured directly and instead of that a related observed value must be used in analysis. I

Define non linear mapping (nlm), Non linear mapping (NLM ) is a technique f...

Non linear mapping (NLM ) is a technique for obtaining a low-dimensional representation of the set of multivariate data, which operates by minimizing a function of the differences

Variance inflation factor, VIF is the abbreviation of variance inflation fa...

VIF is the abbreviation of variance inflation factor which is a measure of the amount of multicollinearity that exists in a set of multiple regression variables. *The VIF value

Observation-driven model, Observation-driven model  is a term generally a...

Observation-driven model  is a term generally applied to models for the longitudinal data or time series which introduce within the unit correlation by specifying the conditional

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd