Missing data - reasons for screening data, Advanced Statistics

Assignment Help:

Missing Data - Reasons for screening data

In case of any missing data, the researcher needs to conduct tests to ascertain that the pattern of these missing cases is random.

Create dichotomous variable - non-missing vs missing for a specific variable. Run a simple independent samples t-test on a different variable in the collected sample to see if there are any significant differences.

Handling missing values:

1. Delete missing data (good idea if there are only a few missing cases)

2. Delete variables containing missing values (good idea if most of the missing values are concentrated to only a couple of variables. Still problematic if they are important to the ultimate goal of the research)

3. Estimate missing values

4. Prior knowledge

5. Replace missing values with the mean (main concern: lowers the calculated variance as compared to the unknown actual variance)
One variation involves using group means for missing values for cases involving group comparison analysis

6. Regression approach: use several IVs to explain the DV (that includes several missing values). Predict missing values using IV values.

7. Concerns include finding proper IVs that explain DV, estimates obtained from prediction more consistent with the scores used to predict them compared to the real values.

8. When we use any of the techniques described above, as a researcher we have to ascertain that our solution hasn't changed the results of the analysis (run the tests, with and without the treatment).


Related Discussions:- Missing data - reasons for screening data

Explain prospective studies, Prospective study : The studies in which indiv...

Prospective study : The studies in which individuals are followed-up over the period of time. A general example of this type of investigation is where the samples of individuals ar

Explain randomized response technique, Randomized response technique : The ...

Randomized response technique : The procedure for collecting the information on sensitive issues by means of the survey, in which an element of chance is introduced as to what quer

Clustering, hello I have a dataset including both categorical & numerical v...

hello I have a dataset including both categorical & numerical variable for market segmentation.how can i cluster them via k-means in matlab? thank you

Concordant mutations test, Concordant mutations test : A statistical test u...

Concordant mutations test : A statistical test used in the cancer studies to determine whether or not a diagnosed second primary tumour is biologically independent of the original

Pie chart, Pie chart is an extensively used graphical technique for presen...

Pie chart is an extensively used graphical technique for presenting relative frequencies related with the observed values of the categorical variable. The chart comprises of a cir

Lipstick Dilemma, For a career woman, wearing lipstick has become an integr...

For a career woman, wearing lipstick has become an integral part of her daily life. It is not unusual for a woman to look for a lipstick that will stay on her lips and not smudge o

Generalized estimating equations (gee), Technically the multivariate analog...

Technically the multivariate analogue of the quasi-likelihood with the same feature that it leads to consistent inferences about the mean responses without needing specific supposi

Explain lie factor, Lie factor : A measure suggested by Tufte for judging t...

Lie factor : A measure suggested by Tufte for judging the honesty of the graphical presentation of data. Which can be calculated as follows   The values close to one are desir

Sampling issue, Dear Experts, Please note that I''m doing a PhD in Busines...

Dear Experts, Please note that I''m doing a PhD in Business management under the title: Technology transfer and competitive advantage in Qatar oil and gas companies. It is a quant

Point scoring, Point scoring is an easy distribution free method which can...

Point scoring is an easy distribution free method which can be used for the prediction of a response which is a binary variable from the observations on several explanatory variab

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd