Outliers - reasons for screening data, Advanced Statistics

Assignment Help:

Outliers - Reasons for Screening Data

Outliers are due to data entry errors, subject is not a member of the population that the sample is trying to represent, or the subject is really different. Statistical tests are quite sensitive to outliers so this problem should be addressed.

Univariate outliers are easy to detect (z-scores, box plots, histograms, etc.) standard scores larger than +/-3 are outliers (consider 4 is n>100 or 2.5 if n<10)

Multivariate outliers are difficult to detect. Mahalanobis distance is one powerful technique to use in this case (discussed later). This is evaluated as a chi-square statistic with degrees of freedom equal to number of variables in the analysis. A chi-sqaure statistic value that is significant beyond p<0.001 level determines outliers.

In most cases, it is ok to drop the value from the sample. One can also take steps to reduce the relative influence of outliers if the researcher decides to include the values in the analysis.


Related Discussions:- Outliers - reasons for screening data

Mauchly test, Mauchly test is a test which a variance-covariance matrix of...

Mauchly test is a test which a variance-covariance matrix of pair wise differences of responses in the set of longitudinal data is the scalar multiple of identity matrix, a proper

Extrapolation, This process of estimating from a data set those values lyin...

This process of estimating from a data set those values lying beyond range of the data. In the regression analysis, for instance, a value of the response variable might be estimate

Parks test, The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedastici...

The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedasticity errors and no heteroscedasticity exists The Alternative Hypothesis - H1: β 1 ≠ 0 i.e. there is no homoscedasti

Ordered alternative hypothesis, Ordered alternative hypothesis is a hypoth...

Ordered alternative hypothesis is a hypothesis or assumption which speci?es an order for the set of parameters of interest as an alternative to the equality, rather than simply th

Simplex method, Economic Interpretation of the Optimum Simplex solution

Economic Interpretation of the Optimum Simplex solution

Explain maz experiments, MAZ experiments : The Mixture-amount experiments w...

MAZ experiments : The Mixture-amount experiments which include control tests for which the entire amount of the mixture is set to zero. Examples comprise drugs (some patients do no

Bonferroni correction, Bonferroni correction : A procedure for guarding aga...

Bonferroni correction : A procedure for guarding against the rise in the probability of a type I error when performing the multiple signi?cance tests. To maintain probability of a

Statistics HW, we are testing : Ho: µ=40 versus Ha: µ>40 (a= 0.01) Suppose...

we are testing : Ho: µ=40 versus Ha: µ>40 (a= 0.01) Suppose that the test statistic is z0=2.75 based on a sample size of n=25. Assume that data are normal with mean mu and standa

Option-3 scheme, Option-3 scheme is a scheme of measurement used in the si...

Option-3 scheme is a scheme of measurement used in the situations investigating possible changes over the time in longitudinal data. The scheme is planned to prevent measurement o

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd