Outliers - reasons for screening data, Advanced Statistics

Assignment Help:

Outliers - Reasons for Screening Data

Outliers are due to data entry errors, subject is not a member of the population that the sample is trying to represent, or the subject is really different. Statistical tests are quite sensitive to outliers so this problem should be addressed.

Univariate outliers are easy to detect (z-scores, box plots, histograms, etc.) standard scores larger than +/-3 are outliers (consider 4 is n>100 or 2.5 if n<10)

Multivariate outliers are difficult to detect. Mahalanobis distance is one powerful technique to use in this case (discussed later). This is evaluated as a chi-square statistic with degrees of freedom equal to number of variables in the analysis. A chi-sqaure statistic value that is significant beyond p<0.001 level determines outliers.

In most cases, it is ok to drop the value from the sample. One can also take steps to reduce the relative influence of outliers if the researcher decides to include the values in the analysis.


Related Discussions:- Outliers - reasons for screening data

Copulas, Invariant transformations to combine marginal probability function...

Invariant transformations to combine marginal probability functions to form multivariate distributions motivated by the need to enlarge the class of multivariate distributions beyo

Factor rotation, Generally the final stage of an exploratory factor analysi...

Generally the final stage of an exploratory factor analysis in which factors derived initially are transformed to build their interpretation simpler. Generally the target of the pr

Calculate the probability, (a) A plane timetable states that a particular p...

(a) A plane timetable states that a particular plane is due at 2pm but the actual arrival time isuniformly distributed between 1pm and 3pm. (i) Calculate the probability that th

Lipstick Dilemma, For a career woman, wearing lipstick has become an integr...

For a career woman, wearing lipstick has become an integral part of her daily life. It is not unusual for a woman to look for a lipstick that will stay on her lips and not smudge

Kaiser''s rule, Kaiser's rule is the  rule frequently used in the principa...

Kaiser's rule is the  rule frequently used in the principal components analysis for selecting the suitable the number of components. When the components are derived from correlati

Chance events, Chance events : According to the Cicero these are events whi...

Chance events : According to the Cicero these are events which occurred or will occur in ways which are the uncertain-events which may happen, may not happen, or may happen in some

Multivariate analysis of variance, Multivariate analysis of variance is th...

Multivariate analysis of variance is the procedure for testing equality of the mean vectors of more than two populations for the multivariate response variable. The method is dire

Partial autocorrelation function, The graph for Partial Autocorrelation Fun...

The graph for Partial Autocorrelation Function for RES1 shows that there is no autocorrelation even though there are alternating spikes because they fall inside the 5% significance

Ecme algorithm, The Expectation/Conditional Maximization Either algorithm w...

The Expectation/Conditional Maximization Either algorithm which is the generalization of ECM algorithm attained by replacing some of the CM-steps of ECM which maximize the constrai

Direct edacyclic graph, Formal graphical representation of the "causal diag...

Formal graphical representation of the "causal diagrams" or the "path diagrams" where the  relationships are directed but acyclic (that is no feedback relations allowed). Plays an

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd