Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Alternative hypothesis, The Null Hypothesis - H0: β0 = 0, H0: β 1 = 0, H...

The Null Hypothesis - H0: β0 = 0, H0: β 1 = 0, H0: β 2 = 0, Β i = 0 The Alternative Hypothesis - H1: β0 ≠ 0, H0: β 1 ≠ 0, H0: β 2 ≠ 0, Β i ≠ 0      i =0, 1, 2, 3

Orthogonal, Orthogonal is a term which occurs in several regions of the st...

Orthogonal is a term which occurs in several regions of the statistics with different meanings in each case. Most commonly the encountered in the relation to two variables or t

Paired availability design, Paired availability design  is a design which c...

Paired availability design  is a design which can lessen selection bias in the situations where it is not possible to use random allocation of the subjects to treatments. The desig

Lipstick Dilemma, For a career woman, wearing lipstick has become an integr...

For a career woman, wearing lipstick has become an integral part of her daily life. It is not unusual for a woman to look for a lipstick that will stay on her lips and not smudge

Empirical likelihood, An approach of using the likelihood as the basis of e...

An approach of using the likelihood as the basis of estimation without the requirement to specify a parametric family for data. Empirical likelihood can be viewed as the example of

Error rate estimation, The term used for the estimation of the misclassific...

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group

Bayes factor, Bayes factor : A summary of evidence for the modelM1 against ...

Bayes factor : A summary of evidence for the modelM1 against the another modelM0 provided by the set of data D, which can be used in the model selection. Given by the ratio of post

Oracle property, Oracle property is a name given to techniques for estimat...

Oracle property is a name given to techniques for estimating the regression parameters in the models fitted to high-dimensional data which have the property that they can correctl

Lexis diagram, Lexis diagram  is the diagram for displaying the simultaneou...

Lexis diagram  is the diagram for displaying the simultaneous effects of the two time scales (generally age and calendar time) on a rate. For instance, mortality rates from cancer

Likert scales, Likert scales is often used in the studies of attitudes in ...

Likert scales is often used in the studies of attitudes in which the raw scores are based on the graded alternative responses to each of a series of queries. For instance, the sub

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd