Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Bootstrap, Bootstrap : The data-based simulation method/technique for the s...

Bootstrap : The data-based simulation method/technique for the statistical inference which can be used to study the variability of the estimated characteristics of the probability

Estimating functions, The functions of the data and the parameters of inter...

The functions of the data and the parameters of interest which can be brought in use to conduct inference about the parameters when full distribution of the observations is unknown

Forecasting, Briefly explain the importance of forecasting for managers?

Briefly explain the importance of forecasting for managers?

Data fusion, The act of combining data from heterogeneous sources with the ...

The act of combining data from heterogeneous sources with the intent of extracting information that would not be available for any single source in isolation. An example is the com

Fisher''s transformation, The transformation of the Pearson's product momen...

The transformation of the Pearson's product moment correlation coefficient, r, can be given by   The statistic z has the normal distribution with mean   here ρ is the pop

Principal components regression analysis, Principal components regression a...

Principal components regression analysis is a process often taken in use to overcome the problem of multicollinearity in the regression, when simply deleting a number of the expla

Percentage, Looking for the correct answer.Y=50+.079(149)-.261(214)=

Looking for the correct answer.Y=50+.079(149)-.261(214)=

Homoscedasticity - reasons for screening data, Homoscedasticity - Reasons f...

Homoscedasticity - Reasons for Screening Data Homoscedasticity is the assumption that the variability in scores for a continuous variable is roughly the same at all values of

Describe population pyramid, Population pyramid : The diagram designed to s...

Population pyramid : The diagram designed to show the comparison of the human population by sex and age at a given instant time, consisting of a pair of the histograms, one for eve

Generalized additive models, Models which make use of the smoothing techniq...

Models which make use of the smoothing techniques such as locally weighted regression to identify and represent the possible non-linear relationships between the explanatory and th

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd