Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Particlefilters, Particlefilters is a simulation method for tracking movin...

Particlefilters is a simulation method for tracking moving target distributions and for reducing computational burden of the dynamic Bayesian analysis. The method uses a Markov ch

Helmert contrast, Helmert contrast is the contrast often used in analysis ...

Helmert contrast is the contrast often used in analysis of the variance, in which each level of a factor is tested against average of the remaining levels. So, for instance, if th

Determine the probablity, Dr. Stallter has been teaching basic statistics f...

Dr. Stallter has been teaching basic statistics for many years. She knows that 80% of the students will complete the assigned problems. She has also determined that among those who

Explain yate s'' continuity correction, Yate s' continuity correction : Whe...

Yate s' continuity correction : When the testing for independence in contingency table, a continuous probability distribution, known as chi-squared distribution, is used as the app

Buffon''s needle problem, Buffon's needle problem : A problem proposed and ...

Buffon's needle problem : A problem proposed and solved by the scientist Comte de Buffon in 1777 which includes determining the probability, p, which a needle of length l will inte

Determinant, A value related with the square matrix which represents sums a...

A value related with the square matrix which represents sums and products of its elements. For instance, if the matrix is   then the determinant of A (conventionally written as

Friedman''s two-way analysis of variance, The distribution free or techniqu...

The distribution free or technique which is the analogue of the analysis of variance for the design with two factors. It can be applied to data sets which do not meet the assumptio

Randomized encouragement trial, Randomized encouragement trial   is the cl...

Randomized encouragement trial   is the clinical trials in which the participants are encouraged to change their behaviour in a particular manner (or not, if they are allocated to

Multiple correlation coefficient, Multiple correlation coefficient is th...

Multiple correlation coefficient is the correlation among the observed values of dependent variable in the multiple regression, and the values predicted by estimated regression

Hot deck, Hot deck is a method broadly used in surveys for imputing the mi...

Hot deck is a method broadly used in surveys for imputing the missing values. In its easiest form the method includes sampling with replacement m values from the sample respondent

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd