Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Explain Geometric distribution, Geometric distribution: The probability di...

Geometric distribution: The probability distribution of the number of trials (N) before the first success in the sequence of Bernoulli trials. Specifically the distribution is can

General household survey, It is the survey which is carried out in Great Br...

It is the survey which is carried out in Great Britain on a continuous basis since 1971. About 100 000 households are included in this sample every year. The main goal of the surve

Explain multicentre study, Multicentre study : The clinical trial conducte...

Multicentre study : The clinical trial conducted simultaneously in the number of participating hospitals, with all centres following an agreed-upon study of the protocol and with

Double-dummy technique, It is the technique used in the clinical trials whe...

It is the technique used in the clinical trials when it is possible to make an acceptable place before an active treatment but not to make the two active treatments identical. In t

Continuous variable, Continuous variable : The measurement which is not res...

Continuous variable : The measurement which is not restricted to the particular values except in so far as this is constrained by the accuracy of measuring instrument. General exam

Hazard regression, Hazard regression is the procedure for modeling the haz...

Hazard regression is the procedure for modeling the hazard function which does not depend on the suppositions made in Cox's proportional hazards model, namely that the log-hazard

Double sampling, The procedure in which initially the sample of subjects is...

The procedure in which initially the sample of subjects is selected for generating the auxillary information only, and then the second sample is selected in which the variable of i

Intention-to-treat analysis, Intention-to-treat analysis is the process in...

Intention-to-treat analysis is the process in which all the patients randomly allocated to a treatment in the clinical trial are analyzed together as representing that particular

Decision tree, The graphic representation of the alternatives in a decision...

The graphic representation of the alternatives in a decision making problem which summarizes all the possibilities foreseen by the decision maker. For instance, suppose we are give

Outliers - reasons for screening data, Outliers - Reasons for Screening Dat...

Outliers - Reasons for Screening Data Outliers are due to data entry errors, subject is not a member of the population that the sample is trying to represent, or the subject i

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd