Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Regression to the mean, Regression to the mean is the procedure first note...

Regression to the mean is the procedure first noted by Sir Francis Galton that 'each peculiarity in man is shared by his kinsmen, but on average to the less degree.' Hence the ten

Generalized additive models, Models which make use of the smoothing techniq...

Models which make use of the smoothing techniques such as locally weighted regression to identify and represent the possible non-linear relationships between the explanatory and th

Business Statistic HW., Hello , I have a business statistic HW that is due ...

Hello , I have a business statistic HW that is due after 23 hours exactly for now . I need full and details answers please , plus they must be in a done and typed in a word or exce

Bivariate boxplot, Bivariate boxplot : A bivariate analogue of boxplot in w...

Bivariate boxplot : A bivariate analogue of boxplot in which the inner area contains 50%of the data, and a 'fence' helps to identify the potential outliers. Robust methods or techn

Fan-spread model, This term sometimes is applied to the model for explainin...

This term sometimes is applied to the model for explaining the differences found between naturally happening groups which are greater than those observed on some previous occasion;

Relative poverty statistics, Relative poverty statistics is the statistics...

Relative poverty statistics is the statistics on the properties of populations falling below given fractions of average income which play a central role in debate of poverty. The

Disease clusters, An unusual aggregation of the health events, real or perc...

An unusual aggregation of the health events, real or perceived. The events might be grouped in the particular region or in some short period of time, or they might happen among the

Efficiency, This term applied in the context of comparing the different met...

This term applied in the context of comparing the different methods and techniques of estimating the same parameter; the estimate with the lowest variance being regarded as the mos

Gauss markov theorem, This is the theorem which states that if the error te...

This is the theorem which states that if the error terms in a multiple regression have the same variance and are not corrected, then the estimators of the parameters in the model p

Collapsing categories, Collapsing categories : A procedure generally applie...

Collapsing categories : A procedure generally applied to contingency tables in which the two or more row or column categories are combined, in number of cases so as to yield the re

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd