Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Forecasting, Briefly explain the importance of forecasting for managers?

Briefly explain the importance of forecasting for managers?

Counter arguments, Standardise the following arguments, which involve count...

Standardise the following arguments, which involve counter-arguments Some educators have argued that the increasing use of the internet by children and teenagers will have a be

Gambling, It is the art of attempting to exchange something quite small and...

It is the art of attempting to exchange something quite small and certain, for something which are large and uncertain. Gambling is big business; in the US, for instance, it is at

Describe lorenz curve., Lorenz curve : Essentially the graphical representa...

Lorenz curve : Essentially the graphical representation of cumulative distribution of the variable, most often used for the income. If the risks of disease are not monotonically in

Define matching coefficient, Matching coefficient is a similarity coeffici...

Matching coefficient is a similarity coefficient for data consisting of the number of binary variables which is often used in cluster analysis. It can be given as follows    he

Explain regression through the origin, Regression through the origin : In s...

Regression through the origin : In some of the situations a relationship between the two variables estimated by the regression analysis is expected to pass by the origin because th

Quantile regression, Quantile regression is an extension of the classical ...

Quantile regression is an extension of the classical least squares from estimation of the conditional mean models to the estimation of the variety of models for many conditional q

Epidemic curve, The plot of the number of cases of the disease against the ...

The plot of the number of cases of the disease against the time period. A large and sudden increase corresponds to an epidemic. The example of this is shown in the figure drawn bel

Interior analysis, Interior analysis is the  term now and again applied to...

Interior analysis is the  term now and again applied to analysis carried out on the fitted model in regression problem. The basic target of such analyses is the identification of

Probability., 5. Packages from a machine a normally distributed with a mean...

5. Packages from a machine a normally distributed with a mean 200g and its standard deviation 2grams. Find the probability that a package from the machine weighs a) Less than

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd