Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Discriminant analysis, A term which covers the large number of techniques f...

A term which covers the large number of techniques for the analysis of the multivariate data which have in common the aim to assess whether or not the set of variables distinguish

Statistical & Quantitative Methods , Given: There are 4 jobs and 4 persons...

Given: There are 4 jobs and 4 persons. The cost incurred for each person and each job is as follows: Persons Job 1 Job 2 Job 3 Job 4 A 10 9 21 11 B 15 12 25 17 C 12 10 20 12 D 17

Probability distribution of the net present value, Suppose that $4 million ...

Suppose that $4 million is available for investment in three projects.  The probability distribution of the net present value earned from each project depends on how much is invest

Ecme algorithm, The Expectation/Conditional Maximization Either algorithm w...

The Expectation/Conditional Maximization Either algorithm which is the generalization of ECM algorithm attained by replacing some of the CM-steps of ECM which maximize the constrai

Probability, Modern hotels and certain establishments make use of an electr...

Modern hotels and certain establishments make use of an electronic door lock system. To open a door an electronic card is inserted into a slot. A green light indicates that the doo

Describe longini koopman model, Longini Koopman model : In epidemiology the...

Longini Koopman model : In epidemiology the model for primary and secondary infection, based on the classification of the extra-binomial variation in an infection rate which might

Comprehensive report writing assignment help, Hamilton County judges try th...

Hamilton County judges try thousands of cases per year. In an overwhelming majority of the cases disposed, the verdict stands as rendered. However, some cases are appeale

Finite mixture distribution, The probability distribution which is a linear...

The probability distribution which is a linear function of the number of component probability distributions. This type of distributions is used to model the populations thought to

Hazard plotting, Hazard plotting  is based on the hazard function of a dist...

Hazard plotting  is based on the hazard function of a distribution, this procedure gives estimates of distribution parameters, the proportion of units failing by the given time per

Explain time series, Time series : The values of a variable recorded, gener...

Time series : The values of a variable recorded, generally at a regular interval, over the long period of time. The observed movement and fluctuations of several such series are

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd