Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Time series, relevancy of time series in business management

relevancy of time series in business management

Ecm algorithm, This is extension of the EM algorithm which typically conver...

This is extension of the EM algorithm which typically converges more slowly than EM in terms of the iterations but can be much faster in the whole computer time. The general idea o

Explain Geometric distribution, Geometric distribution: The probability di...

Geometric distribution: The probability distribution of the number of trials (N) before the first success in the sequence of Bernoulli trials. Specifically the distribution is can

Cochrane collaboration, Cochrane collaboration : An international network o...

Cochrane collaboration : An international network of the individuals committed to preparing , maintaining and disseminating the systematic reviews of the effects of the health care

Codominance, Codominance : The relationship between genotype at the locus a...

Codominance : The relationship between genotype at the locus and a phenotype to which it in?uences. If an individuals with heterozygote (such as, AB) genotype is phenotypically dif

F-test, A test for equality of the variances of the two populations having ...

A test for equality of the variances of the two populations having normal distributions, based on the ratio of the variances of the sample of observations taken from each. Most fre

Classification and regression tree technique (cart), Classification and reg...

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory va

Cure models, Models for the analysis of the survival times, or the time to ...

Models for the analysis of the survival times, or the time to event, data in which it is expected that a fraction of the subjects will not experience the event of interest. In a cl

Math, A statewide survey of 1,706 California adults’ residents include the ...

A statewide survey of 1,706 California adults’ residents include the following question: would you favor or oppose providing a path to citizenship for illegal immigrants in the U.S

Direct edacyclic graph, Formal graphical representation of the "causal diag...

Formal graphical representation of the "causal diagrams" or the "path diagrams" where the  relationships are directed but acyclic (that is no feedback relations allowed). Plays an

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd