Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Intention-to-treat analysis, Intention-to-treat analysis is the process in...

Intention-to-treat analysis is the process in which all the patients randomly allocated to a treatment in the clinical trial are analyzed together as representing that particular

Cohort component method, Cohort component method : A broadly used method or...

Cohort component method : A broadly used method or technique of forecasting the age- and sex-speci?c population to the upcoming years, in which the initial population is strati?ed

Cluster sampling, Cluster sampling : A method or technique of sampling in w...

Cluster sampling : A method or technique of sampling in which the members of the population are arranged in groups (called as 'clusters'). A number of clusters are selected at the

Construct the de bruijn digraph, A directed graph is simple if each ordered...

A directed graph is simple if each ordered pair of vertices is the head and tail of at most one edge; one loop may be present at each vertex. For each n ≥ 1, prove or disprove the

Institutional surveys, Institutional surveys are the surveys in which the ...

Institutional surveys are the surveys in which the primary sampling units are the institutions, for instance, hospitals. Within each of the sampled institution, a sample of the pa

Data theory, Data theory is anxious with how observations are transformed i...

Data theory is anxious with how observations are transformed into data which can be analyzed. Data are thus viewed as the theory laden in the sense that the observations can be giv

Combine standard deviation, what is the combine standard deviation height f...

what is the combine standard deviation height from the follwing

Non parametric maximum likelihood (npml), Non parametric maximum likelihood...

Non parametric maximum likelihood (NPML) is a likelihood approach which does not need the specification of the full parametric family for the data. Usually, the non parametric max

Biplots, Biplots: It is the multivariate analogue of the scatter plots, wh...

Biplots: It is the multivariate analogue of the scatter plots, which estimates the multivariate distribution of the sample in a few dimensions, typically two and superimpose on th

Graph theory, Why Graph theory? It is the branch of mathematics concerned w...

Why Graph theory? It is the branch of mathematics concerned with the properties of sets of points (vertices or nodes) some of which are connected by the lines known as the edges. A

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd