Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Dirichlet process mixture models, The nonparametric Bayesian inference appr...

The nonparametric Bayesian inference approach to using the finite mixture distributions for modelling data suspected of the containing distinct groups of observations; this approac

Gene environment interaction, The interplay of the genes and environment on...

The interplay of the genes and environment on, for instance, the risk of disease. The term represents the step away from the argument as to whether the nature or nurture is the pre

Describe martingale, Martingale: In the gambling context the term at first...

Martingale: In the gambling context the term at first referred to a system for recouping losses by doubling the stake after each loss has occured. The modern mathematical concept

Dorfman scheme, An approach to investigations designed to recognize a parti...

An approach to investigations designed to recognize a particular medical condition in the large population, usually by means of a blood test, which might result in the considerable

Interior analysis, Interior analysis is the  term now and again applied to...

Interior analysis is the  term now and again applied to analysis carried out on the fitted model in regression problem. The basic target of such analyses is the identification of

What is harris and stevens forecasting, Harris and Stevens forecasting is ...

Harris and Stevens forecasting is the method of making short term forecasts in the time series which is subject to abrupt changes in pattern and the transient effects. Instances o

Mosaic displays, Mosaic displays  is the graphical display of the standardi...

Mosaic displays  is the graphical display of the standardized residuals from the fitting a log-linear model to a contingency table in which the colour and outline of the mosaic's '

Back-projection, Back-projection: A term most often applied to the procedu...

Back-projection: A term most often applied to the procedure for reconstructing plausible HIV incidence curves from the AIDS incidence data. The method or technique assumes that th

Doane''s rule, A rule for computing the number of classes to use while cons...

A rule for computing the number of classes to use while constructing a histogram and  can be given by   here n is the sample size and ^ γ is the estimate of kurtosis.

Non-randomized clinical trial, Non-randomized clinical trial is the clinic...

Non-randomized clinical trial is the clinical trial in which the series of consecutive patients receive a new treatment and those which respond (according to some of the pre-defin

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd