Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Network sampling, Network sampling is a sampling design in which the simpl...

Network sampling is a sampling design in which the simple random sample or strati?ed sample of the sampling units is made and all observational units which are linked to any of th

Command-line options, Command-Line options Compression: C++:  ./comp...

Command-Line options Compression: C++:  ./compress  -f  myfile.txt  [-o  myfile.hzip  -s Java:  sh  compress.sh  -f  myfile.txt  [-o  myfile.hzip  -s] Decompression:

Describe non linear model, Non linear model : A model which is non-linear i...

Non linear model : A model which is non-linear in the parameters, for instance are   Some such type of models can be converted into the linear models by linearization (the s

Locally weighted regression, Locally weighted regression  is the method of ...

Locally weighted regression  is the method of regression analysis in which the polynomials of degree one (linear) or two (quadratic) are used to approximate regression function in

Ecological fallacy, The term used when the aggregated data (for instance, a...

The term used when the aggregated data (for instance, aggregated over different areas) are analysed and the results supposed to apply to the relationships at the individual level.

Tests for heteroscedasticity, The Null Hypothesis - H0: There is no heteros...

The Null Hypothesis - H0: There is no heteroscedasticity i.e. β 1 = 0 The Alternative Hypothesis - H1:  There is heteroscedasticity i.e. β 1 0 Reject H0 if nR2 > MTB >

Explain knox''s test, Knox's tests: These tests designed to detect any ten...

Knox's tests: These tests designed to detect any tendency for the patients with a particular disease to form the disease cluster in time and space. The tests are relied on a two-b

frequentist inference, The approach to statistics based on a frequency vie...

The approach to statistics based on a frequency view of probability in which it is supposed that it is possible to consider an in?nite sequence of the independent repetitions of th

Baddeley''smetric, Baddeley'smetric : A manner of measuring the 'error' in ...

Baddeley'smetric : A manner of measuring the 'error' in the image processing technique or method. The metric is derived using the fundamental theory from the stochastic geometry an

Degenerate distributions, The special cases of the probability distribution...

The special cases of the probability distributions in which the random variable's distribution is concentrated at one point only. For instance, a discrete uniform distribution when

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd