Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Conditional probability, Conditional probability : The probability that an ...

Conditional probability : The probability that an event occurs given the outcome of other event. Generally written, Pr(A|B). For instance, the probability of a person being color b

Lexis diagram, Lexis diagram  is the diagram for displaying the simultaneou...

Lexis diagram  is the diagram for displaying the simultaneous effects of the two time scales (generally age and calendar time) on a rate. For instance, mortality rates from cancer

Multi co linearity, Multi co linearity is the term used in the regression ...

Multi co linearity is the term used in the regression analysis to indicate situations where the explanatory variables are related by a linear function, making the inference of the

Hypergeometric distribution, Hypergeometric distribution is t he probabili...

Hypergeometric distribution is t he probability distribution related with the sampling without replacement from the population of finite size. If the population comprises of r ele

Clustered data, Clustered data : The term applied to both the data in whic...

Clustered data : The term applied to both the data in which the sampling units are grouped into the clusters sharing some common feature, for instance families or geographical reg

Density estimation, Procedures for estimating the probability distributions...

Procedures for estimating the probability distributions without supposing any particular functional form. Constructing the histogram is perhaps the easiest example of such type of

Explain kolmogorov smirnov two-sample method, Kolmogorov Smirnov two-sample...

Kolmogorov Smirnov two-sample method is a distribution free technique which tests for any difference between the two populations probability distributions. The test is relied on t

Survey Design, Hello, I have a solution for a Survey Design (proposal) assi...

Hello, I have a solution for a Survey Design (proposal) assignment and looking for an expert that can look at it and correct it in case if it is wrong. Do you have this kind of ser

Traditional linear model, What is a Generalized Linear Model? A traditional...

What is a Generalized Linear Model? A traditional linear model is of the form where Yi is the response variable for the ith observation, xi is a column vector of explanator

Nearest-neighbour methods, Nearest-neighbour methods are the methods of di...

Nearest-neighbour methods are the methods of discriminant analysis are based on studying the training set subjects much similar to the subject to be classified. Classification mig

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd