Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Ascertainment bias, Ascertainment bias : A feasible form of bias, particula...

Ascertainment bias : A feasible form of bias, particularly in the retrospective studies, which arises from the relationship between the exposure to the risk factor and the probabil

Sequencing problem, 2 jobs n machines,graphical method,how to determine wh...

2 jobs n machines,graphical method,how to determine which job should proceed first on each machine

Describe prior distribution, Prior distributions : The probability distribu...

Prior distributions : The probability distributions which summarize the information about a random variable or parameter known or supposed at a given time instant, prior to attaini

Logistic regression - computing log odds without probabiliti, Please help w...

Please help with following problem: : Let’s consider the logistic regression model, which we will refer to as Model 1, given by log(pi / [1-pi]) = 0.25 + 0.32*X1 + 0.70*X2 + 0.

Clustering, hello I have a dataset including both categorical & numerical v...

hello I have a dataset including both categorical & numerical variable for market segmentation.how can i cluster them via k-means in matlab? thank you

Bartlett''s test for variances, Bartlett's test for variances : A test for ...

Bartlett's test for variances : A test for equality of the variances of the number (k)of the populations. The test statistic can be given as follows   where s square is an

Distribution free methods, The statistical methods for estimation and infer...

The statistical methods for estimation and inference which are based on a function of sample observations, probability distribution of which does not rely upon a complete speci?cat

Chebyshev''s inequality, Chebyshev's inequality: A statement about the pro...

Chebyshev's inequality: A statement about the proportion of the observations which fall within some number of the standard deviations of the mean for any of the probability distri

Generalized method of moments (gmm), Generalized method of moments (gmm) is...

Generalized method of moments (gmm) is the estimation method popular in econometrics which generalizes the method of the moments estimator. Essentially same as what is known as the

Bioinformatics, Bioinformatics : Essentially the application of the informa...

Bioinformatics : Essentially the application of the information theory to biology to deal with the deluge of the information resulting from the advances in molecular biology. The m

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd