Diversity of data , Applied Statistics

Assignment Help:

The box plot displays the diversity of data for the totexp; the data ranges from 30 being the minimum value and 390 being the maximum value. The box plot is positively skewed at 1.87 as the whisker at the top of the box is longer than the whisker at the bottom of the box, thus showing that there are more values at the top end of the box. The lower quartile is basically the 25th percentile which is 70; the median is the 50th percentile which is 90 and the upper quartile is 75th percentile which is 120.

1388_box plot1.png

In relation to extreme data for totexp there are 47 outliers which have been identified on the box plot which deviate significantly from the rest of the distribution of the data:     

Outlier Value

Rows

200

457, 853, 1035, 1042, 1110, 1404, 1443

210

82, 565, 694, 1088, 1249, 1411, 1491

220

182, 657, 1451

230

97, 155, 476, 980

240

1145, 1300, 1435

250

386, 970, 995, 1285

260

680, 845, 1350

270

449, 1299, 1385, 1508

280

56, 1009

290

315, 1225

300

755

310

80

320

482, 1415

330

1198

360

77

390

1187


Related Discussions:- Diversity of data

#title., Features of index numbers

Features of index numbers

Bionomial, The Quality Manager of a battery manufacturing plant reviewed th...

The Quality Manager of a battery manufacturing plant reviewed the warranty records within his department and found that 4% of the low maintenance batteries produced at the plant ov

Optimal number of cluster, Try different numbers of clusters in your progra...

Try different numbers of clusters in your program (K=2...15) and build a plot that shows the dependency between number K and value of RSS function on the last iteration. What is th

Correlation, prove that coefficient of correlation lies between -1 and+1

prove that coefficient of correlation lies between -1 and+1

Sensitivity and Specificity tests, The prevalence of undetected diabetes in...

The prevalence of undetected diabetes in a population to be screened is approximately 1.5% and it is assumed that 10,000 persons will be screened. The screening test will measure

Geometric mean, Geometric Mean is defined as the n th root of the ...

Geometric Mean is defined as the n th root of the product of numbers to be averaged. The geometric mean of numbers X 1 , X 2 , X 3 .....X n is given as

Construct a cumulative percentage polygon, 1. For each of the following var...

1. For each of the following variables: major, graduate GPA, and height: a. Determine whether the variable is categorical or numerical. b. If the variable is numerical, deter

Multivariate analysis, Multivariate analysis involves a set of techniques t...

Multivariate analysis involves a set of techniques to analyse data sets on more than one variable. Many of these techniques are modern and often involve quite sophisticated use of

Standard cost method, Under the standard cost method which is also referred...

Under the standard cost method which is also referred as the standard cost method ,stock receipts are assigned a standard cost. Any variations between the actual cost and standard

Find the unbiased estimators for mean and variance matrix, Is the random ve...

Is the random vector (Trunk Space, Length, Turning diameter) of US car normally distributed? Why? If yes, find the unbiased estimators for the mean and variance matrix of (Trunk Sp

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd