Diversity of data , Applied Statistics

Assignment Help:

The box plot displays the diversity of data for the totexp; the data ranges from 30 being the minimum value and 390 being the maximum value. The box plot is positively skewed at 1.87 as the whisker at the top of the box is longer than the whisker at the bottom of the box, thus showing that there are more values at the top end of the box. The lower quartile is basically the 25th percentile which is 70; the median is the 50th percentile which is 90 and the upper quartile is 75th percentile which is 120.

1388_box plot1.png

In relation to extreme data for totexp there are 47 outliers which have been identified on the box plot which deviate significantly from the rest of the distribution of the data:     

Outlier Value

Rows

200

457, 853, 1035, 1042, 1110, 1404, 1443

210

82, 565, 694, 1088, 1249, 1411, 1491

220

182, 657, 1451

230

97, 155, 476, 980

240

1145, 1300, 1435

250

386, 970, 995, 1285

260

680, 845, 1350

270

449, 1299, 1385, 1508

280

56, 1009

290

315, 1225

300

755

310

80

320

482, 1415

330

1198

360

77

390

1187


Related Discussions:- Diversity of data

Standard gaussian random variable , You will recall the function pnorm() fr...

You will recall the function pnorm() from lectures. Using this, or otherwise, Dteremine the probability of a standard Gaussian random variable exceeding 1.3.  Using table(), or

Perform clustering of the unlabeled data set, Perform clustering of the unl...

Perform clustering of the unlabeled data set. You could use provided initial centroids set or generate your own. Also there could be considered next stopping criteria : - maxim

Simulation, Simulation When decisions are to be taken under conditions ...

Simulation When decisions are to be taken under conditions of uncertainty, simulation can be used. Simulation as a quantitative method requires the setting up of a mathematical

Write down the payoff matrix, Two individuals, player 1 and player 2, are  ...

Two individuals, player 1 and player 2, are  competing in an auction to obtain a valuable object. Each player bids in a sealed envelope, without knowing the bid of the other player

Simple linear regression, We are interested in assessing the effects of tem...

We are interested in assessing the effects of temperature (low, medium, and high) and technical configuration on the amount of waste output for a manufacturing plant. Suppose that

Median, The median, as the name suggests, is the middle value of a series a...

The median, as the name suggests, is the middle value of a series arranged in any of the orders of magnitude i.e. ascending or descending order. As distinct from the arithmetic

Find the distribution, The Elementary Teachers' Federation of Ontario make ...

The Elementary Teachers' Federation of Ontario make the following claim on their website as of February 13, 2013: For years, the Elementary Teachers' Federation of Ontario (ETFO

Canonical correlation analysis, Canonical correlation analysis (CC) allows ...

Canonical correlation analysis (CC) allows the investigation of the relationship between two ,sets of variables. For example, a sociologist may want to investigate the Relationship

Initial centroids data set, Find unlabeled data set test.txt and initial ...

Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ...

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd