Draw the box-plots for age and and fat

Assignment Help Engineering Mathematics
Reference no: EM131035621

Problem 1:  This problem is an example of data preprocessing needed in a data mining process.  

Suppose that a hospital tested the age and body fat data for 18 randomly selected adults with the following results:

Age

23

23

27

27

39

41

47

49

50

%fat

9.5

26.5

7.8

17.8

31.4

25.9

27.4

27.2

31.2

Age

52

54

54

56

57

58

58

60

61

%fat

34.6

42.5

28.8

33.4

30.2

34.1

32.9

41.2

35.7

a. Draw the box-plots for age and %fat.  Interpret the distribution of the data

b. Normalize the two attributes based on z-score normalization.

c. Regardless of the original ranges of the variables, normalization techniques transform the data into new ranges that allow to compare and use variables on the same scales. What are the values ranges of the following normalization methods? Explain your answer.

i. Min-max normalization

ii. Z-score normalization

iii. Normalization by decimal scaling.

d. Draw a scatter-plot based on the two variables and interpret the relationship between the two variables.

e. Calculate the correlation coefficient. Are these two attributes positively or negatively correlated? Compute the covariance matrix.

Problem 2:  This problem is an example of data preprocessing needed in a data mining process.  

Suppose a group of 12 sales price records has been sorted as follows:

5, 10, 11, 13, 15, 35, 50,55,72,92,204,215

Partition them into bins by each of the following method, smooth the data and interpret the results:

a. equal-depth partitioning with 3 values per bin

b. equal-width partitioning with 3 bins

Problem 3 a) Figure 1 illustrates the plots for some data with respect to two variables: balance and employment status. If you have to select one of these two variables to classify the data into two classes (circle class and plus class), which one would you select? Is there any approach/criterion that you can use to support your selection? Explain your answer.

822_Figure.png

Figure 1: Data Plots for Problem 3.a.

b) For the data in Figure 2 with three variables and two classes: which variable you would choose to classify the data? Show all the steps of your calculations and interpret your answer.

139_Figure1.png

Figure 2: Data for Problem 3.b

Reference no: EM131035621

Questions Cloud

Recommend for the construction of this system : Which design strategy would you recommend for the construction of this system? Why?
Should the boom be fully retracted : The front wheels are free to roll. Do an equilibrium analysis to explain your answer.
Successively higher levels of debt : If a firm goes from zero debt to successively higher levels of debt, why would you expect its stock price to rise first, then hit a peak, and then begin to decline?
Has the researcher communicated clearly and fully : Did the article make an original contribution to the existing body of knowledge? Was the theoretical framework for the study adequate and appropriate?
Draw the box-plots for age and and fat : Draw the box-plots for age and %fat.  Interpret the distribution of the data and Normalize the two attributes based on z-score normalization
Dfs-files-directories and shares : From the first e-Activity, examine the key benefits afforded to an organization that utilizes Distributed File System (DFS) technologies.
Debt level that maximizes its stock price : Is the debt level that maximizes a firm's expected EPS the same as the debt level that maximizes its stock price? Explain.
Calculate profit margin and gross profit rate for company : Calculate the Profit Margin, and Gross profit rate for the company. Be sure to provide the formula you are using, show your calculations, and discuss your findings/results.
Analyzes the pros and cons of each : Your company has decided to open up a new comprehensive resort on a tropical island. Your manager (me) is working with corporate senior managers to determine how best to structure this new enterprise. analyzes the pros and cons of each, and describes..

Reviews

Write a Review

Engineering Mathematics Questions & Answers

  Develop a plan for purchasing and cleaning

Mary's major at State is Management Science and she wants to develop a plan for purchasing and cleaning sheets using Linear Programming. Help Mary formulate a Linear Programming Nodel for this problem and solve it using the computer.

  Maximize the angle theta subtended

"A painting in an art gallery has height h and is hung so that its lower edge is a distance d above the eye of an observer. How far from the wall should the observer stand to get the best view? (In other words, where should the observer stand so a..

  Maintain a convection coefficient

Heating is effected in a gas-fired furnace, where products of combustion at T8 = 800°C maintain a convection coefficient of h = 250 W/m2 · K on both surfaces of the plate. How long the plate should be left in the furnace?

  Determining the parallel-pipe system

For the parallel-pipe system of Fig, each pipe is cast iron, and the pressure drop p1 - p2 = 3 lbf/in2. Compute the total flow rate between 1 and 2 if the fluid is SAE 10 oil at 20°C.

  1 evaluate lim sup ek and liminf ek of ek-1k1 for k odd and

1. evaluate lim sup ek and liminf ek of ek-1k1 for k odd and liminf ek-11k for k even.nbsp2. show that the set e x in

  Type of taxes and rates in spokane wa

Describe the different type of taxes and their rates in Spokane WA.

  Find the solution of the exact differential equation

Find the solution of the exact differential equation and separable differential equations

  For the composite areas shown first determine the centroids

for the composite areas shown first determine the centroids and second determine the moment of inertia with respect to

  Use the euclidean algorithm to calculate gcd

Determine whether or not there exists a solution to the following linear Diophantine equation - Determine whether or not there exists a solution to the linear Diophantine equation.

  Suitable method for generating the number

In Excel, use a suitable method for generating the number of days needed to repair the copier, when it is out of service, according to the discrete distribution shown.

  Adjustment of adolescents

A researcher compared the adjustment of adolescents who had been raised in homes that were either very structured or unstructured. Thirty adolescents from each type of family completed an adjustment inventory. The results are reported in the table..

  The mean low temperature volume based numericals

Give a 95% prediction interval for the mean low temperature volume next month if the high temperature is 50.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd