Draw the box-plots for age and and fat

Assignment Help Engineering Mathematics
Reference no: EM131035621

Problem 1:  This problem is an example of data preprocessing needed in a data mining process.  

Suppose that a hospital tested the age and body fat data for 18 randomly selected adults with the following results:

Age

23

23

27

27

39

41

47

49

50

%fat

9.5

26.5

7.8

17.8

31.4

25.9

27.4

27.2

31.2

Age

52

54

54

56

57

58

58

60

61

%fat

34.6

42.5

28.8

33.4

30.2

34.1

32.9

41.2

35.7

a. Draw the box-plots for age and %fat.  Interpret the distribution of the data

b. Normalize the two attributes based on z-score normalization.

c. Regardless of the original ranges of the variables, normalization techniques transform the data into new ranges that allow to compare and use variables on the same scales. What are the values ranges of the following normalization methods? Explain your answer.

i. Min-max normalization

ii. Z-score normalization

iii. Normalization by decimal scaling.

d. Draw a scatter-plot based on the two variables and interpret the relationship between the two variables.

e. Calculate the correlation coefficient. Are these two attributes positively or negatively correlated? Compute the covariance matrix.

Problem 2:  This problem is an example of data preprocessing needed in a data mining process.  

Suppose a group of 12 sales price records has been sorted as follows:

5, 10, 11, 13, 15, 35, 50,55,72,92,204,215

Partition them into bins by each of the following method, smooth the data and interpret the results:

a. equal-depth partitioning with 3 values per bin

b. equal-width partitioning with 3 bins

Problem 3 a) Figure 1 illustrates the plots for some data with respect to two variables: balance and employment status. If you have to select one of these two variables to classify the data into two classes (circle class and plus class), which one would you select? Is there any approach/criterion that you can use to support your selection? Explain your answer.

822_Figure.png

Figure 1: Data Plots for Problem 3.a.

b) For the data in Figure 2 with three variables and two classes: which variable you would choose to classify the data? Show all the steps of your calculations and interpret your answer.

139_Figure1.png

Figure 2: Data for Problem 3.b

Reference no: EM131035621

Questions Cloud

Recommend for the construction of this system : Which design strategy would you recommend for the construction of this system? Why?
Should the boom be fully retracted : The front wheels are free to roll. Do an equilibrium analysis to explain your answer.
Successively higher levels of debt : If a firm goes from zero debt to successively higher levels of debt, why would you expect its stock price to rise first, then hit a peak, and then begin to decline?
Has the researcher communicated clearly and fully : Did the article make an original contribution to the existing body of knowledge? Was the theoretical framework for the study adequate and appropriate?
Draw the box-plots for age and and fat : Draw the box-plots for age and %fat.  Interpret the distribution of the data and Normalize the two attributes based on z-score normalization
Dfs-files-directories and shares : From the first e-Activity, examine the key benefits afforded to an organization that utilizes Distributed File System (DFS) technologies.
Debt level that maximizes its stock price : Is the debt level that maximizes a firm's expected EPS the same as the debt level that maximizes its stock price? Explain.
Calculate profit margin and gross profit rate for company : Calculate the Profit Margin, and Gross profit rate for the company. Be sure to provide the formula you are using, show your calculations, and discuss your findings/results.
Analyzes the pros and cons of each : Your company has decided to open up a new comprehensive resort on a tropical island. Your manager (me) is working with corporate senior managers to determine how best to structure this new enterprise. analyzes the pros and cons of each, and describes..

Reviews

Write a Review

Engineering Mathematics Questions & Answers

  Prime number theorem

Dirichlet series

  Proof of bolzano-weierstrass to prove the intermediate value

Every convergent sequence contains either an increasing, or a decreasing subsequence.

  Antisymmetric relations

How many relations on A are both symmetric and antisymmetric?

  Distributed random variables

Daily Airlines fies from Amsterdam to London every day. The price of a ticket for this extremely popular flight route is $75. The aircraft has a passenger capacity of 150.

  Prepare a system of equations

How much money will Dave and Jane raise for charity

  Managing ashland multicomm services

This question is asking you to compare the likelihood of your getting 4 or more subscribers in a sample of 50 when the probability of a subscription has risen from 0.02 to 0.06.]  Talk about the comparison of probabilities in your explanation.

  Skew-symmetric matrices

Skew-symmetric matrices

  Type of taxes and rates in spokane wa

Describe the different type of taxes and their rates in Spokane WA.

  Stratified random sample

Suppose that in the four player game, the person who rolls the smallest number pays $5.00 to the person who rolls the largest number. Calculate each player's expected gain after one round.

  Find the probability density function

Find the probability density function.

  Develop a new linear programming for an aggregate production

Linear programming applied to Aggregate Production Planning of Flat Screen Monitor

  Discrete-time model for an economy

Discrete-time model for an economy

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd