Reference no: EM133665664
Assignment: Data Science
A. The elements in the data set are food items of various sizes, ranging from a teaspoon of cinnamon to an entire carrot cake.
1. Sort the data set by the saturated fat (saturated_fat) and produce a listing of the five food items highest in saturated fat.
2. Comment on the validity of comparing food items of different sizes.
B. Derive a new variable, saturated_fat_per_gram, by dividing the amount of saturated fat by the weight in grams.
1. Sort the data set by saturated_fat_per_gram and produce a listing of the five food items highest in saturated fat per gram.
2. Which food has the most saturated fat per gram?
C. Derive a new variable, cholesterol_per_gram.
1. Sort the data set by cholesterol_per_gram and produce a listing of the five food items highest in cholesterol fat per gram.
2. Which food has the most cholesterol fat per gram?
Solve the following problems D to F, work with the adult_ch3_training data set. The response is whether income exceeds $50,000. Use Python.
D. Add a record index field to the data set.
E. Determine whether any outliers exist for the education field.
F. Do the following for the age field.
1. Standardize the variable.
2. Identify how many outliers there are and identify the most extreme outlier.