Reference no: EM133622109
Assignment 1
1) use irs data set
The Iris dataset was introduced by R. A. Fisher as an example for discriminant analysis. This dataset comprises measurements of four features-sepal width, sepal length, petal width, and petal length-pertaining to three distinct species of Iris flowers.
The Iris dataset is available on Canvas.
a) What is the average sepal width for each species?
b) Normalize the sepal width. Copy and paste the first 5 rows.
c) What is the correlation between sepal width and petal width?
2-use dealership data
The Applewood Auto Group is an ownership group that includes four dealerships. The group sells a wide range of vehicles, including the inexpensive but popular Korean brands Kia and Hyundai, BMW and Volvo sedans and luxury SUVs, and a full line of Ford and Chevrolet cars and trucks.
Ms. Kathryn Ball is a member of the senior management team at Applewood Auto Group. She is responsible for tracking and analyzing vehicle sales and the profitability of those vehicles.
Every month, Ms. Ball collects data from each of the four dealerships and enters them into an Excel spreadsheet.
Last month the Applewood Auto Group sold 180 vehicles at the four dealerships (please see Dealership dataset).
The variables collected include:
Age-the age of the buyer at the time of the purchase.
Profit-the amount earned by the dealership on the sale of each vehicle.
Location-the dealership where the vehicle was purchased.
Vehicle type-SUV, sedan, compact, hybrid, or truck.
Previous-the number of vehicles previously purchased at any of the four Applewood dealerships by the consumer.
Question 2a: provide summary statistics of the data set (only numerical variables):
Question 2b: Create a histogram for profit:
Question 2c: Compare profit of each location using box plots. Summarize your findings (include the box plots here).
Question 2d: based on a scatter plot of Profit and Age, is there a relationship between age of buyer and profit? Include your scatter plot.
Attachment:- Assignment data set.rar