Reference no: EM133663807
Assignment
Using Pandas to access data with Excel or CSV files. Use your approved dataset for this assignment.
Requirements I
Import the pandas library
Import the numpy library
Import the openpyxl library
Create a data frame and load the Excel or CSV file
Add the optional settings
pd.set_option("display.max_columns",None)
pd.set_option("display.max_rows",None)
pd.set_option("max_colwidth",None)
pd.set_option('expand_frame_repr',False)
Requirements II
Print all the columns in the data frame
Print a statistic summary of the data frame
Print the first 4 records in the data frame
Print the last 7 records in the data frame
Requirements III
Print the index in the data frame
Print the data types in the data frame
Selecting Data with brackets []
Print any column in the data frame
Print the 3rd data element using an index for your selected column
Requirements IV
Create a new numeric column (default: zero)
Create a temp dataset with your main dataset
Using a for loop:
Using a function (insert parameter):
Perform arithmetic with one of your numeric fields
Store the result in your new numeric column
Print the first 5 records in the data frame
Save the data into a csv file (Make sure your name is in the file title)
Requirements V
Create a filter using two of your columns from your dataset
Create a temp dataset with your main dataset and filter
Print the first 5 records in the data frame
Save the data into a csv file (Make sure your name is in the file title)
Requirements VI
Create a new numeric column (default: zero)
Create another function (insert parameter) to perform arithmetic
Append a row to the main dataset; use the function to populate the numeric column created earlier
Save the data into a csv file (Make sure your name is in the file title)
Open the file and highlight the row you added
Requirements VII
Create a new numeric column (default: negative -1)
Create a temp dataset with your main dataset
Using a for loop:
Using if statements:
Populate categorical codes in the numeric column
Print the first 5 records in the data frame
Save the data into a csv file (Make sure your name is in the file title)