Reference no: EM133749915
You could work in a group of a maximum 3 students. Your submission to Canvas must be named HW_Name(s), containing your answers in a Word file, the source codes in R, and the data files used for the assignment. (Do NOT use any other extension, such as .7z or .tar, since I can't open them. If I cannot open your files to check your answers and your code, I cannot give you credit for the assignment.)
You may work in groups of at most two students but you must include the names of the students you worked with in your assignment and file name are HW_FirstStudentName_SecondStudentName
N.B. Include in the word document results copied from the console and screenshots of the plots for the following:
Part A: Basic Data Analytics (50 points)
In this part of the assignment you will be using US Census data file "PopulationData.csv"
Population data have the following columns:
• time: when the census data got collected
• Country: the country for the collected census data
• State: the country for the collected census data could be empty ""
• MFRatio: the number of males divided by the number of females in a population
• below14: the number of people below 14 years old in the a population
• between15_65: the number of people between 15 years old and 65 years old in the a population
• above65: the number of people above 65 years old in the a population
1. Load Population Data from the csv to a dataframe called countries
2. Subset from all countries the data that have an empty value "" for State and put the result into dataframe called "countries1"
3. Compute summary of the above65 column of countries
4. Compute standard deviation of the above65 column of countries
5. which country have the lowest MFRatio of countries
6. Use table to check how many countries have females more than males (MFRatio< 1)