Reshape the WHO data set into the form

Assignment Help Other Subject
Reference no: EM131971819

Assignment Tasks:

You will use WHO data set for Tasks 1- 5. Read the WHO data using an appropriate function and complete the tasks 1-5.

1- Tidy Task 1:

Use appropriate "tidyr" functions to reshape the WHO data set into the form given below:

2- Tidy Task 2:

The WHO data set is not in a tidy format yet. The "code" column still contains four different variables' information (see variable description section for the details). Separate the "code" column and form four new variables using appropriate "tidyr" functions. The final format of the WHO data set for this task should be in the form given below:

3- Tidy Task 3:

The WHO data set is not in a tidy format yet. The "rel", "ep", "sn", and "sp" keys need to be in their own columns as we will treat each of these as a separate variable. In this step, move the "rel", "ep", "sn", and "sp" keys into their own columns. The final format of the WHO data set for this task should be in the form given below:

4- Tidy Task 4:

There is one more step to tidy the WHO data set. We have two categorical variables "sex" and "age". Use "mutate()" to factorise sex and age. For "age" variable, you need to create labels and also order the variable. Labels would be: <15, 15-24, 25-34, 35-44, 45-54, 55-64, 65>=. The final tidy version of the WHO data set would look like this:

5- Task 5: Filter & Select

Drop the redundant columns "iso2" and "new", and filter any three countries from the tidy version of the WHO data set. Name this subset of the data frame as "WHO_subset".

You will use surveys and species data sets for Tasks 6 - 10. Read the species and surveys data sets using an appropriate function. Name these data frames as "species" and "surveys", respectively.

6- Task 6: Join

Combine "surveys" and "species" data frames using the key variable "species_id". For this task, you need to add the species information ("genus", "species", "taxa") to the "surveys" data. Rename the combined data frame as "surveys_combined".

7- Task 7: Calculate

Using the "surveys_combined" data frame, calculate the average weight and hindfoot length of one of the species observed in each month (irrespective of the year). Make sure to exclude missing values while calculating the average.

8- Task 8: Missing Values

Select one of the years in the "surveys_combined" dataframe, rename this data set as "surveys_combined_year". Using "surveys_combined_year" dataframe, find the total missing values in "weight" column grouped by species. Replace the missing values in "weight" column with the mean values of each species. Save this imputed data as "surveys_weight_imputed".

9- Task 9: Inconsistencies or Special Values

Inspect the "weight" column in "surveys_weight_imputed" dataframe for any further inconsistencies or special values (i.e., NaN, Inf, -Inf). Trace back and explain briefly why you got such a value.

10- Task 10: Outliers

Using the "surveys_combined" data frame, inspect the variable hindfoot length for possible univariate outliers. If you detect any outliers use any of the methods outlined in the Module 6 notes to deal with them. Explain briefly the actions that you take to handle outliers.

Attachment:- Assignment.zip

Reference no: EM131971819

Questions Cloud

Generate a program that prompts the user : Generate a program that prompts the user to enter a telephone number expressed in letters and outputs the corresponding telephone number in digits.
What is the equivalent annual cost for upgrading : In evaluating projects, Buford Engineers (BE) uses a discount rate of 15% for a before-tax analysis. One year ago, a robotic transfer machine was installed.
What is the average time to read a single sector : What is the average time to read a single sector?
Prepare the manufacturing overhead budget by quarters : For Roche Inc., variable manufacturing overhead costs are expected. Prepare the manufacturing overhead budget by quarters and in total for the year.
Reshape the WHO data set into the form : MATH2349 Data Preprocessing - Read the species and surveys data sets using an appropriate function. Name these data frames as "species" and "surveys"
Compute what is the npv of the project : We're looking at a new project. We plan to sell 7,600 units per year, $ 68 per unit, for the next 10 years. In other words, the annual cash flow.
Journalize the adjusting entry for the inventory shrinkage : Journalize the adjusting entry for inventory shrinkage for Rodriguez Company for year ended June 30, 2014. Assume that inventory shrinkage is a normal amount.
Warehouse operations manager for implementation : Recommend best practices for backup plans to a warehouse operations manager for implementation.
Determine what is the annual yield to maturity : Evans Emergency Response bonds have 4 years to maturity. Interest is paid semiannually. The bonds have a $1,400 par value and a coupon rate of 9 percent.

Reviews

len1971819

5/5/2018 6:31:41 AM

Criteria Not acceptable (0) Needs Improvement (1) Excellent (2) Task 1 (10%) Unable to tidy the data set properly. There was an attempt to tidy the data but it wasn’t in the required form A complete set of tasks were provided to tidy the data set in the required form. Task 2 (10%) Unable to tidy the data set properly. There was an attempt to tidy the data but it wasn’t in the required form A complete set of tasks were provided to tidy the data set in the required form. Task 3 (10%) Unable to tidy the data set properly. There was an attempt to tidy the data but it wasn’t in the required form A complete set of tasks were provided to tidy the data set in the required form.

Write a Review

Other Subject Questions & Answers

  Proactively respond to student misconduct

How can faculty cultivate an environment that positively contributes to learning and proactively responds to student misconduct?

  Develop initial interview question

Develop initial interview questions. Staffing services believes that a half-hour interview will be appropriate, with about 3 minutes per interview question. They would like 5 behavioral interview questions and 5 situational interview questions. Each ..

  What is the defendants mental diagnosis

What is the defendant being charged with (e.g., Felony Theft)?What is the defendant's mental diagnosis (e.g., Schizophrenia)?

  Weaknesses of michael eric dyson argument

Critically discuss some of the strengths and weaknesses of Michael Eric Dyson's argument concerning affirmative action programs to redress racist practices in the past?

  What methods of secure custody do

How does the prison environment influence the way you ensure security and custody in your prison - what methods of secure custody do you use in your prison?

  An analysis of the choreography rooms

An analysis of the choreography "Rooms" discussing what it is about and how the movement creates this narrative.

  Define biostatistics and why it is used in public health

In public health and biostatistics, it is important to know and understand terminology and concepts relating to the field

  Explain deviant cultural behavior or tradition

Cultural behavior or tradition which was acceptable 50 years ago in culture, but is now considered deviant today, and cultural behavior or tradition which was considered deviant 50 years ago.

  Explain phases of the moon phenomena

What causes the following phenomena: phases of the moon, solar eclipse, lunar eclipse. Include a description of the observed shapes of the moon during a gibbous phase and a lunar eclipse, and how they differ.

  The realms of africa

Perhaps no realm has been more thoroughly disrupted and transformed by the colonial experience than Africa. Slave-trading, European colonization, and the discovery of rich natural resources

  What is the responsibility of christians

What is the responsibility of Christians with regards to economic development, leadership within the community, and the mandates of the Gospel?

  What different styles of communication are most prominent

What different styles of communication are most prominent in the workplace in each culture. First slide on the culture of Asia.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd