Reshape the WHO data set into the form

Assignment Help Other Subject
Reference no: EM131971819

Assignment Tasks:

You will use WHO data set for Tasks 1- 5. Read the WHO data using an appropriate function and complete the tasks 1-5.

1- Tidy Task 1:

Use appropriate "tidyr" functions to reshape the WHO data set into the form given below:

2- Tidy Task 2:

The WHO data set is not in a tidy format yet. The "code" column still contains four different variables' information (see variable description section for the details). Separate the "code" column and form four new variables using appropriate "tidyr" functions. The final format of the WHO data set for this task should be in the form given below:

3- Tidy Task 3:

The WHO data set is not in a tidy format yet. The "rel", "ep", "sn", and "sp" keys need to be in their own columns as we will treat each of these as a separate variable. In this step, move the "rel", "ep", "sn", and "sp" keys into their own columns. The final format of the WHO data set for this task should be in the form given below:

4- Tidy Task 4:

There is one more step to tidy the WHO data set. We have two categorical variables "sex" and "age". Use "mutate()" to factorise sex and age. For "age" variable, you need to create labels and also order the variable. Labels would be: <15, 15-24, 25-34, 35-44, 45-54, 55-64, 65>=. The final tidy version of the WHO data set would look like this:

5- Task 5: Filter & Select

Drop the redundant columns "iso2" and "new", and filter any three countries from the tidy version of the WHO data set. Name this subset of the data frame as "WHO_subset".

You will use surveys and species data sets for Tasks 6 - 10. Read the species and surveys data sets using an appropriate function. Name these data frames as "species" and "surveys", respectively.

6- Task 6: Join

Combine "surveys" and "species" data frames using the key variable "species_id". For this task, you need to add the species information ("genus", "species", "taxa") to the "surveys" data. Rename the combined data frame as "surveys_combined".

7- Task 7: Calculate

Using the "surveys_combined" data frame, calculate the average weight and hindfoot length of one of the species observed in each month (irrespective of the year). Make sure to exclude missing values while calculating the average.

8- Task 8: Missing Values

Select one of the years in the "surveys_combined" dataframe, rename this data set as "surveys_combined_year". Using "surveys_combined_year" dataframe, find the total missing values in "weight" column grouped by species. Replace the missing values in "weight" column with the mean values of each species. Save this imputed data as "surveys_weight_imputed".

9- Task 9: Inconsistencies or Special Values

Inspect the "weight" column in "surveys_weight_imputed" dataframe for any further inconsistencies or special values (i.e., NaN, Inf, -Inf). Trace back and explain briefly why you got such a value.

10- Task 10: Outliers

Using the "surveys_combined" data frame, inspect the variable hindfoot length for possible univariate outliers. If you detect any outliers use any of the methods outlined in the Module 6 notes to deal with them. Explain briefly the actions that you take to handle outliers.

Attachment:- Assignment.zip

Reference no: EM131971819

Questions Cloud

Generate a program that prompts the user : Generate a program that prompts the user to enter a telephone number expressed in letters and outputs the corresponding telephone number in digits.
What is the equivalent annual cost for upgrading : In evaluating projects, Buford Engineers (BE) uses a discount rate of 15% for a before-tax analysis. One year ago, a robotic transfer machine was installed.
What is the average time to read a single sector : What is the average time to read a single sector?
Prepare the manufacturing overhead budget by quarters : For Roche Inc., variable manufacturing overhead costs are expected. Prepare the manufacturing overhead budget by quarters and in total for the year.
Reshape the WHO data set into the form : MATH2349 Data Preprocessing - Read the species and surveys data sets using an appropriate function. Name these data frames as "species" and "surveys"
Compute what is the npv of the project : We're looking at a new project. We plan to sell 7,600 units per year, $ 68 per unit, for the next 10 years. In other words, the annual cash flow.
Journalize the adjusting entry for the inventory shrinkage : Journalize the adjusting entry for inventory shrinkage for Rodriguez Company for year ended June 30, 2014. Assume that inventory shrinkage is a normal amount.
Warehouse operations manager for implementation : Recommend best practices for backup plans to a warehouse operations manager for implementation.
Determine what is the annual yield to maturity : Evans Emergency Response bonds have 4 years to maturity. Interest is paid semiannually. The bonds have a $1,400 par value and a coupon rate of 9 percent.

Reviews

len1971819

5/5/2018 6:31:41 AM

Criteria Not acceptable (0) Needs Improvement (1) Excellent (2) Task 1 (10%) Unable to tidy the data set properly. There was an attempt to tidy the data but it wasn’t in the required form A complete set of tasks were provided to tidy the data set in the required form. Task 2 (10%) Unable to tidy the data set properly. There was an attempt to tidy the data but it wasn’t in the required form A complete set of tasks were provided to tidy the data set in the required form. Task 3 (10%) Unable to tidy the data set properly. There was an attempt to tidy the data but it wasn’t in the required form A complete set of tasks were provided to tidy the data set in the required form.

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd