Make a chart that contains histograms of heights

Assignment Help Other Subject
Reference no: EM132373819

Assignment -

Part 1 - BBALL STUDY

We previously used a dataset called PlayerBBall.csv which contained information about NBA basketball players. To finish that assignment, you had to manipulate the height column. Review the code you used to do that and see if you can't make more efficient code using regular expressions and / or the string functions from this Unit.

  • Use regular expressions to use the height column to create a TotalInches column that has the total height in inches and is recorded as a numeric variable.
  • Use this variable to make a chart that contains histograms of heights for every position (color coded).

Part 2 - FIFA STUDY

We previously used a dataset called FIFA Playersl.csv which contained information about Soccer players.

a. Use the string functions and regular expressions to assess a relationship between height and weight among soccer players. To do this you will need to manipulate the height and weight columns into columns that have numberic values of the height and weight. Tell you story using 2 - 4 PPT Slides.

b. Next, assess this relationship between just the LB and LM positions. (1 slide should do it.)

BBALL STUDY - We previously used a dataset called PlayerBBall.csv which contained information about NBA basketball players. To finish that assignment, you had to manipulate the height column. Review your code and see if there isn't a more efficient solution using regular expressions and / or the string functions from this unit. Tell your story on 1 or 2 PPT Slides.

  • Use regular expressions to use the height column to create a TotalInches column that has the total height in inches and is recorded as a numeric variable.
  • Use this variable to make a chart that contains histograms of heights for every position (color coded).

Part 3 - BABY NAMES

Backstory: Your client is expecting a baby soon. However, he is not sure what to name the child. Being out of the loop, he hires you to help him figure out popular names. He provides for you raw data in order to help you make a decision.

The Most Popular Baby Names in The UK

Girls

Boys

Olivia

Oliver

Amelia

Harry

Emily

George

Isla

Jack

Ava

Jacob

Isabella

Noah

Lily

Charlie

Jessica

Muhammad

Ella

Thomas

Mia

Oscar

Baby Names: Question 1

1. Data Munging: Utilize yob2016.txt for this question. This file is a series of popular children's names born in the year 2016 in the United States. It consists of three columns with a first name, a gender, and the amount of children given that name. However, the data is raw and will need cleaning to make it tidy and usable.

a. First, import the .txt file into R so you can process it. Keep in mind this is not a CSV file. You might have to open the file to see what you're dealing with. Assign the resulting data frame to an object, df, that consists of three columns with human-readable column names for each.

b. Display the summary and structure of df

c. Your client tells you that there is a problem with the raw file. One name was entered twice and misspelled. The client cannot remember which name it is; there are thousands he saw! But he did mention he accidentally put three y's at the end of the name. Write an R command to figure out which name it is and display it.

d. Upon finding the misspelled name, please remove this particular observation, as the client says it's redundant. Save the remaining dataset as an object: y2016.

Baby Names: Question 2

2. Data Merging: Utilize yob2015.txt for this question. This file is similar to yob2016, but contains names, gender, and total children given that name for the year 2015.

a. Like 1a, please import the .txt file into R. Look at the file before you do. You might have to change some options to import it properly. Again, please give the dataframe human-readable column names. Assign the dataframe to y2015.

b. Display the last ten rows in the dataframe. Describe something you find interesting about these 10 rows.

c. Merge y2016 and y2015 by your Name column; assign it to final. The client only cares about names that have data for both 2016 and 2015; there should be no NA values in either of your amount of children rows after merging.

Baby Names: Question 3

3. Data Summary: Utilize your data frame object final for this part.

a. Create a new column called "Total" in final that adds the amount of children in 2015 and 2016 together. In those two years combined, how many people were given popular names?

b. Sort the data by Total. What are the top 10 most popular names?

c. The client is expecting a girl! Omit boys and give the top 10 most popular girl's names.

d. Write these top 10 girl names and their Totals to a CSV file. Leave out the other columns entirely.

Baby Names: Question 4

4. Data Visualization: Create a well labeled, visually appealing and informative visualization summarizing some of the results of this study.

Attachment:- Data Files.rar

Reference no: EM132373819

Questions Cloud

What are some of the psychological approaches : What are some of the psychological approaches to treating depressive disorder?
Compare the rate of spectral rolloff : Compare the rate of spectral rolloff with rectangular pulses and with Gaussian filtering. Also, by examining the plot of the transmitted message signal
Briefly discuss types of generalization and discrimination : Briefly discuss the types of generalization and discrimination that occur in classical and operant conditioning and explain how each works.
Autoshaping conditioning experiment : Based on the Autoshaping conditioning experiment using pigeons, write:
Make a chart that contains histograms of heights : BBALL STUDY - Use this variable to make a chart that contains histograms of heights for every position (color coded)
Analyze data on gdp and gdp per capita for china and india : ECON 2213/CHIN 2290 - Emerging Giants: The Economic Rise of China and India - Dalhousie University. In this assignment, you will find and analyze data on GDP.
What do you think is the most difficult challenge facing : What do you think is the most difficult challenge facing adolescents today? Why?
Aspects of psychoanalytic theory to personal growth : Are you able to apply any of the aspects of psychoanalytic theory to personal growth? Discuss how so or how not with specific examples.
Discuss how so or how not with specific examples : Are you able to apply any of the aspects of psychoanalytic theory to personal growth? Discuss how so or how not with specific examples.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd