Reference no: EM132314626
When filming on location, permits are required for the exclusive use of city property such as streets, parks and even footpaths. There are many film locations around the world however one of the most iconic is New York City. To film in New York City permission is required from
the Mayor's Office of Media and Entertainment.
Data on these filming permits is hosted by the open data platform of the City of New York. You don't need this website for the assignment but it's a great link to follow for lots of interesting data sets. The data for this question is in the file film-permits.csv (available from the Data page).
It contains 52,350 rows of data across 14 columns and is a friendly 20MB or so according to my file explorer. This is the data you will analyse. The file film permit codebook.pdf contains the data dictionary for each variable in the data set. For this assignment, you will need to produce a report summarising a collection of requested statistical analyses and visualisations of the data. See below for details.
Assignment tasks
For this assignment you will need to produce a report summarising a collection of requested statistical analyses and visualisations of the data. As a guideline, for the written report 2-3 pages of writing will be sufficient excluding tables/figures. I won't strictly count words so if you go over/under by a bit that's fine, but this is a good ballpark.
The report should contain:
1. An introduction outlining the analysis to follow/background information. The introduction can be up to 2 paragraphs. For the purposes of this assignment a paragraph is 6-8 sentences.
2. A statistical summary of the duration of filming. To determine filming duration you will need to use the variables StartDateTime and EndDateTime. The statistical summary should consist of a numerical analysis and visual representation of your calculated duration by Borough, by Category and then by Borough and Category together.
3. A numerical and visual analysis of: Event Type (the different types of permits requested and then Event Type broken down by Borough)
LeadTime: this is a variable you will need to calculate as the time duration between when the permit was submitted and when shooting commenced.
LeadTime should then be analysed by Borough and Category individually and then together.
Relationship between duration of filming and lead time.
Discussion of these analyses should be one paragraph per analysis (eg Event Type, then Event Type by Borough, etc).
4. Tables of Borough and Category by Subcategory. Discuss the trends you see in the tables.
5. What about zip-codes? What zip-codes are more popular for filming? Produce appropriate numerical and visual summaries. You can plot a map (scatterplot) by using package zipcode. You even can plot a real map using ggmap and but it is not free, so you don't need it for the assignment.
6. Conclusions (1-2 paragraphs is fine)
Attachment:- Film Permits.rar