Summarising a collection of requested statistical analyses

Assignment Help Applied Statistics
Reference no: EM132317752

Assignment - New York Movies Scene

Data and Background Information -

When filming on location, permits are required for the exclusive use of city property such as streets, parks and even footpaths. There are many film locations around the world however one of the most iconic is New York City. To film in New York City permission is required from the Mayor's Office of Media and Entertainment.

Data on these filming permits is hosted by the open data platform of the City of New York. You don't need this website for the assignment but it's a great link to follow for lots of interesting data sets.

The data for this question is in the file film-permits.csv (available from the Data page). It contains 52,350 rows of data across 14 columns and is a friendly 20MB or so according to my file explorer. This is the data you will analyse.

The file film permit codebook.pdf contains the data dictionary for each variable in the data set. For this assignment, you will need to produce a report summarising a collection of requested statistical analyses and visualisations of the data. See below for details.

Assignment tasks -

For this assignment you will need to produce a report summarising a collection of requested statistical analyses and visualisations of the data. As a guideline, for the written report 2-3 pages of writing will be sufficient excluding tables/figures. I won't strictly count words so if you go over/under by a bit that's fine, but this is a good ballpark.

The report should contain:

1. An introduction outlining the analysis to follow/background information. The introduction can be up to 2 paragraphs. For the purposes of this assignment a paragraph is 6-8 sentences.

2. A statistical summary of the duration of filming. To determine filming duration you will need to use the variables StartDateTime and EndDateTime. The statistical summary should consist of a numerical analysis and visual representation of your calculated duration by Borough, by Category and then by Borough and Category together.

3. A numerical and visual analysis of:

  • Event Type (the different types of permits requested and then Event Type broken down by Borough)
  • LeadTime: this is a variable you will need to calculate as the time duration between when the permit was submitted and when shooting commenced. LeadTime should then be analysed by Borough and Category individually and then together.
  • Relationship between duration of filming and lead time.

Discussion of these analyses should be one paragraph per analysis (eg Event Type, then Event Type by Borough, etc).

4. Tables of Borough and Category by Subcategory. Discuss the trends you see in the tables.

5. What about zip-codes? What zip-codes are more popular for filming? Produce appropriate numerical and visual summaries. You can plot a map (scatterplot) by using package zipcode. You even can plot a real map using ggmap and Google API but it is not free, so you don't need it for the assignment.

6. Conclusions (1-2 paragraphs is fine).

Attachment:- Assignment Files.rar

Reference no: EM132317752

Questions Cloud

Identify the comprehensive emergency management cycle : Describe Hazards and Disasters. Identify the Comprehensive Emergency Management Cycle. Threat Hazard Identification and Risk assessment (THIRA)
Definition of being employed part-time : Explain what would likely happen to the unemployment rate if the definition of being employed part-time was changed to require someone to have
What are the benefits of using gdp to measure an economy : What are the benefits of using GDP to measure an economy? Explain what you think about the GDP and why you believe it is the best measure for the economy.
Cash flow estimates profitability indices and cost of car : How does price discrimination by airlines affect NPV, IRR, Cash flow estimates Profitability indices and the cost of capital?
Summarising a collection of requested statistical analyses : For this assignment you will need to produce a report summarising a collection of requested statistical analyses and visualisations of the data
Explain the ways in which immigration can potentially : Explain the ways in which immigration can potentially decrease the earnings of native-born workers, and explain also how immigration
Develop components of the quality assurance : Develop components of the Quality Assurance documents discussed in lectures - Test Plans and Test Cases using a simulated industry case study
Define what factors contribute to the yearly incidence : The American Cancer Society (ACS) is a nationwide, community-based, voluntary health organization dedicated to eliminating cancer as a major health problem.
Program learning outcome for communication studies : The third program learning outcome (PLO) for the Communication Studies major at San Francisco State is that its majors can apply course content to personal,

Reviews

Write a Review

Applied Statistics Questions & Answers

  Re-draw the scatter plot and the least square line

STAT102: BUSINESS DATA ANALYSIS: FACTS FROM FIGURES Assignment, Australian Catholic University. Re-draw the scatter plot and the least square line without Year

  Find the mean, variance, and standard deviation

Find the mean, variance, and standard deviation of the binomial distribution with the given values of n and p n=127 p=0.58

  Develop a linear regression model

MBAC6031 Quantitative Methods Practice Final Exam. Develop a linear regression model that can be used to estimate the level of charitable contributions

  Standard deviation of the binomial distribution

Find the Mean, Variance, Standard Deviation of the binomial distribution with the given values of N and P

  What is the probability the service technician will have

A service call has just come in, but the type of malfunction is unknown. It is 3.00 P.M. and service technicians usually get off at 5:00 P.M. What is the probability the service technician will have to work overtime to fix the machine?

  Compute the value of the test statistic

Use Z or T test? And why? At α = 0.05, what is the rejection rule? Compute the value of the test statistic. What is the p-value. What is the hypothesis being tested in this problem? In the above ANOVA table, is the factor significant at the 5% level

  Basic question that underlies hypothesis testing

What is the basic question that underlies hypothesis testing - What is the new critical value you will use for this calculation?

  Specific motors manufactures three different car

Specific Motors manufactures three different car models, Model X, Model Y, and Model Z, withrespective net revenues of$1000, $3000, and $6000. Each Model X requires 40 labor-hours and 1 tonof steel to produce, each Model Y requires 65 labor-hours and..

  Create one observation for each year

Initialize each of the variables below to their current values, and use a DO LOOP to calculate their estimated values for the next ten years. For example, next year's wage expense will be this year's wage expense plus 6 percent of this year's amount;..

  Jessica will have won more games than susan

When Susan and Jessica play a card game, susan wins 60% of the time. if they play 9 games, what is the probability that jessica will have won more games than susan?

  Question 1nbspnbsp a large shipping company recorded the

question 1nbspnbsp a large shipping company recorded the number of tons shipped weekly across the pacific for 50

  Would you characterize the relationship as small

If the proportion of systematic variance to error variance is .08, would you characterize the relationship as small, medium, or large? What if the proportion were .72? .OO?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd