ST2195 Programming for data science Assignment

Assignment Help Programming Languages
Reference no: EM133107806

ST2195 Programming for data science - University of London

Project:
The 2009 ASA Statistical Computing and Graphics Data Expo consisted of flight arrival and departure details for all commercial flights on major carriers within the USA, from October 1987 to April 2008. This is a large dataset; there are nearly 120 million records in total, and takes up 1.6 gigabytes of space compressed and 12 gigabytes when uncompressed.

Choose any subset of (at least two) consecutive years and any of the supplementary information provided by the Harvard Dataverse to answer the following questions using the principles and tools you have learned in this course:

1. When is the best time of day, day of the week, and time of year to fly to minimise delays?

2. Do older planes suffer more delays?

3. How does the number of people flying between different locations change over time?

4. Can you detect cascading failures as delays in one airport create delays in others?

5. Use the available variables to construct a model that predicts delays.

All questions should be answered using R and Python for all tasks.

Your answers should be provided in a separate structured report of no more than 10 pages. The page limit excludes title, references and table of contents but includes graphics and tables. The report should be in PDF format and also contain adequate explanations for readers not familiar with programming. In addition to the report, you will also be asked to provide your R and Python code in RMarkdown and Jupyter notebooks respectively. All the relevant files will need to be submitted in the designated Atrio submission portal.

Each report should detail all steps you took starting from raw data up to the answer for each question. Any databases you set up, data wrangling/cleaning operations you carry out, and any modelling decisions you make should be clearly described in each structured report. Each report should also include any relevant graphics and tables as part of the answer.

If you are using elements (e.g. code, databases, graphics, etc) from your answer to a previous question to answer the current one, you will need to refer to those elements.

You should also supply the code you used to answer each question, in a way that can be used by someone else to replicate your analyses. You can do this either as separate scripts or separate RMarkdown/Jupyter notebooks per question, clearly indicating (both with comments and in the filename) which question each script refers to.

Attachment:- Programming for data science.rar

Reference no: EM133107806

Questions Cloud

Short insight into strategy formulation : A short insight into strategy formulation. Then 2 reasons why strategy should be aligned with revenue generation, products/services, and customers.
How much is the share of a in the profit : Question - The ABC Co., on which A, B and C are partners, reported profit of P360,000 during the year. How much is the share of A in the profit
Explain the organizational strategy : An analysis of the way in which organizational strategy should be linked to products, services, customers and revenue
Strong academic background and high job ambitions : This is the story of Ms. Rai Patel, a finalist BBM student with strong academic background and high job ambitions.
ST2195 Programming for data science Assignment : ST2195 Programming for data science Assignment Help and Solution, University of London - Assessment Writing Service
Why investments appraisal is important : Explain why investments' appraisal is important in the success of a company
How much money will you earn : How much money will you earn if you open an 18-month CD account with $10,000 and invest your money until it fully matures at a 4% interest rate
Assignment on employee engagement : Identify and discuss one or more complex or difficult to solve problems you currently face (or have faced in the past) that research could facilitate in resolvi
Prepare monthly cash budgets for january and february : Kayak requires a minimum cash balance of $30,000 at each month-end. Prepare monthly cash budgets for January, February, and March

Reviews

Write a Review

Programming Languages Questions & Answers

  You have in your program an arraylist which contains

you have in your program an arraylist that contains employee salaries double type in arbitrary order. you need to

  Create a script file that generates a row vector

Create a script file that generates a row vector of 10 random numbers from 5 to 15 - Call your function using your array of random numbers and the scalar value of 22.

  Calculate the average rainfall for three months

Write a program that asks the user to enter five floating-point numbers. The program should create a file and save all five numbers to the file.

  Design application to declare an array

Design an application that declares an array of 10 HousePlants. Prompt the user user for data for each of the HousePlants, then display all the values.

  Create logic for program which contains housekeeping

Create logic for program in pseudocode or flowchart which contains housekeeping, detail loop, and end-of-job modules, and which computes service charge and the original check amount customers owe for writing bad check.

  Prepare a computer program to simulate the traffic

Write a computer program to simulate the traffic on a 2D plane under different traffic light control schemes.

  Write a little man program that adds a column of input value

Write a Little Man program that adds a column of input values and produces the sum as output. An input value of zero will indicate the last value in the input stream of input values.

  What''s the significance of the programming language

What are you going to recommend and why - again with reference to his remarks

  Write a program to compute the weekly pay for each employee

A company pays its employees as managers (who receive a fixed weekly salary), hourly workers (who receive a fixed hourly wage for up to the first 40 hours).

  Program to keep track of the seat availability of flight

A small airline company needs a program to keep track of the seat availability of its flights. Design the structure type FLIGHT to store a four-digit flight number.

  Write down the conditions which are not satisfied in program

Sequential program consists of the following five statements, S1 through S5. Considering each statement as separate process. Specify which of the three conditions is not satisfied.

  Insert a html file into an html document

How do you insert a link to the file goodstuff.html in an HTML document? The browser should go to the HTML document goodstuff.html.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd