CST4070 Applied Data Analytics Assignment

Assignment Help Other Subject
Reference no: EM132429943

CST4070 Applied Data Analytics - Tools, Practical Big Data Handling, Cloud Distribution - Middlesex University

The challenge

Santander Cycles (formerly Barclays Cycle Hire) is a public bicycle hire scheme in London. The scheme's bicycles are popularly known as Boris Bikes, after then-Mayor of London, Boris Johnson. The operation of the scheme has been contracted by Transport for London (TfL). The recent success of the scheme has led to its expansion into many areas of London and its rapid growth has led to real challenges balancing bike sharing supply with bike sharing demand.

To provide a possible solution to this problem, bike sharing usage prediction is critical. To this purpose, the Transport for London (TfL), has released three dataset - named bike_journeys, bike_stations and London_census- whose structure is illustrated in the next section.

As part of the challenge, you need to define and implement a data science method able to predict the total number of bikes rented in each bike station with the temporal granularity of one hour time slot, so to help TfL to balance bike sharing supply with bike sharing demand.

Special attention must be paid to the interpretation of the final adopted model, to understand which factors are associated with an high/low demand of rented bikes in London, such as population composition from census data, weekends, peak hours, and so forth.

Data

You have three dataset available, which have the following structure.

bike_journeys data:

Journey_Duration: duration of the bike journey in seconds.
Journey_ID: the id of the journey.
End_Date: a numeric field indicating the day of the month when the journey terminated (e.g., 1, 2, ..., 30, 31).
End_Month: a numeric field indicating the month when the journey terminated (e.g., 1, 2, ..., 11, 12).
End_Year: a numeric field indicating the year when the journey terminated (e.g., 2017).
End_Hour: a numeric field indicating the hour when the journey terminated (e.g., 1, 2, ..., 23, 24). End_Minute: a numeric field indicating the minute when the journey terminated (e.g., 1, 2, ..., 59, 60). EndStationID: the id of the station where the journey terminated.
Start_Date: a numeric field indicating the day of the month when the journey started (e.g., 1, 2, ..., 30, 31).
Start_Month: a numeric field indicating the month when the journey stated (e.g., 1, 2, ..., 11, 12).
Start_Year: a numeric field indicating the year when the journey started (e.g., 2017).
Start_Hour: a numeric field indicating the hour when the journey started (e.g., 1, 2, ..., 23, 24). Start_Minute: a numeric field indicating the minute when the journey started (e.g., 1, 2, ..., 59, 60). StartStationID: the id of the station where the journey started.

bike_stations data:

Station_ID: the id of a bike station.
Capacity: a numeric value indicating the maximum capacity of bikes of the station.
Latitude: the latitude where the station is located.
Longitude: the longitude where the station is located.
Station_Name: a string indicating the name of the station (e.g., "River Street , Clerkenwell", "Phillimore Gardens, Kensington").

London Census data:

WardCode: geographical unit of analysis for the census data. It is a code corresponding to an electoral London area.
WardName: name of the corresponding electoral London area.
Borough: London Borough to which the ward corresponds to.
NESW: whether the ward is located in the north, south, west, east part of London.
AreaSqKm: square kilometres associated with the corresponding ward.
lon, lat: coordinates (longitude, latitude) associated with the centre of the ward.
IncomeScor: proportion of the population experiencing deprivation relating to low income. The more deprived is an area, the higher the score.

LivingEnSc: quality of the local environment. The more deprived is an area, the higher the score.
NoEmployee: number of people having an occupation. GrenSpace: percentage of green space associated with the ward. PopDen: population divided by the surface of the ward area.
BornUK: total number of people who were born in the UK.
NotBornUK: total number of people who were not born in the UK.
NoCTFtoH: number of properties in council tax band F-H (the highest median house price)
NoDwelling: number of properties in each ward.
NoFlats: number of flats in each ward.
NoHouses: number of houses in each ward. NoOwndDwel: number of owned properties in each ward. MedHPrice: median house price.

Attachment:- Applied Data Analytics.rar

Reference no: EM132429943

Questions Cloud

Health care and the constitution : A description of what medicine and health care consisted of in the decade the constitution was written 1780's-1790's
Discuss the support for the theory and critiques of theory : Discuss the support for the theory and critiques of the theory. Identify and explain any additional perspectives
Analyze the impact of each role on police administration : Write a brief description of two roles that a forensic psychology professional may have when working with police administrators. Then, analyze the impact.
The stigmas of mental illness and substance abuse assignment : The stigmas of mental illness and substance abuse Assignment help and solutions:- What can be done in the future from a national policy perspective
CST4070 Applied Data Analytics Assignment : CST4070 Applied Data Analytics Tools, Practical Big Data Handling, Cloud Distribution Assignment Help and Solution, Middlesex University - Assessment Writing
What purpose does performance appraisal serve : What purpose does a performance appraisal serve? What are some key ideas to remember when conducting a performance appraisal?
Taxing of four different types of organizations : Explain the differences in taxing of four different types of organizations
Define three ways of sociologically : Describe the factors that have caused you to view the world through that perspective, such as personal experience in our society, popular culture, media
Evaluate the impact of diversity training : Evaluate the impact of diversity training by forensic psychology professionals-s pecifically, respond to the difference it can make and evaluate its value.

Reviews

Write a Review

Other Subject Questions & Answers

  Define harry and the employer is being sued

After leaving happy hour, Harry has a really bad car accident in the company car and a person ends up dying. Both Harry and the employer is being sued

  Discuss a current ethical situation in business today

Discuss a current ethical situation in business today. Assess what caused this ethical situation and how it could have been avoided.

  What do you think about homelessness

It seems that we are gradually moving away from the assumption that homelessness results from laziness or the persistent reluctance of people to help themselve

  Include the agency most recent budget or financial plan

Identify and explain one to two challenges you will have in managing the budget. Include the agency's most recent budget or financial plan.

  How might diversity impact individual attitudes and behavior

What are the challenges in managing a diverse age group of employees? How might diversity impact individual attitudes and behavior

  What are the ethical implications of using crowdsourcing

Crowdsourcing is a new way of incorporating the voice of the customer during new product development. Use your course materials and complete internet research.

  Discuss the issues of globalization and climate change

the system of nation-states need to be replaced or modified to address the issues of globalization and climate change

  What does the given passage mean

What does the passage mean? Using your own words, convey the literal meaning of Marx's claims in this passage as simply and clearly as you can. Refer to the surrounding text to support your interpretation

  Write a python program which satisfies following requirement

Write a program that asks the user for the name of a file. The program should display the contents of the file with each line preceded with a line number followed by a colon.

  Sociologists use the term

Sociologists use the term "________" to refer to the fundamental changes in society that occur as a result of vast numbers of women entering the work force.

  What is the correlation between students age

What is the correlation between students' score on the "when should you wash your hands" knowledge index and the "correct handwashing" self-report scale?

  Explain the political climate relative to the issue

Identify the stakeholders involved and availability and accessibility of necessary resources to solve the social problem.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd