CST4070 Applied Data Analytics Assignment

Assignment Help Other Subject
Reference no: EM132429943

CST4070 Applied Data Analytics - Tools, Practical Big Data Handling, Cloud Distribution - Middlesex University

The challenge

Santander Cycles (formerly Barclays Cycle Hire) is a public bicycle hire scheme in London. The scheme's bicycles are popularly known as Boris Bikes, after then-Mayor of London, Boris Johnson. The operation of the scheme has been contracted by Transport for London (TfL). The recent success of the scheme has led to its expansion into many areas of London and its rapid growth has led to real challenges balancing bike sharing supply with bike sharing demand.

To provide a possible solution to this problem, bike sharing usage prediction is critical. To this purpose, the Transport for London (TfL), has released three dataset - named bike_journeys, bike_stations and London_census- whose structure is illustrated in the next section.

As part of the challenge, you need to define and implement a data science method able to predict the total number of bikes rented in each bike station with the temporal granularity of one hour time slot, so to help TfL to balance bike sharing supply with bike sharing demand.

Special attention must be paid to the interpretation of the final adopted model, to understand which factors are associated with an high/low demand of rented bikes in London, such as population composition from census data, weekends, peak hours, and so forth.

Data

You have three dataset available, which have the following structure.

bike_journeys data:

Journey_Duration: duration of the bike journey in seconds.
Journey_ID: the id of the journey.
End_Date: a numeric field indicating the day of the month when the journey terminated (e.g., 1, 2, ..., 30, 31).
End_Month: a numeric field indicating the month when the journey terminated (e.g., 1, 2, ..., 11, 12).
End_Year: a numeric field indicating the year when the journey terminated (e.g., 2017).
End_Hour: a numeric field indicating the hour when the journey terminated (e.g., 1, 2, ..., 23, 24). End_Minute: a numeric field indicating the minute when the journey terminated (e.g., 1, 2, ..., 59, 60). EndStationID: the id of the station where the journey terminated.
Start_Date: a numeric field indicating the day of the month when the journey started (e.g., 1, 2, ..., 30, 31).
Start_Month: a numeric field indicating the month when the journey stated (e.g., 1, 2, ..., 11, 12).
Start_Year: a numeric field indicating the year when the journey started (e.g., 2017).
Start_Hour: a numeric field indicating the hour when the journey started (e.g., 1, 2, ..., 23, 24). Start_Minute: a numeric field indicating the minute when the journey started (e.g., 1, 2, ..., 59, 60). StartStationID: the id of the station where the journey started.

bike_stations data:

Station_ID: the id of a bike station.
Capacity: a numeric value indicating the maximum capacity of bikes of the station.
Latitude: the latitude where the station is located.
Longitude: the longitude where the station is located.
Station_Name: a string indicating the name of the station (e.g., "River Street , Clerkenwell", "Phillimore Gardens, Kensington").

London Census data:

WardCode: geographical unit of analysis for the census data. It is a code corresponding to an electoral London area.
WardName: name of the corresponding electoral London area.
Borough: London Borough to which the ward corresponds to.
NESW: whether the ward is located in the north, south, west, east part of London.
AreaSqKm: square kilometres associated with the corresponding ward.
lon, lat: coordinates (longitude, latitude) associated with the centre of the ward.
IncomeScor: proportion of the population experiencing deprivation relating to low income. The more deprived is an area, the higher the score.

LivingEnSc: quality of the local environment. The more deprived is an area, the higher the score.
NoEmployee: number of people having an occupation. GrenSpace: percentage of green space associated with the ward. PopDen: population divided by the surface of the ward area.
BornUK: total number of people who were born in the UK.
NotBornUK: total number of people who were not born in the UK.
NoCTFtoH: number of properties in council tax band F-H (the highest median house price)
NoDwelling: number of properties in each ward.
NoFlats: number of flats in each ward.
NoHouses: number of houses in each ward. NoOwndDwel: number of owned properties in each ward. MedHPrice: median house price.

Attachment:- Applied Data Analytics.rar

Reference no: EM132429943

Questions Cloud

Health care and the constitution : A description of what medicine and health care consisted of in the decade the constitution was written 1780's-1790's
Discuss the support for the theory and critiques of theory : Discuss the support for the theory and critiques of the theory. Identify and explain any additional perspectives
Analyze the impact of each role on police administration : Write a brief description of two roles that a forensic psychology professional may have when working with police administrators. Then, analyze the impact.
The stigmas of mental illness and substance abuse assignment : The stigmas of mental illness and substance abuse Assignment help and solutions:- What can be done in the future from a national policy perspective
CST4070 Applied Data Analytics Assignment : CST4070 Applied Data Analytics Tools, Practical Big Data Handling, Cloud Distribution Assignment Help and Solution, Middlesex University - Assessment Writing
What purpose does performance appraisal serve : What purpose does a performance appraisal serve? What are some key ideas to remember when conducting a performance appraisal?
Taxing of four different types of organizations : Explain the differences in taxing of four different types of organizations
Define three ways of sociologically : Describe the factors that have caused you to view the world through that perspective, such as personal experience in our society, popular culture, media
Evaluate the impact of diversity training : Evaluate the impact of diversity training by forensic psychology professionals-s pecifically, respond to the difference it can make and evaluate its value.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd