Reference no: EM133632130
LO1: Describe and discuss Cloud Computing concepts, service models, delivery models, security issues, representative applications, and business drivers influencing cloud adoption.
LO 2: Describe and discuss the practical and theoretical applications of big data and how it can be used to solve business problems, inform business decisions and/or create competitive advantage.
LO3: Identify different big data solution environments and compare their different features and benefits, and advantages and disadvantages.
LO4: Apply Map-Reduce techniques to a business problem involving big-data analytics.
LO5: Apply analytic techniques to a business problem involving high- dimensional big data.
The program language used in this assessment should be python. Data sets:
Traffic Crashes shows each crash that occured within city streets as reported in the electronic crash reporting system (E-Crash) at CPD. Citywide data are available starting September 2017.
Red Light Camera Violations reflects the daily number of red light camera violations recorded by the City of Chicago Red Light Program for each camera since 2014.
Speed Camera Violations reflects the daily number of speed camera violations recorded by each camera in Children's Safety Zones since 2014.
Historical Traffic Congestion Estimates estimates traffic congestion on Chicago's arterial streets in real-time by monitoring and analyzing GPS traces received from Chicago Transit Authority (CTA) buses.
Current Traffic Congestion Estimate shows current estimated speed for street segments covering 300 miles of arterial roads. Congestion estimates are produced every ten minutes.
The first four datasets can be exported in csv and bulk downloaded, and the last dataset gets real-time data through the Socrata Open Data API.
Research expectation:
Three layers of the Lambda Architecture are to be implemented for big data processing.
The Jupyter notebook is too be installed.
Necessary preprocessing of data should be applied
Adopt proper libraries for data vistualisation.
Detailed Submission Requirements
Your submission of data visualisation program should be in the form of .ipynb format.
A brief report (1500 words) with screenshots of data visualization should be submitted in .pdf format. Retain the formatting of the template (11 font, 2.5 margins, 1.5 spacing)
Use Harvard referencing including the reference list.