Reference no: EM132931148 
                                                                               
                                       
5011CEM Big Data Programming Project - Coventry University
Assignment Brief 
Learning Outcome 1: DATA SCIENCE:work with (potentially large) datasets; using appropriatestorage technology; applying statistical analysis to drawmeaningful conclusions; and using modern machinelearning tools to discover hidden patterns.
Learning Outcome 2: PROFESSIONAL PRACTICE:understand professional practices of the modern ITindustry which include those technical (e.g. versioncontrol / automated testing) but also social, ethical &legal responsibilities.
Learning Outcome 3: TRANSFERABLE SKILLS:apply a wide variety of degree level transferable skillsincluding time management, team working, written andverbal presentation to both experts and non-experts, andcritical reflection on own and others work.
VIVA TASK
The VIVA will take the form of a submission of a recorded presentation of your work.
The recording should be an informal, meeting-likepresentation and should be considered as an opportunity to showcase your work. The aim is for you to present your work clearly and effectively to your client.
You are allowed 5 minutes to deliver your main content.You will then answer the questions below where you are allowed up to1 minute per answer. Poor timing will affect your grade.
VIVA Questions
Following the presentation of your work, please verbally answer the following questions.Keep your answers brief and concise and take account of the timing indicated for each.
1. You have tested your code using ozone (o3). We have many chemical species to analyse, how would you need to adapt your code to work with carbon monoxide (CO) for example.
2. If we wanted to analyse multiple chemical species at the same time, how would that affect our HPC requirements, e.g. number of processors?
3. One of our measuring instruments uses different text entries for errors, e.g. "Instrument Error", "Communication Error" as an error code, not NaN. How might you adapt your code to check and report errors?
Attachment:- Big Data Programming Project.rar