Design and develop advanced big data applications

Assignment Help Other Subject
Reference no: EM132741330 , Length: word count:2500

KF7032 Big Data and Cloud Computing - Northumbria University

Aims

The aim of this assignment is to introduce a practical application of Big Data and Cloud Computing using a realistic big data problem. Students will implement a solution using an industry leading Cloud computing provider together with the distributed processing environment Apache Spark. This will involve the selection of problem appropriate Machine Learning algorithms and methods.

Learning Outcome 1: Apply big data analytic algorithms, including those for visualization and cloud computing techniques to multi-terabyte datasets.
Learning Outcome 2: Critically assess data analytic and machine learning algorithms to identify those that satisfy given big data problem requirements

Learning Outcome 3: Critically evaluate and select appropriate big data analytic algorithms to solve a given problem, considering the processing time available and other aspects of the problem.

Learning Outcome 4: Design and develop advanced big data applications that integrate with third party cloud computing services Personal Values Attributes (Global / Cultural awareness, Ethics, Curiosity) (PVA):

Learning Outcome 5: Critically assess the relationship between knowledge and the ethical and social interpretation of primary research using big data.

Definitions

Portfolio Assignment: A collection of pieces of work

Individual Work: Work carried out by one person only

Group Work: Work carried out collaboratively seeking to improve each other's elements
Peer Review: Critical analysis and subsequent grading of a social equal's work
Semi-Formative: Training tasks assigned course credit to reward and ensure engagement.

Big Data Product: Weapons and Drugs

In the television documentary "Ross Kemp and the Armed Police" broadcast 6th September 2018 by ITV, multiple claims were made regarding violent crime in the UK.

These claims were:
1. Violent Crime is increasing
2. There are more firearms incidents per head in Birmingham than anywhere else in the UK
3. Crimes involving firearms are closely associated with drugs offences

In this assignment you will investigate these claims using real, publicly available data sets that will be made available to you and placed in Amazon S3. These include, but are not limited to:

1. Street Level Crime Data published by the UK Home Office. This dataset contains 19 million data rows giving a crime type, together with their location as a latitude and longitude.

2. Land Registry Price Paid Data: This gives the postcode of a property, the property type from an enumeration of D (Detached), S (Semi-Detached), T (Terraced), F (Flats/Maisonettes) and the price paid.

3. Postcode Data: This data set is based on material provided by the Ordinance Survey. It gives a latitude and longitude to every postcode. This is useful as it relates between the Land Registry Price Paid dataset postcode, and the original crime dataset
latitude/longitude.

Specifics

1. Process the data prepared for you using Apache Spark.

2. Filter the dataset so that crimes refer to relevant events only.

3. Using appropriate visualization methods, statistics, and machine learning, determine whether the claims made by Ross Kemp were true, false, or could not be determined.

4. Explain the reasoning behind your code so that it is clear what each block is intended to achieve, and why.

5. Report critically on the advantages, disadvantages, and limitations of the methods used.

6. Your submission will be a Jupyter Notebook containing both code (typically Python), and explanatory text (Markdown) limited to 2500 words (plus references).

Attachment:- Big Data and Cloud Computing.rar

Reference no: EM132741330

Questions Cloud

New specific marketing channel : Tell us about three considerations a specific B2B company should take into account before deciding on entering into a new specific marketing channel.
Calculate the weighted average flotation costs : The company issues new equity, it incurs a flotation cost of 7%. The flotation cost on new debt is 3%. Calculate the weighted average flotation costs.
Journalize the entry that should be made by the company : Borden Company is a credit memo for $21,200 representing the principal ($20,000), Journalize the entry that should be made by the company
Internal conflicts of the inventory management system : Describe the internal conflicts of the inventory management System and its collision with the firms profitability
Design and develop advanced big data applications : Critically assess the relationship between knowledge and the ethical and social interpretation of primary research using big data.
Find and compute the approximate yield to maturity : Compute the approximate yield to maturity. Bonds issued by the Coleman Manufacturing Company have a par value of 1,000, which of course, is also the amount
Estimating and budgeting are dry subjects : Estimating and budgeting are dry subjects. Learning to perform this process on real projects,
What amount should be reported for cash : Deposit in transit not recorded by bank, $13,325. If the balance sheet were prepared for Creative Design Co. on August 31, what amount should reported for cash
Find what percent will the price of the bonds increase : Lance Whiningham IV specializes in buying deep discount bonds. By what percent will the price of the bonds increase between now and maturity?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd