Discuss three phases of mapreduce framework

Assignment Help Other Subject
Reference no: EM133027768

MITS6005 Big Data - Victorian Institute of Technology

Question 1
An insurance company implementeda Big Data solution to store and process data and applications. The company collects structured and unstructured data about their customers and products.

a) Discuss three phases of MapReduce framework to determine the number of customer claims from database.

b) Mention at least two disadvantages of MapReduce and discuss an alternative solution to replaceMapReduce in Hadoop ecosystem for the above problem.

Question 2
Australia is currently leading the global mining section. One of the Australian mining companies is specialized in the extraction and processing of minerals. The company employs over 70000 workers worldwide. A HR and payroll system is used to store the data about the employees and manage their salaries and leaves. One analytical system is used to process satellite images about the mining lands. IoT sensors are used to monitor for any defect in the mining machineries to avoid downtime and operational hazards. You are hired to lead the IT team at the company and identify the appropriate IT solutions.

a) Discuss the database solution that is required for this company based on the data types? Justify your answer.

b) Based on different sources of the data, discuss one big data analytics opportunity from the case study and identify one technology from Hadoop ecosystem can be used to support this process.

Question 3
Big Data is defined by at least three dimensions known as 3Vs: volume, variety, and velocity. Hadoop cluster is a computational cluster designed for storing and analyzing huge amounts of unstructured data in distributed computing environment. A healthcare center is using Hadoop platform to store and process the data which are related to doctors, staff, and patients.

a) Discuss the advantages of using Hadoop cluster in health data applications. Provide examples in your explanation.

b) HDFS is a storage system in Hadoop platform. The healthcare center has a Hadoop cluster, and there is a file about appointments of size 812 MB stored in HDFS (Hadoop 2.x) using default block size configuration. Calculate the number of blocks that needs to be generated for the given size and find each block's size to be stored in the Hadoop.

Question 4

Australian Bank reported an increase in the number of fraudulent credit card transaction recently. Several big data solutions are implemented to store and process the data including data marts and data lakes. However, it is still challenging to detect the fraudulent activities at the right time.

a) Discuss the appropriate Big Data technology that can be used to process the data transactions without latencies?

b) Discuss one application with examples where real-time data processing is required.

Attachment:- Big Data.rar

Reference no: EM133027768

Questions Cloud

PSY 5993-41 Directed Research Assignment : PSY 5993-41 Directed Research Assignment Help and Solution, Evaluating Evidence in the Psychology of Culture, Race, and Ethnicity - University of Minnesota
Compute the adjusted cost : Compute the adjusted cost base of each component of the consideration that Ms. Bond has received from the corporation
What were the actual costs incurred : Your Company's flexible budget cost formula for indirect materials is $1,000 FC + $0.45 per unit of output. What were the actual costs incurred
What discount rate should peter pet foods use : Tim's Toys specializes in pet toys and has a WACC of 10.5 percent. What discount rate should Peter's Pet Foods use to evaluate this investment
Discuss three phases of mapreduce framework : Discuss three phases of MapReduce framework to determine the number of customer claims from database and discuss an alternative solution to replaceMapReduce
Find the price of molybdenum : If the discount rate is 16%, find the price of molybdenum above which it makes sense to do the investment, i.e. find the price at which the NPV is zero
What is the probability : What is the probability that none of the production facilities will be damaged by fire in any given year
What is the NPV of this project : One-year taxable bonds that have similar risk to the project are yielding 9%. Similar-risk municipal bonds are yielding 6%. What is the NPV of this project
How much would he have to repay : In 4 months he repaid $3,800 towards the loan, and in 7 months he repaid $6,600. How much would he have to repay his parents at the end of 13 months

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd