Evaluation of different current available big data platform

Assignment Help Data Structure & Algorithms
Reference no: EM133253944

System requirement analysis, design and Development

Description of the assessment

The first part is a report of evaluations of different current available big data platforms and implementation of a data warehouse using one of these platforms.

The second part is to do a big data analytics task applying on the big data process technologies and machine learning algorithms.

Assessment Content

Processing Big Data has many challenges with 4Vs.

The assessment 1(CW1) requires you to investigate no less than 3 big data supported cloud platforms with a demo example of data warehouse implementation. The Python-SQL-based data warehouse implementation and data used for the demo will be explained in the practical sessions.You should Report your evaluation results of the platforms with at least 8 criteria following the evaluation guide. After evaluating the cloud infrastructure, you should be able to do data analysis tasks by processing a big dataset using Python (ideally should be PySpark). The dataset will be provided.

The suggested platforms are:
BigQuery (Google)
Azure (Microsoft)
Keboola
Red Hat OpenShift
Deepnote
Deliverables:
Report on evaluation of big data cloud platform and data warehouse demo implementation process. The report structure should follow the structure below

1. Introduction
The purpose and scope of the report
• how many platforms you would like to evaluate?
• the criteria for evaluation
• investigation methodology

2. Platform investigation
• Detailed report of evaluation on each platform according to the defined criteria.
• The comparation result

3. Big Data processingand analysis implementation

• Working on a big dataset to enable applying NoSQL, PySpark or similar techniques to do data analysis on one of the cloud platforms or simulate on your own PC. The analysis should include data EDA, classification and price prediction.
• The explanations and screenshots to support this section. Critical discussion on the reason special algorithms are selected to do the work.
• The dataset will be provided. The dataset is relatively big for assessment purpose, and you can download the data from module blackboard.

4. Conclusion
• Summarisation
• Experience (what you have learnt from the assessment) discussion
• Future work (what can be improved)

Attachment:- BIG DATA ASSIGNMENT.rar

Reference no: EM133253944

Questions Cloud

Adjusting entry-worksheet : At year-end, the allowance for bad debts account of Pelayo Trading had a credit balance of P2,500 just prior to the adjusting entry to provide for bad debts exp
Why the laundry supervisor might be right : The laundry manager received his first quarterly budget performance report for the first quarter of the year - Jan-Mar. It showed that the laundry department wa
Green valley medical center david w young : This is a case about capital budgeting, but the focus is on capital budgeting as a system and as a component of an organization's strategic planning process.
What is meant by responsibility center accounting : St. Luke's Homeless Shelter is located in the heart of Savannah, and has been operating for over a decade. The shelter provides three service lines. In addition
Evaluation of different current available big data platform : System requirement analysis, design and Development evaluations of different current available big data platforms and implementation of a data warehouse
Payment of the entire premium : Carreon Printing Company, owned by Carlo Carreon, acquired a two-year insurance policy on September 1, 2021. The total amount of premium paid is P62,640.
Discounted cash flow and npv : Comment on a potential personal situation where using a discounted cash flow and NPV could help you make a better decision.
High representation and visibility in primatology : Why do women have a high representation and visibility in primatology especially as primatologists who study non-human primates in the wild.
Identify and discuss the various risks : The Jones, Jenny and kevin, ages 41 and 40 respectively, have 2 children aged 10 and 14. Jenny is a high school teacher earning $110,000 per annum plus 12.5% su

Reviews

Write a Review

Data Structure & Algorithms Questions & Answers

  Implement an open hash table

In this programming assignment you will implement an open hash table and compare the performance of four hash functions using various prime table sizes.

  Use a search tree to find the solution

Explain how will use a search tree to find the solution.

  How to access virtualised applications through unicore

How to access virtualised applications through UNICORE

  Recursive tree algorithms

Write a recursive function to determine if a binary tree is a binary search tree.

  Determine the mean salary as well as the number of salaries

Determine the mean salary as well as the number of salaries.

  Currency conversion development

Currency Conversion Development

  Cloud computing assignment

WSDL service that receives a request for a stock market quote and returns the quote

  Design a gui and implement tic tac toe game in java

Design a GUI and implement Tic Tac Toe game in java

  Recursive implementation of euclids algorithm

Write a recursive implementation of Euclid's algorithm for finding the greatest common divisor (GCD) of two integers

  Data structures for a single algorithm

Data structures for a single algorithm

  Write the selection sort algorithm

Write the selection sort algorithm

  Design of sample and hold amplifiers for 100 msps by using n

The report is divided into four main parts. The introduction about sample, hold amplifier and design, bootstrap switch design followed by simulation results.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd