Reference no: EM133777054
Introduction to NoSQL and Survey of Big Data Analytics Software
Learning outcome 1: Critically analyze a business objective and design and implement a database solution using an established methodology.
Learning outcome 2: Extract meaningful information from data sets using appropriate tools and techniques.
Learning outcome 3: Apply data visualization techniques to enable insightful information analysis for business decision-making, taking account of appropriate legal, ethical, and professional issues related to data.
Task:
Part A
There is a common misconception that NoSQL databases cannot store relationships. However, this is not entirely true. NoSQL databases store relationships, but they do so differently from SQL databases. In fact, many people find it easier to model relationship data in NoSQL because related data doesn't have to be split between tables, unlike SQL databases. Answer the following questions:
Create a NoSQL database based on PSB Academy's organizational/administrative operations:
Include at least FIVE (5) collections in one database.
Create a relationship for the collections.
Part B
Hadoop is a popular framework for processing large volumes of data. It is widely used in the industry due to its ease of use. Hadoop offers simple and effective tools to deal with complex Big Data, making it a preferred choice for many users. Now, answer the following questions.
Choose Five (5) tools in Hadoop.
Explain their main functions and what are their usage (answer with diagrams).
Explain how they can be integrated into the Big Data frameworks.
Explain how these tools can make Big Data easier for computing applications (discuss the necessary user and system interface).
Explain with a historical review who the developers of the tool are.
Social media analytics is the process of collecting and analyzing data from social networks to understand trends and patterns. Now, answer the following questions.
Find Five (5) tools that could be used to perform social media analytics.
Explain their main functions and what are their usage (answer with diagrams).
Explain how they can help the selected industry to make more commercial profit.
Explain how these tools can make Big Data easier for computing applications (discuss the necessary user and system interface).
Explain with a historical review who the developers of the tool are.Coursework - Presentation Guidelines
The following is a suggestion for your presentation slide preparation. It would be best if you included the following slides in your presentation.
Title page.
CW2 project title, your name, student ID, date, etc.
Part A: NoSQL and MongoDB.
According to the CW2 brief, introduce the Five (5) collections and then explain how to establish their relationship using any aggregation framework.
Part B: Select 61 (Apache Hadoop) or B2 (Social Media Analytic Software) to be included in the presentation.
Start with an introduction, then explain each tool/software according to the CW2 brief. Also, B1 discusses how the tools are being integrated to build the Hadoop big data system, and B2 discusses how the tools help tech companies improve their revenue through data analytics.
Conclusion page.
State the project design experiences and technical achievements.
Referencing page.