CMM524 Advanced Data Management Assignment

Assignment Help Database Management System
Reference no: EM132715957

CMM524 Advanced Data Management - Robert Gordon University

Learning outcome 1: Identify, handle and manipulate structured and unstructured data using modern databases.
Learning outcome 2: Efficiently handle and manipulate large datasets.
Learning outcome 3: Identify and implement appropriate data management techniques.
Learning outcome 4: Apply analysis techniques to extract knowledge from data.

1 Aim

This coursework examines the student's ability to design a relational database, manipulate and analyse large datasets, and to interpret analysis results.

2 Coursework Contribution

This coursework contributes 100% to the final module grade. For the weightings of different parts, please see the separate marking grid document.

3 The Tasks

3.1 Part 1: Designing a Relational Database

Little Panda is a takeaway who wants to open up its business to accept online orders. To do this, it needs a database to store its food menu, customer data and orders.

Your task is to design a relational database that runs on MySQL. Here are some requirements:

• Customers must register before they can make orders. They must provide enough details for home delivery.
• Menu item prices may change. Customers are charged prices at the time of order.
• Little Panda needs to know the status of an order so that they can follow up.
e.g. either it is "waiting to be cooked", "cooked and to be delivered", or delivered, etc. You can assume all orders are paid before they enter the system.
• Order details must be stored for accounting purpose, even after they are completed.

3.2 Part 2: Analysing the "UN City Population" Dataset

You are given the "UN city population" dataset. Perform the following analysis using Pig:

Question 1: Find the number of countries in the dataset.
Question 2: List the countries together with the number of cities in each country1.
Question 3: List countries in ascending order of female-to-male ratio, throughout the years2.
Question 4: List the top 10 most populated cities according to the most recent data in the dataset3.
Question 5: List the top 10 cities which have the highest population change per year in percentage since the start of the survey4.

Notes:
• You must use Pig.
• Annotate your program code properly so that the marker can understand how it works. The annotation also contributes to the grade.
• State any assumption that you made.
• If you cannot complete a task, an incomplete solution may also bring you partial credit.

3.3 Part 3: Analysing Datasets of Your Choice

In this part of the coursework you need to:
• Find a dataset, or multiple datasets.
? Dataset(s) must be public domain and of a considerable size.
? A dataset cannot be too small. e.g. Just a few lines.
? There is no need to go for a GB or TB-sized dataset unless the dataset is very interesting.
? DO NOT choose a dataset similar to the one in part 2.
• Propose 3 analysis tasks that you will perform on the dataset(s).
? Your proposed analyses should be insightful. e.g. give useful information for decision making.
? You may combine multiple datasets for some interesting analyses.
? DO NOT propose tasks similar to those in part 2.
• Implement the 3 proposed analyses using Pig.
• For each analysis, interpret the result.

Attachment:- Advanced Data Management.rar

Reference no: EM132715957

Questions Cloud

Determine the energy needed to change : Determine the energy needed to change a 0.85 kg block of ice at 0 °C into water at 20°C. The heat of fusion for water at 0 °C is Lf = 3.35E5 J/kg.
What is the total mass of water vapor : What is the total mass (in kg) of water vapor in the air in the apartment?
Where do you think data systems will be in five years : Throughout the semester, you learned about different database types and even studied a variety of database platform. Where do you think data systems will be in.
Compute what is cost of goods sold for julia and company : Julia & Company, March 1 Beginning inventory = 29 units @ $5.90. What is the cost of goods sold for Julia & Company assuming it uses LIFO?
CMM524 Advanced Data Management Assignment : CMM524 Advanced Data Management Assignment Help and Solution, Robert Gordon University - Assessment Writing Service
Find the company inventory turnover ratio equals : Find the company's inventory turnover ratio equals. A company's sales equal $60,000 and cost of goods sold equals $20,000. Its beginning inventory was $1,600
Differentiate between lazy and eager update propagation : Briefly differentiate between lazy and (immediate) eager update propagation. How can they be compared to delayed-write and write-through policy in.
Determine the change in thermal energy of the iron : A 500-g aluminum container holds 300 g of water. The water and aluminum are initially at 40°C. A 200-g iron block at 0°C is added to the water
Compute what anthony average days in inventory is : Compute What Anthony's average days in inventory is? Average inventory 50,000. Anthony Corporation reported the amounts for the year

Reviews

Write a Review

Database Management System Questions & Answers

  Create a header in the header field labeled database

Create an automatic sort by applying a filter to each column so your data can be viewed in different ways.

  How the proposed er model could accommodate

Relational Database Systems (COMP 1005) Design the logical structure of a database using Entity-Relationship diagram and Apply normalization techniques to reduce redundancy in a database - Normalize the below given Patient Prescription Bill to First ..

  Explain how volvo car corporation transforms data

Judge how Volvo Car Corporation integrated the cloud infrastructure into its networks. Explain how Volvo Car Corporation transforms data into knowledge

  Why big data has become a hot topic in the business world

Read the article Big Data basic concepts and benefits explained. Focus on why this has become a hot topic in the business world. Describe why big data has become a hot topic in the business world.

  Explain what problems can occur in a distributed database

Explain what problems can occur in a distributed database that is partitioned but not replicated.

  Advantages and disadvantages of joins and nested queries

One side effect of normalization is that you often need more than one table to get meaningful results. For example, you may have a table that includes a list of parts and the ID number for the vendor of each part. T

  Each technician is an expert on one or more plane model

Each technician is an expert on one or more plane model(s), and his or her exper-tise may overlap with that of other technicians. This information about technicians must also be recorded.

  Entity relationship in a physical relational database design

How would you represent a one-to-many strong entity relationship in a physical relational database design?

  Explain and support the database schema

Explain and support the database schema with relevant arguments that support the rationale for the structure. Note: The minimum requirement for the schema.

  Database requirements for operational data

Outline the main differences between database requirements for operational data and for decision support data. Describe three (3) examples in which databases could be used to support decision making in a large organizational environment

  Design a high-level conceptual view of a data warehouse

Design a high-level conceptual view of a data warehouse using Microsoft Visio that shows the following: Integration layers and The data warehouse

  Describe what entity represents in an er mode

Describe what entity represents in an ER mode and provide examples of entities with a physical or conceptual existence. Describe how strong and weak entities differ and provide an example of each.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd