Set up a searchable database

Assignment Help Programming Languages
Reference no: EM132619096

Option - Records Case Study

You were hired as a Big Data Analyst for a large 50 years old UMUC academic records system. Most of the academic records are stored in Ascii-based text files. The files are stored on high volume hard disks, optical discs, CDs, and DVDs. Three sample records in a file are shown below:

Record 1
Program: Information Technology, Specialization: Database Systems, Course: DBST651 Grade: A, Course: ITEC630 Grade B, course: DBST667 Grade: A, instructor 1: James Smith (DBST651), instructor 2: Jennifer Lopez (DBST651), instructor 3: Jennifer Lopez (ITEC630), instructor 4: Catharine Murphy (DBST667), student name: Yelena Bytenskaya, EmplD: 123456 , User Name : ybytensk, instructor id: 234567, graduated: Yes

Record 2
Program: Data Analytics, Course: DATA610 Grade: B, Course: DATA620 Grade: A, Course: DATA630 Grade: C, Course: DATA630 Grade: A, Course: DATA640 Grade: B, Course: DATA650 Grade: A, Course: DATA670 Grade: B, instructor 1: Steve Knode (DATA610), instructor2: Caroline Beam (DATA620), instructor 3: BatiFirdu (DATA630), instructor 4: Elena Gortcheva (DATA650) , instructor 5: Ozan Ozcan (DATA650) instructor 7: Jon McKeeby (DATA670), instructor 8: Steve Knode (DATA670), instructor 9: Steve Knode (DATA640), instructor 10: TA Yelena Bytenskaya (DATA650), student name: Linesh Dave, EmlID:567890, user name: ldave, instructor id: 567907, graduated: Yes

Record 3
Program: Information Technology, Specialization: Database Systems, Specialization: Project Management, Specialization: Software Engineering, course: DBST651 grade: F, course: DBST651 grade: B, course: ITEC610 grade: B, course: ITEC620 grade: A , course PMAN634 grade: C, instructor 1: Brandon Morris (ITEC 610), instructor 2: Elena Gortcheva (ITEC620), Instructor 3: James Green (DBST651), Instructor4: TA Yelena Bytenskaya (DBST651), student name: Jeff Martin, emplID: 987654, user name: jmartin, graduated: No
You are given the following information about the data (metadata):
• A student is enrolled in a program.
• Some programs may offer specializations. A student enrolled in a program that offers specializations may choose one or more specialization.
• New specializations could be added to a program. If new specialization is added to a program that the student is enrolled in, the student may choose that specialization.
• A student takes multiple classes and receives the final letter grade in each class.
• A class session may have multiple instructors. A student may take multiple classes with the same instructor.
• A student who graduated may become an instructor.
• A student could have multiple IDs (6-digit emplid, username for accessing online classroom and academic records, faculty ID if a student is hired as an instructor.)
• If a student repeats the class, the grade received on the last attempt overwrites the grade received on prior attempts for GPA calculation and on a transcript. However, the system should track all attempts for academic advising purposes.

Your task: Theoretically set up a searchable database that can flexibly accommodate all the above requirements, can contain records of several hundred million books in many languages.

Your paper must have Introduction, Problem Statement, Design, Implementation Methods, Conclusion with a discussion of the pros and cons of your design. The following are required:

1. Design showing the different Big Data Systems that you will use to solve this problem.

2. Pseudocode of a function that will read in each record, parse it, and transform it into database queries or input data for database ingestion

3. Database design showing a detailed schema

4. The student in record 3 above is enrolled in Information Technology program. How would you handle adding a new specialization to Information Technology program and letting the student choose it as additional specialization?

5. Discussion of the pros and cons of choosing ACID vs CAP systems for this problem.

6. Discussion of queries that database users would run.

7. Ideas for improving the speed of the query tool.

Reference no: EM132619096

Questions Cloud

What is the primary measure used for asset valuation : What is the primary measure used for asset valuation on the statement of financial position? What is the source of this measure and justification for its use?
What are the main characteristics of assets : What are the main characteristics of assets and liabilities from an accounting perspective? Is this consistent with a non-accounting definition?
Which movement of funds the four sectors : Which movement of funds between the four sectors of a domestic economy and the rest of the world is called? cross-sector flows.
Analyze ways in which alfons heck participation : Analyze the ways in which Alfons Heck's participation in the Hitler Youth and in the culture of Nazism served to craft his sense of purpose and identity.
Set up a searchable database : Set up a searchable database that can flexibly accommodate all the above requirements, can contain records of several hundred million books in many languages
Management to governance in international economic relations : The transition from management to governance in international economic relations has been said to have gone through three phases, according to Spero and Hart
What is the equilibrium spot sek : You observe the spot exchange rate, What is the equilibrium spot SEK-$ exchange rate, i.e, what is S0(SEK/$)? S0(¥/SEK) = ¥19.00/SEK, S0(¥/$) = ¥167.0/$.
What percentage do you expect the spot rate to change : 5% per year, while the long-term forecast for UK inflation is 3%. By what percentage do you expect the spot rate to change over the coming year?
Calculate spp weighted average cost of capital : Government bonds are yielding 3%. The market risk premium is 6%. SPP's tax rate is 35%. Calculate SPP's weighted average cost of capital (WACC).

Reviews

len2619096

9/2/2020 4:22:17 AM

Do not have to run it to any software. Just create a Hadoop pseudocode for it. Please attached the pseudocode with the solutions.

Write a Review

Programming Languages Questions & Answers

  Write a haskell program to calculates a balanced partition

Write a program in Haskell which calculates a balanced partition of N items where each item has a value between 0 and K such that the difference b/w the sum of the values of first partition,

  Create an application to run in the amazon ec2 service

In this project you will create an application to run in the Amazon EC2 service and you will also create a client that can run on local machine and access your application.

  Explain the process to develop a web page locally

Explain the process to develop a Web page locally

  Write functions

These 14 questions covers java class, Array, link list , generic class.

  Programming assignment

If the user wants to read the input from a file, then the output will also go into a different file . If the user wants to read the input interactively, then the output will go to the screen .

  Write a prolog program using swi proglog

Write a Prolog program using swi proglog

  Create a custom application using eclipse

Create a custom Application Using Eclipse Android Development

  Create a application using the mvc architecture

create a application using the MVC architecture. No scripting elements are allowed in JSP pages.

  Develops bespoke solutions for the rubber industry

Develops bespoke solutions for the rubber industry

  Design a program that models the worms behavior

Design a program that models the worm's behavior.

  Writing a class

Build a class for a type called Fraction

  Design a program that assigns seats on an airplane

Write a program that allows an instructor to keep a grade book and also design and implement a program that assigns seats on an airplane.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd