Reference no: EM133091147
ITECH1103 Big Data and Analytics
Review Questions
1. Fill in the following table as you discuss what data is collected by the different organisation types, and how this data is used regarding individuals and groups.
Organisation Type
|
Data collected for individual
|
What information can be gained from this data about individuals?
|
What information can be gained from this data about groups?
|
Government Departments
|
|
|
|
Banking / Finance
|
|
|
|
Retail
|
|
|
|
Education Sector
|
|
|
|
Search Engines
|
|
|
|
Social networking sites
|
|
|
|
2. What is data redundancy, and which characteristics of the file system can lead to it?
3. What is data independence, and why is it lacking in file systems.
4. What is a DBMS and what are its functions?
5. What is structural independence, and why is it important?
6. Explain the difference between data and information.
7. What is metadata in the context of a database system?
Portfolio Questions
1. Given the file structure shown in the Figure 1, answer the followingquestions:
a) How many records does the file contain? How many fields are there perrecord?
b) What problem would you encounter if you wanted to produce a listing by city? How would you solve this problem by altering the file structure?
c) If you wanted to produce a listing of the file contents by last name, area code, city, state, or zip code, how would you alter the file structure?
d) What data redundancies do you detect? How could those redundancies lead to anomalies?
2. Given the file structure shown in the Figure 2, answer the following questions.
a) Identify and discuss the serious data redundancy problems exhibited by the file structure shown in Figure 2
b) Looking at the EMP_NAME and EMP_PHONE contents in Figure 2, what changes would you recommend?
c) Identify the different data sources in the file you examined in Problem 2a).
d) Given your answer to Problem 2c), what new files should you create to help eliminate the data redundancies found in the file shown in Figure 2?