Draw the pie chart or line chart of data anomalies

Assignment Help Database Management System
Reference no: EM132145197

Q1. Given the following file for assignment worker.com, identify data anomalies that must be removed before data can be loaded in data warehouse.

Worker_assignment ← -----------------on course web site

File is available in SAKAI

Assignment_worker(assignment_no, assignment_date, emp_number, chg_hour,assigned_hour, charges)

Where assignment number is the number assigned to an assignment
Assignment_date is the date assignment started
Emp_number is the number of employee assigned to that assignment
Chg_hour is amount paid to that employee for that assignment
Assigned_hour is the hours assigned to that employee for that assignment
Charges are the Total charges for that employee for that assignment (this is calculated as Chg_hour*assigned_hour)

Rules:
- Assignment numbers always start with a letter followed by a 1 and are ALWAYS four characters long
ex: A123, Z178

- Emp No IS always 3 CHARCATER LONG

- An employee can not work more than 40 hour on a given project

Requirement:
Count (using EXCEL formulas -- IF, countif etc. as done in class) four types of errors:

- Missing data
- Incorrect Format
o To check length of empno--you can use LEN(cell address) to get length of item in that cell
o check for assignment number format (BONUS +1 points)
- Zero values
- Incorrect Calculations
o check for charges
charges= chg_hour*Assigned_Hour
o check for employee working more than 40 hours

Once counted

- Draw the pie chart or line chart of data anomalies and
- Discuss what errors can be corrected and how. (submit in WORD)

Must submit the EXCEL worksheet where errors are calculated and graph is drawn

Q2

Data integrity is a required feature of data warehouses. P & G is building a data warehouse and have run in data integration problems. They need to get data from 2 different users and combine them to maintain data integrity in their data warehouse.

The sources are:

Asia region
North American Region

Both region have data stored in different formats in two different files (employee_asia and emp_NA

Both tables are available in account Aggarwal as READ ONLY. You must create a copy in your account before using it.

Or

you can create your own tables.

SHOW ALL QUERIES AND OUTPUTS

1. CLEAN the data in required format (for gender, country of origin, job_class and seniority)
a. Employee gender should be standardized, i.e., male should be changed to m and female to f
b. Country should be spelled completely, i.e, USA should be spelled out as United States of America
c. Ceylon no longer exists, change the name to Sri Lanka
d. Name is one attribute in dimension table, combine name as last and first, example Bora (last) and Lakshmi (first) should be modified to Bora, Lakshmi
e. Calculate both job_class and seniority

2. Create CLEAN_ASIA table

3. Create CLEAN_NA table

4. Combine the two using UNION to create following table

EMPLOYEE_DIM (Employee Id, Employee_name, seniority, gender, country, job_class)

5. Show the contents and structure of EMPLOYEE_DIM table.
6. Give a count of male and female employees

Q3 Revise the data warehouse based on new requirements (same as what we did in class)

Attachment:- Assignment.rar

Attachment:- archive 2.zip

Verified Expert

The solution file is prepared in ms word created tables in oracle and excel file. In oracle created Assignment_worker table with following attributes assignment_no, assignment_date, emp_number, chg_hour,assigned_hour, charges and in excel found the count of Missing data, Incorrect Format, check length of empno, check for assignment number format and represented graph for number of Zero values, Incorrect Calculations,check for charges, charges and check for employee working more than 40 hours. And also discussed about errors can be corrected and how. We created two more tables in oracle are employee_Asia and employee_na and inserted data values then executed required quires and join two tables uisgn union query. Finally created new data model for two way aggregation on employee _dim table.

Reference no: EM132145197

Questions Cloud

What will be some of the challenges for you : In your role as a business psychology student, you are likely to work and study with people from many backgrounds. In 500 words (two typed pages).
What is the strategy of break talk : What is the strategy of Break Talk? Are they succeeding or failing? Why?
What are some of the risks associated with a low cost : What are some of the risks associated with a low cost leadership strategy? Provide one original example of a company that you believe employs this strategy
Create a one page goal setting plan : Goal Setting is an excellent behavioral strategy for exercise promotion and adherence. Create a one page Goal Setting Plan for yourself using information found.
Draw the pie chart or line chart of data anomalies : INSS 422 - Business Intelligence - identify data anomalies that must be removed before data can be loaded in data warehouse
What challenges will there be in researching this market : A market domain is defined as something (a product or service) that people need, but that is NOT currently available to them.
What are the characteristics of bakery industry : How competitive is the landscape, etc? Any mega trends affecting the bakery industry which will affect how the BreadTalk competes?
What are the ethical principles and implications : You have been called in to consult on cases that may require mandated treatment. After reviewing the PSY699 The ethics of mandated treatment scenarios.
Questions of the server acting as the model : Sonya, the manager of a restaurant, has just hired three servers. As part of their training program, she wants them to first learn by watching another

Reviews

inf2145197

11/23/2018 1:18:21 AM

Hi, first of all, thank you for processing my request but I need my order as soon as possible because I need to submit it so kindly complete the process and send it to me as soon as possible. Thank you. I was very happy with this assignment because it was done according to the standards set in the questions. Even if the topic was a bit difficult, it was completed with great precision. Thank you.

len2145197

10/19/2018 11:28:45 PM

INSS 422 Mid Term Exam Fall Business Intelligence You will submit two files • Word file with q1 and q2 • EXCEL file for q1 (show all work) MUST BE DONE INDIVIDUALLY READ THE INSTRUCTIONS CAREFULLY!!!!!!!! NO LATE SUBMISSION!!!!!!! DUE DATE: Oct 21st by 10pm in SAKAI Note: Please DO NOT write any comments with your submission. • This is an EXAM, I can NOT answer any questions. • Email me for any GENERAL clarifications • Exam is to be done individually • Honesty and integrity is expected, any dishonesty will result in 0 for the EXAM and further action will be taken for all involved in dishonesty. • All work should be done on the exam itself except EXCEL file. ORACLE queries and output should be embedded in the exam. Make sure to include all Oracle statements that you use (including how many rows selected etc). I will be going and checking your queries in ORACLE in your account

Write a Review

Database Management System Questions & Answers

  What types of relationships are possible in a database

Define operational excellence. How can information systems help achieve it? What types of relationships are possible in a relational database? Describe and give an example of each.

  Need a system that networks its 3 campuses in the us and

need a system that networks its 3 campuses in the us and one campus in singapore. transaction data for all campuses

  Discuss the future of data management for large firms

Assignment: Case: Databse Structures, Languages and Architechures. Discuss the future of data management for large firms and the implications for IT management.

  Find candidate keys for the relation

Find candidate keys for the relation

  Implement a cluster using physical hardware

Implement a cluster using physical hardware, the process is very straightforward.But what if you want to implement a cluster using virtualization?

  Identify the different types of entities and attributes

In a group, identify the different types of entities, attributes, keys, and relationships Martex will want to consider when designing its relational database.

  List last name and first name of owner located in bowton

List the last name and first name of every owner located in Bowton.

  Conceptual data modeling is typically done in parallel

Conceptual data modeling is typically done in parallel with other requirements analysis and structuring steps during systems planning and selection.

  What difficulties did you encountered and how did you

write a 200- to 300-word short-answer response for the followingwhat difficulties did you encountered and how did you

  Dml triggers can be defined for a combination of dml

Normal 0 false false false EN-US X-NONE X-NONE DML triggers can be defined fo..

  Create the tables and relationships from the database design

Create the tables and relationships from the database design discussed in Lab 2. Create a query with all fields from the student table, where the student's.

  System analysis and design project

Need an expert in the fields of system design to handle this project This is a system analysis and design project, not a research project

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd