DBW624 Introduction to Datawarehousing Assignment

Assignment Help Database Management System
Reference no: EM133182375

DBW624 Introduction to Datawarehousing - Seneca College

ETL Application

Now we are going to focus on the reference tables and get a feel for what is involved with moving and cleansing data.

For each of the sources below, you need to create a script which will take the data from the government site and load it into your reference tables within our data warehouse.

Once in the warehouse, you need to clean the data, where necessary.

You can use the approach of creating a staging table which is the target of the IMPORT from the .csv files below - then - clean the data and move over to the final reference tables.

Here are the three reference tables we will use.
1. Names (male and female) in Ontario
b. There are two separate files to down load. One for males and one for females
c. Just search on "male baby names" and "female baby names" to find the source of data
d. File name is ontariopbabynames_male_1917-2010_english.csv
e. File name is ontariopbabynames_female_1917-2010_english.csv
f. You can merge the two into a single reference table - or - keep them separate.
2. Population Table
b. Look for file: demo05a-eng.csv
c. It is under the summary tables area
3. Average expected life span (male and female)

All of this assignment can be done with SQL, however, you are free to use any programming language you like.

DBW624 Business Analytic Application - Seneca College

Now that we have a complete data warehouse, loaded with data, we are going to build our Business Intelligence (BI) application which will actually answer the business questions below, which were highlighted in Assignment 1.

Here are the business analytics we will want to perform.
1. Sales Volumes Analysis by fiscal quarter
a. By Store
b. By Product
c. By Product Group (Age Group)
2. Sales Revenue Analysis by fiscal quarter
a. By Store,
b. By Product
c. By Product Group (Age Group)
3. Sales Profit Analysis by fiscal quarter
a. By Store,
b. By Product
c. By Product Group (Age Group)
4. Product Line Analysis by fiscal quarter, measured by revenue and profit
a. Which products have been the most / least successful
b. Which product groups have been the most / least successful
c. What is the product trends (growth or declining)?
5. Store Analysis by fiscal quarter, measured by revenue and profit
a. Which stores are the most / least successful
b. What is the growth trends for each store (growth or declining)?
6. Additional Analysis
a. Which names have been most successful by volume
b. Which gender has been most successful by volume
c. Who was the top sales person for the quarter?
d. What percentage of sales are cash versus credit card?
e. What percentage of sales were using a marketing campaign?
7. Analytics Against Reference Tables
a. Which ten cities should we open stores in, based on population?
b. Which names should we expect will be the most popular for our personalized products?

For each of the business questions above, write a SQL query which provides the business answer.

What you need to hand in is a copy of your BI application showing both the SQL used to run the required analytic query and the result set.

Attachment:- ETL Application.rar

Reference no: EM133182375

Questions Cloud

Evidence-based consumer-facing infographic : Use your research from Part 1 to create a single-page consumer-facing infographic. Be sure to include the following criteria:
Product diversity and sales of foreign products : How high is Kenyas nation's level of foreign product diversity and sales of foreign products within its market?
Discuss how mom and pop shops : Discuss how 'mom and pop' shops/small retailers can succeed when competing for the same customer with powerhouses like Wal-Mart or Amazon
How will the march transaction be shown : Cleveland paid back the $50,000 plus $1,250 in interest. How will the March transaction be shown in the 2021 statement of cash flows
DBW624 Introduction to Datawarehousing Assignment : DBW624 Introduction to Datawarehousing Assignment Help and Solution, Seneca College - Assessment Writing Service
The case of the disgruntled employee : For Jeff Greenwood, the day started out like any other workday. It would turn out to be anything but a normal day. In a few short hours, Jeff would look back
Discuss how they identify their customers : Discuss how they identify their customers. You can select any industry, and your discussion must address the issues of high/low volume customers
What is the inflation premium : You are considering an investment in 30-year bonds issued by Corporation A. The bonds have no special covenants. What is the inflation premium
How does this change the price of the tundra : The base platform for the Toyota Tundra truck line is ¥1,650,000. How does this change the price of the Tundra to Toyota's British subsidiary in British pounds

Reviews

Write a Review

Database Management System Questions & Answers

  Return a result table sorted by patient name

Return a result table that could be used to produce a report, sorted by patient name or date, for a particular week or after a particular date, or a listing of patient visits for patients assigned to a specific social worker

  List and explain the family members of nosql database

List and explain the family members of NoSQL database? Why is NoSQL an ideal database type for Big Data Analytics? Provide examples.

  Write summary on relational database theory

Write a two-page executive summary for your boss explaining how a relational data solution can be applied to a current business problem or area for improvement. Assume that your boss knows nothing about relational database theory.

  Implement your designed City Jail database using Oracle

HS2021 Database Design and Use Assignment - Implement your designed City Jail database using Oracle SQL developer 12c

  Create an er diagram from the scenario

ITDA1001 - Database Fundamentals - Write a T-SQL comment above each SQL statement you write, stating your name, the task you are solving

  Discuss about the database security

Discuss about the database security.We will focus three topics for this conference:Inference in ordinary databases or statistical databases,database privacy (through encryption),andcloud security.

  Create the tables and relationships from the database design

Create the tables and relationships from the database design, Create a query with all fields from the student table, where the student's last name is "Smith".

  Review the information on relational database models

Review the information on relational database models in the course text, Coronel, C. & Morris, S. (2015). Database systems: Design, implementation, and management (11th ed.). Stamford, CT: Cengage Learning.

  List full details of all hotels

List full details of all hotels

  Describe the role of business process reengineering bpr in

question 1 what is the role of business process reengineering bpr in enhancing competitiveness? give examples to

  Explain efficient algorithm for edge-labeled graph

Explain efficient algorithm that, provided edge-labeled graph G with distinguished vertex v0 and sequence s = σ1, σ2, ..., σk of characters from Σ, returns path in G which begins at v0.

  Calculates the percentage of books returned

Calculates the percentage of books returned based on the number of books returned and the quantity sold. Copy the formula from cell F7 to the range F8:F22.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd