Data Warehousing by Example - : Elephants, Olympic Judo and Data Warehouses

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

1. Why ?

The purpose of this document is to present our Best Practice approach to Data Warehouse design
based on more than 15 years experience.
We are publishing it on Kindle, as cheaply as possible, in order to encourage constructive criticism so
that we can improve the book.
We would be very glad to have your comments at barryw@databaseanswers.org.
After 5 years ago, a teacher emailed me to say that his students found my Data Models boring and
were falling asleep in class !!!
So I began to wonder how I could make them more interesting and still easy to relate to.
My conclusion was that I could take everyday events to use as examples.
That is why I have used Football, Malaysia and the Olympics.
Of course, a holiday in Malaysia is not something we do every day, and Olympics is not an everyday
event ;-0) Data Warehousing by Example | 3 Elephants, Olympic Judo and Data
Warehouses
2. The Approach
In this Section we will discuss our Approach to the design of an Enterprise Data Model with
associated Data Warehouses and how it applies to a Day at the Olympics and a Holiday in Malaysia.
2.1 Data Architecture
This Architecture supports Data Migration into an Enterprise Data Warehouse to meet BI
requirements.
It shows the major Layers in an End-to-End Architecture for Data Migration from Data Sources, into a
Data Warehouse and finally to a BI Layer to deliver data to the end-user.
BI Layer KPIs, Dashboards, etc.
Dimensional Models (Stars and Snowflakes)
Third-Normal Form Data Model (Single View of the Truth)
Data Integration Layer (Data Quality, ETL, etc)
Data Sources (Salesforce, SAP, Mobile, etc.) Data Warehousing by
Example | 4 Elephants, Olympic Judo and Data Warehouses
2.2 Some Definitions
A Data Warehouse can be either a Third-Normal Form (3NF) Data Model or a Dimensional Data
Model, or a combination of both.
One benefit of a 3NF Data Model is that it facilitates production of A Single Version of the Truth.
Multiple Data Marts will usually share common Dimensions, such as Dates, which we will call
Conformed Dimensions.
2.3 Steps
When all the Steps have been completed, Best Practice suggests that each item produced should be
reviewed and extended or modified as appropriate.
This includes the EDM, CDM, Subject Area Models, Canonical Data Models and Canonical Entities.
In this way, the library of existing work is confirmed in its accuracy and relevance.
Step 1 : Draft the Enterprise Data Model (EDM)
Start by reviewing the list of candidates on the Database Answers Web Site :-
http://www.databaseanswers.org/data_models/enterprise_data_models.htm

Step 2 : Review the Subject Areas


Start by reviewing the list of candidates :-
http://www.databaseanswers.org/data_models/subject_area_data_models.htm

Step 3 : Use the Canonical Data Model (CDM)


Here is the Link for the CDM :-
http://www.databaseanswers.org/data_models/canonical_data_models/index.htm

This is what the CDM looks like :-


As you can see, it is centred around an Events Entity.

You might also like