0% found this document useful (0 votes)

34 views3 pages

DataWarehousing - Testing Made Easy

The document discusses data warehousing concepts including dimensional modeling, fact and dimension tables, and schemas like star, snowflake, and hybrid. It defines dimensional modeling as subject-oriented, integrated, time-variant, and non-volatile. Fact tables contain numeric facts and dimension tables contain descriptive attributes used to analyze facts. Dimension tables are typically de-normalized while fact tables are highly normalized.

Uploaded by

RajeshCuddapah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views3 pages

DataWarehousing - Testing Made Easy

Uploaded by

RajeshCuddapah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

According to D.W.

Inmon :
DWH

Subject Oriented
Integrated
Time Varient
Non Volatile

D.W Implementation
Approach

Top Down Approach : D.W.Inmon

Bottom up Approach : Ralph Kimball

SQL Set Operators

Desgined Subject oriented to design

Analysis
Business info collected from various
sources
Allows to anlysis the data with time Eg:
Month, YOY
Once the data entered in DW cannot
change
First Develop EDW then Devlop
Datamarts
First Develop Datamarts then Develp
EDW

Syntax

UNION[ALL]
,MINUS,INTERSECT
String Functions
CONCAT

CONCAT(String1,String2 [..])

RTRIM

RTRIM('String')

LTRIM

LTRIN('String')

TRIM

TRIM('String')
SUBSTRING('String', Start integer, Length of
String)

SUBSTRING
Analytic function
COALESCE Similar to Case
IsNULL (allows only two
arguments)

Returns the first non-null expression in

the list
Replaces NULL with the specified
replacement value.

ROW_NUMBER

COALESCE ( expression [ ,...n ] )

ISNULL ( check_expression,
replacement_value )
ROW_NUMBER() OVER(ORDER BY Column
nam)

RANK

RANK() OVER (ORDER BY Col )

DENSE_RANK
ROW_NUMBER with
PARTITION

DENSE_RANK() OVER (ORDER BY Col )

ROW_NUMBER() OVER(PARTITION BY Col
ORDER BY Col ASC)

Ranking within your ordered partition

No ranks are skipped if there are ranks
with
multiple items
This is kinda like using a Row_number
with Group by

eg:CONVERT(VARCHAR(19),GETDATE())

General function that

converts an expression of one data
type to another

CONVERT()

Returns the sequential number

Null Functions
ISNULL(), NVL(), IFNULL() and
COALESCE()
Joins
Inner Join

Returns all rows when there is at least

one match in BOTH tables

Dimension Table

Return all rows from the left table, and

the matched rows from the right table
Return all rows from the right table,
and the matched rows from the left
table
Return all rows when there is a match
in ONE of the tables
If a table contains primary keys and it
gives the detailed info about business
then such a table
- Entry Points to the fact tables
- Typically in De-Normalized form
- Generally Static and descriptive fields
- Typically used by Group by in SQL
- Typically either Primarkey and
Dimensional attribu

Fact Table

A fact table which contains foreign keys

to dimension tables and numeric facts
- The term FACT represents a single
business measure. E.g. Sales, Qty Sold
- Facts can be detailed level facts or
summarized facts
- Typically the MOST NORMALIZED
TABLE in a dimensional model
- contain HUGE DATA VOLUMES
running into millions of rows

LEFT JOIN

RIGHT JOIN
FULL JOIN

Types of Dimension tables

Degnerated Dimension

Junk Dimeonsions

Slowley Changing Dimension

Conformed Dimension
Fast Chaning Dimensions

data that is dimensional in nature but

stored in a fact table
contain miscellaneous data like flags,
gender, text values etc which is not
useful for reporting
If the data values are changed slowly in
a column or in a row over the period of
time then that dimension table
If Dimension table shared with multiple
fact tables
Changes very fast Eg: Acc Bal,Income
etc

Types of Facts
Additive
Semi Additive
Non Additive
Transformations

Measures that can be added across all

dimensions
Measures that can be added across few
dimensions and not with others
Measures that cannot be added across
all dimensions
It is the process of transforming the
data into a required business format

Data Aggregation

process of integrating the data from

multiple input
Process of removing unwanted/error
out/inaccurate data
multiple detailed values are
summarized into a single unit

Data Purging

Earsing of the data completely

Data Profiling

examining data available from an

existing information source,collecting
stastics and summerise the data with
Aggrate functions (Min,MAX,AVG)

Data Merging
Data Cleansing/Scrubbing

Schemas

Star Schema

Snowflake Schema
Hybrid Schema/Galaxy
Schema/
Fact constellation

It has single fact table connected to

dimension tables like a star.
- The star schema is highly
denormalized
- Simple structure -> easy to
understand schema
- Relatively long time of loading data
into dimension tables (reduendent of
data)
- Performance less compare to Snow
flake
It is an extension of the star schema.In
snowflake schema, very large
dimension tables are normalized into
multiple tables. It is used when a
dimensional table becomes very big
- Highly normalized
- Complex compare to star schema
- Less time to load (due to normalized
data)
- Very good in performance
Combination of Star and Snowflake
schema

ITIL Process Assessment Framework - MacDonald
75% (4)
ITIL Process Assessment Framework - MacDonald
42 pages
Data Access For Highly Scalable Solutions
No ratings yet
Data Access For Highly Scalable Solutions
273 pages
ODI Class Notes
50% (2)
ODI Class Notes
149 pages
Data Management For Analytics Notes
No ratings yet
Data Management For Analytics Notes
21 pages
1.4.introduction To Agility and Agile Process
100% (2)
1.4.introduction To Agility and Agile Process
23 pages
Unit 3 OLAP and OLTP
No ratings yet
Unit 3 OLAP and OLTP
64 pages
SQL
No ratings yet
SQL
58 pages
3 - Data Warehousing and Business Intelligence
No ratings yet
3 - Data Warehousing and Business Intelligence
58 pages
Fiori Apps List
100% (2)
Fiori Apps List
10 pages
Databases and Data Management Systems
No ratings yet
Databases and Data Management Systems
64 pages
What Is A Data Warehouse
No ratings yet
What Is A Data Warehouse
11 pages
Chapter V
No ratings yet
Chapter V
38 pages
Chapter 4: Intermediate SQL: Database System Concepts, 6 Ed
No ratings yet
Chapter 4: Intermediate SQL: Database System Concepts, 6 Ed
52 pages
DW CrashCoursePPT
No ratings yet
DW CrashCoursePPT
24 pages
Data Warehousing: People Making Technology Wor K™
100% (1)
Data Warehousing: People Making Technology Wor K™
44 pages
Database Management Systems
No ratings yet
Database Management Systems
44 pages
Power BI DAX Training manual
No ratings yet
Power BI DAX Training manual
12 pages
dw4 - Dimension1
No ratings yet
dw4 - Dimension1
75 pages
Dimensional Modeling Tutorial
No ratings yet
Dimensional Modeling Tutorial
9 pages
Datawarehouse PPT
No ratings yet
Datawarehouse PPT
39 pages
Olap Ssas
No ratings yet
Olap Ssas
69 pages
Data Warehousin G Concepts
No ratings yet
Data Warehousin G Concepts
41 pages
Dimensional Modeling
100% (1)
Dimensional Modeling
19 pages
Methods of Incremental Loading in Data Warehouse
25% (4)
Methods of Incremental Loading in Data Warehouse
5 pages
OBIEE - Quick Guide
No ratings yet
OBIEE - Quick Guide
78 pages
Very Short Notes
No ratings yet
Very Short Notes
13 pages
Interview Questions and Answar
No ratings yet
Interview Questions and Answar
22 pages
BI- Chap 3 - Data Warehouses Design
No ratings yet
BI- Chap 3 - Data Warehouses Design
54 pages
What Are Schemas
No ratings yet
What Are Schemas
25 pages
Data Warehouse Ques
No ratings yet
Data Warehouse Ques
10 pages
Chapter 1 The Database Environment and Development Process
No ratings yet
Chapter 1 The Database Environment and Development Process
22 pages
Data Warehouse Implementation
No ratings yet
Data Warehouse Implementation
37 pages
CSIS 3300 W3 Denormalization StarSchema
No ratings yet
CSIS 3300 W3 Denormalization StarSchema
27 pages
Data Warehouse
No ratings yet
Data Warehouse
14 pages
DWM UNIT-II NOTES
No ratings yet
DWM UNIT-II NOTES
27 pages
DWH Interview Questions.
No ratings yet
DWH Interview Questions.
7 pages
Framework Manager-0124 IBM Cognos
No ratings yet
Framework Manager-0124 IBM Cognos
61 pages
MVA Implementing A Data Warehouse With SQL Jump Start Mod 1 Final
No ratings yet
MVA Implementing A Data Warehouse With SQL Jump Start Mod 1 Final
37 pages
Assignment#8 SQL
No ratings yet
Assignment#8 SQL
7 pages
1) Union and Union All
No ratings yet
1) Union and Union All
11 pages
What Is Fact?: A Fact Is A Collection of Related Data Items, Each Fact Typically Represents A Business Item, A
No ratings yet
What Is Fact?: A Fact Is A Collection of Related Data Items, Each Fact Typically Represents A Business Item, A
28 pages
Lecture 1 Notes: Dimension Tables
No ratings yet
Lecture 1 Notes: Dimension Tables
2 pages
GCP
No ratings yet
GCP
15 pages
7FE Project Framework
0% (1)
7FE Project Framework
5 pages
Unit 5 DW
No ratings yet
Unit 5 DW
12 pages
Business Intelligence Interview Questions and Answer
No ratings yet
Business Intelligence Interview Questions and Answer
12 pages
Infor Basics
No ratings yet
Infor Basics
15 pages
DW_unit 2
No ratings yet
DW_unit 2
11 pages
Advance SQL
No ratings yet
Advance SQL
12 pages
What Is Data Warehouse?: Explanatory Note
No ratings yet
What Is Data Warehouse?: Explanatory Note
11 pages
SQL Interview Questions 1725044566
No ratings yet
SQL Interview Questions 1725044566
4 pages
Data Warehouse Lec-3
No ratings yet
Data Warehouse Lec-3
38 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
7 pages
DDD Assignment Mark Scheme Autumn 2018
No ratings yet
DDD Assignment Mark Scheme Autumn 2018
13 pages
B3.2 - Cricket Data Analysis - Digital Business
No ratings yet
B3.2 - Cricket Data Analysis - Digital Business
18 pages
SQL Access Practice Work
No ratings yet
SQL Access Practice Work
3 pages
Data Warehouse and Data Modelling
No ratings yet
Data Warehouse and Data Modelling
11 pages
The Database Environment: Modern Database Management 12 Edition
No ratings yet
The Database Environment: Modern Database Management 12 Edition
30 pages
SQL Joins in Report
No ratings yet
SQL Joins in Report
6 pages
DWH Unit 2
No ratings yet
DWH Unit 2
13 pages
BI Assignment 1
No ratings yet
BI Assignment 1
6 pages
Notes
No ratings yet
Notes
14 pages
Data Warehouse Schema
No ratings yet
Data Warehouse Schema
10 pages
Accenture Loaded Labor Rates PDF
No ratings yet
Accenture Loaded Labor Rates PDF
11 pages
Auto Reader Proposal To Tony
No ratings yet
Auto Reader Proposal To Tony
14 pages
SRE Foundation V1 - 0 - Value Added Resources 11 - 2019
No ratings yet
SRE Foundation V1 - 0 - Value Added Resources 11 - 2019
8 pages
1734277634220
No ratings yet
1734277634220
4 pages
SQL Server Interview Questions : - BI Intelligence
No ratings yet
SQL Server Interview Questions : - BI Intelligence
8 pages
Data Warehouse Concepts
No ratings yet
Data Warehouse Concepts
11 pages
Datawarehousing Top50 Interview Questions
No ratings yet
Datawarehousing Top50 Interview Questions
10 pages
DATA WAREHOUSING EXAM GUIDE
No ratings yet
DATA WAREHOUSING EXAM GUIDE
4 pages
IDS NEXT Corporate Profile
No ratings yet
IDS NEXT Corporate Profile
18 pages
Ssas Real Time Interview Questions and Answers
No ratings yet
Ssas Real Time Interview Questions and Answers
7 pages
DW
No ratings yet
DW
29 pages
Database Approach
No ratings yet
Database Approach
12 pages
International Conference On Innovative Computing and Communication (ICICC 2020)
No ratings yet
International Conference On Innovative Computing and Communication (ICICC 2020)
11 pages
DWH Int Questions
100% (1)
DWH Int Questions
9 pages
Factless Fact Table
No ratings yet
Factless Fact Table
5 pages
DevOps Phases Across Software Development Lifecycl
No ratings yet
DevOps Phases Across Software Development Lifecycl
9 pages
Data Warehousing Interview Questions and Answers
No ratings yet
Data Warehousing Interview Questions and Answers
5 pages
Ma A Project Charter
No ratings yet
Ma A Project Charter
5 pages
Super Informatica Basics PDF
No ratings yet
Super Informatica Basics PDF
49 pages
SAP Delete CO Transaction Data
No ratings yet
SAP Delete CO Transaction Data
5 pages
ECC 6.0 Solutions: Data Modeling & ABAP Dictionary Exercises SAP Development ABAP Training Chapter 2 Questions
No ratings yet
ECC 6.0 Solutions: Data Modeling & ABAP Dictionary Exercises SAP Development ABAP Training Chapter 2 Questions
2 pages
Performance - Load Test Methodology
100% (1)
Performance - Load Test Methodology
5 pages
WIN (2019-20) SWE2006 ELA AP2019205000374 Reference Material I 19-Dec-2019 WIN (2019-20) SWE2006 ELA AP2019205000612 Reference Material I 17-Dec-2019 SQL1
100% (1)
WIN (2019-20) SWE2006 ELA AP2019205000374 Reference Material I 19-Dec-2019 WIN (2019-20) SWE2006 ELA AP2019205000612 Reference Material I 17-Dec-2019 SQL1
2 pages
Event Management
No ratings yet
Event Management
3 pages
Logical Physical Conceptual Datamodel
No ratings yet
Logical Physical Conceptual Datamodel
2 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
Excel Techniques
From Everand
Excel Techniques
Online Trainees
2/5 (1)