ODI Scheduler Source Control Performance
ODI Scheduler Source Control Performance
ODI Scheduler Source Control Performance
ODI Performance
ODI Scheduling
ODI Deployment/Release
ULI BETHKE
Dublin based
Blog www.bi-q.ie
ODI 2007
Reviewer two ODI books
ODI articles OTN
Deputy chair OUG BI SIG. Next event 11th June
ODI advanced trainer
ODI PERFORMANCE
Target
- Least amount of roundtrips. Network (JDBC, XML)
- One target database server only (DW)
Another Server
- ODBC drivers
- JEE agent on Weblogic
- No support for target OS
- Resources on target
- DBA
INTERFACES
JDBC drivers
JVM
Type 4 or 5 JDBC drivers (Data Direct)
Array fetch size.
DB packet size.
Network packet size.
PERFORMANCE MONITORING
dbms_sqltune_util0.sqltext_to_sqlid
Link to Data Dictionary Tables
MACIEJ KOCON
Dublin based
ODI 2005 (Sunopsis)
Reviewer two ODI books
Blog www.bi-q.ie
maciek@bi-q.ie
ORCHESTRATING DWH
PROCESSES
Orchestration of Data Process Flow
Standard DWH Process flow orchestration
Packages in Oracle Data Integrator 10g
Load Plans in Oracle Data Integrator 11g
Process Flow use cases - efficiency analysis
Alternative scheduling
benefits
TYPICAL DATA FLOW in DWH
step 1
STAGE E-LT
DATA EXTRACT
loads data from
sources
TYPICAL DATA FLOW in DWH
step 1 ste 2
p
STAGE DIMs E-LT
INT_A
PRC_B
INT_C
PKG_DE
INT_D
INT_E
ORCHESTRATION ODI PACKAGES
using object directly using scenarios compiled
code
PKG_ABC SYNCHRONOUS PKG_ABCDE
INT_A INT_A
PRC_B PRC_B
INT_C INT_C
PKG_DE
PKG_DE
INT_D
INT_E
ORCHESTRATION ODI PACKAGES
using object directly using scenarios compiled
code
PKG_ABC SYNCHRONOUS PKG_ABCDE
INT_A INT_A
PRC_B PRC_B
INT_C INT_C
PKG_DE
PKG_DE
INT_D
INT_E
ASYNCHRONOUS PKG_ABCDE
INT_A
PRC_B
INT_C
PKG_DE
ODI 10g vs. ODI 11
STAGE DIMs FACTS
PKG_DM
PKG_ABC PKG_DE PKG_FG
INT_A INT_C INT_F ODI 10g
PRC_B PRC_D PRC_G Packages
INT_C
A D F
B E G
C
ODI 10g vs. ODI 11
STAGE DIMs FACTS
PKG_DM
PKG_ABC PKG_DE PKG_FG
INT_A INT_C INT_F ODI 10g
PRC_B PRC_D PRC_G Packages
INT_C
ODI 11
Load plans
ODI 10g vs. ODI 11
STAGE DIMs FACTS
PKG_DM
PKG_ABC PKG_DE PKG_FG
INT_A INT_C INT_F ODI 10g
PRC_B PRC_D PRC_G Packages
INT_C
ODI 11
Load plans
A D F
SAME
B E G EFFECT!
C
PROCESS FLOW EFFICIENCY ANALYSIS
Standard Flow Orchestration:
Stage-(stop)DIMs-(stop)Facts
A
30 sequential
10 10
D F
B 10 A
30 10
parallel
E G D F
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
PROCESS FLOW EFFICIENCY ANALYSIS
Standard Flow Orchestration:
Stage-(stop)DIMs-(stop)Facts
A
30 sequential
10 10
D F
B 10 A
30 10
parallel
E G D F
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
DOWNSIDES:
POSSIBLE INEFFICIENCIES (IDLE RESOURCES)
PROCESS FLOW EFFICIENCY ANALYSIS
OPTIMIZATION ATTEMPT
30
A 10 10
D F
10
B 30 10
E G
10
C
PROCESS FLOW EFFICIENCY ANALYSIS
OPTIMIZATION ATTEMPT
30
sequential
A 10 10
D F
10 A D
B
parallel
30 10
E G F
10
C B
30 10 10
E G
10 30 + 10
30 10 + 10 = 50 C
10 10 + 30
OPTIMIZATION ATTEMPT
30
sequential
A 10 10
D F
10 A D
B
parallel
30 10
E G F
10
C B
30 10 10
E G
10 30 + 10
30 10 + 10 = 50 C
10 10 + 30
OPTIMIZATION ATTEMPT
A
30 sequential
D
10
F
10 70
B
10 A
30 10
parallel
E G 70 D F
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
DOWNSIDE:
INEFFICIENCY EXISTS BUT CANT BE RESOLVED
CONSUMER WAITING & IMPACT
TRADITIONAL SCHEDULING - LIMITATIONS
SCHEDULER
DEPENDENCY DRIVEN SCHEDULING
A
D
B
E
C
A A A
B B B
C C C
D D D
E E E
A A A
B B B
C C C
D D D
E E E
DEPENDENCY DRIVEN SCHEDULING
A
D
PACKGAGES
B &
E LOAD PLANS
C
A A A
B B B
C C C
D D D
E E E
A A A
B B B
C C C
D D D
E E E
PROCESS FLOW EFFICIENCY ANALYSIS
A 30 sequential
D
10
F
10 70
B 10
30
A
parallel
10
E G 70 D F
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
30
A 10 10
D F
10
B 30 10
E G
10
C
10 10 10
10
30 30 10
PROCESS FLOW EFFICIENCY ANALYSIS
A 30 sequential
D
10
F
10 70
B 10
30
A
parallel
10
E G 70 D F
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
30
A 10 10
D F 30
10
B 30 10
E G 70
10
C
10 10 10
10
30 30 10
Notifications
errors (+auto-restartability)
finish summary
logging
Multiple/overlapping E-LT streams
load with different frequencies
Parameterization
improved system stress control
process prioritization
F I R S T RUN
10
processes
F I R S T RUN T O D AY
10 584
processes processes
1389
DEPENDENCIES
F I R S T RUN T O D AY
10 584
processes processes
1389
DEPENDENCIES
10 584
processes processes
1389
DEPENDENCIES
12h43m
TIME
LOAD PLANS
F I R S T RUN T O D AY
10 584
processes processes
1389
DEPENDENCIES