ODI Knowledge Module Introduction
ODI Knowledge Module Introduction
ODI Knowledge Module Introduction
com 1
Contact Us: bispconsulting@gmail.com
History:
Version Description Change Author Publish Date
0.1 Initial Draft Gourav Atalkar 03-June-2011
Table of Contents
b). Retrieves sub models, datastores, columns, unique keys, foreign keys, conditions from
the metadata provider to SNP_REV_SUB_MODEL,SNP_REV_TABLE,SNP_REV_COL,
C). Updates the model in the work repository by calling the OdiReverseSetMetaData tool.
6) RKM INFORMIX
7) RKM INFORMIX SE
8) RKM MSSQL
9) RKM ORACLE
14)RKM TERADATA
a) To check the consistency of existing data. This can be done on any datastore or
within interfaces, by setting the STATIC_CONTROL option to "Yes". In the first case,
the data checked is the data currently in the datastore. In the second case, data in the
target datastore is checked after it is loaded.
b) To check consistency of the incoming data before loading the records to a target
datastore. This is done by using the FLOW_CONTROL option. In this case, the CKM
simulates the constraints of the target datastore on the resulting flow prior to
writing to the target
The CKM accepts a set of constraints and the name of the table to check. It creates an "E$"
error table which it writes all the rejected records to. The CKM can also remove the
erroneous records from the checked result set.
The following figures show how a CKM operates in both STATIC_CONTROL and
FLOW_CONTROL modes.
(STATIC_CONTROL)
In STATIC_CONTROL mode, the CKM reads the constraints of the table and checks them
against the data of the table. Records that don't match the constraints are written to the
"E$" error table in the staging area.
(FLOW_CONTROL)
In FLOW_CONTROL mode, the CKM reads the constraints of the target table of the Interface.
It checks these constraints against the data contained in the "I$" flow table of the staging
area. Records that violate these constraints are written to the "E$" table of the staging area.
a) Create the "E$" error table on the staging area. The error table should contain the
same columns as the datastore as well as additional columns to trace error
messages, check origin, check date etc.
b) Isolate the erroneous records in the "E$" table for each primary key, alternate key,
foreign key, condition, mandatory column that needs to be checked.
c) If required, remove erroneous records from the table that has been checked
b) CKM NETEZZA
c) CKM ORACLE
d) CKM SQL
e) CKM SYBASE IQ
f) CKM TERADATA
The LKM creates the "C$" temporary table in the staging area. This table will hold records
loaded from the source server
a) The LKM obtains a set of pre-transformed records from the source server by
executing the appropriate transformations on the source. Usually, this is done by a
single SQL SELECT query when the source server is an RDBMS. When the source
doesn't have SQL capacities (such as flat files or applications), the LKM simply reads
the source data with the appropriate method (read file or execute API).
b) The LKM loads the records into the "C$" table of the staging area.
An interface may require several LKMs when it uses datastores from different sources.
When all source datastores are on the same data server as the staging area, no LKM is
required.
Therefore, the IKM simply needs to execute the "Staging and Target" transformations, joins
and filters on the "C$" tables, and tables located on the same data server as the staging
area. The resulting set is usually processed by the IKM and written into the "I$" temporary
table before loading it to the target. These final transformed records can be written in
several ways depending on the IKM selected in your interface. They may be simply
appended to the target, or compared for incremental updates or for slowly changing
dimensions. There are 2 types of IKMs: those that assume that the staging area is on the
same server as the target datastore, and those that can be used when it is not. These are
illustrated below:
When the staging area is on the target server, the IKM usually follows these steps:
a) The IKM executes a single set-oriented SELECT statement to carry out staging area
and target declarative rules on all "C$" tables and local tables (such as D in the
figure). This generates a result set.
b) Simple "append" IKMs directly write this result set into the target table. More
complex IKMs create an "I$" table to store this result set.
c) If the data flow needs to be checked against target constraints, the IKM calls a CKM
to isolate erroneous records and cleanse the "I$" table.
d) The IKM writes records from the "I$" table to the target following the defined
strategy (incremental update, slowly changing dimension, etc.).
f) Optionally, the IKM can call the CKM again to check the consistency of the target
datastore.
These types of KMs do not manipulate data outside of the target server. Data processing is
set-oriented for maximum efficiency when performing jobs on large volumes.
When the staging area is different from the target server, as shown in Figure, the IKM
usually follows these steps:
a) The IKM executes a single set-oriented SELECT statement to carry out declarative
rules on all "C$" tables and tables located on the staging area (such as D in the
figure). This generates a result set.
b) The IKM loads this result set into the target datastore, following the defined strategy
(append or incremental update).
A CKM cannot be used to perform a data integrity audit on the data being processed.
Data needs to be extracted from the staging area before being loaded to the target, which
may lead to performance issues.
changes, views on this table and one or more triggers or log capture programs as illustrated
below.
the different operations to generate for each datastores web service. Unlike other KMs,
SKMs do no generate an executable code but rather the Web Services deployment archive
files. SKMs are designed to generate Java code using Oracle Data Integrator's framework for
Web Services. The code is then compiled and eventually deployed on the Application
Server's containers.
1) SKM HSQL
2) SKM IBM UDB
3) SKM Informix
4) SKM Oracle
5) SKM SQL