Db2 Analytics Accelerator Version MDUG May2019
Db2 Analytics Accelerator Version MDUG May2019
Db2 Analytics Accelerator Version MDUG May2019
1
Technical Foundation
Jim Knisley jknisley@us.ibm.com
May 15, 2019
IBM z Analytics
IBM z Analytics Db2 Analytics Accelerator
Agenda
2
IBM z Analytics Db2 Analytics Accelerator
IBM Z Analytics
Keep your data in place – a different approach to enterprise analytics
3
IBM z Analytics Db2 Analytics Accelerator
SPEED
• Dramatically improve query response – up to 2000X faster –
to support time-sensitive decisions
• Right-time. Low latency. Trusted. Accurate.
SAVINGS
• Minimize data proliferation
• Lower the cost of storing and managing historical data
• Free up compute resources
SIMPLICITY
• Simplify infrastructure, reduce ETL and data movement off-platform
• Non-disruptive installation
SECURITY
• Safeguard valuable data under the control and security of Db2 for z/OS
• Protected. Secured. Governed.
4
IBM z Analytics Db2 Analytics Accelerator
Long name: Db2 Analytics Accelerator for z/OS Version 7.1, deployment on IBM
Integrated Analytics System (IAS)
Short name: Accelerator on IAS
Long name: Db2 Analytics Accelerator for z/OS Version 7.1, deployment on IBM Z
Short name: Accelerator on IBM Z
8
IBM z Analytics Db2 Analytics Accelerator
Agenda
• Value Proposition
• Version 7.1 Architecture and Deployment Options
• Version 7.1 Functionality
• Version 5 PTF7 Updates
• Customer Use Cases
9
10
IBM z Analytics Db2 Analytics Accelerator
Db2 Analytics Accelerator
In-memory column processing Multi-core and SIMD Patented compression Skips unnecessary
with dynamic movement of data parallelism technique preserves order -- processing of irrelevant data
from storage (Single instruction data can be used without
Multiple Data) decompressing
• Huge potential for faster ingest for incremental updates, and thereby less HTAP query delay!
• IBM’s premier analytics engine across many products
• Latest analytics technology innovations
• SQL compatibility across all IBM products
• High degree of concurrent users and queries
The new engine is replaced internally – external interfaces will stay the same.
The same Db2 subsystem can be connected to the existing and new generation.
11
IBM z Analytics Db2 Analytics Accelerator
OSA-Express
• Data Studio with Db2 Analytics
10 GbE
Accelerator Studio Plug-in
Patch Panel
• Data Server Manager 2.1.5 or higher
OSA-Express
10 GbE
Dedicated highly available
network connection
IBM Db2
Users/ Db2 for z/OS enabled for Analytics
Applications IBM Db2 Analytics Accelerator Accelerator
incl. Stored procedures
14
IBM z Analytics Db2 Analytics Accelerator
IBM Power 8
S822L 24 core
server 3.02GHz 15
IBM z Analytics Db2 Analytics Accelerator
16
IBM z Analytics Db2 Analytics Accelerator
Data Skipping
Skips unnecessary processing of irrelevant data
Actionable Compression
Patented compression technique that preserves order allowing data to be used
without decompressing first
17
IBM z Analytics Db2 Analytics Accelerator
Docker container
Components
Existing
Accelerator Db2 Warehouse Additional future
server engine functionality
SE / HCM Workload
Systems Manager Authentication
Monitoring
Accelerator Solution
Delivered as part of
Docker supported OS + management
Customer’s
Storage Storage (SAN) Filesystem
Management
Hardware Considerations
• 1 Accelerator = 1 LPAR – but multiple Accelerator LPARs on a single system possible
• Each LPAR requires IFLs, RAM, and storage
Minimum suggested configuration: • You can convert your z14 models • FCP or FICON attached
M01, M02, M03 servers by adding a • Flash storage is not
• 4 IFLs, 256 GB memory for dedicated drawer containing IFLs
test/dev required, but e.g. IBM
and memory at a very attractive price FlashSystem 900 is an
• 8 IFLs, 512 GB memory for • Also available on M04, M05 as a new effective option
production build inclusive of the drawer, as you
cannot add another drawer to M04
or M05 machines
20
IBM z Analytics Db2 Analytics Accelerator
21
IBM z Analytics Db2 Analytics Accelerator
Appliance UI
Private
network
M:N connectivity
– Multiple Db2 subsystems can
share an Accelerator
– Multiple Accelerators can
OSA 10 Gbps recommended
connect to a Db2 subsystem
Shared OSA is possible
23
IBM z Analytics Db2 Analytics Accelerator
24
IBM z Analytics Db2 Analytics Accelerator
POWER hardware and storage integrated in a self-contained Software appliance deployed on customer’s z hardware and
workload-optimized system for analytics storage infrastructure
HA support within appliance CA/DR support based on GDPS (active/passive and active/active)
Analytics for largest data volumes and highest performance Flexible, smaller, elastic deployment option
Integrated storage and management integration into existing z environment: hardware and storage
management, CA/DR infrastructure, support processes,
organizational structures,… – no new infrastructure needed 25
IBM z Analytics Db2 Analytics Accelerator
Connectivity options
Multiple Db2 systems can connect
to a single Accelerator
Represents all
deployment
options
Full flexibility for § residing in the same LPAR § being independent (non-data sharing)
Db2 systems and § residing in different LPARs § belonging to the same data sharing group
deployment options: § residing in different CECs § belonging to different data sharing groups 26
IBM z Analytics Db2 Analytics Accelerator
Superior
availability,
reliability, security
Workload
management
Superior
z/OS on IBM Z performance on
analytic queries
27
IBM z Analytics Db2 Analytics Accelerator
Agenda
• Value Proposition
• Version 7.1 Architecture and Deployment Options
• Version 7.1 Functionality
• Version 5 PTF7 Updates
• Customer Use Cases
28
IBM z Analytics
• Data synchronization
– Db2 table and partition load
– Incremental update
– Db2 Analytics Accelerator Loader
29
IBM z Analytics Db2 Analytics Accelerator
Queries executed
with Accelerator
30
IBM z Analytics Db2 Analytics Accelerator
Routing criteria
•Restrictions:
• No user defined functions (except inline SQL scalar UDF, compiled SQL scalar UDF)
• No multiple encoding schemes in the same statement
§ Scope: Row
Incremental Update § Scattered updates after “bulk” load § Based on Change Data Capture (CDC) of IBM
§ Reporting on continuously updated data (e.g., an ODS), InfoSphere Data Replication
Log-based capturing of
changes and propagation to considering most recent changes § Management integrated into stored
Accelerator with low latency § More efficient for smaller updates than full table procedures and Data Studio to:
(typically few minutes) refresh • Enable/Disable tables for replication
• Start/Stop replication 35
IBM z Analytics
§ Queries can be routed to accelerator when tables or partitions are being refreshed
§ Changes to DB2 data can prevented during load process using LOCKMODE option
• LOCKMODEs supported: TABLESET, TABLE, PARTITION, NONE
• Ensures consistent snapshot
§ Supports change detection: DB2 automatically determines if table/partition was changed based on DB2 real
time statistics
• Otherwise skips the table/partition in the load request 36
IBM z Analytics Db2 Analytics Accelerator
Table B Table A
Stored Procedures
38
IBM z Analytics
Incremental Update
§ Keeps data in DB2 and the Accelerator in synch in near real-time
§ Scope: Row
§ Based on the Change Data Capture (CDC) component of IBM InfoSphere Data Replication
§ INSERT/UPDATE/DELETE statements captured from DB2 log data and replicated
to the Accelerator
§ Default apply interval around 31 secondes
§ UPDATES are decomposed into DELETEs and INSERTs
§ Tables enabled for incremental update require either an enforced uniqueness (primary key,
unique index) or a defined informational constraint (via ACCEL_ADD_TABLES stored procedure)
§ Required for DELETEs
§ Continuous replication
§ Base table not locked while table initially loaded to the Accelerator
§ Replication not stopped if replication subscription is changed (tables added, removed, loaded, reloaded)
39
IBM z Analytics Db2 Analytics Accelerator
Stored Accelerator
Table Procedures Server Table
T1 T1
Table Table
T2 T2
Manages
Table Subscription CDC Table
T3 Access T3
Server
T T T
1 2 3
Db2 Log
Data
CDC CDC
Capture Replication
Agent Committed Data
Engine
40
IBM z Analytics Db2 Analytics Accelerator
Remote
SMF Data VSAM & Flat File
Db2 for z/OS
Other Relational
z/OS Syslog IMS
non z
#1
S-
A BLE
D_T
OA
L_L
T CE
AC
A
B
L
E ACCEL_LOAD_TABLES - #2
AC
CE
L_
LO
AD
_T
AB
LE
S-
#3
T
A
B
L
E High Availability Load
A one single load operation
Features
• Included with Accelerator Loader product
• Integrated backup, and fast recovery when needed
• Familiar DBA Functionality
• Fits into disaster recovery scenarios
PI70981: Backup & Recovery of Accelerator-only Tables
• Fast Restore to copy point
47
IBM z Analytics Db2 Analytics Accelerator
Write
requests
Asynchronous replication
OLTP
reads
Most recent
OLAP committed
reads data no
required? yes
yes
Most recent
committed no Initiate
data apply
Wait for committed data available?
• Default: 0 = No wait
replication
OLTP
Most recent
OLAP committed
reads data no
– WAITFORDATA = 0 required? yes
yes
• Immediately execute in accelerator (Current behavior, no delay)
Most recent
committed no Initiate
data apply
available?
Execute on
• The flow diagram shows the effect of the Timeout before N
Accelerator with
WAITFORDATA special register data arrived? "new" data
SQLCODE 0
Y
• It shows how the Accelerator processes a query,
dependent on settings and timings Execute on
Accelerator with
OPTION 13 N "old" data
= Fail ?
SQLCODE +904
Execute on
DB2
50
IBM z Analytics Db2 Analytics Accelerator
Accelerator-shadow table
Table 2 Table 2
• Data in DB2 and the Accelerator
Accelerator-archived table /
Not partition Table 3 Table 3
available • Empty read-only partition in DB2
for V7 yet • Partition data is in Accelerator only
Table 4 Table 4
Accelerator-only table (AOT)
• “Proxy table” in DB2
• Data is in Accelerator only
IBM z Analytics Db2 Analytics Accelerator
Accelerator-only tables
Supporting in-database transformation and multi-step processing
§ Can be used to store a set of data in Db2 Analytics Accelerator only, not on Db2 for z/OS, without using
the High Performance Storage Saver functionality
§ Accelerate in-database data transformations and data movement processes
§ Reduced need of data movement processes to other platforms for data transformation purposes
§ Enables multi-step reporting on the Accelerator
§ Saves disk space and CPU cost on IBM Z currently used for transformations and reporting steps
§ Allow data preparation steps for data mining and other advanced analytics to execute on the
Accelerator
53
IBM z Analytics Db2 Analytics Accelerator
§ SELECT and INSERT/UPDATE/DELETE operations using AOTs can only run on the Accelerator
• QUERY ACCELERATION behavior must be set to ENABLE/ELIGIBLE/ALL
• Accelerator-shadow tables and other AOTs can be used in the same statement
2
Temporary
results
n
Temporary
results
ELT logic
Customer Transaction
Summary and History
AOTs
Customer Summary
Analytics Mart AOTs
Advantages:
• Simpler to manage
• Better performance and
reduced latency Data for transactional and analytical processing
56
IBM z Analytics
• Incremental Update
• New datatypes: BINARY, DECFLOAT, ROWID
• Recursive SQL support
Accelerator on IBM Z • Support for Db2 Analytics Accelerator Loader
• Accelerator monitor in Data Studio
• Configuration Console access via ssh
• Support for additional IIAS hardware sizes and
configurations
– M4002 series including M4002-003 GoD
• Additionally supported IBM Z hardware
– z14 ZR1
– z13 and Linux One (check firmware levels with IBM)
Accelerator on IIAS (Appliance)
57
IBM z Analytics
Enhancements in 7.1.5
• Pass-through support for a set of OLAP built-in functions
Accelerator on IBM Z
• Support for Local Date Format in accelerated queries
• Support for multi-row insert for accelerator-only tables
• True HTAP: SMF counters for monitoring wait/expiration times of queries
• Accelerator on Z:
• GDPS support for fail-over scenarios
• Script based image deployments
• Coexistence V5/V7: Option to specify default accelerator for query routing
Accelerator on IIAS (Appliance) to support migration scenarios (Db2 enhancement only)
58
IBM z Analytics
•Passthru support for additional built-in functions. See this blog entry:
Newly supported builtin functions in Db2 12 for z/OS with Db2 Analytics Accelerator V7
•Accelerator on Z: Support for small deployments for small production, test/dev or
getting-started use cases. The minimum suggested configuration is: 2 IFLs, 64 GB
Accelerator on IBM Z
memory and 16 GB for each Change Data Capture (CDC) subscription (in case
Incremental Update is used).
62
IBM z Analytics
What’s next?
63
Roadmap
è The new “Multi-Node Deployment Option on IBM Z” will close that gap and
is designed to scale up to performance/capacity similar to M4002-020
Roadmap
HiperSocket
…
Db2 z/OS networ IDAA server Db2 WH Db2 WH
k Db2 WH • Data • Data
• Catalog partitions partitions
• No data
partitions
…
Storage
Roadmap
GDPS KSys
XRC instance z/OS LPAR B
GDPS RSys
GDPS K1 GDPS K2 XRC instance z/OS LPAR C
Metro instance z/OS LPAR A Metro instance z/OS LPAR B
IDAA-on-Z LPAR C
IDAA-on-Z LPAR A IDAA-on-Z LPAR B
GDPS Agent
GDPS Agent GDPS Agent
nt
iffere
B u ses d PAR A LPAR C uses same
LPAR data as L data as LPAR B
/
Disks
P P P GDPS-PPRC S S S GDPS-XRC DR DR DR
Roadmap
nt
iffere
R B uses d PAR A
LPA as L
/data
Disks
Flash
P P P GDPS-PPRC S S S XRC
DR DR DR Copy DS DS DS
IBM z Analytics Db2 Analytics Accelerator
Agenda
• Value Proposition
• Version 7.1 Architecture and Deployment Options
• Version 7.1 Functionality
• Version 5 PTF7 Updates
• Customer Use Cases
73
IBM z Analytics
74
IBM z Analytics Db2 Analytics Accelerator
Agenda
• Value Proposition
• Version 7.1 Architecture and Deployment Options
• Version 7.1 Functionality
• Version 5 PTF7 Updates
• Customer Use Cases
75
IBM z Analytics Db2 Analytics Accelerator
76
IBM z Analytics Db2 Analytics Accelerator
• What’s available?
• Product videos - for a quick glance at
product features and capabilities
• A guided demo - provides a guided
walk-through of the Accelerator’s
basic capabilities
• A hands-on lab - offers a full product
experience with a guided tutorial
77
Legal Disclaimer
• The information contained in this publication is provided for informational purposes only. While efforts were made to verify
the completeness and accuracy of the information contained in this publication, it is provided AS IS without warranty of any
kind, express or implied. In addition, this information is based on IBM’s current product plans and strategy, which are subject
to change by IBM without notice. IBM shall not be responsible for any damages arising out of the use of, or otherwise related
to, this publication or any other materials. Nothing contained in this publication is intended to, nor shall have the effect of,
creating any warranties or representations from IBM or its suppliers or licensors, or altering the terms and conditions of the
applicable license agreement governing the use of IBM software.
• References in this presentation to IBM products, programs, or services do not imply that they will be available in all countries
in which IBM operates. Product release dates and/or capabilities referenced in this presentation may change at any time at
IBM’s sole discretion based on market opportunities or other factors, and are not intended to be a commitment to future
product or feature availability in any way. Nothing contained in these materials is intended to, nor shall have the effect of,
stating or implying that any activities undertaken by you will result in any specific sales, revenue growth or other results.