0% found this document useful (0 votes)

11 views15 pages

Parallel Query Processing in PostgreSQL

Uploaded by

FrancesHsieh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views15 pages

Parallel Query Processing in PostgreSQL

Uploaded by

FrancesHsieh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Parallel query processing in PostgreSQL

Daniel Vojtek
12.2.2009
Content

 Motivation
 Query processing in PostgreSQL
 Introduction to parallelization
 Parallel processing of subquery
 Sorting
 Our approach and work
 Problems with parallelization

2
Motivation

 Databases are larger and larger

 More effective usage of resources
 More and more CPUs on one machine
 Speed up in query execution (linear)
 Scale up (linear)

3
Query processing

 For each session PostgreSQL creates one

backend process
 Processing query then involves:
 Parsing
 Apllying rewrite rules
 Creation of optimized execution plan
 Executing the plan
 Utility Processing (for DDL)

4
Parallelism in DB

 Usage of multiple CPUs to perform parts of a

single task
 Interquery parallelism – parallelism among
queries – already in PostgreSQL
 Intraquery parallelism – operations within query
are executed parallely
 Intraoperation - parallel subqueries
 Interoperation – parallel sort

5
Intraquery - interoperation

 Pipelining – output records of operation A are

consumed by a second operation B, even
before the first operation has produced the
entire set of records
 Saves space by not storing complete intermediate
results.
 Independent – operations do not depend on
each other – multiple joins (4 = 2 + 2)
 Mixed – more practical solution

6
Intraquery – interoperation cont`d

 Planner produces tree of plan Nodes

 No support of parallelism in planner
 Executor decides which branches of plan tree to
execute in separate thread
 Smart planner
 Adds new Parallel Nodes to plan
 Distribute – single input, multiple output
 Gather – multiple output, single input
 Rejects to use parallelization for simple queries
 Optimizes parallelization 7
Intraquery - intraoperation

 Parallel sorting – in memory quicksort

 Divide and conquer strategy – divides list into
two sublists
 Sublists can then be processed by separate
threads
 After sublists are sorted there is no need for
synchronization – sort is finished
 Without preprocessing there is a linear speedup

8
Other tasks

 Parallel plan scoring

 Planner can search more of the plan space
 Search for optimal plan is NPC problem
 Index rebuilding
 When they spawned many levels or have many
deleted leaf rows
 Importing large warehouse tables
 Partitioned tables
 Parallel processing of partitions
9
Our approach

 Implement intraquery parallelization with

threads
 Create global pool of threads for each backend,
so different phases of query processing can
use it

10
Problems

 Technical:
 PostgreSQL code is not thread safe
 Signal handling
 Logical: Structures like Locks are per process
based. Deadlock management. Decision about
parallelism in planner or in executor
 Support of threads differs on OS
 POSIX threads
 WinThreads
11
Competition

 Oracle
 Large support of parallelism
 Parallel hint for queries, parallel index, partitions
 MS SQL
 Index rebuilding, parallel query support for partitions
 DB2
 Parallel query, partitions.

12
Summary

 Speed up and scale up for processor-intensive

queries
 Intraquery paralllelism
 Implemented with threads
 Work in progress

13
Sources

 PostgreSQL source code

 High Performance Parallel Database
Processing and Grid Databases - David Taniar

14
Q&A

Q Tips: Fast, Scalable, and Maintainable Kdb+
From Everand
Q Tips: Fast, Scalable, and Maintainable Kdb+
Nick Psaris
No ratings yet
DFo Section 2 Quiz
80% (5)
DFo Section 2 Quiz
22 pages
OpenID Connect in Action v13
100% (1)
OpenID Connect in Action v13
264 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Postgresql Benchmark
No ratings yet
Postgresql Benchmark
36 pages
Cs6005 - Advanced Database Systems (Unit-1)
No ratings yet
Cs6005 - Advanced Database Systems (Unit-1)
136 pages
Query Parallelism
No ratings yet
Query Parallelism
8 pages
Parallel Execution in Oracle
No ratings yet
Parallel Execution in Oracle
17 pages
14 Queryexecution2
No ratings yet
14 Queryexecution2
6 pages
Mastering DuckDB: High-Performance Analytics Made Easy
From Everand
Mastering DuckDB: High-Performance Analytics Made Easy
Robert Johnson
No ratings yet
Parallel and Distributed Databases NOTES
No ratings yet
Parallel and Distributed Databases NOTES
98 pages
Optimizing Deep Learning Workloads with OneDNN: The Complete Guide for Developers and Engineers
From Everand
Optimizing Deep Learning Workloads with OneDNN: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
Inter and Intra Query Parallelism
No ratings yet
Inter and Intra Query Parallelism
1 page
CO2 Session 11
No ratings yet
CO2 Session 11
25 pages
Dbms
No ratings yet
Dbms
14 pages
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
No ratings yet
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
27 pages
Parallel Databases
No ratings yet
Parallel Databases
11 pages
CO2 Session 13
No ratings yet
CO2 Session 13
25 pages
Advanced Java Data Structures: Techniques and Applications for Efficient Programming
From Everand
Advanced Java Data Structures: Techniques and Applications for Efficient Programming
Adam Jones
No ratings yet
Cap 5
No ratings yet
Cap 5
50 pages
Database Management Systems: Unit 4 - Parallel DBMS
No ratings yet
Database Management Systems: Unit 4 - Parallel DBMS
14 pages
Mastering Data Structure in Java: Advanced Techniques
From Everand
Mastering Data Structure in Java: Advanced Techniques
Ed A Norex
No ratings yet
Neon Serverless Postgres Engineering: The Complete Guide for Developers and Engineers
From Everand
Neon Serverless Postgres Engineering: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Query Execution: Intro To Database Systems Andy Pavlo
No ratings yet
Query Execution: Intro To Database Systems Andy Pavlo
63 pages
Scalable Computing with Dask: The Complete Guide for Developers and Engineers
From Everand
Scalable Computing with Dask: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
DBA Roadmap - Learn To Become A Database Administrator With Postg
No ratings yet
DBA Roadmap - Learn To Become A Database Administrator With Postg
8 pages
XNNPACK for Efficient Neural Network Inference on CPU: The Complete Guide for Developers and Engineers
From Everand
XNNPACK for Efficient Neural Network Inference on CPU: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Advanced PostgreSQL Mastery: In-Depth Database Techniques and Performance Tuning
From Everand
Advanced PostgreSQL Mastery: In-Depth Database Techniques and Performance Tuning
Adam Jones
No ratings yet
PostgreSQL_Interview_QA
No ratings yet
PostgreSQL_Interview_QA
9 pages
Postgresql Tuning Guide: Postgresql Architecture: Key Takeaways
No ratings yet
Postgresql Tuning Guide: Postgresql Architecture: Key Takeaways
8 pages
Learning Hadoop 2
From Everand
Learning Hadoop 2
Garry Turkington
4/5 (1)
DeepSparse for Efficient CPU Inference: The Complete Guide for Developers and Engineers
From Everand
DeepSparse for Efficient CPU Inference: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Apache Nemo Data Processing Optimization: The Complete Guide for Developers and Engineers
From Everand
Apache Nemo Data Processing Optimization: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Oracle Explain Plans EXPLAINED
100% (1)
Oracle Explain Plans EXPLAINED
35 pages
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet
Lecture - 12 Transactions and Concurrency
No ratings yet
Lecture - 12 Transactions and Concurrency
13 pages
Java Concurrency Patterns: Mastering Multithreading and Asynchronous Techniques
From Everand
Java Concurrency Patterns: Mastering Multithreading and Asynchronous Techniques
Peter Jones
No ratings yet
Data Structure and Algorithms in Java: From Basics to Expert Proficiency
From Everand
Data Structure and Algorithms in Java: From Basics to Expert Proficiency
William Smith
No ratings yet
Execution
No ratings yet
Execution
37 pages
Query Execution and Query Plan Analysis
No ratings yet
Query Execution and Query Plan Analysis
28 pages
Recent Postgresql Optimizer Improvements: Tom Lane Postgresql - Red Hat Edition Group Red Hat, Inc
No ratings yet
Recent Postgresql Optimizer Improvements: Tom Lane Postgresql - Red Hat Edition Group Red Hat, Inc
34 pages
Mastering Python
From Everand
Mastering Python
Rick van Hattem
No ratings yet
Postgresql Query Optimization: Step by Step Techniques
No ratings yet
Postgresql Query Optimization: Step by Step Techniques
50 pages
To Paralelel or Not
No ratings yet
To Paralelel or Not
62 pages
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
NoSQL Essentials: Navigating the World of Non-Relational Databases
From Everand
NoSQL Essentials: Navigating the World of Non-Relational Databases
Kameron Hussain
No ratings yet
Efficient Data Science Workflows with Vaex: Definitive Reference for Developers and Engineers
From Everand
Efficient Data Science Workflows with Vaex: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering the Art of Nix Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of Nix Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Java Concurrency and Multithreading: Unlock the Secrets of Expert-Level Skills
From Everand
Java Concurrency and Multithreading: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Chapter 2
No ratings yet
Chapter 2
47 pages
Chapter One1
No ratings yet
Chapter One1
21 pages
Psycopg 2010 Stuttgart
No ratings yet
Psycopg 2010 Stuttgart
44 pages
Mastering Core Java：Advanced Techniques and Tricks
From Everand
Mastering Core Java：Advanced Techniques and Tricks
Ted Norice
No ratings yet
2 Algorithms For Query Processing Optimization
No ratings yet
2 Algorithms For Query Processing Optimization
46 pages
Ads Unit 3
No ratings yet
Ads Unit 3
8 pages
Vaex for Scalable Data Processing in Python: The Complete Guide for Developers and Engineers
From Everand
Vaex for Scalable Data Processing in Python: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Query Optimization
No ratings yet
Query Optimization
17 pages
PostgreSQL Server Programming 2nd Edition Usama Dar - Download The Ebook and Start Exploring Right Away
100% (1)
PostgreSQL Server Programming 2nd Edition Usama Dar - Download The Ebook and Start Exploring Right Away
81 pages
PostgreSQL Replication - Second Edition
From Everand
PostgreSQL Replication - Second Edition
Hans-Jurgen Schonig
No ratings yet
Couchbase Essentials: Definitive Reference for Developers and Engineers
From Everand
Couchbase Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
T-SQL Cheat Sheet
100% (1)
T-SQL Cheat Sheet
20 pages
T-SQL Improvements and Data Types
No ratings yet
T-SQL Improvements and Data Types
24 pages
Database Connection Strings
No ratings yet
Database Connection Strings
22 pages
Big Oh
No ratings yet
Big Oh
10 pages
Basic SQL: CHAPTER 4 (6/E) CHAPTER 8 (5/E)
No ratings yet
Basic SQL: CHAPTER 4 (6/E) CHAPTER 8 (5/E)
28 pages
Update SQL Server Statistics
No ratings yet
Update SQL Server Statistics
1 page
Transact-SQL Syntax Conventions (Transact-SQL)
No ratings yet
Transact-SQL Syntax Conventions (Transact-SQL)
2 pages
Robocopy and A Few Examples 2
No ratings yet
Robocopy and A Few Examples 2
7 pages
Robocopy and A Few Examples PDF
100% (1)
Robocopy and A Few Examples PDF
10 pages
Architecture Design
100% (1)
Architecture Design
40 pages
Logical Operators Group
No ratings yet
Logical Operators Group
14 pages
Oracle Vs SQL Server Issues
No ratings yet
Oracle Vs SQL Server Issues
2 pages
Oracle Metadata: Data Dictionary vs. Dynamic Performance Views
No ratings yet
Oracle Metadata: Data Dictionary vs. Dynamic Performance Views
1 page
Check SQL Server Port Availability
No ratings yet
Check SQL Server Port Availability
3 pages
SQL Server Query Optimization Techniques PDF
No ratings yet
SQL Server Query Optimization Techniques PDF
9 pages
Steps To Shrink VMware Disks
No ratings yet
Steps To Shrink VMware Disks
1 page
Resumable Space
No ratings yet
Resumable Space
2 pages
Rolling Upgrade To SQL Server Database Mirroring
No ratings yet
Rolling Upgrade To SQL Server Database Mirroring
4 pages
Oracle Database Memory Management
No ratings yet
Oracle Database Memory Management
2 pages
Rolling Upgrade To SQL Server Database Mirroring
No ratings yet
Rolling Upgrade To SQL Server Database Mirroring
4 pages
Oracle SQL Loader - Conventional Path vs. Direct Path
No ratings yet
Oracle SQL Loader - Conventional Path vs. Direct Path
2 pages
Change Data Capture Error 14234
No ratings yet
Change Data Capture Error 14234
2 pages
Difference Between Cache and Buffer
No ratings yet
Difference Between Cache and Buffer
2 pages
Host Credentials Oracle EM
No ratings yet
Host Credentials Oracle EM
1 page
Ofa & Omf
No ratings yet
Ofa & Omf
2 pages
Elementary Data Structures - Stacks, Queues, & Lists, Amortized Analysis Trees
No ratings yet
Elementary Data Structures - Stacks, Queues, & Lists, Amortized Analysis Trees
41 pages
Presentation Evaluation Form
No ratings yet
Presentation Evaluation Form
1 page
Format Model Modifiers
No ratings yet
Format Model Modifiers
4 pages
Automating Oracle Database Startup and Shutdown On Linux SOP
100% (1)
Automating Oracle Database Startup and Shutdown On Linux SOP
3 pages
Sysbench Manual
No ratings yet
Sysbench Manual
17 pages
rcv2 PDF
No ratings yet
rcv2 PDF
8 pages
Make Passive Income With Faceless TikTok, YouTube, Instagram
No ratings yet
Make Passive Income With Faceless TikTok, YouTube, Instagram
72 pages
Gov Response For New DPR Trial
No ratings yet
Gov Response For New DPR Trial
150 pages
FB Produck
No ratings yet
FB Produck
2 pages
3KA71234AA00 Datasheet en PDF
No ratings yet
3KA71234AA00 Datasheet en PDF
2 pages
Thermal Mass Flow Meter Manual - V2-1
No ratings yet
Thermal Mass Flow Meter Manual - V2-1
37 pages
Distillation Design and Control Using Aspen Simulation
No ratings yet
Distillation Design and Control Using Aspen Simulation
10 pages
Developing A Complete Project Scope Statement in 2 Days
No ratings yet
Developing A Complete Project Scope Statement in 2 Days
11 pages
Camera
No ratings yet
Camera
19 pages
Soft Sensing of Product Quality in The Debutanizer
No ratings yet
Soft Sensing of Product Quality in The Debutanizer
8 pages
ANTEX Sound Cards
No ratings yet
ANTEX Sound Cards
12 pages
RMT 323
No ratings yet
RMT 323
4 pages
Xtream Code 2021
No ratings yet
Xtream Code 2021
5 pages
Res Expansion Candidate Submittal Template
No ratings yet
Res Expansion Candidate Submittal Template
3 pages
Pavankumarreddy
No ratings yet
Pavankumarreddy
3 pages
FrederiksbergUserGuide 2 1
No ratings yet
FrederiksbergUserGuide 2 1
36 pages
Smart Data Services For Ngeniusone Splunk Integration
No ratings yet
Smart Data Services For Ngeniusone Splunk Integration
2 pages
Comptia Securityx Cas 005 Exam Objectives (3 0)
No ratings yet
Comptia Securityx Cas 005 Exam Objectives (3 0)
17 pages
SQL Project Module 7
No ratings yet
SQL Project Module 7
14 pages
Design of Index Based Round Robin Arbiter For NOC Router
No ratings yet
Design of Index Based Round Robin Arbiter For NOC Router
6 pages
Deep Learning - Assignment 11 Your Name, Roll Number 1. What Is The Difference Between Backpropagation Algorithm and Backpropagation Through Time (BPTT) Algorithm ?
No ratings yet
Deep Learning - Assignment 11 Your Name, Roll Number 1. What Is The Difference Between Backpropagation Algorithm and Backpropagation Through Time (BPTT) Algorithm ?
10 pages
Ir2117 Igbt Driver PDF
No ratings yet
Ir2117 Igbt Driver PDF
18 pages
CASIO QV-8000SX Digital Camara SM
No ratings yet
CASIO QV-8000SX Digital Camara SM
57 pages
Cspro 80
No ratings yet
Cspro 80
959 pages
Performance Evaluation of Supervised Machine Learning Techniques For Efficient Detection of Emotions From Online Content
No ratings yet
Performance Evaluation of Supervised Machine Learning Techniques For Efficient Detection of Emotions From Online Content
26 pages
DKG 972
No ratings yet
DKG 972
6 pages
Course Pack
No ratings yet
Course Pack
559 pages

Parallel Query Processing in PostgreSQL

Uploaded by

Parallel Query Processing in PostgreSQL

Uploaded by

Parallel query processing in PostgreSQL

 Databases are larger and larger

 For each session PostgreSQL creates one

 Usage of multiple CPUs to perform parts of a

 Pipelining – output records of operation A are

 Planner produces tree of plan Nodes

 Parallel sorting – in memory quicksort

 Parallel plan scoring

 Implement intraquery parallelization with

 Speed up and scale up for processor-intensive

 PostgreSQL source code

You might also like