Lec 7 Query Processing, Optimization & Indexing

The document covers the fundamentals of query processing in database management systems, detailing the stages of parsing, optimization, and execution. It discusses various optimization techniques, including heuristic and cost-based methods, as well as different join algorithms such as nested loop, hash, and merge joins. Additionally, it provides insights into execution plans and how to optimize SQL queries for efficiency.

Uploaded by

mhariskhan513

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views29 pages

Lec 7 Query Processing, Optimization & Indexing

Uploaded by

mhariskhan513

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Query Processing, Optimization &

Indexing
Lecture Agenda
⚫ Query Processing in DBMS
⚫ Query Parsing & Optimization
⚫ Execution Plans & Join Algorithms
⚫ Indexing & Index Structures
⚫ B-Trees, Hash Indexes
⚫ Unique, Composite, and Covering Indexes
What is Query Processing?
⚫ Query processing is the series of steps a database
management system (DBMS) follows to interpret and
execute a user's request (query) to retrieve or
manipulate data stored in a database.
⚫ Translation of SQL queries into low-level instructions
Query Processing Phases
Query Processing Involves multiple stages: Parsing,
Optimization, Execution
⚫ Parsing: Analyzing SQL syntax and semantics
⚫ Optimization: Finding the most efficient execution
plan
⚫ Execution: Running the query using the selected plan
Query Parsing
⚫ Query parsing is the process of analyzing a query
written in SQL (or another query language) to
understand its syntax and structure.
⚫ The parser checks whether the query follows the
correct grammar of the query language. If the syntax is
correct, it translates the query into a parse tree or
abstract syntax tree (AST), which is a tree
representation of the query's logical structure.
Steps in Query Parsing
⚫ Lexical Analysis: The query string is divided into
tokens (e.g., keywords, operators, identifiers).
⚫ Syntax Analysis: The tokens are checked against the
grammar of the query language to ensure correct
structure.
⚫ Semantic Analysis: The parser checks for semantic
errors (e.g., referencing non-existent tables or
columns).
⚫ Query Rewrite: Some queries are rewritten for
optimization purposes (e.g., transforming a subquery
into a join).
Query Optimization
⚫ Query Optimization is the process where the database
system evaluates multiple ways to execute a query and
chooses the most efficient one, usually based on cost
estimates (e.g., time, memory, or I/O operations).
⚫ It happens after parsing and translating the query into a
logical plan, but before actual execution.
⚫ Why Optimize? Reduce I/O, CPU, memory use
Types of Query Optimization
⚫ Query optimization can be categorized in several ways
based on how and when the optimization occurs
⚫ Key Types are
1. Heuristic query optimization
2. Cost-based Query optimization
Heuristic Query Optimization
⚫ Uses a set of rules of thumb (heuristics) to transform
and simplify the query into a more efficient form
without considering cost estimates.
Key Techniques:
⚫ Apply selection operations as early as possible (push
down selections).
⚫ Combine selections and projections to reduce the
number of columns/rows.
⚫ Use smaller tables first in joins.
⚫ Reorder joins and other operations based on known
patterns.
Heuristic optimization
Advantages:
⚫ Fast and simple
⚫ Good for basic improvements
Disadvantage:
Doesn’t always find the most efficient plan
Cost-Based Query Optimization
⚫ Evaluates multiple possible query execution plans using
statistics (e.g., table size, indexes, row selectivity) and chooses
the one with the lowest estimated cost.
Key Steps:
⚫ Generate All Possible Execution Plans: The optimizer
generates all possible ways to execute the query using
different combinations of joins, scans, and sorting.
⚫ Estimate the Cost of Each Plan: Each plan’s cost is
estimated based on factors such as:
⚫ Number of I/O operations: Reading and writing data.
⚫ CPU time: Time spent processing data.
⚫ Memory usage: Storage used during query processing.
⚫ Select the Best Plan: The plan with the lowest cost is
selected as the optimal execution plan.
Cost Based optimization
Factors Affecting Query Cost:
⚫ Table Size: Larger tables require more I/O
operations to read.
⚫ Index Availability: If indexes are available, they
can speed up data retrieval.
⚫ Join Methods: Different join algorithms (e.g.,
nested loop, merge join, hash join) have different
costs.
⚫ Sort Operations: Sorting data can be expensive if
it requires additional memory or disk I/O.
Cost Based optimization
Advantage:
⚫ More accurate and efficient plans
⚫ Adapts to real data distribution
Disadvantage:
⚫ More time-consuming than heuristic optimization
⚫ Depends heavily on up-to-date statistics
Query Execution
⚫ A Query Execution Plan (QEP) is a step-by-step
strategy chosen by the database management system
(DBMS) to execute a SQL query efficiently. After
optimization, the DBMS selects the best plan and uses
it to retrieve or modify the data.
⚫ Generated by the optimizer
⚫ Describes how tables are accessed and joined
Execution Plan
⚫ Access Paths
How data will be accessed: full table scan, index scan, etc.
⚫ Join Methods
How tables will be joined: nested loop, hash join, merge
join.
⚫ Join Order
In what sequence tables will be joined.
⚫ Selection/Projection
When and how WHERE and SELECT clauses are applied.
⚫ Intermediate Steps
Temporary tables, sorting, filtering, grouping.
⚫ Estimated Costs
Estimated time, CPU usage, I/O operations, number of
rows processed at each step.
Example: Optimized Query Execution
Plan
⚫ Student(student_id, name, age, course_id)
⚫ Course(course_id, course_name)
SELECT s.name, c.course_name
FROM Student s
JOIN Course c ON s.course_id = c.course_id
WHERE s.age > 20;
Execution Plan
Step Operation Table Method Notes
Use index on
1 Index Scan Student B-Tree Index age to filter
early
2 Filter Student - s.age > 20
Student + Join on
3 Hash Join Hash Table
Course course_id
Load all
Table Access Full Table
4 Course course
(Full) Scan
records
Output
s.name,
5 Projection Result -
c.course_na
me
How to View Execution Plans

⚫ MYSQL EXPLAIN Select

Join Algorithms Overview
The type of join used in the execution plan of a query
depends on various factors, such as:
⚫ The size of the tables involved
⚫ Availability of indexes
⚫ Join condition (e.g., equality vs inequality)
⚫ The database engine's optimizer
Selection depends on: Input size, Sort order, Join
condition
Types of Join Algorithms
Types of Join Algorithms:
⚫ Nested Loop Join,
⚫ Hash Join,
⚫ Merge Join
Nested Loop Join
⚫ Brute-force method: Compare every pair
⚫ Time Complexity: O(n × m)
⚫ Best For: Small datasets or indexed lookups
How it works: For each row in the outer table, the DBMS
searches for matching rows in the inner table.
Best when: One table is small, or there's an index on the join
column of the inner table.
⚫ Example in plan:
Nested Loop
-> Index Scan on Student
-> Index Lookup on Course
Hash Join
⚫ Build Phase: Hash smaller table
⚫ Probe Phase: Scan larger table and match using hash
⚫ Efficient for equi-joins
How it works:
⚫ Build a hash table on the smaller table using the join key.
⚫ Scan the larger table and use the hash table to find matches.
Best when: No indexes, large tables, and equality joins.
Example in plan:
Hash Join
-> Seq Scan on Student
-> Hash on Course
Merge Join
⚫ Works on sorted inputs
⚫ Time Complexity: O(n + m)
⚫ Efficient for large, pre-sorted data
How it works: Both tables are sorted on the join key,
and then merged together like in merge sort.
Best when: Tables are already sorted or sorting is
efficient.
Example in plan:
Merge Join
-> Sort on Student.course_id
-> Sort on Course.course_id
Example
Consider Schema
Student(student_id, name, age, course_id)
Course(course_id, course_name)

Query:
Select the names of students enrolled in course of
database
Query optimization & Joins Example
⚫ Plan 1:
Select s.name from student
Where course_id
IN (select course_id from course where course_name =
Database)
⚫ Plan 2:
Select s.name from student s JOIN course c
ON s.course_id = c.course_id
Where c.course_name = ‘Database’;
Query optimization & Joins Example
⚫ Plan 3:
WITH registration AS(select course_id from course
where course_name = ‘Database’)
Select s.name from student s JOIN registration r ON
s.course_id = r.course_id;
Join Algorithm Comparison
⚫ Nested Loop: Small tables, O(n × m)
⚫ Hash Join: Medium/Large, EQ joins, O(n + m)
⚫ Merge Join: Sorted data, O(n + m)

Join Type Best For Common in execution

plan when
Nested Loop Small + Large Index exists on inner
Join (with index) table
Hash Join Large + Large No index, but equality
(equality only) join
Merge Join Sorted data Data already sorted or
sorted easily
Class Assignment
⚫ Optimize this query:
⚫ SELECT E.name, D.name FROM Employees E JOIN
Departments D ON E.dept_id = D.id WHERE E.salary
> 70000;
⚫ Suggest 2 execution plans, Recommend suitable
indexes

Exam 1Z0-922 MySQL Implementation Associate Dumps
100% (1)
Exam 1Z0-922 MySQL Implementation Associate Dumps
8 pages
Backend Engineer Task
No ratings yet
Backend Engineer Task
6 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Module 1 - Query Processing
No ratings yet
Module 1 - Query Processing
20 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Dbms Seminar
No ratings yet
Dbms Seminar
24 pages
Optimization of Queries
No ratings yet
Optimization of Queries
6 pages
Query
No ratings yet
Query
10 pages
Query Optimization in Databases
No ratings yet
Query Optimization in Databases
6 pages
Query Processing
No ratings yet
Query Processing
5 pages
DB - Lecture Query Optimization
No ratings yet
DB - Lecture Query Optimization
80 pages
Rdbms Assignment
No ratings yet
Rdbms Assignment
12 pages
SQL Query Optimization Help Book
No ratings yet
SQL Query Optimization Help Book
8 pages
Databases LEVEL 3 Notes
No ratings yet
Databases LEVEL 3 Notes
29 pages
Query Processing
No ratings yet
Query Processing
8 pages
SQL Server Execution Plan
No ratings yet
SQL Server Execution Plan
17 pages
Chapter 2 Query Optimization
No ratings yet
Chapter 2 Query Optimization
31 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
127 pages
Querry Optimization
No ratings yet
Querry Optimization
13 pages
Ivunit Query Processing
No ratings yet
Ivunit Query Processing
12 pages
Query Proc Notes
No ratings yet
Query Proc Notes
10 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
23 pages
Chapter 2 Querry Proccessing
No ratings yet
Chapter 2 Querry Proccessing
7 pages
Advancedchapter 2 2013
No ratings yet
Advancedchapter 2 2013
16 pages
Advanced Database System Chapter Three Query Processing and Optimization
No ratings yet
Advanced Database System Chapter Three Query Processing and Optimization
94 pages
NICE ONE - SQL Optimization
No ratings yet
NICE ONE - SQL Optimization
60 pages
Practical Notes Part2
No ratings yet
Practical Notes Part2
22 pages
Chapter One1
No ratings yet
Chapter One1
21 pages
Chapter 4 Query Optimization
100% (2)
Chapter 4 Query Optimization
35 pages
BCS Topic
No ratings yet
BCS Topic
66 pages
Chapter 2 Query Processing and Optimization
No ratings yet
Chapter 2 Query Processing and Optimization
58 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
Week09 QPO
No ratings yet
Week09 QPO
56 pages
Dbmsimpunit 3
No ratings yet
Dbmsimpunit 3
10 pages
Presentation9 - Query Processing and Query Optimization in DBMS
No ratings yet
Presentation9 - Query Processing and Query Optimization in DBMS
36 pages
2 Algorithms For Query Processing Optimization
No ratings yet
2 Algorithms For Query Processing Optimization
46 pages
ADBMS Chapter 1
No ratings yet
ADBMS Chapter 1
47 pages
Data Warehousing: Need For Speed: Join Techniques
No ratings yet
Data Warehousing: Need For Speed: Join Techniques
22 pages
Module - 4
No ratings yet
Module - 4
60 pages
Lecture 7
No ratings yet
Lecture 7
25 pages
Query Opt5235234534t34vt4wtwtw45t4w
No ratings yet
Query Opt5235234534t34vt4wtwtw45t4w
24 pages
Lecture11 Query Processing
No ratings yet
Lecture11 Query Processing
37 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
Advanced Database Chapter Two Query Processing and Optimization
100% (1)
Advanced Database Chapter Two Query Processing and Optimization
43 pages
SQL Tuning
No ratings yet
SQL Tuning
27 pages
Query
No ratings yet
Query
14 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
It212 Lecture 7
No ratings yet
It212 Lecture 7
9 pages
Chapter 1 Query Processing
No ratings yet
Chapter 1 Query Processing
58 pages
Chapter 8
No ratings yet
Chapter 8
65 pages
Mastering SQL Query Performance - An In-Depth Optimization G
No ratings yet
Mastering SQL Query Performance - An In-Depth Optimization G
6 pages
Ad Database All Slide
No ratings yet
Ad Database All Slide
49 pages
Ch1 Query Processing
No ratings yet
Ch1 Query Processing
49 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
ADBChapter 1
No ratings yet
ADBChapter 1
32 pages
Chapter 2 - Query Optimization
No ratings yet
Chapter 2 - Query Optimization
40 pages
Full Guide to Understand Queries Execution in MySQL
No ratings yet
Full Guide to Understand Queries Execution in MySQL
4 pages
Query Optimization
No ratings yet
Query Optimization
9 pages
Adbs CH2
No ratings yet
Adbs CH2
56 pages
Chapter 2 Query Processing
No ratings yet
Chapter 2 Query Processing
56 pages
Search Algorithm: Fundamentals and Applications
From Everand
Search Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Devising Lesson Plans
No ratings yet
Devising Lesson Plans
63 pages
Creating A Performance Baseline
No ratings yet
Creating A Performance Baseline
24 pages
DBMS Assignment
No ratings yet
DBMS Assignment
2 pages
ITS472 SoW 2013
No ratings yet
ITS472 SoW 2013
4 pages
ProxySQL Series: Query Cache With ProxySQL - Mydbops
No ratings yet
ProxySQL Series: Query Cache With ProxySQL - Mydbops
6 pages
Lpu Dbms
No ratings yet
Lpu Dbms
1 page
Create Table SQL 1
No ratings yet
Create Table SQL 1
11 pages
Oracle EAM SQL Statements
No ratings yet
Oracle EAM SQL Statements
9 pages
International Journal On Recent and Inno
No ratings yet
International Journal On Recent and Inno
5 pages
Mysql For Excel en
No ratings yet
Mysql For Excel en
58 pages
SQL PDF
No ratings yet
SQL PDF
58 pages
Create AWS Cost Intelligence Dashboard
No ratings yet
Create AWS Cost Intelligence Dashboard
8 pages
ADSA Lab Programs
100% (1)
ADSA Lab Programs
98 pages
Data Warehouse
No ratings yet
Data Warehouse
2 pages
NW Funda
No ratings yet
NW Funda
72 pages
Class 12 CBSE MySQL Computer Science Worksheet
0% (1)
Class 12 CBSE MySQL Computer Science Worksheet
3 pages
Data Base Management With MS Access
No ratings yet
Data Base Management With MS Access
7 pages
Database
No ratings yet
Database
5 pages
001-2023-0921 DLMDSBDT01 Course Book
No ratings yet
001-2023-0921 DLMDSBDT01 Course Book
124 pages
Transaction Management and Concurrency Discussion Questions Solution
No ratings yet
Transaction Management and Concurrency Discussion Questions Solution
6 pages
DBMS Unit 5
No ratings yet
DBMS Unit 5
30 pages
Another Odesk Test With Answer
No ratings yet
Another Odesk Test With Answer
4 pages
Certified List of Candidates: Region Ii - Nueva Vizcaya Nueva Vizcaya
No ratings yet
Certified List of Candidates: Region Ii - Nueva Vizcaya Nueva Vizcaya
3 pages
CBTP Phase 3
100% (2)
CBTP Phase 3
16 pages
Create Table Person (Id Integer, Firstname Varchar, Lastname Varchar) Create Table Student (Credit Varchar)
No ratings yet
Create Table Person (Id Integer, Firstname Varchar, Lastname Varchar) Create Table Student (Credit Varchar)
8 pages
Chapter 2-Transaction Management
No ratings yet
Chapter 2-Transaction Management
35 pages
AVL Trees: CSE 373 Data Structures
No ratings yet
AVL Trees: CSE 373 Data Structures
43 pages
ODI Sample
No ratings yet
ODI Sample
3 pages

Lec 7 Query Processing, Optimization & Indexing

Uploaded by

Lec 7 Query Processing, Optimization & Indexing

Uploaded by

Query Processing, Optimization &

⚫ MYSQL EXPLAIN Select

Join Type Best For Common in execution

You might also like