0% found this document useful (0 votes)

4 views13 pages

F09 - Lock Free Data Structures Stack and Queue

The lecture discusses the purpose and advantages of lock-free data structures, emphasizing their ability to scale with multiple threads without the limitations of locking mechanisms. It introduces key concepts such as atomic operations, non-blocking algorithms, and the differences between locking, lock-free, and transactional memory. The lecture also addresses potential issues like the ABA problem and provides examples of lock-free implementations using atomic compare and exchange operations.

Uploaded by

ejy jawa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views13 pages

F09 - Lock Free Data Structures Stack and Queue

Uploaded by

ejy jawa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Locking vs lock-free

Contents of Lecture 9
Purpose of using lock-free data structures
Terminology
Comparing locking, lock-free and transactional memory
Lock-free data structures: stack and fifo queue

Jonas Skeppstedt Lecture 9 2022 1 / 13

Purpose of using lock-free data structures

Suppose we need to scale our computations to use hundreds or

thousands of threads
Two important problems with locking:
Limited scaling due to serialization at a lock. Severity depends on lock
contention, of course.
Using fine grained locking may be complex (and lead to hard to find
bugs).
Similar to standing in a supermarket queue and the person paying
answers a phone call.
What can we do?

Jonas Skeppstedt Lecture 9 2022 2 / 13

Examples of ”unexpected delays”

The thread currently owning the lock may:

be preempted by OS kernel due to:
interrupt due to disk operation completed, network packet arrived, etc
another thread should run
get a page fault (page must be fetched from disk)
get a TLB fault
translation-lookaside buffer fault
a virtual memory page translation must be updated in the CPU
not part of this course
get a cache miss

Jonas Skeppstedt Lecture 9 2022 3 / 13

Key idea with lock-free data structures

Use atomic variables

Let multiple threads work on a data structure concurrently
Detect if some other thread modified it before us
If so, do something sensible such as update some variable and try again
How can we detect such modifications?

Jonas Skeppstedt Lecture 9 2022 4 / 13

Recall atomic operations

Using assignment operators ensures an atomic read-modify-write.

atomic_int a;

a += 1;

The following is not an atomic read-modify-write.

atomic_int a;

a = a + 1;

We would do one atomic read, an add, and an atomic write using

sequential consistency but there is no guarantee the new value is
exactly one more than the old.
For integers it is sometimes possible to use assignment operators but
not always!

Jonas Skeppstedt Lecture 9 2022 5 / 13

Another example

atomic_int a;

a = f(a);

For add, we can do +=

In the general case we need something else.
What can we do?

Jonas Skeppstedt Lecture 9 2022 6 / 13

This is what we want to do

atomic_int a;
int old_a;
int new_a;

old_a = a;

new_a = compute_a(old_a);

a = new_a; /∗ but only i f a == old_a ∗/

How can we do this?

Jonas Skeppstedt Lecture 9 2022 7 / 13

Recall atomic compare exchange from Lecture 6

bool atomic_compare_exchange_weak(
volatile A* ptr,
C* expected,
C value);

You can ignore the volatile

Recall how it is defined:
if (*ptr == *expected)
*ptr = value;
else
*expected = *ptr;

Operation introduced for IBM System 370

Also called atomic compare and swap and written CAS

Jonas Skeppstedt Lecture 9 2022 8 / 13

Using atomic compare exchange

atomic_int a;
int old_a;
int new_a;

old_a = a;
do
new_a = compute_a(old_a);
while (!atomic_compare_exchange_weak(&a, &old_a, new_a));

This modifies a only if a == old_a.

If they are not equal, the current value of a is copied to old_a
You may want to think of this function as:
Is it I who should modify the variable now? (or somebody else?)
What we essentially do is detecting a data-race and retry
But can we be sure no other threads modified a ?

Jonas Skeppstedt Lecture 9 2022 9 / 13

Answer to previous slide’s question

We cannot be sure.
a may have been incremented and decremented back to old_a
Sometimes that matters and at other times not.
It is called the ABA-problem.
x had value A, then B, and then A again.
It can cause chaos if the atomic variable is e.g. a pointer to a list, and
the pointer is both freed and malloced again. Then one thread may
think it still has the list pointer (and can use a next field) but that will
not work.
We will come back to it later in this lecture and see a solution in detail.

Jonas Skeppstedt Lecture 9 2022 10 / 13

Some terminology

An algorithm is blocking if one thread can delay another thread.

For example algorithms with mutexes are blocking.
An algorithm is non-blocking if one thread cannot delay other
threads.
An algorithm is lock-free if at least one thread can make progress
after a finite number of steps.
This means the program makes progress but individual threads may
have to wait a long time.
An algorithm is wait-free if every thread can make progress after a
finite number of steps.

Jonas Skeppstedt Lecture 9 2022 11 / 13

An example: slide 9

The code is non-blocking since there is no mutex

Is it lock-free ?
Or, will at least one thread leave the loop?
Yes, the thread that was lucky to read and write the variable
sufficiently close in time
Why? Trivial if we have an atomic instruction and also true if we have
load-and-reserve and store conditional, since only stores remove the
reservation of another thread.
Is it wait-free?
Or, will every thread make progress after a finite number of iterations?
No, an unlucky thread may loop an unbounded number of iterations

Jonas Skeppstedt Lecture 9 2022 12 / 13

Locking vs lock-free vs transactional memory

Locking is in some sense pessimistic

Locking assumes there will be conflicts and avoids them
Lock-free is optimistic
Lock-free assumes there will be no conflict and detects them if they
happen — and tries again
Lock-free algorithms are much more complex to implement than
blocking algorithms
Transactional memory is also optimistic but trivial to get correct but
can have performance problems when used in the wrong context.
Which is fastest depends on the algorithm and input

Jonas Skeppstedt Lecture 9 2022 13 / 13

Embedded Interview Questions
100% (1)
Embedded Interview Questions
14 pages
Embedded Interview Questions PDF
100% (1)
Embedded Interview Questions PDF
13 pages
(Dsilytc) Final Paper
No ratings yet
(Dsilytc) Final Paper
22 pages
Turbo Expander
100% (4)
Turbo Expander
47 pages
En Troubleshooting Guide
0% (1)
En Troubleshooting Guide
17 pages
Deep Excavation KLCC
100% (1)
Deep Excavation KLCC
20 pages
Frank Alexy Kuhne
No ratings yet
Frank Alexy Kuhne
31 pages
F06 - Threads and The Memory Model in ISO C C++ and Java
No ratings yet
F06 - Threads and The Memory Model in ISO C C++ and Java
69 pages
X X X - Old X - New Op (X - Old) Interlockedcompareexchange X X - Old
No ratings yet
X X X - Old X - New Op (X - Old) Interlockedcompareexchange X X - Old
2 pages
5.explain How Non-Blocking Algorithm Used To Resolve The Deadlock Issues? With Example?
No ratings yet
5.explain How Non-Blocking Algorithm Used To Resolve The Deadlock Issues? With Example?
3 pages
Dr. Dobb's - Writing Lock-Free Code - A Corrected Queue
No ratings yet
Dr. Dobb's - Writing Lock-Free Code - A Corrected Queue
4 pages
An Attempt To Illustrate Differences Between Memory Ordering and Atomic Access
No ratings yet
An Attempt To Illustrate Differences Between Memory Ordering and Atomic Access
15 pages
F08 - Transactional Memory in Clojure and C
No ratings yet
F08 - Transactional Memory in Clojure and C
44 pages
Lock-Free Programming (Or, Juggling Razor Blades) - Herb Sutter - CppCon 2014
No ratings yet
Lock-Free Programming (Or, Juggling Razor Blades) - Herb Sutter - CppCon 2014
47 pages
Synchronization
No ratings yet
Synchronization
9 pages
Final Practice
No ratings yet
Final Practice
8 pages
F05 - Memory Consistency Models Plus Introduction To Caches
No ratings yet
F05 - Memory Consistency Models Plus Introduction To Caches
48 pages
F12 Research Trends
No ratings yet
F12 Research Trends
13 pages
Concurrency 2
No ratings yet
Concurrency 2
54 pages
OSC Exam Questions
No ratings yet
OSC Exam Questions
6 pages
A Practical Wait-Free Simulation For Lock-Free Data Structures
No ratings yet
A Practical Wait-Free Simulation For Lock-Free Data Structures
12 pages
Transactional Memory: Companion Slides For by Maurice Herlihy & Nir Shavit
No ratings yet
Transactional Memory: Companion Slides For by Maurice Herlihy & Nir Shavit
64 pages
Live Lock-Free or Deadlock - Fedor Pikus - CppCon 2015
No ratings yet
Live Lock-Free or Deadlock - Fedor Pikus - CppCon 2015
112 pages
Au Multi Threaded Structures 1 PDF
No ratings yet
Au Multi Threaded Structures 1 PDF
11 pages
C01 Computer Systems - As
No ratings yet
C01 Computer Systems - As
17 pages
Locks 1
No ratings yet
Locks 1
61 pages
Transactional Memory in Practice - Brett Hall - CppCon 2015
No ratings yet
Transactional Memory in Practice - Brett Hall - CppCon 2015
62 pages
F04 - Java Synchronization and Pthreads For C
No ratings yet
F04 - Java Synchronization and Pthreads For C
51 pages
Data Races Are Evil
No ratings yet
Data Races Are Evil
10 pages
F11 - Cache Aware Programming For Multicores
No ratings yet
F11 - Cache Aware Programming For Multicores
20 pages
Basic Operating Systems Concepts
No ratings yet
Basic Operating Systems Concepts
7 pages
IITD Exam Solution
No ratings yet
IITD Exam Solution
9 pages
2007 Tocs
No ratings yet
2007 Tocs
61 pages
TOPCIT Reviewer OS and ComArch
No ratings yet
TOPCIT Reviewer OS and ComArch
20 pages
Chapter03-Memory Management
No ratings yet
Chapter03-Memory Management
48 pages
Lec06 Synchronization
No ratings yet
Lec06 Synchronization
34 pages
(MIT 6.1800) Spring 2025 Notes
No ratings yet
(MIT 6.1800) Spring 2025 Notes
17 pages
Itcsiu21194 Lab9 Os
No ratings yet
Itcsiu21194 Lab9 Os
16 pages
Transactional Memory: David Chisnall
No ratings yet
Transactional Memory: David Chisnall
21 pages
Modern Operating Systems - Midterm Exam Solutions - Spring 2013
No ratings yet
Modern Operating Systems - Midterm Exam Solutions - Spring 2013
10 pages
TCC Thesis BDC Defense
No ratings yet
TCC Thesis BDC Defense
51 pages
Herlihy 93 Transactional
No ratings yet
Herlihy 93 Transactional
12 pages
Transactional Memory: Architectural Support For Lock-Free Data Structures
No ratings yet
Transactional Memory: Architectural Support For Lock-Free Data Structures
12 pages
List Advantages and Disadvantages of Dynamic Memory Allocation vs. Static Memory Allocation.? Advantages
No ratings yet
List Advantages and Disadvantages of Dynamic Memory Allocation vs. Static Memory Allocation.? Advantages
39 pages
A Methodology For Implementing Highly Concurrent Data Objects by Maurice Herlihy
No ratings yet
A Methodology For Implementing Highly Concurrent Data Objects by Maurice Herlihy
17 pages
Back To Basics Concurrency Arthur Odwyer
No ratings yet
Back To Basics Concurrency Arthur Odwyer
58 pages
Cache Line Ping-Ponging:: Department CSE, SCAD CET
No ratings yet
Cache Line Ping-Ponging:: Department CSE, SCAD CET
2 pages
Lec07 Exclusion
No ratings yet
Lec07 Exclusion
33 pages
FileDirectory
No ratings yet
FileDirectory
29 pages
CH 4 Synchronization Models of Memory Consistency
100% (1)
CH 4 Synchronization Models of Memory Consistency
26 pages
11 Lock Freedom
No ratings yet
11 Lock Freedom
24 pages
Mute Xes
No ratings yet
Mute Xes
7 pages
4 Java Concurrent Patterns Advanced m4 Slides
No ratings yet
4 Java Concurrent Patterns Advanced m4 Slides
31 pages
Data Structures
No ratings yet
Data Structures
22 pages
Iqra University Islamabad Campus: Project Report
No ratings yet
Iqra University Islamabad Campus: Project Report
12 pages
Memory Management: Concept of Memory Hierarchy
No ratings yet
Memory Management: Concept of Memory Hierarchy
10 pages
Concurrent Java
No ratings yet
Concurrent Java
20 pages
Chapter03 Memory Management
No ratings yet
Chapter03 Memory Management
48 pages
Race Condition Is An Undesirable Situation That Occurs When A Device or System Attempts To
No ratings yet
Race Condition Is An Undesirable Situation That Occurs When A Device or System Attempts To
15 pages
Concurrent Programming Without Locks
No ratings yet
Concurrent Programming Without Locks
59 pages
Con Currency
No ratings yet
Con Currency
31 pages
L4 Atomics
No ratings yet
L4 Atomics
56 pages
Page Replacement Algorithms
No ratings yet
Page Replacement Algorithms
5 pages
Os Answer 1
No ratings yet
Os Answer 1
3 pages
IGNOU PGDCA MCS 208 Data Structure and Algorithm Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 208 Data Structure and Algorithm Previous Years Unsolved Papers
Manish Soni
No ratings yet
Gr7 Term 1 MIP LessonPlans 2025
No ratings yet
Gr7 Term 1 MIP LessonPlans 2025
67 pages
Exercises695Clus Solution - Doc Exercises695Clus Solution
No ratings yet
Exercises695Clus Solution - Doc Exercises695Clus Solution
7 pages
EOT Crane
0% (1)
EOT Crane
5 pages
Mathematics - Mathematics Form 2 - Zeraki Achievers 3.0 - Marking Scheme
No ratings yet
Mathematics - Mathematics Form 2 - Zeraki Achievers 3.0 - Marking Scheme
15 pages
Towards Holographic Beam-Forming Metasurface Technology For Next Generation Cubesats
No ratings yet
Towards Holographic Beam-Forming Metasurface Technology For Next Generation Cubesats
4 pages
Decimals Extra
No ratings yet
Decimals Extra
4 pages
Joseph Andrew Amacio Espinosa: Personal Statement
No ratings yet
Joseph Andrew Amacio Espinosa: Personal Statement
2 pages
Novice Nook: The Theory of Chess Improvement
No ratings yet
Novice Nook: The Theory of Chess Improvement
9 pages
Test 1-121 (A) - Key
No ratings yet
Test 1-121 (A) - Key
5 pages
Big Data Analytics (Unit-II)
No ratings yet
Big Data Analytics (Unit-II)
17 pages
Baking Tools, Utensils and Equipment in Making Bread, Cookies, Muffins and Biscuits
No ratings yet
Baking Tools, Utensils and Equipment in Making Bread, Cookies, Muffins and Biscuits
4 pages
Ear834 Bom
No ratings yet
Ear834 Bom
2 pages
Topic 14 Papermaking Pressing Text
100% (2)
Topic 14 Papermaking Pressing Text
21 pages
Vhlpx4 7w 3wh E.aspx
No ratings yet
Vhlpx4 7w 3wh E.aspx
5 pages
Solution Guide: Oil & Gas Industry
No ratings yet
Solution Guide: Oil & Gas Industry
59 pages
BAS 2 Accepted
No ratings yet
BAS 2 Accepted
57 pages
Theory Questions - QT
No ratings yet
Theory Questions - QT
3 pages
5 - Spiral Die
No ratings yet
5 - Spiral Die
42 pages
Technical Description MPPU
No ratings yet
Technical Description MPPU
10 pages
N156bga Eb2
No ratings yet
N156bga Eb2
44 pages
PSPICE Transient Simulation Plotting
No ratings yet
PSPICE Transient Simulation Plotting
7 pages
Instruction Manual
No ratings yet
Instruction Manual
52 pages
B Tech, Pgdom, PHD: Area of Interest
No ratings yet
B Tech, Pgdom, PHD: Area of Interest
25 pages
Maths BOT
No ratings yet
Maths BOT
12 pages
Joel
No ratings yet
Joel
11 pages

F09 - Lock Free Data Structures Stack and Queue

Uploaded by

F09 - Lock Free Data Structures Stack and Queue

Uploaded by

Locking vs lock-free

Jonas Skeppstedt Lecture 9 2022 1 / 13

Suppose we need to scale our computations to use hundreds or

Jonas Skeppstedt Lecture 9 2022 2 / 13

The thread currently owning the lock may:

Jonas Skeppstedt Lecture 9 2022 3 / 13

Use atomic variables

Jonas Skeppstedt Lecture 9 2022 4 / 13

Using assignment operators ensures an atomic read-modify-write.

The following is not an atomic read-modify-write.

We would do one atomic read, an add, and an atomic write using

Jonas Skeppstedt Lecture 9 2022 5 / 13

For add, we can do +=

Jonas Skeppstedt Lecture 9 2022 6 / 13

a = new_a; /∗ but only i f a == old_a ∗/

How can we do this?

Jonas Skeppstedt Lecture 9 2022 7 / 13

You can ignore the volatile

Operation introduced for IBM System 370

Jonas Skeppstedt Lecture 9 2022 8 / 13

This modifies a only if a == old_a.

Jonas Skeppstedt Lecture 9 2022 9 / 13

Jonas Skeppstedt Lecture 9 2022 10 / 13

An algorithm is blocking if one thread can delay another thread.

Jonas Skeppstedt Lecture 9 2022 11 / 13

The code is non-blocking since there is no mutex

Jonas Skeppstedt Lecture 9 2022 12 / 13

Locking is in some sense pessimistic

Jonas Skeppstedt Lecture 9 2022 13 / 13

You might also like