0% found this document useful (0 votes)

27 views

4 Distributed Algorithms

The document discusses distributed algorithms and covers topics like logical clocks for ordering events in distributed systems, using vector clocks to represent causality relationships between processes, defining global states and cuts of a distributed system, and an algorithm by Chandy and Lamport for taking snapshots of global states in a distributed system.

Uploaded by

pranavireddy.pranu2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

4 Distributed Algorithms

Uploaded by

pranavireddy.pranu2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 176

Distributed Systems:

Distributed algorithms

November 2005 Distributed systems: distributed algor 1

ithms
Overview of chapters
• Introduction
• Co-ordination models and languages
• General services
• Distributed algorithms
– Ch 10 Time and global states, 11.4-11.5
– Ch 11 Coordination and agreement, 12.1-12.5

• Shared data
• Building distributed services
November 2005 Distributed systems: distributed algor 2
ithms
This chapter: overview
• Introduction
• Logical clocks
• Global states
• Failure detectors
• Mutual exclusion
• Elections
• Multicast communication
• Consensus and related problems

November 2005 Distributed systems: distributed algor 3

ithms
Logical clocks
• Problem: ordering of events
– requirement for many algorithms
– physical clocks cannot be used
• use causality:
– within a single process: observation
– between different processes: sending of a
message happens before receiving the same
message

November 2005 Distributed systems: distributed algor 4

ithms
Logical clocks (cont.)
• Formalization: happens before relation
x y
• Rules:
– if x happens before y in any process p
then x  y
– for any message m: send (m)  receive (m)
– if x  y and y  z
then x  z
• Implementation: logical clocks
November 2005 Distributed systems: distributed algor 5
ithms
Logical clocks (cont.)
• Logical clock
– counter appropriately incremented
– one counter per process

• Physical clock
– counts oscillations occurring in a crystal at a
definitive frequency

November 2005 Distributed systems: distributed algor 6

ithms
Logical clocks (cont.)
• Rules for incrementing local logical clock
1 for each event (including send) in process p:
Cp := Cp + 1
2 when a process sends a message m, it
piggybacks on m the value of Cp
3 on receiving (m, t), a process q
• computes Cq := max (Cq, t)
• applies rule 1: Cq := Cq +1
Cq is logical time for event receive(m)
November 2005 Distributed systems: distributed algor 7
ithms
Logical clocks (cont.)
• Logical timestamps: example

1 2 3
P1 •a •c •g

0 3 4
P2 • •d •e

1 5
P3 •b •f

November 2005 Distributed systems: distributed algor 8

ithms
Logical clocks (cont.)
• C(x) logical clock value for event x

• Correct usage:
if x  y then C(x) < C(y)

• Incorrect usage:
if C(x) < C(y) then x  y
• Solution: Logical vector clocks
November 2005 Distributed systems: distributed algor 9
ithms
Logical clocks (cont.)
• Vector clocks for N processes:
– at process Pi: Vi[j] for j = 1, 2,…,N
– Properties:

if x  y then V(x) < V(y)

if V(x) < V(y) then x  y

November 2005 Distributed systems: distributed algor 10

ithms
Logical clocks (cont.)
• Rules for incrementing logical vector clock
1 for each event (including send) in process P i:
Vi[i] := Vi[i] + 1
2 when a process Pi sends a message m, it
piggybacks on m the value of Vi
3 on receiving (m, t), a process Pi
• apply rule 1
• Vi[j] := max(Vi[j] , t[j]) for j = 1, 2,…, N

November 2005 Distributed systems: distributed algor 11

ithms
Logical clocks (cont.)
• Logical vector clocks : example

(1,0,0) (2,0,0)
p1
a b m1

(2,1,0) (2,2,0)
Physical
p2
time
c d m2

(0,0,1) (2,2,2)
p3
e f

November 2005 Distributed systems: distributed algor 12

ithms
This chapter: overview
• Introduction
• Logical clocks
• Global states
• Failure detectors
• Mutual exclusion
• Elections
• Multicast communication
• Consensus and related problems

November 2005 Distributed systems: distributed algor 13

ithms
Global states
• Detect global properties
p1 p2

object
reference
message
a. Garbage collection garbage object

p1 p2
wait-for

b. Deadlock wait-for

p1 p2
activate
c. Termination passive passive

November 2005 Distributed systems: distributed algor 14

ithms
Global states (cont.)
• Local states & events
– Process Pi : eik events
sik state, before event k
– History of Pi :

hi = < ei0, ei1, ei2,…>

– Finite prefix of history of Pi :

hik = < ei0, ei1, ei2,…, eik >

November 2005 Distributed systems: distributed algor 15
ithms
Global states (cont.)
• Global states & events
– Global history

H = h1  h2  h3  …  hn

– Global state (when?)

S = ( s1p, s2q, …, snu)

consistent?
– Cut of the systems execution

C = h1c1  h1c2  …  h1cn

November 2005 Distributed systems: distributed algor 16
ithms
Global states (cont.)
• Example of cuts:

0 1 2 3
e1 e1 e1 e1
p1

m1 m2

p2 Physical
0 1 2 time
e2 e 2 e 2

Inconsistent cut
Consistent cut

November 2005 Distributed systems: distributed algor 17

ithms
Global states (cont.)
• Finite prefix of history of Pi :
hik = < ei0, ei1, ei2,…, eik >
• Cut of the systems execution
C = h1c1  h1c2  …  h1cn

• Consistent cut C
 e  C, f  e  f  C
• Consistent global state
corresponds to consistent cut
November 2005 Distributed systems: distributed algor 18
ithms
Global states (cont.)
• Model execution of a (distributed) system

S0  S1  S2  S3  …

– Series of transitions between consistent states

– Each transition corresponds to one single event
• Internal event
• Sending message
• Receiving message
– Simultaneous events
 order events
November 2005 Distributed systems: distributed algor 19
ithms
Global states (cont.)
• Definitions:
– Run = ordering of all events (in a global history)
consistent with each local history’s
ordering
– Linearization =
consistent run +
consistent with 
– S’ reachable from S
 linearization: … S  …  S’ …

November 2005 Distributed systems: distributed algor 20

ithms
Global states (cont.)
• Kinds of global state predicates:
– Stable = true in S
S’, S  …  S’  = true in S’

– Safety  = undesirable property

S0 = initial state of system
S, S0  …  S   = false in S

– Liveness  = desirable property

S0 = initial state of system
S, S0  …  S   = true in S
November 2005 Distributed systems: distributed algor 21
ithms
Global states (cont.)
• Snapshot algorithm of Chandy & Lamport
– Record consistent global state
– Assumptions:
• Neither channels nor processes fail
• Channels are unidirectional and provide FIFO-
ordered message delivery
• Graph of channels and processes is strongly
connected
• Any process may initiate a global snapshot
• Process may continue their execution during the
snapshot

November 2005 Distributed systems: distributed algor 22

ithms
Global states (cont.)
• Snapshot algorithm of Chandy & Lamport
– Elements of algorithm
• Players: processes Pi with
– Incoming channels
– Outgoing channels
• Marker messages
• 2 rules
– Marker receiving rule
– Marker sending rule
– Start of algorithm
• A process acts as it received a marker message

November 2005 Distributed systems: distributed algor 23

ithms
Global states (cont.)
Marker receiving rule for process pi
On pi’s receipt of a marker message over channel c:
if (pi has not yet recorded its state) it
records its process state now;
records the state of c as the empty set;
turns on recording of messages arriving over other incoming
channels;
else
pi records the state of c as the set of messages it has received
over c since it saved its state.
end if

Marker sending rule for process pi

After pi has recorded its state, for each outgoing channel c:
pi sends one marker message over c
(before it sends any other message over c).
November 2005 Distributed systems: distributed algor 24
ithms
Global states (cont.)
• Example:

p1 c2 p2
c1

$1000 (none) $50 2000

account widgets account widgets

November 2005 Distributed systems: distributed algor 25

ithms
Global states (cont.)
1. Global state S 0
<$1000, 0> p1 c2 (empty) p2 <$50, 2000>

c1 (empty)

2. Global state S 1
<$900, 0> p1 c2 (Order 10, $100), M p2 <$50, 2000>

c1 (empty)

3. Global state S 2
<$900, 0> p1 c2 (Order 10, $100), M p2 <$50, 1995>

c1 (five widgets)

4. Global state S 3
<$900, 5> p1 c2 (Order 10, $100) p2 <$50, 1995>
C1=<(five widgets)> c1 (empty) C2 = <>

(M = marker message)
November 2005 Distributed systems: distributed algor 26
ithms
Global states (cont.)
1. Global state S 0
<$1000, 0> p1 c2 (empty) p2 <$50, 2000>

c1 (empty)

4. Global state S 3
<$900, 5> p1 c2 (Order 10, $100) p2 <$50, 1995>
C1=<(five widgets)> c1 (empty) C2 = <>

5. Global state S 4
<$900, 5> p1 c2 (Order 10, $100) p2 <$50, 1995>
C1=<(five widgets)> c1 M C2 = <>

6. Global state S 5
<$900, 5> p1 c2 (Order 10, $100) p2 <$50, 1995>
C1=<(five widgets)> c1 (empty) C2 = <>

(M = marker message)
November 2005 Distributed systems: distributed algor 27
ithms
Global states (cont.)
• Observed state
– Corresponds to consistent cut
– Reachable!
actual execution e 0 ,e 1,...

Sinit recording recording Sfinal

begins ends

Ssnap
pre-snap: e '0 ,e '1 ,...e 'R-1 post-snap: e ' R,e 'R+1 ,...

November 2005 Distributed systems: distributed algor 28

November 2005 Distributed systems: distributed algor 29

ithms
Failure detectors
• Properties
– Unreliable failure detector: answers with
• Suspected
• Unsuspected No “P is here” within T + E sec
– Reliable failure detector: answers with
• Failed
• Unsuspected No “P is here” within T + A sec

• Implementation
– Every T sec: multicast by P of “P is here”
– Maximum on message transmission time:
• Asynchronous system: estimate E
• Synchronous system: absolute bound A

November 2005 Distributed systems: distributed algor 30

November 2005 Distributed systems: distributed algor 31

ithms
Mutual exclusion
• Problem: how to give a single process
temporarily a privilege?
– Privilege = the right to access a (shared)
resource
– resource = file, device, window,…
• Assumptions
– clients execute the mutual exclusion algorithm
– the resource itself might be managed by a
server
– Reliable communication
November 2005 Distributed systems: distributed algor 32
ithms
Mutual exclusion (cont.)
• Basic requirements:
– ME1: at most one process might execute
in the shared resource at any
time
(Safety)
– ME2: a process requesting access to the
shared resource is eventually
granted it (Liveness)
– ME3: Access to the shared resource should be
granted in happened-before order
(Ordering or fairness)
November 2005 Distributed systems: distributed algor 33
ithms
Mutual exclusion (cont.)
• Solutions:
– central server algorithm
– distributed algorithm using logical clocks
– ring-based algorithm
– voting algorithm
• Evaluation
– Bandwidth (= #messages to enter and exit)
– Client delay (incurred by a process at enter and exit)
– Synchronization delay (delay between exit and enter)

November 2005 Distributed systems: distributed algor 34

ithms
Mutual exclusion (cont.)
central server algorithm
• Central server offering 2 operations:
– enter()
• if resource free
then operation returns without delay
else request is queued and return from operation is
delayed
– exit()
• if request queue is empty
then resource is marked free
else return for a selected request is executed

November 2005 Distributed systems: distributed algor 35

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:

User

Enter()
P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 36

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:

User
3

Enter()
P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 37

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:

User
3

P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 38

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:

User
3
enter()

P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 39

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:
4
User
3
enter()

P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 40

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:
4
User
3
enter()

enter()
P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 41

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:
4, 2
User
3
enter()

enter()
P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 42

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:
4, 2
User
3
enter()

enter() exit()
P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 43

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:
4, 2
User

enter()

enter()
P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 44

ithms
Mutual exclusion (cont.)
central server algorithm
• Example:
Server
Queue:
2
User
4

enter()
P1
P4
P2 P3

November 2005 Distributed systems: distributed algor 45

ithms
Mutual exclusion (cont.)
central server algorithm
• Evaluation:
– ME3 not satisfied!
– Performance:
• single server is performance bottleneck
• Enter critical section: 2 messages
• Synchronization: 2 messages between exit of one process and
enter of next
– Failure:
• Central server is single point of failure
• what if a client, holding the resource, fails?
• Reliable communication required
November 2005 Distributed systems: distributed algor 46
ithms
Mutual exclusion (cont.)
ring-based algorithm
• All processes arranged in a
– unidirectional
– logical
ring

• token passed in ring

• process with token has access to resource

November 2005 Distributed systems: distributed algor 47
ithms
Mutual exclusion (cont.)
ring-based algorithm
P2
P1
P3

P6
P4
P5

November 2005 Distributed systems: distributed algor 48

ithms
Mutual exclusion (cont.)
ring-based algorithm
P2
P1
P3
P2 can use resource

P6
P4
P5

November 2005 Distributed systems: distributed algor 49

ithms
Mutual exclusion (cont.)
ring-based algorithm
P2
P1
P3
P2 stopped using resource
and forwarded token
P6
P4
P5

November 2005 Distributed systems: distributed algor 50

ithms
Mutual exclusion (cont.)
ring-based algorithm
P2
P1
P3
P3 doesn’t need resource
and forwards token
P6
P4
P5

November 2005 Distributed systems: distributed algor 51

ithms
Mutual exclusion (cont.)
ring-based algorithm
P2
P1
P3

P6
P4
P5

November 2005 Distributed systems: distributed algor 52

ithms
Mutual exclusion (cont.)
ring-based algorithm
• Evaluation:
– ME3 not satisfied
– efficiency
• high when high usage of resource
• high overhead when very low usage
– failure
• Process failure: loss of ring!
• Reliable communication required

November 2005 Distributed systems: distributed algor 53

ithms
Mutual exclusion (cont.)
distributed algorithm using logical clocks
• Distributed agreement algorithm
– multicast requests to all participating processes
– use resource when all other participants agree
(= reply received)
• Processes
– keep logical clock; included in all request
messages
– behave as finite state machine:
• released
• wanted
• held
November 2005 Distributed systems: distributed algor 54
ithms
Mutual exclusion (cont.)
distributed algorithm using logical clocks
• Ricart and Agrawala’s algorithm: process Pj
– on initialization:
• state := released;
– to obtain resource:
• state := wanted;
• T = logical clock value for next event;
• multicast request to other processes <T, Pj>;
• wait for n-1 replies;
• state := held;
November 2005 Distributed systems: distributed algor 55
ithms
Mutual exclusion (cont.)
distributed algorithm using logical clocks
• Ricart and Agrawala’s algorithm: process Pj
– on receipt of request <Ti, Pi> :
• if (state = held) or
(state = wanted and (T,Pj) < (Ti,Pi) )
then queue request from Pi
else reply immediately to Pi
– to release resource:
• state := released;
• reply to any queued requests;
November 2005 Distributed systems: distributed algor 56
ithms
Mutual exclusion (cont.)
distributed algorithm using logical clocks
• Ricart and Agrawala’s algorithm: example

– 3 processes
– P1 and P2 will request it concurrently
– P3 not interested in using resource

November 2005 Distributed systems: distributed algor 57

ithms
Mutual exclusion (cont.)
distributed algorithm using logical clocks
• Ricart and Agrawala’s algorithm: example

P1 P3
released released
Queue: Queue:

P2
released
Queue: