0% found this document useful (0 votes)

25 views64 pages

Distributed Shared Memory - Revised

Uploaded by

predator862001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views64 pages

Distributed Shared Memory - Revised

Uploaded by

predator862001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 64

Distributed Shared

Memory
Distributed shared memory
 DSM paradigm provides process with shared address
space.
 Primitives for shared memory:
1. Read(address)
2. Write(address, data)
 Shared memory paradigm gives the system a illusion of
physically shared memory.
 DSM refers to shared memory paradigm applied to
loosely coupled distributed memory systems.
Cont….
 Shared memory exists only virtually.
 Similar concept to virtual memory.
 DSM also known as DSVM.
 DSM provides a virtual address space shared among
processes on loosely coupled processors.
 DSM is basically an abstraction that integrates the local
memory of different machine into a single logical entity
shared by cooperating processes.
Distributed shared memory
DSM Architecture
 Each node of the system consist of one or more CPUs
and memory unit.
 Nodes are connected by high speed communication
network.
 Simple message passing system for nodes to exchange
information.
 Main memory of individual nodes is used to cache pieces
of shared memory space.
 Reduces network latency
Cont….
 Memory mapping manager routine maps local memory
to shared virtual memory.
 Shared memory space is partitioned into blocks.
 Data caching is used in DSM system to reduce network
latency.
 The basic unit of caching is a memory block.
Cont….
 If data is not available in local memory network block
fault is generated.

 On Network block fault:

 The missing block is migrated from the remote node to the
client process’s node and OS maps it into the application’s
address space.
 Data blocks keep migrating from one node to another on
demand but no communication is visible to the user processes.
Design and implementation issues
1. Granularity
2. Structure of Shared memory
3. Memory coherence and access synchronization
4. Data location and access
5. Replacement strategy
6. Thrashing
7. Heterogeneity
Granularity

 Refers to the block size of DSM.

 The unit of sharing & the unit of data transfer across the
network when a network block fault occurs.

 Possible unit are a few word, a page or few pages.

Memory Coherence & Access Synchronization
 Replicated and shared data items may
simultaneously be available in the main memories of
a number of nodes.

 Memory Coherence Problem

 Deals with the consistency of a piece of shared data lying
in the main memories of two or more nodes.
Replacement strategy
 Ifthe local memory of a node is full, a cache miss at that
node implies not only a fetch of accessed data block from
a remote node but also a replacement.

 Data block must be replaced by the new data block.

Thrashing

 Data block migrate between nodes on demand.

 Therefore if two nodes compete for write access to a

single data item, the corresponding data block may be
transferred back.
Granularity

 Most visible parameter in the design of DSM system is

block size.

 Sending small packet of data is more expensive than

sending large packet.
Cont…
 Factors influencing block size selection:
1. Paging overhead
2. Directory size
3. Thrashing
4. False sharing
Paging overhead
 A process is likely to access a large region of its
shared address space in a small amount of time.

 The paging overhead is less for large block size as

compared to the paging overhead for small block size.

Directory size
 The larger the block size, the smaller the
directory.

 Result: reduced directory management overhead

for larger block size.
Thrashing

 The problem of thrashing may occur:

 when data item in the same data block are being

updated by multiple nodes at the same time.

 with any block size, more likely with larger block

size.
False sharing
 Occurs when two different processes access
two unrelated variable that reside in the
same data block.
 The larger is the block size, higher is the probability of
false sharing.
 False sharing of a block may lead to a thrashing problem
Possible Solutions
Using page size as block size

 Relative advantage and disadvantages of small

and large block size makes it difficult for DSM
designer to decide on a proper block size.

 On Intel Core2 Duo 64-bit system, the page size

is 4kB, which is normal for almost every desktop
PC architectures.

 Linux kernel 2.6.x versions support large pages

(4MB).
Advantages of using page size as
block size

 Use of existing page fault schemes to trigger a DSM page

fault.
 Allows the access right control.
 Page size do not impose undue communication overhead
at the time of network page fault.
 Page size is a suitable data entity unit with respect to
memory contention.
Structure of Shared-Memory Space

 Structure defines the abstract view of the shared

memory space for application programmers.

 The structure and granularity of a DSM system are

closely related.
Consistency Models
 Consistency requirement vary from application to
application.

 Refers to the degree of consistency that has to be

maintained for the shared memory data.

 If a system supports a stronger consistency model,

weaker consistency model is automatically supported
but the converse is not true.
Types of Consistency Models
1. Strict Consistency model
2. Sequential Consistency model
3. Casual consistency model
4. Pipelined Random Access Memory consistency
model (PRAM)
5. Processor Consistency model
6. Weak consistency model
7. Release consistency model
Strict Consistency Model
 The strongest form with most stringent consistency
requirement.

 Value returned by a read operation on a memory

address is always the same as the value written by
the most recent write operation to that address.

 All writes instantaneously become visible to all

processes.

 Implementation requires the existence of an absolute

global time to synchronize clocks of all nodes.
 Practically impossible.
Sequential Consistency Model
 Proposed by Lamport in 1979.
 All processes see the same order of all memory access
operations on the shared memory.
 Exact order of access operations are interleaved & does
not matter.
Example :
 Operations performed in order
1. Read(r1)
2. write(w1)
3. Read(r2)
Cont…

 Only acceptable ordering

 for a strictly consistency memory
(r1, w1, r2)

 For sequential consistency model

Any of the orderings (r1,w1,r2), (r1,r2,w1), (w1,r1,r2),
(w1,r2,r1), (r2,r1,w1), (r2,w1,r1), is correct

if all processes see the same ordering.

Cont…
 The consistency requirement of the sequential
consistency model is weaker than that of the strict
consistency model.

 A sequentially consistent memory provide one-copy /

single-copy semantics.

 Acceptable by most applications.

Casual Consistency Model (PCR)
 Proposed by hutto and ahemad in 1990.

 All processes see only those memory reference

operations in the correct order that are potentially
casually related (PCR).
 Otherwise seen by different processes in different order.

 Required to construct and maintain dependency graphs

for memory access operations.
Pipelined Random Access Memory
(PRAM) Consistency model
 Proposed by Lipton and Sandberg in 1988.
 A weaker consistency semantics.
 Ensures that all write operations performed by a single
process are seen by all other processes in the order in
which they were performed as if all write operations
performed by a single process are in a pipeline.
PRAM
 P1 – W11 & W12
 P2 – W21 & W22
 P3 can see these as ((W11,W12), (W21,W22))
 P4 can see these as ((W21,W22), (W11,W12))
 Advantages:
 Simple and easy to implement.
 Good performance
 Limitation:
 All processes may not agree on the same order of memory
reference operations.
Processor Consistency model
 Proposed by Goodman in 1989.
 Very similar to PRAM model with additional restriction of
memory coherence.
 Memory coherence:
 For any memory location, all processes agree on the same order
of all write operation to that location.

 Processor consistency ensures that all write operations

performed on the same location are seen by all
processes in the same order.
Weak Consistency Model
 Many applications may demand that:
 It is not necessary to show the change in memory
done by every write operation to other processes.
 Idea of weak consistency:
 Better performance can be achieved if consistency
is enforced on a group of memory reference
operations rather than on individual memory
reference operations.
 Uses a special variable called a
synchronization variable to synchronize
memory.
Cont…
Requirements:
1. All accesses to synchronization variables must obey
sequential consistency semantics.
2. All previous write operations must be completed
everywhere before an access to a synchronization
variable is allowed.
3. All previous accesses to synchronization variables must
be completed before access to a non-synchronization
variable is allowed.
4. Better performance at the cost of putting extra burden on the
programmers.
Release consistency model

 Enhancement of weak consistency model.

 Use of two synchronization variables:

 Acquire
 Release
Cont…
 Acquire
 Used to tell the system that a process is
entering Critical section.
 Results in propagating changes made by other
nodes to process's node.
 Release
 Used to tell the system that a process has just
exited critical section.
 Results in propagating changes made by the
process to other nodes.
Cont…
 A variation of release consistency is lazy
release consistency proposed by Keleher in
1992.

 Modifications are not sent to other nodes at the

time of release but only on demand.

 Better performance.
Implementing Seq. Consistency Model

 Protocol for implementing depends on

whether the DSM system allows:
 replication & / or
 Migration
Of shared memory data blocks.

Strategies:
1. Nonreplicated, Nonmigrating blocks (NRNMB)
2. Nonreplicated, Migrating blocks (NRMB)
3. Replicated, migrating blocks (RMB)
4. Replicated, Nonmigrating blocks (RNMB)
NRNMBS
 Simplest strategy.

 Each block of the shared memory has a single

copy whose location is always fixed.
Cont…
 Enforcing sequential consistency is trivial.

 Method is simple and easy to implement.

 Drawbacks:
 Serializing data access creates a bottleneck.

 Parallelism is not possible.

Cont…
Data locating in the NRNMB strategy:
 There is a single copy of each block in the entire
system.

 The location of a block never changes.

 Requires a simple mapping function to map a block

of a node.
NRMBS
 Each block of the shared memory has a single
copy in the entire system.

 Migration is allowed.
 Owner node of a block changes as soon as the
block is migrating to a new node.

 Only the processes executing on one node

can read or write a given data item at any
time.
Cont…
Cont…
 Advantage :
 No communications cost are incurred when a
process accesses data currently held locally.
 Advantage of data access locality.

 Drawbacks:
 Prone to thrashing problem.
 No parallelism.
Data locating in the NRMB strategy

 There is a single copy of each block, the location

of a block keeps changing dynamically.
Cont…
Following method used:
1. Broadcasting
2. Centralized server algorithm
3. Fixed distributed server algorithm
4. Dynamic distributed server algorithm
Broadcasting

 Each node maintains an owned block table

that contains an entry for each block for
which the node is the current owner.
Cont…
Cont….

 On a fault,
 Fault handler of the faulting node broadcasts a
read/write request on the network.

Disadvantage:
 Not scalable.
Centralized server algorithm
Cont…
 A centralized server maintains a block table
that contains the location information for all
block in the shared memory space.

Drawbacks:
 A centralized server serializes location
queries, reducing parallelism.

 The failure of the centralized server will cause

the DSM system to stop functioning.
Fixed distributed server algorithm
Cont..
 A direct extension of the centralized server
scheme.

 There is a block manager on several nodes.

 Each block manager is given a predetermined

subset of data blocks to manage.

 On fault, the mapping functions is used by

the currently accessed block.
Dynamic distributed server algorithm
Cont…

 Does not use any block manager and

attempts to keep track of the ownership
information of all block in each node.

 Each node has a block table.

 Contains the ownership information for all blocks.

 Use of probable owner.

 When fault occurs, the faulting node extracts
from its block table the node information.
RMB - Replicated, migrating blocks

 Used to increase parallelism.

 Read operations are carried out in parallel at multiple
nodes by accessing the local copy of the data.

 Replication tends to increase the cost of write

operation.
 Problem to maintain consistency.

 Extra expense if the read/write ratio is large.

Cont…
 Protocols to ensure sequential consistency:
1. Write-invalidate
2. Write-update

Write-invalidate
 All copies of a piece of data except one are invalidated
before a write can be performed on it.

 After invalidation of a block, only the node that performs

the write operation on the block holds the modified
version of the block.
Cont…
Write-update
 A write operation is carried out by updating all copies of
the data on which the write is performed.

 On write fault, the fault handler copies the accessed

block from one of the block’s current node to its own
node.

 The write operation completes only after all the copies of

the block have been successfully updated.
Cont…
Cont…
 Assign sequence number to the modification and
multicasts that to all the nodes where a replica of the
data block to be modified is located.

 The write operations are processed at each node in

sequence number order.

 If the verification fails, node request the sequencer

for a retransmission of the missing modification.
Cont…
Cont…
 Write-update approach is very expensive.

 Inthe write-invalidate approach, updates are only

propagated when data is read and several updates
can take place before communication is necessary.
 Use of status tag associated with each block.
 Indicates the block is valid/shared/ read-only / writable.
Data Locating in the RMB strategy

Following algorithms may be used:

1. Broadcasting
2. Centralized-server algorithm
3. Fixed distributed-server algorithm
4. Dynamic distributed-server algorithm
Replicated, Non-Migrating Block
 A shared memory block may be replicated at multiple
node of the system but the location of each replica is
fixed.
 All replicas of a block are kept consistent by updating
them all in case of a write access.
 Sequential consistency is ensured by using a global
sequencer to sequence the write operation of all nodes.
Data locating in the RNMB strategy
Following characteristics:
1. The replica location of a block never change.
2. All replicas of a data block are kept consistent.
3. Only a read request can be directly sent to one of the
node having a replica and all write requests have to
be sent to the sequencer.

DSM - Distributedsharedmemory
No ratings yet
DSM - Distributedsharedmemory
108 pages
Lect5 - Distributed Shared Memory
No ratings yet
Lect5 - Distributed Shared Memory
120 pages
Introduction To DSM: Unit - III Essay Questions
No ratings yet
Introduction To DSM: Unit - III Essay Questions
21 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
24 pages
Distributed Shared Memory
100% (1)
Distributed Shared Memory
20 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
51 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
35 pages
Name: Jayrajsinh Vaghela Roll No: 5166 Div: B Sub: DOS (Assi-3.1)
No ratings yet
Name: Jayrajsinh Vaghela Roll No: 5166 Div: B Sub: DOS (Assi-3.1)
24 pages
WINSEM2022-23 CSE4001 ETH VL2022230503162 ReferenceMaterialI TueFeb1400 00 00IST2023 Module4DistributedSystemsLecture2
No ratings yet
WINSEM2022-23 CSE4001 ETH VL2022230503162 ReferenceMaterialI TueFeb1400 00 00IST2023 Module4DistributedSystemsLecture2
27 pages
L 14 DSM
No ratings yet
L 14 DSM
3 pages
Chapter 7: Distributed Shared Memory: Why DSM?
No ratings yet
Chapter 7: Distributed Shared Memory: Why DSM?
14 pages
Module 2
No ratings yet
Module 2
34 pages
10 Distributed Shared Memory
No ratings yet
10 Distributed Shared Memory
20 pages
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
No ratings yet
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
9 pages
DSM
No ratings yet
DSM
36 pages
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
No ratings yet
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
9 pages
Unit 2
No ratings yet
Unit 2
15 pages
Parallel and Distributed Computing Lec 6
No ratings yet
Parallel and Distributed Computing Lec 6
26 pages
DSM1
No ratings yet
DSM1
77 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
109 pages
Unit - IV Notes
No ratings yet
Unit - IV Notes
42 pages
A4
No ratings yet
A4
5 pages
Unit 5 DOS SCR
No ratings yet
Unit 5 DOS SCR
46 pages
Distributed Shared Memory (DSM)
No ratings yet
Distributed Shared Memory (DSM)
27 pages
Unit 3
No ratings yet
Unit 3
58 pages
Chap 5 Slides - DSM2
No ratings yet
Chap 5 Slides - DSM2
9 pages
Unit 5 DOS SCR
No ratings yet
Unit 5 DOS SCR
22 pages
Unit 4
No ratings yet
Unit 4
7 pages
Unit-4 DS
No ratings yet
Unit-4 DS
39 pages
DS IAT 3 Answer Key
No ratings yet
DS IAT 3 Answer Key
9 pages
Memory Management Technique For Paging On Distributed Shared Memory Framework
No ratings yet
Memory Management Technique For Paging On Distributed Shared Memory Framework
13 pages
Unit 3 DSM
No ratings yet
Unit 3 DSM
12 pages
Shared Memory Multiprocessors
No ratings yet
Shared Memory Multiprocessors
45 pages
Distributed Shared Memory For Advanced Os
No ratings yet
Distributed Shared Memory For Advanced Os
21 pages
Distributed Shared Memory-Report
No ratings yet
Distributed Shared Memory-Report
35 pages
Article 4
No ratings yet
Article 4
7 pages
L04 Parallel Systems Synchronization Communication Scheduling
No ratings yet
L04 Parallel Systems Synchronization Communication Scheduling
117 pages
Distributed Resource Management: Distributed Shared Memory
No ratings yet
Distributed Resource Management: Distributed Shared Memory
20 pages
R12 U5 MultiProcessor Architectures
No ratings yet
R12 U5 MultiProcessor Architectures
47 pages
Distributed System (UNIT-III) 7th Sem
No ratings yet
Distributed System (UNIT-III) 7th Sem
7 pages
Week# 12 & 13 (Intro To Distributed Memory)
No ratings yet
Week# 12 & 13 (Intro To Distributed Memory)
41 pages
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
No ratings yet
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
22 pages
DS Unit4
No ratings yet
DS Unit4
63 pages
Lect4 Parallelsystem-shared Memory
No ratings yet
Lect4 Parallelsystem-shared Memory
31 pages
Untitled
No ratings yet
Untitled
27 pages
6CS5 DS Unit-4
No ratings yet
6CS5 DS Unit-4
64 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
Imp DS
No ratings yet
Imp DS
27 pages
Lecture 5
No ratings yet
Lecture 5
15 pages
Data Shared System (DSM)
No ratings yet
Data Shared System (DSM)
7 pages
Advanced Os Slides
No ratings yet
Advanced Os Slides
40 pages
Distributed Shared Memory (DSM)
No ratings yet
Distributed Shared Memory (DSM)
4 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
20 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
22 pages
Vi6CS5 DS Unit-4
No ratings yet
Vi6CS5 DS Unit-4
64 pages
Shared Memory Multiprocessors
No ratings yet
Shared Memory Multiprocessors
45 pages
Unit-6 Distributed Shared Memory
100% (1)
Unit-6 Distributed Shared Memory
71 pages
Preface
No ratings yet
Preface
8 pages
Distributed Operating Systems: By: Malik Abdulrehman
No ratings yet
Distributed Operating Systems: By: Malik Abdulrehman
27 pages
MCQ Merged
No ratings yet
MCQ Merged
43 pages
CH 05 Consistency, Replication N Fault Tolerance
No ratings yet
CH 05 Consistency, Replication N Fault Tolerance
55 pages
Client Centric Consistency Models
100% (1)
Client Centric Consistency Models
11 pages
ARM Multi Core Processing
No ratings yet
ARM Multi Core Processing
38 pages
Consistency Model PDF
No ratings yet
Consistency Model PDF
4 pages
Consistency & Replication in Distributed Systems
No ratings yet
Consistency & Replication in Distributed Systems
32 pages
Numa Multiprocessors
No ratings yet
Numa Multiprocessors
26 pages
CS621 FT Highlighted by Vaniza
No ratings yet
CS621 FT Highlighted by Vaniza
111 pages
DFGHJKL
No ratings yet
DFGHJKL
5 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
51 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
3 pages
MC0085 MQP
No ratings yet
MC0085 MQP
20 pages
CA Classes-116-120
No ratings yet
CA Classes-116-120
5 pages
003 Abstractions
No ratings yet
003 Abstractions
22 pages
Aca 2 Marks With Answers
No ratings yet
Aca 2 Marks With Answers
22 pages
Naming, Identifiers and Addresses
No ratings yet
Naming, Identifiers and Addresses
65 pages
CH-07 Replication
No ratings yet
CH-07 Replication
35 pages
A Critique of The CAP Theorem-Martin Kleppmann
No ratings yet
A Critique of The CAP Theorem-Martin Kleppmann
14 pages
Distributed OS Sem V
No ratings yet
Distributed OS Sem V
5 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
53 pages
All Merged
No ratings yet
All Merged
179 pages
Database Development Supporting Offline Update Using CRDT: (Conflict-Free Replicated Data Types)
No ratings yet
Database Development Supporting Offline Update Using CRDT: (Conflict-Free Replicated Data Types)
6 pages
Unit 4 and 5 Answer
No ratings yet
Unit 4 and 5 Answer
24 pages
Multiprocessor Architecture: Taxonomy of Parallel Architectures
100% (1)
Multiprocessor Architecture: Taxonomy of Parallel Architectures
32 pages
Distributed Systems - Lesson Plan (14CS705B)
No ratings yet
Distributed Systems - Lesson Plan (14CS705B)
4 pages
DS Assi 1-5 Ques
No ratings yet
DS Assi 1-5 Ques
5 pages
Te Comp Distributed System 6180-51
No ratings yet
Te Comp Distributed System 6180-51
2 pages
CS3551Unit 1
No ratings yet
CS3551Unit 1
53 pages

Distributed Shared Memory - Revised

Uploaded by

Distributed Shared Memory - Revised

Uploaded by

Distributed Shared

 On Network block fault:

 Refers to the block size of DSM.

 Possible unit are a few word, a page or few pages.

 Memory Coherence Problem

 Data block must be replaced by the new data block.

 Data block migrate between nodes on demand.

 Therefore if two nodes compete for write access to a

 Most visible parameter in the design of DSM system is

 Sending small packet of data is more expensive than

 The paging overhead is less for large block size as

 Result: reduced directory management overhead

 The problem of thrashing may occur:

 when data item in the same data block are being

 with any block size, more likely with larger block

 Relative advantage and disadvantages of small

 On Intel Core2 Duo 64-bit system, the page size

 Linux kernel 2.6.x versions support large pages

 Use of existing page fault schemes to trigger a DSM page

 Structure defines the abstract view of the shared

 The structure and granularity of a DSM system are

 Refers to the degree of consistency that has to be

 If a system supports a stronger consistency model,

 Value returned by a read operation on a memory

 All writes instantaneously become visible to all

 Implementation requires the existence of an absolute

 Only acceptable ordering

 For sequential consistency model

if all processes see the same ordering.

 A sequentially consistent memory provide one-copy /

 Acceptable by most applications.

 All processes see only those memory reference

 Required to construct and maintain dependency graphs

 Processor consistency ensures that all write operations

 Enhancement of weak consistency model.

 Use of two synchronization variables:

 Modifications are not sent to other nodes at the

 Protocol for implementing depends on

 Each block of the shared memory has a single

 Method is simple and easy to implement.

 Parallelism is not possible.

 The location of a block never changes.

 Requires a simple mapping function to map a block

 Only the processes executing on one node

 There is a single copy of each block, the location

 Each node maintains an owned block table

 The failure of the centralized server will cause

 There is a block manager on several nodes.

 Each block manager is given a predetermined

 On fault, the mapping functions is used by

 Does not use any block manager and

 Each node has a block table.

 Use of probable owner.

 Used to increase parallelism.

 Replication tends to increase the cost of write

 Extra expense if the read/write ratio is large.

 After invalidation of a block, only the node that performs

 On write fault, the fault handler copies the accessed

 The write operation completes only after all the copies of

 The write operations are processed at each node in

 If the verification fails, node request the sequencer

 Inthe write-invalidate approach, updates are only

Following algorithms may be used:

You might also like