Lect4 Parallelsystem-shared Memory

Shared memory architecture allows multiple processors to communicate and synchronize through a global memory, leading to potential performance issues like contention and coherence problems. Shared memory systems can be classified into Uniform Memory Access (UMA), Nonuniform Memory Access (NUMA), and Cache-Only Memory Architecture (COMA), each with distinct characteristics regarding memory access times and structures. Cache coherence methods are essential to maintain consistency across multiple caches, employing strategies such as write-through, write-back, write-invalidate, and write-update to ensure data integrity.

Uploaded by

sama akram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views31 pages

Lect4 Parallelsystem-shared Memory

Uploaded by

sama akram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Shared Memory Architecture

Shared memory systems

 Shared memory systems form a major category of multiprocessors.
 All processors share a global memory.
 Communication between tasks running on different processors is performed through
writing to and reading from the global memory.
 All interprocessor coordination and synchronization is also accomplished via the global
memory.
 A shared memory computer system consists of a set of independent processors, a set of
memory modules, and an interconnection network.
problems related to designing a shared memory system:
 performance degradation due to contention
Performance degradation might happen when multiple processors are trying to access the
shared memory simultaneously. A typical design might use caches to solve the contention
problem.

 coherence problems, having multiple copies of data, spread throughout the caches,
might lead to a coherence problem. The copies in the caches are coherent if they are all
equal to the same value. However, if one of the processors writes over the value of one of
the copies, then the copy becomes inconsistent because it no longer equals the value of
the other copies.
CLASSIFICATION OF SHARED MEMORY SYSTEMS
The simplest shared memory system consists of one memory module (M) that can be accessed
from two processors (P1 and P2).
Requests arrive at the memory module through its two ports. An arbitration unit within the
memory module passes requests through to a memory controller. If the memory module is not
busy and a single request arrives, then the arbitration unit passes that request to the memory
controller and the request is satisfied.
The module is placed in the busy state while a request is being serviced. If a new request
arrives while the memory is busy servicing a previous request, the memory module sends a
wait signal, through the memory controller, to the processor making the new request.
In response, the requesting processor may hold its request on the line until the memory
becomes free or it may repeat its request some time later.
If the arbitration unit receives two requests, it selects one of them and passes it to the memory
controller. Again, the denied request can be either held to be served next or it may be repeated
some time later.
CLASSIFICATION OF SHARED MEMORY SYSTEMS
Based on the interconnection network used, shared memory systems can be categorized in
the following categories.

 Uniform Memory Access (UMA)

 Nonuniform Memory Access (NUMA)
 Cache-Only Memory Architecture (COMA)
CLASSIFICATION OF SHARED MEMORY SYSTEMS
 Uniform Memory Access (UMA)
A shared memory is accessible by all processors through an interconnection network in the
same way a single processor accesses its memory.

All processors have equal access time to any memory location.

The interconnection network used in the UMA can be a single bus, multiple buses, or a
crossbar switch.

Because access to shared memory is balanced, these systems are also called SMP (symmetric
multiprocessor) systems.

Each processor has equal opportunity to read/write to memory, including equal access speed.
CLASSIFICATION OF SHARED MEMORY SYSTEMS
 Uniform Memory Access (UMA)
A typical bus-structured SMP computer, attempts to reduce contention for the bus by
fetching instructions and data directly from each individual cache, as much as possible. In
the extreme, the bus contention might be reduced to zero after the cache memories are
loaded from the global memory, because it is possible for all instructions and data to be
completely contained within the cache. This memory organization is the most popular
among shared memory systems.
CLASSIFICATION OF SHARED MEMORY SYSTEMS
 Nonuniform Memory Access (UMA)
each processor has part of the shared memory attached. The memory has a single address
space. Therefore, any processor could access any memory location directly using its real
address. However, the access time to modules depends on the distance to the processor.
This results in a nonuniform memory access time.
A number of architectures are used to interconnect processors to memory modules in a
NUMA. Among these are the tree and the hierarchical bus networks.
CLASSIFICATION OF SHARED MEMORY SYSTEMS
 Cache-Only Memory Architecture (COMA)

Similar to the NUMA, each processor has part of the shared memory in the COMA. However,
in this case the shared memory consists of cache memory.
A COMA system requires that data be migrated to the processor requesting it. There is no
memory hierarchy and the address space is made of all the caches. There is a cache directory
(D) that helps in remote cache access.
CLASSIFICATION OF SHARED MEMORY SYSTEMS
 Cache-Only Memory Architecture (COMA)
BUS-BASED SYMMETRIC MULTIPROCESSORS
A typical bus-based design uses caches to solve the bus contention problem.

High speed caches connected to each processor on one side and the bus on the other side mean
that local copies of instructions and data can be supplied at the highest possible rate.

If the local processor finds all of its instructions and data in the local cache, we say the hit rate
is 100%.

The miss rate of a cache is the fraction of the references that cannot be satisfied by the cache,
and so must be copied from the global memory, across the bus, into the cache, and then passed
on to the local processor.

One of the goals of the cache is to maintain a high hit rate, or low miss rate under high
processor loads. A high hit rate means the processors are not using the bus as much.
BUS-BASED SYMMETRIC MULTIPROCESSORS
BUS-BASED SYMMETRIC MULTIPROCESSORS
BASIC CACHE COHERENCY METHODS
Multiple copies of data, spread throughout the caches, lead to a coherence problem among
the caches. The copies in the caches are coherent if they all equal the same value. However, if
one of the processors writes over the value of one of the copies, then the copy becomes
inconsistent because it no longer equals the value of the other copies. If data are allowed to
become inconsistent (incoherent), incorrect results will be propagated through the system,
leading to incorrect final results. Cache coherence algorithms are needed to maintain a level
of consistency throughout the parallel system.

• Cache–Memory Coherence.
• Cache–Cache Coherence.
• Shared Memory System Coherence
BASIC CACHE COHERENCY METHODS
Cache–Memory Coherence.
In a single cache system, coherence between memory and the cache is maintained using
one of two policies:
(1) write-through,
(2) write-back.

When a task running on a processor P requests the data in memory location X, for
example, the contents of X are copied to the cache, where it is passed on to P. When P
updates the value of X in the cache, the other copy in memory also needs to be updated in
order to maintain consistency.

In write-through, the memory is updated every time the cache is updated,

In write-back, the memory is updated only when the block in the cache is being replaced.
BASIC CACHE COHERENCY METHODS
Cache–Memory Coherence.
BASIC CACHE COHERENCY METHODS
Cache–Cache Coherence
In multiprocessing system, when a task running on processor P requests the data in global
memory location X, for example, the contents of X are copied to processor P’s local cache, where
it is passed on to P. Now, suppose processor Q also accesses X. What happens if Q wants to write
a new value over the old value of X? There are two fundamental cache coherence policies:
(1) write-invalidate,
(2) write-update.
Write-invalidate maintains consistency by reading from local caches until a write occurs. When
any processor updates the value of X through a write, posting a dirty bit for X invalidates all other
copies. For example, processor Q invalidates all other copies of X when it writes a new value into
its cache. This sets the dirty bit for X. Q can continue to change X without further notifications to
other caches because Q has the only valid copy of X. However, when processor P wants to read X,
it must wait until X is updated and the dirty bit is cleared.
Write-update maintains consistency by immediately updating all copies in all caches. All dirty
bits are set during each write operation. After all copies have been updated, all dirty bits are
cleared
BASIC CACHE COHERENCY METHODS
Cache–Cache Coherence
BASIC CACHE COHERENCY METHODS
Shared Memory System Coherence

• If we permit a write-update and write-through directly on global memory location X,

the bus would start to get busy and ultimately all processors would be idle while
waiting for writes to complete.
• In write-update and write-back, only copies in all caches are updated.
Write-Invalidate and Write-Through
In this simple protocol the memory is always consistent with the most recently
updated cache copy. Multiple processors can read block copies from main
memory safely until one processor updates its copy. At this time, all cache copies
are invalidated and the memory is updated to remain consistent.
Write-Invalidate and Write-Back
In this protocol a valid block can be owned by memory and shared in multiple caches that can
contain only the shared copies of the block.
Multiple processors can safely read these blocks from their caches until one processor updates
its copy. At this time, the writer becomes the only owner of the valid block and all other copies
are invalidated.
Write-Invalidate and Write-Back
Write-Update and Partial Write-Through
In this protocol an update to one cache is written to memory at the same time it is broadcast to
other caches sharing the updated block. These caches snoop on the bus and perform updates
to their local copies. There is also a special bus line, which is asserted to indicate that at least
one other cache is sharing the block.
Write-Update and Partial Write-Through
Write-Update and Write-Back
This protocol is similar to the previous one except that instead of writing through to the
memory whenever a shared block is updated, memory updates are done only when the
block is being replaced.

William Stallings Computer Organization and Architecture 10 Edition
No ratings yet
William Stallings Computer Organization and Architecture 10 Edition
34 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Help HTLsoft4
No ratings yet
Help HTLsoft4
15 pages
Shared Memory Architecture
No ratings yet
Shared Memory Architecture
39 pages
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
No ratings yet
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
22 pages
4-Module #4-Shared-Memory-Students-Version-Final-October-24-2024
No ratings yet
4-Module #4-Shared-Memory-Students-Version-Final-October-24-2024
25 pages
Multiprocessing: Flynn's Classification (1966)
No ratings yet
Multiprocessing: Flynn's Classification (1966)
8 pages
0014 SharedMemoryArchitecture
No ratings yet
0014 SharedMemoryArchitecture
31 pages
EGC121lect20 Multicore MSI Protocol (1)
No ratings yet
EGC121lect20 Multicore MSI Protocol (1)
39 pages
Lec 6 SharedArch PDF
No ratings yet
Lec 6 SharedArch PDF
33 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
9 pages
Multiprocessors
No ratings yet
Multiprocessors
39 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
Module 4
No ratings yet
Module 4
40 pages
CA-unit 5-Material-For Reference
No ratings yet
CA-unit 5-Material-For Reference
16 pages
Unit 5
No ratings yet
Unit 5
89 pages
DSM
No ratings yet
DSM
36 pages
1.symmetric and Distributed Shared Memory Architectures
79% (19)
1.symmetric and Distributed Shared Memory Architectures
29 pages
Cache Coherence
No ratings yet
Cache Coherence
53 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
23 pages
Computer Architecture: Multiprocessors Shared Memory Architectures Prof. Jerry Breecher CSCI 240 Fall 2003
No ratings yet
Computer Architecture: Multiprocessors Shared Memory Architectures Prof. Jerry Breecher CSCI 240 Fall 2003
24 pages
ACA Lecture 29 Cache-Coherence 2
No ratings yet
ACA Lecture 29 Cache-Coherence 2
42 pages
CA Lecture 13
No ratings yet
CA Lecture 13
27 pages
MODULE 4 HPC
No ratings yet
MODULE 4 HPC
41 pages
Shared Memory Multiprocessors: Logical Design and Software Interactions
No ratings yet
Shared Memory Multiprocessors: Logical Design and Software Interactions
107 pages
Unit 5 DOS SCR
No ratings yet
Unit 5 DOS SCR
46 pages
2.symmetric Shared Memory Architectures
No ratings yet
2.symmetric Shared Memory Architectures
12 pages
Bus-Based Multiprocessor: A.K.A or Snoopy-Bus Architecture
No ratings yet
Bus-Based Multiprocessor: A.K.A or Snoopy-Bus Architecture
54 pages
Shared Memory Architecture Concepts and Performance Issues: Outline
No ratings yet
Shared Memory Architecture Concepts and Performance Issues: Outline
7 pages
Lecture4 (Share Memory-"According Access")
No ratings yet
Lecture4 (Share Memory-"According Access")
16 pages
Memory Hierarchy: Haresh Dagale Dept of ESE
No ratings yet
Memory Hierarchy: Haresh Dagale Dept of ESE
32 pages
Parallel Computer Architecture A Hardware-Software
No ratings yet
Parallel Computer Architecture A Hardware-Software
18 pages
Lecture 06
No ratings yet
Lecture 06
26 pages
Distributed Shared Memory: Introduction & Thisis
No ratings yet
Distributed Shared Memory: Introduction & Thisis
22 pages
L39 - Centralized Shared Memory Architectures
No ratings yet
L39 - Centralized Shared Memory Architectures
31 pages
Cache Coherence: CSE 661 - Parallel and Vector Architectures
No ratings yet
Cache Coherence: CSE 661 - Parallel and Vector Architectures
37 pages
A Survey of Cache Coherence Mechanisms in Shared M
No ratings yet
A Survey of Cache Coherence Mechanisms in Shared M
27 pages
Cache Coherence (Part 1)
No ratings yet
Cache Coherence (Part 1)
13 pages
L7 Multicore 1
No ratings yet
L7 Multicore 1
50 pages
PART17
No ratings yet
PART17
45 pages
Unit 5 DOS SCR
No ratings yet
Unit 5 DOS SCR
22 pages
Lecture 5
No ratings yet
Lecture 5
15 pages
Lecture-7 SMP NUMA Cache Coherence
No ratings yet
Lecture-7 SMP NUMA Cache Coherence
34 pages
Coa Unit 3 Read
No ratings yet
Coa Unit 3 Read
19 pages
Shared-Memory Architectures: Adapted From A Lecture by Ian Watson, University of Machester
No ratings yet
Shared-Memory Architectures: Adapted From A Lecture by Ian Watson, University of Machester
33 pages
L32 SMP
No ratings yet
L32 SMP
47 pages
William Stallings Computer Organization and Architecture 10 Edition
No ratings yet
William Stallings Computer Organization and Architecture 10 Edition
34 pages
05 Multiprocessor
No ratings yet
05 Multiprocessor
54 pages
R12 U5 MultiProcessor Architectures
No ratings yet
R12 U5 MultiProcessor Architectures
47 pages
COE4590 - 9 - Shared Mem - MessgPassing
No ratings yet
COE4590 - 9 - Shared Mem - MessgPassing
14 pages
Parallel Architecture
No ratings yet
Parallel Architecture
33 pages
Lecture 18: Coherence Protocols
No ratings yet
Lecture 18: Coherence Protocols
18 pages
Cache Coherency
No ratings yet
Cache Coherency
33 pages
William Stallings Computer Organization and Architecture: Parallel Processing
No ratings yet
William Stallings Computer Organization and Architecture: Parallel Processing
40 pages
Shared Memory Architectures
No ratings yet
Shared Memory Architectures
34 pages
Cache Coherence - 20250120 - 142158 - 0000
No ratings yet
Cache Coherence - 20250120 - 142158 - 0000
34 pages
Week 5
No ratings yet
Week 5
52 pages
Chapter 8 - Parallel Processing
No ratings yet
Chapter 8 - Parallel Processing
50 pages
Cache Coherence - MESI MOESI
No ratings yet
Cache Coherence - MESI MOESI
57 pages
CH17 COA9e Parallel Processing
No ratings yet
CH17 COA9e Parallel Processing
52 pages
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
The Complete Future Trait Guide
From Everand
The Complete Future Trait Guide
Hamze Ghalebi
No ratings yet
lez_10
No ratings yet
lez_10
8 pages
lez_18
No ratings yet
lez_18
9 pages
lez_07
No ratings yet
lez_07
9 pages
lez_13
No ratings yet
lez_13
9 pages
lez_22
No ratings yet
lez_22
9 pages
lez_15
No ratings yet
lez_15
9 pages
lez_04 (1)
No ratings yet
lez_04 (1)
6 pages
lez_08
No ratings yet
lez_08
8 pages
lez_05
No ratings yet
lez_05
8 pages
chapter 1 Introduction
No ratings yet
chapter 1 Introduction
32 pages
Lect5 Parallel System
No ratings yet
Lect5 Parallel System
41 pages
lect3-parallel system
No ratings yet
lect3-parallel system
31 pages
Sheet 1 Solution
No ratings yet
Sheet 1 Solution
12 pages
lect2-parallel system
No ratings yet
lect2-parallel system
26 pages
chapter 4- interprocess communication1
No ratings yet
chapter 4- interprocess communication1
34 pages
lect1-parallel system
No ratings yet
lect1-parallel system
52 pages
Dev Ops
No ratings yet
Dev Ops
29 pages
Lect8 Parallel System
No ratings yet
Lect8 Parallel System
43 pages
Devops
No ratings yet
Devops
1 page
Professional Cloud DevOps Engineer Questions
No ratings yet
Professional Cloud DevOps Engineer Questions
4 pages
MP QM - Part 2 - 2021
No ratings yet
MP QM - Part 2 - 2021
36 pages
Microsoft Office Notes 2023 - PRACTICALS - Final
No ratings yet
Microsoft Office Notes 2023 - PRACTICALS - Final
115 pages
NetBackup 5240 Appliance Data Sheet
No ratings yet
NetBackup 5240 Appliance Data Sheet
2 pages
Unit 2 IT NOTES
No ratings yet
Unit 2 IT NOTES
18 pages
Android Development Resources
No ratings yet
Android Development Resources
8 pages
RAID
No ratings yet
RAID
6 pages
Chapter1 - Basic Structure of Computers
100% (1)
Chapter1 - Basic Structure of Computers
119 pages
Computer Science (Elec)
No ratings yet
Computer Science (Elec)
4 pages
Fixed Dynamic Mem Management DSB
No ratings yet
Fixed Dynamic Mem Management DSB
28 pages
William Stallings Computer Organization and Architecture 7 Edition
No ratings yet
William Stallings Computer Organization and Architecture 7 Edition
42 pages
Mrs. Maninder Kaur
No ratings yet
Mrs. Maninder Kaur
6 pages
High Availability Guide: Ibm Security Qradar Siem
No ratings yet
High Availability Guide: Ibm Security Qradar Siem
50 pages
Wireless Sensor Network Notes
100% (2)
Wireless Sensor Network Notes
158 pages
Sebop
No ratings yet
Sebop
15 pages
Sciencelogic Architecture 7 5 4
No ratings yet
Sciencelogic Architecture 7 5 4
53 pages
Release Notes AccuNest
50% (2)
Release Notes AccuNest
58 pages
Communication
No ratings yet
Communication
3 pages
Lecture 1 - Why File Structures
No ratings yet
Lecture 1 - Why File Structures
16 pages
Systems Storage Disk Ds3000 PDF Interop
No ratings yet
Systems Storage Disk Ds3000 PDF Interop
11 pages
BP-2009 Metro Availability
No ratings yet
BP-2009 Metro Availability
50 pages
E5164 Chapter-3-memory-Storage
No ratings yet
E5164 Chapter-3-memory-Storage
49 pages
Chapter 6: Auditing in A Computer Information Systems (Cis) or Information Technology (It) Environment
No ratings yet
Chapter 6: Auditing in A Computer Information Systems (Cis) or Information Technology (It) Environment
29 pages
Computer Hardware Servicing CG
No ratings yet
Computer Hardware Servicing CG
20 pages
Question With Answer MP & MC
100% (1)
Question With Answer MP & MC
13 pages
Pdfcaie Igcse Computer Science 0478 Theory v1 PDF
No ratings yet
Pdfcaie Igcse Computer Science 0478 Theory v1 PDF
14 pages
1-1 Into To Distributed Systems
No ratings yet
1-1 Into To Distributed Systems
10 pages
6.823 Computer System Architecture: Victim Cache
No ratings yet
6.823 Computer System Architecture: Victim Cache
2 pages
Chapter 7
No ratings yet
Chapter 7
34 pages
Data Recovery Logical & Physicalshort Course Content Chapter Wise
No ratings yet
Data Recovery Logical & Physicalshort Course Content Chapter Wise
4 pages

Lect4 Parallelsystem-shared Memory

Uploaded by

Lect4 Parallelsystem-shared Memory

Uploaded by

Shared Memory Architecture

Shared memory systems

 Uniform Memory Access (UMA)

All processors have equal access time to any memory location.

In write-through, the memory is updated every time the cache is updated,

• If we permit a write-update and write-through directly on global memory location X,

You might also like