Lecture 6 Parallel Programming Models

The document discusses various parallel programming models, including Shared Memory, Distributed Memory, and Hybrid Models, highlighting their architectures and programming paradigms such as OpenMP and MPI. It emphasizes the importance of choosing the right model based on available resources and personal preference, along with the significance of performance metrics, security, and energy efficiency in parallel computing. Additionally, it introduces high-level programming models like SPMD and MPMD, and mentions applications in cloud computing and GPU programming.

Uploaded by

hassiedward977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views17 pages

Lecture 6 Parallel Programming Models

Uploaded by

hassiedward977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

CSC334 Parallel & Distributed Computing

Lecture # 06
Parallel Programming Models
Suliman Khan

Department of Computer Science

University of Lahore, Sargodha Campus
Parallel Computers

• Programming mode types

– Shared Memory
– Distributed Memory
– Hybrid Model
Parallel Programing Models
• Parallel programming models exist as an abstraction above hardware and memory
architectures
• These models are NOT specific to a particular type of machine or memory
architecture
• These models can (theoretically) be implemented on any underlying hardware
• Examples from past
• SHARED memory model on a DISTRIBUTED memory machine. Kendall Square Research (KSR)
ALLCACHE approach, “virtual shared memory"
• DISTRIBUTED memory model on a SHARED memory machine. Message Passing Interface (MPI)
on SGI Origin 2000, employed the CC-NUMA type of shared memory architecture, however, MPI
commonly done over a network of distributed memory machines
• Which model to use?
• Combination of what is available and personal choice
Shared Memory
• Architecture
Processors have direct access to global memory and
I/O through bus or fast switching network
• Cache Coherency Protocol guarantees
consistency of memory and I/O accesses
• Each processor also has its own memory (cache)
• Data structures are shared in global address space
• Concurrent access to shared memory must be coordinated
• Programming Models
– Multithreading (Thread Libraries)
– OpenMP P
P0 P1 ... Pn
0
Cach Cach Cach
e e e
Shared
Bus
Global Shared
Memory
Threads Model
• Threads implementations commonly comprise:
• A library of subroutines that are called from within parallel source code
• A set of compiler directives imbedded in either serial or parallel source code
• Historically, hardware vendors have implemented their own proprietary
versions of threads, making it difficult for programmers to develop
portable threaded applications
• Standardization efforts: POSIX Threads (IEEE POSIX 1003.1c) and
OpenMP (Industry standard)
• POSIX Part of Unix/Linux, Library based
• OpenMP Compiler directive based, Portable / multi-platform
• Mircosoft threads, Java, Python threads, CUDA threads for GPUs
OpenMP
• OpenMP: portable shared memory parallelism
• Higher-level API for writing portable
multithreaded applications
• Provides a set of compiler directives and library routines
for parallel application programmers
• API bindings for Fortran, C, and C++
Distributed Memory
Architecture
• Each Processor has direct access only to its local memory
• Processors are connected via high-speed interconnect
• Data structures must be distributed
• Data exchange is done via explicit processor-to-
processor communication: send/receive messages
• Programming Models
– Widely used standard: MPI
– Others: PVM, Express, P4, Chameleon, PARMACS, ...

Memory Memory Memory

P0 P1 ... Pn
Communicati
on
Interconne4 ct
Message Passing Interface
MPI provides:
• Point-to-point communication
• Collective operations
– Barrier synchronization
– gather/scatter operations
– Broadcast, reductions
• Different communication modes
– Synchronous/asynchronous
– Blocking/non-blocking
– Buffered/unbuffered
• Predefined and derived datatypes
• Virtual topologies
• Parallel I/O (MPI 2)
• C/C++ and Fortran bindings
Hybrid Model
• A hybrid model combines more than one of the
previously described programming models.
• A common example of a hybrid model is the
combination of the message passing model
(MPI) with the threads model (OpenMP).
• Threads perform computationally intensive
kernels using local, on-node data
• Communications between processes on
different nodes occurs over the network using
MPI
Hybrid Model
• Another similar and increasingly popular example
of a hybrid model is using MPI with CPU-GPU
(Graphics Processing Unit) programming.
• MPI tasks run on CPUs using local memory and
communicating with each other over a network.
• Computationally intensive kernels are off-loaded to
GPUs on node.
• Data exchange between node-local memory and
GPUs uses CUDA (or something equivalent)
High level programming model
• Single Program Multiple Data (SPMD)
• Multiple Program Multiple Data (MPMD)
SPMD
• Built upon any combination of the previously mentioned parallel
programming models
• SINGLE PROGRAM: All tasks execute their copy of the same program
simultaneously. This program can be threads, message passing, data
parallel or hybrid.
• MULTIPLE DATA: All tasks may use different data

-tasks do not necessarily have to execute the

entire program
- perhaps only a portion of it
MPMD
• built upon any combination of the previously mentioned parallel
programming models
• MULTIPLE PROGRAM: Tasks may execute different programs
simultaneously.
• The programs can be threads, message passing, data parallel or
hybrid.
• MULTIPLE DATA: All tasks may use different data

MPMD applications are not as common

as SPMD applications
Parallel and Distributed
Programming Models
• OPENMP
• MPI
• For message passing systems
• MapReduce and BigTable
• For internet clouds and data centers
• Service clouds require extension of Hadoop, EC2, S3 to facilitate distributed
computing over distributed storage system
• CUDA
• For NVIDIA GPUs
• Open Grid Service Architecture (OGSA)
• For grid application development
Performance, Security and Energy
Efficiency
• Performance Metrics
• CPU Speed, FLOPS, Job response
time, network latency, system
throughput, network bandwidth,
System overhead (OS boot time,
compile time, etc).
• Scalability
• Machine (size), software,
application, and technology
scalability
• Amdahl’s law
Performance, Security and Energy
Efficiency
• Security
• Threats to system and network
• Confidentiality, integrity, and availability
• Copyright protection
• System Defense technologies
• Data protection infrastructures (IDS)
• Energy efficiency
• Distributed power management
• Unused servers’ energy consumption
• Reducing energy in active servers
That’s all for today!!

Bcs702 Parallel Computing Module 1
No ratings yet
Bcs702 Parallel Computing Module 1
35 pages
Manufacturing MRP White Paper
100% (1)
Manufacturing MRP White Paper
17 pages
Lecture 6 Parallel Programming Models 1
No ratings yet
Lecture 6 Parallel Programming Models 1
14 pages
3.3-Recent Trends in Parallel Computing
No ratings yet
3.3-Recent Trends in Parallel Computing
12 pages
3 ParallelProgrammingModels
No ratings yet
3 ParallelProgrammingModels
20 pages
Lecture 1
No ratings yet
Lecture 1
23 pages
Parallel Programming
No ratings yet
Parallel Programming
108 pages
Prebook MCAP
No ratings yet
Prebook MCAP
11 pages
Chapter 2 - Parallel Algorithm Design
No ratings yet
Chapter 2 - Parallel Algorithm Design
84 pages
Multi Core Architectures and Programming
No ratings yet
Multi Core Architectures and Programming
10 pages
2 Parallel Computer Memory Architectures
No ratings yet
2 Parallel Computer Memory Architectures
26 pages
Parallel Programming Models
No ratings yet
Parallel Programming Models
25 pages
Parallel Programming: Process and Threads
No ratings yet
Parallel Programming: Process and Threads
18 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
HPC Module 4
No ratings yet
HPC Module 4
18 pages
Assignment 2
No ratings yet
Assignment 2
6 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Parallel_Programming_FDP
No ratings yet
Parallel_Programming_FDP
43 pages
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
No ratings yet
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
40 pages
Intro Parallel Programming Paradigms
No ratings yet
Intro Parallel Programming Paradigms
45 pages
Overview of Parallel Computing: Shawn T. Brown
No ratings yet
Overview of Parallel Computing: Shawn T. Brown
46 pages
Demystifying Multicore Germany 14 PDF
No ratings yet
Demystifying Multicore Germany 14 PDF
82 pages
ParallelProgramming Start2016
No ratings yet
ParallelProgramming Start2016
41 pages
Meet-7-Parallel Programming Models Bag1
No ratings yet
Meet-7-Parallel Programming Models Bag1
17 pages
Parallel Programming
No ratings yet
Parallel Programming
42 pages
Parallel Programming
No ratings yet
Parallel Programming
17 pages
Parallel and Distributed Computing Lecture#12
No ratings yet
Parallel and Distributed Computing Lecture#12
19 pages
PDC Lecture 15 OpenMP
No ratings yet
PDC Lecture 15 OpenMP
18 pages
Mit Openmp Mpi
No ratings yet
Mit Openmp Mpi
77 pages
Parallel Programming For Multicore Machines Using OpenMP and MPI Lecture Notes (Dr. Constantinos Evangelinos) (Z-Library)
No ratings yet
Parallel Programming For Multicore Machines Using OpenMP and MPI Lecture Notes (Dr. Constantinos Evangelinos) (Z-Library)
292 pages
OpenMP Tutorial - Lawrence Livermore National Laboratory
No ratings yet
OpenMP Tutorial - Lawrence Livermore National Laboratory
75 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
Introduction On OpenMPI
No ratings yet
Introduction On OpenMPI
14 pages
Mpi Openmp Handouts
No ratings yet
Mpi Openmp Handouts
67 pages
Introduction To Parallel Programming: Center For Institutional Research Computing
No ratings yet
Introduction To Parallel Programming: Center For Institutional Research Computing
98 pages
Lecture 1.2.2
No ratings yet
Lecture 1.2.2
13 pages
Programming Models
No ratings yet
Programming Models
21 pages
03 Programming
No ratings yet
03 Programming
63 pages
Parallel Programming Unit 2
No ratings yet
Parallel Programming Unit 2
71 pages
Multicore Code Entwicklung
No ratings yet
Multicore Code Entwicklung
33 pages
2 ParallelArchExec
No ratings yet
2 ParallelArchExec
46 pages
Lecture 4
No ratings yet
Lecture 4
20 pages
Memory in Multiprocessor System
No ratings yet
Memory in Multiprocessor System
52 pages
3.multicore Architecture and Programming
0% (1)
3.multicore Architecture and Programming
3 pages
About OpenMP
No ratings yet
About OpenMP
86 pages
Lecture-4 Parallel Programming Model
No ratings yet
Lecture-4 Parallel Programming Model
14 pages
3.introduction To Parallelism
No ratings yet
3.introduction To Parallelism
64 pages
Multi Threading
No ratings yet
Multi Threading
168 pages
Concurrency: CS2403 Programming Languages
No ratings yet
Concurrency: CS2403 Programming Languages
44 pages
Programming Assignment: On Openmp
No ratings yet
Programming Assignment: On Openmp
19 pages
Lec6 - TLP Data Dependence Solutions
No ratings yet
Lec6 - TLP Data Dependence Solutions
20 pages
IT105 Midterm Lecture Part1
No ratings yet
IT105 Midterm Lecture Part1
5 pages
Module 2
No ratings yet
Module 2
5 pages
CSC-334 - P&DC - Lab Manual - V2.0
No ratings yet
CSC-334 - P&DC - Lab Manual - V2.0
102 pages
Shared Memory Parallel Programming: Introduction To Openmp
No ratings yet
Shared Memory Parallel Programming: Introduction To Openmp
39 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Sign Language Recognition Using Computer Vision
No ratings yet
Sign Language Recognition Using Computer Vision
10 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Lecture W2abc 2
No ratings yet
Lecture W2abc 2
39 pages
Lecture W5c
No ratings yet
Lecture W5c
30 pages
ArbMaker 3 - Release Notes
No ratings yet
ArbMaker 3 - Release Notes
3 pages
Parallelism in Algorithms - Bernstein's Conditions
100% (2)
Parallelism in Algorithms - Bernstein's Conditions
2 pages
Bharathidasan University, Tiruchirappalli - 620 024. Master of Computer Application (M.C.A) - Course Structure Under CBCS
No ratings yet
Bharathidasan University, Tiruchirappalli - 620 024. Master of Computer Application (M.C.A) - Course Structure Under CBCS
54 pages
GPT4架构揭秘
No ratings yet
GPT4架构揭秘
12 pages
Parallel Architecture Classification
50% (2)
Parallel Architecture Classification
41 pages
Chapter - 1 Basic Structure of Computers: Computertypes
No ratings yet
Chapter - 1 Basic Structure of Computers: Computertypes
18 pages
5 Marks Q. Describe Array Processor Architecture
No ratings yet
5 Marks Q. Describe Array Processor Architecture
11 pages
Cbca2103 SG
No ratings yet
Cbca2103 SG
63 pages
PREFACE of Operating Systems
No ratings yet
PREFACE of Operating Systems
4 pages
OpenMPSlides Tamu SC
No ratings yet
OpenMPSlides Tamu SC
80 pages
Embedded Systems: Design, Analysis and Verification
No ratings yet
Embedded Systems: Design, Analysis and Verification
368 pages
Teradata Performance Optimization
No ratings yet
Teradata Performance Optimization
7 pages
Flexgen: High-Throughput Generative Inference of Large Language Models With A Single Gpu
No ratings yet
Flexgen: High-Throughput Generative Inference of Large Language Models With A Single Gpu
23 pages
Slides Chap10
No ratings yet
Slides Chap10
138 pages
Parallel Computing
No ratings yet
Parallel Computing
34 pages
PSTU CSE Syllabus From 2011-2012
No ratings yet
PSTU CSE Syllabus From 2011-2012
28 pages
cs501 Final Term Highlighted Handouts
No ratings yet
cs501 Final Term Highlighted Handouts
216 pages
Group 4 Supercomputer
No ratings yet
Group 4 Supercomputer
5 pages
Nptel - Ac.in Aeronautical Microprocessors and Software Engineering Final
No ratings yet
Nptel - Ac.in Aeronautical Microprocessors and Software Engineering Final
148 pages
D ODB Final Exam Answered
No ratings yet
D ODB Final Exam Answered
7 pages
Contents:: Multiprocessors: Characteristics of Multiprocessor, Structure of Multiprocessor
No ratings yet
Contents:: Multiprocessors: Characteristics of Multiprocessor, Structure of Multiprocessor
52 pages
S Lora
No ratings yet
S Lora
16 pages
Computer Classification
No ratings yet
Computer Classification
6 pages
Lecture-3 Parallel Computer Memory Architecture
No ratings yet
Lecture-3 Parallel Computer Memory Architecture
14 pages
Maulana Abul Kalam Azad University of Technology, West Bengal
No ratings yet
Maulana Abul Kalam Azad University of Technology, West Bengal
2 pages
CESS Tracks Electives Direct Prerequisites Tree
No ratings yet
CESS Tracks Electives Direct Prerequisites Tree
1 page
In Search of Database Nirvana
100% (1)
In Search of Database Nirvana
54 pages
HRMS and BEN 11i and R12 Tuning and Health Check
No ratings yet
HRMS and BEN 11i and R12 Tuning and Health Check
75 pages
Code Generation For Embedded Processors
No ratings yet
Code Generation For Embedded Processors
297 pages

Lecture 6 Parallel Programming Models

Uploaded by

Lecture 6 Parallel Programming Models

Uploaded by

CSC334 Parallel & Distributed Computing

Department of Computer Science

• Programming mode types

Memory Memory Memory

-tasks do not necessarily have to execute the

MPMD applications are not as common

You might also like