0% found this document useful (0 votes)

2K views18 pages

Pipeline and Vector Processing

The document discusses pipelining and vector processing techniques for improving parallel processing performance. It covers: 1) Pipelining which breaks down instructions into sequential sub-operations that can execute concurrently across multiple stages. This includes arithmetic, instruction, and RISC pipelines. 2) Vector processing which performs the same operation on multiple data elements simultaneously using array processors. 3) Parallel computers classified by Flynn's taxonomy based on the number of instruction and data streams as SISD, SIMD, MISD, and MIMD architectures.

Uploaded by

nancy_01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views18 pages

Pipeline and Vector Processing

Uploaded by

nancy_01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 18

Pipelining and Vector Processing 1

PIPELINING AND VECTOR PROCESSING

• Parallel Processing

• Pipelining

• Arithmetic Pipeline

• Instruction Pipeline

• RISC Pipeline

• Vector Processing

• Array Processors

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 2 Parallel Processing

PARALLEL PROCESSING

Execution of Concurrent Events in the computing

process to achieve faster Computational Speed

Levels of Parallel Processing

- Job or Program level

- Task or Procedure level

- Inter-Instruction level

- Intra-Instruction level

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 3 Parallel Processing

PARALLEL COMPUTERS
Architectural Classification

– Flynn's classification
» Based on the multiplicity of Instruction Streams and
Data Streams
» Instruction Stream
• Sequence of Instructions read from memory
» Data Stream
• Operations performed on the data in the processor

Number of Data Streams

Single Multiple

Number of Single SISD SIMD

Instruction
Streams Multiple MISD MIMD

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 4 Pipelining

PIPELINING
A technique of decomposing a sequential process
into suboperations, with each subprocess being
executed in a partial dedicated segment that
operates concurrently with all other segments.
Ai * Bi + Ci for i = 1, 2, 3, ... , 7
Ai Bi Memory Ci
Segment 1
R1 R2

Multiplier
Segment 2
R3 R4

Adder
Segment 3

R1  Ai, R2  Bi Load Ai and Bi

R3  R1 * R2, R4  Ci Multiply and load Ci
R5  R3 + R4 Add
Computer Organization Computer Architectures Lab
Pipelining and Vector Processing 5 Pipelining

OPERATIONS IN EACH PIPELINE STAGE

Clock Segment 1 Segment 2 Segment 3

Pulse
Number R1 R2 R3 R4 R5
1 A1 B1
2 A2 B2 A1 * B1 C1
3 A3 B3 A2 * B2 C2 A1 * B1 + C1
4 A4 B4 A3 * B3 C3 A2 * B2 + C2
5 A5 B5 A4 * B4 C4 A3 * B3 + C3
6 A6 B6 A5 * B5 C5 A4 * B4 + C4
7 A7 B7 A6 * B6 C6 A5 * B5 + C5
8 A7 * B7 C7 A6 * B6 + C6
9 A7 * B7 + C7

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 6 Pipelining

GENERAL PIPELINE
General Structure of a 4-Segment Pipeline
Clock

Input S1 R1 S2 R2 S3 R3 S4 R4

Space-Time Diagram
1 2 3 4 5 6 7 8 9 Clock cycles
Segment 1 T1 T2 T3 T4 T5 T6
2 T1 T2 T3 T4 T5 T6
3 T1 T2 T3 T4 T5 T6
4 T1 T2 T3 T4 T5 T6

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 7 Pipelining

PIPELINE SPEEDUP
n: Number of tasks to be performed

Conventional Machine (Non-Pipelined)

tn: Clock cycle
: Time required to complete the n tasks
 = n * tn

Pipelined Machine (k stages)

tp: Clock cycle (time to complete each suboperation)
: Time required to complete the n tasks
 = (k + n - 1) * tp

Speedup
Sk: Speedup

Sk = n*tn / (k + n - 1)*tp
tn
lim Sk = ( = k, if tn = k * tp )
n tp

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 8 Arithmetic Pipeline

ARITHMETIC PIPELINE
Floating-point adder Exponents
a b
Mantissas
A B
X = A x 2a
Y = B x 2b R R

[1] Compare the exponents Compare Difference

Segment 1: exponents
[2] Align the mantissa by subtraction
[3] Add/sub the mantissa
[4] Normalize the result
R

Segment 2: Choose exponent Align mantissa

Segment 3: Add or subtract

mantissas

R R

Segment 4: Adjust Normalize

exponent result

R R

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 9 Instruction Pipeline

INSTRUCTION CYCLE
Six Phases* in an Instruction Cycle
[1] Fetch an instruction from memory
[2] Decode the instruction
[3] Calculate the effective address of the operand
[4] Fetch the operands from memory
[5] Execute the operation
[6] Store the result in the proper place

* Some instructions skip some phases

* Effective address calculation can be done in
the part of the decoding phase
* Storage of the operation result into a register
is done automatically in the execution phase

==> 4-Stage Pipeline

[1] FI: Fetch an instruction from memory

[2] DA: Decode the instruction and calculate
the effective address of the operand
[3] FO: Fetch the operand
[4] EX: Execute the operation

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 10 Instruction Pipeline

INSTRUCTION PIPELINE

Execution of Three Instructions in a 4-Stage Pipeline

Conventional

i FI DA FO EX

i+1 FI DA FO EX

i+2 FI DA FO EX

Pipelined

i FI DA FO EX
i+1 FI DA FO EX
i+2 FI DA FO EX

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 11 Instruction Pipeline

INSTRUCTION EXECUTION IN A 4-STAGE PIPELINE

Segment1: Fetch instruction

from memory

Decode instruction
Segment2: and calculate
effective address

yes Branch?
no
Fetch operand
Segment3: from memory

Segment4: Execute instruction

Interrupt yes
Interrupt?
handling
no
Update PC

Empty pipe
Step: 1 2 3 4 5 6 7 8 9 10 11 12 13
Instruction 1 FI DA FO EX
2 FI DA FO EX
(Branch) 3 FI DA FO EX
4 FI FI DA FO EX
5 FI DA FO EX
6 FI DA FO EX
7 FI DA FO EX

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 12 RISC Pipeline

RISC PIPELINE
RISC
- Machine with a very fast clock cycle that
executes at the rate of one instruction per cycle
<- Simple Instruction Set
Fixed Length Instruction Format
Register-to-Register Operations

Instruction Cycles of Three-Stage Instruction Pipeline

Data Manipulation Instructions
I: Instruction Fetch
A: Decode, Read Registers, ALU Operations
E: Write a Register

Load and Store Instructions

I: Instruction Fetch
A: Decode, Evaluate Effective Address
E: Register-to-Memory or Memory-to-Register

Program Control Instructions

I: Instruction Fetch
A: Decode, Evaluate Branch Address
E: Write Register(PC)
Computer Organization Computer Architectures Lab
Pipelining and Vector Processing 13 RISC Pipeline

DELAYED LOAD
LOAD: R1  M[address 1]
LOAD: R2  M[address 2]
ADD: R3  R1 + R2
STORE: M[address 3]  R3
Three-segment pipeline timing
Pipeline timing with data conflict

clock cycle 1 2 3 4 5 6
Load R1 I A E
Load R2 I A E
Add R1+R2 I A E
Store R3 I A E

Pipeline timing with delayed load

clock cycle 1 2 3 4 5 6 7
Load R1 I A E
The data dependency is taken
Load R2 I A E care by the compiler rather
NOP I A E than the hardware
Add R1+R2 I A E
Store R3 I A E

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 14 RISC Pipeline

DELAYED BRANCH
Compiler analyzes the instructions before and after
the branch and rearranges the program sequence by
inserting useful instructions in the delay steps

Using no-operation instructions

Clock cycles: 1 2 3 4 5 6 7 8 9 10
1. Load I A E
2. Increment I A E
3. Add I A E
4. Subtract I A E
5. Branch to X I A E
6. NOP I A E
7. NOP I A E
8. Instr. in X I A E

Rearranging the instructions

Clock cycles: 1 2 3 4 5 6 7 8
1. Load I A E
2. Increment I A E
3. Branch to X I A E
4. Add I A E
5. Subtract I A E
6. Instr. in X I A E

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 15 Vector Processing

VECTOR PROCESSING
Vector Processing Applications
• Problems that can be efficiently formulated in terms of vectors
– Long-range weather forecasting
– Petroleum explorations
– Seismic data analysis
– Medical diagnosis
– Aerodynamics and space flight simulations
– Artificial intelligence and expert systems
– Mapping the human genome
– Image processing

Vector Processor (computer)

Ability to process vectors, and related data structures such as matrices
and multi-dimensional arrays, much faster than conventional computers

Vector Processors may also be pipelined

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 16 Vector Processing

VECTOR PROGRAMMING

DO 20 I = 1, 100
20 C(I) = B(I) + A(I)

Conventional computer

Initialize I = 0
20 Read A(I)
Read B(I)
Store C(I) = A(I) + B(I)
Increment I = i + 1
If I  100 goto 20

Vector computer

C(1:100) = A(1:100) + B(1:100)

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 17 Vector Processing

VECTOR INSTRUCTION FORMAT

Vector Instruction Format

Operation Base address Base address Base address Vector
code source 1 source 2 destination length

Pipeline for Inner Product

Source
A

Source Multiplier Adder

B pipeline pipeline

Computer Organization Computer Architectures Lab

Pipelining and Vector Processing 18 Vector Processing

MULTIPLE MEMORY MODULE AND INTERLEAVING

Multiple Module Memory

Address bus
M0 M1 M2 M3

AR AR AR AR

Memory Memory Memory Memory

array array array array

DR DR DR DR

Data bus

Address Interleaving

Different sets of addresses are assigned to

different memory modules

Computer Organization Computer Architectures Lab

Scribid ACA Important Topics With Answers
No ratings yet
Scribid ACA Important Topics With Answers
57 pages
Arithmatic Pipline Unit-3 (1)
No ratings yet
Arithmatic Pipline Unit-3 (1)
27 pages
Contact Session 8
No ratings yet
Contact Session 8
63 pages
Advanced Computer Architecture: Pipelined Processor
No ratings yet
Advanced Computer Architecture: Pipelined Processor
20 pages
Arithmatic Pipline Unit-3
No ratings yet
Arithmatic Pipline Unit-3
27 pages
CH7-Parallel and Pipelined Processing
No ratings yet
CH7-Parallel and Pipelined Processing
23 pages
6. Pipeline -3117 (1)
No ratings yet
6. Pipeline -3117 (1)
22 pages
Coa, Unit v, Notes
No ratings yet
Coa, Unit v, Notes
26 pages
Pipelining and Vector Processing Chapter 9
100% (6)
Pipelining and Vector Processing Chapter 9
29 pages
Coa Mod 4 5
No ratings yet
Coa Mod 4 5
91 pages
3.2 Pipeline Processing
No ratings yet
3.2 Pipeline Processing
18 pages
Important Questions Solution Coa Unit 5
No ratings yet
Important Questions Solution Coa Unit 5
8 pages
cao-unit-6
No ratings yet
cao-unit-6
21 pages
COA-UNIT-5
No ratings yet
COA-UNIT-5
20 pages
Presentation 5156 Content Document 20250301102853AM
No ratings yet
Presentation 5156 Content Document 20250301102853AM
40 pages
32 Hazards in Pipeline 06-04-2023
No ratings yet
32 Hazards in Pipeline 06-04-2023
24 pages
UNIT-5: Pipeline and Vector Processing
No ratings yet
UNIT-5: Pipeline and Vector Processing
63 pages
Pipeline & Parallel Processing
No ratings yet
Pipeline & Parallel Processing
19 pages
FINAL Presentation
No ratings yet
FINAL Presentation
31 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
37 pages
Unit 6 COA
No ratings yet
Unit 6 COA
37 pages
UNIT-4_Pipelining & Parallel processing
No ratings yet
UNIT-4_Pipelining & Parallel processing
34 pages
Unit 5-2 COA
No ratings yet
Unit 5-2 COA
52 pages
Unit-6 Pipelining
No ratings yet
Unit-6 Pipelining
63 pages
Lecture 8 Unit 4 Pipeline and Vector Processing 2019
No ratings yet
Lecture 8 Unit 4 Pipeline and Vector Processing 2019
36 pages
Coa Module 5
No ratings yet
Coa Module 5
10 pages
Pipelining 2
No ratings yet
Pipelining 2
43 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
72 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
39 pages
Pipelining
No ratings yet
Pipelining
33 pages
chapter9pipelining-200907163859
No ratings yet
chapter9pipelining-200907163859
13 pages
Pipelining and Vector Processing: - Parallel
No ratings yet
Pipelining and Vector Processing: - Parallel
37 pages
CPS Plus 7.2 Start-Up User Guide - EN - 6802974C10 - BH
No ratings yet
CPS Plus 7.2 Start-Up User Guide - EN - 6802974C10 - BH
90 pages
Unit-5 Computer Organization Notes
No ratings yet
Unit-5 Computer Organization Notes
16 pages
1.4-Parallel Computer Architecture
No ratings yet
1.4-Parallel Computer Architecture
22 pages
COA DR MVN 5 UNIT - Latest PDF
No ratings yet
COA DR MVN 5 UNIT - Latest PDF
24 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
Logcat
No ratings yet
Logcat
61 pages
Pipeline and Vector Processing
83% (12)
Pipeline and Vector Processing
37 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
37 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
37 pages
Service Support Tool Version 1.63E Operation Manual: Revision 0
No ratings yet
Service Support Tool Version 1.63E Operation Manual: Revision 0
87 pages
Chap. 9 Pipeline and Vector Processing
0% (1)
Chap. 9 Pipeline and Vector Processing
12 pages
COA M3 BIT (1)
No ratings yet
COA M3 BIT (1)
4 pages
Unit 5
No ratings yet
Unit 5
23 pages
Windows MultiPoint Server 2012 Deployment Guide2 PDF
No ratings yet
Windows MultiPoint Server 2012 Deployment Guide2 PDF
74 pages
RD3912A10
0% (1)
RD3912A10
13 pages
UNIT-V-Pipeline and Array Processing and Multi Processors
No ratings yet
UNIT-V-Pipeline and Array Processing and Multi Processors
51 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
30 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
28 pages
Chapter 5 Pipelining and Vector Processing Modified
No ratings yet
Chapter 5 Pipelining and Vector Processing Modified
37 pages
Ca Unit 2.2
100% (2)
Ca Unit 2.2
22 pages
EIGRP Over The Top Routing (OTP)
No ratings yet
EIGRP Over The Top Routing (OTP)
52 pages
Chap. 9 Pipeline and Vector Processing
No ratings yet
Chap. 9 Pipeline and Vector Processing
16 pages
ASICS Factories - Key Users Training
No ratings yet
ASICS Factories - Key Users Training
50 pages
Chapter 9
No ratings yet
Chapter 9
28 pages
Pipelining Vector Processing
No ratings yet
Pipelining Vector Processing
27 pages
Module 5 Coa
No ratings yet
Module 5 Coa
11 pages
Parallel Processing
No ratings yet
Parallel Processing
32 pages
Antivirus Protection Methods Procedures and Tips
No ratings yet
Antivirus Protection Methods Procedures and Tips
18 pages
ExaCCGen1-UpgradeGridTo19c
No ratings yet
ExaCCGen1-UpgradeGridTo19c
14 pages
Unit-5-Parallel Processing
No ratings yet
Unit-5-Parallel Processing
11 pages
Geo SCADA Expert Performance Guidelines
No ratings yet
Geo SCADA Expert Performance Guidelines
12 pages
Dokumen - Tips - SwOS (MikroTik Switch OS) Administration Guide
No ratings yet
Dokumen - Tips - SwOS (MikroTik Switch OS) Administration Guide
22 pages
Information Technology Infrastructure: Vu Quang Nguyen
No ratings yet
Information Technology Infrastructure: Vu Quang Nguyen
12 pages
TP Limit Switch
No ratings yet
TP Limit Switch
11 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
Chapter4 (Lect 41-44 Micro Operations)
No ratings yet
Chapter4 (Lect 41-44 Micro Operations)
37 pages
lastException_63872174381
No ratings yet
lastException_63872174381
13 pages
Parallel Processing
No ratings yet
Parallel Processing
33 pages
Basic SQL: Section 4.1-4.7
No ratings yet
Basic SQL: Section 4.1-4.7
30 pages
Nas DS2423+
No ratings yet
Nas DS2423+
10 pages
MIT App Inventor
100% (1)
MIT App Inventor
29 pages
Poor Man's Computing Revisited: Alexander Shchepetkin, I.G.P.P. UCLA
No ratings yet
Poor Man's Computing Revisited: Alexander Shchepetkin, I.G.P.P. UCLA
12 pages
Handeling Telephone Calls
No ratings yet
Handeling Telephone Calls
11 pages
AFL2-12A-HM65 UMN v1.00
No ratings yet
AFL2-12A-HM65 UMN v1.00
186 pages
DSL-2520U+D1+RU1 (1) .00+manual 20071211
No ratings yet
DSL-2520U+D1+RU1 (1) .00+manual 20071211
43 pages
Install Coovachilli On Ubuntu 20.04
No ratings yet
Install Coovachilli On Ubuntu 20.04
8 pages
Top 100 Networking Interview Questions
100% (1)
Top 100 Networking Interview Questions
26 pages
CSO Lecture Notes Unit - 5
No ratings yet
CSO Lecture Notes Unit - 5
11 pages
SC 8000
No ratings yet
SC 8000
12 pages
Relational Calculus: Database Management Systems, R. Ramakrishnan 1
No ratings yet
Relational Calculus: Database Management Systems, R. Ramakrishnan 1
17 pages
Chapter 17 (Lect 48 and Micro Programmed Control Intro.)
No ratings yet
Chapter 17 (Lect 48 and Micro Programmed Control Intro.)
15 pages
Chapter 12: File System Implementation
No ratings yet
Chapter 12: File System Implementation
11 pages
Stack Computers: The New Wave
From Everand
Stack Computers: The New Wave
Philip Koopman
No ratings yet
William Stallings Computer Organization and Architecture 6 Edition Reduced Instruction Set Computers
No ratings yet
William Stallings Computer Organization and Architecture 6 Edition Reduced Instruction Set Computers
14 pages
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
From Everand
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
Derek Molloy
4/5 (2)
Q1) Sort The Given List Using Heap Sort Technique:: AVL Tree Operations
No ratings yet
Q1) Sort The Given List Using Heap Sort Technique:: AVL Tree Operations
12 pages
Cisco Vs Juniper Commands
No ratings yet
Cisco Vs Juniper Commands
7 pages
Chapter 3 - Pipelining-And-Vector-Processing
100% (1)
Chapter 3 - Pipelining-And-Vector-Processing
29 pages
Usbbdm
No ratings yet
Usbbdm
11 pages
Basic Computer Organization and Design
No ratings yet
Basic Computer Organization and Design
9 pages
Tutorial PC Manager EN-US
No ratings yet
Tutorial PC Manager EN-US
6 pages
Addressing Modes
No ratings yet
Addressing Modes
6 pages
Addressing Modes
No ratings yet
Addressing Modes
6 pages
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
Lovely Professional University Homework: #3: CAP205 2009-12 MCA 210111 TB903
No ratings yet
Lovely Professional University Homework: #3: CAP205 2009-12 MCA 210111 TB903
5 pages
Dumpstate
No ratings yet
Dumpstate
2 pages
Recovery Mode Update For Yealink IP Phones
No ratings yet
Recovery Mode Update For Yealink IP Phones
5 pages
Pipeline and Vector Processing
100% (1)
Pipeline and Vector Processing
18 pages
h6811 Datadomain Ds
No ratings yet
h6811 Datadomain Ds
5 pages
WB 2
No ratings yet
WB 2
2 pages
Memory Hierarchy
No ratings yet
Memory Hierarchy
4 pages
Insertion Sort
No ratings yet
Insertion Sort
5 pages
Computer Hardware Quiz
No ratings yet
Computer Hardware Quiz
5 pages