CA Classes-116-120

The document discusses the execution of load and store instructions in computer architecture. It describes the subtasks involved in executing load and store instructions, as well as different approaches for processing load/store instructions sequentially or in parallel with other instructions.

Uploaded by

SrinivasaRao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views5 pages

CA Classes-116-120

Uploaded by

SrinivasaRao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Computer Architecture Unit 5

Let us first consider a load instruction. Its execution begins with the
determination of the effective memory address (EA) from where data is to
be fetched. In straightforward cases, like RISC processors, this can be done
in two steps: fetching the referenced address register(s) and calculating the
effective address. However, for CISC processors address calculation may
be a difficult task, requiring multiple subsequent register fetches and
address calculations, as for instance in the case of indexed, post-
incremented, relative addresses. Once the effective address is available,
the next step is usually, to forward the effective (virtual) address to the MMU
for translation and to access the data cache. Here, and in the subsequent
discussion, we shall not go into details of whether the referenced cache is
physically or virtually addressed, and thus we neglect the corresponding
issues. Furthermore, we assume that the referenced data is available in the
cache and thus it is fetched in one or a few cycles. Usually, fetched data is
made directly available to the requesting unit, such as the FX or FP unit,
through bypassing. Finally, the last subtask to be performed is writing the
accessed data into the specified register.
For a store instruction, the address calculation phase is identical to that
already discussed for loads. However, subsequently both the virtual address
and the data to be stored can be sent out in parallel to the MMU and the
cache, respectively. This concludes the processing of the store instruction.
Figure 5.8 shows the subtasks involved in executing load and store
instructions.

Figure 5.8: Subtasks of Executing Load and Store Instructions

Manipal University Jaipur B1648 Page No. 116

Computer Architecture Unit 5

5.5.2 The design space

While considering the design space of pipelined load/store processing we
take into account only one aspect, namely whether load/store operations
are executed sequentially or in parallel with FX instructions (Figure 5.9).
In traditional pipeline implementations, load and store instructions are
processed by the master pipeline. Thus, loads and stores are executed
sequentially with other instructions (Figure 5.9).

Figure 5.9: Sequential vs. Parallel Execution of Load/Store Instructions

In this case, the required address calculation of a load/store instruction can

be performed by the adder of the execution stage. However, one instruction
slot is needed for each load or store instruction.

Manipal University Jaipur B1648 Page No. 117

Computer Architecture Unit 5

A more effective technique for load/store instruction processing is to do it in

parallel with data manipulations (see again Figure 5.9). Obviously, this
approach assumes the existence of an autonomous load/store unit which
can perform address calculations on its own.
Let’s discuss both these techniques in detail.
5.5.3 Sequential consistency of instruction execution
By operating the processors with multiple EUs (Execution Units) in parallel,
the instructions execution can be finished very fast. However, all the
instructions execution should maintain sequential consistency. The
sequential consistency follows two aspects:
1. Processor Consistency - the order of instructions execution ();
2. Memory Consistency - the order of accessing the memory ().
Processor consistency: The phrase Processor Consistency is applied to
suggest the consistency of instruction completion with sequential instruction
execution. There are two types of processor consistency reflected by
Superscalar processors; namely weak or strong consistency.
Weak processor consistency specifies that all the instructions must be
executed justly; with the condition of no violation of data dependencies.
Data dependencies must be observed and settled during the execution.
Strong processor consistency forces the instructions to follow program order
for the execution. This can be attained through ROB (reorder buffer).ROB is
a storage area from where all data is read and written.
Memory consistency: One another face of superscalar instruction
execution is whether memory access is executed in the same order as in a
sequential processor.
Memory consistency is weak if with strict sequential program execution, the
memory access is out-of-order. Moreover, data dependencies should not be
dishonoured. Simply, it can be stated that weak consistency permits load
and store reordering and being very particular about memory data
dependencies, to be found and settled.
Memory consistency is strong, if memory access occurs strictly in program
order and load/store reordering is prohibited.

Manipal University Jaipur B1648 Page No. 118

Computer Architecture Unit 5

Load and Store reordering

Load and store instructions affect both the processor and the memory.
Firstly ALU or address unit computes the addresses and then the load and
store instructions get executed.
Now, the loads can fetch the data cache from the memory data. Once the
generated address is received, a store instruction can send the operands.
Processor affirming weak memory consistency permits memory access
reordering. This point can be considered as advantageous because of the
following three reasons:
1. Permitting load/store bypassing,
2. Making speculative loads or stores feasible
3. Allowing hiding of cache misses.
Load/Store bypassing
Load/Store bypassing means that any of the two can bypass each other.
This means either stores can bypass loads or vice versa, without violating
the memory data dependencies. The bypassing of loads to stores provides
the advantage of runtime overlapping of loops.
This is accomplished by permitting loads at the origin of iteration to access
memory without having to hold till stores at the end of the former iteration
are finished. In order to prevent fetching a false data value, a load can
bypass pending stores if none of the previous stores have the same target
address as the load. Nevertheless, certain addresses of pending stores may
not be available.
Speculative loads
Speculative loads avoid memory access delay. This delay can be caused
due to the non- computation of required addresses or clashes among the
addresses. The speculative loads should be checked for correctness. If
required then respective measures should be taken to done for it.
Speculative loads are alike speculative branches.
To check the address, write the loads and stores computed target address
into ROB (ReOrder buffer). The address comparison is carried out at ROB.
Reorder buffer (ROB)
ROB came in 1988 for the solution of precise interrupt problem. Currently,
ROB is an assurance tool for sequential consistency execution where
multiple EUs operate in parallel.
Manipal University Jaipur B1648 Page No. 119
Computer Architecture Unit 5

ROB is a circular buffer. It has a head and tail pointers. In ROB, instructions
enter in program order only. Instructions can only be retired if all of their
previous instructions have finished and they had also retired.
Sequential consistency can be maintained by directing instructions to
update the program state by writing their results in proper program order
into the memory or referenced architectural register(s). ROB can
successfully support both interrupt handling and speculative execution.
5.5.4 Instruction Issuing and parallel execution
In this phase execution tuples are created. After its creation it is decided
that which execution tuple can now be issued. When the accessibility of
data and resources are checked during run-time it is then known as
Instruction Issuing. In instruction issuing area many pipelines are
processed.
In figure 5.10 you can see a reorder buffer which follows FIFO order.

Figure 5.10: A Reorder Buffer.

In this buffer the entries received and sent in FIFO order. When the input
operands are present then the instruction can be executed. Other instruction
might be located in instruction issue.
Other constraints are associated with the buffers carrying the execution
tuples. In figure 5.11 you can see the Parallel Execution Schedule (PES) of

Manipal University Jaipur B1648 Page No. 120

RN ACA-5 Unit-II
No ratings yet
RN ACA-5 Unit-II
42 pages
Lecture 10: Memory Dependence Detection and Speculation
No ratings yet
Lecture 10: Memory Dependence Detection and Speculation
3 pages
Superscalar Processors Superscalar Processors vs. VLIW: Computer Science
No ratings yet
Superscalar Processors Superscalar Processors vs. VLIW: Computer Science
17 pages
Chapter 3 - Basic Operational Concepts
No ratings yet
Chapter 3 - Basic Operational Concepts
16 pages
The Sparc Microprocessor: Contents
No ratings yet
The Sparc Microprocessor: Contents
12 pages
Ee6304 Ym Lec 14
No ratings yet
Ee6304 Ym Lec 14
15 pages
Notes On Memory Consistency and Cache Coherence: Rege+Regf
No ratings yet
Notes On Memory Consistency and Cache Coherence: Rege+Regf
9 pages
7.7 Sectioion 7 Architecture, Data Communication & Networking
No ratings yet
7.7 Sectioion 7 Architecture, Data Communication & Networking
21 pages
05-Design Models
No ratings yet
05-Design Models
6 pages
Instructions and Addressing Modes
No ratings yet
Instructions and Addressing Modes
2 pages
2162 Term Project: The Tomasulo Algorithm Implementation
No ratings yet
2162 Term Project: The Tomasulo Algorithm Implementation
5 pages
02 ISA-Ch10
No ratings yet
02 ISA-Ch10
37 pages
Computer Systems ASSIGNMENT
No ratings yet
Computer Systems ASSIGNMENT
12 pages
Computer Systems ASSIGNMENT
No ratings yet
Computer Systems ASSIGNMENT
17 pages
M14
No ratings yet
M14
44 pages
02 Addressing Modes 6800
No ratings yet
02 Addressing Modes 6800
9 pages
Module 1: Basic Structure of Computers 1.1 Basic Operational Concepts
No ratings yet
Module 1: Basic Structure of Computers 1.1 Basic Operational Concepts
34 pages
Chapter - 2 Instruction Set Architecture 2.1 Memory Locations and Addresses
No ratings yet
Chapter - 2 Instruction Set Architecture 2.1 Memory Locations and Addresses
11 pages
SP week 3-4 (1)
No ratings yet
SP week 3-4 (1)
79 pages
ClassSlideChapter1 (1)
No ratings yet
ClassSlideChapter1 (1)
42 pages
BCS-29 Advanced Computer Architecture
No ratings yet
BCS-29 Advanced Computer Architecture
496 pages
Week 6
No ratings yet
Week 6
52 pages
CH 03
No ratings yet
CH 03
48 pages
Cyan 2800398239029h09fn0ivj0vcjb0
No ratings yet
Cyan 2800398239029h09fn0ivj0vcjb0
16 pages
Chap 4 The CPU
No ratings yet
Chap 4 The CPU
59 pages
Chapter 3 Csa Summary
No ratings yet
Chapter 3 Csa Summary
10 pages
A Simple Computer
No ratings yet
A Simple Computer
63 pages
Instruction Set Architecture: From Source To Assembly Code
100% (1)
Instruction Set Architecture: From Source To Assembly Code
6 pages
COA Ch05 (2)
No ratings yet
COA Ch05 (2)
26 pages
EE457Unit9b_Speculation
No ratings yet
EE457Unit9b_Speculation
66 pages
Lec 4-2
No ratings yet
Lec 4-2
54 pages
Cad For Vlsi 2 Pro Ject - Superscalar Processor Implementation
No ratings yet
Cad For Vlsi 2 Pro Ject - Superscalar Processor Implementation
10 pages
Chapter 5
No ratings yet
Chapter 5
37 pages
4.2 5-Stage Pipeline ARM Organization: Memory Bottle Neck
No ratings yet
4.2 5-Stage Pipeline ARM Organization: Memory Bottle Neck
6 pages
WINSEM2024-25_BCSE205L_TH_VL2024250501476_2025-01-25_Reference-Material-I
No ratings yet
WINSEM2024-25_BCSE205L_TH_VL2024250501476_2025-01-25_Reference-Material-I
22 pages
Module 2 Coa
No ratings yet
Module 2 Coa
13 pages
CS6303 - CA - Question Bank
No ratings yet
CS6303 - CA - Question Bank
48 pages
C01 Computer Systems - As (1)
No ratings yet
C01 Computer Systems - As (1)
17 pages
Chapter 5
No ratings yet
Chapter 5
48 pages
LU11-12 Instruction Execution
No ratings yet
LU11-12 Instruction Execution
18 pages
COA Unit-1
No ratings yet
COA Unit-1
10 pages
CSA Unit 4
No ratings yet
CSA Unit 4
71 pages
unit-2.1 superscalar processor
No ratings yet
unit-2.1 superscalar processor
33 pages
Computer Organization and Architecture: Addressing Modes
No ratings yet
Computer Organization and Architecture: Addressing Modes
22 pages
AppendixD Assembly Arm
No ratings yet
AppendixD Assembly Arm
53 pages
COA MOD 1 QP
No ratings yet
COA MOD 1 QP
45 pages
Draw The Block Diagram of Von Neumann Architecture and Explain About Its Parts in Brief Answer
No ratings yet
Draw The Block Diagram of Von Neumann Architecture and Explain About Its Parts in Brief Answer
7 pages
UNIT 5
No ratings yet
UNIT 5
42 pages
COA UNIT 4
No ratings yet
COA UNIT 4
13 pages
Memory Location & Addresses - 2
No ratings yet
Memory Location & Addresses - 2
7 pages
Code Generation
No ratings yet
Code Generation
25 pages
The Central Processing Unit 3.1 Computer Arithmetic 3.1.1 The Arithmetic and Logic Unit (ALU)
No ratings yet
The Central Processing Unit 3.1 Computer Arithmetic 3.1.1 The Arithmetic and Logic Unit (ALU)
12 pages
unit 2 coa
No ratings yet
unit 2 coa
23 pages
Unit 5 part 1_CD
No ratings yet
Unit 5 part 1_CD
14 pages
APP_UNIT_2_PKM
No ratings yet
APP_UNIT_2_PKM
102 pages
TASK 2
No ratings yet
TASK 2
9 pages
Ch05
No ratings yet
Ch05
25 pages
Pipelining
No ratings yet
Pipelining
21 pages
Topic 3 ARM Instruction Set Part - 1
No ratings yet
Topic 3 ARM Instruction Set Part - 1
47 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Computer Architecture AllClasses-Outline-100-198
No ratings yet
Computer Architecture AllClasses-Outline-100-198
99 pages
Computer Architecture AllClasses-Outline
No ratings yet
Computer Architecture AllClasses-Outline
294 pages
CA Classes-126-130
No ratings yet
CA Classes-126-130
5 pages
CA Classes-196-200
No ratings yet
CA Classes-196-200
5 pages
CA Classes-251-255
No ratings yet
CA Classes-251-255
5 pages
Computer Architecture AllClasses-Outline-1-99
No ratings yet
Computer Architecture AllClasses-Outline-1-99
99 pages
CA Classes-201-205
No ratings yet
CA Classes-201-205
5 pages
CA Classes-221-225
No ratings yet
CA Classes-221-225
5 pages
CA Classes-236-240
No ratings yet
CA Classes-236-240
5 pages
CA Classes-26-30
No ratings yet
CA Classes-26-30
5 pages
CA Classes-36-40
No ratings yet
CA Classes-36-40
5 pages
CA Classes-186-190
No ratings yet
CA Classes-186-190
5 pages
CA Classes-261-265
No ratings yet
CA Classes-261-265
5 pages
CA Classes-106-110
No ratings yet
CA Classes-106-110
5 pages
Programming in C - 121-140
No ratings yet
Programming in C - 121-140
20 pages
CA Classes-216-220
No ratings yet
CA Classes-216-220
5 pages
Programming in C - 161-180
No ratings yet
Programming in C - 161-180
20 pages
CA Classes-86-90
No ratings yet
CA Classes-86-90
5 pages
CA Classes-16-20
No ratings yet
CA Classes-16-20
5 pages
Programming in C - 21-40
No ratings yet
Programming in C - 21-40
20 pages
Qbdgroup Com en Blog What Is The Gamp 5 V Model in Computeri
No ratings yet
Qbdgroup Com en Blog What Is The Gamp 5 V Model in Computeri
16 pages
Programming in C - 41-60
No ratings yet
Programming in C - 41-60
20 pages
White Paper CPV Lets Foster Quality
No ratings yet
White Paper CPV Lets Foster Quality
7 pages
Differences Between The PICS EU GMP Guidelines and WHO Guidelines - Final
No ratings yet
Differences Between The PICS EU GMP Guidelines and WHO Guidelines - Final
20 pages
C Programming AllClasses-Outline-198-233
No ratings yet
C Programming AllClasses-Outline-198-233
36 pages
C Programming AllClasses-Outline-1-98
No ratings yet
C Programming AllClasses-Outline-1-98
98 pages
Tips For Writing User Friendly GMP Document
No ratings yet
Tips For Writing User Friendly GMP Document
12 pages
Pharmabeginers Com Quality Risk Management
No ratings yet
Pharmabeginers Com Quality Risk Management
31 pages
WWW Pharmaceutical Technology Com Sponsored Pharmaceutical Q
No ratings yet
WWW Pharmaceutical Technology Com Sponsored Pharmaceutical Q
6 pages
Pharmabeginers Com Investigation Tools Guideline
No ratings yet
Pharmabeginers Com Investigation Tools Guideline
31 pages
Guada A. Dumapit RN, Man
No ratings yet
Guada A. Dumapit RN, Man
18 pages
Detection of Sorbitol/sugar Alcohols and Carbohydrates in Milk by Using Thin-Layer Chromotography
No ratings yet
Detection of Sorbitol/sugar Alcohols and Carbohydrates in Milk by Using Thin-Layer Chromotography
64 pages
Thesis Chapter 4 Qualitative
100% (3)
Thesis Chapter 4 Qualitative
8 pages
A Seminar Report On
No ratings yet
A Seminar Report On
4 pages
ECON-331
No ratings yet
ECON-331
4 pages
MSc I CHO 150 Structure Stability and Reaction of Reactive Intermediate
No ratings yet
MSc I CHO 150 Structure Stability and Reaction of Reactive Intermediate
58 pages
CS Prerequisite Flowchart
No ratings yet
CS Prerequisite Flowchart
1 page
Pretest Inquiries
No ratings yet
Pretest Inquiries
2 pages
HE 1149 PRESSURE TRANSMITER
No ratings yet
HE 1149 PRESSURE TRANSMITER
2 pages
Finals Module: 1 Assignment Performance Evaluation
No ratings yet
Finals Module: 1 Assignment Performance Evaluation
2 pages
Gollisuniversity-C
No ratings yet
Gollisuniversity-C
9 pages
Personal Selling - CH 6 - Adaptive Selling For Relationship Building
No ratings yet
Personal Selling - CH 6 - Adaptive Selling For Relationship Building
65 pages
Referrals - Payment To Unlicensed Persons
No ratings yet
Referrals - Payment To Unlicensed Persons
2 pages
Overview of Finance
No ratings yet
Overview of Finance
12 pages
Adhesive Dentistry and Endodontics Materials, Clinical Strategies and Procedures for Restoration of Access Cavities A review
No ratings yet
Adhesive Dentistry and Endodontics Materials, Clinical Strategies and Procedures for Restoration of Access Cavities A review
15 pages
1.2-PPT More About Photosynthesis
No ratings yet
1.2-PPT More About Photosynthesis
37 pages
The Cornea PPT Edit
100% (2)
The Cornea PPT Edit
38 pages
Cambridge Assessment International Education: First Language Arabic 0508/02 October/November 2019
No ratings yet
Cambridge Assessment International Education: First Language Arabic 0508/02 October/November 2019
6 pages
Section Iii: Photographers' Positions Photographers' Positions
No ratings yet
Section Iii: Photographers' Positions Photographers' Positions
9 pages
Sns LAB NO 02 PDF
No ratings yet
Sns LAB NO 02 PDF
6 pages
PESO Job Ad
No ratings yet
PESO Job Ad
6 pages
H19-315-EnU HCSA-Presales-Transmission & Access Exam Dumps
No ratings yet
H19-315-EnU HCSA-Presales-Transmission & Access Exam Dumps
9 pages
Upstart AICertCourse Syllabus 042424
No ratings yet
Upstart AICertCourse Syllabus 042424
2 pages
cfpb_adult-fin-ed_chinese-style-guide-glossary
No ratings yet
cfpb_adult-fin-ed_chinese-style-guide-glossary
81 pages
Swift 6
No ratings yet
Swift 6
4 pages
Artificial Floating Islands - Cities of The Future PDF
No ratings yet
Artificial Floating Islands - Cities of The Future PDF
137 pages
JAVA NOTES 3 Sem A - NOTES 3 CLASS CONSTRUCTOR
No ratings yet
JAVA NOTES 3 Sem A - NOTES 3 CLASS CONSTRUCTOR
45 pages
Wireless & Mobile Network: M.Sc. Computer Science
No ratings yet
Wireless & Mobile Network: M.Sc. Computer Science
87 pages
Housekeeping Inspection Safety Checklist
No ratings yet
Housekeeping Inspection Safety Checklist
2 pages
Mananita Songs Without Chords
No ratings yet
Mananita Songs Without Chords
1 page