PS5

The document outlines Problem Set #5 for a Computer Architecture course, detailing ten problems related to various concepts such as cache coherence protocols, atomic operations, and network bandwidth calculations. Each problem has specific points assigned and requires students to apply their knowledge of computer architecture principles. The problems range from implementing atomic operations to analyzing multi-threaded programs and network performance metrics.

Uploaded by

genopsyism

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views3 pages

PS5

Uploaded by

genopsyism

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Computer Architecture

ELE 475
Fall 2014
Problem Set #5
Total Points: 100

Problem #1 (10 Points): Page 254 in H&P5, Problem 3.13

For problem #1: Assume that 3 stall cycles after load means load has a latency of 4.

Problem #2 (10 Points): Assume that your architecture has a test-and-set instruction as its only atomic
primitive. Implement atomic compare-and-exchange out of the test-and-set primitive.

Problem #3 (10 Points): List the possible sequentially consistent outcomes for the variables i and j after
the completion of executing the three threads T1, T2, and T3. Assume that all threads begin executing
after ‘i’ has been set to 9 and ‘j’ is set to 10.

T1: T2: T3:

ADDI R1, R0, 30 ADDI R5, R0, 99 ADD R8, R0, 100
SW R1, 0(i) LW R6, 0(j) SW R8, 0(i)
LW R2, 0(j) ADD R7, R5, R6
SW R2, 0(j) SW R7, 0(j)

Problem # 4 (10 Points): You are writing a multi-threaded program that will count the number of
occurrences of a value in an array. The values in the array are between 0 and 1023. In effect, you will
be building a histogram. Assume that the list of numbers is very large, on the order of gigabytes large.
Extend the following program such that 100 threads (processors) can execute on the program
concurrently. Assume a sequentially consistent memory model. Add P() and V() semaphores where
appropriate and add any storage needed for the semaphores. Explain why the speedup of such a
solution may not be 100x. Note that the output lock array is assumed to be initialized to 1 (this allows
for a mutex).

// Sequential code, assume that the input and output arrays are created
// outside of the function
#define MAX_VALUE 1023
function(int input_array_size, int * input_array, int * output_array)
{
int counter;
for(counter = 0; counter < input_array_size; counter++)
{
assert(input_array[counter] <= MAX_VALUE);
assert(input_array[counter] >= 0);
output_array[input_array[counter]]++;
}
}

1
Problem # 5 (10 Points): Show for each cache line and cache what state it is in on every cycle assuming
three processors executing code as interleaved below. Assume a 64-byte cache line block size. Assume
all cores contain a direct mapped cache that is 4KB large. First, assume that the processors are using a
snoopy MSI cache coherence protocol. Second, repeat this for a MESI protocol.

Time P1: P2: P3:

1 LW R1, 4(R0)
2 LW R1, 16(R0)
3 LW R1, 4(R0)
4 SW R2, 100(R0)
5 LW R4, 104(R0)
6 LW R3, 100(R0)
7 SW R1, 0(R0)
8 LW R1, 4100(R0)
9 SW R2, 4100(R0)
10 SW R3, 4100(R0)
11 SW R5, 0(R0)

Problem #6 (10 Points): Calculate the bisection bandwidth for a 4-ary 3-cube without end-around, but
where each link is 32-bits wide and clocks at 800MHz. Calculate the bisection bandwidth of an 8-node
omega network with 64-bit links that clock at 1.2GHz.

Problem #7 (10 Points): How large of a credit counter is needed to provide full bandwidth on a link
where the link has one cycle for routing delay, two cycles for link delay, and the return credit takes two
cycles? What is the bandwidth as a proportion of the maximum if the credit size is two smaller than the
needed number?

Problem #8 (10 Points): Assume that a message is routed on a 2D dimension-ordered network that is 4
by 4. Assume that the link delay is one cycle and that the router delay in each hop is two cycles.
Assume that each link is one byte wide. Assume that the flit length is 4 bytes and the phit size is one
byte. How many cycles does it take to send a 32-byte message from location (0,0) to location (2,3)
assuming no insertion or destination delay assuming that the architecture implements store-and-
forward? Repeat assuming that the network is a wormhole/cut-through switched network.

Problem Set continued on next page

2
Problem #9 (10 Points): Show for each cache line, cache, and directory controller what state it is on
every load/store. Assume that the code is executing on three processors as interleaved below. Assume
that there is one centralized directory. Also, draw the share list that exists in the directory. Assume a
64-byte cache line block size. Assume all cores contain a direct mapped cache that is 4KB large. Assume
that a MSI protocol is used in the caches and a ESU protocol is used at the directory.

Time P1: P2: P3:

1 LW R1, 4(R0)
2 LW R1, 16(R0)
3 LW R1, 4(R0)
4 SW R2, 100(R0)
5 LW R4, 104(R0)
6 LW R3, 100(R0)
7 SW R1, 0(R0)
8 LW R1, 4100(R0)
9 SW R2, 4100(R0)
10 SW R3, 4100(R0)
11 SW R5, 0(R0)

Problem # 10 (10 Points): Page 420 in H&P5, Problem 5.11

Main Sol Midterm
No ratings yet
Main Sol Midterm
21 pages
Boolean Truth Table
No ratings yet
Boolean Truth Table
42 pages
Ccn-Matlab Soft
No ratings yet
Ccn-Matlab Soft
109 pages
(Common To Cse, It, Ai&Ml, DS) : Computer Networ Web Technology Laboratory Manual
No ratings yet
(Common To Cse, It, Ai&Ml, DS) : Computer Networ Web Technology Laboratory Manual
138 pages
User Manual LG 32LB5700
No ratings yet
User Manual LG 32LB5700
239 pages
HYUNDAI Placement Paper 2011
100% (3)
HYUNDAI Placement Paper 2011
14 pages
RCP8 Users Manual
No ratings yet
RCP8 Users Manual
259 pages
Sample Questions 2019 Test Code PCB (Short Answer Type)
No ratings yet
Sample Questions 2019 Test Code PCB (Short Answer Type)
24 pages
written_asst2
No ratings yet
written_asst2
27 pages
Ee547 (B) Assignment 1
No ratings yet
Ee547 (B) Assignment 1
11 pages
DS At32f403a V2.01 en
No ratings yet
DS At32f403a V2.01 en
88 pages
Networks
No ratings yet
Networks
30 pages
Komatsu 7codes
100% (4)
Komatsu 7codes
3 pages
IP in The ISAM
100% (1)
IP in The ISAM
62 pages
CDJ 9000 Nxs
No ratings yet
CDJ 9000 Nxs
158 pages
final
No ratings yet
final
20 pages
HCT222 - 22computer Architecture and Organization 2021 July Test1
No ratings yet
HCT222 - 22computer Architecture and Organization 2021 July Test1
6 pages
Chapter 05
No ratings yet
Chapter 05
19 pages
Csci 343 - Summer 2013 - Exam 1
No ratings yet
Csci 343 - Summer 2013 - Exam 1
6 pages
CAT - E601 - Split Body Ball Valve - 2020 PDF
100% (1)
CAT - E601 - Split Body Ball Valve - 2020 PDF
28 pages
CENG400-Final-Fall 2015
No ratings yet
CENG400-Final-Fall 2015
10 pages
sem3_pyqs
No ratings yet
sem3_pyqs
25 pages
CS3001-Fall2023-Mid-II - Solution - V2.2 Final
No ratings yet
CS3001-Fall2023-Mid-II - Solution - V2.2 Final
6 pages
Solutions Assignment4 Ceg3185 2014w
No ratings yet
Solutions Assignment4 Ceg3185 2014w
8 pages
Department of Computer Science & Engineering CSL718 Architecture of High Performance Systems Major Test Solution
No ratings yet
Department of Computer Science & Engineering CSL718 Architecture of High Performance Systems Major Test Solution
8 pages
Final Winter 2004
No ratings yet
Final Winter 2004
5 pages
CSGC 342
No ratings yet
CSGC 342
7 pages
CA PDF
No ratings yet
CA PDF
10 pages
Ael ZG626 Ec-2r First Sem 2023-2024
No ratings yet
Ael ZG626 Ec-2r First Sem 2023-2024
6 pages
ASSIGNMENT-SOLUTION-WEEK8
No ratings yet
ASSIGNMENT-SOLUTION-WEEK8
3 pages
Cao 2021 HW2
No ratings yet
Cao 2021 HW2
4 pages
HPC-bio-final-fall-2023
No ratings yet
HPC-bio-final-fall-2023
2 pages
Cambridge University - Computer Science Tripos - y2023PAPER5
No ratings yet
Cambridge University - Computer Science Tripos - y2023PAPER5
9 pages
Coa Applied
No ratings yet
Coa Applied
13 pages
Computer Architecture: Ph.D. Qualifiers Examination - Sample Questions
No ratings yet
Computer Architecture: Ph.D. Qualifiers Examination - Sample Questions
2 pages
COSS_2022-23 question paper
No ratings yet
COSS_2022-23 question paper
6 pages
Unit 2: Architecture of Microprocessor
No ratings yet
Unit 2: Architecture of Microprocessor
50 pages
350 Exam 2 Spring 2024
No ratings yet
350 Exam 2 Spring 2024
7 pages
Instructions: Csce 212: Final Exam Spring 2009
No ratings yet
Instructions: Csce 212: Final Exam Spring 2009
5 pages
Dchuynh HW4
No ratings yet
Dchuynh HW4
5 pages
ps2
No ratings yet
ps2
2 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
9 pages
CMSC 417 Midterm #1 (Fall 2001) - Solution
No ratings yet
CMSC 417 Midterm #1 (Fall 2001) - Solution
3 pages
finalsol
No ratings yet
finalsol
9 pages
Solutions: 18-742 Advanced Computer Architecture
No ratings yet
Solutions: 18-742 Advanced Computer Architecture
8 pages
School of Physics, Engineering and Technology: The Statement of Assessment
No ratings yet
School of Physics, Engineering and Technology: The Statement of Assessment
3 pages
JLN-205Mk2 Instruction Manual PDF
No ratings yet
JLN-205Mk2 Instruction Manual PDF
92 pages
Ca Model QB
No ratings yet
Ca Model QB
4 pages
PS1
No ratings yet
PS1
1 page
CS & IT CGPDTM Mains
No ratings yet
CS & IT CGPDTM Mains
6 pages
QP4_BRN32
No ratings yet
QP4_BRN32
7 pages
Next Generation Technology
No ratings yet
Next Generation Technology
4 pages
Exercise 1 - Introduction To Embedded Systems
No ratings yet
Exercise 1 - Introduction To Embedded Systems
3 pages
Quiz Questions
No ratings yet
Quiz Questions
2 pages
2005 Computer Architecture Solutions
No ratings yet
2005 Computer Architecture Solutions
11 pages
ACN Model Question
0% (1)
ACN Model Question
3 pages
Sample Solutions: TH TH TH
No ratings yet
Sample Solutions: TH TH TH
6 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
10 pages
DDL Commands
No ratings yet
DDL Commands
13 pages
Number (I) Max (Number (0), Number (1),, Number (N - 1) ) +1 J 0 J N J++) (Number (J) ! 0) &&
No ratings yet
Number (I) Max (Number (0), Number (1),, Number (N - 1) ) +1 J 0 J N J++) (Number (J) ! 0) &&
3 pages
Questions
No ratings yet
Questions
2 pages
Datasheet TFT Monitor Okuma Control
No ratings yet
Datasheet TFT Monitor Okuma Control
3 pages
ADC0820
No ratings yet
ADC0820
22 pages
Nintendo Age Ezine #5
No ratings yet
Nintendo Age Ezine #5
13 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
CS Unit 1 EoT Exam Solutions
No ratings yet
CS Unit 1 EoT Exam Solutions
5 pages
Midterm 11 So Ls
No ratings yet
Midterm 11 So Ls
7 pages
cn1
No ratings yet
cn1
4 pages
15IF11 Multicore E PDF
No ratings yet
15IF11 Multicore E PDF
14 pages
Funtoo Linux Installation - Funtoo Linux
No ratings yet
Funtoo Linux Installation - Funtoo Linux
16 pages
COMSATS University Islamabad, Wah Campus: Final (Fall 2020)
100% (1)
COMSATS University Islamabad, Wah Campus: Final (Fall 2020)
3 pages
PHD Comprehensive Examination Department of Computer Science & Engineering
No ratings yet
PHD Comprehensive Examination Department of Computer Science & Engineering
5 pages
Pulsar Wiper Relay Location
100% (1)
Pulsar Wiper Relay Location
9 pages
ECE 341 Final Exam Solution: Problem No. 1 (10 Points)
No ratings yet
ECE 341 Final Exam Solution: Problem No. 1 (10 Points)
9 pages
EBB v21 SCH
No ratings yet
EBB v21 SCH
1 page
Brochure Geomagic Design X Software
No ratings yet
Brochure Geomagic Design X Software
4 pages
Experiment 3: Table 3.1: List of Components Sr. No Name of Components Value Specification 1 2 3 4 5 6 7
No ratings yet
Experiment 3: Table 3.1: List of Components Sr. No Name of Components Value Specification 1 2 3 4 5 6 7
5 pages
HP Pro Tower 400 G9 PCI Desktop PC
No ratings yet
HP Pro Tower 400 G9 PCI Desktop PC
3 pages
NetVR 100 Series Datasheet - tcm841 142502
No ratings yet
NetVR 100 Series Datasheet - tcm841 142502
3 pages
3.0 - BOQ and Pricing File - V2
No ratings yet
3.0 - BOQ and Pricing File - V2
4 pages
Error-Code 0xc0000034 Windows-8 Boot
No ratings yet
Error-Code 0xc0000034 Windows-8 Boot
4 pages
Nova Ii & Nova Iv Receivers
No ratings yet
Nova Ii & Nova Iv Receivers
4 pages
Leviton 47611-5GB 47611-8GB Gigabit Switch
No ratings yet
Leviton 47611-5GB 47611-8GB Gigabit Switch
2 pages
SecureCRT - USB Flash Memory
No ratings yet
SecureCRT - USB Flash Memory
2 pages
Telecommunications Part 1 - General 1.01
No ratings yet
Telecommunications Part 1 - General 1.01
3 pages
Freshers CV
No ratings yet
Freshers CV
2 pages
1993 Eurocopter BK-117-B2 7247
No ratings yet
1993 Eurocopter BK-117-B2 7247
3 pages
CCNA Certification Study Guide Volume 1: Exam 200-301 v1.1
From Everand
CCNA Certification Study Guide Volume 1: Exam 200-301 v1.1
Todd Lammle
5/5 (1)
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
From Everand
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
Redouane MEDDANE
No ratings yet
ROUTING INFORMATION PROTOCOL: RIP DYNAMIC ROUTING LAB CONFIGURATION
From Everand
ROUTING INFORMATION PROTOCOL: RIP DYNAMIC ROUTING LAB CONFIGURATION
Mulayam Singh
No ratings yet