Introduction To FPGA HSIO
Introduction To FPGA HSIO
Introduction To FPGA HSIO
HIGH SPEED IO
“Microsoft has had clear competitors in the past. It’s a good thing we have
museums to document that.” ~ Bill Gates
2
[Courtesy] https://percepticon.wordpress.com/material/open-data/internet-diffusion-and-usage-statistics/
Objective of this Seminar
High Speed I/O Problem Statement
Vocabulary
Techniques
Design Flow with Intel FPGAs
Hands on Lab
3
The World is Going Serial – PC back panel
Old School (90s) Newer School – 2010s
Ethernet
USB
HDMI
SATA
Parallel Interfaces Serial Interfaces
4
Why not Parallel interfaces?
Data
The
The
“Channel”
“Channel”
PCB B
Clock Source
Chip A
tsetup thold
Clock
C C
o o
n n
n n
e
c
e
c
Chip B
PCB A t
o
t
o
r r
Data
Valid
Slow timing transition Different wire lengths
Window
makes meeting data valid
Fast timing transition window difficult
Bit 2
5
Dynamic Phase Alignment (DPA) vs Clock and Data Recovery (CDR)
SERDES with DPA SERDES with DPA
Data
Parallel
Parallel
to
to Serial
Serial TX Clock RX Serial
Serial to
to
Parallel
Parallel
Serializer/Deserializer or SERDES is commonly used to describe both techniques. Intel PSG calls CDR Transceiver
6
Stages of Data transfer
7 6 5 4 3 2 1
7. Application
6. Presentation
5. Session
4. Transport
3. Network
2. Datalink
1. Physical
7
LVDS Dynamic Phase Aligner
This technique is used in many Intel PSG families - caps out at roughly 3 Gbps
LVDS = Low Voltage Differential Swing is a special type of CMOS I/O cell that can run at high data rates
8
Centering the clock with DPA circuitry – 8 tap PLL
D0 D1
Degree
Shift
0
45
90
135
180
225
270
315
9
Rates beyond DPA? CDR – high precision clock phase shift!
Buffer Buffer
Deserializer
Question: Hey where’s the clock? How can you get the data to align across the channel?
11
Clock Data Recovery Circuit
PFD PD Charge Pump & LF VCO
Measure Measure phase Translate between Electronic oscillator
phase/frequency differences between PD/PFD output and whose oscillation is
differences between serial data input phase VCO control voltages. controlled by a
reference clock and and divided output. voltage source.
divided output.
12
Clock Data Recovery Sequence
LOCK TO
REFERENCE
LOCK TO
DATA
13
Phase Detector Circuit
Alexander Phase Detector
Samples serial data in three consecutives clock LEADS: reference clock edge is early with
edges. respect to data edge.
PD determines if the clock leads or lags the data. LAGS: reference clock edge is late with respect
to data edge.
14
Voltage-Controlled Oscillator
15
Challenges in High Speed I/O
Standards of Data Exchange
Commonality in understanding the data
Integrity of Signals
16
Physical Coding Sublayer (PCS) to Physical Media Attach (PMA)
0.5GBps 0.5GBps 0.5GBps 0.5GBps
Fabric PCS PMA
Higher
voltage
0.5GBps 0.5GBps 0.5GBps 5GBps
x10 x10 x10
High speed
÷ (serial) clock
Lower speed (parallel) clock
17
Native PHY L/H Tile UG Pages: 231 – 295 : clocking section
Transmitter Physical Coding Sublayer (PCS)
Transmitter PCS consists of:
Phase Compensation FIFO
– Regulate the availability of data between 2 clock domains
Byte Serializer
– Convert wide parallel data into byte size (narrow parallel data)
– Eg: 16-bit wide into 8-bit wide
– Fast clock and a slow clock (half the speed of fast clock)
Encoder
– Converts information from one format to another for dc-balancing
– Schemes like 8b/10b, 64b/66b
18
How to handle long sequences of zeros and ones in a
row? Answer: Physical Coding Sublayer (PCS)
0 0
0 0
1 1
8 Bit word 1 1 Encoded 10 Bit word
6 One’s and 2 Zero’s 1 1 5 One’s and 5 Zero’s
(1 transition, unbalanced ones 1 1 (2 transitions, balanced
and zeros) 1 1 ones and zeros)
1 0
0
0
19
Coding: 8B10B
20% overhead to add transitions
(vs 100% overhead for
Manchester)
20
Physical Media Attach (PMA) – The Analog world
TX Transmission medium RX
Clean edges +
Boost signal
Recover clock
21
Native PHY L/H Tile UG Pages: 320 - 333
Transmission medium reaches and applications
22
Sources: Intel PAM4 App note and http://www.ethernetalliance.org/wp-content/uploads/2014/10/41014-DRAFT-TEF-56Gbs.pdf
TX – Getting the signal across
Transmission medium
Pre/De Emphasis
Pre-Tap
Post-Taps
23
RX – Signal recovery
RX Equalization – Calibration
Serial
Data
Serial
Data In VCM CTLE VGA DFE
CDR
Boost
Boost
R1
Serial
Clock
R1
C1
R2
R2
GND1
gnd
frequency
24
Native PHY L/H Tile UG Pages: 402 – 411 : Calibration section
Some common High Speed IO Protocols
• PCIe – Serial Computer Expansion bus
• Ethernet - networking
• Interlaken – chip to chip
• CPRI – Common Public Radio Interface – Tower to wireless basestation
• USB – Universal Serial Bus – computer to peripheral connectivity (and so much more)
• HDMI – High Definition Multimedia Interface
+ dozens more!
*supporting so many transceiver protocols makes the design challenging!
25
Coding Standards by Protocol
Standard Line Code
Ethernet 1-10 Mbps Manchester
Ethernet 100 Mbps 4b5b
Ethernet 1Gbps 8b10b
Ethernet 10Gbps 64b66b
Ethernet 40Gbps (4x10) 64b66b
Ethernet 100Gbps (10x10, 4x25) 64b66b
PCIe Gen 1 (2.5 Gbps) 8b10b
PCIe Gen 2 (5 Gbps) 8b10b
PCIe Gen 3 (8 Gbps) 128b/130b
PCIe Gen 4 (16 Gbps) 128b/130b
26
Eye diagram
27
History of Intel PSG Transceivers
Data Rate vs Process Node
Data Rate vs Year 56 56
60
40 28 28 20 11.1
3.19 6.38
20 11.1 0
3.19 6.38 0 130 90 60 40 28 20 14 10
0
2002 2004 2006 2008 2010 2014 2015 2017 Process Node
Year
Stratix History
Memory
Year Node Data Rate Transceivers KLEs 18x18 (Mbit) PLL
28
Transceiver Design Flow
1
optional
29