Cryptography and Data Security
Cryptography and Data Security
Cryptography and Data Security
www.crypto.rub.de
Table of Contents
2 Stream Ciphers 22
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
2.2 Some Remarks on Random Number Generators . . . . . . . . . . . . . . . . 26
2.3 General Thoughts on Security, One-Time Pad and Practical Stream Ciphers 27
2.4 Synchronous Stream Ciphers . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
i
2.4.1 Linear Feedback Shift Registers (LFSR) . . . . . . . . . . . . . . . . 31
2.4.2 Clock Controlled Shift Registers . . . . . . . . . . . . . . . . . . . . . 34
2.5 Known Plaintext Attack Against Single LFSRs . . . . . . . . . . . . . . . . 35
2.6 Lessons Learned — Stream Ciphers . . . . . . . . . . . . . . . . . . . . . . . 37
ii
4.4.2 Diffusion Layer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
4.4.3 Key Addition Layer . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.5 Decryption . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.6 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
4.6.1 Hardware . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
4.6.2 Software . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
4.7 Lessons Learned — AES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
iii
7 RSA 93
7.1 Cryptosystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
7.2 Computational Aspects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
7.2.1 Choosing p and q . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
7.2.2 Choosing a and b . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
7.2.3 Encryption/Decryption . . . . . . . . . . . . . . . . . . . . . . . . . . 100
7.3 Attacks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
7.3.1 Brute Force . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
7.3.2 Finding Φ(n) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
7.3.3 Finding a directly . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
7.3.4 Factorization of n . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
7.4 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
7.5 Lessons Learned — RSA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
iv
9.2.1 Diffie-Hellman Key Exchange . . . . . . . . . . . . . . . . . . . . . . 129
9.2.2 Menezes-Vanstone Encryption . . . . . . . . . . . . . . . . . . . . . . 130
9.3 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
v
14 Message Authentication Codes (MACs) 166
14.1 Principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
14.2 MACs from Block Ciphers . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
14.3 MACs from Hash Functions: HMAC . . . . . . . . . . . . . . . . . . . . . . 170
14.4 Lessons Learned — Message Authentication Codes . . . . . . . . . . . . . . 171
vi
17.3 SSL Handshake Protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
17.3.1 Core Cryptographic Components of SSL . . . . . . . . . . . . . . . . 190
vii
***************************************************************************
1
Chapter 1
2
1.1 Literature Recommendations
1. W. Stallings [Sta02], Cryptography and Network Security. Prentice Hall, 2002. Very
accessible book on cryptography, well suited for self-study or the undergraduate level.
Covers cryptographic algorithms as well as an introduction to important practical
protocols such as IPsec and SSL.
2. D.R. Stinson [Sti02], Cryptography: Theory and Practice. CRC Press, 2002. A real
textbook. Much more mathematical than Stallings’, but very systematic treatment of
the material. The Lecture Notes by Christof Paar loosely follow the material presented
in this book.
4. B. Schneier [Sch96], Applied Cryptography. 2nd ed., Wiley, 1995. Very accessible
treatment of protocols and algorithms. Gives also a very nice introduction to cryptog-
raphy as a discipline. Is becoming a bit dated.
3
1.2 Overview on the Field of Cryptology
CRYPTOLOGY
Cryptography Cryptanalysis
Public-Key In 1976 the first public-key scheme was introduced by Diffie-Hellman key ex-
change protocol.
Hybrid Approach In today’s practical systems, very often hybrid schemes are applied
which use symmetric algorithms together with public-key algorithms (since both types
of algorithms have advantages and disadvantages.)
4
1.3 Symmetric Cryptosystems
1.3.1 Basics
Sometimes these schemes are also referred to as symmetric-key, single-key, and secret-key
approaches.
Problem Statement: Alice and Bob want to communication over an un-secure channel
(e.g., the Internet, a LAN or a cell phone link.) They want to prevent Oscar (the bad guy)
from listening.
Solution: Use of symmetric-key cryptosystems (these have been around for thousands of
years) such that if Oscar reads the encrypted version y of the message x over the un-secure
channel, he will not be able to understand its content because x is what really was sent.
Oscar
(bad)
x y x
Alice Encryption Decryption Bob
(good) e() d() (good)
k k
Secure Channel
Key
Generator
Remark: In this scenario we only consider the problem of confidentiality, that is, of hiding
the contents of the message from an eavesdropper. We will see later in these lecture notes
that there are many other things we can do with cryptography, such as preventing Oscar to
make changes to the message.
5
Some important definitions:
1a) x is called the “plaintext”
1b) P= {x1 , x2 , . . . , xp } is the (finite) “plaintext space”
2a) y is called the “ciphertext”
2b) C= {y1 , y2 , . . . , yc } is the (finite) “ciphertext space”
3a) k is called the “key”
3b) K= {k1 , k2 , . . . , kl } is the finite “key space”
4a) There are l encryption functions e ki : P→C (or: eki (x) = y)
4b) There are l decryption functions d ki : C→P (or: dki (y) = x)
4c) ek1 and dk2 are inverse functions if k1 = k2 : dki (y) = dki (eki (x)) = x for all ki ∈K
6
1.3.2 A Motivating Example: The Substitution Cipher
A → K
B → D
C → W
···
Attacks:
A search through such a key space is technically not feasible with today’s computer technol-
ogy.
Q: Other attacks?
A: But: Letter frequency analysis works!
The major weakness of the method is that each plaintext symbol always maps to the same
ciphertext symbol. That means that the statistical properties of the plaintext are preserved
in the ciphertext. For practical attacks, the following properties of language can be exploited:
1. Determine the frequencies of every ciphertext letter. The frequency distribution (even
of relatively short pieces of encrypted text) will be close to that of the given language
in general. In particular, the most frequent letters (for instance, in English: “e” is the
most frequent one with about 13%, “t” is the second most frequent one with about
9%, “a” is the third most frequent one with about 8%, ...) can often easily be spotted
in ciphertext.
7
2. The method above can be generalized by looking at pairs (or triples, or quadruples, or
...) of ciphertext symbols. For instance, in English (and German and other European
languages), the letter “q” is almost always followed by a “u”. This behavior can be
exploited for detecting the substitution of the letter “q” and the letter “u”.
3. If we assume that word separators (blanks) have been found (which is often an easy
task), one can often detect frequent short words such as “the”, “and”, ... , which leaks
all the letters in the words involved in those words
In practice the three techniques listed above are often combined to break substitution ciphers.
Lesson learned: Good ciphers should hide the statistical properties of the encrypted plain-
text. The ciphertext symbols should appear to be random. Also, a large key space alone is
not sufficient for a strong encryption function.
8
1.3.3 How Many Key Bits Are Enough?
The following table gives a rough indication of the security of symmetric ciphers with respect
to brute force attacks. As described in Subsection 1.3.2, a large key space is only a necessary
but not a sufficient condition for a secure symmetric cipher. The cipher must also be strong
against analytical attacks.
9
1.4 Cryptanalysis
What is cryptanalysis? The science of recovering the plaintext x from the ciphertext y.
Often cryptanalysis is understood as the science of recovering the plaintext through mathe-
matical analysis. However, there are other methods too such as:
• Side-channel analysis can be used for finding a secret key, for instance by measuring
the electrical power consumption of a smart card.
• Social engineering (bribing, black mailing, tricking) or classical espionage can be used
for obtaining a secret key by involving humans.
A cryptosystem should be secure even if the attacker (Oscar) knows all details about
the system, with the exception of the secret key. In particular, the system should
be secure when the attacker knows the encryption and decryption algorithm.
10
1.4.2 Attacks against Crypto Algorithms
1. Ciphertext-only attack
Oscar’s knowledge: some y1 = ek (x1 ), y2 = ek (x2 ), . . .
Oscar’s goal : obtain x1 , x2 , . . . or the key k.
11
1.5 Some Number Theory
Goal Find a finite set in which we can perform (most of) the standard arithmetic operations.
Example of a finite set in every day live: The hours on a clock. If you keep adding 1 hour
you get:
1h, 2h, 3h, . . . , 11h, 12h, 1h, 2h, 3h, . . .
Even though we keep adding one hour, we never leave the set. Let’s look at a general way
of dealing with arithmetic in such finite sets. We consider the set of the 9 numbers:
{0, 1, 2, 3, 4, 5, 9, 7, 8}
We can do regular arithmetic as long as the results are smaller than 9. For instance:
2×3 = 6
4+4 = 8
But what about 8 + 4? We try now the following rule: Perform regular integer arithmetic
and divide the result by 9. We then consider only the remainder rather than the original
result. Since 8 + 4 = 12, and 12/9 has a remainder of 3, we write:
8 + 4 ≡ 3 mod 9
12
Some remarks on the modulo operation:
0≤r ≤ m−1
13
Modulo operation in the C programming language
C programming command : “%” (C can return a negative value)
r = 42 % 9 returns r = 6
but r = -42 % 9 returns r = -6 → if remainder is negative, add modulus m:
−6 + 9 = 3 ≡ −42 mod 9
Let’s now look at the mathematical structure we obtain if we consider the set of integers
from zero to m together with the operations addition and multiplication:
• a + b ≡ c mod m (c ∈ Zm )
• a × b ≡ d mod m (d ∈ Zm )
Example: m = 9
Z9 = {0, 1, 2, 3, 4, 5, 6, 7, 8}
6 + 8 = 14 ≡ 5 mod 9
6 × 8 = 48 ≡ 3 mod 9
All rings (not only the ring Zm we consider here) have a set of properties which are listed in
the following:
14
Definition 1.5.3 Some important properties of the ring Zm = {0, 1, 2, . . . , m − 1}
1. The additive identity is the element zero “0”: a + 0 = a mod m, for any
a ∈ Zm .
2. The additive inverse “−a” of “a” is such that a+(−a) ≡ 0 mod m: −a = m−a,
for any a ∈ Zm .
6. The multiplicative identity is the element one “1”: a × 1 ≡ a mod m, for any
a ∈ Zm .
15
Some remarks on the ring Zm :
• Roughly speaking, a ring is a structure in which we can add, subtract, multiply, and
sometimes divide.
• Definition 1.5.4 If gcd(a, m) = 1, then a and m are “relatively prime” and the
multiplicative inverse of a exists.
Example:
i) Question: does multiplicative inverse exist with 15 mod 26?
Answer: yes — gcd(15, 26) = 1
ii) Question: does multiplicative inverse exist with 14 mod 26?
Answer: no — gcd(14, 26) 6= 1
• The ring Zm , and thus the integer arithmetic with the modulo operation, is of central
importance to modern public-key cryptography. In practice, the integers are repre-
sented with 150–2048 bits.
16
1.6 Simple Blockciphers
Recall:
Private-key Systems
Idea: The message string is divided into blocks (or cells) of equal length that are then
encrypted and decrypted.
Input: message string X̄ → X̄ = x1 , x2 , x3 , . . . , xn , where each xi is one block.
Cipher: Ȳ = y1 , y2 , y3 , . . . , yn ; with yi = ek (xi ) where the key k is fixed.
17
1.6.1 Shift Cipher
One of the most simple ciphers where the letters of the alphabet are assigned a number as
depicted in Table 1.2.
A B C D E F G H I J K L M
0 1 2 3 4 5 6 7 8 9 10 11 12
N O P Q R S T U V W X Y Z
13 14 15 16 17 18 19 20 21 22 23 24 25
Remark:
If k = 3 the the shift cipher is given a special name — “Caesar Cipher”.
18
Example:
k = 17,
plaintext:
X = x1 , x2 , . . . , x6 = AT T ACK.
X = x1 , x2 , . . . , x6 = 0, 19, 19, 0, 2, 10.
encryption:
y1 = x1 + k mod 26 = 0 + 17 = 17 mod 26 = R
y2 = y3 = 19 + 17 = 36 ≡ 10 mod 26 = K
y4 = 17 = R
y5 = 2 + 17 = 19 mod 26 = T
y6 = 10 + 17 = 27 ≡ 1 mod 26 = B
ciphertext: Y =y
¯ 1 , y2 , . . . , y6 = R K K R T B.
1. Ciphertext-only: Try all possible keys (|k| = 26). This is known as “brute force attack”
or “exhaustive search”.
Secure cryptosystems require a sufficiently large key space. Minimum requirement
today is |K| > 280 , however for long-term security, |K| ≥ 2100 is recommended.
2. Same cleartext maps to same ciphertext ⇒ can also easily be attacked with letter-
frequency analysis.
19
1.6.2 Affine Cipher
20
1.7 Lessons Learned — Introduction
• Never ever develop your own crypto algorithm unless you have a team of experienced
cryptanalysts checking your design.
• Do not use unproven crypto algorithms (i.e., symmetric, asymmetric, hash function)
or unproven protocols.
• A large key space by itself is no guarantee for a secure cipher: The cipher might still
be vulnerable against analytical attacks.
1. The time your crypto implementation will be used (often only a few years.)
2. The time the encrypted data should stay secure (depending on application: can
range from a day to several decades.)
• Key lengths for symmetric algorithms in order to thwart exhaustive key-search attacks:
1. 64 bits — unsecure except for data with extreme short term value.
3. 256 bits — as above, but also secure against attack by quantum computer.
21
Chapter 2
Stream Ciphers
2.1 Introduction
Remember classification:
Private-key Systems
• Encrypts blocks of bits at a time. In practice, xi (and yi ) are 64 or 128 bits long.
• The encryption function ek () requires complex operation. In practice all block ciphers
are iterative algorithms with multiple rounds. Examples: DES (Chapter 3) or AES
(Chapter 4).
22
Stream Cipher: y1 , y2 , . . . , yn = ez1 (x1 ), ez2 (x2 ), . . . , ezn (xn ),
where z1 , z2 , . . . , zn is the keystream.
Key features of stream ciphers:
• The main art of stream cipher design is the generation of the key stream.
23
Most popular en/decryption function: modulo 2 addition
Assume: xi , yi , zi ∈ {0, 1}
This leads to the following block diagram for a stream cipher encryption/decryption:
Zi Zi
Xi Yi Xi
Remarks:
1. Historical note: A machine realizing the functionality shown above was developed by
Vernam for teletypewriters in 1917. Vernam was alumni of Worcester Polytechnic
Institute (WPI). Further reading: [Kah67].
a b c = a + b mod 2
0 0 0 + 0 = 0 mod 2
0 1 0 + 1 = 1 mod 2
1 0 1 + 0 = 1 mod 2
1 1 1 + 1 = 0 mod 2
.
⇒ modulo 2 addition yields the same truth table as the XOR operation.
24
2. Encryption and decryption are the same operation, namely modulo 2 addition (or
XOR).
Why? We show that decryption of ciphertext bit yi yields the corresponding plaintext
bit.
Decryption: yi + zi = (xi + zi ) + zi = xi + (zi + zi ) ≡ xi mod 2.
| {z }
encryption
Note that zi + zi ≡ 0 mod 2 for zi = 0 and for zi = 1.
25
2.2 Some Remarks on Random Number Generators
We distinguish between three types of random number generators (RNG):
True Random Number Generators (TRNG) These are sequences of numbers gener-
ated from physical processes. Example: coin flipping, rolling of dices, semiconductor
noise, radioactive decay, ...
General Pseudo Random Generators (PRNG) These are sequences which are com-
puted from an initial seed value. Often they are computed recursively:
z0 = seed
zi+1 = f (zi )
z0 = seed
zi+1 ≡ a zi + b mod m,
It must be stressed that for stream cipher applications it is not sufficient for a pseudo random
generator to have merely good statistical properties. In addition, for stream ciphers only
cryptographically secure generators are useful. Important: The distinction between PRNG
and CSPRN and their relevance for stream ciphers is often not clear to non-cryptographers.
26
2.3 General Thoughts on Security, One-Time Pad and
Practical Stream Ciphers
Definition 2.3.1 Unconditional Security
A cryptosystem is unconditionally secure if it cannot be broken even
with infinite computational resources.
27
Remarks:
y0 = x0 + K0 mod 2
y1 = x1 + K1 mod 2
..
.
Question: In order to build practical stream generators, can we “emulate” a OTP by using
a short key?
k k
Oscar
key-stream key-stream
Alice generator generator Bob
zi zi
It should be stressed that practical stream ciphers are not unconditionally secure. In fact,
all known practical crypto algorithms (stream ciphers, block ciphers, public-key algorithms)
are at the most relative secure, which we define as follows:
28
Definition 2.3.3 Computational Security
A system is “computationally secure” if the best possible algorithm
for breaking it requires N operations, where N is very large and known.
Unfortunately, all known practical systems are only computational secure for known algo-
rithms.
Example
29
Classification of practical key-stream generators:
synchronous stream cipher
zi = f (k, zi−i , . . . , z1 )
Note that the receiver (Bob) has to match the exact zi to the correct yi in order to obtain the
correct cleartext. This requires synchronization of Alice’s and Bob’s key-stream generators.
f
asynch1
z asynch2
x y
30
2.4 Synchronous Stream Ciphers
The keystream z1 , z2 , . . . is a pseudo-random sequence which depends only on the key.
An LFSR consists of m storage elements (flip-flops) and a feedback network. The feedback
network computes the input for the “last” flip-flop as XOR-sum of certain flip-flops in the
shift register.
Example: We consider an LFSR of degree m = 3 with flip-flops K2 , K1 , K0 , and a feedback
path as shown below.
mod 2 addition / XOR
K2 K1 K0
Z2 Z1 Z0 Z0 Z 1 ........ Z 6
CLK
K2 K1 K0
1 0 0
0 1 0
1 0 1
1 1 0
1 1 1
0 1 1
0 0 1
1 0 0
31
z4 = z2 + z1 mod 2
z5 = z3 + z2 mod 2
..
.
general case: zi+3 = zi+1 + zi mod 2; i = 0, 1, 2, . . .
........
C m-1 C1 C0
K m-1 K1 K0
........ OUTPUT
CLK
C0 , C1 , . . . , Cm−1 are the feedback coefficients. Ci = 0 denotes an open switch (no connec-
tion), Ci = 1 denotes a closed switch (connection).
m−1
X
zi+m = Cj · zi+j mod 2; Cj ∈ {0, 1}; i = 0, 1, 2, . . .
j=0
Example:
k = {(C0 = 1, C1 = 1, C2 = 0), (z0 = 0, z1 = 0, z2 = 1), 3}
32
Proof:
There are only 2m different states (k0 , . . . , km ) possible. Since only the current
state is known to the LFSR, after 2m clock cycles a repetition must occur. The
all-zero state must be excluded since it repeats itself immediately.
Remarks:
1.) Only certain configurations (C0 , . . . , Cm−1 ) yield maximum length LFSRs.
For example:
if m = 4 then (C0 = 1, C1 = 1, C2 = 0, C3 = 0) has length of 2m − 1 = 15
but (C0 = 1, C1 = 1, C2 = 1, C3 = 1) has length of 5
2.) LFSRs are sometimes specified by polynomials.
such that the P (x) = xm + Cm−1 xm−1 + . . . + C1 x + C0 .
Maximum length LFSRs have “primitive polynomials”.
These polynomials can be easily obtained from literature (Table 16.2 in [Sch96]).
For example:
(C0 = 1, C1 = 1, C2 = 0, C3 = 0) ⇐⇒ P (x) = 1 + x + x4
33
2.4.2 Clock Controlled Shift Registers
LFSR1 Out1
LFSR2 Out2
CLK
LFSR3 Out3
Basic operation:
When Out1 = 1 then LFSR2 is clocked otherwise LFSR3 is clocked.
Out4 serves as the keystream and is a bitwise XOR of the results from LFSR2 and LFSR3.
• If the sequence lengths of all LFSRs are relatively prime to each other, then the
sequence length of the generator is the product of all three sequence lengths, i.e.,
L = L 1 · L2 · L3 .
• A secure generator should have LFSRs of roughly equal lengths and the length should
be at least 128: m1 ≈ m2 ≈ m3 ≈ 128.
34
2.5 Known Plaintext Attack Against Single LFSRs
Assumption:
For a known plaintext attack, we have to assume that m is known.
Idea:
This attack is based on the knowledge of some plaintext and its corresponding ciphertext.
i) Known plaintext → x0 , x1 , . . . , x2m−1 .
ii) Observed ciphertext → y0 , y1 , . . . , y2m−1 .
iii) Construct keystream bits → zi = xi + yi mod 2; i = 0, 1, . . . , 2m − 1.
Goal:
To find the feedback coefficients Ci .
Note:
35
Solving the matrix in (2.2) for the Ci coefficients we get:
−1
c0 z0 ... zm−1 zm
..
.. ..
..
. = . . · . mod 2 (2.3)
cm−1 zm−1 . . . z2m−2 z2m−1
Summary:
⇒ LFSRs by themselves are extremely unsecure! Even though they are PRNG with
good statistical properties, the are not cryptographically secure. However, combinations of
them such as the alternating stop-and-go generator can be secure.
36
2.6 Lessons Learned — Stream Ciphers
• Stream ciphers are less popular than block ciphers in most application domains such
as Internet security. There are exceptions, for instance the popular stream cipher RC4.
• Stream ciphers are often used in mobile (and presuming military) applications, such
as the A5 speech encryption algorithm of the GSM mobile network.
• Stream ciphers generally require fewer resources (e.g., code size or chip area) for an
implementation than block ciphers. They tend to encrypt faster than block ciphers.
• The one-time pad is highly impractical in most cases because the key length has to be
equal to the message length.
• The requirements for a cryptographically secure pseudo-random generator are far more
demanding than the requirements for pseudo-random generators in other (engineering)
applications such as simulation.
• Many pseudo-random generators with good statistical properties such as LFSRs are
not cryptographically secure at all. A stand-alone LFSR makes, thus, a poor stream
cipher.
37
Chapter 3
1. Confusion — encryption operation where the relationship between cleartext and ci-
phertext is obscured. Some examples are:
2. Diffusion — encryption by spreading out the influence of one cleartext letter over
many ciphertext letters. An example is:
38
Remarks:
1. Today → changing of one bit of cleartext should result on average in the change of
half the output bits.
x1 = 001010 → encr. → y1 = 101110.
x2 = 000010 → encr. → y2 = 001011.
2. Combining confusion with diffusion is a common practice for obtaining a secure scheme.
Data Encryption Standard (DES) is a good example of that.
product
cipher
39
3.2 Introduction to DES
General Notes:
• Expired in 1998.
System Parameters:
→ block cipher.
→ 64 input/output bits.
→ 56 bits of key.
Principle: 16 rounds of encryption.
K1 K16
40
3.2.1 Overview
Message X Key K
64
Initial Permutation
IP(X)
56
64
L0 R0
32
32 48
f Transform 1
32 K1
56
round 1
32
32
L1 R1
L 15 R 15
32
48
32
f Transform 16
32 K 16
round 16
32
32
L 16 R 16
Final Permutation
-1
IP (R16 , L16 )
41
3.2.2 Permutations
IP
58 50 42 34 26 18 10 2
60 52 44 36 28 20 12 4
62 54 46 38 30 22 14 6
64 56 48 40 32 24 16 8
57 49 41 33 25 17 9 1
59 51 43 35 27 19 11 3
61 53 45 37 29 21 13 5
63 55 47 39 31 23 15 7
1 50 58 64
IP(X)
1 2 40
40
-1
IP (Z)
1
42
Note:
IP −1 (IP (X)) = X.
General Description:
Li = Ri−1 .
Ri = Li−1 ⊕ f (Ri−1 , ki ).
The core iteration is the f-function that takes the right half
of the output of the previous round and the key as input.
E bit table
32 1 2 3 4 5
4 5 6 7 8 9
8 9 10 11 12 13
12 13 14 15 16 17
16 17 18 19 20 21
20 21 22 23 24 25
24 25 26 27 28 29
28 29 30 31 32 1
S-boxes:
Contain look-up tables (LUTs) with 64 numbers ranging from 0 . . . 15.
Input: Six bit code selecting one number.
Output: Four bit binary representation of one number out of 64.
43
R i-1
of single bits
Expansion
E(Ri-1 )
48
48
Ki
48
confusion: obscures
ciphertext/cleartext
6 6 relationship
f-function
S1 S8
4 4
L i-1 8 * 4 = 32
page 75 in Stinson
Permutation P
32
32
32
Ri
44
Example:
S1
14 4 13 1 2 15 11 8 3 10 6 12 5 9 0 7
0 15 7 4 14 2 13 1 10 6 12 11 9 5 3 8
4 1 14 8 13 6 2 11 15 12 9 7 3 10 5 0
15 12 8 2 4 9 1 7 5 11 3 14 10 0 6 13
S-Box 1
Input: Six bit vector with MSB and LSB selecting the row and four inner bits
selecting column.
b = (100101).
→ row = (11)2 = 3 (forth row).
→ column = (0010)2 = 2 (third column).
S1 (37 = 1001012 ) = 8 = 10002 .
Remark:
S-boxes are the most crucial elements of DES because they introduce a non-
linear function to the algorithm, i.e., S(a) XOR S(b) 6= S(a XOR b).
Note:
64
7 1 7 1
P P
P = parity bits
45
In practice the DES key is artificially enlarged with odd parity bits. These bits
are “stripped” in PC-1.
K
64
PC - 1
56
C0 D0
28 28
LS 1 LS 1
28 28
K1 PC - 2 C1 D1
48 56
28 28
LS 2 LS 2
LS 16 LS 16
K 16 PC - 2 C 16 D 16
48 56
46
Remark:
3.3 Decryption
One advantage of DES is that decryption is essentially the same as encryption. Only the
key schedule is reversed. This is due to the fact that DES is based on a Feistel network.
47
Cipher Y = DES(X) Key K
64 64
Initial Permutation
IP PC-1
64
56
d d
L0 R0
32
32 48
f Transform 16
32 K 16
32
32
d d 56
L1 R1
d d
L 15 R 15
32
48
32
f Transform 1
32 K1
32
32
d d
L 16 R 16
Final Permutation
IP -1
-1 -1
X = DES (Y) = DES (DES(X))
48
Reversed Key Schedule:
Question: Given K, how can we easily generate k16 ?
k16 = P C2(C16 , D16 ) = P C2(C0 , D0 ) = P C2(P C1(k)).
k15 = P C2(C15 , D15 ) = P C2(RS1 (C16 ), RS1 (D16 )) = P C2(RS1 (C0 ), RS1 (D0 )).
K
56
PC - 1
56
K 16 PC - 2 C0 = C 16 D0 = D 16
48 56
28 28
RS 1 RS 1
28 28
K 15 PC - 2 C 15 D 15
48 56
28 28
RS 2 RS 2
RS 15 RS 15
K1 PC - 2 C1 D1
48 56
49
3.4 Implementation
Note:
One design criteria for DES was fast hardware implementation.
3.4.1 Hardware
Since permutations and simple table look-ups are fast in hardware, DES can be implemented
very efficiently. An implementation of a single DES round can be done with approximately
5000 gates.
1. One of the faster reported ASIC implementations: 9 Gbit/s in 0.6 µm technology with
16 stage pipeline [WPR+ 99].
3.4.2 Software
A straightforward software implementation which follows the data flow of most DES descrip-
tions, such as the one presented in this chapter, results in a very poor performance! There
have been numerous method proposed for accelerating DES software implementations. Here
are two representative ones:
2. The well known and fairly fast crypto library Crypto++ by Weidai claims a perfor-
mance of about 100Mbit/sec on a 850 MHz Celeron processor. See also http://www.eskimo.co
50
3.5 Attacks
There have been two major points of criticism about DES from the beginning:
i) key size is too small (allowing a brute-force attack),
ii) the S-boxes contained secret design criteria (allowing an analytical attack).
1977 Diffie & Hellman, estimate cost of key search machine (underestimate)
1990 Biham & Shamir propose differential cryptoanalysis (247 chosen ciphertexts)
1993 Mike Wiener proposes detailed hardware design for key search machine:
average search time of 36 h @ $100,000
1993 Matsui proposes linear cryptoanalysis (243 chosen ciphertexts)
Jun. 1997 DES Challenge I broken, distributed effort took 4.5 months
Feb. 1998 DES Challenge II–1 broken, distributed effort took 39 days
Jul. 1998 DES Challenge II–2 broken, key-search machine built by the
Electronic Frontier Foundation (EFF), 1800 ASICs, each with 24
search units, $250K, 15 days average (actual time 56 hours)
Jan. 1999 DES Challenge III broken, distributed effort combined with EFF’s
key-search machine, it took 22 hours and 15 minutes.
51
3.6 DES Alternatives
There exists a wealth of other block ciphers. A small collection of as of yet unbroken ciphers
is:
52
3.7 Lessons Learned — DES
• Standard DES with 56 bits key length can relatively easily be broken nowadays through
an exhaustive key search.
• DES is very robust against known analytical attacks: DES is resistant against differ-
ential and linear cryptanalysis. However, the key length is too short.
• DES is only reasonably efficient in software but very fast and small in hardware.
• The most conservative alternative to DES is triple DES which has Effective key lengths
of 112 bits.
53
Chapter 4
4.1 Introduction
• Successor to DES.
• Unlike DES, the AES selection was an open (i.e., public) process.
54
4.1.2 Chronology of the AES Process
A lot of work went into software and hardware performance analysis of the AES candidate
algorithms. Here are representative numbers:
55
4.2 Rijndael Overview
128 128
x y
Rijndael
k
128/192/256
• Both block size and key length of Rijndael are variable. Sizes shown in Figure 4.2 are
the ones required by the AES Standard. The number of rounds (or iterations) is a
function of the key length:
• However, Rijndael also allows block sizes of 192 and 256 bits. For those block sizes the
number of rounds must be increased.
Important: Rijndael does not have a Feistel structure. Feistel networks do not encrypt
an entire block per iteration (e.g., in DES, 64/2 = 32 bits are encrypted in one iteration).
Rijndael encrypts all 128 bits in one iteration. As a consequence, Rijndael has a comparably
small number of rounds.
Rijndael uses three different types of layers. Each layer operates on all 128 bits of a block:
56
1. Key Addition Layer: XORing of subkey.
3. Diffusion Layer: provides diffusion over all 128 (or 192 or 256) block bits. It is split
in two sub-layers:
57
x
ByteSubstitution Layer
ShiftRow SubLayer
rounds 1 ... n r - 1 Diffusion Layer
MixColumn Sublayer
ByteSubstitution Layer
58
4.3 Some Mathematics: A Very Brief Introduction to
Galois Fields
“Galois fields” are used to perform substitution and diffusion in Rijndael.
Theorem 1 Let p be a prime. GF (p) is a “prime field,” i.e., a Galois field with a
prime number of elements. All arithmetic in GF (p) is done modulo p.
Theorem 4.3.1 For every power pm , p a prime and m a positive integer, there exists
a finite field with pm elements, denoted by GF (pm ).
59
Examples:
- GF (12) = GF (3·22 ) is NOT a finite field (in fact, the notation is already incorrect
and you should pretend you never saw it).
A(x) = x7 + x6 + x4 + 1
B(x) = x4 + x2 + 1
C(x) = x7 + x6 + x2
60
where:
c00 = a0 b0 mod p
c01 = a0 b1 + a1 b0 mod p
..
.
A(x) = x3 + x2 + 1
B(x) = x2 + x
x4 = 1 · P (x) + (x + 1)
x4 ≡ x + 1 mod P (x)
x5 ≡ x2 + x mod P (x)
A(x) · B(x) ≡ x3
Note: in a typical computer representation, the multiplication would assign the follow-
ing unusually looking operations:
A · B = C
(1 1 0 1) · (0 1 1 0) = (1 0 0 0)
61
Example 2: x4 + x3 + x + 1 is reducible since x4 + x3 + x + 1 = (x2 + x + 1)(x2 + 1).
t0 = 0, t1 = 1
x3 + x + 1 = [x]x2 + [x + 1] t2 = t0 − q1 t1 = −q1 = −x = x
x + 1 = [1]x + 1 t 3 = t 1 − q 2 t2 = 1 − q 2 x = 1 − x = x + 1
x = [x]1 + 0
⇒ (x2 )−1 = t(x) = t3 = x + 1
Remark: In every iteration of the Euclidean algorithm, you should use long division (not
shown above) to uniquely determine qi and ri .
• Each byte A is considered an element of GF (28 ) and undergoes the following substi-
tution individually
62
1. B = A−1 ∈ GF (28 ) where P (x) = x8 + x4 + x3 + x + 1
The entire substitution can be realized as a look-up in a 256×8-bit table with fixed
entries.
Remark: Unlike DES, Rijndael applies the same S-Box to each byte.
• Unlike the non-linear substitution layer, the diffusion layer performs a linear operation
on input words A, B. That means:
63
ShiftRow Sublayer
1. Write an input word A as 128/8 = 16 bytes and order them in a square array:
Input A = (a0 , a1 , · · · , a15 )
a0 a4 a8 a12
a1 a5 a9 a13
a2 a6 a10 a14
a3 a7 a11 a15
a0 a4 a8 a12 0 positions
a5 a9 a13 a1 − − − −→ 3 positions right shift
a10 a14 a2 a6 −− −→ 2 positions right shift
a15 a3 a7 a11 − −→ 1 position right shift
MixColumn Sublayer
Question: How?
Each 4-byte column is considered as a vector and multiplied by a 4 × 4 matrix. The matrix
contains constant entries. Multiplication and addition of the coefficients is done in GF (28 ).
c0 02 03 01 01 b0
c1 01 02 03 01 b1
=
c 01 01 02 03
b2
2
c3 03 01 01 02 b3
Remarks:
64
2. The small values {01, 02, 03} allow for a very efficient implementation of the coefficient
multiplication in the matrix. In software implementations, multiplication by 02 and
03 can be done through table look-up in a 256-by-8 table.
4.5 Decryption
Unlike DES and other Feistel ciphers, all of Rijndael layers must actually be inverted.
65
y
66
4.6 Implementation
4.6.1 Hardware
Compared to DES, Rijndael requires considerable more hardware resources for an implemen-
tation. However, Rijndael can still be implemented with very high throughputs in modern
ASIC or FPGA technology. Two representative implementation reports are:
4.6.2 Software
Unlike DES, Rijndael was designed such that an efficient software implementation is possible.
A naive implementation of Rijndael which directly follows the data path description, such
as the description given in this chapter, is not particularly efficient, though. In a naive
implementation all time critical functions (Byte Substitution, Mix Row, Shift Row) operate
on individual bytes. Processing 1 byte per instruction is inefficient on modern 32 or 64 bit
processors.
However, the Rijndael designers proposed a method which results in fast software implemen-
tations. The core idea is to merge all round functions (except the rather trivial key addition)
into one table look-up. This results in 4 tables, each of which consists of 256 entries, where
each entry is 32 bit wide. These tables are named “T-Box”. Four table accesses yield 32 bit
output bits of one round. Hence, one round can be computed with 16 table look-ups.
A detailed description of the construction of the T-Boxes can be found in [DR98, Section 5].
67
4.7 Lessons Learned — AES
• The AES selection was an open process. It appears to be extremely unlikely that the
designers included hidden weaknesses (trapdoors) in Rijndael.
• The AES key lengths provide long term security against brute force attacks for several
decades.
• AES is a relativley new cipher. At the moment it can not be completely excluded that
there will be analytical attacks against Rijndael in the future, even though this does
not seem very likely.
• The fact that AES is a “standard” is currently only relevant for US Government ap-
plications.
68
Chapter 5
Further Reading:
Section 8.1 in [Sch96].
Note:
The following modes are applicable to all block ciphers ek (X).
69
5.1.1 Electronic Codebook Mode (ECB)
X0 X1 X2 e Y0 Y1 Y2 e-1 X0 X1 X2
K K
General Description:
e−1 −1
k (Yi ) = ek (ek (Xi )) = Xi ; where the encryption can, for instance, be DES.
Problem:
This mode is susceptible to substitution attack because same Xi are mapped to same Yi .
Example: Bank transfer.
Block # 1 2 3 4 5
Sending Sending Receiving Receiving Amount
Bank A Account # Bank B Account # $
2. Send $1.00 transfer to own account at bank B repeatedly → block 4 can be identified
and recorded.
70
5.1.2 Cipher Block Chaining Mode (CBC)
i=0 i=0
IV IV
Y i-1 Y i-1
Y i-1 Y i-1
Xi e e-1 Xi
Yi
k k
Beginning: Y0 = ek (X0 ⊕ IV ).
X0 = IV ⊕ e−1 −1
k (Y0 ) = IV ⊕ ek (ek (X0 ⊕ IV )) = X0 .
71
5.1.3 Cipher Feedback Mode (CFB)
Assumption: block cipher with b bits block width and message with block width l, 1 ≤
l ≤ b.
SR l SR l
b b
~
zi ~
zi
l l
e e
b b
Procedure:
4. Encrypt data: Y0 = X0 ⊕ z0 .
5. Shift the shift register and load Y0 into the rightmost SR position.
72
5.1.4 Counter Mode
Notes:
• Counter Mode does not rely on previous ciphertext for encrypting the next block.
⇒ well suited for parallel (hardware) implementation, with several encryption blocks
working in parallel.
• Counter Mode stems from the Security Group of the ATM Forum, where high data
rates required parallelization of the encryption process.
LFSR
k e
n n
X Y
1. An n-bit initial vector (IV) is loaded into a (maximum length) LFSR. The IV can be
publically known, although a secret IV (i.e., the IV is considered part of the private
key) turns the counter mode systems into a non-deterministic cipher which makes
cryptoanalysis harder.
3. The block cipher output is considered a pseudorandom mask which is XORed with the
plaintext.
73
4. The LFSR is clocked once (note: all input bits of the block cipher are shifted by one
position).
5. Goto to Step 2.
Note that the period of a counter mode is n · 2n which is very large for modern block ciphers,
e.g., 128 · 2128 = 2135 for AES algorithms.
74
5.2 Key Whitening
Xi e Yi
k2 k1 k3
75
5.3 Multiple Encryption
-1
e (X) = z (1)
i e (Y) = z (2)
j
ki kj
X e e Y
n z
k
ki kj
Procedure:
(1)
1. Compute a look-up table for all (zi , ki ), i = 1, 2, . . . , 2k and store it in memory.
Number of entries in the table is 2k with each entry being n bits wide.
(2)
2. Find matching zj .
(2)
(a) compute e−1 0
kj (y ) = zj
76
(c) if ki and kj give matching encryptions stop; otherwise go back to (a) and try
different key kj .
Question: How many additional pairs (x00 , y 00 ), (x000 , y 000 ), . . . should we test?
1. In the first step there are 2lk possible key combinations for the mapping E(x0 ) =
e(· · · (e(e(x0 )) · · ·) = y 0 but only 2n possible values for x0 and y 0 . Hence, there are
2lk
2n
mappings E(x0 ) = y 0 . Note that only one mapping is done by the correct key!
2n
X’ Y’
2. We use now a candidate key from step 1 and check whether E(x00 ) = y 00 . There are 2n
possible outcomes y for the mapping E(x00 ). If a random key is used, the likelyhood
that E(x00 ) = y 00 is
1
2n
If we check additionally a third pair (x000 , y 000 ) under the same “random” key from step
1, the likelyhood that E(x00 ) = y 00 and E(x000 ) = y 000 is
1
22n
77
If we check t − 1 additional pairs (x00 , y 00 ), (x000 , y 000 ), . . . (x(t) , y (t) ) the likelyhood that a
random key fulfills E(x00 ) = y 00 , E(x000 ) = y 000 , . . . is
1
2(t−1)n
2n
X’’ Y’’
2n mappings E(x’’) = y
2lk
3. Since there are 2n
candidate keys in step 1, the likelyhood that at least one of the
candidate keys fulfills all E(x00 ) = y 00 , E(x000 ) = y 000 , . . . is
1 2lk
= 2lk−tn
2(t−1)n 2n
Example: Double encryption with DES. We use two pairs (x0 , y 0 ), (x00 , y 00 ). The likelyhood
that an incorrect key pair ki , kj is picked is
If we use three pairs (x0 , y 0 ), (x00 , y 00 ), (x000 , y 000 ), the likelyhood that an incorrect key pair ki , kj
is picked is
2lk−tn = 2112−192 = 2−80
Computational complexity:
78
5.3.2 Triple Encryption
Option 1:
Y = ek1 (e−1
k2 (ek3 (X))); if k1 = k2 → Y = ek3 (X).
Option 2:
Y = ek3 (ek2 (ek1 (X))); where |k| ≈ 22k
X e e e Y
z
1
k1 k2 k3
Note:
Meet in the middle attack can be used in a similar way by storing zi results in
memory. The computational complexity of this approach is 2k · 2k = 22k .
79
5.4 Lessons Learned — More About Block Ciphers
• The ECB mode has security weaknesses.
• The counter mode allows parallelization of encryption and is thus suited for high speed
hardware implementations.
• Double encryption with a given block cipher only marginally improves the attack re-
sistance against brute force attacks.
• Triple encryption with a given block cipher roughly doubles the key length. Triple DES
(“3DES”) has, thus, an effective key length of 112 bits.
• Key whithening enlarges the DES key length without too much effort.
80
Chapter 6
Introduction to Public-Key
Cryptography
6.1 Principle
Quick review of symmetric-key cryptography
e Y dk
X k X
k k
1. The algorithm requires same secret key for encryption and decryption.
81
Analogy for symmetric key algorithms
Symmetric key schemes are analogous to a safe box with a strong lock. Everyone
with the key can deposit messages in it and retrieve messages.
2. In a network environment, each pair of users has to have a different key resulting in
too many keys (n · (n − 1) ÷ 2 key pairs).
New Idea:
Make a slot in the safe box so that everyone can deposit a message, but only the
receiver can open the safe and look at the content of it. This idea was proposed
in [DH76] in 1976 by Diffie/Hellman.
82
Protocol:
3. Alice encrypts her message with Bob’s public key and sends the ciphertext.
K pub
( K pub , K pr ) = K
2.) X
Y
3.) Y = eK (X) Y
pub
X = dK (Y)
4.) pr
83
Mechanisms that can be realized with public-key algorithms
1. Key establishment protocols (e.g., Diffi-Hellman key exchange) and key transport pro-
tocols (e.g., via RSA) without prior exchange of a joint secret
3. Encryption
It looks as though public-key schemes can provide all functionality needed in modern security
protocols such as SSL/TLS. However, the major drawback in practice is that encryption
of data is extremely computationally demanding with public-key algorithms. Many block
and stream ciphers can encrypt 1000 times faster in software than public-key algorithms.
On the other hand, symmetric algorithms are poor at providing digital signatures and key
establishment/transport functionality. Hence, most practical protocols are hybrid protocols
which incorporate both symmetric and public-key algorithms.
84
6.2 One-Way Functions
All public-key algorithms are based on one-way functions.
85
6.3 Overview of Public-Key Algorithms
There are three families of Public-Key (PK) algorithms of practical relevance:
In addition, there are many other public-key schemes, such as NTRU or systems based on
hidden field equations, which are not in wide spread use. Often, their security is not very
well understood.
Block cipher 80
Table 6.1: Bit lengths of public-key algorithms for a security level of approximately 280
computations for a successful attack.
Remark: The long operands lead to a high computationally complexity of public-key algo-
rithms. This can be a bottleneck in applications with constrained microprocessors (e.g.,
mobile applications) or on the server side of networks, where many public-key operations
per time unit have to be executed.
86
– Key establishment algorithms
– Signature algorithms
Note: IEEE P1363 does not recommend any bit lengths or security levels.
ANSI# Subject
FIPS# Subject
87
6.5 More Number Theory
Basic Form
Given r0 and r1 with one larger than the other, compute the gcd(r0 , r1 ).
Example 1:
r0 = 22, r1 = 6.
gcd(r0 , r1 ) =?
r0 6
6 6
4 gcd(22,6) = gcd(6,4)
r1 4
2
gcd(6,4) = gcd(4,2)
2
2
r2 gcd(4,2) = 2
r3 2
Example 2:
r0 = 973; r1 = 301.
973 = 3 · 301 + 70.
301 = 4 · 70 + 21.
70 = 3 · 21 + 7.
21 = 3 · 7 + 0.
gcd(973, 301) = gcd(301, 70) = gcd(70, 21) = gcd(21, 7) = 7.
Algorithm:
88
input: r0 , r1
r0 = q 1 · r1 + r 2 gcd(r0 , r1 ) = gcd(r1 , r2 )
r1 = q 2 · r2 + r 3 gcd(r1 , r2 ) = gcd(r2 , r3 )
.. ..
. .
rm−2 = qm−1 · rm−1 + rm gcd(rm−2 , rm−1 ) = gcd(rm−1 , rm )
rm−1 = qm · rm + 0 ← † gcd(r0 , r1 ) = gcd(rm−1 , rm ) = rm
† - termination criteria
Theorem 6.5.1 Given two integers r0 and r1 , there exist two other integers s and t
such that s · r0 + t · r1 = gcd(r0 , r1 ).
89
Now: s = sm , t = tm
Recursive formulae:
s0 = 1, t0 = 0
s1 = 0, t1 = 1
si = si−2 − qi−1 · si−1 , ti = ti−2 − qi−1 · ti−1 ; i = 2, 3, 4 . . .
Remark:
b) For fast software implementation, the “binary extended Euclidean algorithm” is more
efficient [AM97] because it avoids the division required in each iteration of the extended
Euclidean algorithm shown above.
Example 1:
m = 6; Z6 = {0, 1, 2, 3, 4, 5}
gcd(0, 6) = 6
gcd(1, 6) = 1 ←
gcd(2, 6) = 2
gcd(3, 6) = 3
gcd(4, 6) = 2
gcd(5, 6) = 1 ←
Φ(6) = 2
90
Example 2:
m = 5; Z5 = {0, 1, 2, 3, 4}
gcd(0, 5) = 5
gcd(1, 5) = 1 ←
gcd(2, 5) = 1 ←
gcd(3, 5) = 1 ←
gcd(4, 5) = 1 ←
Φ(5) = 4
n
Y
Φ(m) = (pei i − pei i −1 )
i=1
aΦ(m) ≡ 1 mod m
Example: m = 6; a = 5
Φ(6) = Φ(3 · 2) = (3 − 1)(2 − 1) = 2
5Φ(6) = 52 = 25 ≡ 1 mod 6
91
6.6 Lessons Learned — Basics of Public-Key Cryptog-
raphy
• Public-key algorithms have capabilities that symmetric ciphers don’t have, in particular
digital signature and key establishment functions.
• Public-key algorithms are computationally intensive (= slow), and are hence poorly
suited for bulk data encryption.
• Most modern protocols are hybrid protocols which use symmetric as well as public-key
algorithms.
• There are considerably fewer established public-key algorithms than there are symmet-
ric ciphers.
• Computing Euler’s phi function of an integer number is easy if one knows the factor-
ization of the number. Otherwise it is very hard.
92
Chapter 7
RSA
3. Was patented in the USA (not in the rest of the world) until 2000.
93
7.1 Cryptosystem
Set-up Stage
1. Choose two large primes p and q.
2. Compute n = p · q.
b · a ≡ 1 mod Φ(n).
94
Example:
Alice sends encrypted message (x = 4) to Bob after Bob
sends her the public key.
Alice Bob
(1) choose p = 3; q = 11
(2) n = p · q = 33
(3) Φ(n) = (3 − 1)(11 − 1) = 2 · 10 = 20
(4) choose b = 3; gcd(20, 3) = 1
kpub (3,33)
x=4 ←− (5) a = b−1 = 7 mod 20
y=31
y = xb mod n = 43 = 64 ≡ 31 mod 33 −→ x = y a = 317 ≡ 4 mod 33
95
Why does RSA work?
We have to show that: dkpr (y) = dkpr (ekpub (x)) = x.
dkpr = y a = xba = xab mod n.
a · b ≡ 1 mod Φ(n) ⇐⇒ a · b = 1 + t · Φ(n), where t is an integer
dkpr = xab = xt·Φ(n) · x1 = (xΦ(n) )t · x mod n.
96
7.2 Computational Aspects
Problem: Finding two large primes p, q (for instance, each ≈ 512 bits).
Approach: Choose a random large integer and apply a primality test. In practice, a “Monte
Carlo” test, for instance the Miller-Rabin [Sti02] test, is used. Note that a primality test
does not require factorization, and is in fact enormously faster than factorization.
In practice, the above algorithm is run 3 times (for a 1000 bit prime) and upto 12 times (for
a 150 bit prime) [AM97, Table 4.4 page 148] with different parameters r. If the answer is
always “p is prime”, then p is with very high probability a prime.
97
Question: What is the likelihood that a randomly picked integer p (or q) is prime?
1
Answer: P(p is prime ) ≈ ln(p)
.
1 1
P(p is prime) ≈ ≈
ln(2512 ) 355
This means that on average about 1 in 355 random integers with a length of
512 bit is a prime. Since we can spot even numbers right away, we only have to
generate and test on average 355/2 ≈ 173 numbers before we find a prime of this
bit length.
Conclusion: Primes are relatively frequent, even for numbers with large bit lengths. To-
gether with an efficient primality test, this results in a very practical way of finding random
prime numbers.
98
7.2.2 Choosing a and b
3. Calculate a:
Question: What is t · b mod Φ(n)?
t · b = (−s)Φ(n) + 1
⇒ t · b ≡ 1 mod Φ(n)
Remark:
It is not necessary to find s for the computation of a.
99
7.2.3 Encryption/Decryption
The goal now is to find an efficient way of performing exponentiations with very large num-
bers. Note that all parameters n, x, y, a, b are in general very large numbers1 . Nowadays, in
actual implementations, these parameters are typically chosen in the range of 1024–4096 bit!
The straightforward way of exponentiation:
x, x2 , x3 , x4 , x5 , . . .
does not work here, since the exponents a, b have in actual applications values in the range of
21024 . Straightforward exponentiation would thus require around 21024 multiplications. Since
the number of atoms in the visible universe is estimated to be around 2150 , computing 21024
multiplications for setting up one secure session for our web browser is not too tempting.
The central question is whether there are considerably faster methods for exponentiation
available. The answer is, luckily, yes (otherwise we could forget about RSA and pretty much
all other public-key cryptosystem in use today.) In order to develop the method, let’s look
at some absurdly small example of an exponentiation:
Question: How many multiplications are required for computing x8 ?
Answer: With the straightforward method (x, x2 , x3 , x4 , x5 , x6 , x7 , x8 ) we need 7 multipli-
cations. However, alternatively we can do something much smarter:
2 2 2 4 4 4 8
| · x{z= x};
x x
| · x{z = x }; x
| · x{z = x }.
1. MUL 2. MUL 3. MUL
1
The only exception is the public exponent b, which is often chosen to be a short number, e.g., b = 17.
100
Question: OK, that worked fine, but the exponent 8 is a very special case since it’s a power
of two (23 = 8) after all. Is there are fast way of computing an exponentiation with an
arbitrary exponent? Let’s look at the exponent 13. How many multiplications are required
for computing x13 ?
2 2 3 3 3 6 6 6 12 12 13
Answer: x
| · x{z= x}; x
| · x{z= x}; x
| · x{z = x }; x
| · x{z= x }; x
| · x{z= x }.
SQ MUL SQ SQ MUL
Observation: Apparently, we have to perform squarings (of the current result) and multi-
plying by x (of the current result) in order to achieve the over-all exponentiation.
Question: Is there a systematic way for finding the sequence in which we have to perform
squarings and multiplications (by x) for a given exponent B?
Square-and-multiply Algorithm
101
Example: x13 = x11012 = x(b3 ,b2 ,b1 ,b0 )2
Why the algorithms works becomes clear if we look at a more general form of a 4-bit expo-
nentiation:
Binary representation of the exponent → xB ; B ≤ 15
B = b 3 · 23 + b 2 · 22 + b 1 · 21 + b 0
B = (b3 · 2 + b2 )22 + b1 · 2 + b0 = ((b3 · 2 + b2 )2 + b1 )2 + b0
xB = x((b3 ·2+b2 )2+b1 )2+b0
Step xB
#1 xb3 ·2
#2 (xb3 ·2 · xb2 )
#3 (xb3 ·2 · xb2 )2
#4 (xb3 ·2 · xb2 )2 · xb1
#5 ((xb3 ·2 · xb2 )2 · xb1 )2
#6 ((xb3 ·2 · xb2 )2 · xb1 )2 · xb0
102
Of course, the algorithm also works for general exponents with more than 4 bits. In the
following, the square-and-multiply algorithms is given in pseudo code. Compare this pseudo
code with the verbal description in the first paragraph after the headline Square-and-multiply
Algorithm.
Pl−1
Algorithm [Sti02]: computes z = xB mod n, where B = i=0 bi 2
i
1. z = x
2. FOR i = l − 2 DOWNTO 0:
(a) z = z 2 mod n
(b) IF (bi = 1)
z = z · x mod n
Remark 1: Remember to apply modulo reduction after every multiplication and squaring
operation, in oder to keep the intermediate results small.
Remark 2: Bear in mind that each individual multiplication and squaring involves in
practice a number with 1024 or more bits. Thus, a single multiplication (or squaring)
consists typically of 100s of 32 integer multiplications on a desktop PC (and even more
integer multiplications on an 8 bit smart card processor.)
103
Remark 3 (exponenent lengths in practice): The public exponent b is often chosen to
be a short integer, for instance, the value b = 17 is popular. This makes encryption of a
message (and verification of an RSA signature) a very fast operation. However, the private
exponent a needs to have full length, i.e., the same length as the modulus n, for security
reasons. Note that a short exponent b does not cause a to be short.
104
7.3 Attacks
There have been several attacks proposed against RSA implementations. They typically
exploited weaknesses in the way RSA was implemented rather than breaking the actual RSA
algorithms. The following is a list of attacks against the actual algorihtm that could, in
theory, be exploited. However, the only known method for breaking the RSA algorithm is
by factoring the modulus.
Given y = xb mod n, try all possible keys a; 0 ≤ a < Φ(n) to obtain x = y a mod n. In
practice |K| = Φ(n) ≈ n > 2500 ⇒ impossible.
7.3.4 Factorization of n
Factoring Algorithms:
105
1. Quadratic Sieve (QS): speed depends on the size of n; record: in 1994 factoring of
n =RSA129, log10 n = 129 digits, log2 n = 426 bits.
2. Elliptic Curve: similar to QS; speed depends on the size of the smallest prime factor
of n, i.e., on p and q.
3. Number Field Sieve: asymptotically better than QS; record: in 1999 factoring of
n =RSA155; log10 n = 155 digits; log2 n = 512 bits.
Algorithm Complexity
√
Quadratic Sieve O(e(1+o(1)) ln(n) ln(ln(n)) )
√
Elliptic Curve O(e(1+o(1)) 2 ln(p) ln(ln(p)) )
1/3 (ln(ln(n)))2/3
Number Field Sieve O(e(1.92+o(1))(ln(n)) )
106
7.4 Implementation
Some representative performance numbers:
• Software (Pentium at a few 100MHz): 1024 bit decryption in 43 ms; 1024 bit encryption
with short public exponent in 0.65 ms.
In practice, hybrid systems consisting of public-key and symmetric-key algorithms are com-
monly used:
1. key exchange and digital signatures are performed with (slow) public-key algorithm
2. bulk data encryption is performed with (fast) block ciphers or stream ciphers
107
7.5 Lessons Learned — RSA
• RSA is the most widely used public-key cryptosystems. In the future, elliptic curves
cryptosystems will probably catch up in popularity.
• RSA is mainly used for key transport (i.e., encryption of keys) and digital signatures.
• The public key b can be a short integer. The private key a needs to have the full length
of the modulus.
• Decryption with the long private key is computationally demanding and can be a
bottleneck on small processors, e.g., in mobile applications.
1. Currently, 1024 bit (about 310 decimal digits) numbers cannot be factored.
108
Chapter 8
5. . . . . . .
109
8.1 Some Algebra
Further Reading: [Big85].
8.1.1 Groups
Definition 8.1.1 A group is a set G of elements together with a binary operation
“o” such that:
1. If a, b ∈ G then a ◦ b = c ∈ G → (closure).
2. If (a ◦ b) ◦ c = a ◦ (b ◦ c) → (associativity).
Examples:
1. G= Z = {. . . , −2, −1, 0, 1, 2, . . .}
◦ = addition
(Z, +) is a group with e = 0 and ã = −a
2. G= Z
◦ = multiplication
(Z, ×) is NOT a group since inverses ã do not exist except for a = 1
u − iv
ã = a−1 =
u2 + v 2
110
Definition 8.1.2 “Zn∗ ” denotes the set of numbers i, 0 ≤ i < n, which are relatively
prime to n.
111
Examples:
1. Z9∗ = {1, 2, 4, 5, 7, 8}
2. Z7∗ = {1, 2, 3, 4, 5, 6}
Multiplication Table
∗ mod 9 1 2 4 5 7 8
1 1 2 4 5 7 8
2 2 4 8 1 5 7
4 4 8 7 2 1 5
5 5 1 2 7 8 4
7 7 5 1 8 4 2
8 8 7 5 4 2 1
Theorem 8.1.1 Zn∗ forms a group under modulo n multiplication. The identity ele-
ment is e = 1.
Remark:
The inverse of a ∈ Zn∗ can be found through the extended Euclidean algorithm.
112
8.1.2 Finite Groups
Definition 8.1.3 A group (G, ◦) is finite if it has a finite number of g elements.
We denote the cardinality of G by |G|.
Examples:
Definition 8.1.4 The order of an element a ∈ (G, ◦) is the smallest positive integer
o such that a ◦ a ◦ . . . ◦ a = ao = 1.
∗
Example: (Z11 , ×), a = 3
Question: What is the order of a = 3?
a1 = 3
a2 = 3 2 = 9
a3 = 33 = 27 ≡ 5 mod 11
a4 = 34 = 33 · 3 = 5 · 3 = 15 ≡ 4 mod 11
a5 = a4 · a = 4 · 3 = 12 ≡ 1 mod 11
⇒ ord(3) = 5
113
Definition 8.1.5 A group G which contains elements α with maximum order
ord(α) = |G| is said to be cyclic. Elements with maximum order are called gen-
erators or primitive elements.
∗
Example: 2 is a primitive element in Z11
∗
|Z11 | = |{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}| = 10
a=2
a2 = 4
a3 = 8
a4 = 16 ≡ 5
a5 = 10;
a6 = 20 ≡ 9
a7 = 18 ≡ 7
a8 = 14 ≡ 3;
a9 = 6
a10 = 12 ≡ 1
a11 = 2 = a.
∗
⇒ ord(a = 2) = 10 = |Z11 |
∗
⇒ (1) |Z11 | is cyclic
⇒ (2) a = 2 is a primitive element
∗
Observation (important): 2i ; i = 1, 2, . . . , 10 generates all elements of Z11
i 1 2 3 4 5 6 7 8 9 10
2i 2 4 8 5 10 9 7 3 6 1
114
Some properties of cyclic groups:
∗ ∗
Example: Z11 ; |Z11 | = 10
1. Φ(10) = (2 − 1)(5 − 1) = 1 · 4 = 4
2. a = 3 → 310 = (35 )2 = 12 = 1
3. homework . . .
8.1.3 Subgroups
Definition 8.1.6 A subset H of a group G is called a subgroup of G if the elements
of H form a group under the group operation of G.
?
Example: G = Z11 :
31 = 3
32 = 9
33 ≡ 5
34 ≡ 4
35 ≡ 1
⇒ 3 is a generator of H = {1, 3, 4, 5, 9} which is a subgroup of G
115
H
8
1 3
9
4 5
7 2
G 6
10
?
Figure 8.1: Subgroup H of G = Z11
Multiplication Table
H 1 3 4 5 9
1 1 3 4 5 9
3 3 9 1 4 5
4 4 1 5 9 3
5 5 4 9 3 1
9 9 5 3 1 4
ord(α) = t
116
?
Example (1): ord(3) = 5 in Z11
⇒ 3 generates H with 5 elements.
Remarks:
?
Example (2): |Z11 | = 10
⇒ possible subgroup orders: 1, 2, 3 (,10).
{1} α=1
{1, 10} α = 10
{1, 3, 4, 5, 9} α = 3, 4, 5, 9
117
8.2 The Generalized DL Problem
Given a cyclic subgroup (G, ◦) and a primitive element α. Let
i
β=α
| ◦ α{z. . . α} = α
i times
be an arbitrary element in G.
General DL Problem:
Given G, α, β = αi , find i.
i = logα (β)
Examples:
i 1 2 3 4 5 6 7 8 9 10 11
2i 2 4 6 8 10 1 3 5 7 9 0
Let i = 7: β = 7 · 2 ≡ 3 mod 11
Question: given α = 2, β = 3 = i · 2, find i
Answer: i = 2−1 · 3 mod 11
Euclid’s algorithm can be used to compute i thus this example is NOT a one-way
function.
∗
2. (Z11 , ×); α = 2; β = 2| · 2 ·{z. . . · 2} = 2i
i times
β = 3 = 2i mod 11
Question: i = log2 (3) = log2 (2i ) = ?
Very hard computational problem!
118
8.3 Attacks for the DL Problem
1. Brute force:
check:
?
α1 = β
?
α2 = β
..
.
?
αi = β
Complexity: O(|G|) steps.
p−1
Example: DL in Zp∗ ≈ 2
tests
minimum security requirement ⇒ p − 1 = |G| ≥ 280
3. Pohlig-Hellman algorithm:
Let |G| = p1 · p2 · · · pl
|{z}
√ largest prime
Complexity: O( pl ) steps.
Example: DL in Zp∗ : pl of (p − 1) must be ≥ 2160
minimum security requirement ⇒ pl ≥ 2160
4. Index-Calculus method:
Further reading: [AM97].
Applies only to Zp∗ and Galois fields GF(2k )
√
Complexity: O (e(1+O(1)) ln(p) ln(ln(p)) ) steps.
Example: DL in Zp∗ : minimum security requirement ⇒ p ≥ 21024
Remark: Index-Calculus is more powerful against DL in Galois Fields GF(2k ) than
119
against DL in Zp∗ .
120
8.4 Diffie-Hellman Key Exchange
Remarks:
8.4.1 Protocol
Set-up:
Protocol:
Alice Bob
pick kprA = aA ∈ {2, 3, . . . , p − 1} pick kprB = aB ∈ {2, 3, . . . , p − 1}
compute kpubA = bA = αaA mod p compute kpubB = bB = αaB mod p
bA
−→
bB
←−
kAB = baBA = (αaB )aA kAB = baAB = (αaA )aB
121
8.4.2 Security
Diffie-Hellman Problem:
Given bA = αaA mod p, bB = αaB mod p, and α find αaA ·aB mod p.
Note:
There is no proof that the DL problem is the only solution to the D-H problem!
However, it is conjectured.
122
8.5 Lessons Learned — Diffie-Hellman Key Exchange
• The Diffie-Hellman protocol is a widely used method for key exchange. It is based on
cyclic groups.
• In practice, the multiplicative group of the prime field Zp or the group of an elliptic
curve are most often used.
• If the parameters are chosen carefully, the Diffie-Hellman protocol is secure against
passive (i.e., attacker can only eavesdrop) attacks.
• For the Diffie-Helmann protocol in Zp? , the prime p should be at least 1024 bit long.
This provides a security roughly equivalent to an 80 bit symmetric cipher. For a better
long term security, a prime of length 2048 bit should be chosen.
123
Chapter 9
Further Reading:
Chapter 6 in [Kob94].
Book by Alfred Menezes [Men93].
Remarks:
• It is believed to be more secure than RSA/DL in Zp∗ , but uses arithmetic with much
shorter numbers (≈ 160 – 256 bits vs. 1024 – 2048 bits).
Drawbacks:
124
9.1 Elliptic Curves
Goal: To find another instance for the DL problem in cyclic groups.
Question: What is the equation x2 + y 2 = r 2 over reals?
Answer: It is a circle.
y
r2
Note:
There are only certain points (x,y) which fulfill the equation. For example the
point (x = r, y = 1) fulfills the equation of a circle.
125
Definition 9.1.1 The elliptic curve over Zp , p > 3, is a set of all pairs (x, y) ∈ Zp
which fulfill:
y 2 ≡ x3 + a · x + b mod p
where
a, b, ∈ Zp
and
4 · a3 + 27 · b2 6= 0 mod p
Q+Q=2Q
Q
P
x
P+Q
Goal: Finding a (cyclic) group (G, ◦) so that we can use the DL problem as a one-way
function.
We have a set (points on the curve). We “only” need a group operation on the points.
126
Group G: Points on the curve given by (x, y).
Operation ◦: P + Q = (x1 , y1 ) + (x2 , y2 ) = R = (x3 , y3 ).
a) P 6= Q → line through P and Q and mirror point of third interception along the x-axis.
x3 = λ2 − x1 − x2 mod p
y3 = λ(x1 − x3 ) − y1 mod p
where
y2 −y1
x2 −x1
mod p ; if P 6= Q
λ=
3x21 +a
2y1
mod p ; if P = Q
Remarks:
• Additive inverse of any point (x, y) = P is P +(−P ) = O such that (x, y)+(x, −y) = O.
127
Remark: Under certain conditions all points on an elliptic curve form a cyclic group as
the following example shows.
128
9.2 Cryptosystems
1. Choose E: y 2 ≡ x3 + a · x + b mod p.
Protocol:
Alice Bob
choose kprA = aA ∈ {2, 3, . . . , #E − 1} choose kprB = aB ∈ {2, 3, . . . , #E − 1}
compute kpubA = bA = aA · α = (xA , yA ) compute kpubB = bB = aB · α = (xB , yB )
b
A
−→
b
B
←−
compute aA · bB = aA · aB · α = (xk , yk ) compute aB · bA = aB · aA · α = (xk , yk )
kAB = xk ∈ Zp kAB = xk ∈ Zp
Security:
Oscar knows: E, p, α, bA = aA · α, bB = aB · α
Diffie-Hellman problem for elliptic curves
Oscar wants to know: kAB = aA · aB · α
2. Compute aA · bB = aA · aB · α.
129
Attacks:
• Only possible attacks against elliptic curves are the Pohlig-Hellman scheme together
with Shank’s algorithm or Pollard’s-Rho method.
⇒ #E must have one large prime factor pl
⇒ 2160 ≤ pl ≤ 2250 .
• For supersingular elliptic curves over GF(2n), DL in elliptic curves can be solved by
solving DL in GF(2k·n); k ≤ 6.
⇒ stay away from supersingular curves despite of possible faster implementations.
Set-up:
1. Choose E: y 2 ≡ x3 + a · x + b mod p.
4. Compute a · α = β = (xβ , yβ ).
130
Encryption:
Decryption:
1. Compute a · Y0 = (c1 , c2 ).
a · Y0 = a · k · α = k · β = (c1 , c2 ).
(x1 , x2 ).
# bits y 4dlog2 pe
= =2
# bits x 2dlog2 pe
9.3 Implementation
1. Hardware:
• Approximatly 0.2 msec for an elliptic curve point multiplication with 167 bits on
an FPGA [OP00].
2. Software:
• One elliptic curve point multiplication a · P in less than 10 msec over GF(2155 ).
131
Chapter 10
10.1 Cryptosystem
Remarks:
• Published in 1985.
Principle:
Alice Bob
choose private key kprA = aA choose private key kprB = aB
compute kpubA = αaA mod p = bA compute kpubB = αaB mod p = bB
b
A
−→
b
B
←−
kAB = baBA = αaA aB mod p kAB = baAB = αaB aA mod p
y
y = x · kAB mod p −→
−1
x = y · kAB mod p
132
ElGamal:
Set-up:
4. Compute β = αa mod p,
Public Key: Kpub = (p, α, β),
Private Key: Kpr = (a).
Encryption:
2. Y1 = αk mod p.
3. Y2 = x · β k mod p.
Decryption:
133
Question: How does the ElGamal scheme work?
Protocol:
Alice Bob
message x < p set-up phase steps 1-4
kpub = (p, α, β)
kpr = (a)
kpub =(p,α,β)
←−
choose k ∈ {2, 3, · · · , p − 2}
Y1 = αk mod p
Y2 = x · β k mod p
(Y1 ,Y2 )
−→
x = Y2 (Y1a )−1
134
Remarks:
• ElGamal is non-deterministic.
10.2.1 Encryption
Y1 = αk mod p
apply the square-and-multiply for exponentiation
Y2 = x · β k mod p
10.2.2 Decryption
be = bq(p−1)+r = (bp−1 )q · br
= 1q · br mod p
= br mod p
⇒ e = r mod (p − 1)
135
Thus, be ≡ be mod (p−1)
mod p, where b ∈ Zp∗ and e ∈ Z
−a mod (p−1)
(Y1a )−1 = Y1−a = Y1 mod p
= Y1p−1−a mod p
2. Y2 · β −k = x ← easy.
• In both cases Oscar has to compute the DL problem in finite fields (Zp∗ or GF(2k )).
He can use index-calculus method which forces us to implement schemes with at least
1024 bits.
136
Chapter 11
Digital Signatures
Protocols use:
• Symmetric-key algorithms.
• Public-key algorithms.
• Digital Signatures.
• Hash functions.
as building blocks. In practice, protocols are often the most vulnerable part of a cryp-
tosystem. The following chapters deal with digital signature, message authentication codes
(MACs), and hash functions.
137
11.1 Principle
The idea is similar to a conventional signature on paper: Given a message x, a digital
signature ist appended to the message. As with conventional signatures, only the person
who sends the message must be capable of generating a valid signature. In order to achieve
this with cryptography, we make the signature a function of a private key, so that only the
holder of the private key can sign a message. In order to make sure that a signature changes
with each document, we also make the signature a function of the message that is being
signed.
message x
The main advantage which digital signatures have is that they enable communication parties
to prove that one party has actually generated the message. Such a “proof” can even have
legal meaning, for instance as in the German Signaturgesetz (signature law.)
138
message space
signature space
sig (x) = y
Kpr
y
x
true if y = sig(x)
ver (x, y)=
Kpub false if y == sig(x)
Basic protocol:
3. Alice runs the verification function verkpub (x, y) with Bob’s public key.
139
Properties of digital signatures:
• Integrity: Message x cannot be altered since that would be detected through verifica-
tion.
• Non-repudiation: The receiver of the message can prove that the sender had actually
send the message.
It is important to note that the last property, sender non-repudiation, can only be achieved
with public-key cryptography. Sender authentication and integrity can also be achieved via
symmetric techniques, i.e., through message authentication codes (MACs).
140
11.2 RSA Signature Scheme
Set-up: kpr = (p, q, a); kpub = (n, b).
General Protocol:
3. Alice verifies:
= x ⇒ true
verkpub (x, y) = dkpub (y) = y b
6= x ⇒ false
Remark:
• The role of public/private key are exchanged if compared with RSA public-key encryp-
tion.
141
Drawback/possible attack:
Oscar can generate a valid signature for a random message x:
1. Choose signature y ∈ Zn .
The attack above can be prevented by formatting rules for the message x. For instance, a
simple rule could be that the first and last 100 bits of x must all be zero (or one or any
other specific bit pattern.) It is extremely unlikely that a random message x shows this bit
pattern. Such a formatting scheme imposes a rule which distinguishes between valid and
invalid messages.
142
11.3 ElGamal Signature Scheme
Remarks:
143
Set-up:
1. Choose a prime p.
4. Compute β = αa mod p.
Public key: kpub = (p, α, β).
Private key: kpr = (a).
Signing:
2. Compute signature:
γ = αk mod p
δ = (x − a · γ)k −1 mod p − 1
Public verification:
γ δ
= αx mod p valid signature
verkpub (x, (γ, δ)) = β · γ
6= αx mod p invalid signature
−1
β γ · γ δ = (αa )γ (αk )(x−a·γ)k mod (p−1)
mod p
−1 (x−a·γ)
= αa·γ · αk·k mod p
= αa·γ−a·γ+x = αx
144
11.4 Lessons Learned — Digital Signatures
• Digital signatures provide message integrity, sender authentication, and non-repudiation.
• RSA is the currently most widely used digital signature algorithm. Competitors are the
Digital Signature Standard (DSA) and the Elliptic Curve Digital Signature Standard
(ECDSA.)
• RSA verification can be done with short public keys b, whereas the signature key a
must have the full length of the modulus n. Hence, RSA verification is fast and signing
is slower.
• RSA digital signature is almost identical to RSA encryption, except that the private
key is applied first to the message (signing), and the public key is applied to the signed
message in the second step (verification.)
• As with RSA encryption, the modulus n should be at least 1024 bit long. This provides
a long-term security roughly equivalent to an 80 bit symmetric cipher. For a better
long-term security, a prime of length 2048 bit should be chosen.
145
Chapter 12
introduces
errors and Channel
eavesdropping
146
Source Coding (Data Compression) Most data, such as text, has redundancy in it.
This means the standard representation of the message, e.g., English text, uses more
bits than necessary to uniquely represent the message contents. Source coding tech-
niques extract the redundany and, thus, reduce the message length.
Encryption (Reminder: Pretty much everything in these lecture notes) The goal of encryp-
tion is to disguise the contents of a message. Only the owner of cryptographic keys
should be able to recover the original content. Encryption can be viewed as a form of
coding.
Channel Coding (Error Coding) The purpose of channel codes is to make the data ro-
bust against errors introduced during transmission over the channel.
147
PSfrag replacements
Alice Bob
M M̂ M
encode channel decode
The goal of channel codes it to make the data robust against errors on the transmission
channel. The basic idea of channel codes is to introduce extra information (i.e., to add
extra bits) to the data before transmission, so that there is a certain functional relationship
between the message bits and the extra bits.
PSfrag replacements
extra
message M
bits
M̂
If this is done in a smart way it is unlikely (but not impossible) that a random error during
data transmission changes the bits in such a way that the relationship between the bits
are destroyed. The receiver (Bob) checks for the relationship between the message bits and
the extra bit. Note that channel coding adds redundancy to the message, i.e., makes the
transmitted data longer, which is the opposite of source coding.
148
There are two types of channel codes:
In the remainder of this chapter, only error detection codes are discussed.
10010101........................01 1
M P
From this, the construction of P for a given message follows trivally as:
l
X
P ≡ mi mod 2
i=1
A consequence of this coding scheme is that the number of bits that have the value “1” in a
message together with the parity bit is always even. Hence, this coding scheme is also called
even parity.
149
Example: Transmission of the ASCII character “s”= 1010 011
parity bit P = 0
transmitted (M |P ) = 1010 0110
received (M |P )0 = 1010 0010 (bit error in position 6)
error will be detected since the mod 2 sum of the bits received is not equal to 0.
• they detec all odd number of bit errors, i.e., single bit errors, 3-bit errors, 5-bit errors,
...
150
The functional relationship between the message (the first 9 digits) and the check sum is the
following. Let
M = m10 , m9 , . . . , m2
M CRC
m r
n=m+r
Encoding:
151
Example:
(x13 +x12 +x10 +x8 +x7 +x5 +x4 ) : (x4 +x +1) = x9 + x8 + x3 + x + H(x)/G(x)
−(x13 +x10 +x9 )
(x12 +x9 +x8 +x7 +x5 +x4 )
−(x12 +x9 +x8 )
(x7 +x5 +x4 )
−(x7 +x4 +x3 )
(x5 +x3 )
−(x5 +x2 +x)
(x3 +x2 +x) = H(x)
Remark:
Decoding: Divide the received polynomial R(x) by G(x). If the remainder is not zero, an
error occured. Otherwise, we assume no error occured:
(
=0 error free
R(x) mod G(x) =
6= 0 error occured
• Channel: R(x) = T (x) + E(x) (all bits at error positions are flipped)
Example: R(x) = x13 + x12 + x10 + x7 + x5 + x4 + x3 + x2 + x + 1
152
• Decoder:
153
Chapter 13
Hash Functions
13.1 Introduction
The problem with digital signatures is that long messages require very long signatures. We
would like for performance as well as for security reasons to have one signature for a message
of arbitrary length. The solution to this problem are hash functions.
Note: There are many other applications of hash functions in cryptography beyond digital
signatures. In particular, hash functions have become very popular for message authentica-
tion codes (MACs.)
154
x
x x is of arbitrary length
zi = h ( xi ||zi-1 )
z z is of fixed length
sig (z)
kpr
Remarks:
• h(x) is public.
Basic Protocol:
Alice Bob
1) z = h(x)
2) y = sigkpr (z)
3) (x,y)
←−
4) z = h(x)
5) verkpub (z, y)
155
Naı̈ve approach: Use of error detection codes as hash functions
Principle of error correction codes: Given a message x, the sender computes f (x), where
f () is a publically known function and sends x||f (x). The receiver obtains x0 and checks
?
f (x0 ) = f (x).
Sender Receiver
1) x
2) compute f (x)
3) (x,f (x))
←−
4) check if f (x0 ) = f (x)
Important: Error detection codes are designed to detect errors introduced during trans-
mission on the channel, e.g., errors due to noise.
Let’s try to use a column-wise parity check code. In this method, we compute an even parity
check bit for every bit position. An even parity bit is defined such that the sum of all bits
in the column is “1” if the XOR sum of all column bits is “1”, and “0” if the XOR of sum
of all column bits is “0”.
E.g., consider a text x = (x1 , x2 , ..., xl ) consisting of ASCII symbols xi . We can compute the
parity bits P = (p1 , p2 , ..., pl ) by bitwise XOR of the column entries:
x1 = 00101010
⊕ x2 = 01010011
..
.
⊕ xl = 11101000
156
157
The problem with error detection codes is, that they were designed to detect random errors,
and not “errors” introduced by an intelligent opponent such as Oscar. Oscar can easily alter
the message and additionally alter the parity bits such that the changes go undetected.
Discussion:
• (4) if h(x) is not one-way, Oscar can compute x from h(x) in cases where x is encrypted.
158
• (5) if h(x) is not weak collision free, Oscar can replace x with x0 .
• (6) if h(x) is not strong collission free, Oscar runs the following attack:
b) Alter x1 and x2 at “non-visible” location, i.e. replace tabs through spaces, append
returns, etc., until h(x01 ) = h(x02 ) (Note: e.g. 64 alteration locations allow 264
versions of a message with 264 different hash values).
159
Question: Why is there no collision free hash function?
Answer: There exist far more x than z!
Z
h(x)
PSfrag replacements X
h(x) is the map from X to Z, where |X| >> |Z| with x ∈ X, z ∈ Z. A minimum in possible
collisions is the objective of any hash function. The function h(x) (and the size of Z) has to
assure that a construction of two different x’s with the same hash value z is time-consuming
and, thus, not feasible.
160
13.2 Security Considerations
Question: How many people are needed at a party so that there is a 50% chance that at
least two people have the same birthday?
Thus,
k−1
Y i 1 2 3 k−1
P (no collision) ≈ e− n = e − n e− n e− n · · · e− n
i=1
k−1
Y i 1+2+3+···+k−1
e− n = e − n
i=1
Rewriting the exponent with the help of the following identity:
1 + 2 + 3 + · · · + k − 1 = k(k − 1)/2
We obtain,
k(k−1)
P (no collission) ≈ e− 2n
Define as
DEF k(k−1)
P (at least one collission) = ≈ 1 − e− 2n
k(k−1)
1− ≈ e− 2n
k(k − 1)
ln (1 − ) ≈ −
2n
1
k(k + 1) ≈ −2n ln (1 − ) = 2n ln
1−
161
If k >> 1, then
2 1
k ≈ k(k − 1) ≈ 2n ln
s
1−
1
k ≈ 2n ln
1−
Example: s
1
√ √ √
k( = 0.5) ≈ 2n ln = 2 ln 2 n = 1.18 n
1 − 0.5
√
⇒ A collission in a set of n values is found after about n trials with a probability of 0.5.
In other words, given a hash function with 40 bit output ⇒ collission after approximately
√
240 = 220 trials.
⇒ In order to provide collision resistance in practice, the output space of the hash function
should contain at least 2160 elements, that is, the hash function should have at least 160
√
output bits. Finding a collision takes then roughly 2160 = 280 steps.
162
13.3 Hash Algorithms
Overview:
Hash Algorithms
a) MD4–family
1. SHA-1
2. RIPE-MD 160
163
b) Hash functions from block ciphers
xi
H i-1 m
g e
K Hi = e g(H )
( xi ) xi
i-1
Hi
where g is a simple n-to-m bit mapping function (if n = m, g can be the identity
mapping)
Last output Hl is the hash of the whole message x1 ,x2 ,. . .,xl
Remark:
For block ciphers with less than 128 bit block length, different techniques
must be used (Sec. 9.4.1 (ii) in [AM97])
164
13.4 Lessons Learned — Hash Functions
• Hash functions are key-less. They serve as auxiliary functions in many cryptographic
protocols.
• Among the two most important applications of hash functions are: support function
for digital signatures and core function for building message authentication codes, e.g.,
HMAC.
• Hash functions should have at least 160 bit output length in order to withstand collision
attacks. 256 or more bits are better.
165
Chapter 14
Message authentication codes are widely used in practice for providing message integretiy
and message authentication in cases where the two communication parties share a secret key.
MACs are much faster than digital signatures since they are based on symmetric ciphers or
hash functions.
166
14.1 Principle
Similar to digital signatures, MACs append an “authentication tag” to a message. The main
difference is that MACs use a symmetric key on both the sender and receiver side.
message space
"signing"
signature space
MACK (x)
y
x
?
MACK (x) = y ; verification
Protocol:
Alice Bob
1) y = MACK (x)
(x,y)
2) ←−
3) y 0 = MACK (x)
?
y0 = y
Note: For MAC verification, Alice performs exactly the same steps that Bob used for
generating the MAC. This is quite different from digital signatures.
167
Properties of MACs:
168
14.2 MACs from Block Ciphers
MAC generation: Run block cipher in CBC mode
y0 = ek (x0 ⊕ IV ) = ek (x0 ⊕ 0000 . . .)
yi = ek (xi ⊕ yi−1 )
X = x0 , x1 , . . . , xm−1
M ACk (x) = ym−1
i=1 IV i=1 IV
Y i-1 Y i-1
Y i-1 Y i-1
i=n
X n , ... , X2 , X 1 Y n X n , ... , X2 , X 1 Yi
e e
Y’n
?
k k
Yn
X n , ... , X2 , X 1
MAC Verification: Run the same process that was used for MAC generation on the
receiving end.
169
14.3 MACs from Hash Functions: HMAC
• Popular in modern protocols such as SSL.
• Basic idea: Hash a secret key K together with the message M and consider the hash
output the authentication tag for the message: H(K||M ).
• Details
where
K + = K padded with zeros on the left so that the result is b bits in length (where b
is the number of bits in a block).
170
14.4 Lessons Learned — Message Authentication Codes
• MACs provide the two security services message integrity and message authentication
using symmetric techniques. MACs are widely used in protocols in practice.
• Both of these services are also provided by digital signatures but MACs are much
faster.
171
Chapter 15
Security Services
Information Information
source destination
(e) Fabrication
172
Remarks:
15.2 Introduction
Security Services are goals which information security systems try to achieve. Note that
cryptography is only one module in information security systems.
• Non-repudiation. Ensures that the sender of a message can not deny the creation of
the message.
Remark: Message Authentication implies data integrity; the opposite is not true.
15.3 Privacy
Tool: Encryption algorithm.
173
e Y dk
X k X
k k
a) Symmetric-Key
Provides:
−privacy
−message authentication and thus
only if Bob can distinguish
−integrity between valid and invalid X
−no non-repudiation
and if there are only two parties.
Remark:
b) Public-Key
e Y dkpr_B
X kpub_B X
ekpub_B (x)
kpub_B kpr_B
Provides:
- privacy
- no message authentication
174
15.4 Integrity and Sender Authentication
Recall: Sender authentication implies integrity.
(x, y) (x, y)
x x
y
y = sig (h(x)) x
Kpr_A
x h(x) sig h(x) ver true / false
Kpr_A Kpub_A
Provides:
- integrity
- sender authentication
15.4.2 MACs
(x, y) (x, y)
x x
K K
Provides:
175
- integrity
- authentication
- no non-repudiation
(x, y)
x e d
eK (x, y)
y
y x compare
x K
h(x) K h(x)
y’
Provides:
- privacy
- integrity
- authentication
- no non-repudiation
Remark:
• Instead of hash functions, MACs are also possible. In this case: c = eK1 (x, MACK2 (y)).
176
Chapter 16
Key Establishment
16.1 Introduction
Remark:
Some schemes make use of trusted authority (TA) which is trusted by and can
communicate with all users.
177
16.2 Symmetric-Key Approaches
Example: n = 4 users.
KAB KAC KAD KAB KBC KBD
A B
TA secure channels
D C
KAD KBD KCD KAC KBC KCD
Drawbacks:
n(n−1) n2
• TA must generate 2
≈ 2
keys
• every new network user makes updates at all other user as of necessary ⇒ scales badly
178
16.2.2 Key Distribution Center (KDC)
TA is a KDC: TA shares secret key with each user and generates session keys.
a) Basic protocol:
- kA,KDC = secret key between Alice and KDC (Key encryption key, KEK)
- kB,KDC = secret key between Bob and KDC (Key encryption key, KEK)
Remarks:
179
16.3 Public-Key Approaches
Protocol:
Alice Bob
pick kprA = aA ∈ {2, 3, . . . , p − 2} pick kprB = aB ∈ {2, 3, . . . , p − 2}
compute kpubA = bA = αaA mod p compute kpubB = bB = αaB mod p
b
A
−→
b
B
←−
kAB = baBA = αaA aB mod p kAB = baAB = αaA aB mod p
Security:
1. passive attacks
⇒ security relies on Diffie-Hellman problem thus p > 21000 .
2. active attack
⇒ Man-in-the-middle attack:
180
Remarks:
16.3.2 Certificates
2. MACs (symmetric)
Alice Bob
(x,y)
y = sigKprA (x) −→ verKpubA (x, y)?
181
Sfrag replacements
RQST
Bob
ID(B), KpubB
RQST
CA
ID(A), KpubA C(B) = sigKprCA (ID(B), KpubB )
Alice
1. Each user U :
General requirement: all users have the correct verification algorithm verT A with TA’s public
key.
Remarks:
• Certificate structures are specified in X.509, authentication services for the X.500 di-
rectory recommendation (CCITT).
182
ID(U)
KprU
Version
Serial Number
Algorithm Identifier:
- Algorithm
- Parameters
Issuer
Period of Validity:
- Not Before Date
- Not After Date
Subject
Signature
183
16.3.3 Diffie-Hellman Exchange with Certificates
Idea: Same as standard Diffie-Hellman key exchange, but each users’s public key is authen-
ticated by a certificate.
Alice Bob
KpubA = bA KpubB = bB
KprA = aA KprB = aB
C(B)=(ID(B),bB ,sigCA (ID(B),bB ))
←−
C(A)=(ID(A),bA ,sigCA (ID(A),bA ))
−→
1.) verCA (ID(B), bB ) 1.) verCA (ID(A), bA )
2.) kAB = baBA = αaB aA = αaA aB 2.) kAB = baAB = αaA aB
Question: Does Oscar have any further possibilities for an attack?
Answer:
1. Oscar impersonates Alice to obtain a certificate of the CA (with his key but Alices’
identity)
KpubCA
KpubCA KpubCA
KpubO
184
16.3.4 Authenticated Key Agreement
Idea: Alice and Bob sign their own public keys. Signatures can be correctly verified through
certificates.
Set-up:
• public prime p
Protocol:
Alice TA Bob
C(A)=(ID(A),verA ,sigT A (ID(A),verA ))
←−
C(B)=(ID(B),verB ,sigT A (ID(B),verB ))
−→
1.) kprA = aA
b
2.) kpubA = bA = αaA mod p A
−→
3.) kprB = aB
4.) kpubB = bB = αaB mod p
5.) kAB = baAB = αaA aB mod p
(C(B),bB ,yB )
←− 6.) yB = sigB (bB , bA )
7.) verT A (C(B)): true/false
8.) verB (yB ): true/false
9.) kAB = baBA = αaA aB mod p
(C(A),yA )
10.) yA = sigA (bA , bB ) −→
11.) verT A (C(A)): true/false
12.) verA (yA ): true/false
Remark:
This scheme is also known as station-to-station protocol and is the basis for
ISO 9798-3.
185
Chapter 17
Note:
This chapter describes the most important security mechanisms of the SSL Pro-
tocol. For more details references [Sta02] and Netscape’s SSL web page are
recommended.
17.1 Introduction
• SSL was developed by Netscape.
• TLS (Transport Layer Security) is the IETF standard version of SSL. TLS is very close
to SSL.
• SSL is algorithm independent: for both public-key and symmetric-key operations, sev-
eral algorithms are possible. Algorithms are negotiated on a per-session basis.
186
HTTP FTP SMTP
SSL or TLS
TCP
IP
Handshake Protocol : provides shared secret key using public-key techniques and
mutual entity authentication.
187
17.2 SSL Record Protocol
The SSL Record Protocol provides two main services:
1. Confidentiality: SSL payloads are encrypted with a symmetric cipher. The keys are for
the symmetric cipher and they must be established during the preceding handshake
protocol.
2. Message Integrity: the integrity of the message is provided through HMAC, a message
authentication code.
Application data
Fragment
Add MAC
Encrypt
Append SSL
record header
Description:
• MAC: a derivative of the popular HMAC message authentication code. HMACs are
based on hash functions.
188
where:
H = hash algorithm; either MD5 or SHA-1.
secret-key = shared secret session key.
pad1 = the byte 0x36 (0011 0110) repeated 48 times (384 bits) for MD5 and 40
times (320 bits) for SHA-1.
pad2 = the byte 0x5C (0101 1100) repeated 48 times for MD5 and 40 times for
SHA-1.
seq-num = the sequence number of the message.
fragment-length = length of the fragment (plaintext).
fragment = the plaintext block for which the MAC is computed.
1. Block ciphers:
2. Stream ciphers:
189
17.3 SSL Handshake Protocol
Remark: Most complex part of SSL, requires costly public-key operations
CLIENT SERVER
PHASE 1
random, cipher suite
certificate
PHASE 2
certificate
PHASE 3
Explanation:
190
(a) RSA: the secret key is encrypted with the receiver’s public RSA-
key. Certificates are required.
(d) Fortezza
Certificate : authenticated public key for any key exchange method except
anonymous Diffie-Hellman.
191
Chapter 18
192
Overview:
ID techniques
⇒ passwords and PINs are weak since they violate requirement 1 below.
To achieve these goals, Alice has to perform a proof of knowledge which in general involves
a challenge-and-response protocol.
193
18.1 Symmetric-key Approach
Challenge-and-response (CR) protocol:
Assumption: Alice and Bob share a secret key kAB and a keyed one-way function f (x).
Alice Bob
1) generate challengex
x
←−
y
2) y = fkAB (x) −→
3) y 0 = fkAB (x)
?
4) verification: y = y 0
Example:
a) fk (x) = DESk (x).
b) fk (x) = H(k||x).
c) fk (x) = xk mod p.
Remarks:
• There are many variations to the above protocol, e.g., including time stamps or serial
numbers in the response.
• Instead of block ciphers, public-key algorithms and keyed hash functions can be used.
194
Alice Bob
1) y = ekAB (T S, ID(Bob))
y
−→
2) (T S 0 , ID 0 (Bob) = e−1
kAB (y)
? ?
T S ≤ time ≤ T S +
195
Bibliography
[AM97] S.A. Vanstone A.J. Menezes, P.C. Oorschot. Handbook of Applied Cryptography.
CRC Press, 1997.
[Big85] N.L. Biggs. Discrete Mathematics. Oxford University Press, New York, 1985.
[DR98] J. Daemen and V. Rijmen. AES Proposal: Rijndael. In First Advanced Encryp-
tion Standard (AES) Conference, Ventura, California, USA, 1998.
[Kah67] D. Kahn. The Codebreakers. The Story of Secret Writing. Macmilian, 1967.
[LTG+ 02] A. K. Lutz, J. Treichler, F.K. Gurkaynak, H. Kaeslin, G. Basler, A. Erni, S. Re-
ichmuth, P. Rommens, S. Oetiker, , and W. Fichtner. 2Gbit/s Hardware Realiza-
196
tions of RIJNDAEL and SERPENT: A comparative analysis. In Çetin K. Koç
Burt Kaliski and Christof Paar, editors, Proceedings of the Fourth Workshop
on Cryptographic Hardware and Embedded Systems (CHES), Berlin, Germany,
August 2002. Springer-Verlag.
[Men93] A.J. Menezes. Elliptic Curve Public Key Cryptosystems. Kluwer Academic Pub-
lishers, 1993.
[Sch96] B. Schneier. Applied Cryptography. John Wiley & Sons Inc., New York, New
York, USA, 2nd edition, 1996.
[Sta02] W. Stallings. Cryptography and Network Security: Principles and Practice. Pren-
tice Hall, Upper Saddle River, New Jersey, USA, 3rd edition, 2002.
[Sti02] D. R. Stinson. Cryptography, Theory and Practice. CRC Press, 2nd edition,
2002.
[WPR+ 99] D. C. Wilcox, L. Pierson, P. Robertson, E. Witzke, and K. Gass. A DES ASIC
Suitable for Network Encryption at 10 Gbps and Beyond. In Ç. Koç and C. Paar,
197
editors, Workshop on Cryptographic Hardware and Embedded Systems - CHES
’99, volume LNCS 1717, pages 37–48, Worcester, Massachusetts, USA, August
1999. Springer-Verlag.
198