Set Theory Math
Set Theory Math
Set Theory Math
JEAN-LOUIS KRIVINE
TRANSLATED BY: CHRISTIAN ROSENDAL
0. Typesetter’s Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
3. Relativization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.1. Consistency of the Axiom of Foundation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.2. Inaccessible Ordinals and Models of ZFC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.3. The Reflection Scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4. Formalizing Logic in U . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
4.1. Model Theory for U-formulas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
7. Forcing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
7.1. Generic Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
7.2. Mostowski Collpase of a Well-founded Relation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
7.3. Construction of Generic Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
7.4. Definition of Forcing (Strong Forcing) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
7.5. The Truth Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
7.6. Consistency of ZFC + ¬CH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
SET THEORY AND FORCING 1
0. Typesetter’s Introduction
These notes provide a great introduction to axiomatic set theory and topics therein appropriate
for a first class for a graduate or upper level undergraduate student. I was taught set theory by
Professor Anush Tserunyan at the university of Illinois at Urbana-Champaign in the Spring of 2018
following mostly these notes. My goal in TeXing these notes is to stick to Rosendal’s original
handwritten translation of Krivine’s notes as closely as possible so as to represent them well and
avoid making mistakes myself. However, I have taken the liberty in a couple sections to change
minor details. These include things like grammar or notation (for example, when discussing forcing,
Professor Tserunyan took the liberty of underlining variable names as opposed to barring the top,
so as to be able to write vectors as well when dealing with multiple variables, and I also employ
this). The overall message of these notes is unchanged. Where I include notes of my own, I precede
them with TN:, for Typesetter’s Note. I am also greatly indebted to both Professor Tserunyan
and Ms. Jenna Zomback. Without Professor Tserunyan, I would not have any good grasp of the
topics herein, and Ms. Zomback provided the last set of notes that were mysteriously missing from
both Rosendal’s handwritten pdf and my own notebook. If you happen to notice any typos, issues,
compliments, or complaints with these notes, please feel free to email me at schirlem (at) uci (dot)
edu.
2 KRIVINE
That is, two sets with the same members are identical.
Definition 1.2. Given sets x and y, we can define the ordered pair or couple {{x}, {x, y}} by
repeated application of the pairing axiom. We denote this by (x, y).
Lemma 1.3. If x, y, a, b are sets and (x, y) = (a, b), then x = a and y = b.
Proof.
If x = y, then (x, y) = {{x}, {x, y}} = {{x}, {x}} = {{x}} has only a single element, implying
that also (a, b) = {{a}, {a, b}} has a single element which must be {a} = {x}. Thus, a = b =
x = y.
If x 6= y, then (x, y) has two elements, namely, a singleton {x} and a doubleton {x, y}. It
follows that also (a, b) must contain a unique singleton, namely {a}, and a unique doubleton,
namely {a, b}. From the uniqueness and the axiom of extensionality, a = x and then also b = y.
aTN: For those more familiar with model theory, (U , ∈) will be a model of ZFC, and objects within this model
will be sets. It will take us a while to fully describe the theory ZFC, as we want the reader to have an appreciation
for each of the axioms therein.
SET THEORY AND FORCING 3
Definition 1.4. For any sets x1 , x2 , ..., xn , we can inductively define n-tuples (x1 , x2 , ..., xn ) by
(x1 , x2 , x3 ) := (x1 , (x2 , x3 ))
We call it the union over x. For example, (x, y) = {{x}, {x, y}} = {x, y}.
S S
So, by induction on n, we can for any sets x1 , x2 , ..., xn construct a set {x1 , x2 , ..., xn } whose
members are exactly {x1 , x2 , ..., xn }. For this,
[
{x1 , x2 , ..., xn } := {x1 }, {x2 , ..., xn } .b
Also, if x and y are sets, we denote by
[
x ∪ y := {x, y}
and call it the union of x and y.
Fact:
The operation ∪ is associative, i.e.,
x ∪ (y ∪ z) = (x ∪ y) ∪ z
so we denote (x1 ∪ (x2 ∪ ...(xn−1 ∪ xn )...)) unambiguously by x1 ∪ x2 ∪ ... ∪ xn .
aTN: Prof. Tserunyan explains this in a very intuitive way. Think of x as your car, and its members z are the
grocery bags, and the sets u are the actual groceries. The set y is the fridge where you empty all of your groceries
into. So, for any car full of groceries, there is a fridge where you’ve emptied all the grocery bags into.
bTN: Note that up to this point, there is no way to construct such a set. Repeated application of the pairing
axiom will not allow for pairing a doubleton and singleton to achieve a set of three elements. One needs to do this
and then apply the union axiom.
4 KRIVINE
CAUTION: One must be careful when understanding the power set axiom. For the
variable z only refers to objects in U and not subsets of x that happen not to be in U. In
fact, it is a basic idea in the construction of universes to make judicious choices of which
subsets of a set to include in U and which to leave out. So, in such a U, P(x) will only
consist of the subsets of x that are actually in U.
The language of set theory is the usual first-order language including the logical symbol = and the
extra-logical symbol ∈. Now, if φ(x, y, z) is a formula with at most three free variables x, y, z, and
possibly having parameters a1 , a2 , ..., an , we have a corresponding relation Rφ on U defined by
Rφ (a, b, c) ⇐⇒ φ holds of a, b, c in U.
For example, if φ is ∀z(z ∈ x −→ z ∈ y), then Rφ is the inclusion relation ⊆.
The relations so obtained are called class relations, and unary class relations are also just called
classes. Later on, we shall misuse notation and write (b1 , ..., bn ) ∈ Rφ to denote that φ(b1 , ..., bn )
holds even though Rφ is in general not a set.
Example:
φ(x) := ∀u (u ∈ x −→ ∃v(v ∈ x ∧ ∀t (t ∈ v ←→ t = u ∨ t ∈ u)))
defines the class of all sets x such that if u ∈ x, then also u ∪ {u} ∈ x.
Class Functions:
Suppose φ(x1 , x2 , ..., xn , y) is a formula. We say that φ defines a class function Rφ if the following
holds in U:
∀x1 ∀x2 ...∀xn ∀y∀y 0 (φ(x1 , ..., xn , y) ∧ φ(x1 , ..., xn , y 0 ) −→ y = y 0 ).
In this case, the formula ∃yφ(x1 , ..., xn , y) in variables x1 , ...xn defines the domain of Rφ while
∃x1 ∃x2 ...∃xn φ(x1 , ..., xn , y) defines the image of Rφ . To simplify notation, we shall often write
aTN: It will be noted that both Prof. Tserunyan and I believe that, should ZFC be inconsistent, the inconsistency
will be at the hands of this particular axiom (well, or maybe the set existence axiom, to come later). As we’ll see
later, the Axiom of Choice won’t introduce any inconsistencies when added to ZF, so there’s no need to fear choice.
SET THEORY AND FORCING 5
Russell’s Paradox:
Note that φ(x, y) given by "(x = y) ∧ (y ∈
/ y)" is a functional relation. So if we allowed the
unrestricted replacement axiom, then there would be a set u such that y ∈ u ←→ y 6∈ y.
But then u ∈ u −→ u 6∈ u and u 6∈ u −→ u ∈ u, i.e. u ∈ u ←→ u 6∈ u, which is a
contradiction. So, we must restrict the replacement axiom to a specific set z.a
TN: At this point, we have covered a lot of the axioms of ZFC. There still remain the axiom of
infinity, the axiom of foundation, and the axiom of choice. These will be introduced later as we
build up more machinery to be able to discuss them in proper context.
Exercise:
Given two sets a and b, show how to define (and prove the existence of) the sets
arb a∩b a × b = {(x, y)|x ∈ a ∧ y ∈ b}.
1.2. Functions.
Suppose ψ(x, y, ~a) is a formula defining a class function Rψ whose domain is not just a class, but
a set. I.e., there is a set z such that
∀x ∃y ψ(x, y, ~a) ←→ x ∈ z .
Then Rψ is itself a set, i.e., for some set u,
∀x∀y ψ(x, y, ~a) ←→ (x, y) ∈ u)
To see this, just let v be the image of z by Rψ . Then v is the whole image of Rψ and
u = {(x, y) ∈ z × v | Rψ (x, y)}
is a set.
We say then that u is a function from z to v and denote this by u : z → v. So functions are always
a set of pairs and hence are identified with their graphs.
Definition 1.6. A function f : a → b from a set a to a set b is a subset f ⊆ a × b satisfying
∀x (x ∈ a −→ ∃y ∈ b (x, y) ∈ f )
!
∀x∀y∀y 0 (x, y) ∈ f ∧ (x, y 0 ) ∈ f −→ (y = y 0 )
CAUTION:
When I = ∅, then X and hence depends on the choice of X.
T S
ai =
i∈I
8 KRIVINE
Observation:
Y ⊆ X is an initial segment if and only if Y = X or Y = δx (X) for some x ∈ X.
Proof.
If Y 6= X, just let x be the minimum element in X r Y .
2.1. Classes and Sets.
Recall that a class C is simply the collection of all x satisfying some formula φ(x, ~a) with parame-
ters. Note that we do not give classes any formal existence, in the sense that they do not belong to
U, and any statement about the class C is just a shorthand for a more complex statement involving
the formula φ(x, ~a).
On the other hand, suppose there is a set z containing all members in C, i.e.
∀x(φ(x, ~a) −→ x ∈ z)a
Then, by comprehension, we can identify C with the set
y = {x ∈ z | φ(x, ~a)}.
aTN: Note that z needs only to contain C. It may have other elements.
SET THEORY AND FORCING 9
Similarly, any set z can be identified with a class, namely the class given by the formula
φ(x) := (x ∈ z).
Slogan:
Classes are collections that sometimes are too large to be sets, while on the other
hand, all sets are classes.
Definition 2.2. A class C is a proper class when it is not a set, i.e., when there is no set whose
elements are exactly those belonging to C.
Examples:
• U is a proper class given by the formula "x = x"
• Russell’s class given by the formula "x 6∈ x" is also a proper class (TN: Assuming AF,
to come later, this class is U)
Notation:
If φ(x, ~a) is a formula with a single free variable x and parameters ~a, we let {x | φ(x, ~a)} denote
the, possibly proper, class defined by φ(x, ~a).
So, sets are the special classes given by expressions {x ∈ z | φ(x, ~a)} where z is another set.
2.2. Well-Orderings and Ordinals.
Definition 2.3. A class relation R defining a strict linear ordering of a class C is said to be a
well-ordering if for any x in C, the class initial segment δx (C) = {y | R(y, x)} is a set that is
well-ordered by R.
Definition 2.4. A set x is said to be transitive if ∀y (y ∈ x −→ y ⊆ x). I.e., z ∈ y ∈ x =⇒ z ∈ x.
Definition 2.5. An ordinal is a transitive set α that is well-ordered by the class relation ∈.
Lemma 2.6. Ordinals form a class relation called Orda, or, sometimes On.
Proof.
(
Ord = α | ∀y(y ∈ α −→ y ⊆ α) ∧
∀x∀y (x ∈ α) ∧ (y ∈ α) ∧ (x 6= y) −→ (x ∈ y) ∨ (y ∈ x) ∧
∀x(x ∈ α −→ x 6∈ x) ∧
)
∀x (x ⊆ α) ∧ (x 6= ∅) −→ ∃y ∈ x ∀z ∈ x (y ∈ z ∨ y = z) .
Example:
∅, {∅}, ∅, {∅} are ordinals.
Facts:
Let α be an ordinal. Then...
• ...the initial segments of α are α itself and the elements of α.
• ...so is any β ∈ α.
• α 6∈ α. (For if γ ∈ α, then γ 6∈ γ as otherwise α would not be well-ordered by ∈.)
Lemma 2.7. If α and β are ordinals, then either
α∈β α=β or α 3 β.
Proof.
Let γ = α ∩ β. Then γ is an initial segment of α, β. For if x ∈ γ and y ∈ x, then as α, β are
transitive, also y ∈ α ∩ β = γ.
So, as any initial segment of an ordinal is either the ordinal itself or an element of it, there are
four possible cases:
(1) γ = α = β
(2) γ = α and γ ∈ β, hence α ∈ β
(3) γ = β and γ ∈ α, hence β ∈ α
(4) γ ∈ α and γ ∈ β. But γ is an ordinal, and γ ∈ α ∩ β = γ, which is impossible.
Proposition 2.8. The class Ord is well-ordered by the class relation ∈.
Proof.
We know that ∈ linearly orders Ord, for if α ∈ β ∈ γ, then by transitivity of γ, α ∈ γ and
irreflexivity has been checked. Moreover, for any ordinal α,
δα (Ord) = {β | β is an ordinal and β ∈ α} = α
is a set well-ordered by ∈.
Lemma 2.9. Ord is a proper class.
Proof.
For if Ord was a set a, then a would itself be an ordinal and hence a ∈ a, which is impossible.
(To see that a is transitive, note that for α ∈ β ∈ a, since any element of an ordinal is an ordinal,
also α ∈ a).
Note:
For ordinals α, β, α ⊆ β ⇐⇒ α ∈ β or α = β.
Terminology: In the following, we will always consider the class Ord with the well-ordering ∈,
and we sometimes also write < instead. That is, for ordinals α, β:
α < β :⇐⇒ α ∈ β ⇐⇒ α ( β
So for any ordinal α, α = {γ | γ is an ordinal and γ < α}.
Lemma 2.11. If X is a set of ordinals, then sup X = X. I.e., X is an ordinal greater than or
S S
equal to all elements of X.
Proof.
S X = {x | ∃α ∈ X, x ∈ α} is a set ofSordinals and hence is well-ordered by ∈. To see that
a
S
X is transitive, note that if y ∈ x S ∈ X, then there is α ∈ X such that x ∈ α, hence, by
transitivity of α, also y ∈ α, i.e., yS∈ X.
Notice, if α ∈ X, then α ⊆ X, so α ≤ X. And S if β < X, i.e. β ∈ X, then
S S S
∃α ∈ X, β ∈ α, i.e. β is not an upper bound for X. So X = sup X.
Lemma 2.12. Suppose α, β are ordinals and f : α → β is a strictly increasing function, i.e. for
γ, ξ < α, γ < ξ =⇒ f (γ) < f (ξ). Then α ≤ β and γ ≤ f (γ) for all γ < α.
Proof.
Suppose towards a contradiction that there is some γ < α such that f (γ) < γ. Take the minimal
such γ.b Then, since f (γ) < γ and γ is minimal, f f (γ) ≥ f (γ), but, on the other hand, since
f is strictly increasing
f (γ) < γ =⇒ f f (γ) < f (γ),
which is a contradiction.
But then if β < α, β ≤ f (β) ∈ β, i.e. β ≤ f (β) < β, which is a contradiction again.
Theorem 2.13. Suppose f : α → β is a function which is an isomorphism of the ordered sets
(α, <) and (β, <). Then α = β and the isomorphism is unique, i.e. f = idα . In other words, the
structure (α, <) is rigid.
Proof.
α = β follows from the lemma applied to f and f −1 . Also, this gives us for any γ < α, γ ≤ f (γ)
and γ ≤ f −1 (γ), hence also f (γ) ≤ f (f −1 (γ)) = γ, i.e. γ = f (γ).
aTN: One may reasonably be concerned that this notation denotes unrestricted comprehension, as we do not
stipulate that the element x be in another set. However, recall that this notation is valid for classes, and it’s been
established that taking the union of a set is a valid set operation.
bTN: As a quick comment, the fact that ordinals are well-ordered becomes a phenomenally useful fact it proofs,
as then one can always take the minimal ordinal that satisfies some property. Keep this trick in your back pocket,
or front pocket, or, really, just always on hand.
12 KRIVINE
Theorem 2.14. For any well-ordered set (X, <X ), there is a unique isomorphism onto an ordinal
(α, <).
Proof.
Uniqueness: Follows from the theorem above.
Existence: Let Y = {x ∈ X | (δx , <X ) is isomorphic to an ordinal}a where δx := δx (X).
By the uniqueness part, for any x ∈ Y , there is a unique ordinal β(x) isomorphic to (δx , <X ).
Note that for x <X y and y ∈ Y , by the isomorphism of δy with β(y), δx ⊆ δy is set to an initial
segment of β(y), which is itself an ordinal β(x) < β(y). So x ∈ Y and Y is an initial segment of
X.
Now x 7−→ β(x) is a function defined on Y , so by the axiom of replacement, Z = {β(x) | x ∈
Y } is a set. Moreover, Z is an initial segment of the ordinal numbers. For if γ < β(x) for some
x ∈ Y , then the isomorphism from δx to β(x) takes some y ∈ δx to γ, hence δy is mapped onto
the set of ordinals below γ, i.e. γ itself. So γ = β(y).
So Z itself, being a set and an initial segment, is an ordinal Z = α and x ∈ Y 7−→ β(x) < α
is an isomorphism from Y to α. Now, if Y ( X, let x0 be minimal in X r Y b, so δx0 = Y ∼ = α,
contradicting that x0 6∈ Y .
There is also a class version of this theorem. Namely, if R(x, y) is a class relation that well-orders
a proper class C, then there is a class function F from C to Ord which is an isomorphism of
the orderings R and ∈. Just define F by F (x) = α ⇐⇒ there is an order-isomorphism between
(δx (C), R) and (α, ∈).
2.3. Inductive Definitions.
Suppose φ(x) is a formula (possibly with parameters). Then
φ(x) holds for every ordinal α
iff
(∗)
∀α ∀β β < α −→ φ(β) −→ φ(α)
For the nontrivial implication, if (∗) holds, but φ(α) is not true for all ordinals, then one gets a
contradiction by looking at the least α such that ¬φ(α).
Whereas proving ∀α φ(α) by proving (∗) constitutes a proof by induction on ordinals, we can also
give definitions by induction.
Suppose F is a class function of one variable and a is a subset of the domain of F , i.e., ∀x ∈
α ∃y, F (x) = y. Then we let F a denote the function obtained by restricting F to a, i.e. for
b = {F (x) | x ∈ a}, which is a set by replacement, Fa = {(x, y) ∈ a × b | F (x) = y}.
Now, suppose H is any class function of one variable. We say that a function f is H-inductive
if:
• α =dom(f ) is an ordinal, and
• ∀β < α, fβ is in the domain of H and f (β) = H(fβ ).
aTN: Perhaps it ought to be mentioned that these sets where English is used in the set notation are also well-
defined. One needs only to find first order formulas which describe the English appropriately, and this is a fairly
simple exercise for the reader should they not already be convinced of this truth.
bTN: I told you this would come up a lot.
SET THEORY AND FORCING 13
Lemma 2.15. For any class function H and ordinal α, there is at most one H-inductive function
f with dom(f ) = α.
Proof.
Suppose f : α → X and g : α → Y are distinct H-inductive functions and let β < α be
minimal such that f (β) 6= g(β). Then f β = gβ and so f (β) = H(f β ) = H(gβ ) = g(β), a
contradiction.
Lemma 2.16. Suppose H is a class function and α is an ordinal such that any function f : β → X
with domain β will belong to the domain of H. Then there is an H-inductive function g : α → Y .
Proof.
Let τ = {β < α | there is an H-inductive function fβ : β → X}. Then τ is a set and is seen to
be an initial segment of α, hence τ is an ordinal ≤ α. Moreover, by uniqueness, the assignment
β < τ 7−→ fβ is well-defined, and since for any γ < β, fβ γ is also H-inductive, by uniqueness,
we have
γ < β < τ =⇒ fβ γ = fγ .
Thus, f = fβ is an H-inductive function with domain σ = sup β =
S S
β.
β<τ β<τ β<τ
Assume towards a contradiction that σ < α. Then f˜ defined by f ˜ σ = f and f˜(σ) = H(f ) is an
H-inductive function with domain σ + 1 = σ ∪ {σ} 6∈ τ , which is impossible.
With only slight adjustments, it suffices that for some class A, H takes values in A and any function
f : β → X, where X is a subset of A, belongs to the domain of H.
Theorem 2.17. Suppose A is a class and M is the class of all functions f : α → X with domain
an ordinal and range X a subset of A. Suppose H is a class function of one variable defined on all
of M and with values in A. Then there is a unique class function F , i.e., given by a formula of set
theory, such that
• F is defined on Ord
• ∀α F (α) = H(Fα )
Proof.
The class function F is defined by
y = F (α) ⇐⇒ there is an H-inductive function f : α → X, X ⊆ A, and y = H(f ).
2.4. Stratified or Ranked Classes.
A class W is said to be stratified or ranked if there is a class function ρ with domain W and taking
values in Ord such that for any ordinal α, the following class is a set.
Wα = {x | W (x) ∧ ρ(x) < α}
14 KRIVINE
Theorem 2.18. Suppose W is a stratified class with corresponding stratification ρ. Also let M
be the class of all functions with domain Wα for some α and H(α, f ) = y a class function with
domain W × M . Then there is a unique class function F with domain W such that for any a in
W : F (a) = H(a, FWρ(a) ).
Note:
For a in W , a ∈ Wρ(a)+1 r Wρ(a) .
Proof.
Set Wα0 = {x | W (x) ∧ ρ(x) = α} and note that Wα0 is a set and Wα = Wβ0 . By induction on
S
β<α
Ord, i.e., by applying the preceding theorem, we find a class function G defined onhordinals suchi
that for every α, G(α) = “the function φ with domain Wα0 such that φ(x) := H x,
S
G(β)
β<α
for all x ∈ Wα0 ”.
(Note that G(α) is written as a function of Gα , so the theorem applies.)
Uniqueness is an exercise.
(8) Axiom of Choice or (AC)
For any set X and A ⊆ P(X) set of pairwise disjoint non-empty subsets of X, there is a
set T ⊆ X such that ∀a ∈ A, T ∩ a contains exactly one element.
Proposition 2.19. AC⇐⇒AC ⇐⇒AC (given the background theory of axioms (1)–(7))a
0 00
aTN: The proof of this is omitted from the original notes, and I do not provide it here, as this would make for a
good exercise. As a further note, there are many other statements to come which are also equivalent to the axiom of
choice. The reader is encouraged to attempt equivalence proofs of these later statements.
SET THEORY AND FORCING 15
(?) H is defined on the class of H-inductive functions. Also, H-inductive functions are injective.
For if f : α → X is H-inductive, then for any β < α, f (β) = H(fβ ) ∈ XrIm(fβ ), hence
for any γ < β, f (γ) 6= f (β). It thus follows that f is injective. If also f were surjective,
then this would induce a well-ordering of X, contradicting our assumption.
By (?), we know that there is an H-inductive class function F : Ord → X, which is injective by
(?). But then since Im(f ) is a set (TN: Why?), F −1 : Im(F ) → Ord is a function from a set
onto Ord, which is impossible.
Note:
Any well-ordered set admits a choice function.
Theorem 2.21 (Zorn’s Lemma). Suppose (X, 6) is a partially ordered set (or poset) all of whose
linearly ordered subsets admits an upper bound. Then (X, 6) has a maximal element, i.e. there is
y ∈ X such that ∀x ∈ X, y 6< x.
Proof.
Let A = {Y ⊆ X | ∃x ∈ X, ∀y ∈ Y, y < x} and let π : P(X) r {∅} → X be a choice function.
Define p : A → X by p(Y ) = π {x ∈ X | ∀y ∈ Y, y < x} .
(†) Any H-inductive function f : α → X is strictly increasing, i.e., β < γ < α =⇒ f (β) <
f (γ).
It follows from (†) that the image of any H-inductive function f : α → X is linearly ordered, and
hence has an upper bound xf ∈ X, i.e. ∀β < α, f (β) < xf . Now, if f : α → X is H-inductive,
but H is not defined on f , then Im(f ) has no strict majorant, hence f (β) = xf for some β < α
and xf thus is the maximum in Im(f ) and a maximal element of X.
If, on the other hand, H is defined on the class of H-inductive functions, then we can find an
H-inductive class function F : Ord → X, which by (†) is strictly increasing. But then F would
define an injection of a proper class into the set X, which is impossible.
16 KRIVINE
TN: Before moving onto ordinal arithmetic, it should be noted that Zorn’s Lemma, Zermelo’s
Theorem, and the three statements for choice are all equivalent. The reader should verify this if
they have not already done so. Further questions and observations include wondering what is yellow
and equivalent to the axiom of choice,b and noting that Jesus was in fact pro-choice.c
2.5. Ordinal Arithmetic. Recall that if α is an ordinal, the successor of α, i.e. the smallest
ordinal strictly larger than α, is α + 1 = α ∪ {α}.
Note that if (X, <X ) and (Y, <Y ) are well-ordered sets, then we can define well-orderings on the
sets
(X × {0}) ∪ (Y × {1}) and X ×Y
by
i = j = 0 and a <X b, or
For all we know till now, multiplication could be commutative, but this fails after the next axiom.
Note that the natural numbers form an initial segment of the ordinals. So, if we let ω
denote the smallest infinite ordinal, we see that ω is a limit ordinal.
Remark:
ω · 2 = ω + ω 6= 2 · ω = ω
ω + 2 6= ω = 2 + ω
The proof of uniqueness and existence of f follows along the same lines as the general proof for
inductive definitions (or can be deduced from it directly).
TN: Even further, the statement “every set X has a cardinality” is equivalent to (AC). To see
this, we actually prove the equivalence to Zermelo’s theorem, which is equivalent to AC. Note that
if every set can be well-ordered, then there is a bijection from a set to a well-ordered set, and as
seen previously, an isomorphism from a well-ordered set to an ordinal, and then of course, from an
ordinal to its cardinality. On the other hand, if an arbitrary set X has a cardinality, then there is
a bijection from it to an ordinal, and this induces a well-ordering on X.
A common joke is that Zermelo’s theorem is clearly false, the axiom of choice is clearly true, and
Zorn’s lemma is too difficult to understand to say one way or another. I find this equivalence of
Zermelo’s theorem to the fact that every set has a cardinality very compelling in convincing one of
the truth of Zermelo’s theorem.
Theorem 2.29 (Cantor-Schröder-Bernstein). X and Y are equinumerous if and only if X injects
into Y and vice versa.
Proof.
This is trivial given (AC). But it is possible to give a proof without. (TN: The reader is
encouraged to try this for themselves. A back-and-forth technique should prove useful.)
Theorem 2.30 (Cantor). For any set X, |X| < |P(X)|.
Proof.
Suppose for contradiction that there is a surjection π : X P(X) and define Y := {x ∈ X | x 6∈
π(x)}. Then if Y = π(y), we have
y ∈ π(y) ⇐⇒ y ∈ Y ⇐⇒ y 6∈ π(y)
which is impossible.
Definition 2.31. An ordinal number κ is said to be a cardinal if |κ| = κ.
aTN: It’s not immediately obvious why the cardinality of a set should always exist. That is, why must there exist
any bijection to an ordinal for every set? This follows from Zermelo’s Theorem, which is equivalent to AC, which
we’ve included as an axiom.
SET THEORY AND FORCING 19
For example, ℵ0 = ω and ℵα+1 is the smallest cardinal number larger than ℵα .
Since the cardinal numbers are well-ordered and unbounded, for any cardinal number κ there is a
smallest cardinal greater than κ, which we denote κ+ . So, for example, n+ = n + 1 for n finite, and
α = ℵα+1 .
ℵ+
Note:
For any ordinal γ, we have |γ| ≤ γ < |γ|+ . For if |γ|+ ≤ γ, then |γ|+ ⊆ γ and thus |γ|+ would
inject into the smaller cardinal |γ|, which is impossible.
Definition 2.35. Cardinals of the form κ+ are called successors, while non-zero non-successors are
called limit cardinals.
Proposition 2.36. The function ℵ is continuous with respect to the order-topology, that is, for
any limit ordinal λ,
ℵλ = sup ℵξ
ξ<λ
Proof.
Set γ = sup ℵξ . Then |γ| ≤ γ < |γ|+ . Now, if γ < ℵλ , then there is some ξ0 < λ such that
ξ<λ
ℵξ0 = |γ|, hence
sup ℵξ = γ < |γ|+ = ℵξ0 +1 ≤ sup ℵξ
ξ<λ ξ<λ
which is impossible. So
sup ℵξ ≤ ℵλ ≤ sup ℵξ .
ξ<λ ξ<λ
aTN: Cofinality is defined a bit later, but all you need to know is essentially that if you pick some ordinal, there’s
a cardinal above it.
20 KRIVINE
Cardinal Arithmetic.
Definition 2.38. For cardinal numbers κ and λ, we set
κ ⊗ λ := |κ × λ|
κ ⊕ λ := κ × {0} ∪ λ × {1}
Note that both cardinal multiplication and addition are commutative and associative.
aTN: This statement is well-known to be independent of the axioms of ZFC (meaning there are models in which
it is true, and models in which it is false). We shall prove this by the end of these notes.
bTN: This is not quite equivalent to choice, but the statement that for every infinite set A, that A × A and A
are equinumerous is equivalent to choice.
cTN: This is not equivalent to AC. Try a Cantor-Schröder-Bernstein argument.
dTN: Yes, this is ambiguous with ordinal exponentiation. In general, this tends to be the more common usage
of this notation, and if you mean ordinal exponentiation, you should specify. Well, in general, you should always
specify, but c’est la vie.
SET THEORY AND FORCING 21
Note:
cof(β) is always a cardinal.
Observation:
There is a strictly increasing cofinal map h : cof(β) → β.
Proof.
If f : cof(β) → β is cofinal, define h(ξ) := max f (ξ), sup h(γ) + 1 .
γ<ξ
Proposition 2.44. If α and β are limit ordinals and f : α → β is a strictly increasing cofinal map,
then cof(α) = cof(β).
Proof.
The fact that cof(β) ≤ cof(α) is clear. Conversely, if g : cof(β) → β is a cofinal map, then we
can define h : cof(β) → α by
h(ξ) = min γ < α | f (γ) > g(ξ) .
Then h is cofinal in α.
Proof.
Fix a cofinal map f : λ → κ and consider any function G : κ → κλ . We have to show that G is
not surjective. We think of κλ as the set of functions from λ to κ. So define h : λ → κ by
h(ξ) = min(κ r G(α)(ξ) | α ≤ f (ξ) )
Note then that if h = G(α) for some α < κ, pick ξ < λ such that f (ξ) ≥ α. Then h(ξ) 6= G(α)(ξ),
which is a contradiction.
SET THEORY AND FORCING 23
Theorem 2.52 (König). Let I be a set and (Ai )i∈I , (Bi )i∈I be sequences of sets. If, for all i ∈ I,
we have that |Ai | < |Bi |, then
G Y
Ai < Bi .
i∈I i∈I
Proof.
First, use (AC) more than once to define an injection i∈I Ai ,→F i∈I Bi . Q
F Q
Suppose towards a contradiction that there is a surjection g : i∈I Ai i∈I Bi , and then
define x ∈ i∈I Bi that is not in the image of g by choosing the value x(i) such that it precludes
Q
x from being in the g-image of {i} × Ai . The main point is that for each i ∈ I, the map
gi : Ai → Bi defined by a 7−→ g(i, a)(i) is not surjective, so one can choose a value from
Bi r gi [Ai ] as x(i) (where gi [Ai ] denotes the image of Ai under the map gi ).
2.7. Foundation.
(10) Axiom of Foundation or (AF)
The axiom of foundation states that ∈ admits no infinite descending chains, or, equivalently
∀x (x 6= ∅) −→ ∃y ∈ x ∀z ∈ y, z 6∈ x
Note that, if (un )n∈ω were an infinite sequence, i.e., formally a function with domain ω,
such that un+1 ∈ un for every n ∈ ω, then the set x = {un }n∈ω would contradict (AF).
Note:
(AF) =⇒ ∀x, x 6∈ x.
Without assuming the axiom of foundation, we can define a class function V : Ord → U by
transfinite induction: [
Vβ := P(Vα ).
α<β
aTN: Here I am explicitly following the notation and proof sketch given during a homework exercise in the class
I took with Professor Tserunyan.
24 KRIVINE
(⇒) : Now suppose that X does not belong to V , but that AF holds. Then X ⊆ cl(X) and we
claim that Y = y ∈ cl(X) | ¬V (y) is non-empty. For if not, then cl(X) and hence also X
would be a subset of V and thus belong to V itself, which is not the case.
On the other hand, if y ∈ Y , then ¬V (y) and hence y cannot be a subset of V . Thus, for some
z ∈ y, ¬V (z). Since cl(X) is transitive, also z ∈ cl(x) and thus z ∈ Y too. It follows that Y is
a counterexample to AF.
SET THEORY AND FORCING 25
S
Proposition 2.58. cl(X) = X ∪ y∈X cl(y).
Proof.
X ⊆ cl(X) and the latter is transitive, so y ⊆ cl(x) for all y ∈ X. Since cl(y)
S is the minimal
transitive set containing y, also cl(y) ⊆ cl(x) for any y ∈ X. So cl(X) ⊇ X ∪ y∈X cl(y).
For the other direction, it suffices to note that the right hand side is transitive.
Definition 2.59. The theory ZF− consists of all axioms given previously except for AC and AF.
Moreover,
ZF = ZF− + AF
ZFC = ZF + AC = ZF− + AF + AC
ZFC− = ZF− + AC.
Definition 2.60. A set X is extensional if
∀x, y ∈ X (x ∩ X = y ∩ X) −→ x = y
That is, (X, ∈) |= axiom of extensionality.
Note that every transitive set is extensional simply because the axiom of extensionality holds.
Theorem 2.61 (The Mostowski Collapse (ZF)). For any extensional set X, there is a unique
isomorphism π : (X, ∈) → (Y, ∈) onto a transitive set Y .
Proof.
Uniqueness: Follows by induction on the rank of elements of X.
Existence: We define π(x) by induction on the rank of x ∈ X.
π(x) = π(a) : a ∈ x ∩ X
(Formally, this is induction on a stratified class).
That is, note that by extensionality of X, there is a unique element a0 ∈ X of minimal rank.
Set π(a0 ) = ∅. Now, if x ∈ X and π(a) has been defined for all a ∈ X with rk(a) < rk(x), we
set π(x) = π(a) : a ∈ x ∩ X .
By induction on the rank, we see that π is injective and also that π[X] = Y is a transitive
set. Finally, π is an isomorphism; for if x, y ∈ X and x ∈ y, then x ∈ y ∩ X and so
π(x) ∈ π(a) : a ∈ y ∩ X = π(y);
and, conversely, if x, y ∈ X and π(x) ∈ π(y), then π(x) = π(a) for some a ∈ y ∩ X, so by
injectivity, x = a ∈ y.
26 KRIVINE
3. Relativization
Suppose C is a class. We define for every formula φ(~x, ~a) in parameters ~a = (a1 , ...an ), ai belonging
to C, the relativized formula φC (~x, ~a) by induction on the construction of φ:
• if φ is quantifier-free, then φC = φ
• (¬φ)C = ¬φC , (φ ∨ C C
ψ) C = φ ∨ ψ
C
C
• (∃yφ) = ∃y C(y) ∧ φ
• (∀yφ)C = ∀y C(y) −→ φC
So φC simply relativizes all quantifications to C. So for any φ(~x) without parameters and any
parameters ~a in C, we have
(U, ∈) |= φC (~a) ⇐⇒ (C, ∈) |= φ(~a)
3.1. Consistency of the Axiom of Foundation.
Something that should be on our mind is the consistency of the axioms we have stated so far. So
proofs of relative consistency are highly important. It follows from Gödel’s Second Incompleteness
Theorem that we cannot prove the consistency of ZFC from ZFC itself, so the only results we can
hope for are relative consistency results, such as
Theorem (3.1*). Suppose U is a universe of sets satisfying the theory ZF− . Then the class V
constructed in U will be a universe of sets in which ZF holds.
Proof.
Recall that ZF− is axiomatized bya
(i) the axiom of extensionality
(ii) the union axiom
(iii) the powerset axiom
(iv) the axiom scheme of replacement
(v) set existence
(vi) the axiom of infinity
We thus S have to prove that if U is a buniverse of sets and V is the class of sets defined by
V = Vα , then (V, ∈) |= (i), ..., (vi). I.e., for any axiom φ among (i)-(vi), φV holds.
α∈Ord
(i): Suppose x, y belong to V and that for any z in V , z ∈ x ←→ z ∈ y. Then as x, y are subsets
of V , we have ∀z (z ∈ x ←→ z ∈ y), so since extensionality holds in U, we have x = y and
thus (x = y)V too.
(ii): Suppose x belongs to V . Then x = z : ∃y ∈ x, z ∈ x is a subset of V and hence itself
S
belongs to V .
aTN: And recall that pairing and comprehension follow from these as well.
bTN: This notation is slightly improper, as replacement actually has infinitely many axioms, but the idea is clear.
Note that it’s obviously not enough just to show that AF holds. We still want V to be a model of ZF− !
SET THEORY AND FORCING 27
(iii): If X belongs to V , then so does P(x); for any subset of x will be a subset of V and thus
belong to V , hence P(x) is a subset of V and thus an element of V .
(iv): Suppose φ(x, y) is a formula with parameters in V that defines a class function in V , i.e.
V
∀x∃≤1 y φ(x, y)
or more explicitly
∀x V (x) −→ ∃≤1 y V (y) ∧ φV (x, y) .
Then the formula ψ(x, y) := V (x) ∧ V (y) ∧ φV (x, y) defines a functional relation in U, and
thus, for any set A, there is (by replacement in U) a set B such that
y ∈ B ⇐⇒ ∃x ∈ A, ψ(x, y) ⇐⇒ ∃x ∈ A, φV (x, y) ∧ V (y)
Lemma 3.2. If κ is a strongly inaccessible cardinal, then |Vκ | = κ and for any a ⊆ Vκ , we have
a ∈ Vκ ⇐⇒ |a| < κ.
Proof.
Since κ ⊆ Vκ , notice |Vκ | ≥ κ. Conversely, by induction we show that for any ξ < κ, |Vξ | < κ,
which implies that |Vκ | = κ. For the induction, note that if |Vξ | < κ, then |Vξ+1 | = |P(Vξ )| =
2|Vξ | < κ. And if |Vξ | < κ for all ξ < λ < κ, λ limit, then |Vλ | = | ξ<λ Vξ | ≤ sup |Vξ | < κ by
S
ξ<λ
regularity of κ.
Thus, if a ⊆ Vκ , |a| < κ, then rk : a → κ cannot be cofinal in κ since κ is regular. So for some
β < κ, a ⊆ Vβ , hence a ∈ Vβ+1 ⊆ Vκ .
aTN: If you’ve already done the exercise to show that for any ordinal α, rk α = α + 1, then you’re already done,
as this implies that for any ordinal α, α ∈ Vα+1 and hence belongs to V .
28 KRIVINE
Lemma 3.7. Suppose φ(~x) is a formula without parameters in prenex-form and that (Xu )u∈ω is
an increasing sequence of sets. If φ and allSits subformulas are absolute for every Xn , then φ and
all of its subformulas are absolute for X = n∈ω Xn .
30 KRIVINE
Proof.
The result is proved by induction on the length of the prefix of φ.
If φ is quantifier-free, then φ is absolute for any class or set, so the result is trivial.
Suppose now that the result is true for ψ(y, x1 , ..., xn ) and let φ(~x) = ∃y ψ(y, ~x). Then for
any ~c ∈ X, choose k < ω such that ~c ∈ Xk . Now, if φ(~c) holds, then since φ is absolute for Xk ,
also φXk (~c) holds, so for some b ∈ Xk , ψ Xk (b, ~c) holds. As ψ is absolute for Xk , we get that
ψ(b, ~c), and as ψ is absolute for X, we get that ψ X (b, ~c). Thus, finally, ∃b ∈ X ψ X (b, ~c), i.e.
φX (~c).
Conversely, if φX (~c), then for some b ∈ X, ψ X (b, ~c). Since ψ is absolute for X, also ψ(b, ~c)
and so ∃yψ(y, ~c), i.e. φ(~c).
Universal quantification is proved similarly.
Theorem 3.8 (The Reflection Scheme (ZF)). Suppose φ(~x) is a formula without parameters. Then
for every α, there is a limit ordinal β > α such that φ is absolute for Vβ .
Proof.
Without loss of generality, we can suppose that φ is in prenex-form. We show by induction on
the length of the quantifier-prefix of φ that:
∀α ∃β > α a limit (every subformula of φ is absolute for Vβ ).
The base case when φ is quantifier-free is trivial, since φ is absolute for Vα+ω .
Now, suppose that the induction hypothesis holds for ψ(y, ~x) and let φ(~x) = ∃yψ(y, ~x). Then,
by the induction hypothesis, for any α there is β > α limit such that ψ and all its subformulas
are absolute for Vβ . Fix α.
We define a class function F (~x) = z by "z = F (~x) is the set of all y of minimal rank such
that ψ(y, ~x)." Thus, ~x belongs to the domain of F if and only if ∃y ψ(y, ~x) ∧ y ∈ F (~x) .
We now define a strictly increasing sequence of odrinals (βn )n∈ω as follows:
• β0 = α.
• β2n+1 = smallest ordinal > β2n such that F (~c) ∈ Vβ2n+1 for every tuple ~c = (c1 , ..., ck ) in
the domain of F with c1 , ..., ck ∈ Vβ2n .
• β2n+2 = smallest ordinal > β2n+1 such that ψ and all its subformulas are absolute for
Vβ2n+2 .
Now set β = sup βn , which is a limit ordinal > α. Also, since Vβ = n<ω Vβ2n+2 , the previous
S
n<ω
lemma implies that ψ and all its subformulas are absolute for Vβ . To finish the proof of the
induction step, it thus suffices to prove that also φ is absolute for Vβ .
We fix c1 , ..., ck ∈ Vβ , say c1 , ..., ck ∈ Vβ2u+1 . First, if φVβ (~c), then there is b ∈ Vβ such that
ψ (b, ~c). Since ψ is absolute for Vβ , also ψ(b, ~c), hence ∃yψ(y, ~c), i.e. φ(~c).
Vβ
Conversely, if φ(~c), then there is some b of minimal rank such that ψ(b, ~c), hence b ∈ F (~c). It
follows that F (~c) ∈ Vβ2u+2 , and so also b ∈ F (~c) ⊆ Vβ2u+2 ⊆ Vβ . Thus, as ψ is absolute for Vβ ,
we have ψ Vβ (b, ~c) and hence ∃y ∈ Vβ , ψ Vβ (y, ~c), i.e. φVβ (~c).
The case of universal quantifiers is similar. Alternatively, by using ∀ = ¬∃¬, one can reduce
it to existential quantifiers.
Corollary 3.9. For any true sentence σ without parameters, there are arbitrarily large limit
ordinals β such that σ Vβ holds.
SET THEORY AND FORCING 31
Using the preceding arguments, one can prove a more general statement:
4. Formalizing Logic in U
Our universe of sets U should be a place for all mathematics to be done. That is, all groups,
manifolds, function spaces, etc. can be constructed as elements of U and all reasoning about these
objects should ultimately hark back to an underlying reasoning based on ZFC. Thus, in many
ways, the set theoretical language is our machine language, while concepts such as fiber-bundles,
C ∞ -maps, solution spaces of partial differential equations are special kinds of sets defined by more
or less involved definitions upon definitions.
As all other mathematical topics, logic also admits a formalization in U, in such a way that
formulas, proofs, and models simply are objects within U. We shall give a cursory treatment of
this.
Finally, F = Fn . Elements of F0 are called atomic U-formulas, while the elements of F are
S
n<ω
simply U-formulas. For f ∈ F, l(f ) = length(f ) = minimal n < ω such that f ∈ Fn .
Lemma 4.2 (Unique Readabilityb). For any U-formula f ∈ F, exactly one of the following holds:
(i) f is an atomic U-formula
(ii) f = ( ¬˙ , g) for some unique g ∈ F
(iii) f = ( ∨˙ , g, h) for some unique g, h ∈ F
(iv) f = ( ∃˙ , x, g) for some unique g ∈ F, x ∈ V.
Moreover, in each of these cases, l(g), l(h) < l(f ).
Notation:
For simplicity of notation, we shall write
˙ y), ( ¬˙ f ), (f ∨˙ g), ∃˙ x(f )
(x ∈˙ y), (x =
for the U-formulas
˙ , x, y), ( ¬˙ , f ), ( ∨˙ , f, g), ( ∃˙ , x, f ).
( ∈˙ , x, y), ( =
Similarly, the U-formulas
¬˙ ∃˙ x( ¬˙ f )
( ¬˙ f ) ∨˙ g , ¬˙ ( ¬˙ f ) ∨˙ ( ¬˙ g) ,
are written
∀˙ x(f )
˙ g), (f ∧˙ g),
(f −→
aTN: In class, we struggled to come up with a way to represent the same logical symbols we know and love, while
making sure that there is little confusion about which are meta-logical symbols, and which are logical symbols that
are really sets in U . We settled on the convention of dotting the logical symbols inside our universe, language, etc.
bTN: Professor Tserunyan finds the inclusion of such a lemma to be useless. The truth of this lemma ought to
be painfully obvious from the construction.
SET THEORY AND FORCING 33
By induction on l(f ), we define for any f ∈ F the set var(f ) of free variables in f by
– if f is (x ∈˙ y) or (x =
˙ y), then var(f ) = {x, y}
– var( ¬˙ f ) = var(f )
– var(f ∨˙ g) = var(f ) ∪ var(g)
– var( ∃˙ x(f )) = var(f ) r {x}.
Also f ∈ F is said to be a U-sentence if var(f ) = ∅.
Note:
For any formula φ(~x) of set theory, there is a corresponding U-formula f which we will denote
by pφq. Thus, while φ is an object of our metalanguage, pφq is a set belonging to our universe U.
So, for example, it makes sense to quantify over U-formulas in the language of set theory, which
is not the case for true formulas of the metalanguage.
Also, note that if our universe U contains non-standard natural numbers, then there may be
non-standard U-formulas, i.e. U-formulas f not of the form pφq for some φ of the language of set
theory.
4.1. Model Theory for U-formulas. By induction on the length of f ∈ F, we define for every
non-empty set X, a set V al(f, X) by:
(i) V al (x ∈˙ y), X = δ ∈ X {x,y} δ(x) ∈ δ(y) a
Note:
For any formula φ(x1 , ..., xn ) of our metalanguage, we have (modulo changing variables)
V al(pφq, X) = δ : {x1 , ..., xn } → X φX (δx1 , ..., δxn ) holds .
Suppose f is a U-formula with free variables among x1 , ..., xn , written f (x1 , ..., xn ). Assume also
that γ is a function from a subset of var(f ) into a set X. Then we say that (f, γ) is a U-formula
with parameters in X.
For simplicity of notation, if f (x1 , ..., xn , y1 , ..., yk ) is given with x1 , ..., xn ∈ var(f ) and γ :
{x1 , ..., xn } → X with γ(xi ) = ai , we write f (a1 , ..., an , y1 , ..., yk ) or just f (~a, ~y ) for (f, γ). In this
case, var(f (~a, ~y )) = var(f ) r {x1 , ..., xn }.
A U-formula f (possibly with parameters) is said to be a U-sentence if var(f ) = ∅. Also,
V al (f, γ), X = δ ∈ X var(f,γ) | δ ∪ γ ∈ V al(f, X)
Theorem 4.3 (Löwenheim-Skolem (AC)). Suppose P ⊆ X are sets. Then there is a subset Y ⊆ X
containing P , |Y | ≤ |P | ⊕ ℵ0 , such that for any U-sentence f with parameters in Y ,
X |= f ⇐⇒ Y |= f.
Proof.
Fix a choice function π : P(X) r {∅} → X, i.e. such that π(A) ∈ A for any non-empty subset
A of X. We define inductively an increasing sequence (Pn )n<ω of subsets of X as follows:
– P0 = P
– Given Pn , let Rn = g(~a, x) | g(~a, x) is a U-formula with parameters ~a in X and X |=
∃˙ x g(~a, x)
– For any g(~a, x) ∈ Rn , let bg(~a,x) = π {b ∈ X X |= g(~a, b)}
Theorem 4.4. Suppose X ⊆ Y are sets with X ∈ Y and f (~a, ~x) is a U-formula with parameters
in X. Then
V al(f (~a, ~x), X) = V al(f X (~a, ~x), Y ) ∩ X var(f (~a,~x)) .
SET THEORY AND FORCING 35
Definition 5.1 (ZF). Let OD be the class of ordinal definable sets given by:
there are ordinals α1 , ..., αk , k < ω and β and a U-formula
OD(a) ⇐⇒ f (x, α1 , ..., αk ) with one free variable such that α1 , ..., αk < β,
a ∈ Vβ and V al(f (x, α1 , ..., αk ), Vβ ) = {a}.
Observation:
Note that every ordinal is ordinal definable by using itself as a parameter.
Proposition 5.2. Suppose φ(x, α1 , ..., αn ) is a formula of the language of set theory with ordinal
parameters α1 , ..., αn and suppose that a is the unique set satisfying φ(x, α1 , ..., αn ). Then a is
ordinal definable, i.e. belongs to OD.
Proof.
By the reflection scheme, we can find an ordinal β satisfying
– α1 , ..., αn < β
– a ∈ Vβ
– φ(x, y1 , ..., yn ) is absolute for Vβ .
In particular, a is the unique element of Vβ satisfying φVβ (x, α1 , ..., αn ) and hence
V al(pφ(x, α1 , ..., αn )q, Vβ ) = {a}.
So a is ordinal definable.
Proposition 5.3. There is a formula ψ(x, y) of the language of set theory such that for any set a,
⇐⇒ ∃γ ordinal ∀x ψ(x, γ) ←→ x = a
OD(a)
⇐⇒ ∃γ ordinal ψ(a, γ).
Thus, this formula ψ provides a uniform characterization of ordinal definability.
Proof.
Let S be the class of all ordinal valued functions s : n → Ord, with dom(s) = n a finite ordinal.
That is, S is the class of finite sequences of ordinals. For s, t in S, we put
sup(s) < sup(t) or
Then one can check that <0 defines a well-ordering of S whose proper initial segments are sets.
It follows that there is an order preserving class function B : Ord → S.a
aTN: The character used for this function is a backwards cyrillic B. The handwritten notes were a bit hard to
distinguish here, leading some in the class to speculate as to which character was used, and this was the funniest
answer, and was kept.
36 KRIVINE
Then, we have
⇐⇒ ∃γ ordinal ∀x ψ(x, γ) ←→ x = a
OD(a)
⇐⇒ ∃γ ordinal ψ(a, γ).
Definition 5.4. The class HOD of hereditarily ordinal definable sets is given by
HOD(a) ⇐⇒ OD(a) ∧ cl(a) is a subset of OD.
Lemma 5.5. HOD(a) ⇐⇒ OD(a) ∧ ∀x ∈ a, HOD(x).
This follows from the fact that [
cl(a) = a ∪ cl(x).
x∈a
Union: Note that if HOD(a) and b = x, then b is a subset of HOD. So to see that b
S
x∈a
belongs to HOD, we only need to see that b is ordinal definable. So pick α such that ψ(a, α)
holds. Then b is the unique object x satisfying
φ(x, a) := ∀y y ∈ x ←→ ∃z (z ∈ a ∧ y ∈ z) .
Hence, the formula ∃v ψ(v, α) ∧ φ(x, v) defines b. Thus, OD(b) and HOD(b).
Powerset: Suppose HOD(a) and let b = P(a) ∩ HOD be the set of all hereditarily ordinal
definable subsets of a. Then b is a subset of HOD and can be seen to be ordinal definable by
methods as above. Thus b = P HOD (a).
Replacement: Suppose σ(x, y, a1 , ..., ak ) is a formula with parameters a1 , ..., ak from HOD that
defines a class function in HOD, i.e.
∀x HOD(x) −→ ∃≤1 y (HOD(y) ∧ σ HOD (x, y, ~a) .
Suppose X is in HOD and Y is the set of images in HOD of elements of X by this class function.
Notice, Y is a subset of HOD, so we need only show that Y is in OD.
So fix ordinals β, α1 , ..., αk such that ψ(X, β), ψ(a1 , α1 ), ..., ψ(ak , αk ). Then Y is the unique
object satisfying
ψ(x, X, a1 , ..., ak ) := ∀z z ∈ x ←→ ∃u u ∈ X ∧ HOD(z) ∧ σ HOD (u, z, ~a) .
Hence, Y is defined by
∃v∃y1 ...∃yk ψ(v, β) ∧ ψ(y, α1 ∧ ... ∧ ψ(yk , αk ) ∧ φ(x, v, y1 , ..., yk ) .
Infinity: Since HOD(ω), HOD satisfies the axiom of infinity.
That is, we well-order elements of HOD according to the minimal ordinal defining them via ψ.
So, if X 6= ∅ belongs to HOD, then
R = (a, b) ∈ X 2 | a <0 b
Proposition 5.8. U satisfies the principle of choice if and only if ∀x OD(x) holds.
Proof.
Note that in U, ≺ defines a well-ordering of OD. Thus, if "∀x OD(x)", i.e. U = OD, then ≺ is a
well-ordering of U.
Conversely, suppose φ(x, y) is a formula without parameters defining a well-ordering of U.
Then there is a unique class function F : Ord → U such that α < β ⇐⇒ φ(F (α), F (β)). It
follows that for every α, F (α) is ordinal definable with parameters α, and hence, ∀x OD(x).
Thus: principle of choice ⇐⇒ U = OD ⇐⇒ U = HOD.
SET THEORY AND FORCING 39
Again, this is a first order property in X and A, so we can define the set
D(A) = X ⊆ A | X is definable with parameters in A .
Example:
Suppose φ(x, y1 , ..., yn ) is a formula in the language of set theory and a1 , ..., an ∈ A are such that
X = {x ∈ A | A |= φ(x, ~a)}. Then
X = V al pφ(x, ~a)q, A ∈ D(A).
Remark: (ZFC)
Suppose |A| ≥ ℵ0 . Then |D(A)| = |A|.
To see this, note that the set of U-formulas with parameters in A has size |A| itself, and thus
also |D(A)| = |A|. In particular, for infinite A, |D(A)| < |P(A)|.
Also, A ∈ D(A) for any A. But if A ⊆ B is not definable, then A 6∈ D(B) and so D(A) 6⊆ D(B).
Nevertheless, we have the following theorem.
so X ∈ D(B).
By transfinite induction, we now define a hierarchy of sets by
[
Lα := D(Lβ ).
β<α
So L0 = ∅ and Lβ ⊆ Lα for β < α. Also, for β < α, Lβ ∈ D(Lβ ) ⊆ Lα , and hence, by the
preceding theorem, D(Lβ ) ⊆ D(Lα ). It follows that the hierarchy can alternatively be described by
[
L0 = ∅ , Lα+1 = D(Lα ) , Lλ = Lξ for λ limit.
ξ<λ
Moreover, since D(A) ⊆ P(A) for every set A, we see by induction on α that Lα ⊆ Vα .
Let L be the class of constructible sets defined by L := Lα . So L ⊆ V .
S
α∈Ord
Definition 6.3. The axiom of constructibility is the statement U = L, which, assuming AF, is just
V = L, i.e. ∀x ∃α, x ∈ Lα .
40 KRIVINE
Lemma 6.4. L is a transitive class, that is, if A is constructible, then so is every element of A.
Also, if A ∈ Lα , then A ⊆ Lβ for some β < α.
Proof.
Note that if A ∈ Lξ+1 = D(Lξ ), then A ⊆ Lξ .
By the proof, we see that any Lα is a transitive set.
Definition 6.5. For any constructible set X, we let order(X) = min(α | X ∈ Lα ).
Theorem 6.6. Ord ⊆ L and Ord ∩Lα = α, ∀α. So any ordinal α is constructible with order α + 1.
Proof.
By induction on α, we prove Ord ∩ Lα = α. So suppose this holds for all α less than some
ordinal β.
If β is a limit, then
[ [
Ord ∩ Lβ = Ord ∩ Lα = α = sup α = β.
α<β
α<β α<β
Definition 6.7. The class of Σ1 -formulas is the smallest class of first order formulas of the language
{∈} such that
(i) any quantifier-free formula is Σ1
(ii) if φ, ψ are Σ1 , then so are φ ∧ ψ and φ ∨ ψ
(iii) if φ is Σ1 , then so is ∃x φ
(iv) if φ is Σ1 , then so is ∀x ∈ y φ, i.e. ∀x (x ∈ y −→ φ).
A class, class function, or class relation is said to be Σ1 if it is defined in the universe U by some
Σ1 -formula. So a class function is Σ1 if it has a Σ1 graph.
SET THEORY AND FORCING 41
Remark:
The subclass of ∆0 -formulas is obtained by replacing (iii) by the following two conditions and
otherwise changing "Σ1 " to "∆0 ".
(a) if φ is ∆0 , then so is ∃x ∈ y, φ
(b) if φ is ∆0 , then so is ¬φ.
Theorem 6.8. Suppose φ(x1 , ..., xk ) is Σ1 and M ⊆ N are classes with M transitive. Then for
any a1 , ..., ak in M :
(?) φM (a1 , ..., ak ) =⇒ φN (a1 , ..., ak ).
Remark:
Before proving this, we should acknowledge that most formulas are not Σ1 , but only equivalent
to Σ1 -formulas in some background theory. However, suppose T is a theory and φ(~x), ψ(~x) are
formulas with φ Σ1 such that T |= (φ(~x) −→ ψ(~x)). Assume also that M, N satisfy T . Then (?)
implies that also ψ M (~a) =⇒ ψ N (~a) for any ~a in M .
Proof.
We prove the result by induction on the construction of φ by the principles (i)–(iv). (i) and (ii)
are trivial, so suppose that ψ(x, x1 , ..., xk ) satisfies the induction hypothesis and that a1 , ..., ak
belong to M .
Lemma 6.10. Suppose a is a set defined by a Σ1 -formula φ(x), i.e. x = a ⇐⇒ φ(x), and that
ψ(y, ~z) is Σ1 . Then also ψ(a, ~z) is Σ1 .
Proof.
ψ(a, ~z) ⇐⇒ ∃x (φ(x) ∧ ψ(x, ~z)).
Fact: (ZF)
Ord is a ∆0 class and ω is defined by a Σ1 -formula.
Proof.
Note that
Ord(x) ⇐⇒ ∀y ∈ x ∀z ∈ x (y ∈ z ∨ z ∈ y ∨ z = y) ∧ ∀y ∈ x ∀z ∈ y, z ∈ x.
For the definition of ω, we check that the following formulas are Σ1 .
x = ∅: ∀y ∈ x, y 6= y
y = x ∪ {x}: ∀z ∈ y (z ∈ x ∨ z = x) ∧ ∀z ∈ x, z ∈ y ∧ x ∈ y
x = ω: ∅ ∈ x ∧ ∀y ∈ x, y ∪ {y} ∈ x ∧ ∀y ∈ x (y = ∅ ∨ ∃z y = z ∪ {z})
Proposition 6.11 (ZF). Suppose H(−) is a Σ1 class function of one variable defined on all functions
with ordinal domain. Then the unique class function F defined on all ordinals and satisfying
F (α) = H(Fα ) is Σ1 too.
Proof.
Again, we successively verify that certain objects, classes, and class relations are Σ1 :
z = {x, y}: x ∈ z ∧ y ∈ z ∧ ∀u ∈ z (x = u ∨ y = u)
z = (x, y): z = {{x}, {x, y}}
y ⊆ x: ∀z ∈ y, z ∈ x
z = x ∪ y: ∀u ∈ z (u ∈ x ∧ u ∈ y) ∧ ∀u ∈ x, u ∈ z ∧ ∀u ∈ y, u ∈ z
z = x ∩ y: ∀u ∈ z (u ∈ x ∧ u ∈ y) ∧ ∀u ∈ x (u ∈ y −→ u ∈ z)
z = x r y: ∀u ∈ z (u ∈ x ∧ u 6∈ y) ∧ ∀u ∈ x (u 6∈ y −→ u ∈ z)
z ⊆ x × y: ∀u ∈ z∃a ∃b (a ∈ x ∧ b ∈ y ∧ u = (a, b))
z ⊇ x × y: ∀a ∈ x ∀b ∈ y ((a, b) ∈ z)
f is a function from x to y:
f ⊆ x × y ∧ ∀z ∈ x ∃v (v ∈ y ∧ (z, v) ∈ f ) ∧
∀z ∈ x ∀v ∈ y ∀u ∈ y ((z, v) 6∈ f ∨ (z, u) 6∈ f ∨ v = u)
[Here, (z, v) 6∈ f ⇐⇒ ∃a (a 6∈ f ∧ a = (z, v)) is Σ1 in the variables z, v, f ]
f is a function with domain x: ∃y f : x → y
g = fx : ∃a ∃b (x ⊆ a ∧ f : a → b ∧ g : x → b ∧ g ⊆ f )
f is a function:∃a ∃b f : a → b
f is a function and f (x) = y: f is a function and (x, y) ∈ f .
Fact:
The following are Σ1 .
f is an injection from x to y:
(f : x → y) ∧ ∀a ∈ x ∀b ∈ x ∀c ∈ y (a, c) 6∈ f ∨ (b, c) 6∈ f ∨ (a = b)
f is a U-formula: ∃k (f ∈ Fk )
z = F: the set of U-formulas
f ∈ F ∧ a = var(f ) is Σ1 in the variables f and a.
We now show that the class relation "Y = X var(f ) ∧ f ∈ F" in the three variables X, Y, f
is Σ1 . Note, however, that the class relation "Y = X a " is not Σ1 . To see this, observe that
if Z is a transitive set, then for Y, X, a ∈ Z,
(Y = X a )Z ⇐⇒ Y = {g ∈ Z | g : a → X}.
Since we can construct transitive sets Z with elements Y, X, a such that
Y = {g ∈ Z | g : a → X} ( {g | g : a → X},
being the set of functions from a to X is not preserved from Z to U and hence cannot be
Σ1 .
aTN: All the FOL symbols, parens, and everything else made this obnoxious to TeX. In making revisions to this
pdf, I’m reluctant to redo all the various sizes of parens, brackets, and others to make it readable. I’ll do so iff people
care, aka send me emails.
44 KRIVINE
k < ω ∧ Y = X k is Σ1 in k, Y, X:
::::::::::::::
∈ F ∧ Y = X var(f ) :
f::::::::::::::::::
f ∈ F ∧ ∃p ∃k < ω (p : var(f ) → k is a bijection) ∧
(∀g ∈ X k , g ◦ p ∈ Y ) ∧ (∀h ∈ Y ∃g ∈ X k , g ◦ p = h)
f is a U-sentence with parameters in X: ∃g ∈ F ∃δ : var(g) → X, f = (g, δ)
z = FX 0
where this latter is the set of U-sentences with parameters in X:
∀f ∈ z (f is a U-sentence with parameters in X) ∧ ∀f ∈ F ∀δ ∈ X var(f ) , (f, δ) ∈ z
f is a U-formula with parameters in X and x as a single free variable
z = FX x
where the latter denotes the set of U-formulas with parameters in X and a single
free variable x
f ∈ FX 0
∧ t = V al(f, X) is Σ1 in variables f, X, t and where t can take the values 0, 1.
To see this, note that t = V al(f, X) if and only if there is a function satisfying Tarski’s
recursive definition of truth eventually ending up with t.
x
f ∈ FX ∧ y = V al(f, X)
x
y ∈ D(X): ∃x ∈ V ∃f ∈ FX , y = V al(f, X)
x
z = D(X): ∀y ∈ z y ∈ D(X) ∧ ∀x ∈ V ∀f ∈ FX (V al(f, X) ∈ z)
Corollary 6.13 (ZF). The class function L : Ord → U is Σ1 .
Proof.
L is a class function defined by transfinite recursion from the Σ1 class function D(−)
Theorem 6.14 (ZF). L satisfies ZF + "V = L"
Proof.
Recall that L is a transitive class. Assume that L satisfies ZF. Then, as the class function
α 7−→ Lα is well-defined in any model of ZF, we have that
L
α 7−→ Lα
is a class function defined on all ordinals α in L. Moreover, since y = Lα is Σ1 in the variables
y, α, we have for all y, α in L:
L
y = Lα =⇒ y = Lα .
Now, suppose x belongs to L and find an ordinal α such that x ∈ Lα (not necessarily the least
such ordinal). Then α belongs to L and, since the class of ordinals is ∆0 and thus has Σ1
complement, we have
L
Ord(α) .
SET THEORY AND FORCING 45
Now, let y be the set in L such that [y = Lα ]L . Then also y = Lα and so x ∈ y, hence also
[x ∈ y]L . So, finally, [x ∈ Lα ]L , showing that
L
∀x ∃α x ∈ Lα
i.e., [V = L]L .
It now remains to show that L satisfies ZF.
Extensionality holds in L since L is transitive.
Union: Suppose a is constructible, say a ∈ Lα . Then b = x∈a x ⊆ Lα since Lα is transitive
S
and
b = V al( ∃˙ y (y ∈˙ a ∧˙ x ∈˙ y), Lα ) ∈ Lα+1 .
Powerset: Suppose a is in L and let b = {x ⊆ a | x is in L}. Find also α sufficiently large
such that b ⊆ Lα and thus also a ∈ Lα . But then
b = V al( ∀˙ y (y ∈˙ x −→ ˙ y ∈˙ a), Lα ) ∈ Lα+1 .
Infinity: ω belongs to L.
Replacement: Suppose φ(x, y, ~a) is a formula with parameters a1 , ..., ak in L that defines a
class function in L. Suppose c is a constructible set and set
b = y | L(y) ∧ ∃x ∈ c φL (x, y, ~a) .
Then b ⊆ Lα for some α large enough such that a1 , .., ak , c ∈ Lα . By the reflection scheme,
we can find β > α a limit such that
∀x, y, ~z ∈ Lβ φLβ (x, y, ~z) ←→ φL (x, y, ~z)
It follows that
b = V al(p∃x ∈ x φ(x, y, ~a)q, Lβ ) ∈ Lβ+1 .
Foundation: If a 6= ∅ belongs to L, pick b ∈ a of minimal rank. Then b belongs to L and
a ∩ b = ∅.
Theorem 6.15. V = L =⇒ principle of choice.
In particular, the principle of choice is consistent. Also,
V = L =⇒ V = L = OD = HOD.
Proof.
List the set of U-formulas as (fn )n<ω . Suppose X is a set well-ordered by a relation . We then
define a well-ordering 0 of D(X) as follows:
• The ordering of X canonically induces a well-ordering 1 of the set X <ω of finite se-
quences of elements of X.
• Now, if A, B ∈ D(X), put
there is fn (x, ~y ) ∈ F and ~a ∈ X <ω with
A = V al fn (x, ~a), X and for any fm (x, ~z) ∈ F and
A 2 B ⇐⇒
b ∈ X <ω with B = V al fm (x, ~b), X , either
n < m, or n = m ∧ ~a 1 ~b.
46 KRIVINE
Now, p∃y φ(a, y)q ∈ A and so Y |= p∃y φ(a, y)q, hence for some b ∈ Y , we have φY (a, b). Since
φ is Σ1 and Y is transitive, it follows that also φ(a, b), hence F (a) = b ∈ Y . Again, as Y is
transitive, F (a) ⊆ Y and so
|F (a)| ≤ |Y | = |X| ≤ | cl(a)| ⊕ ℵ0 .
Remark:
Since |P(ω)| > ℵ0 = | cl(ω)| ⊕ ℵ0 , we see that the class function P(−) cannot be Σ1 .
Remark:
If a is a Σ1 definable set, i.e. the statement x = a is Σ1 in the variable x, then |a| ≤ ℵ0 . For in
this case, a = F (∅) for a Σ1 class function.
aTN: Recall that we mentioned earlier that we would be proving the independence of CH. This is technically the
first part of that proof. We are now showing that ZFC + "CH" is consistent if ZFC is, and we will later show, using
methods of forcing, that ZFC + "¬CH" is also consistent if ZFC is.
bTN: This is one of the rare times that definitions work out very nicely.
48 KRIVINE
Theorem 6.20 (ZF). If an arithmetical statement φ is provable from ZFC+"V =L"+GCH, then
φ is provable from ZF.
Proof.
Vω = Lω ⊆ L, φ is Σ1 and so from ZF, we get that φL =⇒ φ. Now, suppose ψ1 , ψ2 , ..., ψn , φ is
a proof of φ from the axioms of ZFC+"V =L". If ψi is an axiom, then also ψiL holds (as can be
prove only supposing ZF i U), so ψ1L , ψ2L , ..., ψnL , φL , φ is a proof of φ only using axioms of ZF.
SET THEORY AND FORCING 49
7. Forcing
Whereas Gödel’s construction of L provided us with a model of ZFC+"V =L"+GCH, we shall now
present Paul Cohen’s method of forcing giving us a model of ZFC+¬CH.
Main Idea:
If U is a model of ZFC and M is a countable, transitive set in U, then forcing is a
method for adjoining a new set x to M , assumed to be somehow generic, to obtain a new
countable set M [x] still satisfying ZF.
We also have tools for studying this adjoining of x to M and to control the properties of M [x] in
terms of M and x. First, let us see how we can obtain countable transitive set models of ZFC.
Theorem 7.1. Suppose T is a theory in the language of set theory extending ZFC and let
m be a new constant symbol. Then, if T is consistent, so is the theory T ∗ := T + T m +
“m is a countable, transitive set00 .
Proof.
Suppose towards a contradiction that T ∗ is inconsistent. Then there is a finite fragment of T ∗
that is inconsistent,a and so there are sentences φ1 , ..., φn ∈ T such that
^n
00
T+ i + “m is a countable, transitive set
φm
i=1
is inconsistent.
Now, since T is consistent, let U be a model of T . By the reflection scheme, find an ordinal
Vn V α
α such that i=1 φi holds. Also, by Löwenheim-Skolem, there is a countable set X ⊆ Vα
such that for any U-formula f , we have X |= f ⇐⇒ Vα |= f . In particular, since Vα satisfies the
axiom of extensionality, so does X, and as Vα |= pφi q for all i ≤ n, we have X |= pφi q for all
i ≤ n as well.
Let j : X → Y be the canonical map from X onto its Mostowski collapse. Then Y is a
countable
Vn transitive set and Y |= pφi q for all i ≤00n. We can therefore expand U to a model
of T + i=1 φm i + “m is a countable, transitive set by interpreting m as Y , contradicting our
assumption.
7.1. Generic Extensions.
In the following, suppose M is a countable transitive set satisfying ZF in a universe U satisfying
ZFC. Assume also that (P, ≤) ∈ M is a poset (partially ordered set) in M , P 6= ∅.
Definition 7.2.
– Elements of P are called forcing conditions, and if p ≤ q, we say that p is stronger than q.a
– Two forcing conditions p, q are said to be compatible if ∃r ∈ P (r ≤ p ∧ r ≤ q). Otherwise,
p and q are incompatible, written p⊥q.
– A subset D ⊆ P is dense if ∀p ∈ P ∃q ∈ D (q ≤ p)
– D is said to be saturated if ∀p ∈ D ∀q ≤ p (q ∈ D).
– D is said to be predense if ∀p ∈ P ∃q ∈ D (p and q are compatible).
– For any set X ⊆ P, let
X̃ := {p ∈ P | ∃q ∈ X p ≤ q}
denote the saturation of X. Note that if X is predense, then X̃ is dense.
Definition 7.3 (Generic Extension). Now, suppose G ⊆ P is a subset, not necessarily belonging
to M , but only to U. We say that G is P-generic over M if
(i) ∀p ∈ G ∀q ∈ P (p ≤ q −→ q ∈ G)
(that is, G is closed upwards)
(ii) ∀p, q ∈ G, p and q are compatible
(iii) ∀D ∈ M (if D is a dense subset of P, then D ∩ G 6= ∅)
Since P-generics are upwards closed, we see that (iii) can be replaced by either
(iii)’ ∀D ∈ M (if D is a predense subset of P, then D ∩ G 6= ∅) or
(iii)” ∀D ∈ M (if D is a dense and saturated subset of P, then D ∩ G 6= ∅)
Lemma 7.4. Suppose G is P-generic over M . Then ∀p ∈ P p 6∈ G ←→ ∃q ∈ G (p⊥q) .
Proof.
Since any two elements of G are compatible, if q ∈ G and p⊥q, then p 6∈ G.
Conversely, suppose p 6∈ G and consider the set
D = {q ∈ P | (q ≤ p) ∨ (q⊥p)}.
We claim that D is dense. For if r ∈ P is given, then either r⊥p, in which case r ∈ D, or there
is q ∈ P such that q ≤ r ∧ q ≤ p, in which case q ∈ D, showing density.
Also, since M satisfies ZF, the construction of D can be done inside M , and so D ∈ M . In
other words, D ∈ M is a dense subset of P. So, as G is P-generic over M , we have G ∩ D 6= ∅.
So, let q ∈ G ∩ D be any element. Note that if q ≤ p, then as G is closed upwards, also p ∈ G,
which is by assumption not the case. So instead we must have p⊥q.
Lemma 7.5. Suppose G is P-generic over M . Then ∀p, q ∈ G ∃r ∈ G (r ≤ p ∧ r ≤ q). That is, any
two elements of G have a common minorant.
Proof.
Fix p, q ∈ G and let
D = r ∈ P | r⊥p ∨ (r ≤ p ∧ r⊥q) ∨ (r ≤ p ∧ r ≤ q) .
Again, since M satisfies ZF, the construction of D can be done in M , and so D ∈ M . Moreover,
D is dense: for given any t ∈ P, either t⊥p, and so t ∈ D, or there is s ∈ P with s ≤ t ∧ s ≤ p.
In the latter case, either s⊥q, where s ∈ D, or there is some r ≤ s ≤ p ∧ r ≤ q, hence r ∈ D.
aTN: This is not a typo.
SET THEORY AND FORCING 51
So pick some r ∈ G ∩ D. Since any two elements of G are compatible, this must mean that
r ≤ p ∧ r ≤ q, and so p, q have a common minorant in G.
By induction, we see that
Lemma 7.6. Any finite subset of G has a common minorant in G.
Definition 7.7. Suppose D ⊆ P and p ∈ P. We say that D is dense below p if
∀q ∈ P q ≤ p −→ ∃r ∈ D (r ≤ q) .
Lemma 7.8. Suppose G is P-generic over M and assume D ∈ M is dense below some p ∈ G. Then
G ∩ D 6= ∅.
Proof.
Note that E = D ∪ {q ∈ P | q⊥p} ∈ M and E is dense in P. For if q ∈ P and q has no minorant
in D, then q and p cannot have any common minorant, hence p⊥q and thus also q ∈ D.
It follows that G ∩ E 6= ∅ and so, as any two elements of G are compatible, also G ∩ D 6= ∅.
Definition 7.9. A subset X ⊆ P is an antichain if ∀p ∈ X ∀q ∈ X (p 6= q −→ p⊥q).
An antichain is said to be maximal if it is not contained in any larger antichain.
Note:
An antichain is maximal if and only if it is predense in P.
In particular, if X ∈ M is a maximal antichain and G is P-generic over M , then G ∩ X 6= ∅. (In
particular, being a maximal antichain is ∆0 .)
Lemma 7.10 (Assuming M satisfies AC). Suppose D ∈ M is a dense subset of P. Then there is
a maximal antichain X ⊆ D, X ∈ M .
Proof.
Work in M:
We order the set of all antichains X ⊆ D by inclusion and note that by Zorn’s Lemma, this
family has a maximal element X, which then is predense in (D, ≤). So for any p ∈ P, there is
q ∈ D, q ≤ p, and so , by predensity of X in D, some r ∈ X compatible with q and thus also
with p. So X is predense in P and hence a maximal antichain.
Theorem 7.11 (Assuming M satisfies AC). Assume G ⊆ P. Then G is P-generic over M if and
only if
(i) any two elements of G are compatible
(ii) if X ∈ M is a maximal antichain in P, then G ∩ X 6= ∅.
Proof.
We have already seen that if G is P-generic over M , then (i) and (ii) hold. For the inverse,
note that if G intersects any maximal antichain X ∈ M , then G also intersects any dense subset
D ∈ M.
Finally, to see that G is closed upwards, suppose p ∈ G, q ∈ P and p ≤ q. We let Dq :=
{r ∈ P | r⊥q} and let X ⊆ Dq , X ∈ M be a maximal antichain of the poset (Dq , ≤). Then also
52 KRIVINE
The following result tells us that for the purposes of forcing, we can work with (D, ≤) instead of
(P, ≤) for any dense subset D ∈ M .
that {q ∈ Dn | q ≤ pn } =
6 ∅, hence this sequence is well-defined).
Then p0 ≥ p1 ≥ ... and pn+1 ∈ Dn for any n < ω. Letting G = {q ∈ P | ∃n < ω, pn ≤ q}, we
see that G is upwards closed, any two elements of G are compatible, and G intersects any dense
D ⊆ P, D ∈ M . Moreover, p ∈ G.
aTN: Wikipedia says that this theorem "is one of the most fundamental facts used in the technique of forcing."
SET THEORY AND FORCING 53
dom(Φ) by Ψ(b) = Φ(b) for b ∈ dom(Φ), and Ψ(a) = {Φ(b) | b ∈ A ∧ bRa}. Then Ψ is an
R-contraction defined on a strictly larger downwards closed subset of A, which is absurd.
7.3. Construction of Generic Extensions.
Suppose M is a countable transitive set model of ZF and (P, ≤) ∈ M is a poset. Suppose also that
G is P-generic over M . We define the following relation R on M using G:
For x, y ∈ M
xRy ⇐⇒ ∃p ∈ G (x, p) ∈ y .
Since xRy =⇒ rk(x) < rk(y), we see that R is well-founded on M .
By the preceding theorem, there is a unique R-contraction, Φ with domain M , and we shall denote
the image Φ[M ] by M [G]. M [G] is called a generic extension of M . Our goal is now to show that
M [G] itself is a countable transitive set model of ZF and eventually study the relation between
M, (P, ≤), G, and M [G].
54 KRIVINE
aTN: I know I’ve already said this, but well-orderings are great.
56 KRIVINE
(3) p x 6∈ y ⇐⇒ ∀q ≤ p q 6 x ∈ y
(4) p x = y ⇐⇒ ∀q ≤ p q 6 x 6= y .
The symbol is called the forcing relation and, e.g., "p x ∈ y" should be read as "p forces x ∈ y".
Since the statements (1), (2), (3), and (4) are defined by induction on the rank of x, y, it is
intuitively clear that this is well-defined. But to see that they are class relations, we defined a
well-founded ordering ≺ of M × M by
or
max (rk(x0 ), rk(y0 )) < max (rk(x1 ), rk(y1 )),
(x0 , y0 ) ≺ (x1 , y1 ) ⇐⇒ max (rk(x0 ), rk(y0 )) = max (rk(x1 ), rk(y1 )) and
min (rk(x0 ), rk(y0 )) < min (rk(x1 ), rk(y1 )).
So, using F , we see that (1), (2), (3), (4) are class relations in M .
Having defined (1), (2), (3), (4), we extend the forcing relation to arbitrary formulas with
parameters by induction on their construction over :::::: literals “x ∈ y”, “x 6∈ y”, “x = y”, “x 6= y”.
If φ(a1 , ..., an ), ψ(a1 , ..., an ), σ(y, a1 , ..., an ) are formulas of the language of set theory with pa-
rameters a1 , ..., an ∈ M , let
(5) p φ(~a) ∨ ψ(~a) ⇐⇒ p φ(~a) or p ψ(~a)
WARNING
Contrary to classical logic, for the forcing relation, we consider formulas as being built
up from literals
::::::
and NOT from atomic formulas.
E.g.
p ¬(x 6∈ y) ⇐⇒ ∀q ≤ p, q 6 x 6∈ y ⇐⇒ ∀q ≤ p ∃r ≤ q, r x ∈ y,
which is weaker than p x ∈ y.
Proof.
Suppose first that φ is one of the formulas a ∈ b, a 6= b, a 6∈ b, or a = b. Then the result is
proved on the stratification ρ. Now for any other formula φ, the result is proved by induction
on the construction of φ using (5), (6), and (7).
Formulas involving ∀ and ∧ are introduced by defining
∀x φ ⇐⇒ ¬∃x ¬φ
φ ∧ ψ ⇐⇒ ¬(¬φ ∨ ¬ψ).
Thus,
(8) p ∀y σ(y, ~a) ⇐⇒ p ¬∃y ¬σ(y, ~a) ⇐⇒ ∀q ≤ p ∀b, q 6 ¬σ(b, ~a)
⇐⇒ ∀b ∀q ≤ p ∃r ≤ q, r σ(b, ~a).
(9) p ∀q ≤ p ∃r ≤ q, r φ(~a) ∧ ∃s ≤ q, s ψ(~a) .
φ(~a) ∧ ψ(~a) ⇐⇒
Lemma 7.23. For all p ∈ P and a, b ∈ M with (a, p) ∈ b, p a = a and p a ∈ b.
Proof.
We show this by induction on rk(a). So suppose q c = c for all q ∈ P and c ∈ M with
rk(c) < rk(a). Then
p a = a ⇐⇒ ¬∃q ≤ p, q a 6= a ⇐⇒ ¬∃q ≤ p ∃r ≥ q ∃c, (c, r) ∈ a ∧ q c ∈ 6 a
6 a. Then
Now, suppose towards a contradiction that q ≤ p, r ≥ q, and (c, r) ∈ a with q c ∈
rk(c) < rk(a), and so q c = c, hence by (1), q c ∈ a. By (3), this contradicts q c 6∈ a.
Therefore, p a = a. To see that p a ∈ b, we apply (1) directly.
7.5. The Truth Lemma.
We begin this subsection with a preliminary lemma which is not the truth lemma.
Lemma 7.24. For any formula φ(~a) with parameters a1 , ..., an ∈ M , there is p ∈ G such that
p φ(~a) or p ¬φ(~a).
Proof.
Note that by (7), the set
D := p ∈ P p φ(~a) ∨ p ¬φ(~a)
is dense and also belongs to M . So G ∩ D 6= ∅.
Lemma 7.25. If a, b ∈ M , we have
Φ(a) ∈ Φ(b) ⇐⇒ ∃p ∈ G, p a∈b
Φ(a) 6= Φ(b) ⇐⇒ ∃p ∈ G, p a 6= b.
58 KRIVINE
Proof.
The proof is by induction on ρ(a, b). So assume the result holds for all c, d ∈ M with ρ(c, d) <
ρ(a, b).
Suppose Φ(a) ∈ Φ(b) and pick c ∈ M, q ∈ G such that Φ(a) = Φ(c), (c, q) ∈ b. Choose also,
by the previous lemma, some p ≤ q, p ∈ G, such that either p c 6= a or p ¬(c 6= a), i.e.,
p c = a.
Since ρ(c, a) < ρ(a, b), we have
p c 6= a =⇒ Φ(c) 6= Φ(a)
and we must therefore have p c = a. By (1), it follows that p a ∈ b.
Conversely, if p a ∈ b for some p ∈ G, then by (1), there is r ≥ p and c ∈ M such that
(c, r) ∈ b and p c = a. Again, ρ(c, a) < ρ(a, b), so if Φ(c) 6= Φ(a), then by the induction
hypothesis, there is q ≤ p, q ∈ G such that q c 6= a, contradicting p c = a. Thus,
Φ(a) = Φ(c) ∈ Φ(b).
The proof for the second equivalence is similar.
Lemma 7.26 (Truth Lemma). Suppose ψ(~x) is a formula without parameters and a1 , ..., an ∈ M .
Then
ψ M [G] Φ(a1 ), ..., Φ(an ) ∃p ∈ G, p ψ(~a)a
⇐⇒
Proof.
While the previous lemma is proved by induction on ρ, this is proved by induction on the
construction of ψ over literals “x ∈ y”, “x 6∈ y”, “x = y”, “x 6= y”.
Now if ψ is either x1 ∈ x2 or x1 6= x2 , then the result is proved by the previous lemma.
Suppose ψ is x1 6∈ x2 . Then for a1 , a2 ∈ M ,
∃p ∈ G, p a1 6∈ a2 ⇐⇒ ∃p ∈ G ∀q ≤ p, q 6 a1 ∈ a2 .
So if Φ(a1 ) ∈ Φ(a2 ), there is r ∈ G such that r a1 ∈ a2 , and so for any p ∈ G, there is
q ≤ r, q ≤ p such that q a1 ∈ a2 , contradicting that ∃p ∈ G, p a1 6∈ a2 .
Thus,
∃p ∈ G, p a1 6∈ a2 =⇒ Φ(a1 ) 6∈ Φ(a2 )
=⇒ ∀p ∈ G, p 6 a1 ∈ a2 =⇒ ∃p ∈ G, p a1 6∈ a2
Similarly for x1 = x2 and ψ = ¬φ.
Suppose ψ(~x) = ∃y σ(y, ~x) and the lemma holds for σ. Then for any a1 , ..., an ∈ M ,
ψ M [G] Φ(a1 ), ..., Φ(an ) ⇐⇒ ∃b ∈ M, σ M [G] Φ(b), Φ(a1 ), ..., Φ(an )
aTN: If you don’t immediately see why this lemma is so beautiful and powerful for us, then stop, go back to the
beginning of this section, and reread until you realize how useful this is about to become.
SET THEORY AND FORCING 59
Notation:
Instead of writing p a, ~b ∈ M , we simplify notation and write p
n , b1 , ..., bk for ~
ψ ab1 , ..., ac
ψ(a1 , ..., an , b1 , ..., bk ).
So, "un-underlined" a’s refer to themselves in the generic extension M [G], while underlined b’s
refer to Φ(b) in M [G]. So the truth lemma reads:
For any formula ψ(x1 , ..., xn , y1 , ..., yk ) and any a1 , ..., an , b1 , ..., bk ∈ M :
ψ M [G] ~a, Φ(b1 ), ..., Φ(bk ) ∃p ∈ G, p ψ(~a, ~b).
⇐⇒
Theorem 7.27. M [G] satisfies ZF.
Proof.
We need only check the power set axiom and replacement scheme.
For the power set axiom, suppose Φ(a) ∈ M [G] for some a ∈ M . We let
a0 := (x, p) ∈ cl(a) × P ∃q ≥ p, (x, q) ∈ a ∈ M
and b = P M (a0 ) × P ∈ M .
We claim that Φ(b) = P M [G] Φ(a) . To see this, suppose Φ(u) ∈ Φ(b) and assume without
loss of generality that there is r ∈ G such that (u, r) ∈ b. Then u ∈ a0 , and so for any (x, p) ∈ u,
there is q ≥ p with (x, p) ∈ a. Since G is upwards closed, this means that if Φ(x) ∈ Φ(u), also
Φ(x) ∈ Φ(a), hence Φ(u) ⊆ Φ(a) and Φ(b) ⊆ P M [G] Φ(a) .
Conversely, if Φ(u) ⊆ Φ(a), we need to find v ⊆ a0 such that Φ(v) = Φ(u). It follows then
that (v, r) ∈ b for any r ∈ G, and so Φ(u) = Φ(v) ∈ Φ(b). Set
v := (x, p) ∈ a0 | p x ∈ u .
Then for Φ(v) = Φ(u), suppose that Φ(x) ∈ Φ(v) for some (x, p) ∈ a0 , p x ∈ u and p ∈ G. Then
by the truth lemma, also Φ(x) ∈ Φ(u), so Φ(v) ⊆ Φ(u). Conversely, if Φ(x) ∈ Φ(u) ⊆ Φ(a) for
some (x, q) ∈ a with q ∈ G, there is by the truth lemma some r ∈ G such that r x ∈ u. Now,
if p ∈ G, p ≤ r, q, then p x ∈ u and so (x, p) ∈ v and thus also Φ(x) ∈ Φ(v). So Φ(u) ⊆ Φ(v).
For replacement, suppose Φ(a) ∈ M [G] and ψ x, y, Φ(a1 ), ..., Φ(an ) is a formula that in M [G]
So F is a class function in M and thus, by applying replacement in M , we see that the following
is a set in M :
b = (y, p) ∈ M × P ∃x ∃q ≥ p, (x, q) ∈ a ∧ y ∈ F (x, p) .
Now, if Φ(y) ∈ Φ(b) for some p ∈ G such that (y, p) ∈ b, we find x and q ≥ p such that (x, q) ∈ a
and y ∈ F (x, p), i.e., p ψ(x, y, ~a). By the truth lemma, ψ M [G] Φ(x), Φ(y), Φ(a1 ), ..., Φ(an )
and so Φ(y) ∈ B.
Conversely, if Φ(y) ∈ B, find Φ(x) ∈ Φ(a) such that ψ M Φ(x), Φ(y), Φ(a1 ), ..., Φ(an ) and
assume without loss of generality that (x, q) ∈ a for some q ∈ G. By the truth lemma, there is
60 KRIVINE
p ∈ G such that p ψ(x, y, ~a), and we can assume that p ≤ q. Pick y0 ∈ F (x, p) 6= ∅. Then
p ψ(x, y0 , ~a) and so, by the truth lemma,a
ψ M [G] Φ(x), Φ(y0 ), Φ(a1 ), ..., Φ(an ) .
Since ψ defined a class function in M [G], this implies that Φ(y) = Φ(y0 ). As also (y0 , p) ∈ b, we
have Φ(y) = Φ(y0 ) ∈ Φ(b). Therefore, B ⊆ Φ(b) and so B = Φ(b).
Theorem 7.28. M and M [G] contain the same ordinals.
Proof.
Recall that as M, M [G] satisfy ZF and are transitive, the notion of ordinal is absolute for both.
So the result follows from M ∩ Ord = M [G] ∩ Ord.
Theorem 7.29 (Proof omitted). M [G] is the smallest transitive set model of ZF such that M ⊆
M [G] and G ∈ M [G].
Theorem 7.30. b
If M satisfies AC, then so does M [G].
Proof.
Suppose a[G] ∈ M [G]. It suffices to find in M [G] a surjection from an ordinal onto a superset of
a[G]. This induces a well-ordering on a[G] by defining an injection from a[G] to this ordinal by
mapping a0 ∈ a[G] to the least pre-image of a0 under this surjection. And finally, an injection
from a set to an ordinal obviously induces a well-ordering on that set.
So by AC in M , let f : α cl(a) be a surjection from an ordinal α onto the transitive closure
of a. It should be immediately obvious that a[G] ∈ cl(a)[G]. Using the image of f under our
collapse map should work. This is because being a function and being an ordinal are absolute,
so the image of f under the collapse map will still be a function with domain an ordinal that
maps onto a superset of a[G].
7.6. Consistency of ZFC + ¬CH.
Recall that not only is CH consistent with ZFC, but GCH is consistent with ZFC. During this final
section, we will use the preceding useful lemmas to force a new model which forces |2ω | > ℵ1 .
Here’s the general outline of our steps:
<ω
• We look at the poset ℵ7 × ω → 2 of finite partial functions from ℵ7 × ω to 2 ordered
First, one should verify that the poset we’ve given is actually a poset and that for some transitive
model M satisfying ZFC, that the given poset is genuinely S in M .
Also, if G ⊆ P is a P-generic ultrafiltera, set g := G. Then, we want to show that g is a
surjective function onto the codomain, and, setting for each α ∈ ℵ7 , gα := g(α, −) : ω → 2, viewing
g as a function g 0 : ℵ7 → 2ω given by α 7−→ gα , g 0 is injective. These tasks are left as exercises.
What follows is Professor Tserunyan’s graph theoretic proof of the ∆-system lemma.
Definition 7.31. A set (think family of sets) F is called a ∆-system is there is a set r (called the
root) such that for any distinct a, b ∈ F, a ∩ b = r.
Definition 7.32. A graph is locally countable if every vertex has at most countably many neigh-
bors.
A quick observation is that a graph is locally countable if and only if each of its connected compo-
nents is countable.
For a graph G on a set V , a subset I ⊆ V is called independent if no two vertices in it are
adjacent.
Lemma 7.33 (Uses AC). A locally countable graph G on an uncountable set V admits an inde-
pendent set I ⊆ V with |I| = |V |.
Proof.
Each connected component is countable, so their collection has cardinality |V |. Choosing a point
from each connected component gives a desired independent set.
The intersection graph on any family V of sets is defined by putting an edge between distinct
u, v S∈ V exactly when u ∩ v 6= ∅. Call a family V of sets point-locally countable if for each
a ∈ V , the set Va := {v ∈ V | a ∈ v} is countable.
Lemma 7.34 (Uses Countable-AC). The intersection graph on any point-locally countable family
V of countable sets is locally countable.
Proof.
For each v ∈ V , each a ∈ v is contained in at most countably many other vertices in V . Since v
is countable and countable union of countable sets is countable, it intersects at most countably
many other sets in V .
The previous two lemmas give us the following:
Proposition 7.35 (Uses AC). The intersection graph on a point-locally countable uncountable
family V of countable sets admits a pairwise disjoint collection I ⊆ V with |I| = |V |.
aTN: A filter is simply a non-empty subset of a poset that is closed upwards and any two elements are compatible;
an ultrafilter would be a maximal filter; thus, we’ve already been working a lot with filters and ultrafilters through
P-generic subsets.
62 KRIVINE
Corollary 7.36 (The ∆-system Lemma, Uses AC). Every uncountable family V of finite sets
contains an uncountable ∆-system.
Proof.
By shrinking V using uncountable-to-countable Pigeonhole Principle (involves AC), we may
assume that for some n ∈ N, all sets in V have the same size n and we prove the statement by
induction on n. The base case n = 1 is trivial, so suppose the statement is true for n − 1 ≥ 1
and let V be an uncountable family of sets of size n.
If V is point-locally countable, then Proposition 7.35 finishes the proof, so suppose that V is
not point-locally countable and let a ∈ V be such that the set Va is uncountable. Removing
S
a from the sets in Va , applying induction, and putting a back yields an uncountable ∆-system
Va0 ⊆ Va .
Lemma 7.37. For sets A, B with 2 ≤ |B| ≤ ℵ0 , the poset [A → B] , ⊇ has the countable
a <ω