Last Lec
Last Lec
Last Lec
Contents
1 Basic Theorems of Real Analysis 1
1.1 The Least Upper Bound Principle . . . . . . . . . . . . . . . . 1
1.2 Monotonic Sequences . . . . . . . . . . . . . . . . . . . . . . . 1
1.3 Upper and Lower Limits of Bounded Sequences of Real Numbers 2
1.4 Cauchy’s Criterion for Convergence . . . . . . . . . . . . . . . 4
1.5 The Bolzano-Weierstrass Theorem . . . . . . . . . . . . . . . . 5
1.6 The Intermediate Value Theorem . . . . . . . . . . . . . . . . 6
3 Metric Spaces 23
3.1 Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.2 Convergence of Sequences in a Metric Space . . . . . . . . . . 24
3.3 Continuity of Functions between Metric Spaces . . . . . . . . 25
3.4 Open Sets in Metric Spaces . . . . . . . . . . . . . . . . . . . 26
i
3.5 Closed Sets in a Metric Space . . . . . . . . . . . . . . . . . . 28
3.6 Continuous Functions and Open and Closed Sets . . . . . . . 29
3.7 Homeomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . 31
5 Topological Spaces 51
5.1 Topological Spaces: Definitions and Examples . . . . . . . . . 51
5.2 Hausdorff Spaces . . . . . . . . . . . . . . . . . . . . . . . . . 53
5.3 Subspace Topologies . . . . . . . . . . . . . . . . . . . . . . . 53
5.4 Continuous Functions between Topological Spaces . . . . . . . 55
5.5 Homeomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . 57
5.6 Sequences and Convergence . . . . . . . . . . . . . . . . . . . 57
5.7 Neighbourhoods, Closures and Interiors . . . . . . . . . . . . . 58
5.8 Product Topologies . . . . . . . . . . . . . . . . . . . . . . . . 60
5.9 Cut and Paste Constructions . . . . . . . . . . . . . . . . . . . 63
5.10 Identification Maps and Quotient Topologies . . . . . . . . . . 65
5.11 Connected Topological Spaces . . . . . . . . . . . . . . . . . . 68
6 Compact Spaces 73
6.1 Compact Topological Spaces . . . . . . . . . . . . . . . . . . . 73
6.2 Compact Metric Spaces . . . . . . . . . . . . . . . . . . . . . . 79
6.3 The Lebesgue Lemma and Uniform Continuity . . . . . . . . . 83
6.4 The Equivalence of Norms on a Finite-Dimensional Vector Space 84
ii
1 Basic Theorems of Real Analysis
1.1 The Least Upper Bound Principle
A widely-used basic principle of analysis, from which many important theo-
rems ultimately derive, is the Least Upper Bound Principle.
Let D be a subset of the set R of real numbers. A real number u is said
to be an upper bound of the set D of x ≤ u for all x ∈ D. The set D is said
to be bounded above if such an upper bound exists.
Example The real number 2 is the least upper bound of the sets {x ∈ R :
x ≤ 2} and {x ∈ R : x < 2}. Note that the first of these sets contains its
least upper bound, whereas the second set does not.
1
Proof Let a1 , a2 , a3 , . . . be a non-decreasing sequence of real numbers that is
bounded above. It follows from the Least Upper Bound Principle that there
exists a least upper bound l for the set {an : n ∈ N}. We claim that the
sequence converges to l.
Let ε > 0 be given. We must show that there exists some natural num-
ber N such that |an − l| < ε whenever n ≥ N . Now l − ε is not an upper
bound for the set {an : n ∈ N} (since l is the least upper bound), and there-
fore there must exist some natural number N such that aN > l − ε. But then
l − ε < an ≤ l whenever n ≥ N , since the sequence is non-decreasing and
bounded above by l. Thus |an − l| < ε whenever n ≥ N . Therefore an → l
as n → +∞, as required.
If the sequence a1 , a2 , a3 , . . . is non-increasing and bounded below then
the sequence −a1 , −a2 , −a3 , . . . is non-decreasing and bounded above, and
is therefore convergent. It follows that the sequence a1 , a2 , a3 , . . . is also
convergent.
The sets S1 , S2 , S3 , . . . are all bounded. It follows that there exist well-defined
infinite sequences u1 , u2 , u3 , . . . and l1 , l2 , l3 , . . . of real numbers, where uj =
sup Sj and lj = inf Sj for all positive integers j. Now Sj+1 is a subset of Sj for
each positive integer j, and therefore uj+1 ≤ uj and lj+1 ≥ lj for each positive
integer j. It follows that the bounded infinite sequence (uj : j ∈ N) is a non-
increasing sequence, and is therefore convergent (Theorem 1.1). Similarly
the bounded infinite sequence (lj : j ∈ N) is a non-decreasing sequence, and
is therefore convergent. We define
The quantity lim sup aj is referred to as the upper limit of the sequence
j→+∞
a1 , a2 , a3 , . . .. The quantity lim inf aj is referred to as the lower limit of the
j→+∞
sequence a1 , a2 , a3 , . . ..
2
Note that every bounded infinite sequence a1 , a2 , a3 , . . .. of real num-
bers has a well-defined upper limit lim sup aj and a well-defined lower limit
j→+∞
lim inf aj .
j→+∞
Proof For each positive integer j, let uj = sup Sj and lj = inf Sj , where
c − 12 ε ≤ lj ≤ uj ≤ c + 12 ε
as required.
3
1.4 Cauchy’s Criterion for Convergence
Definition An infinite sequence a1 , a2 , a3 , . . . of real numbers said to be
a Cauchy sequence if, given any positive real number ε, there exists some
positive integer N such that |aj − ak | < ε for all j and k satisfying j ≥ N
and k ≥ N .
This shows that any convergent sequence of real numbers is a Cauchy se-
quence.
Next let a1 , a2 , a3 , . . . be a Cauchy sequence of real numbers. We must
prove that this sequence is convergent. First we show that it is bounded. Now
there exists some natural number M such that |aj − ak | < 1 for all positive
integers j and k satisfying j > M and k > M . Let R be the maximum of
the real numbers
|a1 |, |a2 |, . . . , |aM −1 |, |aM | + 1.
It is clear that |aj | ≤ R when j < M . If j ≥ M then |aj − aM | < 1, and
therefore |aj | < |aM | + 1 ≤ R. Thus |aj | ≤ R for all positive integers j. This
proves that the Cauchy sequence is bounded.
For each positive integer j, let
4
Thus if d = lim sup aj − lim inf aj then 0 ≤ d ≤ 2ε for all positive real
j→+∞ j→+∞
numbers ε. It must therefore be the case that d = 0. Thus lim sup aj =
j→+∞
lim inf aj . It now follows from Proposition 1.2 that the Cauchy sequence
j→+∞
a1 , a2 , a3 , . . . is convergent, as required.
It follows that c is not an upper bound for the set {aj : j ≥ N }, and therefore
there exists some positive integer satisfying j ≥ N for which aj > c. We
conclude from this that there does not exist any positive integer N with the
property that aj ≤ c whenever j ≥ N . Therefore {j ∈ N : aj > c} is not a
finite set. The result follows.
uN = sup{aN , aN +1 , aN +2 , . . .} = sup{aj : j ≥ N }
5
uN < s + ε. But then aj < s + ε whenever j ≥ N . It follows that the
number of positive integers j for which aj ≥ s + ε is finite. Also it follows
from Lemma 1.4 that the number of positive integers j for which aj > s − ε
is infinite. Putting these two facts together, we see that the number of
positive integers j for which s − ε < aj < s + ε is infinite. (Indeed let
S1 = {j ∈ N : aj > s − ε} and S2 = {j ∈ N : aj ≥ s + ε}. Then S1 is an
infinite set, S2 is a finite set, and therefore S1 \ S2 is an infinite set. Moreover
s − ε < aj < s + ε for all j ∈ S1 \ S2 .)
Now given any positive integer j, and given any positive number mj
such that |amj − s| < j −1 , there exists some positive integer mj+1 such that
mj+1 > mj and |amj+1 − s| < (j + 1)−1 . It follow from this that there exists
a subsequence am1 , am2 , am3 , . . . of the infinite sequence a1 , a2 , a3 , . . ., where
m1 < m2 < m3 < · · ·, which has the property that |amj − s| < j −1 for all
positive integers j. This subsequence converges to s as required.
Proof Let
and let s = sup S. Now s ∈ [a, b], and therefore the function f is continuous
at s. Therefore there exists some real number δ satisfying δ > 0 such that
|f (x) − f (s)| < 21 for all x ∈ [a, b] satisfying |x − s| < δ. But the function f
is integer-valued. It follows that f (x) = f (s) for all x ∈ [a, b] satisfying
|x − s| < δ. Now s − δ is not an upper bound for the set S. Therefore
there exists some element x0 of S satisfying s − δ < x0 ≤ s. But then
f (s) = f (x0 ) = f (a), and therefore the function f is constant on the interval
[a, x] for all x ∈ [a, b] satisfying s ≤ x < s + δ. Thus x ∈ [a, b] ∩ [s, s + δ) ⊂ S.
In particular s ∈ S. Now S cannot contain any elements x of [a, b] satisfying
x > s. Therefore [a, b] ∩ [s, s + δ) = {s}, and therefore s = b. This shows
that b ∈ S, and thus the function f is constant on the interval [a, b], as
required.
6
Theorem 1.8 (The Intermediate Value Theorem) Let a and b be real num-
bers satisfying a < b, and let f : [a, b] → R be a continuous function defined
on the interval [a, b]. Let c be a real number which lies between f (a) and f (b)
(so that either f (a) ≤ c ≤ f (b) or else f (a) ≥ c ≥ f (b).) Then there exists
some s ∈ [a, b] for which f (s) = c.
Proof Let c be a real number which lies between f (a) and f (b), and let
gc : R \ {c} → Z be the continuous integer-valued function on R \ {c} de-
fined such that gc (x) = 0 whenever x < c and gc (x) = 1 if x > c. Suppose
that c were not in the range of the function f . Then the composition func-
tion gc ◦ f : [a, b] → R would be a continuous integer-valued function defined
throughout the interval [a, b]. This function would not be constant, since
gc (f (a)) 6= gc (f (b)). But every continuous integer-valued function on the in-
terval [a, b] is constant (Proposition 1.7). It follows that every real number c
lying between f (a) and f (b) must belong to the range of the function f , as
required.
Proof Let x1 and x2 be distinct real numbers belonging to the interval [a, b]
then either x1 < x2 , in which case f (x1 ) < f (x2 ) or x1 > x2 , in which
case f (x1 ) > f (x2 ). Thus f (x1 ) 6= f (x2 ) whenever x1 6= x2 . It follows that
the function f is injective. The Intermediate Value Theorem (Theorem 1.8)
ensures that f is surjective. It follows that the function f has a well-defined
inverse f −1 : [c, d] → [a, b]. It only remains to show that this inverse function
is continuous.
Let y be a real number satisfying c < y < d, and let x be the unique real
number such that a < x < b and f (x) = y. Let ε > 0 be given. We can then
choose x1 , x2 ∈ [a, b] such that x − ε < x1 < x < x2 < x + ε. Let y1 = f (x1 )
and y2 = f (x2 ). Then y1 < y < y2 . Choose δ > 0 such that δ < y − y1
and δ < y2 − y. If v ∈ [c, d] satisfies |v − y| < δ then y1 < v < y2 and
therefore x1 < f −1 (v) < x2 . But then |f −1 (v) − f −1 (y)| < ε. We conclude
that the function f −1 : [c, d] → [a, b] is continuous at all points in the interior
of the interval [a, b]. A similar argument shows that it is continuous at the
endpoints of this interval. Thus the function f has a continuous inverse, as
required.
7
2 Analysis in Euclidean Spaces
2.1 Euclidean Spaces
We denote by Rn the set consisting of all n-tuples (x1 , x2 , . . . , xn ) of real
numbers. The set Rn represents n-dimensional Euclidean space (with respect
to the standard Cartesian coordinate system). Let x and y be elements of
Rn , where
x = (x1 , x2 , . . . , xn ), y = (y1 , y2 , . . . , yn ),
and let λ be a real number. We define
x+y =
(x1 + y1 , x2 + y2 , . . . , xn + yn ),
x−y (x1 − y1 , x2 − y2 , . . . , xn − yn ),
=
λx =
(λx1 , λx2 , . . . , λxn ),
x·y x1 y1 + x2 y2 + · · · + xn yn ,
=
q
|x| = x21 + x22 + · · · + x2n .
The quantity x · y is the scalar product (or inner product) of x and y, and
the quantity |x| is the Euclidean norm of x. Note that |x|2 = x · x. The
Euclidean distance between two points x and y of Rn is defined to be the
Euclidean norm |y − x| of the vector y − x.
Proof We note that |λx + µy|2 ≥ 0 for all real numbers λ and µ. But
so that (|x|2 |y|2 − (x · y)2 ) |y|2 ≥ 0. Thus if y 6= 0 then |y| > 0, and hence
8
It follows easily from Schwarz’ Inequality that |x + y| ≤ |x| + |y| for all
x, y ∈ Rn . For
It follows that
|z − x| ≤ |z − y| + |y − x|
for all points x, y and z of Rn . This important inequality is known as the
Triangle Inequality. It expresses the geometric fact the the length of any
triangle in a Euclidean space is less than or equal to the sum of the lengths
of the other two sides.
Proof Let xji and pi denote the ith components of xj and p. Then |xji −pi | ≤
|xj − p| for all j. It follows directly from the definition of convergence that
if xj → p as j → +∞ then xji → pi as j → +∞.
Conversely suppose that, for each i, xji → pi as j → +∞. Let ε > 0 be
√ Then there exist natural numbers N1 , N2 , . . . , Nn such that |xji −pi | <
given.
ε/ n whenever j ≥ Ni . Let N be the maximum of N1 , N2 , . . . , Nn . If j ≥ N
then n
2
X √
|xj − p| = (xji − pi )2 < n(ε/ n)2 = ε2 ,
i=1
so that xj → p as j → +∞.
9
Proof Let x1 , x2 , x3 , . . . be a bounded sequence of points in Rn . Let us
(i)
denote by xj the ith component of the point xj , so that
(1) (2) (n)
xj = (xj , xj , . . . , xj )
for all positive integers j. Suppose that, for some integer s between 1 and
n − 1, the sequence x1 , x2 , x3 , . . . has a subsequence xp1 , xp2 , xp3 , . . . with
the property that, for each integer i satisfying 1 ≤ i ≤ s, the ith compo-
nents of the members of this subsequence constitute a convergent sequence
(i) (i) (i) (s+1)
xp1 , xp2 , xp3 , . . . of real numbers. Let aj = xpj for each positive integer j.
Then a1 , a2 , a3 , . . . is a bounded sequence of real numbers. It follows from
the Bolzano-Weierstrass Theorem (Theorem 1.6) that this sequence has a
convergent subsequence am1 , am2 , am3 , . . ., where m1 < m2 < m3 < · · ·. Let
qj = pmj for each positive integer j. Then xq1 , xq2 , xq3 , . . . is a subsequence of
the original bounded sequence x1 , x2 , x3 , . . . which has the property that, for
each integer i satisfying 1 ≤ i ≤ s + 1, the ith components of the members
(i) (i) (i)
of the subsequence constitute a convergent sequence xq1 , xq2 , xq3 , . . . of real
numbers.
Repeated applications of this result show that the bounded sequence
x1 , x2 , x3 , . . . has a subsequence xr1 , xr2 , xr3 , . . . with the property that, for
each integer i satisfying 1 ≤ i ≤ n, the ith components of the members
of the subsequence constitute a convergent sequence of real numbers. Let
z = (z1 , z2 , . . . , zn ) where, for each value of i between 1 and n, the ith com-
(i) (i) (i)
ponent zi of z is the limit of the sequence xr1 , xr2 , xr3 , . . . of ith components
of the members of the subsequence xr1 , xr2 , xr3 , . . . . Then this subsequence
converges to the point z, as required.
10
whenever j ≥ N and k ≥ N .
Thus |pj | ≤ R for all positive integers j. We conclude that the Cauchy
sequence (pj : j ∈ N ) is bounded.
Now every bounded sequence of points in a Euclidean space has a con-
vergent subsequence (Theorem 2.3). In particular, the Cauchy sequence
(pj : j ∈ N ) has a convergent subsequence (pkj : j ∈ N ), where k1 , k2 , k3 , . . .
are positive integers satisfying k1 < k2 < k3 < · · ·. Let the point q of Rn be
the limit of this subsequence. Then, given any positive number ε satisfying
ε > 0, there exists some positive integer M such that |pkm −q| < 21 ε whenever
m ≥ M . Also there exists some positive integer N such that |pj − pk | < 12 ε
whenever j ≥ N and k ≥ N . Choose m large enough to ensure that m ≥ M
and km ≥ N . If j ≥ N then
2.4 Continuity
Definition Let X and Y be a subsets of Rm and Rn respectively. A function
f : X → Y from X to Y is said to be continuous at a point p of X if and
only if the following criterion is satisfied:—
given any real number ε satisfying ε > 0 there exists some δ >
0 such that |f (x) − f (p)| < ε for all points x of X satisfying
|x − p| < δ.
11
Lemma 2.6 The functions s: R2 → R and p: R2 → R defined by s(x, y) =
x + y and p(x, y) = xy are continuous.
for all points (x, y) of R2 . Thus if the distance from (x, y) to (u, v) is less
than δ then |x − u| < δ and |y − v| < δ, and hence |p(x, y) − p(u, v)| <
δ 2 + (|u| + |v|)δ. Let ε > 0 is given. If δ > 0 is chosen to be the minimum of
1 and ε/(1 + |u| + |v|) then δ 2 + (|u| + |v|)δ < (1 + |u| + |v|)δ < ε, and thus
|p(x, y) − p(u, v)| < ε for all points (x, y) of R2 whose distance from (u, v) is
less than δ. This shows that p: R2 → R is continuous at (u, v).
Proof Let ε > 0 be given. Then there exists some η > 0 such that |g(y) −
g(f (p))| < ε for all y ∈ Y satisfying |y − f (p)| < η. But then there exists
some δ > 0 such that |f (x) − f (p)| < η for all x ∈ X satisfying |x − p| < δ.
It follows that |g(f (x)) − g(f (p))| < ε for all x ∈ X satisfying |x − p| < δ,
and thus g ◦ f is continuous at p, as required.
12
Proof Let ε > 0 be given. Then there exists some δ > 0 such that
|f (x) − f (p)| < ε for all x ∈ X satisfying |x − p| < δ, since the func-
tion f is continuous at p. Also there exists some natural number N such
that |xj − p| < δ whenever j ≥ N , since the sequence x1 , x2 , x3 , . . . con-
verges to p. Thus if j ≥ N then |f (xj ) − f (p)| < ε. Thus the sequence
f (x1 ), f (x2 ), f (x3 ), . . . converges to f (p), as required.
and
lim (aj bj ) = lim p(aj , bj ) = p lim (aj , bj ) = p(l, m) = lm.
j→+∞ j→+∞ j→+∞
Also the sequence (−bj : j ∈ N) converges to −m, and therefore lim (aj −
j→+∞
bj ) = l − m. Now the reciprocal function r: R \ {0} → R is continuous on
13
R \ {0}, where r(x) = 1/x for all non-zero real numbers x. It follows from
Lemma 2.8 that if bj 6= 0 for all positive integers j then 1/bj converges to
1/m as j → +∞. But then lim (aj /bj ) = l/m. This completes the proof of
j→+∞
Proposition 2.9.
14
Proof Note that f + g = s ◦ h and f · g = p ◦ h, where h: X → R2 ,
s: R2 → R and p: R2 → R are given by h(x) = (f (x), g(x)), s(u, v) = u + v
and p(u, v) = uv for all x ∈ X and u, v ∈ R. If the functions f and g are
continuous at p then so is the function h (Proposition 2.10). The functions
s and p are continuous on R2 . It therefore follows from Lemma 2.7 that the
composition functions s ◦ h and p ◦ h are continuous at p. Thus the functions
f + g and f · g are continuous at p. Now f − g = f + (−g), and both f and
−g are continuous. Therefore f − g is continuous.
Now suppose that g(x) 6= 0 for all x ∈ X. Note that 1/g = r ◦ g, where
r: R \ {0} → R is the reciprocal function, defined by r(t) = 1/t. Now the
reciprocal function r is continuous. It now follows on applying Lemma 2.7
that the function 1/g is continuous at p. The function f /g, being the product
of the functions f and 1/g is therefore continuous at p.
15
point p itself. It follows directly from the definition of continuity that any
function between subsets of Euclidean space is continuous at all the isolated
points of its domain.
for all x ∈ X, and let q = (q1 , q2 , . . . , qn ). Then lim f (x) = q if and only if
x→p
lim fi (x) = qi for i = 1, 2, . . . , n.
x→p
16
Proposition 2.15 Let X be a subset of some Euclidean space, let p be a
limit point of X, and let f : X → Rn and g: X → Rn be functions on X
taking values in some Euclidean space Rn . Suppose that the limits lim f (x)
x→p
and lim g(x) exist. Then
x→p
If moreover lim g(x) 6= 0 and the function g is non-zero throughout its do-
x→p
main X then
lim f (x)
f (x) x→p
lim = .
x→p g(x) lim g(x)
x→p
|f (x) − f (p)| ≤ |f (x) − fj (x)| + |fj (x) − fj (p)| + |fj (p) − f (p)|
< 13 ε + 13 ε + 13 ε = ε
17
2.9 Open Sets in Euclidean Spaces
Given a point p of Rn and a non-negative real number r, the open ball B(p, r)
of radius r about p is defined to be the subset of Rn given by
(Thus B(p, r) is the set consisting of all points of Rn that lie within a sphere
of radius r centred on the point p.)
Proof Let x be an element of B(p, r). We must show that there exists some
δ > 0 such that B(x, δ) ⊂ B(p, r). Let δ = r − |x − p|. Then δ > 0, since
|x − p| < r. Moreover if y ∈ B(x, δ) then
|y − p| ≤ |y − x| + |x − p| < δ + |x − p| = r,
by the Triangle Inequality, and hence y ∈ B(p, r). Thus B(x, δ) ⊂ B(p, r).
This shows that B(p, r) is an open set, as required.
18
Proof Let x be a point of Rn satisfying |x − p| > r, and let y be any point
of Rn satisfying |y − x| < δ, where δ = |x − p| − r. Then
|x − p| ≤ |x − y| + |y − p|,
by the Triangle Inequality, and therefore
|y − p| ≥ |x − p| − |y − x| > |x − p| − δ = r.
Thus B(x, δ) is contained in the given set. The result follows.
Proof The empty set ∅ is an open set by convention. Moreover the definition
of an open set is satisfied trivially by the whole space Rn . This proves (i).
Let A be any collection of open sets in Rn , and let U denote the union
of all the open sets belonging to A. We must show that U is itself open in
Rn . Let x ∈ U . Then x ∈ V for some set V belonging to the collection A.
It follows that there exists some δ > 0 such that B(x, δ) ⊂ V . But V ⊂ U ,
and thus B(x, δ) ⊂ U . This shows that U is open in Rn . This proves (ii).
Finally let V1 , V2 , V3 , . . . , Vk be a finite collection of subsets of Rn that
are open in Rn , and let V denote the intersection V1 ∩ V2 ∩ · · · ∩ Vk of these
sets. Let x ∈ V . Now x ∈ Vj for j = 1, 2, . . . , k, and therefore there
exist strictly positive real numbers δ1 , δ2 , . . . , δk such that B(x, δj ) ⊂ Vj for
j = 1, 2, . . . , k. Let δ be the minimum of δ1 , δ2 , . . . , δk . Then δ > 0. (This is
where we need the fact that we are dealing with a finite collection of sets.)
Now B(x, δ) ⊂ B(x, δj ) ⊂ Vj for j = 1, 2, . . . , k, and thus B(x, δ) ⊂ V . Thus
the intersection V of the sets V1 , V2 , . . . , Vk is itself open in Rn . This proves
(iii).
19
Example The set
{(x, y, z) ∈ R3 : (x − n)2 + y 2 + z 2 < 1
4
for some n ∈ Z}
1
is an open set in R3 , since it is the union of the open balls of radius 2
about
the points (n, 0, 0) for all integers n.
Example For each natural number k, let
Vk = {(x, y, z) ∈ R3 : k 2 (x2 + y 2 + z 2 ) < 1}.
Now each set Vk is an open ball of radius 1/k about the origin, and is therefore
an open set in R3 . However the intersection of the sets Vk for all natural
numbers k is the set {(0, 0, 0)}, and thus the intersection of the sets Vk for all
natural numbers k is not itself an open set in R3 . This example demonstrates
that infinite intersections of open sets need not be open.
Lemma 2.20 A sequence x1 , x2 , x3 , . . . of points in Rn converges to a point p
if and only if, given any open set U which contains p, there exists some
natural number N such that xj ∈ U for all j satisfying j ≥ N .
Proof Suppose that the sequence x1 , x2 , x3 , . . . has the property that, given
any open set U which contains p, there exists some natural number N such
that xj ∈ U whenever j ≥ N . Let ε > 0 be given. The open ball B(p, ε) of
radius ε about p is an open set by Lemma 2.17. Therefore there exists some
natural number N such that xj ∈ B(p, ε) whenever j ≥ N . Thus |xj −p| < ε
whenever j ≥ N . This shows that the sequence converges to p.
Conversely, suppose that the sequence x1 , x2 , x3 , . . . converges to p. Let
U be an open set which contains p. Then there exists some ε > 0 such that
the open ball B(p, ε) of radius ε about p is a subset of U . Thus there exists
some ε > 0 such that U contains all points x of Rn that satisfy |x − p| < ε.
But there exists some natural number N with the property that |xj − p| < ε
whenever j ≥ N , since the sequence converges to p. Therefore xj ∈ U
whenever j ≥ N , as required.
20
Definition Let X be a subset of Rn . A subset V of X is said to be open
in X if and only if, given any point p of V , there exists some δ > 0 such that
BX (p, δ) ⊂ V .
By convention, we regard the empty set ∅ as being an open subset of X.
(The criterion given above is satisfied vacuously in the case when V is the
empty set.)
S 2 = {(x, y, z) ∈ R3 : x2 + y 2 + z 2 = 1}
21
Let A be some collection of subsets of a set X. Then
[ \ \ [
X\ S= (X \ S), X\ S= (X \ S)
S∈A S∈A S∈A S∈A
22
3 Metric Spaces
3.1 Metric Spaces
Definition A metric space (X, d) consists of a set X together with a distance
function d: X × X → [0, +∞) on X satisfying the following axioms:
(i) d(x, y) ≥ 0 for all x, y ∈ X,
(ii) d(x, y) = d(y, x) for all x, y ∈ X,
(iii) d(x, z) ≤ d(x, y) + d(y, z) for all x, y, z ∈ X,
(iv) d(x, y) = 0 if and only if x = y.
The quantity d(x, y) should be thought of as measuring the distance be-
tween the points x and y. The inequality d(x, z) ≤ d(x, y)+d(y, z) is referred
to as the Triangle Inequality. The elements of a metric space are usually re-
ferred to as points of that metric space.
Note that if X is a metric space with distance function d and if A is a
subset of X then the restriction d|A × A of d to pairs of points of A defines
a distance function on A satisfying the axioms for a metric space.
The set R of real numbers becomes a metric space with distance function d
given by d(x, y) = |x − y| for all x, y ∈ R. Similarly the set C of complex
numbers becomes a metric space with distance function d given by d(z, w) =
|z − w| for all z, w ∈ C, and n-dimensional Euclidean space Rn is a metric
space with with respect to the Euclidean distance function d, given by
v
u n
uX
d(x, y) = |x − y| = t (xi − yi )2
i=1
23
3.2 Convergence of Sequences in a Metric Space
Definition Let X be a metric space with distance function d. A sequence
x1 , x2 , x3 , . . . of points in X is said to converge to a point p in X if, given
any strictly positive real number ε, there exists some natural number N such
that d(xj , p) < ε whenever j ≥ N .
We refer to p as the limit lim xj of the sequence x1 , x2 , x3 , . . . .
j→+∞
Proof Let ε > 0 be given. We must show that there exists some natural
number N such that |d(xj , y) − d(p, y)| < ε whenever j ≥ N . However N
can be chosen such that d(xj , p) < ε whenever j ≥ N . But
for all j, and hence |d(xj , y) − d(p, y)| < ε whenever j ≥ N , as required.
24
3.3 Continuity of Functions between Metric Spaces
Definition Let X and Y be metric spaces with distance functions dX and
dY respectively. A function f : X → Y from X to Y is said to be continuous
at a point p of X if and only if the following criterion is satisfied:—
• given any strictly positive real number ε, there exists some strictly
positive real number δ such that dY (f (x), f (p)) < ε for all points x
of X satisfying dX (x, p) < δ.
The function f : X → Y is said to be continuous on X if and only if it is
continuous at p for every point p of X.
25
3.4 Open Sets in Metric Spaces
Definition Let (X, d) be a metric space. Given a point p of X and r ≥ 0,
the open ball BX (p, r) of radius r about p in X is defined by
Lemma 3.4 Let X be a metric space with distance function d, and let p be
a point of X. Then, for any r > 0, the open ball BX (p, r) of radius r about
p is an open set in X.
Proof Let q ∈ BX (p, r). We must show that there exists some δ > 0 such
that BX (q, δ) ⊂ BX (p, r). Now d(q, p) < r, and hence δ > 0, where δ =
r − d(q, p). Moreover if x ∈ BX (q, δ) then
by the Triangle Inequality, hence x ∈ BX (p, r). Thus BX (q, δ) ⊂ BX (p, r),
showing that BX (p, r) is an open set, as required.
Lemma 3.5 Let X be a metric space with distance function d, and let p be
a point of X. Then, for any r ≥ 0, the set {x ∈ X : d(x, p) > r} is an open
set in X.
Proof Let q be a point of X satisfying d(q, p) > r, and let x be any point
of X satisfying d(x, q) < δ, where δ = d(q, p) − r. Then
26
(i) the empty set ∅ and the whole set X are both open sets;
(ii) the union of any collection of open sets is itself an open set;
(iii) the intersection of any finite collection of open sets is itself an open set.
Proof The empty set ∅ is an open set by convention. Moreover the definition
of an open set is satisfied trivially by the whole set X. Thus (i) is satisfied.
Let A be any collection of open sets in X, and let U denote the union of
all the open sets belonging to A. We must show that U is itself an open set.
Let x ∈ U . Then x ∈ V for some open set V belonging to the collection A.
Therefore there exists some δ > 0 such that BX (x, δ) ⊂ V . But V ⊂ U , and
thus BX (x, δ) ⊂ U . This shows that U is open. Thus (ii) is satisfied.
Finally let V1 , V2 , V3 , . . . , Vk be a finite collection of open sets in X, and let
V = V1 ∩ V2 ∩ · · · ∩ Vk . Let x ∈ V . Now x ∈ Vj for all j, and therefore there
exist strictly positive real numbers δ1 , δ2 , . . . , δk such that BX (x, δj ) ⊂ Vj
for j = 1, 2, . . . , k. Let δ be the minimum of δ1 , δ2 , . . . , δk . Then δ > 0.
(This is where we need the fact that we are dealing with a finite collection
of open sets.) Moreover BX (x, δ) ⊂ BX (x, δj ) ⊂ Vj for j = 1, 2, . . . , k, and
thus BX (x, δ) ⊂ V . This shows that the intersection V of the open sets
V1 , V2 , . . . , Vk is itself open. Thus (iii) is satisfied.
Lemma 3.7 Let X be a metric space. A sequence x1 , x2 , x3 , . . . of points
in X converges to a point p if and only if, given any open set U which
contains p, there exists some natural number N such that xj ∈ U for all
j ≥ N.
Proof Let x1 , x2 , x3 , . . . be a sequence satisfying the given criterion, and let
ε > 0 be given. The open ball BX (p, ε) of radius ε about p is an open set
(see Lemma 3.4). Therefore there exists some natural number N such that,
if j ≥ N , then xj ∈ BX (p, ε), and thus d(xj , p) < ε. Hence the sequence (xj )
converges to p.
Conversely, suppose that the sequence (xj ) converges to p. Let U be
an open set which contains p. Then there exists some ε > 0 such that
BX (p, ε) ⊂ U . But xj → p as j → +∞, and therefore there exists some
natural number N such that d(xj , p) < ε for all j ≥ N . If j ≥ N then
xj ∈ BX (p, ε) and thus xj ∈ U , as required.
Definition Let (X, d) be a metric space, and let x be a point of X. A
subset N of X is said to be a neighbourhood of x (in X) if and only if there
exists some δ > 0 such that BX (x, δ) ⊂ N , where BX (x, δ) is the open ball
of radius δ about x.
It follows directly from the relevant definitions that a subset V of a metric
space X is an open set if and only if V is a neighbourhood of v for all v ∈ V .
27
3.5 Closed Sets in a Metric Space
A subset F of a metric space X is said to be a closed set in X if and only
if its complement X \ F is open. (Recall that the complement X \ F of F
in X is, by definition, the set of all points of the metric space X that do not
belong to F .) The following result follows immediately from Lemma 3.4 and
Lemma 3.5.
Lemma 3.8 Let X be a metric space with distance function d, and let x0 ∈
X. Given any r ≥ 0, the sets
are closed. In particular, the set {x0 } consisting of the single point x0 is a
closed set in X.
(i) the empty set ∅ and the whole set X are both closed sets;
(iii) the union of any finite collection of closed sets in X is itself a closed
set.
Lemma 3.10 Let F be a closed set in a metric space X and let (xj : j ∈ N)
be a sequence of points of F . Suppose that xj → p as j → +∞. Then p also
belongs to F .
28
Proof Suppose that the limit p of the sequence were to belong to the com-
plement X \ F of the closed set F . Now X \ F is open, and thus it would
follow from Lemma 3.7 that there would exist some natural number N such
that xj ∈ X \ F for all j ≥ N , contradicting the fact that xj ∈ F for all j.
This contradiction shows that p must belong to F , as required.
Proof Let x be a point of X with the property that, given any ε > 0, there
exists some a ∈ A satisfying d(x, a) < ε. Let F be any closed subset of X
containing A. If x did not belong to F then there would exist some ε > 0
with the property that BX (x, ε) ∩ F = ∅, where BX (x, ε) denotes the open
ball of radius ε about x. But this would contradict the fact that BX (x, ε) ∩ A
is non-empty for all ε > 0. Thus the point x belongs to every closed subset F
of X that contains A, and therefore x ∈ A, by definition of the closure A of
A.
Conversely let x ∈ A, and let ε > 0 be given. Let F be the complement
X \ BX (x, ε) of BX (x, ε). Then F is a closed subset of X, and the point x
does not belong to F . If BX (x, ε) ∩ A = ∅ then A would be contained in F ,
and hence x ∈ F , which is impossible. Therefore there exists a ∈ A satisfying
d(x, a) < ε, as required.
29
if, given any ε > 0, there exists some δ > 0 such that f maps BX (p, δ) into
BY (f (p), ε) (where BX (p, δ) and BY (f (p), ε) denote the open balls of radius
δ and ε about p and f (p) respectively).
Let f : X → Y be a function from a set X to a set Y . Given any subset V
of Y , we denote by f −1 (V ) the preimage of V under the map f , defined by
f −1 (V ) = {x ∈ X : f (x) ∈ V }.
Also a subset of a metric space is closed if and only if its complement is open.
The following result therefore follows directly from Proposition 3.12.
30
Let f : X → Y be a continuous function from a metric space X to a metric
space Y . Then, for any point y of Y , the set {x ∈ X : f (x) = y} is a closed
subset of X. This follows from Corollary 3.13, together with the fact that
the set {y} consisting of the single point y is a closed subset of the metric
space Y .
Let X be a metric space, and let f : X → R be a continuous function from
X to R. Then, given any real number c, the sets
are closed subsets of X. Also, given real numbers a and b satisfying a < b,
the set
{x ∈ X : a < f (x) < b}
is an open subset of X, and the set
{x ∈ X : a ≤ f (x) ≤ b}
is a closed subset of X.
Similar results hold for continuous functions f : X → C from X to C.
Thus, for example,
3.7 Homeomorphisms
Let X and Y be metric spaces. A function h: X → Y from X to Y is said to
be a homeomorphism if it is a bijection and both h: X → Y and its inverse
h−1 : Y → X are continuous. If there exists a homeomorphism h: X → Y
from a metric space X to a metric space Y , then the metric spaces X and Y
are said to be homeomorphic.
The following result follows directly on applying Proposition 3.12 to
h: X → Y and to h−1 : Y → X.
31
Lemma 3.14 Any homeomorphism h: X → Y between metric spaces X and
Y induces a one-to-one correspondence between the open sets of X and the
open sets of Y : a subset V of Y is open in Y if and only if h−1 (V ) is open
in X.
32
4 Complete Metric Spaces, Normed Vector
Spaces and Banach Spaces
4.1 Complete Metric Spaces
Definition Let X be a metric space with distance function d. A sequence
x1 , x2 , x3 , . . . of points of X is said to be a Cauchy sequence in X if and only if,
given any ε > 0, there exists some positive integer N such that d(xj , xk ) < ε
for all j and k satisfying j ≥ N and k ≥ N .
whenever j ≥ N and k ≥ N .
33
Conversely, suppose that A is complete. Suppose that A were not closed.
Then the complement X \ A of A would not be open, and therefore there
would exist a point p of X \ A with the property that BX (p, δ) ∩ A is non-
empty for all δ > 0, where BX (p, δ) denotes the open ball in X of radius δ
centred at p. We could then find a sequence a1 , a2 , a3 , . . . of points of A
satisfying d(aj , p) < 1/j for all positive integers j. This sequence would be a
Cauchy sequence in A which did not converge to a point of A, contradicting
the completeness of A. Thus A must be closed, as required.
The following result follows directly from Theorem 2.5 and Lemma 4.1.
• the identities
(λµ)x = λ(µx), 1x = x
are satisfied for all x, y ∈ X and λ, µ ∈ F.
Elements of the field F are referred to as scalars. We consider here only real
vector spaces and complex vector spaces: these are vector spaces over the
fields of real numbers and complex numbers respectively.
34
(ii) kx + yk ≤ kxk + kyk for all x, y ∈ X,
(iii) kλxk = |λ| kxk for all x ∈ X and for all scalars λ,
Note that any normed complex vector space can also be regarded as a
normed real vector space.
Example Let k.k1 , k.k2 and k.k∞ be the real-valued functions on Cn defined
by
n
X
kzk1 = |zj |,
j=1
n
! 12
X
kzk2 = |zj |2 ,
j=1
kzk∞ = max(|z1 |, |z2 |, . . . , |zn |),
for each z ∈ Cn , where z = (z1 , z2 , . . . , zn ). Then k.k1 , k.k2 and k.k∞ are
norms on Cn . In particular, if we regard Cn as a 2n-dimensional real vector
space naturally isomorphic to R2n (via the isomorphism
(z1 , z2 , . . . , zn ) 7→ (x1 , y1 , x2 , y2 , . . . , xn , yn ),
where xj and yj are the real and imaginary parts of zj for j = 1, 2, . . . , n) then
k.k2 represents the Euclidean norm on this space. The inequality kz + wk2 ≤
kzk2 + kwk2 satisfied for all z, w ∈ Cn is therefore just the standard Triangle
Inequality for the Euclidean norm.
35
Example Let
+∞
! 12
X
kzk2 = |zj |2 ,
j=1
kzk∞ = sup{|z1 |, |z2 |, |z3 |, . . .}.
Lemma 4.3 Let X be a normed vector space over the field F, where F =
R or C. Let (xj ) and (yj ) be convergent sequences in X, and let (λj ) be
36
a convergent sequence in F. Then the sequences (xj + yj ) and (λj xj ) are
convergent in X, and
kλj xj − λxk = kλj (xj − x) + (λj − λ)xk ≤ |λj | kxj − xk + |λj − λ|kxk
≤ (|λ| + 1) kxj − xk + |λj − λ|kxk < ε.
37
We say that a normed vector space X is complete A normed vector space is
complete if and only if every Cauchy sequence in X is convergent. A complete
normed vector space is referred to as a Banach space. (The basic theory of
such spaces was extensively developed by the famous Polish mathematician
Stefan Banach and his colleagues.)
+∞
X +∞
X
xn ≤ kxn k.
n=1 n=1
sn = x 1 + x 2 + · · · + x n .
+∞
P +∞
P
Let ε > 0 be given. We can find N such that kxn k < ε, since kxn k is
n=N n=1
convergent. Let sn = x1 + x2 + · · · + xn . If j ≥ N , k ≥ N and j < k then
k
X k
X +∞
X
ksk − sj k = xn ≤ kxn k ≤ kxn k < ε.
n=j+1 n=j+1 n=N
as required.
38
4.3 Bounded Linear Transformations
Let X and Y be real or complex vector spaces. A function T : X → Y is said
to be a linear transformation if T (x + y) = T x + T y and T (λx) = λT x for
all elements x and y of X and scalars λ. A linear transformation mapping
X into itself is referred to as a linear operator on X.
Lemma 4.5 Let X and Y be normed vector spaces, and let S: X → Y and
T : X → Y be bounded linear transformations. Then S + T and λS are
bounded linear transformations for all scalars λ, and
Proof kT Sxk ≤ kT k kSxk ≤ kT k kSk kxk for all x ∈ X. The result fol-
lows.
(i) T : X → Y is continuous,
(ii) T : X → Y is continuous at 0,
39
(iii) T : X → Y is bounded.
Proof Obviously (i) implies (ii). We show that (ii) implies (iii) and (iii)
implies (i). The equivalence of the three conditions then follows immediately.
Suppose that T : X → Y is continuous at 0. Then there exists δ > 0 such
that kT xk < 1 for all x ∈ X satisfying kxk < δ. Let C be any positive real
number satisfying C > 1/δ. If x is any non-zero element of X then kλxk < δ,
where λ = 1/(Ckxk), and hence
Proof We have already shown that B(X, Y ) is a normed vector space (see
Lemma 4.5). Thus it only remains to show that B(X, Y ) is complete.
Let S1 , S2 , S3 , . . . be a Cauchy sequence in B(X, Y ). Let x ∈ X. We
claim that S1 x, S2 x, S3 x, . . . is a Cauchy sequence in Y . This result is trivial
if x = 0. If x 6= 0, and if ε > 0 is given then there exists some positive
integer N such that kSj − Sk k < ε/kxk whenever j ≥ N and k ≥ N . But
then kSj x − Sk xk ≤ kSj − Sk k kxk < ε whenever j ≥ N and k ≥ N .
This shows that S1 x, S2 x, S3 x, . . . is indeed a Cauchy sequence. It therefore
converges to some element of Y , since Y is a Banach space.
Let the function S: X → Y be defined by Sx = lim Sn x. Then
n→+∞
40
Next we show that Sn → S in B(X, Y ) as n → +∞. Let ε > 0 be given.
Then there exists some positive integer N such that kSj −Sn k < 21 ε whenever
j ≥ N and n ≥ N , since the sequence S1 , S2 , S3 , . . . is a Cauchy sequence in
B(X, Y ). But then kSj x − Sn xk ≤ 21 εkxk for all j ≥ N and n ≥ N , and thus
41
Proposition 4.10 Let T be a bounded linear operator on a Banach space X.
Suppose that kT k < 1. Then the operator I − T has a bounded inverse
(I − T )−1 (where I denotes the identity operator on X). Moreover
(I − T )−1 = I + T + T 2 + T 3 + · · · .
1 + kT k + kT k2 + kT k3 + · · ·
is convergent (since kT k < 1). It follows from Corollary 4.9 that the infinite
series
I + T + T2 + T3 + ···
converges to some bounded linear operator S on X. Now
kf k = sup |f (x)|.
x∈X
One can readily verify that k.k is a norm on the vector space C(X, Rn ). We
shall show that C(X, Rn ), with the supremum norm, is a Banach space (i.e.,
the supremum norm on C(X, Rn ) is complete). The proof of this result will
make use of the following characterization of continuity for functions whose
range is Rn .
Theorem 4.11 The normed vector space C(X, Rn ) of all bounded continu-
ous functions from some metric space X to Rn , with the supremum norm, is
a Banach space.
42
Proof Let f1 , f2 , f3 , . . . be a Cauchy sequence in C(X, Rn ). Then, for each
x ∈ X, the sequence f1 (x), f2 (x), f3 (x), . . . is a Cauchy sequence in Rn (since
|fj (x) − fk (x)| ≤ kfj − fk k for all positive integers j and k), and Rn is a com-
plete metric space. Thus, for each x ∈ X, the sequence f1 (x), f2 (x), f3 (x), . . .
converges to some point f (x) of Rn . We must show that the limit function f
defined in this way is bounded and continuous.
Let ε > 0 be given. Then there exists some positive integer N with the
property that kfj − fk k < 13 ε for all j ≥ N and k ≥ N , since f1 , f2 , f3 , . . .
is a Cauchy sequence in C(X, Rn ). But then, on taking the limit of the left
hand side of the inequality |fj (x) − fk (x)| < 13 ε as k → +∞, we deduce that
|fj (x)−f (x)| ≤ 13 ε for all x ∈ X and j ≥ N . In particular |fN (x)−f (x)| ≤ 31 ε
for all x ∈ X. It follows that |f (x)| ≤ kfN k + 13 ε for all x ∈ X, showing that
the limit function f is bounded.
Next we show that the limit function f is continuous. Let p ∈ X and ε > 0
be given. Let N be chosen large enough to ensure that |fN (x)−f (x)| ≤ 31 ε for
all x ∈ X. Now fN is continuous. It follows from the definition of continuity
for functions between metric spaces that there exists some real number δ
satisfying δ > 0 such that |fN (x) − fN (p)| < 13 ε for all elements x of X
satisfying dX (x, p) < δ, where dX denotes the distance function on X. Thus
if x ∈ X satisfies dX (x, p) < δ then
|f (x) − f (p)| ≤ |f (x) − fN (x)| + |fN (x) − fN (p)| + |fN (p) − f (p)|
< 13 ε + 13 ε + 13 ε = ε.
43
some function f : X → Rn . Let x be some point of X. Then fj (x) → f (x) as
j → +∞. But then f (x) ∈ F , since fj (x) ∈ F for all j, and F is closed in Rn .
This shows that f ∈ C(X, F ), and thus the Cauchy sequence f1 , f2 , f3 , . . .
converges in C(X, F ). We conclude that C(X, F ) is a complete metric space,
as required.
44
since T : X → X is a continuous function, and thus x is a fixed point of T .
If x0 were another fixed point of T then we would have
But this is impossible unless x0 = x, since λ < 1. Thus the fixed point x of
the contraction map T is unique.
Proof Solving the differential equation with the initial condition x(t0 ) = x0
is equivalent to finding a continuous function ϕ: I → R satisfying the integral
equation Z t
ϕ(t) = x0 + F (ϕ(s), s) ds.
t0
where I denotes the closed interval [t0 − δ, t0 + δ]. (Note that any continuous
function ϕ satisfying this integral equation is automatically differentiable,
since the indefinite integral of a continuous function is always differentiable.)
Let K = |F (x0 , t0 )| + 1. Using the continuity of the function F , together
with the fact that U is open in R2 , one can find some δ0 > 0 such that the
open disk of radius δ0 about (x0 , t0 ) is contained in U and |F (x, t)| ≤ K for
all points (x, t) in this open disk. Now choose δ > 0 such that
√
δ 1 + K 2 < δ0 and M δ < 1.
45
Let J denote the closed interval [x0 − Kδ, x0 + Kδ]. The space C(I, J)
of continuous functions from the interval I to the interval J is a complete
metric space, by Corollary 4.12. Define T : C(I, J) → C(I, J) by
Z t
T (ϕ)(t) = x0 + F (ϕ(s), s) ds.
t0
We claim that T does indeed map C(I, J) into itself and is a contraction
mapping.
Let ϕ: I → J be an element of C(I, J). Note that if |t − t0 | ≤ δ then
|T (ϕ)(t) − x0 | ≤ Kδ
46
differential equation must belong to the space C(I, J) defined in the proof of
Theorem 4.14. The uniqueness of the fixed point of the contraction mapping
T : C(I, J) → C(I, J) then shows that ψ = ϕ, where ϕ: [t0 − δ, t0 + δ] → R
is the solution to the differential equation whose existence was proved in
Theorem 4.14. This shows that the solution to the differential equation is in
fact unique on the interval [t0 − δ, t0 + δ].
Lemma 4.15 Let X be a metric space with distance function d, let (xj ) and
(yj ) be Cauchy sequences of points in X, and let dj = d(xj , yj ) for all positive
integers j. Then (dj ) is a Cauchy sequence of real numbers.
dj ≤ d(xj , xk ) + dk + d(yk , yj )
Lemma 4.16 Let X be a metric space with distance function d, and let (xj ),
(yj ) and (zj ) be Cauchy sequences of points in X. Then
Proof This follows immediately on taking limits of both sides of the Triangle
Inequality.
47
Lemma 4.17 Let X be a metric space with distance function d, and let (xj ),
(yj ) and (zj ) be Cauchy sequences of points in X. Suppose that
lim d(xj , yj ) = 0 and lim d(yj , zj ) = 0.
j→+∞ j→+∞
Similarly lim d(x0j , yj0 ) ≤ lim d(xj , yj ). It follows that lim d(xj , yj ) =
j→+∞ j→+∞ j→+∞
lim d(x0j , yj0 ), as required.
j→+∞
It follows from Lemma 4.18 that the value of d(x̃, ỹ) does not depend on the
choice of Cauchy sequences (xj ) and (yj ) representing x̃ and ỹ. We obtain
in this way a distance function on the set X̃. This distance function satisfies
the Triangle Inequality (Lemma 4.16) and the other metric space axioms.
Therefore X̃ with this distance function is a metric space. We refer to the
space X̃ as the completion of the metric space X.
We can regard the metric space X as being embedded in its completion
X̃, where a point x of X is represented in X̃ by the equivalence class of the
constant sequence x, x, x, . . ..
48
Example The completion of the space Q of rational numbers is the space R
of real numbers.
49
Remark In a paper published in 1872, Cantor gave a construction of the real
number system in which real numbers are represented as Cauchy sequences
of rational numbers. The real numbers represented by two Cauchy sequences
of rational numbers are equal if and only if the difference of the Cauchy
sequences converges to zero. Thus the construction of the completion of
a metric space, described above, generalizes Cantor’s construction of the
system of real numbers from the system of rational numbers.
50
5 Topological Spaces
The theory of topological spaces provides a setting for the notions of continu-
ity and convergence which is more general than that provided by the theory
of metric spaces. In the theory of metric spaces one can find necessary and
sufficient conditions for convergence and continuity that do not refer explic-
itly to the distance function on a metric space but instead are expressed in
terms of open sets. Thus a sequence of points in a metric space X converges
to a point p of X if and only if every open set which contains the point p
also contains all but finitely many members of the sequence. Also a function
f : X → Y between metric spaces X and Y is continuous if and only if the
preimage f −1 (V ) of every open set V in Y is an open set in X. It follows
from this that we can generalize the notions of convergence and continuity
by introducing the concept of a topological space: a topological space consists
of a set together with a collection of subsets termed open sets that satisfy
appropriate axioms. The axioms for open sets in a topological space are
satisfied by the open sets in any metric space.
(ii) the union of any collection of open sets is itself an open set,
(iii) the intersection of any finite collection of open sets is itself an open set.
The collection consisting of all the open sets in a topological space X is
referred to as a topology on the set X.
51
is an open set, and any finite intersection of open sets in a metric space is an
open set. Thus the topological space axioms are satisfied by the collection
of open sets in any metric space. We refer to this collection of open sets as
the topology generated by the distance function d on X.
Any subset X of n-dimensional Euclidean space Rn is a topological space:
a subset V of X is open in X if and only if, given any point v of V , there
exists some δ > 0 such that
{x ∈ X : |x − v| < δ} ⊂ V.
Example Given any set X, one can define a topology on X where every
subset of X is an open set. This topology is referred to as the discrete
topology on X.
Example Given any set X, one can define a topology on X in which the
only open sets are the empty set ∅ and the whole set X.
(i) the empty set ∅ and the whole set X are closed sets,
(ii) the intersection of any collection of closed sets is itself a closed set,
(iii) the union of any finite collection of closed sets is itself a closed set.
52
5.2 Hausdorff Spaces
Definition A topological space X is said to be a Hausdorff space if and only
if it satisfies the following Hausdorff Axiom:
• if x and y are distinct points of X then there exist open sets U and V
such that x ∈ U , y ∈ V and U ∩ V = ∅.
Proof Let X be a metric space with distance function d, and let x and y be
points of X, where x 6= y. Let ε = 21 d(x, y). Then the open balls BX (x, ε)
and BX (y, ε) of radius ε centred on the points x and y are open sets. If
BX (x, ε) ∩ BX (y, ε) were non-empty then there would exist z ∈ X satisfying
d(x, z) < ε and d(z, y) < ε. But this is impossible, since it would then follow
from the Triangle Inequality that d(x, y) < 2ε, contrary to the choice of ε.
Thus x ∈ BX (x, ε), y ∈ BX (y, ε), BX (x, ε) ∩ BX (y, ε) = ∅. This shows that
the metric space X is a Hausdorff space.
53
Lemma 5.3 Let X be a metric space with distance function d, and let A be
a subset of X. A subset W of A is open with respect to the subspace topology
on A if and only if, given any point w of W , there exists some δ > 0 such
that
{a ∈ A : d(a, w) < δ} ⊂ W.
Thus the subspace topology on A coincides with the topology on A obtained
on regarding A as a metric space (with respect to the distance function d).
{x ∈ X : d(x, w) < δ} ⊂ U.
But then
{a ∈ A : d(a, w) < δ} ⊂ U ∩ A = W.
Conversely, suppose that W is a subset of A with the property that, for
any w ∈ W , there exists some δw > 0 such that
{a ∈ A : d(a, w) < δw } ⊂ W.
Define U to be the union of the open balls BX (w, δw ) as w ranges over all
points of W , where
The set U is an open set in X, since each open ball BX (w, δw ) is an open set
in X, and any union of open sets is itself an open set. Moreover
54
• a subset B of A is closed in A (relative to the subspace topology on A)
if and only if B = A ∩ F for some closed subset F of X;
55
Lemma 5.6 Let X and Y be topological spaces, let f : X → Y be a function
from X to Y , and let X = A1 ∪ A2 ∪ · · · ∪ Ak , where A1 , A2 , . . . , Ak are closed
sets in X. Suppose that the restriction of f to the closed set Ai is continuous
for i = 1, 2, . . . , k. Then f : X → Y is continuous.
56
of two continuous functions. Similarly γ|[ 12 , 1] is continuous. The subinter-
vals [0, 12 ] and [ 12 , 1] are closed in [0, 1], and [0, 1] is the union of these two
subintervals. It follows from Lemma 5.6 that γ: [0, 1] → Y is continuous.
5.5 Homeomorphisms
Definition Let X and Y be topological spaces. A function h: X → Y is said
to be a homeomorphism if and only if the following conditions are satisfied:
57
Proof Suppose that p and q were limits of the sequence (xj ), where p 6= q.
Then there would exist open sets U and V such that p ∈ U , q ∈ V and
U ∩ V = ∅, since X is a Hausdorff space. But then there would exist natural
numbers N1 and N2 such that xj ∈ U for all j satisfying j ≥ N1 and xj ∈ V
for all j satisfying j ≥ N2 . But then xj ∈ U ∩ V for all j satisfying j ≥ N1
and j ≥ N2 , which is impossible, since U ∩ V = ∅. This contradiction shows
that the sequence (xj ) has at most one limit.
Proof Let V be an open set in Y which contains the point f (p). Then
f −1 (V ) is an open set in X which contains the point p. It follows that
there exists some natural number N such that xj ∈ f −1 (V ) whenever j ≥
N . But then f (xj ) ∈ V whenever j ≥ N . We deduce that the sequence
f (x1 ), f (x2 ), f (x3 ), . . . converges to f (p), as required.
58
Proof It follows directly from the definition of neighbourhoods that an open
set V is a neighbourhood of any point belonging to V . Conversely, suppose
that V is a subset of X which is a neighbourhood of each v ∈ V . Then, given
any point v of V , there exists an open set Uv such that v ∈ Uv and Uv ⊂ V .
Thus V is an open set, since it is the union of the open sets Uv as v ranges
over all points of V .
Proof If F is any closed set containing A then xj ∈ F for all j, and therefore
p ∈ F , by Lemma 5.8. Therefore p ∈ A by definition of A.
59
5.8 Product Topologies
The Cartesian product X1 × X2 × · · · × Xn of sets X1 , X2 , . . . , Xn is defined
to be the set of all ordered n-tuples (x1 , x2 , . . . , xn ), where xi ∈ Xi for i =
1, 2, . . . , n.
The sets R2 and R3 are the Cartesian products R × R and R × R × R
respectively.
Cartesian products of sets are employed as the domains of functions of
several variables. For example, if X, Y and Z are sets, and if an element
f (x, y) of Z is determined for each choice of an element x of X and an
element y of Y , then we have a function f : X × Y → Z whose domain is the
Cartesian product X × Y of X and Y : this function sends the ordered pair
(x, y) to f (x, y) for all x ∈ X and y ∈ Y .
{p} ⊂ V1 × V2 × · · · × Vn ⊂ D ⊂ E.
Thus E is open in X.
Let U = U1 ∩U2 ∩· · ·∩Um , where U1 , U2 , . . . , Um are open sets in X, and let
p be a point of U . Then there exist open sets Vki in Xi for k = 1, 2, . . . , m and
i = 1, 2, . . . , n such that {p} ⊂ Vk1 × Vk2 × · · · × Vkn ⊂ Uk for k = 1, 2, . . . , m.
Let Vi = V1i ∩ V2i ∩ · · · ∩ Vmi for i = 1, 2, . . . , n. Then
60
Lemma 5.13 Let X1 , X2 , . . . , Xn and Z be topological spaces. Then a func-
tion f : X1 × X2 × · · · × Xn → Z is continuous if and only if, given any point
p of X1 × X2 × · · · × Xn , and given any open set U in Z containing f (p),
there exist open sets Vi in Xi for i = 1, 2, . . . , n such that p ∈ V1 × V2 · · · × Vn
and f (V1 × V2 × · · · × Vn ) ⊂ U .
Proof Let Vi be an open set in Xi for i = 1, 2, . . . , n, and let U be an open set
in Z. Then V1 ×V2 ×· · ·×Vn ⊂ f −1 (U ) if and only if f (V1 ×V2 ×· · ·×Vn ) ⊂ U .
It follows that f −1 (U ) is open in the product topology on X1 ×X2 ×· · ·×Xn if
and only if, given any point p of X1 ×X2 ×· · ·×Xn satisfying f (p) ∈ U , there
exist open sets Vi in Xi for i = 1, 2, . . . , n such that f (V1 ×V2 ×· · ·×Vn ) ⊂ U .
The required result now follows from the definition of continuity.
Let X1 , X2 , . . . , Xn be topological spaces, and let Vi be an open set in
Xi for i = 1, 2, . . . , n. It follows directly from the definition of the product
topology that V1 × V2 × · · · × Vn is open in X1 × X2 × · · · × Xn .
Theorem 5.14 Let X = X1 × X2 × · · · × Xn , where X1 , X2 , . . . , Xn are
topological spaces and X is given the product topology, and for each i, let
pi : X → Xi denote the projection function which sends (x1 , x2 , . . . , xn ) ∈ X
to xi . Then the functions p1 , p2 , . . . , pn are continuous. Moreover a function
f : Z → X mapping a topological space Z into X is continuous if and only if
pi ◦ f : Z → Xi is continuous for i = 1, 2, . . . , n.
Proof Let V be an open set in Xi . Then
p−1
i (V ) = X1 × · · · × Xi−1 × V × Xi+1 × · · · × Xn ,
61
so that Nz ⊂ f −1 (U ). It follows that f −1 (U ) is the union of the open sets Nz
as z ranges over all points of f −1 (U ). Therefore f −1 (U ) is open in Z. This
shows that f : Z → X is continuous, as required.
Proof We must show that a subset U of Rn is open with respect to the usual
topology if and only if it is open with respect to the product topology.
Let U be a subset of Rn that is open with respect to the usual topology,
and let u ∈ U . Then there exists some δ > 0 such that B(u, δ) ⊂ U , where
{u} ⊂ I1 × I2 × · · · × In ⊂ B(u, δ) ⊂ U,
since n 2
2
X
2 δ
|x − u| = (xi − ui ) < n √ = δ2
i=1
n
for all x ∈ I1 × I2 × · · · × In . This shows that any subset U of Rn that is
open with respect to the usual topology on Rn is also open with respect to
the product topology on Rn .
Conversely suppose that U is a subset of Rn that is open with respect
to the product topology on Rn , and let u ∈ U . Then there exist open
sets V1 , V2 , . . . , Vn in R containing u1 , u2 , . . . , un respectively such that V1 ×
V2 × · · · × Vn ⊂ U . Now we can find δ1 , δ2 , . . . , δn such that δi > 0 and
(ui − δi , ui + δi ) ⊂ Vi for all i. Let δ > 0 be the minimum of δ1 , δ2 , . . . , . . . , δn .
Then
B(u, δ) ⊂ V1 × V2 × · · · Vn ⊂ U,
for if x ∈ B(u, δ) then |xi − ui | < δi for i = 1, 2, . . . , n. This shows that any
subset U of Rn that is open with respect to the product topology on Rn is
also open with respect to the usual topology on Rn .
62
Corollary 5.16 Let X be a topological space and let f : X → Rn be a function
from X to Rn . Let us write
63
(u, v) of the square are related if and only if at least one of the following
conditions is satisfied:
• s = u and t = v;
• s = 0, u = 1 and t = v;
• s = 1, u = 0 and t = v;
• t = 0, v = 1 and s = u;
• t = 1, v = 0 and s = u;
• (s, t) and (u, v) both belong to {(0, 0), (0, 1), (1, 0), (1, 1)}.
Note that if 0 < s < 1 and 0 < t < 1 then the equivalence class of the
point (s, t) is the set {(s, t)} consisting of that point. If s = 0 or 1 and
if 0 < t < 1 then the equivalence class of (s, t) is the set {(0, t), (1, t)}.
Similarly if t = 0 or 1 and if 0 < s < 1 then the equivalence class of (s, t) is
the set {(s, 0), (s, 1)}. The equivalence class of each corner of the square is
the set {(0, 0), (1, 0), (0, 1), (1, 1)} consisting of all four corners. Thus each
equivalence class contains either one point in the interior of the square, or
two points on opposite edges of the square, or four points at the four corners
of the square. Let T 2 denote the set of these equivalence classes. We have
a map q: [0, 1] × [0, 1] → T 2 which sends each point (s, t) of the square to
its equivalence class. Each element of the set T 2 is the image of one, two
or four points of the square. The elements of T 2 represent points on the
torus obtained from the square by first joining together two opposite sides of
the square to form a cylinder and then joining together the boundary circles
of this cylinder as described above. We say that the torus T 2 is obtained
from the square [0, 1] × [0, 1] by identifying the points (0, t) and (1, t) for all
t ∈ [0, 1] and identifying the points (s, 0) and (s, 1) for all s ∈ [0, 1].
The topology on the square [0, 1]×[0, 1] induces a corresponding topology
on the set T 2 , where a subset U of T 2 is open in T 2 if and only if q −1 (U )
is open in the square [0, 1] × [0, 1]. (The fact that these open sets in T 2
constitute a topology on the set T 2 is a consequence of Lemma 5.18.) The
function q: [0, 1] × [0, 1] → T 2 is then a continuous surjection. We say that
the topological space T 2 is the identification space obtained from the square
[0, 1] × [0, 1] by identifying points on the sides to the square as described
above. The continuous map q from the square to the torus is an example of
an identification map, and the topology on the torus T 2 is referred to as the
quotient topology on T 2 induced by the identification map q: [0, 1] × [0, 1] →
T 2.
64
Another well-known identification space obtained from the square is the
Klein bottle (Kleinsche Flasche). The Klein bottle K 2 is obtained from the
square [0, 1] × [0, 1] by identifying (0, t) with (1, 1 − t) for all t ∈ [0, 1] and
identifying (s, 0) with (s, 1) for all s ∈ [0, 1]. These identifications correspond
to an equivalence relation on the square, where points (s, t) and (u, v) of the
square are equivalent if and only if one of the following conditions is satisfied:
• s = u and t = v;
• s = 0, u = 1 and t = 1 − v;
• s = 1, u = 0 and t = 1 − v;
• t = 0, v = 1 and s = u;
• t = 1, v = 0 and s = u;
• (s, t) and (u, v) both belong to {(0, 0), (0, 1), (1, 0), (1, 1)}.
65
It follows directly from the definition that any identification map is con-
tinuous. Moreover, in order to show that a continuous surjection q: X → Y
is an identification map, it suffices to prove that if V is a subset of Y with
the property that q −1 (V ) is open in X then V is open in Y .
66
Y ∈ τ . If {Vα : α ∈ A} is any collection of subsets of Y indexed by a set A,
then it is a straightforward exercise to verify that
[ [ \ \
q −1 (Vα ) = q −1 Vα , q −1 (Vα ) = q −1 Vα
α∈A α∈A α∈A α∈A
(i.e., given any collection of subsets of Y , the union of the preimages of the
sets is the preimage of the union of those sets, and the intersection of the
preimages of the sets is the preimage of the intersection of those sets). It
follows easily from this that unions and finite intersections of sets belonging
to τ must themselves belong to τ . Thus τ is a topology on Y , and the
function q: X → Y is an identification map with respect to the topology τ .
Clearly τ is the unique topology on Y for which the function q: X → Y is an
identification map.
67
Example The Klein bottle K 2 is the identification space obtained from the
square [0, 1] × [0, 1] by identifying (0, t) with (1, 1 − t) for all t ∈ [0, 1] and
identifying (s, 0) with (s, 1) for all s ∈ [0, 1]. Let q: [0, 1] × [0, 1] → K 2
be the identification map determined by these identifications. Let Z be a
topological space. A function g: [0, 1] × [0, 1] → Z mapping the square into Z
which satisfies g(0, t) = g(1, 1 − t) for all t ∈ [0, 1] and g(s, 0) = g(s, 1) for all
s ∈ [0, 1], determines a corresponding function f : K 2 → Z, where g = f ◦ q.
It follows from Lemma 5.19 that the function f : K 2 → Z is continuous if and
only if g: [0, 1] × [0, 1] → Z is continuous.
68
Let Z be the set of integers with the usual topology (i.e., the subspace
topology on Z induced by the usual topology on R). Then {n} is open for
all n ∈ Z, since
{n} = Z ∩ {t ∈ R : |t − n| < 12 }.
It follows that every subset of Z is open (since it is a union of sets consisting
of a single element, and any union of open sets is open). It follows that
a function f : X → Z on a topological space X is continuous if and only if
f −1 (V ) is open in X for any subset V of Z. We use this fact in the proof of
the next theorem.
Proposition 5.21 A topological space X is connected if and only if every
continuous function f : X → Z from X to the set Z of integers is constant.
Proof Suppose that X is connected. Let f : X → Z be a continuous function.
Choose n ∈ f (X), and let
U = {x ∈ X : f (x) = n}, V = {x ∈ X : f (x) 6= n}.
Then U and V are the preimages of the open subsets {n} and Z \ {n} of
Z, and therefore both U and V are open in X. Moreover U ∩ V = ∅, and
X = U ∪ V . It follows that V = X \ U , and thus U is both open and closed.
Moreover U is non-empty, since n ∈ f (X). It follows from the connectedness
of X that U = X, so that f : X → Z is constant, with value n.
Conversely suppose that every continuous function f : X → Z is constant.
Let S be a subset of X which is both open and closed. Let f : X → Z be
defined by
1 if x ∈ S;
f (x) =
0 if x 6∈ S.
Now the preimage of any subset of Z under f is one of the open sets ∅,
S, X \ S and X. Therefore the function f is continuous. But then the
function f is constant, so that either S = ∅ or S = X. This shows that X is
connected.
Lemma 5.22 The closed interval [a, b] is connected, for all real numbers a
and b satisfying a ≤ b.
Proof Let f : [a, b] → Z be a continuous integer-valued function on [a, b]. We
show that f is constant on [a, b]. Indeed suppose that f were not constant.
Then f (τ ) 6= f (a) for some τ ∈ [a, b]. But the Intermediate Value Theorem
would then ensure that, given any real number c between f (a) and f (τ ), there
would exist some t ∈ [a, τ ] for which f (t) = c, and this is clearly impossible,
since f is integer-valued. Thus f must be constant on [a, b]. We now deduce
from Proposition 5.21 that [a, b] is connected.
69
Example Let X = {(x, y) ∈ R2 : x 6= 0}. The topological space X is not
connected. Indeed if f : X → Z is defined by
1 if x > 0,
f (x, y) =
−1 if x < 0,
• if U and V are open sets in X such that A∩U and A∩V are non-empty
and A ⊂ U ∪ V then A ∩ U ∩ V is also non-empty.
Proof It follows from the definition of the closure of A that A ⊂ F for any
closed subset F of X for which A ⊂ F . On taking F to be the complement
of some open set U , we deduce that A ∩ U = ∅ for any open set U for which
70
A ∩ U = ∅. Thus if U is an open set in X and if A ∩ U is non-empty then
A ∩ U must also be non-empty.
Now let U and V be open sets in X such that A ∩ U and A ∩ V are
non-empty and A ⊂ U ∪ V . Then A ∩ U and A ∩ V are non-empty, and
A ⊂ U ∪ V . But A is connected. Therefore A ∩ U ∩ V is non-empty, and
thus A ∩ U ∩ V is non-empty. This shows that A is connected.
(i) Sx is connected,
(ii) Sx is closed,
71
Proof Let f : Sx → Z be a continuous integer-valued function on Sx , for
some x ∈ X. Let y be any point of Sx . Then, by definition of Sx , there exists
some connected set A containing both x and y. But then f is constant on A,
and thus f (x) = f (y). This shows that the function f is constant on Sx .
We deduce that Sx is connected. This proves (i). Moreover the closure Sx is
connected, by Lemma 5.24. Therefore Sx ⊂ Sx . This shows that Sx is closed,
proving (ii).
Finally, suppose that x and y are points of X for which Sx ∩ Sy 6= ∅. Let
f : Sx ∪ Sy → Z be any continuous integer-valued function on Sx ∪ Sy . Then
f is constant on both Sx and Sy . Moreover the value of f on Sx must agree
with that on Sy , since Sx ∩ Sy is non-empty. We deduce that f is constant
on Sx ∪ Sy . Thus Sx ∪ Sy is a connected set containing both x and y, and
thus Sx ∪ Sy ⊂ Sx and Sx ∪ Sy ⊂ Sy , by definition of Sx and Sy . We conclude
that Sx = Sy . This proves (iii).
72
6 Compact Spaces
6.1 Compact Topological Spaces
Let X be a topological space, and let A be a subset of X. A collection of
subsets of X in X is said to cover A if and only if every point of A belongs to
at least one of these subsets. In particular, an open cover of X is collection
of open sets in X that covers X.
If U and V are open covers of some topological space X then V is said to
be a subcover of U if and only if every open set belonging to V also belongs
to U.
We now show that any closed bounded interval in the real line is compact.
This result is known as the Heine-Borel Theorem. The proof of this theorem
uses the least upper bound principle which states that, given any non-empty
set S of real numbers which is bounded above, there exists a least upper
bound (or supremum) sup S for the set S.
Proof Let U be a collection of open sets in R with the property that each
point of the interval [a, b] belongs to at least one of these open sets. We must
show that [a, b] is covered by finitely many of these open sets.
Let S be the set of all τ ∈ [a, b] with the property that [a, τ ] is covered
by some finite collection of open sets belonging to U, and let s = sup S. Now
s ∈ W for some open set W belonging to U. Moreover W is open in R, and
therefore there exists some δ > 0 such that (s − δ, s + δ) ⊂ W . Moreover
s − δ is not an upper bound for the set S, hence there exists some τ ∈ S
satisfying τ > s − δ. It follows from the definition of S that [a, τ ] is covered
by some finite collection V1 , V2 , . . . , Vr of open sets belonging to U.
73
Let t ∈ [a, b] satisfy τ ≤ t < s + δ. Then
[a, t] ⊂ [a, τ ] ∪ (s − δ, s + δ) ⊂ V1 ∪ V2 ∪ · · · ∪ Vr ∪ W,
Proof The range f (X) of the function f is covered by some finite collection
I1 , I2 , . . . , Ik of open intervals of the form (−m, m), where m ∈ N, since f (X)
is compact (Lemma 6.4) and R is covered by the collection of all intervals of
this form. It follows that f (X) ⊂ (−M, M ), where (−M, M ) is the largest of
the intervals I1 , I2 , . . . , Ik . Thus the function f is bounded above and below
on X, as required.
74
Proof Let m = inf{f (x) : x ∈ X} and M = sup{f (x) : x ∈ X}. There
must exist v ∈ X satisfying f (v) = M , for if f (x) < M for all x ∈ X then
the function x 7→ 1/(M − f (x)) would be a continuous real-valued function
on X that was not bounded above, contradicting Lemma 6.5. Similarly
there must exist u ∈ X satisfying f (u) = m, since otherwise the function
x 7→ 1/(f (x)−m) would be a continuous function on X that was not bounded
above, again contradicting Lemma 6.5. But then f (u) ≤ f (x) ≤ f (v) for all
x ∈ X, as required.
Proof Let p be a point of X that does not belong to A, and let f (x) =
d(x, p), where d is the distance function on X. It follows from Proposition 6.6
that there is a point q of A such that f (a) ≥ f (q) for all a ∈ A, since A is
compact. Now f (q) > 0, since q 6= p. Let δ satisfy 0 < δ ≤ f (q). Then the
open ball of radius δ about the point p is contained in the complement of
A, since f (x) < f (q) for all points x of this open ball. It follows that the
complement of A is an open set in X, and thus A itself is closed in X.
Proof For each point y ∈ K there exist open sets Vx,y and Wx,y such that
x ∈ Vx,y , y ∈ Wx,y and Vx,y ∩ Wx,y = ∅ (since X is a Hausdorff space). But
then there exists a finite set {y1 , y2 , . . . , yr } of points of K such that K is
contained in Wx,y1 ∪ Wx,y2 ∪ · · · ∪ Wx,yr , since K is compact. Define
75
Proposition 6.10 Let X be a Hausdorff topological space, and let K1 and
K2 be compact subsets of X, where K1 ∩ K2 = ∅. Then there exist open sets
U1 and U2 such that K1 ⊂ U1 , K2 ⊂ U2 and U1 ∩ U2 = ∅.
Proof It follows from Proposition 6.8 that, for each point x of K1 , there
exist open sets Vx and Wx such that x ∈ Vx , K2 ⊂ Wx and Vx ∩ Wx = ∅. But
then there exists a finite set {x1 , x2 , . . . , xr } of points of K1 such that
76
Proposition 6.13 A continuous surjection f : X → Y from a compact topo-
logical space X to a Hausdorff space Y is an identification map.
so that Nx ⊂ V . It follows that V is the union of the open sets Nx for all
x ∈ V . Thus V is itself an open set in X, as required.
77
Proof It suffices to prove that the product of two compact topological spaces
X and Y is compact, since the general result then follows easily by induction
on the number of compact spaces in the product.
Let U be an open cover of X × Y . We must show that this open cover
possesses a finite subcover.
Let x be a point of X. The set {x}×Y is a compact subset of X ×Y , since
it is the image of the compact space Y under the continuous map from Y to
X ×Y which sends y ∈ Y to (x, y), and the image of any compact set under a
continuous map is itself compact (Lemma 6.4). Therefore there exists a finite
collection U1 , U2 , . . . , Ur of open sets belonging to the open cover U such that
{x} × Y is contained in U1 ∪ U2 ∪ · · · ∪ Ur . Let Vx denote the set of all points
x0 of X for which {x0 } × Y is contained in U1 ∪ U2 ∪ · · · ∪ Ur . Then x ∈ Vx ,
and Lemma 6.14 ensures that Vx is an open set in X. Note that Vx × Y is
covered by finitely many of the open sets belonging to the open cover U.
Now {Vx : x ∈ X} is an open cover of the space X. It follows from the
compactness of X that there exists a finite set {x1 , x2 , . . . , xr } of points of X
such that X = Vx1 ∪ Vx2 ∪ · · · ∪ Vxr . Now X × Y is the union of the sets
Vxj × Y for j = 1, 2, . . . , r, and each of these sets can be covered by a finite
collection of open sets belonging to the open cover U. On combining these
finite collections, we obtain a finite collection of open sets belonging to U
which covers X × Y . This shows that X × Y is compact.
78
6.2 Compact Metric Spaces
We recall that a metric or topological space is said to be compact if every
open cover of the space has a finite subcover. We shall obtain some equivalent
characterizations of compactness for metric spaces (Theorem 6.22); these
characterizations do not generalize to arbitrary topological spaces.
Proposition 6.17 Every sequence of points in a compact metric space has
a convergent subsequence.
Proof Let X be a compact metric space, and let x1 , x2 , x3 , . . . be a sequence
of points of X. We must show that this sequence has a convergent subse-
quence. Let Fn denote the closure of {xn , xn+1 , xn+2 , . . .}. We claim that
the intersection of the sets F1 , F2 , F3 , . . . is non-empty. For suppose that
this intersection were the empty set. Then X would be the union of the
sets V1 , V2 , V3 , . . ., where Vn = X \ Fn for all n. But V1 ⊂ V2 ⊂ V3 ⊂ · · ·,
and each set Vn is open. It would therefore follow from the compactness of
X that X would be covered by finitely many of the sets V1 , V2 , V3 , . . ., and
therefore X = Vn for some sufficiently large n. But this is impossible, since
Fn is non-empty for all natural numbers n. Thus the intersection of the sets
F1 , F2 , F3 , . . . is non-empty, as claimed, and therefore there exists a point p
of X which belongs to Fn for all natural numbers n.
We now obtain, by induction on n, a subsequence xn1 , xn2 , xn3 , . . . which
satisfies d(xnj , p) < 1/j for all natural numbers j. Now p belongs to the
closure F1 of the set {x1 , x2 , x3 , . . .}. Therefore there exists some natural
number n1 such that d(xn1 , p) < 1. Suppose that xnj has been chosen so
that d(xnj , p) < 1/j. The point p belongs to the closure Fnj +1 of the set
{xn : n > nj }. Therefore there exists some natural number nj+1 such that
nj+1 > nj and d(xnj+1 , p) < 1/(j + 1). The subsequence xn1 , xn2 , xn3 , . . .
constructed in this manner converges to the point p, as required.
We shall also prove the converse of Proposition 6.17: if X is a metric
space, and if every sequence of points of X has a convergent subsequence,
then X is compact (see Theorem 6.22 below).
Let X be a metric space with distance function d. A Cauchy sequence in
X is a sequence x1 , x2 , x3 , . . . of points of X with the property that, given
any ε > 0, there exists some natural number N such that d(xj , xk ) < ε for
all j and k satisfying j ≥ N and k ≥ N .
A metric space (X, d) is said to be complete if every Cauchy sequence
in X converges to some point of X.
Proposition 6.18 Let X be a metric space with the property that every
sequence of points of X has a convergent subsequence. Then X is complete.
79
Proof Let x1 , x2 , x3 , . . . be a Cauchy sequence in X. This sequence then has
a subsequence xn1 , xn2 , xn3 , . . . which converges to some point p of X. We
claim that the given Cauchy sequence also converges to p.
Let ε > 0 be given. Then there exists some natural number N such that
d(xm , xn ) < 12 ε whenever m ≥ N and n ≥ N , since x1 , x2 , x3 , . . . is a Cauchy
sequence. Moreover nj can be chosen large enough to ensure that nj ≥ N
and d(xnj , p) < 12 ε. If n ≥ N then
d(xn , p) ≤ d(xn , xnj ) + d(xnj , p) < 21 ε + 12 ε = ε.
This shows that the Cauchy sequence x1 , x2 , x3 , . . . converges to the point p.
Thus X is complete, as required.
80
A subset A of a totally bounded metric space X is itself totally bounded.
For if X is the union of the subsets B1 , B2 , . . . , Bk , where diam Bn < ε
for n = 1, 2, . . . , k, then A is the union of A ∩ Bn for n = 1, 2, . . . , k, and
diam A ∩ Bn < ε.
Proof Suppose that X were not totally bounded. Then there would exist
some ε > 0 with the property that no finite collection of subsets of X of
diameter less than 3ε covers the set X. There would then exist an infinite
sequence x1 , x2 , x3 , . . . of points of X with the property that d(xm , xn ) ≥ ε
whenever m 6= n. Indeed suppose that points x1 , x2 , . . . , xk−1 of X have
already been chosen satisfying d(xm , xn ) ≥ ε whenever m < k, n < k and
m 6= n. The diameter of each open ball BX (xm , ε) is less than or equal to
2ε. Therefore X could not be covered by the sets BX (xm , ε) for m < k, and
thus there would exist a point xk of X which does not belong to B(xm , ε)
for any m < k. Then d(xm , xk ) ≥ ε for all m < k. In this way we can
successively choose points x1 , x2 , x3 , . . . to form an infinite sequence with
the required property. However such an infinite sequence would have no
convergent subsequence, which is impossible. This shows that X must be
totally bounded, as required.
Proof Let X be some totally bounded metric space. Suppose that there
exists an open cover V of X which has no finite subcover. We shall prove the
existence of a Cauchy sequence x1 , x2 , x3 , . . . in X which cannot converge to
any point of X. (Thus if X is not compact, then X cannot be complete.)
Let ε > 0 be given. Then X can be covered by finitely many closed sets
whose diameter is less than ε, since X is totally bounded and every subset of
X has the same diameter as its closure (Lemma 6.19). At least one of these
closed sets cannot be covered by a finite collection of open sets belonging to
V (since if every one of these closed sets could be covered by a such a finite
collection of open sets, then we could combine these collections to obtain
a finite subcover of V). We conclude that, given any ε > 0, there exists a
closed subset of X of diameter less than ε which cannot be covered by any
finite collection of open sets belonging to V.
We claim that there exists a sequence F1 , F2 , F3 , . . . of closed sets in X
satisfying F1 ⊃ F2 ⊃ F3 ⊃ · · · such that each closed set Fn has the following
properties: diam Fn < 1/2n , and no finite collection of open sets belonging
81
to V covers Fn . For if Fn is a closed set with these properties then Fn is itself
totally bounded, and thus the above remarks (applied with Fn in place of
X) guarantee the existence of a closed subset Fn+1 of Fn with the required
properties. Thus the existence of the required sequence of closed sets follows
by induction on n.
Choose xn ∈ Fn for each natural number n. Then d(xm , xn ) < 1/2n for
any m > n, since xm and xn belong to Fn and diam Fn < 1/2n . Therefore
the sequence x1 , x2 , x3 , . . . is a Cauchy sequence. Suppose that this Cauchy
sequence were to converge to some point p of X. Then p ∈ Fn for each
natural number n, since Fn is closed and xm ∈ Fn for all m ≥ n. (If a
sequence of points belonging to a closed subset of a metric or topological
space is convergent then the limit of that sequence belongs to the closed set.)
Moreover p ∈ V for some open set V belonging to V, since V is an open
cover of X. But then there would exist δ > 0 such that BX (p, δ) ⊂ V , where
BX (p, δ) denotes the open ball of radius δ in X centred on p. Thus if n were
large enough to ensure that 1/2n < δ, then p ∈ Fn and diam Fn < δ, and
hence Fn ⊂ BX (p, δ) ⊂ V , contradicting the fact that no finite collection of
open sets belonging to V covers the set Fn . This contradiction shows that
the Cauchy sequence x1 , x2 , x3 , . . . is not convergent.
We have thus shown that if X is a totally bounded metric space which is
not compact then X is not complete. Thus every complete totally bounded
metric space must be compact, as required.
Theorem 6.22 Let X be a metric space with distance function d. The fol-
lowing are equivalent:—
(i) X is compact,
Proof Propositions 6.17, 6.18 6.20 and 6.21 show that (i) implies (ii), (ii)
implies (iii), and (iii) implies (i). It follows that (i), (ii) and (iii) are all
equivalent to one another.
82
6.3 The Lebesgue Lemma and Uniform Continuity
Lemma 6.23 (Lebesgue Lemma) Let (X, d) be a compact metric space. Let
U be an open cover of X. Then there exists a positive real number δ such that
every subset of X whose diameter is less than δ is contained wholly within
one of the open sets belonging to the open cover U.
Proof Every point of X is contained in at least one of the open sets belonging
to the open cover U. It follows from this that, for each point x of X, there
exists some δx > 0 such that the open ball B(x, 2δx ) of radius 2δx about
the point x is contained wholly within one of the open sets belonging to the
open cover U. But then the collection consisting of the open balls B(x, δx )
of radius δx about the points x of X forms an open cover of the compact
space X. Therefore there exists a finite set x1 , x2 , . . . , xr of points of X such
that
B(x1 , δ1 ) ∪ B(x2 , δ2 ) ∪ · · · ∪ B(xr , δr ) = X,
where δi = δxi for i = 1, 2, . . . , r. Let δ > 0 be given by
δ = minimum(δ1 , δ2 , . . . , δr ).
But B(xi , 2δi ) is contained wholly within one of the open sets belonging to
the open cover U. Thus A is contained wholly within one of the open sets
belonging to U, as required.
83
Proof Let dX and dY denote the distance functions for the metric spaces X
and Y respectively. Let f : X → Y be a continuous function from X to Y .
We must show that f is uniformly continuous.
Let ε > 0 be given. For each y ∈ Y , define
for all x ∈ X.
Lemma 6.25 Two norms k.k and k.k∗ on a real or complex vector space X
are equivalent if and only if they induce the same topology on X.
84
Proof Suppose that the norms k.k and k.k∗ induce the same topology on X.
Then there exists some δ > 0 such that
since the set {x ∈ X : kxk∗ < 1} is open with respect to the topology on X
induced by both k.k∗ and k.k. Let C be any positive real number satisfying
Cδ > 1. Then
1 1
x = < δ,
Ckxk C
and hence
1
kxk∗ = Ckxk x < Ckxk.
Ckxk ∗
for all non-zero elements x of X, and thus kxk∗ ≤ Ckxk for all x ∈ X. On
interchanging the roles of the two norms, we deduce also that there exists a
positive real number c such that kxk ≤ (1/c)kxk∗ for all x ∈ X. But then
ckxk ≤ kxk∗ ≤ Ckxk for all x ∈ X. We conclude that the norms k.k and
k.k∗ are equivalent.
Conversely suppose that the norms k.k and k.k∗ are equivalent. Then
there exist constants c and C, where 0 < c ≤ C, such that ckxk ≤ kxk∗ ≤
Ckxk for all x ∈ X. Let U be a subset of X that is open with respect to the
topology on X induced by the norm k.k∗ , and let u ∈ U . Then there exists
some δ > 0 such that
But then
showing that U is open with respect to the topology induced by the norm k.k.
Similarly any subset of X that is open with respect to the topology induced
by the norm k.k must also be open with respect to the topology induced by
k.k∗ . Thus equivalent norms induce the same topology on X.
It follows immediately from Lemma 6.25 that if k.k, k.k∗ and k.k] are
norms on a real (or complex) vector space X, if the norms k.k and k.k∗ are
equivalent, and if the norms k.k∗ and k.k] are equivalent, then the norms k.k
and k.k] are also equivalent. This fact can easily be verified directly from the
definition of equivalence of norms.
We recall that the usual topology on Rn is that generated by the Euclidean
norm on Rn .
85
Lemma 6.26 Let k.k be a norm on Rn . Then the function x 7→ kxk is
continuous with respect to the usual topology on on Rn .
x = (x1 , x2 , . . . , xn ), y = (y1 , y2 , . . . , yn ).
n
! 21 n
! 12
X X
≤ (xj − yj )2 kej k2 = Ckx − yk2 ,
j=1 j=1
where
C 2 = ke1 k2 + ke2 k2 + · · · + ken k2
and kx − yk2 denotes the Euclidean norm of x − y, defined by
n
! 12
X
kx − yk2 = (xj − yj )2 .
j=1
Theorem 6.27 Any two norms on Rn are equivalent, and induce the usual
topology on Rn .
86
Proof Let k.k be any norm on Rn . We show that k.k is equivalent to the
Euclidean norm k.k2 . Let S n−1 denote the unit sphere in Rn , defined by
and let f : S n−1 → R be the real-valued function on S n−1 defined such that
f (x) = kxk for all x ∈ S n−1 . Now the function f is a continuous function on
S n−1 (Lemma 6.26). Also the function f is non-zero at each point of S n−1 ,
and therefore the function sending x ∈ S n−1 to 1/f (x) is continuous. Now
any closed bounded set in Rn is compact (Theorem 6.16), and any continuous
real-valued function on a compact topological space is bounded (Lemma 6.5).
It follows that there exist positive real numbers C and D such that f (x) ≤ C
and 1/f (x) ≤ D for all x ∈ S n−1 . Let c = D−1 . Then c ≤ kxk ≤ C for all
x ∈ S n−1 .
Now
kxk
= f kxk−1
2 x
kxk2
for all x ∈ Rn \ {0}. (This is an immediate consequence of the fact that
kλxk = |λ| kxk for all x ∈ Rn and λ ∈ R.) It follows that ckxk2 ≤ kxk ≤
Ckxk2 for all x ∈ Rn \ {0}. These inequalities also hold when x = 0. The
result follows.
87