Introduction to linear algebra
Teo Banica
Real matrices, The determinant, Complex matrices, Calculus and applications,
Infinite matrices, Special matrices
08/20
Foreword
These are slides written in the Fall 2020, on linear algebra.
Presentations available at my Youtube channel.
1. Real matrices and their properties ... 3
2. The determinant of real matrices ... 19
3. Complex matrices and diagonalization ... 35
4. Linear algebra and calculus questions ... 51
5. Infinite matrices and spectral theory ... 67
6. Special matrices and matrix tricks ... 83
Real matrices and their properties
Teo Banica
"Introduction to linear algebra", 1/6
08/20
Rotations 1/3
Problem: what’s the formula of the rotation of angle t?
Rotations 2/3
The points in the plane
2 × 2 matrices “act” on
a
c
R2 can be represented as vectors
such vectors, as follows:
ax + by
x
b
=
cx + dy
y
d
x
y
. The
Many simple transformations (symmetries, projections..) can be
written in this form. What about the rotation of angle t?
Rotations 3/3
A quick picture shows that we must have:
cos t
1
∗ ∗
=
sin t
0
∗ ∗
Also, by paying attention to positives and negatives:
− sin t
0
∗ ∗
=
cos t
1
∗ ∗
Thus, the matrix of our rotation can only be:
cos t − sin t
Rt =
sin t cos t
By "linear algebra”, this is the correct answer.
Linear maps 1/4
Theorem. The maps f : R2 → R2 which are linear, in the sense
that they map lines through 0 to lines through 0, are:
x
ax + by
f
=
y
cx + dy
Remark. If we make the multiplication convention
ax + by
x
a b
=
cx + dy
y
c d
the theorem says f (v ) = Av , with A being a 2 × 2 matrix.
Linear maps 2/4
Examples. The identity and null maps are given by:
x
x
1 0
=
,
0 1
y
y
x
0
0 0
=
0 0
y
0
The projections on the horizontal and vertical axes are given by:
x
x
x
0
1 0
0 0
=
=
,
0 0
0 1
y
y
0
y
The symmetry with respect
0
1
to the x = y diagonal is given by:
x
y
1
=
0
y
x
We have as well the rotation of angle t ∈ R, studied before.
Linear maps 3/4
Theorem. The maps f : RN → RN which are linear, in the sense
that they map lines through 0 to lines through 0, are:
a11 x1 + . . . + a1N xN
x1
..
f ... =
.
xN
aN1 x1 + . . . + aNN xN
Remark. With the matrix multiplication convention
a11 . . . a1N
x1
a11 x1 + . . . + a1N xN
..
.. .. =
..
.
. .
.
aN1 . . . aNN
xN
aN1 x1 + . . . + aNN xN
the theorem says f (v ) = Av , with A being a N × N matrix.
Linear maps 4/4
Example. Consider the all-1 matrix. This acts as follows:
1 ... 1
x1
x 1 + . . . + xN
.. . . .. ..
..
.
. . . =
.
1 ... 1
xN
x 1 + . . . + xN
But this formula
1
1 .
.
N .
1
can be written as follows:
x1
... 1
1
.. x1 + . . . + xN ..
. . ..
.
. . . =
N
1
... 1
xN
And this latter map is the projection on the all-1 vector.
Theory 1/4
Definition. We can multiply M × N matrices with N × K matrices,
a11 . . . a1N
b11 . . . b1K
..
..
.. ..
.
.
. .
aM1 . . . aMN
bN1 . . . bNK
the product being a M × K matrix, given by the formula
a11 b11 + . . . + a1N bN1 . . . . . . a11 b1K + . . . + a1N bNK
..
..
.
.
..
..
.
.
aM1 b11 + . . . + aMN bN1 . . . . . . aM1 b1K + . . . + aMN bNK
obtained via the rule “multiply rows by columns”.
Theory 2/4
Better definition. The matrix multiplication is given by
X
(AB)ij =
Aik Bkj
k
with Aij being the entry on the i-th row and j-th column.
Theorem. The linear maps f : RN → RM are those of the form
fA (v ) = Av
with A being a N × M matrix.
Remark. Size check (N × 1) = (N × M)(M × 1), ok.
Theory 3/4
Theorem. With the above convention fA (v ) = Av , we have
fA fB = fAB
"the product of matrices corresponds to the composition of maps".
Theorem. A linear map f : RN → RN is invertible when the matrix
A ∈ MN (R) which produces it is invertible, and we have:
(fA )−1 = fA−1
Theory 4/4
Theorem. The inverses of the 2 × 2 matrices are given by:
a b
c d
−1
1
=
ad − bc
d −b
−c a
Proof. When ad = bc the columns are proportional, so the matrix
cannot be invertible. When ad − bc 6= 0, let us solve:
a b
c d
−1
1
=
ad − bc
∗ ∗
∗ ∗
We must solve the following equations:
ad − bc
0
∗ ∗
a b
=
0
ad − bc
∗ ∗
c d
But this leads to the formula in the statement.
Eigenvectors 1/4
Definition. Let A ∈ MN (R) be a square matrix, and assume that A
multiplies by λ ∈ R in the direction of a vector v ∈ RN :
Av = λv
In this case, we say that:
(1) v ∈ RN is an eigenvector of A.
(2) λ ∈ R is the corresponding eigenvalue.
Eigenvectors 2/4
Examples. The identity has all vectors as eigenvectors, with λ = 1:
x
x
1 0
=
0 1
y
y
The same goes for the null matrix, with λ = 0 this time:
x
0
0 0
=
0 0
y
0
For the projection on the horizontal axis, P yx = x0 , we have:
Pv = λv
⇐⇒
v=
0
x
, λ = 0 or v =
,λ = 1
y
0
A similar result holds for the projection on the vertical axis.
Eigenvectors 3/4
More examples. For the symmetry S
x
y
=
y
x
, we have:
x
x
Sv = λv ⇐⇒ v =
, λ = 1 or v =
, λ = −1
−x
x
For the transformation T yx = (00 10 ) yx = y0 we have:
Tv = λv
⇐⇒
x
v=
,λ = 0
0
For the rotation of angle t 6= 0, we must have v = 0, λ = 0.
Eigenvectors 4/4
Definition. We say that a matrix A ∈ MN (R) is diagonalizable if it
has N eigenvectors v1 , . . . , vN which form a basis of RN .
Remark. When A is diagonalizable, in that basis we can write:
λ1
..
A=
.
λN
This means that we have A = PDP −1 , with D diagonal.
Problems. Which matrices are diagonalizable? And, how to
diagonalize them?
The determinant of real matrices
Teo Banica
"Introduction to linear algebra", 2/6
08/20
Definition 1/3
Definition. Associated to any vectors v1 , . . . , vN ∈ RN is the volume
det+ (v1 . . . vN ) = vol < v1 , . . . , vN >
of the parallelepiped made by these vectors.
Remark. This notion is useful, for instance because v1 , . . . , vN are
linearly dependent precisely when det+ (v1 . . . vN ) = 0.
Definition 2/3
Theorem. In 2 dimensions we have the formula
+ a b
det
= |ad − bc|
c d
valid for any two vectors
a
c
,
b
d
∈ R2 .
Proof. We must
show
that the area of the parrallelogram formed by
the vectors ca , db equals the quantity |ad − bc|.
But this latter quantity is a difference of areas of two rectangles,
and this can be done in “puzzle” style.
Comment. This is nice, but with ad − bc as "answer", which is
linear in a, b, c, d, it would be even nicer.
Definition 3/3
Convention. A system of vectors v1 , . . . , vN ∈ RN is called:
(1) Oriented (+), if one can pass from the standard basis to it.
(2) Unoriented (-), otherwise.
Definition. Associated to v1 , . . . , vN ∈ RN is the signed volume
det(v1 . . . vN ) = vol ± < v1 , . . . , vN >
of the parallelepiped made by these vectors.
a b
Remark. We have det
c d
= ad − bc, which is nice.
Properties 1/4
Notation. Given a matrix A ∈ MN (R), we write det A, or just |A|,
for the determinant of the system of column vectors of A.
Notation. Given a linear map, written as f (v ) = Av , we call the
number det A the “inflation coefficient” of f .
Remark. The inflation coefficient of f is the signed volume of the
image f (N ) of the unit cube N ∈ RN .
Properties 2/4
Theorem. The determinant det A of the matrices A ∈ MN (R) has
the following properties:
(1) It is a linear function of the columns of A.
(2) We have det(AB) = det A · det B.
(3) We have det(AB) = det(BA).
Proof. (1) By doing some geometry, we obtain indeed:
det(u + v , {wi }) = det(u, {wi }) + det(v , {wi })
det(λu, {wi }) = λ det(u, {wi })
(2) This follows from fAB = fA fB , by looking at "inflation".
(3) Follows from (2), both quantities being det A · det B.
Properties 3/4
Theorem. Assuming that a matrix A ∈ MN (R) is diagonalizable,
with eigenvalues λ1 , . . . , λN , we have:
det A = λ1 . . . λN
Proof. This is clear from the "inflation" viewpoint, because in the
basis formed by the eigenvectors v1 , . . . , vN , we have:
fA (vi ) = λi vi
Alternatively, A = PDP −1 with D = diag (λ1 , . . . , λN ), so
det(A) = det(PDP −1 ) = det(DP −1 · P) = det(D)
and by linearity det(D) = λ1 . . . λN · det(1N ) = λ1 . . . λN .
Properties 4/4
Theorem. We have the following formula, for any λ ∈ R:
det(u, v , {wi }i ) = det(u − λv , v , {wi }i )
Theorem. For an upper triangular matrix we have
λ1
..
.
∗
= λ1 . . . λN
λN
and a similar result holds for the lower triangular matrices.
Proofs. The first theorem follows from linearity, because we have
det(v , v , {wi }i ) = 0, and the second theorem follows from it.
Examples 1/4
Theorem. In 2 dimensions, the determinant is given by:
a b
= ad − bc
c d
Proof. This is something that we already know, but that we can
recover by using the general theory developed above:
a b
c d
=
a − b · c/d
c − d · c/d
b
d
a − bc/d b
0
d
= (a − bc/d)d
=
Thus, we obtain the formula in the statement.
Examples 2/4
Theorem. In 3 dimensions, the determinant is given by
a b c
d e f = aei + bfg + cdh − ceg − bdi − afh
g h i
and this can be memorized by using Sarrus’ triangle method.
Proof. This follows a bit as in 2 dimensions, by using the "Gauss
method". We will be back later with a more conceptual proof.
Examples 3/4
Theorem. The determinant of a projection is always 0, unless the
projection is the identity, and the determinant is 1.
Proof. This is clear with the "inflation" viewpoint. Alternatively, P
is diagonalizable, with 1 eigenvalues on the image, and 0 outside:
1
P ∼ ...
0
By making the product we obtain det P = 1 . . . 1 · 0 . . . 0, with at
least one 0 in the case P 6= 1N , as claimed.
Examples 4/4
Example. For the symmetry with respect to x = y , we have:
0 1
= 0 · 0 − 1 · 1 = −1
1 0
Example. For the rotation of angle t ∈ R, we have:
cos t − sin t
= cos2 t + sin2 t = 1
sin t cos t
These formulae follow as well without computations, by "inflation".
Remark. The "basic" matrices tend to have determinant −1, 0, 1.
Theory 1/4
Theorem. The determinant can be fully computed by using the
Gauss method, namely:
(1) Multiplying row by scalars.
(2) Substracting rows.
Theorem. The determinant function
det : RN × . . . × RN → R
is multilinear, alternate and unital, and unique with these properties.
Proofs. The first theorem is something that we already know, and
the second theorem follows from it, by uniqueness.
Theory 2/4
Definition. A permutation of {1, . . . , N} is a bijection, as follows:
σ : {1, . . . , N} → {1, . . . , N}
The set of such permutations is denoted SN .
Theorem. There are N! = 1.2.3 . . . N such permutations.
Proof. We have N choices for σ(1), then N − 1 choices for σ(2),
and so on, up to 1 choice for σ(N).
Definition. The signature of a permutation ε(σ) ∈ {±1} is the
number of inversions, i < j with σ(i) > σ(j).
Theory 3/4
Theorem. The determinant is given by the formula
X
ε(σ)A1σ(1) . . . ANσ(N)
det A =
σ∈SN
with the signature function being the one introduced above.
Proof. This follows either by using the Gauss method, or by using
the abstract characterization of the determinant.
Remark. At N = 3 we obtain in this way the Sarrus formula.
Theory 4/4
Theorem. The eigenvalues of a matrix A ∈ MN (R) must satisfy
PA (λ) = 0
where PA = det(A − λ1N ) is the characteristic polynomial.
Proof. Given a vector v ∈ RN and a number λ ∈ R, we have:
Av = λv ⇐⇒ (A − λ1N )v = 0
But this latter equation has nonzero solutions when
B = det(A − λ1N )
is not invertible, and so when det B = 0.
Complex matrices and diagonalization
Teo Banica
"Introduction to linear algebra", 3/6
08/20
Complex numbers 1/3
The complex numbers are z = a + ib, with i 2 = −1.
They can be represented in the plane, with z being ba .
√
We have z = re it , with r = a2 + b 2 , and tan t = b/a.
The equation x 2 = −1 has two solutions, x = ±i.
In fact, the equation P(x) = 0 has N = deg P solutions.
Also, complex numbers are important in quantum physics.
Complex numbers 2/3
Consider the rotation of angle t ∈ R:
cos t − sin t
Rt =
sin t cos t
This rotation has 2 complex eigenvectors (!), because:
1
cos t − i sin t
1
=
Rt
= e −it
i
sin t + i cos t
i
1
cos t + i sin t
1
it
=
Rt
=e
−i
sin t − i cos t
−i
Thus, good news, Rt is diagonalizable over C.
Complex numbers 3/3
More magics. When identifying R2 with the complex plane C, the
rotation of angle t ∈ R becomes a 1 × 1 matrix (!):
Rt = (e it )
Thus, with complex numbers, this rotation Rt of angle t ∈ R in the
plane is something completely trivial. Very nice.
Theory 1/4
The theory from the real case extends to this setting:
Theorem. Any linear map f : CN → CN is of the form f (v ) = Av ,
with A ∈ MN (C).
Theorem. More generally, any linear map f : CN → CM is of the
form f (v ) = Av , with A ∈ MM×N (C).
Theorem. With fA (v ) = Av , we have fAB = fA fB . In particular fA is
invertible when A is invertible, and fA−1 = fA−1 .
Theory 2/4
The theory of the determinant extends as well:
Definition. The determinant of a matrix A ∈ MN (C) is
X
ε(σ)A1σ(1) . . . ANσ(N)
det A =
σ∈SN
where ε(σ) = (−1)c , c being the number of inversions.
Theorem. The determinant is subject to the following rules:
(1) det(λu, {wi }) = λ det(u, {wi }).
(2) det(u, v , {wi }) = det(u − v , v , {wi }).
Also, we have det(AB) = det A · det B, det(At ) = det A.
Theory 3/4
The theory of the eigenvalues extends as well:
Definition. Given A ∈ MN (C), if v ∈ CN and λ ∈ C satisfy
Av = λv
we say that v is an eigenvector of A, with eigenvalue λ.
Theorem. The eigenvalues are the roots of the polynomial
P(λ) = det(A − λ1N )
called characteristic polynomial of the matrix.
Theory 4/4
Theorem. Consider a 2 × 2 real or complex matrix:
a b
A=
c d
(1) The characteristic polynomial is P(λ) = λ2 − Sλ + P, with:
S =a+d
,
P = ad − bc
(2) We have two complex eigenvalues, given by:
λ1 + λ2 = S
,
λ 1 λ2 = P
(3) Equivalently, we have the following formula:
√
S ± S 2 − 4P
λ1,2 =
2
Diagonalization 1/4
Theorem. Given A ∈ MN (C), consider its characteristic polynomial
P(x) = det(A − x1N ), and decompose it into factors:
P(x) = (−1)N (x − λ1 ) . . . (x − λN )
For λ ∈ {λ1 , . . . , λN } consider the corresponding eigenspace:
Eλ = {v ∈ CN Av = λv }
We have then dimension inequalities as follows, for any λ,
1 ≤ dim(Eλ ) ≤ #(λ ∈ {λ1 , . . . , λN })
and A is diagonalizable precisely when we have equalities at right.
Diagonalization 2/4
In practice, the above result can be used as follows:
(1) Compute the characteristic polynomial P(x) = det(A − x1N ),
and factorize it as P(x) = (−1)N (x − λ1 ) . . . (x − λN ).
(2) Remark: if λi are distinct, A is certainly diagonalizable. Also, if
λi ∈
/ R for some i, A is not diagonalizable over R.
(3) Solve Av = λi v for any i. If a space of solutions Eλi satisfies
dim(Eλi ) < #(λ ∈ {λ1 , . . . , λN }), A is not diagonalizable.
(4) Otherwise, find a basis of each of these spaces Eλi , and put all
eigenvectors found into a matrix P (the "passage matrix").
(5) Put as well all eigenvalues found on the diagonal of a matrix D.
Compute P −1 . We have then A = PDP −1 .
Diagonalization 3/4
Some tricks and tips:
(1) In 2 dimensions, where A = (ac bd ), the eigenvalues are best
computed by using x + y = a + d, xy = ad − bc.
(2) In fact, in N dimensions, it is known that the eigenvalues satisfy
λ1 + . . . + λN = Tr (A) and λ1 . . . λN = det A.
(3) If P has integer coefficients, P ∈ Z[X ], look first for integer
roots, λ ∈ Z. These must divide the coefficient of X 0 .
Diagonalization 4/4
More tricks and tips:
(1) When computing eigenspaces Eλi , start with eigenvalues having
big multiplicity, because the computation here might lead to the
conclusion that A is not diagonalizable, and so you’re done.
(2) Always check and doublecheck your computations. If your
matrix depends on a parameter t, plug in t = 0 or so from time to
time, in order to doublecheck. Good luck!
Advanced 1/4
Theorem. With respect to < x, y >=
P
i
xi ȳi we have
< Ax, y >=< x, A∗ y >
with A∗ being the adjoint matrix, given by (A∗ )ij = Āji .
Theorem. For a matrix U ∈ MN (A), the following are equivalent:
(1) U is a unitary, < Ux, Uy >=< x, y >.
(2) U satisfies the equation U ∗ = U −1 .
Proof. We have indeed < Ux, Uy >=< x, U ∗ Uy >, as desired.
Advanced 2/4
Theorem. The matrices which are normal, in the sense that
AA∗ = A∗ A
are diagonalizable.
Theorem. The matrices which are self-adjoint, in the sense that
A = A∗
are diagonalizable. Moreover, their eigenvalues are real.
Theorem. The matrices which are unitary, in the sense that
U ∗ = U −1
are diagonalizable. Their eigenvalues are on the unit circle.
Advanced 3/4
Theorem. The following happen, inside MN (C):
(1) The matrices having distinct eigenvalues are dense.
(2) The diagonalizable matrices are dense.
Proof. Here (1) follows by using the resultant R(P, P ′ ), because
the equation R = 0 defines a hypersurface in MN (C), having dense
complement. As for (2), this follows from (1).
Comment. This is interesting, because it tells us that "any formula
which is true for diagonalizable matrices is true in general".
Advanced 4/4
Theorem. Any matrix A ∈ MN (C) can be put in Jordan form
J1
..
A∼
.
Jk
with each Jordan block being of the
λ 1
..
.
J=
following type,
..
. 1
λ
..
.
with the numbers λ ranging over the eigenvalues of A.
Linear algebra and calculus questions
Teo Banica
"Introduction to linear algebra", 4/6
08/20
Systems 1/3
Theorem. Any linear system of equations
a11 x1 + a12 x2 + . . . + a1N xN = v1
a21 x1 + a22 x2 + . . . + a2N xN = v2
..
.
aN1 x1 + aN2 x2 + . . . + aNN xN = vN
can be written in matrix form, as follows,
a11 a12 . . . a1N
x1
v1
a21 a22 . . . a2N x2 v2
..
.. .. = ..
.
. . .
aN1 aN2 . . . aNN
xN
vN
and when A is invertible, its solution is given by x = A−1 v .
Systems 2/3
Theorem. Any linear recurrence system
xk+1 = a11 xk + a12 yk + a13 zk + . . .
yk+1 = a21 xk + a22 yk + a23 zk + . . .
zk+1 = a31 xk + a32 yk + a33 zk + . . .
..
.
can be written in matrix form,
a11
xk+1
yk+1 a21
zk+1 = a31
..
..
.
.
as follows,
xk
a12 a13 . . .
a22 a23 . . . yk
a32 a33 . . .
zk
..
..
..
.
.
.
and the solution is obtained by applying Ak to the inital data.
Systems 3/3
In order to compute Ak , we must diagonalize the matrix,
A = PDP −1
and then the powers are given by the following formula:
Ak = PD k P −1
This formula holds in fact for any k ∈ Z, or even k ∈ R.
Calculus 1/4
Theorem. Any function can be locally approximated as
f (x + t) ≃ f (x) + at
where a = f ′ (x) is the derivative of f at the point x.
Proof. Let us recall indeed the definition of the derivative:
f (x + t) − f (x)
t→0
t
f ′ (x) = lim
But this gives the formula in the statement.
Calculus 2/4
Theorem. Any function of several variables, written as
f = (f1 , . . . , fN )
can be locally approximated as follows,
f (x + t) ≃ f (x) + At
with A being the matrix of partial derivatives at x,
∂fi
(x)
A=
∂xj
ij
acting on the vectors t by usual multiplication.
Calculus 3/4
Theorem. We have the change of variable formula
Z
b
f (x)dx =
a
Z
d
f (ϕ(t))ϕ′ (t)dt
c
where c = ϕ−1 (a) and d = ϕ−1 (b).
Proof. This follows with f = F ′ from the rule
(F ϕ)′ (t) = F ′ (ϕ(t))ϕ′ (t)
by integrating between c and d.
Calculus 4/4
Theorem. Given a transformation in several variables,
ϕ = (ϕ1 , . . . , ϕN )
we have the following change of variable formula,
Z
Z
f (x)dx =
f (ϕ(t))Jϕ (t)dt
E
ϕ−1 (E )
with the Jϕ quantity, called Jacobian, being given by:
"
#
∂ϕi
Jϕ (t) = det
(x)
∂xj
ij
Polar coordinates 1/4
Theorem. We have polar coordinates in 2 dimensions,
(
x = r cos t
y = r sin t
and the corresponding Jacobian is J(r , t) = r .
Proof. The Jacobian is by definition given by:
cos t −r sin t
=r
sin t r cos t
Thus, we have indeed the formula in the statement.
Polar coordinates 2/4
Z
2
e −x dx =?
R
Polar coordinates 3/4
Theorem. We have the following formula:
Z
√
2
e −x dx = π
R
Proof. The square of the integral is given by:
Z Z
2
2
2
I =
e −x −y dxdy
R R
Z 2π Z ∞
2
=
re −r drdt
0
0
Z 2π " −r 2 #∞
e
dt
−
=
2
0
0
We obtain I 2 = (2π) ×
1
2
= π, and so I =
√
π.
Polar coordinates 4/4
Definition. The normal law of parameter 1 is:
1
2
g1 = √ e −x /2 dx
2π
More generally, the normal law of parameter t > 0 is:
gt = √
1
2
e −x /2t dx
2πt
√
Remark. The Gauss formula gives with x = y / 2t
Z
√
2
e −y /2t dy = 2πt
R
so these laws have indeed mass 1.
Spheres 1/4
Theorem. We have spherical coordinates in 3 dimensions,
x = r cos s
y = r sin s cos t
z = r sin s sin t
and the corresponding Jacobian is J(r , s, t) = r 2 sin s.
Proof. The Jacobian is given by:
cos s
−r sin s
0
sin s cos t r cos s cos t −r sin s sin t = r 2 sin s
sin s sin t r cos s sin t r sin s cos t
Thus, we have indeed the formula in the statement.
Spheres 2/4
Theorem. We have spherical coordinates in N dimensions,
x1 = r cos t1
x2 = r sin t1 cos t2
..
.
xN−1 = r sin t1 . . . sin tN−2 cos tN−1
x
= r sin t1 . . . sin tN−2 sin tN−1
N
and the corresponding Jacobian is:
J(r , t) = r N−1 sinN−2 t1 sinN−3 t2 . . . sin2 tN−3 sin tN−2
Remark. This generalizes the previous coordinates at N = 2, 3.
Spheres 3/4
Theorem. The volume of the sphere in RN is given by
π [N/2]
V
1
=
N
2
(N + 1)!!
2
with N!! = (N − 1)(N − 3)(N − 5) . . ., stopping at 1 or 2.
(1) At N = 1 we obtain V /2 = 1, so V = 2.
(2) At N = 2 we obtain V /4 = π/2 · 1/2, so V = π.
(3) At N = 3 we obtain V /8 = π/2 · 1/3, so V = 4π/3.
(4) At N = 4 we obtain V /16 = π 2 /4 · 1/8, so V = π 2 /2.
Spheres 4/4
Proof. By using spherical coordinates, and Fubini, we are left with
computing integrals over the circle. But these are given by
2
π
Z
π/2
0
δ(p,q)
2
p!!q!!
cos t sin t dt =
π
(p + q + 1)!!
p
q
where δ(a, b) = 0 if both a, b are even, and δ(a, b) = 1 otherwise,
and by plugging in these quantities, we obtain the result.
Infinite matrices and spectral theory
Teo Banica
"Introduction to linear algebra", 5/6
08/20
Linear spaces 1/3
Definition. A complex vector space is a set V with operations
(u, v ) → u + v
,
(λ, u) → λu
having the following properties:
(1) u + v = v + u.
(2) (u + v ) + w = u + (v + w ).
(3) (λ + µ)u = λu + µu.
(4) (λµ)u = λ(µu).
(5) λ(u + v ) = λu + λv .
Examples. CN , C∞ , MN (C), C [0, 1] and many other.
Linear spaces 2/3
Definition. A map f : V → W is called linear when:
(1) f (u + v ) = f (u) + f (v ).
(2) f (λu) = λf (u).
Theorem. Let f : V → W be a linear map.
(1) ker f = {v ∈ V |f (v ) = 0} is a linear space.
(2) Im f = {f (v )|v ∈ V } is a linear space.
(3) dim ker f + dim Im f = dim V .
Linear spaces 3/3
Theorem. In finite dimensions, any vector space V has a basis {ei },
which is such that any v ∈ V can be written, uniquely, as:
v = v 1 e 1 + . . . + vN e N
Thus we have V = CN , the identification being given by:
v1
..
v = .
vN
As a consequence, any linear map f : V → W between finite
dimensional vector spaces corresponds to a matrix.
Hilbert spaces 1/4
Definition. A scalar product on a complex vector space H is an
operation (x, y ) →< x, y >, satisfying:
(1) < x, y > is linear in x, and antilinear in y .
(2) < x, y > =< y , x >, for any x, y .
(3) < x, x >> 0, for any x 6= 0.
Theorem. If we set ||x|| =
√
< x, x > then:
(1) | < x, y > | ≤ ||x|| · ||y ||.
(2) ||x + y || ≤ ||x|| + ||y ||.
(3) d(x, y ) = ||x − y || is a distance.
Proof. (1) follows from the fact that the degree 2 polynomial
f (t) = ||tx + y ||2 is positive, and (1) =⇒ (2) =⇒ (3).
Hilbert spaces 2/4
Definition. A Hilbert space is a complex vector space H with a
scalar product < x, y >, which is complete with respect to
||x|| =
√
< x, x >
in the sense that the Cauchy sequences with respect to the
associated distance d(x, y ) = ||x − y || converge.
Examples.
(1) H = CN , with < x, y >=
P
i xi ȳi .
P
(2) H =
with < x, y >= i xi ȳi .
R
(3) H = L2 (X ), with < f , g >= X f (x)g (x)dx.
l 2 (N),
Hilbert spaces 3/4
Theorem. Any Hilbert space H has an orthonormal basis {ei }i∈I ,
and so we have an indentification H = l 2 (I ).
Proof. The basis can be constructed by starting with an "algebraic”
basis, and using the Gram-Schmidt method.
Warning. For spaces like H = L2 [0, 1], this is something not trivial.
Theorem. Let H be a Hilbert space, with basis {ei }i∈I . We have
L(H) ⊂ MI (C)
with T : H → H linear corresponding to the following matrix:
Mij =< Tej , ei >
In particular, when dim(H) = N < ∞, we obtain L(H) ≃ MN (C).
Hilbert spaces 4/4
Theorem. Given a Hilbert space H, the linear operators T : H → H
which are bounded, in the sense that
||T || = sup ||Tx||
||x||≤1
is finite, form a complex algebra with unit B(H), which:
(1) is complete with respect to ||.|| (Banach algebra).
(2) has an involution T → T ∗ , < Tx, y >=< x, T ∗ y >.
The norm and involution are related by ||TT ∗ || = ||T ||2 .
Proof. Here "complex algebra" is elementary, (1) follows by setting
Tx = limn→∞ Tn x, (2) comes from the fact that ϕ(x) =< Tx, y >
is linear, and (3) can be proved by double inequality.
Spectral theory 1/4
Definition. A C ∗ -algebra is a complex algebra with unit A, with:
(1) A norm a → ||a||, making it a Banach algebra.
(2) An involution a → a∗ , such that ||aa∗ || = ||a||2 , ∀a ∈ A.
Definition. The spectrum of an element a ∈ A is the set:
σ(a) = λ ∈ C a − λ 6∈ A−1
Theorem. σ(ab) = σ(ba) outside {0}.
Proof. Indeed, c = (1 − ab)−1 =⇒ 1 + cba = (1 − ba)−1 .
Remark. In infinite dimensions, S ∗ S = 1, SS ∗ 6= 1 (shift).
Spectral theory 2/4
Theorem. We have the following formula, for any rational function
f ∈ C(X ) having its poles outside σ(a):
σ(f (a)) = f (σ(a))
Proof. In the polynomial case, f ∈ C[X ], we can factorize,
f (X ) − λ = c(X − r1 ) . . . (X − rn )
and the result can be proved as follows:
λ∈
/ σ(f (a))
⇐⇒
a − r1 , . . . , a − rn ∈ A−1
⇐⇒
λ∈
/ f (σ(a))
⇐⇒
r1 , . . . , rn ∈
/ σ(a)
In the general case, f = P/Q, we can use F = P − λQ.
Spectral theory 3/4
Definition. Given an element a ∈ A, its spectral radius ρ(a) is the
radius of the smallest disk centered at 0 containing σ(a).
Theorem. Let A be a C ∗ -algebra.
(1) The spectrum of a norm 1 element is in the unit disk.
(2) The spectrum of a unitary (a∗ = a−1 ) is on the unit circle.
(3) The spectrum of a self-adjoint element (a = a∗ ) is real.
(4) ρ of a normal element (aa∗ = a∗ a) equals its norm.
Spectral theory 4/4
(1) Clear from (1 − a)−1 = 1 + a + a2 + . . . for ||a|| < 1.
(2) Follows by using f (z) = z −1 . Indeed, we have:
σ(a)−1 = σ(a−1 ) = σ(a∗ ) = σ(a)
(3) Follows from (2), by using f (z) = (z + it)/(z − it).
(4) By (1) we have ρ(a) ≤ ||a||. Given ρ > ρ(a), we have:
!
Z
Z
∞
X
zn
dz =
z n−k−1 dz ak = an−1
z
−
a
|z|=ρ
|z|=ρ
k=0
By applying the norm and taking n-th roots we obtain:
ρ ≥ lim ||an ||1/n
n→∞
If a = a∗ we are done. In general, we can use ||aa∗ || = ||a||2 .
Advanced 1/4
Theorem. Given a compact space X , the complex algebra
C (X ) = {f : X → C continuous}
is a C ∗ -algebra, with norm and involution given by:
||f || = sup |f (x)| ,
f ∗ (x) = f (x)
x∈X
This algebra is commutative, in the sense that fg = gf .
Proof. It is well-known that C (X ) is complete with respect to the
sup norm, and the other conditions are trivially satisfied.
Advanced 2/4
Theorem. Any commutative C ∗ -algebra is the form C (X ), with its
“spectrum” X = Spec(A) consisting of the characters:
χ:A→C
Proof. Set X = Spec(A), with topology making continuous all the
evaluation maps eva : χ → χ(a). Then X is a compact space, and
a → eva is a morphism of algebras ev : A → C (X ).
(1) ev involutive. Using real + imaginary parts, we must prove that
eva∗ = eva∗ when a = a∗ . But this follows from σ(a) ⊂ R.
(2) ev isometric. Follows from ||eva || = ρ(a) = ||a||.
(3) ev surjective. Follows from Stone-Weierstrass.
Advanced 3/4
Theorem. Assume that a ∈ A is normal, and let f ∈ C (σ(a)).
(1) We can define f (a) ∈ A, with f → f (a) being a morphism.
(2) We have the formula σ(f (a)) = f (σ(a)).
Proof. Since a is normal, B =< a > is commutative, and the
Gelfand theorem gives B = C (X ), with X = Spec(B).
The map X → σ(a) given by evaluation at a being bijective, we
have X = σ(a). Thus B = C (σ(a)), and we are done.
Advanced 4/4
Definition. Given an arbitrary C ∗ -algebra A, we can write
A = C (X )
and call X a "noncommutative compact space".
Special matrices and matrix tricks
Teo Banica
"Introduction to linear algebra", 6/6
08/20
Fourier 1/3
Theorem. We have the Vandermonde formula:
1
x1
..
.
1
x2
..
.
1
xN
..
.
...
...
x1N−1 x2N−1 . . . xNN−1
=
Y
i>j
(xi − xj )
Proof. The determinant D is a polynomial in x1 , . . . , xN , of degree
N − 1 in each variable. Since xi = xj makes D = 0, we obtain:
Y
D=c
(xi − xj )
i>j
The constant c ∈ R can be computed by recurrence, we get c = 1.
Fourier 2/3
Definition. The Fourier matrix FN is given by:
FN = (w ij )ij
,
w = e 2πi/N
With matrices indices i, j = 0, 1, . . . , N − 1, we have:
1
1
1
...
1
1
w
w2
. . . w N−1
2
4
2(N−1)
1
w
w
.
.
.
w
FN =
..
..
..
..
.
.
.
.
2
N−1
(2N−1)
(N−1)
1 w
w
... w
This is a Vandermonde matrix, with xi = w i .
Fourier 3/3
Theorem. The rescaled matrix FN =
√1 (w ij )ij
N
is unitary.
Proof. We have the following computation:
X
(FN FN∗ )ij =
(FN )ik (F̄N )jk
k
=
X
k
=
X
w ik · w −jk
(w i−j )k
k
= Nδij
√
Thus the rescaled matrix FN = FN / N is unitary.
Special matrices 1/4
Theorem. For a matrix H ∈ MN (C), the following are equivalent,
(1) H is circulant, Hij = ξj−i for some ξ ∈ CN .
(2) H is Fourier-diagonal, H = FQF ∗ with Q diagonal.
where F = FN . In addition, the first row vector of H is
√
ξ = Fq/ N
where qi = Qii is the vector formed by the diagonal entries of Q.
Special matrices 2/4
Proof. If Hij = ξj−i is circulant then Q = F ∗ HF is diagonal:
Qij =
X
1 X jl−ik
w
ξl−k = δij
w jr ξr
N
r
kl
Also, if Q = diag (q) is diagonal then H = FQF ∗ is circulant:
Hij =
X
k
Fik Qkk F̄jk =
1 X (i−j)k
w
qk
N
k
√
This formula proves as well the last assertion, ξ = Fq/ N.
Special matrices 3/4
Theorem. The various sets of circulant matrices are as follows,
(1) MN (C)circ = {FQF ∗ |q ∈ CN }.
(2) UNcirc = {FQF ∗ |q ∈ TN }.
(3) ONcirc = {FQF ∗ |q ∈ TN , q̄i = q−i , ∀i}.
with the convention Q = diag (q), for q ∈ CN .
Proof. (1) This is something that we already know.
(2) This is because the eigenvalues must be on the unit circle T.
(3) For q ∈ CN we have Fq = F q̃, with q̃i = q̄−i , and so ξ = Fq
is real if and only if q̄i = q−i for any i. This gives the result.
Special matrices 4/4
Theorem. The groups BN ⊂ ON and CN ⊂ UN of bistochastic
matrices (sum 1 on each row and column) are given by:
BN ≃ ON−1
,
CN ≃ UN−1
Proof. The all-1 vector ξ being equal to
Uξ = ξ
⇐⇒
⇐⇒
⇐⇒
√
NFe0 , we have:
UFe0 = Fe0
F ∗ UFe0 = e0
F ∗ UF = diag (1, w )
Thus we have isomorphisms as in the statement.
Hadamard matrices 1/4
Definition. A complex Hadamard matrix is a square matrix
H ∈ MN (C)
whose entries are on the unit circle, Hij ∈ T, and whose rows are
pairwise orthogonal, with respect to the scalar product of CN .
Example. For the Fourier matrix, FN = (w ij ) with w = e 2πi/N , the
scalar products between rows are:
X
X
w (a−b)j = Nδab
w aj w −bj =
< Ra , Rb >=
j
j
Thus the Fourier matrix FN is Hadamard.
Hadamard matrices 2/4
Theorem. Given a finite abelian group G , with group dual
Gb = {χ : G → T}
consider the Fourier coupling G × Gb → T:
(i, χ) → χ(i)
(1) Via the standard isomorphism G ≃ Gb, this Fourier coupling is a
square matrix, FG ∈ MG (T), which is complex Hadamard.
(2) For a cyclic group G = ZN we obtain in this way, via the
standard identification ZN = {1, . . . , N}, the Fourier matrix FN .
(3) In general, when using a decomposition G = ZN1 × . . . × ZNk ,
the corresponding Fourier matrix is FG = FN1 ⊗ . . . ⊗ FNk .
Hadamard matrices 3/4
Examples. (1) For the cyclic group Z2 we obtain the Fourier matrix
F2 , also denoted W2 , and called first Walsh matrix:
1 1
W2 =
1 −1
(2) For the Klein group Z2 × Z2 we obtain the tensor product
W4 = W2 ⊗ W2 , called second Walsh matrix:
1 1
1
1
1 −1 1 −1
W4 =
1 1 −1 −1
1 −1 −1 1
(3) In general, for the group Zn2 we obtain the n-th Walsh matrix
WN = W2⊗n , having size N = 2n . Useful in radio, coding.
Hadamard matrices 4/4
Hadamard Conjecture. There is at least one real Hadamard matrix
H ∈ MN (±1)
for any integer N ∈ 4N.
Comment. Verified so for up to N = 666.
Rotations 1/4
Theorem. For a matrix U ∈ MN (C), the following are equivalent:
(1) U preserves the scalar product, < Ux, Uy >=< x, y >.
(2) U preserves the norm, ||Ux|| = ||x||, where ||x|| =
√
< x, x >.
(3) U is unitary, in the sense that U ∗ = U −1 , where (U ∗ )ij = Ūji .
(4) U has its eigenvalues on the unit circle T.
Proof. The equivalences (1) ⇐⇒ (2) ⇐⇒ (3) follow by using
< Mx, y >=< x, M ∗ y >, and (4) is something that we know.
Rotations 2/4
Theorem. The unitaries in M2 (C) of determinant 1 are
a b
U=
−b̄ ā
with a, b ∈ C satisfying |a|2 + |b|2 = 1.
Proof. For U = (ac bd ) of determinant 1, U ∗ = U −1 reads:
d −b
ā c̄
=
−c a
b̄ d¯
Thus c = −b̄, d = ā. Finally, det U = 1 gives |a|2 + |b|2 = 1.
Rotations 3/4
Theorem. The unitaries in M3 (R) of determinant 1 are
2
x + y 2 − z2 − t2
2(xt + yz)
O=
2(yt − xz)
2(yz − xt)
x2 + z2 − y 2 − t2
2(xy + zt)
2(xz + yt)
2(zt − xy )
2
2
2
2
x +t −y −z
with x, y , z, t ∈ R satisfying x 2 + y 2 + z 2 + t 2 = 1.
Proof. With a = x + iy , b = z + it, the previous formula reads:
x + iy z + it
U=
−z + it x − iy
But we must have "O + 1 = ad(U)", and this gives the result.
Rotations 4/4
Conclusion. We can now:
• do some serious engineering
• or write 3D games software.
References 1/2
[1] M.F. Atiyah and I.G. MacDonald, Introduction to commutative algebra,
Addison-Wesley (1969).
[2] R. Brauer, On algebras which are connected with the semisimple continuous
groups, Ann. of Math. 38 (1937), 857–872.
[3] P. Deligne, Catégories tannakiennes, in “Grothendieck Festchrift”, Birkhauser
(1990), 111–195.
[4] P. Diaconis and M. Shahshahani, On the eigenvalues of random matrices, J.
Applied Probab. 31 (1994), 49–62.
[5] S. Doplicher and J. Roberts, A new duality theory for compact groups, Invent.
Math. 98 (1989), 157–218.
[6] V.G. Drinfeld, Quantum groups, Proc. ICM Berkeley (1986), 798–820.
[7] R. Hartshorne, Algebraic geometry, Springer (1977).
[8] F. Klein, Vergleichende Betrachtungen über neuere geometrische Forschungen,
Math. Ann. 43 (1893), 63–100.
References 2/2
[9] S. Lang, Algebra, Addison-Wesley (1993).
[10] W. Rudin, Real and complex analysis, McGraw-Hill (1966).
[11] J.P. Serre, Linear representations of finite groups, Springer (1977).
[12] I.R. Shafarevich, Basic algebraic geometry, Springer (1974).
[13] G.C. Shephard and J.A. Todd, Finite unitary reflection groups, Canad. J. Math. 6
(1954), 274–304.
[14] T. Tannaka, Über den Dualitätssatz der nichtkommutativen topologischen
Gruppen, Tôhoku Math. J. 45 (1939), 1–12.
[15] H. Weyl, The classical groups: their invariants and representations, Princeton
(1939).
[16] S.L. Woronowicz, Compact matrix pseudogroups, Comm. Math. Phys. 111
(1987), 613–665.