Lecture Notes On SVD For Math 54
Lecture Notes On SVD For Math 54
Lecture Notes On SVD For Math 54
and
{v1 , . . . , vn } is an orthonormal basis of Rn .
where (
1 if i = j
δij =
0 6 j.
if i =
(ii) Furthermore, the eigenvalues of S = AT A are nonnegative:
λj ≥ 0 (j = 1, . . . , n).
1
Proof. We use (1) to calculate that
λ1 ≥ · · · ≥ λr > λr+1 = · · · = λn = 0.
Then
DEFINITION. We write
1
ui = Avi (i = 1, . . . , r).
σi
The key point is that we can use the orthonomal basis {u1 , . . . , um }
of Rm and the orthonormal basis {v1 , . . . , vn } of Rn to convert our
matrix A into a simpler form. Here is how to do it:
2
NOTATION. Introduce the m × m orthogonal matrix
U = (u1 |u2 | . . . |um ),
whose ith column is ui (i = 1, . . . , m). Likewise, introduce the n × n orthog-
onal matrix
V = (v1 |v2 | . . . |vn ).
Then
(5) U U T = U T U = I, V V T = V T V = I.
THEOREM 1. We have
σ1 0 ... 0
0 σ2 ... 0
(6) U T AV = ... ... ..
.
.. .
. O
0 0 . . . σr
O O
REMARK. Thus if we write Σ for the m × n matrix on the right hand side
of (6), we obtain using (5) the singular value decomposition (SVD)
(7) A = U ΣV T
of our matrix A.
This is similar to the familiar orthogonal diagonalization formula for a
symmetric n × n matrix, but in (6) and (7) the matrix A need not be
symmetric nor square.
Proof. Since
AV = A(v1 |v2 | . . . |vn ) = (Av1 |Av2 | . . . |Avn ),
it follows that
u1 · Av1 u1 · Av2 . . . u1 · Avn
u2 · Av1 u2 · Av2 . . . u2 · Avn
(8) U T AV = .
.. .. ... ..
. . .
um · Av1 um · Av2 . . . um · Avn
3
Now if j ∈ {r + 1, . . . , n}, then Avj = 0. If j ∈ {1, . . . , r} and i ∈ {r +
1, . . . , m}, then
ui · Avj = σj ui · uj = 0.
Finally, if i, j ∈ {1, . . . , r}, then
1 λi
ui · Avj = Avi · Avj = vi · vj = σi δij .
σi σi
Using these formulas in (8) gives (6).
3. Let 1
σj = λj2 (j = 1, . . . , n);
then
σ1 ≥ · · · ≥ σr > σr+1 = · · · = σn = 0.
4. Define
1
ui = Avi (i = 1, . . . , r).
σi
5. Extend {u1 , . . . , ur } to an orthonormal basis {u1 , . . . , um } of Rm .
We compute
T 1 0
S = A A = 25 .
0 4
4
The eigenvalues of S are λ1 = 100, λ2 = 25, with corresponding orthonormal
eigenvectors
0 1
v1 = , v2 = .
1 0
Therefore
σ1 = 10, σ2 = 5
and
1 1 3 1 1 −4
u1 = Av1 = , u2 = Av2 = .
σ1 5 4 σ2 5 3
So
0 1 1 3 −4 10 0
V = , U= , Σ= .
1 0 5 4 3 0 5