Canonical Forms: 1) Triangular Form
Canonical Forms: 1) Triangular Form
Canonical Forms: 1) Triangular Form
CANONICAL FORMS
1) Triangular Form
Definition :
The linear transformations S , T ∈ A (V ) are said to be similar if there exists an invertible element
Definition :
The relation on A (V) defined by similarity is an equivalence relation, the equivalence class of
an element will be called its similarity class.
Note :
To check the two linear transformations are similar or not is difficult. Therefore, we can use
similarity class which matrix in some basis. These matrices will be called the Canonical Forms.
Definition :
Lemma :
V
If W ⊂ V is invarient under T, then T induces a linear transformation T on defined by
W
T ( v + W ) = T ( v ) + W . If T satisfies the polynomial q ( x ) ∈ F [ x ] then so does T . If p1 ( x ) is the
minimal polynomial for T over F and if p ( x ) is that for T then p1 ( x) | p ( x ) .
54
V
Proof : Let W ⊂ V is inveriant under T. T is linear transformation and V = be vector space which
w
contain the element as v = v + W for v ∈ V .
Define, T ( v ) = T ( v ) + W
∴ v1 +W = v2 +W ⇒ ( v1 − v2 )+ W = 0 ⇒ v1 − v2 ∈ W
∴T ( v1 ) −T ( v2 ) ∈ W = T ( v1 ) + W = T ( v2 ) + W
α , β ,∈ F , α v1 + β v2 ∈V for v1, v2 ∈ V .
∴T (α v1 + β v 2 ) = T ( α v1 + β v 2 ) = T ( α v1 + βv 2 ) +W
= ( α T ( v1 ) + β T ( v2 ) ) + W = α T ( v1 ) + W + β T ( v2 ) + W
= α T ( v1 ) + W + β T ( v2 ) + W = α T ( v1 ) + β T ( v2 )
Hence, T is linear..
Now if v = v + W ∈V then T 2 ( v ) = T 2 ( v ) + W = T ( T ( v ) ) + W
⇒ T 2 ( v ) = T (T ( v ) + W ) = T ( T ( v + W ) ) = ( T ) ( v )
2
Q for q ( x ) = α 0 + α1 x + .... + α n x n
∴ q ( T ) = α 0 + α1T + .... + α nT m
( )
⇒ q ( T ) = α 0 + α1T + .... + α nT m = α 0 + α1T + .... + α nT m
Qα i ∈ F
n
= α 0 + α1T + .... + α n T
= α 0 + α1T + .... + α n ( T ) = q (T )
n
Q (1)
55
For any q ( x ) ∈ F [ x ] with q ( T ) = 0 Q 0 is the zero transformation on V .
0 = q (T ) = q (T ) .... (2)
∴ p (T ) = 0 by (2)
⇒ p1 ( x) | p ( x ) by (3)
Hence proof.
Theorem :
If T ∈ A (V ) has all its charactristics roots in F, then there is a basis of V in which the matrix of
T is triangular.
Proof :
We prove this theorem by using induction on the dimensions of V over F.
If dimV = 1 then every element in A (V) is a scalar and also the theorem is true here.
Suppose that the theorem is true for all vectr spaces over F of dimension n–1 and let V be of
dimension n over F.
The linear transformation T on V has all its characteristic roots in F let λ1 ∈ F be a characteristic
root of T. There exists a non-zero vector v1 in V such that T ( v1 ) = λ1v1 .
Let V = V / W we know “If V is finite dimensional vector space and W be subspace of V then
dim V = dimV − dimW ”
T ( v + W ) = T (v) + W
56
T induces a linear transformation T on V whose minimal polynomial over F divides the
minimal polynomial of T over F.
Thus all the roots of the minimal polynomial of T , being roots of the minimal polynomial of T,,
must lie in F.
The linear transformation T in its action on V satisfies the hypothesis of the theorem.
Since V is (n–1) dimensional over F, by our induction hypothesis there is a basis v2 ,..., vn of
over F such that
[ Q v2 ,..., vn be a basis if they are lineary independent and v2 , v3 ,...., vn maps into these
elementry ∴ v2 , v3 ,...., vn linearly independent. Therefore we have if v1, v2 ,...., vn linearly
independent then Tv1, Tv2 ,...., Tvn linearly independent Now let v1, v2 ,...., vn and
α1,...., α n ∈ F such that α1v1 + .... + α nvn = 0 . i.e. α1 = 0 the this linearly independent we are throw
T ( v2 ) = α 21v1 + α 22 v2
∴T ( vi ) − α i1v1 + αi 2 v2 + ..... + α ii vi
The basis v1,...., vn of V over F provides us with a basis where every T ( vi ) is a linear
combination of vi and its precedessors in the basis. Therefore the matrix of T in the basis is triangular..
57
Theorem :
Proof : By previous theorem we can find a basis v1,...., vn of V over F such that
Equivalenty ( T − λ1 ) v1 = 0 , ( T − λ2 ) v2 = α 2 1v1....
2) Nilpotent Transformations
Lemma :
A1 0 L 0
0 A2 L 0
M
0 0 L An
where each Ai is an ni × ni matrix and is the matrix of the linear transformation induced by T on Vi .
58
(1) (1) (1) (2) (2) (2)
Choose a basis of V as follows : v1 , v2 ,...., vn1 is a basis of V1 , v1 , v2 ,...., vn2 is a basis
(n ) ( n ) (n )
of V2 and so on .... v1 , v2 ,...., vnn is a basis of Vn . Since each Vi is invariant under T.
T.
( )
T v j (i ) ∈ Vi so is a linear combination of v1 , v2 ,...., vni and only these. Thus the matrix of
(i ) (i ) (i )
T in the basis so chosen is of the desired form. That each Ai is the matrix of Ti , the linear transformation
induced on Vi by T is clear from the very definition of the matrix of a linear transformation.
Lemma :
Proof :
1 r −1
S S2 r −1 S
(α 0 + S ) − 2 + 3 + .... + ( −1) =1
α 0r
if S r = 0
α0 α0 α0
S r = (α1T +α 2T 2 + .... + α mT m ) = 0
r
QT r = 0
Definition :
M n1 0 L 0
0 M n2 0
M O
0 0 L M nr
59
Proof : T ∈ A (V ) is nilpotent with index of nilpotence n1 .
∴T n1 = 0 but T n1−1 ≠ 0
We can find a vector v ∈ V such that vT n1−1 ≠ 0 .
Suppose that these are linearly dependent i.e. there are scalars α1, α 2 ,...., α ni ∈ F not all
zero such that
( )
T s −1 α s + α s +1T + .... + α n1T n1 −s ( v ) = 0
∴V1 is invariant under T and have basis v1, v2 ,...., vn1 , T can be induces the linear transformation
So far we have produced the upper left-hand corner of the matrix of the theorem.
60
Lemma :
Proof : Since u ∈ V ,
Lemma :
Proof :
Let W be a subspace of V of largest possible dimension such that
Since T n1 = 0 , there exists an integer k, 0 < k < n, such that T k ( z ) ∈V 1 +W and such that
0 = T k ( z) = T
n1 − k
(T k ( z ) ) = T n1− k ( u ) + T n1− k ( w)
However, since both V1 and W are invariant under T, T n1− k ( u ) ∈ V1 and T ni − k ( w ) ∈ W .
T n1− k (u ) = − Tn1− k ( w) V1 I W = ( 0) ⇒ T ni −k ( u ) = 0
We know “If u ∈ V1 is nsuch that T n1− k u = 0 where 0 < k ≤ n1 then u = T k u0 for some
u0 ∈ V1 .”
61
We have T k ( u0 ) = u for some u0 ∈ V1 .
Therefore T k ( z) = u + w = T k u0 + w
W ⊂ W1 , the dimension of W1 must be larger than that of W, moreover, since T k ( z1 ) ∈W and since
W is invariant under T, W1 must be invariant under T..
Problem : Let V = V1 ⊕ W where W is invariant under T when we can find a basis of V, so that
M n1 0
matrix representation of T in this basis is of the form .
0 A2
62
Definition : The integers n1, n2 ,....., nr are called the invariants of T..
1) T m ( M ) = ( 0 ) , T m−1 ( M ) ≠ ( 0 )
Lemma : If M of dimension m, is cyclic with respect to T, then the dimension of T k ( M ) is m–k for
all k ≤ m .
Example : For a nilpotent T in A (V). Find integers n1 ≥ n2 ≥ ..... ≥ nr and subspaces V1, V2 ,...., Vr of
V cyclic with respect to T and of dimensions n1, n2 ,....., nr respectively such that V = V1 ⊕ ..... ⊕ V2 .
Show that these are unique integers.
∴T n1 = 0 and T n1−1 ≠ 0
63
Suppose at the last we get V1, V2 ,...., Vr be subspace each invariant under T and
Suppose there are other integers m1 ≥ m2 ≥ .... ≥ ms and subspace U1 ,....,U s of V cyclic
w.r.t. to T and of dimensions m1 ,...., ms respectively such that V = U1 ⊕ ..... ⊕ Us .
Claim :
s = r and m1 = n1,...., mr = nr
Suppose that this were not the case then there is a first integer i such that mi ≠ ni we may
assume that mi < ni .
Therefore above
∴ dim (T i V ) ≥ ( n1 − mi ) + ( n2 − mi ) + .... +( ni − mi )
m
.... (1)
Thus dim T
mi
(V ) = ( m1 − mi ) + ( m2 − mi ) + .... + ( mi −1 − mi )
Contradict to the equation (1) Q ni − mi > 0 thus there is unique set of integers
n1 ≥ n2 ≥ ..... ≥ nr such that V = V1 ⊕ ..... ⊕ Vr .
Equivalently we have shown that the invariants of T are unique.
Theorem : Two nilpotent linear transformation are similar if and only if they have the same invariants.
Proof : The above Example has proved that if the two nilpotent linear transformations have different
invariants, then they cannot be similar for their respective matrices.
64
M n1 M m1
M n2 M m2
and
O O cannot be similar.
M nr M mr
In the other direction, if the two nilpotent linear transform S and T have the same invariants
n1 ≥ ..... ≥ nr .
We know the result “If T ∈ A (V ) is nilpotent of index of nilpotent n1 then basis of V can be
M n1
O when n ≥ ..... ≥ n and
found such that the matrix of T in this basis has of the form
M nr
1 r
n1 + n2 + ..... + nr = dimV .”
Therefore there are basis v1, v2 ,.....vn and w1,.....wn of V such that the matrix of S in v1,.....vn
M n1
O .
and that of T in w1 ,....., wn are each equal to
M nr
0 1 1
Example : Let m (T ) = 0 0 0 ∈ F3 . Find similar matrix. T act on F (3) .
0 0 0
Let v1 = u1 , v2 = T ( u1 ) = u2 + u3 , v3 = u3
w.r.t. this basis
1 0 0 0 1 1 1 0 0
1 0 0
ATA = 0 1 1 0 0 0 0 0 0
−1
A = 0 1 1 and
0 0 1 0 0 0 0 1 1
0 0 1
65
A Decomposition of V : Jordan Form
Example : Let V be finite dimensional vector space over F, T ∈ A (V ) , V subspace of V invariant
under T. T induces a linear transformation T1 on V1 defined by T1 ( u ) = T ( u ) for every u ∈ V1 . Show
that for any polynomial q ( x ) ∈ F [ x ] , the linear transformation induced by q ( T ) on V1 is precisely
q ( T1 ) . In particular q ( T ) = 0 then q ( T1 ) = 0 .
Lemma : Suppose that V = V1 ⊕ V2 where V1 and V2 are subspaces of V, invariant under T. Let T1
and T2 be the linear transformations induced by T on V1 and V2 respectively. If the minimal polynomial
of T1 over F is p1 ( x ) while that of T2 is p2 ( x ) , then the minimal polynomial for T over F is the least
common multiple of p1 ( x ) and p2 ( x ) .
Proof : If p ( x ) is the minimal polynomial for T over F, as we know above example both p ( T1 ) and
p ( T2 ) are zero, whence p1 ( x) | p ( x ) and p2 ( x ) | p ( x ) . But then the least common multiple of
p1 ( x ) and p2 ( x ) must also divide p ( x ) .
q ( T ) ( v1 ) = q ( T1 )( v1 ) = 0 similarly for v2 ∈ V 2
q ( T ) ( v ) = q (T ) ( v1 + v2 ) = q ( T ) ( v1 ) + q ( T ) ( v2 ) = 0
⇒ p ( x) = q ( x)
66
Corollary :
If V = V1 ⊕ ..... ⊕ Vk where each Vi is invariant under K and T if pi ( x ) is the minimal
polynomial over F of Ti the linear transformation induced by T on Vi then minimal polynomial of T
over F is least common multiple of p1 ( x) , p2 ( x ) ,..., pk ( x ) .
Theorem :
If k = 1 the V = V1 and there is nothing that needs proving suppoose then that k > 1 and
l lk
h1 ( x ) = q2 ( x ) 2 ....qk ( x )
l l lk
h2 ( x ) = q1 ( x ) 1 q3 ( x ) 3 ....qk ( x )
M
lj
hi ( x ) = ∏ q j ( x )
i≠ j
M
l l k −1
hk ( x ) = q1 ( x ) 1 ....qk −1 ( x )
l l
qi (T ) i ( w) = qi (T ) i ( hi (T ) v ) = p ( T ) ( v ) = 0
hi ( T) ( v j ) = 0
67
The polynomials h1 ( x ) , h2 ( x ) ,...., hk ( x ) are relatively prime.
v = v ⋅ 1 = ( a1 ( T ) ) h1 (T ) + .... + ak ( T ) hk ( T ) ( v )
= a1 (T ) h1 ( T ) ( v ) + .... + ak ( T ) hk ( T) ( v )
Thus Vi = V1 + V2 + .... + Vk .
We must now verify that this sum is a direct sum. To show this, it is enough to prove that if
u1 + u2 + .... + uk = 0 with each ui = 0 . So suppose that u1 + u2 + .... + uk = 0 and that some ui say
But q1 ( T ) l1 ( u1 ) = 0 and since h1 ( x ) and q1 ( x ) are relatively prime we are led to u1 = 0 which is
of course in consistent with assumption that u1 ≠ 0 .
l
By definition of Vi , since qi (T ) li Vi = 0 , qi (Ti ) i = 0 whence the minimal equation of Ti
The minimal polynomial of T over F is the least common multiple of qi ( x ) fi ,...., qk ( x ) f k and
68
Note : If all the characteristic roots of T should happen to lie in F, then the minimal polynomial of T
l lk
takes on the especially nice form q ( x ) = ( x − λ1 ) 1 .... ( x − λk ) where λ1 ,...., λk are the distinct
characteristic roots of T. The irreducible factors qi ( x ) are x − λi . Noe that on Vi , Ti has only λi as
a characteristic root.
Corollary :
If all the distinct characteristic roots λi ,...., λk of T lie in F, then V can be written as
{ l
}
V = V1 ⊕ ..... ⊕ Vk where Vi = u ∈ V | (T − λi ) i v = 0 and where Ti has only one characteristic
root λi on Vi .
Note :
QV = V1 ⊕ ..... ⊕ Vk if dimVi = ni then we can find a basis of V such that in this basis the
A1
A2
matrix of T is of the form O where each Ai is an ni × ni matrix and is in fact the
Ak
matrix of Ti .
0 1 0 L 0 0
0 0 1 M M
= Mn
Notation M 0 1
0 0
L 0 n×n
λ 1 0 L 0
0 λ 1 L 0
M
Definition : The matrix 0
0 0 L 1 with λ ' s on the diagonal, 1 is on the super diagonal
0 0 L λ
0
69
Theorem : Let T ∈ Ap (V ) have all its distinct roots, λi ,...., λk in F. Then a basis of V can be fund in
J1 Bi1
J2
Bi 2
which the matrix T is of the form O where each J i = and
O
J k
Bir
where Bi1,....Bi ri are basic Jordan blocks belonging to λi .
0 1 0 L 0 0
0 0 1 L 0 0
Mm = M
0 0 0 L 0 1
0 0 0 L 0 0 m×m
We know that “If V = V1 ⊕V2 ⊕..... ⊕Vk where each subspace Vi is of dimension ni and is
invariant under T, an element of A (V), then a basis of V can be found so that the matrix of T in this
A1 0 L 0
0 A2 L 0
basis is of the form M .
0 0 L Ak
Where each Ai is an ni × ni matrix and is the matrix of the linear transformation induced by T
on Vi .”
Also we know that “If all the distinct characteristic roots λi ,...., λk of T lie in F, then V can be
{ l
}
written as V = V1 ⊕V2 ⊕..... ⊕Vk where Vi = v ∈ V | ( T − λi ) i ( v ) = 0 and where Ti has only
Therefore, we can reduce to the case when T has only one characteristic root λ , that is T − λ
is nilpotent.
70
Thus T = λ + (T − λ ) and since T − λ is nilpotent, there is a basis in which its matrix is of the
M n1 0
O
form
0 M nr
M n1 0 L 0
0 M n2 L 0
such that the matrix of T in this basis has the form M .
0 0 L M nr
λ M
n1 Bn1
λ
+ O = O
O
M nr Bnr
λ
Using the first remark made in this proof about the relation of a basic Jordan block and the
Mm’s we have the required.
Example :
−2 5 1 0
−2 4 1 0
A=
1) −1 2 1 0
−1 2 0 1
rank of A − λ I is 2.
Therefore, geometric multiplicity of equation is 2.
Hence there are two Jordan blocks.
71
The minimal polynomial is ( x − 1) 3
Example : Let T ∈ A (V ) and F [ x ] ring of polynomials in x over F and define for any f ( x ) in
Problems :
1 1 1
1. Prove that the matrix −1 − 1 − 1 is nilpotent, and find its invariants and Jordan form.
1 1 0
2. Find all possible Jordan forms for all 8 × 8 matrices having x 2 ( x − 1) 3 as minimal polynomial.
3. If the multiplicity of each characteristic root of T is 1, and if all the characteristic roots of T are
in F, prove that T is diagonalizable over F.
❏❏❏
72