Intro To Galois Theory
Intro To Galois Theory
Intro To Galois Theory
Contents
3 Introduction to Galois Theory 2
3.1 Rings and Fields . . . . . . . . . . . . . . . . . . . . . . . . . 2
3.2 Ideals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.3 Quotient Rings and Homomorphisms . . . . . . . . . . . . . . 5
3.4 The Characteristic of a Ring . . . . . . . . . . . . . . . . . . . 7
3.5 Polynomial Rings . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.6 Gauss’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.7 Eisenstein’s Irreducibility Criterion . . . . . . . . . . . . . . . 12
3.8 Field Extensions and the Tower Law . . . . . . . . . . . . . . 12
3.9 Algebraic Field Extensions . . . . . . . . . . . . . . . . . . . . 14
3.10 Ruler and Compass Constructions . . . . . . . . . . . . . . . . 16
3.11 Splitting Fields . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.12 Normal Extensions . . . . . . . . . . . . . . . . . . . . . . . . 24
3.13 Separability . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.14 Finite Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.15 The Primitive Element Theorem . . . . . . . . . . . . . . . . . 30
3.16 The Galois Group of a Field Extension . . . . . . . . . . . . . 31
3.17 The Galois correspondence . . . . . . . . . . . . . . . . . . . . 33
3.18 Quadratic Polynomials . . . . . . . . . . . . . . . . . . . . . . 35
3.19 Cubic Polynomials . . . . . . . . . . . . . . . . . . . . . . . . 35
3.20 Quartic Polynomials . . . . . . . . . . . . . . . . . . . . . . . 36
3.21 The Galois group of the polynomial x4 − 2 . . . . . . . . . . . 37
3.22 The Galois group of a polynomial . . . . . . . . . . . . . . . . 39
3.23 Solvable polynomials and their Galois groups . . . . . . . . . . 39
3.24 A quintic polynomial that is not solvable by radicals . . . . . 43
1
3 Introduction to Galois Theory
3.1 Rings and Fields
Definition A ring consists of a set R on which are defined operations of
addition and multiplication satisfying the following axioms:
• x+y = y+x for all elements x and y of R (i.e., addition is commutative);
• (x + y) + z = x + (y + z) for all elements x, y and z of R (i.e., addition
is associative);
• there exists an an element 0 of R (known as the zero element) with the
property that x + 0 = x for all elements x of R;
• given any element x of R, there exists an element −x of R with the
property that x + (−x) = 0;
• x(yz) = (xy)z for all elements x, y and z of R (i.e., multiplication is
associative);
• x(y + z) = xy + xz and (x + y)z = xz + yz for all elements x, y and z
of R (the Distributive Law ).
Lemma 3.2 Let R be a ring. Then (−x)y = −(xy) and x(−y) = −(xy) for
all elements x and y of R.
2
A ring R is said to be unital if it possesses a (necessarily unique) non-zero
multiplicative identity element 1 satisfying 1x = x = x1 for all x ∈ R.
3
Proof A field is a unital commutative ring. Let x and y be non-zero elements
of a field K. Then there exist elements x−1 and y −1 of K such that xx−1 = 1
and yy −1 = 1. Then xyy −1 x−1 = 1. It follows that xy 6= 0, since 0(y −1 x−1 ) =
0 and 1 6= 0.
3.2 Ideals
Definition Let R be a ring. A subset I of R is said to be an ideal of R if
0 ∈ I, a + b ∈ I, −a ∈ I, ra ∈ I and ar ∈ I for all a, b ∈ I and r ∈ R. An
ideal I of R is said to be a proper ideal of R if I 6= R.
Lemma 3.4 A unital commutative ring R is a field if and only if the only
ideals of R are {0} and R.
4
We denote by (f1 , f2 , . . . , fk ) the ideal of R generated by any finite subset
{f1 , f2 , . . . , fk } of R. We say that an ideal I of the ring R is finitely generated
if there exists a finite subset of I which generates the ideal I.
Proof Let I be the subset of R consisting of all these finite sums. If J is any
ideal of R which contains the set X then J must contain each of these finite
sums, and thus I ⊂ J. Let a and b be elements of I. It follows immediately
from the definition of I that 0 ∈ I, a + b ∈ I, −a ∈ I, and ra ∈ I for all
r ∈ R. Also ar = ra, since R is commutative, and thus ar ∈ I. Thus I
is an ideal of R. Moreover X ⊂ I, since the ring R is unital and x = 1x
for all x ∈ X. Thus I is the smallest ideal of R containing the set X, as
required.
Lemma 3.6 Every ideal of the ring Z of integers is generated by some non-
negative integer n.
Proof The zero ideal is of the required form with n = 0. Let I be some
non-zero ideal of Z. Then I contains at least one strictly positive integer
(since −m ∈ I for all m ∈ I). Let n be the smallest strictly positive integer
belonging to I. If j ∈ I then we can write j = qn + r for some integers q
and r with 0 ≤ r < n. Now r ∈ I, since r = j − qn, j ∈ I and qn ∈ I.
But 0 ≤ r < n, and n is by definition the smallest strictly positive integer
belonging to I. We conclude therefore that r = 0, and thus j = qn. This
shows that I = nZ, as required.
5
x, x0 , y and y 0 are elements of R satisfying I + x = I + x0 and I + y = I + y 0
then
(x + y) − (x0 + y 0 ) = (x − x0 ) + (y − y 0 ),
xy − x0 y 0 = xy − xy 0 + xy 0 − x0 y 0 = x(y − y 0 ) + (x − x0 )y 0 .
6
The verification of the following result is a straightforward exercise.
Proof Let p = charR. Clearly p 6= 1. Suppose that p > 1 and p = jk, where
j and k are positive integers. Then (j.1)(k.1) = (jk).1 = p.1 = 0. But R is
an integral domain. Therefore either j.1 = 0, or k.1 = 0. But if j.1 = 0 then
p divides j and therefore j = p. Similarly if k.1 = 0 then k = p. It follows
that p is a prime number, as required.
a0 + a1 x + a2 x 2 + a3 x 3 + · · · ,
7
where the coefficients a0 , a1 , a2 , a3 , . . . of the polynomial are elements of the
ring R and only finitely many of these coeffients are non-zero. If ak = 0 then
the term ak xk may be omitted when writing down the expression defining
the polynomial. Therefore every polynomial can therefore be represented by
an expression of the form
a0 + a1 x + a2 x 2 + · · · + am x m
a0 + a1 x + a2 x 2 + · · · + am x m
f (x) = a0 + a1 x + a2 x2 + a3 x3 + · · ·
and
g(x) = b0 + b1 x + b2 x2 + b3 x3 + · · ·
then
and
f (x)g(x) = u0 + u1 x + u2 x2 + u3 x3 + · · ·
where, for each integer i, the coefficient ui of xi in f (x)g(x) is the sum
of the products aj bk for all pairs (j, k) of non-negative integers satisfying
j + k = i. (Thus u0 = a0 b0 , u1 = a0 b1 + a1 b0 , u2 = a0 b2 + a1 b1 + a2 b0
etc.). Straightforward calculations show that the set R[x] of polynomials
with coefficients in a ring R is itself a ring with these operations of addition
and multiplication. The zero element of this ring is the polynomial whose
coefficients are all equal to zero.
We now consider various properties of polynomials whose coefficients be-
long to a field K (such as the field of rational numbers, real numbers or
complex numbers).
8
Lemma 3.10 Let K be a field, and let f ∈ K[x] be a non-zero polynomial
with coefficients in K. Then, given any polynomial h ∈ K[x], there exist
unique polynomials q and r in K[x] such that h = f q + r and either r = 0
or else deg r < deg f .
Proof If deg h < deg f then we may take q = 0 and r = h. In general we
prove the existence of q and r by induction on the degree deg h of h. Thus
suppose that deg h ≥ deg f and that any polynomial of degree less than deg h
can be expressed in the required form. Now there is some element c of K
for which the polynomials h(x) and cf (x) have the same leading coefficient.
Let h1 (x) = h(x) − cxm f (x), where m = deg h − deg f . Then either h1 = 0
or deg h1 < deg h. The inductive hypothesis then ensures the existence
of polynomials q1 and r such that h1 = f q1 + r and either r = 0 or else
deg r < deg f . But then h = f q + r, where q(x) = cxm + q1 (x). We now
verify the uniqueness of q and r. Suppose that f q + r = f q + r, where
q, r ∈ K[x] and either r = 0 or deg r < deg f . Then (q − q)f = r − r. But
deg((q − q)f ) ≥ deg f whenever q 6= q, and deg(r − r) < deg f whenever
r 6= r. Therefore the equality (q − q)f = r − r cannot hold unless q = q and
r = r. This proves the uniqueness of q and r.
Any polynomial f with coefficients in a field K generates an ideal (f )
of the polynomial ring K[x] consisting of all polynomials in K[x] that are
divisible by f .
Lemma 3.11 Let K be a field, and let I be an ideal of the polynomial ring
K[x]. Then there exists f ∈ K[x] such that I = (f ), where (f ) denotes the
ideal of K[x] generated by f .
Proof If I = {0} then we can take f = 0. Otherwise choose f ∈ I such
that f 6= 0 and the degree of f does not exceed the degree of any non-zero
polynomial in I. Then, for each h ∈ I, there exist polynomials q and r in K[x]
such that h = f q + r and either r = 0 or else deg r < deg f . (Lemma 3.10).
But r ∈ I, since r = h − f q and h and f both belong to I. The choice of f
then ensures that r = 0 and h = qf . Thus I = (f ).
Definition Polynomials f1 , f2 , . . . , fk with coefficients in some field K. are
said to be coprime if there is no non-constant polynomial that divides all of
them.
Theorem 3.12 Let f1 , f2 , . . . , fk be coprime polynomials with coefficients in
some field K. Then there exist polynomials g1 , g2 , . . . , gk with coefficients in
K such that
f1 (x)g1 (x) + f2 (x)g2 (x) + · · · + fk (x)gk (x) = 1.
9
Proof Let I be the ideal in K[x] generated by f1 , f2 , . . . , fk . It follows from
Lemma 3.11 that the ideal I is generated by some polynomial d. Then d
divides all of f1 , f2 , . . . , fk and is therefore a constant polynomial, since these
polynomials are coprime. It follows that I = K[x]. The existence of the
required polynomials g1 , g2 , . . . , gk then follows using Lemma 3.5.
Proof Suppose that f does not divide g. We must show that f divides
h. Now the only polynomials that divide f are constant polynomials and
multiples of f . No multiple of f divides g. Therefore the only polynomials
that divide both f and g are constant polynomials. Thus f and g are coprime.
It follows from Proposition 3.12 that there exist polynomials u and v with
coefficients in K such that 1 = ug + vf . Then h = ugh + vf h. But f divides
ugh + vf h, since f divides gh. It follows that f divides h, as required.
Proposition 3.14 Let K be a field, and let (f ) be the ideal of K[x] generated
by an irreducible polynomial f with coefficients in K. Then K[x]/(f ) is a
field.
Proof Let I = (f ). Then the quotient ring K[x]/I is commutative and has
a multiplicative identity element I +1. Let g ∈ K[x]. Suppose that I +g 6= I.
Now the only factors of f are constant polynomials and constant multiples
of f , since f is irreducible. But no constant multiple of f can divide g, since
g 6∈ I. It follows that the only common factors of f and g are constant
polynomials. Thus f and g are coprime. It follows from Proposition 3.12
that there exist polynomials h, k ∈ K[x] such that f h + gk = 1. But then
(I +k)(I +g) = I +1 in K[x]/I, since f h ∈ I. Thus I +k is the multiplicative
inverse of I + g in K[x]/I. We deduce that every non-zero element of K[x]/I
is invertible, and thus K[x]/I is a field, as required.
10
Definition A polynomial with integer coefficients is said to be primitive if
there is no prime number that divides all the coefficients of the polynomial
11
3.7 Eisenstein’s Irreducibility Criterion
Proposition 3.17 (Eisenstein’s Irreducibility Criterion) Let
f (x) = a0 + a1 x + a2 x2 + · · · + an xn
be a polynomial of degree n with integer coefficients, and let p be a prime
number. Suppose that
• p does not divide an ,
• p divides a0 , a1 , . . . , an−1 ,
• p2 does not divide a0 .
Then the polynomial f is irreducible over the field Q of rational numbers.
Proof Suppose that f (x) = g(x)h(x), where g and h are polynomials with
integer coefficients. Let g(x) = b0 + b1 x + b2 x2 + · · · + br xr and h(x) =
c0 +c1 x+c2 x2 +· · ·+cs xs . Then a0 = b0 c0 . Now a0 is divisible by p but is not
divisible by p2 . Therefore exactly one of the coefficients b0 and c0 is divisible
by p. Suppose that p divides b0 but does not divide c0 . Now p does not divide
all the coefficients of g(x), since it does not divide all the coefficients of f (x).
Let j be the smallest value of i for which p does not divide bi . Then p divides
j−1
P
aj − bj c0 , since aj − bj c0 = bi cj−i and bi is divisible by p when i < j. But
i=0
bj c0 is not divisible by p, since p is prime and neither bj nor c0 is divisible by
p. Therefore aj is not divisible by p, and hence j = n and deg g ≥ n = deg f .
Thus deg g = deg f and deg h = 0. Thus the polynomial f does not factor
as a product of polynomials of lower degree with integer coefficients, and
therefore f is irreducible over Q (Proposition 3.16).
12
If L: K is a field extension then we can regard L as a vector space over
the field K. If L is a finite-dimensional vector space over K then we say that
the extension L: K is finite. The degree [L: K] of a finite field extension L: K
is defined to be the dimension of L considered as a vector space over K.
{xi yj : 1 ≤ i ≤ m and 1 ≤ j ≤ n}
13
is a basis of M , considered as a vector space over K. We conclude that the
extension M : K is finite, and
[M : K] = mn = [M : L][L: K],
as required.
Proof Let L: K be a finite field extension, and let n = [L: K]. Let α ∈ L.
Then either the elements 1, α, α2 , . . . , αn are not all distinct, or else these
elements are linearly dependent over the field K (since a linearly inde-
pendent subset of L can have at most n elements.) Therefore there exist
c0 , c1 , c2 , . . . , cn ∈ K, not all zero, such that
c0 + c1 α + c2 α2 + · · · + cn αn = 0.
14
Lemma 3.20 Let K be a field and let α be an element of some extension
field L of K. Suppose that α is algebraic over K. Then there exists a unique
irreducible monic polynomial m ∈ K[x], with coefficients in K, characterized
by the following property: f ∈ K[x] satisfies f (α) = 0 if and only if m divides
f in K[x].
Proof Suppose that the field extension K(α): K is finite. It then follows
from Lemma 3.19 that α is algebraic over K.
Conversely suppose that α is algebraic over K. Let R = {f (α) : f ∈
K[x]}. Now f (α) = 0 if and only if the minimum polynomial m of α over
K divides f . It follows that f (α) = 0 if and only if f ∈ (m), where (m) is
the ideal of K[x] generated by m. The ring homomorphism from K[x] to R
that sends f ∈ K[x] to f (α) therefore induces an isomorphism between the
quotient ring K[x]/(m) and the ring R. But K[x]/(m) is a field, since m is
irreducible (Proposition 3.14). Therefore R is a subfield of K(α) containing
K ∪ {α}, and hence R = K(α).
15
Let z ∈ K(α). Then z = g(α) for some g ∈ K[x]. But then there exist
polynomials l and f belonging to K[x] such that g = lm + f and either f = 0
or deg f < deg m (Lemma 3.10). But then z = f (α) since m(α) = 0.
Suppose that z = h(α) for some polynomial h ∈ K[x], where either h = 0
or deg h < deg m. Then m divides h−f , since α is a zero of h−f . But if h−f
were non-zero then its degree would be less than that of m, and thus h − f
would not be divisible by m. We therefore conclude that h = f . Thus any
element z of K(α) can be expressed in the form z = f (α) for some uniquely
determined polynomial f ∈ K[x] satisfying either f = 0 or deg f < deg m.
Thus if n = deg m then 1, α, α2 . . . , αn−1 is a basis of K(α) over K. It follows
that the extension K(α): K is finite and [K(α): K] = deg m, as required.
• the construction of the edge of a cube having twice the volume of some
given cube;
16
Definition Let P0 and P1 be the points of the Euclidean plane given by
P0 = (0, 0) and P1 = (1, 0). We say that a point P of the plane is constructible
using straightedge and compasses alone if P = Pn for some finite sequence
P0 , P1 , . . . , Pn of points of the plane, where P0 = (0, 0), P1 = (1, 0) and, for
each j > 1, the point Pj is one of the following:—
• the point at which a straight line joining two points belonging to the
set {P0 , P1 , . . . , Pj−1 } intersects a circle which is centred on a point of
this set and passes through another point of the set;
Constructible points of the plane are those that can be constructed from
the given points P0 and P1 using straightedge (i.e., unmarked ruler) and
compasses alone.
One can apply this criterion to show that there is no geometrical con-
struction that enables one to trisect an arbitrary angle using straightedge
and compasses alone. The same method can be used to show the impos-
sibility of ‘duplicating a cube’ or ‘squaring a circle’ using straightedge and
compasses alone.
17
Example We show that there is no geometrical construction for the trisec-
tion of an angle of π3 radians (i.e., 60◦ ) using straightedge and compasses
alone. Let a = cos π9 and b = sin π9 . Now the point (cos π3 , sin π3 ) (i.e, the
√
point ( 12 , 12 3)) is constructible. Thus if an angle of π3 radians could be tri-
sected using straightedge and compasses alone, then the point (a, b) would
be constructible. Now
cos 3θ = cos θ cos 2θ − sin θ sin 2θ = cos θ(cos2 θ − sin2 θ) − 2 sin2 θ cos θ
= 4 cos3 θ − 3 cos θ
Lemma 3.24 If the endpoints of any line segment in the plane are con-
structible, then so is the midpoint.
18
Proof Let P and Q be constructible points in the plane. Let S and T be the
points where the circle centred on P and passing through Q intersects the
circle centred on Q and passing through P . Then S and T are constructible
points in the plane, and the point R at which the line ST intersects the
line P Q is the midpoint of the line segment P Q. Thus this midpoint is a
constructible point.
Lemma 3.25 If any three vertices of a parallelogram in the plane are con-
structible, then so is the fourth vertex.
Theorem 3.26 Let K denote the set of all real numbers x for which the
point (x, 0) is constructible using straightedge and compasses alone. Then K
is a subfield of the field of real numbers, and a point (x, y) of the plane is
constructible using straightedge and compass √ alone if and only if x ∈ K and
y ∈ K. Moreover if x ∈ K and x > 0 then x ∈ K.
19
since it is the fourth vertex of a parallelogram which has three vertices at the
constructible points (x, 0), (0, y) and (0, 1) (Lemma 3.25). But the line which
passes through the two constructible points (0, y) and (x, y − 1) intersects
the x-axis at the point (xy, 0). Therefore the point (xy, 0) is constructible,
and thus xy ∈ K.
Now suppose that x ∈ K, y ∈ K and y 6= 0. The point (x, 1 − y) is
constructible, since it is the fourth vertex of a parallelogram with vertices
at the constructible points (x, 0), (0, y) and (0, 1). The line segment joining
the constructible points (0, 1) and (x, 1 − y) intersects the x-axis at the point
(xy −1 , 0). Thus xy −1 ∈ K.
The above results show that K is a subfield of the field of real numbers.
Moreover if x ∈ K and y ∈ K then the point (x, y) is constructible, since it is
the fourth vertex of a rectangle with vertices at the constructible points (0, 0),
(x, 0) and (0, y). Conversely, suppose that the point (x, y) is constructible.
We claim that the point (x, 0) is constructible and thus x ∈ K. This result is
obviously true if y = 0. If y 6= 0 then the circles centred on the points (0, 0)
and (1, 0) and passing through (x, y) intersect in the two points (x, y) and
(x, −y). The point (x, 0) is thus the point at which the line passing through
the constructible points (x, y) and (x, −y) intersects the x-axis, and is thus
itself constructible. The point (0, y) is then the fourth vertex of a rectangle
with vertices at the constructible points (0, 0), (x, 0) and (x, y), and thus is
itself constructible. The circle centred on the origin and passing though (0, y)
intersects the x-axis at (y, 0). Thus (y, 0) is constructible, and thus y ∈ K.
We have thus shown that a point (x, y) is constructible using straightedge
and compasses alone if and only if x ∈ K and y ∈ K.
Suppose that x ∈ K and that x > 0. Then 21 (1 − x) ∈ K. Thus if
C = (0, 12 (1 − x)) then C is a constructible point. Let (u, 0) be the point at
which the circle centred on C and passing through the constructible point
(0, 1) intersects the x-axis. (The circle does intersect the x-axis since it passes
through (0, 1) and (0, −x), and x > 0.) The radius of this circle is 12 (1 + x)),
and therefore 14 (1 − x)2 + u2 = 14 (1 + x)2 (Pythagoras’ Theorem.) But then
2
√ = x. But (u, 0) is a constructible point. Thus if x ∈ K and x > 0 then
u
x ∈ K, as required.
20
reduces to that of determining which regular polygons with an odd number
of sides are constructible. Moreover it is not difficult to reduce down to the
case where n is a power of some odd prime number.
Gauss discovered that a regular 17-sided polygon was constructible in
1796, when he was 19 years old. Techniques of Galois Theory show that the
regular n-sided polygon is constructible using straightedge and compass if
and only if n = 2s p1 p2 · · · pt , where p1 , p2 , . . . , pt are distinct Fermat primes:
a Fermat prime is a prime number that is of the form 2k +1 for some integer k.
If k = uv, where u and v are positive integers and v is odd, then 2k + 1 =
wv + 1 = (w + 1)(wv−1 − wv−2 + · · · − w + 1), where w = 2u , and hence
m
2k + 1 is not prime. Thus any Fermat prime is of the form 22 + 1 for some
non-negative integer m. Fermat observed in 1640 that Fm is prime when
m ≤ 4. These Fermat primes have the values F0 = 3, F1 = 5, F2 = 17,
F3 = 257 and F4 = 65537. Fermat conjectured that all the numbers Fm were
prime. However it has been shown that Fm is not prime for any integer m
between 5 and 16. Moreover F16 = 265536 + 1 ≈ 1020000 . Note that the five
Fermat primes 3, 5, 17, 257 and 65537 provide only 32 constructible regular
polygons with an odd number of sides.
It is not difficult to see that the geometric problem of constructing a
regular n-sided polygon using straightedge and compasses is equivalent to
the algebraic problem of finding a formula to express the nth roots of unity
in the complex plane in terms of integers or rational numbers by means of
algebraic formulae which involve finite addition, subtraction, multiplication,
division and the successive extraction of square roots. Thus the problem is
closely related to that of expressing the roots of a given polynomial in terms
of its coefficients by means of algebraic formulae which involve only finite
addition, subtraction, multiplication, division and the successive extraction
of pth roots for appropriate prime numbers p.
21
Definition Let L: K be a field extension, and let f ∈ K[x] be a polynomial
with coefficients in K. The field L is said to be a splitting field for f over K
if the following conditions are satisfied:—
• the polynomial f does not split over any proper subfield of L that
contains the field K.
We shall prove below that splitting fields always exist and that any two
splitting field extensions for a given polynomial over a field K are isomorphic.
Given any homomorphism σ: K → M of fields, we define
22
Proof Let g be an irreducible factor of f , and let L = K[x]/(g), where (g)
is the ideal of K[x] generated by g. For each a ∈ K let i(a) = a + (g). Then
i: K → L is a monomorphism. We embed K in L on identifying a ∈ K with
i(a).
Now L is a field, since g is irreducible (Proposition 3.14). Let α = x+(g).
Then g(α) is the image of the polynomial g under the quotient homomor-
phism from K[x] to L, and therefore g(α) = 0. But g is a factor of the
polynomial f . Therefore f (α) = 0, as required.
Corollary 3.29 Let K be a field and let f ∈ K[x]. Then there exists a
splitting field for f over K.
Proof We use induction on the degree deg f of f . The result is trivially true
when deg f = 1 (since f then splits over K itself). Suppose that the result
holds for all fields and for all polynomials of degree less than deg f . Now it
follows from Theorem 3.28 that there exists a field extension K1 : K of K and
an element α of K1 satisfying f (α) = 0. Moreover f (x) = (x − α)g(x) for
some polynomial g with coefficients in K(α). Now deg g < deg f . It follows
from the induction hypothesis that there exists a splitting field L for g over
K(α). Then f splits over L.
Suppose that f splits over some field M , where K ⊂ M ⊂ L. Then
α ∈ M and hence K(α) ⊂ M . But M must also contain the roots of g,
since these are roots of f . It follows from the definition of splitting fields
that M = L. Thus L is the required splitting field for the polynomial f over
K.
Any two splitting fields for a given polynomial with coefficients in a field K
are K-isomorphic. This result is a special case of the following theorem.
23
Let g and h be polynomials with coefficients in K1 . Now g(α) = h(α)
if and only if m divides g − h. Similarly σ∗ (g)(β) = σ∗ (h)(β) if and only if
σ∗ (m) divides σ∗ (g) − σ∗ (h). Therefore σ∗ (g)(β) = σ∗ (h)(β) if and only if
g(α) = h(α), and thus there is a well-defined isomorphism ϕ: K1 (α) → K2 (β)
which sends g(α) to σ∗ (g)(β) for any polynomial g with coefficients in K.
Now L1 and L2 are splitting fields for the polynomials f and σ∗ (f ) over the
fields K1 (α) and K2 (β) respectively, and [L1 : K1 (α)] < [L1 : K1 ]. The induc-
tion hypothesis therefore ensures the existence of an isomorphism τ : L1 → L2
extending ϕ: K1 (α) → K2 (β). Then τ : L1 → L2 is the required extension of
σ: K1 → K2 .
Note that a field extension L: K is normal if and only if, given any ele-
ment α of L, the minimum polynomial of α over K splits over L.
24
Proof Suppose that L: K is both finite and normal. Then there exist alge-
braic elements α1 , α2 , . . . , αn of L such that L = K(α1 , α2 , . . . , αn ) (Corol-
lary 3.22). Let f (x) = m1 (x)m2 (x) · · · mn (x), where mj ∈ K[x] is the mini-
mum polynomial of αj over K for j = 1, 2, . . . , n. Then mj splits over L since
mj is irreducible and L: K is normal. Thus f splits over L. It follows that
L is a splitting field for f over K, since L is obtained from K by adjoining
roots of f .
Conversely suppose that L is a splitting field over K for some polynomial
f ∈ K[x]. Then L is obtained from K by adjoining the roots of f , and
therefore the extension L: K is finite. (Corollary 3.22).
Let g ∈ K[x] be irreducible, and let M be a splitting field for the polyno-
mial f g over L. Then L ⊂ M and the polynomials f and g both split over
M . Let β and γ be roots of g in M . Now the polynomial f splits over the
fields L(β) and L(γ). Moreover if f splits over any subfield of M containing
K(β) then that subfield must contain L (since L is a splitting field for f over
K) and thus must contain L(β). We deduce that L(β) is a splitting field for
f over K(β). Similarly L(γ) is a splitting field for f over K(γ).
Now there is a well-defined K-isomorphism σ: K(β) → K(γ) which sends
h(β) to h(γ) for all polynomials h with coefficients in K, since two such poly-
nomials h1 and h2 take the same value at a root of the irreducible polyno-
mial g if and only if their difference h1 −h2 is divisible by g. This isomorphism
σ: K(β) → K(γ) extends to an K-isomorphism τ : L(β) → L(γ) between L(β)
and L(γ), since L(β) and L(β) are splitting fields for f over the field K(β) and
K(γ) respectively (Theorem 3.30). Thus the extensions L(β): K and L(γ): K
are isomorphic, and [L(β): K] = [L(γ): K]. But [L(β): K] = [L(β): L][L: K]
and [L(γ): K] = [L(γ): L][L: K] by the Tower Law (Theorem 3.18). It follows
that [L(β): L] = [L(γ): L]. In particular β ∈ L if and only if γ ∈ L. This
shows that that any irreducible polynomial with a root in L must split over
L, and thus L: K is normal, as required.
3.13 Separability
Let K be a field. We recall that nk is defined inductively for all integers n
and for all elements k of K so that 0k = 0 and (n + 1)k = nk + k for all
n ∈ Z and k ∈ K. Thus 1k = k, 2k = k + k, 3k = k + k + k etc., and
(−n)k = −(nk) for all n ∈ Z.
25
n
jcj xj−1 .
P
of f is defined by the formula (Df )(x) =
j=1
and hence (Df )(α) = 0. It follows that the minimum polynomial of α over
K is a non-constant polynomial with coefficients in K which divides both f
and Df .
Conversely let f ∈ K[x] be a polynomial with the property that f and
Df are both divisible by some non-constant polynomial g ∈ K[x]. Let L be
a splitting field for f over K. Then g splits over L (since g is a factor of f ).
Let α ∈ L be a root of g. Then f (α) = 0, and hence f (x) = (x − α)e(x)
for some polynomial e ∈ L[x]. On differentiating, we find that (Df )(x) =
e(x) + (x − α)De(x). But (Df )(α) = 0, since g(α) = 0 and g divides Df
in K[x]. It follows that e(α) = (Df )(α) = 0, and thus e(x) = (x − α)h(x)
for some polynomial h ∈ L[x]. But then f (x) = (x − α)2 h(x), and thus the
polynomial f has a repeated root in the splitting field L, as required.
26
Corollary 3.34 Let K be a field. An irreducible polynomial f is inseparable
if and only if Df = 0.
27
p
that p divides for all j satisfying 0 < j < p. But px = 0 for all x ∈ K,
j
since charK = p. Therefore (x + y)p = xp + y p for all x, y ∈ K. The identity
(xy)p = xp y p is immediate from the commutativity of K.
Corollary 3.38 There exists a finite field GF(pn ) of order pn for each prime
number p and positive integer n. Two finite fields are isomorphic if and only
if they have the same number of elements.
28
The field GF(pn ) is referred to as the Galois field of order pn .
The non-zero elements of a field constitute a group under multiplication.
We shall prove that all finite subgroups of the group of non-zero elements of
a field are cyclic. It follows immediately from this that the group of non-zero
elements of a finite field is cyclic.
For each positive integer n, we denote by ϕ(n) the number of integers
X x
satisfying 0 ≤ x < n that are coprime to n. We show that the sum ϕ(d)
d|n
of ϕ(d) taken over all divisors of a positive integer n is equal to n.
X
Lemma 3.39 Let n be a positive integer. Then ϕ(d) = n.
d|n
The set of all non-zero elements of a field is a group with respect to the
operation of multiplication.
Proof Let n be the order of the group G. It follows from Lagrange’s Theorem
that the order of every element of G divides n. For each divisor dX
of n, let ψ(d)
denote the number of elements of G that are of order d. Clearly ψ(d) = n.
d|n
Let g be an element of G of order d, where d is a divisor of n. The elements
1, g, g 2 , . . . , g d−1 are distinct elements of G and are roots of the polynomial
xd − 1. But a polynomial of degree d with coefficients in a field has at most
d roots in that field. Therefore every element x of G satisfying xd = 1 is g k
29
for some uniquely determined integer k satisfying 0 ≤ k < d. If k is coprime
to d then g k has order d, for if (g k )n = 1 then d divides kn and hence d
divides n. Conversely if g k has order d then d and k are coprime, for if e is
a common divisor of k and d then (g k )d/e = g d(k/e) = 1, and hence e = 1.
Thus if there exists at least one element g of G that is of order d then the
elements of G that are of order d are the elements g k for those integers k
satisfying 0 ≤ k < d that are coprime to d. It follows that if ψ(d) > 0 then
ψ(d) = ϕ(d), where ϕ(d) is the number of integers k satisfying 0 ≤ k < d
that are coprime to d. X
Now 0 ≤ ψ(d) ≤ ϕ(d) for each divisor d of n. But ψ(d) = n and
d|n
X
ϕ(d) = n. It follows that ψ(d) = φ(d) for each divisor d of n. In
d|n
particular ψ(n) = ϕ(n) ≥ 1. Thus there exists an element of G whose order
is the order n of G. This element generates G, and thus G is cyclic, as
required.
30
common root of g and h. It follows that x − γ is a highest common factor of
g and h in the polynomial ring K(θ)[x], and therefore γ ∈ K(θ). But then
β ∈ K(θ), since β = θ − cγ and c ∈ K. It follows that L = K(θ).
It now follows by induction on m that if L = K(α1 , α2 , . . . , αm ), where K
is infinite, α1 , α2 , . . . , αm are algebraic over K, and L: K is separable, then
the extension L: K is simple. Thus all finite separable field extensions are
simple, as required.
Proof It follows from the Primitive Element Theorem (Theorem 3.42) that
there exists some element α of L such that L = K(α). Let λ be an element
of L. Then λ = g(α) for some polynomial g with coefficients in K. But then
σ(λ) = g(σ(α)) for all σ ∈ Γ(L: K), since the coefficients of G are fixed by
σ. It follows that each automorphism σ in Γ(L: K) is uniquely determined
once σ(α) is known
If f be the minimum polynomial of α over K then f (σ(α)) = σ(f (α)) = 0
for all σ ∈ Γ(L: K) since the coefficients of f are in K and are therefore fixed
by σ. Thus σ(α) is a root of f . It follows that the order |Γ(L: K)| of the
Galois group is bounded above by the number of roots of f that belong to
L, and is thus bounded above by the degree deg f of f . But deg f = [L: K]
(Theorem 3.21). Thus |Γ(L: K)| ≤ [L: K], as required.
(x − α1 )(x − α2 ) · · · (x − αk ),
where α1 , α2 , . . . , αk are distinct and are the elements of the orbit of α under
the action of G on L.
31
Proof Let f (x) = (x − α1 )(x − α2 ) · · · (x − αm ). Then the polynomial f is
invariant under the action of G, since each automorphism in the group G
permutes the elements α1 , α2 , . . . , αk and therefore permutes the factors of
f amongst themselves. It follows that the coefficients of the polynomial f
belong to the fixed field K of G. Thus α is algebraic over K, as it is a root
of the polynomial f .
Now, given any root αi of f , there exists some σ ∈ G such that αi =
σ(α). Thus if g ∈ K[x] is a polynomial with coefficients in K which satisfies
g(α) = 0 then g(αi ) = σ(g(α)) = 0, since the coefficients of g are fixed by σ.
But then f divides g. Thus f is the minimum polynomial of α over K, as
required.
Proof It follows from Proposition 3.44 that, for each α ∈ L, the minimum
polynomial of α over K splits over L and has no multiple roots. Thus the
extension L: K is both normal and separable.
Let M be any field satisfying K ⊂ M ⊂ L for which the extension M : K
is finite. The extension M : K is separable, since L: K is separable. It follows
from the Primitive Element Theorem (Theorem 3.42) that the extension
M : K is simple. Thus M = K(α) for some α ∈ L. But then [M : K] is equal
to the degree of the minimum polynomial of α over K (Theorem 3.21). It
follows from Proposition 3.44 that [M : K] is equal to the number of elements
in the orbit of α under the action of G on L. Therefore [M : K] divides |G|
for any intermediate field M for which the extension M : K is finite.
Now let the intermediate field M be chosen so as to maximize [M : K].
If λ ∈ L then λ is algebraic over K, and therefore [M (λ): M ] is finite. It
follows from the Tower Law (Theorem 3.18) that [M (λ): K] is finite, and
[M (λ): K] = [M (λ): M ][M : K]. But M has been chosen so as to maximize
[M : K]. Therefore [M (λ): K] = [M : K], and [M (λ): M ] = 1. Thus λ ∈ M .
We conclude that M = L. Thus L: K is finite and [L: K] divides |G|.
The field extension L: K is a Galois extension, since it has been shown to
be finite, normal and separable. Now G ⊂ Γ(L: K) and |Γ(L: K)| ≤ [L: K]
(Lemma 3.43). Therefore |Γ(L: K)| ≤ [L: K] ≤ |G| ≤ |Γ(L: K)|, and thus
G = Γ(L: K) and |G| = [L: K], as required.
32
Theorem 3.46 Let Γ(L: K) be the Galois group of a finite field extension
L: K. Then |Γ(L: K)| divides [L: K]. Moreover |Γ(L: K)| = [L: K] if and only
if L: K is a Galois extension, in which case K is the fixed field of Γ(L: K).
Proof Let M be the fixed field of Γ(L: K). It follows from Theorem 3.45
that L: M is a Galois extension and |Γ(L: K)| = [L: M ]. Now [L: K] =
[L: M ][M : K] by the Tower Law (Theorem 3.18). Thus |Γ(L: K)| divides
[L: K]. If |Γ(L: K)| = [L: K] then M = K. But then L: K is a Galois
extension and K is the fixed field of Γ(L: K).
Conversely suppose that L: K is a Galois extension. We must show that
|Γ(L: K)| = [L: K]. Now the extension L: K is both finite and separable. It
follows from the Primitive Element Theorem (Theorem 3.42) that there exists
some element θ of L such that L = K(θ). Let f be the minimum polynomial
of θ over K. Then f splits over L, since f is irreducible and the extension
L: K is normal. Let θ1 , θ2 , . . . , θn be the roots of f in L, where θ1 = θ and
n = deg f . If σ is a K-automorphism of L then f (σ(θ)) = σ(f (θ)) = 0, since
the coefficients of the polynomial f belong to K and are therefore fixed by
σ. Thus σ(θ) = θj for some j. We claim that, for each root θj of f , there is
exactly one K-automorphism σj of L satisfying σj (θ) = θj .
Let g(x) and h(x) be polynomials with coefficients in K. Suppose that
g(θ) = h(θ). Then g − h is divisible by the minimum polynomial f of θ.
It follows that g(θj ) = h(θj ) for any root θj of f . Now every element of
L is of the form g(θ) for some g ∈ K[x], since L = K(θ). We deduce
therefore that there is a well-defined function σj : L → L with the property
that σj (g(θ)) = g(θj ) for all g ∈ K[x]. The definition of this function ensures
that it is the unique automorphism of the field L that fixes each element of
K and sends θ to θj .
Now the roots of the polynomial f in L are distinct, since f is irreducible
and L: K is separable. Moreover the order of the Galois group Γ(L: K) is
equal to the number of roots of f , since each root determines a unique element
of the Galois group. Therefore |Γ(L: K)| = deg f . But deg f = [L: K] since
L = K(θ) and f is the minimum polynomial of θ over K (Theorem 3.21).
Thus |Γ(L: K)| = [L: K], as required.
33
is irreducible over K and L: K is a normal extension. Also the roots of fK in
L are distinct, since L: K is a separable extension. But fM divides fK , since
fK (α) = 0 and the coefficients of fK belong to M . It follows that fM also
splits over L, and its roots are distinct. We deduce that the finite extension
L: M is both normal and separable, and is therefore a Galois extension.
The finite extension M : K is clearly separable, since L: K is separable.
Thus if M : K is a normal extension then it is a Galois extension.
34
3.18 Quadratic Polynomials
We consider the problem of expressing the roots of a polynomial of low degree
in terms of its coefficients. Then the well-known procedure for locating the
roots of a quadratic polynomial with real or complex coefficients generalizes
to quadratic polynomials with coefficients in a field K whose characteristic
does not equal 2. Given a quadratic polynomial ax2 + bx + c with coefficients
a and b belonging to some such field K, let us adjoin to K an element δ sat-
isfying δ 2 = b2 − 4ac. Then the polynomial splits over K(δ), and its roots are
(−b ± δ)/(2a). We shall describe below analogous procedures for expressing
the roots of cubic and quartic polynomials in terms of their coefficients.
f (u + v) = u3 + v 3 + (3uv − p)(u + v) − q.
where the two cube roots must be chosen so as to ensure that their product
is equal to 13 p. It follows that the cubic polynomial x3 − px − q splits over the
field K(, ξ, ω), where 2 = 14 q 2 − 27
1 3
p and ξ 3 = 12 q + and where ω satisfies
35
ω 3 = 1 and ω 6= 1. The roots of the polynomial in this extension field are α,
β and γ, where
p p p
α=ξ+ , β = ωξ + ω 2 , γ = ω2ξ + ω3 .
3ξ 3ξ 3ξ
Now let us consider the possibilities for the Galois group Γ(L: K), where
L is a splitting field for f over K. Now L = K(α, β, γ), where α, β and γ
are the roots of f . Also a K-automorphism of L must permute the roots
of f amongst themselves, and it is determined by its action on these roots.
Therefore Γ(L: K) is isomorphic to a subgroup of the symmetric group Σ3
(i.e., the group of permutations of a set of 3 objects), and thus the possibilities
for the order of Γ(L: K) are 1, 2, 3 and 6. It follows from Corollary 3.31 that
f is irreducible over K if and only if the roots of K are distinct and the
Galois group acts transitively on the roots of K. By considering all possible
subgroups of Σ3 it is not difficult to see that f is irreducible over K if and
only if |Γ(L: K)| = 3 or 6. If f splits over K then |Γ(L: K)| = 1. If f factors
in K[x] as the product of a linear factor and an irreducible quadratic factor
then |Γ(L: K)| = 2.
Let δ = (α−β)(α−γ)(β −γ). Then δ 2 is invariant under any permutation
of α β and γ, and therefore δ 2 is fixed by all automorphisms in the Galois
group Γ(L: K). Therefore δ 2 ∈ K. The element δ 2 of K is referred to as
the discriminant of the polynomial f . A straightforward calculation shows
that if f (x) = x3 − px − q then δ 2 = 4p3 − 27q 2 . Now δ changes sign under
any permutation of the roots α, β and γ that transposes two of the roots
whilst leaving the third root fixed. But δ ∈ K if and only if δ is fixed by all
elements of the Galois group Γ(L: K), in which case the Galois group must
induce only cyclic permutations of the roots α, β and γ. Therefore Γ(L: K)
is isomorphic to the cyclic group of order 3 if and only if f is irreducible
and the discriminant 4p3 − 27q 2 of f has a square root in the field K. If f
is irreducible but the discriminant does not have a square root in K then
Γ(L: K) is isomorphic to the symmetric group Σ3 , and |Γ(L: K)| = 6.
36
µ = (α + γ)(β + δ) = −(α + γ)2 ,
ν = (α + δ)(β + γ) = −(α + δ)2 .
It follows that g(x) = x3 + 2px2 + (p2 + 4r)x + q 2 . We can use the formulae
for the roots of a cubic polynomial to express the roots λ, µ and ν of g in
terms of the coefficients of f , and thus determine the roots α, β, γ and δ of
f in terms of the coefficients of f .
37
Q(ξ). Another application of Theorem 3.21 now shows that [L: Q(ξ)] =
[Q(ξ, i): Q(ξ)] = 2. It follows from the Tower Law (Theorem 3.18) that
[L: Q] = [L: Q(ξ)][Q(ξ): Q] = 8. Moreover the extension L: Q is a Galois
extension, and therefore its Galois group Γ(L: Q) is a group of order 8 (The-
orem 3.46).
Another application of the Tower Law now shows that [L: Q(i)] = 4,
since [L: Q] = [L: Q(i)][Q(i): Q] and [Q(i): Q] = 2. Therefore the minimum
polynomial of ξ over Q(i) is a polynomial of degree 4 (Theorem 3.21). But ξ is
a root of x4 −2. Therefore x4 −2 is irreducible over Q(i), and is the minimum
polynomial of ξ over Q(i). Corollary 3.31 then ensures the existence of an
automorphism σ of L that sends ξ ∈ L to iξ and fixes each element of Q(i).
Similarly there exists an automorphism τ of L that sends i to −i and fixes
each element of Q(ξ). (The automorphism τ is in fact the restriction to L
of the automorphism of C that sends each complex number to its complex
conjugate.)
Now the automorphisms σ, σ 2 , σ 3 and σ 4 fix i and therefore send ξ to
iξ, −ξ, −iξ and ξ respectively. Therefore σ 4 = ι, where ι is the identity
automorphism of L. Similarly τ 2 = ι. Straightforward calculations show
that τ σ = σ 3 τ , and (στ )2 = (σ 2 τ )2 = (σ 3 τ )2 = ι. It follows easily from this
that Γ(L: Q) = {ι, σ, σ 2 , σ 3 , τ, στ, σ 2 τ, σ 3 τ }, and Γ(L: Q) is isomorphic to the
dihedral group of order 8 (i.e., the group of symmetries of a square in the
plane).
The Galois correspondence is a bijective correspondence between the sub-
groups of Γ(L: Q) and subfields of L that contain Q. The subfield of L cor-
responding to a given subgroup of Γ(L: Q) is set of all elements of L that
are fixed by all the automorphisms in the subgroup. One can verify that
the correspondence between subgroups of Γ(L: Q) and their fixed fields is as
follows:—
Subgroup of Γ(L: Q) Fixed field
Γ(L: K) Q
{ι, σ, σ 2 , σ 3 } Q(i)
√
{ι, σ 2 , τ, σ 2 τ } Q( √2)
{ι, σ 2 , στ, σ 3 τ } √ 2)
Q(i
{ι, σ 2 } Q( 2, i)
{ι, τ } Q(ξ)
{ι, σ 2 τ } Q(iξ)
{ι, στ } Q((1 − i)/ξ)
{ι, σ 3 τ } Q((1 + i)/ξ)
{ι} Q(ξ, i)
38
3.22 The Galois group of a polynomial
Definition Let f be a polynomial with coefficients in some field K. The
Galois group ΓK (f ) of f over K is defined to be the Galois group Γ(L: K) of
the extension L: K, where L is some splitting field for the polynomial f over
K.
We recall that all splitting fields for a given polynomial over a field K
are K-isomorphic (see Theorem 3.30), and thus the Galois groups of these
splitting field extensions are isomorphic. The Galois group of the given poly-
nomial over K is therefore well-defined (up to isomorphism of groups) and
does not depend on the choice of splitting field.
Let f be a polynomial with coefficients in some field K and let the roots
of f is some splitting field L be α1 , α2 , . . . , αn . An element σ of Γ(L: K) is
a K-automorphism of L, and therefore σ permutes the roots of f . Moreover
two automorphism σ and τ in the Galois group Γ(L: K) are equal if and only
if σ(αj ) = τ (αj ) for j = 1, 2, . . . , n, since L = K(α1 , α2 , . . . , αn ). Thus the
Galois group of a polynomial can be represented as a subgroup of the group
of permutations of its roots. We deduce immediately the following result.
39
It follows from the definition above that a polynomial with coefficients in
a field K is solvable by radicals if and only if there exist fields K0 , K1 , . . . , Km
such that K0 = K, the polynomial f splits over Km , and, for each integer i
between 1 and m, the field Ki is obtained on adjoining to Ki−1 an element αi
with the property that αipi ∈ Ki−1 for some positive integer pi . Moreover we
can assume, without loss of generality that p1 , p2 , . . . , pm are prime numbers,
since an nth root α of an element of a given field can be adjoined that field
by successively adjoining powers αn1 , αn2 , . . . , αnk of α chosen such that n/n1
is prime, ni /ni−1 is prime for i = 2, 3, . . . , k, and nk = 1.
We shall prove that a polynomial with coefficients in a field K of charac-
teristic zero is solvable by radicals if and only if its Galois group ΓK (f ) over
K is a solvable group.
Let L be a field, and let p be a prime number that is not equal to the
characteristic of L. Suppose that the polynomial xp − 1 splits over L. Then
the polynomial xp − 1 has distinct roots, since its formal derivative pxp−1 is
non-zero at each root of xp − 1. An element ω of L is said to be a primitive
pth root of unity if ω p = 1 and ω 6= 1. The primitive pth roots of unity are
the roots of the polynomial xp−1 +xp−2 +· · ·+1, since xp −1 = (x −1)(xp−1 +
xp−2 + · · · + 1). Also the group of pth roots of unity in L is a cyclic group
over order p which is generated by any primitive pth root of unity.
Lemma 3.51 Let K be a field, and let p be a prime number that is not
equal to the characteristic of K. If ω is a primitive pth root of unity in
some extension field of K then the Galois group of the extension K(ω): K is
Abelian.
40
where αp = c and ω is some primitive pth root of unity. Now K(ω): K
is a normal extension, since K(ω) is a splitting field for the polynomial
xp − 1 over K (Theorem 3.32). On applying the Galois correspondence
(Theorem 3.48), we see that Γ(M : K(ω)) is a normal subgroup of Γ(M : K),
and Γ(M : K)/Γ(M : K(ω)) is isomorphic to Γ(K(ω): K). But Γ(K(ω): K) is
Abelian (Lemma 3.51). It therefore suffices to show that Γ(M : K(ω)) is also
Abelian.
Now the field M is obtained from K(ω) by adjoining an element α sat-
isfying αp = c. Therefore each automorphism σ in Γ(M : K(ω)) is uniquely
determined by the value of σ(α). Moreover σ(α) is also a root of xp − c, and
therefore σ(α) = αω j for some integer j. Thus if σ and τ are automorphisms
of M belonging to Γ(M : K(ω)), and if σ(α) = αω j and τ (α) = αω k , then
σ(τ (α)) = τ (σ(α)) = αω j+k , since σ(ω) = τ (ω) = ω. Therefore σ ◦ τ = τ ◦ σ.
We deduce that Γ(M : K(ω)) is Abelian, and thus Γ(M : K) is solvable, as
required.
41
Theorem 3.54 Let f be a polynomial with coefficients in a field K of char-
acteristic zero. Suppose that f is solvable by radicals. Then the Galois group
ΓK (f ) of f is a solvable group.
Proof The Galois group Γ(L: K) is a cyclic group of order p, since its order is
equal to the degree p of the extension L: K. Let σ be a generator of Γ(L: K),
let β be an element of L \ K, and let
αj = β0 + ω j β1 + ω 2j β2 + · · · + ω (p−1)j βp−1
α0 + α1 + α2 + · · · + αp−1 = pβ,
42
Proof Let ω be a primitive pth root of unity. Then ΓK(ω) (f ) is isomorphic
to a subgroup of ΓK (f ) (Lemma 3.49) and is therefore solvable (Proposi-
tion 2.49). Moreover f is solvable by radicals over K if and only if f is
solvable by radicals over K(ω), since K(ω) is obtained from K by adjoining
an element ω whose pth power belongs to K. We may therefore assume,
without loss of generality, that K contains a primitive pth root of unity for
each prime p that divides |ΓK (f )|.
The result is trivial when |ΓK (f )| = 1, since the polynomial f splits over
K. We prove the result by induction on the degree |ΓK (f )| of the Galois
group. Thus suppose that the result holds when the order of the Galois group
is less than |ΓK (f )|. Let L be a splitting field for f over K. Then L: K is
a Galois extension and Γ(L: K) ∼ = ΓK (f ). Now the solvable group Γ(L: K)
contains a normal subgroup H for which the corresponding quotient group
Γ(L: K)/H is a cyclic group of order p for some prime number p dividing
Γ(L: K). Let M be the fixed field of H. Then Γ(L: M ) = H and Γ(M : K) ∼ =
Γ(L: K)/H. (Theorem 3.48), and therefore [M : K] = |Γ(L: K)/H| = p. It
follows from Lemma 3.55 that M = K(α) for some element α ∈ M satisfying
αp ∈ K. Moreover ΓM (f ) ∼ = H, and H is solvable, since any subgroup of
a solvable group is solvable (Proposition 2.49). The induction hypothesis
ensures that f is solvable by radicals when considered as a polynomial with
coefficients in M , and therefore the roots of f lie in some extension field of
M obtained by successively adjoining radicals. But M is obtained from K by
adjoining the radical α. Therefore f is solvable by radicals, when considered
as a polynomial with coefficients in K, as required.
43
of f , and therefore |G| is divisible by p. It follows from a theorem of Cauchy
(Theorem 2.42) that G has an element of order p. Moreover an element of
G is determined by its action on the roots of f . Thus an element of G is of
order p if and only if it cyclically permutes the roots of f .
The irreducibility of f ensures that f has distinct roots (Corollary 3.35).
Let α1 and α2 be the two roots of f that are not real. Then α1 and α2 are
complex conjugates of one another, since f has real coefficients. We have
already seen that G contains an element of order p which cyclically permutes
the roots of f . On taking an appropriate power of this element, we obtain
an element σ of G that cyclically permutes the roots of f and sends α1 to
α2 . We label the real roots α3 , α4 , . . . , αp of f so that αj = σ(αj−1 ) for
j = 2, 3, 4, . . . , p. Then σ(αp ) = α1 . Now complex conjugation restricts to a
Q-automorphism τ of L that interchanges α1 and α2 but fixes αj for j > 2.
But if 2 ≤ j ≤ p then σ 1−j τ σ j−1 transposes the roots αj−1 and αj and fixes
the remaining roots. But transpositions of this form generate the whole of
the group of permutations of the roots. Therefore every permutation of the
roots of f is realised by some element of the Galois group G of f , and thus
G∼ = Σp , as required.
The above example demonstrates that there cannot exist any general
formula for obtaining the roots of a quintic polynomial from its coefficients in
a finite number of steps involving only addition, subtraction, multiplication,
division and the extraction of nth roots. For if such a general formula were
to exist then every quintic polynomial with rational coefficients would be
solvable by radicals.
44