Lecture note_Polynomial Theory
Lecture note_Polynomial Theory
COLLEGE OF EDUCATION
POLYNOMIAL THEORY
VINH 2024
CONTENTS
Contents 1
Introduction 2
1 Rings of Polynomials 3
1.1. Polynomials in One Variable . . . . . . . . . . . . . . . . . . 3
1.2. Division Algorithm . . . . . . . . . . . . . . . . . . . . . . . 9
2 Factoring Polynomials 18
2.1. Ideals of F[x] . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.2. Properties of the GCD . . . . . . . . . . . . . . . . . . . . . 19
2.3. Irreducible Polynomials . . . . . . . . . . . . . . . . . . . . . 21
3 Substitution in Polynomials 28
3.1. Polynomial Functions . . . . . . . . . . . . . . . . . . . . . . 28
3.2. Polynomials over Z and Q . . . . . . . . . . . . . . . . . . . . 31
3.3. Eisenstein’s Irreducibility Criterion . . . . . . . . . . . . . . . 34
3.4. Polynomials over R and C . . . . . . . . . . . . . . . . . . . . 35
3.5. Solution of Cubic and Quartic Equations by Formulas . . . . . 36
4 Extensions of Fields 60
4.1. Algebraic and Transcendental Elements . . . . . . . . . . . . . 60
4.2. The Minimum Polynomial . . . . . . . . . . . . . . . . . . . . 62
4.3. Basic Theorem on Field Extensions . . . . . . . . . . . . . . . 64
4.4. Degrees of field extensions . . . . . . . . . . . . . . . . . . . . 66
1
INTRODUCTION
2
CHAPTER 1
RINGS OF POLYNOMIALS
an π n + an−1 π n−1 + . . . + a1 π + a0
3
polynomial expressions in π with integer coefficients.
But the set of all the polynomial expressions in π with integer coefficients
is a ring (it is a ring of because it is obvious that the sum and product of
any two polynomials in π is again a polynomial in π). This ring contains
Z because every integer a is a polynomial with a constant term only, and
it also contains π.
Thus, if we wish to enlarge the ring Z by adjoining to it the new number
π, it turns out that the “next largest” ring after Z which contains as a
subring and includes π, is exactly the ring of all the polynomials in π with
coefficients in Z.
As this example shows, aside from their practical applications, polyno-
mials play an important role in the scheme of ring theory because they
are precisely what we need when we wish to enlarge a ring by adding new
elements to it.
In elementary algebra one considers polynomials whose coefficients are
real numbers, or in some cases, complex numbers. As a matter of fact,
the properties of polynomials are pretty much independent of the exact
nature of their coefficients. All we need to know is that the coefficients
are contained in some ring. For convenience, we will assume this ring is a
commutative ring with unity.
Let A be a commutative ring with unity. Up to now we have used letters
to denote elements or sets, but now we will use the letter x in a different
way. In a polynomial expression such as ax2 + bx + c, where a, b, c ∈ A, we
do not consider x to be an element of A, but rather x is a symbol which we
use in an entirely formal way. Later we will allow the substitution of other
things for x, but at present x is simply a placeholder.
Notationally, the terms of a polynomial may be listed in either ascending
or descending order. For example, 4x3 − 3x2 + x + 1 and 1 + x − 3x2 + 4x3
denote the same polynomial. In elementary algebra descending order is
preferred, but for our purposes ascending order is more convenient.
4
Let A be a commutative ring with unity, and x an arbitrary symbol.
Every expression of the form
a0 + a1 x + . . . + an xn
5
The familiar sigma notation for sums is useful for polynomials. Thus,
n
X
n
a(x) = a0 + . . . + an x = ak xk .
k=0
a(x) = a0 + . . . + an xn ,
b(x) = b0 + . . . + bn xn .
Here we do not assume that a(x) and b(x) have the same degree, but allow
ourselves to insert zero coefficients if necessary to achieve uniformity of
appearance.
We add polynomials by adding corresponding coefficients. Thus,
Note that the degree of a(x) + b(x) is less than or equal to the higher of
the two degrees, deg a(x) and deg b(x). Multiplication is more difficult, but
quite familiar:
6
If A is any ring, the symbol A[x] designates the set of all the polyno-
mials in x whose coefficients are in A, with addition and multiplication of
polynomials as we have just defined them.
1.1.1 Theorem. Let A be a commutative ring with unity. Then A[x] is a
commutative ring with unity.
Proof. To prove this theorem, we must show systematically that A[x] satis-
fies all the axioms of acommutative ring with unity. Throughout the proof,
let a(x), b(x), and c(x) stand for the following polynomials:
a(x) = a0 + . . . + an xn ,
b(x) = b0 + . . . + bn xn ,
c(x) = c0 + . . . + cn xn .
The axioms which involve only addition are easy to check: for example,
addition is commutative because
a(x) + b(x) = (a0 + b0 ) + (a1 + b1 )x + . . . + (an + bn )xn
= (b0 + a0 ) + (b1 + a1 )x + . . . + (bn + an )xn
= b(x) + a(x).
The associative law of addition is proved similarly, and is left as an exercise.
The zero polynomial has already been described, and the negative of a(x)
is
−a(x) = (−a0 ) + (−a1 )x + . . . + (−an )xn ,
To prove that multiplication is associative requires some care. Let
b(x)c(x) = d(x), where d(x) = d0 + d1 x + . . . + d2n x2n . By the definition of
polynomial multiplication, the k th coefficient of b(x)c(x) is
X
dk = b i cj .
i+j=k
Then a(x)[b(x)c(x)] = a(x)d(x) = e(x), where e(x) = e0 +e1 x+. . .+e3n x3n .
Now, the ℓth coefficient of a(x)d(x) is
X X X
ek = ah dk = ah ( bi cj ).
h+k=ℓ h+k=ℓ i+j=k
7
It is easy to see that the sum on the right consists of all the terms ah bi cj
such that h + i + j = ℓ. Thus,
X
ek = ah bi cj .
h+i+j=ℓ
a(x)[b(x)c(x)] = [a(x)b(x)]c(x).
To prove the distributive law, let a(x)[b(x) + c(x)] = d(x) where d(x) =
d0 + d1 x + . . . + d2n x2n . By the definitions of polynomial addition and mul-
tiplication, the k th coefficient a(x)[b(x) + c(x)] is
X X X X
dk = ai (bj + cj ) = ai bj + ai cj = ai b j + ai cj .
i+j=k i+j=k i+j=k i+j=k
Proof. If a(x) and b(x) are nonzero polynomials, we must show that their
product a(x)b(x) is not zero. Let an be the leading coefficient of a(x), and
bm the leading coefficient of b(x). By definition, an ̸= 0, and bm ̸= 0. Thus
an bm ̸= 0 because A is an integral domain. It follows that a(x)b(x) has a
nonzero coefficient (namely, an bm ), so it is not the zero polynomial.
8
If A is an integral domain, we refer to A[x] as a domain of polynomials,
because A[x] is an integral domain. Note that by the preceding proof, if
an and bm are the leading coefficients of a(x) and b(x), then an bm is the
leading coefficient of a(x)b(x). Thus, deg a(x)b(x) = n + m: In a domain
of polynomials A[x], where A is an integral domain,
x2 = (x − 2) (x + 2) + |{z}
|{z} 4 .
| {z } | {z }
a(x) b(x) c(x) r(x)
1.2.1 Theorem (Division algorithm for polynomials). If a(x) and b(x) are
polynomials over a field F,and b(x) ̸= 0, there exist polynomials q(x) and
9
r(x) over F such that
Proof. Let b(x) remain fixed, and let us show that every polynomial a(x)
satisfies the following condition: There exist polynomials q(x) and r(x) over
F such that
a(x) = b(x)q(x) + r(x),
10
where r(x) = 0 or deg r(x) < deg b(x). But then
an n−m
A(x) = a(x) + x b(x)
bm
an
= b(x)p(x) + r(x) + xn−m b(x)
bm
an n−m
= b(x)(p(x) + x ) + r(x).
bm
If we let p(x) + (an /bm )xn − m be renamed q(x), then
11
EXERCISES
12
equal to a polynomial of degree 4? Explain.
3. Write all the quadratic polynomials in Z5 [x]. How many are there?
How many cubic polynomials are there in Z5 [x]? More generally, how many
polynomials of degree m are there in Zn [x]?
4. Let A be an integral domain, prove the following statements:
(a) If (x + 1)2 = x2 + 1 in A[x], then A must have characteristic 2,
(b) If (x + 1)4 = x4 + 1 in A[x], then A must have characteristic 2,
(c) If (x + 1)6 = x6 + 2x3 + 1 in A[x], then A must have characteristic 3.
5. Find an example of each of the following in Z8 [x]: a divisor of zero,
an invertible element (find nonconstant examples).
6. Explain why x cannot be invertible in any A[x], hence no domain of
polynomials can ever be a field.
7. There are rings in which every element ̸= 0, 1 is a divisor of zero.
Explain why this cannot happen in any ring of polynomials A[x], even
when A is not an integral domain.
8. Show that in every A[x], there are elements ̸= 0, 1 which are not
idempotent, and elements ̸= 0, 1 which are not nilpotent.
13
find polynomials a(x) and b(x) such that deg a(x)b(x) < deg a(x)+deg b(x).
6. Show that if A is an integral domain, the only invertible elements in
A[x] are the constant polynomials with inverses in A. Then show that in
Z4 [x] there are invertible polynomials of all degrees.
7. Give all the ways of factoring x2 into polynomials of degree 1 in Z9 [x];
in Z5 [x]. Explain the difference in behavior.
8. Find all the square roots of x2 + x + 4 in Z5 [x]. Show that in Z8 [x],
there are infinitely many square roots of 1.
14
3. Let S be the set of all the polynomials a(x) in A[x] for which every
coefficient ai for odd i is equal to zero. Show that S is a subring of A[x].
Why is the same not true when “odd” is replaced by “even”?
4. Let J consist of all the elements in A[x] whose constant coefficient is
equal to zero. Prove that J is an ideal of A[x].
5. Let J consist of all the polynomials a0 + a1 x + . . . + an xn in A[x] such
that a0 + a1 + . . . + an = 0. Prove that J is an ideal of A[x].
6 Prove that the ideals in both parts 4 and 5 are prime ideals (assume
A is an integral domain).
h(a0 + a1 x + . . . + an xn ) = a0 + a1 cx + a2 c2 x2 + . . . + an cn xn .
15
kernel K. Define h : A[x] → B[x] by
A[x1 , x2 ] denotes the ring of all the polynomials in two letters x1 and x2
with coefficients in A. For example, x2 − 2xy + y 2 + x − 5 is a quadratic
polynomial in Q[x, y]. More generally, A[x1 , . . . , xn ] is the ring of the poly-
nomials in n letters x1 , . . . , xn with coefficients in A. Formally it is defined
as follows: Let A[x1 ] be denoted by A1 ; then A1 [x2 ] is A[x1 , x2 ]. Contin-
uing in this fashion, we may adjoin one new letter xi at a time, to get
A[x1 , . . . , xn ].
1. Prove that if A is an integral domain, then A[x1 , . . . , xn ] is an integral
domain.
2. Give a reasonable definition of the degree of any polynomial p(x, y)
in A[x, y] and then list all the polynomials of degree ≤ 3 in Z3 [x, y].
aij xi y j
P
Let us denote an arbitrary polynomial p(x, y) in A[x, y] by
P
where ranges over some pairs i, j of nonnegative integers.
16
3. Imitating the definitions of sum and product of polynomials in A[x],
give a definition of sum and product of polynomials in A[x, y].
4. Prove that deg a(x, y)b(x, y) = deg a(x, y) + deg b(x, y) if A is an
integral domain.
In the division algorithm, prove that q(x) and r(x) are uniquely deter-
mined. [HINT: Suppose
and subtract these two expressions, which are both equal to a(x).]
17
CHAPTER 2
FACTORING POLYNOMIALS
Just as every integer can be factored into primes, so every polynomial can
be factored into “irreducible” polynomials which cannot be factored further.
As a matter of fact, polynomials behave very much like integers when it
comes to factoring them. This is especially true when the polynomials have
all their coefficients in a field.
Throughout this chapter, we let F represent some field and we consider
polynomials over F. It will be found that F[x] has a considerable number
of properties in common with Z. To begin with, all the ideals of F[x] are
principal ideals, which was also the case for the ideals of Z. Note carefully
that in F[x], the principal ideal generated by a polynomial a(x) consists of
all the products a(x)s(x) as a(x) remains fixed and s(x) ranges over all the
members of F[x].
Proof. Let J be any ideal of F[x]. If J contains nothing but the zero poly-
nomial, J is the principal ideal generated by 0.
If there are nonzero polynomials in J, let b(x) be any polynomial of
lowest degree in J. We will show that J = ⟨b(x)⟩, which is to say that
every element of J is a polynomial multiple b(x)q(x) of b(x).
Indeed, if a(x) is any element of J, we may use the division algorithm to
write a(x) = b(x)q(x) + r(x), where r(x) = 0 or deg r(x) < deg b(x). Now,
r(x) = a(x) − b(x)q(x); but a(x) was chosen in J, and b(x) ∈ J; hence
18
b(x)q(x) ∈ J. It follows that r(x) is in J.
If r(x) ̸= 0, its degree is less than the degree of b(x). But this is im-
possible because b(x) is a polynomial of lowest degree in J. Therefore, of
necessity, r(x) = 0. Thus, finally, a(x) = b(x)q(x); so every member of J
is a multiple of b(x), as claimed.
a(x) = a0 + a1 x + . . . + an xn ,
then
a0 a1 an
a(x) = c( + x + . . . + xn ),
c c c
hence c | a(x). A polynomial a(x) is invertible iff it is a divisor of the unity
polynomial 1. But if a(x)b(x) = 1, this means that a(x) and b(x) both have
degree 0, that is, are constant polynomials: a(x) = a, b(x) = b, and ab = 1.
Thus, we have the following statement.
2.2.1 Proposition. The invertible elements of F[x] are all the nonzero
constant polynomials.
19
a(x) = b(x)c(x) and b(x) = a(x)d(x) for some c(x) and d(x). If this
happens to be the case, then
hence d(x)c(x) = 1 because F[x] is an integral domain. But then c(x) and
d(x) are constant polynomials, and therefore a(x) and b(x) are constant
multiples of each other. Thus, we have the following statement.
2.2.2 Proposition. In F[x], a(x) and b(x) are associates iff they are con-
stant multiples of each other.
(2) For any u(x) in F[x], if u(x) | a(x) and u(x) | b(x), then u(x) | d(x).
According to this definition, two different gcd’s of a(x) and b(x) divide
each other, that is, are associates. Of all the possible gcd’s of a(x) and
b(x), we select the monic one, call it the gcd of a(x) and b(x), and denote
it by gcd[a(x), b(x)]. It is important to know that any pair of polynomials
always has a greatest common divisor.
20
2.2.4 Theorem. Any two nonzero polynomials a(x) and b(x) in F[x] have
a gcd d(x). Furthermore, d(x) can be expressed as a “linear combination”
u(x)a(x) + v(x)b(x)
as u(x) and v(x) range over F[x], then J is an ideal of F[x], say the ideal
⟨d(x)⟩ generated by d(x). Now a(x) = la(x) + 0b(x) and b(x) = 0a(x) +
1b(x), so a(x) and b(x) are in J. But every element of J is a multiple of
d(x), so d(x) | a(x) and d(x) | b(x).
If k(x) is any common divisor of a(x) and b(x), this means there are
polynomials F(x) and g(x) such that a(x) = k(x)f (x) and b(x) = k(x)g(x).
Now, d(x) ∈ J, so d(x) can be written as a linear combination
d(x) = r(x)a(x) + s(x)b(x)
= r(x)k(x)f (x) + s(x)g(x)g(x)
= k(x)[r(x)f (x) + s(x)g(x)].
hence k(x) | d(x). This confirms that d(x) is the gcd of a(x) and b(x).
2.2.5 Definition. Polynomials a(x) and b(x) in F[x] are said to be rela-
tively prime if their gcd is equal to 1.
This is equivalent to saying that their only common factors are constants
in F.
21
Because b(x) and c(x) both have positive degrees, and the sum of their
degrees is deg a(x), each has degree less than deg a(x).
2.3.5 Corollary. Let q1 (x), . . . , qr (x) and p(x) be monic irreducible poly-
nomials. If p(x) | q1 (x), . . . qr (x), then p(x) is equal to one of the factors
q1 (x), . . . , qr (x).
If this were not true, we could choose a polynomial a(x) of lowest de-
gree among those which cannot be factored into irreducibles. If a(x) is
22
reducible, so a(x) = b(x)c(x) where b(x) and c(x) have lower degree than
a(x). But this means that b(x) and c(x) can be factored into irreducibles,
and therefore a(x) can also. This is contradictory.
The proof is the same, in all major respects, as the corresponding proof
for Z ; it is left as an exercise.
In the next chapter we will be able to improve somewhat on the last two
results in the special cases of R[x] and C[x]. Also, we will learn more about
factoring polynomials into irreducibles.
23
EXERCISES
24
field of n elements?
4 How many irreducible cubics are there over a field of n elements?
25
1. Prove that every common divisor of a(x) and b(x) is a common divisor
of b(x) and r1 (x).
It follows from part 1 that the gcd of a(x) and b(x) is the same as the
gcd of b(x) and r1 (x). This procedure can now be repeated on b(x) and
r1 (x); divide b(x) by r1 (x):
next
r1 (x) = r2 (x)q3 (x) + r3 (x),
..
.
Finally,
rn−1 (x) = rn (x)qn+1 (x) + 0.
In other words, we continue to divide each remainder by the succeeding
remainder. Since the remainders continually decrease in degree, there must
ultimately be a zero remainder. But we have seen that
Since rn (x) is a divisor of rn−1 (x), it must be the gcd of rn (x) and rn−1 .
Thus,
rn (x) = gcd[a(x), b(x)].
This method is called the euclidean algorithm for finding the gcd.
2. Find the gcd of x3 + 1 and x4 + x3 + 2x2 + x − 1. Express this gcd as
a linear combination of the two polynomials.
3. Do the same for x24 − 1 and x15 − 1.
4. Find the gcd of x3 + x2 + x + 1 and x4 + x3 + 2x2 + 2x in Z3 [x].
G. A Transformation of F[x]
Let G be the subset of F[x] consisting of all polynomials whose constant
term is nonzero. Let h : G → G be defined by
h(a0 + a1 x + . . . + an xn ) = an + an−1 x + . . . + a0 xn .
26
1. Prove that:
(a) h preserves multiplication, that is, h[a(x)b(x)] = h[a(x)]h[b(x)].
(b) h is injective and surjective and h.h = 1G .
(c) a0 + a1 x + . . . + an xn is irreducible iff an + an−1 x + . . . + a0 xn is
irreducible.
2. Let a0 + a1 x + . . . + an xn = (b0 + . . . + bm xm )(c0 + . . . + cq xq ). Factor
an + an−1 x + . . . + a0 xn .
3. Let a(x) = a0 + a1 x + . . . + an xn and b
a(x) = an + an−1 x + . . . + a0 xn .
If c ∈ F , prove that a(c) = 0 iff b
a(1/c) = 0.
27
CHAPTER 3
SUBSTITUTION IN POLYNOMIALS
28
If a(x) is a polynomial with coefficients in F, and c is an element of F
such that a(c) = 0 then we call c a root of a(x). For example, 2 is a root
of the polynomial 3x2 + x − 14 ∈ R[x], because 3.22 + 2 − 14 = 0.
There is an absolutely fundamental connection between roots of a poly-
nomial and factors of that polynomial. This connection is explored in the
following pages, beginning with the next theorem:
0 = a(c) = (c − c)q(c) + r = 0 + r = r
(x − c1 )(x − c2 ) . . . (x − cm )
is a factor of a(x).
29
Proof. To prove this, let us first make a simple observation: if a polynomial
a(x) can be factored, any root of a(x) must be a root of one of its factors.
Indeed, if a(x) = s(x)t(x) and a(c) = 0, then s(c)t(c) = 0, and therefore
either s(c) = 0 or t(c) = 0.
Let cl , . . . , cm be distinct roots of a(x). By Theorem 3.1.1,
Repeating this argument for each of the remaining roots gives us our result.
It was stated earlier in this chapter that the difference between polyno-
mials and polynomial functionsis mainly a difference of viewpoint. Mainly,
but not entirely! Remember that two polynomials a(x) and b(x) are equal
iff corresponding coefficients are equal, whereas two functions a(x) and b(x)
are equal iff a(x) = b(x) for every x in their domain. These two notions of
equality do not always coincide!
For example, consider the following two polynomials in Z5 [x]:
a(x) = x5 + 1
b(x) = x − 4.
30
You may check that a(0) = b(0), a(1) = b(1), . . . , a(4) = b(4); hence a(x)
and b(x) are equal functions from Z5 to Z5 . But as polynomials, a(x) and
b(x) are quite distinct (they do not even have the same degree)!
It is reassuring to know that this cannot happen when the field F is
infinite. Suppose a(x) and b(x) are polynomials over a field F which has
infinitely many elements. If a(x) and b(x) are equal as functions, this
means that a(c) = b(c) for every c ∈ F . Define the polynomial d(x) to
be the difference of a(x) and b(x): d(x) = a(x) − b(x). Then d(c) = 0
for every c ∈ F. Now, if d(x) were not the zero polynomial, it would be a
polynomial (with some finite degree n) having infinitely many roots, and
by Theorem 3.1.3 this is impossible! Thus, d(x) is the zero polynomial (all
its coefficients are equal to zero), and therefore a(x) is the same polynomial
as b(x) (they have the same coefficients).
This tells us that if F is a field with infinitely many elements (such as
Q, R, or C), there is no need to distinguish between polynomials and poly-
nomial functions. The difference is, indeed, just a difference of viewpoint.
31
The polynomial b(x) has integer coefficients; and since it differs from a(x)
only by a constant factor, it has the same roots as a(x). Thus, for every
polynomial with rational coefficients, there is a polynomial with integer
coefficients having the same roots. Therefore, for the present we will confine
our attention to polynomials with integer coefficients. The next theorem
makes it easy to find all the rational roots of such polynomials:
3.2.1 Theorem. Let s/t be a rational number in simplest form (that is,
the integers s and t do not have a commonfactor greater than 1). Let
a(x) = a0 + . . . + an xn be a polynomial with integer coefficients. If s/t is a
root of a(x), then s | a0 and t | an .
a0 tn + a1 stn−1 + . . . + an sn = 0 (3.2.1.1)
We may now factor out s from all but the first term to get
As an example of the way Theorem 3.2.1 may be used, let us find the
rational roots of
3.2.2 Lemma. Let a(x) = b(x)c(x), where a(x), b(x), and c(x) have integer
coefficients. If a prime number p divides every coefficient of a(x), it either
divides every coefficient of b(x) or every coefficient of c(x).
Proof. If this is not the case, let br be the first coefficient of b(x) not divisible
by p, and let ct be the first coefficient of c(x) not divisible by p. Now,
a(x) = b(x)c(x), so
a(x) = b(x)c(x),
where b(x) and c(x) have rational coefficients. Then there are polynomials
B(x) and C(x) with integer coefficients, which are constant multiples of
b(x) and c(x), respectively, such that
a(x) = B(x)C(x).
33
Proof. Let k and l be integers such that kb(x) and lc(x) have integer coef-
ficients. Then kla(x) = [kb(x)][lc(x)]. By Lemma 3.2.2, each prime factor
of kl may now be canceled with a factor of either kb(x) or lc(x).
a(x) = a0 + a1 x + . . . + an xn
34
Let s be the smallest integer such that p ∤ cs . We have
as = b0 cs + b1 cs−1 + . . . + bs c0
35
be factored into a(x) = k(x − c1 ) . . . (x − cn ), where cl , . . . , cn are complex
numbers (some of which may be real).
For our closing comments, we need the following lemma:
F(a0 + a1 r + . . . + an rn ) = a0 + a1 r + . . . + an rn .
Now let a(x) be any polynomial with real coefficients, and let r = a + bi
be a complex root of a(x). Then r is also a root of a(x), so
36
We now place emphasis on a different point of view. Finding the zeros
of a polynomial
a0 + a1 x + . . . + an−1 xn−1 + an xn = 0.
ax + b = 0
by the formula
b
x=−
a
and the solution of quadratic equations
ax2 + bx + c = 0
by the formula √
b2 − 4ac
−b ±
x=
2a
long ago prompted mathematicians to seek similar formulas for equations
of higher degree with real coefficients.
In the 16th century, Italian mathematicians named Ferro, Tartaglia, Fer-
rari, and Cardano developed methods for solving third- and fourth-degree
equations with real coefficients by the use of formulas that involved only
the operations of addition, subtraction, multiplication, division, and the
extraction of roots. For more than two hundred years afterward, mathe-
maticians struggled to obtain similar formulas for equations with degree
higher than 4 or to prove that such formulas did not exist. It was in the
early 19th century that the Norwegian mathematician Abel proved that
37
it was impossible to obtain such formulas for equations with degree greater
than 4.
The proof of Abel’s result is beyond the level of this text, but the for-
mulas for cubic and quartic (third- and fourth-degree) equations with real
coefficients are within our reach.
We consider first the solution of the general cubic equation
a0 + a1 x + a2 x2 + a3 x3 = 0
x3 + ax2 + bx + c = 0.
cos 0 + i sin 0 = 1,
√
2π 2π −1 + i 3
cos + i sin = ,
3 3 2
√
4π 4π −1 − i 3
cos + i sin = .
3 3 2
√
−1 + i 3
If we let ω = the distinct cube roots of 1 are ω, ω 2 , and
2 √
ω 3 = 1. For an arbitrary nonzero complex number z, let 3 z denote any
√ √ √
fixed cube root of z. Then each of the numbers 3 z, ω 3 z, and ω 2 3 z and is
a cube root of z, and they are clearly distinct. Thus the three cube roots
of z are given by
√ √ √
3
z, ω 3 z, ω 2 3 z,
√
−1 + i 3
where ω = . This result is used in solving the cubic equation in
2
Theorem 3.5.2.
38
The following two theorems lead to formulas for the solutions of the
general cubic equation
x3 + ax2 + bx + c = 0.
y 3 + py + q = 0,
a2 ab 2a3
where p = b − , q = c − + .
3 3 27
Proof. The theorem can be proved by direct substitution, but the details
are neater if we first consider a substitution of the form x = y + h, where
h is unspecified at this point. This substitution yields
y 3 + py + q = 0,
39
√ r r
−1 + i 3 q q 2 p 3 q q 2 p 3
and let ω = ,A=− + + ,B=− − + .
2 2 2 3 2 2 3
The solutions to y 3 + py + q = 0 are given by
√ √ √ √ √ √
A + B, ω A + ω 2 B, ω 2 A + ω B
3 3 3 3 3 3
√ √
where 3 A and 3 B denote (real or complex) cube roots of A and B chosen
√ √ p
so that 3 A 3 B = − .
3
Proof. For an efficient proof, we resort to a “trick” substitution: We let
p
y=z−
3z
in y 3 + py + q = 0. This substitution yields
p 3 p
z− +p z− + q = 0.
3z 3z
This equation then simplifies to
3 p3
z − +q =0
27z 3
and then to
6 3 p3
z + qz − = 0.
27
This is a quadratic equation in z 3 and we can use the quadratic formula to
obtain r
4p3
−q ± q2 + q
r
q 2 p 3
z3 = 27 = − ± + .
2 2 2 3
With A and B as given in the statement of the theorem, we have
z 3 = A or z 3 = B.
Noting that
r ! r !
q q 2 p 3 q q 2 p 3
AB = − + + − − +
2 2 3 2 2 3
q 2 q 2 p 3
= − +
2 2 3
p3
=− ,
27
40
√
3
√
3
we see that A and B need to be chosen so that
√
3
√
3 2
√
3
√
3
√
3 2
√
3
A, ω A, ω A, B, ω B, ω B.
3.5.3 Example. We shall use the formulas in Theorem 3.5.2 to solve the
equation
y 3 − 9y − 12 = 0.
41
√ !2 √ !
√
2 3
√
3 −1 + i 3 √ 3 −1 + i 3 √ 3
ω 9+ω 3= 9+ 3
2 2
√ ! √ !
−1 − i 3 √ 3 −1 + i 3 √3
= 9+ 3
2 2
√
1 √ 3
√
3 i 3 √
3
√ 3
= − ( 9 + 3) − ( 9 − 3),
2 2
The results of Theorem 3.5.1 and Theorem 3.5.2 combine to yield the
following theorem. The formulas in the theorem are known as Cardano’s
Formulas.
x3 + ax2 + bx + c = 0,
are given by
√ √ a √ √ a √ √ a
B − , ω A + ω2 B − , ω2 A + ω B − ,
3 3 3 3 3 3
A+
3 3 3
where √
−1 + i 3 a2 ab a3
ω= ,p = b − ,q = c − + ,
2 3 3 27
r r
q q 2 p 3 q q 2 p 3
A=− + + ,B = − − + ,
2 2 3 2 2 3
√ √ √ √ p
with 3 A and 3 B chosen so that 3 A 3 B = − .
3
The use of Theorems 3.5.4 is demonstrated in the following example.
x3 − 3x2 − 6x − 4 = 0.
2 2
√
√3
√
2 3 1 √ 3
√
2 3 i 3 √ 3
√
2 3
ω 9 + ω 3 + 1 = − (ω 9 + ω 3 − 2) − (ω 9 − ω 3).
2 2
We turn our attention now to the solution of quartic equations. As in
the case of the cubic equation, there is no loss of generality in assuming
that the equation is monic. Thus we assume an equation of the form
x4 + ax3 + bx2 + cx + d = 0.
y 4 + py 2 + qy + r = 0.
y 4 + py 2 + qy + r = 0
43
which can be written as
y 4 = −py 2 − qy − r.
The basic idea of our method, which was devised by Ferrari, is to add an
expression to each side of the last equation that will make both sides perfect
squares (squares of binomials). With this idea in mind, we add
2 t2
ty +
4
to both sides, where t is yet to be determined. This gives
t2 t2
y 4 + ty 2 + = −py 2 − qy − r + ty 2 + ,
4 4
or 2
t2
2 t 2
y + = (t − p)y − qy + −r .
2 4
We recall that a quadratic polynomial Ay 2 + By + C is the square of a
binomial
Ay 2 + By + C = (Dy + E)2
t3 − t2 − 24t + 20 = 0.
t 2
2
2 2 t
y + = (t − p)y − qy + −r
2 4
becomes
5 2 1 2
2 2 1
y + = 4y + 2y + = 2y + .
2 4 2
Equating square roots, we obtain
5 1 5 1
y 2 + = 2y + or y 2 + = − 2y +
2 2 2 2
and then
y 2 − 2y + 2 = 0 or y 2 + 2y + 3 = 0.
45
We can now describe a method of solution for an arbitrary quartic equa-
tion
x4 + ax3 + bx2 + cx + d = 0.
y 4 + py 2 + qy + r = 0.
We next use the method of Example 3.5.7 to find the four solutions y1 , y2 , y3
and y4 of the equation in y. Then the solutions to the original equation are
given by
a
xj = yj − , for j = 1, 2, 3, 4.
4
This is illustrated in Example 3.5.8.
x4 + 4x3 + 7x2 + 4x + 6 = 0.
a
The substitution formula x = y − yields x = y − 1 and the resulting
4
equation
y 4 + y 2 − 2y + 6 = 0.
46
Just as the discriminant b2 −4ac can be used to characterize the solutions
of the quadratic equation ax2 +bx+c = 0, the discriminant of a polynomial
equation can be used to characterize its solutions. In particular, we will see
that a cubic equation will have either exactly one real solution or exactly
three real solutions. We begin with the next definition.
f (y) = y 3 + py + q
where
√
3
√
3
c1 = A+
B,
√ √
c2 = ω A + ω 2 B,
3 3
√ √
c3 = ω 2 A + ω B
3 3
and √
−1 + i 3
ω= ,
2
r
q q 2 p 3
A=− + + ,
2 2 3
47
r
q q 2 p 3
B=− − + .
2 2 3
The discriminant is
√ √ √ √
= A + B − ω A − ω2 B
3 3 3 3
√ √
= (1 − ω)( A − ω 2 B),
3 3
√ √ √ √
c1 − c3 = ( A + B) − (ω 2 A + ω B)
3 3 3 3
√ √ √ √
= A + B − ω2 A − ω B
3 3 3 3
√ √
= −ω 2 (1 − ω)( A − ω B),
3 3
√ √ √ √
c2 − c3 = (ω A + ω 2 B) − (ω 2 A + ω B)
3 3 3 3
√ √ √ √
= ω A + ω2 B − ω2 A − ω B
3 3 3 3
√ 3
√
3
= ω(1 − ω)( A − B).
Then
√ √ √ √ √ √
D = −(1 − ω)( A − ω 2 B)ω 2 (1 − ω)( A − ω B)ω(1 − ω)( A − B)
3 3 3 3 3 3
√ √ √ √ √ √
= −ω 3 (1 − ω)3 ( A − ω 2 B)( A − ω B)( A − B)
3 3 3 3 3 3
√ √ √ √ √ √ √
= 3i 3( A − ω 2 B)( A − ω B)( A − B)
3 3 3 3 3 3
√ √ 3
√
3 3
√ √
3 3 3
√3
√3
√
3
√
3
= 3i 3( A − ω B + ω A B − A B − ω A2 B
3 3 2 2
√ √3
√
3
√ √ √3
+ ω A B 2 − ω 2 A2 B + ω 2 A B 2 )
3 3 3
√ √ 3
√
3
√ 3
√3
= 3i 3(A − B + A B − A B 2 2
√3
√ √ √3
√
3
√ √ √
3
− ω( A2 B − A B 2 ) − ω 2 ( A2 B − A B 2 ))
3 3 3 3
√ √3
√ √ √
3
= 3i 3(A − B + (−1 − ω − ω 2 )( A2 B − A B 2 ))
3 3
√
= 3i 3(A − B)
√
r
4
= 3i 3 q 2 + p3
27
48
since −1 − ω − ω 2 = 0 and
r r !
q q 2 p 3 q q 2 p 3
A−B =− + + − − − +
2 2 3 2 2 3
r
q 2 p 3
=2 +
r 2 3
4
= q 2 + p3 .
27
Thus !2
√
r
4 3
D2 = 3i 3 q 2 + p
27
= −27q 2 − 4p3 .
y 3 + py + q = 0.
49
and
D2 = (2bi((a − c1 )2 + b2 ))2
= −4b2 ((a − c1 )2 + b2 )2 < 0
since b ̸= 0. Thus, if there is a nonreal solution, then the discriminant
is negative. It follows that if the discriminant is nonnegative, then the
solutions must all be real.
50
EXERCISES
In order to find a root of a(x) in a finite field F, the simplest method (if
F is small) is to test every element of F by substitution into the equation
a(x) = 0.
1. Find all the roots of the following polynomials in Z5 [x], and factor
the polynomials:
x3 + x2 + x + 1; 3x4 + x2 + 1; x5 + 1; x4 + 1; x4 + 4.
2. Use Fermat’s theorem to find all the roots of the following polynomials
in Z7 [x]:
x100 − 1; 3x98 + x19 + 3; 2x74 − x55 + 2x + 6.
3. Using Fermat ’s theorem, find polynomials of degree ≤ 6 which
determine the same functions as the following polynomials in Z7 [x]:
3x75 − 5x54 + 2x13 − x2 ; 4x108 + 6x101 − 2x81 ; 3x103 − x73 + 3x55 − x25 .
4. Explain why every polynomial in Zp [x] has the same roots as a poly-
nomial of degree < p.
Let F be a field.
1. Prove that the following statements are true in F[x]:
(a) The remainder of p(x), when divided by x − c, is p(c).
(b) (x − c) | (p(x) − p(c)).
(c) Every polynomial has the same roots as any of its associates.
2. If a(x) and b(x) have the same roots in F, are they necessarily asso-
ciates? Explain.
3. Prove: If a(x) is a monic polynomial of degree n, and a(x) has n
roots c1 , . . . , cn ∈ F , then a(x) = (x − c1 ) . . . (x − cn ).
4. Suppose a(x) and b(x) have degree < n. If a(c) = b(c) for n values of
c, prove that a(x) = b(x).
7. There are infinitely many irreducible polynomials in Z5 [x].
8. How many roots does x2 − x have in Z10 ? In Z11 ? Explain the
difference.
52
y = x + c, it does. It is important to note that if a(x) can be factored into
p(x)q(x), then certainly a(x+c) can be factored into p(x+c)q(x+c). Thus,
the irreducibility of a(x + c) implies the irreducibility of a(x).
(a) Use the change of variable y = x + 1 to show that x4 + 4x + 1 is
irreducible in Q[x] (In other words, test (x+1)4 +4(x+1)+1 by Eisenstein’s
criterion).
(b) Find an appropriate change of variable to prove that the following
are irreducible in Q[x]:
x4 + 2x2 − 1; x3 − 3x + 1; x4 + 1; x4 − 10x2 + 1.
(x − 1)(xp − 1 + xp − 2 + . . . + x + 1) = xp − 1
hence
p−1 p−2 xp − 1
x + x + ... + x + 2 = .
x−1
Use the change of variable y = x + 1, and expand by the binomial theorem.
4. By Exercise
G1(c)
h(a0 + a1 x + . . . + an xn ) = an + an−1 x + . . . + a0 xn
x4 − 5x2 + 1; 3x4 − x2 − 2; x4 + x3 + 3x + 1.
2x3 + x2 + 4x + 1; x4 + 2; x4 + 4x2 + 2; x4 + 1.
54
1. If h(a(x)) is irreducible in Zn [x] and a(x) is monic, then a(x) is
irreducible in Z[x];
2. x4 +10x3 +7 is irreducible in Z[x] by using the natural homomorphism
from Z to Z5 ;
3. The following are irreducible in Z[x] (find the right value of n and
use the natural homomorphism from Z to Zn ):
It is a useful fact that Theorems 3.1.1, 3.1.2, and 3.1.3 are still true in
A[x] when A is not a field, but merely an integral domain. The proof of
Theorem 3.1.1 must be altered a bit to avoid using the division algorithm.
We proceed as follows: If a(x) = a0 + a1 x + . . . + an xn and c is a root of
a(x), consider
2. Conclude from part 1 that a(x) − a(c) = (x − c)q(x) for some q(x).
3. Complete the proof of Theorem 3.1.1, explaining why this particular
proof is valid when A is an integral domain, not necessarily a field.
4 Check that Theorems 3.1.2 and 3.1.3 are true in A[x] when A is an
integral domain.
H. Polynomial Interpolation
55
y = f (x) such that f (a0 ) = b0 , f (a1 ) = b1 , . . . , f (an ) = bn . The simplest
and most useful kind of function for this purpose is a polynomial function
of the lowest possible degree.
We now consider a commonly used technique for constructing a poly-
nomial p(x) of degree n which assumes given values b0 , b1 , . . . , bn are given
points a0 , a1 , . . . , an . That is,
56
2. Prove that xp − x has p roots in Zp [x], for any prime p. Draw the
conclusion that in Zp [x], xp − x can be factored as
3. Prove that if a(x) and b(x) determine the same function in p[x], then
(xp − x) | (a(x) − b(x)).
In the next four parts, let F be any finite field.
4. Let a(x) and b(x) be in F[x]. Prove that if a(x) and b(x) determine
the same function, and if the number of elements in F exceeds the degree
of a(x) as well as the degree of b(x), then a(x) = b(x).
5. Prove: The set of all a(x) which determine the zero function is an
ideal of F[x]. What its generator?
6. Let F(F ) be the ring of all functions from F to F, defined in the
same way as F(R). Let h : F [x] → F(F ) send every polynomial a(x) to the
polynomial function which it determines. Show that h is a homomorphism
from F[x] onto F(F ).
(NOTE: To show that h is onto, use Exercise H4).
7. Let F = {c1 , . . . , cn } and p(x) = (x − c1 ) . . . (x − cn ). Prove that
F[x]/⟨p(x)⟩ ∼
= F(F).
57
(1) x3 − 15x − 30 = 0,
(2) x3 − 9x + 12 = 0,
(3) x3 − 12x − 20 = 0,
(4) x3 + 15x − 20 = 0,
(5) x3 − 6x − 6 = 0,
(6) x3 + 6x − 2 = 0,
(7) x3 + 9x + 6 = 0,
(8) x3 + 9x − 6 = 0,
(9) 2x3 + 6x − 3 = 0,
(10) 2x3 − 6x − 5 = 0,
(11) x3 − 6x2 + 33x − 92 = 0,
(12) x3 + 3x2 + 21x + 13 = 0,
(13) 8x3 + 12x2 + 150x + 25 = 0,
(14) 8x3 − 12x2 + 54x − 9 = 0,
(15) x4 + x2 − 2x + 6 = 0,
(16) x4 − 2x2 + 8x − 3 = 0,
(17) x4 + 4x3 + 3x2 + 4x + 2 = 0,
(18) x4 − 4x3 + 4x2 − 8x + 4 = 0.
3. Characterize the solutions to the following equations by evaluating
the discriminant D2 .
(1) x3 − 91x + 90 = 0,
(2) x3 − 32x + 24 = 0,
(3) x3 − 55x − 72 = 0,
(4) x3 − 124x − 240 = 0,
(5) x3 − 47x − 136 = 0,
(6) x3 − 3x + 52 = 0.
a
4. Prove Theorem 3.5.6: The change of variable x = y − in
4
x4 + ax3 + bx2 + cx + d = 0
58
yields an equation of the form
y 4 + py 2 + qy + r = 0.
1
5. Show that the change of variable x = y − an−1 in
n
xn + an−1 xn−1 + an−2 xn−2 + . . . + a1 x + a0 = 0
or
y n + bn−2 y n−2 + . . . + b1 x + b0 = 0.
59
CHAPTER 4
EXTENSIONS OF FIELDS
60
In the matter of factoring polynomials and extracting their roots, is
utopia! In every polynomial a(x) of degree n has exactly n roots c1 , . . . , cn
and can therefore be factored as
This ideal situation is not enjoyed by all fields - far from it! In an arbitrary
field F, a polynomial of degree n may have any number of roots, from no
roots to n roots, and there may be irreducible polynomials of any degree
whatever. This is a messy situation, which does not hold the promise of an
elegant theory of solutions to polynomial equations. However, it turns out
that F always has a suitable extension E such that any polynomial a(x)
of degree n over F has exactly n solutions in E. Therefore, a(x) can be
factored in E[x] as
61
Let Jc denote the kernel of σc ; since the kernel of any homomorphism is
an ideal, Jc is an ideal of F[x]. An element c in E is called algebraic over
F if it is the root of some nonzero polynomial a(x) in F[x]. Otherwise, c is
called transcendental over F. Obviously c is algebraic over F iff Jc contains
nonzero polynomials, and transcendental over F iff Jc = {0}.
We will confine our attention now to the case where c is algebraic. The
transcendental case will be examined in Exercise G at the end of this chap-
ter.
62
above, σ√2 : Q[x] → R is a homomorphism and its kernel consists of all
√
the polynomials in Q[x] which have 2 as one of their roots. The monic
√
polynomial of least degree in Q[x] having 2 as a root is p(x) = x2 − 2;
√
hence x2 − 2 is the minimum polynomial of 2 over Q.
Now, let us turn our attention to the range of σc . Since σc is a homomor-
phism, its range is obviously closed with respect to addition, multiplication,
and negatives, but it is not obviously closed with respect to multiplicative
inverses. Not obviously, but in fact it is closed for multiplicative inverses,
which is far from self-evident, and quite a remarkable fact. In order to prove
this, let f (c) be any nonzero element in the range of σc . Since f (c) ̸= 0, f (x)
is not in the kernel of σc . Thus, f (x) is not a multiple of p(x), and since p(x)
is irreducible, it follows that f (x) and p(x) are relatively prime. Therefore
there are polynomials s(x) and t(x) such that s(x)f (x) + t(x)p(x) = 1. But
then
s(c)f (c) + t(c)p(c) = 1.
We have just seen that range σc is a field. In fact, it is the smallest field con-
taining F and c: indeed, any other field containing F and c would inevitably
contain every element of the form
a0 + a1 c + . . . + an cn (a0 , . . . , an ∈ F )
63
Now, here is what we have, in a nutshell: σc is a homomorphism with
domain F[x], range F(c), and kernel Jc = ⟨p(x)⟩. Thus, by the fundamental
homomorphism theorem
F(c) ∼
= F [x]/⟨p(x)⟩.
64
Proof. To begin with, a(x) can be factored into irreducible polynomials
in F[x]. If p(x) is any nonconstant irreducible factor of a(x), it is clearly
sufficient to find an extension of F containing a root of p(x), since such a
root will also be a root of a(x).
In Exercise D4 of Chapter 2, the reader was asked to supply the simple
proof that, if p(x) is irreducible in F[x], then ⟨p(x)⟩ is a maximal ideal of
F[x]. Furthermore, if ⟨p(x)⟩ is a maximal ideal of F[x], then the quotient
ring F[x]/⟨p(x)⟩ is a field.
It remains only to prove that F[x]/⟨p(x)⟩ is the desired field extension
of F. When we write J = ⟨p(x)⟩, let us remember that every element of
F[x]/J is a coset of J. We will prove that F[x]/J is an extension of F by
identifying each element a in F with its coset a + J.
To be precise, define h : F → F [x]/J by h(a) = a + J. Note that h is
the function which matches every a in F with its coset a + J in F[x]/J. We
will now show that h is an isomorphism.
By the familiar rules of coset addition and multiplication, h is a homo-
morphism. Now, every homomorphism between fields is injective (this is
true because the kernel of a homomorphism is an ideal, and a field has no
nontrivial ideals). Thus, h is an isomorphism between its domain and its
range.
What is the range of h? It consists of all the cosets a + J where a ∈ F ,
that is, all the cosets of constant polynomials (if a is in F, then a is a
constant polynomial). Thus, F is isomorphic to the subfield of F[x]/J
containing all the cosets of constant polynomials. This subfield is therefore
an isomorphic copy of F, which may be identified with F, so F[x]/J is an
extension of F.
Finally, if p(x) = a0 + a1 x + . . . + an xn , let us show that the coset x + J
is a root of p(x) in F[x]/J. Of course, in F[x]/J, the coefficients are not
actually a0 , a1 , . . . , an , but their cosets a0 + J, a1 + J, . . . , an + J. Writing
a0 = a0 + J, . . . , an = an + J, and x = x + J.
65
We must prove that
Well
a0 + a1 x + . . . + an xn = (a0 + J) + (a1 + J)(x + J) + . . . + (an + J)(x + J)n
= (a0 + J) + (a1 x + J) + . . . + (an xn + J)
= p(x) + J
= J ( because p(x) ∈ J).
This completes the proof of the basic theorem of field extensions. Observe
that we may use this theorem several times in succession to get the follow-
ing: Let a(x) be a polynomial of degree n in F[x]. There is an extension
field E of F which contains all n roots of a(x).
66
containing F and c must contain F (c). We saw in the above that if c is
algebraic over F , then F (c) consists of all the elements of the form a(c),
for all a(x) in F [x]. Since F (c) is an extension of F , we may regard it as a
vector space over F . Is F (c) a finite extension of F ?
Well, let c be algebraic over F , and let p(x) be the minimum polynomial
of c over F (that is, p(x) is the monic polynomial of lowest degree having
c root). Let the degree of the polynomial p(x) be equal to n. It turns out,
then, that the n elements
1, c, c2 , . . . , cn−1
are linearly independent and span F (c). We will prove this fact in a mo-
ment, but meanwhile let us record what it means. It means that the set
of n “vectors” {1, c, c2 , . . . , cn−1 } is a basis of F (c); hence F (c) is a vector
space of dimension n over the field F . This may be summed up concisely
as follows:
4.4.1 Theorem. The degree of F (c) over F is equal to the degree of the
minimum polynomial of c over F .
Proof. It remains only to show that the n elements 1, c, . . . , cn−1 span F (c)
and are linearly independent. Well, if a(c) is any element of F (c), use the
division algorithm to divide a(x) by p(x):
Therefore,
a(c) = p(c)q(c) + r(c) = 0 + r(c) = r(c).
This shows that every element of F (c) is of the form r(c) where r(x) has
degree n − 1 or less. Thus, every element of F (c) can be written in the form
a0 + a1 c + . . . + an−1 cn−1
67
Finally, to prove that 1, c, . . . , cn−1 are linearly independent, suppose
that a0 + a1 c + . . . + an−1 cn−1 = 0. If the coefficients a0 , a1 , . . . , an−1 were
not all zero, c would be the root of a nonzero polynomial of degree n − 1
or less, which is impossible because the minimum polynomial of c over F
has degree n. Thus, a0 = a1 = . . . = an−1 = 0.
√ √
4.4.2 Example. Let us look at Q( 2): the number 2 is not a root of any
monic polynomial of degree 1 over Q. For such a polynomial would have to
√ √
be x − 2, and the latter is not in Q[x] because 2 is irrational. However,
√ √
2 is a root of x2 − 2, which is therefore the minimum polynomial of 2
over Q, and which has degree 2. Thus,
√
[Q( 2) : Q] = 2.
√
In particular, every element in Q( 2) is therefore a linear combination of
√ √
1 and 2, that is, a number of the form a + b 2 where a, b ∈ Q.
Proof. To prove that the set {ai bj } spans E, note that each element c in E
68
can be written as a linear combination
c = k1 b1 + . . . + kn bn
[E : F ] = [E : K][K : F ].
69
F (c), and then adjoin d to F (c). The resulting field is denoted F (c, d) and
is the smallest field containing F , c and d. [Indeed, any field containing F ,
c and d must contain F (c), hence also F (c, d)]. It does not matter whether
we first adjoin c and then d or vice versa.
If c1 , . . . , cn are algebraic over F , we let F (c1 , . . . , cn ) be the smallest
field containing F and c1 , . . . , cn . We call it the field obtained by adjoining
c1 , . . . , cn to F . We may form F (c1 , . . . , cn ) step by step, adjoining one ci
at a time, and the order of adjoining the ci is irrelevant.
An extension F (c) formed by adjoining a single element to F is called
a simple extension of F . An extension F (c1 , . . . , cn ) formed by adjoining
a finite number of elements c1 , . . . , cn is called an iterated extension. It is
called “iterated” because it can be formed step by step, one simple extension
at a time:
70
4.4.6 Theorem. If K is a finite extension of F , every element of K is
algebraic over F .
71
needed later.
By the comments immediately preceding Theorem 4.4.1, every element
of F (c1 ) is a linear combination of powers of c1 , with coefficients in F . That
is, every element of F (c1 ) is of the form ki ci1 where the ki are in F . For
P
i
the same reason, every element of F (c1 , c2 ) is of the form lj ci1 where the
P
j
coefficients lj are in F (c1 ). Thus, each coefficient lj is equal to a sum of
the form (2). But then, clearing brackets, it follows that every element of
F (c1 , c2 ) is of the form kij ci1 cj2 where the coefficients kij are in F .
P
i
If we continue this process, it is easy to see that every element of
F (c1 , c2 , . . . , cn ) is a sum of terms of the form kci11 ci22 . . . cinn where the coef-
ficient k of each term is in F .
72
EXERCISES
73
(a) 1 + 2i,
√
(b) 1 + 2,
√
(c) 1 + 2i,
p √
(d) 2 + 3 3,
√ √
(e) 3 + 5,
p √
(f) 1 + 2.
√
2. Show that the minimum polynomial of 2 + i is
√
(a) x2 − 2 2x + 3 over R,
(b) x4 − 2x2 + 9 over Q,
(c) x2 − 2ix − 3 over Q(i).
3. Find the minimum polynomial of the following numbers over the
indicated fields:
√ √
(a) 3 + i over R; over Q; over Q(i); over Q( 3),
p√ √
(b) 2 + i over R; over Q; over Q(i); over Q( 2).
4. For each of the following polynomials p(x), find a number a such that
p(x) is the minimum polynomial of a over Q:
(a) x2 + 2x − 1,
(b) x4 + 2x2 − 1,
(c) x4 − 10x2 + 1.
5 Find a monic irreducible polynomial p(x) such that Q[x]/⟨p(x)⟩ is
isomorphic to
√
(a) Q( 2),
√
(b) Q(1 + 2),
p √
(c) Q( 1 + 2).
74
1. Prove: Every element in F(c) can be written as r(c), for some r(x) of
degree < n in F[x].
[HINT: Given any element t(c) ∈ F (c), use the division algorithm to
divide t(x) by p(x)].
2. If s(c) = t(c) in F(c), where s(x) and t(x) have degree < n, prove
that s(x) = t(x).
3. Conclude from parts 1 and 2 that every element in F(c) can be written
uniquely as r(c), with deg r(x) < n.
4. Using part 3, explain why there are exactly four elements in Z2 [x]/⟨x2 +
x + 1⟩. List these four elements, and give their addition and multiplication
tables.
[HINT: Identify Z2 [x]/<x2 + x + 1⟩ with Z2 (c), where c is a root of
x2 + x + 1. Write the elements of Z2 (c) as in part 3. When computing the
multiplication table, use the fact that c2 + c + 1 = 0].
5. Describe Z2 [x]/⟨x3 + x + 1⟩, as in part 4.
6. Describe Z3 [x]/⟨x3 + x2 + 2⟩, as in part 4.
75
both roots of p(x).
E. Simple Extensions
76
1. Prove that, if F is a field whose characteristic is ̸= 2, any quadratic
√
extension of F is of the form F( a), for some a ∈ F (HINT: Complete the
square, and use Exercise E1(d)).
Let F be a finite field, and F∗ the multiplicative group of nonzero ele-
ments of F. Obviously H = {x2 | x ∈ F ∗ } is a subgroup of F∗ ; since every
square x2 in F∗ is the square of only two different elements, namely ±x,
exactly half the elements of F∗ are in H. Thus, H has exactly two cosets:
H itself, containing all the squares, and aH (where a ∈ / H), containing all
a
the nonsquares. If a and b are nonsquares, then ab−1 = ∈ H. Thus: if a
a b
and b are nonsquares, is a square. Use these remarks in the following:
b
2. Let F be a finite field. If a, b ∈ F , let p(x) = x2 − a and q(x) = x2 − b
√ √
be irreducible in F[x], and let a and b denote roots of p(x) and q(x) in
a a
an extension of F. Explain why is a square, say = c2 for some c ∈ F.
b b
Prove that is a root of p(cx).
√
3. Use part 2 to prove that F[x]/⟨p(cx)⟩ ∼ = F ( b); then use Exercise
√ √
E1(e) to conclude that F( a) ∼ = F ( b).
4. Use part 3 to prove: Any two quadratic extensions of a finite field are
isomorphic.
a
is a square (why?). Use the same
5. If a and b are nonsquares in R,
b
argument as in part 4 to prove that any two simple extensions of R are
isomorphic (hence isomorphic to C).
77
4. If c is transcendental over F, every element in F(c) but not in F is
transcendental over F.
Let F be a field, and let a(x), b(x) ∈ F [x]. Prove the following:
1. If a(x) and b(x) have a common root c in some extension of F,
they have a common factor of positive degree in F[x]. [Use the fact that
a(x), b(x) ∈ ker σc ].
2. If a(x) and b(x) are relatively prime in F[x], they are relatively prime
in K[x], for any extension K of F. Conversely, if they are relatively prime
in K[x], then they are relatively prime in F[x].
78
3. If F has characteristic p ̸= 0, and a′ (x) = 0, prove that the only
nonzero terms of a(x) are of the form amp xmp for some m [that is, a(x) is
a polynomial in powers of xp ].
J. Multiple Roots
a(x) = (x − c1 ) . . . (x − cn )
79
This important result is stated as follows: A polynomial a(x) in F[x] has
a multiple root iff a(x) and a′ (x) have a common factor of degree > 1 in
F[x].
7. Show that each of the following polynomials has no multiple roots in
any extension of its field of coefficients:
The preceding example is most interesting: it shows that there are 100
different hundredth roots of 1 over Z7 . (The roots ±1 are in Z7 , while the
remaining 98 roots are in extensions of Z7 ). Corresponding results hold for
most other fields.
80
Let F be a field of characteristic ̸= 2. Let a ̸= b be in F .
√ √ √ √
1. Prove that any field F containing a + b also contains a and b.
√ √ √ √
Conclude that F ( a + b) = F ( a, b).
√ √ √
[HINT: Compute ( a + b)2 and show that ab ∈ F . Then compute
√ √ √
ab( a + b), which is also in F ].
√ √
2. Prove that if b ̸= x2 a for any x ∈ F , then b ∈ / F ( a). Conclude
√ √
that F ( a, b) is of degree 4 over F .
√ √
3. Show that x = a + b satisfies x4 − 2(a + b)x2 + (a − b)2 = 0. Show
p √
that x = a + b + 2 ab also satisfies this equation. Conclude that
q √ √ √
F ( a + b + 2 ab) = F ( a, b).
81
D. Degrees of Extensions (Applications of Theorem 4.4.5)
82
F. Further Properties of Degrees of Extensions
83
4. Let Q(a0 , . . . , an ) = Q1 . Since a(x) ∈ Q1 [x], c is algebraic over Q1 .
Prove that:
(a) Q1 (c) is a finite extension of Q1 hence a finite extension of Q. Why?
(b) c ∈ A.
Conclusion: The roots of any polynomial whose coefficients are algebraic
numbers are themselves algebraic numbers.
A field F is called algebraically closed if the roots of every polynomial
in F [x] are in F . We have thus proved that A is algebraically closed.
84
BIBLIOGRAPHY
[1] Nguyen Thi Hong Loan, Pham Hung Quy and Nguyen Thanh Quang
(20023), Dai so dai cuong, Vinh University Publications.
85