0% found this document useful (0 votes)
7 views52 pages

Lecture Notes KW2

These lecture notes by Mark Veraar cover measure and integration theory as part of a real analysis course at TU Delft. The document includes topics such as σ-algebras, measures, measurable functions, integration theory, convergence theorems, Lp spaces, and applications to Fourier series. It serves as a foundational resource for various applied mathematics courses and includes references for further reading.

Uploaded by

Ramesh Kadambi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views52 pages

Lecture Notes KW2

These lecture notes by Mark Veraar cover measure and integration theory as part of a real analysis course at TU Delft. The document includes topics such as σ-algebras, measures, measurable functions, integration theory, convergence theorems, Lp spaces, and applications to Fourier series. It serves as a foundational resource for various applied mathematics courses and includes references for further reading.

Uploaded by

Ramesh Kadambi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 52

MEASURE AND INTEGRATION

Lecture notes by Mark Veraar

Contents
Introduction 2
1. σ-algebras 3
2. Measures 6
3. Construction of measures 9
4. Measurable functions 13
5. Construction of the integral 18
5.1. Integral for simple functions 18
5.2. Integral for positive measurable functions 19
5.3. Integral for measurable functions 21
6. Convergence theorems and applications 25
6.1. The three main convergence results 25
6.2. Consequences and applications 26
7. Lp -spaces 30
7.1. Minkowski and Hölder’s inequalities 30
7.2. Completeness of Lp 31
7.3. Lp -spaces on intervals 32
8. Applications to Fourier series 36
8.1. Fourier coefficients 36
8.2. Weierstrass’ approximation result and uniqueness 37
8.3. Fourier series in L2 (0, 2π) 39
8.4. Fourier series in C([0, 2π]) 41
Appendix A. Dynkin’s lemma 46
Appendix B. Carathéodory’s extension theorem 48
Appendix C. Non-measurable sets 51
References 52

Date: January 18, 2019.


1
2 MEASURE AND INTEGRATION

Introduction
These notes have been created for the “Measure and integration theory” part of a course on
real analysis at the TU Delft. Together with the first part of the course on metric spaces, these
notes form the mathematical basis for several bachelor courses and master courses in applied
mathematics at TU Delft.
I would like to thank Emiel Lorist and Bas Nieraeth for their support in the preparation of
these notes. All the figures have been created by Emiel Lorist. I would also like to thank the
students of the course on real analysis for pointing out the typos in the manuscript.

In Section 1 and 2 we introduce σ-algebras and measures. The Lebesgue measure is constructed
in Section 3 and is based on Appendix B on Carathéodory’s theorem. Uniqueness questions are
addressed in Appendix A on Dynkin’s monotone class theorem. The amount of books on measure
theory is almost not measurable. The lecture notes are based on [1], [8], [16] and [17]. A very
complete treatment of measure theory is given in the impressive works [5].
In Sections 5, 6, 7 we introduce the integration theory and the Lebesgue spaces Lp . This theory
is fundamental in modern (applied) mathematics. There are many excellent books which give
more detailed treatments on the subject. See for instance [1], [4], [6], [15] for detailed treatments.
In Section 8 we give a brief introduction to the theory of Fourier series. More thorough treat-
ments can be found in for example [9], [10], [13], [14] and [19]. A full course on Fourier Analysis
is offered as 3rd elective course based on the lecture notes [10]. The theory of Fourier series will
be used in the 2nd year bachelor course on Partial Differential Equations [7], but also in several
other parts of Mathematical Physics and Numerical Analysis.

We end this brief introduction with a quote from a historical note of Zygmund [18]:

“The Lebesgue integral did not arise via the theory of Fourier series but was created through the
necessities of measuring geometric figures. But once it was introduced, it had an enormous impact
on Analysis through Fourier series”:
• The Riesz-Fischer theorems 7.5, 8.7, in its initial formulation primarily a theorem on
Fourier series.
• The M. Riesz interpolation theorem.
• Structure of sets of measure 0.
• The theory of trigonometric series has become a workshop of new methods in analysis, a
place where new methods are first discovered before they are generalized and applied in
other contexts.
• The theory of Fourier series gave a fresh impulse to problems of the differentiability of
functions (Sobolev spaces etc.).
MEASURE AND INTEGRATION 3

1. σ-algebras
For a set S we write P(S) for its power set.
Definition 1.1. Let S be a set. A collection R ⊆ P(S) is called a ring if
(i) ∅ ∈ R;
(ii) A, B ∈ R =⇒ B \ A ∈ R;
n
S
(iii) n ∈ N, A1 , A2 , . . . , An ∈ R =⇒ Aj ∈ R.
j=1

Remark 1.2.
(1) An equivalent definition is obtained if one replaces (iii) by A, B ∈ R =⇒ A ∪ B ∈ R. This
follows by induction.
(2) If R is a ring, then for all A, B ∈ R one has A ∩ B ∈ R. Indeed, this follows from the identity
A ∩ B = A \ (A \ B).
Definition 1.3. Let S be a set. A family A ⊆ P(S) is called a σ-algebra1 if
(i) ∅, S ∈ A;
(ii) A ∈ A =⇒ Ac ∈ A;

S
(iii) A1 , A2 , . . . , ∈ A =⇒ Aj ∈ A.
j=1
The sets A ∈ A are often called the measurable subsets of S.
Remark 1.4.

T
(1) If A is a σ-algebra, then for all A1 , A2 , . . . , ∈ A one has Aj ∈ A. This follows from the
j=1
∞  S∞ c
Acj .
T
identity Aj =
j=1 j=1
(2) Every σ-algebra is a ring. This follows from the identity B \ A = B ∩ Ac .
Example 1.5. Let S be a set.
(a) A = {S, ∅} is the smallest possible σ-algebra on S.
(b) A = P(S) is the largest possible σ-algebra on S.
Example 1.6. Let S = {1, 2, 3}.
(a) Let A = {∅, S, {1}, {2, 3}}. Then A is a σ-algebra.
(b) Let B = {∅, S, {1}, {2, 3}, {1, 2}}. Then B is not a σ-algebra.
Example 1.7. Let S be a set.
(a) The set R = {A ⊆ S : A is finite} is a ring.
(b) Let2 A = {A ⊆ S : A is countable or Ac is countable}. Then A is a σ-algebra (see Exercise
1.2). It is called the countable-cocountable σ-algebra and is useful for counterexamples.
We continue with a more serious example which plays a crucial role in later constructions.
Example 1.8.
(a) Let S = R. Let I 1 be the collection of all sets of the form (a, b] with a ≤ b. These intervals
will be called half-open intervals. Then I 1 is not a ring since for instance (0, 3] \ (1, 2] =
(0, 1] ∪ (2, 3] is not in I 1 .
(b) Let S = R. Let F 1 be the collection of sets which can be written as a finite union of half-open
intervals (thus of the form (a, b] with a ≤ b). Then F 1 is not a σ-algebra for several reasons.3
We check that F 1 is a ring. (i) follows from ∅ = (1, 1] ∈ F 1 . (iii) is clear since a finite union
of a finite union of intervals of the form (a, b] is again a finite union. It remains to check (ii).
1In part of the literature a σ-algebra is also called a σ-field
2Recall that a set A ⊆ S is countable if A is finite or there is a bijection f : N → A.

3For instance S (0, 1 − 1 ] = (0, 1) is not in F 1 .
n
n=1
4 MEASURE AND INTEGRATION

For this we first note that it is simple to check that for A, B ∈ F 1 one has A ∩ B ∈ F 1 and by
induction this extends to the intersections of finitely many sets. For two intervals (a, b] and
(c, d] using B \ A = B ∩ Ac and R \ (a, b] = (−∞, a] ∪ (b, ∞) we find
(c, d] \ (a, b] = (c, d] ∩ (R \ (a, b])
 
= (c, d] ∩ (−∞, a] ∪ (c, d] ∩ (b, ∞)
= (c, a ∧ d] ∪ (b ∨ c, d].
m n
This is in F 1 again. Now if A = (cj , dj ] are in F 1 , then
S S
(ai , bi ] and B =
i=1 j=1
n
[ n \
[ m
B\A= (cj , dj ] \ A = (cj , dj ] \ (ai , bi ].
j=1 j=1 i=1

and by the previous observations this is in F 1 again.


(c) Let S = Rd . For a, b ∈ Rd with a = (α1 , . . . , αd ) and b = (β1 , . . . , βd ) with αj ≤ βj for
j ∈ {1, . . . , k} the half-open rectangles are given by
(a, b] = (α1 , β1 ] × . . . × (αd , βd ].
Let F d be the collection of sets which can be written as a finite unions of half-open rectangles.
Then F d is a ring (see Exercise 1.7).
The proof of the next result is Exercise 1.3.
Proposition
T 1.9 (Intersection of σ-algebras). Suppose that Ai is a σ-algebra on S for every i ∈ I.
Then Ai is a σ-algebra.
i∈I

Definition 1.10. Let S be a set and let F ⊆ P(S). We write σ(F) for the smallest σ-algebra
which contains F. Then σ(F) is called the σ-algebra generated by F. More precisely:4
\
σ(F) = {A : A is a σ-algebra on S and F ⊆ A}.

Example 1.11. Let S = {1, 2, 3}, F = {{1, 2}} and G = {{2, 3}, {1, 2}}.
(a) σ(F) = {∅, S, {1, 2}, {3}}.
(b) σ(G) = P(S). Indeed, {2, 3}c = {1}, {1, 2}c = {3} and {2, 3} ∩ {1, 2} = {2}. Thus the
singletons {1}, {2} and {3} are in σ(G). Therefore, the required result follows since we can
form every subset of S by taking suitable finite unions.
Example 1.12. Let S = N and F = {{n} : n ∈ N}. Then σ(F) = P(N).
Definition 1.13. Let (S, d) be a metric space.5 Let B(S) be the σ-algebra generated by the open
sets in S. Thus
B(S) = σ{open sets in S}.
The σ-algebra B(S) is called6 the Borel σ-algebra of S. The sets of B(S) are called the Borel
subsets of S.
Example 1.14. One of the most important σ-algebras is the Borel σ-algebra of R which is usually
denoted by B(R). Later on we will show that B(R) 6= P(R).
The following lemma will be useful in some of the exercises on the Borel σ-algebra of R and Rd .
d d
S 1.15 (Lindelöf). Let A ⊆ R . Assume that for each
Lemma S i ∈ I let Oi ⊆ R be open. If
A⊆ Oi , then there exists a countable J ⊆ I such that A ⊆ Oi
i∈I i∈J

4Note that Proposition 1.9 ensures that σ(F ) is indeed a σ-algebra. The intersection makes sure we obtain the
smallest possible one
5More generally one could take any topological space here
6Named after the French mathematician Félix Borel 1871-1956
MEASURE AND INTEGRATION 5

Proof. Choose for each x ∈ A, ix ∈ I and rx > 0 such that B(x, rx ) ⊆ Oix . For each x ∈ A choose
ax ∈ Qd and sx ∈ Q ∩ (0,
S ∞) such that x ∈ B(ax , sx ) ⊆ B(x, rx ) ⊆ Oix . Let F = {B(ax , sx ) : x ∈
A}. Then clearly A ⊆ B(ax , sx ). Moreover, F has at most countably many sets. Indeed, this
x∈A
follows from the fact that it is a subset of {B(q, r) : q ∈ Qd , r ∈ Q ∩ (0, ∞)} which is countable.
Therefore, we can write F = {B(axn , sxn ) : n ∈ N}
S with xn ∈ A for each n ∈ N.
Now let J = {ixn ∈ I : n ∈ N}. Then A ⊆ Oi . Indeed, if x ∈ A, then x ∈ B(ax , sx ) and
i∈J S
choosing n ∈ N such that axn = ax and sxn = sx we find that x ∈ Oixn ⊆ Oi ,
i∈J

Exercises
Exercise 1.1. Let S = R and F = {A ⊆ R : A ⊆ [0, 1] or Ac ⊆ [0, 1]}. Is F a ring?
Exercise∗ 1.2. Prove that the collection in Example 1.7 (b) is a σ-algebra.
Exercise 1.3.
(a) Prove Proposition 1.9.
(b) Give an example of two σ-algebras A and B on S = {1, 2, 3} such that A∪B is not a σ-algebra.
Exercise∗ 1.4. Let S be a set and let F = {{s} : s ∈ S} be the collection consisting of all
sets which contain one element of S. Show that σ(F) coincides with the countable-cocountable
σ-algebra of Example 1.7.
Hint: Use the followsing (well-known) facts: The subset of a countable set is again countable;
The countable union of countable sets is again countable.
Exercise 1.5. Show that N, Q, R \ Q ∈ B(R). That is N, Q and R \ Q are Borel subsets of R.
Exercise∗ 1.6. Consider the following collection B0 = {(−∞, x) : x ∈ R}) of subsets of R.
(a) Show that σ(B0 ) contains all open intervals.
(b) Show that every open set in R can be written as the union of countably many open intervals.
Hint: Use Lindelöf’s Lemma 1.15.
(c) Conclude that σ(B0 ) = B(R).
Exercise∗ 1.7. Let I d ⊆ F d be the collection of half-open rectangles of Example 1.8. Prove the
following assertions:
(a) If I, J ∈ I d then I ∩ J ∈ I d .
(b) If I, J ∈ I d , then I \J is the union of finitely many disjoint sets from I d , and thus I \J ∈ F d .
Hint: Use induction on the dimension d. Use Example 1.8 (b) for d = 1.
(c) Each A ∈ F d can be written as union of finitely many disjoint sets in I d .
n
Ik with I1 , . . . , In ∈ I d .
S
Hint: Use induction on n to prove this for all sets of the form A =
k=1
(d) F d is a ring.
Exercise∗∗ 1.8. Prove that a σ-algebra is either finite or uncountable.7
Hint: Recall that P(N) is uncountable.

7This shows that σ-algebras are either easy finite sets or quite complicated
6 MEASURE AND INTEGRATION

2. Measures
Definition 2.1. Let S be a set and R ⊆ P(S) a ring. Let µ : R → [0, ∞] be a function with8
µ(∅) = 0
(i) µ is called additive if for all disjoint9 A1 , . . . , An ∈ R one has
[ n  X n
µ Aj = µ(Aj ).
j=1 j=1

S
(ii) µ is called σ-additive if for each disjoint sequence (An )n≥1 in R which satisfies An ∈ R
n=1
it holds that

[  X∞
µ Aj = µ(Aj ).
j=1 j=1

Remark 2.2.
(1) By an induction argument it suffices to consider n = 2 in the definition of additive. (See
Exercise 2.1).
(2) If µ is σ-additive, then it is additive as follows by taking Am = ∅ for m ≥ n + 1.
Definition 2.3. Let S be a set and let A be a σ-algebra on S.
(i) The pair (S, A) is called a measurable space.
(ii) A function µ : A → [0, ∞] which satisfies µ(∅) = 0 and which is σ-additive on A is called a
measure. In this case the triple (S, A, µ) is called a measure space.
(iii) If additionally to (ii) µ(S) < ∞, then µ is called a finite measure. If moreover, µ(S) = 1,
then µ is called a probability measure and (S, A, µ) is called a probability space. 10
Example 2.4 (Counting measure). Let S = N and A = P(N). We write #A for the number of
elements of a finite set A, and we set #A = ∞ if A is infinite. Let µ : A → [0, ∞] be given by
µ(A) = #A. Then µ is a measure. Often µ is denoted by τ and called the counting measure.
Example 2.5 (Dirac measure/Dirac’s delta function). Let S = R, A = P(R). Let x ∈ R. Let
δx : A → [0, ∞] be given by δx (A) = 1 if x ∈ A and δx (A) = 0 if x ∈ R \ A. Then δx is a measure.
It is usually called the Dirac measure11 at x.
Example 2.6. Let S be a set and let µ : P(S) → [0, ∞] be given by µ(∅) = 0 and µ(A) = 1 if
A 6= ∅. If S contains at least two elements, then µ is not a measure.
Example 2.7 (Length of an interval). 12 Let R = F 1 as in Example 1.8. For a ≤ b let λ((a, b]) =
b − a (length of the interval). If A = (a1 , b1 ] ∪
P.n. . ∪ (an , bn ] is a union of disjoint sets such that
aj ≤ bj for j = 1, . . . , n, then we let13 λ(A) = j=1 bj − aj . Then λ is additive. Later we will see
that λ is σ-additive on F 1 and has an extension to a measure on σ(F 1 ) = B(R).
Theorem 2.8. Let R be a ring and µ : R → [0, ∞] be additive. The following assertions hold:
(i) If A, B ∈ R and A ⊆ B, then µ(A) ≤ µ(B) (monotonicity).

S
(ii) If A1 , A2 , . . . ∈ R are disjoint and Aj ∈ R, then
j=1

[  X∞
µ Aj ≥ µ(Aj ).
j=1 j=1

8This assumption will always be made.


9Here we mean A ∩ A = ∅ if i 6= j
i j
10Measure theory is at the very heart of probability theory. See the third year elective course
11Named after the English theoretical physicist Paul Dirac 1902-1984. The “delta function” is actually not a
function, but can be interpreted as a generalized function or as measure.
12See Section 3 for a further construction of the so-called Lebesgue measure
13One can check that this does not depend on the way we write the set A. See below Definition 3.6.
MEASURE AND INTEGRATION 7


S
(iii) If A1 , A2 , . . . ∈ R and Aj ∈ R and µ is σ-additive on R, then
j=1

[  X∞
µ Aj ≤ µ(Aj ) (σ-subadditivity).
j=1 j=1

Proof. (i): Write B = A ∪ (B \ A). Then


µ(B) = µ(A ∪ (B \ A)) = µ(A) + µ(B \ A) ≥ µ(A).
n
S ∞
S
(ii): Let n ∈ N. Then Aj ⊆ Aj and therefore by additivity of µ
j=1 j=1
n
X n
[ ∞
 (i)  [ 
µ(Aj ) = µ Aj ≤ µ Aj .
j=1 j=1 j=1

The result follows by letting n → ∞.


(iii): Let
(2.1) B1 = A1 , B2 = A2 \ A1 , B3 = A3 \ (A1 ∪ A2 ), etc.

S ∞
S
Then (Bn )n≥1 is a disjoint sequence and Bj = Aj . Therefore, by the σ-additivity of µ
j=1 j=1

 ∞
[   ∞
[  ∞
X ∞
(i) X
µ Aj =µ Bj = µ(Bj ) ≤ µ(Aj ).
j=1 j=1 j=1 j=1

,
Let (an )n≥1 be a sequence of real numbers. We write an ↑ a if (an )n≥1 is an increasing sequence
which converges to a. Similarly, we write an ↓ a if it decreases and converges to a. This notation
will now be extended to sets.
Definition 2.9. Let S be a set.
(i) A sequence (An )n≥1 of subsets of S will be called increasing if An ⊆ An+1 for all n ∈ N.

S
In this case we write An ↑ A, where A = An .
n=1
(ii) A sequence (An )n≥1 of subsets of S will be called decreasing if An ⊇ An+1 for all n ∈ N.

T
In this case we write An ↓ A, where A = An .
n=1

Theorem 2.10. Let (S, A, µ) be a measure space and let (An )n≥1 be a sequence in A.
(i) If An ↑ A, then µ(An ) ↑ µ(A).
(ii) If An ↓ A and µ(A1 ) < ∞, then µ(An ) ↓ µ(A).
Proof. (i): Define (Bn )n≥1 as in (2.1). Then (Bn )n≥1 is a disjoint sequence and the following

S n
S
identities hold: Bj = A and Bj = An . Therefore, the σ-additivity of µ gives
j=1 j=1

X n
X
µ(A) = µ(Bj ) = lim µ(Bj ) = lim µ(An ).
n→∞ n→∞
j=1 j=1

(ii): See Exercise 2.4. ,


The following result will help us to check σ-additivity on rings. It will be used in the construction
of the Lebesgue measure in Lemma 3.9.
Lemma 2.11 (Sufficient condition for σ-additivity). Let R be a ring and let µ : R → [0, ∞] be
such that µ(∅) = 0 and µ is additive. Suppose that for each sequence (An )n≥1 with An ↓ ∅ one
has µ(An ) → 0. Then µ is σ-additive on R.
8 MEASURE AND INTEGRATION


S
Proof. Let (Bj )j≥1 be a disjoint sequence in R with B := Bj ∈ R. We need to show that
j=1

X
(2.2) µ(B) = µ(Bj ).
j=1

S
Let An = Bj = B \ (B1 ∪ . . . ∪ Bn−1 ). Then An ∈ R and An ↓ ∅. Now the assumption yields
j=n
µ(An ) → 0. On the other hand
n−1
X
µ(B) = µ(An ∪ B1 ∪ B2 ∪ . . . ∪ Bn−1 ) = µ(An ) + µ(Bj ).
j=1
n−1
X
Therefore, µ(B) − µ(Bj ) = µ(An ) → 0 and (2.2) follows. ,
j=1

Exercises
Exercise 2.1. Let R be a ring on a set S. Assume µ : R → [0, ∞] satisfies µ(∅) = 0 and
µ(A ∪ B) = µ(A) + µ(B) for all sets A, B ∈ R with A ∩ B = ∅. Show that µ is additive.
Exercise 2.2. Let R be a ring on a set S. Let µ : R → [0, ∞] be additive.
(a) Prove that for A, B ∈ R with µ(A) < ∞ and A ⊆ B one has
µ(B \ A) = µ(B) − µ(A).
(b) Prove that for A, B ∈ R with µ(A) < ∞ one has
µ(A ∪ B) = µ(A) + µ(B) − µ(A ∩ B).
(c) Show that for any n ∈ N and any sets (Aj )nj=1 in R,
finite subadditivity µ(A1 ∪ . . . ∪ An ) ≤ µ(A1 ) + . . . + µ(An ).

X2.3. Let (an )n≥1 be numbers in [0, ∞). Set µ(∅) = 0 and define µ : P(N) → [0, ∞] by
Exercise
µ(A) = an . Prove that µ is a measure on N.
n∈A

Exercise∗ 2.4.
(a) Prove Theorem 2.10 (ii).
(b) Give an example of a measure space (S, A, µ) and sets An ∈ A such that An ↓ ∅ and
µ(An ) = ∞ for all n ∈ N.
Hint: Use the counting measure.
Exercise∗ 2.5. Let (S, A, µ) be a measure space. For A1 , A2 , . . . ⊆ S define
∞ [
\ ∞
lim sup An = An .
n→∞
k=1 n=k
(a) Show that s ∈ lim sup An if and only if there are infinitely many n ∈ N such that s ∈ An .
n→∞
(b) Assume A1 , A2 , . . . ∈ A. Show that lim sup An ∈ A.
n→∞

X
(c) Assume A1 , A2 , . . . ∈ A satisfy µ(An ) < ∞. Show that µ(lim sup An ) = 0.14
n→∞
n=1

Exercise∗ 2.6. Let A be the σ-algebra from Example 1.7 (b) with S = R. Define µ : A → [0, ∞]
by µ(A) = 0 if A is countable and µ(A) = 1 if Ac is countable. Show that µ is a measure.

14This is called the Borel-Cantelli lemma in probability theory.


MEASURE AND INTEGRATION 9

3. Construction of measures
It is not a simple task to construct a measure. In this section we will construct the Lebesgue
measure on Rd of which we have previously shown it is an additive mapping on the ring F 1 in
Example 2.7. To extend it to the Borel σ-algebra we use a deep result of Carathéodory.15 His
result basically says that it is enough to check that a measure is σ-additive on a ring generating
the desired σ-algebra. A detailed proof can be found in Theorem B.6 in the appendix, but it will
do no harm if one takes the result for granted.16
Theorem 3.1 (Carathéodory’s extension theorem). Let S be a set and let R ⊆ P(S) be a ring.
Suppose that µ : R → [0, ∞] is σ-additive on R and µ(∅) = 0. Then µ extends to a measure µ on
σ(R). More precisely, there exists a measure µ on (S, σ(R)) such that µ(A) = µ(A) for all A ∈ R.
Remark 3.2. The measure µ is often unique.17 When there is no danger of confusion we will write
µ again for the extension to σ(R). However, in general one has to be careful about uniqueness.
For instance if we define µ on the ring F 1 by µ(A) = ∞ if A ∈ F 1 is nonempty, then µ has at
least two extensions: the counting measure on B(R) is an extension of µ, but also the measure
ν : B(R) → [0, ∞] given by ν(A) = ∞ if A 6= ∅ is an extension of µ.
We continue with a uniqueness result which will be proved in the appendix.
Definition 3.3 (π-system). A collection E ⊆ P(S) is called a π-system if for all A, B ∈ E one
has A ∩ B ∈ E.
Example 3.4.
(a) Every ring is a π-system.
(b) Let S = Rd . The half-open rectangles I d are a π-system.
The following result will be proved in Proposition A.7.
Proposition 3.5 (Uniqueness). Let µ1 and µ2 both be measures on measurable space (S, A).
Assume the following conditions:
(i) E ⊆ A is a π-system with σ(E) = A;
(ii) µ1 (S) = µ2 (S) < ∞ and µ1 (E) = µ2 (E) for all E ∈ E.
Then µ1 = µ2 on A.
We continue with the construction of the Lebesgue 18 measure λ. In Example 1.8 we intro-
duced the half-open intervals (a, b] ∈ I 1 with a ≤ b. Also recall that F 1 denotes the collection
of all finite unions of half-open intervals. We have seen that F 1 is a ring. Moreover, in of course
every set in F 1 can be written as a finite union of disjoint half-open intervals.
Definition 3.6 (on unions of half-open intervals). For A ∈ F 1 of the form A = (a1 , b1 ] ∪ . . . ∪
(an , bn ] with disjoint ((aj , bj ])nj=1 in I 1 define λ : F 1 → [0, ∞] as the sum of the lengths:
n
X
λ(A) = (bj − aj ).
j=1

The above is well-defined. To see this assume A = (c1 , d1 ]∪. . . (cm , dm ] is another representation
of A as a union of disjoint intervals. Let Iij = (ci , di ] ∩ (aj , bj ]. Then either Iij is empty or a half
open interval,
m
[ [n
Iij = (aj , bj ] and Iij = (cj , dj ].
i=1 j=1

15Carathéodory 1873-1950 was a Greek mathematician working in Analysis, but also on Thermodynamics.
16The appendix is not part of the exam
17For instance when µ is a finite measure. See Proposition A.7.
18Henri Lebesgue 1875–1941 was a French mathematician well-known for his integration theory. See Section 5.
10 MEASURE AND INTEGRATION

From the definition and the disjointness of the (Iij )m,n


i,j=1 we obtain
n
X n m
[  Xn X
m m X
n m n
[  Xm
 X X X 
λ (aj , bj ] = λ Iij = λ(Iij ) = λ(Iij ) = λ Iij = λ (ci , di ] ,
j=1 j=1 i=1 j=1 i=1 i=1 j=1 i=1 j=1 i=1

which proves the well-definedness. Alternatively, one can observe that λ(A) coincides with the
Riemann integral of 19 1A . Indeed, fix an interval I such that A ⊆ I. By linearity of the Riemann
integral
Z Xn Z n
X
1A dx = 1(aj ,bj ] dx = (bj − aj ) = λ(A).
I j=1 I j=1

In Example 1.8 we introduced the half-open rectangles (a, b] ∈ I d with a = (α1 , . . . , αd ) and
b = (β1 , . . . , βd ) and αj ≤ βj for j ∈ {1, . . . , d}. Also recall that F d denotes the collection of all
finite unions of half-open rectangles. By Exercise 1.7 (d), F d is a ring. Moreover, in Exercise
1.7 (c) it was shown that every set in F d can be written as a finite union of disjoint half-open
rectangles.
Definition 3.7 (on unions of half-open rectangles). For a half open rectangle I = (a, b] ∈ I d with
a = (α1 , . . . , αd ) and b = (β1 , . . . , βd ) let its volume be denoted by
d
Y
|I| = (βj − αj ).
j=1

For A ∈ F d of the form A = I1 ∪ . . . ∪ In with disjoint (Ij )nj=1 in I d define λd : F d → [0, ∞] by


n
X
λd (A) = |Ij |.
j=1

This is well-defined since for a rectangle R ⊆ Rd with A ⊆ R, we again have


Z
λd (A) = 1A dx,
R
where the latter is a d-dimensional Riemann integral.
Remark 3.8.
(1) When the dimension is fixed and there is no danger of confusion we will write λ for λd . It is
common to write |A| for A ∈ F d as well.
(2) For A ∈ F d , λ(A) = λd (A) = |A| equals the volume of A.
Next we want to extend λd to σ(F d ) = B(Rd ) (see Exercise 3.1 for this identity). To apply
Theorem 3.1, we first need to check the σ-additivity of λ on F d . This will be done via Lemma
2.11.
Lemma 3.9. The function λ : F d → [0, ∞] is σ-additive on F d .
Proof. By Lemma 2.11 it suffices to prove each sequence (An )n≥1 in F d with An ↓ ∅, satisfies
µ(An ) → 0. Fix ε > 0. We have to find N ∈ N such that λ(An ) < ε for all n ≥ N .
Step 1: For each n ∈ N choose a Bn ∈ F d such that Bn ⊆ An and λ(An \ Bn ) ≤ 2−n ε.20 Since

Bn = ∅. It follows that {(Bn )c : n ∈ N} is an open cover of the set A1 which is
T
Bn ⊆ An also
n=1
N
(Bn )c . It
S
compact by the Heine-Borel theorem. Therefore, there exists an N such that A1 ⊆
n=1
N N
Bn ⊆ Ac1 . Since all for all n ≥ 1, Bn ⊆ A1 , we must have that
T T
follows that Bn = ∅.
n=1 n=1

19Recall that 1 (x) = 1 if x ∈ A and 1 = 0 if x ∈ Ac .


A A
20So we just choose a set B which is slightly smaller than A .
n n
MEASURE AND INTEGRATION 11

n
T n
S n
S
Step 2: Let Cn = Bj for n ≥ 1. For every n ≥ 1, An \ Cn = (An \ Bj ) ⊆ (Aj \ Bj ).
j=1 j=1 j=1
Therefore, using Theorem 2.8 (i) in (∗) and Exercise 2.2 (c) in (∗∗), we find
(∗)  [n n
 (∗∗) X Xn
λ(An \ Cn ) ≤ λ (Aj \ Bj ) ≤ λ(Aj \ Bj ) ≤ 2−j ε < ε.
j=1 j=1 j=1

Since Cn = ∅ for all n ≥ N , we can conclude that λ(An ) = λ(An \ Cn ) < ε for every n ≥ N . ,
We can now deduce the main result of this section.
Theorem 3.10 (Lebesgue measure). There exists a unique measure λ on (Rd , B(Rd )) such that
for all half-open rectangles I, one has λ(I) = |I|, where |I| is the volume of I. Moreover, for all
h ∈ Rd and A ∈ B(Rd ), λ(A + h) = λ(A).21
In the above A + h := {x + h : x ∈ A}.
Proof. Step 1: Existence. In Lemma 3.9 we have shown that λ is σ-additive on the ring F d .
Therefore, by Theorem 3.1 λ extends to a measure on σ(F d ) = B(Rd ) (see Exercise 3.1).
Step 2: Uniqueness. Let µ be another measure such that µ(I) = |I| for half-open rectangles
I ∈ I d . Fix n ∈ N and let Sn = (−n, n]d . Define λ(n) and µ(n) on B(Rd ) by
λ(n) (A) = λ(A ∩ Sn ) and µ(n) (A) = µ(A ∩ Sn ).
Then λ(n) and µ(n) are measures and λ(n) (Rd ) = λ(Sn ) = |Sn | and similarly µ(n) (Rd ) = |Sn |.
Since λ(n) and µ(n) coincide on I d , it follows from Example 3.4 (b) and Proposition 3.5 that
λ(n) = µ(n) on B(Rd ). Therefore, for any A ∈ B(Rd ), since A ∩ Sn ↑ A Theorem 2.10 yields
λ(n) (A) = λ(A ∩ Sn ) → λ(A) and µ(n) (A) = µ(A ∩ Sn ) → µ(A).
Thus λ(A) = µ(A).
Step 3: Translation invariance: Let h ∈ Rd . We claim that for every A ∈ B(Rd ) one has
A + h ∈ B(Rd ). For this let Ah = {A ∈ B(Rd ) : A + h ∈ B(Rd )}. By definition Ah ⊆ B(Rd ).
One can check that Ah is a σ-algebra. For each open set A one has A + h is open and hence
A + h ∈ B(Rd ). Therefore, B(Rd ) = σ({open sets}) ⊆ Ah , and the claim follows.
Define µh on B(Rd ) by µh (A) = λ(A + h). Then µh is a measure and for any half-open
rectangle I, µh (I) = |I + h| = |I| = λ(I). By the uniqueness of step 2, we find µh (A) = λ(A) for
all A ∈ B(Rd ) and this proved the result. ,
Remark 3.11. From Theorem B.6 one can actually see that for any A ∈ B(Rd ),

nX ∞
[ o
λ(A) = inf |Ij | : A ⊆ Ij , where (Ij )j≥1 are disjoint half-open rectangles ,
j=1 j=1

but we will not use this formula.

Exercises on the Lebesgue measure


If the dimension is fixed we write λ instead of λd for simplicity.
Exercise∗ 3.1. Let F d be as in Example 1.8. Show that σ(F d ) is the Borel σ-algebra B(Rd ).
Hint: Use the Lindelöf Lemma 1.15.
Exercise 3.2.
(a) Show that any countable subset A ⊆ Rd is in B(Rd ).
Hint: First show that {x} ∈ B(Rd ) for every x ∈ Rd .
(b) Show that any countable subset A ⊆ Rd satisfies λ(A) = 0. In particular, λ(Qd ) = 0.
Hint: First show that λ({x}) = 0.
21This is called translation invariance. Up to a scaling factor λ is the only measure on B(Rd ) which satisfies
this property. See Exercise 3.6.
12 MEASURE AND INTEGRATION

Exercise∗ 3.3. For a, b ∈ Rd with a = (α1 , . . . , αd ) and b = (β1 , . . . , βd ) with αj ≤ βj for


j ∈ {1, . . . , k} let
(a, b) = (α1 , β1 ) × . . . × (αd , βd ) and [a, b] = [α1 , β1 ] × . . . × [αd , βd ]
be the open and closed rectangle, respectively. Prove that
  
λ (a, b) = λ [a, b] = λ (a, b]
and thus all coincide with the volume of the rectangle.
Hint: Use Theorem 2.10.
Exercise∗ 3.4 (Uncountable sets can have measure zero). Show that the Cantor set C ⊆ [0, 1] is
in B(Rd ) and satisfies λ(C) = 0.
Hint: What can you say about λ(Cn ) ?
Exercise∗ 3.5. For A ⊆ R and t ≥ 0 let tA = {tx : x ∈ A}. Show that for each A ∈ B(R),
λ(tA) = tλ(A).
Hint: Use the same method as in the proof of Theorem 3.10.
Exercise∗ 3.6. Let µ : B(R) → [0, ∞] be a measure such that for all h ∈ R and A ∈ B(R),
µ(A + h) = µ(A). Let c = µ((0, 1]) and assume c ∈ (0, ∞). Prove the following assertions: 22
 
(a) For each x ≥ 0 and q ∈ N,µ (0, qx] = qµ (0, x] .
(b) For each p, q ∈ N, µ (0, pq ] = c pq .

(c) For each x ≥ 0, µ (0, x] = cx.
(d) For each a ≤ b, µ (a, b] = c(b − a).
(e) For each A ∈ B(R), µ(A) = cλ(A).
Exercise∗∗ 3.7 (Lebesgue-Stieltjes23 measure). Let a < b and let F : R → R be right-continuous
and increasing. Show that µ((a, b]) = F (b) − F (a) for a ≤ b extends to a measure on B(R).
Exercises on general measures
Exercise∗ 3.8. Let A be a σ-algebra on S and let T ⊆ S. Define the restricted σ-algebra AT on
T by
AT = {A ∩ T : A ∈ A}.
(a) Show that AT is a σ-algebra.
(b) If T ∈ A, show that AT = {A ⊆ T : A ∈ A}.
(c) Let µ be a measure on (S, A). If T ∈ A show that the restriction of µ to AT is a measure
again.
(d) If A is the Borel σ-algebra on metric space (S, d), then AT coincides with the Borel σ-algebra
on (T, d).
Exercise 3.9 (Non-uniqueness of extensions I). Let S = {1, 2, 3, 4} and let F = {{1, 2}, {1, 3}}.
Define µ : F → [0, ∞] by µ({1, 2}) = µ({1, 3}) = 21 . Find two different extensions of µ to
σ(F) = P(S). Why does this not contradict Proposition 3.5?
Exercise∗ 3.10 (Non-uniqueness of extensions II). Let S = N and let F = {n, n+1, . . .} : n ∈ N .


(a) Show that F is a π-system.


(b) Show that σ(F) = P(N)
(c) Let τ be the counting measure and let µ : P(N) → [0, ∞] be defined by µ(A) = 2τ (A). Show
that τ = µ on F. Why does this not contradict Proposition A.7?

22From this exercise we see that up to a scaling factor, λ is the only translation invariant measure on B(R). The
case for dimensions d ≥ 2 holds as well and can be proved in a similar way.
23Thomas Stieltjes 1856-1894 was a Dutch mathematician working in Analysis. He has even worked in Delft.
MEASURE AND INTEGRATION 13

4. Measurable functions
One of the aims will be to integrate functions f : S → R with respect to a measure µ on a
measurable space (S, A). A way to do this is to use discretization in the range24 of f . So we would
like to know the measure of for instance the set Ay,ε = {s ∈ S : f (s) ∈ [y, y + ε]}. Knowing this
for all y ∈ R and all ε > 0 makes it possible estimate the “area” under f . Of course we do need
the sets to be in A to make this work. This is the motivation of the definition of measurability.
See Section 5 for details on integration.
The natural setting to introduce measurability of functions is a follows:
Definition 4.1. Let (S, A) and (T, B) be two measurable spaces. A function f : S → T is called
measurable if for each B ∈ B, one has25 f −1 (B) ∈ A.
Remark 4.2.
(1) The composition of two measurable function is again measurable (see Exercise 4.1).
(2) Instead of f −1 (B) or {s ∈ S : f (s) ∈ B} one sometimes writes {f ∈ B} for the same set.
(3) In probability theory measurable functions are called random variables.
It suffices to check measurability on a generating collection F ⊆ B:
Lemma 4.3. Let (S, A) and (T, B) be two measurable spaces and let f : S → T . Suppose F ⊆ B
is such that σ(F) = B. If f −1 (F ) ∈ A for all F ∈ F, then f is measurable.

Proof. Define Be = {B ∈ B : f −1 (B) ∈ A}. Our aim is to show that Be = B. We claim that Be
is a σ-algebra. Indeed, since f −1 (∅) = ∅ ∈ A also ∅ ∈ B.e Similarly, since f −1 (T ) = S ∈ A,
−1 −1
e then f (T \ B) = S \ f (B) ∈ A, which implies that B c ∈ B.
we find T ∈ B. e If B ∈ B, e If
B1 , B2 , . . . ∈ B, then
e

[  [∞
f −1 Bn = f −1 (Bn ) ∈ A.
n=1 n=1
Therefore, the claim follows.
Now since F ⊆ B, e the claim yields B = σ(F) ⊆ Be ⊆ B. This implies Be = B. ,
In the sequel a metric space X will always be equipped with its Borel σ-algebra B(X) (unless
otherwise stated).
Proposition 4.4 (Continuous mappings are measurable). Let (X, d) and (Y, ρ) be metric spaces.
If f : X → Y is continuous, then f is measurable.26
Proof. By the continuity of f we find that for all open O ⊆ Y the inverse image is f −1 (O) open
in X and hence in B(X). Since the open sets of Y generate the Borel σ-algebra, the result follows
from Lemma 4.3 with F = {O ⊆ Y : O is open}. ,
The most frequent case we will encounter is when f : S → R and (S, A) is a measurable space.
Unless otherwise stated we consider the Borel σ-algebra B(R) on R. The following characterization
of measurability will be useful.
Proposition 4.5 (Real valued functions). Let (S, A) be a measurable space. For f : S → R the
following are equivalent:
(i) f is measurable.
(ii) For all r ∈ R, one has f −1 ((−∞, r]) ∈ A.
(iii) For all r ∈ R, one has f −1 ((−∞, r)) ∈ A.

24In Riemann integration of functions f : Rd → R the discretization is always done in the domain of the function.
This is one of the major differences with Lebesgue integration
25Recall that f −1 (B) is called the inverse image of B by f and is defined by f −1 (B) = {s ∈ S : f (s) ∈ B}
26The same result holds for topological spaces and the proof is the same. In the setting of Borel-σ-algebras,
measurable functions are often called Borel measurable.
14 MEASURE AND INTEGRATION

Proof. (i) ⇒ (ii) and (i) ⇒ (iii) are trivial.


(iii) ⇒ (i): Let F = {(−∞, r) : r ∈ R}. In Exercise 1.6 we have seen that σ(F) = B(R).
Therefore, (i) follows from Lemma 4.3.
(ii) ⇒ (i): This can be proved as before by proving the required version of Exercise 1.6. ,
Example 4.6. Let S = {1, 2, 3} and A = {S, ∅, {1, 2}, {3}}. Let f : S → R be given by f (s) = 2016
if s = 1 and f (s) = 0 if s 6= 1. Then f is not measurable, because f −1 ({2016}) = {1} ∈
/ A. If we
replace A by (for example) A0 := {S, ∅, {1}, {2, 3}}, then f becomes measurable.
Recall that x ∨ y = max{x, y} and x ∧ y = min{x, y}.
Theorem 4.7. Let (S, A) be a measurable space. Let f, g : S → R be measurable functions and
let α ∈ R. Then the following functions are all measurable as well:
1
f + g, f − g, f · g, f ∨ g, f ∧ g, f + := f ∨ 0, f − := (−f ) ∨ 0, |f |, α · f, (if f 6= 0 on S).
f
Proof. We first claim that h : S → R2 given by h(s) = (f (s), g(s)) is measurable. To prove this
observe that for all half-open rectangles I = I1 × I2 ⊆ R2 , one has
h−1 (I) = {s ∈ S : f (s) ∈ I1 and g(s) ∈ I2 } = f −1 (I1 ) ∩ g −1 (I2 ) ∈ A.
By Exercise 3.1 σ(half open rectangles) = σ(F 2 ) = B(R2 ), we can use Lemma 4.3 to find that h
is measurable.
To prove the statements we use that continuous functions are measurable (see Proposition 4.4)
and the fact that the composition of measurable functions is again measurable (see Exercise 4.1).
For instance let ϕ : R2 → R be given by ϕ(x, y) = x + y. Then ϕ is continuous and therefore
measurable. Now writing f + g = ϕ ◦ h the required measurability follows.
The proofs for the difference, product, maximum, minimum are similar. Note that the maximum
x ∨ y = 12 (x + y) + 12 |x − y| and this is a continuous function from R2 to R. For the minimum
there is an analogue formula.
The measurability of f ± , |f |, α · f and f1 all follow in the same way by rewriting them as φ ◦ f
for a suitable continuous function φ.
The case f1 requires some comment. Let φ : R\{0} → R be given by φ(x) = x1 . Note that in this
case R \ {0} is a metric space on which φ is continuous, and therefore measurable by Proposition
4.4. Now for all open sets B ∈ B(R), C := φ−1 (B) is open in R \ {0} and hence in B(R). Thus
(φ ◦ f )−1 (B) = f −1 (C) ∈ A. Therefore, the measurability of φ ◦ f follows from Lemma 4.3. ,
Let R = [−∞, ∞]. It will be useful to introduce measurability of functions f : S → R as well.
For this, we introduce an analogue of the Borel σ-algebra on R.
Definition 4.8. Let B(R) be the the σ-algebra generated by the sets {∞}, {−∞} and B ∈ B(R).
The σ-algebra B(R) will be called the Borel σ-algebra of R.
From Lemma 4.3 we see that a function f : S → R is measurable if and only if {s ∈ S : f (s) =
±∞} ∈ A and f −1 (B) ∈ A for each B ∈ B(R). We extend addition, multiplication, etc. to R in
the following way:
∞ + a = a + ∞ = ∞, for all a ∈ (−∞, ∞]
−∞ + a = a − ∞ = −∞, for all a ∈ [−∞, ∞)
∞ · a = a · ∞ = ∞, for all a ∈ (0, ∞]
∞ · a = a · ∞ = −∞, for all a ∈ [−∞, 0)
a a
∞·0=0·∞= = =0 for all a ∈ (−∞, ∞)
∞ −∞
In this setting Theorem 4.7 remains true27 for functions f, g : S → R.

27We do not define ∞ − ∞, so some cases need to be excluded. For the proof one additionally needs to check
in each of the cases that inverse images of {∞}, {−∞}, {0} are measurable. We leave this to the reader.
MEASURE AND INTEGRATION 15

The next result shows that measurability is preserved under taking countable suprema, count-
able infimum, limits of sequences, etc. For a sequence of numbers (xn )n≥1 in R, let
lim sup xn = lim sup xn and lim inf xn = lim inf xn .
n→∞ k→∞ n≥k n→∞ k→∞ n≥k

The limit of yk := supn≥k xn exists28 since (yk )k≥1 is decreasing. Moreover,


(4.1) lim sup xn = lim yk = inf yk = inf sup xn .
n→∞ k→∞ k≥1 k≥1 n≥k

Similar formulas hold for the lim inf n→∞ xn . Recall that the lim supn→∞ xn and lim inf n→∞ xn
always exist. Moreover they both coincide with limn→∞ xn if and only if (xn )n≥1 converges in R.
Theorem 4.9. Let (S, A) be a measurable space. For each n ∈ N let fn : S → R be a measurable
function. Then each of the following functions is measurable as well:29
sup fn , inf fn , lim sup fn , lim inf fn .
n≥1 n≥1 n→∞ n→∞

30
Moreover, if fn → f pointwise , then f is measurable again.
Proof. Let g = supn≥1 fn . Then for each r ∈ R,

\
g −1 ([−∞, r]) = {s ∈ S : g(s) ≤ r} = {s ∈ S : fn (s) ≤ r for all n ∈ N} = fn−1 ([−∞, r]) ∈ A.
n=1

Since σ({[−∞, r] : r ∈ R}) = B(R), the measurability of g follows from Lemma 4.3. The case of
infima follows from inf n≥1 fn = − supn≥1 (−fn ).
By (4.1) we can write lim supn→∞ fn = inf k≥1 supn≥k fk . Therefore, the measurability follows
from the previous cases. The remaining cases follow from lim inf n→∞ fn = − lim supn→∞ (−fn )
and limn→∞ fn = lim supn→∞ fn . ,
Definition 4.10. A function f : S → R is called a simple function31 if f is measurable and
takes only finitely many values.
Letting x1 , . . . , xn ∈ R denote the distinct values of f and Aj = {s ∈ S : f (s) = xj }, we can
always represent a simple function as
X n
f= xj · 1Aj .
j=1

Of course if xj = 0 for some j ∈ {1, . . . , n}, we could leave it out from the sum.
Example 4.11. Let S = R and A = B(R). The following functions are simple functions:
(a) f = π · 1(0,1) − 4 · 1(13,14] + 5 · 1Z∩(−∞,0) .
(b) f = 1Q .
Next we show that measurable functions can be written as limits of simple functions. For this
we discretize in the range space in a suitable way. The result plays a crucial role in the integration
theory in Section 5.
Theorem 4.12. Let (S, A) be a measurable space. 32
(i) Let f : S → [0, ∞] be measurable. Then there exists a sequence of simple functions (fn )n≥1
such that 0 ≤ f1 (s) ≤ f2 (s) ≤ . . . and limn→∞ fn (s) = f (s) for each s ∈ S.
(ii) Let f : S → R be measurable. Then there exists a sequence of simple functions (fn )n≥1 such
that limn→∞ fn (s) = f (s) for all s ∈ S.

28Here we allow divergence to ±∞


29Here it is important what we work with countable suprema, infimum, etc.
30Here we allow divergence to ±∞
31In part of the literature this is called a step function, but we will use this name for a different class of functions
32In the following we allow divergence to ±∞
16 MEASURE AND INTEGRATION

33
Proof. (i): For each n ∈ N and j ∈ {0, 1, . . . , 4n − 1}, let
An,j = {s ∈ S : j2−n ≤ f (s) < (j + 1)2−n } and An = {s ∈ S : f (s) ≥ 2n }.
Then for each n, j one has An,j , An ∈ A. Define34
n
4X −1
n j
fn = 2 1An + 1A .
j=0
2n n,j
It is clear that each fn takes finitely many values. Moreover, by Exercise 4.5 and Theorem 4.7,
each fn is measurable. Thus each fn is a simple function.
Now fix s ∈ S. We first prove 0 ≤ fn (s) ≤ fn+1 (s) for each n ∈ N. First assume f (s) < 2n .
Then selecting the unique j ∈ {0, . . . , 4n − 1} such that s ∈ An,j we find that fn (s) = j2−n .
Similarly, we can select k ∈ {0, . . . , 4n+1 − 1} such that s ∈ An+1,k and we find that fn+1 (s) =
k2−(n+1) . Since, f (s) ≥ j2−n = 2j2−(n+1) , we find that k ≥ 2j and therefore
fn (s) = j2−n ≤ k2−(n+1) = fn+1 (s).
The case f (s) ≥ 2n can be treated similarly and is left to the reader.
To prove that fn (s) → f (s), first assume f (s) < ∞. Let ε > 0. Choose N ∈ N so large that
f (s) < 2N and 2−N < ε. Let n ≥ N . Selecting j ∈ {0, . . . , 4n − 1} such that s ∈ An,j we find that
|f (s) − fn (s)| ≤ 2−n ≤ 2−N < ε.
Therefore, fn (s) → f (s) in this case. If f (s) = ∞, then fn (s) = 2n for every n ∈ N and thus
fn (s) → ∞ = f (s).
(ii): Write f = f + − f − . Then by (i) we can find simple functions fn,+ , fn,− : S → R such
that fn,+ → f + and fn,− → f − . Let fn = fn,+ − fn,− for n ∈ N. Define the sets A+ and A− by
A± = {s ∈ S : ±f (s) ∈ [0, ∞]}. Then A+ ∪ A− = S. If s ∈ A+ , then
fn (s) = fn,+ (s) → f + (s) = f (s).
The case s ∈ A− is similar.
,

Exercises
Exercise 4.1. Let (Sj , Aj ) for j = 1, 2, 3 be measurable spaces. Assume f : S1 → S2 and
g : S2 → S3 are both measurable. Show that the composition g ◦ f : S1 → S3 is measurable.
Exercise 4.2. Let (S, A) be a measurable space. Let f, g : S → R be measurable functions. Show
that {s ∈ S : f (s) = g(s)} ∈ A and {s ∈ S : f (s) < g(s)} ∈ A.
Exercise 4.3. Let (S, A) be a measurable space. Let f : S → R be a measurable function and
p ∈ (0, ∞). Show that the function |f |p is measurable.
Exercise 4.4. Let S be a set. For a function f : S → R let Af = {f −1 (B) : B ∈ B(R)}. Show
that Af is a σ-algebra.35
Exercise 4.5. Let (S, A) be a measurable space and let A ⊆ S. Show that A ∈ A if and only if
the function 1A : S → R is measurable.
Exercise 4.6. Let (S, A) be a measurable space. Let f1 , f2 , P. . . : S → R be measurable functions

and A1 , A2 , . . . ∈ A be disjoint. Show that the function f = n=1 1An fn is measurable.
Exercise∗ 4.7 (Vector-valued functions). Let (S, A) be a measurable space. For each j ∈ {1, . . . , d}
let fj : S → R be a function and let f : S → Rd be given by f = (f1 , . . . , fd ). Prove that f is
measurable if and only if fj is measurable for each j ∈ {1, . . . , d}.
Hint: For the “if part” one can use the same technique as in Theorem 4.7.
33For the proof make a picture where you make a partition into intervals of length 2−n on the y-axis. For the
picture put the set S is on the x-axis
34The idea is that for each n ∈ N we approximate f up to 2−n on the set {f < 2n }.
35The σ-algebra A is called the σ-algebra generated by f . It is the smallest σ-algebra for which f is measurable.
f
MEASURE AND INTEGRATION 17

Exercise∗ 4.8 (Monotone functions). Assume that f : R → R is increasing. Show that f is


measurable, where as usually on R we consider the Borel σ-algebra.
Hint: Use Proposition 4.5.
Exercise∗ 4.9 (Set of convergence). Let (S, A) be a measurable space. Let f1 , f2 , . . . : S → R be
measurable functions. Define
A = {s ∈ S : (fn (s))n≥1 is convergent}.
(a) Explain why A = {s ∈ S : (fn (s))n≥1 is a Cauchy sequence}.
(b) Show that for each k, n, m ∈ N the set A(k, n, m) = {s ∈ S : |fn (s) − fm (s)| < k1 } is in A.
∞ S
T ∞ ∞
T T∞
(c) Show that A = A(k, n, m).
k=1 N =1 m=N n=N
Hint: s ∈ A if and only if ∀k ∈ N ∃N ∈ N ∀n ∈ N ∀m ∈ N : |fn (s) − fm (s)| < k1 and connect
the universal quantifier with an intersection and the existential quantifier with a union.
(d) Conclude that A ∈ A.
18 MEASURE AND INTEGRATION

5. Construction of the integral


In this section (S, A, µ) is a measure space. Our goal is to construct an integral for measurable
functions f : S → R. Notation:
Z Z
f dµ or f (s) dµ(s).
S S

The integral will be built in three steps:


(1) For simple functions f : S → [0, ∞);
(2) For measurable functions f : S → [0, ∞];
(3) For (certain) measurable functions f : S → R.
The advantage of this setting is that it works for any measure space (S, Σ, µ). Moreover, in the
special case of the Lebesgue measure it extends the Riemann integral.
It will be convenient to use the following terminology.
Definition 5.1. For measurable functions f, g : S → R we say that f = g almost everywhere
if µ({s ∈ S : f (s) 6= g(s)}) = 0. Notation: f = g a.e.36
Similarly, one can define f < ∞ a.e., etc.

5.1. Integral for simple functions.


Definition 5.2 (Integral for simple functions). Let f : S → [0, ∞] be a simple function given by
n
X
(5.1) f= xj · 1Aj ,
j=1

with x1 , . . . , xn ∈ [0, ∞] and (Aj )nj=1 disjoint sets in A. For E ∈ A let37


Z Xn
f dµ = xj · µ(E ∩ Aj ).
E j=1

This is called the integral of f over the set E.


Remark 5.3. By using a common refinement one checks that if a different representation for the
functionRf from (5.1) is used, then the integral over E gives the same value (see Lemma 5.4 (ii)).
Clearly E f dµ ∈ [0, ∞] for every E ∈ A.
We continue with some basic properties of the integral.
Lemma 5.4. Let f, g : S → [0, ∞] be simple functions. Then the following hold:
R R
(i) For all E ∈ A, E f dµ = S 1E f dµ. R R
(ii) (monotonicity I) If E ∈ A and f ≤ g on E, then ER f dµ ≤ ER g dµ.
(iii) (monotonicity II) If E, F ∈ A satisfy E ⊆ F R, then E f dµ ≤ FRf dµ. R
(iv) (linearity) For all E ∈ A and α, β ∈ [0, ∞), RE αf + βg dµ = R α E f dµR+ β E g dµ.
(v) (additivity) For all disjoint sets E1 , E2 ∈ A, E1 ∪E2 f dµ = E1 f dµ + E2 f dµ.
(vi) S f dµ = 0 if and only if f = 0 almost everywhere.38.
R

Proof. (i). This is immediate from the definition and the identity 1E∩A = 1E · 1A .
For the proof of the remaining assertions we continue with a preliminary observation. Write
Pm m Pn
f = i=1 xi · 1Ai with (Ai )m
S
i=1 disjoint sets in A with Ai = S, and g = j=1 yj · 1Bj with
i=1

36In probability theory this is usually called almost surely and this is abbreviated as a.s.
37Here we use the convention 0 · ∞ = 0
38See Definition 5.1
MEASURE AND INTEGRATION 19

m
(Bj )nj=1 disjoint sets in A with Bi = S. Let Ci,j = Ai ∩ Bj for all i and j. Then (Ci,j )m,n
S
i,j=1 are
i=1
disjoint sets in A. By additivity
Z Xm m X
X n
(5.2) f dµ = xi · µ(E ∩ Ai ) = xi · µ(E ∩ Ci,j )
E i=1 i=1 j=1
Z n
X Xn X m
(5.3) g dµ = yj · µ(E ∩ Bj ) = yj · µ(E ∩ Ci,j ).
E j=1 j=1 i=1

(ii): For disjoint sets A, B ⊆ S one has 1A∪B = 1A + 1B and hence


m X
X n n X
X m
(5.4) 1E f = xi · 1E∩Ci,j and 1E g = yj · 1E∩Ci,j .
i=1 j=1 j=1 i=1

Therefore, xi ≤ yj whenever i, j satisfy E ∩ Ci,j 6= ∅. This together with (5.2), (5.3), yields (ii).
(iii): This follows from µ(E ∩ Ai ) ≤ µ(F ∩ Ai ) whichPis immediate from the monotonicity of µ.
m Pn
(iv): By (5.4) with E = S, we can write αf + βg = i=1 j=1 (αxi + βyj ) · 1Ci,j . Therefore,
Z Xm Xn
αf + βg dµ = (αxi + βyj )µ(E ∩ Ci,j )
E i=1 j=1
Xm X n X m
n X Z Z
=α xi µ(E ∩ Ci,j ) + β yj µ(E ∩ Ci,j ) = α f dµ + β g dµ,
i=1 j=1 j=1 i=1 E E

where we used (5.2) and (5.3).


(v): Since 1E1 ∪E2 f = 1E1 f + 1E2 f ,
Z Z Z Z Z Z
(i) (iv) (i)
f dµ = 1E1 ∪E2 f dµ = 1E1 f dµ + 1E2 f dµ = f dµ + f dµ.
E1 ∪E2 S S S E1 E2

(vi): By leaving out some of those k for which S xk = 0, we canR assume xiP > 0 for all i. Let
n m
A = {s ∈ S : f (s) > 0} and observe that A = i=1 Ai . Now S f dµ = i=1 xi µ(Ai ) with
µ(Ai ) ≥P0 for each i. Therefore, if the integral is zero, then µ(Ai ) = 0 for each i and hence
n
µ(A) = i=1 µ(Ai ) = 0. Conversely, if f = 0 a.e., monotonicity yields µ(Ai ) ≤ µ(A) = 0, and the
result follows. ,

Example 5.5. Let λ be the Lebesgue measure on R. Since Q ∈ B(R) it Rfollows from Exercises 1.5
and 4.5 that 1Q is measurable and by Exercise 3.2 for any E ∈ B(R), E 1Q dλ = λ(E ∩ Q) = 0.
Note that 1Q is not Riemann integrable on any interval [a, b] with a < b. More on the Riemann
integral can be found in Example 5.16.
5.2. Integral for positive measurable functions. In order to extend the definition of the
integral to arbitrary measurable functions f : S → [0, ∞] we use the following lemma. In the
sequel, we write fn ↑ f if for all s ∈ S, (fn (s))n≥1 is increasing and fn (s) → f (s).
Lemma 5.6 (Consistency). Let f : S → [0, ∞] be a measurable function. Suppose that (fn )n≥1
Z such that 0 Z≤ fn ↑ f . Suppose g : S → [0, ∞] is a simple function
is sequence of simple functions
such that 0 ≤ g ≤ f . Then g dµ ≤ lim fn dµ for every E ∈ A.
E n→∞ E
Z
Observe that lim fn dµ exists in [0, ∞], because it is an increasing sequence of real numbers.
n→∞ E

Proof. Let ε ∈ (0, 1) and set En = {s ∈ E : (1 − ε)g(s) ≤ fn (s)} for n ∈ N. Then using the
indicated parts of Lemma 5.4 in the estimates below, we find
(ii) (iii) (ii)
Z Z Z Z Z
(iv)
(1 − ε) g dµ = (1 − ε)g dµ ≤ fn dµ ≤ fn dµ ≤ lim fn dµ.
En En En E n→∞ E
20 MEASURE AND INTEGRATION

Pm
It remains to prove En g dµ → E g dµ. For this write g = i=1 xi · 1Ai with (Ai )m
R R
i=1 in A disjoint
and x1 , . . . , xm ≥ 0. Since En ∩ Ai ↑ E ∩ Ai as n → ∞, Theorem 2.10 yields
Z Xm m
X Z
(5.5) g dµ = xi · µ(En ∩ Ai ) → xi · µ(E ∩ Ai ) = g dµ.
En i=1 i=1 E

,
Definition 5.7 (Integral for positive functions). For a measurable function f : S → [0, ∞] and a
sequence of simple functions (fn )n≥1 with 0 ≤ fn ↑ f we define the integral of f over E ∈ A as
Z Z
f dµ = lim fn dµ in [0, ∞].
E n→∞ E
R
The above limit exists in [0, ∞] since the numbers an := E fn dµ form an increasing sequence
(an )n≥1 in [0, ∞]. Note that by Theorem 4.12 we can always find simple functions fn : S → [0, ∞)
such that 0 ≤ fn ↑ f . However, we need to check that the above definition does not depend on the
choice of the sequence (fRn )n≥1 . Let (gm )m≥1 be another sequence of simple functions such that
0 ≤ gm ↑ f and let bm = E gm dµ. It suffices to show that limn→∞ an = limm→∞ bm . By Lemma
5.6 for each m ≥ 1, Z Z
bm = gm dµ ≤ lim fn dµ = lim an .
E n→∞ E n→∞
From this we obtain limm→∞ bm ≤ limn→∞ an . Reversing the roles of gm and fn , one sees that
the converse holds as well.
Next we can extend the properties of the integral of Lemma 5.4.
Proposition 5.8. Let f, g : S → [0, ∞] be measurable functions. Then the following hold:
R R
(i) For all E ∈ A, E f dµ = S 1E f dµ. R R
(ii) (monotonicity I) If E ∈ A and f ≤ g on E, then ER f dµ ≤ ER g dµ.
(iii) (monotonicity II) If E, F ∈ A satisfy E ⊆ F R, then E f dµ ≤ FRf dµ. R
(iv) (linearity) For all E ∈ A and α, β ∈ [0, ∞), RE αf + βg dµ = R α E f dµR+ β E g dµ.
(v) (additivity) For all disjoint sets E1 , E2 ∈ A, E1 ∪E2 f dµ = E1 f dµ + E2 f dµ.
R
(vi) S f dµ = 0 if and only if f = 0 almost everywhere.
Proof. (i): Let (fn )n≥1 be simple functions such that 0 ≤ fn ↑ f . Then by Lemma 5.4 (i),
Z Z Z Z
f dµ = lim fn dµ = lim 1E fn dµ = 1E f dµ.
E n→∞ E n→∞ S S
(ii)–(v): See Exercise 5.2.
(vi): Assume f = 0 a.e. Choose a sequence of simple functions (fn )n≥1 such that 0 ≤ fn ↑ f .
Then fn = 0 a.e. for each n ∈ N and thus by Lemma 5.4 (vi),
Z Z
f dµ = lim fn dµ = 0.
S n→∞ S
For the converse we use contraposition. Assume one does not have f = 0 a.e. Then E = {s ∈
S : f (s) > 0} satisfies µ(E) > 0. Letting En = {s ∈ S : f (s) ≥ n1 } we find En ↑ E, so by Theorem
2.10, µ(En ) → µ(E). Therefore, there exists an n ∈ N such that µ(En ) > 0 and thus
(iii) (ii)
Z Z Z
1 1
f dµ ≥ f dµ ≥ n dµ = n µ(En ) > 0.
S En En
,
Example 5.9 (Series are integrals). Let S = N and A = P(N). Let τ : P(N) → [0, ∞] denote the
counting measure.PLet f : N → [0, ∞] be arbitrary. Then f is clearly measurable. Now define for
n
each n ≥ 1, fn = j=1 f (j)1{j} . Since each fn is a simple function and 0 ≤ fn ↑ f we find
Z Z n
X ∞
X
f dτ = lim fn dτ = lim f (j) = f (j).
N n→∞ N n→∞
j=1 j=1
MEASURE AND INTEGRATION 21

5.3. Integral for measurable functions. The final step is to define the integral for measurable
functions f : S → R by using the splitting f = f + − f − . Recall that f + = max{f, 0} and
f − = max{−f, 0}. Note that |f | = f + + f − .
5.10. A measurable function f : S → R is called integrable when both S f + dµ and
R
Definition
R −
S
f dµ are finite. In this case the integral of f over E ∈ A is defined as
Z Z Z
f dµ = f + dµ − f − dµ.
E E E

Remark 5.11.
(1) If f : S → R is integrable,R then by monotonicity E f + dµ ∈ [0, ∞) and E f − dµ are finite for
R R

every E ∈ A. Therefore, E f dµ is a number in R. R


(2) A measurable function f : S → [0, ∞] is integrable if and only if S f dµ < ∞.
(3) If f : S → R is integrable, then µ({s ∈ S : f (s) = ±∞}} = 0 (see Exercise 5.1).
To check integrability the following characterization is very useful.
Proposition 5.12. For a measurable function f : S → R the following are equivalent:
(i) f is integrable;
(ii) |f | is integrable.
Moreover, in this case for each E ∈ A,
Z Z
(triangle inequality) f dµ ≤ |f | dµ.
E E

Proof. First observe that |f | = f + + f − . Therefore, both assertions are equivalent to I + , I − < ∞,
where I ± = S f ± dµ and hence the result follows. To prove the required estimate note that by
R

the triangle inequality and linearity of the integral for positive functions:
Z Z Z
f dµ = |I + − I − | ≤ I + + I − = f + + f − dµ = |f | dµ.
E E E
,

By considering the positive and negative part separately one can check Proposition 5.8 (i),(ii),
(v) hold for all integrable functions f, g : S → R again (see Exercise 5.4). The following example
shows that Parts (iii) and (vi) do not extend to this setting.
Example 5.13. Let S = R and λ the Lebesgue measure. Let f = 1(0,1] − 1(1,2] . Then f + = 1(0,1]
and f − = 1(1,2] and
Z Z
f dλ = f + dλ = λ((0, 1]) = 1,
(0,1] (0,1]
Z Z Z
f dλ = f + dλ − f − dλ = λ((0, 1]) − λ((1, 2]) = 1 − 1 = 0.
(0,2] (0,2] (0,2]

The extension of the linearity (iv) is more difficult and proved below.
Proposition 5.14. Let f, g : S → R be integrable functions and α, β ∈ R. Then39 αf + βg is
integrable, and for all E ∈ A
Z Z Z
(linearity) αf + βg dµ = α f dµ + β g dµ.
E E E

Proof. Note that by Proposition 5.12 each of the following functions is integrable f ± , g ± , |f |, |g|.
Therefore, |α| |f | + |β| |g| is integrable by Proposition 5.8 (iv). Since |αf + βg| ≤ |α| |f | + |β| |g|,
the function αf + βg is integrable as well (see Exercise 5.7).

39Of course we only consider those functions for which αf + β + g is well-defined.


22 MEASURE AND INTEGRATION

It remains to prove the identity for each E ∈ A. To do this we first consider the case α = β = 1.
Write f + g = φ − ψ, where φ = f + + g + and ψ = (f − + g − ). Then by Proposition 5.8 (iv) φ and
ψ are integrable and Exercise 5.3 yields
Z Z Z
f + g dµ = φ dµ − ψ dµ
E
(5.6) ZE E
Z Z Z Z Z
= f + dµ + g + dµ − f − dµ − g − dµ = f dµ + g dµ.
E E E E E E
It remains to show that for all α ∈ R,
Z Z
αf dµ = α f dµ.
E E
If α ≥ 0, then αf = (αf )+ − (αf )− = αf + − αf − . Proposition 5.8 (iv) yields
Z Z Z Z Z

αf dµ = +
αf dµ − αf dµ = α +
f dµ − α f − dµ.
E E E E E
In case α < 0, then by (5.6) and the previous case, we find
Z Z Z Z Z
0= αf + (−α)f dµ = αf dµ + (−α)f dµ = αf dµ + (−α) f dµ
E E E E E
and the result follows by subtracting the second term on both sides. ,
Pn
Example 5.15. Every simple function f : S → R given by f = j=1 xj 1Aj with A1 , . . . , An ∈ A
and µ(Aj ) < ∞ for all j, is integrable and by Proposition 5.14,
Z Xn Z Xn
f dµ = xj 1Aj dµ = xj µ(E ∩ Aj ).
E j=1 E j=1

In the next example we show that for continuous functions the integral with respect to the
Lebesgue measure coincides with the Riemann integral.40
Example 5.16. Let a < b be real numbers. Let λ be the restriction of the Lebesgue measure to
B([a, b]) and let f : [a, b] → R be continuous. Note that f is measurable by Proposition 4.4.
Moreover, since there is a constant M ≥ 0 such that |f | ≤ M , we can use Exercise 5.7 to deduce
that f is (Lebesgue) integrable. Since f is continuous, it is Riemann integrable. Below we show
that L(f ) = R(f ), where we used the abbreviations
Z Z b
L(f ) := f dλ (Lebesgue integral) and R(f ) := f (x) dx (Riemann integral).
[a,b] a

To prove this identity let ε > 0. Since f is uniformly continuous we can find a δ > 0 such that
ε
for all x, y ∈ [a, b], |x − y| < δ implies |f (x) − f (y)| < b−a . Let n ∈ N be such that b−a
n < δ. Let
b−a
xj = a + j n for j = 0, . . . , n. LetPmj = min{f (x) : x ∈ [xj−1 , xj ]} and Mj P = max{f (x) : x ∈
n n
[xj−1 , xj ]}. Let g(x) = 1{a} f (a) + j=1 1(xj−1 ,xj ] mj and G(x) = 1{a} f (a) + j=1 1(xj−1 ,xj ] Mj .
Then g and G are integrable in the sense of Riemann and Lebesgue and one can check
n
X Xn
L(g) = R(g) = (xj − xj−1 )mj =: α and L(G) = R(G) = (xj − xj−1 )Mj =: β.
j=1 j=1

Since g ≤ f ≤ G, by monotonicity we find that L(f ), R(f ) ∈ [α, β]. Therefore,


n n
X X b−a ε
|R(f ) − L(f )| ≤ β − α = (xj − xj−1 )(Mj − mj ) ≤ = ε,
j=1 j=1
n b−a

Since ε > 0 is arbitrary, we find that R(f ) = L(f ).


40It is even known that every Riemann integrable function f is Lebesgue integrable and the integrals coincide
(see [1, Theorem 23.6]). The proof uses a similar method as ours. However, one should be aware that not every
improper Riemann integral is Lebesgue integrable. See Theorem 6.12 and Exercise 6.10 for more details on improper
Riemann integrals
MEASURE AND INTEGRATION 23

Example 5.17. Let S = R, A = P(R). Let x ∈ R and let δx be the Dirac measure from Example
2.5. Then for any f : R → R. Z
f dδx = f (x).
R
Indeed, for simple functions f this is obvious. For f : S → [0, ∞] this follows by approximation
from below by simple functions. For general f : S → [0, ∞] this follows by writing f = f + − f − .
Example 5.18. 41 Consider the setting
P∞ of the counting measure of Example 5.9. A function f :
N → R is integrable if and only if j=1 |f (j)| < ∞. In that case
Z X∞
f dτ = f (j).
N j=1

Finally we briefly indicate how one can extend the integral to complex functions f : S → C.
On the complex numbers C we consider its Borel σ-algebra. If f : S → C is measurable, then we
write f = u + iv with u, v : S → R and u and v are measurable (see Exercise 5.11).
Definition 5.19. A measurable function f : S → C given by f = u + iv with u, v : S → R is
called integrable if both u and v are integrable. In this case let
Z Z Z
f dµ = u dµ + i v dµ, E ∈ A.
E E E
Exercise 5.11 yields that Propositions 5.12 and 5.14 extend to the complex setting.

Exercises about general integrals


In the exercises below (S, A, µ) denotes a measure space.
Exercise 5.1. Let f : S → [0, ∞] be an integrable function. Show that f < ∞ a.e.
Exercise 5.2.
(a) Prove Proposition 5.8 (iv).
Hint: Approximate by simple functions as in the definition of the integral.
(b) Prove Proposition 5.8 (ii),
Hint: Write 1E g = 1E (g − f ) + 1E f and use Proposition 5.8 (iv).
(c) Prove Proposition 5.8 (iii).
(d) Prove Proposition 5.8 (v).
The following was used in the proof of Proposition 5.14.
Exercise 5.3. Assume g, h : S → [0, ∞] and f : S → R are all integrable functions and f = g − h.
Show that Z Z Z
f dµ = g dµ − h dµ, E ∈ A.
E E E
Hint: Note that f + + h = f − + g and use Proposition 5.8 (iv).
Exercise 5.4. Extend the following results to integrable functions f, g : S → R:
(a) Proposition 5.8 (i). First show that 1E f is integrable for E ∈ A.
(b) Proposition 5.8 (ii).
(c) Proposition 5.8 (v)
Exercise 5.5. Let f, g : S → R both be measurable functions. Assume f is integrable and f = g
a.e. Show that g is integrable and
Z Z
f dµ = g dµ, E ∈ A.
E E
Hint: First try to prove this in the case f and g take values in R. In this case apply Proposition
5.8 (vi) to h = |f − g|. For the case where f and g are R-valued one needs a careful analysis of
positive and negative parts.
41For details see Exercise 5.8
24 MEASURE AND INTEGRATION

Exercise 5.6. Let λ denote the Lebesgue measure on (R, B(R)). Let f : R → R be an integrable
function. Show that for all −∞ < a < b < ∞
Z Z
f dλ = f dλ.
[a,b] (a,b)

Is this true if we replace λ by an arbitrary measure on B(R)?


Exercise∗ 5.7 (Domination). Let f, g : S → R be measurable functions.
(a) Assume |f | ≤ g and g is integrable. Prove
P∞ that f is integrable.
(b) Show that f is integrable if and only if n=−∞ 2n µ({s ∈ S : 2n < |f (s)| ≤ 2n+1 }) < ∞.
Hint: Use (a) for a suitable function g : S → [0, ∞].
In the next exercise you are asked to give the details of Example 5.18
Exercise∗ 5.8. Consider the setting of the counting measure of Example 5.9.
P∞
(a) Show that
P∞ a function f : N → R is integrable if and only if n=1 |f (n)| < ∞.
(b) Assume n=1 |f (n)| < ∞. Show that
Z X∞
f dτ = f (j).
N j=1

+
Hint: Write f = f − f and approximate f + and f − in a similar way as in Example 5.9.
Exercise∗ 5.9 (Chebyshev’s inequality). Let f : S → [0, ∞] be a measurable function. Prove that
for every t > 0,
1
Z
µ({s ∈ S : f (s) ≥ t}) ≤ f dµ.
t S
R
Hint: Let At = {s ∈ S : f (s) ≥ t} and write µ(At ) = S 1At dµ.
Exercise∗ 5.10. Let λ be the Lebesgue measure on (Rd , B(Rd )). Let f : Rd → R be a measurable
function. For h ∈ Rd define the translation fh : Rd → R by fh (x) = f (x − h).
(a) Show that fh is measurable.
(b) Assume that f is integrable. Show that fh is integrable and
Z Z
fh dλ = f dλ.
Rd Rd
∗∗
Exercise 5.11 (Complex functions).
(a) Let f : S → C be a measurable function and write f = u + iv with u, v : S → R. Show that u
and v are measurable.
(b) Prove Proposition 5.14 for integrable functions f, g : S → C and α, β ∈ C.
(c) Prove Proposition 5.12 for integrable functions f : S → C.
(d) Which parts of Proposition 5.8 remain true for integrable functions f, g : S → C?
MEASURE AND INTEGRATION 25

6. Convergence theorems and applications


In this section (S, A, µ) is a measure space.
One of the problems with the Riemann integral is that the cases where limit and integral can
be interchanged are rather limited. In modern mathematics it is crucial to have better “tools”
for this. We use the integration theory developed so far in order to obtain these tools. In Section
6.1 we prove three famous convergence results. In Section 6.2 we give several consequences and
applications.
We start with several simple examples which illustrate some difficulties.
Example 6.1. For instance if (xj )j≥1 is an enumerate of Q, and An = {x1 , . . . , xn }, then 1An → 1Q
pointwise. Each 1An is Riemann integrable, but 1Q is not.
1
Example 6.2. Let fn : R → R be defined by one of the following n1(0, n1 ] , n 1(n,2n) or 1(n,∞) for
R
n ≥ 1. Then fn → 0, but in each case R fn dλ 6→ 0.
6.1. The three main convergence results. In this section we prove the three most famous
convergence results of integration theory:
• Monotone Convergence Theorem (MCT);42
• Fatou’s lemma;43
• Dominated Convergence Theorem (DCT).44
Theorem 6.3 (Monotone Convergence Theorem (MCT)). Let (fn )n≥1 be measurable functions
such that 0 ≤ fn ↑ f . Then f is measurable and
Z Z
lim fn dµ = f dµ.
n→∞ S S

Proof. By Theorem 4.9, f : S → [0, ∞] is measurable. By monotonicity of the integral


Z Z
α := lim fn dµ ≤ f dµ.
n→∞ S S
R
It remains to prove S f dµ ≤ α. R For this choose a sequence of simple functions (gm )m≥1 such
that 0 ≤ gm ↑ f . We claim that S gm dµ ≤ α for each m ≥R 1. The required estimate follows from
the claim by letting m → ∞ and using the definition of S f dµ. To prove the claim we repeat
part of the argument of Lemma 5.6. Fix m ∈ N and write g = gm .
Let ε ∈ (0, 1) and for each n ≥ 1, set En = {s ∈ S : (1 − ε)g(s) ≤ fn (s)}. Then using the
indicated parts of Proposition 5.8 in the estimates below, we find
(ii) (iii) (ii)
Z Z Z Z
(iv)
(1 − ε) g dµ = (1 − ε)g dµ ≤ fn dµ ≤ fn dµ ≤ α.
En En En S
R R
Finally, En g dµ → S g dµ follows from (5.5) with E = S in Lemma 5.6. Therefore, we can
R
conclude (1 − ε) S g dµ ≤ α and the claim follows since ε ∈ (0, 1) was arbitrary. ,
Lemma 6.4 (Fatou’s Lemma). Let (fn )n≥1 be a sequence of measurable functions with values in
[0, ∞]. Then Z Z
lim inf fn dµ ≤ lim inf fn dµ.
S n→∞ n→∞ S

Proof. Note that f = lim inf n→∞ fn is a measurable function by Theorem 4.9. Fix n ∈ N and let
gn = inf fk . For all m ≥ n, we have gn ≤ fm and by the monotonicity of the integral this gives
R k≥n R

S
gn dµ ≤ S fm dµ. Therefore, taking the infimum over all m ≥ n, we find
Z Z
(6.1) gn dµ ≤ inf fm dµ.
S m≥n S

42This result is due to Beppo Levi 1875–1961 who was an Italian mathematician.
43This result is named after the French mathematician Pierre Fatou 1878–1929.
44This is due to Lebesgue (see footnote 18)
26 MEASURE AND INTEGRATION

Note that 0 ≤ gn ↑ f and thus


(6.1)
Z Z Z Z Z
(MCT)
f dµ = lim gn dµ = lim gn dµ ≤ lim inf fm dµ = lim inf fn dµ.
S S n→∞ n→∞ S n→∞ m≥n S n→∞ S
,
Theorem 6.5 (Dominated Convergence Theorem (DCT)). Let (fn )n≥1 be a sequence of integrable
functions such that lim fn = f pointwise. If there exists an integrable function g : S → [0, ∞]
n→∞
such that |fn | ≤ g for all n ≥ 1, then f is integrable and
Z Z
(6.2) lim fn dµ = f dµ.
n→∞ S S

Proof. The function f is measurable


R R 4.9. Moreover, since also |f | ≤ g, we see that
by Theorem
f is integrable. Let xn := S fn dµ and x := S f dµ. It suffices to show that lim sup xn ≤ x ≤
n→∞
lim inf xn . It follows that
n→∞
Z Z Z
g dµ ± x = g ± f dµ = lim (g ± fn ) dµ (linearity)
S S S n→∞
Z
≤ lim inf g ± fn dµ (Fatou’s lemma with g ± fn ≥ 0)
n→∞ S
Z 
= lim inf g dµ ± xn (linearity).
n→∞ S
Z
= g dµ + lim inf (±xn ).
S n→∞
R
Since S g dµ < ∞, it follows that ±x ≤ lim inf (±xn ). This implies the estimates
n→∞
x ≤ lim inf xn and lim sup xn ≤ x,
n→∞ n→∞

where for the second part we used lim inf (−xn ) = − lim sup xn . ,
n→∞ n→∞

For functions f, f1 , f2 : S → R we say that fn → f a.e. if there exists a set A ∈ A such that
µ(A) = 0 and for all s ∈ S \ A, fn (s) → f (s). For the details of the following remark we refer to
Exercise 6.5.
Remark 6.6. Let f, f1 , f2 , . . . : S → R be measurable functions.
(1) Theorem 6.3 also holds under the weaker assumption that 0 ≤ fn ↑ f a.e.
(2) Theorem 6.5 also holds under the weaker assumptions that fn → f a.e. and |fn | ≤ g a.e.
Example 6.7. Let f :R R → R be integrable with respect to the Lebesgue measure λ. Let F : R → R
be given by F (x) = (−∞,x] f dλ. Then F is continuous on R. Indeed, if xn < x and xn → x, then
Z Z
(DCT)
F (xn ) = 1(−∞,xn ] f dλ −→ 1(−∞,x) f dλ = F (x),
R R
where the last step follows from Exercise 5.5 and the fact that 1(−∞,x) f = 1(−∞,x] f almost
everywhere. Similarly, one checks that F (xn ) → F (x) if xn ≥ x and xn → x.
6.2. Consequences and applications. We continue with several consequences and applications.
We start with a result on integration of series of positive functions.
Corollary 6.8 (Series and integrals). Let f1 , f2 , . . . : S → [0, ∞] be measurable functions. Then
Z X ∞ X ∞ Z
(6.3) fn dµ = fn dµ.
S n=1 n=1 S

Proof. This follows from the MCT (See Exercise 6.4). ,


From a measure µ one can build many other measures in the following way:
MEASURE AND INTEGRATION 27

Theorem 6.9 (Density). Let f : S → [0, ∞] be a measurable function. Define ν : A → [0, ∞] by


Z
ν(A) = f dµ.
A
45
Then ν is a measure. Moreover, g : S → R is integrable with respect to ν if and only if f g is
integrable with respect to µ. In this case
Z Z
(6.4) g dν = f g dµ
S S

S
Proof. Step 1: Clearly, ν(∅) = 0. For (An )n≥1 a disjoint sequence in A, and A := An ,
n=1
Z ∞
Z X ∞ Z ∞
(6.3) X X
ν(A) = 1A f dµ = 1An f dµ = 1An f dµ = ν(An ).
S S n=1 n=1 S n=1

Step 2: First we
R show that (6.4) holds
R for all measurable g : S → [0, ∞]. For g = 1A this is
immediate from S 1A dν = ν(A) = S 1A f dµ. For simple functions g : S → [0, ∞) this follows
by linearity. For a measurable function g : S → [0, ∞], by Theorem 4.12 we can find a sequence
of simple functions (gn )n≥1 such that 0 ≤ gn ↑ g. By the previous case, we obtain
Z Z Z Z
(MCT) (MCT)
g dν = lim gn dν = lim f gn dµ = f g dµ.
S n→∞ S n→∞ S S

Step 3: To prove the “if and only if” assertion and (6.4), let g : S → R be a measurable
function. Since step 2 yields that
Z Z
±
g dν = f g ± dµ,
S S

both the equivalence and (6.4) follows by writing g = g + − g − . ,


Example 6.10. Let f : R → [0, ∞] be measurable. Define ν : B(Rd ) → [0, ∞) by
Z
ν(A) = f dλ, A ∈ B(R).
A
x 2
Then ν is a measure. For example one could take f (x) = √1 e− 2 . Then ν is the standard

Gaussian measure on R.
In Example 5.16 we have discussed a connection between the Riemann and Lebesgue integral.
A similar connection holds for improper Riemann integrals.
R∞
Definition 6.11. Let f : R → R be a continuous function. We say that −∞ f (x) dx exists as an
Rt R0
improper Riemann integral if the limits L1 = limt→∞ 0 f (x) dx and L2 = limt→−∞ t f (x) dx
R∞
exist in R. In that case define −∞ f (x) dx = L1 + L2 .
We show that there is a connection with the Lebesgue integral with respect λ.
Theorem 6.12. Let f : R → R be a continuous function. Then the following are equivalent:
(i) fR is integrable;

(ii) −∞ |f (x)| dx exists as an improper Riemann integral.
R∞
Moreover, in this case −∞ f dx exists as am improper Riemann integral and
Z Z ∞
(6.5) f dλ = f dx.
R −∞
R∞
It may happen that −∞
f dx exists as an improper Riemann integral without f being integrable
(see Exercise 6.10).

45The function f is usually called the density of ν.


28 MEASURE AND INTEGRATION

Proof. (i)⇒(ii): First we make a general observation. Let g : R → R be a continuous and integrable
Rt R R
function. In Example 5.16 we have seen that 0 g(x) dx = R 1[0,t] g dλ. Setting L1 = R 1[0,∞] g dλ,
we find
Z t Z Z Z
L1 − g(x) dx = L1 − 1[0,t] g dλ = 1[t,∞) g dλ ≤ 1[t,∞) |g| dλ.
0 R R R
R R∞
Now the DCT yields that R 1[t,∞) |g| dλ → 0 as t → ∞, and we may conclude that 0 g(x) dx =
L1 exists. The part on (−∞, 0] goes similarly and we find (6.5) with f replaced by g.
Now (i) ⇒ (ii) follows by letting g = |f |. Moreover, (6.5) for f follows by taking g = f .
(ii)⇒ (i): By Example 5.16, monotonicity of improper Riemann integrals,
Z Z n Z ∞
1[−n,n] |f | dλ = |f (x)| dx ≤ |f (x)| dx =: M < ∞.
R −n −∞
R R
Therefore, by the MCT, R
|f | dλ = limn→∞
1
R [−n,n]
|f | dλ ≤ M , and hence f is integrable. ,
n
Example 6.13. Let fn : [0, ∞) → R be defined by fn (x) = 1 + nx e−2x . Below we show that
Rn n
limn→∞ 0 fn (x) dx = 1. Recall the standard limit 0 ≤ 1 + nx ↑ ex for every x ∈ [0, ∞) and
−x
thus 0 ≤ 1[0,n] fn ↑ f , where f (x) = e . Therefore, by Example 5.16
Z n Z Z Z ∞
(MCT) (6.5)
fn (x) dx = 1[0,n] fn dλ −→ f dλ = f (x) dx = lim (1 − e−t ) = 1.
0 [0,∞) [0,∞) 0 t→∞

We end this section with a standard application of the DCT to calculus.


Theorem 6.14 (Differentiating under the integral sign). Suppose f : R × S → R is such that the
following hold:
(i) f is continuous and differentiable with respect to its first coordinate and for each y0 ∈ (a, b)
there exists a δ > 0 and integrable function g : S → [0, ∞] such that
| ∂f
∂y (y, s)| ≤ g(s), y ∈ (y0 − δ, y0 + δ), s ∈ S.

(ii) s 7→ f (y, s) is integrable with respect to µ.


Then for all y ∈ R, Z Z
d ∂f
f (y, s) dµ(s) = (y, s) dµ(s).
dy S S ∂y

Proof. Fix y0 ∈ R and let δ and g be as in (i). Let hn ∈ (0, δ) for n ≥ 1 be such that hn → 0. Let
φn : (y0 − δ, y0 + δ) × S → R and F : (y0 − δ, y0 + δ) → R be given by
f (y + hn , s) − f (y, s)
Z
φn (y, s) = and F (y) = f (y, s) dµ(s).
hn S
∂f
Then φn (y, s) → ∂y (y, s) for each y ∈ R and s ∈ S. Therefore, Theorem 4.12 yields that
s 7→ φn (y, s) is measurable. From the mean value theorem46 we obtain φn (y0 , s) = ∂f ∂y (yn , s) for
some yn ∈ (y0 , y0 + δ), and hence |φn (y0 , s)| ≤ g(s) for all s ∈ S. It follows that
F (y0 + hn ) − F (y0 )
Z Z
(DCT) ∂f
F 0 (y0 ) = lim = lim φn (y0 , s) dµ(s) = (y0 , s) dµ(s).
n→∞ hn n→∞ S S ∂y
,

Exercises
Z
Exercise 6.1. Use convergence theorems to find lim fn dλ in each of the following cases:
n→∞ R
n
(a) fn (x) = 1[4,32] (x) 1 + log(4x)

n (use MCT; answer is 2016).

46See [11, Theorem 6.2.3]: if g : [a, b] → R is continuous on [a, b] and differentiable on (a, b), then there exists a
g(b)−g(a)
point c ∈ (a, b) such that g 0 (c) = b−a
.
MEASURE AND INTEGRATION 29

sin(nx)
(b) fn (x) = 1(1,∞) (x) (use DCT; answer is 0).
nx2
nxn sin(nx) − n √
(c) fn (x) = 1[0,1] (x) √ (use DCT; answer is − 12 2).
x + 2n 2

Exercise 6.2. Assume f, f1 , f2 , . . . : S → R are measurable functions such that


R
(i) There is a constant M ≥ 0 such that for all n ∈ N, S |fn | dµ ≤ M
(ii) fn → f pointwise.
R
Use Fatou’s lemma to show that S |f | dµ ≤ M .

X1
Exercise 6.3. Find lim .
n→∞
m=1
+ n2m2
Hint: Use the counting measure and the DCT.
Pn
Exercise 6.4. Deduce Corollary 6.8 from the MCT applied to sn = j=1 fj for n ≥ 1.

Exercise 6.5. Give the details of Remark 6.6 (use Exercise 5.5).
Exercise 6.6. Let (fn )n≥1 be a sequence of measurable functions with values in R or C and
fn → f pointwise. Assume there exists an integrable function g : S → [0, ∞) such that |fn | ≤ g.
R
(a) Show that S |fn − f | dµ → 0 as n → ∞.
Hint: Apply the DCT inRa suitable way.
R
(b) In the real case we know S fn dµ → S f dµ. Derive this in the complex case as well.
Hint: Use (a) and Exercise 5.11 (c).
Exercise∗ 6.7. Let f : S → R be a measurable function. Define ν : B(R) → [0, ∞] by ν(B) =
µ(f −1 (B)). Prove the following:
(a) ν is a measure.
(b) For every measurable g : R → [0, ∞] one has
Z Z
g(t) dν(t) = g(f (s)) dµ(s).
R S
Hint: Argue as in Theorem 6.9 step 2.
(c) A function g : R → R is integrable with respect to ν if and only if g ◦ f is integrable with
respect to µ. Moreover, in this case
Z Z
g(t) dν(t) = g(f (s)) dµ(s).
R S
Hint: Argue as in Theorem 6.9 step 3.
Exercise∗ 6.8. Assume f : R → R is integrable with respect to λ.
(a) Show that for each y ∈ R the function x 7→ sin(xy)f (x) is integrable with respect to λ.
Define g : R → R by Z
g(y) = sin(xy)f (x) dλ(x)
R
(b) Show that g is continuous.
Hint: Use the sequential characterization of continuity and one of the convergence theorems.
Exercise∗∗ 6.9. Use induction and Theorem 6.14 to show that for each integer n ≥ 0,
Z ∞
n!
xn e−yx dx = n+1 , y > 0.
0 y
R ∞ n −x
In particular, setting y = 1 one obtains: 0 x e dx = n!.
Exercise∗∗ 6.10. Let f : [0, ∞) → R be given by f (x) = sin(x)x if x 6= 0 and f (0) = 1.
(a) Show that f is not integrable with respect to λ.
Hint: Use Rthe estimate | sin(x)| ≥ 12 on [πn + 31 π, πn + 23 π] for all integers n ≥ 0.

(b) Show that 0 f (x) dx exists as an improper Riemann integral.47
47Using some smart tricks one could actually show that the integral equals π .
2
30 MEASURE AND INTEGRATION

7. Lp -spaces
In this section (S, A, µ) is measure space. In this section we want to allow the scalar field to be
complex as well and we use the notation K for this. So K = R or K = C.
Definition 7.1. For p ∈ [1, ∞) let
n Z o
Lp (S) = f : S → K : f is measurable and |f |p dµ < ∞ .
S

For f ∈ Lp (S) let


Z  p1
kf kp := |f |p dµ .
S
1
Note that L (S) coincides with the set of integrable functions f : S → K.
If kf − gkp = 0, then from Proposition 5.8 (vi) we can conclude f = g a.e. However, we would
like to have f = g. Therefore, we will identify f and g whenever f = g a.e.48 So one has to be
rather careful if one talks about f (s) for a certain fixed s ∈RS. In integration
R theory this usually
does not lead to any problems since f = g a.e. implies that E f dµ = E g dµ for any E ∈ A.
Example 7.2. Let S = R with the Lebesgue measure λ. Then 1{0} = 1Q = 0 and 1[0,1]\Q = 1[0,1]
in Lp (R).
7.1. Minkowski and Hölder’s inequalities. Note that for scalars α ∈ K, kαf kp = |α|kf kp .
Therefore, the next result can be used to show that Lp (S) is a normed vector space.
49
Proposition 7.3 (Minkowski’s inequality ). For all f, g ∈ Lp (S) we have f + g ∈ Lp (S) and
kf + gkp ≤ kf kp + kgkp .
Proof. Observe that for all a, b ∈ [0, ∞) one has (See Exercise 7.1)
(7.1) (a + b)p = inf θ1−p ap + (1 − θ)1−p bp .
θ∈(0,1)

It follows from (7.1) that for all θ ∈ (0, 1) and all s ∈ S,


|f (s) + g(s)|p ≤ (|f (s)| + |g(s)|)p ≤ θ1−p |f (s)|p + (1 − θ)1−p |g(s)|p .
Therefore, by monotonicity and linearity of the integral, we obtain that for all θ ∈ (0, 1),
Z Z Z
|f + g|p dµ ≤ θ1−p |f |p dµ + (1 − θ)1−p |g|p dµ.
S S S

Stated differently, this says that for all θ ∈ (0, 1),


kf + gkpp ≤ θ1−p kf kpp + (1 − θ)1−p kgkpp .
Now the result follows by taking the infimum over all θ ∈ (0, 1) and applying (7.1). ,

Another famous inequality which can be proved with the same method is the following.
1 1
Proposition 7.4 (Hölder’s inequality50). Let p, q ∈ (1, ∞) satisfy51 p + q = 1. If f ∈ Lp (S) and
g ∈ Lq (S), then f g ∈ L1 (S) and
kf gk1 ≤ kf kp kgkq .
Proof. See Exercise 7.2. ,

48More precisely, one can build an equivalent relation f ∼ g if f = g almost everywhere and then consider a
quotient space. We will use the above imprecise but more intuitive definition.
49Hermann Minkowski (1864-1909) was a German mathematician who worked in geometry. He also was Albert
Einstein’s teacher and provided the 4-dimensional mathematical framework for part of Einstein’s relativity theory.
50Otto Hölder (1859-1937) is most famous for this result and for the notion of Hölder continuity of a function.
51These exponents are called conjugate exponents. If p = 2, then q = 2 and in this case the inequality is known
as the Cauchy-Schwarz inequality.
MEASURE AND INTEGRATION 31

7.2. Completeness of Lp .
Theorem 7.5 (Riesz–Fischer52). Let p ∈ [1, ∞). Then (Lp (S), k · kp ) is a Banach space.53
Proof. Let (fk )∞ p
k=1 be a Cauchy sequence with respect to the norm k · kp of L (S). By a standard

argument it suffices to show that (fk )k=1 has a subsequence which is convergent in Lp (S).
Recursively, we can find a subsequence (fkn )n≥1 such that
1
(7.2) kfkn+1 − fkn kp ≤ n , n = 1, 2, . . .
2
For notational convenience let φn = fkn for n ∈ N. Also let φ0 = 0. We will show that there exists
an f ∈ Lp (S) such that kφn − f kp → 0.
Define g, g1 , g2 , . . . : S → [0, ∞] by

X m−1
X
g := |φn+1 − φn | and gm := |φn+1 − φn |, ,
n=0 n=0
By Minkowski’s inequality we obtain for each m ≥ 1,
m−1
X ∞
X (7.2) X
kgm kp ≤ kφn+1 − φn kp = kφn+1 − φn kp ≤ kφ1 kp + 2−n ≤ kφ1 kp + 1.
n=0 n=0 n≥1

Since 0 ≤ gm ↑ g, the MCT yields,


Z Z
p
|g| dµ = lim |gm |p dµ = lim kgm kpp ≤ (kφ1 kp + 1)p .
S m→∞ S m→∞
c
Letting A = {g < ∞}, Exercise 5.1 yields µ(A ) = 0. Therefore, we can define f : S → R by

X
f= 1A (φn+1 − φn ),
n=0
where the series is absolutely convergent and f is measurable by Theorem 4.9. By a telescoping
argument it follows that pointwise on S
m−1
X
f = lim 1A (φn+1 − φn ) = lim 1A φm .
m→∞ m→∞
n=0
Clearly,
m−1
X m−1
X
|φm | = (φn+1 − φn ) ≤ |φn+1 − φn | ≤ |g|
n=0 n=0
and by letting m → ∞ we see that also |f | ≤ |g| and in particular f ∈ Lp (S). It follows that
|f − φm |p ≤ (|f | + |φm |)p ≤ (2|g|)p = 2p |g|p .
Since |f − φm |p → 0 a.e. and 2p |g|p is integrable, it follows from the DCT (see the a.e. version of
Remark 6.6) that Z
lim kf − φm kpp = lim |f − φm |p dµ = 0.
m→∞ m→∞ S
,
In the sequel we will say that fn → f in Lp (S) if kfn − f kp → 0.
From the proof of Theorem 7.5 we deduce the following result.
Corollary 7.6. Let p ∈ [1, ∞). Suppose f, f1 , f2 , . . . ∈ Lp (S). If fk → f in Lp (S), then there
exists a subsequence (fkn )n≥1 such that fkn → f a.e.
52Frigyes Riesz (1880–1956) was a Hungarian mathematician who worked in functional analysis. Ernst Fischer
(1875–1954) was a Austrian mathematician who worked in analysis.
53Recall that a Banach space is a complete normed vector space. Stefan Banach (1892–1945) was a Polish
mathematician who is one of the world’s most important 20th-century mathematicians. He is most famous for his
book on functional analysis [2].
32 MEASURE AND INTEGRATION

In general one does not have fn → f a.e. (see Exercise 7.7).


Proof. Indeed, since (fn )n≥1 is convergent it is a Cauchy sequence in Lp (S). In the proof of
Theorem 7.5 there exists a subsequence (fkn )n≥1 and a function f˜ such that fkn → f˜ in Lp (S)
and fkn → f˜ a.e. Now Minkowski’s inequality yields:
kf − f˜kp ≤ kf − fkn kp + kfkn − f˜kp → 0.
Thus f = f˜ a.e. and thus fkn → f a.e. ,
For f, g ∈ L2 (S) we define Z
hf, gi = f (s)g(s) dµ(s),
S
where g(s) stands for the complex conjugate of the number g(s) ∈ K. Then h·, ·i is an inner
product and hf, f i = kf k22 .54
The next theorem is immediate from Theorem 7.5.
Theorem 7.7 (Riesz–Fischer). L2 (S) is a Hilbert space.55
Example 7.8. Let p ∈ [1, ∞). If µ = τ is the counting measure on N, then56 `p := Lp (N) coincides
with a space of sequences and
X  p1
k(an )n≥1 kp = |an |p .
n≥1
p q
If p ≤ q < ∞, then ` ⊆ ` (see Exercise 7.8).
Example 7.9. Assume µ(S) < ∞. If 1 ≤ p ≤ q < ∞, then Lq (S) ⊆ Lp (S) (see Exercise 7.3).
We end this section with a simple density result. Clearly, the definition of a simple function
extends to the complex setting. The following result shows that any Lp -function can be approxi-
mated by simple functions.
Proposition 7.10 (Density of simple functions). Let p ∈ [1, ∞). The set of simple functions is
dense in Lp (S).
Proof. Write f = u + iv, where u and v are both real-valued. Write u = g − h, where g = u+
and h = u− . By Theorem 4.12 we can find simple functions gn and hn such that 0 ≤ gn ↑ g
and 0 ≤ hn ↑ h. Then un := gn − hn is a simple function, un → u pointwise, and |un | ≤
gn + hn ≤ g + h ≤ |u|. Similarly, one constructs simple functions vn such that vn → v pointwise,
|vn | ≤ |v|. We can conclude that fn := un + ivn is a simple function, fn → f pointwise and
1 1
|fn | ≤ (|un |2 + |vn |2 ) 2 ≤ (|u|2 + |v|2 ) 2 ≤ |f |. Now since |fn − f |p ≤ (|fn | + |f |)p ≤ 2p |f |p and the
latter is integrable, it follows from the DCT that kfn − f kp → 0 as n → ∞. ,
7.3. Lp -spaces on intervals. In the remaining part of this section we discuss Lp (I), where I is
an interval and µ = λ is the Lebesgue measure on I. These results will not be used in Section 8
or in the exercises.
For an interval I ⊆ R let the set of step functions Step(I) be defined by
Step(I) = span{1J : J ⊆ I is an interval with finite length}.
One can check that each φ ∈ Step(I) is a simple function. The converse does not hold. For
instance 1Q∩(0,1) is not a step function. Step function have a lot of structure and often questions
on Lp can be reduced to this special class by the following density result.

54Note that the conjugation is need otherwise the square of a complex number can be negative. Physicists often
put the conjugation on f (s) instead of g(s). If K = R, then the conjugation does not play a role.
55Recall that a Hilbert space is a Banach space where kxk = hx, xi 21 . David Hilbert (1862–1943) is sometimes
said to be the last universal mathematician (which means he knew “all” mathematics of his time). He was one of
the most influential mathematicians of the 19th and early 20th centuries.
56Sometimes we wish to use the counting measure on Z instead of N. In this case we write `p (Z) := Lp (Z).
MEASURE AND INTEGRATION 33

1 2 3 4 5

Figure 7.1. Example step function 14 1 1 + 32 1 5 − 13 1 5 + 1[4,5]


(− 2 ,1] (1, 2 ] ( 2 ,4)

Theorem 7.11. Let λ be the Lebesgue measure on I = (a, b) with −∞ ≤ a < b ≤ ∞ and let
p ∈ [1, ∞). Then Step(I) is dense in Lp (I).
Proof. Let f ∈ Lp (I) and let ε > 0. We will construct a step function φ such that kf − φkp < 3ε.
Step 1: Reduction to bounded I. Let fn = 1(−n,n) f for n ∈ N. Then fn → f pointwise.
Moreover, |fn − f |p ≤ |f |p . Therefore, the DCT yields kfn − f kp → 0. Therefore, for n ∈ N large
enough kf − gkp < ε, where g = fn ∈ Lp (I).
Step 2: By Step 1 we can assume I is bounded, and moreover we can assume I = (a, b] with
−∞ < a < b < ∞. By Proposition Pn 7.10 there exists a simple function h : I → K such that
kg − hkp < ε. We can write h = j=1 1Aj xj with (Aj )nj=1 in B(I) disjoint.
Step 3: By Exercise 7.5 there exist Fj ∈ F(a,b] such that λ(Aj 4Fj ) < n(|xjε|+1) . Now let
Pn
φ = j=1 1Fj xj . Observe that |1Aj − 1Fj | = 1Aj 4Fj . Therefore, Minkowski’s inequality yields
that
n n n
X X X ε|xj |
kh − φkp ≤ |xj |k1Aj 4Fj kp = |xj |µ(Aj 4Fj ) ≤ < ε.
j=1 j=1 j=1
n(|xj | + 1)
Conclusion: Clearly, φ is a step function. Moreover, by Minkowski’s inequality we find
kf − φkp ≤ kf − gkp + kg − hkp + kh − φkp ≤ 3ε.
,
Corollary 7.12. Let λ be the Lebesgue measure on I = [a, b] with −∞ < a < b < ∞ and let
p ∈ [1, ∞). Then C([a, b]) is dense in Lp (I).
Proof. Let f ∈ Lp (I) and let ε > 0. By Theorem 7.11 there exists a step function φ such that
kf − φkp < ε. It remains find ψ ∈ C([a, b]) such that kφ − ψkp < ε. For this it suffices to
approximate an arbitrary 1J . Using Figure 7.2 and the DCT, the reader can easily convince him
or herself that this can indeed be done. ,

2 2 2

1 1 1

1 2 3 4 1 2 3 4 1 2 3 4

Figure 7.2. Approximation of 1(1, 52 ] in Lp by continuous functions

Exercises
34 MEASURE AND INTEGRATION

Exercise 7.1. Prove the identity (7.1).


Hint: First consider the case where a, b > 0 and minimize the right-hand side of (7.1).
Exercise∗ 7.2. Let p, q ∈ (1, ∞) be such that 1
p + 1
q = 1.
(a) Show that for all a, b ≥ 0,
 tp ap bq 
ab = inf + .
p t>0 qtq
(b) Use the above identity to prove Proposition 7.4.
Hint: Argue as in Proposition 7.3, but use the identity from (a) instead.
Exercise 7.3. Assume µ(S) < ∞ and 1 ≤ q ≤ p < ∞. Show that Lp (S) ⊆ Lq (S) and for all
f ∈ Lp (S),
1 1
kf kq ≤ µ(S) q − p kf kp .
Hint: Apply Hölder’s inequality to |f |q · 1.
Exercise 7.4. Let p ∈ [1, ∞) and λ be the Lebesgue measure on R. Determine for which α ∈ R
one has f ∈ Lp (R) in each of the following cases:
(a) f (x) = 1(0,1) (x)xα .
(b) f (x) = 1(1,∞) (x)xα .
Explain why Lp (R) * Lq (R) for all p, q ∈ [1, ∞) with p 6= q.
For sets A, B ⊆ R, A4B = (A \ B) ∪ (B \ A) denotes the symmetric difference of A and B.
Exercise∗ 7.5. Let λ be the Lebesgue measure restricted to ((a, b], B((a, b])). Let F(a,b] be the
finite unions of half-open intervals in (a, b].57 Let
A = {A ∈ B((a, b]) : ∀ε > 0 ∃F ∈ F(a,b] such that λ(A4F ) < ε}.
(a) For A, B ∈ B((a, b]) show that A4B = Ac 4B c . S S
(b) Let I be an index set and Ai , Bi ⊆ B((a, b]) for all i ∈ I. Let A = Ai and B = Bi .
S i∈I i∈I
Show that A4B ⊆ Ai 4Bi .
i∈I
(c) Deduce from (a) that A ∈ A implies Ac ∈ A.

S
(d) Assume (An )n≥1 is a disjoint sequence in A. Deduce from (b) that An ∈ A.
n=1

S
Hint: Use that limn→∞ λ( Ak ) = 0 in order to reduce to finitely many sets.
k=n
(e) Show that A = B((a, b]).
Hint: Use Exercises 3.1 and 3.8.
Exercise∗ 7.6. Let −∞ < a < b < ∞ and f ∈ L1 ((a, b]) and assume (a,t] f dλ = 0 for all
R

t ∈ (a, b]. We will derive that f = 0 in L1 (R). By considering real and imaginary part separately,
one can reduce to the case where f is real valued.
(a) Show that for all A ∈ F 1 with A ⊆ (a, b], A f dλ = 0.
R

(b) Let A ∈ B(R) be such that A ⊆ (a, b]. Construct sets A1 , A2 , . . . ∈ F 1 such that Ak ⊆ (−a, b]
and 1An → 1A a.e.
Hint: Use k1A − 1B k1 = λ(A4B) for all A, B ∈ B(R) and Exercise 7.5. Now apply Corollary
7.6 and the DCT. R
(c) Show that for all A ∈ B(R) with A ⊆ (a, b], A f dλ = 0.
Hint: Use (a) and (b).
(d) Derive that f = 0 in L1 (R).
Hint: Consider A = {x ∈ R : f (x) ≥ 0} and A = {x ∈ R : f (x) ≤ 0}.

57By Exercise 1.7 of the lecture notes we can take them disjoint
MEASURE AND INTEGRATION 35

Exercise∗∗ 7.7. Let λ be the Lebesgue measure on R. Observe that each n ∈ N can be uniquely
written as n = 2k + j with k ∈ N0 and j ∈ {0, . . . , 2k − 1}. Now for such n define fn =
1(j2−k ,(j+1)2−k ] . Show that fn → 0 in L1 (R), but for all x ∈ (0, 1], there exists infinitely many
n ∈ N such that fn (x) = 1.
Hint: First make a picture for k = 1 and j = 0, 1, and k = 2 and j = 0, 1, 2, 3.
Exercise∗∗ 7.8. Let 1 ≤ p ≤ q < ∞).
(a) Prove `p ⊆ `q and that for all (an )n≥1 ∈ `p one has k(an )n≥1 kq ≤ k(an )n≥1 kp .
Hint: By homogeneity one can assume k(an )n≥1 k`p = 1, and therefore |an | ≤ 1 for all n ∈ N.
(b) Let an = nα for n ∈ N. For which α ∈ R does one have (an )n≥1 ∈ `p ?
There is a natural limiting space of Lp (S) for p → ∞:
Exercise∗∗ 7.9. A measurable function f : S → K is said to be in L∞ (S) if there exists an M ≥ 0
such that µ({|f | > M }) = 0.58 Define
kf k∞ = inf{M ≥ 0 : µ({|f | > M }) = 0}.
As usual we identify functions f and g in L∞ (S) if f = g a.e.
(a) Show that (L∞ (S), k · k∞ ) is a Banach space.
(b) Show that for all f ∈ L∞ (S) and g ∈ L1 (S), f g ∈ L1 (S) and
kf gk1 ≤ kf k∞ kgk1 .
1
(c) Assume µ(S) < ∞ and p ∈ [1, ∞). Show that L∞ (S) ⊆ Lp (S) and kf kp ≤ µ(S) p kf k∞ .
(d) Assume S = N with the counting measure and p ∈ [1, ∞). Let `∞ := L∞ (N). Show that
`p ⊆ `∞ and k(an )n≥1 k∞ ≤ k(an )n≥1 kp .
(e) Assume I is a finite interval and µ = λ is the Lebesgue measure. Show that the simple
functions are dense in L∞ (I), but the step functions are not.

58In order words |f | ≤ M a.e.


36 MEASURE AND INTEGRATION

8. Applications to Fourier series


Fourier59 analysis plays a role in a large part of mathematics. In particular, it is one of the
central mathematical tools in Physics and Electrical Engineering. A Fourier series is of the form60
X a0 X
cn einx , or equivalently + an cos(nx) + bn sin(nx),
2
n∈Z n≥1

where x ∈ [0, 2π]. Of course one can also use x ∈ R here. Clearly, the above functions will be
periodic whenever they are well-defined.
One of the reasons that Fourier series naturally arise in mathematics is that each of the functions
d2
e±inx , cos(nx), sin(nx) as an eigenfunctions of dx 2 with eigenvalue −n .
2
Indeed, for instance
00 2
cos(nx) = −n cos(nx).
We have seen that Taylor series can be used to represent functions which are smooth enough.61
Fourier series provides another tool to represent functions. The class of functions which can be
represented as a Fourier series will turn out to be enormous.
In this section we will prove a couple of central results in the theory of Fourier series. The
interested reader can read more on the subject in [9], [10], [13], [14] and [19]. In particular, very
interesting but mostly elementary applications to geometry, ergodicity, number theory and PDEs
can be found in [13].
8.1. Fourier coefficients. In this section S = [0, 2π] and λ is the Lebesgue measure on (0, 2π).
For notational convenience we will write
Z b Z
f (x) dx := f dλ.
a [a,b]

for f ∈ L1 (0, 2π) and [a, b] ⊆ [0, 2π].


Definition 8.1. Let ek : [0, 2π] → C be given by ek (x) = eikx for k ∈ Z.62 Let f ∈ L1 (0, 2π).
(i) For k ∈ Z the k-th order Fourier coefficient is defined by63
Z 2π
1
fb(k) = f (x)ek (x) dx.
2π 0
(ii) for n ∈ N0 , the n-th partial sum of the Fourier series sn (f ) : [0, 2π] → C is defined by
X
(8.1) sn (f ) = fb(k)ek .
|k|≤n
P
(iii) A function of the form |k≤n ck ek with n ∈ N0 and (ck )|k|≤n in C is called a trigonometric
polynomial.
Remark 8.2.
(1) The reason to use the notion “trigonometric polynomial” is that ek (x) = (eix )k . Also note
that ek (x) = cos(kx) + i sin(kx) by Euler’s formula.
(2) Observe that by (the complex version of) Proposition 5.12 for each k ∈ Z,
Z 2π
1 1
(8.2) |fb(k)| ≤ |f (x)ek (x)| dx = kf k1 .
2π 0 2π
In Exercise 8.4 it will be shown that one even has limk→∞ fb(k) = 0.
(3) If f, g ∈ L1 (0, 2π), the following linearity property holds: (f\ + g)(k) = fb(k) + gb(k) for all
k ∈ Z. This follows directly from the linearity of the integral.
59Fourier analysis is named after the French mathematician Jean-Baptiste Fourier (1768–1830), and were intro-
duced in order to solve differential equations such as the heat equation
60Recall Euler’s formula: eix = cos(x) + i sin(x) for x ∈ R
61In Complex Function Theory these functions will be characterized as the so-called analytic functions
62Note that e = 1
0 [0,2π] .
63To calculate Fourier transforms numerically one can use the so-called fast Fourier transform FFT (see [13,
Section 7.1.3])
MEASURE AND INTEGRATION 37

Example 8.3. Given f ∈ L1 (0, 2π), the function sn (f ) is a trigonometric polynomial for each
n ∈ N. Each of the functions
einx + e−inx einx − e−inx
cos(nx) = , sin(nx) =
2 2i
is a trigonometric polynomial as well.
Finally note that if P is a trigonometric polynomial, then for any j ∈ N0 , P j is a trigonometric
polynomial as well.

The main questions in this section is whether we can reconstruct f from its Fourier coefficients.
More precisely:
P
• (representation) Which functions f : [0, 2π] → C can we write as f = k∈Z ck ek for
certain coefficients (ck )k∈Z ?
• (convergence) In what sense does the above series converge?
• (uniqueness) Does fb(k) = gb(k) for all k ∈ Z imply f = g ?
When considering convergence of Fourier series we will always consider the convergence of
X n
X
ck e k = ck ek as n → ∞.
|k|≤n k=−n

8.2. Weierstrass’ approximation result and uniqueness. Before we consider convergence of


Fourier series, we first we prove a fundamental result about the approximation by trigonometric
polynomials. It will be an essential ingredient in the uniqueness result in Theorem 8.5.

Theorem 8.4 (Weierstrass’ approximation theorem64 for periodic functions). The trigonometric
polynomials are dense in {f ∈ C([0, 2π]) : f (0) = f (2π)}.

Proof. Let f ∈ C([0, 2π]) be such that f (0) = f (2π) and let ε > 0 be arbitrary. It suffices to show
that there exists a trigonometric polynomial P such that kf − P k∞ < ε. We extend f periodically
to a function f : R → C. Since f is also uniformly continuous we can choose δ ∈ (0, π) such that
|x − y| < δ implies |f
P(x) −Pf (y)| < ε/2.
n
Define65 Fn = n1 k=1 |j|≤k−1 ej which is a periodic function as well. Define Pn : [0, 2π] → C
R 2π
by Pn (x) = 0 Fn (x − y)f (y) dy. Then since ej (x − y) = e2πix e−2πiy the following identity holds
2π n Z 2π n
1 1 X X 1X X
Z
Pn (x) = Fn (x−y)f (y) dz = ej (x) ej (−y)f (y) dy = ej (x)fb(j).
2π 0 2πn 0 n
k=1 |j|≤k−1 k=1 |j|≤k−1

This shows that Pn is a trigonometric polynomial.


By Exercise 8.3 the following identity holds

sin2 (nz/2)
(8.3) Fn (z) = , z ∈ (0, 2π).
n sin2 (z/2)

Therefore, Fn (z) ≥ 0, and thus we can write


2π n Z 2π n
1X X 1X
Z
kFn k1 = Fn (z) dz = ej (z) dz = 2π = 2π.
0 n 0 n
k=1 |j|≤k−1 k=1

64Originally Weierstrass proved that the polynomials are dense in C([a, b]). This can be derived from our version
of the theorem as indicated in Exercise 8.11
65This is called Féjer’s kernel
38 MEASURE AND INTEGRATION

Fix x ∈ [0, 2π]. It follows that


Z 2π Z 2π
2π|f (x) − Pn (x)| = f (x) Fn (x − y) dy − f (y)Fn (x − y) dy
0 0
Z 2π
= Fn (x − y)(f (x) − f (y)) dy
0
Z 2π
≤ |f (x) − f (y)|Fn (x − y) dy ≤ T1 + T2 ,
0
where T1 and T2 are the integrals over I := [x−δ, x+δ] and J := [0, x−δ]∪[x+δ, 2π], respectively.
On the interval I we can use the uniform continuity of f to estimate
Z
ε
T1 ≤ Fn (x − y) dy ≤ πε.
2 I
On J := [0, x − δ] ∪ [x + δ, 2π] we can estimate
Z Z
T2 ≤ 2kf k∞ Fn (x − y) dy = 2kf k∞ Fn (z) dz,
J Bδ

where we substituted z := x − y and where Bδ = [δ, 2π − δ]. Therefore, using (8.3) again and the
fact that | sin(z/2)| ≥ sin(δ/2) for z ∈ Bδ (recall that δ ≤ π) we obtain
|Bδ| 2π
Z
Fn (z) dz ≤ 2 ≤ .
Bδ n sin (δ/2) n sin2 (δ/2)
2kf k∞ ε
So choosing n ≥ 1 so large that n sin2 (δ/2)
< 2 we obtain T2 < πε.
T1 +T2
Therefore, combining the the estimates can conclude that |f (x) − Pn (x)| ≤ 2π < ε. Since
x ∈ [0, 2π] was arbitrary it follows that kf − Pn k∞ < ε as required. ,
Now we can deal with the uniqueness question for Fourier series. This is the most technical
part of this section and could be skipped it at first reading.
Theorem 8.5 (Uniqueness). If f ∈ L1 (0, 2π) satisfies fb(n) = 0 for all n ∈ Z, then f = 0 in
L1 (0, 2π).
Proof. 66 Step 1: First assume f ∈ C([0, 2π]) and f (0) = f (2π). By linearity it follows that for
each trigonometric polynomial P we have
Z 2π
f (x)P (x) dx = 0.
0
By Theorem 8.4 we can find trigonometric polynomials (Pn ) such that Pn → f uniformly. There-
fore, it follows that
Z 2π Z 2π
2
|f (x)| dx = lim f (x)Pn (x) dx = 0.
0 n→∞ 0
This implies f = 0.
Step 2: Next let f ∈ L1 (0, 2π) and assume fb(n) = 0 for all n ∈ Z. Let F : [0, 2π] → R be
defined by
Z t
F (t) = f (x) dx.
0
Then F ∈ C([0, 2π]) (see Example 6.7), and F (0) = F (2π) = 0.67 By Exercise 8.6 (b) Fb(k) =
fb(k)
ik = 0 for all k 6= 0. Now let g = F − C, where C = F (0). Then g ∈ C([0, 2π]), g(0) = g(2π)
b
and gb(k) = 0 for all k ∈ Z. Therefore, g = 0 by step 1 and hence F = C. Since F (0) = 0, this
yields F (x) = 0 for all x ∈ [0, 2π]. By Exercise 7.6 we find f = 0.
,
66There is a much better proof in the literature using the Féjer kernel as an approximate identity. Since
approximate identities are not part of these lecture notes, we proceed differently.
67Note that F (2π) = (2π)fb(0) = 0.
MEASURE AND INTEGRATION 39

8.3. Fourier series in L2 (0, 2π). In this section we will consider Fourier series in the Hilbert
space L2 (0, 2π). Here the inner product is given by
Z 2π
hf, gi = f (x)g(x) dx.
0

Note that L2 (0, 2π) ⊆ L1 (0, 2π) (see Exercise 7.3). Therefore, if f ∈ L2 (0, 2π) the Fourier coeffi-
cients are well-defined and fb(k) = (2π)−1 hf, ek i.
Let us recall as special case of Proposition 7.3 for f, g ∈ L2 (0, 2π),
(8.4) |hf, gi| ≤ kf k2 kgk2 (Cauchy–Schwarz inequality).
2
We will say that f, g ∈ L (0, 2π) are orthogonal if hf, gi = 0. Note that in this case the following
form of Pythagoras theorem holds68
(8.5) kf + gk22 = kf k22 + kgk22 .
Lemma 8.6 (Orthogonality). For j, k ∈ Z,

2π, if j = k;
hej , ek i =
0, if j 6= k.
Consequently, if finitely many (cj )j∈Z in C are nonzero, then
X 1
X  12
(8.6) cj ej = (2π) 2 |cj |2 .
2
j∈Z j∈Z

Proof. Indeed, if j 6= k, then using ek (x) = e−ikx we find that


Z 2π Z 2π Z 2π
i(j−k)x
hej , ek i = e dx = cos((j − k)x) dx + i sin((j − k)x) dx = 0
0 0 0
by periodicity of cos and sin. Similarly, one sees hej , ej i = 2π.
The final statement follows from
X 2 XX X
cj ej = cj ck hej , ek i = 2π |cj |2 .
2
j∈Z j∈Z k∈Z j∈Z

,
We extend this result to series using the completeness of L2 (0, 2π).
Theorem 8.7 (Riesz–Fischer, Convergence of Fourier series in L2 ).
X
(i) If (cn )n∈Z ∈ `2 , then g := cn en converges in L2 (0, 2π), and gb(n) = cn for all n ∈ Z, and
n∈Z
1
(8.7) kgk2 = (2π) 2 k(cn )n∈Z k`2 (Parseval’s identity)
X
(ii) If f ∈ L2 (0, 2π), then (fb(n))n∈Z in `2 and f = fb(n)en in L2 (0, 2π) and (8.7) holds with
n∈Z
g = f and cn = fb(n) for n ∈ Z.
Part (ii) shows that every L2 -function can be represented as a Fourier series. A similar result
holds for series of sine and cosine functions and can be derived as a consequence of the above
result (see Exercise 8.8).
P
Proof. (i): Let gn = |k|≤n ck ek for n ∈ N. We show that (gn )n≥1 is a Cauchy sequence. Let
ε > 0 and choose N ∈ N such that
 X  12 ε
|ck |2 < 1 .

|k|≥N
(2π) 2

68This follows by writing out kf + gk2 = hf + g, f + gi.


2
40 MEASURE AND INTEGRATION

Then for all integers n > m ≥ N by Lemma 8.6,


X 1
 X  21 1
 X  12
kgn − gm k2 = ck ek = (2π) 2 |ck |2 ≤ (2π) 2 |ck |2 < ε.
2
m<|k|≤n m<|k|≤n |k|≥N

This proves that (gn )n≥1 is a Cauchy sequence. By completeness (see Theorem 7.7), g :=
limn→∞ gn exists in L2 (0, 2π).
To check (8.7) note that by the continuity of k · k2 and Lemma 8.6,
1
 X  12 1
kgk2 = lim kgn k2 = lim (2π) 2 |ck |2 = (2π) 2 k(cn )k`2 .
n→∞ n→∞
|k|≤n
−1
Finally, note that gb(k) = (2π) hg, ek i = limn→∞ (2π)−1 hgn , ek i = ck .69
(ii): Fix n ∈ N. Since hf − sn (f ), ek i = 0 for each |k| ≤ n, also hf − sn (f ), sn (f )i = 0 and
hence (8.5) yields
(8.8) kf k22 = kf − sn (f ) + sn (f )k22 = kf − sn (f )k22 + ksn (f )k22 ≥ ksn (f )k2 .
Let ck = fb(k) for k ∈ Z. Then (8.8) yields:
(8.6) X
kf k22 ≥ ksn (f )k2 = 2π|ck |2 .
|k|≤n
−1
Letting n → ∞, we find k(ck )P k∈Z k`2 ≤ (2π) kf k2 < ∞.
By (i) we can define g = n∈Z cn en where the series converges in L2 (0, 2π). We claim that
f = g in L2 (0, 2π). To see this note that by (i), gb(n) = cn = fb(n). Therefore, the claim follows
from the uniqueness Theorem 8.5 applied to f − g. ,
For f ∈ L2 (0, 2π) Theorem 8.7 yields that f − sn (f ) = |k|>n fb(k)ek and by (8.7)
P
X
(8.9) (L2 -error estimate) kf − sn (f )k22 = 2π |fb(k)|2 .
|k|>n

Moreover, since sn (f ) is a trigonometric polynomial, we also find the following:70


Corollary 8.8. The trigonometric polynomials are dense in L2 (0, 2π).
Example 8.9 (Sawtooth function). Let f : [0, 2π) → R be defined by f (x) = x − π and extended
periodically on R. For k ∈ Z \ {0}, by the Fundamental Theorem of Calculus (see [11, Theorem
7.3.1]) and integration by parts,
Z 2π Z 2π
1 −ikx 1 h (x − π)e−ikx i2π 1 1
(8.10) f (k) =
b (x − π)e dx = − e−ikx dx = − .
2π 0 2π −ik 0 2π 0 ik
Clearly, fb(0) = 0. Therefore, Theorem 8.7 yield that f = − k∈Z\{0} eikk with convergence in
P
P∞
L2 (0, 2π). Moreover, using that 2 sin(kx) = ek −e i
−k
we also find that f = −2 k=1 sin(k·)k with
convergence in L2 (0, 2π) (see Figure 8.1) for plots of the partial sums).
The L2 -error can be estimated using (8.9):
X 1 Z ∞
X 1 4π
kf − sn (f )k22 = 2π |fb(k)|2 ≤ 4π 2
≤ 4π 2
dx = .
k n x n
|k|>n k≥n+1

One can show that sn (f ) will not converge to f uniformly (also see Figure 8.1). This is in particular
clear for x = 0 and x = 2π, because f (0) = π and f (2π) = −π, but sn (f )(0) = sn (f )(2π) = 0.
By applying (8.7) one can obtains a remarkable identity:
X 1
kf k22 = 2π .
k2
k∈Z\{0}

69Here we used |hg − g , e i| ≤ kg − g k ke k as follows from (8.5).


n k n 2 k 2
70With more advanced techniques one can show that the trigonometric polynomials are dense in any Lp (0, 2π)
with p ∈ [1, ∞).
MEASURE AND INTEGRATION 41

On the other hand, if we calculate kf k22 with the fundamental theorem of calculus, we obtain
Z 2π h i2π 2
kf k22 = (x − π)2 dx = 13 (x − π)3 = π3 .
0 0 3
P∞
Combining both identities gives k∈Z\{0} k12 = 13 π 2 , and so we find k=1 k12 = 16 π 2 .
P

4 4 4

2 2 2

−10 −5 5 10 −10 −5 5 10 −10 −5 5 10


−2 −2 −2

−4 −4 −4

Figure 8.1. The Fourier series of the sawtooth function with n = 2, n = 5 and
n = 10.

8.4. Fourier series in C([0, 2π]). In this section we will give some sufficient condition on f which
imply the Fourier series is uniformly convergent (or equivalently convergent in C([0, 2π]) with the
supremum norm k · k∞ ). Note that fn → f uniformly implies that fn → f in L2 (0, 2π). Indeed,
this follows from
Z 2π Z 2π
(8.11) kfn − f k22 ≤ |fn (x) − f (x)|2 dx ≤ kfn − f k2∞ 1 dx = 2πkfn − f k2∞ .
0 0
From the above we see that convergence of Fouier series in C([0, 2π]) is stronger than conver-
gence in L2 (0, 2π). However, there are example of functions f ∈ C([0, 2π]) with f (0) = f (2π) for
which the uniform convergence (and even the pointwise convergence) fails (see [1, Example 35.11]
and [10, Example 2.5.1]). So apparently more restrictive conditions are needed.
All the different types of convergence can be confusing. Let us summarize some convergence
results for a sequence (fn )n≥1 in L2 (0, 2π).71
L2 -conv. =⇒ L1 -conv. =⇒ a.e.-conv. subsequence.
=⇒

uniform conv.
=⇒

pointwise conv. =⇒ a.e.-conv.


L1 and L2 -convergence are only implied by a.e. conv. under additional assumptions on the (fn )n≥1
(as given for instance in the DCT).
The following result provides sufficient conditions for uniform convergence.
Theorem 8.10 (Absolute and uniform convergence of Fourier series). Let f ∈ C([0, 2π]). If
(fb(k))k∈Z ∈ `1 , then f (x) =
P
k∈Z f (k)ek (x), x ∈ [0, 2π] where the series is absolutely and
b
uniformly convergent.
As a consequence we see that f (0) = f (2π) holds in this situation, because ek (0) = ek (2π).
Proof. For all x ∈ [0, 2π],
X X
|fb(k)ek (x)| = |fb(k)| < ∞.
k∈Z k∈Z

71For completeness we note that a.e.-conv. =⇒ conv. in measure =⇒ conv. in distribution.


42 MEASURE AND INTEGRATION

P
Therefore, we can let g = k∈Z fb(k)ek , where the series is absolutely convergent. Moreover,
X
kg − sn (f )k∞ ≤ |fb(k)| → 0,
|k|>n

and hence g ∈ C([0, 2π]) and sn (f ) → g uniformly. By (8.11) the convergence holds in L2 (0, 2π)
as well, and hence
gb(k) = hg, ek i = lim hsn (f ), ek i = fb(k)
n→∞
and therefore, g = f a.e. by Theorem 8.5. Let A = {s ∈ [0, 2π] : f (s) = g(s)}. Then A is closed
and λ(A) = 2π. We claim that A is dense. Indeed, if not then there exists an nonempty open
interval I ⊆ [0, 2π] \ A. It follows that 0 < λ(I) ≤ λ([0, 2π] \ A) = λ([0, 2π]) − λ(A) = 0. This is a
contradiction and thus the claim follows. Since A is also closed in [0, 2π], the claim implies that
A = [0, 2π]. ,
From the proof we see that the following error estimate holds:
X
(8.12) (uniform error estimate) kf − sn (f )k∞ ≤ |fb(k)|.
|k|>n

The condition of Theorem 8.10 holds in the following situation:


Corollary 8.11. Assume f ∈ L2 (0, 2π) satisfies fb(0) = 0. Suppose c0 ∈ C and F : [0, 2π] → K is
Rt
given by F (t) = c0 + 0 f (x) dx. Then (Fb(k))k∈Z ∈ `1 and F = k∈Z Fb(k)ek where the series is
P
absolutely and uniformly convergent.
Proof. Since f ∈ L2 (0, 2π) ⊆ L1 (0, 2π), it follows from Example 6.7 that F is continuous on [0, 2π].
Moreover, for every t ∈ [0, 2π],
Z 2π
|F (t)| ≤ |c0 | + |f (x)| dx ≤ |c0 | + kf k1 .
0
1
In particular, by monotonicity we see that kF k2 ≤ (2π) 2 (|c0 | + kf k1 ).
By Exercise 8.6, Fb(k) = fik
b(k)
for k 6= 0. By Theorem 8.7, kfb(k))k∈Z k`2 = kf k2 . Therefore, by
the Cauchy–Schwarz inequality (8.4)
X X |fb(k)| C
|Fb(k)| = |Fb(0)| + ≤ |Fb(0)| + Ck(fb(k))k∈Z k`2 ≤ |Fb(0)| + 1 kf k2 ,
|k| (2π) 2
k∈Z k∈Z\{0}

where used (8.7) in the last step. Therefore, the absolute and uniform convergence follows from
Theorem 8.10. ,
Example 8.12. If g ∈ C([0, 2π]) satisfiesPg(0) = g(2π), g is piecewise continuously differentiable
on (0, 2π) and g 0 ∈ L2 (0, 2π), then g = k∈Z gb(k)ek where the series is absolutely and uniformly
Rt
convergent. Indeed, let F = g − g(0). Then F (t) = −g(0) + 0 g 0 (x) dx, where f := g 0 satisfies the
assumptions of Corollary 8.11.

Exercises
Exercise 8.1. Let f : [0, 2π) → R be given by f = 1[0,π] .
1
(a) Show that fb(k) = 0 for even k 6= 0, and fb(k) = πik for odd k, and fb(0) = 12 .
(b) Write f asPa series of sines as in Example 8.9 and give an estimate of L2 -error given by (8.9).
1
(c) Evaluate j∈Z (2j+1) 2.

Hint: Argue as in Example 8.9.


Exercise 8.2. Let f : [0, 2π) → R be given by f (x) = |x−π|. Example 8.12 yields that the Fourier
series of f is absolutely and uniformly convergent. Calculate the Fourier series explicitly and give
estimates for the L2 -error (8.9) and uniform error (8.12). One can also obtain the exact form of
another famous series k∈Z |fb(k)|2 using Parseval and the fundamental theorem of calculus.
P
MEASURE AND INTEGRATION 43

Exercise 8.3 (Special Fourier series and kernels). The following kernel’s play a central role in
more advanced theory of Fourier series. Prove the identities below for x ∈ (0, 2π). For each
Pn n+1
exercise you should use the geometric sum k=0 ak = 1−a 1−a for a ∈ C \ {1}.
X sin((n− 12 )x)
(a) (Dirichlet kernel) Show that Dn (x) := ek (x) = for n ≥ 1.
sin( 12 x)
|k|≤n−1
n
1X 1 sin2 (n x2 )
(b) (Féjer kernel) Show that Fn (x) := Dj (x) = for n ≥ 1.
n j=1 n sin2 ( 12 x)

Exercise∗ 8.4 (Riemann-Lebesgue lemma).


(a) Show that for any step function f : [0, 2π] → C (see Section 7.3) one has lim|k|→∞ fb(k) = 0.
Hint: By linearity it suffices to consider f = 1(a,b) , where (a, b) ⊆ [0, 2π].
(b) Show that for any f ∈ L1 (0, 2π) one has lim|k|→∞ fb(k) = 0.
Hint: Use Theorem 7.11 and (a).
Exercise∗ 8.5.
(a) Let (hH, h·, ·i) be a Hilbert space (over the complex scalars). Prove that for all u, v ∈ H,
(polarization) 4hu, vi = ku + vk2 − ku − vk2 + iku + ivk2 − iku − ivk2 .
(b) Use (a) and (8.7) to prove that for all f, g ∈ L2 (0, 2π):
Z 2π X
f (x)g(x) dx = 2π fb(k)b
g (k).
0 k∈Z
∗ 1 72 1
Exercise 8.6. Let g ∈ C ([0, 2π]) and f ∈ L (0, 2π). Define F : [0, 2π] → C by F (t) =
Rt
0
f (x) dx. By Example 6.7, F is continuous.
(a) Prove the following integration by parts formula:
Z 2π Z 2π
f (x)g(x) dx = F (2π)g(2π) − F (0)g(0) − F (x)g 0 (x) dx.
0 0
Hint: For continuous f this is just the standard integration by parts formula. Use Corollary
7.12 and approximation to deduce the general case.
(b) Show that fb(k) = fb(0) + ik Fb(k) for all k ∈ Z \ {0}.
Hint: Apply (a) with g = e−k .
Exercise∗ 8.7. Assume F : [0, 2π] → C is continuously differentiable and satisfies F (0) = F (2π)
and Fb(0) = 0. Let f = F 0 .
(a) Show that fb(k) = ik Fb(k) for all k ∈ Z.
Hint: Apply Exercise 8.6 (b).
(b) Show that kF k2 ≤ kf k2 and that equality holds if and only if F = c1 e1 +c−1 e−1 for c1 , c−1 ∈ C.
Hint: Apply (8.7).
Exercise∗ 8.8. Consider Γ = { 21 1[0,2π] } ∪ {cos(n·) : n ∈ N} ∪ {sin(n·) : n ∈ N} ⊆ L2 (0, 2π).
(a) Show that φ, ψ ∈ Γ with φ 6= ψ are orthogonal and kφk2 = π.
(b) Let f ∈ L1 (0, 2π) be such that hf, φi = 0 for all φ ∈ Γ. Show that f = 0.
Hint: Use Theorem 8.5
(c) Show that for every (an )n≥0 , (bn )n≥1 ∈ `2 the following series converges in L2 (0, 2π).
a0 X X
g := + an cos(n·) + bn sin(n·)
2
n≥1 n≥1

Hint: Argue as in Theorem P8.7.


(d) Show that kgk2L2 (0,2π) = π n≥0 |an |2 + π n≥1 |bn |2 .
P

Hint: Argue as in Theorem 8.7.


72That means g is differentiable and its derivative is continuous on [0, 2π]
44 MEASURE AND INTEGRATION

(e) Show that (an )n≥0 and (bn )n≥1 satisfy


1 2π 1 2π
Z Z
an = cos(nx)g(x) dx, bn = sin(nx)g(x) dx.
π 0 π 0
Hint: Argue as in Theorem 8.7.
(f) Show that every f ∈ L2 (0, 2π) can be written as
a0 X
f= + an cos(n·) + bn sin(n·)
2
n≥1
2
with converges in L (0, 2π).
Hint: Apply Theorem 8.7 or argue as in Theorem 8.7.
kf k1
It follows from the previous exercise and (8.2) that for C 1 -functions F one has |Fb(k)| ≤ 2π|k|
for all k ∈ Z \ {0}. Moreover, in Exercise 8.4 we have seen that for general F ∈ L1 (0, 2π) one has
Fb(k) → 0 as |k| → ∞. In the next exercise we show that the convergence can be arbitrary slow
even for periodic functions F ∈ C([0, 2π]).
Exercise∗∗ 8.9. Show that for any sequence (ck )k≥1 with ck 6= 0 and ck → 0 there exists a
function F ∈ C([0, 2π]) with F (0) = F (2π)
P∞such that |F (k)| ≥ |ck | for infinitely many k ∈ N.
b
Hint: Choose a subsequence such that n=1 |ckn | < ∞.
Finally we deduce Weierstrass’ classical approximation result. We first need an elementary
result about even trigonometric polynomials.
Pn
Exercise∗∗ 8.10. Let P = k=−n ck ek be a trigonometric polynomial.
(a) Show that there exists a polynomial qn of degree n such that cos(nx) = qn (cos(x)).
Hint: Use induction and the recursion formula cos(kx)+cos((k −2)x) = 2 cos((k −1)x) cos(x).
(b) Show that there exists a polynomial qn of degree n such that sin(nx)
sin(x) = rn (cos(x)).
Hint: Use induction and the recursion formula sin((k + 1)x) + sin((k − 1)x) = 2 cos(kx) sin(x).
(c) From (a) and (b) derive that there there exit polynomials q, r : [−π, π] → R of degree n such
that P (x) = q(cos(x)) + r(cos(x)) sin(x).
(d) If additionally P is even (i.e. P (−x) = P (x) for x ∈ [−π, π]). Show that there exists a
polynomial q of degree n such that P (x) = q(cos(x)).
Hint: Write 2P (x) = P (x) + P (−x) and use (c).
Exercise∗∗ 8.11 (Weierstrass’ approximation theorem for continuous functions). Let f : [−1, 1] →
R be continuous and let ε > 0. Let g : [−π, π] → R given by g(x) = f (cos(x)).
(a) Show that there is an even trigonometric polynomial P such that kg − P k∞ < ε.
Hint: First apply Theorem 8.4 on [−π, π] to obtain a trigonometric polynomial P such that
kg − P k∞ < ε. Now consider Pe(x) = P (−x)+P
2
(x)
.
(b) Use Exercise 8.10 (d) to find a a polynomial q such that kf − qk∞ < ε.
(c) The above shows that the polynomials are dense in C([−1, 1]). Use a scaling argument to
show that the polynomials are dense in C([a, b]).
MEASURE AND INTEGRATION 45

Final remarks:
• One can show that kf − Fn ∗ f kp → 0 for all f ∈ Lp (0, 2π) with p ∈ [1, ∞). In Theo-
rem 8.4 The convergence holds uniformly if f ∈ C([0, 2π]) satisfies f (0) = f (2π). Here
Fn ∗ f is the so-called convolution product of Fn and f and is defined by Fn ∗ f (t) =
R 2π
0
Fn (t − x)f (x) dx. Using the definition of Fn one can check that Fn ∗ f is a trigono-
metric polynomial.
• Similar results for Dn are true as well as long as p ∈ (1, ∞), but this a much deeper
result. Since Dn ∗ f = sn (f ), this implies that kf − sn (f )kp → 0 for all f ∈ Lp (0, 2π) with
p ∈ (1, ∞).
• Finally, we note that sn (f ) → f a.e. for any f ∈ Lp (0, 2π) with p > 1. This is one of the
deepest result in the theory of Fourier series and was proved by Carleson for p = 2 and
extended to p > 1 by Hunt in 1968. It was proved a long time before that the result fails
for p = 1 by Kolmogorov in 1923.

For details we refer to the elective Bachelor course on Fourier analysis !


46 MEASURE AND INTEGRATION

Appendix A. Dynkin’s lemma


The results of this section are not part of the exam material. We will proof the uniqueness
result of Proposition 3.5. In this section S denotes a set.
Definition A.1 (π-system). A collection E ⊆ P(S) is called a π-system if for all A, B ∈ E one
has A ∩ B ∈ E.
Example A.2. Every ring is a π-system.
Definition A.3. A collection D ⊆ P(S) is called a Dynkin-system73 if the following conditions
hold:
(D1) S ∈ D;
(D2) A, B ∈ D and A ⊆ B implies B \ A ∈ D;
(D3) If (An )n≥1 in D and 74 An ↑ A, then A ∈ D.
Example A.4. Let S = {1, 2, 3, 4} and D = {∅, S, {1, 2}, {3, 4}, {1, 3}, {2, 4}}. Then D is a Dynkin
system, but it is not a π-system.
Proposition A.5. For a collection F ⊆ P(S) the following are equivalent:
(i) F is a σ-algebra;
(ii) F is a Dynkin system and a π-system.
Proof. (i)⇒(ii): This is Exercise A.1.
(ii)⇒(i): ∅, S ∈ F follows from (D1) and (D2) of the definition of a Dynkin system. Let

S n
S
(An )n≥1 be a sequence in F. Let A = Aj and Bn = Aj for n ≥ 1. Since F is a π-system it
j=1 j=1
n
 T c
is closed under finite intersections. Therefore, using (D2) we obtain Bn = Acj ∈ F. Since
j=1
Bn ↑ A, it follows from (D3) that A ∈ F. ,
Lemma A.6 (Dynkin). Let E ⊆ P(S) be a π-system and D ⊆ P(S) be a Dynkin system. If
E ⊆ D, then σ(E) ⊆ D.
Proof. Let D0 denote the intersection of all Dynkin system which contain E. Observation: E ⊆
D0 ⊆ D and D0 is a Dynkin system (see Exercise A.2). We claim that D0 is a π-system as well.
As soon as we have proved this claim, Proposition A.5 yields that D0 is a σ-algebra. Therefore,
from the observation it follows that σ(E) ⊆ D0 ⊆ D. To prove the claim we need two steps.
Step 1: Define a new collection by
D1 = {D ∈ D0 : D ∩ E ∈ D0 for each E ∈ E}.
Since E is a π-system also E ⊆ D1 . The collection D1 is a Dynkin-system again. Indeed, S ∈ D1 is
clear. If A, B ∈ D1 and B ⊆ A, then for each E ∈ E we find (B \ A) ∩ E = (B ∩ E) \ (A ∩ E) ∈ D0 ,
because A ∩ E, B ∩ E ∈ D0 and D0 is a π-system, and thus B \ A ∈ D1 . Next let (An )n≥1 in D1

S
with An ↑ A. Then for each E ∈ E, A ∩ E = (An ∩ E) ∈ D0 since An ∩ E ∈ D0 . Since D0 ⊆ D1
n=1
and D1 is a Dynkin system which contains E, we find D1 = D0 .
Step 2: Define a new collection by
D2 = {D ∈ D0 : D ∩ C ∈ D0 for each C ∈ D0 }.
Since D1 = D0 , we find that E ⊆ D2 . As before one checks that D2 is a Dynkin system. Moreover,
as before this yields D2 = D0 . This proves the claim. -
Proposition A.7 (Uniqueness). Let µ1 and µ2 both be measures on measurable space (S, A).
Assume the following conditions:
(i) E ⊆ A is a π-system with σ(E) = A;
73Eugene Dynkin 1924–2014 was a Russian mathematician who worked on Algebra and Probability theory.
74See Definition 2.9 for the meaning of A ↑ A
n
MEASURE AND INTEGRATION 47

(ii) µ1 (S) = µ2 (S) < ∞ and µ1 (E) = µ2 (E) for all E ∈ E.


Then µ1 = µ2 on A.
Proof. Let D = {A ∈ A : µ1 (A) = µ2 (A)}. Then E ⊆ D. We claim that D is a Dynkin system.
From the claim and Lemma A.6 it follows that A = σ(E) ⊆ D ⊆ A. This implies D = A and the
required result follows from the Definition of D.
To prove the claim note that S ∈ D by assumption. If A, B ∈ D with A ⊆ B, then µ1 (B \ A) =
µ1 (B) − µ1 (A) = µ2 (B) − µ2 (A) = µ2 (B \ A) and hence B \ A ∈ D. Finally, if (An )n≥1 in D with
An ↑ A, then by Theorem 2.10, µj (An ) ↑ µj (A) for j = 0, 1. Since µ1 (An ) = µ2 (An ), this yields
µ1 (A) = µ2 (A) and thus A ∈ D. ,

Exercises
Exercise A.1. Prove Proposition A.5 (i) ⇒ (ii).
Exercise∗ A.2. Prove that the intersection of Dynkin systems is again a Dynkin system.
Exercise∗ A.3. Find a version of Proposition A.7 which for measures with µ1 (S) = µ2 (S) = ∞.
Hint: See the proof of Theorem 3.10.
48 MEASURE AND INTEGRATION

Appendix B. Carathéodory’s extension theorem


The results of this section are not part of the exam material (although the statement of the
Carathéodory’s extension in Theorem 3.1 is part of the exam).
In order to state and prove Carathéodory extension Theorem 3.1 we first a new concept of
measurability associated to a mapping α : P → [0, ∞] which satisfies α(∅) = 0.
Definition B.1. Let α : P(S) → [0, ∞] be a mapping which satisfies α(∅) = 0. We say that
A ⊆ S is α-measurable if
α(Q) = α(Q ∩ A) + α(Q ∩ Ac ), for all Q ∈ P(S).
The collection of all α-measurable sets is denoted by Mα .
The mapping α could be rather general and it will not be additive in general. However, the
cleverly defined collection Mα turns out to be a ring on which α is additive.
Lemma B.2. Let α : P(S) → [0, ∞] be a mapping which satisfies α(∅) = 0. Then Mα is a ring
and α is additive on Mα .
Proof. In order to check that Mα is a ring we check the following:
(i) ∅ ∈ Mα ;
(ii) A ∈ Mα =⇒ Ac ∈ Mα ;
(iii) A, B ∈ Mα =⇒ A ∩ B ∈ Mα .
Given these properties it is straightforward to check that Mα is a ring. Indeed, this follows from
the formulas B \ A = B ∩ Ac and A ∪ B = (Ac ∩ B c )c .
Properties (i) and (ii) are clear. It remains to check (iii). Let A, B ∈ Mα and write C = A ∩ B.
Let Q ∈ P(S) be arbitrary. Observation: A ∩ B c = C c ∩ A and Ac = C c ∩ Ac . One has
α(Q) = α(Q ∩ A) + α(Q ∩ Ac ) since A ∈ Mα
c c
= α(Q ∩ A ∩ B) + α(Q ∩ A ∩ B ) + α(Q ∩ A ) since B ∈ Mα
c c c
= α(Q ∩ C) + α(Q ∩ C ∩ A) + α(Q ∩ C ∩ A ) by the observation
c
= α(Q ∩ C) + α(Q ∩ C ) since A ∈ Mα .
Therefore, A ∩ B = C ∈ Mα which yields (iii).
To check that α is additive fix two disjoint sets A, B ∈ Mα and let Q = A ∪ B. Then Q ∩ A = A
and Q ∩ Ac = B. Therefore, since A ∈ Mα we find
α(A ∪ B) = α(Q) = α(Q ∩ A) + α(Q ∩ Ac ) = α(A) + α(B).
,

In Lemma B.2 we have seen that α is additive. In order to obtain a measure we need a further
condition on α.
Definition B.3. Let S be a set. A function α : P(S) → [0, ∞] is called an outer measure if
(i) α(∅) = 0;
(ii) (monotonicity) A ⊆ B =⇒ α(A) ≤ α(B).

 S  ∞
P
(iii) (σ-subadditivity) For each sequence (An )n≥1 in P(S) one has α An ≤ α(An ).
n=1 n=1

An outer measure is not necessarily a measure. For instance α(∅) = 0 and α(A) = 1 if A ⊆ S
is non-empty is an example of an outer measure which is not a measure (if S contains at least two
elements).
Next, we show that being an outer measure is the right additional ingredient to prove that α
is a measure on Mα .
Lemma B.4. Let α be an outer measure. Then Mα is a σ-algebra and α is a measure on (S, Mα ).
MEASURE AND INTEGRATION 49

Proof. Lemma B.2 gives that Mα is a ring and α is additive on Mα . It remains to check that for
any disjoint sequence (An )n≥1 ,

[ ∞
X
(B.1) A := An ∈ Mα and α(A) = α(An ).
n=1 n=1
n
S
Let Bn = Aj for each n ≥ 1. Fix an arbitrary Q ⊆ S. For every n ∈ N the following holds:
j=1
Pn
j=1 α(Q ∩ Aj ) + α(Q ∩ Ac ) = α(Q ∩ Bn ) + α(Q ∩ Ac ) (by Lemma B.2)
≤ α(Q ∩ Bn ) + α(Q ∩ Bnc ) (since Ac ⊆ Bnc )
= α(Q) (since Bn ∈ Mα )
Using first the σ-subadditivity of α and then the arbitrariness of n ∈ N in the above, we deduce

X
(B.2) α(Q ∩ A) + α(Q ∩ Ac ) ≤ α(Q ∩ Aj ) + α(Q ∩ Ac ) ≤ α(Q).
j=1

On the other hand by subadditivity also the converse estimate holds: α(Q) ≤ α(Q∩A)+α(Q∩Ac ),
and hence A ∈ Mα . Moreover, all inequalities in (B.2) have to be identities75, and hence (B.1)
follows by taking Q = A in (B.2). -
In the above results we have seen that with outer measures one can construct measures on
certain σ-algebras. Our next aim is to show that there is a natural outer measure associated to
an additive mapping µ on a ring R.
Lemma B.5. Let S be a set and R ⊆ P(S) be a ring. Suppose µ : R → [0, ∞] is additive and
satisfies µ(∅) = 0. For A ⊆ S define

nX ∞
[ o
(B.3) µ∗ (A) = inf µ(Bj ) : A ⊆ Bj , where Bj ∈ R for j ≥ 1 ,
j=1 j=1

where we let µ (A) = ∞ if the above set is empty. Then µ∗ is an outer measure

Proof. The mapping µ∗ : P(S) → [0, ∞] clearly satisfies (i) and (ii). In order to check (iii) let
(An )n≥1 in P and let ε > 0. If µ∗ (An ) = ∞ for some n ≥ 1, then (iii) is trivial. Next assume
µ∗ (An ) < ∞ for all n ≥ 1. Then by definition of µ∗ for each fixed n ≥ 1 we can find Bn,j ∈ R
such that
[∞ X∞
An ⊆ Bn,j and µ∗ (An ) + 2−n ε ≥ µ(Bn,j ).
j=1 j=1
∞ ∞
Bn,j and again by the definition of µ∗ we find
S S
Then An ⊆
n=1 n,j=1

[  ∞
X ∞
X ∞
X
µ∗ An ≤ µ(Bn , j) ≤ µ∗ (An ) + 2−n ε = ε + µ∗ (An ).
n=1 n,j=1 n=1 n=1

Since ε > 0 was arbitrary, this prove the required estimate. ,


Theorem B.6 (Carathéodory’s extension theorem). Let R ⊆ P(S) be a ring and µ : R → [0, ∞]
be σ-additive on R and µ(∅) = 0. Then µ∗ defined by (B.3) satisfies the following properties:
(i) µ∗ is a measure on the σ-algebra Mµ∗ ;
(ii) µ∗ (A) = µ(A) for all A ∈ R.
(iii) R ⊆ Mµ∗ ;
In particular, µ : σ(R) → [0, ∞] defined by µ(A) = µ∗ (A) defines a measure.
Clearly, Theorem 3.1 follows from the above statement and actually shows that there is a further
extension to the possibly larger σ-algebra Mµ∗ .
75Clearly, x ≤ y ≤ z ≤ x enforces x = y = z
50 MEASURE AND INTEGRATION

Proof. In Lemma B.5 we have proved that µ∗ is an outer measure. Therefore, assertion (i) follows
from Lemma B.4. In remains to prove (ii) and (iii). The assertion concerning µ follows from
(i)–(iii) as the restriction of a measure to a smaller σ-algebra is a measure again.
Step 1: Proof of (ii): Let A ∈ R. It is clear that µ∗ (A) ≤ µ(A). Indeed, take B1 = A and
Bn = ∅ for n ≥ 2 in (B.3). For the converse estimate the case µ∗ (A) = ∞ is clear. Now suppose

µ∗ (A) < ∞ and let B1 , B2 , . . . , ∈ R be such that A ⊆
S
Bn . Then by the σ-additivity of µ on
n=1

S
R and Theorem 2.8 (iii) applied to A = A ∩ Bn , we find
n=1

X ∞
X
µ(A) ≤ µ(A ∩ Bn ) ≤ µ(Bn ).
n=1 n=1
Taking the infimum over all (Bn )n≥1 as above yields µ(A) ≤ µ∗ (A).
Step 2: Proof of (iii): Let A ∈ R and Q ⊆ S. Since µ∗ is subadditive we find µ∗ (Q) ≤
µ (Q ∩ A) + µ∗ (Q ∩ Ac ). For the converse estimate the case µ∗ (Q) = ∞ is trivial. In case µ∗ (Q) <


Bn . Then Bn ∩ A, Bn ∩ Ac ∈ R for all n ≥ 1 and
S
∞, choose B1 , B2 , . . . , ∈ R such that Q ⊆
n=1

[ ∞
[
Q∩A⊆ Bn ∩ A and Q ∩ Ac ⊆ Bn ∩ Ac .
n=1 n=1
Therefore, using first the definition of µ∗ and then the additivity of µ on R, we find
X∞ X∞ ∞
X
µ∗ (Q ∩ A) + µ∗ (Q ∩ Ac ) ≤ µ(Bn ∩ A) + µ(Bn ∩ Ac ) = µ(Bn )
n=1 n=1 n=1
Taking the infimum over all (Bn )n≥1 as above gives µ∗ (Q ∩ A) + µ∗ (Q ∩ Ac ) ≤ µ∗ (Q). Combining
both estimates we can conclude A ∈ Mµ∗ . ,

Exercises
In the following exercise we show that Mµ∗ 6= P(S) in general.76
Exercise B.1. Let S = {1, 2, 3} and define a σ-algebra by A = {∅, S, {1, 2}, {3}}. Assume µ is a
measure satisfying µ({1, 2}) = µ({3}) = 12 .
(a) Show that µ∗ ({1}) = µ∗ ({2}) = 12 .
(b) Show that {1}, {2} ∈
/ Mµ∗ .
Exercise B.2. Let α : P(S) → [0, ∞] be an outer measure and suppose that A ⊆ P(S) satisfies
α(A) = 0. Show that A ∈ Mα .
Exercise∗ B.3. Assume the conditions of Theorem B.6 and assume µ is σ-finite on R, that means

S
there exists a sequence (Sn )n≥1 in R such that µ(Sn ) < ∞ for all n ≥ 1 and Sn = S. Prove
n=1
that the following are equivalent:
(a) A ∈ Mµ∗ ;
(b) There exists a B ∈ σ(R) such that A ⊆ B and µ∗ (B \ A) = 0
Hint: First reduce to the case of finite measure by intersecting with Sn . Use the definition of µ∗
given in (B.3).

76For the Lebesgue measure one also has M ∗ 6= P(R), but this is much harder to prove. See Appendix C
µ
MEASURE AND INTEGRATION 51

Appendix C. Non-measurable sets


Let λ be the Lebesgue measure on B(Rd ). Let λ∗ : P(S) → [0, ∞] be the outer measure
associated with λ (see Lemma B.5). The σ-algebra M(Rd ) := Mλ∗ introduced in Definition B.1
is usually called the Lebesgue σ-algebra. It follows from Theorem B.6 that F d ⊆ M(Rd ) and
thus also B(Rd ) ⊆ M(Rd ) and λ∗ is a measure on M(Rd ). In the sequel we write λ again for this
measure as it is just an extension of λ.
By Exercise B.3 for every A ∈ M(Rd ) there exists a B ∈ B(Rd ) such that A ⊆ B and λ(B \A) =
0. This shows that the Lebesgue σ-algebra is almost the same as the Borel σ-algebra up to sets
of measure zero. Some strange things can happen with nonmeasurable sets.
The balls of Banach and Tarski
One can cut a ball of radius one in Rd in such a way that it can be used to form two balls of
radius one. Of course something has to be nonmeasurable there. See:
https://en.wikipedia.org/wiki/Banach%E2%80%93Tarski_paradox

A set which Lebesgue measurable but not Borel measurable


There exist a set A ∈ M(Rd ) with λ(A) = 0, but A ∈ / B(Rd ). See [3, Appendix C] and [12,
page 53]
http://onlinelibrary.wiley.com/doi/10.1002/9781118032732.app3/pdf
http://www.math3ma.com/mathema/2015/8/9/lebesgue-but-not-borel
You can also read about this in the Bachelor thesis of Gerrit Vos:
http://resolver.tudelft.nl/uuid:30d69b56-b846-435e-9d44-6a31b840a836

There exist sets which are not Lebesgue measurable A subset of R which is not in M(Rd )
is given by Vitali’s example (see for example [4, Theorem 16.31]):
https://en.wikipedia.org/wiki/Vitali_set
52 MEASURE AND INTEGRATION

References
[1] C. D. Aliprantis and O. Burkinshaw. Principles of real analysis. Academic Press, Inc., San Diego, CA, third
edition, 1998.
[2] S. Banach. Théorie des opérations linéaires. Éditions Jacques Gabay, Sceaux, 1993. Reprint of the 1932 original.
[3] F. Burk. Lebesgue measure and integration. Pure and Applied Mathematics (New York). John Wiley & Sons,
Inc., New York, 1998. An introduction, A Wiley-Interscience Publication.
[4] N. L. Carothers. Real analysis. Cambridge University Press, Cambridge, 2000.
[5] D. H. Fremlin. Measure theory. Vol. 1, 2, 3, 4. Torres Fremlin, Colchester, 2004-2006.
[6] D.J.H. Garling. Inequalities: a journey into linear analysis. Cambridge University Press, Cambridge, 2007.
[7] R. Haberman. Applied Partial Differential Equations with Fourier Series and Boundary Valve Problems.
Pearson Higher Ed, 2012.
[8] O. Kallenberg. Foundations of modern probability. Probability and its Applications (New York). Springer-
Verlag, New York, second edition, 2002.
[9] Y. Katznelson. An introduction to harmonic analysis. Cambridge Mathematical Library. Cambridge University
Press, Cambridge, third edition, 2004.
[10] J Korevaar. Fourier analysis and related topics. https://staff.fnwi.uva.nl/j.korevaar/Foubook.pdf.
[11] S.R. Lay. Analysis. With an introduction to proof. 5th ed. Pearson, 2015.
[12] H. Royden and P. Fitzpatrick. Real analysis. 4th ed. New York, NY: Prentice Hall, 4th ed. edition, 2010.
[13] E.M. Stein and R. Shakarchi. Fourier analysis, volume 1 of Princeton Lectures in Analysis. Princeton University
Press, Princeton, NJ, 2003. An introduction.
[14] E.M. Stein and R. Shakarchi. Real analysis. Princeton Lectures in Analysis, III. Princeton University Press,
Princeton, NJ, 2005. Measure theory, integration, and Hilbert spaces.
[15] E.M. Stein and R. Shakarchi. Functional analysis, volume 4 of Princeton Lectures in Analysis. Princeton
University Press, Princeton, NJ, 2011. Introduction to further topics in analysis.
[16] D. Werner. Einführung in die höhere Analysis. Springer-Lehrbuch. [Springer Textbook]. Springer-Verlag,
Berlin, corrected edition, 2009.
[17] D. Williams. Probability with martingales. Cambridge Mathematical Textbooks. Cambridge University Press,
Cambridge, 1991.
[18] A. Zygmund. The role of Fourier series in the development of analysis. In Proceedings of the American Academy
Workshop on the Evolution of Modern Mathematics (Boston, Mass., 1974), volume 2, pages 591–594, 1975.
[19] A. Zygmund. Trigonometric series. Vol. I, II. Cambridge Mathematical Library. Cambridge University Press,
Cambridge, third edition, 2002. With a foreword by Robert A. Fefferman.

You might also like