Lecture Notes KW2
Lecture Notes KW2
Contents
Introduction 2
1. σ-algebras 3
2. Measures 6
3. Construction of measures 9
4. Measurable functions 13
5. Construction of the integral 18
5.1. Integral for simple functions 18
5.2. Integral for positive measurable functions 19
5.3. Integral for measurable functions 21
6. Convergence theorems and applications 25
6.1. The three main convergence results 25
6.2. Consequences and applications 26
7. Lp -spaces 30
7.1. Minkowski and Hölder’s inequalities 30
7.2. Completeness of Lp 31
7.3. Lp -spaces on intervals 32
8. Applications to Fourier series 36
8.1. Fourier coefficients 36
8.2. Weierstrass’ approximation result and uniqueness 37
8.3. Fourier series in L2 (0, 2π) 39
8.4. Fourier series in C([0, 2π]) 41
Appendix A. Dynkin’s lemma 46
Appendix B. Carathéodory’s extension theorem 48
Appendix C. Non-measurable sets 51
References 52
Introduction
These notes have been created for the “Measure and integration theory” part of a course on
real analysis at the TU Delft. Together with the first part of the course on metric spaces, these
notes form the mathematical basis for several bachelor courses and master courses in applied
mathematics at TU Delft.
I would like to thank Emiel Lorist and Bas Nieraeth for their support in the preparation of
these notes. All the figures have been created by Emiel Lorist. I would also like to thank the
students of the course on real analysis for pointing out the typos in the manuscript.
In Section 1 and 2 we introduce σ-algebras and measures. The Lebesgue measure is constructed
in Section 3 and is based on Appendix B on Carathéodory’s theorem. Uniqueness questions are
addressed in Appendix A on Dynkin’s monotone class theorem. The amount of books on measure
theory is almost not measurable. The lecture notes are based on [1], [8], [16] and [17]. A very
complete treatment of measure theory is given in the impressive works [5].
In Sections 5, 6, 7 we introduce the integration theory and the Lebesgue spaces Lp . This theory
is fundamental in modern (applied) mathematics. There are many excellent books which give
more detailed treatments on the subject. See for instance [1], [4], [6], [15] for detailed treatments.
In Section 8 we give a brief introduction to the theory of Fourier series. More thorough treat-
ments can be found in for example [9], [10], [13], [14] and [19]. A full course on Fourier Analysis
is offered as 3rd elective course based on the lecture notes [10]. The theory of Fourier series will
be used in the 2nd year bachelor course on Partial Differential Equations [7], but also in several
other parts of Mathematical Physics and Numerical Analysis.
We end this brief introduction with a quote from a historical note of Zygmund [18]:
“The Lebesgue integral did not arise via the theory of Fourier series but was created through the
necessities of measuring geometric figures. But once it was introduced, it had an enormous impact
on Analysis through Fourier series”:
• The Riesz-Fischer theorems 7.5, 8.7, in its initial formulation primarily a theorem on
Fourier series.
• The M. Riesz interpolation theorem.
• Structure of sets of measure 0.
• The theory of trigonometric series has become a workshop of new methods in analysis, a
place where new methods are first discovered before they are generalized and applied in
other contexts.
• The theory of Fourier series gave a fresh impulse to problems of the differentiability of
functions (Sobolev spaces etc.).
MEASURE AND INTEGRATION 3
1. σ-algebras
For a set S we write P(S) for its power set.
Definition 1.1. Let S be a set. A collection R ⊆ P(S) is called a ring if
(i) ∅ ∈ R;
(ii) A, B ∈ R =⇒ B \ A ∈ R;
n
S
(iii) n ∈ N, A1 , A2 , . . . , An ∈ R =⇒ Aj ∈ R.
j=1
Remark 1.2.
(1) An equivalent definition is obtained if one replaces (iii) by A, B ∈ R =⇒ A ∪ B ∈ R. This
follows by induction.
(2) If R is a ring, then for all A, B ∈ R one has A ∩ B ∈ R. Indeed, this follows from the identity
A ∩ B = A \ (A \ B).
Definition 1.3. Let S be a set. A family A ⊆ P(S) is called a σ-algebra1 if
(i) ∅, S ∈ A;
(ii) A ∈ A =⇒ Ac ∈ A;
∞
S
(iii) A1 , A2 , . . . , ∈ A =⇒ Aj ∈ A.
j=1
The sets A ∈ A are often called the measurable subsets of S.
Remark 1.4.
∞
T
(1) If A is a σ-algebra, then for all A1 , A2 , . . . , ∈ A one has Aj ∈ A. This follows from the
j=1
∞ S∞ c
Acj .
T
identity Aj =
j=1 j=1
(2) Every σ-algebra is a ring. This follows from the identity B \ A = B ∩ Ac .
Example 1.5. Let S be a set.
(a) A = {S, ∅} is the smallest possible σ-algebra on S.
(b) A = P(S) is the largest possible σ-algebra on S.
Example 1.6. Let S = {1, 2, 3}.
(a) Let A = {∅, S, {1}, {2, 3}}. Then A is a σ-algebra.
(b) Let B = {∅, S, {1}, {2, 3}, {1, 2}}. Then B is not a σ-algebra.
Example 1.7. Let S be a set.
(a) The set R = {A ⊆ S : A is finite} is a ring.
(b) Let2 A = {A ⊆ S : A is countable or Ac is countable}. Then A is a σ-algebra (see Exercise
1.2). It is called the countable-cocountable σ-algebra and is useful for counterexamples.
We continue with a more serious example which plays a crucial role in later constructions.
Example 1.8.
(a) Let S = R. Let I 1 be the collection of all sets of the form (a, b] with a ≤ b. These intervals
will be called half-open intervals. Then I 1 is not a ring since for instance (0, 3] \ (1, 2] =
(0, 1] ∪ (2, 3] is not in I 1 .
(b) Let S = R. Let F 1 be the collection of sets which can be written as a finite union of half-open
intervals (thus of the form (a, b] with a ≤ b). Then F 1 is not a σ-algebra for several reasons.3
We check that F 1 is a ring. (i) follows from ∅ = (1, 1] ∈ F 1 . (iii) is clear since a finite union
of a finite union of intervals of the form (a, b] is again a finite union. It remains to check (ii).
1In part of the literature a σ-algebra is also called a σ-field
2Recall that a set A ⊆ S is countable if A is finite or there is a bijection f : N → A.
∞
3For instance S (0, 1 − 1 ] = (0, 1) is not in F 1 .
n
n=1
4 MEASURE AND INTEGRATION
For this we first note that it is simple to check that for A, B ∈ F 1 one has A ∩ B ∈ F 1 and by
induction this extends to the intersections of finitely many sets. For two intervals (a, b] and
(c, d] using B \ A = B ∩ Ac and R \ (a, b] = (−∞, a] ∪ (b, ∞) we find
(c, d] \ (a, b] = (c, d] ∩ (R \ (a, b])
= (c, d] ∩ (−∞, a] ∪ (c, d] ∩ (b, ∞)
= (c, a ∧ d] ∪ (b ∨ c, d].
m n
This is in F 1 again. Now if A = (cj , dj ] are in F 1 , then
S S
(ai , bi ] and B =
i=1 j=1
n
[ n \
[ m
B\A= (cj , dj ] \ A = (cj , dj ] \ (ai , bi ].
j=1 j=1 i=1
Definition 1.10. Let S be a set and let F ⊆ P(S). We write σ(F) for the smallest σ-algebra
which contains F. Then σ(F) is called the σ-algebra generated by F. More precisely:4
\
σ(F) = {A : A is a σ-algebra on S and F ⊆ A}.
Example 1.11. Let S = {1, 2, 3}, F = {{1, 2}} and G = {{2, 3}, {1, 2}}.
(a) σ(F) = {∅, S, {1, 2}, {3}}.
(b) σ(G) = P(S). Indeed, {2, 3}c = {1}, {1, 2}c = {3} and {2, 3} ∩ {1, 2} = {2}. Thus the
singletons {1}, {2} and {3} are in σ(G). Therefore, the required result follows since we can
form every subset of S by taking suitable finite unions.
Example 1.12. Let S = N and F = {{n} : n ∈ N}. Then σ(F) = P(N).
Definition 1.13. Let (S, d) be a metric space.5 Let B(S) be the σ-algebra generated by the open
sets in S. Thus
B(S) = σ{open sets in S}.
The σ-algebra B(S) is called6 the Borel σ-algebra of S. The sets of B(S) are called the Borel
subsets of S.
Example 1.14. One of the most important σ-algebras is the Borel σ-algebra of R which is usually
denoted by B(R). Later on we will show that B(R) 6= P(R).
The following lemma will be useful in some of the exercises on the Borel σ-algebra of R and Rd .
d d
S 1.15 (Lindelöf). Let A ⊆ R . Assume that for each
Lemma S i ∈ I let Oi ⊆ R be open. If
A⊆ Oi , then there exists a countable J ⊆ I such that A ⊆ Oi
i∈I i∈J
4Note that Proposition 1.9 ensures that σ(F ) is indeed a σ-algebra. The intersection makes sure we obtain the
smallest possible one
5More generally one could take any topological space here
6Named after the French mathematician Félix Borel 1871-1956
MEASURE AND INTEGRATION 5
Proof. Choose for each x ∈ A, ix ∈ I and rx > 0 such that B(x, rx ) ⊆ Oix . For each x ∈ A choose
ax ∈ Qd and sx ∈ Q ∩ (0,
S ∞) such that x ∈ B(ax , sx ) ⊆ B(x, rx ) ⊆ Oix . Let F = {B(ax , sx ) : x ∈
A}. Then clearly A ⊆ B(ax , sx ). Moreover, F has at most countably many sets. Indeed, this
x∈A
follows from the fact that it is a subset of {B(q, r) : q ∈ Qd , r ∈ Q ∩ (0, ∞)} which is countable.
Therefore, we can write F = {B(axn , sxn ) : n ∈ N}
S with xn ∈ A for each n ∈ N.
Now let J = {ixn ∈ I : n ∈ N}. Then A ⊆ Oi . Indeed, if x ∈ A, then x ∈ B(ax , sx ) and
i∈J S
choosing n ∈ N such that axn = ax and sxn = sx we find that x ∈ Oixn ⊆ Oi ,
i∈J
Exercises
Exercise 1.1. Let S = R and F = {A ⊆ R : A ⊆ [0, 1] or Ac ⊆ [0, 1]}. Is F a ring?
Exercise∗ 1.2. Prove that the collection in Example 1.7 (b) is a σ-algebra.
Exercise 1.3.
(a) Prove Proposition 1.9.
(b) Give an example of two σ-algebras A and B on S = {1, 2, 3} such that A∪B is not a σ-algebra.
Exercise∗ 1.4. Let S be a set and let F = {{s} : s ∈ S} be the collection consisting of all
sets which contain one element of S. Show that σ(F) coincides with the countable-cocountable
σ-algebra of Example 1.7.
Hint: Use the followsing (well-known) facts: The subset of a countable set is again countable;
The countable union of countable sets is again countable.
Exercise 1.5. Show that N, Q, R \ Q ∈ B(R). That is N, Q and R \ Q are Borel subsets of R.
Exercise∗ 1.6. Consider the following collection B0 = {(−∞, x) : x ∈ R}) of subsets of R.
(a) Show that σ(B0 ) contains all open intervals.
(b) Show that every open set in R can be written as the union of countably many open intervals.
Hint: Use Lindelöf’s Lemma 1.15.
(c) Conclude that σ(B0 ) = B(R).
Exercise∗ 1.7. Let I d ⊆ F d be the collection of half-open rectangles of Example 1.8. Prove the
following assertions:
(a) If I, J ∈ I d then I ∩ J ∈ I d .
(b) If I, J ∈ I d , then I \J is the union of finitely many disjoint sets from I d , and thus I \J ∈ F d .
Hint: Use induction on the dimension d. Use Example 1.8 (b) for d = 1.
(c) Each A ∈ F d can be written as union of finitely many disjoint sets in I d .
n
Ik with I1 , . . . , In ∈ I d .
S
Hint: Use induction on n to prove this for all sets of the form A =
k=1
(d) F d is a ring.
Exercise∗∗ 1.8. Prove that a σ-algebra is either finite or uncountable.7
Hint: Recall that P(N) is uncountable.
7This shows that σ-algebras are either easy finite sets or quite complicated
6 MEASURE AND INTEGRATION
2. Measures
Definition 2.1. Let S be a set and R ⊆ P(S) a ring. Let µ : R → [0, ∞] be a function with8
µ(∅) = 0
(i) µ is called additive if for all disjoint9 A1 , . . . , An ∈ R one has
[ n X n
µ Aj = µ(Aj ).
j=1 j=1
∞
S
(ii) µ is called σ-additive if for each disjoint sequence (An )n≥1 in R which satisfies An ∈ R
n=1
it holds that
∞
[ X∞
µ Aj = µ(Aj ).
j=1 j=1
Remark 2.2.
(1) By an induction argument it suffices to consider n = 2 in the definition of additive. (See
Exercise 2.1).
(2) If µ is σ-additive, then it is additive as follows by taking Am = ∅ for m ≥ n + 1.
Definition 2.3. Let S be a set and let A be a σ-algebra on S.
(i) The pair (S, A) is called a measurable space.
(ii) A function µ : A → [0, ∞] which satisfies µ(∅) = 0 and which is σ-additive on A is called a
measure. In this case the triple (S, A, µ) is called a measure space.
(iii) If additionally to (ii) µ(S) < ∞, then µ is called a finite measure. If moreover, µ(S) = 1,
then µ is called a probability measure and (S, A, µ) is called a probability space. 10
Example 2.4 (Counting measure). Let S = N and A = P(N). We write #A for the number of
elements of a finite set A, and we set #A = ∞ if A is infinite. Let µ : A → [0, ∞] be given by
µ(A) = #A. Then µ is a measure. Often µ is denoted by τ and called the counting measure.
Example 2.5 (Dirac measure/Dirac’s delta function). Let S = R, A = P(R). Let x ∈ R. Let
δx : A → [0, ∞] be given by δx (A) = 1 if x ∈ A and δx (A) = 0 if x ∈ R \ A. Then δx is a measure.
It is usually called the Dirac measure11 at x.
Example 2.6. Let S be a set and let µ : P(S) → [0, ∞] be given by µ(∅) = 0 and µ(A) = 1 if
A 6= ∅. If S contains at least two elements, then µ is not a measure.
Example 2.7 (Length of an interval). 12 Let R = F 1 as in Example 1.8. For a ≤ b let λ((a, b]) =
b − a (length of the interval). If A = (a1 , b1 ] ∪
P.n. . ∪ (an , bn ] is a union of disjoint sets such that
aj ≤ bj for j = 1, . . . , n, then we let13 λ(A) = j=1 bj − aj . Then λ is additive. Later we will see
that λ is σ-additive on F 1 and has an extension to a measure on σ(F 1 ) = B(R).
Theorem 2.8. Let R be a ring and µ : R → [0, ∞] be additive. The following assertions hold:
(i) If A, B ∈ R and A ⊆ B, then µ(A) ≤ µ(B) (monotonicity).
∞
S
(ii) If A1 , A2 , . . . ∈ R are disjoint and Aj ∈ R, then
j=1
∞
[ X∞
µ Aj ≥ µ(Aj ).
j=1 j=1
∞
S
(iii) If A1 , A2 , . . . ∈ R and Aj ∈ R and µ is σ-additive on R, then
j=1
∞
[ X∞
µ Aj ≤ µ(Aj ) (σ-subadditivity).
j=1 j=1
∞
[ ∞
[ ∞
X ∞
(i) X
µ Aj =µ Bj = µ(Bj ) ≤ µ(Aj ).
j=1 j=1 j=1 j=1
,
Let (an )n≥1 be a sequence of real numbers. We write an ↑ a if (an )n≥1 is an increasing sequence
which converges to a. Similarly, we write an ↓ a if it decreases and converges to a. This notation
will now be extended to sets.
Definition 2.9. Let S be a set.
(i) A sequence (An )n≥1 of subsets of S will be called increasing if An ⊆ An+1 for all n ∈ N.
∞
S
In this case we write An ↑ A, where A = An .
n=1
(ii) A sequence (An )n≥1 of subsets of S will be called decreasing if An ⊇ An+1 for all n ∈ N.
∞
T
In this case we write An ↓ A, where A = An .
n=1
Theorem 2.10. Let (S, A, µ) be a measure space and let (An )n≥1 be a sequence in A.
(i) If An ↑ A, then µ(An ) ↑ µ(A).
(ii) If An ↓ A and µ(A1 ) < ∞, then µ(An ) ↓ µ(A).
Proof. (i): Define (Bn )n≥1 as in (2.1). Then (Bn )n≥1 is a disjoint sequence and the following
∞
S n
S
identities hold: Bj = A and Bj = An . Therefore, the σ-additivity of µ gives
j=1 j=1
∞
X n
X
µ(A) = µ(Bj ) = lim µ(Bj ) = lim µ(An ).
n→∞ n→∞
j=1 j=1
∞
S
Proof. Let (Bj )j≥1 be a disjoint sequence in R with B := Bj ∈ R. We need to show that
j=1
∞
X
(2.2) µ(B) = µ(Bj ).
j=1
∞
S
Let An = Bj = B \ (B1 ∪ . . . ∪ Bn−1 ). Then An ∈ R and An ↓ ∅. Now the assumption yields
j=n
µ(An ) → 0. On the other hand
n−1
X
µ(B) = µ(An ∪ B1 ∪ B2 ∪ . . . ∪ Bn−1 ) = µ(An ) + µ(Bj ).
j=1
n−1
X
Therefore, µ(B) − µ(Bj ) = µ(An ) → 0 and (2.2) follows. ,
j=1
Exercises
Exercise 2.1. Let R be a ring on a set S. Assume µ : R → [0, ∞] satisfies µ(∅) = 0 and
µ(A ∪ B) = µ(A) + µ(B) for all sets A, B ∈ R with A ∩ B = ∅. Show that µ is additive.
Exercise 2.2. Let R be a ring on a set S. Let µ : R → [0, ∞] be additive.
(a) Prove that for A, B ∈ R with µ(A) < ∞ and A ⊆ B one has
µ(B \ A) = µ(B) − µ(A).
(b) Prove that for A, B ∈ R with µ(A) < ∞ one has
µ(A ∪ B) = µ(A) + µ(B) − µ(A ∩ B).
(c) Show that for any n ∈ N and any sets (Aj )nj=1 in R,
finite subadditivity µ(A1 ∪ . . . ∪ An ) ≤ µ(A1 ) + . . . + µ(An ).
X2.3. Let (an )n≥1 be numbers in [0, ∞). Set µ(∅) = 0 and define µ : P(N) → [0, ∞] by
Exercise
µ(A) = an . Prove that µ is a measure on N.
n∈A
Exercise∗ 2.4.
(a) Prove Theorem 2.10 (ii).
(b) Give an example of a measure space (S, A, µ) and sets An ∈ A such that An ↓ ∅ and
µ(An ) = ∞ for all n ∈ N.
Hint: Use the counting measure.
Exercise∗ 2.5. Let (S, A, µ) be a measure space. For A1 , A2 , . . . ⊆ S define
∞ [
\ ∞
lim sup An = An .
n→∞
k=1 n=k
(a) Show that s ∈ lim sup An if and only if there are infinitely many n ∈ N such that s ∈ An .
n→∞
(b) Assume A1 , A2 , . . . ∈ A. Show that lim sup An ∈ A.
n→∞
∞
X
(c) Assume A1 , A2 , . . . ∈ A satisfy µ(An ) < ∞. Show that µ(lim sup An ) = 0.14
n→∞
n=1
Exercise∗ 2.6. Let A be the σ-algebra from Example 1.7 (b) with S = R. Define µ : A → [0, ∞]
by µ(A) = 0 if A is countable and µ(A) = 1 if Ac is countable. Show that µ is a measure.
3. Construction of measures
It is not a simple task to construct a measure. In this section we will construct the Lebesgue
measure on Rd of which we have previously shown it is an additive mapping on the ring F 1 in
Example 2.7. To extend it to the Borel σ-algebra we use a deep result of Carathéodory.15 His
result basically says that it is enough to check that a measure is σ-additive on a ring generating
the desired σ-algebra. A detailed proof can be found in Theorem B.6 in the appendix, but it will
do no harm if one takes the result for granted.16
Theorem 3.1 (Carathéodory’s extension theorem). Let S be a set and let R ⊆ P(S) be a ring.
Suppose that µ : R → [0, ∞] is σ-additive on R and µ(∅) = 0. Then µ extends to a measure µ on
σ(R). More precisely, there exists a measure µ on (S, σ(R)) such that µ(A) = µ(A) for all A ∈ R.
Remark 3.2. The measure µ is often unique.17 When there is no danger of confusion we will write
µ again for the extension to σ(R). However, in general one has to be careful about uniqueness.
For instance if we define µ on the ring F 1 by µ(A) = ∞ if A ∈ F 1 is nonempty, then µ has at
least two extensions: the counting measure on B(R) is an extension of µ, but also the measure
ν : B(R) → [0, ∞] given by ν(A) = ∞ if A 6= ∅ is an extension of µ.
We continue with a uniqueness result which will be proved in the appendix.
Definition 3.3 (π-system). A collection E ⊆ P(S) is called a π-system if for all A, B ∈ E one
has A ∩ B ∈ E.
Example 3.4.
(a) Every ring is a π-system.
(b) Let S = Rd . The half-open rectangles I d are a π-system.
The following result will be proved in Proposition A.7.
Proposition 3.5 (Uniqueness). Let µ1 and µ2 both be measures on measurable space (S, A).
Assume the following conditions:
(i) E ⊆ A is a π-system with σ(E) = A;
(ii) µ1 (S) = µ2 (S) < ∞ and µ1 (E) = µ2 (E) for all E ∈ E.
Then µ1 = µ2 on A.
We continue with the construction of the Lebesgue 18 measure λ. In Example 1.8 we intro-
duced the half-open intervals (a, b] ∈ I 1 with a ≤ b. Also recall that F 1 denotes the collection
of all finite unions of half-open intervals. We have seen that F 1 is a ring. Moreover, in of course
every set in F 1 can be written as a finite union of disjoint half-open intervals.
Definition 3.6 (on unions of half-open intervals). For A ∈ F 1 of the form A = (a1 , b1 ] ∪ . . . ∪
(an , bn ] with disjoint ((aj , bj ])nj=1 in I 1 define λ : F 1 → [0, ∞] as the sum of the lengths:
n
X
λ(A) = (bj − aj ).
j=1
The above is well-defined. To see this assume A = (c1 , d1 ]∪. . . (cm , dm ] is another representation
of A as a union of disjoint intervals. Let Iij = (ci , di ] ∩ (aj , bj ]. Then either Iij is empty or a half
open interval,
m
[ [n
Iij = (aj , bj ] and Iij = (cj , dj ].
i=1 j=1
15Carathéodory 1873-1950 was a Greek mathematician working in Analysis, but also on Thermodynamics.
16The appendix is not part of the exam
17For instance when µ is a finite measure. See Proposition A.7.
18Henri Lebesgue 1875–1941 was a French mathematician well-known for his integration theory. See Section 5.
10 MEASURE AND INTEGRATION
which proves the well-definedness. Alternatively, one can observe that λ(A) coincides with the
Riemann integral of 19 1A . Indeed, fix an interval I such that A ⊆ I. By linearity of the Riemann
integral
Z Xn Z n
X
1A dx = 1(aj ,bj ] dx = (bj − aj ) = λ(A).
I j=1 I j=1
In Example 1.8 we introduced the half-open rectangles (a, b] ∈ I d with a = (α1 , . . . , αd ) and
b = (β1 , . . . , βd ) and αj ≤ βj for j ∈ {1, . . . , d}. Also recall that F d denotes the collection of all
finite unions of half-open rectangles. By Exercise 1.7 (d), F d is a ring. Moreover, in Exercise
1.7 (c) it was shown that every set in F d can be written as a finite union of disjoint half-open
rectangles.
Definition 3.7 (on unions of half-open rectangles). For a half open rectangle I = (a, b] ∈ I d with
a = (α1 , . . . , αd ) and b = (β1 , . . . , βd ) let its volume be denoted by
d
Y
|I| = (βj − αj ).
j=1
n
T n
S n
S
Step 2: Let Cn = Bj for n ≥ 1. For every n ≥ 1, An \ Cn = (An \ Bj ) ⊆ (Aj \ Bj ).
j=1 j=1 j=1
Therefore, using Theorem 2.8 (i) in (∗) and Exercise 2.2 (c) in (∗∗), we find
(∗) [n n
(∗∗) X Xn
λ(An \ Cn ) ≤ λ (Aj \ Bj ) ≤ λ(Aj \ Bj ) ≤ 2−j ε < ε.
j=1 j=1 j=1
Since Cn = ∅ for all n ≥ N , we can conclude that λ(An ) = λ(An \ Cn ) < ε for every n ≥ N . ,
We can now deduce the main result of this section.
Theorem 3.10 (Lebesgue measure). There exists a unique measure λ on (Rd , B(Rd )) such that
for all half-open rectangles I, one has λ(I) = |I|, where |I| is the volume of I. Moreover, for all
h ∈ Rd and A ∈ B(Rd ), λ(A + h) = λ(A).21
In the above A + h := {x + h : x ∈ A}.
Proof. Step 1: Existence. In Lemma 3.9 we have shown that λ is σ-additive on the ring F d .
Therefore, by Theorem 3.1 λ extends to a measure on σ(F d ) = B(Rd ) (see Exercise 3.1).
Step 2: Uniqueness. Let µ be another measure such that µ(I) = |I| for half-open rectangles
I ∈ I d . Fix n ∈ N and let Sn = (−n, n]d . Define λ(n) and µ(n) on B(Rd ) by
λ(n) (A) = λ(A ∩ Sn ) and µ(n) (A) = µ(A ∩ Sn ).
Then λ(n) and µ(n) are measures and λ(n) (Rd ) = λ(Sn ) = |Sn | and similarly µ(n) (Rd ) = |Sn |.
Since λ(n) and µ(n) coincide on I d , it follows from Example 3.4 (b) and Proposition 3.5 that
λ(n) = µ(n) on B(Rd ). Therefore, for any A ∈ B(Rd ), since A ∩ Sn ↑ A Theorem 2.10 yields
λ(n) (A) = λ(A ∩ Sn ) → λ(A) and µ(n) (A) = µ(A ∩ Sn ) → µ(A).
Thus λ(A) = µ(A).
Step 3: Translation invariance: Let h ∈ Rd . We claim that for every A ∈ B(Rd ) one has
A + h ∈ B(Rd ). For this let Ah = {A ∈ B(Rd ) : A + h ∈ B(Rd )}. By definition Ah ⊆ B(Rd ).
One can check that Ah is a σ-algebra. For each open set A one has A + h is open and hence
A + h ∈ B(Rd ). Therefore, B(Rd ) = σ({open sets}) ⊆ Ah , and the claim follows.
Define µh on B(Rd ) by µh (A) = λ(A + h). Then µh is a measure and for any half-open
rectangle I, µh (I) = |I + h| = |I| = λ(I). By the uniqueness of step 2, we find µh (A) = λ(A) for
all A ∈ B(Rd ) and this proved the result. ,
Remark 3.11. From Theorem B.6 one can actually see that for any A ∈ B(Rd ),
∞
nX ∞
[ o
λ(A) = inf |Ij | : A ⊆ Ij , where (Ij )j≥1 are disjoint half-open rectangles ,
j=1 j=1
22From this exercise we see that up to a scaling factor, λ is the only translation invariant measure on B(R). The
case for dimensions d ≥ 2 holds as well and can be proved in a similar way.
23Thomas Stieltjes 1856-1894 was a Dutch mathematician working in Analysis. He has even worked in Delft.
MEASURE AND INTEGRATION 13
4. Measurable functions
One of the aims will be to integrate functions f : S → R with respect to a measure µ on a
measurable space (S, A). A way to do this is to use discretization in the range24 of f . So we would
like to know the measure of for instance the set Ay,ε = {s ∈ S : f (s) ∈ [y, y + ε]}. Knowing this
for all y ∈ R and all ε > 0 makes it possible estimate the “area” under f . Of course we do need
the sets to be in A to make this work. This is the motivation of the definition of measurability.
See Section 5 for details on integration.
The natural setting to introduce measurability of functions is a follows:
Definition 4.1. Let (S, A) and (T, B) be two measurable spaces. A function f : S → T is called
measurable if for each B ∈ B, one has25 f −1 (B) ∈ A.
Remark 4.2.
(1) The composition of two measurable function is again measurable (see Exercise 4.1).
(2) Instead of f −1 (B) or {s ∈ S : f (s) ∈ B} one sometimes writes {f ∈ B} for the same set.
(3) In probability theory measurable functions are called random variables.
It suffices to check measurability on a generating collection F ⊆ B:
Lemma 4.3. Let (S, A) and (T, B) be two measurable spaces and let f : S → T . Suppose F ⊆ B
is such that σ(F) = B. If f −1 (F ) ∈ A for all F ∈ F, then f is measurable.
Proof. Define Be = {B ∈ B : f −1 (B) ∈ A}. Our aim is to show that Be = B. We claim that Be
is a σ-algebra. Indeed, since f −1 (∅) = ∅ ∈ A also ∅ ∈ B.e Similarly, since f −1 (T ) = S ∈ A,
−1 −1
e then f (T \ B) = S \ f (B) ∈ A, which implies that B c ∈ B.
we find T ∈ B. e If B ∈ B, e If
B1 , B2 , . . . ∈ B, then
e
∞
[ [∞
f −1 Bn = f −1 (Bn ) ∈ A.
n=1 n=1
Therefore, the claim follows.
Now since F ⊆ B, e the claim yields B = σ(F) ⊆ Be ⊆ B. This implies Be = B. ,
In the sequel a metric space X will always be equipped with its Borel σ-algebra B(X) (unless
otherwise stated).
Proposition 4.4 (Continuous mappings are measurable). Let (X, d) and (Y, ρ) be metric spaces.
If f : X → Y is continuous, then f is measurable.26
Proof. By the continuity of f we find that for all open O ⊆ Y the inverse image is f −1 (O) open
in X and hence in B(X). Since the open sets of Y generate the Borel σ-algebra, the result follows
from Lemma 4.3 with F = {O ⊆ Y : O is open}. ,
The most frequent case we will encounter is when f : S → R and (S, A) is a measurable space.
Unless otherwise stated we consider the Borel σ-algebra B(R) on R. The following characterization
of measurability will be useful.
Proposition 4.5 (Real valued functions). Let (S, A) be a measurable space. For f : S → R the
following are equivalent:
(i) f is measurable.
(ii) For all r ∈ R, one has f −1 ((−∞, r]) ∈ A.
(iii) For all r ∈ R, one has f −1 ((−∞, r)) ∈ A.
24In Riemann integration of functions f : Rd → R the discretization is always done in the domain of the function.
This is one of the major differences with Lebesgue integration
25Recall that f −1 (B) is called the inverse image of B by f and is defined by f −1 (B) = {s ∈ S : f (s) ∈ B}
26The same result holds for topological spaces and the proof is the same. In the setting of Borel-σ-algebras,
measurable functions are often called Borel measurable.
14 MEASURE AND INTEGRATION
27We do not define ∞ − ∞, so some cases need to be excluded. For the proof one additionally needs to check
in each of the cases that inverse images of {∞}, {−∞}, {0} are measurable. We leave this to the reader.
MEASURE AND INTEGRATION 15
The next result shows that measurability is preserved under taking countable suprema, count-
able infimum, limits of sequences, etc. For a sequence of numbers (xn )n≥1 in R, let
lim sup xn = lim sup xn and lim inf xn = lim inf xn .
n→∞ k→∞ n≥k n→∞ k→∞ n≥k
Similar formulas hold for the lim inf n→∞ xn . Recall that the lim supn→∞ xn and lim inf n→∞ xn
always exist. Moreover they both coincide with limn→∞ xn if and only if (xn )n≥1 converges in R.
Theorem 4.9. Let (S, A) be a measurable space. For each n ∈ N let fn : S → R be a measurable
function. Then each of the following functions is measurable as well:29
sup fn , inf fn , lim sup fn , lim inf fn .
n≥1 n≥1 n→∞ n→∞
30
Moreover, if fn → f pointwise , then f is measurable again.
Proof. Let g = supn≥1 fn . Then for each r ∈ R,
∞
\
g −1 ([−∞, r]) = {s ∈ S : g(s) ≤ r} = {s ∈ S : fn (s) ≤ r for all n ∈ N} = fn−1 ([−∞, r]) ∈ A.
n=1
Since σ({[−∞, r] : r ∈ R}) = B(R), the measurability of g follows from Lemma 4.3. The case of
infima follows from inf n≥1 fn = − supn≥1 (−fn ).
By (4.1) we can write lim supn→∞ fn = inf k≥1 supn≥k fk . Therefore, the measurability follows
from the previous cases. The remaining cases follow from lim inf n→∞ fn = − lim supn→∞ (−fn )
and limn→∞ fn = lim supn→∞ fn . ,
Definition 4.10. A function f : S → R is called a simple function31 if f is measurable and
takes only finitely many values.
Letting x1 , . . . , xn ∈ R denote the distinct values of f and Aj = {s ∈ S : f (s) = xj }, we can
always represent a simple function as
X n
f= xj · 1Aj .
j=1
Of course if xj = 0 for some j ∈ {1, . . . , n}, we could leave it out from the sum.
Example 4.11. Let S = R and A = B(R). The following functions are simple functions:
(a) f = π · 1(0,1) − 4 · 1(13,14] + 5 · 1Z∩(−∞,0) .
(b) f = 1Q .
Next we show that measurable functions can be written as limits of simple functions. For this
we discretize in the range space in a suitable way. The result plays a crucial role in the integration
theory in Section 5.
Theorem 4.12. Let (S, A) be a measurable space. 32
(i) Let f : S → [0, ∞] be measurable. Then there exists a sequence of simple functions (fn )n≥1
such that 0 ≤ f1 (s) ≤ f2 (s) ≤ . . . and limn→∞ fn (s) = f (s) for each s ∈ S.
(ii) Let f : S → R be measurable. Then there exists a sequence of simple functions (fn )n≥1 such
that limn→∞ fn (s) = f (s) for all s ∈ S.
33
Proof. (i): For each n ∈ N and j ∈ {0, 1, . . . , 4n − 1}, let
An,j = {s ∈ S : j2−n ≤ f (s) < (j + 1)2−n } and An = {s ∈ S : f (s) ≥ 2n }.
Then for each n, j one has An,j , An ∈ A. Define34
n
4X −1
n j
fn = 2 1An + 1A .
j=0
2n n,j
It is clear that each fn takes finitely many values. Moreover, by Exercise 4.5 and Theorem 4.7,
each fn is measurable. Thus each fn is a simple function.
Now fix s ∈ S. We first prove 0 ≤ fn (s) ≤ fn+1 (s) for each n ∈ N. First assume f (s) < 2n .
Then selecting the unique j ∈ {0, . . . , 4n − 1} such that s ∈ An,j we find that fn (s) = j2−n .
Similarly, we can select k ∈ {0, . . . , 4n+1 − 1} such that s ∈ An+1,k and we find that fn+1 (s) =
k2−(n+1) . Since, f (s) ≥ j2−n = 2j2−(n+1) , we find that k ≥ 2j and therefore
fn (s) = j2−n ≤ k2−(n+1) = fn+1 (s).
The case f (s) ≥ 2n can be treated similarly and is left to the reader.
To prove that fn (s) → f (s), first assume f (s) < ∞. Let ε > 0. Choose N ∈ N so large that
f (s) < 2N and 2−N < ε. Let n ≥ N . Selecting j ∈ {0, . . . , 4n − 1} such that s ∈ An,j we find that
|f (s) − fn (s)| ≤ 2−n ≤ 2−N < ε.
Therefore, fn (s) → f (s) in this case. If f (s) = ∞, then fn (s) = 2n for every n ∈ N and thus
fn (s) → ∞ = f (s).
(ii): Write f = f + − f − . Then by (i) we can find simple functions fn,+ , fn,− : S → R such
that fn,+ → f + and fn,− → f − . Let fn = fn,+ − fn,− for n ∈ N. Define the sets A+ and A− by
A± = {s ∈ S : ±f (s) ∈ [0, ∞]}. Then A+ ∪ A− = S. If s ∈ A+ , then
fn (s) = fn,+ (s) → f + (s) = f (s).
The case s ∈ A− is similar.
,
Exercises
Exercise 4.1. Let (Sj , Aj ) for j = 1, 2, 3 be measurable spaces. Assume f : S1 → S2 and
g : S2 → S3 are both measurable. Show that the composition g ◦ f : S1 → S3 is measurable.
Exercise 4.2. Let (S, A) be a measurable space. Let f, g : S → R be measurable functions. Show
that {s ∈ S : f (s) = g(s)} ∈ A and {s ∈ S : f (s) < g(s)} ∈ A.
Exercise 4.3. Let (S, A) be a measurable space. Let f : S → R be a measurable function and
p ∈ (0, ∞). Show that the function |f |p is measurable.
Exercise 4.4. Let S be a set. For a function f : S → R let Af = {f −1 (B) : B ∈ B(R)}. Show
that Af is a σ-algebra.35
Exercise 4.5. Let (S, A) be a measurable space and let A ⊆ S. Show that A ∈ A if and only if
the function 1A : S → R is measurable.
Exercise 4.6. Let (S, A) be a measurable space. Let f1 , f2 , P. . . : S → R be measurable functions
∞
and A1 , A2 , . . . ∈ A be disjoint. Show that the function f = n=1 1An fn is measurable.
Exercise∗ 4.7 (Vector-valued functions). Let (S, A) be a measurable space. For each j ∈ {1, . . . , d}
let fj : S → R be a function and let f : S → Rd be given by f = (f1 , . . . , fd ). Prove that f is
measurable if and only if fj is measurable for each j ∈ {1, . . . , d}.
Hint: For the “if part” one can use the same technique as in Theorem 4.7.
33For the proof make a picture where you make a partition into intervals of length 2−n on the y-axis. For the
picture put the set S is on the x-axis
34The idea is that for each n ∈ N we approximate f up to 2−n on the set {f < 2n }.
35The σ-algebra A is called the σ-algebra generated by f . It is the smallest σ-algebra for which f is measurable.
f
MEASURE AND INTEGRATION 17
Proof. (i). This is immediate from the definition and the identity 1E∩A = 1E · 1A .
For the proof of the remaining assertions we continue with a preliminary observation. Write
Pm m Pn
f = i=1 xi · 1Ai with (Ai )m
S
i=1 disjoint sets in A with Ai = S, and g = j=1 yj · 1Bj with
i=1
36In probability theory this is usually called almost surely and this is abbreviated as a.s.
37Here we use the convention 0 · ∞ = 0
38See Definition 5.1
MEASURE AND INTEGRATION 19
m
(Bj )nj=1 disjoint sets in A with Bi = S. Let Ci,j = Ai ∩ Bj for all i and j. Then (Ci,j )m,n
S
i,j=1 are
i=1
disjoint sets in A. By additivity
Z Xm m X
X n
(5.2) f dµ = xi · µ(E ∩ Ai ) = xi · µ(E ∩ Ci,j )
E i=1 i=1 j=1
Z n
X Xn X m
(5.3) g dµ = yj · µ(E ∩ Bj ) = yj · µ(E ∩ Ci,j ).
E j=1 j=1 i=1
Therefore, xi ≤ yj whenever i, j satisfy E ∩ Ci,j 6= ∅. This together with (5.2), (5.3), yields (ii).
(iii): This follows from µ(E ∩ Ai ) ≤ µ(F ∩ Ai ) whichPis immediate from the monotonicity of µ.
m Pn
(iv): By (5.4) with E = S, we can write αf + βg = i=1 j=1 (αxi + βyj ) · 1Ci,j . Therefore,
Z Xm Xn
αf + βg dµ = (αxi + βyj )µ(E ∩ Ci,j )
E i=1 j=1
Xm X n X m
n X Z Z
=α xi µ(E ∩ Ci,j ) + β yj µ(E ∩ Ci,j ) = α f dµ + β g dµ,
i=1 j=1 j=1 i=1 E E
(vi): By leaving out some of those k for which S xk = 0, we canR assume xiP > 0 for all i. Let
n m
A = {s ∈ S : f (s) > 0} and observe that A = i=1 Ai . Now S f dµ = i=1 xi µ(Ai ) with
µ(Ai ) ≥P0 for each i. Therefore, if the integral is zero, then µ(Ai ) = 0 for each i and hence
n
µ(A) = i=1 µ(Ai ) = 0. Conversely, if f = 0 a.e., monotonicity yields µ(Ai ) ≤ µ(A) = 0, and the
result follows. ,
Example 5.5. Let λ be the Lebesgue measure on R. Since Q ∈ B(R) it Rfollows from Exercises 1.5
and 4.5 that 1Q is measurable and by Exercise 3.2 for any E ∈ B(R), E 1Q dλ = λ(E ∩ Q) = 0.
Note that 1Q is not Riemann integrable on any interval [a, b] with a < b. More on the Riemann
integral can be found in Example 5.16.
5.2. Integral for positive measurable functions. In order to extend the definition of the
integral to arbitrary measurable functions f : S → [0, ∞] we use the following lemma. In the
sequel, we write fn ↑ f if for all s ∈ S, (fn (s))n≥1 is increasing and fn (s) → f (s).
Lemma 5.6 (Consistency). Let f : S → [0, ∞] be a measurable function. Suppose that (fn )n≥1
Z such that 0 Z≤ fn ↑ f . Suppose g : S → [0, ∞] is a simple function
is sequence of simple functions
such that 0 ≤ g ≤ f . Then g dµ ≤ lim fn dµ for every E ∈ A.
E n→∞ E
Z
Observe that lim fn dµ exists in [0, ∞], because it is an increasing sequence of real numbers.
n→∞ E
Proof. Let ε ∈ (0, 1) and set En = {s ∈ E : (1 − ε)g(s) ≤ fn (s)} for n ∈ N. Then using the
indicated parts of Lemma 5.4 in the estimates below, we find
(ii) (iii) (ii)
Z Z Z Z Z
(iv)
(1 − ε) g dµ = (1 − ε)g dµ ≤ fn dµ ≤ fn dµ ≤ lim fn dµ.
En En En E n→∞ E
20 MEASURE AND INTEGRATION
Pm
It remains to prove En g dµ → E g dµ. For this write g = i=1 xi · 1Ai with (Ai )m
R R
i=1 in A disjoint
and x1 , . . . , xm ≥ 0. Since En ∩ Ai ↑ E ∩ Ai as n → ∞, Theorem 2.10 yields
Z Xm m
X Z
(5.5) g dµ = xi · µ(En ∩ Ai ) → xi · µ(E ∩ Ai ) = g dµ.
En i=1 i=1 E
,
Definition 5.7 (Integral for positive functions). For a measurable function f : S → [0, ∞] and a
sequence of simple functions (fn )n≥1 with 0 ≤ fn ↑ f we define the integral of f over E ∈ A as
Z Z
f dµ = lim fn dµ in [0, ∞].
E n→∞ E
R
The above limit exists in [0, ∞] since the numbers an := E fn dµ form an increasing sequence
(an )n≥1 in [0, ∞]. Note that by Theorem 4.12 we can always find simple functions fn : S → [0, ∞)
such that 0 ≤ fn ↑ f . However, we need to check that the above definition does not depend on the
choice of the sequence (fRn )n≥1 . Let (gm )m≥1 be another sequence of simple functions such that
0 ≤ gm ↑ f and let bm = E gm dµ. It suffices to show that limn→∞ an = limm→∞ bm . By Lemma
5.6 for each m ≥ 1, Z Z
bm = gm dµ ≤ lim fn dµ = lim an .
E n→∞ E n→∞
From this we obtain limm→∞ bm ≤ limn→∞ an . Reversing the roles of gm and fn , one sees that
the converse holds as well.
Next we can extend the properties of the integral of Lemma 5.4.
Proposition 5.8. Let f, g : S → [0, ∞] be measurable functions. Then the following hold:
R R
(i) For all E ∈ A, E f dµ = S 1E f dµ. R R
(ii) (monotonicity I) If E ∈ A and f ≤ g on E, then ER f dµ ≤ ER g dµ.
(iii) (monotonicity II) If E, F ∈ A satisfy E ⊆ F R, then E f dµ ≤ FRf dµ. R
(iv) (linearity) For all E ∈ A and α, β ∈ [0, ∞), RE αf + βg dµ = R α E f dµR+ β E g dµ.
(v) (additivity) For all disjoint sets E1 , E2 ∈ A, E1 ∪E2 f dµ = E1 f dµ + E2 f dµ.
R
(vi) S f dµ = 0 if and only if f = 0 almost everywhere.
Proof. (i): Let (fn )n≥1 be simple functions such that 0 ≤ fn ↑ f . Then by Lemma 5.4 (i),
Z Z Z Z
f dµ = lim fn dµ = lim 1E fn dµ = 1E f dµ.
E n→∞ E n→∞ S S
(ii)–(v): See Exercise 5.2.
(vi): Assume f = 0 a.e. Choose a sequence of simple functions (fn )n≥1 such that 0 ≤ fn ↑ f .
Then fn = 0 a.e. for each n ∈ N and thus by Lemma 5.4 (vi),
Z Z
f dµ = lim fn dµ = 0.
S n→∞ S
For the converse we use contraposition. Assume one does not have f = 0 a.e. Then E = {s ∈
S : f (s) > 0} satisfies µ(E) > 0. Letting En = {s ∈ S : f (s) ≥ n1 } we find En ↑ E, so by Theorem
2.10, µ(En ) → µ(E). Therefore, there exists an n ∈ N such that µ(En ) > 0 and thus
(iii) (ii)
Z Z Z
1 1
f dµ ≥ f dµ ≥ n dµ = n µ(En ) > 0.
S En En
,
Example 5.9 (Series are integrals). Let S = N and A = P(N). Let τ : P(N) → [0, ∞] denote the
counting measure.PLet f : N → [0, ∞] be arbitrary. Then f is clearly measurable. Now define for
n
each n ≥ 1, fn = j=1 f (j)1{j} . Since each fn is a simple function and 0 ≤ fn ↑ f we find
Z Z n
X ∞
X
f dτ = lim fn dτ = lim f (j) = f (j).
N n→∞ N n→∞
j=1 j=1
MEASURE AND INTEGRATION 21
5.3. Integral for measurable functions. The final step is to define the integral for measurable
functions f : S → R by using the splitting f = f + − f − . Recall that f + = max{f, 0} and
f − = max{−f, 0}. Note that |f | = f + + f − .
5.10. A measurable function f : S → R is called integrable when both S f + dµ and
R
Definition
R −
S
f dµ are finite. In this case the integral of f over E ∈ A is defined as
Z Z Z
f dµ = f + dµ − f − dµ.
E E E
Remark 5.11.
(1) If f : S → R is integrable,R then by monotonicity E f + dµ ∈ [0, ∞) and E f − dµ are finite for
R R
Proof. First observe that |f | = f + + f − . Therefore, both assertions are equivalent to I + , I − < ∞,
where I ± = S f ± dµ and hence the result follows. To prove the required estimate note that by
R
the triangle inequality and linearity of the integral for positive functions:
Z Z Z
f dµ = |I + − I − | ≤ I + + I − = f + + f − dµ = |f | dµ.
E E E
,
By considering the positive and negative part separately one can check Proposition 5.8 (i),(ii),
(v) hold for all integrable functions f, g : S → R again (see Exercise 5.4). The following example
shows that Parts (iii) and (vi) do not extend to this setting.
Example 5.13. Let S = R and λ the Lebesgue measure. Let f = 1(0,1] − 1(1,2] . Then f + = 1(0,1]
and f − = 1(1,2] and
Z Z
f dλ = f + dλ = λ((0, 1]) = 1,
(0,1] (0,1]
Z Z Z
f dλ = f + dλ − f − dλ = λ((0, 1]) − λ((1, 2]) = 1 − 1 = 0.
(0,2] (0,2] (0,2]
The extension of the linearity (iv) is more difficult and proved below.
Proposition 5.14. Let f, g : S → R be integrable functions and α, β ∈ R. Then39 αf + βg is
integrable, and for all E ∈ A
Z Z Z
(linearity) αf + βg dµ = α f dµ + β g dµ.
E E E
Proof. Note that by Proposition 5.12 each of the following functions is integrable f ± , g ± , |f |, |g|.
Therefore, |α| |f | + |β| |g| is integrable by Proposition 5.8 (iv). Since |αf + βg| ≤ |α| |f | + |β| |g|,
the function αf + βg is integrable as well (see Exercise 5.7).
It remains to prove the identity for each E ∈ A. To do this we first consider the case α = β = 1.
Write f + g = φ − ψ, where φ = f + + g + and ψ = (f − + g − ). Then by Proposition 5.8 (iv) φ and
ψ are integrable and Exercise 5.3 yields
Z Z Z
f + g dµ = φ dµ − ψ dµ
E
(5.6) ZE E
Z Z Z Z Z
= f + dµ + g + dµ − f − dµ − g − dµ = f dµ + g dµ.
E E E E E E
It remains to show that for all α ∈ R,
Z Z
αf dµ = α f dµ.
E E
If α ≥ 0, then αf = (αf )+ − (αf )− = αf + − αf − . Proposition 5.8 (iv) yields
Z Z Z Z Z
−
αf dµ = +
αf dµ − αf dµ = α +
f dµ − α f − dµ.
E E E E E
In case α < 0, then by (5.6) and the previous case, we find
Z Z Z Z Z
0= αf + (−α)f dµ = αf dµ + (−α)f dµ = αf dµ + (−α) f dµ
E E E E E
and the result follows by subtracting the second term on both sides. ,
Pn
Example 5.15. Every simple function f : S → R given by f = j=1 xj 1Aj with A1 , . . . , An ∈ A
and µ(Aj ) < ∞ for all j, is integrable and by Proposition 5.14,
Z Xn Z Xn
f dµ = xj 1Aj dµ = xj µ(E ∩ Aj ).
E j=1 E j=1
In the next example we show that for continuous functions the integral with respect to the
Lebesgue measure coincides with the Riemann integral.40
Example 5.16. Let a < b be real numbers. Let λ be the restriction of the Lebesgue measure to
B([a, b]) and let f : [a, b] → R be continuous. Note that f is measurable by Proposition 4.4.
Moreover, since there is a constant M ≥ 0 such that |f | ≤ M , we can use Exercise 5.7 to deduce
that f is (Lebesgue) integrable. Since f is continuous, it is Riemann integrable. Below we show
that L(f ) = R(f ), where we used the abbreviations
Z Z b
L(f ) := f dλ (Lebesgue integral) and R(f ) := f (x) dx (Riemann integral).
[a,b] a
To prove this identity let ε > 0. Since f is uniformly continuous we can find a δ > 0 such that
ε
for all x, y ∈ [a, b], |x − y| < δ implies |f (x) − f (y)| < b−a . Let n ∈ N be such that b−a
n < δ. Let
b−a
xj = a + j n for j = 0, . . . , n. LetPmj = min{f (x) : x ∈ [xj−1 , xj ]} and Mj P = max{f (x) : x ∈
n n
[xj−1 , xj ]}. Let g(x) = 1{a} f (a) + j=1 1(xj−1 ,xj ] mj and G(x) = 1{a} f (a) + j=1 1(xj−1 ,xj ] Mj .
Then g and G are integrable in the sense of Riemann and Lebesgue and one can check
n
X Xn
L(g) = R(g) = (xj − xj−1 )mj =: α and L(G) = R(G) = (xj − xj−1 )Mj =: β.
j=1 j=1
Example 5.17. Let S = R, A = P(R). Let x ∈ R and let δx be the Dirac measure from Example
2.5. Then for any f : R → R. Z
f dδx = f (x).
R
Indeed, for simple functions f this is obvious. For f : S → [0, ∞] this follows by approximation
from below by simple functions. For general f : S → [0, ∞] this follows by writing f = f + − f − .
Example 5.18. 41 Consider the setting
P∞ of the counting measure of Example 5.9. A function f :
N → R is integrable if and only if j=1 |f (j)| < ∞. In that case
Z X∞
f dτ = f (j).
N j=1
Finally we briefly indicate how one can extend the integral to complex functions f : S → C.
On the complex numbers C we consider its Borel σ-algebra. If f : S → C is measurable, then we
write f = u + iv with u, v : S → R and u and v are measurable (see Exercise 5.11).
Definition 5.19. A measurable function f : S → C given by f = u + iv with u, v : S → R is
called integrable if both u and v are integrable. In this case let
Z Z Z
f dµ = u dµ + i v dµ, E ∈ A.
E E E
Exercise 5.11 yields that Propositions 5.12 and 5.14 extend to the complex setting.
Exercise 5.6. Let λ denote the Lebesgue measure on (R, B(R)). Let f : R → R be an integrable
function. Show that for all −∞ < a < b < ∞
Z Z
f dλ = f dλ.
[a,b] (a,b)
Proof. Note that f = lim inf n→∞ fn is a measurable function by Theorem 4.9. Fix n ∈ N and let
gn = inf fk . For all m ≥ n, we have gn ≤ fm and by the monotonicity of the integral this gives
R k≥n R
S
gn dµ ≤ S fm dµ. Therefore, taking the infimum over all m ≥ n, we find
Z Z
(6.1) gn dµ ≤ inf fm dµ.
S m≥n S
42This result is due to Beppo Levi 1875–1961 who was an Italian mathematician.
43This result is named after the French mathematician Pierre Fatou 1878–1929.
44This is due to Lebesgue (see footnote 18)
26 MEASURE AND INTEGRATION
where for the second part we used lim inf (−xn ) = − lim sup xn . ,
n→∞ n→∞
For functions f, f1 , f2 : S → R we say that fn → f a.e. if there exists a set A ∈ A such that
µ(A) = 0 and for all s ∈ S \ A, fn (s) → f (s). For the details of the following remark we refer to
Exercise 6.5.
Remark 6.6. Let f, f1 , f2 , . . . : S → R be measurable functions.
(1) Theorem 6.3 also holds under the weaker assumption that 0 ≤ fn ↑ f a.e.
(2) Theorem 6.5 also holds under the weaker assumptions that fn → f a.e. and |fn | ≤ g a.e.
Example 6.7. Let f :R R → R be integrable with respect to the Lebesgue measure λ. Let F : R → R
be given by F (x) = (−∞,x] f dλ. Then F is continuous on R. Indeed, if xn < x and xn → x, then
Z Z
(DCT)
F (xn ) = 1(−∞,xn ] f dλ −→ 1(−∞,x) f dλ = F (x),
R R
where the last step follows from Exercise 5.5 and the fact that 1(−∞,x) f = 1(−∞,x] f almost
everywhere. Similarly, one checks that F (xn ) → F (x) if xn ≥ x and xn → x.
6.2. Consequences and applications. We continue with several consequences and applications.
We start with a result on integration of series of positive functions.
Corollary 6.8 (Series and integrals). Let f1 , f2 , . . . : S → [0, ∞] be measurable functions. Then
Z X ∞ X ∞ Z
(6.3) fn dµ = fn dµ.
S n=1 n=1 S
Step 2: First we
R show that (6.4) holds
R for all measurable g : S → [0, ∞]. For g = 1A this is
immediate from S 1A dν = ν(A) = S 1A f dµ. For simple functions g : S → [0, ∞) this follows
by linearity. For a measurable function g : S → [0, ∞], by Theorem 4.12 we can find a sequence
of simple functions (gn )n≥1 such that 0 ≤ gn ↑ g. By the previous case, we obtain
Z Z Z Z
(MCT) (MCT)
g dν = lim gn dν = lim f gn dµ = f g dµ.
S n→∞ S n→∞ S S
Step 3: To prove the “if and only if” assertion and (6.4), let g : S → R be a measurable
function. Since step 2 yields that
Z Z
±
g dν = f g ± dµ,
S S
Proof. (i)⇒(ii): First we make a general observation. Let g : R → R be a continuous and integrable
Rt R R
function. In Example 5.16 we have seen that 0 g(x) dx = R 1[0,t] g dλ. Setting L1 = R 1[0,∞] g dλ,
we find
Z t Z Z Z
L1 − g(x) dx = L1 − 1[0,t] g dλ = 1[t,∞) g dλ ≤ 1[t,∞) |g| dλ.
0 R R R
R R∞
Now the DCT yields that R 1[t,∞) |g| dλ → 0 as t → ∞, and we may conclude that 0 g(x) dx =
L1 exists. The part on (−∞, 0] goes similarly and we find (6.5) with f replaced by g.
Now (i) ⇒ (ii) follows by letting g = |f |. Moreover, (6.5) for f follows by taking g = f .
(ii)⇒ (i): By Example 5.16, monotonicity of improper Riemann integrals,
Z Z n Z ∞
1[−n,n] |f | dλ = |f (x)| dx ≤ |f (x)| dx =: M < ∞.
R −n −∞
R R
Therefore, by the MCT, R
|f | dλ = limn→∞
1
R [−n,n]
|f | dλ ≤ M , and hence f is integrable. ,
n
Example 6.13. Let fn : [0, ∞) → R be defined by fn (x) = 1 + nx e−2x . Below we show that
Rn n
limn→∞ 0 fn (x) dx = 1. Recall the standard limit 0 ≤ 1 + nx ↑ ex for every x ∈ [0, ∞) and
−x
thus 0 ≤ 1[0,n] fn ↑ f , where f (x) = e . Therefore, by Example 5.16
Z n Z Z Z ∞
(MCT) (6.5)
fn (x) dx = 1[0,n] fn dλ −→ f dλ = f (x) dx = lim (1 − e−t ) = 1.
0 [0,∞) [0,∞) 0 t→∞
Proof. Fix y0 ∈ R and let δ and g be as in (i). Let hn ∈ (0, δ) for n ≥ 1 be such that hn → 0. Let
φn : (y0 − δ, y0 + δ) × S → R and F : (y0 − δ, y0 + δ) → R be given by
f (y + hn , s) − f (y, s)
Z
φn (y, s) = and F (y) = f (y, s) dµ(s).
hn S
∂f
Then φn (y, s) → ∂y (y, s) for each y ∈ R and s ∈ S. Therefore, Theorem 4.12 yields that
s 7→ φn (y, s) is measurable. From the mean value theorem46 we obtain φn (y0 , s) = ∂f ∂y (yn , s) for
some yn ∈ (y0 , y0 + δ), and hence |φn (y0 , s)| ≤ g(s) for all s ∈ S. It follows that
F (y0 + hn ) − F (y0 )
Z Z
(DCT) ∂f
F 0 (y0 ) = lim = lim φn (y0 , s) dµ(s) = (y0 , s) dµ(s).
n→∞ hn n→∞ S S ∂y
,
Exercises
Z
Exercise 6.1. Use convergence theorems to find lim fn dλ in each of the following cases:
n→∞ R
n
(a) fn (x) = 1[4,32] (x) 1 + log(4x)
n (use MCT; answer is 2016).
46See [11, Theorem 6.2.3]: if g : [a, b] → R is continuous on [a, b] and differentiable on (a, b), then there exists a
g(b)−g(a)
point c ∈ (a, b) such that g 0 (c) = b−a
.
MEASURE AND INTEGRATION 29
sin(nx)
(b) fn (x) = 1(1,∞) (x) (use DCT; answer is 0).
nx2
nxn sin(nx) − n √
(c) fn (x) = 1[0,1] (x) √ (use DCT; answer is − 12 2).
x + 2n 2
7. Lp -spaces
In this section (S, A, µ) is measure space. In this section we want to allow the scalar field to be
complex as well and we use the notation K for this. So K = R or K = C.
Definition 7.1. For p ∈ [1, ∞) let
n Z o
Lp (S) = f : S → K : f is measurable and |f |p dµ < ∞ .
S
Another famous inequality which can be proved with the same method is the following.
1 1
Proposition 7.4 (Hölder’s inequality50). Let p, q ∈ (1, ∞) satisfy51 p + q = 1. If f ∈ Lp (S) and
g ∈ Lq (S), then f g ∈ L1 (S) and
kf gk1 ≤ kf kp kgkq .
Proof. See Exercise 7.2. ,
48More precisely, one can build an equivalent relation f ∼ g if f = g almost everywhere and then consider a
quotient space. We will use the above imprecise but more intuitive definition.
49Hermann Minkowski (1864-1909) was a German mathematician who worked in geometry. He also was Albert
Einstein’s teacher and provided the 4-dimensional mathematical framework for part of Einstein’s relativity theory.
50Otto Hölder (1859-1937) is most famous for this result and for the notion of Hölder continuity of a function.
51These exponents are called conjugate exponents. If p = 2, then q = 2 and in this case the inequality is known
as the Cauchy-Schwarz inequality.
MEASURE AND INTEGRATION 31
7.2. Completeness of Lp .
Theorem 7.5 (Riesz–Fischer52). Let p ∈ [1, ∞). Then (Lp (S), k · kp ) is a Banach space.53
Proof. Let (fk )∞ p
k=1 be a Cauchy sequence with respect to the norm k · kp of L (S). By a standard
∞
argument it suffices to show that (fk )k=1 has a subsequence which is convergent in Lp (S).
Recursively, we can find a subsequence (fkn )n≥1 such that
1
(7.2) kfkn+1 − fkn kp ≤ n , n = 1, 2, . . .
2
For notational convenience let φn = fkn for n ∈ N. Also let φ0 = 0. We will show that there exists
an f ∈ Lp (S) such that kφn − f kp → 0.
Define g, g1 , g2 , . . . : S → [0, ∞] by
∞
X m−1
X
g := |φn+1 − φn | and gm := |φn+1 − φn |, ,
n=0 n=0
By Minkowski’s inequality we obtain for each m ≥ 1,
m−1
X ∞
X (7.2) X
kgm kp ≤ kφn+1 − φn kp = kφn+1 − φn kp ≤ kφ1 kp + 2−n ≤ kφ1 kp + 1.
n=0 n=0 n≥1
54Note that the conjugation is need otherwise the square of a complex number can be negative. Physicists often
put the conjugation on f (s) instead of g(s). If K = R, then the conjugation does not play a role.
55Recall that a Hilbert space is a Banach space where kxk = hx, xi 21 . David Hilbert (1862–1943) is sometimes
said to be the last universal mathematician (which means he knew “all” mathematics of his time). He was one of
the most influential mathematicians of the 19th and early 20th centuries.
56Sometimes we wish to use the counting measure on Z instead of N. In this case we write `p (Z) := Lp (Z).
MEASURE AND INTEGRATION 33
1 2 3 4 5
Theorem 7.11. Let λ be the Lebesgue measure on I = (a, b) with −∞ ≤ a < b ≤ ∞ and let
p ∈ [1, ∞). Then Step(I) is dense in Lp (I).
Proof. Let f ∈ Lp (I) and let ε > 0. We will construct a step function φ such that kf − φkp < 3ε.
Step 1: Reduction to bounded I. Let fn = 1(−n,n) f for n ∈ N. Then fn → f pointwise.
Moreover, |fn − f |p ≤ |f |p . Therefore, the DCT yields kfn − f kp → 0. Therefore, for n ∈ N large
enough kf − gkp < ε, where g = fn ∈ Lp (I).
Step 2: By Step 1 we can assume I is bounded, and moreover we can assume I = (a, b] with
−∞ < a < b < ∞. By Proposition Pn 7.10 there exists a simple function h : I → K such that
kg − hkp < ε. We can write h = j=1 1Aj xj with (Aj )nj=1 in B(I) disjoint.
Step 3: By Exercise 7.5 there exist Fj ∈ F(a,b] such that λ(Aj 4Fj ) < n(|xjε|+1) . Now let
Pn
φ = j=1 1Fj xj . Observe that |1Aj − 1Fj | = 1Aj 4Fj . Therefore, Minkowski’s inequality yields
that
n n n
X X X ε|xj |
kh − φkp ≤ |xj |k1Aj 4Fj kp = |xj |µ(Aj 4Fj ) ≤ < ε.
j=1 j=1 j=1
n(|xj | + 1)
Conclusion: Clearly, φ is a step function. Moreover, by Minkowski’s inequality we find
kf − φkp ≤ kf − gkp + kg − hkp + kh − φkp ≤ 3ε.
,
Corollary 7.12. Let λ be the Lebesgue measure on I = [a, b] with −∞ < a < b < ∞ and let
p ∈ [1, ∞). Then C([a, b]) is dense in Lp (I).
Proof. Let f ∈ Lp (I) and let ε > 0. By Theorem 7.11 there exists a step function φ such that
kf − φkp < ε. It remains find ψ ∈ C([a, b]) such that kφ − ψkp < ε. For this it suffices to
approximate an arbitrary 1J . Using Figure 7.2 and the DCT, the reader can easily convince him
or herself that this can indeed be done. ,
2 2 2
1 1 1
1 2 3 4 1 2 3 4 1 2 3 4
Exercises
34 MEASURE AND INTEGRATION
t ∈ (a, b]. We will derive that f = 0 in L1 (R). By considering real and imaginary part separately,
one can reduce to the case where f is real valued.
(a) Show that for all A ∈ F 1 with A ⊆ (a, b], A f dλ = 0.
R
(b) Let A ∈ B(R) be such that A ⊆ (a, b]. Construct sets A1 , A2 , . . . ∈ F 1 such that Ak ⊆ (−a, b]
and 1An → 1A a.e.
Hint: Use k1A − 1B k1 = λ(A4B) for all A, B ∈ B(R) and Exercise 7.5. Now apply Corollary
7.6 and the DCT. R
(c) Show that for all A ∈ B(R) with A ⊆ (a, b], A f dλ = 0.
Hint: Use (a) and (b).
(d) Derive that f = 0 in L1 (R).
Hint: Consider A = {x ∈ R : f (x) ≥ 0} and A = {x ∈ R : f (x) ≤ 0}.
57By Exercise 1.7 of the lecture notes we can take them disjoint
MEASURE AND INTEGRATION 35
Exercise∗∗ 7.7. Let λ be the Lebesgue measure on R. Observe that each n ∈ N can be uniquely
written as n = 2k + j with k ∈ N0 and j ∈ {0, . . . , 2k − 1}. Now for such n define fn =
1(j2−k ,(j+1)2−k ] . Show that fn → 0 in L1 (R), but for all x ∈ (0, 1], there exists infinitely many
n ∈ N such that fn (x) = 1.
Hint: First make a picture for k = 1 and j = 0, 1, and k = 2 and j = 0, 1, 2, 3.
Exercise∗∗ 7.8. Let 1 ≤ p ≤ q < ∞).
(a) Prove `p ⊆ `q and that for all (an )n≥1 ∈ `p one has k(an )n≥1 kq ≤ k(an )n≥1 kp .
Hint: By homogeneity one can assume k(an )n≥1 k`p = 1, and therefore |an | ≤ 1 for all n ∈ N.
(b) Let an = nα for n ∈ N. For which α ∈ R does one have (an )n≥1 ∈ `p ?
There is a natural limiting space of Lp (S) for p → ∞:
Exercise∗∗ 7.9. A measurable function f : S → K is said to be in L∞ (S) if there exists an M ≥ 0
such that µ({|f | > M }) = 0.58 Define
kf k∞ = inf{M ≥ 0 : µ({|f | > M }) = 0}.
As usual we identify functions f and g in L∞ (S) if f = g a.e.
(a) Show that (L∞ (S), k · k∞ ) is a Banach space.
(b) Show that for all f ∈ L∞ (S) and g ∈ L1 (S), f g ∈ L1 (S) and
kf gk1 ≤ kf k∞ kgk1 .
1
(c) Assume µ(S) < ∞ and p ∈ [1, ∞). Show that L∞ (S) ⊆ Lp (S) and kf kp ≤ µ(S) p kf k∞ .
(d) Assume S = N with the counting measure and p ∈ [1, ∞). Let `∞ := L∞ (N). Show that
`p ⊆ `∞ and k(an )n≥1 k∞ ≤ k(an )n≥1 kp .
(e) Assume I is a finite interval and µ = λ is the Lebesgue measure. Show that the simple
functions are dense in L∞ (I), but the step functions are not.
where x ∈ [0, 2π]. Of course one can also use x ∈ R here. Clearly, the above functions will be
periodic whenever they are well-defined.
One of the reasons that Fourier series naturally arise in mathematics is that each of the functions
d2
e±inx , cos(nx), sin(nx) as an eigenfunctions of dx 2 with eigenvalue −n .
2
Indeed, for instance
00 2
cos(nx) = −n cos(nx).
We have seen that Taylor series can be used to represent functions which are smooth enough.61
Fourier series provides another tool to represent functions. The class of functions which can be
represented as a Fourier series will turn out to be enormous.
In this section we will prove a couple of central results in the theory of Fourier series. The
interested reader can read more on the subject in [9], [10], [13], [14] and [19]. In particular, very
interesting but mostly elementary applications to geometry, ergodicity, number theory and PDEs
can be found in [13].
8.1. Fourier coefficients. In this section S = [0, 2π] and λ is the Lebesgue measure on (0, 2π).
For notational convenience we will write
Z b Z
f (x) dx := f dλ.
a [a,b]
Example 8.3. Given f ∈ L1 (0, 2π), the function sn (f ) is a trigonometric polynomial for each
n ∈ N. Each of the functions
einx + e−inx einx − e−inx
cos(nx) = , sin(nx) =
2 2i
is a trigonometric polynomial as well.
Finally note that if P is a trigonometric polynomial, then for any j ∈ N0 , P j is a trigonometric
polynomial as well.
The main questions in this section is whether we can reconstruct f from its Fourier coefficients.
More precisely:
P
• (representation) Which functions f : [0, 2π] → C can we write as f = k∈Z ck ek for
certain coefficients (ck )k∈Z ?
• (convergence) In what sense does the above series converge?
• (uniqueness) Does fb(k) = gb(k) for all k ∈ Z imply f = g ?
When considering convergence of Fourier series we will always consider the convergence of
X n
X
ck e k = ck ek as n → ∞.
|k|≤n k=−n
Theorem 8.4 (Weierstrass’ approximation theorem64 for periodic functions). The trigonometric
polynomials are dense in {f ∈ C([0, 2π]) : f (0) = f (2π)}.
Proof. Let f ∈ C([0, 2π]) be such that f (0) = f (2π) and let ε > 0 be arbitrary. It suffices to show
that there exists a trigonometric polynomial P such that kf − P k∞ < ε. We extend f periodically
to a function f : R → C. Since f is also uniformly continuous we can choose δ ∈ (0, π) such that
|x − y| < δ implies |f
P(x) −Pf (y)| < ε/2.
n
Define65 Fn = n1 k=1 |j|≤k−1 ej which is a periodic function as well. Define Pn : [0, 2π] → C
R 2π
by Pn (x) = 0 Fn (x − y)f (y) dy. Then since ej (x − y) = e2πix e−2πiy the following identity holds
2π n Z 2π n
1 1 X X 1X X
Z
Pn (x) = Fn (x−y)f (y) dz = ej (x) ej (−y)f (y) dy = ej (x)fb(j).
2π 0 2πn 0 n
k=1 |j|≤k−1 k=1 |j|≤k−1
sin2 (nz/2)
(8.3) Fn (z) = , z ∈ (0, 2π).
n sin2 (z/2)
64Originally Weierstrass proved that the polynomials are dense in C([a, b]). This can be derived from our version
of the theorem as indicated in Exercise 8.11
65This is called Féjer’s kernel
38 MEASURE AND INTEGRATION
where we substituted z := x − y and where Bδ = [δ, 2π − δ]. Therefore, using (8.3) again and the
fact that | sin(z/2)| ≥ sin(δ/2) for z ∈ Bδ (recall that δ ≤ π) we obtain
|Bδ| 2π
Z
Fn (z) dz ≤ 2 ≤ .
Bδ n sin (δ/2) n sin2 (δ/2)
2kf k∞ ε
So choosing n ≥ 1 so large that n sin2 (δ/2)
< 2 we obtain T2 < πε.
T1 +T2
Therefore, combining the the estimates can conclude that |f (x) − Pn (x)| ≤ 2π < ε. Since
x ∈ [0, 2π] was arbitrary it follows that kf − Pn k∞ < ε as required. ,
Now we can deal with the uniqueness question for Fourier series. This is the most technical
part of this section and could be skipped it at first reading.
Theorem 8.5 (Uniqueness). If f ∈ L1 (0, 2π) satisfies fb(n) = 0 for all n ∈ Z, then f = 0 in
L1 (0, 2π).
Proof. 66 Step 1: First assume f ∈ C([0, 2π]) and f (0) = f (2π). By linearity it follows that for
each trigonometric polynomial P we have
Z 2π
f (x)P (x) dx = 0.
0
By Theorem 8.4 we can find trigonometric polynomials (Pn ) such that Pn → f uniformly. There-
fore, it follows that
Z 2π Z 2π
2
|f (x)| dx = lim f (x)Pn (x) dx = 0.
0 n→∞ 0
This implies f = 0.
Step 2: Next let f ∈ L1 (0, 2π) and assume fb(n) = 0 for all n ∈ Z. Let F : [0, 2π] → R be
defined by
Z t
F (t) = f (x) dx.
0
Then F ∈ C([0, 2π]) (see Example 6.7), and F (0) = F (2π) = 0.67 By Exercise 8.6 (b) Fb(k) =
fb(k)
ik = 0 for all k 6= 0. Now let g = F − C, where C = F (0). Then g ∈ C([0, 2π]), g(0) = g(2π)
b
and gb(k) = 0 for all k ∈ Z. Therefore, g = 0 by step 1 and hence F = C. Since F (0) = 0, this
yields F (x) = 0 for all x ∈ [0, 2π]. By Exercise 7.6 we find f = 0.
,
66There is a much better proof in the literature using the Féjer kernel as an approximate identity. Since
approximate identities are not part of these lecture notes, we proceed differently.
67Note that F (2π) = (2π)fb(0) = 0.
MEASURE AND INTEGRATION 39
8.3. Fourier series in L2 (0, 2π). In this section we will consider Fourier series in the Hilbert
space L2 (0, 2π). Here the inner product is given by
Z 2π
hf, gi = f (x)g(x) dx.
0
Note that L2 (0, 2π) ⊆ L1 (0, 2π) (see Exercise 7.3). Therefore, if f ∈ L2 (0, 2π) the Fourier coeffi-
cients are well-defined and fb(k) = (2π)−1 hf, ek i.
Let us recall as special case of Proposition 7.3 for f, g ∈ L2 (0, 2π),
(8.4) |hf, gi| ≤ kf k2 kgk2 (Cauchy–Schwarz inequality).
2
We will say that f, g ∈ L (0, 2π) are orthogonal if hf, gi = 0. Note that in this case the following
form of Pythagoras theorem holds68
(8.5) kf + gk22 = kf k22 + kgk22 .
Lemma 8.6 (Orthogonality). For j, k ∈ Z,
2π, if j = k;
hej , ek i =
0, if j 6= k.
Consequently, if finitely many (cj )j∈Z in C are nonzero, then
X 1
X 12
(8.6) cj ej = (2π) 2 |cj |2 .
2
j∈Z j∈Z
,
We extend this result to series using the completeness of L2 (0, 2π).
Theorem 8.7 (Riesz–Fischer, Convergence of Fourier series in L2 ).
X
(i) If (cn )n∈Z ∈ `2 , then g := cn en converges in L2 (0, 2π), and gb(n) = cn for all n ∈ Z, and
n∈Z
1
(8.7) kgk2 = (2π) 2 k(cn )n∈Z k`2 (Parseval’s identity)
X
(ii) If f ∈ L2 (0, 2π), then (fb(n))n∈Z in `2 and f = fb(n)en in L2 (0, 2π) and (8.7) holds with
n∈Z
g = f and cn = fb(n) for n ∈ Z.
Part (ii) shows that every L2 -function can be represented as a Fourier series. A similar result
holds for series of sine and cosine functions and can be derived as a consequence of the above
result (see Exercise 8.8).
P
Proof. (i): Let gn = |k|≤n ck ek for n ∈ N. We show that (gn )n≥1 is a Cauchy sequence. Let
ε > 0 and choose N ∈ N such that
X 12 ε
|ck |2 < 1 .
|k|≥N
(2π) 2
This proves that (gn )n≥1 is a Cauchy sequence. By completeness (see Theorem 7.7), g :=
limn→∞ gn exists in L2 (0, 2π).
To check (8.7) note that by the continuity of k · k2 and Lemma 8.6,
1
X 12 1
kgk2 = lim kgn k2 = lim (2π) 2 |ck |2 = (2π) 2 k(cn )k`2 .
n→∞ n→∞
|k|≤n
−1
Finally, note that gb(k) = (2π) hg, ek i = limn→∞ (2π)−1 hgn , ek i = ck .69
(ii): Fix n ∈ N. Since hf − sn (f ), ek i = 0 for each |k| ≤ n, also hf − sn (f ), sn (f )i = 0 and
hence (8.5) yields
(8.8) kf k22 = kf − sn (f ) + sn (f )k22 = kf − sn (f )k22 + ksn (f )k22 ≥ ksn (f )k2 .
Let ck = fb(k) for k ∈ Z. Then (8.8) yields:
(8.6) X
kf k22 ≥ ksn (f )k2 = 2π|ck |2 .
|k|≤n
−1
Letting n → ∞, we find k(ck )P k∈Z k`2 ≤ (2π) kf k2 < ∞.
By (i) we can define g = n∈Z cn en where the series converges in L2 (0, 2π). We claim that
f = g in L2 (0, 2π). To see this note that by (i), gb(n) = cn = fb(n). Therefore, the claim follows
from the uniqueness Theorem 8.5 applied to f − g. ,
For f ∈ L2 (0, 2π) Theorem 8.7 yields that f − sn (f ) = |k|>n fb(k)ek and by (8.7)
P
X
(8.9) (L2 -error estimate) kf − sn (f )k22 = 2π |fb(k)|2 .
|k|>n
One can show that sn (f ) will not converge to f uniformly (also see Figure 8.1). This is in particular
clear for x = 0 and x = 2π, because f (0) = π and f (2π) = −π, but sn (f )(0) = sn (f )(2π) = 0.
By applying (8.7) one can obtains a remarkable identity:
X 1
kf k22 = 2π .
k2
k∈Z\{0}
On the other hand, if we calculate kf k22 with the fundamental theorem of calculus, we obtain
Z 2π h i2π 2
kf k22 = (x − π)2 dx = 13 (x − π)3 = π3 .
0 0 3
P∞
Combining both identities gives k∈Z\{0} k12 = 13 π 2 , and so we find k=1 k12 = 16 π 2 .
P
4 4 4
2 2 2
−4 −4 −4
Figure 8.1. The Fourier series of the sawtooth function with n = 2, n = 5 and
n = 10.
8.4. Fourier series in C([0, 2π]). In this section we will give some sufficient condition on f which
imply the Fourier series is uniformly convergent (or equivalently convergent in C([0, 2π]) with the
supremum norm k · k∞ ). Note that fn → f uniformly implies that fn → f in L2 (0, 2π). Indeed,
this follows from
Z 2π Z 2π
(8.11) kfn − f k22 ≤ |fn (x) − f (x)|2 dx ≤ kfn − f k2∞ 1 dx = 2πkfn − f k2∞ .
0 0
From the above we see that convergence of Fouier series in C([0, 2π]) is stronger than conver-
gence in L2 (0, 2π). However, there are example of functions f ∈ C([0, 2π]) with f (0) = f (2π) for
which the uniform convergence (and even the pointwise convergence) fails (see [1, Example 35.11]
and [10, Example 2.5.1]). So apparently more restrictive conditions are needed.
All the different types of convergence can be confusing. Let us summarize some convergence
results for a sequence (fn )n≥1 in L2 (0, 2π).71
L2 -conv. =⇒ L1 -conv. =⇒ a.e.-conv. subsequence.
=⇒
uniform conv.
=⇒
P
Therefore, we can let g = k∈Z fb(k)ek , where the series is absolutely convergent. Moreover,
X
kg − sn (f )k∞ ≤ |fb(k)| → 0,
|k|>n
and hence g ∈ C([0, 2π]) and sn (f ) → g uniformly. By (8.11) the convergence holds in L2 (0, 2π)
as well, and hence
gb(k) = hg, ek i = lim hsn (f ), ek i = fb(k)
n→∞
and therefore, g = f a.e. by Theorem 8.5. Let A = {s ∈ [0, 2π] : f (s) = g(s)}. Then A is closed
and λ(A) = 2π. We claim that A is dense. Indeed, if not then there exists an nonempty open
interval I ⊆ [0, 2π] \ A. It follows that 0 < λ(I) ≤ λ([0, 2π] \ A) = λ([0, 2π]) − λ(A) = 0. This is a
contradiction and thus the claim follows. Since A is also closed in [0, 2π], the claim implies that
A = [0, 2π]. ,
From the proof we see that the following error estimate holds:
X
(8.12) (uniform error estimate) kf − sn (f )k∞ ≤ |fb(k)|.
|k|>n
where used (8.7) in the last step. Therefore, the absolute and uniform convergence follows from
Theorem 8.10. ,
Example 8.12. If g ∈ C([0, 2π]) satisfiesPg(0) = g(2π), g is piecewise continuously differentiable
on (0, 2π) and g 0 ∈ L2 (0, 2π), then g = k∈Z gb(k)ek where the series is absolutely and uniformly
Rt
convergent. Indeed, let F = g − g(0). Then F (t) = −g(0) + 0 g 0 (x) dx, where f := g 0 satisfies the
assumptions of Corollary 8.11.
Exercises
Exercise 8.1. Let f : [0, 2π) → R be given by f = 1[0,π] .
1
(a) Show that fb(k) = 0 for even k 6= 0, and fb(k) = πik for odd k, and fb(0) = 12 .
(b) Write f asPa series of sines as in Example 8.9 and give an estimate of L2 -error given by (8.9).
1
(c) Evaluate j∈Z (2j+1) 2.
Exercise 8.3 (Special Fourier series and kernels). The following kernel’s play a central role in
more advanced theory of Fourier series. Prove the identities below for x ∈ (0, 2π). For each
Pn n+1
exercise you should use the geometric sum k=0 ak = 1−a 1−a for a ∈ C \ {1}.
X sin((n− 12 )x)
(a) (Dirichlet kernel) Show that Dn (x) := ek (x) = for n ≥ 1.
sin( 12 x)
|k|≤n−1
n
1X 1 sin2 (n x2 )
(b) (Féjer kernel) Show that Fn (x) := Dj (x) = for n ≥ 1.
n j=1 n sin2 ( 12 x)
Final remarks:
• One can show that kf − Fn ∗ f kp → 0 for all f ∈ Lp (0, 2π) with p ∈ [1, ∞). In Theo-
rem 8.4 The convergence holds uniformly if f ∈ C([0, 2π]) satisfies f (0) = f (2π). Here
Fn ∗ f is the so-called convolution product of Fn and f and is defined by Fn ∗ f (t) =
R 2π
0
Fn (t − x)f (x) dx. Using the definition of Fn one can check that Fn ∗ f is a trigono-
metric polynomial.
• Similar results for Dn are true as well as long as p ∈ (1, ∞), but this a much deeper
result. Since Dn ∗ f = sn (f ), this implies that kf − sn (f )kp → 0 for all f ∈ Lp (0, 2π) with
p ∈ (1, ∞).
• Finally, we note that sn (f ) → f a.e. for any f ∈ Lp (0, 2π) with p > 1. This is one of the
deepest result in the theory of Fourier series and was proved by Carleson for p = 2 and
extended to p > 1 by Hunt in 1968. It was proved a long time before that the result fails
for p = 1 by Kolmogorov in 1923.
Exercises
Exercise A.1. Prove Proposition A.5 (i) ⇒ (ii).
Exercise∗ A.2. Prove that the intersection of Dynkin systems is again a Dynkin system.
Exercise∗ A.3. Find a version of Proposition A.7 which for measures with µ1 (S) = µ2 (S) = ∞.
Hint: See the proof of Theorem 3.10.
48 MEASURE AND INTEGRATION
In Lemma B.2 we have seen that α is additive. In order to obtain a measure we need a further
condition on α.
Definition B.3. Let S be a set. A function α : P(S) → [0, ∞] is called an outer measure if
(i) α(∅) = 0;
(ii) (monotonicity) A ⊆ B =⇒ α(A) ≤ α(B).
∞
S ∞
P
(iii) (σ-subadditivity) For each sequence (An )n≥1 in P(S) one has α An ≤ α(An ).
n=1 n=1
An outer measure is not necessarily a measure. For instance α(∅) = 0 and α(A) = 1 if A ⊆ S
is non-empty is an example of an outer measure which is not a measure (if S contains at least two
elements).
Next, we show that being an outer measure is the right additional ingredient to prove that α
is a measure on Mα .
Lemma B.4. Let α be an outer measure. Then Mα is a σ-algebra and α is a measure on (S, Mα ).
MEASURE AND INTEGRATION 49
Proof. Lemma B.2 gives that Mα is a ring and α is additive on Mα . It remains to check that for
any disjoint sequence (An )n≥1 ,
∞
[ ∞
X
(B.1) A := An ∈ Mα and α(A) = α(An ).
n=1 n=1
n
S
Let Bn = Aj for each n ≥ 1. Fix an arbitrary Q ⊆ S. For every n ∈ N the following holds:
j=1
Pn
j=1 α(Q ∩ Aj ) + α(Q ∩ Ac ) = α(Q ∩ Bn ) + α(Q ∩ Ac ) (by Lemma B.2)
≤ α(Q ∩ Bn ) + α(Q ∩ Bnc ) (since Ac ⊆ Bnc )
= α(Q) (since Bn ∈ Mα )
Using first the σ-subadditivity of α and then the arbitrariness of n ∈ N in the above, we deduce
∞
X
(B.2) α(Q ∩ A) + α(Q ∩ Ac ) ≤ α(Q ∩ Aj ) + α(Q ∩ Ac ) ≤ α(Q).
j=1
On the other hand by subadditivity also the converse estimate holds: α(Q) ≤ α(Q∩A)+α(Q∩Ac ),
and hence A ∈ Mα . Moreover, all inequalities in (B.2) have to be identities75, and hence (B.1)
follows by taking Q = A in (B.2). -
In the above results we have seen that with outer measures one can construct measures on
certain σ-algebras. Our next aim is to show that there is a natural outer measure associated to
an additive mapping µ on a ring R.
Lemma B.5. Let S be a set and R ⊆ P(S) be a ring. Suppose µ : R → [0, ∞] is additive and
satisfies µ(∅) = 0. For A ⊆ S define
∞
nX ∞
[ o
(B.3) µ∗ (A) = inf µ(Bj ) : A ⊆ Bj , where Bj ∈ R for j ≥ 1 ,
j=1 j=1
where we let µ (A) = ∞ if the above set is empty. Then µ∗ is an outer measure
∗
Proof. The mapping µ∗ : P(S) → [0, ∞] clearly satisfies (i) and (ii). In order to check (iii) let
(An )n≥1 in P and let ε > 0. If µ∗ (An ) = ∞ for some n ≥ 1, then (iii) is trivial. Next assume
µ∗ (An ) < ∞ for all n ≥ 1. Then by definition of µ∗ for each fixed n ≥ 1 we can find Bn,j ∈ R
such that
[∞ X∞
An ⊆ Bn,j and µ∗ (An ) + 2−n ε ≥ µ(Bn,j ).
j=1 j=1
∞ ∞
Bn,j and again by the definition of µ∗ we find
S S
Then An ⊆
n=1 n,j=1
∞
[ ∞
X ∞
X ∞
X
µ∗ An ≤ µ(Bn , j) ≤ µ∗ (An ) + 2−n ε = ε + µ∗ (An ).
n=1 n,j=1 n=1 n=1
Proof. In Lemma B.5 we have proved that µ∗ is an outer measure. Therefore, assertion (i) follows
from Lemma B.4. In remains to prove (ii) and (iii). The assertion concerning µ follows from
(i)–(iii) as the restriction of a measure to a smaller σ-algebra is a measure again.
Step 1: Proof of (ii): Let A ∈ R. It is clear that µ∗ (A) ≤ µ(A). Indeed, take B1 = A and
Bn = ∅ for n ≥ 2 in (B.3). For the converse estimate the case µ∗ (A) = ∞ is clear. Now suppose
∞
µ∗ (A) < ∞ and let B1 , B2 , . . . , ∈ R be such that A ⊆
S
Bn . Then by the σ-additivity of µ on
n=1
∞
S
R and Theorem 2.8 (iii) applied to A = A ∩ Bn , we find
n=1
∞
X ∞
X
µ(A) ≤ µ(A ∩ Bn ) ≤ µ(Bn ).
n=1 n=1
Taking the infimum over all (Bn )n≥1 as above yields µ(A) ≤ µ∗ (A).
Step 2: Proof of (iii): Let A ∈ R and Q ⊆ S. Since µ∗ is subadditive we find µ∗ (Q) ≤
µ (Q ∩ A) + µ∗ (Q ∩ Ac ). For the converse estimate the case µ∗ (Q) = ∞ is trivial. In case µ∗ (Q) <
∗
∞
Bn . Then Bn ∩ A, Bn ∩ Ac ∈ R for all n ≥ 1 and
S
∞, choose B1 , B2 , . . . , ∈ R such that Q ⊆
n=1
∞
[ ∞
[
Q∩A⊆ Bn ∩ A and Q ∩ Ac ⊆ Bn ∩ Ac .
n=1 n=1
Therefore, using first the definition of µ∗ and then the additivity of µ on R, we find
X∞ X∞ ∞
X
µ∗ (Q ∩ A) + µ∗ (Q ∩ Ac ) ≤ µ(Bn ∩ A) + µ(Bn ∩ Ac ) = µ(Bn )
n=1 n=1 n=1
Taking the infimum over all (Bn )n≥1 as above gives µ∗ (Q ∩ A) + µ∗ (Q ∩ Ac ) ≤ µ∗ (Q). Combining
both estimates we can conclude A ∈ Mµ∗ . ,
Exercises
In the following exercise we show that Mµ∗ 6= P(S) in general.76
Exercise B.1. Let S = {1, 2, 3} and define a σ-algebra by A = {∅, S, {1, 2}, {3}}. Assume µ is a
measure satisfying µ({1, 2}) = µ({3}) = 12 .
(a) Show that µ∗ ({1}) = µ∗ ({2}) = 12 .
(b) Show that {1}, {2} ∈
/ Mµ∗ .
Exercise B.2. Let α : P(S) → [0, ∞] be an outer measure and suppose that A ⊆ P(S) satisfies
α(A) = 0. Show that A ∈ Mα .
Exercise∗ B.3. Assume the conditions of Theorem B.6 and assume µ is σ-finite on R, that means
∞
S
there exists a sequence (Sn )n≥1 in R such that µ(Sn ) < ∞ for all n ≥ 1 and Sn = S. Prove
n=1
that the following are equivalent:
(a) A ∈ Mµ∗ ;
(b) There exists a B ∈ σ(R) such that A ⊆ B and µ∗ (B \ A) = 0
Hint: First reduce to the case of finite measure by intersecting with Sn . Use the definition of µ∗
given in (B.3).
76For the Lebesgue measure one also has M ∗ 6= P(R), but this is much harder to prove. See Appendix C
µ
MEASURE AND INTEGRATION 51
There exist sets which are not Lebesgue measurable A subset of R which is not in M(Rd )
is given by Vitali’s example (see for example [4, Theorem 16.31]):
https://en.wikipedia.org/wiki/Vitali_set
52 MEASURE AND INTEGRATION
References
[1] C. D. Aliprantis and O. Burkinshaw. Principles of real analysis. Academic Press, Inc., San Diego, CA, third
edition, 1998.
[2] S. Banach. Théorie des opérations linéaires. Éditions Jacques Gabay, Sceaux, 1993. Reprint of the 1932 original.
[3] F. Burk. Lebesgue measure and integration. Pure and Applied Mathematics (New York). John Wiley & Sons,
Inc., New York, 1998. An introduction, A Wiley-Interscience Publication.
[4] N. L. Carothers. Real analysis. Cambridge University Press, Cambridge, 2000.
[5] D. H. Fremlin. Measure theory. Vol. 1, 2, 3, 4. Torres Fremlin, Colchester, 2004-2006.
[6] D.J.H. Garling. Inequalities: a journey into linear analysis. Cambridge University Press, Cambridge, 2007.
[7] R. Haberman. Applied Partial Differential Equations with Fourier Series and Boundary Valve Problems.
Pearson Higher Ed, 2012.
[8] O. Kallenberg. Foundations of modern probability. Probability and its Applications (New York). Springer-
Verlag, New York, second edition, 2002.
[9] Y. Katznelson. An introduction to harmonic analysis. Cambridge Mathematical Library. Cambridge University
Press, Cambridge, third edition, 2004.
[10] J Korevaar. Fourier analysis and related topics. https://staff.fnwi.uva.nl/j.korevaar/Foubook.pdf.
[11] S.R. Lay. Analysis. With an introduction to proof. 5th ed. Pearson, 2015.
[12] H. Royden and P. Fitzpatrick. Real analysis. 4th ed. New York, NY: Prentice Hall, 4th ed. edition, 2010.
[13] E.M. Stein and R. Shakarchi. Fourier analysis, volume 1 of Princeton Lectures in Analysis. Princeton University
Press, Princeton, NJ, 2003. An introduction.
[14] E.M. Stein and R. Shakarchi. Real analysis. Princeton Lectures in Analysis, III. Princeton University Press,
Princeton, NJ, 2005. Measure theory, integration, and Hilbert spaces.
[15] E.M. Stein and R. Shakarchi. Functional analysis, volume 4 of Princeton Lectures in Analysis. Princeton
University Press, Princeton, NJ, 2011. Introduction to further topics in analysis.
[16] D. Werner. Einführung in die höhere Analysis. Springer-Lehrbuch. [Springer Textbook]. Springer-Verlag,
Berlin, corrected edition, 2009.
[17] D. Williams. Probability with martingales. Cambridge Mathematical Textbooks. Cambridge University Press,
Cambridge, 1991.
[18] A. Zygmund. The role of Fourier series in the development of analysis. In Proceedings of the American Academy
Workshop on the Evolution of Modern Mathematics (Boston, Mass., 1974), volume 2, pages 591–594, 1975.
[19] A. Zygmund. Trigonometric series. Vol. I, II. Cambridge Mathematical Library. Cambridge University Press,
Cambridge, third edition, 2002. With a foreword by Robert A. Fefferman.