LLLLLLLLL
LLLLLLLLL
Ilkka Holopainen
1 Lp-spaces
1.1 Measure space
Definition 1.2. Let X be an arbitrary set and P(X) = {A : A ⊂ X} its power set. A family
Γ ⊂ P(X) is called a σ-algebra (”sigma algebra”) on X if
(1) ∅ ∈ Γ;
(2) A ∈ Γ ⇒ X \ A ∈ Γ; (denote Ac = X \ A)
S
(3) Ai ∈ Γ, i ∈ N ⇒ ∞i=1 Ai ∈ Γ.
Then µ : P(X) → [0, +∞] is a measure (so-called Dirac measure at x ∈ X). Usually it is
denoted by µ = δx .
although A ⊂ E and µ(E) = 0. Note: In the case of a general measure space it is possible that
A ⊂ E, µ(E) = 0, but A 6∈ Γ
Example 1.6. Consider a measure space (Rn , Bor Rn , µ), µ = mn | Bor Rn . Then there exists
B ∈ Bor Rn , µ(B) = 0, and A ⊂ B s.t. A 6∈ Bor Rn .
Let’s prove the case n ≥ 2: (n = 1 later). Let A ⊂ R be a non-Lebesgue measurable set and
f : R → Rn ,
f (x) = (x, 0, . . . , 0) .
Then f is continuous and
m∗n (f A) ≤ m∗n {(x1 , . . . , xn ) ∈ Rn : xi = 0 ∀i = 2, . . . , n} = 0,
hence f A ∈ Leb Rn .
Claim: f A 6∈ Bor Rn .
Assume on the contrary: f A ∈ Bor Rn . Then the pre-image f −1 (f A) = A is a Borel set since
f is continuous [see (1.8)]. This leads to a contradiction since A is not Lebesgue measurable.
Let G ∈ Bor Rn . We say that a mapping g : G → Rm is a Borel mapping (or just Borel) if
(1) g −1 ∅ = ∅ ∈ Bor Rn ⇒ ∅ ∈ Γ;
G \ g−1 V ∈ Bor Rn ⇒ V c ∈ Γ;
(2) V ∈ Γ ⇒ g −1 V c = |{z}
| {z }
∈Bor Rn ∈Bor Rn
S S S
(3) Vi ∈ Γ, i ∈ N ⇒ g −1 i∈N Vi = g −1 V ∈ Bor Rn . ⇒ Vi ∈ Γ.
i∈N | {z }i i∈N
∈Bor Rn
U ⊂ Rm open ⇒ g −1 U ∈ Bor Rn ⇒ U ∈ Γ .
The situation as in Example 1.6 (i.e. A ⊂ E, µ(E) = 0, A 6∈ Γ) does not occur if µ is so called
complete measure.
E ∈ Γ, µ(E) = 0, F ⊂ E ⇒ F ∈ Γ .
This is a minor problem and usually does not cause problems since a (non compete) measure µ
can be completed:
(1) Γ̄ is a σ-algebra on X;
(3) µ = µ̄|Γ.
µ̄ is called the completion of µ (and (X, Γ̄, µ̄) is the completion of (X, Γ, µ)).
A∈Γ
E ∈ Γ, µ(E) = 0
F ⊂E
Proof. Let’s proof some claims (the rest are left as an exercise).
(1) (i): ∅ ∈ Γ̄.
(ii): Let B ∈ Γ̄, B = A ∪ F, where A ∈ Γ, F ⊂ E ∈ Γ and µ(E) = 0. Claim: X \ B ∈ Γ̄. Proof:
Since
X \ B = X \ (A ∪ F ) = X \ (A ∪ E) ∪ E \ (A ∪ F )
| {z } | {z }
∈Γ ⊂E
X \ (A ∪ F )
E
A F
S
(iii): If Bi ∈ Γ̄, i ∈ N, then clearly i∈N Bi ∈ Γ̄ (Exerc.).
2017 5
A1 ⊂ A1 ∪ F1 = A2 ∪ F2 ⊂ A2 ∪ E2
Similarly,
µ(A2 ) ≤ µ(A1 ) ,
and therefore µ(A1 ) = µ(A2 ) = µ̄(B).
(ii): µ̄ is a measure. (Exerc.)
(iii): µ̄ is complete. (Exerc.)
(3) (Exerc.)
Then we can not conclude that f is a Borel function. Instead, we can conclude that f is Lebesgue
measurable (see [Ho, L. 2.23 and 2.27]). (Reason: m complete.)
Remark 1.15. If a measure µ is complete, it makes sense to talk about measurability of functions
that are defined µ-a.e.
1 − p1
I = J1,1 ∪ I1,1 ∪ J1,2 , where J1,1 and J1,2 are closed intervals of length .
2
J1,1 I1,1 J1,2
E1
E3
.. ..
. .
6 Real Analysis I
Next we erase from the middle of each J1,k the open interval I2,k whose length is = p2 ℓ(J1,k ) =
p2 (1−p1 )
2 .
What remains is the set
∞ 2[ j−1 ∞ [
2 j
[ \
E=I\ Ij,k = Jj,k or equivalently
j=1 k=1 j=1 k=1
∞ 2 j
\ [
E= Ej , where Ej = Jj,k is compact .
j=1 k=1
In this course, we say that E = E(p) = is the Cantor set determined by the sequence p.
[Ho, L. 1.60]
E1 ⊃ E2 ⊃ · · · , m(E1 ) < ∞ =⇒
m(E) = lim m(Ej ) = lim (1 − p1 )(1 − p2 ) · · · (1 − pj )
j→∞ j→∞
Y∞
= (1 − pj ) .
j=1
If pj = 1/3 ∀j, the corresponding Cantor set E is called the Cantor 13 -set, in which case
2 j
m(E) = lim = 0.
j→∞ 3
Remark 1.17. The numbers pj can be chosen such that m(E) takes any given value on the
interval [0, 1[. (Exerc.) [Hint: By taking log from the (infinite) product that gives m(E) we obtain
an infinite series. Then choose the numbers pj so that this series is a geometric series (whose sum
we can, of couse, compute). Or more simpler: Let a = m(E) ∈ ]0, 1[ . Choose 0 < p1 < 1 s.t.
a < 1 − p1 < a + 1, p2 ∈ ]0, 1[ s.t. a < (1 − p1 )(1 − p2 ) < a + 1/2 etc.]
(b) If E(p) and E(q) are Cantor sets determined by sequences p and q, then there exists a home-
omorphism
f : E(p) → E(q).
(c) E is uncountable.
Remark 1.19. (i) E is closed and does not contain any (non-empty) open sets ⇒ E is nowhere
dense. [Def. A ⊂ Rn is nowhere dense if int Ā = ∅.]
2017 7
Proof. (a):
Ej closed ⇒ E closed.
⇒ E compact
E ⊂ I, I compact
The construction ⇒ E does not contain open intervals.
(b): If x ∈ E, the clearly ∃ a unique sequence of closed intervals I ⊃ J1,k1 ⊃ J2,k2 ⊃ · · · s.t.
∞
\
Jj,kj = {x} .
j=1
j→∞
because m Jj,kj −−−→ 0.
Define f : E(p) → E(q) as follows:
∞
\ ∞
\
if {x} = Jj,kj (p) , then {f (x)} = Jj,kj (q) .
j=1 j=1
j→∞
δj (p) −−−→ 0 .
E3
If x, y ∈ E(p) and |x − y| < δj (p), then necessarily x and y belong to the same interval Jj,k (p), and
therefore
Concluding similarly we get that f −1 is continuos (or: f continuous bijection and E(p) compact
⇒ f homeomorphism).
(c): If m(E) > 0, then E must be uncountable. Let then E = E(p) be such that m(E) = 0.
Choose a sequence q s.t. m E(q) > 0.
(b) ⇒ ∃ homeomorphism f : E(p) → E(q)
⇒ E(p) uncountable.
E(q) uncountable
Example 1.20. Let us now prove the claim in Example 1.6 in the remaining case n = 1. In other
words, there exists a Lebesgue measurable subset of R that is not Borel.
Proof: Choose Cantor sets E and E ′ s.t. m(E) > 0 and m(E ′ ) = 0. Then there exists a non-
Lebesgue measurable set F ⊂ E.
Now
∞ 2[j−1
[
f: Ij,k → I
j=1 k=1
| {z }
=A
Define f (y) as the above limit at points y ∈ E \{0, 1} and as the one-sided limit at points y ∈ {0, 1}.
We obtain f : I → I that satisfies:
(a) f is a continuous and increasing surjection;
(c) f E = I (because f is constant on each open interval Ij,k ⊂ I \ E and their end points belong
to E).
The function f is called the Cantor 1/3-function ( the Cantor ternary function, ”Devil’s Staircase”).
We will return to this function later.
Let us define
Z
1
L (A) = {f : A → Ṙ | f measurable and |f |dµ < ∞} .
A
Thus: f ∈ L1 ⇐⇒ f : X → Ṙ is integrable.
Remark 1.25. If f : X → Ṙ is integrable, then f (x) ∈ R µ-a.e. We define f ∗ : X → R,
(
f (x), if f (x) ∈ R ,
f ∗ (x) =
0, if f (x) ∈ {−∞, +∞} .
Then f ∗ is integrable and Z Z
∗
f dµ = f dµ .
X X
Threfore we may often assume that the functions in question are real valued.
If f, g ∈ L1 , a, b ∈ R, then af + bg ∈ L1 . Hence L1 is a vector space with real coefficients.
We denote Z Z
kf k1 = kf k1,X = |f |dµ = |f |dµ .
X
Theorem 1.26. k·k satisfies:
(1) kf k1 ≥ 0,
(2) kλf k1 = |λ|kf k1 , ∀λ ∈ R,
(3) kf + gk1 ≤ kf k1 + kgk1 ,
(4) kf k1 = 0 ⇐⇒ f = 0 a.e.
Proof. Clear.
(4’) kf˜k1 = 0 ⇐⇒ f˜ = 0,
where 0 = 0̃ = {f ∈ L1 : f = 0 a.e.}. In what follows we will get rid of the notation L̃1 and we say:
a normed space L1 . We also talk about (L1 -)functions rather than equivalence classes, that is, we
identify functions that agree almost everywhere.
If S = ∅, we set kf k∞ = ∞.
R
{x ∈ X : |f (x)| > α}
α
−α
Theorem 1.29. The space L∞ is a normed space equipped with the norm k·k∞ .
Proof. Clearly:
Moreover:
12 Real Analysis I
Hence
µ {x ∈ X : |f (x) + g(x)| > kf k∞ + kgk∞ } = 0,
and so kf + gk∞ ≤ kf k∞ + kgk∞ and f + g ∈ L∞ .
Hence
f ∈ Lp ⇐⇒ 1 ≤ p < 2 .
2017 13
Proof. Exercise.
aα bβ ≤ αa + βb .
log b
log(αa + βb)
α log a + β log b
αa + βb b
log a
Z Z 1/p Z 1/q
p q
|f g|dµ ≤ |f | dµ |g| dµ .
X X X
Proof. If kf kp = 0, then f = 0 a.e.,and consequently kf gk1 = 0 and the claim follows. Similarly, if
kgkq = 0. Hence we may assume that
kf kp , kgkq > 0 .
Furthermore, we may assume that f (x), g(x) ∈ R ∀x (see Remark 1.25). By applying Young’s
inequality with
|f (x)|p |g(x)|q 1 1
a= p , b = q , α= , β= ,
kf kp kgkq p q
14 Real Analysis I
By integrating over X:n (notice that the functions are measurable) we get
Remark 1.36. The constants p, q > 1, for which p1 + 1q = 1, are called Hölder conjugates (of each
p
other). Often we denote q = p′ = p−1 . The exponent 2 is the only constant that is conjugate with
itself.
kf gk1 ≤ kf k2 kgk2 .
Example 1.38. Let X = {1, 2, . . . , n}, µ : P(X) → [0, ∞[, µ(A) = card A = the number of
elements in A.If 1p + 1q = 1 and a1 , b1 , a2 , b2 , . . . , an , bn ∈ R, then
n n
!1/p n
!1/q
X X X
|ai bi | ≤ |ai |p |bi |q .
i=1 i=1 i=1
Proof: Choose X X
f= ai χ{i} , g= bi χ{i} .
i i
In general:
X = {xi : i = 1, 2, . . .} , Γ = P(X) , and µ(A) = card A .
If f : X → R, then
Z 1/p ∞
!1/p
X
p
kf kp = |f | dµ = |f (xi )|p .
X i=1
Lp (X) = ℓp (X) .
1 1
If f ∈ ℓp (X), g ∈ ℓq (X), p + q = 1, and ai = f (xi ), bi = g(xi ), then the Hölder inequality takes
the form
∞ ∞
!1/p ∞
!1/q
X X X
|ai bi | ≤ |ai |p |bi |q .
i=1 i=1 i=1
kf + gkp ≤ kf kp + kgkp .
2017 15
p 1 1
Proof. The case p = 1 is already proven. Let p > 1 and q = p−1 so that p + q = 1 (Hölder
conjugates).
If a, b ≥ 0, then p p
(a + b)p ≤ 2 max(a, b) = 2p max(a, b) ≤ 2p (ap + bp ) .
We may assume that f (x), g(x) ∈ R ∀x (Remark 1.25). Then
|f + g|p ≤ (|f | + |g|)p ≤ 2p |f |p + |g|p ⇒ f + g ∈ Lp .
and q f +g∈Lp
|f + g|p−1 = |f + g|p =⇒ |f + g|p−1 ∈ Lq .
By Hölder’s inequality we get
Z Z Z
p p p−1
kf + gkp = |f + g| ≤ |f | |f + g| + |g| |f + g|p−1
|{z} | {z } |{z} | {z }
∈Lp ∈Lq ∈Lp ∈Lq
Z 1/q Z 1/q
p−1 q
p−1 q
≤ kf kp |f + g| + kgkp |f + g|
Z 1/q Z 1/q
= kf kp |f + g|p + kgkp |f + g|p
= kf kp + kgkp kf + gkp/q
p
p/q=p−1
=⇒ kf + gkp ≤ kf kp + kgkp .
We have proved:
Theorem 1.40. Lp is a normed space when equipped with the norm k·kp .
d(x, y) = kx − yk .
A normed space (V, k·k) is a Banach space if, for every Cauchy sequence (xj ) in Y , there exists
x ∈ V s.t.
j→∞
kxj − xk −−−→ 0 .
Theorem 1.43. If (fj ) is a Cauchy sequence in Lp , 1 ≤ p < ∞, then there exists a subsequence
(fjk ) that converges (pointwise) almost everywhere.
Note that by Remark 1.25 we may assume that all functions (above) are real valued. We define
By Minkowski’s inequality,
k
X
kgk kp =
|fj1 | + |fjν+1 − fjν |
p
ν=1
k
Minkowski X
≤ kfj1 kp + kfjν+1 − fjν kp
ν=1
k
X 1
≤kfj1 kp + ν
≤ kfj1 kp + 1 ∀k
ν=1
2
2017 17
We obtain
k
X
fjk+1 = fj1 + fjν+1 − fjν → f a.e.
ν=1
Remark 1.44. The condition fj → f in Lp does not (in general) imply that (the whole sequence)
fj → f a.e.
Example: Let Ik be the closed subinterval of I = [0, 1] as in the picture.
I1
I2 I3
I4 I5 I6 I7
jne
Hence fk → 0 in Lp .
Claim: fk (x) 6→ 0 as k → ∞ for any x ∈ I.
Proof: Let x ∈ I and k0 ∈ N be arbitrary.
[
Ik = I ⇒ ∃ k1 > k0 s.t. x ∈ Ik1 and fk1 (x) = 1 .
k>k0
Proof. (a) 1 ≤ p < ∞: Let (fj ) be a Cauchy sequence in Lp . Theorem 1.43 implies that there
exists a subsequence (fjk ) s.t. fjk → f a.e.
Claim: f ∈ Lp and fjk → f in Lp .
Proof: Let ε > 0. Then there exists j0 ∈ N s.t.
kfi − fj kp < ε as i, j ≥ j0 .
If j ≥ j0 , we have
Z Z Z
Fatou
p p
|fj − f | dµ = lim |fj − fjk | dµ ≤ lim inf |fj − fjk |p dµ
k→∞ k→∞
= lim inf kfj − fjk kpp ≤ εp
k→∞
p
fj − f ∈ L
f = fj − (fj − f ) ∈ Lp
⇒ ⇒
j→∞
kf − f k −
j p −−→ 0 fj → f in Lp .
Then µ(Aj ) = 0 = µ(Aj,k ) (this follows directly from the definition of k·k∞ ). Denote
[ [
A= Aj ∪ Aj,k , hence µ(A) = 0 .
j j,k
If x ∈ Ac , then
The Cauchy criterion for the uniform convergence and (1.47) imply that
fj → f uniformly in Ac .
for j ≥ j0 . If x ∈ Ac , we have
⇒ kf k∞ ≤ M , because µ(A) = 0 .
⇒ f ∈ L∞ .
2 Approximation in Lp spaces
2.1 Absolute continuity of measures
Let (X, Γ, µ) be a measure space and let σ : Γ → [0, +∞] be another measure.
µ(A) = 0 ⇒ σ(A) = 0 .
2. Let X = R and σ : Leb R → [0, +∞] be the counting measure. Then the Lebesgue measure
m({0}) = 0 but σ({0}) = 1, and therefore σ 6≪ m.
3. Let X = R, x ∈ R, and δx : Leb R → [0, +∞] be the Dirac measure at x (or, in fact, the
restriction of the Dirac measure to Leb R). Then m({x}) = 0 but δ({x}) = 1, and therefore
δ 6≪ m.
Remark 2.4. Let (X, Γ, µ) be a measure space and ϕ : Γ → Ṙ. We say that
(i) ϕ(∅) = 0,
20 Real Analysis I
P
(ii) if A1 , A2 , . . . ∈ Γ are pairwise disjoint, then i ϕ(Ai ) is defined and
X
ϕ(Ai ) = ϕ ∪i Ai .
i
(c) ϕ is σ-finite if
X = ∪j Aj , Aj ∈ Γ, |ϕ(Aj )| < ∞.
Unfortunately, we shall not prove the following Radon-Nikodym theorem in this course: If (X, Γ, µ)
is σ-finite measure space and if ϕ : Γ → Ṙ is countably additive, σ-finite, and ϕ ≪ µ, then there
exists a measurable function f : X → R s.t.
Z
ϕ(E) = f dµ, ∀E ∈ Γ.
E
If, in addition, g is another function satisfying the equation above, then f = g a.e.
Theorem 2.5. Let (X, Γ, µ) be a measure space σ : Γ → [0, +∞) a measure s.t. σ(X) < ∞. Then
σ ≪ µ ⇐⇒
(2.6) ∀ε > 0 ∃ δ > 0 s.t. µ(A) < δ ⇒ σ(A) < ε .
Denote
[ ∞
\
Ak = Ei , A= Ak .
i≥k k=1
Ak ⊃ Ek ⇒ σ(Ak ) ≥ σ(Ek ) ≥ ε ∀k .
A1 ⊃ A2 ⊃ · · ·
⇒ σ(A) = lim σ(Ak ) ≥ ε .
σ(A1 ) ≤ σ(X) < ∞ k→∞
∞
X 1 1
µ(A) ≤ µ(Ak ) ≤ i
= k−1 ∀k
2 2
i=k
σ≪µ
⇒ µ(A) = 0 =⇒ σ(A) = 0 . Contradiction
⇐ If the condition (2.6) holds, then σ ≪ µ trivially (µ(A) = 0 ⇒ σ(A) < ε ∀ε > 0 ⇒ σ(A) =
0).
2. The proof of Theorem 2.9 is the same as the one in the case of (Rn , Leb Rn , m).
1
If x ∈ H and k ∈ N, then ∃ lk ∈ N s.t. |fm (x) − f (x)| < k ∀m ≥ lk ⇒ x ∈ Ek,lk . Hence
∞
[
H⊂ Ek,l ∀k .
l=1
The sets H and Ek,l are measurable and µ(H) = µ(X) since fm → f a.e.
Ek,l ⊂ Ek,l+1 ⇒
∞
[
(2.13) µ(X) ≥ lim µ(Ek,l ) = µ Ek,l ≥ µ(H) = µ(X)
l→∞
l=1
1
|fm (x) − f (x)| < , if x ∈ Ek,lk and m ≥ lk .
k
1
⇒ |fm (x) − f (x)| < , if x ∈ F and m ≥ lk .
k
⇒ fm |F → f |F uniformly (the index lk is independent of x ∈ F ).
Theorem 2.14 (Lusin’s Theorem). Let A ⊂ Rn be measurable, m(A) < ∞, and let f : A → R be
measurable. Then
Lemma 2.12 ⇒ ∃ compact sets Fi ⊂ Ai s.t. m(Ai \ Fi ) < ε/k. Then F = F1 ∪ · · · ∪ Fk is compact
and Fi ’s are disjoint. If x ∈ F, then ∃ r > 0 s.t. B(x, r) ∩ Fi 6= ∅ for exactly one i.
Fj compact
(Reason: x ∈ Fi ⇒ dist (x, Fj ) = inf |x − y| = min |x − y| > 0 ∀j 6= i .)
y∈Fj y∈Fj
Hence
f (y) = f (x) = ai ∀y ∈ B(x, r) ∩ F ,
and therefore f |F is locally constant and hence f |F is continuous. Also
k
[ k
disjoint X
m(A \ F ) = m (Ai \ Fi ) = m(Ai \ Fi ) < ε .
i=1 i=1
(b): Suppose then that f ≥ 0 is measurable. L. 2.9 ⇒ ∃ simple functions fj s.t. fj ր f. The part
(a) of the proof ⇒ ∃ compact sets Fj ⊂ A s.t.
ε
fj |Fj continuous and m(A \ Fj ) < j .
2
T
Let F0 = j Fj , then
[ X X ε
m(A \ F0 ) = m (A \ Fj ) ≤ m(A \ Fj ) < = ε.
2j
j j j
Egorov’s Theorem ⇒
∃ compact F ⊂ F0 s.t. fj |F → f |F uniformly and m(F0 \ F ) < ε .
Now
fj |F compact
⇒ f |F continuous,
fj |F → f |F uniformly
(c): Suppose finally that f is measurable and write f = f + − f − . The part (b) of the proof ⇒ ∃
compact sets F1 , F2 ⊂ A s.t. f + |F1 , f − |F2 are continuous and m(A \ Fi ) < ε/2, i = 1, 2. The set
F = F1 ∩ F2 fulfils the requirements.
Remark 2.15. 1. The assumption that f is real valued is essential in Egorov’s and Lusin’s
theorems, that is the values ±∞ are not allowed.
2. The assumption µ(X) < ∞ is essential in Egorov’s theorem: Example: X = R, fj = χ[j,∞[.
Then fj (x) → 0 ∀x ∈ R. Denote f = 0. If F ⊂ R s.t. fj |F → f |F uniformly, then ∃j0 s.t.
1
|fj (x) − f (x)| = |fj (x)| <, if j ≥ j0 , x ∈ F
2
⇒ F ∩ [j, ∞[= ∅ ∀j ≥ j0
⇒ [j, ∞[ ⊂ R \ F ⇒ m(R \ F ) = ∞ .
24 Real Analysis I
3. Lusin’s theorem holds also in the case m(A) = ∞ if we just require that the set F is closed.
(Exerc.)
2.16 Convolution in Rn
In this section we consider the measure space (Rn , Leb Rn , m).
Let f, g ∈ L1 (Rn ). The mapping ϕ : Rn → Rn , ϕ(y) = x − y, where x ∈ Rn is a constant,
satisfies:
ϕ(A) measurable ⇐⇒ A measurable.
Hence
y 7→ f (x − y) is measurable
⇒ y 7→ f (x − y)g(y) is measurable.
f0 = f a.e.
g0 = g a.e.
fj ր f + ,
k
X
fj = ai χ A i .
i=1
2017 25
(Note that (ϕj ) need not be increasing ⇒ limj→∞ ϕj need not exist.)
Similarly ∃ a Borel function ϕ− = f − a.e.
Now f0 = ϕ+ − ϕ− is a Borel function and f0 = f a.e.
Similarly for g.
The values of integrals (2.18) and (2.19) do not change if f and g are replaced by f0 and g0 . ⇒
We mays assume: f, g are Borel functions.
Claim: F : Rn × Rn → R, F (x, y) = f (x − y)g(y), is a Borel function.
Proof: The mappings u : Rn × Rn → Rn , u(x, y) n n n
= x − y, and v : R × R → R , v(x, y) = y, are
continuous. Now F (x, y) = f u(x, y) g v(x, y) , eli
F = (f ◦ u)(g ◦ v).
since u : R2n → Rn is continuous (ks. L. 1.7). Hence f ◦ u is Borel. Similarly, we see that g ◦ v is
Borel, and therefore is a Borel function as a product of two Borel functions. By Exercise 1/4, the
”original” F agrees with the ”new” F a.e. in R2n , and therefore the original F is measurable.
We may thus apply Fubini’s theorems. Fubini 1. ⇒
Z Z Z Z
|F (x, y)|dy dx = |F (x, y)|dx dy
Rn Rn Rn Rn
Z Z
= |g(y)| |f (x − y)|dx dy
Rn Rn
= kf k1 kgk1 < ∞ ,
because
Z
(2.21) |f (x − y)|dx = kf k1 .
Rn
1
Lemma 2.12 ⇒ ∃ Fσ -set Bi ⊂ Ai s.t. m(Ai \ Bi ) = 0.
26 Real Analysis I
Remark 2.22. The equation (2.21) holds by the translatoin invariance of the Lebesgue measure:
If
Xk
f= aj χ A j
j=1
k
X
f ◦ϕ= aj χϕ−1 Aj and m(ϕ−1 Aj ) = m(Aj )
j=1
Z k
X k
X Z
⇒ f ◦ϕ = aj m(ϕ−1 Aj ) = aj m(Aj ) = f.
j=1 j=1
In other words, (2.21) holds for simple functions. After this, the general case follows from the
definition of integral.
Question: Why we used Borel sets/functions and not just measurable sets/functions?
Reason: g : Rn → Rm measurable and E ∈ Leb Rm 6⇒ g −1 E ∈ Leb Rn .
Let U ⊂ Rn be open, k ∈ N, f : U → R.
Proof. Let’s make a simple change of variables x − y 7→ y. The value of the integral does not change
(cf. Remark 2.22), and therefore
Z
f ∗ g(x) = f (y)g(x − y)dy
Rn
2017 27
Here M = max|g| (the maximum exists since g is continuous and compactly supported).
Z
f ∗ g(x + h) − f ∗ g(x) = f (y) g(x − y + h) − g(x − y) dy
Rn ⇒ ∀ε > 0 ∃ δ > 0 s.e.
g uniformly continuous in Rn
Z
|f ∗ g(x + h) − f ∗ g(x)| ≤ |f (y)| |g(x − y + h) − g(x − y)| dy
Rn | {z }
<ε
n
< ε kf k1 , for |h| < δ and x ∈ R
| {z }
<∞
⇒ f ∗ g continuous at x.
Let x ∈ Rn . Then
f ∗ g(x + te ) − f ∗ g(x)
i
lim − f ∗ Di g(x)
t→0
t
Z g(x + tei − y) − g(x − y)
= lim f (y) − Di g(x − y) dy
t→0 n t
Z R
≤ lim |f (y)| |ϕ(x − y, t)| dy
t→0 Rn | {z }
≤σ(t)
≤ lim σ(t)kf k1 = 0 ,
t→0
and therefore
Di f ∗ g (x) = f ∗ Di g(x) .
28 Real Analysis I
2.24
Di g ∈ C0 (Rn ) =⇒ Di (f ∗ g) ∈ C(Rn ). Repeating the above we get
D f ∗ g = f ∗ Dg ,
Remark 2.28. Theorems 2.24 and 2.26 hold also for functions f ∈ Lp (Rn ), 1 ≤ p ≤ ∞. Reason:
in the proofs we need not integrate over the whole Rn , it suffices to integrate over a sufficiently
large compact set K since g is compactly supperted. Indeed, by Hölder’s inequality
Z Z
p−1 1/p p−1
|f | ≤ m(K) p |f |p ≤ m(K) p kf kp
K K
p−1
if p > 1 (we interprete above p = 1 if p = ∞).
Our next goal is to use convolution in approximating Lp functions (1 ≤ p < ∞) by C0∞ func-
tions.
For this purpose:
Denote
Z
Ih (A) = |f (x + h) − f (x)|p dx , A ∈ Leb Rn ,
A
Bk = B(0, k) = {x ∈ Rn : |x| < k} , k > 1.
⇒ ∃ k s.t.
Similarly,
Z Z
p p p
Ih (A) ≤ 2 |f | + |f | , if A ∈ Leb Rn and A + h = {a + h : a ∈ A}.
A+h A
2017 29
The integral is absolutely continuous with respect to the Lebesgue measure, and therefore Theorem
2.5 implies ∃ δ > 0 s.t.
F compact ⇒
f |F uniformly continuous.
⇒ ∃ η ∈ ]0, 1[ s.t.
ε
(2.32) |f (x + h) − f (x)|p < , if |h| < η and x, x + h ∈ F .
4m(F )
Let h ∈ B(0, η) be arbitrary. Denote
A1 = {x ∈ F : x + h ∈ F },
A2 = {x : x + h ∈ Bk+1 \ F },
A3 = Bk+1 \ F .
Bk ∩ F ⊂ {x ∈ F : x + h ∈ Bk+1 } ⊂ {x ∈ F : x + h ∈ F } ∪ {x ∈ F : x + h ∈ Bk+1 \ F }
| {z } | {z }
=A1 ⊂A2
⊂ A1 ∪ A2
⇒ Bk = (Bk \ F ) ∪ (Bk ∩ F ) ⊂ A1 ∪ A2 ∪ A3
| {z } | {z }
⊂A3 ⊂A1 ∪A2
Z
n
Ih (R ) = |f (x + h) − f (x)|p dx < ε .
Rn
30 Real Analysis I
Define η : R → [0, ∞[ ,
( 1
e t2 −1 , if |t| < 1,
η(t) =
0, if |t| ≥ 1.
We may choose
gk : Rn → [0, ∞[ continuous,
spt gk ⊂ B̄1/k ,
Z
gk = 1 .
Rn
lim kf − f ∗ gk kp = 0 .
k→∞
2017 31
If p > 1,
Z p
|f (x) − f ∗ gk (x)|p ≤ |f (x) − f (x − y)|gk (y)dy
R n
Z p p
1/p 1/q
= |f (x) − f (x − y)| gk (y) gk (y) dy (where q = )
Rn | {z } p−1
=gk (y)
Z Z q p/q
p
≤ |f (x) − f (x − y)| gk (y)dy gk (y)1/q dy
Hölder Rn n
|R {z }
=1
Z
p
= |f (x) − f (x − y)| gk (y)dy .
Rn
Hence ∀p ≥ 1:
Z
kf − f ∗ gk kpp = |f (x) − f ∗ gk (x)|p dx
Rn
Z Z
≤ |f (x) − f (x − y)|p gk (y)dy dx
n Rn
ZR Z
Fubini 1. p
= gk (y) |f (x) − f (x − y)| dx dy .
Rn Rn
Since spt gk ⊂ B̄1/k , we may assume that |y| ≤ 1/k in the inner integration.
Z
Thm. 2.29 ⇒ |f (x) − f (x − y)|p dx → 0,
Rn
as k → ∞ and |y| ≤ 1/k
⇒ claim.
Z
gk = 1
Rn
Clearly the spaces C0 (Rn ) and C0k (Rn ), k = 1, 2, . . . , ∞, vector subspaces of Lp (Rn ) and, when
equipped with the norm k·kp , they are also normed subspaces of Lp (Rn ).
Definition 2.35. If W is a subspace of a normed space (V, k·k), we say that W is dense in V if
∀v ∈ V ∃ a sequence w1 , w2 , . . . ∈ W s.t. kwi − vk → 0 as i → ∞. (That is, W̄ = V .)
kf − ψk kp → 0 as k → ∞.
(a): Suppose that spt f is compact. Then f ∈ L1 . Choose functions ϕk as inn (2.33). Thm. 2.26
⇒
f ∗ ϕk ∈ C ∞ (Rn ).
If d(x, spt f ) > 1/k and y ∈ spt ϕk (⊂ B̄(0, 1/k)), then x − y 6∈ spt f. and therefore
Z
f ∗ ϕk (x) = f (x − y)ϕk (y)dy = 0 ⇒ spt(f ∗ ϕk ) kompakti.
Rn
k→∞
Thm. 2.34 ⇒ kf − f ∗ ϕk kp −−−→ 0.
Hence we may choose ψk = f ∗ ϕk . (Note: Above spt(f ∗ ϕk ) is compact since it is both closed and
bounded.)
(b): General case: spt f not necessarily compact. Let ε > 0. Denote
kf − fj kp < ε/2 ∀j ≥ j0 .
Then
kf − ψkj j kp < ε ∀j ≥ j0 .
3 Derivative
In this section we study differentiability of integrals, in particular, the question when a function
f : [a, b] → R can be recovered by integrating its derivative f ′ ?
2. There is a more general result (Lebesgue’s theorem): Let g : [a, b] → R be integrable. Then
the function G : [a, b] → R, Z x
G(x) = g(t)dt,
a
is differentiable a.e. and
G′ (x) = g(x) a.e. x ∈ [a, b].
2017 33
4. Let f : [0, 1] → [0, 1] be the Cantor 1/3 function. Then f is a continuous increasing surjection
and f ′ (t) = 0 for a.e. t ∈ [0, 1] (f ′ measurable), but
Z 1 Z 1
′
f (t)dt = 0dt = 0 6= 1 = f (1) − f (0).
0 0
We will study these questions by using ”covering theorems” as a tool. Here we try to ”almost”
cover a given subset of Rn by closed, pairwise disjoint balls. The disjointness of the balls is required
in order to apply the countable additivity of the Lebesgue measure.
Theorem 3.3 (Basic covering theorem). Let F be an arbitrary family of balls in Rn s.t.
D = sup{d(B) : B ∈ F} < ∞,
where d(B) = sup{|x − y| : x, y ∈ B} is the diameter of B. Then there exists a countable (possibly
finite) family G ⊂ F s.t.
Proof. 1. Denote
Fj = {B ∈ F : D/2j < d(B) ≤ D/2j−1 }, j ∈ N,
S
hence F = ∞ j=1 Fj.
Define inductively families Gj ⊂ Fj :
B ∈ F1 ⇒ ∃B ′ ∈ G1 s.t. B ∩ B ′ 6= ∅.
(Thus: We can not add to G1 any ball of F1 without destroying the pairwise disjointness.)
(b) Suppose that families G1 , . . . , Gk−1 are chosen. Let Gk be any maximal collection of pairwise
disjoint balls of Fk s.t.
k−1
[
B ∩ B ′ = ∅ ∀B ′ ∈ Gj .
j=1
34 Real Analysis I
Denote
∞
[
G= Gj ,
j=1
and let us prove that Gj,i is finite (possibly empty), hence Gj ’s and therefore G is countable.
B ∈ Gj,i ⇒ d(B) > D/2j and B ⊂ B̄(0, i) (∗)
⇒ Gj,i finite.
B̄(0, i) compact
Reason for (∗): assume on the contrary: there are infinitely many disjoint balls in Gj,i ⇒ ∃a
sequence xk ∈ B̄(0, i), k ∈ N, s.t.
(for example, every xk is a center of some ball in G j,i .) B̄(0, i) compact ⇒ ∃ a convergent
subsequence of (xk ) that is a contradiction with (3.4).
3. Claim: For all B ∈ F there exists B ′ ∈ G s.t. B ∩ B ′ 6= ∅ and B ⊂ 5B ′ . In particular, then
[ [
B⊂ 5B .
B∈F B∈G
Sk
Proof: If B ∈ F, then B ∈ Fk for some k ∈ N. Since Gk is maximal, there exists B ′ ∈ j=1 Gj s.t.
B ∩ B ′ 6= ∅. On the other hand,
d(B ′ ) > D/2k and d(B) ≤ D/2k−1 ⇒ d(B) < 2d(B ′ )
⇒ B ⊂ 5B ′ .
B ∩ B ′ 6= ∅
5B ′
B B′
Remark 3.5. 1. The part 2 in the proof follows also from the disjointness of the balls in G and
the separability of Rn (Qn is a countable dense subset of Rn ).
2. The basic covering theorem (stated as above) holds for some metric spaces (X, d) provided cer-
tain additional assumptions hold. Read the proof again and think of what kind of properties
(X, d) should have in order to the proof work.
2017 35
Definition 3.6. Let V be a family of balls in Rn . We say that V is a Vitali covering of a set
E ⊂ Rn if
∀x ∈ E and ∀ε > 0 ∃B ∈ V s.t. x ∈ B and d(B) < ε .
Such a family V is a closed Vitali covering (respectively, open) if every ball B ∈ V is closed (resp.
open).
{B ∈ V : d(B) < R}
Corollary 3.8. Let V be a closed Vitali covering of E ⊂ Rn s.t. d(B) < R ∀B ∈ V. Then there
exists a countable family G ⊂ V of pairwise disjoint balls such that for every finite G ∗ ⊂ G we have:
[ [
E\ B⊂ 5B .
B∈G ∗ B∈G\G ∗
Proof. Let G ⊂ V be as in the proof of Theorem 3.3 and let G ∗ = {B1 , B2 , . . . , Bm } ⊂ G be arbitrary.
If
[m
E⊂ Bi , we are done.
i=1
Sm Sm
Otherwise, let x ∈ E \ i=1 Bi . Then i=1 Bi is compact, and therefore
m
[ m
[ m
[
d x, Bi = inf{|x − y| : y ∈ Bi } = min{|x − y| : y ∈ Bi } > 0 .
i=1 i=1 i=1
Sm
Since V is a Vitali covering of E, ∃ B ∈ V s.t. x ∈ B and B ∩ i=1 Bi = ∅ (d(B) small enough).
E
Bi
x
B∈V
It follows from the part 3 of the proof of Theorem S 3.3 that∃ B ′ ∈ G s.t. B ∩ B ′ 6= ∅ and B ⊂ 5B ′ .
m
In particular x ∈ 5B ′ . Now B ′ 6∈ G ∗ , because B ∩ ′ ∗
i=1 Bi = ∅. Thus B ∈ G \ G , and so
[ [
E\ B⊂ 5B .
B∈G ∗ B∈G\G ∗
36 Real Analysis I
Theorem 3.9 (Vitali’s covering theorem). Let E ⊂ Rn (not necessarily measurable) and let V be
a closed Vitali covering of E. Then there exists a countable subfamily G ⊂ V of pairwise disjoint
balls s.t. [
m E\ B = 0.
B∈G
Proof. 1. Suppose fist that E is bounded. Then we may assume that there exists a bounded open
H ⊂ Rn s.t. B ⊂ H ∀B ∈ V. Let G be as in the proof of Theorem 3.3 (and of Corollary 3.8). Let
ε > 0. We will show: [
m∗ E \ B <ε
B∈G
that implies the claim (since ε > 0 arbitrary). The balls in G are pairwise disjoint and ⊂ H, and
hence X [
m(B) = m B ≤ m(H) < ∞ .
B∈G B∈G
⇒ ∃ a finite G∗ ⊂ G s.t. X
m(B) < ε/5n .
B∈G\G ∗
Corollary 3.8 ⇒
[ [ [
m∗ E \ B ≤ m∗ E \ B ≤m 5B
B∈G B∈G ∗ B∈G\G ∗
| {z } | {z }
⊂E\∪B∈G ∗ B ⊂ ∪B∈G\G ∗ 5B
X X
≤ m(5B) = 5n m(B) < ε .
B∈G\G ∗ B∈G\G ∗
ε > 0 arbitrary ⇒ [
m E\ B = 0.
B∈G
Idea: Applying the part 1 to the sets E ∩ Ai we obtain subfamilies Gi ⊂ V. We must take care
that the balls in Gi and Gj intersect each other. This can be dealt with as follows: Ai open,
x ∈ Ai ⇒ ∃ rx > 0 s.t. B(x, r) ⊂ Ai ∀r ≤ rx ⇒ Vi = {B ∈ V : B ⊂ Ai } is a Vitali covering of
E ∩ Ai .
Ai ’s disjoint ⇒
[ ∞
X [ (3.12)
⇒ m E\ B ≤ m (E ∩ Ai ) \ B = 0.
B∈G i=1 B∈Gi
Remark 3.13. Also the covering theorem holds for certain ”metric measure spaces” (X, d, Γ, µ).
Read the proof again and think of what kind of requirements (X, d) and the measure µ should
fulfil.
Lemma 3.15. Let f : X → [0, +∞] be measurable and 0 < p < ∞. Then
Z Z ∞
p
(3.16) f dµ = p tp−1 µ {x : f (x) > t} dt .
X 0
Proof. Idea: (i) Suppose first that f is simple and prove that (by direct computation) that (3.16)
holds. (ii) In the general case, choose a sequence of simple functions fk ր f and apply the monotone
convergence theorem. Details are left as an exercise.
R
f
A
t
{x ∈ X : f (x) > t} X
38 Real Analysis I
R R∞
In the picture above X f dµ = 0 µ {x ∈ X : f (x) > t} dt can be interpreted as the ”product
measure” (µ × m1 )(A) of the shaded area A.
Denote f ∈ L1loc (Rn ) if f : Rn → Ṙ is measurable and
Z
|f | < ∞ ∀ compact K ⊂ Rn .
K
2. f ∈ L1 (Rn ) ⇒ f ∈ L1loc (Rn ). The converse ”⇐” does not hold: for instance f (x) ≡ 1.
Our aim is to prove:
Z
1
f∈ L1loc (Rn ) ⇒ lim f (y)dy = f (x) a. e. x ∈ Rn .
r→0+ m B(x, r) B(x,r)
where B is an (arbitrary) open ball that contains x. Then function M f : Rn → [0, ∞] is the
(Hardy-Littlewood) maximal function of f .
Note: There are different kind of maximal functions in the literature. For instance, the supre-
mum can be taken over balls B(x, r) centered at x. Then we obtain a ”centered” maximal function
of f ∈ L1loc (Rn ), denoted by M̃ f ,
Z
M̃ f (x) = sup |f (y)|dy.
r>0
B(x,r)
Then ∀y ∈ B we have: Z
sup
M f (y) ≥ |f (y)|dy > t ⇒ y ∈ Et .
B
Hence B ⊂ Et and therefore Et is open.
0 2|x|
x
R
If x ∈ Rn \ B(0, R), then B(0, R) ⊂ B(x, 2|x|) (see the picture) and
Z Z
1 1
M f (x) ≥ |f (y)|dy ≥ |f (y)|dy .
m B(x, 2|x|) B(x,2|x|) cn (2|x|)n B(0,R)
| {z }
=I>0
Z Z Z
⇒ M f (x)dx ≥ M f (x)dx ≥ I c−1
n (2|x|)
−n
dx = ∞
Rn Rn \B(0,R) Rn \B(0,R)
since
Z ∞ Z
X
c−1
n (2|x|)
−n
dx = c−1 (2|x|)−n dx
Rn \B(0,R) B(0,2i+1 R)\B(0,2i R) |n {z }
i=0
≥c−1
n (2
i+2 R)−n
∞
X
≥ m B(0, 2i+1 R) \ B(0, 2i R) c−1
n (2
i+2
R)−n
i=0
| {z }
=cn ((2i+1 R)n −(2i R)n )
∞ ∞
X 2n − 1 X
= = c = ∞.
4n
i=0 | {z } i=0
=c>0
3. We say that a measurable function f : Rn → Ṙ belongs to the weak L1 -space, weak-L1 (Rn ), if
there is a constant c = cf < ∞ such that (3.21) holds ∀t > 0. Thus: L1 (Rn ) ⊂ weak-L1 (Rn ),
but weak-L1 (Rn ) 6⊂ L1 (Rn ).
It turns out that the maximal function M f of an integrable function f ∈ L1 (Rn ) satisfies
(3.21). This is one of the most important properties of M f . The proof is based on the Basic
covering theorem 3.3.
Theorem 3.22 (Hardy-Littlewood). If f ∈ L1 (Rn ), then
5n kf k1
(3.23) m {x ∈ Rn : M f (x) > t} ≤ ∀t > 0.
t
Proof. Fix t > 0 and write Mt = {x ∈ Rn : M f (x) > t}. Then ∀x ∈ Mt ∃ an open ball Bx ∋ x (not
necessarily centered at x) s.t. Z
|f (y)|dy > t.
Bx
In other words,
Z
1 kf k1
(3.24) m(Bx ) ≤ |f (y)|dy (≤ ).
t Bx t
We may apply the Basic covering theorem 3.3 ⇒ ∃ countable subfamily G = {B1 , B2 , . . .} ⊂ F of
disjoint open balls s.t. [ [
Mt ⊂ B⊂ 5Bi .
B∈F Bi ∈G
Hence
countable union X X
m(Mt ) ≤ m ∪i 5Bi ≤ m(5Bi ) = 5n m(Bi )
i i
Z
5n
≤ |f (y)|dy.
t Rn
Theorem 3.25. Let 1 < p < ∞ and f ∈ Lp (Rn ). Then M f ∈ Lp (Rn ) and there is a constant
c = c(p, n) s.t.
kM f kp ≤ ckf kp .
Proof. The proof uses Lemma 3.15, the Hardy-Littlewood theorem and Fubini’s theorem.
We omit the details.
x
x0
A
A
x
x0 B(x0 , r) ∩ (A \ {x0 })
∀x ∈ Rn (see the proof of part 3 below). On the other hand, Lusin’s theorem says that a measurable
function is ”almost continuous” (f measurable, ε > 0 ⇒ ∃ closed F ⊂ Rn s.t. m(Rn \ F ) < ε and
f |F continuous), hence a question arises: in what sense (3.27) holds for locally integrable functions.
Theorem 3.28 (Lebesgue differentiation theorem). Let f ∈ L1loc (Rn ) be a locally integrable func-
tion. Then
Z
(3.29) lim |f (y) − f (x)|dy = 0 a.e. x ∈ Rn .
r→0+
B(x,r)
In particular,
Z
(3.30) lim f (y)dy = f (x) a.e. x ∈ Rn .
r→0+
B(x,r)
(Note: Λf (x) is defined ∀x ∈ Rn and the value depends on f (x), thus in particular, on the chosen
representative of the equivalence class.)
Then Λf satisfies:
42 Real Analysis I
1. Λf (x) ≥ 0 ∀x ∈ Rn (clear).
2. Λ is sub linear, i.e.
Λ(f + g) ≤ Λf + Λg, ∀f, g ∈ L1loc (Rn ).
Reason:
Z
Λ(f + g)(x) = lim sup |f (y) + g(y) − f (x) − g(x)|dy
r→0
B(x,r)
△-ineq. Z Z
≤ lim sup |f (y) − f (x)|dy + |g(y) − g(x)|dy
r→0
B(x,r) B(x,r)
Z Z
≤ lim sup |f (y) − f (x)|dy + lim sup |g(y) − g(x)|dy
r→0 r→0
B(x,r) B(x,r)
n
= Λf (x) + Λg(x), ∀x ∈ R .
3. g ∈ C(Rn ) ⇒ Λg(x) = 0 ∀x ∈ Rn .
Reason: Fix x ∈ Rn and ε > 0.
g continuous at x ⇒ ∃ δ > 0 s.t. |g(y) − g(x)| < ε ∀y ∈ B(x, δ).
Hence ∀ 0 < s ≤ δ we have:
Z Z
1 εm B(x, s)
|g(y) − g(x)| dy < εdy = =ε
| {z } m B(x, s) B(x,s) m B(x, s)
B(x,s) <ε
Z !
⇒ Λg(x) = lim sup |g(y) − g(x)|dy ≤ε
r→0 0<s<r
B(x,s)
| {z }
<ε, if 0<s≤δ
4. Λf ≤ M f + |f |.
Reason:
Z △-ey.
Z Z
|f (y) − f (x)|dy ≤ |f (y)|dy + |f (x)|dy ≤ M f (x) + |f (x)|.
B(x,r) B(x,r) B(x,r)
| {z } | {z }
≤M f (x) =|f (x)|
Hence
m∗ {x : Λf (x) > t} ≤ m {x : M (f − g)(x) > t/2} + m {x : |f (x) − g(x)| > t/2}
| {z } | {z }
H.-L. Cheb.
≤ 2·5n kf −gk1 /t ≤ 2kf −gk1 /t
Finally,
Z Z Z Z
f (y)dy − f (x) = f (y)dy − f (x)dy = f (y) − f (x) dy
B(x,r) B(x,r) B(x,r) B(x,r)
| {z }
=f (x)
Z
r→0+
≤ |f (y) − f (x)|dy −−−−→ 0 a.e. x ∈ Rn .
B(x,r)
Furthermore,
m E ∩ B(x, r)
lim = 0 a.e. x ∈ Rn \ E.
r→0+ m B(x, r)
44 Real Analysis I
Furthermore,
Z
m E ∩ B(x, r)
χE (y)dy = ∀x ∈ Rn , r > 0.
m B(x, r)
B(x,r)
This property depends on the value f (x) and hence on the choice of the representative f ∈ L1loc (Rn ).
To get rid of this dependence we say that a point x ∈ Rn b elongs to the Lebesgue set, Leb(f ), of
f if ∃ A = A(x) ∈ R s.t.
Z
(3.34) lim |f (y) − A|dy = 0.
r→0+
B(x,r)
2. Lebesgue diff. theorem 3.28: If f ∈ L1loc (Rn ), then almost every x ∈ Rn belongs to Leb(f ).
Furthermore, A = f (x) for a.e. x ∈ Rn . Thus by modifying f in a set of measure 0 (by setting
f (x) = A(x)) we may assume in what follows that
Z
lim |f (y) − f (x)|dy = 0 ∀x ∈ Leb(f ).
r→0+
B(x,r)
rj
Ej
x
Theorem 3.37. Let f ∈ L1loc (Rn ) and x ∈ Leb(f ). If a sequence Ej shrinks nicely to x, then
Z
lim f (y)dy = f (x).
j→∞
Ej
Proof.
Z Z
Z
f (y)dy − f (x) = f (y)dy − f (x) ≤ |f (y)dy − f (x)|dy
Ej Ej Ej
Z
m B(x, rj ) rj →0
≤ |f (y) − f (x)|dy −−−→ 0 since x ∈ Leb(f ).
m(Ej )
| {z } B(x,rj )
≤ c
(3.36)
Rx
Now we can study differentiability of the function F (x) = a f (t)dt if f : [a, b] → R is integrable
(see Example 3.1.2).
Theorem 3.38. Let f : [a, b] → R be integrable and
Z x
F (x) = f (t)dt, x ∈ [a, b].
a
YThen F is differentiable a.e. and F ′ (x) = f (x) for a.e. x ∈ [a, b].
Proof. Extend f to a function f : R → R by setting f (t) = 0 ∀t ∈ R \ [a, b], hence f ∈ L1 (R).
It suffices to show:
F ′ (x) = f (x) ∀x ∈ Leb(f ) ∩ [a, b]
since almost every x ∈ [a, b] belongs to Leb(f ) (Lebesgue diff. theorem 3.28).
Let x ∈ Leb(f )∩[a, b] and let rj > 0, j ∈ N, be an arbitrary
sequence rj → 0. Denote Ej =]x, x+rj [,,
and so Ej ⊂]x − rj , x + rj [= B(x, rj ) and m B(x, rj ) = 2m(Ej ). Thus the sequence Ej shrinks
nicely to x. Theorem 3.37 ⇒
Z
F (x + rj ) − F (x) 1 x+rj j→∞
= f (t)dt −−−→ f (x).
rj rj x
Similarly, we see that
Z x
F (x − rj ) − F (x) 1 j→∞
= f (t)dt −−−→ f (x)
−rj rj x−rj
by using sets Ej′ =]x − rj , x[. Since (rj ) is an arbitrary sequence, we get
F (x + h) − F (x)
lim = f (x).
h→0 h
46 Real Analysis I
Above the set of discontinuity points of f is countable. This holds for all monotonic functions:
Lemma 3.41. A monotonic function f : [a, b] → R has at most countably many points of discon-
tinuity.
f increasing and bounded in a neighborhood of x ⇒ the limit exists and H(x) ≥ 0. We denote
and prove that Hk is finite. Suppose that x1 , . . . , x2j ∈ Hk s.t. x1 < x2 < · · · < x2j .
For i = 2, 3, . . . , j choose arbitrary x and y s.t. x2i−2 < y < x2i−1 < x < x2i .
Since f is increasing,
which is countable.
Theorem 3.42. Let f : [a, b] → R be a monotonic function. Then the derivative f ′ (x) exists for
a.e. x ∈ [a, b].
f (z) − f (y)
Df (x) = lim inf : a ≤ y ≤ x ≤ z ≤ b, 0 < z − y < ε .
ε→0+ z−y
Indeed, if Df (x) = Df (x) < ∞, then the derivative f ′ (x) = Df (x) = Df (x) exists.
We split the proof of (3.43) in parts:
1. Write
Clearly
\
{x ∈ [a, b] : Df (x) = ∞} = Ek
k∈N
[
{x ∈ [a, b] : Df (x) < Df (x)} = Fs,t . (Note: countable union.)
0<s<t
s,t∈Q
because then
c
m∗ {x : Df (x) = ∞} ≤ ∀k ⇒ m∗ {x : Df (x) = ∞} = 0
| {z } k
⊂Ek ∀k
X ⇒ (3.43).
∗ ∗
m {x : Df (x) < Df (x)} ≤ m (Fs,t ) = 0
| {z }
0<s<t =0
s,t∈Q
f (b)−f (a)
2. Claim: m∗ (Ek ) ≤ k , k ∈ N.4
4
Rd
Compare: g ∈ C 1 , g ′ (t) > k > 0 ∀t ∈ [c, d] ⇒ g(d) − g(c) = c
g ′ (t)dt > k(d − c) ⇒ m([c, d]) = d − c <
(g(d) − g(c))/k.
48 Real Analysis I
f (z) − f (y)
(3.44) > k, thus m (]f (y), f (z)[) > k m [y, z] .
z−y | {z } | {z }
=f (z)−f (y) =z−y
Hence such intervals Ix, ε = [y, z] form a closed Vitali cover of Ek . By Vitali’s covering theorem 3.9
there exists a countable subfamily of disjoint closed intervals Ij = [yj , zj ] ⊂ [a, b] s.t. (3.44) holds
and [
m Ek \ Ij = 0.
j∈N
Now
=0
[ z }| [ { [ X
m∗ (Ek ) ≤ m∗ Ek ∩ Ij + m∗ Ek \ Ij ≤ m Ij ≤ m(Ij )
j∈N j∈N j∈N j∈N
(3.44)
1X
≤ m (]f (yj ), f (zj )[) .
k
j∈N
f increasing, intevals Ij = [yj , zj ] disjoint ⇒ open intervals ]f (yj ), f (zj )[ are disjoint. Hence
1X disjoint 1
[
m∗ (Ek ) ≤ m (]f (yj ), f (zj )[) = m ]f (yj ), f (zj )[
k k
j∈N j∈N
| {z }
⊂[f (a),f (b)]
f (b) − f (a)
≤ .
k
3. Claim: m∗ (Fs,t ) = 0 ∀s, t ∈ Q, 0 < s < t.
Proof: We use Vitali’s covering theorem twice. It follows from the definition of outer measure that
∀ε > 0 ∃ an open G ⊃ Fs,t s.t.
m(G) < m∗ (Fs,t ) + ε.
3a. We apply Vitali’s covering theorem to the set Fs,t (arguing as in part 2).
It follows from definitions of Df (x) and Fs,t that ∀x ∈ Fs,t ∃ arbitrary small closed intervals
[y, z], y < z, s.t. x ∈ [y, z] ⊂ G ∩ [a, b] and
m ]f (y), f (z)[ f (z) − f (y)
(3.45) = < s.
m [y, z] z−y
(Note: G ∋ x open.)
By Vitali’s covering theorem there exists disjoint closed intervals Ij = [yj , zj ] ⊂ G ∩ [a, b], j ∈ N,
s.t. (3.45) holds and [
m Fs,t \ Ij = 0.
j∈N
2017 49
Then
[ disjoint X (3.45) X
m ]f (yj ), f (zj )[ = m ]f (yj ), f (zj )[ < s m [yj , zj ]
j∈N j∈N j∈N
disjoint
[
(3.46) = sm [yj , zj ] ≤ s m(G) < s(m∗ (Fs,t ) + ε).
j∈N
| {z }
⊂G
Furthermore,
[
(3.47) m Fs,t \ ]yj , zj [ = 0,
j∈N
S
because {yj , zj : j ∈ N} is of measure 0. Denote A = j∈N ]yj , zj [.
3b. We apply Vitali’s covering theorem to the set Fs,t ∩ A.
It follows from the definitions of Df (x) and Fs,t that ∀x ∈ Fs,t ∩ A there exists arbitrary small
closed intervals [u, v], u < v, s.t. x ∈ [u, v],
m ]f (u), f (v)[ f (v) − f (u)
(3.48) = > t,
m [u, v] v−u
S
and [u, v] ⊂ ]yj , zj [ for some j ∈ N (this is possible because A = j ]yj , zj [ is a disjoint union of
open intervals).
By Vitali’s covering theorem there exists disjoint closed intervals Jk = [uk , vk ], k ∈ N, as above s.t.
every Jk ⊂ ]yj , zj [ for a suitable j = jk and
[
(3.49) m Fs,t ∩ A \ Jk = 0.
k∈N
S
Denote B = k Jk , hence B ⊂ A and
(3.47) ⇒ Fs,t = (Fs,t ∩ A) ∪ A0 , where A0 = Fs,t \ A of measure 0
⇒
(3.49) ⇒ Fs,t ∩ A = (Fs,t ∩ A ∩ B) ∪ B0 , where B0 = Fs,t ∩ A \ B of measure 0
B⊂A
Fs,t = (Fs,t ∩ A) ∪ A0 = (Fs,t ∩ A ∩ B) ∪ B0 ∪ A0 = (Fs,t ∩ B) ∪ A0 ∪ B0 ,
S
where A0 ∪ B0 is of measure 0. Thus m∗ (Fs,t ) ≤ m(B), B = k Jk . Then
[ disjoint X (3.48) 1 X
m∗ (Fs,t ) ≤ m Jk = m(Jk ) < m ]f (uk ), f (vk )[
t
k k∈N k∈N
disjoint 1
[ 1 [ (3.46) s
= m ]f (uk ), f (vk )[ ≤ m ]f (yj ), f (zj )[ < m∗ (Fs,t ) + ε .
t t t
k∈N j∈N
| S {z }
⊂ j ]f (yj ),f (zj )[
ε > 0 arbitrary ⇒
s
m∗ (Fs,t ) ≤ m∗ (Fs,t )
t s
0<s<t ⇒ <1 ⇒ m∗ (Fs,t ) = 0 ∀s, t ∈ Q, 0 < s < t.
∗
t
m (Fs,t ) ≤ m [a, b] = b − a < ∞
50 Real Analysis I
Remark 3.50. The statement of Theorem 3.42 is the best possible. Indeed:
Let A ⊂ R be an arbitrary set of measure 0. Then there exists a continuous and increasing function
f : R → R for which
Df (x) = ∞ ∀x ∈ A.
Theorem 3.51. If f : [a, b] → R is increasing, the derivative f ′ is integrable and
Z b
(3.52) f ′ (x)dx ≤ f (b) − f (a).
a
Proof. Extend f y setting f (x) = f (b) ∀x > b. Theorem 3.42 ⇒ ∃ derivative f ′ (x) a.e. x ∈ [a, b],
i.e.
′ f x + k1 − f (x)
(3.53) f (x) = lim a.e. x ∈ [a, b].
k→∞ 1/k
Furthermore, f is measurable as an increasing function, hence the function
f x + k1 − f (x)
x 7→
1/k
are measurable ∀k ∈ N, and consequently
1
f x+ k − f (x)
x 7→ lim sup
k→∞ 1/k
is measurable. Since
f x + k1 − f (x) f x + k1 − f (x)
lim sup = lim = f ′ (x) a.e. x ∈ [a, b],
k→∞ 1/k k→∞ 1/k
f ′ is measurable. We notice (by performing a change of variables x + 1/k 7→ x) that
Z b Z b+1/k Z b
1
k f x+ − f (x) dx = k f (x)dx − k f (x)dx
a k a+1/k a
Z b+1/k Z a+1/k
=k f (x) dx − k f (x) dx
b |{z} a |{z}
=f (b) ≥f (a)
| {z }
=f (b)
≤ f (b) − f (a) ∀k ∈ N.
f increasing ⇒
1
k f x+ − f (x) ≥ 0 ∀x ∈ [a, b], k ∈ N,
k
and so by Fatou’s lemma
Z b Z b Z b
′ (3.53) 1 Fatou 1
f (x)dx = lim k f x + − f (x) dx ≤ lim inf k f x+ − f (x) dx
a a k→∞ | k k→∞ k
{z } |a {z }
≥0 and measurable ≤f (b)−f (a)
≤ f (b) − f (a).
Next we will study when an equality holds in (3.52), cf. Example 3.1.4.
2017 51
k
X
Vf (a, x) = sup |f (xi ) − f (xi−1 )|,
i=1
where the supremum is taken over all divisions a = x0 < x1 < x2 < · · · < xk = x of [a, x]. We say
that f is of bounded variation (or has bounded variation) on the interval [a, b] (abbr. f ∈ BV ) if
Vf (a, b) < ∞. A function g : R → R is of bounded variation if Vg (R) = supa<b Vg (a, b) < ∞.
A trivial observation: |f (b) − f (a)| ≤ Vf (a, b), since the points a, b form a division of [a, b].
Example 3.56. 1. f ∈ C 1 [a, b] ⇒ f ∈ BV.
Proof f ∈ C 1 [a, b] ⇒ f ′ : [a, b] → R continuous ⇒ ∃M = max{|f ′ (x)| : x ∈ [a, b]} < ∞.
Let a = x0 < x1 < x2 < · · · < xk = b be an arbitrary division of [a, b]. Then
k k
X VAL X
|f (xi ) − f (xi−1 )| ≤ M (xi − xi−1 ) = M (b − a)
i=1 i=1
sup
=⇒ Vf (a, b) ≤ M (b − a).
2. f ∈ C [a, b] 6⇒ f ∈ BV.
Let f : [0, 1] → R,
(
x sin x1 , jos 0 < x ≤ 1,
f (x) =
0, jos x = 0.
We want to prove that every function of bounded variation can be expressed as a difference of
two increasing functions.
Proof. Suppose that f is increasing. Let a = x0 < x1 < x2 < · · · < xk = b be a division of [a, b].
Then
k
X k
f incr. X
|f (xi ) − f (xi−1 )| = f (xi ) − f (xi−1 ) = f (xk ) − f (x0 ) = f (b) − f (a)
i=1 i=1
sup
=⇒ Vf (a, b) = f (b) − f (a) < ∞.
If f is decresing, then −f is increasing, and therefore Vf (a, b) = V−f (a, b) = f (a) − f (b).
be arbitrary divisions of [a, c] and [c, b], respectively. Then a = x0 < x1 < · · · < xk = y0 < y1 <
· · · < yn = b is a division of [a, b], and therefore
k
X n
X
|f (xi ) − f (xi−1 )| + |f (yi ) − f (yi−1 )| ≤ Vf (a, b).
i=1 i=1
Taking sup over all divisions of [a, c] and [c, b], we obtain
k−1
X n
X
≤ |f (xi ) − f (xi−1 )| + |f (c) − f (xk−1 )| + |f (xk ) − f (c)| + |f (xi ) − f (xi−1 )|
|i=1 {z } |
i=k+1
{z }
≤Vf (a,c) ≤Vf (c,b)
≤ Vf (a, c) + Vf (c, b)
sup
=⇒ Vf (a, b) ≤ Vf (a, c) + Vf (c, b).
Claim: x 7→ Vf (a, x) and x 7→ Vf (a, x) − f (x) are increasing on the interval [a, b].
Proof: Let a ≤ x1 ≤ x2 ≤ b. Lemma 3.58 ⇒
and
Vf (a, x2 ) − f (x2 ) − Vf (a, x1 ) − f (x1 ) = Vf (a, x1 ) + Vf (x1 , x2 ) − f (x2 ) − Vf (a, x1 ) + f (x1 )
= Vf (x1 , x2 ) − f (x2 ) − f (x1 ) ≥ 0.
| {z }
≥|f (x2 )−f (x1 )|
Consequences:
3. f ′ on integrable.
The next result concerning ”integral functions” will be very useful in Section 3.67. We also get
more examples of functions of bounded variation.
Theorem 3.61. Let f : [a, b] → R be integrable, f ∈ L1 [a, b] , and
Z x
F (x) = f (t)dt, x ∈ [a, b].
a
Then F is of bounded variation on the interval [a, b], and the variation of F is
Z b
(3.62) VF (a, b) = |f (t)|dt = kf k1 .
a
Proof. Let a = x0 < x1 < x2 < · · · < xn = b be an arbitrary division of [a, b]. Then
n
X n Z
X xi Z xi−1 X n Z xi
|F (xi ) − F (xi−1 )| = f (t)dt − f (t)dt = f (t)dt
i=1 i=1 a a i=1 xi−1
Xn Z xi Z b
≤ |f (t)|dt = |f (t)|dt.
i=1 xi−1 a
Let ε > 0 be arbitrary. Since g is uniformly continuous ([a, b] closed interval), ∃δ > 0 s.t.
|x − y| < δ
⇒ |g(x) − g(y)| < ε.
x, y ∈ [a, b]
Let a = x0 < x1 < x2 < · · · < xn = b be a division of [a, b] s.t. |xi − xi−1 | < δ ∀i = 1, 2, . . . , n.
Applying △-inequality (several times) we get ∀i = 1, 2, . . . , n
Z xi Z xi
|G(xi ) − G(xi−1 )| = g(t)dt = g(t) − g(xi−1 ) dt + (xi − xi−1 )g(xi−1 )
xi−1 xi−1
Z xi
≥ (xi − xi−1 )|g(xi−1 )| − g(t) − g(xi−1 ) dt
x
Z xi−1
i
≥ (xi − xi−1 )|g(xi−1 )| − |g(t) − g(xi−1 )| dt
xi−1 | {z }
<ε
Z xi
≥ |g(xi−1 )|dt − ε(xi − xi−1 )
xi−1
Z xi
≥ |g(t)| − |g(t) − g(xi−1 )| dt − ε(xi − xi−1 )
x
Z xi−1
i
Z xi
= |g(t)|dt − |g(t) − g(xi−1 )| dt − ε(xi − xi−1 )
xi−1 xi−1 | {z }
<ε
Z xi
≥ |g(t)|dt − 2ε(xi − xi−1 ).
xi−1
ε > 0 mv. ⇒ VG (a, b) ≥ kgk1
(3.64) ⇒ VG (a, b) = kgk1
(3.63)
if g : [a, b] → R is continuous.
B. General case f ∈ L1 [a, b] : We set f (x) = 0 ∀x ∈ R \ [a, b], and so f ∈ L1 (R).
Theorem 2.36 implies that for all ε > 0 there exists g ∈ C(R) s.t. kf − gk1 < ε.
Denote g0 = g|[a, b], hence in particular
Z b
kf − g0 k1 = |f (t) − g(t)|dt ≤ kf − gk1 < ε.
a
2017 55
Define Z x
G0 (x) = g0 (t)dt, x ∈ [a, b].
a
Then
△-ineq.
VG0 (a, b) = VG0 −F +F (a, b) ≤ VG −F (a, b) +VF (a, b) < ε + VF (a, b)
| 0 {z }
≤ kg0 −f k1 <ε
(3.63)
(3.64)
⇒ VF (a, b) ≥ VG0 (a, b) − ε = kg0 k1 − ε = kg0 − f + f k1 − ε
Minkowski
≥ kf k1 − kg0 − f k1 −ε > kf k1 − 2ε.
| {z }
<ε
ε > 0 arbitr. ⇒ VF (a, b) ≥ kf k1
⇒ VF (a, b) = kf k1 .
(3.63)
Proof. Corollary 2.7 (”absolute continuity of integrals”) implies that for all ve > 0 there exists
δ > 0 s.t.
Z
(3.71) E ∈ Leb [a, b] , m(E) < δ ⇒ |f (t)|dt < ε.
E
then
k
X Xk Z bi Z ai
|F (bi ) − F (ai )| = f (t)dt − f (t)dt
i=1 i=1 | a {z a }
Z bi
= f (t)dt
ai
k Z bi Z
X disjoint (3.71)
≤ |f (t)|dt = |f (t)|dt < ε.
i=1 ai E
1. f is (uniformly) continuous,
2. f is of bounded variation,
4. f ′ is integrable.
2. Exerc.
2017 57
2. 3.60
3. f absolutely continuous ⇒ f of bounded variation ⇒ ∃f ′ (x) for a.e. x ∈ [a, b].
2. 3.60
4. f absolutely continuous ⇒ f of bounded variation ⇒ f ′ integrable.
Then
k
X k
X
|f (bi ) − f (ai )| ≤ L (bi − ai ) < ε.
i=1 i=1
”Uniqueness theorem”
Theorem 3.74. If f : [a, b] → R is absolutely continuous and f ′ (x) = 0 for a.e. x ∈ [a, b], then f
is constant.
Proof. Let us show that f (c) = f (a) ∀c ∈]a, b[. Fix c ∈]a, b[ and denote
hence m(E) = c − a. Let ε > 0 be arbitrary and choose δ > 0 as in the definition of absolutely
continuity of f . For every x ∈ E (⊂]a, c[) there exist arbitrary short intervals [x, x+h] ⊂]a, c[, h > 0,
s.t.
|f (x + h) − f (x)|
< ε,
h
and therefore such intervals form a closed Vitali covering of E. The Vitali covering theorem ⇒ ∃
disjoint intervals Ij = [xj , yj ] ⊂]a, c[, j ∈ N, s.t.
and [
m E\ Ij = 0.
j∈N
because
k
[ [ [
lim m ]a, c[\ Ij = m ]a, c[\ Ij = m E \ Ij = 0.
k→∞
j=1 j∈N j∈N
We may assume
a < x1 < y1 < x2 < y2 < · · · < xk < yk < c,
Sk
and hence ]a, c[\ j=1 Ij is a disjoint unionof open intervals ∆j =]pj , qj [, j = 1, . . . , k + 1, (see the
picture) and
k+1
X k
[
(qj − pj ) = m E \ Ij < δ.
j=1 j=1
a I1 I2 Ik c
∆1 ∆2 ∆k+1
We obtain
Xk k+1
X
|f (c) − f (a)| = f (yj ) − f (xj ) + f (qj ) − f (pj )
j=1 j=1
k
X k+1
X
≤ |f (yj ) − f (xj )| + |f (qj ) − f (pj )|
| {z }
j=1 j=1
< ε(yj −xj ) | {z }
(3.75)
< ε
abs. cont.
k
X
<ε (yj − xj ) +ε
j=1
| {z }
≤c−a
≤ ε(c − a + 1).
Example 3.76. Let f : [0, 1] → [0, 1] be the Cantor 1/3-function. L. 3.59 ⇒ f of bounded variation
(f increasing). On the other hand, f ′ (x) = 0 a.e., but f is not a constant, hence f is not absolutely
continuous.
1. f is absolutely continuous,
2. ⇒ 3. Choose g = f ′ .
3. ⇒ 1. Suppose that ∃ integrable g : [a, b] → R s.t.
Z x
f (x) = f (a) + g(t)dt ∀x ∈ [a, b].
a
Z x
Denote G(x) = g(t)dt, x ∈ [a, b] 3.70
a =⇒ G abs. continuous
assumption: g integrable
Choosing x = b ⇒ claim.
Conversely: Suppose that
Z b
(3.80) f ′ (t)dt = f (b) − f (a).
a
Definition 3.81. A function f : [a, b] → R of bounded variation is singular if f ′ (x) = 0 for a.e.
x ∈ [a, b].
(Note: f of bounded variation ⇒ ∃f ′ (x) for a.e. x ∈ [a, b].)
2. Another characterization for absolutely continuous functions (see, for instance, [GZ, 7.45]).
Let f : [a, b] → R. Then f is absolutely continuous ⇐⇒
(a) f continuous,
(b) f is of bounded variation,
(c) E ⊂ [a, b], m(E) = 0 ⇒ m(f E) = 0 ”condition (N )”.
3. Let f : [a, b] → R be a function s.t. ∃f ′ (x) ∀x ∈ [a, b] and f ′ is integrable. Then f s absolutely
continuous (see, for instance, [GZ, 7.47]).
THE END
62 Real Analysis I
References
[EG] Evans, Lawrence and Gariepy Ronald. Measure theory and fine properties of functions, CRC
Press, 1992.
[Fr] Friedman, Avner. Foundations of modern analysis, Dover Publications Inc., 1982.
[GZ] Gariepy, Ronald and Ziemer, William. Modern real analysis, PWS Publishing Company,
1994.
[HS] Hewitt, Edwin and Stromberg, Karl. Real and abstract analysis, Springer-Verlag, 1975.
[Jo] Jones, Frank. Lebesgue integration on Euclidean space, Jones and Bartlett Publishers, 1993.
[Mat] Mattila, Pertti. Geometry of sets and measures in Euclidean spaces, Cambridge University
Press, 1995.
[MW] McDonald, John N. and Weiss, Neil A. A course in real analysis, Academic Press Inc., 1999.
[Ru] Rudin, Walter. Real and complex analysis, McGraw-Hill Book Co., 1987.