Convex Analysis
Convex Analysis
Bauschke · Combettes
Convex Analysis and Monotone Operator Theory in Hilbert Spaces
This book presents a largely self-contained account of the main results of convex analysis,
monotone operator theory, and the theory of nonexpansive operators in the context
of Hilbert spaces. Unlike existing literature, the novelty of this book, and indeed its Canadian
central theme, is the tight interplay among the key notions of convexity, monotonicity,
Mathematical Society
and nonexpansiveness. The presentation is accessible to a broad audience and attempts
to reach out in particular to the applied sciences and engineering communities, where Heinz H. Bauschke Société mathématique
these tools have become indispensable. Graduate students and researchers in pure and
applied mathematics will benefit from this book. It is also directed to researchers in
engineering, decision sciences, economics, and inverse problems, and can serve as a
Patrick L. Combettes du Canada
reference book.
1 Convex Analysis
and Monotone
9 78 1 44 1 994660
Foreword
vii
Preface
Three important areas of nonlinear analysis emerged in the early 1960s: con-
vex analysis, monotone operator theory, and the theory of nonexpansive map-
pings. Over the past four decades, these areas have reached a high level of
maturity, and an increasing number of connections have been identified be-
tween them. At the same time, they have found applications in a wide ar-
ray of disciplines, including mechanics, economics, partial differential equa-
tions, information theory, approximation theory, signal and image process-
ing, game theory, optimal transport theory, probability and statistics, and
machine learning.
The purpose of this book is to present a largely self-contained account
of the main results of convex analysis, monotone operator theory, and the
theory of nonexpansive operators in the context of Hilbert spaces. Authori-
tative monographs are already available on each of these topics individually.
A novelty of this book, and indeed, its central theme, is the tight interplay
among the key notions of convexity, monotonicity, and nonexpansiveness. We
aim at making the presentation accessible to a broad audience, and to reach
out in particular to the applied sciences and engineering communities, where
these tools have become indispensable. We chose to cast our exposition in the
Hilbert space setting. This allows us to cover many applications of interest
to practitioners in infinite-dimensional spaces and yet to avoid the technical
difficulties pertaining to general Banach space theory that would exclude a
large portion of our intended audience. We have also made an attempt to
draw on recent developments and modern tools to simplify the proofs of key
results, exploiting for instance heavily the concept of a Fitzpatrick function
in our exposition of monotone operators, the notion of Fejér monotonicity to
unify the convergence proofs of several algorithms, and that of a proximity
operator throughout the second half of the book.
The book in organized in 29 chapters. Chapters 1 and 2 provide back-
ground material. Chapters 3 to 7 cover set convexity and nonexpansive op-
erators. Various aspects of the theory of convex functions are discussed in
Chapters 8 to 19. Chapters 20 to 25 are dedicated to monotone operator the-
ix
x Preface
ory. In addition to these basic building blocks, we also address certain themes
from different angles in several places. Thus, optimization theory is discussed
in Chapters 11, 19, 26, and 27. Best approximation problems are discussed
in Chapters 3, 19, 27, 28, and 29. Algorithms are also present in various
parts of the book: fixed point and convex feasibility algorithms in Chap-
ter 5, proximal-point algorithms in Chapter 23, monotone operator splitting
algorithms in Chapter 25, optimization algorithms in Chapter 27, and best
approximation algorithms in Chapters 27 and 29. More than 400 exercises
are distributed throughout the book, at the end of each chapter.
Preliminary drafts of this book have been used in courses in our institu-
tions and we have benefited from the input of postdoctoral fellows and many
students. To all of them, many thanks. In particular, HHB thanks Liangjin
Yao for his helpful comments. We are grateful to Hédy Attouch, Jon Borwein,
Stephen Simons, Jon Vanderwerff, Shawn Wang, and Isao Yamada for helpful
discussions and pertinent comments. PLC also thanks Oscar Wesler. Finally,
we thank the Natural Sciences and Engineering Research Council of Canada,
the Canada Research Chair Program, and France’s Agence Nationale de la
Recherche for their support.
1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Sets in Vector Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 Nets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.5 The Extended Real Line . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.6 Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.7 Topological Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.8 Two–Point Compactification of the Real Line . . . . . . . . . . . . . . 9
1.9 Continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.10 Lower Semicontinuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.11 Sequential Topological Notions . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.12 Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
2 Hilbert Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.1 Notation and Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.2 Basic Identities and Inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.3 Linear Operators and Functionals . . . . . . . . . . . . . . . . . . . . . . . . 31
2.4 Strong and Weak Topologies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
2.5 Weak Convergence of Sequences . . . . . . . . . . . . . . . . . . . . . . . . . . 36
2.6 Differentiability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
3 Convex Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3.1 Definition and Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3.2 Best Approximation Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
3.3 Topological Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
3.4 Separation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
xi
xii Contents
13 Conjugation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
13.1 Definition and Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
13.2 Basic Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184
13.3 The Fenchel–Moreau Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194
16 Subdifferentiability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
16.1 Basic Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
16.2 Convex Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
16.3 Lower Semicontinuous Convex Functions . . . . . . . . . . . . . . . . . . 229
16.4 Subdifferential Calculus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240
xiv Contents
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 429
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 449
Chapter 5
Fejér Monotonicity and Fixed Point
Iterations
75
76 5 Fejér Monotonicity and Fixed Point Iterations
Consequently, since (dC (xn ))n∈N was seen in Proposition 5.4(iii) to converge,
(PC xn )n∈N is a Cauchy sequence in the complete set C. ⊓
⊔
Corollary 5.8 Let (xn )n∈N be a sequence in H, let C be a nonempty closed
convex subset of H, and let x ∈ C. Suppose that (xn )n∈N is Fejér monotone
with respect to C and that xn ⇀ x. Then PC xn → x.
5.1 Fejér Monotone Sequences 77
Proposition 5.9 Let (xn )n∈N be a sequence in H and let C be a closed affine
subspace of H. Suppose that (xn )n∈N is Fejér monotone with respect to C.
Then the following hold:
(i) (∀n ∈ N) PC xn = PC x0 .
(ii) Suppose that every weak sequential cluster point of (xn )n∈N belongs to
C. Then xn ⇀ PC x0 .
α2 kPC xn − PC x0 k2 = kPC xn − yα k2
≤ kxn − PC xn k2 + kPC xn − yα k2
= kxn − yα k2
≤ kx0 − yα k2
= kx0 − PC x0 k2 + kPC x0 − yα k2
= d2C (x0 ) + (1 − α)2 kPC xn − PC x0 k2 . (5.3)
Proof. Take x ∈ int C and ρ ∈ R++ such that B(x; ρ) ⊂ C. Define a sequence
(zn )n∈N in B(x; ρ) by
x, if xn+1 = xn ;
(∀n ∈ N) zn = xn+1 − xn (5.4)
x − ρ , otherwise.
kxn+1 − xn k
Thus, n∈N kxn+1 −xn k ≤ kx0 −xk2 /(2ρ) and (xn )n∈N is therefore a Cauchy
P
sequence. ⊓
⊔
Proof. Theorem 5.11 and (5.6) imply that (xn )n∈N converges strongly to some
point x ∈ C. On the other hand, (5.1) yields
Proof. From Example 5.3, (xn )n∈N is Fejér monotone with respect to Fix T .
(i): Let x be a weak sequential cluster point of (xn )n∈N , say xkn ⇀ x.
Since T xkn − xkn → 0, Corollary 4.18 asserts that x ∈ Fix T . Appealing to
Theorem 5.5, the assertion is proved.
(ii): Since D = −D is convex, 0 ∈ D and, since T is odd, 0 ∈ Fix T .
Therefore, by Fejér monotonicity, (∀n ∈ N) kxn+1 k ≤ kxn k. Thus, there
exists ℓ ∈ R+ such that kxn k ↓ ℓ. Now let m ∈ N. Then, for every n ∈ N,
(i): It follows from Corollary 2.14 and the nonexpansiveness of T that, for
every y ∈ Fix T and n ∈ N,
Remark 5.19 It follows from Remark 4.24(iii) that Corollary 5.18 is appli-
cable to firmly nonexpansive operators and, a fortiori, to projection operators
by Proposition 4.8.
p
X
(∀n ∈ N) xn+1 = ωk Ti(k,1) · · · Ti(k,mk ) xn . (5.18)
k=1
Proof. For every i ∈ I, set αi = βi /2 ∈ ]0, 1[. Since, for every i ∈ I, Proposi-
tion 4.8 asserts that PCi is firmly nonexpansive, Corollary 4.29 implies that
Ti is αi -averaged. Borrowing notation from Corollary 5.18, we note that for
every k ∈ {1, . . . , p}, maxi∈Ik αi ∈ ]0, 1[, which implies that ρk ∈ ]0, 1[ and
thus that α ∈ ]0, 1[. Altogether, the result follows from Corollary 5.18 with
λn ≡ 1. ⊓
⊔
where, for every i ∈ I, Pi denotes the projector onto Ci . Then (xn )n∈N
converges weakly to a point in C.
P
Proof. This is an application of Corollary 5.16(iii) with T = i∈I ωi Pi . In-
deed, since the operators (Pi )i∈I are firmly nonexpansive by Proposition 4.8,
their convex combination T is also firmly nonexpansive
T by Example
T 4.31.
Moreover, Proposition 4.34 asserts that Fix T = i∈I Fix Pi = i∈I Ci = C.
Alternatively, apply Corollary 5.18. ⊓
⊔
xn ⇀ y1 = T1 y2 , (5.21)
Tm xn ⇀ ym = Tm y1 , (5.22)
Tm−1 Tm xn ⇀ ym−1 = Tm−1 ym , (5.23)
..
.
T3 · · · Tm xn ⇀ y3 = T3 y4 , (5.24)
T2 · · · Tm xn ⇀ y2 = T2 y3 . (5.25)
Corollary 5.25 Let m be a strictly positive integer, set I = {1, T . . . , m}, let
(Ci )i∈I be a family of closed convex subsets of H such that C = i∈I Ci 6= ∅,
let (Pi )i∈I denote their respective projectors, and let x0 ∈ H. Set (∀n ∈ N)
xn+1 = P1 · · · Pm xn . Then (xn )n∈N converges weakly to a point in C.
Remark 5.26 If, in Corollary 5.25, all the sets are closed affine subspaces,
so is C and we derive from Proposition 5.9(i) that xn ⇀ PC x0 . Corollary 5.28
is classical, and it states that the convergence is actually strong in this case.
In striking contrast, the example constructed in [146] provides a closed hy-
perplane and a closed convex cone in ℓ2 (N) for which alternating projections
converge weakly but not strongly.
Exercises 85
The next result will help us obtain a sharper form of Corollary 5.25 for
closed affine subspaces.
Then xn → PC x0 .
Exercises
Exercise 5.4 Let C be a nonempty subset of H and let (xn )n∈N be a se-
quence in H that is Fejér monotone with respect to C. Show that (xn )n∈N is
Fejér monotone with respect to conv C.
Exercise 5.7 Let m be a strictly positive integer, set I = {1, T. . . , m}, let
(Ci )i∈I be a family of closed convex subsets of H such that C = i∈I Ci 6= ∅,
and let (Pi )i∈I be their respective projectors. Derive parts (ii) and (iii) from
(i) and Theorem 5.5, and also from Corollary 5.18.
(i) Let i ∈ I, let x ∈ Ci , and let y ∈ H. Show that kPi y − xk2 ≤ ky − xk2 −
kPi y − yk2 .
(ii) Set x0 ∈ H and
1
(∀n ∈ N) xn+1 = P1 xn + P1 P2 xn + · · · + P1 · · · Pm xn . (5.34)
m
(a) Let x P∈ C and n ∈ N. Show that kxn+1 − xk2 ≤ kxn − xk2 −
(1/m) i∈I kPi xn − xk2 .
(b) Let x be a weak sequential cluster point of (xn )n∈N . Show that
x ∈ C.
(c) Show that (xn )n∈N converges weakly to a point in C.
(iii) Set x0 ∈ H and
1
(∀n ∈ N) xn+1 = P1 P2 xn +P2 P3 xn +· · ·+Pm−1 Pm xn . (5.35)
m−1
(a) Let x ∈ C and n ∈ N. Show that kxn+1 − xk2 ≤ kxn − xk2 −
Pm−1 2 2
i=1 (kPi+1 xn − xn k + kPi Pi+1 xn − Pi+1 xn k )/(m − 1).
(b) Let x be a weak sequential cluster point of (xn )n∈N . Show that
x ∈ C.
(c) Show that (xn )n∈N converges weakly to a point in C.
Index
460
INDEX 461
cocoercive operator, 60, 61, 68, 70, cyclically monotone operator, 326
270, 294, 298, 325, 336, 339,
355, 370, 372, 377, 379, 438 Debrunner–Flor theorem, 315
coercive function, 158–161, 165, decreasing function, 5
202–204, 210 decreasing sequence of convex sets,
cohypomonotone operator, 337 48, 417
common fixed points, 71 demiclosedness principle, 63
compact set, 7, 8, 13 dense hyperplane, 123
complementarity problem, 376 dense set, 7, 33, 123, 232
complementary slackness, 291, 422 descent direction, 248, 249
complete metric space, 17 descent lemma, 270
composition of set-valued opera- diameter of a set, 16
tors, 2 directed set, 3, 4, 22, 27
concave function, 113 directional derivative, 241, 247
cone, 1, 87, 285, 287, 376, 387, 389, discontinuous linear functional,
425 32, 123, 169
conical hull, 87 discrete entropy, 140
conjugate function, 181 distance, 27
conjugation, 181, 197, 226, 230 distance to a set, 16, 20, 24, 32, 34,
constrained minimization prob- 44, 49, 98, 167, 170, 173, 177,
lem, 283, 285, 383 183, 185, 188, 238, 271, 272
continuity, 9 domain of a function, 5, 6, 113
continuous affine minorant, 168 domain of a set-valued operator, 2
continuous convex function, 123, domain of continuity, 11
136 Douglas–Rachford algorithm, 366,
continuous function, 11 376, 401, 404
continuous linear functional, 31 dual cone, 96
continuous operator, 9, 63 dual optimal value, 214
convergence of a net, 7 dual problem, 212, 214, 275, 279,
convex combination, 44 408
convex cone, 87, 89, 179, 183, 285, dual solution, 275, 279
425 duality, 211, 213, 275
convex feasibility problem, 81, 84 duality gap, 212, 214–216, 221
convex function, 113, 155 Dykstra’s algorithm, 431, 432
convex hull, 43, 44
Eberlein–Šmulian theorem, 35
convex integrand, 118, 138, 193,
effective domain of a function, 6
238
Ekeland variational principle, 19
convex on a set, 114, 125
Ekeland–Lebourg theorem, 263
convex programming problem, 290
enlargement of a monotone opera-
convex set, 43
tor, 309
convexity with respect to a cone,
entropy of a random variable, 139
285
epi-sum, 167
core, 90, 95, 123, 207, 210, 214,
epigraph, 5, 6, 12, 15, 113, 119,
216, 241, 271
133, 168
counting measure, 140
equality constraint, 283
462 INDEX
weak closure, 53
weak convergence, 33, 36, 79–81
weak sequential closure, 53
weak topology, 33
weakly closed, 33–35, 45, 53
weakly compact, 33–35
weakly continuous operator, 33,
35, 62, 418