Ergodic Theory a Volume in the Encyclopedia of Complexity and Systems Science, Second Edition (Cesar E. Silva, Alexandre I. Danilenko, (Eds.)) (Z-Library)
Ergodic Theory a Volume in the Encyclopedia of Complexity and Systems Science, Second Edition (Cesar E. Silva, Alexandre I. Danilenko, (Eds.)) (Z-Library)
Cesar E. Silva
Alexandre I. Danilenko
Editors
Ergodic
Theory
A Volume in the Encyclopedia of
Complexity and Systems Science,
Second Edition
Encyclopedia of Complexity and
Systems Science Series
Editor-in-Chief
Robert A. Meyers, Ramtech Limited, Palm Desert, CA, USA
The Encyclopedia of Complexity and Systems Science series of topical vol-
umes provides an authoritative source for understanding and applying the
concepts of complexity theory, together with the tools and measures for
analyzing complex systems in all fields of science and engineering. Many
phenomena at all scales in science and engineering have the characteristics of
complex systems, and can be fully understood only through the transdisciplin-
ary perspectives, theories, and tools of self-organization, synergetics, dynam-
ical systems, turbulence, catastrophes, instabilities, nonlinearity, stochastic
processes, chaos, neural networks, cellular automata, adaptive systems,
genetic algorithms, and so on. Examples of near-term problems and major
unknowns that can be approached through complexity and systems science
include: The structure, history and future of the universe; the biological basis
of consciousness; the integration of genomics, proteomics and bioinformatics
as systems biology; human longevity limits; the limits of computing; sustain-
ability of human societies and life on earth; predictability, dynamics and extent
of earthquakes, hurricanes, tsunamis, and other natural disasters; the dynamics
of turbulent flows; lasers or fluids in physics, microprocessor design; macro-
molecular assembly in chemistry and biophysics; brain functions in cognitive
neuroscience; climate change; ecosystem management; traffic management;
and business cycles. All these seemingly diverse kinds of phenomena and
structure formation have a number of important features and underlying
structures in common. These deep structural similarities can be exploited to
transfer analytical methods and understanding from one field to another. This
unique work will extend the influence of complexity and system science to a
much wider audience than has been possible to date.
Cesar E. Silva • Alexandre I. Danilenko
Editors
Ergodic Theory
A Volume in the Encyclopedia of
Complexity and Systems Science,
Second Edition
With 50 Figures
Editors
Cesar E. Silva Alexandre I. Danilenko
Mathematics B.Verkin Institute for Low Temperature
Williams College Physics and Engineering of the NAS of
Williamstown, MA, USA Ukraine
Kharkiv, Ukraine
This Springer imprint is published by the registered company Springer Science+Business Media,
LLC, part of Springer Nature.
The registered company address is: 1 New York Plaza, New York, NY 10004, U.S.A.
Series Preface
v
vi Series Preface
Each entry in each of the Series books was selected and peer reviews organized
by one of our university-based book Editors with advice and consultation
provided by our eminent Board Members and the Editor-in-Chief.
This level of coordination assures that the reader can have a level of
confidence in the relevance and accuracy of the information far exceeding
than that generally found on the World Wide Web. Accessibility is also a
priority and for this reason each entry includes a glossary of important terms
and a concise definition of the subject. In addition, we are pleased that the
mathematical portions of our Encyclopedia have been selected by Math
Reviews for indexing in MathSciNet. Also, ACM, the world’s largest educa-
tional and scientific computing society, recognized our Computational Com-
plexity: Theory, Techniques, and Applications book, which contains content
taken exclusively from the Encyclopedia of Complexity and Systems Science,
with an award as one of the notable Computer Science publications. Clearly,
we have achieved prominence at a level beyond our expectations, but consis-
tent with the high quality of the content!
ix
x Volume Preface
This volume is for researchers and students interested in ergodic theory and
dynamics. We thank Bryna Kra who edited the first edition. We are grateful to
all the researchers who have contributed to this volume.
xi
xii Contents
Robert A. Meyers
President: RAMTECH Limited
Manager, Chemical Process Technology,
TRW Inc.
Postdoctoral Fellow: California Institute of Tech-
nology
Ph.D. Chemistry, University of California at Los
Angeles
B.A. Chemistry, California State University, San
Diego
Biography
Dr. Meyers was manager of Energy and Environ-
mental Projects at TRW (now Northrop
Grumman) in Redondo Beach, CA, and is now
president of RAMTECH Limited. He is
coinventor of the Gravimelt process for desulfur-
ization and demineralization of coal for air pollu-
tion and water pollution control and was manager
of the Department of Energy project leading to the
construction and successful operation of a first-of-
a-kind Gravimelt Process Integrated Test Plant.
Dr. Meyers is the inventor of and was project
manager for the DOE-sponsored Magnetohydro-
dynamics Seed Regeneration Project which has
resulted in the construction and successful opera-
tion of a pilot plant for production of potassium
formate, a chemical utilized for plasma electricity
generation and air pollution control. He also man-
aged TRW efforts in magnetohydrodynamics
electricity generating combustor and plasma
channel development.
Dr. Meyers managed the pilot-scale DoE pro-
ject for determining the hydrodynamics of syn-
thetic fuels. He is a coinventor of several
thermooxidative stable polymers which have
xiii
xiv About the Editor-in-Chief
xix
xx Contributors
common features. This is discussed in the entry The entry ▶ “Operator Ergodic Theory” focuses
▶ “Dynamical Systems of Probabilistic Origin: on the asymptotic behavior of powers of a power-
Gaussian and Poisson Systems.” bounded operator in a Banach space. The asymp-
A classification of ergodic transformations via totic behavior is studied in different operator topol-
certain systems of parameters, such as spectrum ogies and in various modes of convergence. That is
and entropy, is the central classic problem of ergo- closely related to the classic topic of ergodic theo-
dic theory. ▶ “The Complexity and the Structure rems (see ▶ “Ergodic Theorems”).
and Classification of Dynamical Systems,” based Some aspects of the interplay between dynam-
mostly on examples from ergodic theory, explains ical systems and operator algebras are discussed
the methods for studying the complexity of struc- in ▶ “Dynamical Systems and C-Algebras.”
ture, classification, and anti-classification of the While this volume contains articles that
systems via descriptive set theory. address recent areas of ergodic theory, the sub-
Ergodic, spectral, and joining properties of ject continues to develop at a fast rate, and due to
translation flows on translation surfaces (and space and time constraints, we have not been
their Poincare maps, interval exchange transfor- able to cover every possible new development.
mations) and smooth area-preserving locally We want to thank all the authors who have
Hamiltonian flows are discussed in ▶ “Ergodic contributed to this volume, as well as the staff at
and Spectral Theory of Area-Preserving Flows Springer for their help in completing this edition.
on Surfaces.”
of disjoint (mod m, i.e., up to sets of measure 0)
Ergodic Theory: Basic Examples measurable sets {P1, . . ., Pn} such that X ¼
and Constructions [ Pi (mod m). The entropy of P with respect to
m is HðP Þ ¼ i mðPi Þ ln m(Pi) (other bases
Matthew Nicol1 and Karl Petersen2 are sometimes used for the logarithm).
1
Department of Mathematics, University of • The metric (or measure-theoretic) entropy of
Houston, Houston, TX, USA T with respect to P is hm ðT, P Þ ¼
2
Department of Mathematics, University of North lim n!1 1n H P _ . . . _ T nþ1 ðP Þ , where P _
Carolina, Chapel Hill, NC, USA
. . . _ T nþ1 ðP Þ is the partition of X into sets of
points with the same coding with respect to P
under T i, i ¼ 0, . . ., n 1, that is, x, y are in the
Article Outline
same set of the partition P _ . . . _ T –nþ1(P ) if
and only if T i(x) and T i( y) lie in the same set of
Glossary
the partition P for i ¼ 0, . . ., n 1.
Definition of the Subject and its Importance
• The metric entropy hm(T ) of (X, ℬ, m, T ) is the
Introduction
supremum of hm(T, P ) over all finite measur-
Examples
able partitions P .
Constructions
• If T is a continuous transformation of a com-
Future Directions
pact metric space X, then the topological
Bibliography
entropy of T is the supremum of the metric
entropies hm(T ), where the supremum is taken
Glossary over all T-invariant Borel probability
measures.
• A transformation T of a measure space (X, ℬ, • A system (X, ℬ, m, T ) is loosely Bernoulli if it
m) is measure-preserving if m (T 1A) ¼ m (A) is isomorphic to the first-return system to a
for all measurable A ℬ. subset of positive measure of an irrational rota-
• A measure-preserving transformation (X, ℬ, m, tion or a (positive or infinite entropy) Bernoulli
T ) is ergodic if T 1(A) ¼ A (mod m) implies system.
m (A) ¼ 0 or m (Ac) ¼ 0 for each measurable set • Two systems are spectrally isomorphic if the
A ℬ. unitary operators that they induce on their L2
• A measure-preserving transformation (X, ℬ, m, spaces are unitarily equivalent.
T ) of a probability space is weak-mixing if • A smooth dynamical system consists of a dif-
i
lim n!1 1n n1
i¼0 m T A \ B mðAÞmðBÞ ¼ 0 for ferentiable manifold M and a differentiable
all measurable sets A, B ℬ. map f : M ! M. The degree of differentiability
• A measure-preserving transformation (X, ℬ, m, may be specified.
T ) of a probability space is strong-mixing if • Two submanifolds S1, S2 of a manifold M
limn!1 m(T nA \ B) ¼ m(A)m(B) for all mea- intersect transversely at p M if Tp(S1) þ
surable sets A, B ℬ. Tp(S2) ¼ Tp(M).
• A continuous transformation T of a compact • An (ϵ-) small Cr perturbation of a Cr map f of a
metric space X is uniquely ergodic if there is manifold M is a map g such that dCr ðf , gÞ < ϵ,
only one T-invariant Borel probability measure i.e., the distance between f and g, is less than ϵ
on X. in the Cr topology.
• Suppose (X, ℬ, m) is a probability space. • A map T of an interval I ¼ [a, b] is piecewise
A finite partition P of X is a finite collection smooth (Ck for k 1) if there is a finite set of
only if there are no integers m1, . . . , mk, not all 0, which the dynamics is topologically conjugate to
which satisfy m1 α1 þ . . . þ mk αk ℤ. an adding machine (de Melo and van Strien 1993).
hs(m) is the measure-theoretic entropy of s with Or, repeatedly apply the substitution 0 ! 01,
respect to m. The study of full shifts or shifts of 1 ! 10.
finite type has played a prominent role in the
development of the hyperbolic theory of dynam- 0
ical systems as physical systems with “chaotic” 01
dynamics “typically” possess an invariant set with
0 1 10
induced dynamics topologically conjugate to a
0 1 10 0110
shift of finite type (see the discussion by Smale
in (Smale 1980, p. 147)). Dynamical systems in ...
which there are transverse homoclinic connec-
tions are a common example (Guckenheimer and The n’th entry is the sum, mod 2, of the digits
Holmes 1990, Theorem 5.3.5). Furthermore, in in the dyadic expansion of n. Using Keane’s block
certain settings positive metric entropy implies multiplication (Keane 1968) according to which if
the existence of shifts of finite type. One result B is a block, B 0 ¼ B, B 1 ¼ B0 , and B (o1
along these lines is a theorem of Katok (Katok . . . on) ¼ (B o1) . . . (B on), we may also
1980). Let htop( f ) denote the topological entropy obtain this sequence as
of a map f and hm( f ) denote metric entropy with
respect to an invariant measure m. 0 01 01 01 . . . :
Theorem 4.1 (Katok) Suppose T : M ! M The orbit closure of this sequence is uniquely
is a C1þϵ diffeomorphism of a closed manifold ergodic (there is a unique shift-invariant Borel
and m is an invariant measure with positive metric probability measure, which is then necessarily
entropy (i.e., hm (T ) > 0). Then for any 0 < ϵ < hm ergodic). It is isomorphic to a skew product (see
(T ), there exists an invariant set Λ topologically section “Skew Products”) over the von Neumann-
conjugate to a transitive shift of finite type with Kakutani adding machine, or odometer (see sec-
htop(T|Λ) > hm(T ) ϵ. tion “Adding Machines”). Generalized Morse
systems, that is, orbit closures of sequences like
More Examples of Subshifts 0 001 001 001 . . . , are also isomorphic
We consider some further examples of systems that to skew products over compact group rotations.
are given by the shift transformation on a subset of
the set of (usually doubly infinite) sequences on a Chacon System
finite alphabet, usually {0, 1}. Associated with This is the orbit closure of the sequence generated
each subshift is its language, the set of all finite by the substitution 0 ! 0010, 1 ! 1. It is uniquely
blocks seen in all sequences in the subshift. These ergodic and is one of the first systems shown to be
languages are extractive (or factorial) (every sub- weakly mixing but not strongly mixing. It is prime
word of a word in the language is also in the (has no nontrivial factors) (del Junco 1978) and in
language) and insertive (or extendable) (every fact has minimal self joinings (del Junco et al.
word in the language extends on both sides to 1980). It also has a nice description by means of
longer words in the language). In fact, these two cutting up the unit interval and stacking the
properties characterize the languages (subsets of pieces, using spacers (see section “Cutting and
the set of finite-length words on an alphabet) asso- Stacking”). This system has singular spectrum. It
ciated with subshifts. is not known whether or not its Cartesian square is
loosely Bernoulli.
Prouhet-Thue-Morse
An interesting (and often rediscovered) element of Sturmian Systems
Take the orbit closure of the sequence on ¼
f0, 1gℤþ is produced as follows. Start with 0, and
at each stage write down the opposite (00 ¼ 1, w[1α,1)(nα), where α is irrational. This is a
uniquely ergodic system that is isomorphic to
10 ¼ 0) or mirror image of what is available so far.
rotation by α on the unit interval. These systems
Ergodic Theory: Basic Examples and Constructions 9
1
Piecewise C2 Expanding Maps C dm
dm
C: Furthermore, T is ergodic with respect
The main statistical features of the examples in to m and displays the same statistical properties
section “Smooth Expanding Interval Maps” gen- listed above for the C2 expanding maps
eralize to a broader class of expanding maps of the (Boyarsky and Góra 1997a) (See the “Folklore
interval. For example: Theorem” in the article on Measure-Preserving
Let X ¼ [0, 1] and let P ¼ {I1, . . . , In} (n 2) Systems).
be a partition of X into intervals (closed, half-
open, or open) such that I i \ I j ¼ 0 if i 6¼ j. Let More Interval Maps
I oi denote the interior of Ii. Suppose T : X ! X
satisfies: Continued Fraction Map
This is the map T : [0, 1] ! [0, 1] given by Tx ¼
(a) For each i ¼ 1, . . . , n, T|Ii has a C2 extension 1/x mod 1, and it corresponds to the shift [0; a1, a2,
to the closure I i of Ii and |T 0(x)| α > 1 for all . . .] ! [0; a2, a3, . . .] on the continued fraction
x I oi . expansions of points in the unit interval (a map on
(b) T I j ¼ [i Pj I i Lebesgue a.e. for some non- ℕℕ). It preserves a unique finite measure equiva-
empty subset Pj {1, . . ., n}. lent to Lebesgue measure, the Gauss measure
(c) For each Ij, there exists nj such that T nj I j ¼ dx/(log 2)(1 þ x). It is Bernoulli with entropy
½0, 1 Lebesgue a.e. π2/6 log 2 (in fact the natural partition into inter-
vals is a weak Bernoulli generator; for definition
Then T has an invariant measure m which is and details, see (Phillips and Varadhan 1975)). By
absolutely continuous with respect to Lebesgue using the Ergodic Theorem, Khintchine and Lévy
measure m, and there exists C > 0 such that showed that
1 log k= log 2
1
ða1 . . . an Þ1=n ! 1þ 2
a:e: as n ! 1,
k¼1 k þ 2k
pn 1 p2
if ½0; a1 , . . . , an ¼ , then log qn ! a:e:;
qn n 12 log 2
1 p ðxÞ p2
log x n ! a:e:;
n qn ð x Þ 6 log 2
and if m is Lebesgue measure (or any equivalent provide the intermediate convergents (best one-
measure) and m is Gauss measure, then for each sided) as well as the continued fraction (best two-
interval I, m(T –nI) ! m(I ), in fact exponentially sided) rational approximations to irrational num-
fast, with a best constant 0.30366 . . . See bers. See (Lagarias 1991, 1992).
(Billingsley 1978a; Mayer 1991).
F-Expansions
The Farey Map Generalizing the continued fraction map, let f :
This is the map U : [0, 1] ! [0, 1] given by Ux ¼ [0, 1] ! [0, 1] and let {In} be a finite or infinite
x/(1 – x) if 0 x 1/2, Ux ¼ (1 x)/x if partition of [0, 1] into subintervals. We study the
1/2 x 1. It is ergodic for the s-finite infinite map f by coding itineraries with respect to the
measure dx/x (Rényi/Parry). It is also ergodic for partition {In}. For many examples, absolutely
the Minkowski measure d, which is a measure of continuous (with respect to Lebesgue measure)
maximal entropy. This map corresponds to the invariant measures can be found and their
shift on the Farey tree of rational numbers which
Ergodic Theory: Basic Examples and Constructions 11
dynamical properties determined. See (Schweiger A β-shift is a shift of finite type if and only if
1995a). the β-expansion of 1 is finite. It is sofic if and only
if the expansion of 1 is eventually periodic. If β is
β-Shifts a Pisot-Vijayaragavhan number (algebraic integer
This is the special case of f-expansions when all of whose conjugates have modulus less than 1),
f(x) ¼ βx mod 1 for some fixed β > 1. This map then the β-shift is sofic. If the β-shift is sofic, then
of the interval is called the β-transformation. With β is a Perron number (algebraic integer of maxi-
a proper choice of partition, it is represented by mum modulus among its conjugates).
the shift on a certain subshift of the set of all
sequences on the alphabet D ¼ {0, 1, . . . , bβc}., Theorem 4.2 Parry (1966) Every strongly tran-
called the β-shift. A point x is expanded as an sitive ( for every nonempty open set U, [n > 0T nU ¼
infinite series in negative powers of β with coeffi- X) piecewise monotonic map on [0, 1] is topolog-
cients from this set; dβ(x)n ¼ bβf n(x)c. ically conjugate to a β-transformation.
(By convention, terminating expansions are
replaced by eventually periodic ones.) A one- Gaussian Systems
sided sequence on the alphabet D is in the β-shift Consider a real-valued stationary process {fk :
if and only if all of its shifts are lexicographically –1 < i < 1} on a probability space (Ω, F , P).
less than or equal to the expansion dβ(1) of 1 base The process (and the associated measure-
β. A one-sided sequence on the alphabet D is the preserving system consisting of the shift and a
valid expansion of 1 for some β if and only if it shift-invariant measure on ℝℤ) is called Gaussian
lexicographically dominates all its shifts. These if for each d 1, any d of the fk form an
were first studied by Bissinger, Rényi, and Parry; ℝd-valuedGaussian random variable on Ω : This
there are good summaries by Bertrand-Mathis means that with E( fk) ¼ m for all k and
(1986) and Blanchard
p
(1989).
1þ 5
For b ¼ 2 , db ð1Þ ¼ 10101010 . . . : Aij ¼ f ki m f kj m dP
For b ¼ 32 , d b ð1Þ ¼ 101000001 . . . (not even- O
tually periodic). ¼ C ki kj for i, j ¼ 1, . . . , d,
Every β-shift is coded.
The topological entropy of a β-shift is log β. for each Borel set B ℝ,
There is a unique measure of maximal entropy
log β.
P o : f k1 ðoÞ, . . . , f kd ðoÞ B ¼
1 1
p exp ðx ðm, . . . , mÞÞtr A1 ðx ðm, . . . , mÞÞ dx1 . . . dxd :
2pd=2 det A B 2
The function C(k) is positive semidefinite and mixing. It is mixing if and only if C(k) ! 0 as
hence has an associated measure s on [0, 2 π] such |k| ! 1. If s is singular with respect to Lebesgue
that measure, then the entropy is 0; otherwise the
entropy is infinite (de la Rue 1993).
2p For more details, see (Cornfeld et al. 1982a).
CðkÞ ¼ eikt dsðtÞ:
0
Hamiltonian Systems
Theorem 4.3 The Gaussian system is ergodic if This paragraph is from the article on Measure-
and only if the “spectral measure” s is continuous Preserving Systems. Many systems that model
(i.e., nonatomic), in which case it is also weakly physical situations can be studied by means of
12 Ergodic Theory: Basic Examples and Constructions
Hamilton’s equations. The state of the entire sys- undergoes a perfectly elastic collision with the
tem at any time is specified by a vector angle of incidence equal to the angle of reflection
(q, p) ℝ2n, the phase space, with q listing the and continues in a straight line until it next hits the
coordinates of the positions of all of the particles, boundary. It is usual to normalize and consider
and p listing the coordinates of their momenta. We unit speed, as we do in this discussion for conve-
assume there is a time-independent Hamiltonian nience. We take coordinates (x, v) given by the
function H(q, p) such that the time development of Euclidean coordinates in x D together with a
the system satisfies Hamilton’s equations: direction vector v Sd1. A flow ft is defined
with respect to Lebesgue almost every (x, v) by
dqi @H dpi @H translating x a distance t defined by the direction
¼ , ¼ , i ¼ 1, . . . , n: ð4Þ
dt @pi dt @qi vector v, taking account of reflections at bound-
aries. ft preserves a measure absolutely continu-
Often in applications, the Hamiltonian func- ous with respect to Riemannian volume on (x, v)
tion is the sum of kinetic and potential energy: coordinates. The flow we have described is called
a billiard flow. The corresponding billiard map is
H ðq, pÞ ¼ K ðpÞ þ U ðqÞ: ð5Þ formed by taking the Poincaré map corresponding
to the cross-section given by the boundary @D. We
Solving these equations with initial state (q, p) will describe the planar billiard map; the higher
for the system produces a flow (q, p) ! Tt(q, p) in dimensional generalization is clear. The billiard
phase space which moves (q, p) to its position map is a map T : @D ! @D, where @D is
Tt(q, p) t units of time later. According to coordinatized by (s, θ), s [0, L], where L is
Liouville’s formula (Mañé 1987a, Theorem 3.2), the length of @D and θ (0, π) measures the
this flow preserves Lebesgue measure on ℝ2n. angle that inward pointing vectors make with the
Calculating dH/dt by means of the Chain Rule tangent line to @D at s. Given a point (s, θ), the
angle θ defines an oriented line l(s, θ) which
dH @H dpi @H dqi intersects @D in two points s and s0 . Reflecting
¼ þ
dt i
@pi dt @qi dt l in the tangent line to @D at the point s0 gives
another oriented line passing through s0 with angle
and using Hamilton’s equations shows that H is θ0 (measured with respect to the angular coordi-
constant on orbits of the flow, and thus each set of nate system based at s0 ). The billiard map is the
constant energy X(H0) ¼ {(q, p) : H(q, p) ¼ H0} is map T(s, θ) ¼ (s0 , θ0 ). T preserves a measure m ¼
an invariant set. There is a natural invariant mea- sin θds dθ. The billiard flow may be modeled as
sure on a constant energy set X(H0) for the a suspension flow over the billiard map (see sec-
restricted flow, namely the measure given by tion “Suspension Flows”).
rescaling the volume element dS on X(H0) by the If the region D is a polygon in the plane
factor 1/k▽ Hk. (or polyhedron in ℝd), then @D consists of the
faces of the polyhedron. The dynamical behavior
Billiard Systems of the billiard map or flow in regions with only flat
These form an important class of examples in (noncurved) boundaries is quite different to that of
ergodic theory and dynamical systems, motivated billiard flows or maps in regions D with strictly
by natural questions in physics, particularly the convex or strictly concave boundaries. The topo-
behavior of gas models. Consider the motion of a logical entropy of a flat polygonal billiard is zero.
particle inside a bounded region D in ℝd with Research interest focuses on the existence and
piecewise smooth (C1 at least) boundaries. In the density of periodic or transitive orbits. It is
case of planar billiards, we have d ¼ 2. The known that if all the angles between sides are
particle moves in a straight line with constant rational multiples of π, then there are periodic
speed until it hits the boundary; at which point, it orbits (Boshernitzan 1992; Vorobets et al. 1992;
Masur 1986) and they are dense in the phase space
Ergodic Theory: Basic Examples and Constructions 13
(Boshernitzan et al. 1998). It is also known that a perturbing in the class of area-preserving
residual set of polygonal billiards are topologi- diffeomorphisms is an appropriate imposition in
cally transitive and ergodic (Zemljakov and many physical models. We will take the version of
Katok 1975; Kerckhoff et al. 1986). the KAM theorem as given in (Mañé 1987a, The-
On the other hand, billiard maps in which @D orem 5.1) (original references include
has strictly convex components are physical (Kolmogorov 1954; Arnol’d 1963; Moser
examples of nonuniformly hyperbolic systems 1962)). An elliptic fixed point for an area-
(with singularities). The meaning of concave or preserving diffeomorphism T of a surface M is
convex varies in the literature. We will consider a called a nondegenerate elliptic fixed point if
billiard flow inside a circle to be a system with a there is a local Cr, r 4, change of coordinates
strictly concave boundary, while a billiard flow on h so that in polar coordinates
the torus from which a circle has been excised to
be a billiard flow with strictly convex boundary. hTh1 ðr, yÞ ¼ ðr, y þ a0 þ a1 r Þ þ F,
The class of billiards with some strictly convex
boundary components, sometimes called dispers- where all derivatives of F up to order 3 vanish,
ing billiards or Sinai billiards, was introduced by α1 6¼ 0, and α0 6¼ 0, p 2p
2 , π, 3 . A map of the form
Sinai (Sinaĭ 1970) who proved many of their
fundamental properties. Lazutkin (Lazutkin tðr, yÞ ¼ ðr, y þ a0 þ a1 r Þ,
1973) proved that planar billiards with generic
strictly concave boundary are not where α1 6¼ 0, is called a twist map. Note that a
ergodic. Nevertheless, Bunimovich (Bunimovič twist map leaves invariant the circle r ¼ k, any
1974; Bunimovich 1979) produced a large billiard constant k, and rotates each invariant curve by a
system, Bunimovich billiards, with strictly con- rigid rotation α1r, the magnitude of the rotation
cave boundary segments (perhaps with some flat depending upon r. With respect to two-
boundaries as well) which were ergodic and non- dimensional Lebesgue measure, a twist map is
uniformly hyperbolic. For more details, see certainly not ergodic.
(Chernov and Markarian 2006a; Katok et al.
1986; Liverani and Wojtkowski 1995; Theorem Suppose T is a volume-preserving
Tabachnikov 2005). We will discuss possibly the diffeomorphism of class Cr, r 4, of a surface
simplest example of a dispersing billiard, namely M. If x is a nondegenerate elliptic fixed point, then
a toral billiard with a single convex obstacle. Take for every ϵ > 0 there exists a neighborhood Uϵ of
the torus T 2 and consider a single strictly convex x and a set U0,ϵ Uϵ with the properties:
subdomain S with C1 boundary. The domain of
the billiard map is [0, L] (0, π), where L is the (a) U0,ϵ is a union of T-invariant simple closed
length of @S. The measure sin(θ)ds dθ is pre- curves of class Cr–1 containing x in their
served. If the curvature of @S is everywhere non- interior.
zero, then the billiard map T has positive (b) The restriction of T to each such invariant
topological entropy, periodic points are dense, curve is topologically conjugate to an irratio-
and in fact the system is isomorphic to a Bernoulli nal rotation.
shift (Gallavotti and Ornstein 1974). (c) m(Uϵ U0,ϵ) ϵUϵ, where m is Lebesgue
measure on M.
KAM-Systems and Stably Nonergodic Behavior As a corollary, we have.
A celebrated theorem of Kolmogorov, Arnold,
and Moser (the KAM theorem) implies that the Corollary Let M be a compact surface without
set of ergodic area-preserving diffeomorphisms of boundary and Diff r(M ) the space of C r area-
a compact surface without boundary is not dense preserving diffeomorphisms with the C r topology.
in the Cr topology for r 4. This has important Then the set of T Diff r(M) which are ergodic
implications, in that there are natural systems in with respect to the probability measure
which ergodicity is not generic. The constraint of
14 Ergodic Theory: Basic Examples and Constructions
determined by normalized area is not dense in structurally stable if and only if it is uniformly
Diff r(M) for r 4. hyperbolic and satisfies a technical assumption
called strong transversality (see below for details).
Smooth Uniformly Hyperbolic Suppose M is a C1 compact Riemannian man-
Diffeomorphisms and Flows ifold equipped with metric d and tangent space
Time series of measurements on deterministic TM with norm k k. Suppose also that U Mis a
dynamical systems sometimes display limit laws nonempty open subset and T : U ! T(U ) is a C1
exhibited by independent identically distributed diffeomorphism. A compact T invariant set Λ U
random variables, such as the central limit theo- is called a hyperbolic set if there is a splitting of
rem, and also various mixing properties. The the tangent space Tp M at each point p Λ into
models of hyperbolicity we discuss in this section two invariant subspaces, TpM ¼ Eu( p) Es( p),
have played a key role in showing how this phe- and a number 0 < l < 1 such that for n 0
nomenon of “chaotic behavior” arises in deter-
ministic dynamical systems. Hyperbolic sets and Dp T n v Cln kvk for v Es ðpÞ,
their associated dynamics have also been pivotal Dp T n v < Cln kvk for v Eu ðpÞ:
in studies of structural stability. A smooth system
is Cr structurally stable if a small perturbation in The subspace Eu is called the unstable or
the Cr topology gives rise to a system which is expanding subspace and the subspace Es the sta-
topologically conjugate to the original. When ble or contracting subspace. The stable and unsta-
modeling a physical system, it is desirable that ble subspaces may be integrated to produce stable
slight changes in the modeling parameters do not and unstable manifolds
greatly affect the qualitative or quantitative
behavior of the ensemble of orbits considered as W s ðpÞ ¼ fy : dðT n p, T n yÞ ! 0g as n ! 1,
a whole. The orbit of a point may change drasti-
W u ðpÞ ¼ fy : dðT n p, T n yÞ ! 0g as n ! 1:
cally under perturbation (especially if the system
has sensitive dependence on initial conditions) but
The stable and unstable manifolds are immer-
the collection of all orbits should ideally be “sim-
sions of Euclidean spaces of the same dimension
ilar” to the original unperturbed system. In the
as Es( p) and Eu( p), respectively, and are of the
latter case, one would hope that statistical proper-
same differentiability as T. Moreover, Tp(Ws( p)) ¼
ties also vary only slightly under perturbation.
Es( p) and Tp(W u( p)) ¼ Eu( p). It is also useful to
Structural stability is one, quite strong, notion of
define local stable manifolds and local unstable
stability. The conclusion of a body of work on
manifolds by
structural stability is that a system is C1
Finally, we discuss the notion of strong trans- In the C r, r 1 topology, Robbin (1971), de Melo
versality. We say a point x is nonwandering if for (1973), and Robinson (1975, 1976) proved that
each open neighborhood U of x there exists an dynamical systems with the strong transversal
n > 0 such that T n ðUÞ \ U 6¼ 0. The NW set of property are structurally stable, and Robinson
nonwandering points is called the nonwandering (1973) in addition showed that strong trans-
set. We say a dynamical system has the strong versality was also necessary. Mañé (1988) showed
transversal property if Ws(x) intersects W u( y) that a C1 structurally stable diffeomorphism must
transversely for each pair of points x, y NW. be uniformly hyperbolic, and Hayashi (1997)
Ergodic Theory: Basic Examples and Constructions 15
extended this to flows. Thus a C1 diffeomorphism Hyperbolic Dynamical Systems”), was shown to
or flow on a compact manifold is structurally be stably ergodic (Grayson et al. 1994), so the
stable if and only if it is uniformly hyperbolic geodesic flow is still playing a major role in the
and satisfies the strong transversality condition. development of ergodic theory.
ft ðp, vÞ ¼ gp,v ðtÞ, g_ p,v ðtÞ : Since each matrix I corresponds to the iden-
tity transformation, we consider matrices inPSL
where (p, v) TM. Since geodesics have constant (2, ℝ) ≔ SL(2, ℝ)/{I}.
speed, if kvk ¼ 1 then kγp,v(t)k ¼ 1 for all t, and thus The unit tangent bundle, Sℋ+, of the upper
the unit tangent bundle T 1M ¼ {( p, v) TM : half-plane can be identified with PSL(2, ℝ). Then
kvk ¼ 1} is preserved under the geodesic flow. the geodesic flow corresponds to the
The geodesic flow and its restriction to the unit transformations
tangent bundle both preserve a volume form,
Liouville measure. In 1934, Hedlund (1934) pro- et 0
ved that the geodesic flow on the unit tangent t ℝ 7!
0 et
bundle of a surface of strictly negative constant
sectional curvature is ergodic, and in 1939 Hopf seen as acting on PSL(2, ℝ). The unstable folia-
(1939) extended this result to manifolds of arbi- tion of an element A PSL(2, ℝ) ffi Sℋ+ is
trary dimension and strictly negative (not neces- given by
sarily constant) curvature. Hopf’s technique of
proof of ergodicity (Hopf argument) was 1 t
extremely influential and used the foliation of A, t ℝ,
0 1
the tangent space into stable and unstable mani-
folds. For a clear exposition of this technique, and and the flow along this foliation, given by
the property of absolute continuity of the folia-
tions into stable and unstable manifolds, see
1 t
(Liverani and Wojtkowski 1995). The geodesic t ℝ 7! ,
0 1
flow on manifolds of constant negative sectional
curvature is an Anosov flow (see section “Anosov
is called the horocycle flow, similarly for the flow
Systems”). We remark that for surfaces sectional
induced on the unit tangent bundle of each quo-
curvature is the same as Gaussian curvature.
tient of the upper half-plane by a discrete group of
Recently, the time-one map of the geodesic flow
linear fractional transformations.
on the unit tangent bundle of a surface with con-
The geodesic and horocycle flows acting on a
stant negative curvature, which is a partially
(finite-volume) surface of constant negative
hyperbolic system (see section “Partially
16 Ergodic Theory: Basic Examples and Constructions
curvature form the fundamental example of a and 0 < l < 1, constant C such that
transverse pair of actions. The geodesic flow k|DT nvk < Clnkvk for all v Es( p) and
often has many periodic orbits and many invariant kDT nwk Clnkwk for all w Eu( p).
measures, has positive entropy, and is in fact A similar definition holds for Anosov flows
Bernoulli with respect to the natural measure f : ℝ M ! M. A flow is Anosov if there is a
(Ornstein and Weiss 1973), while the horocycle splitting of the tangent bundle into flow-invariant
flow is often uniquely ergodic (Furstenberg 1973; subspaces Eu, Es, and Ec so Dp ft Esp ¼
Marcus 1975) and of entropy zero, although EsftðpÞ , Dp ft Eup ¼ Euft ðpÞ , and Dp ft Ecp ¼ Ecft ðpÞ , and
mixing of all orders (Marcus 1978). See at each point p M
(Hasselblatt and Katok 2003a) for more details.
T p M ¼ Esp Eup Ecp
Markov Partitions and Coding
If (X, T, ℬ, m) is a dynamical system, then a finite Dp ft v < Clt kvk for v Es ðpÞ
partition of X always induces a coding of the Dp ft v < Clt kvk for v Eu ðpÞ
orbits and a semiconjugacy with a subshift on a
symbol space (it may not of course be a full for some 0 < l < 1. The tangent to the flow
conjugacy). For hyperbolic systems, a special direction Ec( p) is a neutral direction:
class of partitions, Markov partitions, induce a
conjugacy for the invariant dynamics to a subshift Dp ft v ¼ kvk for v Ec ðpÞ:
of finite type. A Markov partition P for an invari-
ant subset Λ of a diffeomorphism T of a compact Anosov proved that Anosov flows and
manifold M is a finite collection of sets Ri, diffeomorphisms which preserve a volume form
1 i n called rectangles. The rectangles have are ergodic (Anosov 1967) and are also structur-
the property, for some ϵ > 0, if x, y Ri then ally stable. Sinai (1968) constructed Markov par-
W sϵ ðxÞ \ W uϵ ðyÞ Ri . This is sometimes described titions for Anosov diffeomorphisms and hence
as being closed under local product structure. We coded trajectories via a subshift of finite type.
let W u(x, Ri) denote W uϵ ðxÞ \ Ri and W s(x, Ri) Using ideas from statistical physics in (Sinaĭ
denote W sϵ ðxÞ \ Ri : Furthermore, we require for 1972), Sinai constructed Gibbs measures for
all i, j: Anosov systems. An SRB measure (see section
“Physically Relevant Measures and Strange
1. Each Ri is the closure of its interior. Attractors”) is a type of Gibbs measure
2. Λ [iRi. corresponding to the potential log j det DT jEu j
3. Ri \ Rj ¼ @Ri \ @Rj if i 6¼ j. and is characterized by the property of absolutely
4. If x Roi and T ðxÞ Roj , then W u(T(x), Rj) continuous conditional measures on unstable
T(W u(x, Ri)) and W s(x, Ri) T 1(W u(T(x), Rj)). manifolds.
The simplest examples of Anosov
Anosov Systems diffeomorphisms are perhaps the two-
An Anosov diffeomorphism (Anosov 1967) is a dimensional hyperbolic toral automorphisms (the
uniformly hyperbolic system in which the entire n > 2 generalization is clear). Suppose A is a 2 2
manifold is a hyperbolic set. Thus, an Anosov matrix with integer entries
diffeomorphism is a C1 diffeomorphism T of
M with a DT-invariant splitting (which is a con- a b
tinuous splitting) of the tangent space TM(x) at
c d
each point p into a disjoint sum
such that det(A) ¼ 1 and A has no eigenvalues of
T p M ¼ Eu ð pÞ Es ð pÞ
modulus 1. Then A defines a transformation of the
two-dimensional torus T 2 ¼ S1 S1 such that if
v T 2,
Ergodic Theory: Basic Examples and Constructions 17
a partially hyperbolic diffeomorphism with cen- to consider ergodicity (and other statistical prop-
tral direction given by the flow direction. There is erties) of the system with respect to Lebesgue
no expansion or contraction in the central measure. In dissipative systems, a measure equiv-
direction. alent to Lebesgue may not be invariant (for exam-
ple, the solenoid). Nevertheless, Lebesgue
Nonuniformly Hyperbolic Systems measure has a distinguished role since sampling
The assumption of uniform hyperbolicity is quite by experimenters is done with respect to Lebesgue
restrictive, and few “chaotic systems” found in measure. The idea of a physically relevant mea-
applications are likely to exhibit uniform hyper- sure m is that it determines the statistical behavior
bolicity. A natural weakening of this assumption, of a positive Lebesgue measure set of orbits, even
and one that is nontrivial and greatly extends the though the support of m may have zero Lebesgue
applicability of the theory, is to require the hyper- measure. An example of such a situation in the
bolic splitting (no longer uniform) to hold only at uniformly hyperbolic setting is the solenoid Λ,
almost every point of phase space. A systematic where the attracting set Λ has Lebesgue measure
theory was built by Pesin (1976, 1977) on the zero and is (locally) topologically the product of a
assumption that the system has nonzero Lyapunov two-dimensional Cantor set and a line segment.
exponents m almost everywhere, where m is Nevertheless, Λ determines the behavior of all
Lebesgue-equivalent invariant probability mea- points in a solid torus in ℝ3. More generally,
sure. Recall that a number l is a Lyapunov expo- suppose that T : M ! M is a diffeomorphism on
nent for p M if kDpT nvk eln for some unit a compact Riemannian manifold and that m is a
vector v TpM. Oseledet’s theorem (Oseledec version of Lebesgue measure on M, given by a
1968) (see also Walters 1982a, p. 232), which is smooth volume form. Although Lebesgue mea-
also called the Multiplicative Ergodic Theorem, sure m is a distinguished physically relevant mea-
implies that if T is a C1 diffeomorphism of M, then sure, m may not be invariant under T, and the
for any T-invariant ergodic measure m almost system may even be volume contracting in the
every point has well-defined Lyapunov expo- sense that m(T nA) ! 0 for all measurable sets A.
nents. One of the highlights of Pesin theory is Nevertheless, an experimenter might observe
the following structure theorem: If T : M ! M is long-term “chaotic” behavior whenever the state
a C1þϵ diffeomorphism with a T-invariant of the system gets close to some compact invariant
Lebesgue-equivalent Borel measure m such that set X which attracts a positive m-measure of orbits
T has nonzero Lyapunov exponents with respect in the sense that these orbits limit on X. Possibly
to m, then T has at most a countable number of m(X) ¼ 0, so that X is effectively invisible to the
ergodic components {Ci} on each of which the observer except through its effects on orbits not
restriction of T is either Bernoulli or Bernoulli contained in X. The dynamics of T restricted to
times a rotation (by which we mean the support X can in fact be quite complicated – maybe a full
of mi ¼ mjCi consists of a finite number ni of sets shift, or a shift of finite type, or some other com-
Si1 , . . . Sini cyclically permuted and T ni is plicated topological dynamical system. Suppose
there is a T-invariant measure m supported on
Bernoulli when restricted to each Sij Þ (Pesin X such that for all continuous functions f : M ! ℝ
1977; Young 1993). This structure theorem has
been generalized to SRB measures with nonzero n1
1
Lyapunov exponents (Ledrappier 1984; Pesin f ∘ T k ðxÞ ! fdm, ð6Þ
1977). n k¼0 X
Physically Relevant Measures and Strange for a positive m-measure of points x M. Then
Attractors the long-term equilibrium dynamics of an observ-
(This paragraph is from the article on Measure- able set of points x M (i.e., a set of points of
Preserving Systems.) For Hamiltonian systems positive m measure) is described by (X, T, m). In
and other volume-preserving systems, it is natural this situation, m is described as a physical
20 Ergodic Theory: Basic Examples and Constructions
measure. There has been a great deal of research manifolds is absolutely continuous (takes sets of
on the properties of systems with attractors zero Lebesgue measure on W u to sets of zero
supporting physical measures. Lebesgue measure on W u), there is a positive
In the dissipative nonuniformly hyperbolic set- Lebesgue measure of points (namely an unstable
ting, the theory of “physically relevant” measures manifold and the union of stable manifolds
is best developed in the theory of SRB (for Sinai, through it) satisfying (Eq. 7). Thus an SRB mea-
Ruelle, and Bowen) measures. These dynamically sure with absolutely continuous holonomy maps
invariant measures may be supported on a set of along stable manifolds is a physically relevant
Lebesgue measure zero yet determine the asymp- measure. If the stable foliation possesses this
totic behavior of points in a set of positive property, it is called absolutely continuous. An
Lebesgue measure. Axiom A attractor for a C2 diffeomorphism is an
If T is a diffeomorphism of M and m is a example of an SRB attractor (Bowen 1975; Ruelle
T-invariant Borel probability measure with posi- 1976, 1978; Sinaĭ 1972). The examples we have
tive Lyapunov exponents which may be inte- given of SRB measures and attractors and mea-
grated to unstable manifolds, then we call m an sures have been uniformly hyperbolic.
SRB measure if the conditional measure m induced Recently, much progress has been made in
on the unstable manifolds is absolutely continu- understanding the statistical properties of non-
ous with respect to the Riemannian volume ele- uniformly hyperbolic systems by using a tower
ment on these manifolds. The reason for this (see section “Induced Transformations”) to con-
definition is technical but can be gleaned from struct SRB measures. We refer to Young’s original
the following observation. Suppose that the papers (Young 1998, 1999), to the book by Baladi
diffeomorphism has no zero Lyapunov exponents (2000a), and to (Young 1993) for a recent survey
with respect to m. Since T is a diffeomorphism, on SRB measures in the nonuniformly setting.
this implies T has negative Lyapunov exponents
as well as positive Lyapunov exponents and Unimodal Maps
corresponding local stable manifolds as well as Maps of an interval to itself are simple examples
local unstable manifolds. Suppose that a of non-uniformly hyperbolic systems that have
T-invariant set A consists of a union of unstable played an important role in the development of
manifolds and is the support of an ergodic SRB dynamical systems theory. Suppose I ℝ is an
measure m and that f : M ! ℝ is a continuous interval; for simplicity, we take I ¼ [0, 1].
function. Since m has absolutely continuous con- A unimodal map is a map T : [0, 1] ! [0, 1]
ditional measures on unstable manifolds with such that there exists a point 0 < c < 1 and
respect to conditional Lebesgue measure on the
unstable manifolds, almost every point x in the • T is C2.
union of unstable manifolds U satisfies • T0 (x) > 0 for x < c, T0 (x) < 0 for x > c.
• T0 (c) ¼ 0.
n1
1
lim f ∘ T j ðxÞ ¼ f dm ð7Þ Such a map is clearly not uniformly expanding,
n!1 n
j¼0
as |T0 (x)| < 1 for points in a neighborhood of c.
The family of maps Tm (x) ¼ mx(1 – x), 0 < m 4,
If y W sϵ ðxÞ for such an x U, then d(T nx,
is a family of unimodal maps with c ¼ 1/2 and
T y) ! 1 and hence (Eq. 7) implies
n
T2(1/2) ¼ 1/2, T4(1/2) ¼ 1.
n1
We could have taken the interval I to be [1, 1]
1 or indeed any interval with an obvious modifica-
lim f∘T j ðyÞ ¼ f dm
n!1 n
j¼0 tion of the definition above. A well-studied family
of unimodal maps in this setting is the logistic
Furthermore, if the holonomy between unsta- family fa : [1, 1] ! [1, 1], fa(x) ¼ 1 ax2,
ble manifolds defined by sliding along stable a (0, 2). The families are equivalent under a
Ergodic Theory: Basic Examples and Constructions 21
smooth coordinate change, so statements about topologically conjugate to the dyadic adding
one family may be translated into statements machine coexisting with isolated repelling orbits
about the other. of period 2n, n ¼ 0, 1, 2, . . . There is a unique
Unimodal maps are studied because of the repelling orbit of period 2n for n 1 along with
insights they offer into transitions from regular two fixed points. The Cantor set is the o-limit set
or periodic to chaotic behavior as a parameter for all points that are not periodic or preimages of
(e.g., m or a) is varied, the existence of absolutely periodic orbits. C is the set of accumulation points
continuous measures, and rates of decay of corre- of periodic orbits. Despite this picture of incredi-
lations of regular observations for nonuniformly ble complexity, the topological entropy is zero for
hyperbolic systems. l l1. For l > l1, the map Tl has positive
A result of Jakobson (1981) and Benedicks and topological entropy and infinitely many periodic
Carleson (1985) implies that in the case of the orbits whose periods are not powers of 2. For each
logistic family there is a positive Lebesgue mea- l l1, Tl possesses an invariant Cantor set
sure set of a such that fa has an absolutely contin- which is repelling for l > l1. We say that Tl is
uous ergodic invariant measure ma. It has been hyperbolic if there is only one attracting periodic
shown by Young (1999) and Keller and Nowicki orbit and the only recurrent sets are the attracting
(1992) that if fa is mixing with respect to ma then periodic orbit, repelling periodic orbits, and pos-
the decay of correlations for Lipshitz observations sibly a repelling invariant Cantor set. It is known
on I is exponential. It is also known that the set of that the set of l [0, 4] for which Tl is hyperbolic
a such that fa is mixing with respect to ma has is open and dense (Graczyk and Światek 1997).
positive Lebesgue measure. There is a well- Remarkably, by Jakobson’s result (Jakobson
developed theory concerning the bifurcations the 1981) there is also a positive Lebesgue measure
maps Tm undergo as m varies (Collet and Eckmann set of parameters l for which Tl has an absolutely
1980a). We briefly describe the period-doubling continuous invariant measure ml with a positive
route to chaos in the family Tl(x) ¼ lx(1 x). For Lyapunov exponent.
a good account, see (Hasselblatt and Katok
2003a). We let cl denote the fixed point l1 l . For Intermittent Maps
p
3 < l 1 þ 6, all points in [0, 1] except for 0, Maps of the unit interval T : [0, 1] ! [0, 1] which
cl, and their preimages are attracted to a unique are expanding except at the point x ¼ 0, where
periodic orbit O( pl) of period 2. There is a mono- they are locally x x þ x1þα, α > 0, have been
tone sequence of parameter values ln (l1 ¼ 3) extensively studied both for the insights they give
such that for ln < l lnþ1, Tl has a unique into rates of decay of correlations for non-
attracting periodic orbit O(ln) of period 2n and uniformly hyperbolic systems (hyperbolicity is
for each k ¼ 1, 2, . . . , n 1 a unique repelling lost at the point x ¼ 0, where the derivative is 1)
orbit of period 2k. All points in the interval [0, 1] and for their use as models of intermittent behav-
except for the repelling periodic orbits and their ior in turbulence (Manneville and Pomeau 1980).
preimages are attracted to the attracting periodic A fixed point where the derivative is 1 is some-
orbit of period 2n. At l ¼ ln, the periodic orbit times called an indifferent fixed point. It is a model
O(ln) undergoes a period-doubling bifurcation. of intermittency in the sense that orbits close to
Feigenbaum (1978) found that the limit d ¼ 1 will stay close for many iterates (since the
ln ln1
lnþ1 ln 4:699 . . . exists and that in a wide class expansion is very weak there), and hence a time
of unimodal maps this period-doubling cascade series of observations will be quite uniform for
occurs and the differences between successive long periods of time before displaying chaotic
bifurcation parameters give the same limiting type behavior after moving away from the indif-
ratio, an example of universality. At the end of ferent fixed into that part of the domain where the
the period-doubling cascade at a parameter map is uniformly expanding.
l1 3:569 . . . , T l1 has an invariant Cantor set A particularly simple model (Liverani et al.
C (the Feigenbaum attractor) which is 1999) is provided by
22 Ergodic Theory: Basic Examples and Constructions
D(T ) be the measurable union of the collection of systems which react to inputs from other systems,
wandering sets for T. The transformation T is con- and continuous time systems are often modeled as
servative with respect to m if (X \ D(T )) ¼ X (mod suspension flows over discrete-time dynamics) or
m) (for more details, see the recent survey by to reduce systems to simpler components (often a
A. Danilenko and C. Silva in this volume). It is factor system or induced system is simpler to
usually necessary to assume T conservative with study). Unless stated otherwise, in the following
respect to m to say anything interesting about its we discuss measure-preserving transformations
behavior. For example, if T(x) ¼ x þ α, α > 0 is a on Lebesgue spaces (see the article on Measure-
translation of the real line, then D(T) ¼ X. The Preserving Systems).
definition of ergodicity in this setting remains the
same: T is ergodic if A ℬ, and T 1 A ¼ A mod Products
m implies that m(A) ¼ 0 or m(Ac) ¼ 0. However, the Given measure-preserving systems (X, ℬ, m, T )
equivalence of ergodicity of T with respect to m and (Y, C , n, S), their product consists of their
and the equality of time and space averages for completed product measure space with the trans-
L1(m) functions no longer holds. Thus, in general formation T S : X Y ! X Y defined by (T
m ergodic does not imply that S)(x, y) ¼ (Tx, Sy) for all (x, y) X Y. Neither
ergodicity nor transitivity is in general preserved
1
n1 by taking products, for example, the product of an
lim f ∘ T j ðxÞ ¼ f dm m a:e: x X irrational rotation on the unit circle with itself is
n!1 n
i¼0 X
not ergodic. For a list of which mixing properties
are preserved under the taking of products, see
for all f L1(m). In the example of the intermit-
(Walters 1982a). Given any countable family of
tent map with γ (1, 2), the orbit of Lebesgue
measure-preserving transformations on probabil-
almost every x X is dense in X, yet the fraction
ity spaces, their direct product is defined similarly.
of time spent near the indifferent fixed point x ¼
0 tends to one for Lebesgue almost every x X.
Factors
In fact, it may be shown (Aaronson 1997,
We say that a measure-preserving system (Y, C , n,
Section 2.4) that when m (x) ¼ 1 there are no
S) is a factor of a measure-preserving system (X,
constants an > 0 such that
ℬ, m, T ) if (possibly after deleting a set of measure
n1
0 from X) there is a measurable onto map f :
1 X ! Y such that
lim f ∘ T j ðxÞ ¼ f dm m a:e: x X
n!1 an
i¼0 X
f1 C ℬ,
Nevertheless, it is sometimes possible to obtain fT ¼ Sf, and ð8Þ
distributional limits, rather than almost sure 1
mT ¼ v:
limits, of Birkhoff sums under suitable normali-
zation. We refer the reader to Aaronson’s book
For Lebesgue spaces, factors of (X, ℬ, m, T )
(Aaronson 1997) for more details.
correspond perfectly with T-invariant complete
sub-s-algebras of ℬ. According to Rokhlin’s the-
ory of Lebesgue spaces (Rohlin 1952) (see the
Constructions article on Measure-Preserving Systems), factors
also correspond perfectly to certain kinds of par-
We give examples of some of the standard con- titions of X. A factor map f : X ! Y between
structions in dynamical systems. Often these con- Lebesgue spaces is an isomorphism if and only
structions appear in modeling situations (for if it has a measurable inverse, or equivalently f1
example, skew products are often used to model C ¼ ℬ up to sets of measure 0.
24 Ergodic Theory: Basic Examples and Constructions
ðT ⋉ S Þðx, yÞ ¼ ðTx, Sx yÞ: ð9Þ is finite m a.e. We may define the first-return map
by
The space Y is called the fiber of the skew
product and the space X the base. Sometimes in T B x ¼ T nB ðxÞ x: ð11Þ
the literature, the word skew product has a more
general meaning and refers to the structure Then (after perhaps discarding as usual a set of
(T ⋉ S)(x, y) ¼ (Tx, Sxy) (without any assumption measure 0) TB : B ! B is a measurable transfor-
of measure-preservation), where the action of the mation which preserves the probability measure
map on the fiber Y is determined or “driven” by the mB ¼ m/m (B). The system (B, ℬ \ B, mB, TB) is
map T : X ! X. called an induced, first-return, or derived trans-
Some common examples of skew products formation. If (T, X, m, ℬ) is ergodic, then
include the following: (B, ℬ \ B, mB, TB) is ergodic, but the converse
is not in general true.
Random Dynamical Systems The construction of the transformation TB
Suppose Sx is considered a (random) choice of a allows us to represent the forward orbit of points
mapping Y ! Y from the set {Sx : x X}. We in B via a tower or skyscraper over B. For each
suppose T : X ! X to be the full shift. Then the n ¼ 1, 2, . . . , let
projection onto Y of the orbits of (Tx, Sxy) give the
orbits of a point y Y under a random composi- Bn ¼ fx B : nB ðxÞ ¼ ng: ð12Þ
tion of maps ST n x ∘ . . . ∘STx ∘Sx . More generally,
we could consider the choice of maps Sx that are Then {B1, B2, . . . } form a partition of B, which
composed to come from any ergodic dynamical we think of as the bottom floor or base of the
system, (T, X, m) to model the effect of perturba- tower. The next floor is made up of TB2, TB3, . . . ,
tions by a stationary ergodic “noise” process. which form a partition of TB \ B, and so on. All
these sets are disjoint. A column is a part of the
Group Extensions of Dynamical Systems tower of the form Bn [ TBn [ . . . [ T n1Bn for
Suppose Y is a group, n is a measure on Y invariant some n ¼ 1, 2, . . . . The action of T on the entire
under a left group action, and Sxy : ¼ g(x)y is tower is pictured as mapping each x not at the top of
given by a group-valued function g : X ! Y. In its column straight up to the point Tx above it on the
this setting, g is often called a cocycle, since upon next level, and mapping each point on the top level
defining g(n)(x) by (T ⋉ S)(n)(x, y) ¼ (T nx, g(n)(x)y) to T nB x B . An equivalent way to describe the
we have a cocycle relation, namely g(mþn)(x) ¼ transformation on the tower is to write for each
g(m)(T nx)g(n)(x). Group extensions arise often in n and j < n, T jBn as {(x, j) : x Bn}, and then
modeling systems with symmetry (Field and the transformation F on the tower becomes
Nicol 2004). Common examples are provided by
Ergodic Theory: Basic Examples and Constructions 25
Ergodicity of (T, X, m) implies the ergodicity of is isomorphic to one constructed by cutting and
(Ts, XR, n). stacking. We could mention especially the von
Neumann-Kakutani adding machine (or 2-odom-
Cutting and Stacking eter) (section “Adding Machines”), the Chacon
Many of the most interesting examples in ergodic weakly mixing but not strongly mixing system
theory have been constructed by this method; in (section “Chacon System”), Ornstein’s mixing
fact, because of Rokhlin’s Lemma (see section rank one examples (see Nadkarni 1998a, p. 160
“Rokhlin’s Lemma”) every ergodic measure- ff.), and many more.
preserving transformation on a Lebesgue space
26 Ergodic Theory: Basic Examples and Constructions
We construct a Lebesgue measure-preserving edges into each vertex, and then X is partially
transformation T on an interval X (bounded or ordered as follows: x and y are comparable if
maybe unbounded) by defining it as a translation they agree from some point on, in which case we
on each of a pairwise disjoint countable collection say that x < y if at the last level n, where they
of subintervals. The construction proceeds by traverse different edges, the edge xn of x is smaller
stages, at each stage defining T on an additional than the edge yn of y. A map T is defined by letting
part of X, until eventually T is defined a.e. Tx be the smallest y that is larger than x, if there is
At each stage, X is represented as a tower, one. In nice situations, T is a homeomorphism
which is defined to be a disjoint union of columns. after defining it and its inverse on perhaps count-
A column is defined to be a finite disjoint union of ably many maximal and minimal elements.
intervals of equal length, which are numbered Invariant measures can sometimes be defined by
from 0, for the “floor,” to the last one, for the assigning weights to edges, which are then multi-
“roof,” and which we picture as lying each plied to define the measure of each cylinder set.
above the preceding-numbered one. T is defined This is a nice combinatorial way to present the
on each level of a column (i.e., each interval in the cutting and stacking method of constructing m.p.
column) except the roof by mapping it by transla- t.’s, allows for more convenient analysis of ques-
tion to the next higher interval in the column. tions such as orbit equivalence, and leads to the
At stage 0, we have just one column, consisting construction of many interesting examples, such
of all of X as the floor, and T is not defined as those based on the Pascal or Euler graphs
anywhere. To pass from one stage to the next, (Bailey et al. 2006; Frick and Petersen; Méla and
the columns are cut and stacked. This means that Petersen 2005). Odometers and generalizations
each column is divided, by vertical cuts, into a are natural examples of adic systems. Vershik
disjoint union of subcolumns of equal height (but showed that in fact every ergodic measure-
maybe not equal width), and then some of these preserving transformation on a Lebesgue space
subcolumns are stacked above others (of the same is isomorphic to a uniquely ergodic adic transfor-
width) so as to form a new tower. This allows the mation. See (Vershik and Livshits 1992).
definition of T to be extended to some parts of
X that were previously tops of towers, since they Rokhlin’s Lemma
now may have levels above them (Sometimes The following result is the fundamental starting
columns of height 1 are thought of as forming a point for many constructions in ergodic theory,
reservoir for “spacers” to be inserted between from representing arbitrary systems in terms of
subcolumns that are being stacked). If the measure cutting and stacking or adic systems, to
of the union of the tops of the columns tends to 0, constructing useful partitions and symbolic cod-
eventually T becomes defined a.e. This descrip- ings of abstract systems, to connecting conver-
tion in words can be made precise with cumber- gence theorems in abstract ergodic theory with
some notation, but the process can also be given a those in harmonic analysis. It allows us to picture
neater graphical description, which we sketch in arbitrarily long stretches of the action of a
the next section. measure-preserving transformation as a transla-
tion within the set of integers. In the ergodic
Adic Transformations nonatomic case, the statement follows readily
A.M. Vershik has introduced a family of models, from the construction of derivative
called adic or Bratteli-Vershik transformations, transformations.
into ergodic theory and dynamical systems. One
begins with a graph which is arranged in levels, Lemma 5.1 (Rokhlin’s Lemma) Let T : X ! X
finitely many vertices on each level, with connec- be a measure-preserving transformation on a
tions only from each level to the adjacent ones. probability space (X, ℬ, m). Suppose that (X, ℬ,
The space X consists of the set of all infinite paths m) is nonatomic and T : X ! X is ergodic, or, more
in this graph; it is a compact metric space in a generally, (T, X, ℬ, m) is aperiodic: that is to say,
natural way. We are given an order on the set of in the set {x X : there is n ℕ such that T nx ¼
Ergodic Theory: Basic Examples and Constructions 27
x} of periodic points has measure 0. Then given Because fji πj ¼ πi for all j i, the p1j ℬj are
n ℕ and ϵ > 0, there is a measurable set B X increasing, and so their union is an algebra. The
such that the sets B, TB, . . . , T n1B are pairwise set function m can, with some difficulty, be shown
disjoint and m \n1
k¼0 T B > 1 ϵ.
k
to be countably additive on this algebra: Since we
are dealing with Lebesgue spaces, by means of
Inverse Limits measure-theoretic isomorphisms it is possible to
Suppose that for each i ¼ 1, 2, . . . we have a replace the entire situation by compact metric
Lebesgue probability space (Xi, ℬi, mi) and a spaces and continuous maps, then use regularity
measure-preserving transformation Ti : Xi ! Xi. of the measures involved – see (Parthasarathy
Suppose also that for each i j there is a factor 2005, p. 137 ff.). Thus, by Carathéodory’s Theo-
map fji : (Tj, Xj, ℬj, mj) ! (Ti, Xi, ℬi, mi,), such that rem (see the article on Measure-Preserving Sys-
each fjj is the identity on Xj and fji fkj ¼ fki tems), m extends to all of ℬ.
whenever k j i. Let Define T : X ! X by T(xj) ¼ (Tjxj). Then (T, X,
ℬ, m) is a measure-preserving system which has
X ¼ x P1 all the (Tj, Xj, ℬj, mj) as factors, and any system
i¼1 X i : fji xj ¼ xi for all j i :
that factors onto all the (Tj, Xj, ℬj, mj) also factors
ð15Þ onto (T, X, ℬ, m).
O ≔fðx0 , x1 , x2 , . . .Þ : xn ¼ T ðxnþ1 Þ, xn X, n ¼ 0, 1, 2, . . .g
with s : Ω ! Ω defined by s ((x0, x1, x2, . . .)) ¼ invariant measure for the natural extension m on
(T(x0), x0, x1, . . .). The map s is invertible on Ω. Ω by defining it first on cylinder sets C(A0, A1, . . . ,
Given the invariant measure m, we define the Ak) by
and then extending it to Ω using Kolmogorov’s . . .) ¼ x0, then π ∘ sn(x0, . . ., xn, . . .) ¼ T n(x0) for
extension theorem. We think of (x0, x1, x2, . . .) as all x0 and thus the natural extension yields all
being an inverse branch of x0 X under the information about the orbits of X under T.
mapping T : X ! X. The maps s, s1 : Ω ! Ω The natural extension is an inverse limit. Let
are ergodic with respect to m if (T, X, ℬ, m) is (X, ℬ, m) be a Lebesgue probability space and T :
ergodic (Walters 1982a). If π : Ω ! X is projec- X ! X a map such that T 1ℬ ℬ and mT 1 ¼ m.
tion onto the first component, i.e., π (x0, . . ., xn, For each i ¼ 1, 2, . . . let (Ti, Xi, ℬi, mi) ¼ (T, X, ℬ,
28 Ergodic Theory: Basic Examples and Constructions
m), and fji ¼ T ji for each j > i. Then the inverse properties of dynamical systems will continue to
limit T, X, ℬ, m of this system is an invertible be an active research area for the foreseeable
future.
measure-preserving system which is the natural
The directions will include, among others, the
extension of (T, X, ℬ, m). We have
following:
1
T ðx1 , x2 , . . .Þ ¼ ðx2 , x3 , . . .Þ: ð17Þ 1. Establishing statistical and ergodic properties
under weakened dependence assumptions.
The original system (T, X, ℬ, m) is a factor of There is a “hierarchy” of probabilistic limit
T, X, ℬ, m (using any πi as the factor map), and theorems including, among others, ergodicity
any factor mapping from an invertible system (akin to the strong law of large numbers for
onto (T, X, ℬ, m) consists of a factor mapping integrable observations); central limit theorem
onto T, X, ℬ, m followed by projection onto (distributional convergence to a Gaussian for
the first coordinate. the scaled Birkhoff sums of an observable with
finite second moment); law of the iterated log-
Joinings arithm (almost sure rate of growth of scaled
Given measure-preserving systems (T, X, ℬ, m) Birkhoff sums); and the almost sure invariance
and (S, Y, C , n), a joining of the two systems is a invariance principle (a strong form of approx-
T S-invariant measure P on their product mea- imation by Brownian motion). Some observ-
surable space that projects to m and n, respectively, ables on some systems exhibit the same limit
under the projections of X Y to X and Y, respec- laws as iid processes, for example, Hölder
tively. That is, if π1 : X Y ! X is the projection observables on smooth hyperbolic systems sat-
onto the first component, i.e., π1(x, y) ¼ x, then isfy the almost sure invariance principle. In
P p11 ðAÞ ¼ mðAÞ for all A ℬ and similarly
other settings, the determinism of the system
for π2 : X Y ! Y. plays a key role, for example, the return time
This concept is the ergodic-theoretic version of statistics of a system to a periodic orbit may
the notion in probability theory of a coupling. The best be described by a compound Poisson pro-
product measure m n is always a joining of the cess rather than a Poisson law (Hirata 1993).
two systems. If product measure is the only join- An important strand of research is determining
ing of the two systems, then we say that they are conditions, usually dynamical and mixing con-
disjoint and write X ⊥ Y (Furstenberg 1967). If D ditions, on a system to determine the form of
is any family of systems, we write D ⊥ for statistics that observables on the system will
the family of all measure-preserving systems display. This research has also produced a rich
which are disjoint from every system in D: Exten- class of counterexamples. A good reference is
sive recent accounts of the use of joinings in (Melbourne and Török 2004).
ergodic theory are in (Glasner 2003a; Rudolph 2. The study of systems which display “anoma-
1990b; Thouvenot 1995a). lous statistics.” By “anomalous statistics” is
usually meant non-Gaussian limit laws such
as the convergence of a scaled observable to a
Future Directions stable law rather than a Gaussian. This arises
basically in two ways in a dynamical system:
The basic examples and constructions presented (1) a nonintegrable observable on a fast mixing
here are idealized, and many of the underlying system; (2) a regular (usually Hölder) observ-
assumptions (such as uniform hyperbolicity) are able on a slowly mixing dynamical system. For
seldom satisfied in applications, yet they have example, scaled Birkhoff sums of a non-
given important insights into the behavior of integrable observable such as d(x, x0)1 on a
real-world physical systems. The ergodic rapidly mixing system such as the doubling
map will converge to a stable law as will scaled
Ergodic Theory: Basic Examples and Constructions 29
dynamics (Proc. Conf., Yale Univ., New Haven, Conn., vol. 1222, Springer-Verlag, Berlin. MR872698
1972; in honor of Gustav Arnold Hedlund), Springer, (88k:58075)
Berlin, pp. 95–115. Lecture Notes in Math, vol. 318. Keane M (1968) Generalized Morse sequences.
MR0393339 (52 #14149) Z Wahrscheinlichkeitstheorie Verw Gebiete 10:
Gallavotti G, Ornstein DS (1974) Billiards and Bernoulli 335–353. MR0239047 (39 #406)
schemes. Commun Math Phys 38:83–101. Keane M (1977) Non-ergodic interval exchange transfor-
MR0355003 (50 #7480) mations. Israel J Math 26(2):188–196. MR0435353
Glasner E (2003a) Ergodic Theory via Joinings, Mathe- (55 #8313)
matical Surveys and Monographs, vol 101. American Keller G, Nowicki T (1992) Spectral theory, zeta functions
Mathematical Society, Providence. MR1958753 and the distribution of periodic points for Collet-
(2004c:37011) Eckmann maps. Commun Math Phys 149(1):31–69.
Gouëzel S (2004) Central limit theorem and stable laws for MR1182410 (93i:58123)
intermittent maps. Probab Theory Relat Fields 128(1): Kerckhoff S, Masur H, Smillie J (1986) Ergodicity of
82–122. https://doi.org/10.1007/s00440-003-0300-4. billiard flows and quadratic differentials. Ann Math
MR2027296 124(2):293–311. MR855297 (88f:58122)
Graczyk J, Światek G (1997) Generic hyperbolicity in the Kolmogorov AN (1954) On conservation of conditionally
logistic family. Ann Math 146(1):1–52. MR1469316 periodic motions for a small change in Hamilton’s
(99b:58079) function. Dokl Akad Nauk SSSR (NS) 98:527–530.
Guckenheimer J, Holmes P (1990) Nonlinear oscillations, (Russian). MR0068687 (16,924c)
dynamical systems, and bifurcations of vector fields, Krieger W (2000) On subshifts and topological Markov
Applied Mathematical Sciences, vol. 42. Springer- chains, Numbers, information and complexity
Verlag, New York. Revised and corrected reprint of (Bielefeld, 1998). Kluwer Academic Publishers, Bos-
the 1983 original. MR1139515 (93e:58046) ton, pp. 453–472. MR1755380 (2001g:37010)
Grayson M, Pugh C, Shub M (1994) Stably ergodic Lagarias JC (1991) The Farey shift, manuscript
diffeomorphisms. Ann Math 140(2):295–329. Lagarias JC (1992) Number theory and dynamical systems.
MR1298715 (95g:58128) In: The unreasonable effectiveness of number theory
Hasselblatt B, Katok A (2003a) A first course in dynamics. (Orono, ME, 1991), Proceedings of Symposia in Pure
Cambridge University Press, New York. With a panorama Mathematics, vol. 46, American Mathematical Society,
of recent developments. MR1995704 (2004f:37001) Providence, pp. 35–72. MR1195841 (93m:11143)
Hayashi S (1997) Connecting invariant manifolds and the Lazutkin VF (1973) Existence of caustics for the billiard
solution of the C(X, ℬ, m)1 stability and Ω-stability problem in a convex domain. Izv Akad Nauk SSSR Ser
conjectures for flows. Ann Math 145(1):81–137. Mat 37:186–216. (Russian). MR0328219 (48 #6561)
MR1432037 (98b:58096) Ledrappier F (1984) Propriétés ergodiques des mesures de
Hedlund GA (1934) On the metrical transitivity of the Sinaï. Inst Hautes Études Sci Publ Math 59:163–188.
geodesics on closed surfaces of constant negative cur- (French). MR743818 (86f:58092)
vature. Ann Math. 35(4):787–808. MR1503197 Lind D, Marcus B (1995) An introduction to symbolic
Hirata M (1993) Poisson law for Axiom A diffeomorphisms. dynamics and coding. Cambridge University Press,
Ergodic Theory Dyn Syst 13(3):533–556. https://doi.org/ Cambridge. MR1369092 (97a:58050)
10.1017/S0143385700007513. MR1245828 Liverani C, Saussol B, Vaienti S (1999) A probabilistic
Hopf E (1939) Statistik der geodätischen Linien in approach to intermittency. Ergodic Theory Dyn Syst
Mannigfaltigkeiten negative Krümmung. Ber Verh 19(3):671–685. MR1695915 (2000d:37029)
Sächs Akad Wiss Leipzig 91:261–304. (German). Lyons R (1988) On measures simultaneously 2- and
MR0001464 (1,243a) 3-invariant. Israel J Math 61(2):219–224. MR941238
Host B (1995) Nombres normaux, entropie, translations. (89e:28031)
Israel J Math 91(1–3):419–428. (French, with English Liverani C, Wojtkowski MP (1995) Ergodicity in Hamil-
summary). MR1348326 (96g:11092) tonian systems, Dynamics reported. Dynam. Report.
Huyi H (2004) Decay of correlations for piecewise smooth Expositions Dynam. Systems (N.S.), vol. 4, Springer,
maps with indifferent fixed points. Ergodic Theory Dyn Berlin, pp. 130–202. MR1346498 (96g:58144)
Syst 24(2):495–524. MR2054191 (2005a:37064) Manneville P, Pomeau Y (1980) Different ways to turbu-
Jakobson MV (1981) Absolutely continuous invariant lence in dissipative dynamical systems. Phys D 1(2):
measures for one-parameter families of one- 219–226. MR581352 (81h:58041)
dimensional maps. Commun Math Phys 81(1):39–88. Mañé R (1988) A proof of the C(X, ℬ, m)1 stability
MR630331 (83j:58070) conjecture. Inst Hautes Études Sci Publ Math 66:
Katok A (1980) Lyapunov exponents, entropy and periodic 161–210. MR932138 (89e:58090)
orbits for diffeomorphisms. Inst Hautes Études Sci Publ Mañé R (1987a) Ergodic theory and differentiable dynam-
Math 51:137–173. MR573822 (81i:28022) ics. Ergebnisse der Mathematik und ihrer Grenzgebiete
Katok A, Strelcyn J-M, Ledrappier F, Przytycki F (1986) (3) [Results in Mathematics and Related Areas (3)],
Invariant manifolds, entropy and billiards; smooth vol. 8, Springer-Verlag, Berlin, Translated from the
maps with singularities, Lecture Notes in Mathematics, Portuguese by Silvio Levy. MR889254 (88c:58040)
32 Ergodic Theory: Basic Examples and Constructions
Marcus B (1978) The horocycle flow is mixing of all Phillips E, Varadhan S (eds) (1975) Ergodic theory.
degrees. Invent Math 46(3):201–209. MR0488168 Courant Institute of Mathematical Sciences New York
(58 #7731) University, New York. A seminar held at the Courant
Marcus B (1975) Unique ergodicity of the horocycle flow: Institute of Mathematical Sciences, New York Univer-
variable negative curvature case. Israel J Math sity, New York, 1973–1974; With contributions by
21(2–3):133–144. Conference on Ergodic Theory and S. Varadhan, E. Phillips, S. Alpern, N. Bitzenhofer
Topological Dynamics (Kibbutz Lavi, 1974). and R. Adler. MR0486431 (58 #6177)
MR0407902 (53 #11672) Pugh C, Shub M (2004) Stable ergodicity. Bull Am Math
Masur H (1986) Closed trajectories for quadratic differen- Soc (N.S.) 41(1):1–41 (electronic). With an appendix
tials with an application to billiards. Duke Math J 53(2): by Alexander Starkov. MR2015448 (2005f:37011)
307–314. MR850537 (87j:30107) Robbin JW (1971) A structural stability theorem. Ann
Mayer DH (1991) Continued fractions and related trans- Math 94:447–493. MR0287580 (44 #4783)
formations. Ergodic theory, symbolic dynamics, and Robinson C (1973) C(X, ℬ, m)r structural stability implies
hyperbolic spaces (Trieste, 1989), Oxford Sci. Publ., Kupka-Smale. Dynamical systems (Proc Sympos, Univ
Oxford University Press, New York, pp. 175–222. Bahia, Salvador, 1971). Academic Press, New York,
MR1130177 pp. 443–449. MR0334282 (48 #12601)
Melbourne I, Török A (2004) Statistical limit theorems for Robinson C (1975) Errata to: “Structural stability of vector
suspension flows. Israel J Math 144:191–209. fields” (Ann Math (2) 99 (1974), 154–175). Ann Math
MR2121540 (2006c:37005) 101:368. MR0365630 (51 #1882)
Moser J (1962) On invariant curves of area-preserving Robinson C (1976) Structural stability of C(X, ℬ, m)1
mappings of an annulus. Nachr Akad Wiss Göttingen diffeomorphisms. J Differ Equ 22(1):28–73.
Math-Phys Kl II 1962:1–20. MR0147741 (26 #5255) MR0474411 (57 #14051)
Méla X, Petersen K (2005) Dynamical properties of the Rohlin VA (1952) On the fundamental ideas of measure
Pascal adic transformation. Ergodic Theory Dyn Syst theory. Am Math Soc Transl 1952(71):55. MR0047744
25(1):227–256. MR2122921 (2005k:37012) (13,924e)
Nadkarni MG (1998a) Spectral theory of dynamical sys- Rudolph DJ (1990a) 2 and 3 invariant measures and
tems. Birkhäuser Advanced Texts: Basler Lehrbücher. entropy. Ergodic Theory Dyn Syst 10(2):395–406.
[Birkhäuser Advanced Texts: Basel Textbooks], MR1062766 (91g:28026)
Birkhäuser Verlag, Basel. MR1719722 (2001d:37001) Rudolph DJ (1990b) Fundamentals of measurable dynam-
Ornstein D (1970) Bernoulli shifts with the same entropy ics. Oxford Science Publications, The Clarendon Press
are isomorphic. Adv Math 4:337–352. MR0257322 Oxford University Press, New York. Ergodic theory on
(41 #1973) Lebesgue spaces. MR1086631 (92e:28006)
Ornstein DS, Weiss B (1973) Geodesic flows are Ruelle D (1978) Thermodynamic formalism, Encyclope-
Bernoullian. Israel J Math 14:184–198. MR0325926 dia of Mathematics and its Applications, vol. 5,
(48 #4272) Addison-Wesley Publishing Co., Reading. The mathe-
Oseledec VI (1968) A multiplicative ergodic theorem. matical structures of classical equilibrium statistical
Characteristic Ljapunov, exponents of dynamical sys- mechanics; With a foreword by Giovanni Gallavotti
tems. Trudy Moskov Mat Obšč 19:179–210. (Russian). and Gian-Carlo Rota. MR511655 (80g:82017)
MR0240280 (39 #1629) Ruelle D (1976) A measure associated with axiom-A
Parry W (1966) Symbolic dynamics and transformations of attractors. Am J Math 98(3):619–654. MR0415683
the unit interval. Trans Am Math Soc 122:368–378. (54 #3763)
MR0197683 (33 #5846) Sarig O (2002) Subexponential decay of correlations.
Parry W (1996) Squaring and cubing the circle— Invent Math 150(3):629–653. MR1946554
Rudolph’s theorem. Ergodic theory of Z(X, ℬ, m)d (2004e:37010)
actions (Warwick, 1993), London Math Soc Lecture Schweiger F (1995a) Ergodic theory of fibred systems and
Note Ser, vol. 228, Cambridge University Press, Cam- metric number theory. Oxford Science Publications,
bridge, pp. 177–183. MR1411219 (97h:28009) The Clarendon Press Oxford University Press,
Parthasarathy KR (2005) Probability measures on metric New York. MR1419320 (97h:11083)
spaces. AMS Chelsea Publishing, Providence. Reprint Sinaĭ JG (1970) Dynamical systems with elastic reflec-
of the 1967 original. MR2169627 (2006d:60004) tions. Ergodic properties of dispersing billiards. Uspehi
Pesin JB (1976) Families of invariant manifolds that cor- Mat Nauk 25(2):141–192 (Russian). MR0274721
respond to nonzero characteristic exponents. Izv Akad (43#481)
Nauk SSSR Ser Mat 40(6):1332–1379, 1440 (Russian). Sinaĭ JG (1968) Markov partitions and
MR0458490 (56 #16690) U-diffeomorphisms. Funkcional Anal i Priložen 2(1):
Pesin JB (1977) Characteristic Ljapunov exponents, and 64–89. (Russian). MR0233038 (38 #1361)
smooth ergodic theory. Uspehi Mat Nauk 32 no. Sinaĭ JG (1972) Gibbs measures in ergodic theory. Uspehi
4 (196), 55–112, 287 (Russian). MR0466791 Mat Nauk 27(4):21–64 (Russian). MR0399421
(57 #6667) (53 #3265)
Ergodic Theory: Basic Examples and Constructions 33
Smale S (1980) The mathematics of time. Springer-Verlag, geometric and probabilistic perspective; Mathematical
New York. Essays on dynamical systems, economic Physics, III. MR2105774 (2005g:37001)
processes, and related topics. MR607330 (83a:01068) Boyarsky A, Góra PL (1997b) Laws of chaos, Probability
Tabachnikov S (2005) Geometry and billiards, Student and its Applications. Birkhäuser Boston Inc., Boston.
Mathematical Library, vol 30. American Mathematical Invariant measures and dynamical systems in one
Society, Providence. MR2168892 (2006h:51001) dimension. MR1461536 (99a:58102)
Thouvenot JP (1995a) Some properties and applications of Brin M, Stuck G (2002) Introduction to dynamical sys-
joinings in ergodic theory. Ergodic theory and its con- tems. Cambridge University Press, Cambridge.
nections with harmonic analysis (Alexandria, 1993), MR1963683 (2003m:37001)
London Math Soc Lecture Note Ser, vol. 205. Cam- Carleson L, Gamelin TW (1993b) Complex dynamics,
bridge University Press, Cambridge, pp. 207–235. Universitext: Tracts in Mathematics. Springer-Verlag,
MR1325699 (96d:28017) New York. MR1230383 (94h:30033)
Vershik AM, Livshits AN (1992) Adic models of ergodic Chernov N, Markarian R (2006b) Chaotic billiards, Math-
transformations, spectral theory, substitutions, and ematical Surveys and Monographs, vol 127. American
related topics, Representation theory and dynamical Mathematical Society, Providence. MR2229799
systems. Adv Soviet Math, vol. 9, American Mathe- (2007f:37050)
matical Society, Providence, pp. 185–204. Collet P, Eckmann J-P (1980b) Iterated maps on the inter-
MR1166202 (93i:46131) val as dynamical systems, Progress in Physics,
Vorobets YB, Gal’perin GA, Stëpin AM (1992) Periodic vol 1. Birkhäuser, Boston. MR613981 (82j:58078)
billiard trajectories in polygons: generation mecha- Cornfeld IP, Fomin SV, Sinaĭ YG (1982b) Ergodic Theory.
nisms. Uspekhi Mat. Nauk 47(285):9–74, 207 (Russian, Grundlehren der Mathematischen Wissenschaften
with Russian summary); English transl, Russian Math [Fundamental Principles of Mathematical Sciences],
Surveys 47(3) (1992):5–80. MR1185299 (93h:58088) vol. 245, Springer-Verlag, New York. Translated from
Walters P (1982a) An Introduction to Ergodic Theory, the Russian by A. B. Sosinskiĭ. MR832433 (87f:28019)
Graduate Texts in Mathematics, vol 79. Springer- Denker M, Grillenberger C, Sigmund K (1976) Ergodic
Verlag, New York. MR648108 (84e:28017) theory on compact spaces. Springer-Verlag, Berlin.
Young L-S (1998) Statistical properties of dynamical sys- Lecture Notes in Mathematics, vol. 527. MR0457675
tems with some hyperbolicity. Ann Math 147(3): (56 #15879)
585–650. MR1637655 (99h:58140) Friedman NA (1970) Introduction to Ergodic Theory. Van
Young L-S (1999) Recurrence times and rates of mixing. Nostrand Reinhold Co., New York. Van Nostrand
Israel J Math 110:153–188. MR1750438 Reinhold Mathematical Studies, No. 29. MR0435350
(2001j:37062) (55 #8310)
Young LS (1993) Ergodic theory of chaotic dynamical Glasner E (2003b) Ergodic Theory via Joinings, Mathe-
systems. In: From Topology to Computation: Proceed- matical Surveys and Monographs, vol 101. American
ings of the Smalefest (Berkeley, CA, 1990), Springer, Mathematical Society, Providence. MR1958753
New York, pp. 201–226. MR1246120 (94i:58112) (2004c:37011)
Young L-S (2017) Generalizations of SRB measures to Halmos PR (1960) Lectures on Ergodic Theory. Chelsea
nonautonomous, random, and infinite dimensional sys- Publishing Co., New York. MR0111817 (22 #2677)
tems. J Stat Phys 166(3–4):494–515. https://doi.org/10. Hasselblatt B, Katok A (2003b) A first course in dynamics.
1007/s10955-016-1639-0. MR3607578 Cambridge University Press, New York. With a pano-
Zemljakov AN, Katok AB (1975) Topological transitivity rama of recent developments. MR1995704
of billiards in polygons. Mat Zametki 18(2):291–300. (2004f:37001)
(Russian). MR0399423 (53 #3267) Hopf E (1937) Ergodentheorie, 1st ed., Ergebnisse der
Mathematik und ihrer Grenzgebiete 5. Bd., 2. Jaden
Springer, Berlin
Books and Reviews
Jacobs K (1965) Einige neuere Ergebnisse der
Baladi V (2000b) Positive transfer operators and decay of
Ergodentheorie. Jber Deutsch Math-Verein 67(Abt.
correlations, Advanced Series in Nonlinear Dynamics,
1):143–182 (German). MR0186789 (32 #4244)
vol 16. World Scientific Publishing Co. Inc., River
Katok A, Hasselblatt B (1995) Introduction to the modern
Edge. MR1793194 (2001k:37035)
theory of dynamical systems. In: Encyclopedia of
Billingsley P (1978b) Ergodic theory and information.
mathematics and its applications, vol. 54, Cambridge
Robert E. Krieger Publishing Co., Huntington. Reprint
University Press, Cambridge. With a supplementary
of the 1965 original. MR524567 (80b:28017)
chapter by Katok and Leonardo Mendoza.
Billingsley P (1995) Probability and measure, 3rd ed.,
MR1326374 (96c:58055)
Wiley Series in Probability and Mathematical Statis-
Keller G (1998) Equilibrium States in Ergodic Theory,
tics. John Wiley & Sons Inc., New York. A Wiley-
London Mathematical Society Student Texts, vol 42.
Interscience Publication. MR1324786 (95k:60001)
Cambridge University Press, Cambridge. MR1618769
Bonatti C, Díaz LJ, Viana M (2005) Dynamics beyond
(99e:28022)
uniform hyperbolicity, Encyclopaedia of Mathematical
Sciences, vol. 102, Springer-Verlag, Berlin. A global
34 Ergodic Theory: Basic Examples and Constructions
Lucarini V, Faranda D, Moreira ACGM, de Freitas J, Petersen K (1989) Ergodic theory, Cambridge Studies in
Milhazes M, de Freitas M, Holland TK, Nicol M, Advanced Mathematics, vol 2. Cambridge University
Todd M, Vaienti S (eds) (2016) Extremes and recur- Press, Cambridge. Corrected reprint of the 1983 origi-
rence in dynamical systems, Pure and Applied Mathe- nal. MR1073173 (92c:28010)
matics (Hoboken). John Wiley & Sons, Inc., Hoboken. Royden HL (1988) Real analysis, 3rd edn. Macmillan Pub-
MR3558780 lishing Company, New York. MR1013117 (90g:00004)
Mañé R (1987b) Ergodic Theory and Differentiable Rudolph DJ (1990c) Fundamentals of measurable dynam-
Dynamics. Ergebnisse der Mathematik und ihrer ics. Oxford Science Publications, The Clarendon Press
Grenzgebiete (3) [Results in Mathematics and Related Oxford University Press, New York, Ergodic theory on
Areas (3)], vol. 8, Springer-Verlag, Berlin. Translated Lebesgue spaces. MR1086631 (92e:28006)
from the Portuguese by Silvio Levy. MR889254 Schweiger F (1995b) Ergodic theory of fibred systems and
(88c:58040) metric number theory. Oxford Science Publications,
Nadkarni MG (1998b) Spectral theory of Dynamical Sys- The Clarendon Press Oxford University Press,
tems. Birkhäuser Advanced Texts: Basler Lehrbücher New York. MR1419320 (97h:11083)
[Birkhäuser advanced texts: Basel textbooks]. Thouvenot J-P (1995b) Some properties and applications
Birkhäuser Verlag, Basel. MR1719722 (2001d:37001) of joinings in ergodic theory. Ergodic theory and its
Ornstein DS, Rudolph DJ, Weiss B (1982) Equivalence of connections with harmonic analysis (Alexandria,
measure preserving transformations. Mem Am Math 1993), London Math Soc Lecture Note Ser, vol.
Soc 37(262):xii+116. MR653094 (85e:28026) 205, Cambridge University Press, Cambridge,
Parry W, Pollicott M (1990) Zeta functions and the periodic pp. 207–235. MR1325699 (96d:28017)
orbit structure of hyperbolic dynamics. Astérisque Walters P (1982b) An Introduction to Ergodic Theory,
187–188:268 (English, with French summary). Graduate Texts in Mathematics, vol 79. Springer-
MR1085356 (92f:58141) Verlag, New York. MR648108 (84e:28017)
Glossary
Ergodicity and Mixing Properties
Bernoulli shift Mathematical abstraction of the
Terrence Adams1 and Anthony Quas2 scenario in statistics or probability in which
1
Department of Mathematics and Statistics, State one performs repeated independent identical
University of New York, Albany, NY, USA experiments.
2
Department of Mathematics and Statistics, Markov chain A probability model describing a
University of Victoria, Victoria, BC, Canada sequence of observations made at regularly
spaced time intervals such that at each time,
the probability distribution of the subsequent
Article Outline observation depends only on the current obser-
vation and not on prior observations.
Glossary Measure-preserving transformation A map
Introduction from a measure space to itself such that for
Ergodicity each measurable subset of the space, it has
Mixing the same measure as its inverse image under
Hyperbolicity and Decay of Correlations the map.
Representations, Realizations, and Genericity Measure-theoretic entropy Non-negative (pos-
Future Directions sibly infinite) real number describing the com-
References plexity of a measure-preserving transformation.
Product transformation Given a pair of
Ergodicity is the condition under which the strong measure-preserving transformations: T of X and
law of large numbers holds for a dynamical sys- S of Y, the product transformation is the map of
tem. A case can be made that ergodicity holds in X Y given by (T S)(x, y) ¼ (T(x), S( y)).
situations much broader than is typically utilized Strong mixing Given two sets, over time the
in statistics, machine learning, and other scientific probability that points from one set end up in
disciplines. This is not surprising given the power the other set, converges to the product of the
of the pointwise ergodic theorem and the ability to probabilities of the two sets.
decompose a space into ergodic components. Weak mixing There exists a sequence of natural
Also, in the first half of the twentieth century, it numbers with density 1 in the set of all natural
was shown that topologically generic dynamical numbers such that the transformation is strong
systems are weak mixing, rigid, and thus, zero- mixing on that sequence.
entropy (and not i.i.d.). This chapter will give a
comprehensive account of mixing properties in
ergodic theory including Bernoulli, strong Introduction
mixing, weak mixing, mild, partial, and light
mixing. A hierarchy of mixing properties is pre- The term “ergodic” was introduced by Boltzmann
sented (Fig. 2), along with important examples, as (1871, 1909) in his work on statistical mechanics,
well as applications using the various properties. where he was studying Hamiltonian systems with
This chapter places many of the breakthroughs large numbers of particles. The system is described
from ergodic theory into a common framework at any time by a point of phase space, a subset of
with an eye toward emerging mathematical and ℝ6N where N is the number of particles. The con-
scientific areas and concludes with a collection of figuration describes the three-dimensional position
important unsolved problems. and velocity of each of the N particles. It has long
been known that the Hamiltonian (i.e., the overall of concrete examples sitting at various points of
energy of the system) is invariant over time in these this hierarchy. Many of the mixing properties may
systems. Thus, given a starting configuration, all be characterized in terms of the Koopman opera-
future configurations as the system evolves lie on tors mentioned above (i.e., they are spectral prop-
the same energy surface as the initial one. erties), but the strongest mixing properties are not
Boltzmann’s ergodic hypothesis was that the spectral in nature. Also, this chapter highlights
trajectory of the configuration in phase space many of the connections between mixing proper-
would fill out the entire energy surface. The term ties and their spectral reformulations; however, it
“ergodic” is thus an amalgamation of the Greek is recommended the interested reader refer to the
words for work and path. This hypothesis then chapter ▶ “Spectral Theory of Dynamical Sys-
allowed Boltzmann to conclude that the long-term tems” (Lemancyzk and Kanigowski 2022) for
average of a quantity as the system evolves would detailed definitions and results focused mainly
be equal to its average value over the phase space. on the spectrum of measure preserving systems.
Subsequently, it was realized that this hypothe- This chapter will also bring to light some of the
sis is rarely satisfied. The ergodic hypothesis was connections between the range of mixing proper-
replaced in 1911 by the quasi-ergodic hypothesis ties and measure-theoretic entropy. In measure-
of the Ehrenfests (1911) which stated instead that preserving transformations that arise in practice,
each trajectory is dense in the energy surface, rather there is a correlation between strong mixing prop-
than filling out the entire energy surface. The mod- erties and positive entropy, although many of
ern notion of ergodicity (to be defined below) is these properties are logically independent.
due to Birkhoff and Smith (1924). Koopman One important issue for which many questions
(1931) suggested studying a measure-preserving remain open is that of higher-order mixing. Here,
transformation by means of the associated isometry the issue is if instead of asking that the observa-
on Hilbert space, UT : L2(X) ! L2(X) defined by tions at two times separated by a large time T be
UT( f ) ¼ f ∘ T. This point of view was used by von approximately independent, one asks whether if
Neumann (1932) in his proof of the mean ergodic one makes observations at more times, each pair
theorem. This was followed closely by Birkhoff suitably separated, the results can be expected to
(1931) proving the pointwise ergodic theorem. be approximately independent. This issue has an
An ergodic measure-preserving transformation analogue in probability theory, where it is well-
enjoys the property that Boltzmann first intended known that it is possible to have a collection of
to deduce from his hypothesis: That long-term random variables that are pairwise independent,
averages of an observable quantity coincide with but not mutually independent.
the integral of that quantity over the phase space.
These theorems allow one to deduce a form of Basics, Examples, and Highlighted
independence on the average: Given two sets of Applications
configurations A and B, one can consider the vol- In this chapter, except where otherwise stated, the
ume of the phase space consisting of points that are measure-preserving transformations under con-
in A at time 0 and in B at time t. In an ergodic sideration are defined on probability spaces.
measure-preserving transformation, if one com- More specifically, given a measurable space
putes the average of the volumes of these regions (X, ℬ) and a probability measure m defined on
over time, the ergodic theorems mentioned above ℬ, a measure-preserving transformation of
allow one to deduce that the limit is simply the (X, ℬ, m) is a ℬ-measurable map T : X ! X such
product of the volume of A and the volume of that m(T1B) ¼ m(B) for all B ℬ.
B. This is the weakest mixing-type property. This While this definition makes sense for arbitrary
chapter outlines a rather full range of mixing prop- measures, not simply probability measures, most of
erties with ergodicity at the weakest end and the the results and definitions below only make sense in
Bernoulli property at the strongest end. the probability measure case. Sometimes it will be
This chapter sets out in some detail the various helpful to make the assumption that the underlying
mixing properties, basing the study on a number probability space is a Lebesgue space (i.e., the space
Ergodicity and Mixing Properties 37
together with its completed s-algebra agrees up to a form . . .x2x1 x0x1x2. . . (the is a placeholder
measure-preserving bijection a.e. with the unit inter- that allows us to distinguish (for example)
val with Lebesgue measure and the usual s-algebra between the sequences . . .01010 10101. . . and
of Lebesgue measurable sets). Although this sounds . . .10101 01010. . .).
like a strong restriction, in practice it is barely a A Bernoulli shift is defined as a map (the shift
restriction at all, as almost all of the spaces that map) S on Aℕ by (S(x))n ¼ xnþ1 and define S on Aℤ
appear in the theory (and all of those that appear in by the same formula. Note that S is invertible as a
this chapter) turn out to be Lebesgue spaces. For a transformation on Aℤ but noninvertible as a trans-
detailed treatment of the theory of Lebesgue spaces, formation on Aℕ.
the reader is referred to Rudolph’s book (1990). The It is necessary to equip Aℕ and Aℤ with measures.
reader is referred also to the chapter on “Measure This is done by defining the measure of a preferred
Preserving Systems”. class of sets, checking certain consistency condi-
While many of the definitions presented are tions and appealing to the Kolmogorov extension
valid for both invertible and noninvertible theorem. Here the preferred sets are the cylinder
measure-preserving transformations, the strongest sets. Given m n in the invertible case and a
mixing conditions are most useful in the case of sequence am . . . an, let ½am . . . an nm denote
invertible transformations. {x Aℤ : xm ¼ am, . . ., xn ¼ an} and define
It will be helpful to present a selection of foun- m ½am . . . an nm ¼ Pam Pamþ1 . . . Pan : This is then
dational examples, relative to which ergodicity shown to uniquely define a measure m on the
and the various notions of mixing are explored. s-algebra of Aℤ generated by the cylinder sets. It
These examples and the lemmas necessary to is immediate to see that for any cylinder set C,
show that they are measure-preserving transfor- m(S1C) ¼ m(C), and it follows that S is a measure-
mations as claimed may be found in the books of preserving transformation of (Aℤ, ℬ, m). The con-
Petersen (1983), Rudolph (1990), Walters (1982), struction is exactly analogous in the noninvertible
and Silva (2008). More details on these examples case. See the chapter on “Measure Preserving Sys-
can also be found in the chapter on ▶ “Ergodic tems” or the books of Walters (1982) or Rudolph
Theory: Basic Examples and Constructions.” (1990) for more details of defining measures in these
systems.
Some Iconic Examples
Example 1 (Rotation on the circle). Let α ℝ.
Example 4 (Markov Shift). The spaces Aℕ and
Let Rα : [0, 1) ! [0, 1) be defined by Rα(x) ¼ x þ
Aℤ are exactly as above, as is the shift map. All
α mod 1. It is straightforward to verify that Rα
that changes is the measure.
preserves the restriction of Lebesgue measure l to
To define a Markov shift, it is necessary to have
[0, 1) (it is sufficient to check that l R1
a ðJ Þ ¼ a stochastic matrix P (i.e., a matrix with non-
lðJ Þ for an interval J) negative entries whose rows sum to (1) with rows
and columns indexed by A and a left eigenvector π
Example 2 (Doubling Map). Let
for P with eigenvalue 1 with the property that the
M2 : [0, 1) ! [0, 1) be defined by M2(x) ¼ 2x
entries of π are non-negative and sum to 1. The
mod 1. Again, Lebesgue measure is invariant
existence of such an eigenvector is a consequence
under M2 (to see this, one observes that for an
of the Perron-Frobenius theory of positive matri-
interval J, M1
2 ðJ Þ consists of two intervals, each ces. Provided that the matrix P is irreducible (for
of half the length of J). This may be generalized in each a and a0 in A, there is an n > 0 such that
the obvious way to a map Mk for any integer k 2. Pna,a0 > 0), the eigenvector π is unique.
Given the pair (P, π), one defines the measure
Example 3 (Bernoulli Shift). Let A be a finite set
of a cylinder set by m ½am . . . an nm ¼
and fix a vector ( pi)i A of positive numbers that
sum to 1. Let Aℕ denote the set of sequences of the pam Pam amþ1 . . . Pan1 an and extends m as before to
form x0x1x2. . ., where xn A for each n ℕ and a probability measure on Aℕ or Aℤ.
let Aℤ denote the set of bi-infinite sequences of the
38 Ergodicity and Mixing Properties
isomorphism, while others are not. If a property is T ðx0 , x1 , . . .Þ ¼ ðT ðx0 Þ, x0 , x1 , . . .Þ: The transfor-
invariant under spectral isomorphism, it is said mation T of X, ℬ, m is called the natural exten-
that it is a spectral property. sion of the transformation T of (X, ℬ, m) (see the
There are a number of mixing type properties chapter on “Ergodic Theory: Basic Examples and
that occur in the probability literature (α-mixing, Constructions” for more details). In situations
β-mixing, f-mixing, c-mixing, etc.) (see where one wants to use invertibility, it is often
Bradley’s (2005) survey for a description of possible to pass to the natural extension, work
these conditions). In the case of a stationary pro- there, and then derive conclusions about the orig-
cess, Xi for 1 < i < 1, the process can be inal noninvertible transformation.
modeled using a measure preserving transforma- Arguably, the simplest example of an exten-
tion T and measurable function f such that Xi ¼ sion is a two-point extension. Given an ergodic
f ∘ Ti1. These mixing properties from the proba- measure-preserving transformation T acting on (X,
bility literature often constrain the function f and ℬ, m), this can be viewed as selecting a switching
are not preserved under measure-theoretic iso- set A X of positive measure. The two-point
morphism. One example of this distinction is extension is composed of two copies
that finite ergodic Markov chains are measure- (X1, ℬ, m, T1) and (X2, ℬ, m, T2) with mirror
theoretically isomorphic to Bernoulli shifts; how- copies A1, A2 of the switching set A. Also, there
ever, these systems are viewed as distinct varieties is a measure-preserving bijection f : X1 ! X2
of stationary processes. For this reason, these between the two mirror copies such that A2 ¼
properties are not widely used in ergodic theory, f(A1). The two-point extension S is defined as,
although β-mixing turns out to be equivalent to
the so-called weak Bernoulli property (which T 1 ðxÞ if x X1 ∖A1
turns out to be stronger than the Bernoulli prop- T 2 ðxÞ if x X2 ∖A2
erty that is discussed in this chapter – see Sð x Þ ¼
T 2 ðfðxÞÞ if x A1
Smorodinsky’s (1971) paper) and α-mixing is
T 1 f1 ðxÞ if x A2 :
equivalent to strong mixing.
A basic construction (see the chapter on
Establishing that mixing properties such as
▶ “Ergodic Theory: Basic Examples and Con-
Bernoulli extend to a two-point extension is
structions”) that is required in what follows is
already nontrivial (Rudolph 1978; Kammeyer
the product of a pair of measure-preserving trans-
1990). These notions generalize to more general
formations: Given transformations T of (X, ℬ, m)
group extensions, as well as nonalgebraic exten-
and S of (Y, F , n), the product transformation is
sions such as the roof over a transformation.
defined as T S: (X Y, ℬ F , m n) by (T
A roof over a transformation is constructed by
S)(x, y) ¼ (Tx, Sy).
identifying a subset A X of positive measure
One issue that is faced on occasion is that it is
and extending the measure space using a measur-
sometimes convenient to deal with noninvertible
able set A0 and invertible measure preserving iso-
measure-preserving transformations. It turns out
morphism f such that A0 ¼ f(A) is disjoint from
that given a noninvertible measure-preserving
X. The set A0 is a disjoint copy of A. In this case,
transformation, there is a natural way to uniquely
the extension S is defined as,
associate an invertible measure-preserving trans-
formation sharing almost all of the ergodic prop-
erties of the original transformation. Specifically, T ðxÞ if x X∖A
given a noninvertible measure-preserving trans- Sð x Þ ¼ fðxÞ if x A
formation T of (X, ℬ, m), one lets X ¼ T f1 ðxÞ if x A0 :
fðx0 , x1 , . . .Þ : xn X and T(xn) ¼ xn1 for all n},
ℬ be the s-algebra generated by sets of the form The normalized measure m0 defined as m0(B) ¼
An ¼ x X : xn A , m An ¼ mðAÞ and m(B)/(1 þ m(A0)) is an invariant probability
Ergodicity and Mixing Properties 41
measure under the action of S. The first weak second representing velocity). The system is
mixing, nonstrong mixing transformation due to constrained in that its position is required to lie
Kakutani and von Neumann was constructed as a in a bounded region S of ℝM with a piecewise
carefully designed roof over an irrational rotation smooth boundary. The system evolves by moving
(see previous footnote 1). the position at a constant rate in the direction of
The notion of induced (or derived) transforma- the velocity vector until the point reaches @S, at
tion (Kakutani 1943) is the reverse notion from which time the component of the velocity parallel
the “roof” extension. Given a subset A X of to the normal to @S is reversed. This then defines
positive measure, the induced transformation TA a flow (i.e., a family of maps (Tt)t ℝ satisfying
is defined on A as Ttþs ¼ Tt ∘ Ts) on the phase space. Since the
magnitude of the velocity is conserved, it is con-
T A ðxÞ ¼ T rA ðxÞ ðxÞ, venient to restrict to flows with speed 1. This
system is clearly the closest of the examples that
where rA(x) ¼ inf {n 1 : Tn(x) A}. If T is we consider to the situation envisaged by
ergodic, invertible, and measure-preserving, then Boltzmann. Perhaps not surprisingly, proofs of
so is TA. Also, the entropy of an induced transfor- even the most basic properties for this system are
mation may be computed directly from the much harder than the other examples that we
entropy h(T) of T, as hðT A Þ ¼ mð1AÞ hðT Þ consider.
(Abramov 1959). Interestingly, it was established
for an ergodic transformation, there exists a dense Application 2 (Combinatorial Number Theory).
collection of sets A such that TA is weak mixing The legendary mathematician Paul Erdős was
(Chacon 1966a), and then a dense collection of known to disavow life’s material possessions
A such that TA is strongly mixing (Friedman and and instead, give complete devotion to mathemat-
Ornstein 1973). Hence, one method for obtaining ics, and of course, his own mother. He used all
a zero-entropy strong mixing transformation is to disposable cash to fund a growing number of
apply the result of Friedman-Ornstein (1973) to bounties on unsolved math problems. His first
any zero-entropy ergodic transformation. The major award went to Szemerédi for proving that
notion of an induced transformation has led to any sequence of natural numbers with positive
the theory of Kakutani equivalence and many upper density contains arbitrarily long arithmetic
important results for both transformations and progressions. The award was $1000 in 1974. Fast
flows. For example, see results of Ambrose and forward to 2004, mathematicians Green and Tao
Kakutani (1942) and Rudolph (1976). extended Szemerédi’s results to prove that the
prime numbers contain arbitrarily long arithmetic
Highlighted Applications and Connections progressions (Green and Tao 2008). A conjecture
Application 1 (Hard Sphere Gases and Bil- of Erdős continues to remain open: Given any
liards). We wish to model the behavior of a gas increasing sequence of natural numbers, ni such
1 1
in a bounded region. We make the assumption that that i¼1 ni ¼ 1, then ni contains arbitrarily
the gas consists of a large number N of identical long arithmetic progressions. A positive answer
balls which move at constant velocity until two to this question would imply the primes contain
balls collide, whereupon they elastically swap arbitrarily long arithmetic progressions.
momentum along the direction of contact. The In the meantime (circa 1977), Hillel Furstenberg
phase space for this system is a region of ℝ6N proved Szemerédi’s theorem using ergodic theory
(with N 3-dimensional position vectors and N techniques (Furstenberg 1977). Furstenberg
3-dimensional velocity vectors). More abstractly, established a transference principle that recast number
the system is equivalent to the motion of a single theoretic questions into properties of measure-pre-
point particle in a region of ℝM ℝM (with the serving dynamical systems. Furstenberg, Bergelson,
first M-vector representing position and the and others spearheaded new research on multiple
42 Ergodicity and Mixing Properties
recurrence of dynamical systems to obtain far VC-dimension for almost all i.i.d. samples (Vapnik
reaching generalizations of the original Szemerédi’s and Chervonenkis 2015).
theorem, including to higher order group actions Using ergodic theory techniques, it was shown
(Furstenberg and Katznelson 1978; Bergelson and that Vapnik-Chervonenkis uniform convergence
Leibman 2002) and probabilistic versions for occur- holds for stationary ergodic processes (Adams
rences of arithmetic progressions based on multi- and Nobel 2010).
dimensional ergodic theorems (Bergelson 1987;
Conze and Lesigne 1984; Zhang 1996; Frantzikinakis Theorem 1 (Adams and Nobel 2010). Let w be a
and Kra 2005; Tao 2008; Austin 2010). Many of the complete separable metric space equipped with
ideas from Furstenberg’s ergodic theoretic approach its Borel measurable subsets S, and let C S be
find themselves in the ideas of Green and Tao. To any countable family of sets. If dimðC Þ < 1, then
better comprehend the impact of Furstenberg’s break- for every stationary ergodic process X ¼ X1, X2,
throughs, as well as Tao and Green’s, here is an . . . taking values in ðw, S Þ,
excerpt from Tao’s 2006 Fields Medal citation:
m
1
Of special note is his joint work with Ben Green, a Gm ðC : XÞ ¼ sup I ðX i C Þ P ðX C Þ
Clay Research Fellow from 2005 through 2007. In CC m i¼1
their 2004 paper, “The primes contain arbitrarily
long arithmetic progressions,” the authors answered ! 0 wp1
in the affirmative a long-standing conjecture that ð1Þ
had resisted many attempts.
algorithms for approximately learning transition This allows us to decompose the transformation
and emission probabilities. HMMs have been X into two pieces A and Ac and study the transfor-
studied in the context of ergodic theory under mation T separately on each. In fact the same
different frameworks (e.g., subshifts of finite situation holds if T 1A and A agree up to a set of
type, sofic measures). See Marcus et al. (2011) measure 0. For this reason, a set A is called invari-
for more details on applications of ergodic theory ant if m(T 1A Δ A) ¼ 0.
and entropy to HMMs. In McGoff et al. (2015), Returning to Boltzmann’s ergodic hypothesis,
the consistency of maximum likelihood estimates existence of an invariant set of measure between
for HMMs was established under general condi- 0 and 1 would be a bad situation as his essential
tions including observations with noise. idea was that the orbit of a single point would
More recently, the availability of big data has “see” all of X, whereas if X were decomposed in
led to a resurgence in neural networks and in this way, the most that a point in A could see
particular, deep neural networks (DNNs) for data would be all of A, and similarly the most that a
modeling and prediction. While DNNs may be point in Ac could see would be all of Ac.
successful in modeling data (examples include A measure-preserving transformation will be
Google translate, Apple Siri, Amazon Alexa), called ergodic if it has no nontrivial decomposi-
the DNNs are generally viewed as black boxes tion of this form. More formally, let T be a
with little understanding of why and when they measure-preserving transformation of a probabil-
work. This lack of understanding has spurred new ity space (X, ℬ, m). The transformation T is said to
research toward applying rigorous mathematics to be ergodic if for all invariant sets, either the set or
improve understanding of deep neural networks, its complement has measure 0.
both in the training phase and in the understanding Unlike the remaining concepts discussed in
of failure modes after a system is trained. During this chapter, this definition of ergodicity applies
2018–2019, several research papers have demon- also to infinite measure-preserving transforma-
strated how over-parameterization of neural net- tions and even to certain non-measure-preserving
works can lead to consistent modeling. A key transformations. See Aaronson’s book (Aaronson
ingredient in this approach is the use of 1997) for more information.
Hoeffding’s inequality to bound the probability The following lemma is often useful:
that the sum of bounded independent random
variables deviates from its expected value by Lemma 2 Let (X, ℬ, m) be a probability space
more than a certain amount. This is a computa- and let T: X ! X be a measure-preserving trans-
tional version of the ergodic theorem which does formation. Then T is ergodic if and only if the only
not hold in all situations. However, there is measurable functions f satisfying f ∘ T ¼ f (up to
research to extend Hoeffding’s inequality beyond sets of measure 0) are constant almost
the i.i.d. case (Impagliazzo and Kabanets 2010; everywhere.
Schmidt et al. 1993; Pelekis and Ramon 2017), in For the straightforward proof, notice that if the
some cases developing new concentration condition in the lemma holds and A is an invariant
inequalities. For more details on recent research set, then 1A ∘ T ¼ 1A almost everywhere, so that
in this area, see Du et al. (2019); Arora et al. 1A is an a.e. constant function and so A or Ac is of
(2019); Arora et al. (2018); and Raginsky measure 0. Conversely, if f is an invariant func-
et al. (2017). tion, it is seen that for each α, {x: f (x) < α} is an
invariant set and hence of measure 0 or 1. It fol-
lows that f is constant almost everywhere. Note
Ergodicity that it is sufficient to check that the bounded
measurable invariant functions are constant.
Given a measure-preserving transformation The following corollary of the lemma shows
T : X ! X, if T1A ¼ A, then T1Ac ¼ Ac also. that ergodicity is a spectral property.
44 Ergodicity and Mixing Properties
tion of the circle, it is claimed that the transforma- set A of positive measure satisfies,
tion is ergodic if and only if the ‘angle’ α is m [1i¼0 T ðAÞ ¼ 1: This proves that every rank-
i
A detailed proof of ergodic decomposition is not property that there is convergence in the strong
given here. The theorem can be proved using Césaro sense. That is, a measure-preserving trans-
the Birkhoff ergodic theorem and the Riesz Repre- formation T is weak mixing if
sentation theorem identifying the dual space of the
space of continuous functions on a compact space as 1
N1
On the other hand, now it is shown that any 1. T is weak mixing if and only if for every f,
Bernoulli shift is strong mixing. To see this, let g L2 one has
A and B be arbitrary measurable sets. By standard
measure-theoretic arguments, A and B may each
N1
be approximated arbitrarily closely by a finite 1
j hf , g∘T n i h f , 1ih1, gi j! 0 as
union of cylinder sets. Since if A0 and B0 are finite N n¼0
unions of cylinder sets, we have that m(A0 \ TnB0) N ! 1:
is equal to m(A0)m(B0) for large n, it is easy to
deduce that m(A \ T nB) ! m(A)m(B) as required. 2. T is strong mixing if and only if for every
Since the doubling map is measure-theoretically f, g L2, one has
isomorphic to a one-sided Bernoulli shift, it fol-
lows that the doubling map is also strong mixing.
f , g∘T N ! h f , 1i h1, gi as N ! 1:
Similarly, if a Markov Chain is irreducible
(i.e., for any states i and j, there exists an n 0
Using this, one can see that both mixing con-
such that Pnij > 0) and aperiodic (there is a state
ditions are spectral properties.
i such that gcd n; Pnii > 0 ¼ 1 ), then given
any pair of cylinder sets A0 and B0 , it follows Corollary 10 Weak and strong mixing are spec-
from standard theorems of Markov chains tral properties.
m(A0 \ T nB0) ! m(A0)m(B0). The same argument This is an opportune time to highlight a couple
as above then shows that an aperiodic irreducible major results that establish mixing properties for
Markov Chain is strong mixing. On the other interval exchange transformations. In Katok
hand, if a Markov chain is periodic (1980a), Katok shows that no interval exchange
d¼ gcd n : Pnii > 0 > 0 , then letting A ¼ B ¼ transformation is strong mixing. On the other
{x : x0 ¼ i}, it follows that m(A \ T nB) ¼ 0 when- hand, if the underlying permutation n is irreduc-
ever d ∤ n. Thus, Td is not ergodic, so that T is not ible and not a rotation,2 then the interval exchange
weak mixing. Note that a mixing Markov Chain is transformation is weak mixing for almost all divi-
actually measure-theoretically isomorphic to a sions of the interval (with respect to Lebesgue
Bernoulli shift (Friedman and Ornstein 1970; measure on the k – 1-dimensional simplex). The
Adler et al. 1972; Keane and Smorodinsky 1979b). case k ¼ 3 was proved by Katok and Stepin (1967)
So far in this chapter, all examples of explicit and the general case including k > 3 was
mixing transformations have been isomorphic established by Avila and Forni (2007). (This
to Bernoulli transformations. Ornstein’s con- research is cited as a primary contributor to Avila’s
struction of rank-one mixing using random 2014 Fields Medal (https://www.mathunion.org/
spacers produced the first known examples of imu-awards/fields-medal/fields-medal-2014/fields-
zero-entropy mixing (Ornstein 1972). Since medallists-2014-awardees-brief-citations).)
these examples, tools have been developed to While on the face of it the formulation of weak
establish explicit rankone mixing transforma- mixing is considerably less natural than that of
tions (e.g., staircase transformations) (Adams strong mixing, the notion of weak mixing turns
1998; Creutz and Silva 2010). Amazingly, it out to be extremely natural from a spectral point of
has already been established by Danilenko that view. Given a measure-preserving transformation
any discrete countable infinite amenable group T, let UT be the Koopman operator described
admits a zero-entropy mixing action (Danilenko above. Since this operator is an isometry, any
2016). eigenvalue must lie on the unit circle. The constant
Both weak and strong mixing have formula- function 1 is always an eigenfunction with eigen-
tions in terms of functions: value 1. Note that if z ¼ x þ iy is in the complex
plane, then its complex conjugate z ¼ x iy: If from the fact that any sub-s-algebra F of
T is ergodic and g and h are eigenfunctions of UT ℬ gives rise to a factor mapping π : (X, ℬ, m) !
with eigenvalue l, then gh is an eigenfunction with (X, F , m) with π(x) ¼ x. By construction L2(X, ℬ0,
eigenvalue 1, hence invariant, so that g ¼ Kh m) is the closed linear span of the eigenfunctions
for some constant K. Notice that for ergodic trans- of T considered as a measure-preserving transfor-
formations, up to rescaling, there is at most one mation of (X, ℬ0, m). By the Discrete Spectrum
eigenfunction with any given eigenvalue. Theorem of Halmos and von Neumann (1942),
If UT has a nonconstant eigenfunction f, then T acting on (X, ℬ0, m) is measure-theoretically
one has j UnT f , f j¼ k f k2 for each n, whereas by isomorphic to a rotation on a compact group.
Cauchy-Schwartz, |h f, 1i|2 < k f k2. It follows that This allows one to split L2(X, ℬ, m) as
j U nT f , f h f , 1ih1, f i j c for some positive L2 ðX, ℬ, mÞ L2c ðX, ℬ, mÞ, where, as mentioned
constant c, so that using Lemma 9, T is not weak above the first part is the discrete spectrum part,
mixing. spanned by eigenfunctions, and the second part is
The converse can be shown using the spectral the continuous spectrum part, consisting of func-
theorem. For a detailed proof, see section 2.6 of tions whose spectral measure is continuous. Since
Petersen (1983). Also, another chapter ▶ “Spec- L2 is split into a discrete part and a continuous
tral Theory of Dynamical Systems” (Lemancyzk part, it is natural to ask whether the underlying
and Kanigowski 2022) is devoted to a study of transformation T can be split up in some way into
spectral theory and its connections to ergodic the- a weak mixing part and a discrete spectrum
ory. We encourage the interested reader to refer to (compact group rotation) part, somewhat analo-
this chapter for many of the detailed definitions gously to the ergodic decomposition. Unfortu-
and results on spectral theory. nately, there is no-such decomposition available.
However, for some applications, for example, to
Theorem 11 The measure-preserving transfor- multiple recurrence (starting with the work of
mation T is weak mixing if and only UT has no Furstenberg (1977, 1981)), the decomposition of
nonconstant eigenfunctions. L2 (possibly into more complicated parts) plays a
Of course this also shows that weak mixing is a crucial role (see the chapters “Ergodic Theory:
spectral property. Equivalently, this says that the Recurrence” and “Ergodic Theory: Interactions
transformation T is weak mixing if and only if with Combinatorics and Number Theory”).
apart from the constant eigenfunction, the opera- For noninvertible measure-preserving transfor-
tor UT has only continuous spectrum (i.e., the mations, the transformation is weak or strong
operator has no other eigenfunctions). mixing if and only if its natural extension has
Using this theory, one can establish the that property.
following: The understanding of weak mixing in terms of
the discrete part of the spectrum of the operator
Theorem 12 also extends to total ergodicity. Tn is ergodic if and
only if T has no eigenvalues of the form e2πip/n
1. T is weak mixing if and only if T T is ergodic; other than 1. From this, it follows that an ergodic
2. If T and S are ergodic, then T S is ergodic if measure-preserving transformation T is totally
and only if US and UT have no common eigen- ergodic if and only if it has no rational spectrum
values other than 1. (i.e., no eigenvalues of the form e2πip/q other than
the simple eigenvalue 1).
For a measure-preserving transformation T, let An intermediate mixing condition between
K be the subspace of L2 spanned by the eigen- strong- and weak mixing is that a measure-
functions of UT. It is a remarkable fact that K may preserving transformation is mild-mixing if when-
be identified as L2(X, ℬ, m) where ℬ0 is a ever f ∘ T ni ! f for an L2 function f and a
sub-s-algebra of ℬ. The space K is called the sequence ni ! 1, then f is a.e. constant. Clearly
Kronecker factor of T. The terminology comes mild-mixing is a spectral property. If a
Ergodicity and Mixing Properties 49
transformation has an eigenfunction f, then it is then strong mixing implies topological mixing.
straightforward to find a sequence ni such that Mild mixing does not imply topological mixing.
f ∘T ni ! f , thus, if follows that mild-mixing There is a measure theoretic property which is a
implies weak mixing. To see that strong mixing close counterpart to topological mixing. The prop-
implies mild-mixing, suppose that T is strong erty is commonly referred to as light mixing and
mixing and that f ∘T ni ! f : Then f ∘T ni f ! was first introduced by Walters as intermixing
k f k2 : On the other hand, the strong mixing prop- (Walters 1972). A transformation T is lightly
erty implies that f ∘T ni f ! jh f , 1ij2 : The equal- mixing if given any two sets A and B of positive
ity of these implies that f is a.e. constant. Mild- measure,
mixing has a useful reformulation in terms of
ergodicity of general (not necessarily probability) lim inf mðA \ T n BÞ > 0:
n!1
measure-preserving transformations:
A transformation T is mild-mixing if and only if If T is lightly mixing, then T is mildly mixing
for every conservative ergodic measure- (since lim infn!1m(A \ T nAc) ¼ 0 for A in a
preserving transformation S, T S is ergodic. See rigid factor). Also, there exist transformations
Furstenberg and Weiss’ article (1978) for further which are lightly mixing, but are not strong
information on mild-mixing. mixing. In particular, the transformation first
If there exists a sequence ni ! 1 such that for published by Chacon as a weak mixing, nonstrong
every f L2, f ∘T ni ! f , then T is said to be rigid. mixing transformation is actually lightly mixing
The sequence ni is called a rigidity sequence for T. (Chacon 1969; Friedman and King 1991).
Mild mixing transformations are those transfor- If there exists α > 0 such that for all measurable
mations with no rigid factor. There has been much sets A and B,
new research on the nature of rigidity sequences,
spurred largely by (Bergelson et al. 2014). See lim mðA \ T n BÞ amðAÞmðBÞ,
n!1
(Eisner and Grivaux 2011; Aaronson et al. 2014;
Adams 2015; Fayad and Kanigowski 2015;
then T is partially mixing, or more specifically
Grivaux 2013; Le 2017; Bayless and Yancey
α-mixing. It is not difficult to show that α 1
2015; Grivaux and Roginskaya 2013; Robertson
for T measure-preserving. There are transforma-
2019) for a selection of recent results on rigidity
tions which are partially mixing, but are not
and nonrecurrent sequences. If there exists α > 0
strongly mixing. Also, Chacon’s transformation
such that for every measurable set A,
from (Friedman and King 1991) is lightly mixing,
but not partially mixing. In Friedman (1989),
lim inf mðA \ T n AÞ amðAÞ,
n!1 Friedman constructs explicit transformations
with an optimal combination of mixing and rigid-
then T is said to be partially rigid, or more specif- ity. In particular, for each α such that 0 < α < 1, a
ically α-rigid. Strong mixing transformations are transformation is constructed that is simulta-
not partially rigid for any α > 0. While mild neously α-rigid and (1 – α)-mixing.
mixing transformations are not rigid, they can be See Fig. 2 for a hierarchy of mixing properties.
partially rigid, and in particular, both Chacon The stronger more restrictive properties start at the
transformations (Chacon-2 and Chacon-3) are top and become weaker as one descends down the
partially rigid. tree. Note that there are known examples
In the case where the space X is endowed with distinguishing these properties, except in the
a topology, a transformation T is topologically case of strong mixing and mixing of all orders.
mixing if given two nonempty open sets U and The dashed line between these properties signifies
V, there exists N ℕ such that for all n > N, the that it is an open question whether strong mixing
set V \ T nU is nonempty. Topological mixing (i.e., 2-mixing) implies mixing of all orders. Many
does not imply ergodicity (Muehlegger et al. of the branches are labeled with references to
1997). If every open set has positive measure,
50 Ergodicity and Mixing Properties
Countable Zero-
Lebesgue Mixing of All Orders Entropy
Spectrum Mixing
Strong Mixing [114]
articles where these properties were defined or invertible measure-preserving transformation has
important examples were presented. simple singular spectrum.
The strongest spectral property considered is The property of countable Lebesgue spectrum
that of having countable Lebesgue spectrum. is by definition a spectral property. Since it
A detailed discussion of spectral theory is not completely describes the transformation up to
included here, although this special case that can spectral isomorphism, there can be no stronger
be described simply. Specifically, let T be an spectral properties. The remaining properties that
invertible measure-preserving transformation. are examined are invariant under measure-
Then T has countable Lebesgue spectrum if theoretic isomorphisms only.
there is a sequence of functions f1, f2, . . . such An invertible measure-preserving transforma-
that f1g [ U nT f j : n ℤ, j ℕ forms an ortho- tion T of (X, ℬ, m) is said to be K (for Kolmogorov)
if there is a sub-s-algebra F of ℬ such that
normal basis for L2(X).
To see that this property is stronger than 1. \1 n
n¼1 T F is the trivial s-algebra up to sets of
strong mixing, simply observe that it implies
measure 0 (i.e., the intersection consists only of
that U tT UnT f j , U m
T f k ! 0 as t ! 1. Then by null sets and sets of full measure).
approximating f and g by their expansions with 2. _1
n¼1 T F ¼ ℬ (i.e., the smallest s-algebra
n
respect to a finite part of the basis, it can be containing T nF for all n > 0 is ℬ.
deduced that U nT f , g ! h f , 1ih1, gi as required.
Since already strong mixing is atypical from the The K property has a useful reformulation in
topological point of view, it follows that countable terms of entropy as follows: T is K if and only if
Lebesgue spectrum has to be atypical. In fact, for every nontrivial partition P of X, the entropy
Yuzvinskii (1967) showed that the typical of T with respect to the partition P is positive:
Ergodicity and Mixing Properties 51
T has completely positive entropy. See the chapter In the case of invertible Bernoulli shifts, Ornstein
on Entropy in Ergodic Theory for the relevant (1970, 1974) developed in the early 1970s a power-
definitions. The equivalence of the K property ful isomorphism theory, showing that two Bernoulli
and completely positive entropy was shown by shifts are measure-theoretically isomorphic if and
Rokhlin and Sinai (1961). For a general transfor- only if they have the same entropy. Entropy had
mation T, one can consider the collection of all already been identified as an invariant by Kolmogo-
subsets B of X such that with respect to the parti- rov and Sinai (Kolmogorov 1958; Sinai 1959), so
tion P B ¼ fB, Bc g, hðP B Þ ¼ 0: One can show this established that it was a complete invariant for
that this is a s-algebra. This s-algebra is known Bernoulli shifts. Keane and Smorodinsky (1979a)
as the Pinsker s-algebra. The above reformulation gave a proof which showed that two Bernoulli shifts
allows us to say that a transformation is K if and of the same entropy are isomorphic using a conju-
only if it has a trivial Pinsker s-algebra. gating map that is continuous almost everywhere.
The K property implies countable Lebesgue With other authors, this theory was extended to
spectrum (see Parry’s (1981) book for a proof). show that the property of being isomorphic to a
To see that K is not implied by countable Bernoulli shift applied to a surprisingly large class
Lebesgue spectrum, it can be pointed out that of measure-preserving transformations (e.g., geode-
certain transformations derived from Gaussian sic flows on manifolds of constant negative curva-
systems (see, for example, the paper of Newton ture (Ornstein and Weiss 1973), aperiodic
and Parry (1966)) have countable Lebesgue spec- irreducible Markov chains (Friedman and Ornstein
trum but zero-entropy. 1970), toral automorphisms (Katznelson 1971), and
The fact that (two-sided) Bernoulli shifts have more generally many Gibbs measures for hyperbolic
the K property follows from Kolmogorov’s 0–1 dynamical systems (see the book of Bowen (1975)).
n
law by taking F ¼ _1 n¼0 T P , where P is the Initially, it was conjectured that the properties
partition into cylinder sets (see Williams’s (1991) of being K and Bernoulli were the same, but since
book for details of the 0–1 law). then a number of measure-preserving transforma-
Although the K property is explicitly an invert- tions that are K but not Bernoulli have been iden-
ible property, it has a noninvertible counterpart, tified. The earliest was due to Ornstein (1973a).
namely, exactness. A transformation T of (X, ℬ, m) Ornstein and Shields (1973) then provided an
n
is exact if \1
n¼0 T ℬ consists entirely of null sets uncountable family of nonisomorphic
and sets of measure 1. It is not hard to see that a K automorphisms. Katok (1980b) gave an exam-
noninvertible transformation is exact if and only if ple of a smooth diffeomorphism that is K but not
its natural extension is K. Bernoulli, and Kalikow (1982) gave a very natural
The final and strongest property in our list is probabilistic example of a transformation that has
that of being measure-theoretically isomorphic this property (the T, T 1 process).
to a Bernoulli shift. If T is measure-theoretically While in systems that one regularly encounters
isomorphic to a Bernoulli shift, it can be said that there is a correlation between positive entropy and
T has the Bernoulli property. While in principle the stronger mixing properties discussed, these
this could apply to both invertible and non- properties are logically independent; for example,
invertible transformations, in practice the defini- taking the product of a Bernoulli shift and the
tion applies to a large class of invertible identity transformation gives a positive entropy
transformations, but occurs comparatively sel- transformation that fails to be ergodic; also,
dom for noninvertible transformations. For this besides rank-one mixing examples, there are
reason, we will restrict ourselves to a discussion zero-entropy Gaussian systems which are strong
of the Bernoulli property for invertible transfor- mixing and have countable Lebesgue spectrum.
mations (see however work of Hoffman and In many of the mixing criteria discussed above,
Rudolph (2002) and Heicklen and Hoffman a pair of sets A and B is considered and one asks
(2002) for work on the one-sided Bernoulli for asymptotic independence of A and B (so that
property). for large n, A and T –n B become independent).
52 Ergodicity and Mixing Properties
It is natural to ask, given a finite collection Bergelson (1987) generalized this by showing that
of sets A0, A1, . . . Ak, under what conditions weak mixing implies a polynomial version of
mðA0 \ T n1 A1 \ . . . \ T nk Ak Þ converges to weak mixing of all orders:
Pkj¼0 m Aj :.
A measure-preserving transformation T is said lim m A0 \ T p1 ðnÞ A1 \ . . . \ T pk ðnÞ Ak
n!1, n J
to be mixing of order k þ 1 if for all measurable
sets A0, . . ., Ak, k
¼ mðAi Þ
i¼0
lim mðA0 \ T n1 A1 \ . . . \ T nk Ak Þ
n1 !1, njþ1 nj !1
system is mixing (i.e., given any measurable examples of C1 expanding maps of the interval for
sets A and B, m A \ T n 1 n2
1 T 2 B ! mðAÞmðBÞ as which Lebesgue measure was invariant, but
k(n1, n2)k ! 1). Ledrappier showed that the respectively not ergodic and not weak mixing.
system fails to be 3-mixing. Subsequently Some of the key tools in controlling mixing in
Masser (2004) established necessary and suffi- one-dimensional expanding maps that are absent
cient conditions for similar higher-dimensional in the C1 case are bounded distortion estimates.
algebraic actions to be mixing of order k but not Here, there is a constant 1 C < 1 such that
order k þ 1 for any given k. given any interval I on which some power Tn of
T acts injectively and any subinterval J of I, one
has 1/C (| T nJ| /| T nI| )/(| J| /| I| ) C. An early
Hyperbolicity and Decay of Correlations place in which bounded distortion estimates
appear is the work of Rényi (1957).
One class of systems in which the stronger mixing One important class of results for expanding
properties are often found is the class of smooth maps establishes an exponential decay of correla-
systems possessing uniform hyperbolicity tions. Here, one starts with a pair of smooth func-
(i.e., the tangent space to the manifold at each tions f and g and one estimates f g ∘ T ndm
point splits into stable and unstable subspaces f dm g dm, where m is an absolutely continuous
Es(x) and Eu(x) such that the DT a<1 invariant measure. If m is mixing, it is expected
Es ðxÞ
that this will converge to 0. In fact though, in good
for all x and DT 1 a and DT(Es(x)) ¼ cases this converges to 0 at an exponential rate for
Eu ðxÞ
Es(T(x)) and DT(Eu(x)) ¼ Eu(T(x))). In some each pair of functions f and g belonging to a
cases, similar conclusions are found in systems sufficiently smooth class. In this case, the
possessing nonuniform hyperbolicity. See Katok measure-preserving transformation T is said to
and Hasselblatt’s (1995) book for an overview of have exponential decay of correlations. See
hyperbolic dynamical systems, as well as the Liverani’s (2004) article for an introduction to a
chapter in this volume on ▶ “Smooth Ergodic method of establishing this based on cones. Expo-
Theory.” nential decay of correlations implies in particular
In the simple case of expanding piecewise con- that the natural extension is Bernoulli.
tinuous maps of the interval (i.e., maps for which Hu (2004) has studied the situation of maps of
the absolute value of the derivative is uniformly the interval for which the derivative is bigger than
bounded below by a constant greater than 1), it is 1 everywhere except at a fixed point, where the
known that if they are totally ergodic and topo- local behavior is of the form x 7! x þ x1þα for
logically transitive (i.e., the forward images of any 0 < α < 1. In this case, rather than exhibiting
interval cover the entire interval), then provided exponential decay of correlations, the map has
that the map has sufficient smoothness (e.g., the polynomial decay of correlations with a rate
map is C1 and the derivative satisfies a certain depending on α.
additional summability condition), the map has a In Young’s (1995) survey, a variety of tech-
unique absolutely continuous invariant measure niques are outlined for understanding the strong
which is exact and whose natural extension is ergodic properties of nonuniformly hyperbolic
Bernoulli (see the paper of Góra (1994) for results diffeomorphisms. In her article (Young 1998),
of this type proved under some of the mildest methods are introduced for studying many classes
hypotheses). These results were originally of nonuniformly hyperbolic systems by looking at
established for maps that were twice continuously suitably high powers of the map, for which the
differentiable, and the hypotheses were progres- power has strong hyperbolic behavior. The article
sively weakened, approaching, but never meeting, shows how to understand the ergodic behavior of
C1. Subsequent work of Quas (1996b, a) provided these systems. These methods are applied (for
54 Ergodicity and Mixing Properties
There are a number of Baire category results invariants for determining when two transforma-
addressing this. In order to state them, one needs tions are isomorphic. However, it is also shown
a set of measure-preserving transformations and a that there are topologically generic classes of
topology on them. As mentioned earlier, it is effec- transformations (i.e., rank-one) that are Borel. At
tively no restriction to assume that a transformation this time, it is an open question to establish an
is a Lebesgue-measurable map on the unit interval easily verifiable method for determining when
preserving Lebesgue measure. The classical cate- two rank-one transformations are isomorphic.
gory results are then on the collection of invertible Nevertheless, there has been recent progress
Lebesgue-measure-preserving transformations of toward characterizing rank-one transformations;
the unit interval. One topology on these is the see Gao and Hill (2014, b); Adams et al. (2017);
“weak” topology, where a subbase is given and Foreman et al. (2019).
by sets of the form N(T, A, ϵ) ¼ {S; l(S(A)
ΔT(A)) < ϵ}. With respect to this topology, Halmos
(1944) showed that a residual set (i.e., a dense Gδ Future Directions
set) of invertible measure-preserving transforma-
tions is weak mixing (see also work of Alpern Problem 1 (Mixing of all orders). Does mixing
(1976)), while Rokhlin (1948) showed that the set imply mixing of all orders? Can the results of
of strong mixing transformations is meagre (i.e., a Kalikow, Ryzhikov, and Host be extended to
nowhere dense Fs set), allowing one to conclude larger classes of measure-preserving transforma-
that with respect to this topology, a typical trans- tions? Thouvenot observed that it is sufficient to
formation is weak but not strong mixing. establish the result for measure-preserving trans-
In Tikhonov (2007), Tikhonov introduces a nat- formations of entropy 0. This observation (whose
ural metric (i.e., leash topology) which makes the proof is based on the Pinsker s-algebra) was
set of mixing transformations into a complete sep- stated in Kalikow’s (1984) paper and is
arable metric space. It is shown that in this metric, a reproduced as Proposition 3.2 in recent work of
generic mixing transformation is mixing of all de la Rue (2006) on the mixing of all orders
orders and has simple singular spectrum and all problem.
its powers are disjoint. See the chapter ▶ “Spectral
Theory of Dynamical Systems” (Lemancyzk and Problem 2 (Multiple weak mixing). As men-
Kanigowski 2022) for further details on simple tioned above, Bergelson (1987) showed that if
singular spectrum, as well as the notion of T is a weak mixing transformation, then there is
disjointness. While it is shown that the conjugacy a subset J of the integers of density 0 such that
class of a generic mixing transformation is dense,
not until Bashtanov (2013), is it shown that the lim m A0 \ T p1 ðnÞ A1 \ . . . \ T pk ðnÞ Ak
n!1, n J
conjugacy class of every mixing transformation is
k
dense in the leash topology. Also, in Bashtanov
¼ mðAi Þ
(2013), it is shown that rank-one is generic in the i¼0
leash topology.
Viewing classes of ergodic measure-preserving whenever p1(n), . . ., pk(n) are nonconstant integer-
transformations from a descriptive set theoretic valued polynomials such that pi(n) – pj(n) is
viewpoint (Becker and Kechris 1996; Foreman unbounded for i 6¼ j. It is natural to ask what is
2000) has shed new light on the challenges char- the most general class of times that can replace the
acterizing broad classes of transformations. In sequences ( p1(n)), . . ., ( pk(n)). In unpublished
particular, in Foreman and Weiss (2011), it is notes, Bergelson and Håland considered as times
shown that the isomorphism relation is a complete the values taken by a family of integer-valued
analytic set and in particular, not Borel. This gives generalized polynomials (those functions of an
an obstacle to concrete (countably) verifiable integer variable that can be obtained by the
56 Ergodicity and Mixing Properties
operations of addition, multiplication, addition of machinery by Ornstein and Weiss (1975). The
or multiplication by a real constantpand taking Weak Pinsker Conjecture states that if a measure-
integer
p parts (e.g., gðnÞ ¼ ⎿ 2⎿pn⏌ þ preserving transformation T has entropy h > 0,
⎿ 3nÞ ). They conjectured necessary and suffi- then for all ϵ > 0, T may be expressed as a product
cient conditions for the analogue of Bergelson’s of a Bernoulli shift and a measure-preserving
weak mixing polynomial ergodic theorem to hold transformation with entropy less than ϵ.
and proved the conjecture in certain cases. This problem was recently solved in the affir-
In a recent paper of McCutcheon and Quas mative by T. Austin (2018). New results on mea-
(2007), the analogous question was addressed in sure concentrations were developed to prove the
the case where T is a mild-mixing transformation. weak Pinsker conjecture.
Austin T (2018) Measure concentration and the weak de la Rue T (2006) 2-fold and 3-fold mixing: why
pinsker property. Publications mathématiques de 3-dot-type counterexamples are impossible in one
l’IHÈS 128(1):1–119 dimension. Bull Braz Math Soc (NS) 37(4):503–521
Avila A, Forni G (2007) Weak-mixing for interval Du SS, Zhai X, Poczos B, Singh A (2019) Gradient descent
exchange transformations and translation flows. Ann provably optimizes over-parameterized neural net-
Math 165:637–664 works. In: International conference on learning
Bashtanov AI (2013) Generic mixing transformations are representations 2019
rank 1. Math Notes 93:209–216 Ehrenfest P, Ehrenfest T (1911) Begriffliche Grundlage der
Bayless RL, Yancey KB (2015) Weakly mixing and rigid statistischen Auffassung in der Mechanik. Number 4 in
rank-one transformations preserving an infinite mea- Encycloplädie der mathematischen Wissenschaften.
sure. New York J Math 21:615–636 Teubner
Becker H, Kechris AS (1996) The descriptive set theory of Eisner T, Grivaux S (2011) Hilbertian Jamison sequences
polish group actions, volume 232. Cambridge Univer- and rigid dynamical systems. J Funct Anal 261(7):
sity Press 2013–2052
Berend MBD, Kolesnik G (2001) Irrational dilations of Fayad B, Kanigowski A (2015) Rigidity times for a weakly
Pascal’s triangle. Mathematika 48:159–168 mixing dynamical system which are not rigidity times
Bergelson V (1987) Weakly mixing pet. Ergodic Theory for any irrational rotation. Ergodic Theory Dynam Syst
Dynam Syst 7:337–349 35(8):2529–2534
Bergelson V, Leibman A (2002) A nilpotent roth theorem. Feller W (1950) An introduction to probability and its
Inventiones mathematicae 147(2):429–470 applications. Wiley
Bergelson V, del Junco A, Lemańczyk M, Rosenblatt Ferenczi S (1997) Systems of finite rank. Colloq Math
J (2014) Rigidity and non-recurrence along sequences. 73(1):35–65
Ergodic Theory Dynam Syst 34(5):1464–1502 Fields medalists 2014 awardees with brief citations. https://
Birkhoff GD (1931) Proof of the ergodic theorem. Proc Nat www.mathunion.org/imu-awards/fields-medal/fields-
Acad Sci 17:656–660 medal-2014/fields-medallists-2014-awardees-brief-
Birkhoff GD, Smith PA (1924) Structural analysis of sur- citations
face transformations. J Undergrad Math 7:345–379 Foreman M (2000) A descriptive view of ergodic theory.
Boltzmann L (1871) Einige allgemeine Sätze über Descriptive Set Theory Dynam Syst:87–171
Wärmegleichgewicht. Wiener Berichte 63:679–711 Foreman DJRM, Weiss B (2011) The conjugacy problem
Boltzmann L (1909) Wissenschaftliche Abhandlungen. in ergodic theory. Ann Math 173(3):1529–1586
Akad. Wissenschaften Berlin Foreman M et al (2019) Rank-one transformations, odom-
Bowen R (1975) Equilibrium states and the ergodic theory eters, and finite factors. arXiv:1910.126J5, pp 1–20
of Anosov diffeomorphisms. Springer Frantzikinakis N, Kra B (2005) Convergence of multiple
Bozgan F, Sanchez A, Silva CE, Stevens D, Wang J (2015) ergodic averages for some commuting transformations.
Subsequence bounded rational ergodicity of rank-one Ergodic Theory Dynam Syst 25(3):799–809
transformations. Dyn Syst 30(1):70–84 Friedman NA (1970) Introduction to ergodic theory, Van
Bradley RC (2005) Basic properties of strong mixing con- Nostrand Reinhold Mathematical Studies, No. 29. Van
ditions. A survey and some open questions. Probab Nostrand Reinhold Co, New York-Toronto,
Surv 2:107–144 Ont.-London
Chacon RV (1966a) Change of velocity in flows. J Math Friedman N (1989) Partial mixing, partial rigidity, and
Mechanics 16(5):417–431 factors. Contemp Math 94:141–145
Chacon RV (1966b) Transformations having continuous Friedman N (1992) Replication and stacking in ergodic
spectrum. J Math Mechanics 16(5):399–415 theory. Am Math Mon 99(1):31–41
Chacon RV (1969) Weakly mixing transformations which Friedman N, King J (1991) Rank one lightly mixing. Israel
are not strongly mixing. Proc Am Math Soc 22:559–562 J Math 73:281–288
Choquet G (1956a) Existence des représentations Friedman NA, Ornstein DS (1970) On isomorphism of
intégrales au moyen des points extrémaux dans les weak Bernoulli transformations. Adv Math 5:365–394
cônes convexes. C R Acad Sci Paris 243:699–702 Friedman NA, Ornstein DS (1973) Ergodic transforma-
Choquet G (1956b) Unicité des représentations intégrales tions induce mixing transformations. Adv Math 10(1):
au moyen de points extrémaux dans les cônes convexes 147–163
réticulés. C. R. Acad Sci Paris 243:555–557 Furstenberg H (1977) Ergodic behavior of diagonal mea-
Conze J-P, Lesigne E (1984) Théorèmes ergodiques pour sures and a theorem of Szemerédi on arithmetic pro-
des mesures diagonales. Bull de la Soc Math de France gressions. J Analyse Math 31:204–256
112:143–175 Furstenberg H (1981) Recurrence in ergodic theory and
Creutz D, Silva CE (2010) Mixing on rank-one transfor- combinatorial number theory. Princeton
mations. Stud Math 199(1):43–72 Furstenberg H, Katznelson Y (1978) An ergodic
Danilenko AI (2016) Mixing actions of zero entropy for Szemerédi theorem for commuting transformations.
countable amenable groups. In: Colloquium J d’Analyse Math 34(1):275–291
Mathematicum, volume 145. Instytut Matematyczny Furstenberg H, Weiss B (1978) The finite multipliers of
Polskiej Akademii Nauk, pp 179–186 infinite ergodic transformations. In: The structure of
58 Ergodicity and Mixing Properties
attractors in dynamical systems (Proc. Conf., North Katok A, Hasselblatt B (1995) Introduction to the modern
Dakota State Univ., Fargo, N.D., 1977). Springer theory of dynamical systems, Cambridge
Gao S, Hill A (2014) A model for rank one measure Katok A, Stepin AM (1967) Approximations in ergodic
preserving transformations. Topol Appl 174:25–40 theory. Uspekhi Mat Nauk 22:5(137):81–106
Garsia AM (1965) A simple proof of E. Hopf’s maximal Katznelson Y (1971) Ergodic automorphisms of n are
ergodic theory. J Math Mech 14:381–382 Bernoulli shifts. Israel . Math 10:186–195
Góra P (1994) Properties of invariant measures for piece- Katznelson Y, Weiss B (1982) A simple proof of some
wise expanding one-dimensional transformations with ergodic theorems. Israel J Math 42:291–296
summable oscillations of derivative. Ergodic Theory Keane MS, Petersen KE (2006) Nearly simultaneous pro-
Dynam Syst 14:475–492 ofs of the ergodic theorem and maximal ergodic theo-
Green B, Tao T (2008) The primes contain arbitrarily long rem. In: Dynamics and stochastics: Festschrift in Honor
arithmetic progressions. Ann Math 167(2):481–547 of M. S. Keane. Institute of Mathematical Statistics,
Grivaux S (2013) IP-Dirichlet measures and IP-rigid pp 248–251
dynamical systems: an approach via generalized riesz Keane M, Smorodinsky M (1979a) Bernoulli schemes of
products. Stud Math 215(3):237–259 the same entropy are finitarily isomorphic. Ann Math
Grivaux S, Roginskaya M (2013) Some new examples of 109:397–406
recurrence and non-recurrence sets for products of rota- Keane M, Smorodinsky M (1979b) Finitary isomorphisms of
tions on the unit circle. Czechoslov Math J 63(3): irreducible markov shifts. Israel J Math 34(4):281–286
603–627 Kolmogorov AN (1958) New metric invariant of transitive
Halmos PA (1944) In general a measure-preserving trans- dynamical systems and endomorphisms of Lebesgue
formation is mixing. Ann Math 45:786–792 spaces. Dokl Russ Acad Sci 119:861–864
Halmos P (1956) Lectures on ergodic theory. Chelsea Koopman BO (1931) Hamiltonian systems and Hilbert
Halmos P, von Neumann J (1942) Operator methods in space. Proc Nat Acad Sci 17:315–218
classical mechanics II. Ann. Math 43:332–350 Kramli A, Simányi N, Szász D (1991) The K-property of
Hoffman C, Heicklen D (2002) Rational maps are d-adic three billiard balls. Ann Math 133:37–72
Bernoulli. Ann Math 156:103–114 Krieger W et al (1972) On unique ergodicity. In: Proceed-
Hoffman C, Rudolph DJ (2002) Uniform endomorphisms ings of the sixth Berkeley symposium on mathematical
which are isomorphic to a Bernoulli shift. Ann Math statistics and probability, volume 2: Probability theory.
76:79–101 The Regents of the University of California
Host B (1991) Mixing of all orders and pairwise indepen- Le AN (2017) Nilsequences and multiple correlations
dent joinings of systems with singular spectrum. Israel along subsequences. Ergodic Theory Dynam Syst:1–21
J Math 76:289–298 Ledrappier F (1978) Un champ Markovien peut être
Hu H (2004) Decay of correlations for piecewise smooth d’entropie nulle et mélangeant. C R Acad Sci Paris
maps with indifferent fixed points. Ergodic Theory Sér A-B 287:561–563
Dynam Syst 24:495–524 Lehrer E (1987) Topological mixing and uniquely ergodic
Impagliazzo R, Kabanets V (2010) Constructive proofs of systems. Israel J Math 57(2):239–255
concentration bounds. In: Serna M, Shaltiel R, Lemancyzk M, Kanigowski A (2022) Spectral theory of
Jansen K, Rolim J (eds) Approximation, randomiza- dynamical systems. Encyclopedia of Complexity and
tion, and combinatorial optimization. algorithms and Systems Science
techniques. Springer, Berlin, Heidelberg, pp 617–631 Liverani C (2004) Decay of correlations. Ann Math 159:
Jewett RI (1970) The prevalence of uniquely ergodic sys- 1275–1312
tems. J Math Mechanics 19(8):717–729 Marcus B, Petersen K, Weissman T (2011) Entropy of
Kakutani S (1943) 131. Induced measure preserving trans- hidden Markov processes and connections to dynami-
formations. Proc Imperial Acad 19(10):635–641 cal systems. London Mathematical Society
Kakutani S (1986) Selected papers. Birkhauser Maruyama G (1949) The harmonic analysis of stationary
Kalikow S (1982) T, T1 transformation is not loosely stochastic processes. Memoirs of the Faculty of Sci-
Bernoulli. Ann Math 115:393–409 ence, Kyushu University. Series A Math 4(1):45–106
Kalikow S (1984) Twofold mixing implies threefold Masser DW (2004) Mixing and linear equations over
mixing for rank one transformations. Ergodic Theory groups in positive characteristic. Israel J Math 142:
Dynam Syst 2:237–259 189–204
Kamae T (1982) A simple proof of the ergodic theorem Masur H (1982) Interval exchange transformations and
using non-standard analysis. Israel J Math 42:284–290 measured foliations. Ann Math 115:169–200
Kammeyer JW (1990) A complete classification of the McCutcheon R, Quas A (2007) Generalized polynomials
two-point extensions of a multidimensional Bernoulli and mild mixing systems. Can J Math. to appear
shift. J d Analyse Math 54(1):113–163 McGoff ANK, Mukherjee S, Pillai N (2015) Consistency
Katok A (1980a) Interval exchange transformations and of maximum likelihood estimation for some dynamical
some special flows are not mixing. Israel J Math 35: systems. Ann Stat 43(1):1–29
301–310 Méla X, Petersen K (2005) Dynamical properties of the
Katok A (1980b) Smooth non-Bernoulli K-automor- Pascal adic transformation. Ergodic Theory Dynam
phisms. Invent Math 61:291–299 Syst 25:227–256
Ergodicity and Mixing Properties 59
Muehlegger E, Raich A, Silva C, Zhao W (1997) Lightly Robertson D (2019) Mild mixing of certain interval-
mixing on dense algebras. Real Anal Exchange 23: exchange transformations. Ergodic Theory Dynam
259–265 Syst 39(1):248–256
Newton D, Parry W (1966) On a factor automorphism of a Rokhlin VA (1948) A ‘general’ measure-preserving trans-
normal dynamical system. Ann Math Statist 37: formation is not mixing. Dokl Akad Nauk SSSR Ser
1528–1533 Mat 60:349–351
Ornstein DS (1970) Bernoulli shifts with the same entropy Rokhlin VA (1949) On endomorphisms of compact com-
are isomorphic. Adv Math 4:337–352 mutative groups. Izvestiya Akad Nauk SSSR Ser Mat
Ornstein DS (1972) On the root in ergodic theory. In: 13:329–340
Proceedings of the sixth berkeley symposium on math- Rokhlin VA, Sinai Y (1961) Construction and properties of
ematical statistics and probability (Univ. California, invariant measurable partitions. Dokl Akad Nauk
Berkeley, Calif., 1970/1971), Vol. II: Probability the- SSSR 141:1038–1041
ory. University California Press, pp 347–356 Rudin W (1966) Real and complex analysis. McGraw Hill
Ornstein DS (1973a) An example of a Kolmogorov auto- Rudolph D (1976) A two-valued step coding for ergodic
morphism that is not a Bernoulli shift. Adv Math 10: flows. Math Z 150(3):201–220
49–62 Rudolph DJ (1978) If a two-point extension of a Bernoulli
Ornstein DS (1973b) A K-automorphism with no square shift has an ergodic square, then it is Bernoulli. Israel
root and Pinsker’s conjecture. Adv Math 10:89–102 J Math 30(1–2):159–180
Ornstein DS (1973c) A mixing transformation for which Rudolph DJ (1990) Fundamentals of measurable dynam-
Pinsker’s conjecture fails. Adv Math 10:103–123 ics. Oxford
Ornstein DS (1974) Ergodic theory, randomness, and Ryzhikov VV (1993) Joinings and multiple mixing of the
dynamical systems. Yale University Press actions of finite rank. Funct Anal Appl 27:128–140
Ornstein DS, Shields PC (1973) An uncountable family of Schmidt JP, Siegel A, Srinivasan A (1993) Chernoff-
K-automorphisms. Adv Math 10:89–102 hoeffding bounds for applications with limited indepen-
Ornstein DS, Weiss B (1973) Geodesic flows are dence. In: Proceedings of the fourth annual ACM-SIAM
Bernoullian. Israel J Math 14:184–198 symposium on discrete algorithms, SODA 93. Society
Ornstein DS, Weiss B (1975) Unilateral codings of for Industrial and Applied Mathematics, p 331340
Bernoulli systems. Israel J Math 21:159166 Shields P, Thouvenot J-P (1975) Entropy zero Bernoulli
Oxtoby JC (1952) Ergodic sets. Bull Amer Math Soc 58: processes are closed in the d metric. Ann Probab 3:
116–136 732–736
Parry W (1981) Topics in ergodic theory, Cambridge Silva CE (2008) Invitation to ergodic theory. Am Math
Pelekis C, Ramon J (2017) Hoeffding’s inequality for sums of Soc 42
dependent random variables. Mediterr J Math 14(6):243 Simányi N (2003) Proof of the Boltzmann-Sinai ergodic
Petersen K (1983) Ergodic theory. Cambridge hypothesis for typical hard disk systems. Invent Math
Petersen K, Schmidt K (1997) Symmetric Gibbs measures. 154:123–178
Trans Am Math Soc 349:2775–2811 Simányi N (2004) Proof of the ergodic hypothesis for typical
Phelps R (1966) Lectures on Choquet’s theorem. Van hard ball systems. Ann Henri Poincare 5:203–233
Nostrand Simányi N, Szász D (1999) Hard ball systems are
Pinsker MS (1960) Dynamical systems with completely completely hyperbolic. Ann Math 149:35–96
positive or zero entropy. Soviet Math Dokl 1:937–938 Sinai YG (1959) On the notion of entropy of a dynamical
Pollard D (1990) Empirical processes: theory and applica- system. Dokl Russ Acad Sci 124:768–771
tions. In: NSF-CBMS regional conference series in Sinai YG (1964) On a weak isomorphism of transforma-
probability and statistics. JSTOR, pp 1–86 tions with invariant measure. Mat Sb (NS) 63:23–42
Quas A (1996a) A C1 expanding map of the circle which is Sinai YG (1970) Dynamical systems with elastic reflec-
not weak-mixing. Israel J Math 93:359–372 tions. Ergodic properties of dispersing billiards. Uspehi
Quas A (1996b) Non-ergodicity for C1 expanding maps Mat Nauk 25:141–192
and g-measures. Ergodic Theory Dynam Syst 16: Sinai Y (1976) Introduction to ergodic theory. Princeton.
531–543 translation of the 1973 Russian original
Raginsky M, Rakhlin A, Telgarsky M (2017) Non-convex Sinai YG, Chernov NI (1987) Ergodic properties of some
learning via stochastic gradient langevin dynamics: a systems of two-dimensional disks and three-
nonasymptotic analysis. In: Kale S, Shamir O (eds) dimensional balls. Uspekhi Mat Nauk 42:153–174
Proceedings of the 2017 Conference on learning theory, Smorodinsky M (1971) A partition on a Bernoulli shift
volume 65 of Proceedings of machine learning which is not weakly Bernoulli. Math Syst Th 5:
research. PMLR, Amsterdam, pp 1674–1703 201–203
Rényi A (1957) Representations for real numbers and their Tao T (2008) Norm convergence of multiple ergodic aver-
ergodic properties. Acta Math Acad Sci Hungar 8: ages for commuting transformations. Ergodic Theory
477–493 Dynam Syst 28(2):657–688
Riesz F (1938) Some mean ergodic theorems. J London Tikhonov SV (2007) A complete metric in the set of
Math Soc 13:274–278 mixing transformations. Mat Sb 198(4):135–158
60 Ergodicity and Mixing Properties
Vapnik V (2013) The nature of statistical learning theory. von Neumann J (1932) Proof of the quasi-ergodic hypoth-
Springer esis. Proc Natl Acad Sci U S A 18:70–82
Vapnik VN, Chervonenkis AY (2015) On the uniform Walters P (1972) Some invariant sigma algebras for mea-
convergence of relative frequencies of events to their sure preserving transformations. Trans Am Math Soc
probabilities. Springer, Cham, pp 11–30 163:357–368
Veech W (1982) Gauss measures for transformations on Walters P (1982) An introduction to ergodic theory.
the space on interval exchange maps. Ann Math 115: Springer
201–242 Williams D (1991) Probability with martingales. Cambridge
Vershik A (1974) A description of invariant measures for Young L-S (1995) Ergodic theory of differentiable dynam-
actions of certain infinite-dimensional groups. Soviet ical systems. In: Real and complex dynamical systems
Math Dokl 15:1396–1400 (Hillerød, 1993). Kluwer, pp 293–336
Vershik A (1981) Uniform algebraic approximation of shift Young L-S (1998) Statistical properties of dynamical sys-
and multiplication operators. Soviet Math Dokl 24: tems with some hyperbolicity. Ann Math 147:585–650
97–100 Yuzvinskii SA (1967) Metric automorphisms with a simple
Vershik A, Livshits A (1992) Adic models of ergodic spectrum. Soviet Math. Dokl. 8:243–245
transformations, spectral theory, substitutions, and Zhang Q (1996) On convergence of the averages.
related topics. Rep Theory Dynam Syst 9:185–204 Monatshefte für Mathematik 122(3):275–300
T: ! , defined by Tx ¼ 2x mod 1, preserves
Ergodic Theory: Recurrence Lebesgue measure, hence induces a measure
preserving system on .
Nikos Frantzikinakis1 and Randall McCutcheon2 Ergodic system Is a measure preserving system
1
Department of Mathematics, University of Crete, (X, ℬ, m, T) (finite or infinite) such that every
Heraklion, Greece A ℬ that is T-invariant (i.e., T 1A ¼ A)
2
Department of Mathematics, University of satisfies either m(A) ¼ 0 or m(X \ A) ¼ 0. (One
Memphis, Memphis, TN, USA can check that the rotation Ra is ergodic if and
only if a is irrational and that the doubling map
is ergodic.)
Article Outline Ergodic decomposition Every measure-preserving
system (X, X, m, T) can be expressed as an integral
Glossary of ergodic systems; for example, one can write
Definition of the Subject and Its Importance m ¼ mt dl(t), where l is a probability measure on
Introduction [0, 1] and mt are T-invariant probability measures
Quantitative Poincaré Recurrence on (X, X ) such that the systems (X, X , mt, T) are
Subsequence Recurrence ergodic for t [0, 1].
Multiple Recurrence Ergodic theorem States that if (X, ℬ, m, T) is a
Connections with Combinatorics and Number measure preserving system and f L2(m), then
Theory lim N!1 N1 Nn¼1 T n f P f L2 ðmÞ ¼ 0, where
Future Directions
Pf denotes the orthogonal projection of the
References
function f onto the subspace {f L2(m): T
f ¼ f}.
Glossary Hausdorff a-measure Let (X, ℬ, m, T) be a mea-
sure preserving system endowed with a
Almost every, essentially Given a Lebesgue
m-compatible metric d. The Hausdorff a-measure
measure space (X, ℬ, m), a property P(x) pred- ℋa(X) of X is an outer measure defined for all
icated of elements of X is said to hold for subsets of X as follows: First, for A X and
almost every x X, if the set X \ {x: P (x) 1 a
e > 0, let ℋa,e ðAÞ ¼ inf n¼1 r i , where the
holds} has zero measure. Two sets A, B ℬ
infimum is taken over all countable coverings of
are essentially disjoint if m(A \ B) ¼ 0.
A by sets Ui X with diameter ri < e. Then
Conservative system Is an infinite measure pre-
define Ha(A) ¼ lim supe!0 ℋa,e(A).
serving system such that for no set A ℬ with
Infinite measure-preserving system Same as
positive measure are A, T 1A, T 2A, . . .
measure preserving system, but m(X) ¼ 1.
pairwise essentially disjoint.
Invertible system Is a measure-preserving sys-
(cn)-Conservative system If (cn)n ℕ is a decreas-
tem (X, ℬ, m, T) (finite or infinite), with the
ing sequence of positive real numbers, a conser-
property that there exists X0 X, with
vative ergodic measure preserving transformation
m(X\X0) ¼ 0, and such that the transformation
T is (cn)-conservative if for some nonnegative
T: X0 ! X0 is bijective, with T 1 measurable.
function f L1(m), 1 n¼1 cn f ðT xÞ ¼ 1 a.e.
n
Measure-preserving system Is a quadruple (X,
Doubling map If is the interval [0, 1] with its
ℬ, m, T), where X is a set, ℬ is a s-algebra of
endpoints identified and addition performed
subsets of X (i.e., ℬ is closed under countable
modulo 1, the (non-invertible) transformation
unions and complementation), m is a probabil- principle was first exploited by Poincaré in his
ity measure (i.e., a countably additive function 1890 King Oscar prize-winning memoir that stud-
from ℬ to [0, 1] with m(X) ¼ 1), and T: X ! X is ied planetary motion. Using the prototype of an
measurable (i.e., T1A ¼ {x X: T x ergodic theoretic argument, he showed that in any
A} ℬ for A ℬ) and m-preserving (i.e., system of point masses having fixed total energy
m(T 1A) ¼ m(A)). Moreover, throughout the that restricts its dynamics to bounded subsets of its
discussion, we assume that the measure space phase space, the typical state of motion (characterized
(X, ℬ, m) is Lebesgue (see Section 1.0 of by configurations and velocities) must recur to an
(Aaronson 1997)). arbitrary degree of approximation.
m-Compatible metric Is a separable metric on Among the recurrence principle’s more spec-
X, where (X, ℬ, m) is a probability space, hav- tacularly counterintuitive ramifications is that iso-
ing the property that open sets measurable. lated ideal gas systems that do not lose energy will
Positive definite sequence Is a complex-valued return arbitrarily closely to their initial states, even
sequence (an)n ℤ such that for any when such a return entails a decrease in entropy
z1 , . . . , zk ℂ, ki,j¼1 aij zi z j 0. from equilibrium, in apparent contradiction to the
Rotations on If is the interval [0, 1] with its second law of thermodynamics. Such concerns,
endpoints identified and addition performed previously canvassed by Poincaré himself, were
modulo 1, then for every a ℝ, the transfor- more infamously expounded by Zermelo (1896)
mation Ra: ! , defined by Rax ¼ x + a, in 1896. Subsequent clarifications by Boltzmann,
preserves Lebesgue measure on and hence Maxwell, and others led to an improved under-
induces a measure preserving system on . standing of the second law’s primarily statistical
Syndetic set Is a subset E ℤ having bounded nature. (For an interesting historical/philosophical
gaps. If G is a general discrete group, a set discussion, see Sklar (2004) and also Bergelson
E G is syndetic if G ¼ F E for some finite (2000). For a probabilistic analysis of the likeli-
set F G. hood of observing second law violations in small
systems over short time intervals, see Evans and
Upper density Is the number dðLÞ ¼
Searles (2002).)
lim supN!1 jL\f2Nþ1
N, ..., N gj
where L ℤ (ass-
, These discoveries had a profound impact in
uming the limit to exist). Alternatively for mea- dynamics, and the theory of measure-preserving
surable E ℝm, d ðEÞ ¼ lim suplðSÞ!1 mmðS\E Þ
ðSÞ , transformations (ergodic theory) evolved from
where S ranges over all cubes in ℝ and l(S)
m
these developments. Since then, the Poincaré
denotes the length of the shortest edge of S. recurrence principle has been applied to a variety
Notation The following notation will be used of different fields in mathematics, physics, and
throughout the article: Tf ¼ f T, {x} ¼ x [x], information theory. In this article we survey the
D-limn!1(an) ¼ a , d({n: |an a| > e} ¼ 0 impact it has had in ergodic theory, especially as
for every e > 0. pertains to the field of ergodic Ramsey theory.
(The heavy emphasis herein on the latter reflects
authorial interest and is not intended to transmit a
Definition of the Subject and Its proportionate image of the broader landscape of
Importance research relating to recurrence in ergodic theory.)
Background information we assume in this article
The basic principle that lies behind several recur- can be found in the books (Einsiedler and Ward
rence phenomena is that the typical trajectory of a 2011; Furstenberg 1981; Glasner 2003; Host and
system with finite volume comes back infinitely Kra 2018; Petersen 1989; Walters 1982) (see also
often to any neighborhood of its initial point. This the chapter “Measure Preserving Systems” by
Ergodic Theory: Recurrence 63
K. Petersen in this volume). Related information we conclude (as in the proof of Theorem 2.1) that
can also be found on the survey articles m(B) ¼ 0. This shows that for almost every x A,
(Bergelson 1996, 2006a, b; Frantzikinakis 2016; we have that T nx A for some n ℕ. Repeating
Kra 2006b, 2007, 2011). this argument for the transformation T m in place
of T for all m ℕ, we easily deduce the adver-
tised statement.
Introduction
Next we give a variation of Poincaré recur-
rence for measure-preserving systems endowed
In this section we shall give several formulations of
with a compatible metric.
the Poincaré recurrence principle using the lan-
guage of ergodic theory. Roughly speaking, the
Theorem 2.3 (Poincaré Recurrence for Metric
principle states that in a finite (or conservative)
Systems) Let (X, ℬ, m, T) be a measure-
measure-preserving system, every set of positive
preserving system, and suppose that X is endowed
measure (or almost every point) comes back to
with a m-compatible metric. Then for almost every
itself infinitely many times under iteration. Despite
x X, we have
the profound importance of these results, their pro-
ofs are extremely simple.
lim inf dðx, T n xÞ ¼ 0:
n!1
Theorem 2.1 (Poincaré Recurrence for Sets)
Let (X, ℬ, m, T) be a measure-preserving system
and A ℬ with m(A) > 0. Then m(A \ T nA) > 0 The proof of this result is similar to the proof of
for infinitely many n ℕ. Theorem 2.2 (see Furstenberg 1981, p. 61).
Applying this result to the doubling map T
Proof Since T is measure preserving, the sets A, x ¼ 2x on , we get that for almost every x X,
T 1A, T 2A, . . . have the same measure. These every string of zeros and ones in the dyadic expan-
sets cannot be pairwise essentially disjoint, since sion of x occurs infinitely often.
then the union of finitely many of them would We remark that all three formulations of the
have measure greater than m(X) ¼ 1. Therefore, Poincaré Recurrence Theorem that we have given
there exist m, n ℕ, with n > m, such that hold for conservative systems as well. See, e.g.,
m(T mA \ T nA) > 0. Again since T is measure Aaronson (1997) for details.
preserving, we conclude that m(A \ T k A) > 0, This article is structured as follows. In section
where k ¼ n m > 0. Repeating this argument for “Quantitative Poincaré Recurrence,” we give a
the iterates A, T mA, T 2mA, . . ., for all m N, few quantitative versions of the previously men-
we easily deduce that m(A \ T nA) > 0 for infi- tioned qualitative results. In sections “Subse-
nitely many n ℕ. quence Recurrence” and “Multiple Recurrence,”
We remark that the above argument actually we give several refinements of the Poincaré recur-
shows that m(A\T nA) > 0 for some n mð1AÞ þ 1. rence theorem, by restricting the scope of the
return time n and by considering multiple inter-
sections (for simplicity we focus on ℤ-actions). In
Theorem 2.2 (Poincaré Recurrence for
section “Connections with Combinatorics and
Points) Let (X, ℬ, m, T) be a measure-preserving
Number Theory,” we give various implications
system and A ℬ. Then for almost every x A,
of the recurrence results in combinatorics and
we have that T nx A for infinitely many n ℕ.
number theory (see also the chapter ▶ “Ergodic
Theory: Interactions with Combinatorics and
Proof Let B be the set of x A such that T nx 2 =
Number Theory” by T. Ward in the present vol-
A for all n ℕ. Notice that B ¼ A\\n ℕT nA; in
ume). Lastly, in section “Future Directions,” we
particular, B is measurable. Since the iterates B,
give several open problems related to the material
T 1B, T 2B, . . . are pairwise essentially disjoint,
64 Ergodic Theory: Recurrence
presented in sections “Subsequence Recurrence,” dm(x) ¼ m(A). Furthermore, for ergodic measure-
“Multiple Recurrence,” and “Connections with preserving systems, we have d(Sx) ¼ m(A) a.e.
Combinatorics and Number Theory.”
Another question that arises naturally is, given
a set A with positive measure and an x A, how
long should one wait till some iterate T nx of x hits
Quantitative Poincaré Recurrence
A. By considering an irrational rotation Ra on ,
1
where a is very near to, but not less than, 100 , and
Early Results
letting A ¼ 1[0,1/2], one can see that the first return
For applications it is desirable to have quantitative
time is a member of the set {1, 50, 51}. So it may
versions of the results mentioned in the previous
come as a surprise that the average first return time
section. For example one would like to know how
does not depend on the system (as long as it is
large m(A \ T nA) can be made and for how
ergodic) but only on the measure of the set A.
many n.
Theorem 3.3 (Kac 1947) Let (X, ℬ, m, T) be an
Theorem 3.1 (Khintchine 1934) Let (X, ℬ, m, T)
ergodic measure-preserving system and A ℬ
be a measure-preserving system and A ℬ. Then
with m(A) > 0. For x X define RA(x) ¼ min{n
for every e > 0, we have m(A \ T nA) > m(A)2 e
ℕ: T nx A}. Then for x A the expected value
for a set of n ℕ that hasbounded gaps .
of RA(x) is 1/m(A), i.e., A RA(x) dm ¼ 1.
By considering the doubling map T x ¼ 2x on
and letting A ¼ 1[0,1/2), it is easy to check that More Recent Results
the lower bound of the previous result cannot be As we mentioned in the previous section, if the
improved. We also remark that it is not possible to space X is endowed with a m-compatible metric d,
estimate the size of the gap by a function of m(A) then for almost every x X, we have that lim
alone. One can see this by considering the rota- infn!1 d(x, T n x) ¼ 0. A natural question is how
tions Rk x ¼ x + 1/k for k ℕ, defined on , and much iteration is needed to come back within a
letting A ¼ 1[0,1/3]. small distance of a given typical point. Under
Concerning the second version of the Poincaré some additional hypothesis on the metric d, we
recurrence theorem, it is natural to ask whether for have the following answer.
almost every x X the set of return times Sx ¼ {n
ℕ: T nx A} has bounded gaps. This is not the Theorem 3.4 (Boshernitzan 1993) Let (X, ℬ, m,
case, as one can see by considering the doubling T) be a measure-preserving system endowed with
map T x ¼ 2x on with the Lebesgue measure and a m-compatible metric d. Assume that the
letting A ¼ 1[0,1/2). Since Lebesgue almost every Hausdorff a-measure ℋa(X) of X is s-finite (i.e.,
x contains arbitrarily large blocks of ones in X is a countable union of sets Xi with
its dyadic expansion, the set Sx has unbounded ℋa(Xi) < 1). Then for almost every x X
gaps. Nevertheless, as an easy consequence of the
Birkhoff ergodic theorem (Birkhoff 1931), one 1
lim inf na dðx, T n xÞ < 1:
n!1
has the following.
Theorem 3.2 Let (X, ℬ, m, T) be a measure- Furthermore, if ℋa(X) ¼ 0, then for almost
preserving system and A ℬ with m(A) > 0. every x X
Then for almost every x X, the set Sx ¼ {n
1
ℕ: T nx A} has well-defined density and d(Sx) lim inf na dðx, T n xÞ ¼ 0:
n!1
Ergodic Theory: Recurrence 65
One can see from rotations by “badly where tr (x) is the first return time of T k x in
approximable” vectors a k that the exponent B(x, r) and the upper and lower pointwise
1/k in the previous theorem cannot be improved. dimensions
Several applications of Theorem 3.4 to billiard
flows, dyadic transformations, symbolic flows, log mðBðx, r ÞÞ
d m ðxÞ ¼ lim inf and
and interval exchange transformations are given r!0 log r
in Boshernitzan (1993). For a related result deal- log mðBðx, r ÞÞ
ing with mean values of the limits in Theorem 3.4, dm ðxÞ ¼ lim sup :
r!0 log r
see Shkredov (2002).
An interesting connection between rates of Then for almost every x X, we have
recurrence and entropy of an ergodic measure-
preserving system was established by Ornstein RðxÞ dm ðxÞ and RðxÞ dm ðxÞ:
and Weiss (1993), following earlier work of
Wyner and Ziv (1989).
Roughly speaking, this theorem asserts that for
Theorem 3.5 (Ornstein and Weiss 1993) Let typical x X and for small r, the first return time of
(X, ℬ, m, T) be an ergodic measure-preserving x in B(x, r) is at most rdm(x). Since d m ðxÞ ℋa ðXÞ
system and P be a finite partition of X. Let Pn(x)
i for almost every x X, we can conclude the first
be the element of the partition Vn1 i¼0 T P ¼ part of Theorem 3.4 from Theorem 3.6. For related
n1 i ðiÞ ðiÞ
\i¼0 T P : P P , 0 i < n that contains results the interested reader should consult the sur-
x. Then for almost every x X, the first return vey (Barreira 2005) and the bibliography therein.
time Rn(x) of x to Pn(x) is asymptotically equiva- We also remark that the previous results and
lent to eh(T,P )n, where h(T, P ) denotes the entropy related concepts have been applied to estimate the
of the system with respect to the partition P. More dimension of certain strange attractors (see Hasley
precisely, and Jensen (2004) and the references therein) and
the entropy of certain Gibbsian systems
log Rn ðxÞ (Chazottes and Ugalde 2005).
lim ¼ hðT, P Þ:
n!1 n We end this section with a result that connects
“wandering rates” of sets in infinite measure-
preserving systems with their “recurrence rates.”
An extension of the above result to some clas-
The next theorem follows easily from a result
ses of infinite measure-preserving systems was
about lower bounds on ergodic averages for
given in Galatolo et al. (2006).
measure-preserving systems due to Leibman
Another connection of recurrence rates, this
(2002); a weaker form for conservative, ergodic
time with the local dimension of an invariant
systems can be found in Aaronson (1981).
measure, is given by the next result.
Theorem 3.7 Let (X, ℬ, m, T) be an infinite
Theorem 3.6 (Barreira 2001) Let (X, ℬ, m, T) be
measure-preserving system and A ℬ with
an ergodic measure-preserving system. Define the
m(A) < 1. Then for all N N,
upper and lower recurrence rates
n
log tr ðxÞ m [N1
n¼0 T A
N1
(viii) R ¼ {[a(1)], [a(2)], . . .}, where a(x) ¼ xc T is measure preserving and f L1(m); (an) is
for any c > 0. This follows from Theorem positive definite, and we call s ¼ sf the spectral
4.3 below and standard exponential sum measure of f.
estimates; see also Boshernitzan et al. Let now (X, ℬ, m, T) be a measure-preserving
(2005) for a more general result regarding system and A ℬ with m(A) > 0. Putting f ¼ 1A,
Hardy sequences. one has
(ix) The set of values of a random non-lacunary
sequence. More precisely, pick n ℕ inde- 1
N
N
Then R is a set of recurrence. 1
lim e2pikxn ¼ 0:
N!1 N
n¼1
We sketch a proof for this result. First, recall
Herglotz’s theorem: if (an)n ℤ is a positive defi- This criterion becomes especially useful when
nite sequence, then there is a unique measure s on paired with van der Corput’s so-called third prin-
the torus such that an ¼ e2pint ds(t). The case cipal property: if, for every h ℕ, (xn+h xn)n ℕ
of interest to us is an ¼ f(x) f(Tnx) dm, where is uniformly distributed mod 1, then (xn)n ℕ is
68 Ergodic Theory: Recurrence
uniformly distributed mod 1. Using the foregoing Write f ¼ g + h where g H and h ⊥ ℋ, and
criteria and some standard (albeit nontrivial) expand the average in (3) into a sum of four aver-
exponential sum estimates, one can verify, for ages involving the functions g and h. Two of these
example, that the sets (iv) and (vii) in Theorem averages vanish because iterates of g are orthogo-
4.2 are good for recurrence. nal to iterates of h. So in order to show that the only
In light of the connection elucidated above contribution comes from the average that involves
between uniform distribution mod 1 and recur- the function g alone, it suffices to establish that
rence, it is not surprising that van der Corput’s
method has been adapted by modern ergodic the- 1
N
2
orists for use in establishing recurrence properties lim Tn h ¼ 0: ð5Þ
N!1 N
directly. n¼1 L2 ð mÞ
1 1
N
1
N
D m!1
lim lim hxnþm , xn i ¼ 0, lim hxnþm , xn i ¼ lim
2 2
T n þ2nmþm h T n hdm
2
N
1
lim xn ¼ 0: Applying the von Neumann ergodic theorem
N!1 N n¼1 (von Newmman 1932) to the transformation T2m
and using the fact that h⊥ℋ, we get that the last
limit is 0. This implies (5).
Let us illustrate how one uses this “van der Thus far we have shown that in order to compute
Corput trick” by showing that S ¼ {n2 : n ℕ} the limit in (3), we can assume that f ¼ g ℋ (g is
is a set of recurrence. We will actually establish also nonnegative and g 6¼ 0). By the definition of
the following stronger fact: If (X, ℬ, m, T) is a ℋ, given any e > 0, there exists a function f 0 ℋ
measure-preserving system and f L1(m) is such that T k f 0 ¼ f 0 for some k ℕ and
nonnegative and f 6¼ 0, then k f f 0 kL2 ðmÞ e. Then the limit in (3) is at least
1/k times the limit
N
1 2
lim inf f ðxÞ f T n x dm > 0: ð3Þ
n!1 N 1
N
2
f ðxÞ f T ðknÞ x dm:
n¼1
lim inf
n!1 N n¼1
Then our result follows by setting f ¼ 1A for
some A ℬ with m(A) > 0.The main idea is one Applying the triangle inequality twice, we get
that occurs frequently in ergodic theory; split the that this is greater or equal than
function f into two components, one of which
contributes zero to the limit appearing in (3) and 1
N
2 2
the other one being much easier to handle than f. lim f 0 ðxÞ f 0 T ðknÞ x dm c e ¼ ð f 0 ðxÞÞ dm 2e
N!1 N
n¼1
To do this consider the T-invariant subspace of 2
L2(X) defined by f 0 ðxÞdm c e,
ℋ¼ f L2 ðmÞ : there exists k ℕ with T k f ¼ f : for some constant c that does not depend on e
ð4Þ (we used that Tk f 0 ¼ f 0 and the Cauchy-Schwartz
inequality). Choosing e small enough, we
Ergodic Theory: Recurrence 69
conclude that the last quantity is positive, com- components. Let K be the closure in L2 of the
pleting the proof. subspace spanned by the eigenfunctions of T, i.e.,
the functions f L2(m) that satisfy f (T x) ¼ e2piaf
(x) for some a ℝ. We write f ¼ g + h, where g
Multiple Recurrence K and h ⊥ K . It can be shown that g, h L1(m)
and g is again nonnegative with g 6¼ 0. We expand
Simultaneous multiple returns of positive measure the average in (7) into a sum of eight averages
sets to themselves were first considered by involving the functions g and h. In order to show
H. Furstenberg (1977), who gave a new proof of that the only nonzero contribution to the limit
Szemerédi’s theorem (Szemerédi 1975) on arith- comes from the term involving g alone, it suffices
metic progressions by deriving it from the follow- to establish that
ing theorem.
N
1
lim T n g T 2n h ¼ 0, ð8Þ
Theorem 5.1 (Furstenberg 1977) Let (X, ℬ, N!1 N n¼1 L2 ðmÞ
m, T) be a measure-preserving system and A
ℬ with m(A) > 0. Then for every k ℕ, there is (and similarly with h and g interchanged, and
some n ℕ such that with g ¼ h, which is similar). To establish (8),
we use the Hilbert space van der Corput lemma
m A \ T n A \ \ T kn A > 0: ð6Þ on xn ¼ T ng T 2nh. Some routine computations
and a use of the ergodic theorem reduce the task
Furstenberg’s proof came by means of a new to showing that
structure theorem allowing one to decompose an
arbitrary measure-preserving system into compo-
nent elements exhibiting one of two extreme types D lim hðxÞ h T 2m x dm ¼ 0:
m!1
of behavior: compactness, characterized by regu-
lar, “almost periodic” trajectories, and weak But this is well known for h ⊥ K (e.g., in virtue
mixing, characterized by irregular, “quasi- of the fact that for h ⊥ K the spectral measure sh is
random” trajectories. On , these types of behav- continuous).We are left with the average (7) when
ior are exemplified by rotations and by the dou- f ¼ g K . In this case f can be approximated
bling map, respectively. To see the point, imagine arbitrarily well by a linear combination of eigen-
trying to predict the initial digit of the dyadic functions, which easily implies that given e > 0,
expansion of T nx given knowledge of the initial one has kT n f f kL2 ðmÞ e for a set of n ℕ with
digits of T ix, 1 i < n. We use the case k ¼ 2 to bounded gaps. Using this fact and the triangle
illustrate the basic idea. inequality, one finds that for a set of n ℕ with
It suffices to show that if f L1(m) is nonneg- bounded gaps,
ative and f 6¼ 0, one has
3
1
N f ðxÞ f ðT n xÞ f T 2n x dm fdm ce
lim inf f ðxÞ f ðT n xÞ f T 2n x dm
N!1 N n¼1
To expedite discussion of some of these develop- “Single and multiple recurrence. . .”;
ments, we introduce a definition: Frantzikinakis 2009, 2010; Frantzikinakis and
Wierdl 2009), integer part polynomial sequences
Definition 5.2 Let R ℤ and k ℕ. Then R is a (Karageorgos and Koutsogiannis “Integer part
set of k-recurrence if for every invertible measure- independent. . .”; Koutsogiannis 2018a, b), random
preserving system (X, B, m, T ) and A B with sequences (Frantzikinakis et al. 2012, 2016), and
m(A) > 0 there is some nonzero n R such that sets of arithmetic nature (Bergelson et al. “A struc-
ture theorem for. . .”; Frantzikinakis and Host
m A \ T n A \ \ T kn A > 0: 2017a, b).
More generally, one would like to know for
The notions of k-recurrence are distinct for which sequences of integers a1(n), . . . , ak (n) it is
different values of k. An example of a difference the case that for every invertible measure-
set that is a set of 1-recurrence but not a set of preserving system (X, B, m, T) and A B with
2-recurrence was given in (Furstenberg 1977); m(A) > 0, there is some nonzero n ℕ such that
sets of k-recurrence that are not sets of (k + 1)-
recurrence for general k were given in m A \ T a1 ðnÞ A \ \ T ak ðnÞ A > 0: ð9Þ
Frantzikinakis et al. (2006) ðRk ¼
p
nℕ : n kþ1
2 ½1=4, 3=4 is such). Unfortunately, a criterion analogous to the one
Aside from difference sets, the sets of (1-) given in Theorem 4.3 for 1-recurrence is not yet
recurrence given in Theorem 4.2 may well be sets available for k-recurrence when k > 1. Neverthe-
of k-recurrence for every k ℕ, though this has less, there have been some notable positive
not been verified in all cases. Let us summarize the results, such as the following:
current state of knowledge. The following are sets
of k-recurrence for every k: sets of the form
[n ℕ{an, 2an, . . ., nan}where an ℕ (this fol- Theorem 5.3 (Bergelson and Leibman 1996)
lows from a uniform version of Theorem 5.1 that Let (X, B, m, T) be an invertible measure-
can be found in Bergelson et al. (2000)); every preserving system and p1(n), . . . pk (n) be integer
IP-set (Furstenberg and Katznelson 1985); the set polynomials with zero constant term. Then for
{p(n), n ℕ} where p is any nonconstant integer every A B with m(A) > 0, there is some
polynomial with p(0) ¼ 0 (Bergelson and Leibman n ℕ such that
1996) and, more generally, when the range of the
polynomial contains multiples of an arbitrary inte- m A \ T p1 ðnÞ A \ \ T pk ðnÞ A > 0: ð10Þ
ger (Frantzikinakis 2008); the set {p(n), n S}
where p is an integer polynomial with p(0) ¼ 0 and
S is any IP-set (Bergelson and McCutcheon 2000); Furthermore, it has been shown that the n in
and the set of values of an admissible generalized (10) can be chosen from any IP set (Bergelson and
polynomial (Bergelson and McCutcheon 2010; McCutcheon 2000) and the polynomials p1, . . . ,
McCutcheon 2005). Moreover, it was shown in pk can be chosen to belong to the more general
Frantzikinakis et al. (2007) for k ¼ 2 and in Wooley class of admissible generalized polynomials
and Ziegler (2012) for general k ℕ that the sets (McCutcheon 2005) or the class of intersective
of shifted primes {p 1, p prime} (or the set polynomials (Bergelson et al. 2008).
{p + 1, p prime}) are sets of k-recurrence (see An important boost in the area of multiple
also Bergelson et al. (2011), Frantzikinakis et al. recurrence was given by a breakthrough of Host
(2013), Koutsogiannis (2018a), and Sun (2015) for and Kra (2005). Building on work of Conze and
related work). Several other multiple recurrence Lesigne (1984, 1988) and Furstenberg and Weiss
results were obtained in the last 10 years, including (Furstenberg and Weiss 1996) (see also the excel-
results for Hardy sequences (Bergelson et al. lent survey of Kra (2006a), exploring close
Ergodic Theory: Recurrence 71
parallels with Green and Tao (2008), and the When the polynomials n, 2n, . . . , kn are replaced
seminal paper of Gowers (2001)), they isolated by linearly independent polynomials p1, p2, . . . ,
the structured component (or factor) of a measure- pk with zero constant term, similar lower bounds
preserving system that one needs to analyze in hold for every k ℕ without assuming ergodicity
order to prove several multiple recurrence and (Frantzikinakis and Kra 2006). The case where the
convergence results. This allowed them, in partic- polynomials n, 2n, 3n are replaced with general
ular, to prove existence of L2 limits for the polynomials p1, p2, p3 with zero constant term is
so-called “Furstenberg ergodic averages” treated in Frantzikinakis (2008) (see also Donoso
1 N
et al. “Optimal lower bounds for multiple. . .”),
n¼1 Pi¼0 f ðT xÞ, which had been a major
k in
N
open problem since the original ergodic proof of and more general results involving Hardy field
Szemerédi’s theorem. Subsequently Ziegler in sequences and polynomials evaluated at the
Ziegler (2007) gave a new proof of the aforemen- primes are obtained in Donoso et al. “Optimal
tioned limit theorem and established minimality lower bounds for multiple. . ..”
of the factor in question. It turns out that this
minimal component admits of a purely algebraic
characterization; it is a nilsystem, i.e., a rotation on Connections with Combinatorics and
a homogeneous space of a nilpotent Lie group. Number Theory
This fact, coupled with some recent results about
nilsystems (see Leibman (2005a, b) for example), The combinatorial ramifications of ergodic-theoretic
makes the analysis of some otherwise intractable recurrence were first observed by Furstenberg, who
multiple recurrence problems much more man- perceived a correspondence between recurrence
ageable. These developments have made it possi- properties of measure-preserving systems and the
ble to obtain new multiple recurrence results, and existence of structures in sets of integers having
they also allowed us to estimate the size of the positive upper density. This gave rise to the field of
multiple intersection in (6) for k ¼ 2, 3 (the case ergodic Ramsey theory, in which problems in com-
k ¼ 1 is Theorem 3.1). binatorial number theory are treated using tech-
niques from ergodic theory. The following
Theorem 5.4 (Bergelson et al. 2005) Let (X, B, formulation is from Bergelson (1987b).
m, T) be an ergodic measure-preserving system
and A B. Then for k ¼ 2, 3 and for every e > 0, Theorem 6.1 Let L be a subset of the integers.
There exists an invertible measure-preserving sys-
m A \ T n A \ \ T kn A tem (X, B, m, T) and a set A B with m(A) ¼ d(L)
> mkþ1 ðAÞ e ð11Þ such that
we can find an increasing sequence of integers Theorem 6.2 (Bergelson and Leibman 1996)
(Nm)m ℕ such that limm!1 |L \ [1, Nm]|/Nm ¼ d Let L ℤ with d(L) > 0 and p1, . . . , pk be integer
polynomials with zero constant term. Then L con-
(L) and such that
tains infinitely many configurations of the form {x,
x + p1(n), . . . , x + pk (n)}.
j Li1 n1 \ Li2 n2 \ \ Lir nr \ ½1,N m j
lim
m!1 Nm The ergodic proof is the only one known for this
ð13Þ result, although very recently some special cases
were covered in Peluse and Prendiville (“Quantita-
exists for every n1, . . . , nr ℤ, and i1, . . . , ir tive bounds in the. . .”) and Prendiville (2017) using
{0, 1}. For n1, n2, . . . , nr ℤ, and i1, i2, . . . , ir more elementary (but complicated) arguments.
{0, 1}, we define the measure m of the cylinder Ergodic-theoretic contributions to the field of
set fxn1 ¼ i1 , xn2 ¼ i2 , . . . , xnr ¼ ir g to be the geometric Ramsey theory were made by Furstenberg
limit (13). Thus defined, m extends to a pre- et al. (1990), who showed that if E is a positive upper
measure on the algebra of sets generated by cyl- density subset of ℝ2, then (i) E contains points with
inder sets and hence by Carathéodory’s extension any large enough distance (see also Bourgain (1986)
theorem (Carathéodory 1968) to a probability and Falconer and Marstrand (1986)) and (ii) every
measure on B. It is easy to check that m(A) ¼ d d-neighborhood of E contains three points forming a
(L), the shift transformation T preserves the mea- triangle congruent to any given large enough dilation
sure m and (12) holds. of a given triangle (in Bourgain (1986), it is shown
that if the three points lie on a straight line, one
Using this principle for k ¼ 1, one may check cannot always find three points with this property
that any set of recurrence is intersective, that is in E itself). Moreover, a generalization of property
intersects E E for every set E of positive (ii) to arbitrary finite configurations of ℝm was
density. Using it for n1 ¼ n, n2 ¼ 2n, . . . , nk ¼ kn, obtained by Ziegler (2006).
together with Theorem 5.1, one gets an ergodic We also mention some exciting connections of
proof of Szemerédi’s theorem (Szemerédi 1975), multiple recurrence with some structural proper-
stating that every subset of the integers with ties of the set of prime numbers. The first one is in
positive upper density contains arbitrarily long the work of Green and Tao (2008), where the
arithmetic progressions (conversely, one can eas- existence of arbitrarily long arithmetic progres-
ily deduce Theorem 5.1 from Szemerédi’s theo- sions of primes was demonstrated; the authors,
rem and that intersective sets are sets of in addition to using Szemerédi’s theorem outright,
recurrence). Making the choice n1 ¼ n2 and use several ideas from its ergodic-theoretic proofs,
using part (iv) of Theorem 4.3, we get an ergodic as appearing in Furstenberg (1977) and
proof of the surprising result of Sárközy (1978) Furstenberg et al. (1982). The second one is in
stating that every subset of the integers with the recent work of Tao and Ziegler (2008); a
positive upper density contains two elements quantitative version of Theorem 5.3 was used to
whose difference is a perfect square. More gen- prove that the primes contain arbitrarily long
erally, using Theorem 6.1, one can translate all of polynomial progressions. Furthermore, results in
the recurrence results of the previous two sec- ergodic theory related to the structure of the min-
tions to results in combinatorics. (This is not imal characteristic factors of certain multiple
straightforward for Theorem 5.4 because of the ergodic averages play an important role in the
ergodicity assumption made there. We refer the work of Green, Tao, and Ziegler (Green and Tao
reader to Bergelson et al. (2005) for the combi- 2010, 2012a, b, 2014; Green et al. 2012), where
natorial consequence of this result.) We mention they get asymptotic formulas for the number of
explicitly only the combinatorial consequence of k-term arithmetic progressions of primes up to x.
Theorem 5.3. This work verifies an interesting special case of
Ergodic Theory: Recurrence 73
the Hardy-Littlewood k-tuple conjecture pre- being occupied by a variable letter x, for x ¼ a1,
dicting the asymptotic growth rate of Na1, . . . , ak . . . , ak. (For example, in W4(A), the sets {(a1, x,
(x) ¼ the number of configurations of primes a2, x) : x A} and {(x, x, x, x), : x A} are
having the form {p, p + a1, . . . , p + ak } with combinatorial lines.)
p x.
In a more recent development, the tools devel- At first glance, the uninitiated reader may not
oped in the last two decades to deal with delicate appreciate the importance of this “master” density
multiple recurrence problems have played an result, so it is instructive to derive at least one of
instrumental role in analyzing the structure of its immediate consequences. Let A ¼ {0, 1, . . . ,
measure-preserving systems naturally associated k 1} and interpret Wn(A) as integers in base
with bounded multiplicative functions. These k having at most n digits. Then a combinatorial
results were used in the last 2 years in works of line in Wn(A) is an arithmetic progression of
Tao and Teräväinen (The structure of logarith- length k – for example, the line {(a1, x, a2, x) :
mically. . .; The structure of correlations. . .) to x A} corresponds to the progression {m, m + n,
make progress on the Chowla and Elliott con- m + 2n, m + 3n}, where m ¼ a1 + a2d2 and n ¼ d + d3.
jectures and in works of Frantzikinakis and Host This allows one to deduce Szemerédi’s theorem.
(2018, Furstenberg systems of bounded. . .) to Similarly, one can deduce from Theorem 6.3 multi-
make progress on the Möbius disjointness con- dimensional and IP extensions of Szemerédi’s the-
jecture of Sarnak. It appears that this interplay orem (Furstenberg and Katznelson 1979, 1985)
of ergodic theory and number theory will be and some related results about vector spaces over
useful for the resolution of several notoriously dif- finite fields (Furstenberg and Katznelson 1985).
ficult problems concerning higher-order correlations
of multiplicative and other number theoretic
functions.
Future Directions
Finally, we remark that in this article we have
restricted attention to multiple recurrence and
In this section we formulate a few open problems
Furstenberg correspondence for ℤ-actions, while
relating to the material in the previous three sec-
in fact there is a wealth of literature on extensions
tions. It should be noted that this selection reflects
of these results to general commutative, amena-
the authors’ interests and does not strive for com-
ble, and even non-amenable groups. For an excel-
pleteness. A more extensive list of problems
lent exposition of these and other recent
related to ergodic theory of ℤ-actions can be
developments, the reader is referred to the survey
found in Frantzikinakis (2016).
articles (Austin “Multiple recurrence and
We start with an intriguing question of
finding. . .”; Bergelson 1996, 2006a, b). Here, we
Katznelson (2001) about sets of topological recur-
give just one notable combinatorial corollary to
rence. A set S ℕ is a set of Bohr recurrence if for
some work of this kind, a density version of the
every a1, . . . , ak ℝ and e > 0 there exists s
classical Hales-Jewett coloring theorem (Hales
S such that {sai} [0, e][[1e, 1) for i ¼ 1, . . . , k.
and Jewett 1963).
Problem 1 Is every set of Bohr recurrence a set
Theorem 6.3 (Furstenberg and Katznelson
of topological recurrence?
(1991); see also Polymath (2012)) Let Wn(A)
denote the set of words of length n with letters in
Background for this problem and evidence for
the alphabet A ¼ {a1, . . . , ak }. For every e > 0,
a positive answer can be found in Katznelson
there exists N0 ¼ N0(e, k) such that if n N0, then
(2001) and Weiss (2000). A negative answer for
any subset S of Wn(A) with |S| ekn contains a
a related question concerning measure theoretic
combinatorial line, i.e., a set consisting of k
recurrence is given in Kriz (1987) (see also
n-letter words, having fixed letters in l positions,
Griesmer “Bohr topology and difference sets. . .”).
for some 0 l < n, the remaining n l positions
74 Ergodic Theory: Recurrence
Bergelson V, Host B, McCutcheon R, Parreau F (2000) Rech Math Rennes, 1987-1. University of Rennes I,
Aspects of uniformity in recurrence. Colloq Math Rennes, pp 1–31
84/85(Part 2):549–576 Donoso S, Le AN, Moreira J, Sun W. Optimal lower
Bergelson V, Host B, Kra B, with an appendix by Ruzsa bounds for multiple recurrence. To appear in Ergodic
I (2005) Multiple recurrence and nilsequences. Invent Theory Dyn Syst. arXiv:1809.06912
Math 160(2):261–303 Einsiedler M, Ward T (2011) Ergodic theory with a view
Bergelson V, Håland-Knutson I, McCutcheon R (2006) IP towards number theory. Graduate texts in mathematics,
systems, generalized polynomials and recurrence. vol 259. Springer London, London
Ergodic Theory Dyn Syst 26:999–1019 Evans D, Searles D (2002) The fluctuation theorem. Adv
Bergelson V, Leibman A, Lesigne E (2008) Intersective Phys 51:1529–1585
polynomials and the polynomial Szemerédi theorem. Falconer K, Marstrand J (1986) Plane sets with positive
Adv Math 219(1):369–388 density at infinity contain all large distances. Bull Lond
Bergelson V, Leibman A, Ziegler T (2011) The shifted Math Soc 18:471–474
primes and the multidimensional Szemerédi and poly- Forrest A (1991) The construction of a set of recurrence
nomial van der Waerden Theorems. Comptes Rendus which is not a set of strong recurrence. Israel J Math
Mathematique 349(3–4):123–125 76:215–228
Bergelson V, Kolesnik G, Son Y (2019) Uniform distribu- Frantzikinakis N (2008) Multiple ergodic averages for
tion of subpolynomial functions along primes and three polynomials and applications. Trans Am Math
applications. J Anal Math 137(1):135–187 Soc 360(10):5435–5475
Bergelson V, Kułaga-Przymus J, Lemańczyk M. A struc- Frantzikinakis N (2009) Equidistribution of sparse
ture theorem for level sets of multiplicative functions sequences on nilmanifolds. J Anal Math 109:353–395
and applications. To appear in Int Math Res Not. Frantzikinakis N (2010) Multiple recurrence and conver-
arXiv:1708.02613 gence for Hardy sequences of polynomial growth.
Bergelson V, Moreira J, Richter FK. Single and multiple J Anal Math 112:79–135
recurrence along non-polynomial sequences. Preprint, Frantzikinakis N (2016) Some open problems on multiple
arXiv:1711.05729 ergodic averages. Bull Hell Math Soc 60:41–90
Bhattacharya B, Ganguly S, Shao X, Zhao Y. Upper tails Frantzikinakis N, Host B (2017a) Higher order Fourier
large deviations for arithmetic progressions in a random analysis of multiplicative functions and applications.
set. To appear in Int Math Res Note. arXiv:1605.02994 J Am Math Soc 30:67–157
Birkhoff G (1931) A proof of the ergodic theorem. Proc Frantzikinakis N, Host B (2017b) Multiple ergodic the-
Natl Acad Sci U S A 17:656–660 orems for arithmetic sets. Trans Am Math Soc
Boshernitzan M (1993) Quantitative recurrence results. 369(10):7085–7105
Invent Math 113:617–631 Frantzikinakis N, Host B (2018) The logarithmic Sarnak
Boshernitzan M, Kolesnik G, Quas A, Wierdl M (2005) conjecture for ergodic weights. Ann Math 187:869–931
Ergodic averaging sequences. J Anal Math 95:63–103 Frantzikinakis N, Host B. Furstenberg systems of bounded
Bourgain J (1986) A Szemerédi type theorem for sets of multiplicative functions and applications. To appear in
positive density in ℝk. Israel J Math 54(3):307–316 Int Math Res Note IMRN. arXiv:1804.08556
Bourgain J (1988) On the maximal ergodic theorem for Frantzikinakis N, Kra B (2006) Ergodic averages for inde-
certain subsets of the positive integers. Israel J Math pendent polynomials and applications. J London Math
61:39–72 Soc 74(1):131–142
Briët J, Gopi S. Gaussian width bounds with applications Frantzikinakis N, Wierdl M (2009) A Hardy field extension
to arithmetic progressions in random settings. To of Szemerédi’s theorem. Adv Math 222:1–43
appear in Int Math Res Note. arXiv:1711.05624 Frantzikinakis N, Lesigne E, Wierdl M (2006) Sets of
Briët J, Dvir Z, Gopi S (2017) Outlaw distributions and k-recurrence but not (k + 1)-recurrence. Ann Inst Fou-
locally decodable codes. Proc ITCS arXiv:1609.06355 rier 56(4):839–849
Brown T, Graham R, Landman B (1999) On the set of Frantzikinakis N, Host B, Kra B (2007) Multiple recur-
common differences in van der Waerden’s theorem on rence and convergence for sets related to the primes.
arithmetic progressions. Can Math Bull 42:25–36 J Reine Angew Math 611:131–144
Carathéodory C (1968) Vorlesungen über reelle Frantzikinakis N, Lesigne E, Wierdl M (2012) Random
Funktionen, 3rd edn. Chelsea Publishing, New York sequences and pointwise convergence of multiple
Chazottes J, Ugalde E (2005) Entropy estimation and fluc- ergodic averages. Indiana Univ Math J 61:585–617
tuations of hitting and recurrence times for Gibbsian Frantzikinakis N, Host B, Kra B (2013) The polynomial
sources. Discrete Contin Dyn Syst Ser B 5(3):565–586 multidimensional Szemerédi Theorem along shifted
Christ M. On random multilinear operator inequalities. primes. Israel J Math 194:331–348
Unpublished manuscript. Available at arXiv:1108.5655 Frantzikinakis N, Lesigne E, Wierdl M (2016) Random
Conze J, Lesigne E (1984) Théorèmes ergodiques pour differences in Szemerédi’s theorem and related results.
des mesures diagonales. Bull Soc Math France J Anal Math 130:91–133
112(2):143–175 Furstenberg H (1977) Ergodic behavior of diagonal mea-
Conze J, Lesigne E (1988) Sur un théorème ergodique pour sures and a theorem of Szemerédi on arithmetic pro-
des mesures diagonales. In: Probabilités, Publ Inst gressions. J Anal Math 71:204–256
Ergodic Theory: Recurrence 77
Furstenberg H (1981) Recurrence in ergodic theory and Kac M (1947) On the notion of recurrence in discrete
combinatorial number theory. Princeton University stochastic processes. Bull Am Math Soc
Press, Princeton 53:1002–10010
Furstenberg H, Katznelson Y (1979) An ergodic Kamae T, Mendés-France M (1978) Van der Corput’s
Szemerédi theorem for commuting transformations. difference theorem. Israel J Math 31:335–342
J Anal Math 34:275–291 Karageorgos D, Koutsogiannis A. Integer part independent
Furstenberg H, Katznelson Y (1985) An ergodic polynomial averages and applications along primes. To
Szemerédi theorem for IP-systems and combinatorial appear in Studia Math. arXiv:1708.06820
theory. J Anal Math 45:117–168 Katznelson Y (2001) Chromatic numbers of Cayley graphs
Furstenberg H, Katznelson Y (1991) A density version of on ℤ and recurrence. Paul Erdös and his mathematics
the Hales-Jewett theorem. J Anal Math 57:64–119 (Budapest, 1999). Combinatorica 21(2):211–219
Furstenberg H, Weiss B (1996) A mean ergodic theorem Khintchine A (1934) Eine Verscharfung des Poincaréschen
for ð1=N Þ N 2
f ðT n xÞg T n x . Convergence in ergo- “Wiederkehrsatzes”. Comp Math 1:177–179
n¼1
Koutsogiannis A (2018a) Closest integer polynomial mul-
dic theory and probability (Columbus, OH, 1993), tiple recurrence along shifted primes. Ergodic Theory
Ohio State Univ Math Res Inst Publ, vol 5, de Gruyter, Dyn Syst 38:666–685
Berlin, pp 193–227 Koutsogiannis A (2018b) Integer part polynomial correla-
Furstenberg H, Katznelson Y, Ornstein D (1982) The ergo- tion sequences. Ergodic Theory Dyn Syst
dic theoretical proof of Szemerédi’s theorem. Bull Am 38:1525–1542
Math Soc 7(3):527–552 Kra B (2006a) The Green-Tao theorem on arithmetic pro-
Furstenberg H, Katznelson Y, Weiss B (1990) Ergodic gressions in the primes: an ergodic point of view. Bull
theory and configurations in sets of positive density. Am Math Soc 43:3–23
In: Mathematics of Ramsey theory. Algorithms combi- Kra B (2006b) From combinatorics to ergodic theory and
natorics, vol 5. Springer, Berlin, pp 184–198 back again. In: Proceedings of international congress of
Galatolo S, Kim DH, Park KK (2006) The recurrence time mathematicians, vol III. Madrid, pp 57–76
for ergodic systems with infinite invariant measures. Kra B (2007) Ergodic methods in additive combinatorics.
Nonlinearity 19:2567–2580 In: Additive combinatorics, CRM proceedings and lec-
Glasner E (2003) Ergodic theory via joinings. Mathemat- ture notes, vol 43. American Mathematical Society,
ical surveys and monographs, vol 101. American Math- Providence, pp 103–143
ematical Society, Providence Kra B (2011) Poincare recurrence and number theory:
Gowers W (2001) A new proof of Szemerédi’s theorem. thirty years later. Bull Am Math Soc 48:497–501
Geom Funct Anal 11:465–588 Kriz I (1987) Large independent sets in shift invariant
Graham RL (1994) Recent trends in Euclidean Ramsey graphs. Solution of Bergelson’s problem. Graphs
theory. Trends in discrete mathematics. Discrete Math Comb 3:145–158
136(1–3):119–127 Leibman A (2002) Lower bounds for ergodic averages.
Green B, Tao T (2008) The primes contain arbitrarily long Ergodic Theory Dyn Syst 22:863–872
arithmetic progressions. Ann Math 167:481–547 Leibman A (2005a) Pointwise convergence of ergodic
Green B, Tao T (2010) Linear equations in primes. Ann averages for polynomial sequences of rotations of a
Math 171:1753–1850 nilmanifold. Ergodic Theory Dyn Syst 25:201–213
Green B, Tao T (2012a) The quantitative behaviour of Leibman A (2005b) Pointwise convergence of ergodic aver-
polynomial orbits on nilmanifolds. Ann Math ages for polynomial actions of ℤd by translations on a
175:465–540 nilmanifold. Ergodic Theory Dyn Syst 25:215–225
Green B, Tao T (2012b) The Möbius function is strongly McCutcheon R (1995) Three results in recurrence. In:
orthogonal to nilsequences. Ann Math 175:541–566 Ergodic theory and its connections with harmonic anal-
Green B, Tao T (2014) On the quantitative distribution of ysis (Alexandria, 1993). London mathematical society
polynomial nilsequences- erratum. Ann Math lecture note series, vol 205. Cambridge University
179:1175–1183, arXiv:1311.6170v3 Press, Cambridge, pp 349–358
Green B, Tao T, Ziegler T (2012) An inverse theorem for the McCutcheon R (2005) FVIP systems and multiple recur-
Gowers U s+1[N ]-norm. Ann Math 176(2):1231–1372 rence. Israel J Math 146:157–188
Griesmer J. Bohr topology and difference sets for some Meyerovitch T. On multiple and polynomial recurrent
abelian groups. Preprint, arXiv:1608.01014 extensions of infinite measure preserving transforma-
Hales A, Jewett R (1963) Regularity and positional games. tions. Unpublished. Available at arXiv:0703914v2
Trans Am Math Soc 106:222–229 Ornstein D, Weiss B (1993) Entropy and data compression
Hasley T, Jensen M (2004) Hurricanes and butterflies. schemes. IEEE Trans Inform Theory 39:78–83
Nature 428:127–128 Peluse S, Prendiville S. Quantitative bounds in the non-
Host B, Kra B (2005) Nonconventional ergodic averages linear Roth theorem. Preprint, arXiv:1903.02592
and nilmanifolds. Ann Math 161:397–488 Petersen K (1989) Ergodic theory. Cambridge studies in
Host B, Kra B (2018) Nilpotent structures in Ergodic advanced mathematics, vol 2. Cambridge University
theory. Mathematical surveys and monographs, Press, Cambridge
vol 236. American Mathematical Society, Providence
78 Ergodic Theory: Recurrence
Poincaré H (1890) Sur le problème des trois corps et les Tao T, Teräväinen J. The structure of correlations of mul-
équations de la dynamique. Acta Math 13:1–270 tiplicative functions at almost all scales, with applica-
Polymath DHJ (2012) A new proof of the density Hales- tions to the Chowla and Elliott conjectures. Preprint,
Jewett theorem. Ann Math 175:1283–1327 arXiv:1809.02518
Prendiville S (2017) Quantitative bounds in the polynomial Tao T, Ziegler T (2008) The primes contain arbitrarily long
Szemerédi theorem: the homogeneous case. Discrete polynomial progressions. Acta Math 201:213–305
Anal 5:1–34 von Newmman J (1932) Proof of the Quasi-ergodic
Rosenblatt J, Wierdl M (1995) Pointwise ergodic theorems hypothesis. Proc Natl Acad Sci U S A 18(1):70–82
via harmonic analysis. In: Ergodic theory and its connec- Walters P (1982) An introduction to ergodic theory. Grad-
tions with harmonic analysis (Alexandria, 1993). London uate texts in mathematics, vol 79. Springer, New York/
mathematical society lecture note series, vol 205. Cam- Berlin
bridge University Press, Cambridge, pp 3–151 Weiss B (2000) Single orbit dynamics. CBMS regional
Sárközy A (1978) On difference sets of integers III. Acta conference series in mathematics, vol 95. American
Math Acad Sci Hungar 31:125–149 Mathematical Society, Providence
Shkredov I (2002) Recurrence in the mean. Mat Zametki Wooley T, Ziegler T (2012) Multiple recurrence and
72(4):625–632; translation in Math Notes convergence along the primes. Am J Math
(2002) 72(3–4):576–582 134:1705–1732
Sklar L (2004) Philosophy of statistical mechanics. In: Wyner A, Ziv J (1989) Some asymptotic properties of the
Zalta EN (ed) The Stanford encyclopedia of philosophy entropy of a stationary ergodic data source with appli-
(Summer 2004 Edition). http://plato.stanford.edu/ cations to data compression. IEEE Trans Inform The-
archives/sum2004/entries/statphys-statmech/ ory 35:1250–1258
Sun W (2015) Multiple recurrence and convergence for Zermelo E (1896) Über einen Satz der Dynamik und die
certain averages along shifted primes. Ergodic Theory mechanische Wärmetheorie. Ann Phys 57:485–494;
Dyn Syst 35(5):1592–1609 English translation (1966) On a theorem of dynamics
Szemerédi E (1975) On sets of integers containing no and the mechanical theory of heat. In: Brush SG
k elements in arithmetic progression. Acta Arith (ed) Kinetic theory, vol II. Oxford, pp 208–217
27:299–345 Ziegler T (2006) Nilfactors of ℝm-actions and configura-
Tao T, Teräväinen J. The structure of logarithmically aver- tions in sets of positive upper density in ℝm. J Anal
aged correlations of multiplicative functions, with Math 99:249–266
applications to the Chowla and Elliott conjectures. To Ziegler T (2007) Universal characteristic factors and
appear in Duke Math J. arXiv:1708.02610 Furstenberg averages. J Am Math Soc 20:53–97
Iteration repeated applications of the map
Ergodic Theorems T above to arrive at the state of the system
after n units of time.
Andrés del Junco Maximal inequality an inequality which allows
Department of Mathematics, University of one to bound the pointwise oscillation of a
Toronto, Toronto, ON, Canada sequence of functions. An essential tool for
proving pointwise ergodic theorems.
Mean ergodic theorem an assertion that ergodic
Article Outline averages converge with respect to some norm
on a space of functions.
Glossary Operator any linear operator U on a vector
Definition of the Subject space of functions on X, for example one aris-
Introduction ing from a dynamical system T by setting Uf
Ergodic Theorems for Measure-Preserving Maps (x) ¼ f(Tx). More generally any linear transfor-
Generalizations to Continuous Time and Higher- mation on a real or complex vector space.
Dimensional Time Orbit of x the forward images x, Tx, T2X. . . of
Pointwise Ergodic Theorems for Operators x X under iteration of T. When T is invertible
Subadditive and Multiplicative Ergodic Theorems one may consider the forward, backward or
Entropy and the Shannon–McMillan–Breiman two-sided orbit of x.
Theorem Pointwise ergodic theorem an assertion that
Amenable Groups ergodic averages An f(x) converge for some or
Subsequence and Weighted Theorems all x X, usually for a.e. x.
Ergodic Theorems and Multiple Recurrence Positive contraction an operator T on a space of
Rates of Convergence functions endowed with a norm k k such that
Ergodic Theorems for Non-amenable Groups T maps positive functions to positive functions
Future Directions and kTf k j f j.
Bibliography Stationary process a sequence (X1, X2, . . .) of
random variables (real or complex-valued
Glossary measurable functions) on a probability space
whose joint distributions are invariant under
Automorphism a dynamical system T : X ! X, shifting (X1, X2, . . .) to (X2, X3, . . .).
where X is a measure space and T is an invert- Uniform distribution a sequence {xn} in [0, 1]
ible map preserving measure. is uniformly distributed if for each interval
Dynamical system in its broadest sense, any set I [0, 1], the time it spends in I is asymptot-
X, with a map T : X ! X. The classical example ically proportional to the length of I.
is: X is a set whose points are the states of some
physical system and the state x is succeeded by
the state Tx after one unit of time. Definition of the Subject
Ergodic average if f is a function on X let
An f ðxÞ ¼ n1 n1 i
i¼0 f T x ; the average of the
Ergodic theorems are assertions about the long-
values of f over the first n points in the orbit term statistical behavior of a dynamical system.
of x. The subject arose out of Boltzmann’s ergodic
Ergodic theorem an assertion that ergodic aver- hypothesis which sought to equate the spatial
ages converge in some sense. average of a function over the set of states in a
physical system having a fixed energy with the
© Springer-Verlag 2009 79
C. E. Silva, A. I. Danilenko (eds.), Ergodic Theory,
https://doi.org/10.1007/978-1-0716-2388-6_176
Originally published in
R. A. Meyers (ed.), Encyclopedia of Complexity and Systems Science, © Springer-Verlag 2009
https://doi.org/10.1007/978-3-642-27737-5_176
80 Ergodic Theorems
time average of the function observed by starting A is asymptotically equal to m(A), a very satisfying
with a particular state and following its evolution justification of intuition. For example, applying
over a longtime period. this to the coin-tossing sequence one obtains the
strong law of large numbers which asserts that
almost every infinite sequence of coin tosses has
Introduction tails occurring with asymptotic frequency 12. One
also obtains Borel’s theorem on normal numbers
Suppose that (X, ℬ, m) is a measure space and which asserts that for almost all x [0, 1] each
T : (X, ℬ, m) ! (X, ℬ, m) is a measurable and digit 0, 1, 2, . . ., 9 occurs with limiting frequency
measure-preserving transformation, that is m- 1
10. The so-called continued fraction transforma-
(T1E) ¼ m(E) for all E ℬ. One important tion x 7! x1 mod 1 on (0, 1) has a finite
motivation for studying such maps is that a Ham- invariant measure 1þx dx
. (Throughout this article
iltonian physical system (see the article by x mod 1 denotes the fractional part of x.)
Petersen in this collection) gives rise to a one- Applying Birkhoff’s theorem then gives precise
parameter group {Tt : t ℝ} of maps in the information about the frequency of occurrence of
phase space of the system which preserve any n ℕ in the continued fraction expansion of
Lebesgue measure. The ergodic theorem of x, for a.e. x. See for example Billingsley
Birkhoff asserts that for f L1(m) (Billingsley 1965).
n1 These are the classical roots of the subject of
1
f Tix ð1Þ ergodic theorems. The subject has evolved from
n i¼0 these simple origins into a vast field in its own
right, quite independent of physics or probability
converges a.e. and that if T is ergodic (to be theory. Nonetheless it still has close ties to both
defined shortly) then the limit is fdm. This may these areas and has also forged new links with
be viewed as a justification for Boltzmann’s ergo- many other areas of mathematics.
dic hypothesis that “space averages equal time Our purpose here is to give a broad overview of
averages”. See Zund (2002) for some history of the subject in a historical perspective. There are
the ergodic hypothesis. For physicists, then, the several excellent references, notably the books of
problem is reduced to showing that a given phys- Krengel (1985) and Tempelman (1992) which
ical system is ergodic, which can be very difficult. give a good picture of the state of the subject at
However ergodic systems arise in many natural the time they appeared. There has been tremen-
ways in mathematics. One example is rotation dous progress since then. The time is ripe for a
z 7! lz of the unit circle if l is a complex number much more comprehensive survey of the field
of modulus one which is not a root of unity. than is possible here.
Another is the shift transformation on a sequence Many topics are necessarily absent and many
of i.i.d. random variables, for example a coin- are only glimpsed. For example this article will
tossing sequence. Another is an automorphism not touch on random ergodic theorems. See the
of a compact Abelian group. Often a transforma- articles (Durand and Schneider 2003; Lemańczyk
tion possesses an invariant measure which is not et al. n.d.) for some references on this topic.
obvious at first sight. Knowledge of such a mea- I thank Mustafa Akcoglu, Ulrich Krengel,
sure can be a very useful tool. See Petersen’s Michael Lin, Dan Rudolph and particularly Joe
article for more examples. Rosenblatt and Vitaly Bergelson for many helpful
If (X, ℬ, m) is a probability space and T is comments and suggestions. I would like to dedi-
ergodic then Birkhoff’s ergodic theorem implies cate this article to Mustafa Akcoglu who has been
that if A is a measurable subset of X then for such an important contributor to the development
almost every x, the frequency with which x visits of ergodic theorems over the past 40 years. He has
Ergodic Theorems 81
also played a vital role in my mathematical devel- The first and most fundamental result about
opment as well as of many other mathematicians. endomorphisms is the celebrated recurrence theo-
He remains a source of inspiration to me, and a rem of Poincaré (1987).
valued friend.
Theorem 1 Suppose m is finite, A ℬ and
m(A) > 0. Then for a.e. x A there is an n > 0
Ergodic Theorems for Measure- such that T nx A, in fact there are infinitely many
Preserving Maps such n.
Von Neumann’s theorem is usually quoted as (closed) subspace of U-invariant vectors and I0
above but to be historically accurate he dealt with the (usually not closed) subspace of vectors of
unitary operators indexed by a continuous time the form f Uf. It is easy to check that any vector
parameter. orthogonal to I0 must be in I, whence the subspace
Inspired by von Neumann’s theorem Birkhoff I þ I0 is dense in ℋ. For any vector of the form
very soon proved his pointwise ergodic theorem x ¼ y þ y0, y I, y0 I0 it is clear that An x ¼
1 n1 i
i¼0 U x converges to y ¼ Px, since Any ¼ y and
(Birkhoff 1931). In spite of the later publication
n
by vonNeumann his result did come first. See if y ¼ z Uz then the telescoping sum Any0 ¼
0
(Bergelson 2007; Zund 2002) for an interesting n1(z Unz) converges to 0. This establishes the
discussion of the history of the two theorems and desired convergence for x I þ I0 and it is easy to
of the interaction between Birkhoff and von extend it to the closure of I þ I0 since the operators
Neumann. An are contractions of ℋ (kAnk 1). Lorch
(1939) used a soft argument in a similar spirit to
Theorem 4 Suppose (X, ℬ, m) is a measure extend non Neumann’s theorem from the case of a
space and T is an endomorphism of (X, ℬ, m). unitary operator on a Hilbert space to that of an
Then for any f L1 ¼ L1(m) there is a T-invariant arbitrary linear contraction on any reflexive
function g L1 such that Banachspace. Sine (1970) gave a necessary and
sufficient condition for the strong convergence of
n1 the ergodic averages of a contraction on an arbi-
1
A n f ðx Þ ¼ f T i x ! gð x Þ a:e: ð4Þ trary Banach space.
n i¼0 Birkhoff’s theorem has the distinction of being
one of the most reproved theorems of twentieth
Moreover if m is finite then the convergence century mathematics. One approach to the
also holds with respect to the L1 norm and one pointwise convergence, parallel to the argument
has E g dm ¼ E fdm for all T-invariant subsets E. to argument just seen, is to find a dense subspace
Again this formulation of Birkhoff’s theorem E of L1 so that the convergence holds for all f E
is not historically accurate as he dealt with a and then try to extend the convergence to all
smooth flow on a manifold. It was soon observed f L1 by an approximation argument. The first
that the theorem, and its proof, remain valid for an step is not too hard. For simplicity assume that m is
abstract automorphism of a measure space a finite measure. As in the proof of von Neumann’s
although the realization that T need not be invert- theorem the subspace E spanned by the T-invariant
ible seems to have taken a little longer. L1functions together with functions of the form
The notation An f ¼ An(T )f as above will occur g Tg, g L1, is dense in L1(m). This can be
often in the sequel. Whenever T is an endomor- seen by using the Hahn–Banach theorem and the
phism one uses the notation Tf ¼ f ∘ T and with duality of L1 and L1. (Here one needs finiteness of
this notation An ðT Þ ¼ 1n n1 i
i¼0 T . When the scalars m to know that L1 L1.) The pointwise conver-
(ℝ or ℂ) for an L1 space are not specified the gence of An f for invariant f is trivial and for f ¼ g
notation should be understood as referring to Tg it follows from telescoping of the sum and the
either possibility. In most of the theorems in this fact that n1T ng ! 0 a.e. This last can be shown by
article the complex case follows easily from the using the Borel–Cantelli lemma. The second step,
real and any indications about proofs will refer to extending pointwise convergence, as opposed to
the real case. norm convergence, for f in a dense subspace to all
Although von Neumann originally used spec- f in L1 is a delicate matter, requiring a maximal
tral theory to prove his result, there is a quick inequality.
proof, attributed to Riesz by Hopf in his 1937 Roughly speaking a maximal inequality is an
book (Hopf 1937), which uses only elementary inequality which bounds the pointwise oscillation
properties of Hilbert space. Let I denote the of An f in terms of the norm of f. The now standard
Ergodic Theorems 83
maximal inequality in the context of Birkhoff’s a.e. convergence. Moreover a simple application
theorem is the following, due to Kakutani and of Fatou’s lemma shows that the limiting function
Yosida (1939). Birkhoff’s proof of his theorem is in L1, hence finite a.e.
includes a weaker version of this result. Let Sn f ¼ There are many proofs of (5). Two of particular
nAn f, the nth partial sum of the iterates of f. interest are Garsia’s (1970), perhaps the shortest
and most mysterious, and the proof via the filling
Theorem 5 Given any real f L1 let A ¼ scheme of Chacón and Ornstein (1960), perhaps
[n1{Sn f 0}. Then the most intuitive, which goes like this. Given a
function g L1 write g+ ¼ max (g, 0), g ¼
g+ g and let Ug ¼ Tg+ g. Interpretation: the
f dm 0: ð5Þ region between the graph of g+ and the X-axis is a
A sheaf of vertical spaghetti sticks, the intervals
[0, g+(x)], x in X, and g is a hole. Now move
Moreover if one sets Mf ¼ sup An f then for
n1 the spaghetti (horizontally) by T and then let it
any α > 0 drop (vertically) into the hole leaving a new hole
and a new sheaf which are the negative and pos-
1 itive parts of Ug. Now let E0 ¼ [n1{Unf 0}, the
mfMf > ag k f k1 ð6Þ
a
set of points x at which the hole is eventually filled
A distributional inequality such as (6) will be after finitely many iterations of U.
referred to as a weak L1inequality. Note that (6) The key point is that E ¼ E0. Indeed if Sn f
follows easily from (5) by applying (5) to f α, at (x) 0 for some n, then the total linear height of
least in the case when m is finite. With the maximal sticks over x, Tx, . . .T n1x is greater than the total
inequality in hand it is straightforward to com- linear depth of holes at these points. The only way
plete the proof of Birkhoff’s theorem. For a real- that spaghetti can escape from these points is by
valued function f let first filling the hole at x, which shows x E0.
Similar thinking shows that if x E0 and the hole
Osc f ¼ lim sup An f lim inf An f : ð7Þ at x is filled for the first time at time n then Sn f
(x) 0, so x E, and that all the spaghetti that
Osc f ¼ 0 a.e. if and only if An f converges goes into the hole at x comes from points Tix
a.e. (to a possibly infinite limit). One has which belong to E0. This shows that E ¼ E0 and
lim sup An f Mf M j f j and by symmetry that the part of the hole lying beneath E is even-
lim inf An f M j f j, so Osc f 2M j f j. To tually filled by spaghetti coming from E0 ¼ E.
establish the convergence of An f for a real-valued Thus the amount of spaghetti over E is no less
f L1 let ϵ > 0 and write f ¼ g þ h with g E than the size of the hole under E, that is
(the subspace where convergence has already E f dm 0.
been established), h L1 and khk < ϵ. Then Most proofs of Birkhoff’s theorem use a max-
since Osc g ¼ 0 one has Osc f ¼ Osc h. Thus imal inequality in some form but a few avoid it
for any fixed α > 0, using (6) altogether, for example (Katok and Hasselblatt
a 1995; Shields 1987). It is also straightforward to
mfOsc f > ag ¼ mfOsc h > ag m Mh >
2 deduce Birkhoff’s theorem directly from a maxi-
2khk1 2ϵ mal inequality, as Birkhoff does, without first
< :
a a establishing convergence on a dense subspace.
ð8Þ However the technique of proving a pointwise
convergence theorem by finding an appropriate
Since ϵ > 0 was arbitrary one concludes that dense subspace and a suitable maximal inequality
m{Osc f > α} ¼ 0 and since α > 0 is arbitrary it has proved extremely useful, not only in ergodic
follows that m{Osc f > 0} ¼ 0, establishing the theory.
84 Ergodic Theorems
Indeed, in some sense maximal inequalities are There is a special setting where one has uni-
unavoidable: this is the content of the following form convergence in the ergodic theorem. Sup-
principle proved already in 1926 by Banach. The pose T is a homeomorphism of a compact metric
principle has many formulations; the following space X. By a theorem of Krylov and
one is a slight simplification of the one to be Bogoliouboff (1937) there is at least one proba-
found in (Krengel 1985). Suppose B is a Banach bility measure on the Borel s-algebra of X which
space, (X, ℬ, m) is a finite measure space and let is invariant under T. T is said to be uniquely
E denote the space of m-equivalence classes of ergodic if there is only one Borel probability
measurable real-valued functions on X. A linear measure, say m, invariant under T. It is easy to
map T : B ! E is said to be continuous in measure see that when this is the case then T is an ergodic
if for each ϵ > 0 automorphism of (X, ℬ, m). As an example, if α is
an irrational number then the rotation z 7! e2πiαz
xn x ! 0 ) mfjTxn Txj> ϵg ! 0: is a uniquely ergodic transformation of the circle
{| z| ¼1}. Equivalently x 7! x þ α mod 1 is a
Suppose that Tn is a sequence of linear maps uniquely ergodic map on [0, 1]. A quick way to
from B to E which are continuous in measure and see this is to show that the Fourierco-efficients
let Mx ¼ supn j T n x j. Of course if Tnx converges mðnÞ of any invariant probability m are zero for
a.e. to a finite limit then Mx < 1 a.e. n 6¼ 0. The Jewett–Krieger theorem (see Jewett
(1969) and Krieger (1972)) guarantees that unique
Theorem 6 (Banach Principle) Suppose ergodicity is ubiquitous in the sense that any auto-
Mx < 1 a.e. for each x X. Then there is a morphism of a probability space is measure-
function C(l) such that C(l) ! 0 as l ! 1 such theoretically isomorphic to a uniquely ergodic
that for all x B and l > 0 one has homeomorphism. The following important result
is due to Oxtoby (1952)).
Generalizations to Continuous Time and In Wiener (1939) proved the following result
Higher-Dimensional Time for actions of G ¼ ℝk and ergodic averages over
Euclidean balls Br ¼ {x ℝk : kxk2 r}.
A (measure-preserving) flow is a one-parameter
group {Tt, t ℝ} of automorphisms of Theorem 12 Suppose T is an action of ℝk on
(X, ℬ, m), that is Ttþs ¼ TtTs, such that Ttx is (X, ℬ, m) and f L1(m).
measurable as a function of (t, x). It will always
be implicitly assumed that the map (t, x) 7! Ttx
from ℝ X to X is measurable. Theorem 4 gen- (a) For f L1 lim ABr f ¼ g exists a.e. If m is finite
r!1
eralizes to flows by replacing sums with integrals the convergence also holds with respect to the
and this generalization follows without difficulty L1-norm, g is T-invariant and Igdm ¼ I f dm
from Theorem 4. (As already observed this obser- for every T-invariant set I.
vation reverses the historical record.) Theorem 4 (b) lim ABr f ¼ f a.e.
may be viewed as a theorem about the “discrete r!0
One may then ask whether AE f converges to a where C is a constant depending only on the
limit, either in the mean or pointwise, asE varies dimension d. (In fact one may take C ¼ 3d.)
through some sequence of sets which “grow In the case when T is the action of Rk on itself
large” or, in case G ¼ ℝk, “shrink to 0”. The by translation (13) is the well-known maximal
second case is referred to as a local ergodic theo- inequality for the Hardy–Littlewood maximal
rem. In the case of ergodic theorems at infinity the function (Krantz and Parks 1999, Lemma 3.5.3).
continuous and discrete theories are rather similar Wiener proves (13) by way of the following cov-
and often the continuous analogue of a discrete ering lemma. If B is a ball in ℝk let B0 denote the
result can be deduced from the discrete result. concentric ball with three times the radius.
Ergodic Theorems 87
transform (actually for the n-dimensional Hilbert case T is the shift map on the integers. The idea of
transform). transference could already be seen in Wiener’s
proof of the ℝn ergodic theorem. Transfer princi-
Theorem 17 Suppose {Tt} is a measure- ples in various forms of have become an important
preserving flow on a probability space (X, ℬ, m) tool in the study of ergodic theorems. See Bellow
and f L1. Then (1999) fora very readable overview.
exists for a.a. x. Early in the history of ergodic theorems there were
Motivated by Cotlar’s result Calderón (1968) attempts to generalize the ergodic theorem to
proved a general transfer principle which allows more general linear operators on Lp spaces, that
one to transfer maximal inequalities and conver- is, operators which do not arise by composition
gence theorems for functions of a real or integer with a mapping of X. In the case p ¼ 1 the main
variable to the ergodic setting. Although stated for motivation for this comes from the theory of Mar-
functions on ℝ it applies equally well to ℝk or ℤk. kov processes.
This principle subsumes Birkhoff’s theorem, Wie- If (X, ℬ) is a measurable space a sub-
ner’s theorem and Cotlar’s result. For simplicity stochastic kernel on X is a non-negative function
only a special case of Calderón’s result, in the P on X ℬ such that
discrete case, will be stated here.
Let T be an automorphism of a probability (a) for each x XP(x, ) ¼ Px is a measure on ℬ
space (X, ℬ, m), let m denote counting measure such that Px(X) 1 and,
on ℤ and suppose s is a probability measure on ℤ. (b) P(, A) is a measurable function for each
For g L1(m) define s(g)(n) ¼ i ℤs(i)g(n þ i). A ℬ.
For f L1(m) let s(T )f ¼ i ℤs(i)T if, which is
easily seen to converge a.e. and in L1-norm. Given It is most intuitive to think about the stochastic
a fixed sequence sn of probabilities define Mg ¼ case, namely when each Px is a probability mea-
supn sn g and MðT Þf ¼ supn sn ðT Þf . sure. One then views P(x, A) as the probability
that the point x moves into the set A in one unit of
Theorem 18 (Calderón) Suppose there is a con- time, so one has stochastic dynamics as opposed
stant C such that to deterministic dynamics, namely the case when
Px ¼ δTx for a map T. In this case the measures Px
are called transition probabilities.
C
mfMg > lg < kgk1 for all g L1 ðmÞ: If m is a s-finite measure on X one may define
l
the measure Pm ¼ Pxdm(x). Pn is also meaning-
ð19Þ
ful if n is a finite signed measure. The case when
Then one also has Pm ¼ m is the stochastic analogue of measure-
preserving dynamics and the case when Pm m
C is the analogue of non-singular dynamics. It is
mfMðT Þf > lg < k f k1 for all T and
l easy to see that given any s-finite measure l
g L1 ðmÞ: there is always a m such that l m and Pm m.
ð20Þ Let L1 denote the space of finite signed measures n
such that n m, which is identified with L1 ¼
In other words, in order to prove a maximal L1(m, ℝ) via the Radon–Nikodym theorem. If
inequality for general T it suffices to prove it in Pm m then P maps L1 ðmÞ into itself so the
Ergodic Theorems 89
In 1963 Chacón (1963a) proved a very general form to (10) the classical strong Lp inequality for
theorem for non-positive operators which automorphisms.(25) was proved by A. Ionesco–
includes the Chacón–Ornstein theorem as well Tulcea (now Bellow) (Tulcea 1964) in the case of
as the Dunford–Schwartz theorem. positive invertible isometries of Lp. It is a result of
Banach (1993), see also (Lamperti 1958), that in
Theorem 22 Suppose T is a contraction of L1 this case T arises from a non-singular automor-
and pn 0 is a sequence of measurable functions phism t of (X, ℬ, m) in the form Tf ¼ r1/pf ∘ t. By
with the property that a series of reductions Bellow was able to show
that in this case (25) can be deduced from (10).
Akcoglu’s brilliant idea was to consider a dila-
g L1 , j g j pn )j Tg j pnþ1 : ð23Þ
tion S of T which is a positive invertible isometry
on a larger Lp space Lp ¼ Lp ðY, C , nÞ. What this
Then
means is that there is a positive isometric injection
n i
i¼0 T f D : Lp ! Lp and a positive projection P on Lp
ð24Þ
n
i¼0 pi
whose range is D(Lp) such that DT n ¼ PS nD for
all n 0. Given the existence of such an S it is not
converges a.e. to a finite limit on the set hard to deduce (25) for T from (25) for S. In fact
1
i¼0 pi > 0 :
Akcoglu constructs a dilation only in the case
If T is an L1, L1-contraction and pn ¼ 1 for all when Lp is finite dimensional and shows how to
n then the hypotheses of this theorem are satisfied, reduce the proof of (25) to this case. In the finite
so Theorem 22 reduces to the result of Dunford dimensional case the construction is very concrete
and Schwartz. See (Chacon 1963b) for a concise and P is a conditional expectation operator. Later
overview of all of the above theorems in this Akcoglu and Kopp (1977) gave a construction in
section and the relations between them. the general case. It is noteworthy that the proof of
The identification of the limit in the Chacón– Akcoglu’s theorem consists ultimately of a long
Ornstein theorem on the conservative part C of string of reductions to the classical strong Lp
X is a difficult problem. It was solved by Neveu inequality (10), which in turn is a consequence
(1961) in case C ¼ X, and in general by Chacón of (5).
(1962). Chacón (1964) has shown that there is a
non-singular automorphism t of (X, ℬ, m) such
that for the associated invertible isometry T of Subadditive and Multiplicative Ergodic
L1(m) given by (11) there is an f L1(m) such Theorems
that lim sup An f ¼ 1 and lim inf An f ¼ 1.
In 1975 Akcoglu (1975) solved a major open Consider a family {Xn,m} of real-valued random
problem when he proved the following celebrated variables on a probability space indexed by the set
theorem. of pairs (n, m) ℤ2 such that 0 n < m. {X(n,m)}
is called a (stationary) subadditive process if
Theorem 23 Suppose T : Lp 7! Lp is a positive
contraction. Then An f ¼ 1n n1 i (a) the joint distribution of {Xn,m} is the same as
i¼0 T f converges
that of {Xnþ1, mþ1},
a.e. Moreover one has the strong Lp inequality
(b) Xn,m Xn,l þ Xl,m whenever n < l < m.
underlying endomorphism T such that Xn,m ∘ T ¼ (b) Suppose that for each i, j P(i, j) is a strictly
Xnþ1,mþ1. In 1968 Kingman (1968) proved the positive function such that log P(i, j) is integra-
1
following generalization of Birkhoff’s theorem. ble. Then the limit p ¼ lim ðP0,n ði, jÞÞn exists
g ¼ inf 1n X0,n dm is called the time constant of a.e. and is independent of i and j.
the process.
Partial results generalizing Kingman’s theorem
Theorem 24 If the Xn,m are integrable and to the multiparameter case were obtained by
γ > 1 then 1n X0,n converges a.e. and in L1- Smythe (1976) and Nguyen (1979). In 1981
norm to a -invariant limit X L1 ðmÞ satisfying Akcoglu and Krengel (1981) obtained a definitive
Xdm ¼ g. multi-parameter subadditive theorem. They con-
sider an action {Tm} of the semigroup G ¼ ℤd0
It is easy to deduce from the above that if one by endomorphisms of a measure space (X, ℬ, m).
assumes only that Xþ 1
0,1 is integrable then n X 0,n still Using the standard total ordering ≺ of G an inter-
converges a.e. to a T-invariant limit X taking val in G is any set of the form {k G : m ≺ k ≺ n}
values in[1, 1). for any m ≺ n G. Let I denote the set of non-
Subadditive processes first arose in the work of empty intervals. Reversing the direction of the
Hammersley and Welsh (1965) on percolation inequality, they define a superadditive process as
theory. Here is an example. Let G be the graph a collection of integrable functions FI, I I ,
with vertex set ℤ2 and with edges joining every such that
pair of nearest neighbors. Let E denote the edge
set and let {Te : e E} be non-negative integrable (a) FI ∘ Tm ¼ FIþm,
i.i.d. random variables. To each finite path P in (b) FI FI1 þ . . . þ FIk whenever I is the dis-
G associate the “travel time” T(P) ¼ e ETe. For joint union of I1, . . ., Ik and
integers m > n 0 let Xn,m be the infimum of T(P) (c) γ ¼ supI I I|1 FIdm < 1.
over all paths P joining (0, n) to (0, m). This is a
subadditive process with 0 γ < Tedm and it is A sequence {In} of sets in I is called regular if
not hard to see that the underlying endomorphism there is an increasing sequence I 0n such that I n
is ergodic. Thus Kingman’s theorem yields the I 0n and j I 0n j C j I n j for some constant C.
result that 1n X0,n ! g a.e.
Suppose now that T is an ergodic automor- Theorem 26 (Akcoglu–Krengel) Suppose FI is
phism of a probability space (X, ℬ, m) and P is a a superadditive process and {In} is regular. Then
1
function on X taking values in the space of d d jI n j FI n converges a.e.
real matrices. Define Pn,m ¼ P(T m1x)
P(T m2x). . .P(T nx) and let Pn,m(i, j) denote the {FI} is additive if the inequality in (b) is
i, j entry of Pn,m. Then Xn,m ¼ log (kPn,mk) (use replaced by equality. In this case FI ¼ f ∘ Tn
any matrix norm) is a subadditive process so one nI
obtains the first part of the following result of where f is an integrable function. Thus in the
Furstenberg and Kesten (1960) originally proved additive case the Akcoglu–Krengel result is a
by more elaborate methods. The second part can theorem about ordinary multi-dimensional ergo-
also be deduced from the subadditive theorem dic averages, which is in fact a special case of an
with a little more work. See Kingman (1976) for earlier result of Tempel’man (1972a) (see section
details and for some other applications of sub- “Amenable Groups” below).
additive processes. Kingman’s proof of Theorem 24 hinged on the
existence of a certain (typically non-unique)
Theorem 25 (a) Suppose log+(| P|)dm < 1. decomposition for subadditive processes.
Then kP0,nk1/n converges a.e. to a finite limit. Akcoglu and Krengel’s proof of the multi-
parameter result does not depend on a Kingman-
92 Ergodic Theorems
type decomposition, in fact they show that there is Then for each u Fi(x)\Fi 1(x)
no such decomposition in general. They prove a
1
weak maximal inequality lim log Pn ðxÞu ¼ li ðxÞ:
n
C
m supjI n j1 FIn > l < g, ð26Þ
l
(c) The functions mi and li are T-invariant.
where C is a constant depending only on the (d) If T is ergodic, det P(x) ¼ 1 a.e. and
dimension, and show that this is sufficient to
1
prove their result. In the case d ¼ 1 the lim sup log Pn dm > 0
n n
Akcoglu–Krengel argument provides a new and
more natural proof of Kingman’s theorem, similar
then the li are constants, l1 < 0 and lr > 0.
in spirit to Wiener’s arguments.
Raghunathan (1979) gave a much shorter proof
Akcoglu and Sucheston (1978) have proved a
of Oseledec’s theorem, valid for matrices with
ratio ergodic theorem for subadditive processes
entries in a locally compact normed field. He
with respect to a positive L1 contraction, general-
showed that it could be reduced to the
izing both the Chacón–Ornstein theorem and
Furstenberg–Kesten theorem by considering the
Kingman’s theorem.
exterior powers of P. Ruelle (1982) extended
In 1968 Oseledec (1968) proved his celebrated
Oseledec’s theorem to the case where P takes
multiplicative ergodic theorem, which gives very
values in the set of bounded operators on a Hilbert
precise information about the random matrix
space. Walters (1993) has given a proof (under
products studied by Furstenberg and Kesten. His
slightly stronger hypotheses) which avoids the
theoremis an important tool for the study of
matrix calculations and tools from multilinear
Lyapunov exponents in differentiable dynamics,
algebra used in other proofs.
see notably Pesin (1977). If A is a d d matrix let
kAk ¼ sup {kAxk : kxk ¼ 1} where kxk is the
Euclidean norm on ℝn.
Entropy and the Shannon–McMillan–
Breiman Theorem
Theorem 27 Suppose T is an endomorphism of
the probability space (X, ℬ, m). Suppose P is a
The notion of entropy was introduced by Shan-
measurable function on X whose values are d d
non in his landmark work (Shannon 1948)
real matrices such that log+kPkdm < 1 and let
which laid the foundations for a mathematical
Pn(x) ¼ P(T n 1x)P(T n 2x) . . . P(x). Then there
theory of information. Suppose (X, ℬ, m) is a
is a T-invariant subset X0 of X with measure 1 such
probability space, P is a finite measurable parti-
that for x X0 the following hold.
tion of X and T is an automorphism of (X, ℬ, m).
P(x) denotes the atom of P containing x. The
1
entropy of P is
(a) lim n!1 Pn ðxÞPn ðxÞ 2n ¼ AðxÞ exists.
(b) Let 0 exp l1(x) exp l2(x) . . . exp h ð PÞ ¼ mðpÞ logðmðpÞÞ
pP
lr(x) be the distinct eigenvalues of A(x) (r ¼ ð27Þ
r(x) may depend on x and l1 may be 1) ¼ logðmðPðxÞÞ dmðxÞ 0:
with multiplicities m1(x), . . . mr(x). Let Ei(x)
be the eigenspace corresponding to exp(li(x))
and set. log (m(A)) may be viewed as a quantitative
measure of the amount of information contained
Fi ðxÞ ¼ E1 ðxÞ þ . . . þ Ei ðxÞ: in the statement that a randomly chosen x X
Ergodic Theorems 93
happens to belong to A. So h(P) is the expected convergence and Breiman (1957) obtained the
information if one is about to observe which atom a.e. convergence.
of P a randomly chosen point falls in. See The original proofs of a.e. convergence used
Billingsley (1965) for more motivation of this the martingale convergence theorem, were not
concept. See also the article in this collection on very intuitive and did not generalize to ℤn-actions,
entropy by J. King or any introductory book on where the martingale theorem is not available.
ergodic theory, e.g. Petersen (1989). Ornstein and Weiss (1983) found a beautiful and
If P and Q are partitions P _ Q denotes the more natural argument which bypasses the mar-
common refinement which consists of all sets tingale theorem and allows generalization to a
p \ q, p P, q Q. It is intuitive and not class of groups which includes ℤn.
hard to show that h(P _ Q) h(P) þ h(Q). Now
i
let Pn0 ¼ _n1
i¼0 T P and hn ¼ h P0 . The sub-
n
most of the results to be seen actually hold for a The work on ergodic theorems for abstract
general locally compact group. locally compact groups was pioneered by
Amenability of G is equivalent to the existence Calderón (1953) who built on Wiener’s methods.
of a finitely additive left invariant probability The main result in this paper is somewhat techni-
measure on G. It is not hard to see that any Abe- cal but it already contains the germ of
lian, and more generally any solvable, group is Tempelman’s theorem. Other ergodic theorems
amenable. On the other hand the free group F2 on for amenable groups, whose main interest lies in
two generators is not amenable. See Paterson the case of continuous groups, include
(1988) for more information on amenable groups. Tempel’man (1967), Renaud (1971), Greenleaf
The Følner property by itself is enough to give a (1973) and Greenleaf and Emerson (1974). The
mean ergodic theorem. discrete versions of these results are all rather
close to Tempelman’s theorem.
Theorem 29 Any Følner sequence is mean good Among pointwise theorems for discrete groups
in Lp for 1 p < 1. Tempelman’s result was essentially the best avail-
able for a long time. It was not known whether
The proof of this result is rather similar to the every amenable group had a Følner sequence
proof of Theorem 3. In fact Theorem 29 is only a which is pointwise good for some Lp. In 1988
special case of quite general results concerning Shulman (1988) introduced the notion of a tem-
amenable semi-groups acting on abstract pered Følner sequence {Fn}, namely one for which
Banachspaces. See the book of Paterson (1988)
for more on this. [ F1
i Fn < C j Fn j, ð31Þ
i<n
Turning to pointwise theorems, the Følner con-
dition alone does not yield a pointwise theorem,
even when G ¼ ℤ and the Fn are intervals. For for some constant C. The advantage of the tem-
example Akcoglu and del Junco (1975) have pered condition is that any Følner sequence has a
p tempered subsequence, and in particular any ame-
shown that when G ¼ ℤ and Fn ¼ ½n, n þ n \
ℤ the pointwise ergodic theorem fails for any nable group has a tempered Følner sequence.
aperiodic T and for some characteristic function f. Shulman proved a maximal inequality in L2 for
See also del Junco and Rosenblatt (1979). such Fn which implies that {Fn} is pointwise good
The following pointwise result of Tempelman in L2. An account of this work may be found in
(1972a) is often quoted. A Følner sequence {Fn} Section 5.6 of Tempelman’s book (1992).
is called regular if there is a constant C such that Lindenstrauss (1999) was able to extend the
j F1 result to L1.
n Fn j C j Fn j and there is an increasing
sequence F0n such that Fn F0n and
0
j Fn j C j Fn j. Theorem 31 Any tempered Følner sequence is
pointwise good in L1.
Theorem 30 Any regular Følner sequence is
The key new idea in his proof is to use a
pointwise good in L1.
probabilistic argument to establish a covering
lemma. In the discrete case Ornstein and Weiss
In case the Fn are intervals in ℤn this result can
(2003) have given a non-probabilistic proof of
be proved by a variant of Wiener’s covering argu-
Lindenstrauss’s covering lemma. Lindenstrauss
ment and in the general case by an abstraction
also generalizes the a.e. convergence in the Shan-
thereof. The condition j F1n Fn j C j Fn j cap-
non–McMillan–Breiman theorem to this setting.
tures the property of rectangles which is needed
L1 convergence was already established by
for the covering argument. Emerson (1974) inde-
Kieffer (1975) in 1975.
pendently proved a very similar result.
Ergodic Theorems 95
Subsequence and Weighted Theorems Later Akcoglu et al. (1996). were able to show
that for lacunary sequences {sS,n} is even strongly
In this section G and T are as in the previous sweeping out. A sequence {sn} of probability mea-
section and sn is a sequence of complex measures sures on ℤ is said to be strongly sweeping out if for
on G. any ergodic T and for all δ > 0 there is a character-
This section will be concerned with conditions istic function f with fdm < δ such that lim sup
on sn that ensure that it is mean or pointwise sn(T)f ¼ 1 a.e. It is not difficult to show that if {sn}
good. For the most part G will be ℤ. is strongly sweeping out then there are characteris-
Hopf’s ergodic theorem gives a class of exam- tic functions f such that lim inf sn(T)f ¼ 0 and
ples for free. Choose any probability measure l on lim sup sn(T)f ¼ 1. Thus for lacunary sequences
G, define the operator Tl ¼ l(T) and observe that the ergodic theorem fails in the worst possible way.
ðT l Þn ¼ T l n , where l n denotes the convolution Bellow and Losert (1985) gave the first exam-
power. Since Tl satisfies the hypotheses of Hopf’s ple of a sequence S ℤ of density 0 which is
theorem it follows that the sequence sn ¼ universally good for pointwise convergence,
1 n1 i
n i¼0 l is pointwise good in L1. answering a question posed by Furstenberg.
Another sort of example is given by choosing a They construct an S which is pointwise good in
sequence gn in G and letting sn ¼ 1n n1
i¼0 dgn : If one
L1. This paper also contains a good overview of
has convergence for such a sequence one speaks of a the progress on weighted and subsequence ergo-
subsequence ergodic theorem. This section will dic theorems at that time.
mainly focus on the case G ¼ ℤ and sequences Weyl’s theorem on uniform distribution
which are the increasing enumeration of a subset (Theorem 9) suggests the possibility of an ergodic
S N. Given S ℕ write sS,n for the corresponding theorem for the sequence {n2}. It is not hard to see
probabilities sn and say S is good if sS,n is good. that {n2} is mean good in L2. In fact the spectral
Perhaps the first subsequence ergodic theorem theorem and the dominated convergence theorem
is due to Blum and Hanson (1960) who proved that show that it is enough to prove that the L1-
n2
an automorphism T of a probability space is bounded sequence of functions 1n n1 i¼0 z on the
strongly mixing if and only if 1n n1 mi
i¼0 T f con-
unit circle converges at each point z of the unit
verges in L2 norm for every f L2 and every circle. When z is not a root of unity the sequence
increasing {mi}. Strong mixing means that m- converges to 0 by Weyl’s result and when z is a
(T nA \ B) ! m(A)m(B). In 1969 Brunel and root of unity the convergenceis trivial because
2
Keane (1969) proved the first pointwise subse- zn is periodic. In 1987 Bourgain (1987,
quence ergodic theorem. 1988d) proved his celebrated pointwise ergodic
theorem for polynomial subsequences.
Theorem 32 Suppose that T is a translation on a
compact Abelian group G with Haarmeasure l, Theorem 33 If p is any polynomial with rational
g G, and E G is any Borel set with l(E) > 0 coefficients taking integer values on the integers
and l(@E) ¼ 0. Let S ¼ {i > 0 : T ig E}. Then then S ¼ {p(n)} is pointwise good in L2.
S is pointwise good in L1.
The first step in Bourgain’s argument is to
Krengel (1971) constructed the first example of reduce the problem of proving a maximal inequal-
a sequence S ℕ which is pointwise universally ity to the case of the shift map on the integers, via
bad, in the strong sense that for any aperiodic T the Calderón’s transfer principle. Then the problem is
a.e. convergence of sS,n(T)f fails for some charac- transferred to the circle by using Fourier trans-
teristic function f. Bellow (1983) proved that any forms. At this point the problem becomes a very
lacunary sequence (meaning anþ1 > can for some delicate question about exponential sums and a
c > 1)is pointwise universally bad in L1. whole arsenal of tools is brought to bear. See
96 Ergodic Theorems
Rosenblatt and Wierdl (1995) and Quas and and also Thouvenot (1995), Glasner (2003) and
Wierdl (Bergelson 2006) (Appendix B) for nice Rudolph’s book (1990). Rudolph’s result concerns
expositions of Bourgain’s methods. the convergence of multiple averages
Bourgain subsequently improved this to all Lp,
N1 k
p > 1 and also extended it to sequences {[q(n)]} 1
f j T nj x ð32Þ
where now q is an arbitrary real polynomial and [] N n¼0 j¼1
denotes the greatest integer function. He also
announced that his methods can be used to show where each Tj is an automorphism of a probability
that the sequencepof primes is pointwise good in space (Xj, ℬj, mj) and the fj are L1 functions. The
Lp for any p > 1þ2 3. Wierdl (1988) soon extended point is that the convergence occurs whenever
the result for primes to all p > 1. each xj X0j , sets of measure one which may be
chosen sequentially for j ¼ 1, . . . , k without
Theorem 34 The primes are pointwise good in knowing what Ti or fi are for any i > j. He actually
Lp for p > 1. proves something stronger, namely he identifies
an intrinsic property of a sequence {ai}, which he
It has remained a major open question for quite calls fully generic, such that the following hold.
some time whether any of these results hold for
p ¼ 1. In 2005 there appeared a preprint of (a) The constant sequence {1} is fully generic.
Mauldin and Buczolich (2005), which remains (b) If {ai} is fully generic then for any T and
unpublished, showing that polynomial sequences f L1 the sequence ai f(Tix) is fully generic
are L1-universally bad. for almost all x.
Another major result of Bourgain’s is the (c) Fully generic implies pointwise good in L1.
so-called return times theorem (Bourgain 1988e).
A simplification of Bourgain’s original proof was The definition of fully generic will not be
published jointly with Furstenberg, Katznelson and quoted here as it is somewhat technical.
Ornstein as an appendix to an article (Bourgain For a proof of the basic return times theorem
1989b) of Bourgain. To state it let us agree to say using joinings see Rudolph (1994). Assani,
that a a sequence of complex numbers {a(n)}n 0 Lesigneand and Rudolph (1995) took a first step
has property P if the sequence of complex mea- towards the multiple theorem, a Wiener–Wintner
sures sn ¼ 1n n1
i¼0 aðiÞdi has property P, where δi version of the return times theorem. Also Assani
denotes the point mass at i. (2000) independently gave a proof of Rudolph’s
result in the case when all the Tj are weakly mixing.
Theorem 35 (Bourgain) Suppose T is an auto- Ornstein and Weiss (1992) have proved the
morphism of a probability space (X, ℬ, m), 1 p, following version of the return times theorem
q 1 are conjugate exponents and f Lp(m). for abstract discrete groups. As with ℤ, let us
Then for almost all x the sequence {f(T nx)} is say that a sequence {ag}g G of complex num-
pointwise good in Lq. bers has property P for {Fn} if the sequence
sn ¼ jF1n j g Fn aðgÞdg of complex measures
Applying this to characteristic functions f ¼ 1E has property P.
one sees that the return time sequence
{i > 0 : Six E} is good for pointwise conver- Theorem 36 Suppose that the increasing Følner
gence in L1. Theorem 32 is a very special case. It sequence {Fn} satisfies the Tempelman condition
is also easy to see that Theorem 35 contains the supn F1
n Fn =jFn j < 1 and [Fn ¼ G. If b L1
Wiener–Wintner theorem. then for a.a. x the sequence {b(Tgx)} is pointwise
In 1998 Rudolph (1998) proved a far-reaching good in L1 for {Fn}.
generalization of the return times theorem using the
technique of joinings. For an introduction to join- Recently Demeter, Lacey, Tao and Thiele
ings see the article by de la Rue in this collection (2008) have proved that the return times theorem
Ergodic Theorems 97
remains valid for any 1 < p 1 and q 2. On Then there is a constant D such that
the other hand Assani, Buczolich and Mauldin
D
(2005) showed that it fails for p ¼ q ¼ 1. mfMf > lg k f k1 for all T, f L1 ðmÞ
l
Bellow, Jones and Rosenblatt have a series of and l > 0:
papers (Bellow et al. 1989, 1990, 1992, 1994)
studying general weighted averages associated to
a sequence sn of probability measures on ℤ, and, in
Ergodic Theorems and Multiple
some cases, more general groups. The following
Recurrence
are a few of their results (Bellow et al. 1990). is
concerned with ℤ-actions and moving block aver-
Suppose S ℕ. The upper density of S is
ages given by sn ¼ mIn , where the In are finite
intervals and mI denotes normalized counting mea- j S \ ½1, n j
dðSÞ ¼ lim sup : ð33Þ
sure on I. They resolve the problem completely, n n
obtaining a checkable necessary and sufficient con-
dition for such a sequence to be pointwise good in and the density d(S) is the limit of the same quan-
L1. tity, if it exists. In 1975 Szemerédi (1975) proved
Bellow et al. (1992) gives sufficient conditions the following celebrated theorem, answering an
on a sequence sn for it to be pointwise good in Lp, old question of Erdősand Turán.
p > 1, via properties of the Fourier transforms sn .
A particular consequence is that if Theorem 38 Any subset of ℕ with positive upper
limn!1k ℤ j sn(k) sn(k 1) j ¼ 0 then density contains an arithmetic progression of
{sn} has a subsequence which is pointwise good length k for each k 1.
in Lp, p > 1. In (Bellow et al. 1994) they obtain
convergence results for sequences sn ¼ sn, the This result has a distinctly ergodic-theoretic
convolution powers of a probability measure s. flavor. Letting T denote the shift map on ℤ, it
A consequence of one of their main results is that says that for each k there is an n such that S0 ¼
if the expectation k ℤks(k) is zero, the second \ki¼1 T in S is non-empty. In fact the result gives
moment k ℤk2s(k) is finite and s is aperiodic more: there is an n for which d ðS0 Þ > 0. In this
(its support is not contained in any proper coset in light Szemerédi’s theorem becomes a multiple
ℤ) then sn is pointwise good in Lp for p > 1. recurrence theorem for the shift map on ℕ,
Bellow and Calderón (1999) later showed that equipped with the invariant “measure-like” quan-
this last result is valid also for p ¼ 1. This is a tity d. Of course d is not even finitely additive so it
consequence of the following sufficient condition is not a measure. d, however, is at least finitely
For a sequence T to satisfy a weak L1 inequality. additive, when defined, and d(ℕ) ¼ 1.
Given an automorphism of a probability space This point of view suggests the following mul-
(X, ℬ, m) let Mf ¼ sup j sn(T )f(x)j be the max- tiple recurrence theorem.
imal operator associated to{sn}.
Theorem 39 Suppose T is an automorphism of a
Theorem 37 (Bellow and Calderón) Suppose probability space (X, ℬ, m), m(B) > 0 and k 1.
there is an α (0, 1] and C > 0 such that for Then there is an n > 0 such that m \ki¼1 T in B > 0:
each n > 1 one has
In 1977, Furstenberg (1977) proved the follow-
ing ergodic theorem which implies the multiple
jyja
j sn ðx þ yÞ sn ðxÞ j C for all x, y ℤ recurrence theorem. He also established a general
jxj1þa correspondence principle which puts the shaky
such that 0 < 2 j y jj x j analogy between the multiple recurrence theorem
98 Ergodic Theorems
and Szemerédi’s theorem on a firm footing and so only their absolute versions (i. e. relative to the
allows each to be deduced from the other. Thus he trivial s-algebra {;, X}) will be described here.
obtained an ergodic theoretic proof of T is said to be compact if for each f L2 the
Szemerédi’s combinatorial result. orbit {T if : i ℤ} is pre-compact in the norm
topology of L2. This turns out to be equivalent to
Theorem 40 Suppose T is an automorphism of the statement that T is a translation on a compact
a probability space (X, ℬ, m), f L1, f 0, Abelian group endowed with its Haar measure.
fdm > 0 and k 1. Then The property of weak mixing is a fundamental
notion in ergodic theory which has many equiva-
lent definitions. The most appropriate for our pur-
N1 k
1 poses is that T is weakly mixing if it has no
lim inf T in f dm > 0: ð34Þ
N N n¼0 i¼1
compact factors. This turns out to be equivalent
to the ergodicity of T T acting on the product
Furstenberg’s result opened the door to the measure space (X, ℬ, m) (X, ℬ, m).
study of so-called ergodic Ramsey theory The verification of 37 in the case of compact T is
which has yielded a vast array of deep results in rather easy. In this case it is not hard to prove that
combinatorics, many of which have no non- for any f L1 and ϵ > 0 the set {n ℤ : kTnf
ergodic proof as yet. The focus of this article is fk2 < ϵ} has bounded gaps and (34) follows easily.
not on this direction but the reader is referred to In the case when T is weakly mixing (34) is a
Furstenberg’s book (1981) for an excellent intro- consequence of the following theorem which
duction and to Bergelson (1996, 2006) for sur- Furstenberg proves in (1977) (as a warm-up for
veys of later developments. There is also the it’s much harder relative version).
article by Frantzikinakis and McCutcheon in
this collection. Theorem 41 If T is weakly mixing and f1, f2, . . .,
Furstenberg’s proof relies on a deep structure fk are L1 functions then
theorem for a general automorphism which
was also developed independently by Zimmer N1 k k
1
(1976a, b) in a more general context. A factor lim T in f i dm ¼ fi ð35Þ
N N
of T is any sub-s-algebra F ℬ such that n¼0 i¼1 i¼1
it have been a key tool in subsequent develop- limiting behavior of the averages is unchanged
ments in ergodic Ramsey theory and in the con- when any one of the fi’s is replaced by its condi-
vergence results to be discussed in this section. tional expectation on C . This means that the ques-
Bergelson (1987) used it to prove the following tion of convergence of these averages may be
mean ergodic theorem for weakly mixing reduced to the case when fi are all C -measurable.
automorphisms. So the problem is to find the right (smallest)
characteristic factor and prove convergence for
Theorem 43 Suppose T is weakly mixing, f1, . . ., that factor.
fk are L1 functions and p1, . . ., pk are polynomials The importance of characteristic factors was
with rational coefficients taking integer values on already apparent in Furstenberg’s original paper
the integers such that no pi pj is constant for (1977), where he showed that the maximal distal
i 6¼ j. Then factor is characteristic for the averages (37). In
fact he showed that for a given k a k-step distal
factor is characteristic. (An automorphism is
N1 k k
1 k-step distal if it is the top rung in a k-step ladder
lim T pi ðnÞ f i f i dm
N!1 N n¼0 i¼1 i¼1 of factors as in the Furstenberg–Zimmer structure-
theorem.) It turns out, though, that the right char-
¼ 0: ð36Þ
acteristic factor for (37) is considerably smaller. In
their seminal paper (Conze and Lesigne 1988)
Theorems 40 and 41 immediately raise the
Conze and Lesigne identified the characteristic
question of convergence of the multiple averages
1 N1 k in factor for k ¼ 3, now called the Conze–Lesigne
N n¼0 i¼1 T f i for a general T. Several authors factor. As shown in (Host and Kra 2005b; Ziegler
obtained partial results on the question of mean 2007), the characteristic factor for a general k is
convergence. It was finally resolved only recently (isomorphic to) an inverse limit of k-step nilflows.
by Host and Kra (2005b), who proved the follow- A k-step nilflow is a compact homogeneous space
ing landmark theorem. N/Γ of a k-step nilpotent Lie group N, endowed
with its unique left-invariant probability measure,
Theorem 44 Suppose f1, f2, . . ., fk L1. Then on which T acts via left translation by an element
there is a g L1 such that of N. Ergodic properties of nilflows have been
studied for some time in ergodic theory, for exam-
N1 k ple in Parry (1969). In this way the problem of L2-
1
lim T in f i g ¼ 0: ð37Þ convergence of (37) is reduced to the case when
N n¼0 i¼1 2 T is a nilflow. In this case one has more: the
averages converge pointwise by a result of
Independently and somewhat later Ziegler Leibman (2005a) (See also Ziegler (2005)).
(2007) obtained the same result by somewhat There have already been a good many gener-
different methods. Furstenberg had already alizations of (37). Host and Kra (2005a),
established Theorem 44 for k ¼ 2 in Frantzikinakis and Kra (2005b, 2006), and
(Furstenberg 1977). It was proved for k ¼ 3 in Leibman (2005a) have proved results which
the case of a totally ergodic T by Conze and replace linear powers of T by polynomial powers.
Lesigne (1988) and in general by Host and Kra In increasing degrees of generality Conze and
(2001). It can also be obtained using the methods Lesigne (1988), Frankzikinakis and Kra (2005a)
developed by Furstenberg and Weiss (1996). In and Tao (2008) have obtained results which
this paper Furstenberg and Weiss proved a result replace the maps T, T2, . . ., Tk in (37) by commut-
for polynomial powers of T in the case k ¼ 2. They ing maps T1, . . ., Tk. Bergelson and Leibman
also formalized the key notion of a characteristic (2002, 2004) have obtained results, both positive
factor. A factor C of T is said to be characteristic and negative, in the case of two non-
for the averages (37) if, roughly speaking, the L2 commuting maps.
100 Ergodic Theorems
In the direction of pointwise convergence the theorem in various ways. Bishop (1967) proved
only general result is the following theorem of the following result which is purely finite and
Bourgain (1990) which asserts pointwise conver- constructive in nature and evidently implies the
gence in the case k ¼ 2. a.e. convergence in Birkhoff’s theorem. If y ¼
(y1, . . ., yn) is a finite sequence of real numbers
Theorem 45 Suppose S and T are powers of a and a < b, an upcrossing of y over [a, b] is a
single automorphism R and f, g L1. Then minimal integer interval [k, l] [1, n] satisfy-
1 N
n¼1 f ðT xÞgðS xÞ converges a.e.
n n ing yk < a and yl > b.
N
See Part 1 of Derriennic (2006) for a selec- where C is a constant depending only on k f k1/
tion of other results in this direction. In spite of ϵ. If f L1 then
these negative results one can obtain quantita-
tive estimates by reformulating the ergodic mfx : Fðϵ, f , xÞ kg AeBk , ð41Þ
Ergodic Theorems 101
the pointwise ergodic theorem fails inL2 while natural to ask for slower rates of growth. In par-
Jones, Lacey and Wierdl (1999) have shown that ticular, in any amenable group is there always a
an only slightly faster rate permits a sequence sequence {Fn} which is pointwise good and
which is pointwise good in L2. How well or grows at most exponentially? Can one do better
badly does the ergodic theorem succeed or fail either in general or in particular groups?
depending on the rate of convergence of ln? In Lindenstrauss’s theorem at least guarantees the
particular is there a (slow) rate which still guaran- existence of Følner sequences which are
tees strong sweeping out? (Jones et al. 1999) pointwise good in L1 but in particular groups
contains some interesting conjectures in this there are often natural sequences which one
1
direction. hopes might be good. For example in i ℤℤ
There are also interesting questions concerning one may take Fn to be a cube based at 0 of side
the mean and pointwise ergodic theorems for sub- length ln and dimension dn (that is, all but the first
sequences which are chosen randomly in some dn co-ordinates are zero), where both sequences
sense. See Bourgain (1988c; Jones et al. 1999) increase to 1. What conditions on ln and dn will
for some results in this direction. Again (Jones give a good sequence? Note that no such sequence
et al. 1999) contains some interesting conjectures is regular in Tempelman’s sense. If dn ¼ n then
along these lines. {ln} must be super exponential to ensure
In a recent paper (Bergelson and Leibman Shulman’s condition. Can one do better? What
2007) Bergelson and Leibman prove some very about ln ¼ dn ¼ n?
interesting and surprising results about the distri-
bution of generalized polynomials. A generalized
polynomial is any function which can be built Bibliography
starting with polynomials in ℝ[x] using the oper-
ations of addition, multiplication and taking the Akcoglu MA (1975) A pointwise ergodic theorem in Lp-
spaces. Can J Math 27(5):1075–1082
greatest integer. As a consequence they derive a
Akcoglu MA, Chacon RV (1965) A convexity theorem for
generalization of von Neumann’s mean ergodic positive operators. Z Wahrsch Verw Gebiete 3:328–332
theorem to averages along generalized polyno- Akcoglu MA, Chacon RV (1970) A local ratio theorem.
mial sequences. The following is a special case. Can J Math 22:545–552
Akcoglu MA, del Junco A (1975) Convergence of aver-
ages of point transformations. Proc Amer Math Soc 49:
Theorem 51 Suppose p is a generalized polyno- 265–266
mial taking integer values on the integers and U is Akcoglu MA, Kopp PE (1977) Construction of dilations of
pðiÞ
a unitary operator on ℋ. Then 1n n1 i¼0 U x is positive Lp-contractions. Math Z 155(2):119–127
Akcoglu MA, Krengel U (1981) Ergodic theorems for
norm convergent for all x ℋ. superadditive processes. J Reine Angew Math 323:
53–67
This begs the question: does one have Akcoglu MA, Sucheston L (1978) A ratio ergodic theorem
for superadditive processes. Z Wahrsch Verw Gebiete
pointwise convergence? If so, this would be a
44(4):269–278
far-reaching generalization of Bourgain’s polyno- Akcoglu M, Bellow A, Jones RL, Losert V, Reinhold-
mial ergodic theorem. Larsson K, Wierdl M (1996) The strong sweeping out
There are also lots of questions concerning the property for lacunary sequences, Riemann sums, con-
volution powers, and related matters. Ergod Theory
nature of Følner sequences {Fn} in an amenable
Dyn Sys 16(2):207–253
group which give a pointwise theorem. For exam- Akcoglu M, Jones RL, Rosenblatt JM (2000) The worst
ple Lindenstrauss (1999) has shown that in the sums in ergodic theory. Michigan Math J 47(2):
lamplighter group, a semi-direct product of ℤ 265–285
Alaoglu L, Birkhoff G (1939) General ergodic theorems.
with i ℤℤ/2ℤ on which Z acts by the shift,
Proc Nat Acad Sci USA25:628–630
there is no sequence satisfying Tempelman’s con- Assani I (2000) Multiple return times theorems for weakly
dition and that any {Fn} satisfying the Shulman mixing systems. Ann Inst H Poincaré Probab Statist
condition must grow super-exponentially. So, it is 36(2):153–165
Ergodic Theorems 103
Assani I (2003) Wiener–Wintner ergodic theorems. World Bergelson V, Leibman A (2002) A nilpotent Roth theorem.
Scientific Publishing Co. Inc, River Edge Invent Math 147(2):429–470
Assani I, Lesigne E, Rudolph D (1995) Wiener–Wintner Bergelson V, Leibman A (2004) Failure of the Roth theo-
return-times ergodic theorem. Israel J Math rem for solvable groups of exponential growth. Ergod
92(1–3):375–395 Theory Dyn Sys 24(1):45–53
Assani I, Buczolich Z, Mauldin RD (2005) An L1 counting Bergelson V, Leibman A (2007) Distribution of values of
problem in ergodic theory. J Anal Math 95:221–241 bounded generalized polynomials. Acta Math 198(2):
Auslander L, Green L, Hahn F (1963) Flows on homoge- 155–230
neous spaces. With the assistance of Markus L, Massey Berkson E, Bourgain J, Gillespie TA (1991) On the almost
W and an appendix by Greenberg L Annals of Mathe- everywhere convergence of ergodic averages for
matics Studies, No 53. Princeton University Press, power-bounded operators on L p-subspaces. Integral
Princeton, NJ Equ Oper Theory 14(5):678–715
Banach S (1993) Théorie des opérations linéaires. Éditions Billingsley P (1965) Ergodic theory and information.
Jacques Gabay, Sceaux, reprint of the 1932 original Wiley, New York
Bellow A (1983) On “bad universal” sequences in ergodic Birkhoff GD (1931) Proof of the ergodic theorem. Proc
theory II. In: Belley JM, Dubois J, Morales P (eds) Natl Acad Sci U S A 17:656–660
Measure theory and its applications. Lecture notes in Bishop E (1966) An upcrossing inequality with applica-
math, vol 1033. Springer, Berlin, pp 74–78 tions. Michigan Math J 13:1–13
Bellow A (1999) Transference principles in ergodic theory. Bishop E (1967/1968) A constructive ergodic theorem.
In: Christ M, Kenig CE, Sadowsky C (eds) Harmonic J Math Mech 17:631–639
analysis and partial differential equations, Chicago lec- Blum JR, Hanson DL (1960) On the mean ergodic theorem
tures in math. University of Chicago Press, Chicago, for subsequences. Bull Amer Math Soc 66:308–311
pp 27–39 Bourgain J (1987) On pointwise ergodic theorems for
Bellow A, Calderón A (1999) A weak-type inequality for arithmetic sets. C R Acad Sci 305(10):397–402
convolution products. In: Christ M, Kenig CE, Bourgain J (1988a) Almost sure convergence and bounded
Sadowsky C (eds) Harmonic analysis and partial dif- entropy. Israel J Math 63(1):79–97
ferential equations, Chicago lectures in math. Univer- Bourgain J (1988b) An approach to pointwise ergodic
sity of Chicago Press, Chicago, pp 41–48 theorems. In: Lindenstrauss J, Milman VD (eds) Geo-
Bellow A, Jones R (eds) (1991) Almost everywhere con- metric aspects of functional analysis (1986/87), Lec-
vergence, II. Academic Press, Boston ture notes in math, vol 1317. Springer, Berlin,
Bellow A, Losert V (1985) The weighted pointwise ergo- pp 204–223
dic theorem and the individual ergodic theorem along Bourgain J (1988c) On the maximal ergodic theorem for
subsequences. Trans Am Math Soc 288(1):307–345 certain subsets of the integers. Israel J Undergrad Math
Bellow A, Jones R, Rosenblatt J (1989) Almost every- 61(1):39–72
where convergence of powers. In: Edgar GA, Bourgain J (1988d) On the pointwise ergodic theorem on
Sucheston L (eds) Almost everywhere convergence. L p for arithmetic sets. Israel J Math 61(1):73–84
Academic, Boston, pp 99–120 Bourgain J (1988e) Temps de retour pour les systèmes
Bellow A, Jones R, Rosenblatt J (1990) Convergence for dynamiques. C R Acad Sci 306(12):483–485
moving averages. Ergod Theory Dyn Sys 10(1):43–62 Bourgain J (1989a) Almost sure convergence in ergodic
Bellow A, Jones RL, Rosenblatt J (1992) Almost every- theory. In: Edgar GA, Sucheston L (eds) Almost every-
where convergence of weighted averages. Math Ann where convergence. Academic, Boston, pp 145–151
293(3):399–426 Bourgain J (1989b) Pointwise ergodic theorems for arith-
Bellow A, Jones R, Rosenblatt J (1994) Almost every- metic sets. Inst Hautes Études Sci Publ Math (69):5–45,
where convergence of convolution powers. Ergod The- with an appendix by the author, Furstenberg H,
ory Dyn Sys 14(3):415–432 Katznelson Y, Ornstein DS
Bergelson V (1987) Weakly mixing PET. Ergod Theory Bourgain J (1990) Double recurrence and almost sure
Dyn Sys 7(3):337–349 convergence. J Reine Angew Math 404:140–161
Bergelson V (1996) Ergodic Ramsey theory—an update. Breiman L (1957) The individual ergodic theorem of infor-
In: Pollicott M, Schmidt K (eds) Ergodic theory of Zd mation theory. Ann Math Statist 28:809–811
actions. London Math Soc Lecture Note Ser, vol 228. Breiman L (1968) Probability. Addison-Wesley, Reading
Cambridge University Press, Cambridge, pp 1–61 Brunel A, Keane M (1969) Ergodic theorems for operator
Bergelson V (2006) Combinatorial and Diophantine appli- sequences. Z Wahrsch Verw Gebiete 12:231–240
cations of ergodic theory. In: Hasselblatt B, Katok Burkholder DL (1962) Semi–Gaussian subspaces. Trans
A (eds) Handbook of dynamical systems, vol 1B, Am Math Soc 104:123–131
Appendix A by Leibman A, Appendix B by Quas A, Calderon AP (1953) A general ergodic theorem. Ann Math
Wierdl M. Elsevier, Amsterdam, pp 745–869 2(58):182–191
Bergelson V (2007) Some historical remarks and modern Calderón AP (1968) Ergodic theory and translation-
questions around the ergodic theorem. Int Math Nachr invariant operators. Proc Natl Acad Sci U S A 59:
205:1–10 349–353
104 Ergodic Theorems
Calderon AP, Zygmund A (1952) On the existence of Eberlein WF (1949) Abstract ergodic theorems and weak
certain singular integrals. Acta Math 88:85–139 almost periodic functions. Trans Am Math Soc 67:
Chacon RV (1962) Identification of the limit of operator 217–240
averages. J Math Mech 11:961–968 Edgar G, Sucheston L (eds) (1989) Almost everywhere
Chacon RV (1963a) Convergence of operator averages. In: convergence. Academic Press, Boston
Wright FB (ed) Ergodic theory. Academic, New York, Emerson WR (1974) The pointwise ergodic theorem for
pp 89–120 amenable groups. Am J Math 96:472–487
Chacon RV (1963b) Linear operators in L1. In: Wright FB Feldman J (2007) A ratio ergodic theorem for commuting,
(ed) Ergodic theory. Academic, New York, pp 75–87 conservative, invertible transformations with quasi-
Chacon RV (1964) A class of linear transformations. Proc invariant measure summed over symmetric hyper-
Am Math Soc 15:560–564 cubes. Ergod Theory Dyn Sys 27(4):1135–1142
Chacon RV, Ornstein DS (1960) A general ergodic theo- Foguel SR (1969) The ergodic theory of Markov pro-
rem. Ill J Math 4:153–160 cesses. In: Van Nostrand Mathematical Studies, No
Conze JP, Lesigne E (1984) Théorèmes ergodiques pour 21. Van Nostrand Reinhold, New York
des mesures diagonales. Bull Soc Math France 112(2): Frantzikinakis N, Kra B (2005a) Convergence of multiple
143–175 ergodic averages for some commuting transformations.
Conze JP, Lesigne E (1988) Sur un théorème ergodique Ergod Theory Dyn Sys 25(3):799–809
pour des mesures diagonales. In: Probabilités, Publ Inst Frantzikinakis N, Kra B (2005b) Polynomial averages
Rech Math Rennes, vol 1987. University Rennes I, converge to the product of integrals. Israel J Math
Rennes, pp 1–31 148:267–276
Cotlar M (1955a) On ergodic theorems. Math Notae 14: Frantzikinakis N, Kra B (2006) Ergodic averages for inde-
85–119. (1956) pendent polynomials and applications. J London Math
Cotlar M (1955b) A unified theory of Hilbert transforms and Soc 2) 74(1:131–142
ergodic theorems. Rev Mat Cuyana 1:105–167. (1956) Furstenberg H (1967) Disjointness in ergodic theory, min-
Day M (1942) Ergodic theorems for Abelian semigroups. imal sets, and a problem in Diophantine approximation.
Trans Am Math Soc 51:399–412 Math Sys Theory 1:1–49
del Junco A, Rosenblatt J (1979) Counterexamples in Furstenberg H (1977) Ergodic behavior of diagonal mea-
ergodic theory and number theory. Math Ann 245(3): sures and a theorem of Szemerédion arithmetic pro-
185–197 gressions. J Analyse Math 31:204–256
Demeter C, Lacey M, Tao T, Thiele C (2008) Breaking the Furstenberg H (1981) Recurrence in ergodic theory and
duality in the return times theorem. Duke Math combinatorial number theory. Princeton University
J 143(2):281–355 Press, Princeton, m B Porter Lectures
Derrien JM, Lesigne E (1996) Un théorème ergodique Furstenberg H, Kesten H (1960) Products of random matri-
polynomial ponctuel pour lesendomorphismes exacts ces. Ann Math Statist 31:457–469
et les K-systèmes. Ann Inst H Poincaré Probab Statist Furstenberg H, Weiss B (1996) A mean ergodic theorem
32(6):765–778 for (1/N)Nn¼1 f(Tnx)g(Tn2 x). In: Bergelson V, March-
Derriennic Y (2006) Some aspects of recent works on limit P, Rosenblatt J (eds) Convergence in ergodic theory and
theorems in ergodic theory with special emphasis on probability, Ohio State Univ Math Res Inst Publ,
the “central limit theorem”. Discrete Contin Dyn Syst vol 5. de Gruyter, Berlin, pp 193–227
15(1):143–158 Garsia AM (1970) Topics in almost everywhere conver-
Doob JL (1938) Stochastic processes with an integral- gence. In: Lectures in advanced mathematics,
valued parameter. Trans Amer MathSoc 44(1):87–150 vol 4. Markham, Chicago
Dunford N (1951) An individual ergodic theorem for non- Glasner E (2003) Ergodic theory via joinings, Mathemat-
commutative transformations. Acta Sci Math Szeged ical surveys and monographs, vol 101. American Math-
14:1–4 ematical Society, Providence
Dunford N, Schwartz J (1955) Convergence almost every- Gowers WT (2001) A new proof of Szemerédi’s theorem.
where of operator averages. Proc Natl Acad Sci U S Geom Funct Anal 11(3):465–588
A 41:229–231 Green B, Tao T (2004) The primes contain arbitrarily large
Dunford N, Schwartz JT (1988) Linear operators, part arithmetic progressions. http://arxivorg/abs/mathNT/
I. Wiley Classics Library, Wiley, New York. General 0404188
theory, with the assistance of Bade WG, Bartle RG, Greenleaf FP (1973) Ergodic theorems and the construc-
Reprint of the 1958 original, A Wiley–Interscience tion of summing sequences in amenable locally com-
Publication pact groups. Commun Pure Appl Math 26:29–46
Durand S, Schneider D (2003) Random ergodic theorems Greenleaf FP, Emerson WR (1974) Group structure and the
and regularizing random weights. Ergod Theory Dyn pointwise ergodic theorem for connected amenable
Sys 23(4):1059–1092 groups. Adv Math 14:153–172
Ergodic Theorems 105
Guivarc'h Y (1969) Généralisation d’un théorème de JonesRL RJM, Wierdl M (2003) Oscillation in ergodic
von Neumann. C R Acad Sci Paris Sér A-B 268: theory: higher dimensional results. Israel J Math 135:
A1020–A1023 1–27
Halmos PR (1946) An ergodic theorem. Proc Natl Acad Kac M (1947) On the notion of recurrence in discrete
Sci U S A 32:156–161 stochastic processes. Bull Amer Math Soc 53:
Halmos PR (1949) A nonhomogeneous ergodic theorem. 1002–1010
Trans Am Math Soc 66:284–288 Kachurovskii AG (1996a) Rates of convergence in ergodic
Halmos PR (1960) Lectures on ergodic theory. Chelsea, theorems. Uspekhi Mat Nauk 51(4(310)):73–124
New York Kachurovskii AG (1996b) Spectral measures and conver-
Hammersley JM, Welsh DJA (1965) First-passage perco- gence rates in the ergodic theorem. Dokl Akad Nauk
lation, subadditive processes, stochastic networks, and 347(5):593–596
generalized renewal theory. In: Proc Internat Res Kakutani S (1940) Ergodic theorems and the Markoff
Semin. Statist Lab, University of California, Berkeley. process with a stable distribution. Proc Imp Acad
Springer, New York, pp 61–110 Tokyo 16:49–54
Hopf E (1937) Ergodentheorie. In: Ergebnisse der Kalikow S, Weiss B (1999) Fluctuations of ergodic aver-
Mathematik und ihrer Grenzgebiete, vol 5. Springer, ages. In: Proceedings of the conference on
Berlin probability, ergodic theory, and analysis,
Hopf E (1954) The general temporally discrete Markoff vol 43, pp 480–488
process. J Ration Mech Anal 3:13–45 Kamae T (1982) A simple proof of the ergodic theorem
Host B, Kra B (2001) Convergence of Conze–Lesigne using nonstandard analysis. Israel J Math 42(4):
averages. Ergod Theory Dyn Sys 21(2):493–509 284–290
Host B, Kra B (2005a) Convergence of polynomial ergodic Katok A, Hasselblatt B (1995) Introduction to the modern
averages. Israel J Math 149:1–19 theory of dynamical systems, Encyclopedia of Mathe-
Host B, Kra B (2005b) Nonconventional ergodic averages matics and its Applications, vol 54. Cambridge Univer-
and nilmanifolds. Ann Math 161(1):397–488 sity Press, Cambridge, with a supplementary chapter by
Hurewicz W (1944) Ergodic theorem without invariant Katok A and Mendoza L
measure. Ann Math 2(45):192–206 Katznelson Y, Weiss B (1982) A simple proof of some
Ivanov VV (1996a) Geometric properties of monotone ergodic theorems. Israel J Math 42(4):291–296
functions and the probabilities of random oscillations. Kieffer JC (1975) A generalized Shannon–McMillan the-
Sibirsk Mat Zh 37(1):117–115 orem for the action of an amenable group on a proba-
Ivanov VV (1996b) Oscillations of averages in the ergodic bility space. Ann Probab 3(6):1031–1037
theorem. Dokl Akad Nauk 347(6):736–738 Kingman JFC (1968) The ergodic theory of subadditive
Jewett RI (1969/1970) The prevalence of uniquely ergodic stochastic processes. J Roy Stat Soc Ser B 30:499–510
systems. J Math Mech 19:717–729 Kingman JFC (1976) Subadditive processes. In: École
Jones RL (1987) Necessary and sufficient conditions for a d’Été de Probabilités deSaint–Flour, V–1975, Lecture
maximal ergodic theorem along subsequences. Ergod notes in math, vol 539. Springer, Berlin, pp 167–223
Theory Dyn Sys 7(2):203–210 Koopman B (1931) Hamiltonian systems and transforma-
Jones RL (1993) Ergodic averages on spheres. J Anal Math tions in Hilbert spaces. Proc Natl Acad Sci U S A 17:
61:29–45 315–318
Jones RL, Wierdl M (1994) Convergence and divergence Krantz SG, Parks HR (1999) The geometry of domains in
of ergodic averages. Ergod Theory Dyn Sys 14(3): space. Birkhäuser Advanced Texts: Basler Lehrbücher.
515–535 Birkhäuser Boston Inc, Boston
Jones RL, Olsen J, Wierdl M (1992) Subsequence ergodic Krengel U (1971) On the individual ergodic theorem for
theorems for L p contractions. Trans Am Math Soc subsequences. Ann Math Stat 42:1091–1095
331(2):837–850 Krengel U (1978/79) On the speed of convergence in the
Jones R, Rosenblatt J, Tempelman A (1994) Ergodic the- ergodic theorem. Monatsh Math 86(1):3–6
orems for convolutions of a measure on a group. Ill Krengel U (1985) Ergodic theorems. In: de Gruyter studies
J Math 38(4):521–553 in mathematics, vol 6. de Gruyter, Berlin. with a sup-
Jones RL, Kaufman R, Rosenblatt JM, Wierdl M (1998) plement by Antoine Brunel
Oscillation in ergodic theory. Ergod Theory Dyn Sys Krengel U, Lin M, Wittmann R (1990) A limit theorem for
18(4):889–935 order preserving nonexpansive operators in L1. Israel
Jones RL, Lacey M, Wierdl M (1999) Integer sequences J Undergrad Math 71(2):181–191
with big gaps and the pointwise ergodic theorem. Ergod Krieger W (1972) On unique ergodicity. In: Proceedings
Theory Dyn Sys 19(5):1295–1308 of the Sixth Berkeley Symposium on Mathematical
Jones RL, Rosenblatt JM, Wierdl M (2001) Oscillation Statistics and Probability, vol II. Probability
inequalities for rectangles. Proc Am Math Soc 129(5): Theory, University of California Press, Berkeley,
1349–1358. (electronic) pp. 327–346
106 Ergodic Theorems
Kryloff N, Bogoliouboff N (1937) La théorie générale de la Reinhold mathematical studies, no 34. Van Nostrand
mesure dans son application à l’étude des systèmes Reinhold Co., London
dynamiques de la mécanique non linéaire. Ann Math Ornstein DS (1960) On invariant measures. Bull Amer
38(1):65–113 Math Soc 66:297–300
Kuipers L, Niederreiter H (1974) Uniform distribution of Ornstein D (1970) Bernoulli shifts with the same entropy
sequences. Wiley, New York, Pure and Applied are isomorphic. Adv Math 4:337–352
Mathematics Ornstein D (1971) A remark on the Birkhoff ergodic the-
Lamperti J (1958) On the isometries of certain function- orem. Ill J Math 15:77–79
spaces. Pac J Math 8:459–466 Ornstein D, Weiss B (1983) The Shannon–McMillan–
Leibman A (2005a) Convergence of multiple ergodic aver- Breiman theorem for a class of amenable groups. Israel
ages along polynomials of several variables. Israel J Math 44(1):53–60
J Math 146:303–315 Ornstein D, Weiss B (1992) Subsequence ergodic theorems
Leibman A (2005b) Pointwise convergence of ergodic for amenable groups. Israel J Math 79(1):113–127
averages for polynomial actions of ℤd by translations Oseledec VI (1968) A multiplicative ergodic theorem.
on a nilmanifold. Ergod Theory Dyn Sys 25(1): Characteristic Ljapunov, exponents of dynamical sys-
215–225 tems. Trudy Moskov Mat Obšč 19:179–210
Lemańczyk M, Lesigne E, Parreau F, Volný D, Wierdl Oxtoby JC (1952) Ergodic sets. Bull Amer Math Soc 58:
M. n.d. 116–136
Lesigne E (1989) Théorèmes ergodiques pour une transla- Parry W (1969) Ergodic properties of affine transformations
tion sur un nilvariété. Ergod Theory Dyn Sys 9(1): and flows on nilmanifolds. Am J Math 91:757–771
115–126 Paterson A (1988) Amenability, mathematical surveys and
Lindenstrauss E (1999) Pointwise theorems for amenable monographs, vol 29. American Mathematical Society,
groups. Electron Res Announc Amer Math Soc 5: Providence
82–90 Peck JEL (1951) An ergodic theorem for a non-
Loomis LH (1946) A note on the Hilbert transform. Bull commutative semigroup of linear operators. Proc Am
Amer Math Soc 52:1082–1086 Math Soc 2:414–421
Lorch ER (1939) Means of iterated transformations in Pesin JB (1977) Characteristic Ljapunov exponents, and
reflexive vector spaces. Bull Amer Math Soc 45: smooth ergodic theory. Uspehi Mat Nauk
945–947 32(4(196)):55–112,287
Mauldin D, Buczolich Z (2005) Concepts behind divergent Petersen K (1983) Another proof of the existence of the
ergodic averages along the squares. In: Assani ergodic Hilbert transform. Proc Am Math Soc 88(1):
I (ed) Ergodic theory and related fields, Contemp 39–43
math, vol 430. American Mathematical Society, Prov- Petersen K (1989) Ergodic theory, Cambridge studies in
idence, pp 41–56 advanced mathematics, vol 2. Cambridge University
McMillan B (1953) The basic theorems of information Press, Cambridge
theory. Ann Math Stat 24:196–219 Pitt HR (1942) Some generalizations of the ergodic theo-
Merlevède F, Peligrad M, Utev S (2006) Recent advances rem. Proc Camb Philos Soc 38:325–343
in invariance principles for stationary sequences. Pro- Poincaré H (1987) Les méthodes nouvelles de la
bab Surv 3:1–36 mécanique céleste. Tome I, II, III. Les Grands
Neveu J (1961) Sur le théorème ergodique ponctuel. C R Classiques Gauthier–Villars. Librairie Scientifique et
Acad Sci Paris 252:1554–1556 Technique Albert Blanchard, Paris
Neveu J (1965) Mathematical foundations of the calculus Raghunathan MS (1979) A proof of Oseledec’s multipli-
of probability. Trans. Amiel Feinstein. Holden-Day, cative ergodic theorem. Israel J Math 32(4):356–362
San Francisco Renaud PF (1971) General ergodic theorems for locally
Nevo A (1994) Harmonic analysis and pointwise ergodic compact groups. Am J Math 93:52–64
theorems for noncommuting transformations. J Am Rosenblatt JM, Wierdl M (1995) Pointwise ergodic theo-
Math Soc 7(4):875–902 rems via harmonic analysis. In: Peterson KE, Salama
Nevo A (2006) Pointwise ergodic theorems for actions of IA (eds) Ergodic theory and its connections with har-
groups. In: Hasselblatt B, Katok A (eds) Handbook of monic analysis, London Mathematical Society lecture
dynamical systems, vol 1B. Elsevier, Amsterdam, note series, vol 205. Cambridge University Press, Cam-
pp 871–982 bridge, pp 3–151
Nevo A, Stein EM (1994) A generalization of Rudolph DJ (1990) Fundamentals of measurable dynam-
Birkhoff’s pointwise ergodic theorem. Acta Math ics. In: Fundamentals of measurable dynamics: ergodic
173(1):135–154 theory on Lebesgue spaces. Oxford Science Publica-
Nguyen XX (1979) Ergodic theorems for subadditive spa- tions, The Clarendon Press, Oxford University Press,
tial processes. Z Wahrsch Verw Gebiete 48(2):159–176 New York
Orey S (1971) Lecture notes on limit theorems for Markov Rudolph DJ (1994) A joinings proof of Bourgain’s return
chain transition probabilities. In: Van Nostrand time theorem. Ergod Theory Dyn Sys 14(1):197–203
Ergodic Theorems 107
Rudolph DJ (1998) Fully generic sequences and a Tempel'man AA (1972b) A generalization of a certain
multiple-term return-times theorem. Invent Math ergodic theorem of Hopf. Teor Verojatnost i Primenen
131(1):199–228 17:380–383
Ruelle D (1982) Characteristic exponents and invariant Thouvenot JP (1995) Some properties and applications of
manifolds in Hilbert space. Ann Math 115(2):243–290 joinings in ergodic theory. In: Peterson KE, Salama IA
Ryll-Nardzewski C (1951) On the ergodic theorems, (eds) Ergodic theory and its connections with
II. Ergodic theory of continued fractions. Stud Math harmonic analysis, London Math Soc Lecture Note
12:74–79 Ser, vol 205. Cambridge University Press, Cambridge,
Ryzhikov V (1994) Joinings, intertwining operators, fac- pp 207–235
tors and mixing properties of dynamical systems. Tulcea AI (1964) Ergodic properties of positive isometries.
Russian Acad Sci Izv Math 42:91–114 Bull AMS 70:366–371
Shah NA (1998) Invariant measures and orbit closures on von Neumann J (1932) Proof of the quasi-ergodic hypoth-
homogeneous spaces for actions of subgroups gener- esis. Proc Natl Acad Sci USA18:70–82
ated by unipotent elements. In: Dani SG (ed) Lie groups Walters P (1993) A dynamical proof of the multiplicative
and ergodic theory. Tata Inst Fund Res Stud Math, ergodic theorem. Trans Am Math Soc 335(1):245–257
vol 14. Tata Institute of Fundamental Research, Bom- Weber M (1998) Entropie métrique et convergence presque
bay, pp 229–271 partout, Travaux en Cours, vol 58. Hermann, Paris
Shalom Y (1998) Random ergodic theorems, invariant Weiss B (2003) Actions of amenable groups. In: Bezuglyi S,
means and unitary representation. In: Dani SG Kolyada S (eds) Topics in dynamics and ergodic theory,
(ed) Lie groups and ergodic theory. Tata Inst Fund London Math Soc Lecture Note Ser, vol 310. Cambridge
Res Stud Math, vol 14. Tata Institute of Fundamental University Press, Cambridge, pp 226–262
Research, Bombay, pp 273–314 Weyl H (1916) Über die Gleichverteilung von Zahlen mod
Shannon CE (1948) A mathematical theory of communi- Eins. Math Ann 77(3):313–352
cation. Bell System Tech J27(379–423):623–656 Wiener N (1939) The ergodic theorem. Duke Math J 5(1):
Shields PC (1987) The ergodic and entropy theorems 1–18
revisited. IEEE Trans Inform Theory 33(2):263–266 Wiener N, Wintner A (1941) Harmonic analysis and ergo-
Shulman A (1988) Maximal ergodic theorems on groups. dic theory. Am J Math 63:415–426
Dep Lit NIINTI No. 2184 Wierdl M (1988) Pointwise ergodic theorem along the
Sine R (1970) A mean ergodic theorem. Proc Am Math Soc prime numbers. Israel J Math 64(3):315–336. (1989)
24:438–439 Wittmann R (1995) Almost everywhere convergence of
Smythe RT (1976) Multiparameter subadditive processes. ergodic averages of nonlinear operators. J Funct Anal
Ann Probab 4(5):772–782 127(2):326–362
Szemerédi E (1975) On sets of integers containing no k Yosida K (1940a) An abstract treatment of the individual
elements in arithmetic progression. Acta Arith 27: ergodic theorem. Proc Imp Acad Tokyo 16:280–284
199–245 Yosida K (1940b) Ergodic theorems of Birkhoff–
Tao T (2005) The Gaussian primes contain arbitrarily Khintchine’s type. Jap J Math 17:31–36
shaped constellations. http://arxivorg/abs/math/ Yosida K, Kakutani S (1939) Birkhoff’s ergodic theorem
0501314 and the maximal ergodic theorem. Proc Imp Acad,
Tao T (2008) Norm convergence of multiple ergodic aver- Tokyo 15:165–168
ages for commuting transformations. Ergod Theory Ziegler T (2005) A non-conventional ergodic theorem for a
Dyn Syst 28(2):657–688 nilsystem. Ergod Theory Dyn Sys 25(4):1357–1370
Tao T, Ziegler T (2006) The primes contain arbitrarily long Ziegler T (2007) Universal characteristic factors and
polynomial progressions. http://frontmathucdavisedu/ Furstenberg averages. J Am Math Soc 20(1):53–97.
06105050 (electronic)
Tempelman A (1992) Ergodic theorems for group actions, Zimmer RJ (1976a) Ergodic actions with generalized dis-
mathematics and its applications, vol 78. Kluwer, Dor- crete spectrum. Ill J Math 20(4):555–588
drecht, informational and thermodynamical aspects, Zimmer RJ (1976b) Extensions of ergodic group actions.
Translated and revised from the 1986 Russian original Ill J Math 20(3):373–409
Tempel'man AA (1967) Ergodic theorems for general Zund JD (2002) George David Birkhoff and John von
dynamical systems. Dokl Akad Nauk SSSR 176: Neumann: a question of priority and the ergodic theo-
790–793 rems, 1931–1932. Hist Math 29(2):138–156
Tempel'man AA (1972a) Ergodic theorems for general Zygmund A (1951) An individual ergodic theorem for non-
dynamical systems. Trudy Moskov Mat Obšč 26: commutative transformations. Acta Sci Math Szeged
95–132 14:103–110
preserving -action T ¼ ðT a Þa is called
Spectral Theory of Dynamical ergodic if w0 1 is a simple eigenvalue
Systems of U T . It is weakly mixing if U T has a contin-
uous spectrum on the subspace L20 ðX , B, mÞ of
Adam Kanigowski1 and Mariusz Lemańczyk2 zero mean functions. T is said to be rigid if
1
Department of Mathematics, University of there is a sequence (an) going to infinity in
Maryland at College Park, College Park, MD, such that the sequence U T an goes to the
USA identity in the strong (or weak) operator topol-
2
Faculty of Mathematics and Computer Science, ogy; T is said to be mildly mixing if it has no
Nicolaus Copernicus University, Toruń, Poland non-trivial rigid factors. We say that T is
mixing if the operator equal to zero is the only
limit point of U T a jL2 ðX ,B,mÞ : a in the
0
Article Outline weak operator topology.
Cocycles and group extensions If T is an
Glossary and Notation ergodic automorphism, G is an l.c.s.c.
Definition of the Subject Abelian group, and ’ : X ! G is measurable,
Introduction then the pair (T, ’) generates a cocycle
Maximal Spectral Type of a Koopman ’()() : ℤ X ! G, where ’ðnÞ ðxÞ ¼
Representation: Alexeyev’s Theorem ’ðxÞ þ . . . þ ’ T n1 x for n > 0,
Spectral Theory of Weighted Operators
0 for n ¼ 0
The Multiplicity Function
’ðT n xÞ þ . . . þ ’ T 1 x for n < 0:
Rokhlin Cocycles
Rank-1 and Related Systems (That is, (’(n)) is a standard 1-cocycle in the
Spectral Theory of Dynamical Systems of
algebraic sense for the ℤ-action n(f) ¼ f ∘ T n
Probabilistic Origin
on the group of measurable functions on
Inducing and Spectral Theory
X with values in G; hence, the function ’ :
Rigid Sequences
X ! G itself is often called a cocycle.)
Spectral Theory of Parabolic Dynamical Systems
Assume additionally that G is compact.
Spectral Theory for Locally Compact Groups of
Using the cocycle ’, we define a group
Type I
extension T’ on ðX G, B B ðGÞ, m lG Þ
Future Directions
(lG stands for Haar measure of G), where
Bibliography
T’(x, g) ¼ (Tx, ’(x) + g).
Induced automorphism Assume that T is an
Glossary and Notation automorphism of a standard probability Borel
space ðX , B, mÞ . Let A B, m(A) > 0. The
AT property of an automorphism An automor-
induced automorphism TA is defined on the
phism T of a standard probability Borel space
conditional space ðX , BA , mA Þ , where BA is
ðX , B, mÞ is called approximatively transitive
the trace of B on A, mA(B) ¼ m(B)/m(A) for
(AT for short) if for every e > 0 and every finite
B BA and T A ðxÞ ¼ T kA ðxÞ x , where kA(x) is
set f1, . . ., fn of non-negative L1-functions on
the smallest k 1 for which Tkx A.
ðX , B, mÞ we can find f L1 ðX , B, mÞ also non-
Kolmogorov group property An -action T
negative such that k fi j aij f ∘T n j kL1 <
satisfies the Kolmogorov group property if
for all i ¼ 1, . . ., n (for some aij 0, nj ℕ).
sUT sUT sU T .
Ergodicity, weak mixing, mild mixing,
mixing, and rigidity of T A measure-
© Springer Science+Business Media, LLC, part of Springer Nature 2023 109
C. E. Silva, A. I. Danilenko (eds.), Ergodic Theory,
https://doi.org/10.1007/978-1-0716-2388-6_511
Originally published in
R. A. Meyers (ed.), Encyclopedia of Complexity and Systems Science, © Springer Science+Business Media LLC 2022
https://doi.org/10.1007/978-3-642-27737-5_511-2
110 Spectral Theory of Dynamical Systems
ability Borel space ðX , B, mÞ . The Koopman speed, and once it reaches the graph of f, it is
representation U ¼ U T of T in L2 ðX , B, mÞ is identified with (Tx, 0).
defined as the unitary representation a 7! U T a Spectral decomposition of a unitary
U L2 ðX , B, mÞ , where U T a ð f Þ ¼ f ∘ T a . representation If U ¼ ðU a Þa is a continu-
ous unitary representation of a locally compact
Markov operator A linear operator J : L2
second countable (l.c.s.c.) Abelian group in
ðX , B, mÞ ! L2 ðY , C, vÞ is called Markov if it
a separable Hilbert space H, then a decompo-
sends non-negative functions to non-negative
sition H ¼ 1 i¼1 ðxi Þ is called spectral if
functions and J1 ¼ J1 ¼ 1.
sx1 sx2 ... (such a sequence of
Maximal spectral type and the multiplicity
measures is also called spectral); here
function of U The maximal spectral type sU
ðxÞ≔spanfU a x : a g is called the cyclic
of U is the type of sx1 in any spectral decom-
space generated by x H, and sx stands for
position of H; the multiplicity function MU :
the spectral measure of x.
! f1, 2, :g [ fþ1g is defined sU -a.e. and Spectral disjointness Two -actions S and T
1
M U ð wÞ ¼ i¼1 1Y i ðwÞ, where Y 1 ¼ and are called spectrally disjoint if the maximal
ds spectral types of their Koopman representa-
Yi ¼ supp dsxxi for i 2. A representation U is
1
tions U T and U S on the corresponding L20 -
said to have simple spectrum if H is reduced to
spaces are mutually singular.
a single cyclic space. The multiplicity is uni-
Time change Let R ¼ (Rt)t ℝ be a flow on
form if there is only one essential value of MU .
ðX , B, mÞ and let n L1 ðX , B, mÞ be a positive
The essential supremum of MU is called the
function. The function n determines a
maximal spectral multiplicity. U is said to have
cocycle over R given by the formula nðt, xÞ
discrete spectrum if H has an orthonormal t
basis consisting of eigenvectors of U; U has ≔ 0 nðRs xÞds:Then for a.e. x X and all t ℝ,
u
singular (Haar, absolutely continuous) spec- there exists a unique u ¼ u(t, x) such that 0 n
trum if the maximal spectral type of U is ðRs xÞds ¼ t: Now, we can define the flow Rt
singular with respect to (equivalent to, abso- ðxÞ≔Ruðt,xÞ ðxÞ. The new flow R ¼ Rt
lutely continuous with) a Haar measure of . tℝ
SCS property We say that a Borel measure s on has the same orbits as the original flow, and it
preserves the measure m m (hence, it is ergo-
satisfies the strong convolution singularity
property (SCS property) if, for each n 1, in dic if R was), where dmdm
¼ n= X ndm:
the disintegration (given by the map Unitary actions on Fock spaces If H is a sepa-
ðw1 , . . . , wn Þ 7! w1 . . . wn Þsn ¼ nw d sðnÞ ðwÞ rable Hilbert space, then by H n, we denote the
the conditional measures nw are atomic with subspace of n-tensors of Hn symmetric under
exactly n! atoms (s(n) stands for the n-th con- all permutations of coordinates, n 1; then the
volution s . . . s). An -action T satisfies Hilbert space FðH Þ≔ 1 n¼0 H
n
is called a sym-
the SCS property if the maximal spectral type metric Fock space. If V U(H), then
of U T on L20 is a type of an SCS measure. FðV Þ≔ 1 n¼0 V
n
UðFðH ÞÞ , where V n ¼
n n
Special flow Given an ergodic automorphism T on V |H .
a standard probability Borel space ðX , B, mÞ and Weighted operator Let T be an ergodic automor-
a positive integrable function f : X ! ℝ+, we put phism of ðX , B, mÞ and x : X ! be a measur-
X f ¼ fðx, t ÞX ℝ : 0 t < f ðxÞg, Bf ¼ B able function. The (unitary) operator V ¼ Vx,T
Spectral Theory of Dynamical Systems 111
acting on L2 ðX , B, mÞ by the formula Vx,T( f )(x) V ma ð f ÞðwÞ ¼ iðaÞðwÞ f ðwÞ ¼ wðaÞ f ðwÞ w ,
¼ x(x)f(Tx) is called a weighted operator.
ð½l , M 1Þ; see section “Remarks on the Markov Operators, Joinings and Koopman
Banach Problem.” Another important problem Representations, Disjointness and Spectral
is to give a complete spectral classification in Disjointness, and Entropy
some classes of dynamical systems (classically, We would like to emphasize that spectral theory is
it was done in the theory of Kolmogorov and closely related to the theory joinings (see de la
Gaussian dynamical systems). We will also see Rue’s article (de la Rue 2009) for needed defini-
how spectral properties of dynamical systems tions). The elements r of the set J ðS, T Þ of
can determine their statistical (ergodic) proper- joinings of two -actions S and T are in a 1–1
ties; a culmination given by the fact that a spec- correspondence with Markov operators J ¼ Jr
tral isomorphism may imply measure-theoretic between the L2-spaces equivariant with the
similitude (discrete spectrum case, Gaussian- corresponding Koopman representations (see sec-
Kronecker case). An old conjecture is that a tion “Glossary and Notation” and de la Rue
dynamical system T either is spectrally deter- (2009)). The set of ergodic self-joinings of an
mined or there are uncountably many pairwise ergodic -action T is denoted by J e2 ðT Þ.
non-isomorphic systems spectrally isomorphic Each Koopman representation U T consists of
to T . Markov operators (indeed, U T a is clearly a Markov
We could also consider Koopman representa- operator). In fact, if U U(L2 ðX , B, mÞ is Markov,
tions in Lp for 1 p 6¼ 2. However, whenever W : then it is of the form UR, where R Aut ðX , B, mÞ;
LP ðX , B, mÞ ! Lp(Y, C, v) is a surjective isometry (Lemańczyk and Parreau 2012). This allows us to
and W ∘ UT a ¼ USa ∘ W for each a , then by see Koopman representations as unitary Markov
the Lamperti theorem (e.g., Royden 1968), the representations, but since a spectral isomorphism
isometry W has to come from a non-singular does not “preserve” the set of Markov operators,
pointwise map R : Y ! X, and, by ergodicity, spectrally isomorphic systems can have drasti-
R “preserves” the measure n and hence establishes cally different sets of self-joinings.
a measure-theoretic isomorphism (Kachurovskii We will touch here only some aspects of inter-
1990) (see also Lemańczyk (1996)). Thus, spec- actions (clearly, far from completeness) between
tral classification of such Koopman representa- the spectral theory and the theory of joinings.
tions goes back to the measure-theoretic In order to see however an example of such
classification of dynamical systems, so it looks interactions, let us recall that the simplicity of
hardly interesting. This does not mean that there eigenvalues for ergodic systems yields a short
are no interesting dynamical questions for p 6¼ 2. “joining” proof of the classical isomorphism
Let us mention still open Thouvenot’s question theorem of Halmos-von Neumann in the discrete
(formulated in the 1980s) for ℤ-actions: For every spectrum case: Assume that S ¼ ðSa Þa and
ergodic T acting on ðX , B, mÞ, does there exist f T ¼ ðT a Þa are ergodic -actions on
L1 ðX , B, mÞ such that the closed linear span of ðX , B, mÞ and ðY, C , vÞ, respectively. Assume that
f ∘ T n, n ℤ, equals L1 ðX , B, mÞ? both Koopman representations have purely dis-
Iwanik (1991, 1992) proved that if T is a sys- crete spectrum and that their group of eigenvalues
tem with positive entropy, then its Lp-multiplicity are the same. Then S and T are measure-
is 1 for all p > 1. Moreover, Iwanik and de Sam theoretically isomorphic. Indeed, each ergodic
Lazaro (1991) proved that for Gaussian systems joining of T and S is the graph of an isomorphism
(they will be considered in section “Spectral The- of these two systems (see Lemańczyk (1996); see
ory of Dynamical Systems of Probabilistic also Goodson’s proof in Goodson (1999)).
Origin”), the Lp-multiplicities are the same for Another example of such interactions appears in
all p > 1 (see also Lemańczyk and de Sam Lazaro the study Rokhlin’s multiple mixing problem and
(1997)). its relation with the pairwise independence
114 Spectral Theory of Dynamical Systems
property (PID) for joinings of higher order. We Golodets (2002) proved this theorem for groups
will not deal with this subject here, referring the even more general than those considered here:
reader to de la Rue (2009) (see however section The spectrum of U T on L2 ðX , B, mÞ L2 ðP Þ is
“Lifting Mixing Properties”). Haar with uniform infinite multiplicity. This gen-
Following Furstenberg (1967), two -actions eral result is quite intricate and based on methods
S and T are called disjoint provided the product introduced to entropy theory by Rudolph and
measure is the only element in J ðS, T Þ (if they are Weiss (2000) with a very surprising use of Dye’s
disjoint, one of these actions has to be ergodic). It theorem on orbital equivalence of all ergodic sys-
was already noticed in Hahn and Parry (1968) that tems. For which is not countable, the same
spectrally disjoint systems are disjoint in the result was proved in Arni (2005) in case of
Furstenberg sense; indeed, Im J r L2 ¼ f0g unimodular amenable groups which are not
0
since sT ,Jr f sS,f . increasing union of compact subgroups. It follows
Notice that whenever we take r J e2 ðT Þ, that spectral theory of dynamical systems essen-
we obtain a new ergodic -action ðT a T a Þa tially reduces to the zero entropy case.
defined on the probability space (X X, r). One
can now ask how close spectrally to T is this new
action? It turns out that except of the obvious
Maximal Spectral Type of a Koopman
fact that the marginal s-algebras are factors,
Representation: Alexeyev’s Theorem
ðT T , rÞ can have other factors spectrally dis-
joint from T : the most striking phenomenon here
Only few general properties of maximal spectral
is a result of Smorodinsky and Thouvenot (1979)
types of Koopman representations are known.
(see also Danilenko and Park (2002)) saying that
The fact that a Koopman representation preserves
each zero entropy system is a factor of an ergodic
the space of real functions implies that its maximal
self-joining system of a fixed Bernoulli system
spectral type is the type of a symmetric (invariant
(Bernoulli systems themselves have countable
under the map w 7! w) measure.
Haar spectrum). The situation changes if r ¼ m
Recall that the Gelfand spectrum sðU Þ of
m. In this case, for f, g L2 ðX , B, mÞ, the
a representation U ¼ ðU a Þa is defined as the
spectral measure of f g is equal to sT ,f sT ,g .
set of approximative eigenvalues of U , that is,
A consequence of this observation is that an ergo-
sðU Þ 3 w if for a sequence (xn) bounded and
dic action T ¼ ðT a Þa is weakly mixing (see
bounded away from zero, kUaxn w(a)xn k !
section “Glossary and Notation”) if and only if
0 for each a . The spectrum is a closed subset
the product measure m m is an ergodic self-
in the topology of pointwise convergence, hence
joining of T .
in the compact-open topology of . In case of
The entropy which is a basic measure-theoretic
¼ ℤ, the above set s(U) is equal to {z ℂ :
invariant does not appear when we deal with
U z Id is not invertible}.
spectral properties. We will not give here any
Assume now that is countable and discrete
formal definition of entropy for amenable group
(and Abelian). Then there exists a Fölner sequence
actions referring the reader to Ornstein and Weiss
(Bn)n 1 whose elements tile (Ornstein and Weiss
(1987). Assume that is countable and discrete.
1987). Take a free and ergodic action T ¼ ðT a Þa
We always assume that is Abelian; hence, it is
on ðX , B, mÞ. By Ornstein and Weiss (1987) for
amenable. For each dynamical system T ¼
each e > 0, we can find a set Yn B such that the
ðT a Þa acting on ðX , B, mÞ, we can find a largest
sets TbYn are pairwise disjoint for b Bn and m
invariant sub - s field P B, called the Pinsker
ð[b Bn T b Y n Þ > 1 . For each w , by con-
s-algebra, such that the entropy of the
sidering functions of the form fn ¼ b Bn wðbÞ
corresponding quotient system is zero. Generaliz-
1T b Y n , we obtain that w sðU T Þ. It follows that the
ing the classical Rokhlin-Sinai theorem (see also
topological support of the maximal spectral type of
Kamiński (1981) for ℤd-actions), Thouvenot
the Koopman representation of a free and ergodic
(unpublished) and independently Dooley and
Spectral Theory of Dynamical Systems 115
action is full (Katok and Thouvenot 2006; above theorem for ¼ ℤ using analytic func-
Lemańczyk 1996; Nadkarni 1998). The theory of tions. Refining Alexeyev’s original proof, Frączek
Gaussian systems shows in particular that there are (1997) showed the existence of a sufficiently reg-
symmetric measures on the circle whose topologi- ular function realizing the maximal spectral type
cal support is the whole circle but which cannot be depending only on the “regularity” of the under-
maximal spectral types of Koopman lying probability space, e.g., when X is a compact
representations. metric space (compact smooth manifold), then
An open well-known question remains of one can find a continuous (smooth) function real-
whether an absolutely continuous measure r is izing the maximal spectral type.
the maximal spectral type of a Koopman repre- By the theory of systems of probabilistic origin
sentation if and only if r is equivalent to a Haar (see section “Spectral Theory of Dynamical Sys-
measure of (this is unknown for ¼ ℤ). tems of Probabilistic Origin”), in case of simplic-
Another interesting question was raised by ity of the spectrum, one can easily point out
A. Katok (see Katok and Lemańczyk (2009)): Is spectral measures whose types are not realized
it true that the topological supports of all mea- by (essentially) bounded functions. However, it
sures in a spectral sequence of a Koopman repre- is still an open question whether for each
sentation are full? If the answer to this question Koopman representation U T there exists a
is positive, then, for example, the essential sup- sequence ( fi)i1 L1 ðX , B, mÞ such that the
remum of MUT is the same on all balls of . sequence s f i i1 is a spectral sequence for U T .
Notice that the fact that all spectral measures in For ¼ ℤ, it is unknown whether the maximal
a spectral sequence are symmetric means that U T spectral type of a Koopman representation can be
is isomorphic to U T 1 : A. del Junco (1981) realized by a characteristic function.
showed that generically for ℤ-actions, T and its The group Aut ðX , B, mÞ considered with
inverse are not measure-theoretically isomorphic the weak operator topology is closed in U
(in fact, he proved disjointness). L2 ðX , B, mÞ , hence becoming a Polish group
Let T be an -action on ðX , B, mÞ. One can ask (If we choose {Ai : i 1} a dense subset in
whether a “good” function can realize the maxi- ℬ (considered modulo null sets), then the
mal spectral type of U T . In particular, can we find weak operator topology is metrizable with
a function f L1 ðX , B, mÞ that realizes the the metric dðT 1 , T 2 Þ≔ i1 21i ðmðT 1 ðAi ÞDT 2 ðAi ÞÞ
maximal spectral type? The answer is given in þm T 1 1
1 ðAi ÞDT 2 ðAi Þ ÞÞ: One can then ask
the following general theorem (see Lemańczyk what are “typical” (largeness is understood as a
and Wasieczko (2006)). residual subset) properties of an automorphism of
ðX , B, mÞ: It is classical (Halmos) that typically an
Theorem 3 (Alexeyev’s Theorem) Assume that automorphism is weakly mixing and rigid and has
U ¼ ðU a Þa is a unitary representation of in a simple spectrum. Some other typical properties
separable Hilbert space H. Assume that F H is a will be discussed later. While Halmos already
dense linear subspace. Assume moreover that with noticed that in the weak operator topology mixing
some F - norm j j – stronger than the norm k k automorphisms form a meager set, in Tikhonov
given by the scalar product – F becomes a Fréchet (2012), S. Tikhonov considers a special (Polish)
space. Then, for each spectral measure s ð sU Þ, topology on the set of mixing automorphisms. In
there exists y F such that sy s. In particular, fact, this topology was introduced by Alpern
there exists y F that realizes the maximal spec- (1985) in 1985, and Tikhonov disproves a conjec-
tral type. ture by Alpern by showing that a generic mixing
transformation has simple singular spectrum and
By taking H ¼ L2 ðX , B, mÞ and F ¼ L1 is mixing of arbitrary order; moreover, all its
ðX , B, mÞ , we obtain the positive answer to the powers are disjoint. In Tikhonov (2013), the
original question. Alexeyev (1958) proved the topology is extended to mixing actions of infinite
116 Spectral Theory of Dynamical Systems
countable groups H; it is given by the metric dm, Therefore, the spectral analysis of such Koopman
where for two H-actions T i and H ∍ h 7! j h j representations reduces to the spectral analysis of
ℕ so that h H1/2jhj < + 1, we have weighted operators Vw∘’,T for all w G.
1
d m ðT 1 , T 2 Þ≔ jhj
dðT 1,h , T 2,h Þ Maximal Spectral Type of Weighted Operators
hH 2
over Rotations
1 The spectral analysis of weighted operators Vx,T is
þ sup
h Hi, j1 2
iþj especially well developed in case of rotations, i.e.,
when we additionally assume that T is an ergodic
j m T 1,h Ai \ A j m T 2,h Ai \ A j j: rotation on a compact monothetic group X : Tx ¼ x
+ x0, where x0 is a topologically cyclic element of
Bashtanov (2013, 2016) proved that the X (and m will stand for Haar measure lX of X). In
conjugacy classes (of mixing automorphisms) this case, Helson’s analysis (Helson 1986) applies
are dense in this topology. Hence, properties like (see also Gromov (1991), Iwanik et al. (1993),
to have trivial centralizer and no (non-trivial) fac- Lemańczyk (1996), and Queffélec (1988)), lead-
tors are typical in this topology. ing to the following conclusions:
is a UT ’ -invariant (clearly, closed) subspace. In the same paper, it is noticed that if we drop
Moreover, the map f w 7! f settles a unitary the assumption on the derivative, then the maxi-
isomorphism of U T ’ L with the operator Vw∘’,T. mal spectral type of Vx,T is a Rajchman measure
w
Spectral Theory of Dynamical Systems 117
(i.e., its Fourier transform vanishes at infinity). It manifold ℝ3/ℤ3 on which we define the nilrotation
is still an open question whether one can find x S((x, y, z) ℤ3) ¼ (a, b, 0) (x, y, z) ℤ3, where a,
absolutely continuous with non-zero degree and b, and 1 are rationally independent. It can be
such that Vx,T has singular spectrum. “Below” shown that S is isomorphic to the skew product
absolute continuity, topological properties of the defined on ½0, 1Þ2 by
cocycle seem to stop playing any role – in Iwanik
et al. (1993), a continuous, degree 1 cocycle x of T ’ : ðx, y, zÞ 7! x þ a, y þ b, z e2pi’ðx,yÞ
bounded variation is constructed such that x(x) ¼
(x)/(Tx) for a measurable : ½0, 1Þ ! (i.e., x ¼ ðx þ a, y þ b, z þ ayÞ ℤ3 ,
is a coboundary), and therefore Vx,T has purely
discrete spectrum (it is isomorphic to UT). For where ’(x, y) ¼ ay c(x + a, y + b) + c(x, y) with
other results about Lebesgue spectrum for Anzai c(x, y) ¼ x[y]. Since nil-cocycles can be consid-
skew products, see also Choe (1990), Frączek ered as a certain analog of affine cocycles for one-
(2000), and Iwanik (1997); in Frączek (2000), dimensional rotations, it would be nice to explain
ℤd-actions of rotations and so-called winding to what kind of perturbations the Lebesgue spec-
numbers instead of topological degree are consid- trum property is stable.
ered. For recent generalizations, see Cecchi and Yet another interesting problem which is
Tiedra de Aldecoa (2016) and Tiedra de Aldecoa related to the spectral theory of extensions given
(2015a). by so-called Rokhlin cocycles (see section
When the cocycle is still smooth but its degree “Rokhlin Cocycles”) arises, when given f : [0, 1)
is zero, the situation drastically changes. Given an ! ℝ, we want to describe spectrally the one-
absolutely continuous function f : [0, 1) ! ℝ, parameter set of weighted operators W c ≔V e2picf ,T ;
M. Herman (1979), using the Denjoy-Koksma here T is a fixed irrational rotation by a. As proved
inequality (see, e.g., Kuipers and Niederreiter by quite sophisticated arguments in Iwanik et al.
ðq Þ
(1974)), showed that f0 n ! 0 uniformly (here (1999), if we take f(x) ¼ x, then for all noninteger c
1
f0 ¼ f 0 fdl½0,1Þ and (qn) stands for the ℝ, the spectrum of Wc is continuous and sin-
sequence of denominators of a). It follows that gular (see also Gromov (1991) and Medina
T e2pif is rigid and hence has a singular spectrum. (1994), where some special a’s are considered).
B. Fayad (2002a) shows that this result is no It has been open for some time if at all one can find
longer true if one-dimensional rotation is replaced f : [0, 1) ! ℝ such that for each c 6¼ 0, the operator
by a multi-dimensional rotation (his counterexam- Wc has a Lebesgue spectrum. The positive answer
ple is in the analytic class). See also Lemańczyk is given in Wysokinska (2004): For example, if
and Mauduit (1994) for the singularity of spec- f(x) ¼ x(2+e)(e > 0) and a has bounded partial
trum for functions f whose Fourier transform sat- quotients, then Wc has a Lebesgue spectrum for all
isfies o jnj 1
condition or to Iwanik et al. (1999), c 6¼ 0. All functions with such a property consid-
where it is shown that sufficiently small variation ered in Wysokinska (2004) are non-integrable. It
implies singularity of the spectrum. would be interesting to find an integrable f with
A natural class of weighted operators arises the above property.
when we consider Koopman operators of rota- We refer to Goodson (1999) and the references
tions on nil-manifolds. We only look at the partic- therein for further results especially for transfor-
ular example of such a rotation on a quotient of the mations of the form (x, y) 7! (x + a, 1[0,b)(x) + y)
Heisenberg group (ℝ3, ) (a general spectral the- on [0, 1) ℤ/2ℤ. Recall however that earlier
ory of nil-actions was mainly developed by Katok and Stepin (1967) used this kind of trans-
W. Parry (1970) – these actions have countable formations to give a first counterexample to the
Lebesgue spectrum in the orthocomplement of the Kolmogorov group property (see section “Glos-
subspace of eigenfunctions), that is, take the nil- sary and Notation”) for the spectrum.
118 Spectral Theory of Dynamical Systems
The Multiplicity Problem for Weighted examples of cocycles over so-called dyadic
Operators over Rotations adding machine for which the multiplicity of the
In case of perturbations of affine cocycles, this Lebesgue component was equal to 2. In
problem was already raised by Kushnirenko Lemańczyk (1988), this was generalized to
(1974). Some significant results were obtained so-called Toeplitz ℤ/2ℤ-extensions of adding
by M. Guenais. Before we state her results, let us machines: For each even number k, we can find
recall a useful criterion to find an upper bound for a two-point extension of an adding machine so
the multiplicity: If there exist c > 0 and a that the multiplicity of the Lebesgue component is
sequence (Fn)n1 of cyclic subspaces of H such k. Moreover, it was shown that Mathew and
that for each y H, k y k ¼ 1 we have Nadkarni’s constructions from Mathew and
lim infn!1 k projFn yk2 c, then esssup (MU) Nadkarni (1984) in fact are close to systems aris-
1/c which follows from a well-known lemma of ing from number theory (like the famous Rudin-
Chacon (1970; Cornfeld et al. 1982; King 1988; Shapiro sequence, e.g., Queffélec (1988)), relat-
Lemańczyk 1996). Using this and a technique ing the result about multiplicity of the Lebesgue
which is close to the idea of local rank one (see component to results by Kamae and Queffélec
Ferenczi (1984) and King (1988)), M. Guenais (1988). Independently of Lemańczyk (1988),
(1998) proved a series of results on multiplicity Ageev (1988) showed that one can construct
which we now list. two-point extensions of the Chacon transforma-
tion realizing (by taking powers of the extension)
Theorem 5 Assume that Tx ¼ x + a and let x : each even number as the multiplicity of the
½0, 1Þ 7! be a cocycle. Lebesgue component. Contrary to all previous
examples, Ageev’s constructions are weakly
(i) If x(x) ¼ e2picx, then MV x,T is bounded by jc j mixing.
+ 1. Still an open question remains whether for
(ii) If x is absolutely continuous and x is of ¼ ℤ one can find a Koopman representation
topological degree zero, then Vx,T has a sim- with the Lebesgue component of multiplicity
ple spectrum. 1 (or even odd).
(iii) If x is of bounded variation, then MV x,T In Guenais (1999), M. Guenais studies the
max ð2, 2pVarðxÞ=3Þ. problem of Lebesgue spectrum in the classical
case of Morse sequences (see Keane (1968) as
Remarks on the Banach Problem well as Kwiatkowski (1981)), where the problem
We already mentioned in section “Introduction” of spectral classification in this class is studied).
the Banach problem in ergodic theory, which is All dynamical systems arising from Morse
simply the question whether there exists a sequences have simple spectra (Kwiatkowski
Koopman representation for ¼ ℤ with simple 1981). It follows that if a Lebesgue component
Lebesgue spectrum. In fact no example of a appears in a Morse dynamical system, it has mul-
Koopman representation with Lebesgue spectrum tiplicity 1. Guenais (1999) using a Riesz product
of finite multiplicity is known. Helson and Parry technique relates the Lebesgue spectrum problem
(1978) asked for the validity of a still weaker with the still open problem (a variation of the
version: Can one construct T such that UT has a classical Littlewood problem) of whether a con-
Lebesgue component in its spectrum whose mul- struction of so-called L1-ultraflat trigonometric
tiplicity is finite? Quite surprisingly in Helson and polynomials with coefficients 1 is possible
Parry (1978), they give a general construction (in the very recent preprint (Balister et al. 2019),
yielding for each ergodic T a cocycle ’ : X ! ℤ/ the Littlewood problem of existence of uniformly
2ℤ such that the unitary operator U T ’ has a flat trigonometric polynomials has been solved,
Lebesgue spectrum in the orthocomplement of but it is unclear whether it yields the ultraflatness
functions depending only on the X-coordinate. condition). However, it is proved in Guenais
Then Mathew and Nadkarni (1984) gave (1999) that such a construction can be carried
Spectral Theory of Dynamical Systems 119
out on some compact Abelian groups and it leads, ergodic measure (this approach was shown to the
for an Abelian countable torsion group , to a second author by A. del Junco). In particular, if
construction of an ergodic action of with simple T is mixing and T’ is weakly mixing, then for each
spectrum and a Haar component in its spectrum. w G∖f1g, the maximal spectral type of Vw∘’,T is
In Prikhodko (2013), A. Prikhodko published a Rajchman.
construction of a rank one flow (see section See section “Rokhlin Cocycles” for a general-
“Rank-1 and Related Systems”) having Lebesgue ization of the lifting result to Rokhlin cocycle
spectrum. As rank one implies simple spectrum, extensions.
the result yields solution of Banach problem for
¼ ℝ. To carry out the construction, Prikhodko
proved the following L1- ultraflat version of the
The Multiplicity Function
Littlewood conjecture: For all 0 < a < b and n
ð nÞ
2piw j t
1, there are polynomials Pn ðtÞ ¼ n1 j¼0 e In this section, only ¼ ℤ is considered. For
ðnÞ
for some real numbers w j ,
p
so that other groups, even for ℝ, much less is known.
k Pn kL ð½a,bÞ = n ! 1 when n ! 1. It seems
1 Clearly, given an automorphism T, by inducing
however that some of the arguments in the paper its Koopman ℤ-representation, we obtain a one-
are written too briefly and no further clarifying parameter group (Vt)t ℝ of unitary operators,
presentation of methods/results/ideas from which has precisely the same properties as the
Prikhodko (2013) has appeared so far. It would original one, except that we added the eigenvalues
also be extremely nice to explain the status of n ℝ. Moreover, classically, the induced
(El Abdalaoui 2015) by H. El Abdalaoui, first Koopman representation is given by the suspen-
posted on arXiv in 2015, which states the solution sion of T, i.e., by the special flow Tf (see section
of the original Banach problem (i.e., in the con- “Glossary and Notation”), where f ¼ 1, whence it
servative infinite measure-preserving category). is also Koopman but is never weakly mixing. See
Danilenko and Lemańczyk (2013) and Danilenko
Lifting Mixing Properties and Solomko (2010), where some of the results
We now give one more example of interactions below proved for ¼ ℤ have been extended to
between spectral theory and joinings (see section (weakly mixing) flows. See also the case of
“Introduction”) that gives rise to a quick proof of so-called product ℤd-actions (Filipowicz 1997)
the fact that r-fold mixing property of T (r 2) and (Solomko 2012) for general countable Abe-
lifts to a weakly mixing compact group extension lian group actions.
T’ (the original proof of this fact is due to Contrary to the case of maximal spectral type,
D. Rudolph (1985)). Indeed, to prove r-fold it is rather commonly believed that there are no
mixing for a mixing(¼2-mixing) transformation restrictions for the set of essential values of
S ðacting on ðY, C , nÞÞ, one has to prove that Koopman representations. In fact, if we drop the
each weak limit of off-diagonal self-joinings assumption that we consider the finite measure-
(given by powers of S, see de la Rue (2009)) of preserving case and let ourselves consider m
order r is simply the product measure vr. We s-finite and infinite, Danilenko and Ryzhikov
need also a Furstenberg’s lemma (Furstenberg (2010, 2011) proved that all subsets of {1, 2,
1981) about relative unique ergodicity (RUE) of . . .} [ {1} are Koopman realizable (in the
compact group extensions T’: If m lG is an weak mixing and mixing class, respectively).
ergodic measure for T’, then it is the only
(ergodic) invariant measure for T’ whose projec- Cocycle Approach
tion on the first coordinate is m. Now, the result We will only concentrate on some results of the
about lifting r-fold mixing to compact group last four decades. In 1983, refining an earlier idea
extensions follows directly from the fact that of Oseledets from 1960, E.A. Robinson (1983)
whenever T’ is weakly mixing, (m lG)r is an proved that for each n 1, there exists an ergodic
120 Spectral Theory of Dynamical Systems
transformation whose maximal spectral multiplic- proved that for a typical automorphism T, the set
ity is n. Another important result was proved in of the values of the multiplicity function for UT k
Robinson Jr (1986) (see also Katok (2003a)), equals {k, k(k 1), . . ., k!} and then he just passes
where it is shown that given a finite set M ℕ to “natural” factors for the Cartesian products by
containing 1 and closed under the least common taking sets invariant under a fixed subgroup of
multiple, one can find (even a weakly mixing) T so permutations of coordinates. In particular, he
that the set of essential values of the multiplicity obtains all sets of the form {2, 3, . . ., n} on L20 .
function equals M. This result was then extended He also shows that such sets of multiplicities are
in Goodson et al. (1992) to infinite sets and finally realizable in the category of mixing transforma-
in Kwiatkowski Jr and Lemańczyk (1995) (see tions. See also Ryzhikov (2009) for a realization
also Ageev (2001)) to all subsets M ℕ of sets of the form {k, l, kl}, {k, l, m, kl, km, lm,
containing 1. In fact, as we have already noticed klm}, etc.
in the previous section, the spectral theory for A further progress was done in 2009–2012,
compact Abelian group extensions is reduced to when first Katok and Lemańczyk (2009) proved
a study of weighted operators and then to compar- that each finite subset M {1, 2, . . .} [ {1}
ing maximal spectral types for such operators. containing 2 can be realized as the set of essential
This leads to sets of the form values of an ergodic automorphism which was
then, by overcoming some algebraic difficulties,
MðG, v, H Þ ¼ extended by Danilenko (2010, 2012) (in the
# w ∘ vi : i ℤ \ anihðH Þ : w anihðH Þ mixing category) to all subsets containing 2.
The latter shows that Proposition 6.4.4 Theorem 6 (Ageev, Danilenko) For every uni-
(multiplicative nature of MT in the Gaussian tary representation U of G in a separable Hilbert
case) claimed in the book by Katok and space H, for which U e1 Lr e1 has no non-trivial fixed
Thouvenot (2006) and also in Robinson (1986) points for 1 r < n, the essential values of the
is not true. multiplicity function for U enþ1 are contained in the
In the unpublished preprint (Ryzhikov 2014), set of multiples of n. If, in addition, U e0 has a simple
Ryzhikov shows that all subsets containing 1 are spectrum, then U enþ1 has uniform multiplicity n.
Gaussian “realizable” (even in the mixing
category). It is then a certain work to show that the
assumption of the second part of the theorem is
satisfied for a typical action of the group G. Using
Rokhlin’s Uniform Multiplicity Problem
a special (C, F)-construction with all the cut-and-
The Rokhlin multiplicity problem (see the book
stack parameters explicit, Danilenko (2006) was
by Anosov (2003)) was, given n 2, to construct
also able to show that each set of the form k M,
an ergodic transformation with uniform multiplic-
where k 1 and M is an arbitrary subset of natural
ity n on L20 . A solution for n ¼ 2 was indepen-
numbers containing 1, is realizable as the set of
dently given by Ageev (1999) and Ryzhikov
essential values of a Koopman representation.
(1999) (see also Anosov (2003) and Goodson
Tikhonov (2011) proved the existence of a
(1999)), and in fact it consists in showing that
mixing automorphism of uniform multiplicity
for some T (actually, any T with simple spectrum
n on L20 for all n 1.
for which 12 ðId þ U T Þ is in the weak operator
closure of the powers of UT will do), the multi-
plicity of T T is uniform and equal to 2 (see also
section “Future Directions”). In Tikhonov (2011) Rokhlin Cocycles
and Ryzhikov and Troitskaya (2016), the case n ¼
2 is solved in case of mixing automorphisms We consider now a certain class of extensions
(flows). which should be viewed as a generalization of
In Ageev (2005), Ageev proposed a new the concept of compact group extensions. We
approach which consists in considering will focus on ℤ-actions only.
actions of “slightly non-Abelian” groups and Assume that T is an ergodic automorphism of
showing that for a “typical” action of such a ðX , B, mÞ: Let G be an l.c.s.c. Abelian group.
group, a fixed “direction” automorphism has a Assume that this group acts on ðY, C , nÞ, that
uniform multiplicity. Shortly after publication is, we have a G-action S ¼ Sg g G on
of Ageev (2005), Danilenko (2006), following ðY, C , nÞ. Let ’ : X ! G be a cocycle. We then
Ageev’s approach, considerably simplified the define an automorphism T ’,S of the space
original proof. We will present Danilenko’s ðX Y , B C, m vÞ by
arguments.
Fix n 1. Denote ei ¼ ð0, . . . , T ’,S ðx, yÞ ¼ Tx, S’ðxÞ ðyÞ :
1, . . . , 0Þ ℤn , i ¼ 1, . . . , n . We define an
automorphism L of ℤn setting Lðei Þ ¼ eiþ1 , i ¼ Such an extension is called a Rokhlin cocycle
1, . . . , n 1 and Lðen Þ ¼ e1 . Using L we define extension (the map x 7! S’(x) is called a Rokhlin
a semi-direct product G ≔ ℤn ⋊ ℤ defining cocycle). Such an operation generalizes the case
multiplication as (u, k) (w, l) ¼ (u + Lkw, k + l). of compact group extensions; indeed, when G is
Put e0 ¼ (0, 1), ei ¼ ðei , 0Þ, i ¼ 1, . . . , n compact, the action of G on itself by rotations
ðand Lei ¼ ðLei , 0ÞÞ. Moreover, denote enþ1 ¼ preserves Haar measure. (It is quite surprising,
en0 ¼ ð0, nÞ . Notice that e0 ei e1 0 ¼ Lei for i that when only we admit G non-Abelian, then,
¼ 1, . . ., n (L(en + 1) ¼ en+1). as shown in Danilenko and Lemańczyk (2005),
122 Spectral Theory of Dynamical Systems
each ergodic extension of T has a form of a Theorem 7 (Lemańczyk and Parreau (2003,
Rokhlin cocycle extension.) Ergodic and spectral 2012))
properties of such extensions are examined in (i) sT ’,S L2 ðXY,mvÞL2 ðX,mÞ ¼ sV w∘’,T dsS :
G
several papers: Glasner (1994), Glasner and (ii) T ’,S is ergodic if and only if T is ergodic and
Weiss (1989), Lemańczyk and Lesigne (2001),
sS L’ ¼ 0.
Lemańczyk et al. (2003), Lemańczyk and Parreau
(iii) T ’,S is weakly mixing if and only if T is
(2003, 2012), Robinson Jr (1992), and Rudolph
weakly mixing and S has no eigenvalues in
(1986). Since in these papers rather joining
S’.
aspects are studied (among other things in
(iv) If T is mixing, S is mildly mixing, and ’ is
Lemańczyk and Lesigne (2001), Furstenberg’s
recurrent and not cohomologous to a
RUE lemma is generalized to this new context),
cocycle with values in a compact subgroup
we will mention here only few results, mainly
of G, then T ’,S remains mixing.
spectral, following Lemańczyk and Lesigne
(v) If T is r-fold mixing, ’ is recurrent, and T ’,S
(2001) and Lemańczyk and Parreau (2012). We
will constantly assume that G is non-compact. As is mildly mixing, then T ’,S is also r-fold
’ : X ! G is then a cocycle with values in a non- mixing.
compact group, the theory of such cocycles is (vi) If T and R are disjoint, the cocycle ’ is
much more complicated (see, e.g., Schmidt ergodic, and S is mildly mixing, then T ’,S
(1977)), and in fact the theory of Rokhlin cocycle remains disjoint from R.
extensions leads to interesting interactions Let us recall (Furstenberg and Weiss 1978;
between classical ergodic theory, the theory of Schmidt and Walters 1982) that an -action S ¼
cocycles, and the theory of non-singular actions ðSa Þa is mildly mixing (see section “Glossary
arising from cocycles taking values in non- and Notation”) if and only if the -action
compact groups – especially, the Mackey action ðS a ta Þa remains ergodic for every properly
associated with ’ plays a crucial role here (see the ergodic non-singular -action t ¼ ðta Þa .
problem of invariant measures for T ’,S in Coming back to Smorodinsky-Thouvenot’s
Lemańczyk and Parreau (2003) and Danilenko result about factors of ergodic self-joinings of a
and Lemańczyk (2005); see also monographs by Bernoulli automorphism, we would like to
Aaronson (1979, 1997) Katok (2001, 2003a), and emphasize here that the disjointness result
Schmidt (1977)). Especially, two Borel subgroups (vi) above was used in Lemańczyk and Parreau
of G are important here: (2003) to give an example of an automorphism
which is disjoint from all weakly mixing trans-
S’ ¼ w G : w∘’ ¼ c x=x∘T for a measuable formations but which has an ergodic self-joining
whose associated automorphism has a non-trivial
x : X ! and c weakly mixing factor. In a sense, this is opposed
to Smorodinsky-Thouvenot’s result as here from
and its subgroup L’ given by c ¼ 1. L’ turns out self-joinings, we produced a “more complicated”
to be the group of L1-eigenvalues of the Mackey system (namely, the weakly mixing factor) than
action (of G) associated with the cocycle ’. This the original system.
action is the quotient action of the natural action of wIt would be interesting to develop the theory of
G (by translations on the second coordinate) on spectral multiplicity for Rokhlin cocycle exten-
the space of ergodic components of the skew sions as it was done in the case of compact group
product T’ – the Mackey action is (in general) extensions. However, a difficulty is that in the
not measure-preserving; it is however non- compact group extension case, we deal with a
singular. We refer the reader to Aaronson and countable direct sum of representations of the
Nadkarni (1987), Host et al. (1991), and Nadkarni form Vw∘’,T, while in the non-compact case, we
(1998) for other properties of those subgroups. have to consider an integral of such representations.
Spectral Theory of Dynamical Systems 123
Rank-1 and Related Systems reader to Ferenczi (1997) as a good source for
basic properties of rank-1 transformations.
Although properties like mixing, weak (and mild) Similar to the rank one property, one can define
mixing, as well as ergodicity are clearly spectral finite rank automorphisms (simply by requiring
properties, they have “good” measure-theoretic that an approximation is given by a sequence of a
formulations (expressed by a certain behavior on fixed number of towers) – see, e.g., Ornstein et al.
sets). Simple spectrum property is another exam- (1982), or even a more general property, namely,
ple of a spectral property, and it was a popular the local rank one property, can be defined, just by
question in the 1980s whether simple spectrum requiring that the approximating sequence of sin-
property of a Koopman representation can be gle towers fills up a fixed fraction of the space (see
expressed in a more “measure-theoretic” way. Ferenczi (1984) and King (1988)). Local rank one
We now recall rank-l concept which can be seen property implies finite multiplicity (King 1988),
as a notion close to Katok’s and Stepin’s theory of and the maximal spectral multiplicity is always
cyclic approximation (Katok and Stepin 1967) bounded by rank. Mentzen (1988) showed that for
(see also Cornfeld et al. (1982)). each n 1, one can construct an automorphism
Assume that T is an automorphism of a stan- with simple spectrum and having rank n, and later
dard probability Borel space ðX , B, mÞ: T is said to Kwiatkowski and Lacroix (1997) showed that for
have the rank one property if there exists an each pair (m, r) with m r, one can construct a
increasing sequence of Rokhlin towers tending rank r automorphism whose maximal spectral
to the partition into points (a Rokhlin tower is a multiplicity is m. In Lemańczyk and Sikorski
family Fn , TF n , . . ., T hn 1 Fn of pairwise (1987), there is an example of a simple spectrum
disjoint sets, while “tending to the partition into automorphism which is not of local rank one.
points” means that we can approximate every set Ferenczi (1985) deals with the notion of funny
in B by unions of levels of towers in the sequence). rank one (approximating towers are Rokhlin
Hence, basically, rank one automorphism is given towers with “holes”) – the concept that has been
by two sequences of parameters: rn, n 1, which introduced by Thouvenot. Funny rank one also
is the number of subcolumns on which we divide implies simplicity of the spectrum. An example is
the nth tower given by Fn, and S n,j , n 1, j ¼ given in Ferenczi (1985) which is even not loosely
0, 1, . . . , r n 1, the sequence of spacers put Bernoulli (see section “Inducing and Spectral
over consecutive subcolumns. A “typical” auto- Theory”; we recall that local rank one property
morphism of a standard probability Borel space implies loose Bernoullicity (Ferenczi 1984)).
has the rank one property. The “typicality” of rank The notion of AT (see section “Glossary and
one is still true in the Alpern-Tikhonov topology Notation”) has been introduced by Connes and
we mentioned in section “Maximal Spectral Type Woods (1985). They proved that AT property
of a Koopman Representation: Alexeyev’s Theo- implies zero entropy. They also proved that
rem” by Bashtanov (2013). funny rank one automorphisms are AT. In Dooley
Baxter (1971) showed that the maximal spec- and Quas (2005), it is proved that the system
tral type of such a T is realized by a characteristic induced by the classical Morse-Thue system is
function. Since the cyclic space generated by the AT (it is an open question by S. Ferenczi whether
characteristic function of the base contains char- this system has funny rank one property).
acteristic functions of all levels of the tower, by A question by Dooley and Quas is whether AT
the definition of rank one, the increasing sequence implies funny rank one property. AT property
of cyclic spaces tends to the whole L2-space; implies “simplicity of the spectrum in L1” which
therefore, rank one property implies simplicity of we already considered in section “Introduction”
the spectrum for the Koopman representation. It (a “generic” proof of this fact is due to J.-P.
was a question for some time whether rank-1 is Thouvenot).
just a characterization of simplicity of the spec- A persistent question was formulated in the
trum, disproved by del Junco (1977). We refer the 1980s whether rank one itself is a spectral
124 Spectral Theory of Dynamical Systems
property. In Ferenczi and Lemańczyk (1991), the Even though it looks as if rank one construc-
authors maintained that this is not the case, based tion is not complicated, mixing in this class is
on an unpublished preprint of the first named possible; historically the first mixing construc-
author of Ferenczi and Lemańczyk (1991) in tions were given by D. Ornstein (1970) in 1970,
which there was a construction of a Gaussian- using probability type arguments for a choice of
Kronecker automorphism (see section “Spectral spacers. Once mixing was shown, the question
Theory of Dynamical Systems of Probabilistic arose whether absolutely continuous spectrum is
Origin”) having rank-l property. This latter con- also possible, as this would give automatically the
struction turned out to be false. In fact, de la Rue positive answer to the Banach problem. However,
(1998a) proved that no Gaussian automorphism Bourgain (Bourgain 1993), relating spectral mea-
can be of local rank one. Therefore, the question sures of rank one automorphisms with some clas-
whether rank one is a spectral property remains sical constructions of Riesz product measures,
one of the interesting open questions in that the- proved that a certain subclass of Ornstein’s class
ory. Downarowicz and Kwiatkowski (2000) pro- consists of automorphisms with singular spectrum
ved that rank-l is a spectral property in the class of (see also El Abdalaoui (2007) and El Abdalaoui
systems generated by generalized Morse et al. (2006)). Since in Ornstein’s class spacers are
sequences. chosen in a certain “non-constructive” way, quite
One of the most beautiful theorems about a lot of attention was devoted to the rank one
rank-1 automorphisms is the following result of automorphism defined by cutting a tower at the
J. King (1986) (for a different proof, see Ryzhikov n-th step into rn ¼ n subcolumns of equal “width”
(1992)). and placing i spacers over the i-th subcolumn. The
mixing property conjectured by M. Smorodinsky
Theorem 8 WCT If T is of rank one, then for was proved by Adams (1998) (in fact Adams
each element S of the centralizer C(T) of T, there proved a general result on mixing of a class of
exists a sequence (nk) such that U nTk ! US staircase transformations). Spectral properties of
strongly. rank-1 transformations are also studied in Klemes
and Reinhold (1997), where the authors proved
A conjecture of J. King is that in fact for rank-l that whenever 1 2
n¼1 r n ¼ þ1, then the spec-
automorphisms, each indecomposable Markov trum is automatically singular (see also the more
operator J ¼ J r r J e2 ðT Þ is a weak limit of recent in Creutz and Silva (2010)). H. El
powers of UT (see King (2001) and also Ryzhikov Abdalaoui (2007) gives a criterion for singularity
(1992)). To which extent the WCT remains true of the spectrum of a rank one transformation; his
for actions of other groups is not clear. In Zeitz proof uses a central limit theorem. It seems that
(1993), the WCT is proved in case of rank one still the question whether rank one implies singu-
flows; however, the main argument seems to be larity of the spectrum remains the most important
based on the fact that a rank one flow has a non- question of this theory.
zero time automorphism T t0 which is of rank We have already seen in section “Spectral The-
one, which is not true. After the proof of the ory of Weighted Operators” that for a special class
WCT by Ryzhikov in Ryzhikov (1992), there is of rank one systems, namely, those with discrete
a remark that the rank one flow version of the spectra (del Junco 1976), we have a nice theory
theorem can be proved by a word-for-word rep- for weighted operators. It would be extremely
etition of the arguments. He also proves that if interesting to find a rank one automorphism with
the flow (Tt)t ℝ is mixing, then T1 does not have continuous spectrum for which a substitute of
finite rank. On the other hand, for ¼ ℤ2 , Helson’s analysis exists.
Downarowicz and Kwiatkowski (2002) gave a B. Fayad (2005) constructs a rank one differ-
counterexample to the WCT. But see also entiable flow, as a special flow over a two-
Janvresse et al. (2012). dimensional rotation. In Fayad (2006), he gives
Spectral Theory of Dynamical Systems 125
new constructions of smooth flows with singular given by the Thue-Morse sequence and related
spectra which are mixing (with a new criterion for (rank two systems), see El Abdalaoui et al.
a Rajchman measure to be singular). In Fayad (2016). Weak closure of powers for Chacon auto-
(2001a), a certain smooth change of time for an morphism is described in Janvresse et al. (2015).
irrational flows on the 3-torus is given, so that the
corresponding flow is partially mixing and has the
local rank one property.
Spectral Theory of Dynamical Systems of
Motivated by Sarnak’s conjecture on Möbius
Probabilistic Origin
disjointness (see Kułaga-Przymus and
Lemańczyk (2019)), a certain recent activity was
Let us just recall that when ðY n Þ1
n¼1 is a stationary
to study spectral disjointness of powers for rank
process, then its distribution m on ℝZ is invariant
one automorphisms. Let s be a probability mea-
under the shift S on ℝZ : S((xn)n ℤ) ¼ (yn)n ℤ,
sure on the additive circle [0, 1). Given a real
where yn ¼ xn+1, n ℤ. In this way, we obtain an
number a > 0, we denote by sa the image of s
automorphism S defined on (ℝℤ, B(ℝℤ), m). For
under the map x 7! ax mod 1. If r 1 is an
each automorphism T, we can find f : X ! ℝ
integer, then by sr, we will denote the measure
measurable such that the smallest s-algebra mak-
which is obtained first by taking the image of s
ing the stationary process (f ∘ T n)n ℤ measurable
under the map x 7! 1r x, i.e., the measure s1/r, and
is equal to B ; therefore, for the purpose of this
then repeating this new measure periodically in
entry, by a system of probabilistic origin, we will
intervals rj , jþ1 . The following holds (e.g., in
r mean (S, m) obtained from a stationary infinitely
the unpublished notes by H. El Abdalaoui,
divisible process (see, e.g., Maruyama (1970) and
J. Kułaga-Przymus, M. Lemańczyk, and T. de la
Sato (1999)). In particular, the theory of Gaussian
Rue.):
dynamical systems is indeed a classical part of
ergodic theory (e.g., Newton 1966; Newton and
if ðr, sÞ ¼ 1 then sr ⊥s if and only if ss ⊥r : Parry 1966; Vershik 1962a, b). If (Xn)n ℤ is a
ð2Þ stationary real centered Gaussian process and s
denotes the spectral measure of the process, i.e.,
In Bourgain (2013), Bourgain used Riesz prod- sðnÞ ¼ EðXn X0 Þ, n ℤ, then by S ¼ Ss, we
uct technique to show that for the class of denote the corresponding Gaussian system on the
so-called rank one automorphisms with bounded shift space (recall also that for each symmetric
parameters (both (rn) and (sn,j) are bounded and no measure s on , there is exactly one stationary
spacer over the last column), we have sr ⊥ ss for r real centered Gaussian process whose spectral
6¼ s prime. In view of (2), it follows that different measure is s). Notice that if s has an atom, then
prime powers are spectrally disjoint. In El in the cyclic space generated by X0, there exists an
Abdalaoui et al. (2014), a much larger class of eigenfunction Y for Ss – if now Ss were ergodic,
rank one automorphisms is considered. No jYj would be a constant function which is not
boundedness assumption on (rn) is made, but a possible by the nature of elements in ℤ(X0). In
certain bounded recurrence is required on the what follows, we assume that s is continuous.
sequence of spacers. Spectral disjointness of dif- It follows that U Ss restricted to ℤ(X0) is spec-
ferent powers (for the continuous part of the max- trally the same as V ¼ V s acting on L2 ð, sÞ, and
imal spectral type) is derived from the existence, we obtain that U Ss , L2 ℝℤ , ms can be
in the weak closure of powers, of sufficiently represented as the symmetric Fock space built
many analytic functions of the Koopman operator over H ¼ L2 ð, sÞ and U Ss ¼ FðV Þ – see section
UT. “Glossary and Notation” (H n is called the n-th
For a spectral disjointness of the continuous chaos). In other words, the spectral theory of
part of the maximal spectral type for powers of Gaussian dynamical systems is reduced to the
automorphisms like the substitutional system spectral theory of special tensor products unitary
126 Spectral Theory of Dynamical Systems
operators. Classical results (see Cornfeld et al. has been proved in Lemańczyk and
(1982)) which can be obtained from this point of Lesigne (2001).
view are, for example, the following: The Foiaş-Stratila theorem implies that when-
ever a spectral measure s is Kronecker, it has no
(A) Ergodicity implies weak mixing. realization of the form sf with f bounded. We will
(B) The multiplicity function is either 1 or is see however in section “Future Directions” that
unbounded. for some automorphisms T (having the SCS prop-
(C) The maximal spectral type of U Ss is equal to erty), the maximal spectral type sT has the prop-
exp(s); hence, Gaussian systems enjoy the erty that SsT has a simple spectrum.
Kolmogorov group property. Gaussian-Kronecker automorphisms are
examples of automorphisms with simple spectra.
However, we can also look at a Gaussian In fact, whenever s is concentrated on a set with-
system in a different way, simply by noticing out rational relations, then Ss has a simple spec-
that the variables e2pif ( f is a real variable), trum (see Cornfeld et al. (1982)). Examples of
where f ℤ(X0) generate L2(ℝℤ, ms). Now, mixing automorphisms with simple spectra are
calculating the spectral measure of e2pif is not known (Newton 1966); however, it is still
difficult and we obtain easily (C). Moreover, unknown (Thouvenot’s question) whether the
integrals of type e2pif 0 e2pif 1 ∘s e2pif 2 ∘s dms
n nþm
Foiaş-Stratila property may hold in the mixing
can also be calculated; whence, in particular, class. F. Parreau (2000) using independent Helson
we easily obtain Leonov’s theorem on the multi- sets gave an example of mildly mixing Gaussian
ple mixing property of Gaussian systems system with the Foiaş-Stratila property.
(Leonov 1960). Joining theory of a class of Gaussian system,
One of the most beautiful parts of the theory of called GAG, is developed in Lemańczyk et al.
Gaussian systems concerns ergodic properties of (2000). A Gaussian automorphism Ss with the
Ss when s is concentrated on a thin Borel set. Gaussian space H L20 ℝℤ , ms is called a
Recall that a closed subset K T is said to be a GAG if for each ergodic self-joining r J e2 ðSs Þ
Kronecker set if each f C(K) is a uniform limit and arbitrary f, g H the variable
of characters (restricted to K). Each Kronecker set
has no rational relations. Gaussian-Kronecker ℝℤ ℝℤ , r 3 ðx, yÞ 7! f ðxÞ þ gðyÞ
automorphisms are, by definition, those Gaussian
systems for which the measure s (always assumed is Gaussian. For GAG systems one can describe
to be continuous) is concentrated on K [ K, K a the centralizer and factors, they turn out to be
Kronecker set. The following theorem has been objects close to the probability structure of the
proved in Foiaş and Stratila (1968) (see also system. One of the crucial observations in
Cornfeld et al. (1982)). Lemańczyk et al. (2000) was that all Gaussian
systems with simple spectrum are GAG.
Theorem 9 Foiaş-Stratila Theorem If T is an It is conjectured (J.P. Thouvenot) that in the
ergodic automorphism and f is a real-valued ele- class of zero entropy Gaussian systems, the PID
ment of L20 such that the spectral measure sf is property holds true.
concentrated on K [ K , where K is a Kronecker For the spectral theory of classical factors of a
set, then the process ( f ∘ T n)n ℤ is Gaussian. Gaussian system, see Lemańczyk and de Sam
Lazaro (1997); also spectrally they share basic
This theorem is indeed striking as it gives spectral properties of Gaussian systems. Recall
examples of weakly mixing automorphisms that historically one of the classical factors,
which are spectrally determined (like rotations). namely, the s-algebra of sets invariant for the
A relative version of the Foiaş-Stratila theorem map
Spectral Theory of Dynamical Systems 127
We also mention here another problem which function is not determined in that paper. Recall
will be taken up in section “Special Flows, Flows (without giving a formal definition, see Ornstein
on Surfaces, and Interval Exchange Transforma- et al. (1982)) that a zero entropy automorphism is
tions” – Is it true that flows of probabilistic origin loosely Bernoulli (LB for short) if and only if it
are disjoint from smooth flows on surfaces? can be induced from an irrational rotation (see
Yet one more (joining) property seems to be also Feldman (1976) and Katok (1977)). The LB
characteristic in the class of systems of probabi- theory shows that not all dynamical systems can
listic origin, namely, they satisfy so-called ELF be obtained by inducing from an ergodic rotation.
property (see Derriennic et al. (2008) and de la However, an open question remained whether LB
Rue’s article (de la Rue (2009))). Vershik asked systems exhaust spectrally all Koopman represen-
whether the ELF property is spectral – however, tations. An interesting question of M. Ratner
the example of a cocycle from Wysokinska (2004) (Ratner 1978) is whether from every ergodic auto-
together with Theorem 7 (i) yields a certain morphism T, one can induce an automorphism
Rokhlin extension of a rotation which is ELF which has countable Lebesgue spectrum (Ratner
and has countable Lebesgue spectrum in the in Ratner (1978) shows that this can be done if T is
orthocomplement of the eigenfunctions (see an irrational rotation).
Wysokińska (2007)); on the other hand, any affine In a deep paper (de la Rue 1996), de la Rue
extension of that rotation is spectrally the same, studies LB property in the class of Gaussian-
while it cannot have the ELF property. Kronecker automorphisms, in particular he con-
Prikhodko and Thouvenot (private communi- structs S which is not LB. Suppose now that T is
cation) have constructed weakly mixing and non- LB and for some A B, U T A is isomorphic to US.
mixing rank one automorphisms which enjoy the Then by the Foiaş-Stratila theorem, TA is isomor-
ELF property. phic to S, and hence TA is not LB. However, an
induced automorphism from an LB automor-
phism is LB, a contradiction.
Another fruitful source of non-LB systems
Inducing and Spectral Theory
comes from taking Cartesian products of some
natural LB systems. In Ornstein et al. (1982), it
Assume that T is an ergodic automorphism of a
is proved that there exists a rank one (and hence
standard probability Borel space ðX , B, mÞ: Can
LB) system whose Cartesian square is not
“all” dynamics be obtained by inducing (see sec-
LB. Moreover, in Ratner (1979), it was shown
tion “Glossary and Notation”) from one fixed
that the square of the horocycle flow is not LB
automorphism was a natural question from the
(the horocycle flow itself being LB (Ratner
very beginning of ergodic theory. Because of
1978)). Recently, in Kanigowski and de la Rue
Abramov’s formula for entropy h(TA) ¼ h(T)/
(2019), the authors showed that there are staircase
m(A), it is clear that positive entropy transforma-
rank one transformations whose Cartesian prod-
tions cannot be obtained from inducing on a zero
uct is not LB.
entropy automorphism. However, here we are
interested in spectral questions, and thus we ask
how many spectral types we obtain when we
induce. It is proved in Friedman and Ornstein Rigid Sequences
(1973) that the family of A B for which TA is
mixing is dense for the (pseudo) metric d(A1, A2) Recall (see section “Glossary and Notation”) that
¼ m(A1D A2). De la Rue (1998b) proves the fol- an automorphism T of a standard probability Borel
lowing result: For each ergodic transformation space ðX , B, mÞ is called rigid if there exists
T of a standard probability space ðX , B, mÞ, the a strictly increasing sequence qn ! 1 such that
set of A B for which the maximal spectral type of mðT qn ADAÞ ! 0 as n ! 1, for each A B .
U T A is Lebesgue is dense in B. The multiplicity (In fact, to have a global rigidity sequence, as
Spectral Theory of Dynamical Systems 129
observed by Thouvenot, we only need to know (which are obviously rigidity sequences for the
that for each A B there is a sequence (qn,A) so that corresponding irrational rotations Tx ¼ x + a on
m(Tqn,AADA) ! 0.) Equivalently, for each f L2 the additive circle) to show that such sequences
q
ðX , B, mÞ, U Tn f ! f in L2 ðX , B, mÞ (it is not hard are rigid for some weakly mixing automorphisms.
to see that the latter is equivalent that Let us also mention Aaronson’s result (Aaronson
1 2piqn x
0e ds f ðxÞ ! 1 for any sf representing the 1979): Given any sequence (rn) of density 0, there
maximal spectral type of T, k f k ¼ 1). We call (qn) is a sequence (qn) such that qn < rn, n 1, and
a rigidity sequence of T. Rigidity is one of (purely (qn) is rigid for some weakly mixing
spectral) the fundamental phenomena in ergodic automorphisms.
theory. Assuming that T is aperiodic, it is not hard Moreover, in Bergelson et al. (2014), two basic
to see that for any rigidity sequence (qn), we must questions have been formulated: Given any
have qn+1 qn ! 1. Typical automorphism is sequence rigid for some T with discrete spectrum,
rigid and weakly mixing, but since weak mixing must it be rigid for some weakly mixing automor-
implies U nT ! 0 weakly on L20 ðX , B, mÞ along a phism? What about the converse?
sequence of n of full density, there is no much The positive answer to the first question was
“room” left for rigidity sequences. So positive given by Adams (2015) and Fayad and Thouvenot
density sequences cannot be rigid, but beyond (2014). On the other hand, surprisingly, Fayad and
that, in the class of zero density sequences, there Kanigowski (2015) answered negatively the sec-
can be other, for example algebraic in nature, ond question: There are rigidity sequences (for
obstructions for rigidity. For example, as noticed weakly mixing automorphisms) which are not
in Bergelson et al. (2014) and Eisner and Grivaux rigidity sequences for any rotation. For a strength-
(2011), if P ℚ[x] is any non-zero polynomial ening of this result (the existence of a rigidity
taking integer values on ℤ, then the sequence sequence which, as a subset, is dense in the Bohr
(P(n)) cannot be rigid for any ergodic automor- topology on ℤ), see Griesmer (2019).
phism. It is also easy to see that (2n) is a rigidity One can also consider a notion stronger than
sequence while (2n + 1) is not. rigidity, called IP-rigidity (see, e.g., Bergelson
A systematic study of sequences which can be et al. (2014)): (qn) is an IP-rigidity sequence for
rigidity sequences was originated in Bergelson an automorphism T acting on ðX , B, mÞ if T x ! Id
et al. (2014) and Eisner and Grivaux (2011). in the strong topology of L2 ðX , B, mÞÞ in the IP
Both papers use harmonic analysis approach to sense, that is, when x ! 1, where x ¼ qm1
construct rigid sequences (via the standard Gauss- þ þ qmk and we require that the smallest ele-
ian functor). Other constructions are also pre- ment qm1 is going to 1. This notion is studied in
sented in Bergelson et al. (2014), rank one Aaronson et al. (2014) relating it to non-singular
constructions, weighted operators, and Poisson ergodic theory (more precisely, to groups of
suspensions, while Eisner and Grivaux (2011) so-called L1-eigenvalues of non-singular auto-
rather concentrates on so-called linear dynamical morphisms). As proved in Aaronson et al.
systems and studies rigidity for weakly mixing (2014), in this category, the answer to the second
automorphisms. One of the results in Bergelson question (above) from Bergelson et al. (2014)
et al. (2014 and Eisner and Grivaux (2011) states turns out to be positive. Moreover, the paper pro-
that if either qn+1/qn ! 1 or qn+1/qn is an integer, vides an example of a super-lacunary sequence
then (qn) is a rigidity sequence (for a weakly (which must be a rigidity sequence by Bergelson
mixing automorphism). On the other hand, Eisner et al. (2014) and Eisner and Grivaux (2011))
and Grivaux in Eisner and Grivaux (2011) give an which is not an IP-rigid.
example of a rigid sequence (qn), for a weakly In the recent preprint (Badea et al. 2018), rigid-
mixing automorphism, such that qn+1/qn ! 1. As ity sequences are compared to other classical
a matter of fact, both Bergelson et al. (2014) and notions in harmonic analysis. It is proved that
Eisner and Grivaux (2011) deal with the case of rigidity sequences (qn) are nullpotent, i.e., there
denominators (qn) of an irrational a [0, 1) exists a topology t on ℤ making it a topological
130 Spectral Theory of Dynamical Systems
group such that qn ! 0, but they are never systems is a lack of many tools from representa-
Kazhdan. (A subset B ℤ is called Kazhdan if tion theory, which is available in the algebraic
there exists e > 0 such that each unitary operator setting. Below, we focus on known results and
U on a separable Hilbert space H having a unit questions in spectral theory of non-algebraic par-
vector x with supn B k Unx x k < e has a non- abolic systems.
zero fixed point.) We find also there a rather sur-
prising result that the family of all rigidity Time-Changes of Algebraic Systems
sequences considered as a subset in ℤℕ is Borel. Perhaps the simplest class of non-algebraic para-
bolic dynamical systems is given by time-changes
(or reparametrizations) of algebraic systems. As
Spectral Theory of Parabolic Dynamical for algebraic systems, it is natural to consider
Systems separately the cases of time-changes of unipotent
systems and nilpotent systems. We do it in two
We say a system is algebraic if it is a ℤ (or ℝ) paragraphs below.
translation on a quotient of a Lie group by a Time-changes of unipotent systems In recent
lattice. Spectral theory (and mixing properties) years, we witnessed substantial development in
of algebraic systems is by now well understood. understanding the theory of time-changes of
The two main classes are actions on quotients of unipotent flows. The first (and most studied)
semi-simple and nilpotent Lie groups. In the first case is that of smooth time changes of horocycle
case, the two main examples are horocycle and flows. Recall first that M. Ratner (1987)
geodesic flows on quotients of SL(2, ℝ). More established measures and joinings rigidity phe-
generally, one can talk about quasi-unipotent and nomena for time-changes of horocycle flows that
partially hyperbolic actions. Recall that in the are analogous to Ratner’s theory in algebraic set-
setting of algebraic actions, being quasi-unipotent ting. In particular in Ratner (1986), Ratner proved
is equivalent to zero entropy, while being partially the H-property for all (sufficiently smooth time-
hyperbolic is equivalent to positive entropy. changes). It is not known if Ratner’s joinings and
It is known that in both cases, the spectrum is measure rigidity also holds for time-changes of
countable Lebesgue (we refer the reader to Katok general unipotent flows.
and Thouvenot (2006) for a nice description of Mixing for smooth time-changes of horocycle
spectral theory of horocycle and geodesic flows). flows was established by Marcus (1977), general-
Actions on quotients of nilpotent Lie groups are izing earlier work of Kushnirenko (1974) who
also known to have countable Lebesgue spectrum required additionally small derivative in the geo-
in the orthocomplement of the eigenspace desic direction. A crucial result for the theory is by
(we refer to Parry (1970) for details). Quantitative L. Flaminio and G. Forni (2003), where the
mixing (and higher-order mixing) of algebraic authors classify all invariant distributions and as
systems is also well understood (we refer the a consequence show that a typical time-change is
reader to a recent paper, Björklund et al. (2020), not a (measurable) quasi-coboundary (and hence
for general results on decay of correlations for the time-change is not trivially isomorphic to the
algebraic systems on semi-simple Lie groups). original flow). A. Katok and J.-P. Thouvenot
Much less is known in spectral theory of conjectured (Katok and Thouvenot 2006) that
parabolic systems beyond algebraic world. There every sufficiently smooth time change of the
is no strict definition for a system to be horocycle flow has countable Lebesgue spectrum.
parabolic. However, characteristic features of par- A partial answer to this conjecture was given by
abolic systems are zero entropy, polynomial G. Forni and C. Ulcigrai (2012), where the authors
orbit growth, strong mixing, and equidistribution show that the maximal spectral type of the time-
properties. We describe some classes of changed flow remains Lebesgue (see also a result
(non-algebraic) parabolic systems below. One of of R. Tiedra de Aldecoa (2012), where the abso-
the main difficulties in studying non-algebraic lute continuity of the spectrum is proven, and
Spectral Theory of Dynamical Systems 131
Tiedra de Aldecoa (2015b, 2017) for further appli- strengthened by D. Ravotti in Ravotti (2019) to
cations of the commutator method in ergodic the- quasi-Abelian nilflows and recently to all nilflows
ory). A full solution of the Katok-Thouvenot by Avila, Forni, Ravotti, and Ulcigrai (2019). It is
conjecture (i.e., countable Lebesgue spectrum) important to mention that the mixing mechanism
was recently given by G. Forni, B. Fayad, and is nonquantitative. Therefore, two questions are
A. Kanigowski (2016). Generalizing the approach natural to ask: What are mixing properties of
from Forni and Ulcigrai (2012), L. Simonelli general time-changes of nilflows and can one
(2018) showed that the spectrum of smooth obtain some quantitative mixing results? The
time-changes of general unipotent flows remains only case in which some progress has been
Lebesgue. It is not known if the multiplicity is recently made is that of time-changes of Heisen-
infinite, but it seems that the approach from Fayad berg nilflows. In Forni and Kanigowski (2017),
et al. (2016) has the potential of being applicable the authors show stretched polynomial decay of
in this setting. correlations for smooth time-changes of full mea-
Recall that Ratner’s work (Ratner 1983) allows sure set of Heisenberg nilflows (parameterized by
one to classify joinings between horocycle flows. the frequency of the Kronecker factor). In the case
Recently, there was a progress in understanding the flow is of bounded type, the authors prove
joinings for time-changes of horocycle flows. In polynomial speed of decay of correlations. More-
Kanigowski et al. (2018) (see also a result in over, in Forni and Kanigowski (2020), the authors
Flaminio and Forni (2019)), the authors show show that for time-changes of bounded type Hei-
that there is a strong dichotomy for two smooth senberg nilflows, every non-trivial time-change
time-changes of horocycle flows: Either the time- enjoys the R-property and as a consequence is
change functions are cohomologous or the mildly mixing. Moreover, in the above setting, it
resulting time-changed flows are disjoint. also follows that every mixing time-change is
Even though quantitative mixing for time- mixing of all orders.
changes of unipotent flows is now well under- Mixing and spectral properties of time-changes
stood, not much is known for quantitative of general nilflows are poorly understood. In par-
higher-order correlations: ticular, the following question seems interesting:
Question 1 Is the decay of higher correlations Question 2 Are all non-trivial smooth time-
for non-trivial time-changes of horocycle flows changes of general nilflows mixing?
(or more generally, unipotent flows) polynomial?
The mixing mechanism from Avila et al.
For trivial time-changes, i.e., for the horocycle (2011) (see also Ravotti (2019) and Avila et al.
flow, decay of higher correlations is indeed poly- (2019)) is nonquantitative. Therefore, the follow-
nomial by a recent result of M. Björklund, ing question seems to be far more challenging:
M. Einsiedler, and A. Gorodnik (2020) (in fact
this applies to general unipotent flows). Question 3 Does there exist a smooth time
Time-changes of nilpotent systems Recall change of a general nilflow with AC (Lebesgue)
that nilpotent flows are never weakly mixing spectrum?
since they always have a non-trivial Kronecker
factor. An interesting question is therefore Special Flows, Flows on Surfaces, and Interval
whether one can improve mixing properties of Exchange Transformations
the system by a time-change. The first result in In this section, we will describe spectral results for
this direction by A. Avila, G. Forni, and special flows over interval exchange transforma-
C. Ulcigrai (2011) is that there exists a dense set tions (IETs) (irrational rotations begin a particular
of smooth functions on the Heisenberg case). As described below, such special flows
nilfmanifold such that the resulting time-changed arise as representations of smooth locally Hamil-
Heisenberg flow is mixing. This result was tonian flows on surfaces.
132 Spectral Theory of Dynamical Systems
Interval Exchange Transformations space of IETs R Dm1, Veech has proved the
To define an interval exchange transformation following fundamental theorem (Veech 1982b;
(IET) of m intervals, we need a permutation p of see also Masur (1982)) which is a generalization
{1, . . ., m} and a probability vector l ¼ (l1, . . ., of the fact that Gauss measure ln1 2 1þx
1
dx is invari-
lm) (with positive entries). Then, we define T ¼ ant for the Gauss map which sends t (0, 1) into
Tl,p of [0, 1) by putting the fractional part of its inverse.
T l,p ðxÞ ¼ x þ bpi bi for x bi , biþ1 , Theorem 10 (Veech and Masur) (1982) Assume
that R is a Rauzy class. There exists a s-finite
where bi ¼ j<i l j , bi ¼
p
pj<pi b j : Obviously, measure mR on R Dm1 which is P -invariant,
each IET preserves Lebesgue measure. One of equivalent to “Lebesgue” measure, conservative
possible approaches to study ergodic properties and ergodic.
of IET is an “a.e.” approach “seen” in the defini-
tion of Tl,p. It is based on the fundamental fact that In Veech (1982b), it is proved that a.e. (in the
the induced transformation on a subinterval of above sense) IET is then of rank one (and hence is
[0, 1) is also IET (see Cornfeld et al. (1982)). ergodic and has a simple spectrum). He also for-
This leads to a very delicate and deep mathematics mulated as an open problem whether we can
based on Rauzy induction, which is a way of achieve the weak mixing property a.e. This has
inducing on special intervals, considering only been answered in positive by A. Avila and
irreducible permutations whose set is partitioned G. Forni (2007) (for p which is not a rotation).
into orbits of some maps (any such an orbit is Katok (1980) proved that IET have no mixing
called a Rauzy class). If now R is a Rauzy class factors (in fact his proof shows more: The IET
of permutations and l lies in the standard simplex class is disjoint from the class of mixing trans-
Dm1, then the Rauzy induction together with a formations). By their nature, IET transformations
natural renormalization leads to a map P : are of finite rank (see Cornfeld et al. (1982)), so
R Dm1 ! R Dm1. For a better understand- they are of finite multiplicity. They need not be of
ing of the dynamics of the Rauzy map, Veech simple spectrum (see remarks in Katok and
(1982b) introduced the space of zippered rectan- Thouvenot (2006), pp. 88–90). It remains an
gles. A zippered rectangle associated with the open question whether an IET can have a non-
Rauzy class R is a quadruple (l, h, a, p), where singular spectrum. Joining properties in the class
l ℝmþ , h ℝþ , a ℝþ , p R and the vectors
m m of exchange of 3 and more intervals are studied in
h and a satisfy some equations and inequalities. Ferenczi et al. (2004, 2005). In Chaika and Eskin
Every zippered rectangle (l, h, a, p) determines a (2018), the authors show that a.e. 3-IET is not
Riemann structure on a compact connected sur- simple. This answers a special case of a question
face. Denote by O(R) the space of all zippered of Veech (1982a) whether a.e. IET is simple. The
rectangles, corresponding to a given Rauzy class case of d-IET’s with d > 3 is still widely open.
R and satisfying the condition hl, hi ¼ 1. In
Veech (1982b), Veech defined a form (P t)t ℝ on Smooth Flows on Surfaces and Their Special
the space O(R) putting Representations
We consider a closed, connected, smooth, and
Pt ðl, h, a, pÞ ¼ ðet l, et h, et a, pÞ orientable surface S of genus g 1. Let X :
S 7! TS be a smooth vector field with finitely
and extended the Rauzy map. On so-called Veech many fixed points and such that the corresponding
moduli space of zippered rectangles, the flow (Pt) flow fXt preserves a smooth area form o. The
becomes the Teichmüller flow, and it preserves a form fXt is called a locally Hamiltonian flow; it
natural Lebesgue measure class; by passing to a is locally given by a smooth Hamiltonian H (up to
transversal and projecting the measure on the an additive constant), so that fXt is a solution to
Spectral Theory of Dynamical Systems 133
particular, all such flows have singular spectra. many frequencies can have the group of eigen-
Moreover, in Frączek and Lemańczyk (2004), it values) for special flows over irrational rotations
is proved that whenever the Fourier transform of is studied in Fayad et al. (2001), Fayad and
the roof function f is of order O 1n , then T f is Windsor (2007), and Guenais and Parreau (2005).
disjoint from all mixing flows (see also Frączek It follows from Frączek and Lemańczyk (2004)
and Lemańczyk (2003)). In fact in the papers that von Neumann’s flows have singular spec-
(Frączek and Lemańczyk 2003, 2004, 2005, trum. However, nothing is known about their
2006), the authors discuss the problem of multiplicity.
disjointness of those special flows with all ELF-
flows conjecturing that no flow of probabilistic Question 4 What is the spectral multiplicity of
origin has a smooth realization on a surface. In von Neumann flows?
Lemańczyk and Wysokińska (2007), the analytic
case is considered, leading to a “generic” result on
Symmetric Logarithmic Singularities
disjointness with the ELF class generalizing the
Kochergin (Kochergin 1976a) proved the absence
classical Shklover’s result on the weak mixing
of mixing for flows, where the roof function has
property (Shklover 1967). A. Katok (1980) pro-
finitely many singularities; however, some
ved the absence of mixing for special flows over
Diophantine restriction is put on a. In Lemańczyk
IET when the roof function is of bounded varia-
(2000), where also the absence of mixing is con-
tion (see also Ryzhikov (1994a)). Katok’s theo-
sidered for the symmetric logarithmic case, it was
rem was strengthened in Frączek and Lemańczyk
conjectured (and proved for arbitrary rotation)
(2005) to the disjointness theorem with the class
that a necessary condition for mixing of a special
of mixing flows. A. Avila and G. Forni (Avila and
flow T f (with arbitrary T and f) is the condition that
Forni 2007) proved that a.e. translation flow on a ðnÞ
the sequence of distributions f0 tends to
surface (of genus at least two) is weakly mixing n
(which is a drastic difference with the linear flow d1 in the space of probability measures on ℝ .
case of the torus, where the spectrum is always K. Schmidt (Schmidt 2002) proved it using the
discrete). theory of cocycles and extending a result from
One important property in the class of special Aaronson and Weiss (2000) on tightness of
flows over rotations and IETs is Ratner’s property cocycles. Ulcigrai (2011) showed that for every
(R-property). This property may be viewed as a d 2 and a.e. IET of d intervals, the
particular way of divergence of orbits of close corresponding special flow T f is not mixing. How-
points; it was shown to hold for horocycle flows ever, Chaika and Wright (2019) proved existence
by M. Ratner (1983). We refer the reader to Ratner of an IET T such that T f is mixing. Notice that by
(1983) and the survey article in Thouvenot (1995) Frączek and Lemańczyk (2004), it also follows
for the formal definitions and basic consequences that for a.e. irrational rotation T f has purely sin-
of R-property. In particular, R-property implies gular spectral type. Recent result (Chaika et al.
“rigidity” of joinings and it also implies the PID 2019) shows that one can also prove singularity of
property; hence, mixing and R-property imply the spectrum for symmetric IETs in the base. This
mixing of all orders. In Frączek and Lemańczyk in particular shows that minimal flows on genus
(2006) and Frączek et al. (2007), a version of 2 surfaces (with two isometric saddles) have
R-property is shown for the class of von Neumann purely singular spectral type. In Kanigowski and
special flows (however, a is assumed to have Kułaga-Przymus (2016), the authors showed that
bounded partial quotients). This allowed one to if T is an IET of bounded type, then T f is mildly
prove there that such flows are even mildly mixing (and has the R-property). For symmetric
mixing (mixing is excluded by a Kochergin’s logarithmic singularities, the following two ques-
result). The eigenvalue problem (mainly how tions are open:
Spectral Theory of Dynamical Systems 135
Question 5 What is the maximal spectral type of obtained by B. Fayad (2002b). In Fayad et al.
T f for a general IETs? (2016), the authors considered the spectrum of
T f. They showed that if f has sufficiently strong
Nothing is known about multiplicity of the power singularity (of the form x1+ for small >
spectrum in this setting. 0), then T f has countable Lebesgue spectrum for
a.e. irrational rotation. To the best of our knowl-
Asymmetric Logarithmic Singularities edge, this is the only result dealing with multiplic-
In this case, mixing properties are different. ity for smooth surface flows. The following
Khanin and Sinai (1992) showed that T f is mixing problems are natural (see Question 34 in Fayad
for a.e. irrational rotation T. This was strengthened and Krikorian (2018)):
by Ulcigrai to a.e. IET (Ulcigrai 2007). Moreover
(Ravotti 2017), Ravotti obtained quantitative Question 8 What is the maximal spectral type of
mixing estimates (with sub-logarithmic speed of Tf when f has power singularities? What is the
decay of correlations). Not much was known multiplicity?
about multiple mixing for T f. This changed
recently: In Fayad and Kanigowski (2016), the To answer this, one needs to consider general
authors showed that for a.e. irrational rotation, T f IETs in the base, as well as functions with weaker
enjoys a variant of the R-property and hence is power singularity than in Fayad et al. (2016). The
multiple mixing. This was strengthened to following question is still open (Question 38 in
a.e. IETs in Kanigowski et al. (2019). The follow- Fayad and Krikorian (2018)):
ing two questions seem to be natural (the first one
already stated as Questions 34, 35 in Fayad and Question 9 Are all mixing surface flows mixing
Krikorian (2018)): of all orders?
Question 6 What is the maximal spectral type of Finally, it may also be useful to show that
T f? smooth flows on surfaces are disjoint from flows
of probabilistic origin – see del Junco and
Moreover, one can ask about quantitative Lemańczyk (1999, 2005), Lemańczyk et al.
higher-order decay: (2011), Ryzhikov and Thouvenot (2006), and
Thouvenot (2000).
Question 7 Is the decay of higher-order correla- B. Fayad (2006) gives a criterion that implies
tions sub-logarithmic? singularity of the maximal spectral type for a
dynamical system on a Riemannian manifold. As
Both of the above questions are open even for an application, he gives a class of smooth mixing
rotations in the base. As in the symmetric case, flows (with singular spectra) on 3 obtained from
nothing is known about multiplicity of the linear flows by a time change (again this is a
spectrum. drastic difference with dimension two, where a
smooth time change of a linear flow leads to
non-mixing flows (Cornfeld et al. 1982)).
Power Singularities
We mention at the end that if we drop here (and
In case f has power singularities, T f was shown to
in other problems) the assumption of regularity of
be mixing by Kochergin (1975) (for any irrational
f, then the answers will be always positive because
rotation and IETs). In Fayad and Kanigowski
of the LB theory; in particular, there is a section of
(2016), the authors show that if T is a rotation of
any horocycle flow (it has the LB property (Ratner
bounded type, then T f is multiple mixing
1978)) such that in the corresponding special rep-
(it enjoys a variant of the R-property). Polynomial
resentation T f, the map T is an irrational rotation.
decay for some flows with power singularities was
136 Spectral Theory of Dynamical Systems
1
Using a Kochergin’s result (Kochergin 1976b) on ¼ n¼1 Gn G1 , we obtain a Borel structure
cohomology (see also Katok (2003a) and on G. It is called Mackey-Borel structure on G.
Rudolph (1986)), the L1-function f is By the Glimm theorem, the Mackey-Borel struc-
cohomologous to a positive function g which is ture is standard if and only if G is of type I (Glimm
even continuous; thus, T f is isomorphic to T g. 1960).
For n ℕ [ {1}, denote by In the identity
operator on Kn . Then for each unitary representa-
Spectral Theory for Locally Compact tion p of G in H, there are a measure l, a
Groups of Type I measurable field G 3 o 7! H o of Hilbert spaces,
and a measurable field G 3 o 7! V o of irreducible
This section has been written by A. Danilenko. unitary G-representations such that Vo o, and
Ho is the space of Vo on G and a measurable map
Groups of Type I m : G ! ℕ [ fþ1g such that
The spectral theory presented here for Abelian
group actions extends potentially to probability-
H¼ Hw KmðwÞ dlðoÞ and
preserving actions of non-Abelian locally com- ^
G
pact groups of type I. We now provide the defini-
tion of type I. Let G be a locally compact second pðgÞ ¼ V w ðgÞ I mðwÞ dlðoÞ:
^
G
countable group, H a separable Hilbert space, and
p : G ∍ g 7! p(g) a (weakly) continuous unitary It appears that if G is of type I, then the equiv-
representation of G in H. We say that p is of type I alence class of l is defined uniquely by p, and the
if there is a subset A {1, 2, . . ., +1} such that p function m is defined up to a l-zero subset. We call
is unitarily equivalent to the orthogonal sum the class of l the maximal spectral type of p, and
k AUk Ik, where Uk is a unitary representation we call m the spectral multiplicity of p. If we have
of G with a simple spectrum and Ik is the trivial a probability-preserving action T ¼ (Tg)g G of G,
representation in the Hilbert space of dimension k. then we can consider the corresponding Koopman
unitary representation p of G. The maximal spec-
Definition 1 If every unitary representation of tral type of p and the spectral multiplicity of p are
G is of type I, then G is called of type I. called the maximal spectral type of T and the
spectral multiplicity of T, respectively. If G is
Denote by G the unitary dual of G, i.e., the set Abelian, then these concepts coincide with their
of unitarily equivalent classes of all irreducible classic counterparts considered above in the
unitary representations of G. If G is Abelian, survey.
then every irreducible representation of G is one All compact groups, Abelian groups,
dimensional. Hence, G is identified naturally with connected semi-simple Lie groups, and nilpotent
the group of characters of G. In the general case, Lie groups (or, more generally, exponential Lie
let Irrn(G) stand for the set of all irreducible uni- groups) are of type I. Each subgroup of GL(n, ℝ)
tary representations of G in the n-dimensional determined by a system of algebraic equations is
separable Hilbert space Kn , 1 n þ1: also of type I. Solvable Lie groups can be as of
Endow it with the natural Borel structure, i.e., type I as not of type I. If G is a countable (discrete)
the smallest one in which the mapping p 7! group, then G is of type I if, and only if it is
hp(g)f, hi is Borel for every g G and f , h virtually Abelian, i.e., it contains an Abelian sub-
Kn. It is standard. Let Gn be the quotient of Irrn(G) group of finite index (Thoma 1964).
by the unitary equivalence relation. Endow Gn Even if we know that a non-Abelian group G is
with the quotient Borel structure. Since G of type I, it is usually not an easy problem to
Spectral Theory of Dynamical Systems 137
For simplicity, we will denote the matrix 1. s1,2(ℝ2\{(0, 0)}) ¼ 0, i.e., there are no non-
1 a c trivial one-dimensional representations in the
0 1 b by [a, b, c]. The unitary dual spectral decomposition of U. The maximal
0 0 1 spectral type of T equals the maximal spectral
type of the restriction of T to the center of
H 3 ðℝÞ is identified with ℝ2 ℝ endowed with
H3(ℝ) (modulo the natural identification);
the natural standard Borel structure. Every irre-
2. T is mixing (see also Ryzhikov (1994b, 1996));
ducible unitary representation of H3(ℝ) is unitar-
3. the weak closure of the group {Tg : g H3(ℝ)}
ily equivalent to either a one-dimensional pa,b
in Aut(X, m) is the union of {Tg : g H3(ℝ)}
with (a, b) ℝ2 or an infinite dimensional pg in
and the weak closure of {T[0,0,t] : t ℝ};
L2(ℝ, Leb), with g ℝ ≔ ℝ\{0}, such that
4. if T is rigid, then (T[0,0,t]))t ℝ is rigid.
In Danilenko (2014), there were constructed
pa,b ½a, b, c≔e2piðaaþbbÞ , explicit examples of mixing of all orders rank
pg ½a, b, cf ðxÞ≔e2piðcþbxÞ f ðx þ aÞ, f L2 ðℝ, LebÞ: one (and hence zero entropy) actions T of the
Heisenberg group.
Now, given a measure-preserving H3(ℝ)- The concept of simplicity for ergodic H3(ℝ)-
action on a standard probability space (X, m), let actions is defined in a similar way as for the
U denote the corresponding Koopman represen- Abelian actions. A simple H3(ℝ)-action T has
tation in L2(X, m). Then there are a probability MSJ if the centralizer of the action is {T[0,0,t] :
measure s1,2 on ℝ2, a function l1,2 : ℝ2 ! ℕ a t ℝ}. It was shown in Danilenko (2014) that the
probability measure s3 on ℝ, a function l3 : ℝ ! examples of mixing H3(ℝ)-actions constructed
ℕ such that there satisfy also the following:
138 Spectral Theory of Dynamical Systems
As a corollary, we obtain examples of mixing Danilenko and Lemańczyk (2016) for details).
Poisson and mixing Gaussian (probability pre- Thus, we see that the maximal spectral type of
serving) actions of H3(ℝ). Heisenberg odometers is purely atomic.
For a decreasing sequence G ¼ ðGn Þ1 n¼1 of
lattices in H3(ℝ), we let SðGÞ≔[n¼1 pðGn Þ and
1
Heisenberg Odometers 1
xG ≔[1 n¼1 xGn ℤ . The following theorem (except
In Danilenko and Lemańczyk (2016), the authors
for the first claim) and the below remarks demon-
isolated a special class of ergodic H3(ℝ)-actions,
strate a drastic difference between H3(ℝ)-
called the odometer actions. Namely, let G1 G2
odometers and ℤ-odometers.
be a sequence of lattices in H3(ℝ). Then, we
can associate a sequence of homogeneous H3(ℝ)-
Theorem 13 Two Heisenberg odometers T and
spaces intertwined with H3(ℝ)-equivariant maps:
T associated with decreasing sequences of lattices
G and G0 respectively are unitarily equivalent if and
H 3 ðℝÞ=G1 H 3 ðℝÞ=G2 :
only if S(G) ¼ S(G0) and xG ¼ x0G . The direct
Denote by X the projective limit of this sequence. product T T 0 ≔ T g T 0g is not spectrally
gG
Then X is a compact Polish G-space. Endow each
equivalent to any Heisenberg odometer. T T 0
space H3(ℝ)/Gn with the Haar probability measure.
is ergodic if and only if S(G) \ S(G0) ¼ {0}. T
The projective limit of the sequence of these measures
T0 is ergodic and has discrete maximal spectral
is a H3(ℝ)-invariant probability measure m. Of course,
type if and only if S(G) \ S(G0) ¼ {0} and xG \
the H3(ℝ)-action on (X, m) is ergodic. It is called the
x0G ¼ f0g.
Heisenberg odometer associated with ðGn Þ1 n¼1 . We
consider the Heisenberg odometers as non-
It was shown in Danilenko and Lemańczyk
commutative counterparts of the ergodic ℤ-actions
(2016) that Heisenberg odometers are not iso-
and ℝ-actions with pure point rational spectrum.
spectral, i.e., the unitary equivalence, in general,
A complete spectral decomposition of the Hei-
does not imply isomorphism for the underlying
senberg odometers is found in Danilenko and
H3(ℝ)-actions. It was also shown in Danilenko
Lemańczyk (2016). Denote by p : H3(ℝ) ! ℝ2
and Lemańczyk (2016) that Heisenberg odome-
the homomorphism [a, b, c] 7! (a, b). The kernel
ters are not spectrally determined: A Heisenberg
of this homomorphism is the center of the Heisen-
odometer is constructed which is unitarily equiv-
berg group. Given a lattice G in H3(ℝ), we denote
alent to an H3(ℝ)-action which is not isomorphic
by xG a positive real such that G \ Ker p ¼ {[0, 0,
to any Heisenberg odometer.
nxG]| n ℤ}. Of course, p(G) is a lattice in ℝ2. If
p(G) ¼ A(ℤ2) for some matrix A GL(2, ℝ), then
we denote by p(G) the dual lattice (A)1ℤ2 in ℝ2. On the “Finitely Dimensional” Part of the
Spectrum
Theorem 12 Let U stand for the Koopman uni- Suppose that G is an arbitrary locally compact
tary representation of the Heisenberg odometer second countable group. If G is not of type I,
associated with a sequence of lattices G1 G2 then G furnished with the Mackey-Borel structure
. If [1
n¼1 pðGn Þ is not closed in ℝ , then
2
is a “bad” (not standard) Borel space. Then a
Spectral Theory of Dynamical Systems 139
space. Assume that there exists a non-trivial auto- (see also Ryzhikov (2007)). In Lemańczyk and
morphism S with a singular spectrum which is not Parreau (2007), it is proved that some special
disjoint from T. Then T has a non-trivial factor flows considered in section “Special Flows,
which is disjoint from any automorphism with a Flows on Surfaces, and Interval Exchange Trans-
Lebesgue spectrum. formations” (including the von Neumann class,
The problem of spectral multiplicity of Carte- however, with a having unbounded partial quo-
sian products for “typical” transformation studied tients) have the SCS property. It is quite plausible
by Katok (2003a) and then its solution in Ageev that the SCS property is commonly seen for
(2008) which we already considered in section smooth flows on surfaces.
“The Multiplicity Function” lead to a study of A classical open problem is whether each ergo-
those T for which dic automorphism has a smooth model. While this
problem stays open even for so-called dyadic
ðCS Þ sðmÞ ⊥sðnÞ whenever m 6¼ n, adding machine (ergodic, discrete spectrum
automorphism having roots of unity of degree
where s ¼ sT just stands for the reduced maximal 2n, n 1, as eigenvalues), A. Katok suggested
spectral type of UT (which is constantly assumed many years ago that one can construct a
to be a continuous measure); see also Stepin’s Kronecker measure so that the corresponding
article (Stepin 1986). Gaussian system (ℤ-action (!)) has a smooth rep-
Usefulness of the above property (CS) in ergo- resentation on the torus. No “written” proof of this
dic theory was already shown in del Junco and fact yet appeared.
Lemańczyk (1992), where a spectral counterex- In Vershik (2011), A.M. Vershik sketches a
ample machinery was presented using the follow- proof of the fact (claimed by himself for decades)
ing observation: If A is a T1-invariant sub - s- that Pascal adic transformation is weakly mixing.
algebra such that the maximal spectral type on Earlier, X. Méla in his PhD showed that this
L2 ðA Þ is absolutely continuous with respect to sT, transformation is ergodic and has zero entropy.
then A is contained in one of the coordinate Moreover, in Janvresse and de la Rue (2004),
sub-s-algebras B . Based on that in del Junco Janvresse and de la Rue proved the LB property
and Lemańczyk (1992), it is shown how to con- of this map. No complete proof of weak mixing of
struct two weakly isomorphic actions which are Pascal adic transformation seems to exist in the
not isomorphic or how to construct two non- published literature.
disjoint automorphisms which have no common In Vershik (2005), Vershik proposed to study a
non-trivial factors (such constructions were previ- new equivalence between measure-preserving
ously known for so-called minimal self-joining automorphisms, called quasi-similarity. Two
automorphisms (Rudolph 1979)). See also automorphisms T on ðX , B, mÞ and S on
Tikhonov (2002) for extensions of those results ðY, C , nÞ are quasi-similar if there are Markov
to ℤd-actions. operators J : L2 ðX , B, mÞ ! L2 ðY , C, vÞ and
Prikhodko and Ryzhikov (2000) proved that K : L2 ðY , C, vÞ ! L2 ðX , B, mÞ, both with
the classical Chacon transformation enjoys the dense ranges, intertwining the corresponding
(CS) property. The SCS property defined in sec- Koopman operators UT and US. This new equiva-
tion “Glossary and Notation” is stronger than the lence is strictly stronger than spectral isomor-
(CS) condition above; the SCS property implies phism but strictly weaker than (weak)
that the corresponding Gaussian system SsT has a isomorphism as shown in Frączek and
simple spectrum. Ageev (2000) shows that Lemańczyk (2010) answering one of the ques-
Chacon transformation satisfies the SCS property; tions by Vershik. However, possible invariants
moreover, in Ageev (2008), he shows that the SCS for this new equivalence are not well understood.
property is satisfied generically and he gives a For example, in Frączek and Lemańczyk (2010),
construction of a rank one mixing SCS-system it is shown that an automorphism quasi-similar to
Spectral Theory of Dynamical Systems 141
Avila A, Forni G, Ravotti D, Ulcigrai C (2019) Mixing for Danilenko AI (2012) New spectral multiplicities for
smooth time-changes of general nilflows, preprint, mixing transformations. Ergodic Theory Dyn Syst
arXiv: 1905.11628 32:517–534
Badea C, Grivaux S, Matheron E (2018) Rigidity Danilenko AI (2013) A survey on spectral multiplicities of
sequences, Kazhdan sets and group topologies on the ergodic actions. Ergodic Theory Dyn Syst 33:81–117
integers, arXiv:1812.09014 Danilenko AI (2014) Mixing actions of the Heisenberg
Balister P, Bollobas B, Morris R, Suhasrabudhe J, Tiba group. Ergodic Theory Dyn Syst 34:1142–1167
M (2019) Flat Littlewood polynomials exist, arXiv Danilenko AI, Lemańczyk M (2005) A class of multipliers
1907.09464 for W⊥. Isr J Math 148:137–168
Bashtanov AI (2013) Generic mixing transformations are Danilenko AI, Lemańczyk M (2013) Spectral multiplicities
rank 1. Math Notes 93:209–216 for ergodic flows. Discrete Contin Dyn Syst
Bashtanov AI (2016) Conjugacy classes are dense in the 33:4271–4289
space of mixing Zd-actions. Math Notes 99:9–23 Danilenko AI, Lemańczyk M (2016) Odometer actions of
Baxter JR (1971) A class of ergodic transformations hav- the Heisenberg group. J Anal Math 128:107–157
ing simple spectrum. Proc Am Math Soc 27:275–279 Danilenko AI, Park KK (2002) Generators and Bernoullian
Bergelson V, del Junco A, Lemańczyk M, Rosenblatt factors for amenable actions and cocycles on their
J (2014) Rigidity and non-recurrence along sequences. orbits. Ergodic Theory Dyn Syst 22:1715–1745
Ergodic Theory Dyn Syst 34:14641502 Danilenko AI, Ryzhikov VV (2010) Spectral multiplicities
Björklund M, Einsiedler M, Gorodnik A (2020) Quantita- for infinite measure preserving transformations. Funct
tive multiple mixing. J. Eur. Math. Soc. (JEMS) 22 Anal Appl 44:161–170
(5):1475–1529 Danilenko AI, Ryzhikov VV (2011) Mixing constructions
Bourgain J (1993) On the spectral type of Ornstein’s class with infinite invariant measure and spectral multiplici-
of transformations. Isr J Math 84:53–63 ties. Ergodic Theory Dyn Syst 31:853–873
Bourgain J (2013) On the correlation of the Moebius func- Danilenko AI, Solomko AV (2010) Ergodic Abelian
tion with rank-one systems. J Anal Math 120:105–130 actions with homogenous spectrum. Contemporary
Cambanis S, Podgrski K, Weron A (1995) Chaotic behav- mathematics, vol 532. American Mathematical Society,
iour of infinitely divisible processes. Stud Math Providence, pp 137–148
115:109–127 de la Rue T (1996) Systèmes dynamiques gaussiens
Cecchi PA, Tiedra de Aldecoa R (2016) Furstenberg trans- d’entropie nulle, l^achement et non l^achement
formations on Cartesian products of infinite- Bernoulli. Ergodic Theory Dyn Syst 16:379–404
dimensional tori. Potent Anal 44:4351 de la Rue T (1998a) Rang des systèmes dynamiques
Chacon RV (1970) Approximation and spectral multiplic- gaussiens. Isr J Math 104:261–283
ity. In: Dold A, Eckmann B (eds) Contributions to de la Rue T (1998b) L’ergodicité induit un type spectral
ergodic theory and probability. Springer, Berlin, maximal équivalent à la mesure de Lebesgue. Ann Inst
pp 18–27 Henri Poincaré Probab Statist 34:249–263
Chaika J, Eskin A (2018) Self-joinings for 3-IET’s, de la Rue T (1998c) L’induction ne donne pas toutes les
arXiv:1805.11167v2 mesures spectrales. Ergodic Theory Dyn Syst
Chaika J, Wright A (2019) A smooth mixing flow on a 18:1447–1466
surface with nondegenerate fixed points. J Am Math de la Rue T (2004) An extension which is relatively two-
Soc 32:81–117 fold mixing but not threefold mixing. Colloq Math
Chaika J, Frączek K, Kanigowski A, Ulcigrai C (2019) 101:271–277
Singularity of the spectrum for smooth area-preserving de la Rue T (2009) Joinings in ergodic theory. In: Ency-
flows in genus two, preprint clopedia of complexity and system science. Springer,
Choe GH (1990) Products of operators with singular con- New York
tinuous spectra. In: Operator theory: operator algebras del Junco A (1976) Transformations with discrete spec-
and applications, Part 2 (Durham, NH, 1988). Proceed- trum are stacking transformations. Can J Math
ings of symposia in pure mathematics, vol 51. Ameri- 28:836–839
can Mathematical Society, Providence, pp 65–68 del Junco A (1977) A transformation with simple spectrum
Connes A, Woods E (1985) Approximately transitive flows which is not of rank one. Can J Math 29:655–663
and ITPFI factors. Ergodic Theory Dyn Syst 5:203–236 del Junco A (1981) Disjointness of measure-preserving
Cornfeld IP, Fomin SV, Sinai YG (1982) Ergodic theory. transformations, minimal self-joinings and category.
Springer, New York Prog Math 10:81–89
Creutz D, Silva C (2010) Mixing on rank-one transforma- del Junco A, Lemańczyk M (1992) Generic spectral prop-
tions. Stud Math 199:43–72 erties of measure-preserving maps and applications.
Danilenko AI (2006) Explicit solution of Rokhlin’s prob- Proc Am Math Soc 115:725–736
lem on homogeneous spectrum and applications. Ergo- del Junco A, Lemańczyk M (1999) Simple systems are
dic Theory Dyn Syst 26:1467–1490 disjoint with Gaussian systems. Stud Math 133:249–256
Danilenko AI (2010) On new spectral multiplicities for del Junco A, Lemańczyk M (2005) Joinings of distally
ergodic maps. Stud Math 197:57–68 simple systems, preprint
Spectral Theory of Dynamical Systems 143
del Junco A, Rudolph D (1987) On ergodic actions whose Fayad B, Krikorian R (2018) Some questions around
self-joinings are graphs. Ergodic Theory Dyn Syst quasi-periodic dynamics, arXiv:1809.10375
7:531–557 Fayad B, Thouvenot J-P (2014) On the convergence to 0 of
Derriennic Y, Frączek K, Lemańczyk M, Parreau F (2008) mnx mod 1. Acta Arith 165:327332
Ergodic automorphisms whose weak closure of off- Fayad B, Windsor A (2007) A dichotomy between discrete
diagonal measures consists of ergodic self-joinings. and continuous spectrum for a class of special flows
Colloq Math 110:81–115 over rotations. J Modern Dyn 1:107–122
Dooley A, Golodets VY (2002) The spectrum of Fayad B, Katok AB, Windsor A (2001) Mixed spectrum
completely positive entropy actions of countable ame- reparameterizations of linear flows on T2, Dedicated to
nable groups. J Funct Anal 196:1–18 the memory of I. G. Petrovskii on the occasion of his
Dooley A, Quas A (2005) Approximate transitivity for 100th anniversary. Mosc Math J 1:521–537
zero-entropy systems. Ergodic Theory Dyn Syst Fayad B, Forni G, Kanigowski A (2016) Lebesgue spec-
25:443–453 trum of countable multiplicity for conservative flows
Downarowicz T, Kwiatkowski J (2000) Spectral isomor- on the torus, submitted
phism of Morse flows. Fundam Math 163:193–213 Feldman J (1976) New K-automorphisms and a problem of
Downarowicz T, Kwiatkowski J (2002) Weak Closure Kakutani. Isr J Math 24:16–38
theorem fails for Zd-actions. Stud Math 153:115–125 Ferenczi S (1984) Systèmes localement de rang un. Ann
Eisner T, Grivaux S (2011) Hilbertian Jamison sequences and Inst Henri Poincaré Probab Statist 20:35–51
rigid dynamical systems. J Funct Anal 261:2013–2052 Ferenczi S (1985) Systèmes de rang un gauche. Ann Inst
El Abdalaoui H (2000) On the spectrum of the powers of Henri Poincaré Probab Statist 21:177–186
Ornstein transformations. Sankhya Ser A 62:291–306. Ferenczi S (1997) Systems of finite rank. Colloq Math
Ergodic theory and harmonic analysis (Mumbai, 1999) 73:35–65
El Abdalaoui H (2007) A new class of rank 1 transforma- Ferenczi S, Lemańczyk M (1991) Rank is not a spectral
tions with singular spectrum. Ergodic Theory Dyn Syst invariant. Stud Math 98:227–230
27:1541 Ferenczi S, Holton C, Zamboni LQ (2004) Structure of
El Abdalaoui H (2015) Ergodic Banach problem on simple three-interval exchange transformations III: ergodic
Lebesgue spectrum and flat polynomials, arXiv and spectral properties. J Anal Math 93:103–138
1508.06439v4 Ferenczi S, Holton C, Zamboni LQ (2005) Joinings of
El Abdalaoui H, Parreau F, Prikhodko A (2006) A new three-interval exchange transformations. Ergodic The-
class of Ornstein transformations with singular spec- ory Dyn Syst 25:483–502
trum. Ann Inst Henri Poincaré Probab Stat 42:671–681 Filipowicz I (1997) Product Zd-actions on a Lebesgue
El Abdalaoui H, Lemańczyk M, de la Rue T (2014) On space and their applications. Stud Math 122:289–298
spectral disjointness of powers for rankone transforma- Flaminio L, Forni G (2003) Invariant distributions and time
tions and Möbius orthogonality. J Funct Anal averages for horocycle flows. Duke Math
266:284–317 J 119(3):465–526
El Abdalaoui H, Kasjan S, Lemańczyk M (2016) 0-1 Flaminio L, Forni G (2019) Orthogonal powers and
sequences of the Thue-Morse type and Sarnaks conjec- Möbius conjecture for smooth time changes of
ture. Proc Am Math Soc 144:161–176 horocycle flows. Electron Res Announc Math Sci
Fayad B (2001a) Partially mixing and locally rank 1 smooth 26:16–23
transformations and flows on the torus Td, d3. J Lond Foiaş C, Stratila S (1968) Ensembles de Kronecker dans la
Math Soc 64(2):637–654 théorie ergodique. C R Acad Sci Paris, Ser A-B 267:
Fayad B (2001b) Polynomial decay of correlations for a A166–A168
class of smooth flows on the two torus. Bull Soc Math Forni G, Kanigowski A (2017) Time-changes of Heisen-
Fr 129:487–503 berg nilflows, preprint, arXiv:1711.05543
Fayad B (2002a) Skew products over translations on Td, Forni G, Kanigowski A (2020) Multiple mixing and
d2. Proc Am Math Soc 130:103–109 disjointness for time changes of bounded-type Heisen-
Fayad B (2002b) Analytic mixing reparametrizations of berg nilflows. J Éc polytech Math 7:63–91
irrational flows. Ergodic Theory Dyn Syst 22:437–468 Forni G, Ulcigrai C (2012) Time-changes of horocycle
Fayad B (2005) Rank one and mixing differentiable flows. flows. J Modern Dyn 6(2):251–273
Invent Math 160:305–340 Frączek K (1997) On a function that realizes the maximal
Fayad B (2006) Smooth mixing flows with purely singular spectral type. Stud Math 124:1–7
spectra. Duke Math J 132:371–391 Frączek K (2000) Circle extensions of Zd-rotations on the
Fayad B, Kanigowski A (2015) Rigidity times for a weakly d-dimensional torus. J Lond Math Soc 61(2):139–162
mixing dynamical system which are not rigidity times Frączek K, Lemańczyk M (2003) On symmetric logarithm
for any irrational rotation. Ergodic Theory Dyn Syst and some old examples in smooth ergodic theory.
35:2529–2534 Fundam Math 180:241–255
Fayad B, Kanigowski A (2016) Multiple mixing for a class Frączek K, Lemańczyk M (2004) A class of special flows
of conservative surface flows. Invent Math over irrational rotations which is disjoint from mixing
203(2):555–614 flows. Ergodic Theory Dyn Syst 24:1083–1095
144 Spectral Theory of Dynamical Systems
Frączek K, Lemańczyk M (2005) On disjointness proper- Hasselblatt B, Katok AB (2002) Principal Structures.
ties of some smooth flows. Fundam Math 185:117–142 Handbook of dynamical systems, vol 1A. North-
Frączek K, Lemańczyk M (2006) On mild mixing of spe- Holland, Amsterdam, pp 1–203
cial flows over irrational rotations under piecewise Helson H (1986) Cocycles on the circle. J Oper Theory
smooth functions. Ergodic Theory Dyn Syst 16:189–199
26:719–738 Helson H, Parry W (1978) Cocycles and spectra. Arkiv
Frączek K, Lemańczyk M (2010) A note on quasi- Math 16:195–206
similarity of Koopman operators. J Lond Math Soc Herman M (1979) Sur la conjugaison différentiable des
82(2):361–375 difféomorphismes du cercle à des rotations. Publ
Frączek K, Lemańczyk M, Lesigne E (2007) Mild mixing Math IHES 49:5–234
property for special flows under piecewise constant Host B (1991) Mixing of all orders and pairwise indepen-
functions. Discrete Contin Dyn Syst 19:691–710 dent joinings of systems with singular spectrum. Isr
Friedman NA, Ornstein DS (1973) Ergodic transforma- J Math 76:289–298
tions induce mixing transformations. Adv Math Host B, Méla F, Parreau F (1991) Non-singular transfor-
10:147–163 mations and spectral analysis of measures. Bull Soc
Furstenberg H (1967) Disjointness in ergodic theory, min- Math Fr 119:33–90
imal sets, and a problem of Diophantine approximation. Indukaev FK (2007) The twisted Burnside theory for the
Math Syst Theory 1:1–49 discrete Heisenberg group and for the wreath
Furstenberg H (1981) Recurrence in ergodic theory and products of some groups. Mosc Univ Math Bull
combinatorial number theory. Princeton University 62:219–227
Press, Princeton Iwanik A (1991) The problem of Lp-simple spectrum for
Furstenberg H, Weiss B (1978) The finite multipliers of ergodic group automorphisms. Bull Soc Math Fr
infinite ergodic transformations. Lect Notes Math 119:91–96
668:127–132 Iwanik A (1992) Positive entropy implies infinite Lp-
Glasner E (1994) On the class of multipliers for W⊥. multiplicity for p>1. Ergodic theory and related topics,
Ergodic Theory Dyn Syst 14:129–140 III (Gstrow, 1990). Lecture notes in mathematics,
Glasner E (2003) Ergodic theory via joinings. Mathemat- vol 1514. Springer, Berlin, pp 124–127
ical surveys and monographs, vol 101. AMS, Iwanik A (1997) Anzai skew products with Lebesgue
Providence component of infinite multiplicity. Bull Lond Math
Glasner E, Weiss B (1989) Processes disjoint from weak Soc 29:195–199
mixing. Trans Am Math Soc 316:689–703 Iwanik A, de Sam Lazaro J (1991) Sur la multiplicité Lp
Glimm JG (1960) On a certain class of operator algebras. d’un automorphisme gaussien. C R Acad Sci Paris Sér
Trans Am Math Soc 95:318–340 I 312:875–876
Goodson GR (1999) A survey of recent results in the Iwanik A, Lemańczyk M, Rudolph D (1993) Absolutely
spectral theory of ergodic dynamical systems. J Dyn continuous cocycles over irrational rotations. Isr J Math
Control Syst 5:173–226 83:73–95
Goodson GR, Kwiatkowski J, Lemańczyk M, Liardet Iwanik A, Lemańczyk M, de Sam Lazaro J, de la Rue
P (1992) On the multiplicity function of ergodic T (1997) Quelques remarques sur les facteurs des
group extensions of rotations. Stud Math 102:157–174 systèmes dynamiques gaussiens. Stud Math
Griesmer J (2019) Recurrence, rigidity, and popular differ- 125:247–254
ences. Ergodic Theory Dyn Syst 39:1299 (to appear) Iwanik A, Lemańczyk M, Mauduit C (1999) Piecewise
Gromov AL (1991) Spectral classification of some types of absolutely continuous cocycles over irrational rota-
unitary weighted shift operators. Algebra Anal (Russ) tions. J Lond Math Soc 59(2):171–187
3:62–87; translation in St. Petersburg Math Janicki A, Weron A (1994) Simulation and chaotic behav-
J 3 (1992):997–1021 ior of a-stable stochastic processes. Monographs and
Guenais M (1998) Une majoration de la multiplicité textbooks in pure and applied mathematics, vol 178.
spectrale d’opérateurs associés a des cocycles réguliers. Marcel Dekker, New York
Isr J Math 105:263–284 Janvresse E, de la Rue T (2004) The Pascal adic transfor-
Guenais M (1999) Morse cocycles and simple Lebesgue mation is loosely Bernoulli. Ann Inst Henri Poincaré
spectrum. Ergodic Theory Dyn Syst 19:437–446 Probab Statist 40:133–139
Guenais M, Parreau F (2005) Valeurs propres de trans- Janvresse E, de la Rue T, Ryzhikov VV (2012) Around
formations liées aux rotations irrationnelles et aux King’s rank one theorems: flows and Zn-actions. In:
fonctions en escalier, preprint Dynamical systems and group actions. Contemporary
Haase M, Moriakov N (2018) On systems with quasi- mathematics, vol 567. American Mathematical Society,
discrete spectrum. Stud Math 241:173–199 Providence, pp 143–161
Hahn F, Parry W (1968) Some characteristic properties of Janvresse E, Prikhod’ko AA, de la Rue T, Ryzhikov VV
dynamical systems with quasi-discrete spectra. Math (2015) Weak limits of powers of Chacon’s automor-
Syst Theory 2:179–190 phism. Ergodic Theory Dyn Syst 35:128141
Spectral Theory of Dynamical Systems 145
Janvresse E, Roy E, de la Rue T (2017) Poisson suspen- Khanin KM, Sinai YG (1992) Mixing of some classes of
sions and Sushis. Ann Sci École Norm Supnérieure special flows over rotations of the circle. Funct Anal
50(4):1301–1334 Appl 26:155–169
Kachurovskii AG (1990) A property of operators gener- King J (1986) The commutant is the weak closure of the
ated by ergodic automorphisms. Optimizatsiya (Russ) powers, for rank-1 transformations. Ergodic Theory
47(64):122–125 Dyn Syst 6:363–384
Kalikow S (1984) Two fold mixing implies three fold King JL (1988) Joining-rank and the structure of finite rank
mixing for rank one transformations. Ergodic Theory rank mixing transformations. J Anal Math 51:182–227
Dyn Syst 4:237–259 King JL (2001) Flat stacks, joining-closure and genericity,
Kamae T. Spectral properties of automata generating preprint
sequences, unpublished preprint Kirillov AA (2004) Lectures on the orbit method. Series:
Kamiński B (1981) The theory of invariant partitions for Graduate studies in mathematics, vol 64. American
Zd-actions. Bull Acad Polon Sci Ser Sci Math Mathematical Society, Providence
29:349–362 Klemes I (1996) The spectral type of the staircase transfor-
Kanigowski A, de la Rue T (2019) Product of two staircase mation. Tohoku Math J 48:247–248
rank one transformations that is not loosely Bernoulli, Klemes I, Reinhold K (1997) Rank one transformations
accepted in J. d’Analyse Math., arXiv:1812.08027 with singular spectral type. Isr J Math 98:1–14
Kanigowski A, Kułaga-Przymus J (2016) Ratner’s prop- Kočergin AV (1972) On the absence of mixing in special
erty and mild mixing for smooth flows on surfaces. flows over the rotation of a circle and in flows on a two-
Ergodic Theory Dyn Syst 36:2512–2537 dimensional torus. Dokl Akad Nauk SSSR (Russ)
Kanigowski A, Kułaga-Przymus J, Ulcigrai C (2019) Mul- 205:949–952
tiple mixing and parabolic divergence in smooth area- Kochergin AV (1975) Mixing in special flows over
preserving flows on higher genus surfaces. J Eur Math rearrangement of segments and in smooth flows on
Soc (JEMS) 21(12):3797–3855 surfaces. Math-USSR Acad Sci 25:441–469
Kanigowski A, Lemańczyk M, Ulcigrai C (2018) On Kochergin AV (1976a) Non-degenerate saddles and
disjointness properties of some parabolic flows, pre- absence of mixing. Mat Zametky (Russ) 19:453–468
print, arXiv:1810.11576v1 Kochergin AV (1976b) On the homology of function over
Katok AB (1977) Monotone equivalence in ergodic theory. dynamical systems. Dokl Akad Nauk SSSR
Izv Akad Nauk SSSR Ser Math (Russ) 41:104–157 231:795–798
Katok AB (1980) Interval exchange transformations and Kochergin AV (2002) A mixing special flow over a rotation
some special flows are not mixing. Isr J Math of the circle with an almost Lipschitz function. Sb Math
35:301–310 193:359–385
Katok AB (2001) Cocycles, cohomology and combinato- Kochergin AV (2004) Nondegenerate fixed points and
rial constructions in ergodic theory. In: Collaboration mixing in flows on a two-dimensional torus. II. Mat
with Robinson EA Jr, Proceedings of symposium on Sb (Russ) 195:15–46
pure mathematics, vol 69. Smooth ergodic theory and Koopman BO (1931) Hamiltonian systems and transfor-
its applications, Seattle, 1999. American Mathematical mations in Hilbert space. Proc Natl Acad Sci USA
Society, Providence, pp 107–173 17:315–318
Katok AB (2003a) Constructions in ergodic theory. Kuipers L, Niederreiter H (1974) Uniform distribution of
Unpublished lecture notes sequences. Wiley, New York
Katok AB (2003b) Combinatorial constructions in ergodic Kułaga J, Parreau F (2012) Disjointness properties for
theory and dynamics. University lecture series, vol 30. Cartesian products of weakly mixing systems. Colloq
American Mathematical Society, Providence Math 128:153–177
Katok A, Lemańczyk M (2009) Some new cases of reali- Kułaga-Przymus J, Lemańczyk M (2019) Sarnak’s conjec-
zation of spectral multiplicity function for ergodic ture from ergodic theory point of view. In: Encyclope-
transformations. Fundam Math 206:185–215 dia of complexity and system science
Katok AB, Stepin AM (1967) Approximations in ergodic Kushnirenko AG (1974) Spectral properties of some
theory. Uspekhi Mat Nauk (Russ) 22(137):81–106 dynamical systems with polynomial divergence of
Katok A, Thouvenot J-P (2006) Spectral properties and orbits. Vestn Mosk Univ 1–3:101–108
combinatorial constructions in Ergodic theory. In: Kwiatkowski J (1981) Spectral isomorphism of Morse
Handbook of dynamical systems, vol 1B. Elsevier dynamical systems. Bull Acad Polon Sci Sér Sci Math
B. V., Amsterdam, pp 649–743 29:105–114
Katok AB, Zemlyakov AN (1975) Topological transitivity Kwiatkowski J Jr, Lemańczyk M (1995) On the multiplic-
of billiards in polygons. Mat Zametki (Russ) ity function of ergodic group extensions. II. Stud Math
18:291–300 116:207–215
Keane M (1968) Generalized Morse sequences. Z Wahr Kwiatkowski J, Lacroix Y (1997) Multiplicity, rank pairs.
Verw Gebiete 10:335–353 J Anal Math 71:205–235
Keane M (1975) Interval exchange transformations. Math Lemańczyk M (1988) Toeplitz Z2–extensions. Ann Inst
Z 141:25–31 Henri Poincaré 24:1–43
146 Spectral Theory of Dynamical Systems
Lemańczyk M (1996) Introduction to ergodic theory from Maruyama G (1970) Infinitely divisible processes. Theory
the point of view of the spectral theory. Lecture notes of Probab Appl 15(1):1–22
the tenth KAIST mathematics workshop, Taejon, Masur H (1982) Interval exchange transformations and
pp 1–153 measured foliations. Ann Math 115:169–200
Lemańczyk M (2000) Sur l’absence de mélange pour des Mathew J, Nadkarni MG (1984) Measure-preserving trans-
flots spéciaux au dessus d’une rotation irrationnelle. formation whose spectrum has Lebesgue component of
Colloq Math 84(85):29–41 multiplicity two. Bull Lond Math Soc 16:402–406
Lemańczyk M (2009) Spectral theory of dynamical sys- Medina H (1994) Spectral types of unitary operators aris-
tems. In: Encyclopedia of complexity and system sci- ing from irrational rotations on the circle group. Mich
ence. Springer, New York, pp 8554–8575 Math J 41:39–49
Lemańczyk M, de Sam Lazaro J (1997) Spectral analysis of Mentzen MK (1988) Some examples of automorphisms
certain compact factors for Gaussian dynamical sys- with rank r and simple spectrum. Bull Pol Acad Sci
tems. Isr J Math 98:307–328 7–8:417–424
Lemańczyk M, Lesigne E (2001) Ergodicity of Rokhlin Nadkarni MG (1998) Spectral theory of dynamical sys-
cocycles. J Anal Math 85:43–86 tems. Hindustan Book Agency, New Delhi
Lemańczyk M, Mauduit C (1994) Ergodicity of a class of Newton D (1966) On Gaussian processes with simple
cocycles over irrational rotations. J Lond Math Soc spectrum. Z Wahr Verw Gebiete 5:207–209
49:124–132 Newton D, Parry W (1966) On a factor automorphism of a
Lemańczyk M, Parreau F (2003) Rokhlin extensions and normal dynamical system. Ann Math Statist
lifting disjointness. Ergodic Theory Dyn Syst 37:1528–1533
23:1525–1550 Ornstein D (1970) On the root problem in ergodic theory.
Lemańczyk M, Parreau F (2007) Special flows over irratio- In: Proceedings of 6th Berkeley symposium on mathe-
nal rotation with simple convolution property, preprint matical statistics and probability. University California
Lemańczyk M, Parreau F (2012) Lifting mixing properties Press, Berkeley, pp 348–356
by Rokhlin cocycles. Ergodic Theory Dyn Syst Ornstein D, Weiss B (1987) Entropy and isomorphism
32:763–784 theorems for actions of amenable groups. J Anal Math
Lemańczyk M, Sikorski A (1987) A class of not local rank 48:1–141
one automorphisms arising from continuous substitu- Ornstein D, Rudolph D, Weiss B (1982) Equivalence of
tions. Probab Theory Relat Fields 76:421–428 measure preserving transformations. Mem Am Math
Lemańczyk M, Wasieczko M (2006) A new proof of Soc 37(262):1–116
Alexeyev’s theorem, preprint Parreau F (2000) On the Foiaş and Stratila theorem. In:
Lemańczyk M, Wysokińska M (2007) On analytic flows Proceedings of the conference on ergodic theory, Toruń
on the torus which are disjoint from systems of proba- Parreau F, Roy E (2015) Prime Poisson suspensions. Ergo-
bilistic origin. Fundam Math 195:97–124 dic Theory Dyn Syst 35:2216–2230
Lemańczyk M, Parreau F, Thouvenot J-P (2000) Gaussian Parry W (1970) Spectral analysis of G-extensions of
automorphisms whose ergodic self–joinings are Gauss- dynamical systems. Topology 9:217–224
ian. Fundam Math 164:253–293 Parry W (1981) Topics in ergodic theory. Cambridge tracts
Lemańczyk M, Thouvenot J-P, Weiss B (2002) Relative in mathematics, vol 75. Cambridge University Press,
discrete spectrum and joinings. Monatsh Math Cambridge/New York
137:57–75 Petersen K (1983) Ergodic theory. Cambridge University
Lemańczyk M, Mentzen MK, Nakada H (2003) Semi- Press, Cambridge
simple extensions of irrational rotations. Stud Math Prikhodko AA (2013) Littlewood polynomials and their
156:31–57 applications to the spectral theory of dynamical sys-
Lemańczyk M, Parreau F, Roy E (2011) Systems with tems. Mat Sb 204:135–160, translation in Sb Math
simple convolutions, distal simplicity and disjointness 204 (2013)
with infinitely divisible systems. Proc Am Math Soc Prikhodko AA, Ryzhikov VV (2000) Disjointness of the
139:185–199 convolutions for Chacon’s automorphism. Dedicated to
Leonov VP (1960) The use of the characteristic functional the memory of Anzelm Iwanik. Colloq Math
and semi–invariants in the ergodic theory of stationary 84(85):67–74
processes. Dokl Akad Nauk SSSR 133:523–526. Sov Quefelec M (1988) Substitution dynamical systems – spec-
Math 1 (1960):878–881 tral analysis. Lecture notes in mathematics, vol 1294.
Lightwood S, Sahin A, Ugarcovici I (2014) The structure Springer, Berlin
and the spectrum of Heisenberg odometers. Proc Am Ratner M (1978) Horocycle flows are loosely Bernoulli. Isr
Math Soc 142:2429–2443 J Math 31:122–132
Mackey GW (1964) Ergodic transformation groups with a Ratner M (1979) The Cartesian square of the horocycle
pure point spectrum. Ill J Math 8:593–600 flow is not loosely Bernoulli. Isr J Math 34(1–2):72–96
Marcus B (1977) Ergodic properties of horocycle flows for Ratner M (1983) Horocycle flows, joinings and rigidity of
surfaces of negative curvature. Ann Math 105(2):81–105 products. Ann Math 118:277–313
Spectral Theory of Dynamical Systems 147
Ratner M (1986) Rigidity of time changes for horocycle Ryzhikov VV (1994a) The absence of mixing in special
flows. Acta Math 156:1–32 flows over rearrangements of segments. Mat Zametki
Ratner M (1987) Rigid reparametrizations and cohomol- (Russ) 55:146–149
ogy for horocycle flows. Invent Math 88(2):341–374 Ryzhikov VV (1994b) Skew products and multiple mixing
Rauzy G (1979) Echanges d’intervalles et transformations of dynamical systems. Russ Math Surv 49:170–171
induites. Acta Arith 34:315–328 Ryzhikov VV (1996) Stochastic intertwinings and multiple
Ravotti D (2017) Quantitative mixing for locally Hamilto- mixing of dynamical systems. J Dyn Control Syst
nian flows with saddle loops on compact surfaces. Ann 2:1–19
Henri Poincaré 18(12):3815–3861 Ryzhikov VV (1999) Transformations having homoge-
Ravotti D (2019) Mixing for suspension flows over skew- neous spectra. J Dyn Control Syst 5:145–148
translations and time-changes of quasi-abelian filiform Ryzhikov VV (2000) The Rokhlin problem on multiple
nilflows. Ergodic Theory Dyn Syst, published online at mixing in the class of actions of positive local rank.
https://doi.org/10.1017/etds.2018.19 Funktsional Anal Prilozhen (Russ) 34:90–93
Robinson EA Jr (1983) Ergodic measure preserving trans- Ryzhikov VV (2007) Weak limits of powers, simple spec-
formations with arbitrary finite spectral multiplicities. trum symmetric products and mixing rank one con-
Invent Math 72:299–314 structions. Math Sb 198:733–754
Robinson EA Jr (1986) Transformations with highly non- Ryzhikov VV (2009) Spectral multiplicities and asymp-
homogeneous spectrum of finite multiplicity. Isr J Math totic operator properties of actions with invariant mea-
56:75–88 sure. Sb Math 200:1833–1845
Robinson EA Jr (1988) Nonabelian extensions have non- Ryzhikov VV (2014) On spectral multiplicities of Gauss-
simple spectrum. Compos Math 65:155–170 ian actions, arXiv 1406.3321
Robinson EA Jr (1992) A general condition for lifting Ryzhikov VV, Thouvenot J-P (2006) Disjointness, divisi-
theorems. Trans Am Math Soc 330:725–755 bility, and quasi-simplicity of measure-preserving
Rosiński J, Żak T (1996) Simple condition for mixing of actions. Funktsional Anal Prilozhen (Russ) 40:85–89
infinitely divisible processes. Stoch Process Appl Ryzhikov VV, Troitskaya AE (2016) Mixing flows with a
61:277–288 homogeneous spectrum of multiplicity 2. Fundam Prikl
Rosinski J, Żak T (1997) The equivalence of ergodicity and Mat (Russ) 21:191–197
weak mixing for infinitely divisible processes. J Theor Sato K-I (1999) Lévy processes and infinitely divisible
Probab 10:73–86 distributions. Cambridge University Press, Cambridge
Roy E (2005) Mesures de Poisson, infinie divisibilit et Schmidt K (1977) Cocycles of ergodic transformation
proprits ergodiques. Thèse de doctorat de l’Universitè groups. Lecture notes in math, vol 1. Mac Millan of
Paris 6 India, New Delhi
Roy E (2007) Ergodic properties of Poissonian ID pro- Schmidt K (2002) Dispersing cocycles and mixing flows
cesses. Ann Probab 35:551–576 under functions. Fundam Math 173:191–199
Royden HL (1968) Real analysis. MacMillan, New York Schmidt K, Walters P (1982) Mildly mixing actions of
Rudin W (1962) Fourier analysis on groups. In: locally compact groups. Proc Lond Math Soc
Interscience tracts in pure and applied mathematics, 45(3):506–518
vol 12. Interscience Publishers (a division of John Shklover M (1967) Classical dynamical systems on the
Wiley and Sons), New York/London torus with continuous spectrum. Izv Vys Ucebn Zaved
Rudolph DJ (1979) An example of a measure-preserving Mat (Russ) 10(65):113–124
map with minimal self-joinings and applications. Simonelli LD (2018) Absolutely continuous spectrum for
J Anal Math 35:97–122 parabolic flows/maps. Discrete Contin Dyn Syst
Rudolph D (1985) k–fold mixing lifts to weakly mixing 38(1):263–292
isometric extensions. Ergodic Theory Dyn Syst Sinai YG (1994) Topics in ergodic theory. Princeton Uni-
5:445–447 versity Press, Princeton
Rudolph D (1986) Zn and Rn cocycle extension and comple- Smorodinsky M, Thouvenot J-P (1979) Bernoulli factors
mentary algebras. Ergodic Theory Dyn Syst 6:583–599 that span a transformation. Isr J Math 32:39–43
Rudolph D (1990) Fundamentals of measurable dynamics. Solomko AV (2012) New spectral multiplicities for ergodic
Oxford Science Publications, Oxford actions. Stud Math 208:229–247
Rudolph D (2004) Pointwise and L1 mixing relative to a Stepin AM (1986) Spectral properties of generic dynamical
sub-sigma algebra. Ill J Math 48:505–517 systems. Izv Akad Nauk SSSR Ser Math (Russ)
Rudolph D, Weiss B (2000) Entropy and mixing for ame- 50:801–834
nable group actions. Ann Math 151(2):1119–1150 Thoma E (1964) Über unitäre Darstellungen abzählbarer,
Ryzhikov VV (1991) Joinings of dynamical systems. diskreter Gruppen. Math Ann 153:111–138
Approximations and mixing. Uspekhi Mat Nauk Thouvenot J-P (1995) Some properties and applications of
(Russ) 46(5(281)):177–178 joinings in ergodic theory. In: Ergodic theory and its
Ryzhikov VV (1992) Mixing, rank and minimal self- connections with harmonic analysis. London Mathe-
joining of actions with invariant measure. Mat Sb matical Society. Cambridge University Press, Cam-
(Russ) 183:133–160 bridge, pp 207–235
148 Spectral Theory of Dynamical Systems
Thouvenot J-P (2000) Les systèmes simples sont disjoints Veech WA (1982a) A criterion for a process to be prime.
de ceux qui sont infiniment divisibles et plongeables Monatsh Math 94:335–341
dans un flot. Colloq Math 84(85):481–483 Veech W (1982b) Gauss measures for transformations on
Tiedra de Aldecoa R (2012) Spectral analysis of time- the space of interval exchange maps. Ann Math
changes of the horocycle flow. J Modern Dyn 115(2):201–242
6(2):275–285 Veech W (1984) The metric theory of interval exchange
Tiedra de Aldecoa R (2015a) The absolute continuous transformations. I. Generic spectral properties. Am
spectrum of skew products of compact lie groups. Isr J Math 106:1331–1359
J Math 208:323–350 Vershik AM (1962a) On the theory of normal dynamic
Tiedra de Aldecoa R (2015b) Commutator methods for the systems. Math Sov Dokl 144:625–628
spectral analysis of uniquely ergodic dynamical sys- Vershik AM (1962b) Spectral and metric isomorphism of
tems. Ergodic Theory Dyn Syst 35:944–967 some normal dynamical systems. Math Sov Dokl
Tiedra de Aldecoa R (2017) Commutator criteria for strong 144:693–696
mixing. Ergodic Theory Dyn Syst 37:308–323 Vershik AM (2005) Polymorphisms, Markov processes,
Tikhonov SV (2002) On a relation between the metric and and quasi-similarity. Discrete Contin Dyn Syst
spectral properties of Zd-actions. Fundam Prikl Mat 13:1305–1324
8:1179–1192 Vershik AM (2011) The Pascal automorphism has a con-
Tikhonov SV (2011) Mixing transformations with homo- tinuous spectrum. Funktsional Anal Prilozhen (Russ)
geneous spectrum. Mat Sb 202(8):139–160; English 45:16–33; translation in Funct Anal Appl
transl in Sb Math 202(8) (2011):1231–1252 45 (2011):173–186
Tikhonov SV (2012) Genericity of a multiple mixing. von Neumann J (1932) Zur Operatorenmethode in der
Uspekhi Mat Nauk (Russ) 67(4(406)):187–188; Klassichen Mechanik. Ann Math 33:587–642
translation in Russian Math Surv 67(4) (2012): Walters P (1982) An introduction to ergodic theory.
779–780 Springer, New York
Tikhonov SV (2013) Complete metric on mixing actions of Wysokinska M (2004) A class of real cocycles over an
general groups. J Dyn Control Syst 19:17–31 irrational rotation for which Rokhlin cocycle exten-
Todd JA (1950) On a conjecture in group theory. J Lond sions have Lebesgue component in the spectrum.
Math Soc 25:246 Topol Methods Nonlinear Anal 24:387–407
Ulcigrai C (2007) Mixing of asymmetric logarithmic sus- Wysokińska M (2007) Ergodic properties of skew products
pension flows over interval exchange transformations. and analytic special flows over rotations. PhD thesis,
Ergodic Theory Dyn Syst 27:991–1035 Toruń
Ulcigrai C (2011) Absence of mixing in area-preserving Yassawi R (2003) Multiple mixing and local rank group
flows on surfaces. Ann Math 173(3):1743–1778 actions. Ergodic Theory Dyn Syst 23:1275–1304
Veech W (1978) Interval exchange transformations. J Anal Zeitz P (1993) The centralizer of a rank one flow. Isr J Math
Math 33:222–272 84:129–145
Marginal of a Probability Measure on a
Joinings in Ergodic Theory Product Space
Let l be a probability measure on the Cartesian
Thierry de la Rue product of a finite or countable collection of mea-
Laboratoire de Mathématiques Raphaël Salem, surable spaces i I X i , i I A i , and let J = j1,
CNRS – Université de Rouen, Saint Étienne du . . ., jk be a finite subset of I. The k-fold marginal of
Rouvray, France l on X j1 , . . . , X jk is the probability measure m
defined by:
Measure · Preserving dynamical system · • PUT = USP, where UT and US are the unitary
Joining · Disjointness · Factor · Minimal self- operators on L2(X, m) and L2(Y, n) associated,
joining respectively, with T and S (i.e., UT f(x) = f(Tx),
and Usg(y) = g(Sy)).
Glossary
P1X ¼ 1Y ,
Disjoint Measure-Preserving Systems
The two measure-preserving dynamical systems • f 0 implies Pf 0, and g 0 implies P g 0,
ðX, A, m, T Þ and ðY, B, n, SÞ are said to be disjoint where P is the adjoint operator of P.
if their only joining is the product measure
m n.
Minimal Self-Joinings
Joining Let k 2 be an integer. The ergodic measure-
Let I be a finite or countable set, and for each preserving dynamical system T has k-fold mini-
i I, let ðXi , A i , mi , T i Þ be a measure-preserving mal self-joinings if, for any ergodic joining l of
dynamical system. A joining of these systems is a k copies of T, we can partition the set {1, . . ., k} of
probability measure on the Cartesian product coordinates into subsets J1, . . ., J‘ such that:
∏i IXi, which has the mis as marginals and 1. For j1 and j2 belonging to the same Ji, the
which is invariant under the product transforma- marginal of l on the coordinates j1 and j2 is
tion i ITi. supported on the graph of T n for some integer
n (depending on j1 and j2).
2. For j1 J1, . . ., j‘ J‘, the coordinates
j1, . . ., j‘ are independent.
We say that T has minimal self-joinings if T has supported on the graph of some S C(T)
k-fold minimal self-joinings for every k 2. (depending on j1 and j2).
2. For j1 J1, . . ., j‘ J‘, the coordinates
Off-Diagonal Self-Joinings j1, . . ., j‘ are independent.
Let ðX, A, m, T Þ be a measure-preserving dynami- We say that T is simple if T is k-fold simple for
cal system and S be an invertible measure- every k 2.
preserving transformation of ðX, A, mÞ commuting
with T. Then the probability measure DS defined
on X X by Definition of the Subject
DS ðA BÞ≔m A \ S1 B ð1Þ The word joining can be considered as the coun-
terpart in ergodic theory of the notion of coupling
is a twofold self-joining of T supported on the in probability theory (see, e.g., Thorisson 2000):
graph of S. We call it an off-diagonal self-joining Given two or more processes defined on different
of T. spaces, what are the possibilities of embedding
them together in the same space? There always
Process in a Measure-Preserving Dynamical exists the solution of making them independent of
Systems each other, but interesting cases arise when we can
Let ðX, A, m, T Þ be a measure-preserving dynami- do this in other ways. The notion of joining orig-
cal system, and let ðE, B ðEÞÞ be a measurable inates in ergodic theory from pioneering works of
space (which may be a finite or countable set, or Furstenberg (1967), who introduced the funda-
ℝd, or ℂd. . .). For any E-valued random variable x mental notion of disjointness, and Rudolph, who
defined on the probability space ðX, A, mÞ, we can laid the basis of joining theory in his article on
consider the stochastic process (xi)i ℤ defined by minimal self-joinings (Rudolph 1979). It has
today become an essential tool in the classification
xi ≔x ∘ T i : of measure-preserving dynamical systems and in
the study of their intrinsic properties.
Since T preserves the probability measure m,
(xi)i ℤ is a stationary process: For any l and n, the
distribution of (x0, . . ., x‘) is the same as the Introduction
probability distribution of (xn, . . ., xn+‘).
A central question in ergodic theory is to tell when
Self-Joining two measure-preserving dynamical systems are
Let T be a measure-preserving dynamical system. essentially the same, i.e., when they are isomor-
A self-joining of T is a joining of a family phic. When this is not the case, a finer analysis
ðXi , A i , mi , T i Þi I of systems where each Ti is a consists in asking what these two systems could
copy of T. If I is finite and has cardinal k, we speak share in common: For example, do there exist
of a k-fold self-joining of T. stationary processes which can be observed in
both systems? This latter question can also be
Simplicity asked in the following equivalent way: Do these
For k 2, we say that the ergodic measure- two systems have a common factor? The arithmet-
preserving dynamical system T is k-fold simple ical flavor of this question is not fortuitous: There
if, for any ergodic joining l of k copies of T, we are deep analogies between the arithmetic of inte-
can partition the set {1, . . ., k} of coordinates into gers and the classification of measure-preserving
subsets J1, . . ., J‘ such that: dynamical systems, and these analogies were at
1. For j1 and j2 belonging to the same Ji, the the starting point of the study of joinings in ergo-
marginal of l on the coordinates j1 and j2 is dic theory.
Joinings in Ergodic Theory 151
In the seminal paper (Furstenberg 1967) which turns out to be related with many important ques-
introduced the concept of joinings in ergodic the- tions in ergodic theory, and a large number of deep
ory, Furstenberg observed that two operations can results can be stated and proved inside the theory
be done with dynamical systems: We can consider of joinings. For example, the fact that the dynam-
the product of two dynamical systems, and we can ical systems S and T are isomorphic is equivalent
also take a factor of a given system. Like the to the existence of a special joining between S and
multiplication of integers, the product of dynam- T, and this can be used to give a joining proof of
ical systems is commutative and associative, it Krieger’s finite generator theorem, as well as
possesses a neutral element (the trivial single- Ornstein’s isomorphism theorem (see section
point system), and the systems S and T are both “Joinings Proofs of Ornstein’s and Krieger’s The-
factors of their product S T. It was then natural orems”). As it already appears in Furstenberg’s
to introduce the property for two measure- article, joinings provide a powerful tool in the
preserving systems to be relatively prime. As far classification of measure-preserving dynamical
as integers are concerned, there are two equivalent systems: Many classes of systems can be charac-
ways of characterizing the relative primeness: terized in terms of their disjointness with other
First, the integers a and b are relatively prime if systems. Joinings are also strongly connected
their unique positive common factor is 1. Second, with difficult questions arising in the study of
a and b are relatively prime if, each time both almost everywhere convergence of non-
a and b are factors of an integer c, their product conventional averages (see section “Joinings and
ab is also a factor of c. It is a well-known theorem Multiple Ergodic Averages”).
in number theory that these two properties are Amazingly, a situation in which the study of
equivalent, but this was not clear for their analog joinings leads to most interesting results consists
in ergodic theory. Furstenberg reckoned that the in considering two or more identical systems. We
second way of defining relative primeness was the then speak of the self-joinings of the dynamical
most interesting property in ergodic theory and system T. Again, the study of self-joinings is
called it disjointness of measure-preserving sys- closely related to many ergodic properties of the
tems (we will discuss precisely in section “From system: its mixing properties, the structure of its
Disjointness to Isomorphy” what the correct ana- factors, the transformations which commute with
log is in the setting of ergodic theory). He also T, and so on. We already mentioned minimal self-
asked whether the nonexistence of a nontrivial joinings, and we will see in section “Minimal Self-
common factor between two systems was equiv- Joinings” how this property may be used to get
alent to their disjointness. He was able to prove many interesting examples, such as a transforma-
that disjointness implies the impossibility of a tion with no root, or a process with no nontrivial
nontrivial common factor, but not the converse. factor. In the same section, we will also discuss a
And in fact, the converse turns out to be false: In very interesting generalization of minimal self-
1979, Rudolph exhibited a counterexample in his joinings: the property of being simple.
paper introducing the important notion of minimal The range of applications of joinings in ergodic
self-joinings. The relationships between disjointness theory is very large; only some of them will be
and the lack of common factor will be presented in given in section “Some Applications and Future
details in section “Joinings and Factors.” Directions”: The use of joinings in proving
Given two measure-preserving dynamical sys- Krieger’s and Ornstein’s theorems, the links
tems S and T, the study of their disjointness natu- between joinings and some questions of pointwise
rally leads one to consider all the possible ways convergence, and the strong connections between
these two systems can be both seen as factors of a the study of self-joinings and Rohlin’s famous
third system. As we shall see, this is precisely the question on multifold mixing, which has been
study of their joinings. The concept of joining opened since 1949 (Rohlin 1949).
152 Joinings in Ergodic Theory
In parallel with the concept of joinings of Therefore, studying the joinings of a family of
measure-preserving systems, Furstenberg also measure-preserving dynamical system amounts to
introduced in Furstenberg (1967) the notion of study all the possible ways these systems can be
topological joinings, concerning topological together seen as factors in another big system.
dynamical systems (i.e., systems given by a con-
tinuous transformation of a compact metric
The Set of Joinings
space). In the present article, we have restricted
The set of all joinings of the Tis will be denoted by
ourselves to the measure-preserving setting. In
J(Ti, i I). Before anything else, we have to
addition to Furstenberg’s paper, applications of
observe that this set is never empty. Indeed, what-
topological joinings can be found, for example,
ever the systems are, the product measure i I
in Furstenberg et al. (1973), King (1990), and
mi always belongs to this set. Note also that any
Weiss (1998).
convex combination of joinings is a joining: J-
(Ti, i I) is a convex set.
The set of joinings is turned into a compact
Joinings of Two or More Dynamical metrizable space, equipped with the topology
Systems defined by the following notion of convergence:
ln ! l if and only if, for all family of measur-
n!1
In the following, we are given a finite or countable able subsets ðAi Þi I i I A i , finitely many of
family ðXi , A i , mi , T i Þi I of measure-preserving them being different from Xi, we have
dynamical systems: Ti is an invertible measure-
preserving transformation of the standard Borel
probability space ðXi , A i , mi Þ: When it is not ln Ai !l Ai : (2)
n!1
ambiguous, we shall often use the symbol Ti to iI iI
denote both the transformation and the system.
A joining l of the Tis (see the definition in the We can easily construct a distance defining
“Glossary”) defines a new measure-preserving this topology by observing that it is enough to
dynamical system: The product transformation check (2) when each of the Ais is chosen in some
countable algebra C i generating the s-algebra A i
T i : ðxi Þi I 7! ðT i xi Þi I . We can also point out that, when the Xis are
iI
themselves compact metric spaces, this topol-
ogy on the set of joinings is nothing but the
acting on the Cartesian product ∏i IXi and pre-
restriction to J(Ti, i I) of the usual weak
serving the probability measure l. We will denote
topology.
this big system by (i ITi)l. Since all marginals
It is particularly interesting to study ergodic
of l are given by the original probabilities mi,
joinings of the Tis, whose set will be denoted by
observing only the coordinate i in the big system
Je(Ti, i I). Since any factor of an ergodic
is the same as observing only the system Ti. Thus,
system is itself ergodic, a necessary condition
each system Ti is a factor of (i ITi)l, via the
for Je(Ti, i I) not to be empty is that all the
homomorphism pi which maps any point in the
Tis be themselves ergodic. Conversely, if all the
Cartesian product to its i-th coordinate.
Tis are ergodic, we can prove by considering the
Conversely, if we are given a measure-
ergodic decomposition of the product measure
preserving dynamical system ðZ, C , r, RÞ admit-
i Imi that ergodic joinings do exist: Any ergo-
ting each Ti as a factor via some homomorphism
dic measure appearing in the ergodic decompo-
’i: Z ! Xi, then we can construct the map ’:
sition of some joining has to be itself a joining.
Z ! ∏i IXi sending z to (’i(z))i I. We can easily
This result can also be stated in the following
check that the image of the probability measure r
way:
is then a joining of the Tis.
Joinings in Ergodic Theory 153
Dp
fX, 0g B A fY, 0g: ð5Þ
The existence of a joining satisfying (5) is a
criterion for S being a factor of T.
For more details on the results stated in this
section, we refer the reader to de la Rue (2006b).
Joinings in Ergodic Theory, Fig. 2 The relatively inde-
Joinings and Factors pendent joining mRn and its disintegration over z
The purpose of this section is to investigate the
relationships between the disjointness of two sys-
tems S and T and the lack of a common factor. The In other words, we have identified in the two
crucial fact which was pointed out by Furstenberg systems T and S the projections on their common
is that the existence of a common factor enables factor R. The twofold marginal of l on X Y is
one to construct a very special joining of S and T: itself a joining of T and S, which we call the
The relatively independent joining over this factor. relatively independent joining over the common
Let us assume that our systems S and T share a factor R. This joining will be denoted by mRn.
common factor ðZ, C , r, RÞ, which means that we (Be careful: The projections pX and pY are hid-
have measurable onto maps pX : X ! Z and den in this notation, but we have to know them
pY : Y ! Z, respectively, sending m and n to r and to define this joining.) From (6), we immedi-
satisfying pX ∘ T = R ∘ pX and pY ∘ S = R ∘ pY. We ately get the formula defining mRn:
can then consider the joinings supported on their
graphs DpX J ðT, RÞ and DpY J ðS, RÞ, as defined 8A A, 8B B,
in the preceding section. Next, we construct a join-
ing l of the three systems S, T, and R. Heuristically, mR nðA BÞ≔ DpX ½1xA jz DpY 1yB jz drðzÞ:
l is the probability distribution of the triple (x, y, z) Z
when we first pick z according to the probability ð7Þ
distribution r, then x and y according to their
conditional distribution knowing z in the respective This definition of the relatively independent
joinings DpX and DpY , but independently of each joining over a common factor can easily be
other (Fig. 2). More precisely, l is defined by extended to a finite or countable family of systems
setting, for all A A, B B, and C C sharing the same common factor.
Note that mRn coincides with the product
measure mn if and only if the common factor is
lðA B CÞ≔ DpX ½1xA jz DpY 1yB jz drðzÞ:
C the trivial one-point system. We therefore get the
ð6Þ following result:
As we already said in the introduction, dynamical systems which are stable under the
Rudolph exhibited in Rudolph (1979) a counter- operations of taking joinings and factors. We
example showing that the converse is not true. will call these properties stable properties. This
There exists however an important result, which is, e.g., the case of the zero-entropy property: We
was published in Glasner et al. (2000) and know that any factor of a zero-entropy system still
Lemańczyk et al. (2000) allowing us to derive has zero entropy and that any joining of zero-
some information on factors from the non- entropy systems also has zero entropy. In other
disjointness of two systems. words, T has zero entropy implies that any
T-factor has zero entropy. But the property of
Theorem 5 S being a K-system is precisely characterized by
If T and S are not disjoint, then S has a nontrivial the fact that any nontrivial factor of S has positive
common factor with some joining of a countable entropy. Hence, a K-system S cannot have a non-
family of copies of T. trivial T-factor if T has zero entropy and is there-
fore disjoint from T. The converse is a
This result leads to the introduction of a special consequence of Theorem 4: If S is not a
class of factors when some dynamical system T is K-system, then it possesses a nontrivial zero-
given: For any other dynamical system S, call entropy factor, and therefore there exists some
T-factor of S any common factor of S with a zero-entropy system from which it is not disjoint.
joining of countably many copies of T. If The same argument also applies to the
ðZ, C , r, RÞ is a T-factor of S and p : Y ! Z is a disjointness of discrete-spectrum systems with
factor map, we say that the s-algebra p1 ðC Þ is a weakly mixing systems, since discrete spectrum is
T-factor s-algebra of S. Another way to state a stable property, and weakly mixing systems are
Theorem 5 is then the following: If S and T are characterized by the fact that they do not have any
not disjoint, then S has a nontrivial T-factor. In discrete-spectrum factor.
fact, an even more precise result can be derived
from the proof of Theorem 5: For any joining l of Markov Intertwinings and Composition of
S and T, for any bounded measurable function f on Joinings
X, the factor s-algebra of S generated by the There is another way of defining joinings of two
function l ½ f ðxÞjy is a T-factor s-algebra of S. measure-preserving dynamical systems involving
With the notion of T-factor, Theorem 5 has been operators on L2 spaces, mainly put to light by
extended in Lesigne et al. (2003) in the following Ryzhikov (see Ryzhikov 1993b): Observe that
way, showing the existence of a special T-factor for any joining l J(T, S), we can consider the
s-algebra of S comprising anything in S which operator Pl : L2(X, m) ! L2(Y, n) defined by
could lead to a nontrivial joining between T and S.
Pl ð f Þ≔l ½ f ðxÞjy :
Theorem 6
Given two measure-preserving dynamical systems It is easily checked that Pl is a Markov
ðX, A, m, T Þ and ðY, B, n, SÞ, there always exists a intertwining of T and S. Conversely, given any
maximum T-factor s-algebra of S, denoted by F T. Markov intertwining P of T and S, it can be
Under any joining l of T and S, the s-algebras shown that the measure lP defined on X Y by
A f 0, Y g and f 0, Xg B are independent con-
with spectral properties of the transformations ðX, A, mÞÞ and denote it by C(T). It always con-
(see, e.g., Goodson 2000). It also provides us tains, at least, all the powers T n, n ℤ.
with a convenient setting to introduce the compo- Each element S of C(T) gives rise to a twofold
sition of joinings: If we are given three dynamical self-joining DS supported on the graph of S. Such
systems ðX, A, m, T Þ, ðY, B, n, SÞ, and ðZ, C , r, RÞ, self-joinings are called off-diagonal self-joinings.
a joining l J(T, S), and a joining l0 J(S, R), They also belong to J ke ðT Þ if T is ergodic.
the composition of the Markov intertwinings Pl It follows that properties of the commutant of
and Pl0 is easily seen to give a third Markov an ergodic T can be seen in its ergodic joinings. As
intertwining, which itself corresponds to a joining an example of application, we can cite Ryzhikov’s
of T and R denoted by l ∘ l0. When R = S = T, i.e., proof of King’s weak closure theorem for rank-
when we are speaking of twofold self-joinings of a one transformations (An introduction to finite-
single system T (cf. next section), this operation rank transformations can be found, e.g., in
turns J(T, T) = J2(T) into a semigroup. Ahn and Nadkarni (1998a); we also refer the reader to the
Lemańczyk (2003) have shown that the subset quite complete survey (Ferenczi 1997).). Rank-
J 2e ðT Þ of ergodic twofold self-joinings is a sub- one measure-preserving transformations form a
semigroup if and only if T is semisimple (see very important class of zero-entropy, ergodic
section “Simple Systems”). measure-preserving transformations. They have
many remarkable properties, among which is the
fact that their commutant is reduced to the weak
limits of powers of T. In other words, if T is rank
Self-Joinings
one, for any S C(T), there exists a subsequence
of integers (nk) such that
We now turn to the case where the measure-
preserving dynamical systems we want to join
together are all copies of a single system T. For 8A A, m T nk ADS 1 A ! 0: (8)
k!1
k 2, any joining of k copies of T is called a k-fold
self-joining of T. We denote by Jk(T) the set of all King proved this result in 1986 (King 1986),
k-fold self-joinings of T and by J ke ðT Þ the subset of using a very intricate coding argument. Observing
ergodic k-fold self-joinings. that (8) was equivalent to the convergence, in
J2(T), of DT nk to DS, Ryzhikov showed in
Ryzhikov (1992b) that King’s theorem could be
Self-Joinings and Commuting
seen as a consequence of the following general
Transformations
result concerning twofold self-joinings of rank-
As soon as T is not the trivial single-point system,
one systems:
T is never disjoint from itself: Since T is obviously
isomorphic to itself, we can always find a twofold
Theorem 7
self-joining of T which is not the product measure
Let T be a rank-one measure-preserving transfor-
by considering self-joinings supported on graphs
mation, and l J 2e ðT Þ. Then there exist T 1/2, a
of isomorphisms (see section “Joinings and Iso-
subsequence of integers (nk), and another twofold
morphism”). The simplest of them is obtained by
self-joining l0 of T such that
taking the identity map as an isomorphism, and
we get that J2(T) always contains the diagonal 0
measure D0 ≔ DId. DT nk ! tl þ ð1 t Þl :
k!1
In general, an isomorphism of T with itself is an
invertible measure-preserving transformation S of Minimal Self-Joinings
ðX, A, mÞ which commutes with T. We call For any measure-preserving dynamical system T,
commutant of T the set of all such transformations the set of twofold self-joinings of T contains at least
(it is a subgroup of the group of automorphisms of the product measure m m p, the off-diagonal
158 Joinings in Ergodic Theory
joinings DT n for each n ℤ, and any convex minimal self-joinings for all k 2, we simply say
combination of these. Rudolph (1979) discovered that T has minimal self-joinings.
in 1979 that we can find systems for which there Rudolph’s construction of a system with twofold
are no other twofold self-joinings than these obvi- minimal self-joinings (Rudolph 1979) was inspired
ous ones. When this is the case, we say that T has by a famous work of Ornstein (1972), giving the
twofold minimal self-joinings, or for short: first example of a transformation with no roots. It
T MSJ(2). It can be shown (see, e.g., Rudolph turned out that Ornstein’s example is a mixing rank-
1990) that, as soon as the underlying probability one system, and all mixing rank-one systems were
space is not atomic (which we henceforth assume), later proved by King (1988) to have twofold min-
twofold minimal self-joinings imply that T is imal self-joinings. This can also be viewed as a
weakly mixing and therefore that m m and DT n , consequence of Ryzhikov’s Theorem 7. Indeed, in
n ℤ, are ergodic twofold self-joinings of T. That the language of joinings, the mixing property of
is why twofold minimal self-joinings are often T translates as follows:
defined by the following:
T is mixing , DT n ! m m: (10)
jnj!1
T MSJð2Þ , J 2e ðT Þ
¼ fm mg [ fDT n , n ℤg: ð9Þ
Therefore, if in Theorem 7 we further assume
that T is mixing, then either the sequence (nk) we
Systems with twofold minimal self-joinings
get in the conclusion is bounded, and then l is
have very interesting properties. First, since for
some DT n , or it is unbounded and then l = m m.
any S in C(T), DS belongs to J 2e ðT Þ, we immedi-
T MSJ(k) obviously implies T MSJ(k0)
ately see that the commutant of T is reduced to the for any 2 k0 k, but the converse is not known.
powers of T. In particular, it is impossible to find a The question whether twofold minimal self-
square root of T, i.e., a measure-preserving S such joinings implies k-fold minimal self-joinings for
that S ∘ S = T. Second, the existence of a non- all k is related to the important open problem of
trivial factor a-algebra of T would lead, via the pairwise-independent joinings (see section
relatively independent self-joining over this fac- “Pairwise-Independent Joinings”). But the latter
tor, to some ergodic twofold self-joining of problem is solved for some special classes of
T which is not in the list prescribed by (9). There- systems, in particular in the category of mixing
fore, any factor s-algebra of a system with twofold rank-one transformations. It follows that, if T is
minimal self-joinings must be either the trivial mixing and rank one, then T has minimal self-
s-algebra f 0, Xg or the whole s-algebra A . This
joinings.
has the remarkable consequence that if x is any In 1980, Del Junco, Rahe, and Swanson pro-
random variable on the underlying probability ved that Chacon’s transformation also has mini-
space which is not almost-surely constant, then mal self-joinings (del Junco et al. 1980). This
the process (x ∘ T n)n ℤ always generates the well-known transformation is also a rank-one sys-
whole s-algebra A . This also implies that T has tem, but it is not mixing (it had been introduced by
zero entropy, since positive-entropy systems have R.V. Chacon in 1969 (Chacon 1969) as the first
many nontrivial factors. explicit example of a weakly mixing transforma-
The notion of twofold minimal self-joinings tion which is not mixing). For another example of
extends for any integer k 2 to k-fold minimal a transformation with twofold minimal self-
self-joinings, which roughly means that there are joinings, constructed as an exchange map on
no other k-fold ergodic self-joinings than the “obvi- three intervals, we refer to del Junco (1983).
ous” ones: those for which the k coordinates are The existence of a transformation with mini-
either independent or just translated by some power mal self-joinings has been used by Rudolph as a
of T (see the glossary for a more precise definition). wonderful tool to construct a large variety of
We denote in this case: T MSJ(k). If T has k-fold striking counterexamples, such as:
Joinings in Ergodic Theory 159
• A transformation T which has no roots, while the fact that C(T) may contain other transforma-
T2 has roots of any order tions than the powers of T.) It turns out that simple
• A transformation with a cubic root but no systems may have nontrivial factors, but the struc-
square root ture of these factors can be explicitly described:
• Two measure-preserving dynamical systems They are always associated with some compact
which are weakly isomorphic (each one is a subgroup of C(T). More precisely, if K is a com-
factor of the other) but not isomorphic pact subgroup of C(T), we can consider the factor
s-algebra
Let us now sketch the argument showing that
we can find two systems with no common factor F K ≔fA A : 8S K, A ¼ SðAÞg,
but which are not disjoint: We start with a system
T with minimal self-joinings. Consider the direct and the corresponding factor transformation T jF K
product of T with an independent copy T 0 of itself, (called a group factor). Then Veech proved the
and take the symmetric factor S of T T0, that is to following theorem concerning the structure of
say the factor we get if we only look at the non- factors of a twofold simple system.
ordered pair of coordinates {x, x0} in the Cartesian
product. Then S is surely not disjoint from T, since Theorem 8
the pair {x, x0} is not independent of x. However, If the dynamical system T is twofold simple, and if
if S and T had a nontrivial common factor, then F A is a nontrivial factor s-algebra of T, then
this factor should be isomorphic to T itself there exists a compact subgroup K of the group
(because T has minimal self-joinings). Therefore C(T) such that F ¼ F K .
we could find in the direct product T T0 a third
copy T of T, which is measurable with respect to There is a natural generalization of Veech’s
the symmetric factor. In particular, T is invariant property to the case of k-fold self-joinings,
by the flip map (x, x0) 7! (x0, x), and this prevents which has been introduced by Del Junco and
T from being measurable with respect to only one Rudolph in 1987 (del Junco and Rudolph 1987)
coordinate. Then, since T MSJ(3), the systems (see the precise definition of simple systems in the
T, T0, and T have no choice but to be independent. glossary). In their work, important results
But this contradicts the fact that T is measurable concerning the structure of factors and joinings
with respect to the s-algebra generated by x and x0. of simple systems are proved. In particular, they
Hence, T and S have no nontrivial common factor. are able to completely describe the structure of the
We can also cite the example given by Glasner ergodic joinings between a given simple system
and Weiss (1983) of a pair of horocycle transfor- and any ergodic system (see also Glasner 2003
mations which have no nontrivial common factor, and Thouvenot 1995). Recall that, for any r 1,
yet are not disjoint. Their construction relies on the symmetric factor of T r is the system we get if
the deep work by Ratner (1983), which describes we observe the r coordinates of the point in Xr and
the structure of joinings of horocycle flows. forget their order. This is a special case of group
factor, associated with the subgroup of C(T r)
consisting of all permutations of the coordinates.
Simple Systems We denote this symmetric factor by Thri.
An important generalization of twofold minimal
self-joinings has been proposed by William Theorem 9
A. Veech in 1982 (Veech 1982). We say that the Let T be a simple system and S an ergodic system.
measure-preserving dynamical system T is two- Assume that l is an ergodic joining of T and
fold simple if it has no other ergodic twofold self- S which is different from the product measure.
joinings than the product measure {m m} and Then there exist a compact subgroup K of C(T)
joinings supported on the graph of a transforma- and an integer r 1 such that:
tion S C(T). (The difference with MSJ(2) lies in
160 Joinings in Ergodic Theory
• T jF K
hr i
is a factor of S. over F (recall that weak mixing is itself charac-
• l is the projection on X Y of the relatively terized by the ergodicity of the direct product
independent joining of T r and S over their T T).
hr i For more details on this subject, we refer the
common factor T jF K : reader to Lemańczyk et al. (2002). We also wish to
mention the generalization of simplicity called
If we further assume that the second system is semisimplicity proposed by Del Junco et al. in
also simple, then in the conclusion we can take (1995), which is precisely characterized by the
r = 1. In other words, ergodic joinings of simple fact that, for any l J 2e ðT Þ, the system (T T)l
systems S and T are either the product measure or is a relatively weakly mixing extension of T.
relatively independent joinings over a common
group factor. This leads to the following corollary:
Joinings Proofs of Ornstein’s and Krieger’s isomorphism of two specific systems, but rather to
Theorems the isomorphism of one given system with some
We have already seen that joinings could be used other system which has to be found. We have
to prove isomorphisms between systems. This fact therefore to introduce a larger set of joinings:
found a nice application in the proofs of two major Given an integer n, we denote by Yn the set of
theorems in ergodic theory: Ornstein’s isomor- double-sided sequences taking values in {1,. . .,n}.
phism theorem (Ornstein 1970), stating that two We consider on Yn the shift transformation S, but
Bernoulli shifts with the same entropy are isomor- we do not determine yet the invariant measure.
phic, and Krieger’s finite generator theorem Now, for a specific measure-preserving dynamical
(Krieger 1970), which says that any dynamical system T, consider the set J(n, T) of all possible
system with finite entropy is isomorphic to the joinings of T with some system (Yn, n, S), when n
shift transformation on a finite-valued stationary ranges over all possible shift-invariant probability
process. The idea of this joining approach to the measures on Yn. J(n, T) can also be equipped with a
proofs of Krieger’s and Ornstein’s theorems was topology which turns it into a compact convex
originally due to Burton and Rothstein, who cir- metric space, and as soon as T is ergodic, the set
culated a preliminary report on the subject which Je(n, T) of ergodic elements of J(n, T) is not empty.
was never published (Burton and Rothstein 1977). In this setting, Krieger’s theorem can be stated as
The first published and fully detailed exposition of follows:
these proofs can be found in Rudolph’s book
(Rudolph 1990) (see also in Glasner’s book Theorem 12
(Glasner 2003)). (Krieger’s finite generator theorem). Let T be an
In fact, Ornstein’s theorem goes far more ergodic dynamical system with entropy h(T )
beyond the isomorphism of two given Bernoulli < log2 n. Then the set of l J(n, T ) which are
shifts: It also gives a powerful tool for showing supported on graphs of isomorphisms between
that a specific dynamical system is isomorphic to a T and some system (Yn, n, S) forms a dense
Bernoulli shift. In particular, Ornstein introduced Gd in J e ðn, T Þ:
the property for an ergodic stationary process to
be finitely determined. We shall not give here the
precise definition of this property (for a complete Since any system of the form (Yn, n, S) obvi-
exposition of Ornstein’s theory, we refer the ously has an n-valued generating process, we
reader to Ornstein (1974)), but simply point out obtain as a corollary that T itself is generated by
that Bernoulli shifts and mixing Markov chains an n-valued process.
are examples of finitely determined processes. As another nice example of how joinings can
Rudolph’s argument to show Ornstein’s theorem be used to describe the isomorphisms between
via joinings makes use of Theorem 3 and of the two transformations, we can also cite the paper
topology of J(T, S). (Foreman et al. 2011) by Foreman, Rudolph, and
Weiss. With a clever construction of a family of
Theorem 11 transformations for which they can completely
(Ornstein’s isomorphism theorem). Let T and S be describe the joinings between T and its inverse,
two ergodic dynamical systems with the same they are able to prove that the isomorphism rela-
entropy, and both generated by finitely deter- tion between measure-preserving transformations
mined stationary processes. Then the set of join- of the unit interval is not Borel.
ings of T and S which are supported on graphs of
isomorphisms forms a dense Gd in J e ðT, SÞ: Joinings and Rohlin’s Multifold-Mixing
Question
Krieger’s theorem is not as easily stated in terms We have already seen that the property of T being
of joinings, because it does not refer to the mixing could be expressed in terms of twofold
162 Joinings in Ergodic Theory
1
n1 since there are examples of transformations which
f T k x g Sk y , ð13Þ are weakly disjoint from themselves.
n k¼0
Recent works use joinings to study the
pointwise convergence of multiple ergodic aver-
which can be viewed as the integral of the func-
ages. We mention in particular Huang et al.
tion f g with respect to the empirical distribution
(2014), where the case of successive powers of
an ergodic distal transformation is treated, and
n1
1 Donoso and Sun (2016) where this result is
dn ðx, yÞ≔ dðT k x,Sk yÞ :
n k¼0 extended to the case of d commuting transforma-
tions generating a distal action of ℤd, using some
We can always assume that T and S are contin- of Austin’s ideas.
uous transformations of compact metric spaces Also, let us observe that a strong connection
(indeed, any measure-preserving dynamical sys- with Question 1 on pairwise-independent joinings
tem is isomorphic to such a transformation on a has been established in Gutman et al. (2018).
compact metric space: see, e.g., Furstenberg Indeed, using a result of Bourgain according to
1981). Then the set of probability measures on which the almost-sure convergence of (11) holds
X Y equipped with the topology of weak con- when d = 2 and T1, T2 are powers of a single
vergence is metric compact. Now, here is the transformation, the authors prove that if T is PID
crucial point where joinings appear: If T and and weakly mixing, then (11) converge almost
S are ergodic, we can easily find subsets X0 X surely for Ti = Ti, i = 1, . . ., d.
and Y0 Y with m(X0) = n(Y0) = 1, such that for
all (x, y) X0 Y0, any cluster point of the
Joinings and Conjectures in Number Theory
sequence (dn(x, y))n>0 is automatically a joining
Joinings have recently proved to be a relevant tool
of T and S. (We just have to pick x and y among the
in the study of a conjecture by Sarnak, which lies
“good points” for the ergodic theorem in their
at the interface between number theory and
respective spaces.) When T and S are disjoint,
dynamical systems. Sarnak conjecture deals with
the only cluster point to the sequence (dn(x, y))
the famous arithmetic Möbius function m, which
is therefore m n. This ensures that, for continu-
is defined for each integer n 0 by
ous f and g, (13) converges to the product of the
integrals of f and g as soon as (x, y) is picked in mðnÞ≔
X0 Y0. The subspace of continuous functions 0 if there exists a prime p such that p2 j n,
being dense in L2, the classical ergodic maximal k
ð1Þ if n is the product of k distinct prime numbers ðk 0Þ:
inequality (see Garsia 1970) ensures that, for any
f and g in L2(m), (13) converges for any (x, y) in a A famous heuristic called the Möbius random-
rectangle of full measure X0 Y0. ness principle states that the patterns of symbols
Coming back to the original question where the 1, 1, and 0 in the sequence m behave so chaot-
spaces on which T and S act are identified, we ically that m has no correlation with any reason-
observe that with probability one, x belongs both ably simple sequence x. Sarnak proposed a precise
to X0 and Y0, and therefore the sequence (12) interpretation of this principle in the context of
converges. dynamical systems:
The existence of a rectangle of full measure in
which the sequence of empirical distributions
(dn(x, y))n>0 always converges to some joining Conjecture 1
has been studied in Lesigne et al. (2003) as a (Sarnak’s conjecture). For any topological
natural generalization of the notion of disjointness. dynamical system (X, T) with zero topological
This property was called weak disjointness of S and entropy, any continuous real-valued function f
T, and it is indeed strictly weaker than disjointness, defined on X, and any x X, we have
Joinings in Ergodic Theory 165
n1 n1
1 1
f T k x mðk Þ ! 0: (14) lim sup lim sup akr aks ¼ 0: ð15Þ
n n!1 n
k¼0
n!1
r, s ! 1, k¼0
r, sdifferent primes
Sarnak pointed out that this conjecture was
supported by an older conjecture of Chowla Then, for each bounded multiplicative function
which says that m has no autocorrelation of any n : ℕ ! ℂ, we have
order. More precisely, it can be stated as follows:
n1
1
lim ak nðkÞ ¼ 0:
N!1 n
Conjecture 2 k¼0
(Chowla’s conjecture). For any integers r 0, i0,
i1, . . ., ir 0 with at least one odd ij, we have To apply the above lemma in the context of the
Sarnak conjecture, we have to consider the sequence
n1 ak = f(Tkx); hence (15) becomes an assumption on
1
mðk Þi0 mðk þ 1Þi1 mðk þ rÞir ! 0: this kind of multiple ergodic average
n k¼0
n!1
n1
1
As pointed out by Sarnak, that the Chowla f ðT r Þk x f ðT s Þk x :
n
conjecture implies the Sarnak conjecture can be k¼0
proved via ergodic theory arguments. Indeed, an
interpretation of the Chowla conjecture is that m is Now, let us assume for simplification that the
the product of m2 (the characteristic function of topological dynamical system (X, T ) has a unique
the set of square-free numbers) with a sequence n invariant probability measure m. Then arguments
p {1, 1}ℕ behaving as a typical output of an similar to those exposed in section “Joinings and
infinite sequence of balanced coin tosses. Now, Multiple Ergodic Averages” lead to considering
with the assumptions of the statement of the joinings of different prime powers T r and T s of the
Sarnak conjecture, the sequence f(T kx)m2(k) is measure-preserving system (X, T, m). For exam-
produced by a zero-entropy system, whereas the ple, as explained in Bourgain et al. (2013),
sequence p comes from a K-system. Hence (14) disjointness of different prime powers of
can be viewed as a consequence of the T implies the validity of the Sarnak conjecture
disjointness between K-systems and systems of for the system. But Lemma 1 can also be applied
zero entropy. (See, e.g., El Abdalaoui et al. in a larger class of systems, which is defined
(2017) for details.) below and for which we can control all possible
Another reason why joinings are used in the joinings of different prime powers of T.
study of Sarnak conjecture comes from the fol- The ergodic measure-preserving system
lowing criterion introduced by Bourgain et al. (X, m, T ) is said to have Asymptotically Orthogo-
(2013), which provides a sufficient condition for nal Powers (AOP) if for each given f and g in
a bounded sequence (an) to be orthogonal to any L2(m) with Xf dm = Xg dm = 0, we have
bounded multiplicative function n. Recall that the
Möbius function is itself multiplicative, which
lim sup f gdk ¼ 0:
means that m(mn) = m(m)m(n) whenever m, r, s ! 1, k J e ðT r , T s Þ XX
n are coprime integers. r, sdifferent primes
Ferenczi et al. (2017) and references therein. In a Gaussian system (see Lemańczyk et al. 2000 and
particular, the Sarnak conjecture holds for any Thouvenot 1987), and this gives a complete
uniquely ergodic model of an AOP system, with description of the structure of their factors. This
uniform convergence of (14) with respect to x. kind of analysis is expected to be applicable to
other classes of dynamical systems. In particular,
Gaussian joinings find a nice generalization in the
Future Directions notion of infinitely divisible joinings, studied by
A lot of important open questions in ergodic the- Roy in (2009). These ID joinings concern a wider
ory involve joinings, and we already have cited class of dynamical systems of probabilistic origin,
several of them: joinings are a natural tool when among which we can also find Poisson suspen-
we want to deal with some problems of pointwise sions. The counterpart of Gaussian joinings in this
convergence involving several transformations latter class is Poisson joinings, which have been
(see section “Joinings and Multiple Ergodic Aver- introduced by Derriennic et al. As far as Poisson
ages”). It can therefore be assumed that they will suspensions are concerned, the analog of the GAG
play an important role in future progress on property in the Gaussian class can also be consid-
pointwise convergence of multiple ergodic aver- ered, and examples of Poisson suspensions for
ages. Their use is also fundamental in the study of which the only ergodic self-joinings are Poisson
Rohlin’s question on multifold mixing. As far as joinings have been given in Janvresse et al. (2017)
this latter problem is concerned, we may mention and Parreau and Roy (2007). In Derriennic et al. a
a recent approach to Question 1: start with a general joining property is described: T satisfies
transformation for which some special pairwise- the ELF property (from the French Ergodicité des
independent self-joining exists, and see what this Limites Faibles) if any joining which can be
assumption entails. In particular, we can ask under obtained as a limit of off-diagonal joinings DT nk
which conditions there exists a pairwise- is automatically ergodic. It turns out that this
independent threefold self-joining of T under property is satisfied by any system arising from
which the third coordinate is a function of the an infinitely divisible stationary process (see
two others. It has already been proved in Derriennic et al. and Roy 2009). It is proved in
Janvresse and de la Rue (2007) that if this function Derriennic et al. that the ELF property implies
is sufficiently regular (continuous for some topol- disjointness with any system which is twofold
ogy), then T is periodic or has positive entropy. simple and weakly mixing but not mixing. The
And there are strong evidences leading to the ELF property is expected to give a useful tool to
conjecture that, when T is weakly mixing, such a prove disjointness between dynamical systems of
situation can only arise when T is a Bernoulli shift probabilistic origin and other classes of systems
of entropy log n for some integer n 2. A question (see, e.g., Frączek and Lemańczyk 2004) in the
in the same spirit was raised by Ryzhikov, who case of ℝ-action for disjointness between ELF
asked in Ryzhikov (1992a) under which condi- systems and a class of special flows over irrational
tions we can find a factor of the direct product rotations).
T T which is independent of both coordinates.
There is also a lot of work to do with joinings in
order to understand the structure of factors of
some dynamical systems and how different clas- Bibliography
ses of systems are related. An example of such a Ahn Y-H, Lemańczyk M (2003) An algebraic property of
work is given in the class of Gaussian dynamical joinings. Proc Am Math Soc 131(6):1711–1716.
systems, i.e., dynamical systems constructed from (Electronic)
the shift on a stationary Gaussian process: For Austin T (2010) Multiple recurrence and the structure of
probability-preserving systems. arXiv:1006.0491
some of them (which are called GAG, from the
Bourgain J, Sarnak P, Ziegler T (2013) Disjointness of
French Gaussien à Autocouplages Gaussiens), it Mobius from horocycle flows. In: From Fourier analy-
can be proved that any ergodic self-joining is itself sis and number theory to Radon transforms and
Joinings in Ergodic Theory 167
geometry, Developments in mathematics, vol 28. Furstenberg H, Peres Y, Weiss B (1995) Perfect filtering
Springer, New York, pp 67–83 and double disjointness. Ann Inst Henri Poincaré Pro-
Bulatek W, Lemańczyk M, Lesigne E (2005) IEEE Trans bab Stat 31(3):453–465
Inform Theory 51(10):3586–3593 Garbit R (2011) A note on Furstenberg’s filtering problem.
Burton R, Rothstein A (1977) Isomorphism theorems in Isr J Math 182:333–336
ergodic theory. Technical report, Oregon State Garsia AM (1970) Topics in almost everywhere conver-
University gence, Lectures in advanced mathematics,
Chacon RV (1969) Weakly mixing transformations which vol 4. Markham Publishing Co., Chicago
are not strongly mixing. Proc Am Math Soc Glasner E (2003) Ergodic theory via joinings, Mathemat-
22:559–562 ical surveys and monographs, vol 101. American Math-
de la Rue T (2006a) 2-fold and 3-fold mixing: why ematical Society, Providence
3-dot-type counterexamples are impossible in one Glasner S, Weiss B (1983) Minimal transformations with
dimension. Bull Braz Math Soc (N.S.) 37(4):503–521 no common factor need not be disjoint. Isr J Math
de la Rue T (2006b) An introduction to joinings in ergodic 45(1):1–8
theory. Discret Contin Dyn Syst 15(1):121–142 Glasner E, Host B, Rudolph DJ (1992) Simple systems and
de la Rue T (2009) Notes on Austin’s multiple ergodic their higher order self-joinings. Isr J Math
theorem, hal-00400975 78(1):131–142
del Junco A (1983) A family of counterexamples in ergodic Glasner E, Thouvenot J-P, Weiss B (2000) Entropy theory
theory. Israel J Math 44(2):160–188 without a past. Ergodic Theory Dyn Syst 20(5):
del Junco A, Rudolph DJ (1987) On ergodic actions whose 1355–1370
self-joinings are graphs. Ergodic Theory Dyn Syst Goodson GR (2000) Joining properties of ergodic dynam-
7(4):531–557 ical systems having simple spectrum. Sankhyā Ser
del Junco A, Rahe M, Swanson L (1980) Chacon’s auto- A 62(3):307–317. Ergodic theory and harmonic analy-
morphism has minimal self-joinings. J Anal Math sis (Mumbai, 1999)
37:276–284 Gutman Y, Huang W, Shao S, Ye X (2018) Almost sure
del Junco A, Lemańczyk M, Mentzen MK (1995) Semi- convergence of the multiple ergodic average for certain
simplicity, joinings and group extensions. Stud Math weakly mixing systems. Acta Math Sin Engl Ser
112(2):141–164 34(1):79–90
Derriennic Y, Fraczek K, Lemańczyk M, Parreau F. (2008) Host B (1991) Mixing of all orders and pairwise indepen-
Ergodic automorphisms whose weak closure of off- dent joinings of systems with singular spectrum. Isr
diagonal measures consists of ergodic self-joinings. J Math 76(3):289–298
To appear in Colloquium Mathematicum 110, No. 1, Host B, Kra B (2005) Nonconventional ergodic averages
81–115 and nil-manifolds. Ann Math 161(1):397–488
Donoso S, Sun W (2016) Pointwise convergence of some Huang W, Shao S, Ye X (2014) Pointwise convergence of
multiple ergodic averages. arXiv:1609.02529 multiple ergodic averages and strictly ergodic models.
El Abdalaoui H, Kulaga-Przymus J, Lemańczyk M, de la arXiv:1406.5930
Rue T (2017) The Chowla and the Sarnak conjectures Janvresse É, de la Rue T (2007) On a class of pairwise-
from ergodic theory point of view. Discret Continuous independent joinings, Preprint. arXiv:0704.3358v2
Dyn Syst 37(6):2899–2944 [math.PR]
Ferenczi S (1997) Systems of finite rank. Colloq Math Janvresse É, Roy E, de la Rue T (2017) Poisson suspen-
73(1):35–65 sions and Sushis. Ann Sci Éc Norm Super
Ferenczi S, Kulaga-Przymus J, Lemańczyk M (2017) 50(6):1301–1334
Sarnak’s conjecture – what’s new. arXiv:1710.04039 Kalikow SA (1984) Twofold mixing implies threefold
Foreman M, Rudolph DJ, Weiss B (2011) The conjugacy mixing for rank one transformations. Ergodic Theory
problem in ergodic theory. Ann Math 173(3):1529–1586. Dyn Syst 4(2):237–259
(English) King J (1986) The commutant is the weak closure of the
Frączek K, Lemańczyk M (2004) A class of special powers, for rank-1 transformations. Ergodic Theory
flows over irrational rotations which is disjoint from Dyn Syst 6(3):363–384
mixing flows. Ergodic Theory Dyn Syst King J (1988) Joining-rank and the structure of finite
24(4):1083–1095 rank mixing transformations. J Anal Math 51:182–227
Furstenberg H (1967) Disjointness in ergodic theory, min- King JL (1990) A map with topological minimal self-
imal sets, and a problem in Diophantine approximation. joinings in the sense of del Junco. Ergodic Theory
Math Syst Theory 1:1–49 Dyn Syst 10(4):745–761. (English)
Furstenberg H (1981) Recurrence in ergodic theory and Krieger W (1970) On entropy and generators of measure-
combinatorial number theory, M. B. Porter Lectures. preserving transformations. Trans Am Math Soc
Princeton University Press, Princeton 149:453–464
Furstenberg H, Keynes HB, Shapiro L (1973) Prime flows Lemańczyk M, Parreau F, Thouvenot J-P (2000) Gaussian
in topological dynamics. Isr J Math 14:26–38. automorphisms whose ergodic self-joinings are Gauss-
(English) ian. Fundam Math 164(3):253–293
168 Joinings in Ergodic Theory
Lemańczyk M, Thouvenot J-P, Weiss B (2002) Relative Science Publications. The Clarendon Press/Oxford
discrete spectrum and joinings. Monatsh Math University Press, New York
137(1):57–75 Ryzhikov VV (1992a) Stochastic wreath products and
Lesigne E, Rittaud B, de la Rue T (2003) Weak disjointness joinings of dynamical systems. Mat Zametki
of measure preserving dynamical systems. Ergodic 52(3):130–140. 160
Theory Dyn Syst 23(4):1173–1198 Ryzhikov VV (1992b) Mixing, rank and minimal self-
Nadkarni MG (1998a) Basic ergodic theory, Birkhäuser joining of actions with invariant measure. Mat Sb
advanced texts: Basler Lehrbücher. [Birkhäuser 183(3):133–160
advanced texts: Basel textbooks], 2nd edn. Birkhäuser Ryzhikov VV (1993a) Joinings and multiple mixing of the
Verlag, Basel actions of finite rank. Funkt- sional Anal i Prilozhen
Nadkarni MG (1998b) Spectral theory of dynamical sys- 27(2):63–78. 96
tems, Birkhäuser advanced texts: Basler Lehrbüacher. Ryzhikov VV (1993b) Joinings, wreath products, factors
[Birkhäuser advanced texts: Basel textbooks]. and mixing properties of dynamical systems. Izv Ross
Birkhäuser Verlag, Basel Akad Nauk Ser Mat 57(1):102–128
Ornstein DS (1970) Bernoulli shifts with the same entropy Sarnak P (2012) Mobius randomness and dynamics. Not S
are isomorphic. Adv Math 4:337–352. 1970 Afr Math Soc 43(2):89–97
Ornstein DS (1972) On the root problem in ergodic theory. Tao T (2008) Norm convergence of multiple ergodic aver-
In: Proceedings of the sixth Berkeley symposium on ages for commuting transformations. Ergodic Theory
mathematical statistics and probability (University of Dyn Syst 28(2):657–688
California, Berkeley, 1970/1971), vol II, probability Thorisson H (2000) Coupling, stationarity, and regenera-
theory. University of California Press, Berkeley, tion, Probability and its applications. Springer,
pp 347–356 New York
Ornstein DS (1974) Ergodic theory, randomness, and Thouvenot J-P (1987) The metrical structure of some
dynamical systems, James K. Whittemore lectures in Gaussian processes. In: Proceedings of the conference
mathematics given at Yale University, Yale mathemat- on ergodic theory and related topics, II (Georgenthal,
ical monographs, vol 5. Yale University Press, New 1986), Teubner-Texte zur mathematik, vol 94. Teubner,
Haven Leipzig, pp 195–198
Parreau F, Roy E (2007) Poisson suspensions with a min- Thouvenot J-P (1995) Some properties and applications of
imal set of self-joinings, Preprint joinings in ergodic theory. In: Ergodic theory and its
Ratner M (1983) Horocycle flows, joinings and rigidity of connections with harmonic analysis (Alexandria,
products. Ann Math 118(2):277–313 1993), London mathematical society lecture note
Rohlin VA (1949) On endomorphisms of compact com- series, vol 205. Cambridge University Press, Cam-
mutative groups. Izvestiya Akad Nauk SSSR Ser Mat bridge, pp 207–235
13:329–340 Veech WA (1982) A criterion for a process to be prime.
Roy E (2009) Poisson suspensions and infinite ergodic Monatsh Math 94(4):335–341
theory. Ergodic Theory Dyn Syst 29(2):667–683 Weiss B (1998) Multiple recurrence and doubly minimal
Rudolph DJ (1979) An example of a measure preserving systems, topological dynamics and applications.
map with minimal self-joinings, and applications. A volume in honor of Robert Ellis. In: Proceedings of
J Anal Math 35:97–122 a conference in honor of the retirement of Robert Ellis,
Rudolph DJ (1990) Fundamentals of measurable dynam- Minneapolis, 5–6 Apr 1995. American Mathematical
ics: ergodic theory on Lebesgue spaces, Oxford Society, Providence, pp 189–196. (English)
Almost everywhere (a.e.) A measure-theoretic
Entropy in Ergodic Theory statement holds almost everywhere, abbreviated
a.e., if it holds off of a nullset. (Eugene Gutkin
Jonathan L. F. King1 and Kyewon Koh Park2 once remarked to us that the problem with Mea-
1
University of Florida, Gainesville, FL, USA sure Theory is. . . that you have to say “almost
2
Korea Institute for Advanced Study, Seoul, everywhere,” almost everywhere.) For example,
a:e:
South Korea B A means that m(A\B) is zero. The a.e. will
usually be implicit.
Factor map A factor map c : ðX, X , m, T Þ !
Article Outline ðY, Y , n, SÞ is a measure-preserving map
c : X ! Y which intertwines the transformations,
Glossary c ∘ T ¼ S ∘ c after deleting a nullset (a mass-
Definition of the Subject zero set) in each space. And c is an isomorphism
Entropy Example: How Many Questions? if this c is a bijection and c1 is also a factor
Distribution Entropy map. If X and Y are topological spaces, then we
A Glance at Shannon’s Noisy Channel Theorem require c to be continuous and onto.
The Information Function Fonts We use the font ℋ, h, I for distribution-
Entropy of a Transformation entropy, entropy, and the information function.
Determinism and Zero-Entropy In contrast, the script font A, ℬ, C . . . will be
The Pinsker-Field and K-Automorphisms used for collections of sets; usually subfields of
Ornstein Theory X . Use ðÞ for the (conditional) expectation
Skew Product and Random Walk in Random operator.
Scenery Measure-preserving map A measure-
Topological Entropy preserving map c : ðX, X , mÞ ! ðY, Y , nÞ is a
Entropy of a Flow(ℝ-Action) map c : X ! Y such that the inverse image of
Entropy of ℤd-Actions each B Y is in X , and m(c1(B)) ¼ n(B).
Finitely Observable Invariant A (measure-preserving) transformation is a
Recent Progress measure-preserving map T : ðX, X , mÞ !
Exodos ðX, X , mÞ. Condense this notation to
Bibliography ðX, X , m, T Þ or (X, m, T).
Measure space A measure space ðX, X , mÞ is a
Glossary set X, a field (i.e., a s-algebra) X of subsets of
X, and a countably-additive measure m : X !
This entry includes some easy proofs and auxil- ½0, 1. (We often just write (X, m), with the
iary results which illustrate how the ideas field implicit.) For a collection C X , use
evolved in the field. This can be also useful for FldðC Þ C for the smallest field including C .
persons not in the field of ergodic theory. How- The number “m(B) is the m-mass of B.”
ever, due to the page constraint, we jump to the Notation ℤ¼ integers, ℤ+¼ positive integers, and
brief presentation of recent developments of ℕ¼ natural numbers ¼0, 1, 2, . . .. (Some well-
entropy theory, skipping many important inter- meaning folk use ℕ for ℤ+, saying “Nothing
mediate advances. could be more natural than the positive integers.”
Some of the following definitions refer to And this is why 0 ℕ.) Use de and bc for the
the “Notation” paragraph immediately below. Use ceiling and floor functions; bc is also called the
mpt for “measure-preserving transformation.” “greatest-integer function.” For an interval
J ≔ [a, b) [1, + 1], let [a. . .b) denote the Clausius circa 1865 (Clausius 1864, 1867), taken
interval of integers J \ ℤ (with a similar conven- from the Greek ϵvtropιa, “a turning towards.”
tion for closed and open intervals). For example, This entry thus begins (Prolegomenon, “introduc-
(e. . .p] ¼ (e. . .p) ¼ {3}. For subsets A and B of tion”) and ends (Exdos (This is the Greek
the same space, O, use AB for inclusion and spelling.), “the path out”) in Greek.
6¼
Clausius, in his coinage, was referring to the
AB for proper inclusion. The difference set B\A
6¼ thermodynamic notion in physics. Our focus in
is fo Bjo= 2Ag. Employ Ac for the complement this entry, however, will be the concept in mea-
O\A. Since we work in a probability space, if we surable and topological dynamics. (Entropy in
let x ≔ m(A), then a convenient convention is to differentiable dynamics would require an article
have xc denote 1 x, since then m(Ac) equals by itself. For instance, see Katok and Hasselblatt
xc. Use ADB for the symmetric difference [A\B] [ 1995; Ledrappier and Young 1985; Mane 1987;
[B\A]. For a collection C ¼ E j j of sets in O, let Pesin 1977.) Shannon’s 1948 paper (Shannon
the disjoint union jEj or ðC Þ represent the 1948) on Information Theory, then Kolmogorov’s
union [j Ej and also assert that the sets are (1958) and Sinai’s (1959) generalization to
pairwise disjoint. Use “8largen” to mean: “∃n0 dynamical systems, will be our starting point.
such that 8n > n0.” To refer to left hand side of Our discussion will be of mostly the one-
an (20), use LhS(20); do analogously for dimensional case, where the acting-group is ℤ.
RhS(20), the right hand side. We will briefly mention continuous time flow,
Partition A partition P ¼ (A1, A2, . . .) splits ℝ-action. Later we will discuss actions of group ℤd
X into pairwise disjoint subsets Ai X so that which is a stepping stone to amenable group actions.
the disjoint union iAi is all of X. Each Ai is an My greatest concern was what to call it. I thought of
atom of P. Use jPj or #P for the number of atoms. calling it ‘information’, but the word was overly
When P partitions a probability space, then it used, so I decided to call it ‘uncertainty’. John von
! Neumann had a better idea, he told me, ‘You should
yields a probability vector v , where vj ≔ m(Aj). call it entropy, for two reasons. In the first place your
Lastly, use Phxi to denote the P-atom that owns x. uncertainty function goes by that name in statistical
Probability space A probability space is a mea- mechanics. In the second place, and more impor-
sure space (X, m) with m(X) ¼ 1; this m is a tant, nobody knows what entropy really is, so in a
debate you will always have the advantage.’
probability measure. All our maps/transforma- (Shannon as quoted in Tribus and McIrvine 1971)
tions in this entry are on probability spaces.
!
Probability vector A probability vector v ¼
ðv1 , v2 , . . .Þ is a list of nonnegative reals
whose sum is 1. We generally assume that Entropy Example: How Many Questions?
probability vectors and partitions (see below)
have finitely many components. Write “count- Imagine a dartboard, Fig. 11, split in five regions
able probability vector/partition,” when A, . . ., E with known probabilities. Blindfolded,
finitely or denumerably many components are you throw a dart at the board. What is the expected
considered.
Topological map A topological space ðX, V , T Þ 1
In this entry, unmarked logs will be to base-2. In entropy
is a compact set X with an open cover V ¼ theory, it does not matter much what base is used, but
fU i g. Each Ui is open and [Ui ¼ X. A trans- base-2 is convenient for computing entropy for messages
formation T is a continuous map from X to described in bits. When using the natural logarithm, some
people refer to the unit of information as a nat. In this entry,
itself.
we have picked bits rather than nats. This holds when each
probability p is a reciprocal power of two. For general
probabilities, the “expected number of questions” interpre-
Definition of the Subject tation holds in a weaker sense: Throw N darts indepen-
dently at N copies of the dartboard. Efficiently ask Yes/No
The word “entropy” (originally German, questions to determine where all N darts landed. Dividing by
1
Entropie) was coined by Rudolf Julius Emanuel N, then sending N ! 1, will be the p log p sum of (1).
Entropy in Ergodic Theory 171
Distribution Entropy
!
Given a probability vector v , define its distribu-
Entropy in Ergodic Theory, Fig. 1 This dartboard is a
probability space with a 5-set partition. The atoms have
tion entropy as
probabilities 12 , 18 , 18 , 18 , 18 . This probability distribu-
!
tion will be used later in Meshalkin’s example ℋ v ≔ hðxÞ: ð3Þ
!
x v
number Vof Yes/No questions needed to ascertain
the region in which the dart landed? This entry will use the term distropy for “dis-
Solve this by always dividing the remaining tribution entropy,” reserving the word entropy for
probability in half. “Is it A?” if Yes, then V ¼ 1. the corresponding dynamical concept, when there
Else: “Is it B or C?” – if Yes, then “Is it B?” – if No, is a notion of time involved. Getting ahead of
then the dart landed in C, and V ¼ 3 was the number ourselves, the entropy of a stationary process is
of questions. Evidently V ¼ 3 also for regions the asymptotic average value that its distropy
B, D, E. Using “log” to denote base-2 logarithm, decays to, as we look at larger and larger finite
the expected number of questions is thus portions of the process.
!
An equi-probable vector v ≔ K1 , , K1 evi-
K
1 1 1 1 1
ðV Þ ¼ 1þ 3þ 3þ 3þ 3 !
dently has ℋ v ¼ log ðK Þ: On a probability
2 8 8 8 8
4 space, the “distropy of partition P,” written
1 note
¼ p j log ¼ 2: ℋ(P) or ℋ(A1, A2, . . .) shall mean the distropy of
pj !
j¼0 probability vector v ¼ ðmðA1 Þ, mðA2 Þ, . . .Þ.
ð1Þ A (finite) partition necessarily has finite
distropy. A countable partition can have finite
distropy, for example, ℋ 12 , 14, 18, . . . ¼ 2.
!
Letting v ≔ 12 , 18 , 18 , 18 , 18 be the probability One could also have infinite distropy: Consider a
vector, we can write this expectation as piece B X of mass 1/2N Splitting B into 2k many
equal-mass atoms gives an h-sum of 2k(k + N)/
ðV Þ ¼ hðxÞ: (2k2N). Setting k ¼ kN ≔ 2N N makes this h-sum
x v
! equal 1; so splitting the pieces of X ¼ 1 N¼1 BN ,
with mðBN Þ ¼ 21N , yields an 1-distropy partition.
Here,
Function h
h: [0, 1] ! [0, 1) is the important function The h(x) ¼ x log (1/x) function has vertical tan-
(There does not seem to be a standard name gent at x ¼ 0, maximum at 1/e and, when graphed
for this function. We use h, since an uppercase in nats slope 1 at x ¼ 1. (Curiosity: Just in this
h looks like an H, which is the letter that paragraph we compute distropy in nats, that is,
Shannon used to denote what we are calling using natural logarithm. Given a small probability
distribution-entropy.) p [0, 1] and setting x ≔ 1/p, note that hðpÞ ¼
172 Entropy in Ergodic Theory
log ðxÞ
x 1=pðxÞ, where p(x) denotes the number of (b) ℋ(Q _ R) ℋ(Q) + ℋ(R), with equality
prime numbers less-equal x. (This approximation is a IFF Q ⊥ R.
weak form of the Prime Number Theorem.) Is there (c) For d 0, 12 , the function d 7! ℋ(d, dc) is
any actual connection between the “approximate strictly increasing.
!
distropy” function ℋp p ≔ p !p 1=pð1=pÞ and (d) R ≼ P implies ℋ(R) ℋ(P), with equality
a:e:
Number Theory, other than a coincidence of growth IFF R ¼ P.
rate?)
Consider partitions P and Q on the same space Proof Use the strict concavity of h(), together
(X, m). Their join, written P _ Q, has atoms A \ B, with Jensen’s inequality (Figs. 2 and 3).
for each pair A P and B Q. They are inde-
pendent, written P ⊥ Q if m(A \ B) ¼ m(A)m(B) for Remark 1 Although we will not discuss it in this
each A, B pair. We write P Q and say that “P entry, most distropy statements remain true with
refines Q,” if each P-atom is a subset of some “partition” replaced by “countable partition of
Q-atom. Consequently, each Q-atom is a union finite distropy.”
of P-atoms. Recall, for d a real number, our con-
vention that dc means 1 d, in analogy with m(Bc)
equaling 1 m(B) on a probability space. Binomial Coefficients
The dartboard gave an example where distropy
arises in a natural way. Here is a second example.
Distropy Fact For a small d > 0, one might guess that the
For partitions P, Q, R on probability space n
(X, m): binomial co-efficient grows asymptoti-
dn
cally (as n ! 1) like 2en, some small ϵ. But
(a) ℋ(P) log (#P), with equality IFF P is an what is the correct relation between ϵ and d?
equi-mass partition. Well, Stirling’s formula n ! [n/e]n gives
Entropy in Ergodic Theory, Fig. 2 Using natural log, here are the graphs of: h(x) in solid red, ℋ(x, xc) in dashed
green, 1 x in dotted blue. Both h(x) and ℋ(x, xc) are strictly convex-down. The 1 x line is tangent to h(x) at x ¼ 1
Entropy in Ergodic Theory 173
Entropy in Ergodic
Theory, Fig. 3 Using
natural log: The graph of
ℋ(x1, x2, x3) in
barycentric coordinates; a
slice has been removed,
between z ¼ 0.745 and
z ¼ 0.821. The three arches
are copies of the distropy
curve from Fig. 2
ddn ½dc d n
making use of (a), (b) in “Distropy Fact.” And #X
n equals LhS(4).
1
Thus, n log ℋðd, dc Þ . But by
dn
means of the above distropy inequalities, we A Glance at Shannon’s Noisy Channel
get an inequality true for all n, not just Theorem
asymptotically.
We can restate the Binomial lemma using the
Hamming metric on {0, 1}n,
Lemma 2 (Binomial Lemma) Fix a d 0, 12
and let H ≔ ℋ(d, dc). Then for each n ℤ+: Distðx, yÞ≔#fi ½1 . . . njxi 6¼ yi g
m ðA \ B Þ 1 Its integral
ℋðPjBÞ ¼ log :
AP
m ðB Þ m ð A \ B Þ=mðBÞ
ℋðPjF Þ≔ I PjF dm,
So conditioning P on a partition Q gives con-
ditional distropy
is the conditional distropy of P on F .
When F is the field of unions of atoms from
ℋðPjQÞ ¼ mðBÞℋðPjBÞ
BQ
some partition Q, then the number ℋ(P| F )
equals the ℋ(P| Q) from (9).
1
¼ mðA \ BÞ log : Write G j ↗F to indicate that fields G 1
A P, B Q
mðA \ BÞ=mðBÞ
G 2 . . . are nested, and that Fld [1
1 G j ¼ F,
ð9Þ
a.e. The Martingale Convergence Theorem
A “dartboard” interpretation of ℋ(P| Q) is (p. 103 in (Petersen 1983)) gives (c) below.
Certainlyhn ¼cn +ℋ(P1 _ _Pn1)¼cn +hn1, The time-zero partition P separates points, under
since T is measure-preserving. Induction gives the action of the shift. This L-atom time-zero parti-
hn ¼ nj¼1 c j . So the Cesàro averages 1n hn con- tion has Phxi ¼ Phyi IFF x0 ¼ y0. So no matter what
verge to the entropy. shift-invariant measure is put on X, the time-zero
partition will generate under the action of T. It is
Theorem 5 The entropy of process (X, P, m, T) Kolmogorov who formulated measure-preserving
equals transformation ðX, X , m, T Þ into the frame of
stationary process, using a generating partition.
1
hðT , PÞ ≔ lim ℋðP0 _ _ Pn1 Þ
n!1 n Time Reversibility
n
¼ lim ℋ P0 VPj A transformation need not be isomorphic to its
n!1 1 inverse. Nonetheless, the average-distropy-
1 numbers show that h(T 1, P) ¼ h(T, P),
¼ ℋ P0 VPj :
1 although this is not obvious from the
conditioning-definition of entropy. Alternatively,
Both limits are nonincreasing. The entropy n
h(T, P) 0, with equality IFF P V1 1 P jG ℭ G. ℋ P0 jVPj _
1
And h(T, P) ℋ(P), with equality IFF T, P is an ¼ ℋðP0 _ _ Pn Þ ℋðP1 _ _ Pn Þ
independent process. ¼ ℋðPn _ _ P0 Þ ℋðPn _ _ P1 Þ
n
¼ ℋ P0 jV Pj :
Generators 1
. . . x2 x1 x0 x1 x2 x3 . . . , 2
We are now at liberty to reveal that our X has always been
a Lebesgue space, that is, measure-isomorphic to an inter-
where xn is PhTn(x)i, the P-letter owning Tn(x). val, [0, 1].
Entropy in Ergodic Theory 177
Proof (a) Evidently Q0 _ R ¼ Q0 _ Q ¼ Q _ R. So Indeed, the failure happens already for process-
ℋ(Q0) ℋ(Q _ R) ℋ(Q) + d entropy with respect to a fixed partition.
(b) As above, A Bernoulli process T, P has positive entropy.
Take mpts Sn ! T, each isomorphic to an irrational
N N N rotation. Then each h(Sn, P) is zero, as shown in
ℋ V Q0j ℋ V Qj þ ℋ V Rj :
1 1 1 the later section “Determinism and Zero-Entropy.”
+
(c) Let K ≔ j Qj. Then there is a sequence of section “Glossary”) of T. Transformations T and
K-set partitions Q(L) ! Q with QðLÞ ≼VLL P‘ . S are weakly isomorphic if each is isomorphic to a
By above, h(T, Q(L)) ! h(T, Q), so showing factor of the other.
that The foregoing entropy tools make short shrift
of the following.
L ?
h T , V P‘ hðT , PÞ Lemma 9 (Entropy Lemma) Consider T-
L invariant sub-fields G j and F .
+
In
particular, G F implies that h T G
+
N 1 L N 1þL
+
hN ≔ℋ V T n
V P‘ ¼ℋ V Pj : hðT F Þ , so entropy is an invariant of weak-
n¼0 L j¼L
isomorphism.
... :
+
+
So (b) h T G 1 _G 2 _ jh T Gj
We denote S by B 14 , 14, 14, 14 and T by the odd digit +1 or 1, as this D is linked to
1 1 1 1 1 Positive or Negative.
B 2 , 8, 8, 8, 8. After deleting invariant null-
sets from X and Y, the construction will produce
a measure-preserving isomorphism c : X ! Y so Markov Shifts
that T ∘ c ¼ c ∘ S. A Bernoulli process T, P has independence
P(1. . .0] ⊥ P1, whereas a Markov process is a
The Code bit less aloof:
In X, consider this point x: The infinite Past P(1. . .0] doesn’t provide any more
information about Tomorrow than Today did.
... 0 0 0 100 þ1 þ2 1 þ 1 0...
That is, the conditional distribution
P1 j P(1. . .0] equals P1 j P0. Equivalently,
Regard each 0 as a left-parenthesis, and each
nonzero as a right-parenthesis. Link them
ℋðP1 jP0 Þ ¼ ℋ P1 jPð1...0
according to the legal way of matching parenthe-
ses, as shown in the top row, below: note
¼ hðT, PÞ: ð13Þ
Entropy in Ergodic Theory, Fig. 4 Call the transition probabilities s ≔ Prob (a ! a) for stay, and c ≔ Prob a ! b for
change. These are nonnegative reals, and s + c ¼ 1
180 Entropy in Ergodic Theory
ms ðbaaabaÞ ¼ pb mba maa maa mab mba 2n intervals. Thus, ℋ(P0 _ _ Pn1 log (2n).
c And 1n log ð2nÞ ! 0.
¼ 1 s s c 1:
1þc Alternatively, the Shannon-McMillan-
Breiman Theorem implies, for an ergodic process
The subscript on ms indicates the dependence T, P, that the number of length-n names is approx-
on the transition probabilities; let’s also mark the imately 2h(T,P)n; this, after discarding small mass
mpt and call it Ts. Using (13), the entropy of our from the space. But the growth of n 7! 2n is linear
Markov map is and so, for our rotation, h(T, P) must be zero.
the introduction of King (1988). (See Chap. 6 in Restricting the random variables to be
Friedman (1970) as well as Ferenczi (1997) and countably-valued, how much of the example sur-
Shields (1973) for examples of stacking vives? Joint work with Kalikow (Kalikow and
constructions.) King 1994) produced a countably-valued station-
A rank-1 transformation (X, m, T) admits a ary V which is nonconsecutively independent as
generating partition P and a sequence of Rokhlin well as deterministic. (Strong determinism is ruled
stacks Sn X, with heights going to 1, and with out, due to cardinality considerations.) A side-
m(Sn) ! 1. Moreover, each of these Rokhlin effect of the construction is that V’s time-reversal
stacks is P-monochromatic, that is, each level of n 7! Vn is not deterministic.
the stack lies entirely in some atom of P.
Taking a stack of some height 2n, let B ¼ Bn be
the union of the bottom n levels of the stack. There The Pinsker-Field and K-Automorphisms
are at most n many length-n names starting in Bn,
by monochromaticity. Finally, m(Bn) is almost 12 , Consider the collection of zero-entropy sets,
so is certainly larger than d≔ 13 . Thus, Eq. (17)
shows that our rank-l T has zero entropy. Z ¼ Z T ≔fA X jhðT, ðA, Ac ÞÞ ¼ 0g: ð18Þ
+
the past. Unsurprisingly, the Pinsker factor T Z has zero
entropy, that is, hðT Z Þ ¼ 0 . A transformation
+
initial conditions”: For each fixed length L, the complete isomorphism-invariant of Bernoulli
distant future transformations; that is, that two independent pro-
cesses with same entropy necessarily have the
V Pj same underlying transformation. Earlier, Sinai
j ðG...GþL
(1962) had shown that two such Bernoulli maps
were weakly isomorphic, that is, each isomorphic
becomes more and more independent of
to a factor of the other.
Ornstein introduced the notion of a process
V P j,
j ð1...0 being finitely determined, see Ornstein (1970)
for a definition, proved that a transformation
as the gap G ! 1. T was Bernoulli IFF it had a finitely determined
A transformation T is a Kolmogorov automor- generator IFF every partition was finitely deter-
phism if it possesses a generating partition P for mined with respect to T and showed that entropy
which T, P is a K-process. completely classified the finitely determined pro-
Here is a theorem that relates the “asymptotic cesses up to isomorphism.
forgetfulness” of Tail (T, P) ¼ ∅, to the lack of Later he developed the equivalent property,
determinism implied by having no zero-entropy “very weak Bernoulli,” which is much easier to
factors (see Walters (1981), p. 113. Related results check and hence used often to prove the
appear in Berg (1975)). Bernoullicity (Ornstein and Weiss 1974). Moreover,
the definition distinguishes Bernoullicity from Kol-
Theorem 11 (Pinsker-Algebra Theorem) Sup- mogorov automorphisms. To define these proper-
pose P is a generating partition for an ergodic ties, he introduced d-metric between two processes,
T. Then Tail (T, P) equals Z T . (X, P, m, T) and (Y, Q, n, S), where jP j ¼ j Qj.
If two processes are d -close, then there exists f :
Since Z T does not depend on P, this means that X ! Y such that for all large n, the average Ham-
all generating partitions have the same tail field, ming metric between n-names of f(P) and Q are
and therefore K-ness of T can be detected from close except on a set of small measure.
any generator. This seminal result and the ideas in the proof led
Another nonevident fact follows from the to a vast machinery for proving transformations to
above. The future field of T, P is defined to be be Bernoulli, as well as classification and structure
Tail (T1, P). It is not obvious that if the present is theorems (Ornstein 1974; Shields 1973; Thouvenot
independent of the distant past, then it is automat- 1975). Showing that the class of K-automorphisms
ically independent of the distant future. (Indeed, far exceeds the Bernoulli maps, Ornstein and
the precise definitions are important; witness sec- Shields produced in Ornstein and Shields (1973)
tion “Cautions on Determinism’s Relation to an uncountable family of nonisomorphic
Zero-Entropy.”) But since the entropy of a pro- K-automorphisms all with the same entropy.
cess, h(T, (A, Ac)), equals the entropy of the time-
reversed process h(T1, (A, Ac)), it follows that
Z T equals Z T 1 . Skew Product and Random Walk in
Random Scenery
Main idea of the proof is that {T, T1}- of the n-step past and n-step future of P . Since
transformation does not have the property of (T, P) is an independent process,
184 Entropy in Ergodic Theory
¼ 0:
V _ W≔fV \ W j V V and W W g;
Hence, the mutual information comes from T V≔ T 1 ðU Þj U V
only the scenery part. This makes it possible for and V ½0...nÞ ≔V 0 _ V 1 _ _ V n1 where V i ¼ T i V 0 ; W V,
us to approximate (recognize) the scenery entropy if each W patch is a subset of some V patch:
if n is sufficiently large.
The second is that for sufficiently small num- The T, V entropy is
ber d, the number of d-balls in d, Bd,n ðAÞ ¼
Bjd An1
0 , B0
n1
< d, B n1 i
i¼0 T P also giv hðT, V Þ ¼ htop ðT, V Þ
es nice to almost the entropy. (The second is the 1 ð20Þ
generalization of Shannon-McMillan Theorem.) ≔ lim sup ℋtop V ½0...nÞ :
n!1 n
Metric Preliminaries
Proof (c) Let C ≼V be a min-cardinality sub- An ϵ-ball-cover comprises finitely many balls, all
cover. Then TC is a subcover of TV . So Card of radius ϵ. Since our space is compact, every
TV j TC j¼j C j. As for entropy, inequality cover V has a Lebesgue number ϵ > 0. I. e.,
(b) and the foregoing give ℋ V ½0...nÞ for each z X, the Bal (Z, e) lies entirely inside at
nℋðV Þ. least one V -patch. (In particular, there is an ϵ-ball-
(d) Set sn ≔ℋ V ½0...nÞ . Then cover which refines V .) Let LEB ðV Þ be the
supremum of the Lebesgue numbers. Courtesy
of Lemma 15f we can
skþl sk þ ℋ T k V ½0...lÞ sk þ sl ,
by (b) and (c), and so the Subadditive Lemma 14, Fix a“ universal” listV ð1Þ ≼V ð2Þ ≼ . . . , with V ðkÞ
applies. (g) WLOG, ‘ ¼ 3. Given V a cover, triple 1
a ball cover:For every T : X ! X, then, the
it to V ≔V \ TV \ T 2 V ; so k
lim k h T, V ðkÞ computes htop ðT Þ:
j
V T3 V ¼ V T i ðV Þ:
j ½0...N Þ i ½0:::3N Þ An ϵ-Microscope Three notions are useful in
examining a metric space (X, d) at scale ϵ. Subset
Thus, ℋ T 3 , V , N ¼ ℋðT, V , 3N Þ, extend- A X is an ϵ-separated-set, if d(z, z0) e for all
ing notation. Part (d) and sending N ! 1, gives distinct z, z0 A. Subset F X is ϵ-spanning if
8x X, ∃ z F with d(x, z) < e.
h T3, V ¼ 3hðT, V Þ. Lastly, take covers such
Lastly, a cover V is ϵ-small if Diam (U) < e, for
that each U V .
h T 3 , C ðkÞ ! htop T 3 and You Take the High Road and I’ll Take the
Low Road
h T, D ðkÞ ! htop ðT Þ, There are several routes to computing top-ent,
some via maximization, others, minimization.
as k ! 1. Define V ðkÞ ≔C ðkÞ _ D ðkÞ. Apply the Our foregoing discussion computed htop(T) by
above to V ðkÞ , then send k ! 1. a family of sizes f k ðnÞ ¼ f Tk ðnÞ, depending on a
186 Entropy in Ergodic Theory
parameter k which specifies the fineness of scale. Proof (a) Take F X, a min-cardinality dn -
(In section “Metric Preliminaries,” this k is an ϵ-spanning set. So [z FDz ¼ X, where
integer; in the original definition, an open cover.)
Define two numbers: note n1
Dz ≔dn Balðz, eÞ ¼ \ T j Bal T j z, e :
j¼0
f 1
L ðkÞ≔ lim sup log f k ðnÞ and
n!1 n This D≔fDz gz is a cover, and it is dn -
ð22Þ
1 2ϵ-small. Thus, Cov ðn, 2eÞ j D j¼j F j . For
L f ðkÞ ≔ lim inf log f k ðnÞ:
n!1 n any metric, a maxima ϵ-separated-set is automat-
ically ϵ-spanning; adjoin a putative unspanned
f
Finally, let h f ðT Þ≔supk L ðkÞ . If the limit point to get a larger separated set.
exists in (22) then agree to write Lf(k) for the Let A be a max-cardinality dn - ϵ-separated set.
common value. The A-K-M definition used the Take C , a min-cardinality subcover of W ½0...nÞ . For
size f V ðnÞ≔ Card V ½0...nÞ , where each z A, pick a C -patch Cz ∍ z. Could some
pair x, y A pick the same C? Well,
CardðW Þ≔Minimumcardinality of write C ¼ \n1j¼0 T
j
W j , with each W j W . For
a subcover from W : every j [0. . .n), then,
Here are three metric-space sizes f (n): d T j ðxÞ, T j ðyÞ Diam W j < e:
limit exists, in arguments that subsequently send set of points x X is the set of doubly 1 paths
e ↘ 0. Ditto for LSpn(e). through the graph. Once trivialities removed, this
This will be used during the proof of the Var- X is a Cantor set and the shift T : X ! X is a
iational Principle. But first, here are two entropy homeomorphism (see Chaps. 2 and 6 in Lind and
computations which illustrate the efficacy in hav- Marcus 1996).
ing several characterizations of topological
entropy. The Golden Shift
htop (Isometry) ¼0 As the simplest example, suppose our magnetic-
Suppose (T : X, d) is a distance-preserving tape is constrained by the Markov graph, Fig. 5
map of a compact metric-space. Fixing ϵ, a set is that we studied measure-theoretically in Fig. 4.
dn - ϵ-separated IFF it is d e-separated. Thus, We want to store the text of The Declaration of
Sep
Sep (n, ϵ) does not grow with n. So each L ðeÞ Independence on our magnetic tape. Imagining
is zero. that English is a stationary process, we’d like to
encode English into this Golden TMS as effi-
Topological Markov Shifts ciently as possible. We seek a shift-invariant
Imagine ourselves back in the days when com-
puter data is stored on large reels of fast-moving
magnetic tape. One strategy to maximize the den-
sity of binary data stored is to not put timing-
marks (which take up space) on the tape. This
has the defect that when the tape-writer writes,
say, 577 consecutive 1-bits, then the tape-reader
may erroneously count 578 copies of 1. We side-
step this flaw by first encoding our data so as to Entropy in Ergodic Theory, Fig. 5 Ignoring the labels
577 on the edges, for the moment, the Golden shift, T, acts on
avoid the 11 1 word, then writing to tape.
the space of doubly-infinite paths through this graph. The
Generalize this to a finite alphabet Q and a space can be represented as a subset XGold {a, b}ℤ,
finite list F of disallowed Q-words. Extend each namely, the set of sequences with no two consecutive
word to a common length K + 1; now F QK+1 b letters
188 Entropy in Ergodic Theory
1
But recall, h(T) ¼ ℋ(P1| P[ 1 . . .0)) ℋ dðx, x0 Þ≔ , ð28Þ
1þ j m j
(P1| P0). So among all measures that make the
conditional distribution P j a equal (s, c), the for the smallest jmj with xm 6¼ x0m .
unique one maximizing entropy is the (s, c)-
Markov process. Its entropy, derived in (14), is Lemma 17 Consider a subshift X. Then the
1 1
f ðsÞ≔ ℋðs, 1 sÞ
2s lim
n!1 n
log ðNamesX ðnÞÞ
ð25Þ
1
¼ ½s log ðsÞ þ ð1 sÞ log ð1 sÞ:
2s exists in [0, 1], and equals htop(X).
Certainly f(0) ¼ f(1) ¼ 0, so f’s maximum Proof With e (0, 1) fixed, two n-names are
occurs at the (it turns out) unique point s where dn - ϵ-separated IFF they are not the same name.
the derivative
p f 0 ðsÞ equals zero. This s ¼ Hence, Sep (n, e) ¼ NamesX(n).
1 þ 5 =2 . Plugging in, the maximum
entropy supportable by the Golden Shift is To compute htop(XGold), declare that a word is
p “golden” if it appears in some x XGold. Each
2 1 þ 5 2 [n + 1]-golden word ending in a has form wa,
MaxEnt ¼ p log p
5 5 2 1 þ 5 where w is n-golden. An [n + 1]-golden word ending
p
3 5 2 in b, must end in ab and so has form wab, where w is
þ log p : [n 1]-golden. Summing up, NamesXGold ðn þ 1Þ ¼
2 3 5
NamesXGold ðnÞ þ NamesXGold ðn 1Þ. This is the
ð26Þ
Fibonacci recurrence, and indeed, these are the
Fibonacci numbers, since NamesXGold ð0Þ ¼ 1 and
Exponentiating, the number of m-typical
NamesXGold ð1Þ ¼ 2. Consequently, we have that
n-names grows like Gn, where
p
1þp 5
p
3p5
NamesXGold ðnÞ Const ln ,
2 5 5 2 5 5
G≔ p p : ð27Þ p
1 þ 5 3 5 where l ¼ 1þ2 5 is the Golden Ratio. So the ses-
quipedalian number G from (27) is simply l, and
This expression looks unpleasant to simplify – htop(XGold) ¼ log (l). Since log(l) 0.694, each
it is not even obviously an algebraic number – and thousand bits written on tape (subject to the “no
yet topological entropy will reveal its familiar
nature. This is because the Variational Principle 3
A popular computer-algebra-system was not, at least
(proved in the next section) says that the top-ent of
under our in-expert tutelage, able to simplify this. How-
a system is the supremum of measure-entropies ever, once top-ent gave the correct answer, the software
supportable by the system3. was able to detect the equality.
Entropy in Ergodic Theory 189
This right hand side, the spectral radius of A, The Variational Principle
means the maximum of the absolute values of Let M ≔ M(X, d) be the set of Borel probability
A’s eigenvalues. So the top-ent of a TMS is thus measures, and M(T) ≔ M(X, d, T) the set of
the T-invariant m M. Assign
If L is finite then there is no measure m of m-nice too, then U ðAc Þ mðAc Þ. Thus, limLaL(A)
maximal entropy; for m must give mass to some exists and equals m(A). Because C≔A is closed, the
Yk; this pulls the entropy below L, since there are continuous functions fN ↘ 1C pointwise, where
no compensatory components with entropy
exceeding L. f N ðxÞ ≔1 Min ðN dðx, CÞ, 1Þ:
In contrast, when L ¼ 1 then there is a
maximal-entropy measure (put mass 1/2j on By the Monotone Convergence theorem, then,
some component Y k j , where kj ↗ 1 swiftly);
indeed, there are continuum-many maximal- N
f N dm ! mðCÞ:
entropy measures. But there is no ergodic measure
of maximal entropy. (The ergodic measures are
the extreme points of M(T); call them MErg(T). And m(C) ¼ m(A), since A is nice. Fixing N,
This M(T) is the set of barycenters obtained from then, it suffices to establish U ðAÞ f N dm. But
Borel probability measures on MErg(T) (see fN is continuous, so
Krein-Milman theorem, Choquet_theory in
Wikipedia). In this instance, what explains the f N dm ¼ lim sup f N daL
failure to have an ergodic maximal-entropy mea- L!1
sure? Let mk be an invariant ergodic measure on
Yk. These measures do converge to the one-point lim sup 1A daL ¼ U ðAÞ:
L!1
(ergodic) probability measure m1 on Y1. But the
map m 7! hm(T) is not continuous at m1.) Corollary 20 Suppose aL ! m, and partition P is
For a concrete L ¼ 1 example, let Sk be the m-nice. Then ℋaL ðPÞ ! ℋm ðPÞ.
shift on [1. . .k]ℤ (see Buzzi and Ruette 2006;
Denker 1976; Misiurewicz 1973 for measures of The diameter of partition P is MaxA P
maximal entropy.) Diam(A).
tℋm ðRÞ þ ð1 tÞℋn ðRÞ ℋtmþð1tÞn ðRÞ: Remark 24 The idea in the following proof is to
mostly fill interval [0 . . . L) with N-blocks,
Proof of the Variational Principle As usual we starting with a offset K [0. . N). Averaging
divide the proof into 2 parts. EntSup(T) htop(T) over the offset will create a Cesàro average over
and EntSup(T) htop(T). each N-block. Averaging over the N-blocks will
Strategy for EntSup(T) htop(T). Choose an allow us to compute distropy with respect to the
e > 0. For L ¼ 1, 2, 3, . . ., take a maximal averaged measure, aL.
(L, e)-separated-set FL X, then define
Proof (of (32)) Since L is fixed, agree to use ’ for
1 the ’L probability measure. Our dL - ϵ-separated
F ¼ Fe ≔ lim sup log ðjFL jÞ:
L!1 L set FL has at most one point in any given atom of
Q[0. . .L), thereupon
Let ’L() be the equi-probable measure on FL;
each point has weight 1/ j FLj. The desired invari- log ðjFL jÞ ¼ ℋ’ Q½0...LÞ :
ant measure m will come from the Cesàro averages
1
aL ≔ T ‘ ’L , Regardless of the “offset” K [0 . . . N), we
L
‘ ½0...LÞ can always fit C≔ LN N many N-blocks into
[0. . .L). Denote by G ðK Þ≔½K . . . K þ CN Þ, this
which get more and more invariant. union of N-blocks, the good set of indices. Let
ℬðK Þ≔½0 . . . LÞ∖G ðKÞ be the bad index-set.
Lemma 23 Let m be any weak- accumulation Therefore,
point of the above faL g11 . (Automatically, m is
T-invariant.) Then hm(T) F. Indeed, if Q is any Bad ðK Þ
m-nice partition with Diam (Q) < e, then hm(T,
Q) F. ℋ’ Q½0...LÞ ℋ’ V Qj þ
jℬðK Þ
1 1?
log ðjFL jÞ d þ ℋaL ðPÞ, ð32Þ This is less than d, since L is large. Applying
L N
1
NL K ½0...N Þ to (28) now produces
since this and Corollary 20 will prove (31): Push-
ing L ! 1 along the sequence that produced m 1 1
log ðjFL jÞ d þ Good ðK Þ: ð34Þ
essentially sends LhS(32) to F, courtesy (24). And L NL K
RhS(32) goes to d þ N1 ℋm ðPÞ, by Corollary 20,
since P is m-nice. Descending d ↘ 0, hands us the Note
needed (31).
192 Entropy in Ergodic Theory
GoodðK Þ ℋ’ T KþcN P :
c
Entropy of a Flow(ℝ-Action)
This latter, by definition, equals
c ℋT KþcN
ð’Þ ðPÞ. We conclude that Historically study of dynamical systems started
with the continuous ℝ-action governed by New-
1
GoodðK Þ
1
ℋT KþcN ’ ðPÞ
ton’s law. We denote the system by ðO, G, n, T t Þ
NL K
NL K c where T t2 ðT t1 ðxÞÞ ¼ T t2 þt1 ðxÞ. The geodesic flows
1
ℋT ‘ ’ ðPÞ, and horocycle flows in homogeneous dynamics
NL
‘ ½0...LÞ are well-known example of ℝ-actions. For each
by adjoining a few translates of P, given t0 , ðO, G, n, T t0 Þ gives rise to a ℤ-action.
1 We define h(Tt) ¼ h(T1), entropy of time 1 map. It
ℋaL ðPÞ,
N is not hard to see that h(Tct) ¼ ch(Tt) for every
by the Distropy Averaging Lemma 22,
c ℝ. In particular, If c is an integer, then h-
1 ‘
(Tct) ¼ h(Tc) ¼ ch(T1) ¼ ch(Tt) (see Lemma 9d).
since aL is the average L ‘T ’ . Thus, (34) Ambrose (1941) has shown that every ergodic
implies (32), our goal. flow ðO, G, n, T t Þ can be represented as a flow
with a transformation on X and a ceiling function
Proof of EntSup(T) htop(T). Fix a T-invariant f(x), which is measurable with respect to X. When
measure m. For partition Q ¼ (B1, . . . , BK), we denote the base by ðX, X , m, T Þ, then the flow Tt
choose a compact set Ak Bk with m(Bk\Ak) acts as follows: each point o ¼ (x, t) moves
small. (This can be done, since m is automatically straight up at unit speed until it hits (x, f(x))
a regular measure (Rudin 1973).) Letting which is identified as (Tx, 0). Then it continues
D ≔ [ iAi]c and P ≔ (D, A1, . . . , AK), we can up at unit speed. The base ðX, X , m, T Þ is called a
have made ℋ(P| Q) as small as desired. Courtesy cross section of the flow. It is clear that the flow
of Lemma 8b, then, we only need consider parti- has positive entropy IFF the base ðX, X , m, T Þ has
tions of the form that P has. positive entropy. Unpredictability of the base is
Open-cover V ¼ ðU 1 , . . ., U K Þ has patches inherited by the flow.
Uk ≔ D [ Ak. What atoms of, say, P[0,1,2], can the The flow has the following structure (Fig. 6).
intersection U9 \ T1(U2) \ T2(U5) touch? Only
the eight atoms
f (x)
ðD or A9 Þ \ T 1 ðD or A2 Þ \ T 2 ðD or A5 Þ:
(x, f (x))
Thus, ♯ P½0...nÞ 2n:♯ V ½0...nÞ . Here ♯() counts
the number of nonvoid atoms/patches.) So
1 1 (x, t)
ℋ P 1 þ log ♯ V ½0...nÞ
n m ½0...nÞ n
x (X, X , µ, T ) Tx
1 þ 1 þ htop ðT Þ;
this last inequality, when n is large. The upshot: Entropy in Ergodic Theory, Fig. 6 In the representation
hm(T) 2 + htop(T). of ðO, G, n, T t0 Þ, (x, f(x)) is identified as (Tx, 0)
Entropy in Ergodic Theory 193
where P ¼ V(i, j) ≺ (0, 0)s(i, j)P and “≺” is a Hence, h(T1) ¼ 1. Similar reasoning for topo-
lexicographic order. logical entropy gives that if htop(s) > 0, then
In more general amenable groups, the htop(s(i, j)) ¼ 1 for all (i, j) ℤ2.
sequence of square Rn is replaced by a sequence
of Følner sets. Existence of Følner sequence Complexity of Zero Entropy Actions
guarantees the existence of invariant probability As the study of dynamical systems expands to big-
measure under the action (Ornstein and Weiss ger group actions, entropy zero systems arise more
1987). naturally with diverse growth rates of orbits. We
If h(s, P) > 0, then ℋ Vði,jÞ Rn sði,jÞ P is in consider the following ℤ2-action s ¼ {T1, T2}
the order of n2. We have the following theorem. where h(s) ¼ 0.
Theorem 26 If h(s) > 0, then h T i1 T 2j ¼ 1. Let s ¼ {T1, T2} be an action where T1 and T2
2 are irrational rotations by a1 and a2 respec-
1 8ði, jÞ ℤ ∖fð0, 0Þg.
tively on a circle. Then h(T1) ¼ h(T2) ¼ 0.
Proof For simplicity we will prove for the case Hence, h(s) ¼ 0.
(i, j) ¼ (1, 0). Let P be a generating partition under 2. We have the following ℤ2-subshift called three
s. We note that a generating partition P under s is dot model.
not necessarily a generating partition for Let X ¼ fðxði,jÞ : xiþ1,j þ xi,j þ xi,jþ1
T1 ¼ T(1,0). Let Pm denote the partition 2
k
0 ðmod 2Þg f0, 1gℤ . It is not hard to see
Vmj¼k T 2j P and let X m ¼ V1 i m
1 T 1 Pm . Note that
that lim m!1 X m ↗X .
1
(a) lim n!1 2n ℋtop Vði,jÞ Rn sði,jÞ P ¼ 1.
hðT 1 Þ ¼ h T , Pm
m Hence, htop(s) ¼ 0.
¼ lim ℋ Pm i m
1 (b) For each Pmm ,
m!1 m j V T 1 Pm
i¼1
m 1 k1 1
¼ lim ℋ T k2 Pj V T i Pm _ V T j P
m!1
k¼m
i¼1 1 m j¼m 2 ℋtop Pm i m
m j V T 1 Pm
i¼1
m 1 k1 1
ℋ Pj T k V T i m
V T j
¼ lim
m!1 2 i¼1 1 Pm _ j¼m 2 P ¼ ℋtop T 2 Pj V T i
m
Pm VPm1 ¼ 1:
k¼m i¼1 1 m m
Since P ¼ V(i, j) ≺ (0, 0)s(i, j)P is finer algebra Hence, htop(T1) ¼ 1. Likewise htop(T2) ¼ 1.
j
than T k
2 V1 i m k1
i¼1 T 1 Pm _ V j¼m T 2 P ,
by Conditional-Distropy Fact (c), we have There exists an obvious invariant measure m on
X such that each atom of V(i, j) [0, m) [0, n)
j s(i, j)P has measure 2(m+n). With respect to m,
ℋ PjT k
2 V1 i m k1
i¼1 T 1 Pm _ V j¼m T 2 P we have h(T1) ¼ h(T2) ¼ 1. Hence, h(s) ¼ 0.
> hðs, PÞ More generally, if Y is a shift space with the left
shift T and f is a continuous map from Y to itself,
for each m. Hence then it is well known that f is a block code since f
is uniformly continuous. If f is not invertible,
hðT 1 Þ ℋ Pm 1 i m then we use the “natural extension method” to
m jVi¼1 T 1 Pm
m extend the space Y to X {0, 1}ℤ as in the
hðs, PÞ above example (2). If Y has an invariant measure
k¼m and f is measure-preserving, then X carries an
¼ ð2m þ 1Þ hðs, PÞ for all m invariant measure.
Entropy in Ergodic Theory 195
To investigate the complexity of entropy zero noncocompact subgroup action of ℤ2. To under-
ℤ -actions, J. Milnor (1988) introduced the notion
2
stand entropy zero ℤd-actions (d > 2), we need to
! !
of directional entropy, h v for v ¼ ðx, yÞ ℝ2 investigate the complexity of noncocompact sub-
group actions of ℤi (0 i d-1) in addition to
as follows. Let P be a partition of X. We denote the
directions.
partition s(i, j)P by P(i, j)
In the case of ℤ2-action generated by a shift
and a Cellular Automaton map (a block code),
! 1 J. Milnor asked if the directional entropy function
h v , P ¼ lim m!1 lim t!1 ℋ V Pði,jÞ
t !
ði, jÞ P ðm, v , tÞ is continuous. We have the following answer
(Park 1999).
!
where P m, v , t ¼ fði, jÞ: 0 j ½ty, m j
! !
x
i m þ j xyg and if y ¼ 0, then P m, v , t ¼ Theorem 27 Directional entropy h v is con-
y
!
fði, jÞ : m j m, 0 i ½txg. That is, tinuous on k v k¼ 1 for the action s generated by
! a shift T and a Cellular Automaton map f.
P m, v , t is a parallelogram with width [m,
!
m] along the vector v . Both limsups in the defini- !
!
Sinai (2017) has shown that h v is upper
tion of h v , P are equal to limits. We define !
semi-continuous on k v k¼ 1. In fact, the conti-
! !
h v ¼ supP h v , P . If P is a generating parti- nuity is guaranteed for more general s. Topolog-
! ! !
tion under s, then h v ¼ h v, P since ical directional entropy htop v can be defined
!
X m ¼ V1t¼1 Vði,jÞ P ðm,!
v ,tÞ P
ði,jÞ
↗X . analogously. It is not yet known if htop v is
Directional entropy has the following continuous in this restrictive class of ℤ -actions.
2
2
properties. Let ðX, X , m, sÞ be a subshift of f0, 1gℤ .
As was remarked via the examples mentioned
! ! above, entropy zero systems exhibit diverse
(i) h c v ¼ ch v :
behavior. For the growth rate of orbits of zero
! !
(ii) h T i1 T 2j ¼ h v where v ¼ ði, jÞ ℤ2 entropy systems, several notions, slow entropy
! (Katok and Thouvenot 1997) and entropy dimen-
(iii) h v is not continuous in general for sion (Dou et al. 2019; Jung et al. 2017), have been
!
k v k¼ 1. introduced to understand the complexity. They
intend to measure the intermediate growth rate
We note that which is bigger than polynomial and less than
exponential. Although the motivation is mainly
! ! from the study of bigger group actions, investiga-
(a) Example (1) satisfies that h v ¼ 08 v S1 .
tion of the growth rate of orbits in entropy zero
! !
(b) Example (2) satisfies that h v > 08 v S1 . ℤ-actions is necessary for more complete
understanding.
However, an example with the following prop-
erties is known.
Finitely Observable Invariant
(i) h(s(1,0)) > 0.
Having given the basic properties of entropy, we
(ii) h(s(i, j)) ¼ 0 8 (i, j) 6¼ (1, 0)
would like to make a remark on its characteristic
! !
of finitely observable invariant in ergodic sys-
If there exists a direction v such that h v > tems. In a landmark paper (Ornstein and Weiss
0, then the growth rate of orbits along Rn is at least 2007), Ornstein and Weiss show that all “finitely
in the order of 2n. Directional entropy can be observable” properties of ergodic processes are
regarded as a generalization of the entropy of secretly entropy; indeed, they are continuous
196 Entropy in Ergodic Theory
n 7! Sn ðx1 , x2 , . . . xn Þ ð35Þ
Recent Progress
converges in O. And on a particular process X, say
!
that S converges, if S converges on a.e. x in X .
In addition to a survey of older results in measure-
A function J : C ! O is isomorphism invari-
theoretic entropy and in topological entropy, let us
ant if, whenever the underlying transformations of
end this entry with a brief discussion of a couple
two processes X , X 0 C are isomorphic, then
of recent results, chosen from many.
J ðX Þ ¼ J ðX 0 Þ . Lastly, say that S “converges to
J,” if for each X C , scheme S converges to the
value J ðX Þ. Entropy of Actions of Nonamenable Groups
The work of David Bailey (1976) produced an Consider ðG, G Þ, a topological group and its Borel
observation scheme for entropy. The Lempel-Ziv field (s-algebra). Let ð G X Þ be the field on
algorithm (Ziv and Lempel 1977) was another G X generated by the two coordinate-subfields.
entropy observer, with practical application A map
(Shields 1996)
Ornstein and Weiss provided entropy schemes c:G X!X ð36Þ
in (Ornstein and Weiss 1993; Ornstein et al.
1990). Their recent paper “Entropy is the only is measurable if
finitely-observable invariant” (Ornstein and
Weiss 2007) gives a converse, a uniqueness result. c1 ðX Þ G X:
Theorem 30 (L. Bowen) Let G be a finite-rank B. Seward (2018) was able to eliminate “at
free group. Then two Bernoulli G-actions are least 3-states” condition. If G is amenable, the
isomorphic IFF they have the same entropy. sofic entropy agrees with classical entropy. The
sofic, measure and topological, entropies are
known to be dependent on the sofic approxima-
The paper introduces an isomorphism invari- tions, unlike amenable groups (Bowen 2017,
ant, the “f invariant,” and shows that, for Bernoulli Bowen). However, for Bernoulli shifts of G the
actions, the f invariant agrees with entropy, that is, sofic entropy is independent of the sofic
with the distropy of the independent generating approximation.
partition. D. Kerr and H. Li (2011) introduced topolog-
And it was not too long to introduce a new ical sofic entropy on a compact metric space
invariant, called sofic entropy to extend the result meaning the exponential growth rate of the num-
to a much larger class of groups. The sofic entropy ber of “approximate periodic points (pseudo-
is defined via a sofic approximation of a group periodic points).” This leads them to prove the
(Bowen 2012, 2017). following variational principle.
Let G be a countable discrete group. We say
G is sofic if there exists sn : G ! Symm 4
The notion of sofic group is due to Mikheal Gromov
({0, 1, . . ., mn 1}) into permutation groups (1999). There is yet no known example of a countable
satisfying group which is not sofic.
198 Entropy in Ergodic Theory
Theorem 33 (Kerr-Li) Let G be a countable center as a major tool in Ergodic Theory and Topolog-
nonamenable group action on (X, s). For a sofic ical Dynamics. Simply mentioning all the substantial
approximate of G, resultsin entropytheorywoulddwarfthelengthofthis
encyclopediaentry manytimesover.Andastherecent
h∑,top ðsÞ ¼ sup h∑ ðs, mÞ results mentioned above (cherry-picked out of many)
m
show, Entropy shows no sign of fading away. . .
where m is an invariant measure on (X, s).
Bibliography
Weak Pinsker Conjecture
Pinsker conjectured that every measure-preserving Adler RL, Marcus B (1979) Topological entropy and
transformation is a product of K-automorphism and equivalence of dynamical systems. Memoirs of the
0-entropy transformation (Pinsker 1960). Pesin American Mathematical Society, vol 219. American
(1977) showed that some flows on surfaces have Mathematical Society, Providence
Adler RL, Weiss B (1967) Entropy, a complete metric
the property; Bernoulli 0-entropy (in fact, a invariant for automorphisms of the torus. Proc Natl
rotation). However, by Ornstein’s counterexample Acad Sci USA 57(6):1573
(Ornstein 1973) to the conjecture, it is shown to be Adler RL, Konheim AG, McAndrew MH (1965) Topolog-
false in general. Thouvenot (1977) proposed a ical entropy. Trans Am Math Soc 114(2):309–319
Ambrose W (1941) Representation of ergodic flows. Ann
weaker version of the conjecture, called weak Math 42:723–739
Pinsker conjecture that every measure-preserving Austin T (2014) Scenery entropy as an invariant of RWRS
transformation is relatively Bernoulli over a trans- processes. arXiv preprint arXiv:1405.1468
formation of arbitrarily small entropy. That is, Austin T (2018) Measure concentration and the weak
pinsker property. Publ Math IHÉS 128(1):1–119
given any > 0, ðX, X , m, T Þ is isomorphic to a Bailey DH (1976) Sequential schemes for classifying and
product Bernoulli (transformation of entropy predicting ergodic processes. PhD thesis, Stanford
< ). Austin (2018) proves the conjecture is the University
affirmative. In fact, he proves a little more. Berg KR (1975) Independence and additive entropy. Proc
Am Math Soc 51(2):366–370
Bowen R (1971) Entropy for group endomorphisms and
Theorem 34 (Austin) If ðX, X , m, T Þ has a factor homogeneous spaces. Trans Am Math Soc
ðY, Y , n, SÞ and h(T) > h(S) then for a given 153:401–414
> 0, there exists an extension Y, Y , n, S of Bowen R (1973) Topological entropy for noncompact sets.
Trans Am Math Soc 184:125–136
ðY, Y , n, SÞ such that the following diagram holds: Bowen L (2010) A new measure conjugacy invariant for
actions of free groups. Ann Math 171(2):1387–1400
Bowen L (2012) Every countably infinite group is almost
(X; X ;µ;T ) Ornstein. In: Dynamical systems and group actions,
vol 567. American Mathematical Society, Providence,
pp 67–78
(Ỹ ; Ỹ; º̃; S̃) Bowen L (2017) Examples in the entropy theory of count-
able group actions. arXiv:1704.06349 [math.DS],
pp 1–131
(Y; Y;º;S) Brin M, Stuck G (2002) Introduction to dynamical sys-
tems. Cambridge University Press, Cambridge
Buzzi J, Ruette S (2006) Large entropy implies existence of
where a maximal entropy measure for interval maps. Discrete
(i) h S < hðSÞ þ : Contin Dynam Syst A 14(4):673–688
Clausius R (1864) Abhandlungen über die mechanische
(ii) ðX, X , m, T Þ is isomorphic to Wärmetheorie, vol 1. F. Vieweg, Braunschweig
Bernoulli Y, Y , n, S . Clausius R (1867) Abhandlungen über die mechanische
Wärmetheorie, vol 2. F. Vieweg, Braunschweig
Exodos Connes A, Feldman J, Weiss B (1981) An amenable equiv-
alence relation is generated by a single transformation.
Ergodic Theory Dynam Syst 1(4):431–450
Ever since the pioneering work of Shannon, and of Cornfel’d I, Fomin S, Sinai YG (1982) Ergodic theory.
Kolmogorov and Sinai, entropy has been front and Grundlehren der mathematischen Wissenschaften: a
Entropy in Ergodic Theory 199
Rudolph D (1990) Fundamentals of measurable dynamics: Young L-S (1982) Dimension, entropy and lyapunov expo-
ergodic theory on Lebesgue spaces. Oxford science nents. Ergodic Theory Dynam Syst 2(1):109–124
publications. Clarendon Press, Oxford Ziv J, Lempel A (1977) A universal algorithm for sequen-
Rudolph DJ, Weiss B (2000) Entropy and mixing for tial data compression. IEEE Trans Inf Theory
amenable group actions. Ann Math 151(3):1119–1150 23(3):337–343
Seward B (2018) Bernoulli shifts with bases of equal
entropy are isomorphic. arXiv preprint
arXiv:1805.08279
Articles, Papers, and Books of Interest
Bowen L. https://warwick.ac.uk/fac/sci/maths/research/
Shannon CE (1948) A mathematical theory of communi-
events/2018-19/metc/abstracts
cation. Bell Syst Tech J 27(3):379–423
Boyle M, Downarowicz T (2004) The entropy theory of
Shields PC (1973) The theory of Bernoulli shifts. Univer-
symbolic extensions. Invent Math 156(1):119–161
sity of Chicago Press, Chicago
Downarowicz T (2011) Entropy in dynamical systems.
Shields PC (1996) The ergodic theory of discrete sample
Cambridge University Press, Cambridge
paths, vol 13. American Mathematical Society,
Downarowicz T, Serafin J (2003) Possible entropy func-
Providence
tions. Isr J Math 135(1):221–250
Sinai YG (1959) On the concept of entropy for a dynamic
Greven A, Keller G, Warnecke G (2003) Entropy.
system. Dokl Akad Nauk SSSR 124(4):768–771
Princeton series in applied mathematics. Princeton Uni-
Sinai YG (1962) A weak isomorphism of transformations
versity Press, Princeton
with invariant measure. Dokl Akad Nauk
Hassner M (1980) A non-probabilistic source and channel
147(4):177–207
coding theory. PhD thesis, UCLA
Sinai YG (2017) Topics in ergodic theory, vol 44.
Katok A (2007) Fifty years of entropy in dynamics:
Princeton University Press, Princeton
1958–2007. J Modern Dyn 1(4):545–596
Thouvenot J-P (1975) Quelques propriétés des systemes
Katok A, Sinai YG, Stepin AM (1977) Theory of dynam-
dynamiques qui se décomposent en un produit de deux
ical systems and general transformation groups with
systemes dont l’un est un schéma de Bernoulli. Isr
invariant measure. J Sov Math 7(6):974–1065
J Math 21(2–3):177–207
McMillan B (1953) The basic theorems of information
Thouvenot J-P (1977) On the stability of the weak pinsker
theory. Ann Math Statist 24(2):196–219
property. Isr J Math 27(2):150–162
Newhouse SE (1989) Continuity properties of entropy.
Tribus M, McIrvine EC (1971) Energy and information.
Ann Math 129(1):215–235
Sci Am 225(3):179–188
Parry W (1969) Entropy and generators in ergodic theory.
Walters P (1981) An introduction to ergodic theory. Grad-
Mathematics lecture note series. W. A. Benjamin,
uate texts in mathematics. Springer, New York
New York
Ward T, Zhang Q (1992) The Abramov-Rokhlin entropy
Wikipedia. http://en.wikipedia.org/wiki/Spectral_radius,
addition formula for amenable group actions. Monatsh
http://en.wikipedia.org/wiki/Information_entropy
Math 114(3):317–329
Coupling of two measure spaces A coupling of
Isomorphism Theory in Ergodic two measure spaces (X, m, ℬ) and ðY, n, C Þ is a
Theory measure γ on X Y such that γ(B Y ) ¼ m(B)
for all B ℬ and γ(X C) ¼ n(C) for all
Christopher Hoffman C C.
Department of Mathematics, University of Ergodic measure preserving trans-
Washington, Seattle, WA, USA formation A measure preserving transforma-
tion is ergodic if the only invariant sets
(m(A △ T1(A)) ¼ 0) have measure 0 or 1.
Article Outline Ergodic theorem The pointwise ergodic theo-
rem says that for any measure preserving trans-
Glossary formation (X, ℬ, m) and T and any L1 function
Definition of the Subject n
1
f the time average lim f T i ðxÞ converges
Introduction n!1 n
i¼1
Basic Transformations a.e. If the transformation is ergodic then the
Basic Isomorphism Invariants limit is the space average, f dm a.e.
Basic Tools Geodesic A geodesic on a Riemannian manifold
Isomorphism of Bernoulli Shifts is a distance minimizing path between points.
Transformations Isomorphic to Bernoulli Shifts Horocycle A horocycle is a circle in the hyper-
Transformations Not Isomorphic to Bernoulli bolic disk which intersects the boundary of the
Shifts disk in exactly one point.
Classifying the Invariant Measures of Algebraic Invariant measure Likewise a measure m is said
Actions to be invariant with respect to (X, T ) provided
Finitary Isomorphisms that m(T1(A)) ¼ m(A) for all measurable
Flows A ℬ.
Other Equivalence Relations Joining of two measure preserving trans-
Non-invertible Transformations formations A joining of two measure preserv-
Factors of a Transformation ing transformations (X, T) and (Y, S) is a
Actions of Amenable Groups coupling of X and Y which is invariant under
Future Directions T S.
Bibliography Markov shift A Markov shift is a stochastic
process such that the conditional distribution
Glossary of the future outputs ({xn}n>0) of the process
conditioned on the last output (x0) is the same
Almost everywhere A property is said to hold as the distribution conditioned on all of the past
almost everywhere (a.e.) if the set on which outputs of the process ({xn}n0).
the property does not hold has measure 0. Measure preserving transformation A mea-
Bernoulli shift A Bernoulli shift is a stochastic sure preserving transformation consists of a
process such that all outputs of the process are probability space (X, T) and a measurable
independent. function T : X ! X such that m(T1(A)) ¼
Conditional measure For any measure space m(A) for all A ℬ.
(X, ℬ, m) and s-algebra C ℬ the conditional Measure theoretic entropy A numerical invari-
measure is a C-measurable function g such that ant of measure preserving transformations that
m(C) ¼ Cg dm for all C C . measures the growth in complexity of
Our main goal in this article is to consider when Bernoulli shifts Some of the most fundamental
two measure preserving transformations are in transformations are the Bernoulli shifts.
some sense different presentations of the same A probability vector is a vector fpi gni¼1
underlying object. To make this precise we say such that 2i¼1 pi ¼ 1 and pi 0 for all i. Let
two measure preserving maps (X, T ) and (Y, S) p ¼ fpi gni¼1 be a probability vector. The
are isomorphic if there exists a measurable map Bernoulli shift corresponding to p has state
f : X ! Y such that space {1, 2, . . ., n}ℤ, the shift operator
T(x)i ¼ xiþ1. To specify the measure we only
1. f is measure preserving, need to specify it on cylinder sets
2. f is invertible almost everywhere and
3. f(T(x)) ¼ S(f(x)) for almost every x. A ¼ fx X: xi ¼ ai 8 i fm, . . . , kgg
The main goal of the subject is to construct a for some m k ℤ and a sequence am, . . .,
collection of invariants of a transformation such ak {1, . . ., n}. The measure on cylinder sets
that a necessary condition for two transforma- is defined by
mfx X : xi ¼ ai ½2ex for all i such that m i kg
tions to be isomorphic is that the invariant be
k
the same for both transformations. Another goal ¼ pai :
of the subject is to solve the much more difficult i¼m
mfx1 ¼ a1 j x0 ¼ a0 g
Matrix actions Another natural class of actions
¼ mfx1 ¼ a1 j x0 ¼ a0 , x1 ¼ a1 , x2 ¼ a2 , .. .g is given by the action of matrices on tori. Let M
be an invertible n n integer valued matrix.
for all choices of ai, i 1. Let m ¼ fmðiÞgni¼1 We define TM : [0, 1)n ! [0, 1)n by TM(x)i = M
be a vector such that Mm = m. Then an invari- (x)i mod 1 for all i, 1 i n. It is easy to check
ant measure is defined by setting the that if M is surjective then Lebesgue measure is
measure on cylinder sets to be A = {x : x0 = invariant under TM. TM is a j det (M)j to one
a0, x1 = a1, . . ., xn = an} is given by map. If n = 1 then M is an integer and we refer
mðAÞ ¼ mða0 Þ ni¼1 Mðai1 , ai Þ. to the map as times M.
Shift maps More generally the shift map s is The [T, T1] transformations Let (X, T) be any
the map s : ℕℤ ! ℕℤ where s(x)i = xi+1 for all invertible measure preserving transformation.
x ℕℤ and i ℤ. We also let s designate the Let s be the shift operator on Y = {1, 1}ℤ.
shift map on ℕℕ. For each measure that is The space Y comes equipped with the
invariant under the shift map there is a Bernoulli (1/2, 1/2) product measure n.
corresponding measure defined on ℕℕ that is The [T, T1] transformation is a map on Y X
invariant under the shift map. Let m be an which preserves n m. It is defined by
invariant measure under the shift map. For
any measurable set of A ℕℕ we define A on T, T 1 ðy, xÞ ¼ ðSðyÞ, T y0 ðxÞÞ:
ℕℤ by
Induced transformations Let (X, T) be a mea-
A ¼ f. . . , x1 , x0 , x1 , . . . : x0 , x1 , . . . Ag:
sure preserving transformation and let A X
with 0 < m(A) < 1. The transformation
Then it is easy to check that m defined by
induced by A, (A, TA, mA), is defined as fol-
m A ¼ mðAÞ is invariant. If the original trans- lows. For any x A
formation was a Markov or Bernoulli shift then
refer to the resulting transformations as a one T A ðxÞ ¼ T nðxÞ ðxÞ
sided Markov shifts or one sided Bernoulli
shift respectively. where n(x) = inf {m > 0 : T m(x) A}. For any
Rational maps of the Riemann sphere We say B A we have that mA(B) = m(B)/m(A).
that f(z) = g(z)/h(z) is a rational function of
degree d 2 if both g(z) and h(z) are poly-
nomials with max(deg(g(z)), deg(h(z))) = d. Basic Isomorphism Invariants
Then f induces a natural action on the Riemann
sphere Tf : z ! f(z) which is a d to one map The main purpose of isomorphism theory is to
(counting with multiplicity). In section “Ratio- classify which pairs of measure preserving trans-
nal Maps” we shall see that for every rational formation are isomorphic and which are not
function f there is a canonical measure mf such isomorphic. One of the main ways that we can
that Tf is a measure preserving transformation. show that two measure preserving transformation
Horocycle flows The horocycle flow acts on SL are not isomorphic is using isomorphism invari-
(2, ℝ)/Γ where Γ is a discrete subgroup of SL ants. An isomorphism invariant is a function
(2, ℝ) such that SL(2, ℝ)/Γ has finite Haar f defined on measure preserving transformations
measure. For any g SL(2, ℝ) and t ℝ such that if (X, T ) is isomorphic to (Y, S) then
we define the horocycle flow by f((X, T)) ¼ f((Y, S)).
204 Isomorphism Theory in Ergodic Theory
concept to information theory. Consider a pro- of ℬ. We say that (Y, S) is trivial if Y consists
cess that generates a string of data of length n. of only one point. We say that a transformation
The entropy of the process is the smallest num- (X, T) has completely positive entropy if
ber h such that you can condense the data to a every non-trivial factor of (X, T ) has positive
string of zeroes and one of length hn and with entropy.
high probability you can reconstruct the origi-
nal data from the string of zeroes and ones. Isomorphism of Bernoulli Shifts
Thus the entropy of a process is the average
amount of information transmitted per symbol Kolmogorov–Sinai
of the process. A long standing open question was for which
Kolmogorov and Sinai introduced the concept p and q are Bernoullip and Bernoulliq
of entropy to ergodic theory in the following isomorphic. In particular are the Bernoulli
way (Kolmogorov 1958, 1959). They defined 2 shift and the Bernoulli 3 shift isomorphic. Both
the entropy of a partition Q is defined to be of these transformations have completely positive
entropy and all other isomorphism invariants
k which were known at the time are the same for
H ðQÞ ¼ mðQi Þ log mðQi Þ: the two transformations. The first application of
i¼1
the Kolmogorov–Sinai entropy was to show that
the answer to this question is no.
The measure-theoretic entropy of a dynamical
Fix a probability vector p. The transformation
system (X, T ) with respect to a partition
Bernoullip has Qp : x ! x0 as a generating parti-
Q : X ! {1, . . ., k} is then defined as
tion. By Sinai’s theorem
n
.
H Bernoullip ¼ H Qp ¼ pi log 2 ðpi Þ:
i¼1
Finally, the measure-theoretic entropy of a Thus the Bernoulli 2 shift (with entropy 1) is
dynamical system (X, T ) is defined as not isomorphic to the Bernoulli 3 shift (with
entropy log2(3)).
hððX, T ÞÞ ¼ sup hðX, T, QÞ Sinai also made significant progress toward
Q
showing that Bernoulli shifts with the same
where the supremum is taken over all finite entropy are isomorphic by proving the following
measurable partitions. A theorem of Sinai theorem.
showed that if Q is a generator of (X, T) then
h(T ) ¼ h(T, Q) (Sinaĭ 1959). Theorem 1 (Sinaĭ 1962) If (X, T) is a measure
This shows that for every measure preserving preserving system of entropy h and (Y, S) is a
0
function (X, T ) there is an associated entropy Bernoulli shift of entropy h h then (Y, S) is a
h(T ) [0, 1]. It is easy to show from the factor of (X, T ).
definition that entropy is an isomorphism
invariant. This theorem implies that if p and q are prob-
We say that (Y, S) is a factor of (X, T ) if there ability vectors and H(p) ¼ H(q) then Bernoullip is
exists a map f : X ! Y such that a factor in Bernoulliq and Bernoulliq is a factor in
Bernoullip. Thus we say that Bernoullip and
1. f is measure preserving and Bernoulliq are weakly isomorphic.
2. f(T(x)) ¼ S(f(x)) for almost every x.
Explicit Isomorphisms
Each factor (Y, S) can be associated with The other early progress on proving that Bernoulli
f1 ðC Þ, which is an invariant sub s-algebra shifts with the same entropy are isomorphic came
206 Isomorphism Theory in Ergodic Theory
ϵ > 0 the set of joinings γ such that P g,ϵ C is an 1. (X, T ) is finitely determined,
open and dense set. Thus by the Baire category 2. (X, T ) is very weak Bernoulli and
theorem there exists γ such that P g C . This 3. (X, T ) is isomorphic to a Bernoulli shift.
reproves Theorem 1.
Moreover if (X, T ) and (Y, S) are finitely
determined, h((X, T )) ¼ h((Y, S)) and P and
Q are generating partition of X and Y then by the Using the fact that a transformation is finitely
Baire category theorem there exists γ such that determined or very weak Bernoulli is equivalent
P g C and Q γ ℬ. Then the map f that sends to it being isomorphic to a Bernoulli shift we can
y ! x where P T i x ¼ P Si y for all i is an prove the following theorem.
isomorphism.
Theorem 5 (Ornstein 1970a)
Properties of Bernoullis 1. If (X, T n) is isomorphic to a Bernoulli shift
Now we define the very weak Bernoulli property then (X, T) is isomorphic to a Bernoulli shift.
which is the most effective property for showing 2. If (X, T ) is a factor of a Bernoulli shift then
that a measure preserving transformation is iso- (X, T ) is isomorphic to a Bernoulli shift.
morphic to a Bernoulli shift. 3. If (X, T ) is isomorphic to a Bernoulli shift then
Given X and a partition P define the past of x by there exists a measure preserving transforma-
tion (Y, S) such that (Y, S n) is isomorphic to
Ppast ðxÞ ¼ x0 : T i Pðx0 Þ ¼ T i PðxÞ 8i ℕ (X, T ).
proof of many of the consequences listed in sec- entropy it is natural to ask if (X, T ) is not isomor-
tion “Properties of Bernoullis”. However if one phic to a Bernoulli shift then is there any reason-
wants to show a particular transformation is iso- able condition we can put on (Y, S) that implies
morphic to a Bernoulli shift then the very weak the two transformations are isomorphic. For
Bernoulli property is more useful. There have example if (X, T2) and (Y, S2) are completely
been many classes of transformations that have positive entropy transformations which are iso-
been proven to be isomorphic to a Bernoulli shift. morphic does that necessarily imply that (X, T )
Here we mention two. and (Y, S) are isomorphic? The answer turns out
The first class are the Markov chains. Friedman to be no (Rudolph 1976). We could also ask if
and Ornstein proved that if a Markov chain is (X, T ) and (Y, S) are completely positive entropy
mixing then it is isomorphic to a Bernoulli shift transformations which are weakly isomorphic
(Friedman and Ornstein 1970). The second are does that imply that (X, T ) and (Y, S) are isomor-
automorphisms of [0, 1)n. Let M be any n n phic? Again the answer is no (Hoffman 1999a).
matrix with integer coefficients and j det (M ) j ¼ The key insight to answering questions like this is
1. If none of the eigenvalues fli gni¼1 of M are roots due to Rudolph who showed that such questions
of unity then Katznelson proved that TM is iso- about the isomorphism of transformations can be
morphic to the Bernoulli shift with entropy reduced to questions about conjugacy of
n
i¼1 maxð0, logðli ÞÞ (Katznelson 1971). permutations.
isomorphic to T p2 if and only if the permutations transformations have little inherent interest out-
π1 and π2 are conjugate. side of their ergodic theory properties. This led
many people to search for a “natural” example of
There are two permutations on two elements, such a transformation. The most natural examples
the flip π1 ¼ (12) and the identity π2 ¼ (1)(2). For are the [T, T 1] transformation and many other
both permutations, the square of the permutation transformations derived from it. It is easy to show
is the identity. Thus there are two distinct permu- that the [T, T 1] transformation has completely
tations whose square is the same. Rudolph positive entropy (Meilijson 1974). Kalikow pro-
showed that this fact can be used to generate two ved that for many T the corresponding [T, T 1]
transformations which are mixing that are not transformation is not isomorphic to a Bernoulli
isomorphic but their squares are isomorphic. The shift.
following theorem gives more examples of the
power of this technique. Theorem 11 (Kalikow 1982) If h(T ) > 0 then
the [T, T1] transformation is not isomorphic to a
Theorem 9 (Rudolph 1979) Bernoulli shift.
1. There exists measure preserving transforma-
tions (X, T ) and (Y, S) which are weakly iso- The basic idea of Kalikow’s proof has been
morphic but not isomorphic. used by many others. Katok and Rudolph used
2. There exists measure preserving transforma- the proof to construct smooth measure preserving
tions (X, T ) and (Y, S) which are not isomor- transformations on infinite differentiable mani-
phic but (X, Tk, m) is isomorphic to (Y, Sk, n) folds which have completely positive entropy
for every k > 1, and but are not isomorphic to Bernoulli shifts (Katok
3. There exists a mixing transformation with no 1980; Rudolph 1988). Den Hollander and Steif
non trivial factors. did a thorough study of the ergodic theory prop-
erties of [T, T 1] transformations where T is sim-
ple random walk on a wide family of graphs (Den
If (X, T ) has minimal self joinings then it has Hollander and Steif 1997).
zero entropy. However Hoffman constructed a
transformation with completely positive entropy
that shares many of the properties of transforma- Classifying the Invariant Measures of
tions with minimal self joinings listed above. Algebraic Actions
Conjecture 1 (Furstenberg 1967) The only non- Keane and Smorodinsky proved the following
atomic measure on [0, 1) which is invariant under strengthening of Ornstein’s isomorphism
times 2 and times 3 is Lebesgue measure. theorem.
Rudolph improved on the work of Lyons Theorem 13 (Keane and Smorodinsky 1979) If
(1988) to provide the following partial answer to p and q are probability vectors with H(p) ¼ H(q)
this conjecture. then the Bernoulli shifts Bernoullip and Bernoulliq
are isomorphic and there exists an isomorphism f
Theorem 12 (Rudolph 1990) The only measure such that f and f1 are both finitary.
on [0, 1) which is invariant under multiplication
by 2 and by 3 and has positive entropy under The nicest that we could hope f to be is if both
multiplication by 2 is Lebesgue measure. f and f1 are finitary and have finite expected
coding length. Schmidt proved that this happens
Johnson then proved that for all relatively only in the trivial case that p and q are
prime p and q that p and q can be substituted for rearrangements of each other.
2 and 3 in the theorem above.
This problem can be generalized to higher Theorem 14 (Schmidt 1984) If p and q are
dimensions by studying the actions of commuting probability vectors and the Bernoulli shifts
integer matrices of determinant greater than one Bernoullip and Bernoulliq are isomorphic and
on tori. Katok and Spatzier (1996) and Einsiedler there exists an isomorphism f such that f and
and Lindenstrauss (2003) obtained results similar f1 are both finitary and have finite expected
to Rudolph’s for actions of commuting matrices. coding time then p is a rearrangement of q.
We say that m(x) has finitetth moment if For any flow (X, T) and {Tt}t R and any cross
m(x)t dm < 1 and that f has finite expected section C we define the return time map for
coding length if the first moment of m(x) is finite. C R : C ! C as follows. For any x C define
Isomorphism Theory in Ergodic Theory 211
then set R(x) ¼ Tt(x)(x). There is a standard method In this section we will discuss a number of equiv-
to project the probability measure m on X to an alence relations between transformations that are
invariant probability measure mC on C as well as weaker than isomorphism. All of these equiva-
the s-algebra ℬ on X to a s-algebra ℬC on C such lence relations have a theory that is parallel to
that (C, mC, ℬC) and R is a measure preserving Ornstein’s theory.
transformation.
First we show that there is a natural analog of Kakutani Equivalence
Bernoulli shifts for flows. We say that two transformations (X, T ) and (Y, S)
are Kakutani equivalent if there exist subsets
Theorem 16 (Ornstein 1970b) There exists a A X and B Y such that (TA, A, mA) and
flow (X, T ) and {Tt}t ℝ such that for every (SB, B, nB) are isomorphic. This is equivalent to
t > 0 the map (X, T ) and Tt is isomorphic to a the existence of a flow (X, T ) and {Tt}t ℝ with
Bernoulli shift. Moreover for any h (0, 1] cross sections C and C0 such that the return time
there exists (X, T ) and {Tt}t ℝ such that maps of C and C0 are isomorphic to (X, T ) and
h(T1) ¼ h. (Y, S) respectively.
Using the properties of entropy of the induced
We say that such a flow (X, {ft}t ℝ, m, ℬ) is a map we have that if (X, T ) and (Y, S) are
Bernoulli flow. Kakutani equivalent then either h((X, T )) ¼
This next version of Ornstein’s isomorphism h((Y, S)) ¼ 0, 0 < h((X, T )), h((Y, S)) < 1 or
theorem shows that up to isomorphism and a h((X, T )) ¼ h((Y, S)) ¼ 1.
change in time (considering the flow X and {Tct} In general the answer to the question of which
instead of X and {Tt}) there are only two pairs of measure preserving transformations are
Bernoulli flows, one with positive but finite isomorphic is quite poorly understood. But if one
entropy and one with infinite entropy. of the transformations is a Bernoulli shift then
Ornstein’s theory gives a fairly complete answer
Theorem 17 (Ornstein 1970b) If (X, T ) and to the question. A similar situation exists for
{Tt}t ℝ and (Y, S){St}t ℝ are Bernoulli flows Kakutani equivalence. In general the answer to
and h(T1) ¼ h(S1) then they are isomorphic. the question of which pairs of measure preserving
transformation are Kakutani equivalent is also
As in the case of actions of ℤ there are many quite poorly understood. But the more specialized
natural examples of flows that are isomorphic to question of which transformations are isomorphic
the Bernoulli flow. The first is for geodesic to a Bernoulli shift has a more satisfactory answer.
flows. In the 1930s Hopf proved that geodesic Feldman constructed a transformation (X, T )
flows on compact surfaces of constant negative which has completely positive entropy but (X, T )
curvature are ergodic (Hopf 1971). Ornstein and is not Kakutani equivalent to a Bernoulli shift.
Weiss extended Hopf's proof to show that the Ornstein, Rudolph and Weiss extended Feldman’s
geodesic flow is also Bernoulli (Ornstein and work to construct a complete theory of the trans-
Weiss 1973). formations that are Kakutani equivalent to a
The second class of flows comes from billiards Bernoulli shift (Ornstein et al. 1982) for positive
on a square table with one circular bumper. The entropy transformations and a theory of the trans-
state space X consists of all positions and veloci- formations that are Kakutani equivalent to an irra-
ties for a fixed speed. The flow Tt is frictionless tional rotation (Ornstein et al. 1982) for zero
movement for time t with elastic collisions. This entropy transformations. (The zero entropy ver-
flow is also isomorphic to the Bernoulli flow sion of this theorem had been developed indepen-
(Gallavotti and Ornstein 1974). dently (and earlier) by Katok (1975).)
212 Isomorphism Theory in Ergodic Theory
They defined two class of transformations to be quite different from the same question for
called loosely Bernoulli and finitely fixed. The invertible transformations. In one sense it is easier
definitions of these properties are the same as the because of an additional isomorphism invariant.
definitions of very weak Bernoulli and finitely For any measure preserving transformation
determined except that the d metric is replaced (X, T ) the probability measure mjT 1 ðxÞ on T 1(x)
by the f metric. For x, y ℕℤ we define is defined for almost every x X. (If (X, T ) is
invertible then this measure is trivial as
f n ðx, yÞ ¼ 1
k j{T 1(x)}j ¼ 1 and mjT 1 ðxÞ T 1 ðxÞ ¼ 1 for
n almost every x.) It is easy to check that if f is an
isomorphism from (X, T ) to (Y, S) then for almost
where k is the largest number such that sequences
every x and x0 T1(x) we have
1 i1 < i2 < . . . < ik n and 1 j1 < j2 < . . . < jk n
such that xil ¼ yjl for all j, 1 j k. (In computer
mjT 1 ðxÞ ðx0 Þ ¼ vjS1 ðfðxÞÞ ðfðx0 ÞÞ:
science this metric is commonly referred to as the
edit distance.) Note that d n ðx, yÞ f n ðx, yÞ.
From this we can easily see that if p ¼ fpi gni¼1
They proved the following analog of
and q ¼ fqi gmi¼1 are probability vectors then the
Theorem 5.
corresponding one sided Bernoulli shifts are iso-
morphic only if m ¼ n and there is a permutation
Theorem 18 For transformations (X, T ) with
π Sn such that pπ(i) ¼ qi for all i. (In this case we
h((X, T )) > 0 the following conditions are
say p is a rearrangement of q.) If p is a
equivalent:
rearrangement of q then it is easy to construct an
isomorphism between the corresponding Bernoulli
1. (X, T ) is finitely fixed,
shifts. Thus the analog of Ornstein’s theorem for
2. (X, T ) is loosely Bernoulli,
Bernoulli endomorphisms is trivial. However we
3. (X, T ) is Kakutani equivalent to a Bernoulli
will see that there still is an analogous theory
shift and
classifying the class of endomorphism that are
4. There exists a Bernoulli flow Y and {Ft}t ℝ
isomorphic to Bernoulli endomorphisms.
and a cross section C such that the return time
We say that an endomorphism is uniformly
map for C is isomorphic to (X, T ).
d to 1 if for almost every x we have that
j{T1(x)}j ¼ d and mjT 1 ðxÞ ðyÞ ¼ 1=d for all
Restricted Orbit Equivalence y T1(x). Hoffman and Rudolph defined two
Using the d metric we got a theory of which trans- classes of noninvertible transformations called
formations are isomorphic to a Bernoulli shift. tree very week Bernoulli and tree finitely deter-
Using the f metric we got a strikingly similar theory mined and proved the following theorem.
of which transformations are Kakutani equivalent
to a Bernoulli shift. Rudolph showed that it is Theorem 19 The following three conditions are
possible to replace the d metric (or the f metric) equivalent for uniformly d to 1 endomorphisms.
with a wide number of other metrics and produce
parallel theories for other equivalence relations. 1. (X, T ) is tree very weak Bernoulli,
For instance, for each of these theories we get a 2. (X, T ) is tree finitely determined, and
version of Theorem 5. This collection of theories is 3. (X, T ) is isomorphic to the one sided Bernoulli
called restricted orbit equivalence (Rudolph 1985). d shift.
tree finitely determined and if and only if it is tree corresponding map ((ℂ, mf, ℬ), Tf) is isomorphic
very weak Bernoulli (Jong 2003). to the one sided Bernoulli d shift.
Differences with Ornstein’s Theory
Markov Shifts Unlike Kakutani equivalence and restricted orbit
We saw that mixing Markov chains are isomor- equivalence which are very close parallels to
phic if they have the same entropy. As we have Ornstein’s theory, the theory of which endomor-
seen there are additional isomorphism invariants phisms are isomorphic to a Bernoulli endomor-
for noninvertible transformations. Ashley, Marcus phism contains some significant differences. One
and Tuncel managed to classify all one sided of the principal results of Ornstein’s isomorphism
mixing Markov chains up to isomorphism theory is that if (X, T ) is an invertible transforma-
(Ashley et al. 1997). tion and (X, T2) is isomorphic to a Bernoulli shift
then (X, T ) is also isomorphic to a Bernoulli shift.
Rational Maps There is no corresponding result for noninvertible
Rational maps are the main object of study in transformations.
complex dynamics. For every rational function
f(z) ¼ p(z)/q(z) there is a nonempty compact set Theorem 23 (Hoffman 2004) There is a uni-
Jf which is called the Julia set. Roughly speaking formly two to one endomorphism (X, T ) which is
this is the set of points for which every neighbor- not isomorphic to the one sided Bernoulli 2 shift
hood acts “chaotically” under repeated iterations but (X, T2) is isomorphic to the one sided
of f. Bernoulli 4 shift.
In order to consider rational maps as measure
preserving transformations we need to specify an
invariant measure. The following theorem of Factors of a Transformation
Gromov shows that for every rational map there
is one canonical measure to consider. In this section we study the relationship between a
transformation and its factors. There is a natural
Theorem 20 (Gromov 2003) For every way to associate a factor of (X, T ) with a sub
f rational function of degree d there exists a s-algebra of ℬ. Let (Y, S) be a factor of (X, T )
unique invariant measure mf of maximal entropy. with factor map f : X ! Y. Then the s-algebra
We have that h(mf) ¼ log2d and mf(Jf) ¼ 1. associated with (Y, S) is ℬY ¼ f1 ðC Þ. Thus the
study of factors of a transformation is the study of
The properties of this measure were studied by its sub s-algebras.
Freire, Lopes and Mañé (1983). Mañé that analy- Almost every property that we have discussed
sis to prove the following theorem. above has an analog in the study of factors of a
transformation. We give three such examples. We
Theorem 21 (Mañé 1985) For every rational say that two factors C and D of (X, T ) are rela-
function f of degree d there exists n such that tively isomorphic if there exists an isomorphism
(ℂ, f n, mf) (where f n(z) ¼ f( f( f. . .(z))) is compo- c : X ! X of (X, T ) with itself such that
sition) is isomorphic to the one sided Bernoulli dn cðC Þ ¼ D. We say that (X, T ) has relatively
shift. completely positive entropy with respect to C
if every factor D which contains D has
Heicklen and Hoffman used the tree very weak hðD Þ > hðC Þ. We say that C is relatively
Bernoulli condition to show that we can always Bernoulli if there exists a second factor D which
take n to be one. is independent of D and ℬ ¼ C _ D.
Thouvenot defined properties of factors called
Theorem 22 (Heicklen and Hoffman 2002) relatively very weak Bernoulli and relatively
For every rational function f of degree d 2 the finitely determined. Then he proved an analog
214 Isomorphism Theory in Ergodic Theory
of Theorem 3. This says that a factor being rela- results discussed above we can ask what is the
tively Bernoulli is equivalent to it being relatively largest class of groups such that an analogous
finitely determined (and also equivalent to it being result is true. It turns out that for most of the
relatively very weak Bernoulli). properties described above the right class of
The Pinsker algebra is the maximal s-algebra groups is discrete amenable groups.
P such that hðP Þ ¼ 0. The Pinsker conjecture was A Følner sequenceFn in a group G is a
that for every measure preserving transformation sequence of subsets Fn of G such that for all
(X, T ) there exists a factor C such that g G we have that lim j gðFn Þ j=j Fn j. A count-
n!1
able group is amenable if and only if it has a
1. C is independent of the Pinsker algebra P Følner sequence.
2. ℬ ¼ C _ P and For nonamenable groups it is much more diffi-
3. ðX, C Þ has completely positive entropy. cult to generalize Birkhoff’s ergodic theorem
(Nevo 1994; Nevo and Stein 1994). Lindenstrauss
Ornstein found a counterexample to the Pinsker proved that for every discrete amenable group there
conjecture (Ornstein 1973c). After Thouvenot is an analog of the ergodic theorem (Lindenstrauss
developed the relative isomorphism theory he 2001). For every amenable group G and every
came up with the following question which is probability vector p we can define a Bernoulli
referred to as the weak Pinsker conjecture. action of G. There are also analogs of Rokhlin’s
lemma and the Shannon–McMillan–Breiman the-
Conjecture 2 For every measure preserving orem for actions of all discrete amenable groups
transformation (X, T ) and every ϵ > 0 there (Krieger 1970; Ornstein and Weiss 1983, 1987).
exist invariant s-algebras C , D ℬ such that Thus we have all of the ingredients to prove a
version of Ornstein’s isomorphism theorem.
1. C is independent of D
2. ℬ¼C _D Theorem 24 If p and q are probability vectors
3. ðX, T, m, D Þ is isomorphic to a Bernoulli shift and H(p) ¼ H(q) then the Bernoulli actions of
4. hððX, T, m, C ÞÞ < ϵ: G corresponding to p and q are isomorphic.
which is a Markov random field and is mixing but Breiman L (1957) The individual ergodic theorem of infor-
has zero entropy (Ledrappier 1978). Even more mation theory. Ann Math Stat 28:809–811
Den Hollander F, Steif J (1997) Mixing E properties of the
surprising even though it is mixing it is not mixing generalized T,T1-process. J Anal Math 72:165–202
of all orders. The existence of a ℤ action which is Einsiedler M, Lindenstrauss E (2003) Rigidity properties
mixing but not mixing of all orders is one of the of Zd-actions on tori and solenoids. Electron Res
longest standing open questions in ergodic theory Announc Am Math Soc 9:99–110
Einsiedler M, Katok A, Lindenstrauss E (2006) Invariant
(Halmos 1950). measures and the set of exceptions to Littlewood’s
Even if we try to strengthen the hypothesis of conjecture. Ann Math (2) 164(2):513–560
Friedman and Ornstein’s theorem to assume that Feldman J (1976) New K-automorphisms and a problem of
the Markov random field has completely positive Kakutani. Isr J Math 24(1):16–38
Freire A, Lopes A, Mañé R (1983) An invariant measure
entropy we will not succeed as there exists a for rational maps. Bol Soc Brasil Mat 14(1):45–62
Markov random field which has completely pos- Friedman NA, Ornstein DS (1970) On isomorphism of
itive entropy but is not isomorphic to a Bernoulli weak Bernoulli transformations. Adv Math 5:365–394
shift (Hoffman 1999b). Furstenberg H (1967) Disjointness in ergodic theory, min-
imal sets, and a problem in Diophantine approximation.
Math Syst Theory 1:1–49
Gallavotti G, Ornstein DS (1974) Billiards and Bernoulli
Future Directions schemes. Commun Math Phys 38:83–101
Gromov M (2003) On the entropy of holomorphic maps.
In the future we can expect to see progress of Enseign Math (2) 49(3–4):217–235
isomorphism theory in a variety of different direc- Halmos PR (1950) Measure theory. D Van Nostrand,
New York
tions. One possible direction for future research is
Halmos PR (1960) Lectures on ergodic theory. Chelsea
better understand the properties of finitary iso- Publishing, New York
morphisms between various transformations and Harvey N, Peres Y. An invariant of finitary codes with
Bernoulli shifts described in section “Finitary Iso- finite expected square root coding length. Ergod The-
ory Dynam Syst, to appear
morphisms”. Another possible direction would be
Heicklen D (1998) Bernoullis are standard when entropy is
to find a theory of equivalence relations for not an obstruction. Isr J Math 107:141–155
Bernoulli endomorphisms analogous to the one Heicklen D, Hoffman C (2002) Rational maps are d-adic
for invertible Bernoulli transformations described Bernoulli. Ann Math (2) 156(1):103–114
Hoffman C (1999a) A K counterexample machine. Trans
in section “Other Equivalence Relations”.
Am Math Soc 351(10):4263–4280
As the subject matures the focus of research in Hoffman C (1999b) A Markov random field which is K but
isomorphism theory will likely shift to connections not Bernoulli. Isr J Math 112:249–269
to other fields. Already there are deep connections Hoffman C (2003) The scenery factor of the [T,T1] trans-
formation is not loosely Bernoulli. Proc Am Math Soc
between isomorphism theory and both number the-
131(12):3731–3735
ory and statistical physics. Finally one hopes to see Hoffman C (2004) An endomorphism whose square is
progress made on the two dominant outstanding Bernoulli. Ergod Theory Dynam Syst 24(2):477–494
conjectures in the field: Thouvenot weak Pinsker Hoffman C, Rudolph D (2002) Uniform endomorphisms
which are isomorphic to a Bernoulli shift. Ann Math
conjecture (Conjecture 2) and Furstenberg’s con-
(2) 156(1):79–101
jecture (Conjecture 1) about measures on the circle Hopf E (1971) Ergodic theory and the geodesic flow on
invariant under both the times 2 and times 3 maps. surfaces of constant negative curvature. Bull Am Math
Progress on either of these conjectures would Soc 77:863–877
Jong P (2003) On the isomorphism problem of
invariably lead the field in exciting new directions.
p-endomorphisms. PhD thesis, University of Toronto
Kalikow SA (1982) T,T1 transformation is not loosely
Bernoulli. Ann Math (2) 115(2):393–409
Bibliography Kammeyer JW, Rudolph DJ (2002) Restricted orbit equiv-
alence for actions of discrete amenable groups. Cam-
Ashley J, Marcus B, Tuncel S (1997) The classification of bridge tracts in mathematics, vol 146. Cambridge
one-sided Markov chains. Ergod Theory Dynam Syst University Press, Cambridge
17(2):269–295 Katok AB (1975) Time change, monotone equivalence,
Birkhoff GD (1931) Proof of the ergodic theorem. Proc and standard dynamical systems. Dokl Akad Nauk
Natl Acad Sci U S A 17:656–660 SSSR 223(4):789–792. in Russian
216 Isomorphism Theory in Ergodic Theory
Katok A (1980) Smooth non-Bernoulli K-automorphisms. Ornstein DS, Weiss B (1987) Entropy and isomorphism
Invent Math 61(3):291–299 theorems for actions of amenable groups. J Anal Math
Katok A, Spatzier RJ (1996) Invariant measures for higher- 48:1–141
rank hyperbolic abelian actions. Ergod Theory Dynam Ornstein DS, Rudolph DJ, Weiss B (1982) Equivalence of
Syst 16(4):751–778 measure preserving transformations. Mem Am Math
Katznelson Y (1971) Ergodic automorphisms of Tn are Soc 37(262) American Mathematical Society
Bernoulli shifts. Isr J Math 10:186–195 Parry W (1969) Entropy and generators in ergodic theory.
Keane M, Smorodinsky M (1979) Bernoulli schemes of the WA Benjamin, New York
same entropy are finitarily isomorphic. Ann Math Parry W (1981) Topics in ergodic theory. Cambridge tracts
(2) 109(2):397–406 in mathematics, vol 75. Cambridge University Press,
Kolmogorov AN (1958) A new metric invariant of tran- Cambridge
sient dynamical systems and automorphisms in Petersen K (1989) Ergodic theory. Cambridge studies in
Lebesgue spaces. Dokl Akad Nauk SSSR (NS) 119: advanced mathematics, vol 2. Cambridge University
861–864. in Russian Press, Cambridge
Kolmogorov AN (1959) Entropy per unit time as a metric Pinsker MS (1960) Dynamical systems with completely
invariant of automorphisms. Dokl Akad Nauk SSSR positive or zero entropy. Dokl Akad Nauk SSSR 133:
124:754–755. in Russian 1025–1026. in Russian
Krieger W (1970) On entropy and generators of measure- Ratner M (1978) Horocycle flows are loosely Bernoulli. Isr
preserving transformations. Trans Am Math Soc 149: J Math 31(2):122–132
453–464 Ratner M (1982) Rigidity of horocycle flows. Ann Math
Ledrappier F (1978) Un champ markovien peut être (2) 115(3):597–614
d’entropie nulle et mélangeant. CR Acad Sci Paris Sér Ratner M (1983) Horocycle flows, joinings and rigidity of
A–B 287(7):A561–A563. in French products. Ann Math (2) 118(2):277–313
Lindenstrauss E (2001) Pointwise theorems for amenable Ratner M (1991) On Raghunathan’s measure conjecture.
groups. Invent Math 146(2):259–295 Ann Math (2) 134(3):545–607
Lyons R (1988) On measures simultaneously 2- and Rudolph DJ (1976) Two nonisomorphic K-automorphisms
3-invariant. Isr J Math 61(2):219–224 with isomorphic squares. Isr J Math 23(3–4):274–287
Mañé R (1983) On the uniqueness of the maximizing mea- Rudolph DJ (1979) An example of a measure preserving
sure for rational maps. Bol Soc Brasil Mat 14(1):27–43 map with minimal self-joinings, and applications.
Mañé R (1985) On the Bernoulli property for rational J Anal Math 35:97–122
maps. Ergod Theory Dynam Syst 5(1):71–88 Rudolph DJ (1983) An isomorphism theory for Bernoulli
Meilijson I (1974) Mixing properties of a class of skew- free Z-skew-compact group actions. Adv Math 47(3):
products. Isr J Math 19:266–270 241–257
Meshalkin LD (1959) A case of isomorphism of Bernoulli Rudolph DJ (1985) Restricted orbit equivalence. Mem Am
schemes. Dokl Akad Nauk SSSR 128:41–44. in Russian Math Soc 54(323) American Mathematical Society
Nevo A (1994) Pointwise ergodic theorems for radial aver- Rudolph DJ (1988) Asymptotically Brownian skew prod-
ages on simple Lie groups. I. Duke Math J 76(1):113–140 ucts give non-loosely Bernoulli K-automorphisms.
Nevo A, Stein EM (1994) A generalization of Birkhoff’s Invent Math 91(1):105–128
pointwise ergodic theorem. Acta Math 173(1):135–154 Rudolph DJ (1990) 2 and 3 invariant measures and
Ornstein D (1970a) Factors of Bernoulli shifts are entropy. Ergod Theory Dynam Syst 10(2):395–406
Bernoulli shifts. Adv Math 5:349–364 Schmidt K (1984) Invariants for finitary isomorphisms
Ornstein D (1970b) Two Bernoulli shifts with infinite with finite expected code lengths. Invent Math 76(1):
entropy are isomorphic. Adv Math 5:339–348 33–40
Ornstein D (1970c) Bernoulli shifts with the same entropy Shields P (1973) The theory of Bernoulli shifts. Chicago
are isomorphic. Adv Math 4:337–352 lectures in mathematics. University of Chicago Press,
Ornstein DS (1973a) A K automorphism with no square Chicago
root and Pinsker’s conjecture. Adv Math 10:89–102 Sinaĭ J (1959) On the concept of entropy for a dynamic
Ornstein DS (1973b) An example of a Kolmogorov auto- system. Dokl Akad Nauk SSSR 124:768–771. in Russian
morphism that is not a Bernoulli shift. Adv Math 10: Sinaĭ JG (1962) A weak isomorphism of transformations
49–62 with invariant measure. Dokl Akad Nauk SSSR 147:
Ornstein DS (1973c) A mixing transformation for which 797–800. in Russian
Pinsker’s conjecture fails. Adv Math 10:103–123 Thouvenot J-P (1975a) Quelques propriétés des systèmes
Ornstein DS, Shields PC (1973) An uncountable family of dynamiques qui se décomposent en un produit de deux
K-automorphisms. Adv Math 10:63–88 systèmes dont l'un est un schéma de Bernoulli. Confer-
Ornstein DS, Weiss B (1973) Geodesic flows are ence on ergodic theory and topological dynamics, Kib-
Bernoullian. Isr J Math 14:184–198 butz, Lavi, 1974. Isr J Math 21(2–3):177–207. in
Ornstein DS, Weiss B (1974) Finitely determined implies French
very weak Bernoulli. Isr J Math 17:94–104 Thouvenot J-P (1975b) Une classe de systèmes pour
Ornstein D, Weiss B (1983) The Shannon–McMillan– lesquels la conjecture de Pinsker est vraie. Conference
Breiman theorem for a class of amenable groups. Isr on ergodic theory and topological dynamics, Kibbutz
J Math 44(1):53–60 Lavi, 1974. Isr J Math 21(2–3):208–214. in French
Gaussian process, Gaussian space A Gaussian
Dynamical Systems of process is a family of real-valued random vari-
Probabilistic Origin: Gaussian ables defined on a probability space (Ω, ℙ),
and Poisson Systems such that any linear combination of finitely
many of these random variables is either 0 or
Élise Janvresse1, Emmanuel Roy2 and normally distributed.
Thierry De La Rue3 A real linear subspace of L2(ℙ) is a
1
Laboratoire Amiénois de Mathématique Gaussian space if any nonzero random vari-
Fondamentale et Appliquée, CNRS-UMR 7352, able it contains is normally distributed. The
Université de Picardie Jules Verne, Amiens, closure of the linear real subspace spanned by
France a Gaussian process is a Gaussian space.
2
Laboratoire Analyse, Géométrie et Applications, Infinite divisibility Let ðG, G, þÞ be a measur-
Université Paris 13 Institut Galilée, Villetaneuse, able Abelian semigroup, i.e., the addition
France ðG G, G GÞ 7! ðG, GÞ
3
Laboratoire de Mathématiques Raphaël Salem, ðg1 , g2 Þ 7! g1 þ g 2
CNRS – Université de Rouen Normandie, Saint is commutative and measurable. The convolu-
Étienne du Rouvray, France tion n r of probability measures n and r on
ðG, G Þ is well defined as the image of v r by
the addition.
Article Outline A probability measure n on ðG, G Þ is
infinitely divisible if for any k 1, there exists
Glossary a probability measure nk on ðG, G Þ such
Definition of the Subject that n ¼ (nk)k.
Introduction Kronecker subset of the unidimendional
From Probabilistic Objects to Dynamical Systems torus A subset K of the unidimendional torus
Spectral Theory ¼ ℝ=ℤ is a Kronecker set if any continuous
Basic Ergodic Properties function f : K ! S1 is a uniform limit of
Joinings, Factors, and Centralizer characters: there exists a sequence (kn) ℤ
GAGs and PAPs such that
Future Directions
max j f ðtÞ ei2pkn t j ! 0:
Bibliography tK n!1
Any finite set of rationally independent ele-
Glossary ments of is a Kronecker set, but there exist
also perfect Kronecker subsets of (see for
Centralizer The centralizer of an invertible example (Cornfeld et al. 1982), Appendix 4]).
measure-preserving transformation T is the set Point process A point process N on ðX, A Þ is a
C(T) of all invertible measure-preserving trans- random variable taking values in the space X
formations on the same measure space which of counting measures on ðX, A Þ. It is said to be
commute with T. simple if N is almost surely a simple counting
(Simple) Counting measure A counting mea- measure.
sure on a measurable space ðX, A Þ is a mea- The measure A 7! ½N ðAÞ on ðX, A Þ is
sure of the form i I dxi where (xi)i I is a called the intensity of N.
countable family of elements of X. The A point process of intensity m is said to
counting measure is said to be simple if xi 6¼ xj have moment of order k 1 if, for all A A
whenever i 6¼ j.
© Springer Science+Business Media, LLC, part of Springer Nature 2023 217
C. E. Silva, A. I. Danilenko (eds.), Ergodic Theory,
https://doi.org/10.1007/978-1-0716-2388-6_725
Originally published in
R. A. Meyers (ed.), Encyclopedia of Complexity and Systems Science, © Springer Science+Business Media LLC 2019
https://doi.org/10.1007/978-3-642-27737-5_725-1
218 Dynamical Systems of Probabilistic Origin: Gaussian and Poisson Systems
with 0 < m(A) < 1, unidimendional torus, called the spectral mea-
N ð AÞ k
< 1: sure of h, satisfying for all k ℤ
The classical theory of Gaussian systems stud- symmetrized of γ, defined for each measurable
ies the above-mentioned standard Gaussian sys- subset A of by
tems. But following (Lemańczyk et al. 2000), it is
useful to introduce also the generalized Gaussian gðAÞ þ gðAÞ
sðAÞ ¼ :
systems as probability-preserving dynamical sys- 2
tems ðX, B, m, T Þ satisfying the following
more general property: there exists a closed real Moreover, if s is continuous, (xp)p ℤ gener-
subspace H of L2(m) such that ates the same sigma-algebra as (Bt)0 t 1, and the
measure-preserving system (C0([0, 1], ℂ), mW,
• H is a Gaussian space. Tγ) is a Gaussian dynamical system isomorphic to
• H is invariant by UT (in particular, the station- Xs.
ary process (x ∘ T n)n ℤ is Gaussian).
• The sigma-algebra generated by H is B. Poisson Suspensions
Poisson point process. Let ðX, A Þ be a standard
Geometric interpretation. A geometric model Borel space, and let ðX , A Þ be the canonical
for a standard Gaussian dynamical system is pro- space of point processes on ðX, A Þ, where
posed in de la Rue (1995) as a transformation of a
complex Brownian motion path. More precisely, • X is the set of counting measures on X,
let B ¼ (Bt)0 t 1 be a complex Brownian motion • A is the sigma-algebra generated by the maps
with B0 ¼ 0. For any given probability measure γ ðN A ÞA A , where
on ðidentified here with ½0, 1Þ, let us define,
for 0 s < 1
X ! ℕ [ fþ1g
NA :
yðsÞ ≔ inf fx ½0, 1 ½: gð½0, xÞ sg: o 7! o ðAÞ:
system ðX , A , m , T Þ is called the Poisson L2(Xn, mn) consisting of functions which are
suspension over the base ðX, A, m, T Þ. invariant by coordinate permutations.
Of course the properties of the Poisson suspen- With the convention H 0 ≔ ℂ, we can consider
sions depend on those of the base system, and this the vector space n0H n which is the formal direct
allows to make strong connections between sum whose elements are finite sums of vectors of
infinite-measure-preserving systems and H n, n 1. The space n0H n can be equipped
probability-preserving systems. with a scalar product by considering the direct sum as
orthogonal and by endowing each H n with the
scalar product h , iHn. The (Boson) Fock space
F(H) of H is the Hilbert space obtained as the com-
Spectral Theory
pletion of n 0H n with respect to the norm of the
scalar product we just set up.
Basics of Spectral Theory
Let us recall some basic notions of spectral theory
(see e.g., (Lemańczyk 2011)). Operators on a Fock Space
Let U be a unitary operator acting on a separa- Whenever F is an operator on H of norm less than
ble Hilbert space H, and let h H. We denote by or equal to 1, it extends naturally to an operator F
sh the spectral measure of h (see (1)). Let C(h) be on the Fock space F(H) by acting on H n as F n,
the cyclic space of h under U, that is, the closure of that is
the linear span of the vectors Unh, n ℤ. The
linear map between C(h) and L2(sh) that maps 8v H, F ðv vÞ ≔Fv Fv:
Unh to ei2πn extends to an isometry and intertwins
the operator U with the unitary operator This operator F is called the second quantiza-
tion of F (see Attal n.d., Chap. 8, p. 16).
V : f 7! ei2p f , f L2 ðsh Þ: ð4Þ The following proposition considers the sec-
ond quantization U of a unitary operator U.
The maximal spectral type of U is the equiva-
lence class of shmax for some hmax satisfying: Proposition 4.1 If U is unitary on H and smax is
in the equivalence class of its maximal spectral
8g H, sg shmax : type, then
Fock Space
We describe here an algebraic construction that Application to Gaussian and Poisson Chaos
plays a crucial role in the study of our objects.
Let once again H be a Hilbert space and denote Fock Space Structure of L2 for Gaussian Dynamical
H n, the vector space of symmetric elements of the Systems and Poisson Suspensions
n-th tensor product Hn. When H is L2(X, m) for In the case of the standard Gaussian dynamical
some sigma-finite measure m on X, then H n can be system Xs, generated by the Gaussian process
identified with the subspace L2perm ðXn , mn Þ of (xn)n ℤ, we denote by H r1 the real Gaussian
222 Dynamical Systems of Probabilistic Origin: Gaussian and Poisson Systems
subspace of L2(ms) spanned by the random vari- In the case of the standard Gaussian system Xs,
ables xn, n ℤ, and H1 ≔H r1 þ iH r1 the complex the action of US on the first chaos H1 corresponds to
subspace spanned by the process. Then H1 is the multiplication by ei2π in L2 ð, sÞ. In general,
isometric to L2 ð, sÞ, with an isometry the action of US on the n-th chaos Hn corresponds to
extending the correspondence xn $ ei2πn(n ℤ). the multiplication by the function
In the case of the Poisson suspension ðt1 , . . ., tn Þ 7! ei2pðt1 þ þtn Þ in L2perm ðn , sn Þ.
ðX , A , m , T Þ, we denote by H1 the subspace In the case of the Poisson suspension
of L2(m) spanned by the random variables of the ðX , A , m , T Þ, the action of U T on H1 cor-
form N A mðAÞ, A A, mðAÞ < 1. In this case responds to the action of UT on L2(m). In general,
H1 is isometric to L2(X, m), with an isometry the action of UT on the n-th chaos Hn corresponds
extending the correspondence N A mðAÞ $ to the action of UT n on L2perm ðXn , mn Þ.
1A ðA A, mðAÞ < 1Þ.
In both cases, H0 denotes the subspace of con-
stant functions. Then for each n 2, we define
inductively the subspace Hn as the ortho- Basic Ergodic Properties
complement of 0 j<nHj in the space of all poly-
nomials of degree at most n in variables in H1 Ergodicity and Mixing
(note that elements of H1 always have moments of Ergodic properties as spectral properties. We list
any order). In the classical probabilist terminol- some classical ergodic properties that are
ogy, the subspace Hn is called the n-th chaos. spectral by nature. We start by the probability-
It turns out that Hn is, in the Gaussian case, preserving case where it is customary to consider
isometric to L2perm ðn , sn Þ, whereas in the Poisson UT : f 7! f ∘ T acting on L20 ðmÞ ¼ L2 ðmÞ ℂ, the
orthocomplement of the constant functions as UT
case it is isometric to L2perm ðXn , mn Þ. We therefore
acts trivially on the latter: UT(α1lX) ¼ α1lX. In
get the following description of L2 as a Fock space,
that case, the reduced maximal spectral type
which in the Gaussian case takes the form
denotes the maximal spectral type of UT acting
on L20 ðmÞ.
L2 ðms Þ ¼ H n ’ F L2 ð, sÞ ,
n0
Theorem 5.1 Let ðX, A, m, T Þ be a dynamical
and in the Poisson case system with m(X) ¼ 1, UT its Koopman operator
acting on L20 ðmÞ and r0max , a finite measure in the
L2 ðm Þ ¼ H n ’ F L2 ðX, mÞ : measure class of the reduced maximal spectral type.
n0
operator acting on L2(m) and rmax a finite mea- yields to the following characterizations (proved
sure in the measure class of the maximal spectral by Marchat (1978) and Grabinsky (1984)).
type.
Theorem 5.4 The Poisson suspension
• ðX, A, m, T Þ has no T-invariant set of finite ðX , A , m , T Þ is ergodic if and only if the
positive measure if and only if rmax is base system ðX, A, m, T Þ has no invariant set
continuous. of finite positive measure. In this case it is also
• ðX, A, m, T Þ is of zero type (i.e., m- weakly mixing.
(A \ T nB) ! 0 as n tends to infinity for all The Poisson suspension ðX , A , m , T Þ is
finite m-measure sets A and B in A) if and only (strongly) mixing if and only if the base system
if rmax is Rajchman. ðX, A, m, T Þ is of zero type.
Theorem 5.3 The Gaussian dynamical system In the case where its spectral measure is the
Xs is ergodic if and only if the spectral measure Lebesgue measure l on , the stationary Gaussian
s is continuous, that is s({t}) ¼ 0 for each t . process (xn)n ℤ is composed of orthogonal, hence
In this case it is also weakly mixing. independent, random variables. Thus Xl is a
The system Xs is (strongly) mixing if and only if Bernoulli shift of infinite entropy. Now if s l,
s is Rajchman. we will see in Theorem 6.1 that Xs is a factor of a
Bernoulli shift. By Ornstein theory (Ornstein
In particular, by the Riemann-Lebesgue 1974), Xs is itself isomorphic to a Bernoulli
Lemma, if s is absolutely continuous, then the shift, and since it also has infinite entropy by
associated Gaussian system is mixing, which Theorem 5.5, it is in fact isomorphic to Xl.
was first proved by Ito (1944). But there also In the general case, decomposing the spectral
exist singular spectral measures whose Fourier measure as the sum of its singular and absolutely
coefficients vanish at infinity (Menchoff 1916). continuous parts and assuming that both are non-
The same kinds of arguments apply to Poisson zero, we get by Theorem 6.1 below that Xs is the
suspensions: together with Theorem 5.2, this direct product of a Bernoulli shift of infinite
224 Dynamical Systems of Probabilistic Origin: Gaussian and Poisson Systems
entropy with a zero-entropy Gaussian system mð½i0 i1 , . . ., ik Þ ≔qi0 pi0 ,i1 Pikl , ik :
(corresponding to the singular part of the spectral
measure). In particular, any standard Gaussian sys- It has been proved by Kalikow (1981) and
tem satisfies the Pinsker property (it is a product of Grabinsky (1984) that the associated Poisson sus-
a zero-entropy system with a Bernoulli shift). pension ðX , A , m , T Þ is Bernoulli. Moreover,
As far as Poisson suspensions are concerned, as shown in Janvresse et al. (2010), its entropy is
anything can happen with the entropy which can given by the following formula generalizing the
be 0, positive finite or infinite. Together with the entropy for positive-recurrent Markov chains:
fact that, when the base ðX, A, m, T Þ is a prob-
ability space, the entropy of the Poisson suspen- hð T Þ ¼ qi pi,j log pi,j :
sion T is just the same as the entropy of T, this iS jS
leads to view the entropy of the Poisson suspen-
sion T as a possible way to define the entropy of (Examples where the above entropy is finite
T when T is an infinite measure-preserving trans- are provided by Krengel (1967).)
formation, as proposed in Roy (2005). Other ways The general structure of a Poisson suspension
to define the entropy of an infinite measure- is, in general, not known. We can ask in particular
preserving transformation, also generalizing the whether there exists a Poisson suspension which
finite-measure case, had been proposed by is K but not Bernoulli.
Krengel (1967) and Parry (1969), and it is proved
in Janvresse et al. (2010) that these three notions
of entropy coincide in many cases. However, an
example of a transformation with zero Krengel
Joinings, Factors, and Centralizer
entropy, but whose Poisson suspension has posi-
Gaussian Factors and Centralizer
tive entropy, is presented in Janvresse and De La
Gaussian factors of a Gaussian system. In the
Rue (2012).
standard Gaussian system Xs generated by the
Assume that there exists in the base system
Gaussian process (xn)n ℤ, let us take a nonzero
ðX, A, m, T Þ a wandering set A (i.e., whose
element z H r1, and set for each n ℤ, zn ≔z∘Sn Hr1 :
images T nA, n ℤ, are pairwise disjoint), and
Then (ζn)n ℤ is a stationary Gaussian process, and
such that X ¼ [n ℤT nA (the system is called
generates a particular factor sigma-algebra of Xs
dissipative in this case). Then, obviously, the
which we call a Gaussian factor. This factor can
base is of zero type and the Poisson suspension
be more precisely described by the following
is mixing. But thanks to the independence prop-
analysis.
erty of the Poisson process, the Poisson suspen-
Spectral theory provides an isometry between
sion is in fact Bernoulli. However, there also exist
the closed real subspace Hr1 of L2(ms) generated by
Bernoulli examples of Poisson suspensions where
the base is conservative (i.e., there is no wander- the Gaussian process (xn) and the following real
ing set of positive measure). Such examples are subspace of L2 ð, sÞ:
provided by Poisson suspensions over null-
L2sym ð, sÞ
recurrent Markov chains: we consider a countable
set S, an irreducible null-recurrent stochastic ≔ f L2 ð, sÞ : fðtÞ ¼ fðtÞ for s almost every t :
matrix P ¼ (pi, j)i, j S, and let q ¼ (qi)i S be a
nonzero measure which is stationary with respect The element ζ of H r1 corresponds to some
to P. We can then form the associated Markov function f L2sym ð, sÞ, and the spectral measure
shift ðX, A, m, T Þ where X ¼ Sℤ, T is the s1 of the process (ζn)n ℤ is absolutely continuous
shift transformation, and m is the shift invariant 2
with respect to s, with density dsds ¼ jfj .
1
infinite measure given on cylinder sets by the Observe conversely that any symmetric posi-
formula tive finite measure s1 s can be realized in this
Dynamical Systems of Probabilistic Origin: Gaussian and Poisson Systems 225
way, for example, by taking the real-valued func- always rich. First, if in Formula (3) we replace
tion f≔ ds1
L2sym ð, sÞ. ei2πθ(s) by ei2πuθ(s) for u ℝ, we get a measure-
ds
The Gaussian factor generated by (ζn)n ℤ preserving ℝ action T ug . Hence any stan-
uℝ
only depends on the equivalence class of s1, so dard Gaussian dynamical system can be embed-
we can denote it by F s1. The action of S on F s1 is ded in an ℝ-action. Moreover each T ug gives rise to
isomorphic to X s1 . a generalized Gaussian system (as the Gaussian
If s1 is equivalent to s, then F s1 coincides with subspace Hr1 is stable by T ug ).
the Borel sigma-algebra of Xs, and X s1 is isomor- Observe also that if we take two probability
phic to Xs. On the other hand, if s1 is not equiv- measures γ1 and γ2 on , then T g1 and T g2 always
alent to s, we can find another symmetric positive commute. Hence the centralizer of a Gaussian
finite measure s2 on such that dynamical system always contains transforma-
tions isomorphic to any other standard Gaussian
• s2 is singular with respect to s1. dynamical system.
• s is equivalent to s1 + s2. The above examples all belong to the so-called
Gaussian centralizer, whose elements are
Using the fact that orthogonal subspaces of H r1 constructed as follows. We start by defining the
correspond to independent families of Gaussian multiplicative group
random variables, we then get another Gaussian
factor F s2 which is independent of F s1 , and such Gs ≔ c L2sym ð, sÞ :jcj ¼ 1ðs a:e:Þ :
that F s1 and F s2 together generate the same
sigma-algebra as the original Gaussian process
For each c in Gs, the multiplication by c in
(xn). This shows that any strict Gaussian factor
L2sym ð, sÞ is a unitary operator, which corre-
F s1 of Xs admits an independent complement. In
the case when Xs is ergodic, this independent sponds in H r1 to a unitary operator Uc. Moreover,
complement is itself weakly mixing, therefore the process (Uc(xn))n ℤ has the same distribution
the corresponding extension X s ! X s1 is rela- as the generating Gaussian process (xn)n ℤ,
tively weakly mixing. hence Uc comes from a measure-preserving trans-
We can summarize some of the above results formation, which we denote here by Tc, commut-
through the following theorem. ing with S on Xs.
Note that Uc can be naturally extended to a
Theorem 6.1 Let s, s1, s2 be symmetric posi- unitary operator on the complex space H1. The
tive finite measures on . Koopman operator associated to Tc is then noth-
ing but the second quantization of Uc.
• If s1 and s2 are equivalent, then X s1 and X s2 The set of transformations
are isomorphic.
• If s1 s, then X s1 is a factor of Xs and Cg ðSÞ≔ T c : c Gs
corresponds to a Gaussian factor F s1 of Xs.
• If s is equivalent to s1 + s2, where s1 and s2 is a subgroup of the centralizer of S which is called
are mutually singular, then Xs is isomorphic to the Gaussian centralizer.
the direct product X s1 X s2 . Compact and classical factors of a Gaussian
dynamical system. Newton and Parry (1966) (see
also Maruyama (1967)) introduced another type
Gaussian centralizer. The geometric interpre- of factor of the Gaussian system Xs by consider-
ing the sigma-algebra of subsets of ℝℤ which
tation of a standard Gaussian dynamical system as
a transformation of the Brownian motion path are invariant by the transformation (xn)n ℤ 7!
given in section “Gaussian Systems” allows to (xn)n ℤ (called the even factor). We can see that
this factor is not a Gaussian factor, as the
show that the centralizer of a Gaussian system is
226 Dynamical Systems of Probabilistic Origin: Gaussian and Poisson Systems
ℝℤ ℝℤ , B ℝℤ B ℝℤ , n, S S , ðX , A , ðm m1 Þ , T Þ,
ðX X , A A , m, T T Þ,
the two natural projections on ℝℤ provide two and
copies
ðX , A , ðm m2 Þ , T Þ:
x0n ¼ x00 ∘ ðS SÞn nℤ
and x00n On X X, we define m as the pushforward
¼ x000 ∘ ðS SÞn measure of ðm m1 Þ m ðm m2 Þ by the
nℤ
factor map
of the original Gaussian process. The self-joining
0 0
n is said to be a Gaussian self-joining if the real o0 , ðo1 , o2 Þ, o0 7! o0 þ o1 , o2 þ o0 :
subspace of L2(n) spanned by the x0n and the
x00n , n ℤ, is a Gaussian space. We get a self-joining ðX X , A A , m,
T T Þ of the original Poisson suspension.
Example 6.4 The product self-joinings, and
more generally the relatively independent prod- Definition 6.5 We call Poisson self-joining of a
uct over a Gaussian factor, are Gaussian self- Poisson suspension any self-joining which is
joinings. The graph self-joining associated to a obtained as explained above.
transformation in the Gaussian centralizer is
also a Gaussian self-joining. The ergodic com- Example 6.6 The product self-joining corre-
ponents of the relatively independent product sponds to the case where m is the null measure.
over a compact factor are Gaussian self- If S is in the centralizer of T and m ≔ Δs is the
joinings. corresponding graph measure, then m ¼ DS ,
i.e., the graph self-joining associated to S.
For the Poisson counterpart, the aforemen- If C A is a sigma-finite factor and m≔mC m is
tioned family of self-joinings is easier to introduce the relatively independent joining over C , then
by expliciting their structure. m ¼ m C m , i.e., the relatively independent
Let ðX , A , m , T Þ be a Poisson suspen- joining over the Poisson factor C .
sion. Start with a “sub-self-joining” of the base,
namely a system ðX X, A A, m, T T Þ Despite the different nature of their structure, it
where m is a T T-invariant measure whose pro- is possible to give a unified characterization of
jections satisfy Poisson and Gaussian self-joinings (see Roy 2005).
Similarly, a self-joining of a Poisson suspen- (which means that the spectral measure s satisfies
sion ðX , A , m , T Þ is Poisson if and only if the same assumptions as in Theorem 7.1). Let n be
the associated Markov operator F acting on a self-joining of Xs, and denote by x0 and x00 the
L2(m) is the second quantization of an operator two copies of the original Gaussian process with
’ on L2(m) corresponding to a sub-self-joining of spectral measure s in the probability-preserving
ðX, A, m, T Þ (a sub-Markov operator commut- dynamical system
ing with T).
ℝℤ ℝℤ , B ℝℤ ℝℤ , n, S S ,
The following property is central in this theory
(see (Lemańczyk et al. 2000) for Gaussian, (Roy (as in Definition 6.3). In L2(n), let H0 and H00 be
2005) and (Derriennic et al. 2008) for Poisson). the real subspaces spanned, respectively, by x0 and
x00, and H ≔ H0 + H00. For any h H, the spectral
Proposition 6.9 A Gaussian (resp. Poisson) self- measure of the stationary process (h ∘ (S S)n)n ℤ
joining of an ergodic Gaussian system (resp. is absolutely continuous with respect to s. Hence,
Poisson suspension) is ergodic. if we further assume that n is ergodic, Theorem 7.1
implies that H is a Gaussian subspace.
We formalize the above by the following def-
GAGs and PAPs inition and theorem.
joinings. Since these self-joinings can tell a lot on Theorem 7.11 (Factors of a PAP (Janvresse
the factors and centralizer of the corresponding et al. 2017b)) Any nontrivial factor of the PAP
transformation, it is no surprise that we are able to ðX , A , m , T Þ contains a nontrivial Poisson
control the factors and centralizer of GAGs factor. Any factor over which T is relatively
and PAPs. weakly mixing is a Poisson factor.
We have seen in section “Gaussian Factors and
Centralizer” that in the centralizer of any Gaussian In particular, when T is in the family (FS), T
system we can find transformations isomorphic to has no nontrivial factor, as T itself has no non-
any other Gaussian system. It turns out that for trivial factor (Janvresse et al. 2018). Note that the
GAGs, there is no other element in the centralizer. primeness and the triviality of the centralizer of T
in this case are striking differences compared to
Theorem 7.8 (Centralizer of a GAG Gaussian systems, which always possess a lot of
(Lemańczyk et al. 2000)) Let Xs be a GAG, factors and a large centralizer. The dissemblance
and let T C(S). Then for each ζ in the real can be pushed even further as it is proved (see
Gaussian subspace H r1 of L2(ms) spanned by the (Janvresse et al. 2017b) and (Janvresse et al.
generating Gaussian process, we have z∘T H r1 . 2020)) that if T is in the family (FS), T is in fact
In particular the system ℝℤ , B ℝℤ , ms , T is disjoint from any standard Gaussian system.
a generalized Gaussian system.
For Poisson suspensions, the entropy theory is de la Rue T (1995) Mouvement moyen et système
much richer and the situation where all three dynamique gaussien. Probab Theory Relat Fields
102(1):45–56. (French)
notions of entropy of a sigma-finite measure- de la Rue T (1996) Systèmes dynamiques gaussiens
preserving system do not coincide remains rather d’entropie nulle, lâchement et non lâchement Bernoulli.
mysterious. For example, is it true that the Poisson Ergodic Theory Dyn Syst 16(2):379–404. (French)
entropy always dominates the Krengel entropy of de la Rue T (1998a) L’ induction ne donne pas toutes les
mesures spectrales. Ergodic Theory Dyn Syst
the system? 18(6):1447–1466. (French)
It is noteworthy to mention that Poisson sus- de la Rue T (1998b) Rang des systèmes dynamiques
pensions can also be defined over non-singular gaussiens. Isr J Math 104:261–283. (French)
transformations, provided some integrability con- Derriennic Y, Fraczek K, Lemańczyk M, Parreau F (2008)
Ergodic automorphisms whose weak closure of off-
dition is satisfied. This is a potentially rich area to diagonal measures consists of ergodic self-joinings.
explore. (There is an ongoing work on this topic Colloq Math 110(1):81–115. (English)
by A. Danilenko, Z. Kosloff, and the second Dobrushin RL (1956) On the Poisson law for the distribu-
author of this entry.) tion of particles in space. Ukr Mat Zh 8:127–134.
(Russian)
Finally, we point out that this presentation Doob JL (1953) Stochastic processes. Wiley, New York.
focused on ℤ-actions, even though the notions of 654 S. 1953 (English)
Gaussian systems and Poisson suspensions Foias C, Stratila S (1967) Ensembles de Kronecker dans la
extend naturally to more general group actions. The’ orie ergodique. C R Acad Sci Paris 267:166–168
Fomin S (1950) On dynamical systems in a space of
For flows (ℝ-actions), Gaussian systems and functions. Ukr Mat Zh 2(2):25–47. (Russian)
Poisson suspensions provide interesting examples Fraczek K, Lemańczyk M (2009) On the self-similarity
of flows for which the self-similarity set problem for ergodic flows. Proc Lond Math Soc
99(3):658–696. (English)
Fraczek K, Kulaga J, Lemańczyk M (2013) On the self-
I ðT Þ≔ s ℝ : ðT st Þt ℝ is isomorphic to ðT t Þt ℝ similarity problem for Gaussian-Kronecker flows. Proc
Am Math Soc 141(12):4275–4291. (English)
can be fully described (see (Danilenko and Girsanov IV (1958) Spectra of dynamical systems gener-
ated by stationary Gaussian processes. Dokl Akad
Ryzhikov 2012; Fraczek and Lemańczyk 2009;
Nauk SSSR 119:851–853. (Russian)
Fraczek et al. 2013)). For other groups, the gener- Goldstein S, Lebowitz JL, Aizenman M (1975) Ergodic
alization of some topics presented in the present properties of infinite systems. Dyn Syst Theory Appl
survey is not always obvious. The entropy of Battelle Seattle 1974 Renc, Lect Notes Phys 38:112–143
Gaussian actions of Abelian groups is proved to Grabinsky G (1984) Poisson process over s -finite Markov
chains. Pac J Math 111:301–315. (English)
be 0 or 1 in Lemánczyk (1998); however, it is Ito K (1944) On the ergodicity of a certain stationary
unknown if the same holds for countable amena- process. Proc Imp Acad Tokyo 20:54–55. (English)
ble groups. We can also ask for which group Iwanik A, Lemańczyk M, de la Rue T, de Sam Lazaro
J (1997) Quelques remarques sur les facteurs des
actions we have a Foias-Stratila theory.
systèmes dynamiques gaussiens. Stud Math
125(3):247–254. (French)
Janvresse É, De La Rue T (2012) Zero Krengel entropy
Bibliography does not kill Poisson entropy. Ann Inst Henri Poincaré,
Probab Stat 48(2):368–376. (English)
Attal S. Lectures in quantum noise theory. http://math. Janvresse É, Meyerovitch T, Roy E, de la Rue T (2010)
univ-lyon1.fr/attal/chapters.html Poisson suspensions and entropy for infinite transfor-
Cornfeld IP, Fomin SV, Sinai YG (1982) Ergodic theory. mations. Trans Am Math Soc 362(6):3069–3094.
Grundlehren der Mathematischen Wissenschaften (English)
[Fundamental principles of mathematical sciences], Janvresse É, Roy E, de la Rue T (2017a) Nearly finite
vol 245. Springer, New York chacon transformation, hal-01586869. Annales Henri
Danilenko AI, Ryzhikov VV (2012) On self-similarities of Lebesgue 2(2019):369–414
ergodic flows. Proc Lond Math Soc 104(3):431–454. Janvresse É, Roy E, de la Rue T (2017b) Poisson suspensions
(English) and SuShis. Ann Scient Éc Norm Sup 50(6):1301–1334
de la Rue T (1993) Entropie d’ un système dynamique Janvresse É, Roy E, de la Rue T (2018) Invariant measures
gaussien: cas d’ une action de ℤd. C R Acad Sci Paris for Cartesian powers of Chacon infinite transformation.
Sér I 317(2):191–194. (French) Israel J Math 224:1–37
232 Dynamical Systems of Probabilistic Origin: Gaussian and Poisson Systems
Janvresse É, Roy E, de la Rue T (2020) Ergodic poisson Newton D (1966) On Gaussian processes with simple
splittings, preprint. Ann Probab 48(3):1266–1285 spectrum. Z Wahrscheinlichkeitstheor Verw Geb
Kalikow S (1981) A Poisson random walk is Bernoulli. 5:207–209. (English)
Commun Math Phys 81:495–499. (English) Newton D (1971) Coalescence and spectrum of automor-
Krengel U (1967) Entropy of conservative transforma- phisms of a Lebesgue space. Z Wahrschein-
tions. Z Wahrscheinlichkeitstheorie Verw Gebiete lichkeitstheor Verw Geb 19:117–122. (English)
7:161–181. MR MR0218522 (36 #1608) Newton D, Parry W (1966) On a factor automorphism of a
Last G, Penrose M (2018) Lectures on the Poisson process, normal dynamical system. Ann Math Stat
vol 7. Cambridge University Press, Cambridge. 37:1528–1533. (English)
(English) Ornstein DS (1974) Ergodic theory, randomness, and
Lemánczyk M (1998) Entropy of Gaussian actions for dynamical systems. Yale University Press, New
countable Abelian groups. Fundam Math Haven. (English)
157(2–3):277–286. (English) Parry W (1969) Entropy and generators in ergodic theory.
Lemańczyk M (2011) Spectral theory of dynamical sys- W. A. Benjamin, Inc., New York-Amsterdam. MR
tems. Springer New York, New York, pp 1618–1638 MR0262464 (41 #7071)
Lemańczyk M, Parreau F, Thouvenot J-P (2000) Gaussian Pinsker MS (1960) Dynamical systems with completely
automorphisms whose ergodic self-joinings are Gauss- positive or zero entropy. Sov Math Dokl 1:937–938.
ian. Fundam Math 164(3):253–293 (English)
Lemańczyk M, Parreau F, Roy E (2011) Joining primeness Roy E (2005) Mesures de poisson, infinite divisibilité et
and disjointness from infinitely divisible systems. Proc propriétés ergodiques. PhD thesis
Am Math Soc 139(1):185–199. (English) Roy E (2009) Poisson suspensions and infinite ergodic
Leonov VP (1960) The use of the characteristic functional theory. Ergodic Theory Dyn Syst 29(2):667–683
and semi-invariants in the ergodic theory of stationary Sinai YG (1963) On higher order spectral measures of
processes. Sov Math Dokl 1:878–881. (English) ergodic stationary processes. Theory Probab Appl
Marchat FA (1978) A class of measure-preserving trans- 8:429–436. (English)
formations arising by the Poisson process, Ph. D. The- Thouvenot J-P (1987) The metrical structure of some
sis, Berkeley, Dec 1978 Gaussian processes, Ergodic theory and related
Maruyama G (1949) The harmonic analysis of stationary topics II. In: Proceedings of the conference on
stochastic processes. Mem Fac Sci Kyūsyū Univ Ser Georgenthal/GDR 1986, Teubner-Texte Math
A 4:45–106 vol 94, pp 195–198
Maruyama G (1967) A singular flow with countable Thouvenot J-P (1996) Utilisation des processus gaussiens
Lebesgue spectrum. J Math Soc Japan 19:359–365. en théorie ergodique., Hommage à P. A. Meyer et
(English) J. Neveu. Société Mathématique de France, Paris,
Maruyama G (1970) Infinitely divisible processes. Theory pp 303–308. (French)
Probab Appl 15(1):1–22 Totoki H (1964) The mixing property of Gaussian flows.
Menchoff D (1916) Sur l’ unicité du développement tri- Mem Fac Sci Kyushu Univ Ser A 18:136–139.
gonométrique. C R Acad Sci Paris 163:433–436. (French) (English)
Neretin YA (1996) Categories of symmetries and infinite- Vershik AM, Gel’fand IM, Graev MI (1975) Representa-
dimensional groups. Transl. from the Russian by G. G. tions of the group of diffeomorphisms. Russ Math Surv
Gould. Clarendon Press, Oxford. (English) 30(6):1–50. (English)
Equivalently, every Borel function f : X ! ℝ
Ergodic Theory: Nonsingular such that f ∘ T = f is constant a.e.
Transformations Nonsingular dynamical system Let (X, ℬ, m)
be a standard Borel space equipped with a s-
Alexandre I. Danilenko1 and Cesar E. Silva2 finite measure. A Borel map T : X ! X is a
1
B.Verkin Institute for Low Temperature nonsingular transformation of X if for any
Physics and Engineering of the NAS of Ukraine, N ℬ, m(T 1N) = 0 if and only if m(N) = 0.
Kharkiv, Ukraine In this case, the measure m is called quasi-
2
Department of Mathematics, Williams College, invariant for T, and the quadruple
Williamstown, MA, USA (X, ℬ, m, T) is called a nonsingular dynamical
system. If m(A) = m(T1A) for all A ℬ, then m
is said to be invariant under T or, equivalently,
Article Outline T is measure-preserving.
Types II, II1, II1, and III Suppose that m is
Glossary nonatomic and T is invertible and ergodic
Definition of the Subject (and hence conservative). If there exists a s-
Introduction and Basic Results finite measure n on ℬ which is equivalent to m
Panorama of Examples and invariant under T then T is said to be of type
Topological Groups ΑUΤ(X, m), II. It is easy to see that n is unique up to scaling.
ΑUΤ2(X, m) and AUT1(X, m) If n is finite, then T is of type II1. If n is infinite,
Orbit Theory then T is of type II1. If T is not of type II, then T
Mixing Notions and Multiple Recurrence is said to be of type III.
Dynamical Properties of IDPFT Systems
Dynamical Properties of Nonsingular Bernoulli
and Markov Shifts Definition of the Subject
Dynamical Properties of Nonsingular Poisson
Suspensions and Nonsingular Gaussian An abstract measurable dynamical system con-
Transformations sists of a set X (phase space) with a transforma-
Spectral Theory for Nonsingular Systems tion T : X ! X (evolution law or time) and a finite
Entropy and Other Invariants or s-finite measure m on X that specifies a class of
Nonsingular Joinings and Factors negligible subsets. Nonsingular ergodic theory
Smooth Nonsingular Transformations studies systems where T respects m in a weak
Miscellaneous Topics sense: The transformation preserves only the
Applications. Connections with Other Fields class of negligible subsets, but it may not pre-
Further Directions serve m. This survey is about dynamics and
Bibliography invariants of nonsingular systems. Such systems
model “non-equilibrium” situations in which
Glossary events that are impossible at some time remain
impossible at any other time. Of course, the first
Conservativity T is conservative if for all sets A question that arises is whether it is possible to
of positive measure there exists an integer find an equivalent invariant measure, i.e., pass to
n > 0 such that m(A \ TnA) > 0. a hidden equilibrium without changing the neg-
Ergodicity T is ergodic if every measurable sub- ligible subsets. It turns out that there exist sys-
set A of X that is invariant under T (i.e., tems which do not admit an equivalent invariant
T1A = A) is either m-null or m-conull. finite or even s-finite measure. They are of our
© Springer Science+Business Media, LLC, part of Springer Nature 2023 233
C. E. Silva, A. I. Danilenko (eds.), Ergodic Theory,
https://doi.org/10.1007/978-1-0716-2388-6_183
Originally published in
R. A. Meyers (ed.), Encyclopedia of Complexity and Systems Science, © Springer Science+Business Media LLC 2020
https://doi.org/10.1007/978-3-642-27737-5_183-2
234 Ergodic Theory: Nonsingular Transformations
primary interest here. In a way (Baire category), this survey. Many new results related to non-
most of systems are like that. singular dynamical systems have appeared since
Nonsingular dynamical systems arise natu- the release of the first edition. The second edition
rally in various fields of mathematics: topologi- is enlarged essentially to cover (partially) this
cal and smooth dynamics, probability theory, progress. In particular, we added new Sections
random walks, theory of numbers, von Neumann 7 and 9 and totally rewrote Section 8. More than
algebras, unitary representations of groups, 100 new references have been added. We are
mathematical physics, and so on. They also can grateful to J. Aaronson, N. Avraham-Re’em,
appear in the study of probability preserving J. Hawkins, and Z. Kosloff for their remarks to
systems: some criteria of mild mixing and the second edition of the survey.
distality, a problem of Furstenberg on
disjointness, etc. We briefly discuss this in §15.
Nonsingular ergodic theory studies all of them Introduction and Basic Results
from a general point of view:
This section includes the basic results involving
• What is the qualitative nature of the dynamics? conservativity and ergodicity as well as some
• What are the orbits? direct nonsingular counterparts of the basic
• Which properties are typical within a class of machinery from classic ergodic theory: mean
systems? and pointwise ergodic theorems, Rokhlin lemma,
• How do we find computable invariants to com- ergodic decomposition, generators, Glimm-Effros
pare or distinguish various systems? theorem, and special representation of non-
singular flows. The historically first example of a
Typically there are two kinds of results: Some transformation of type III (due to Ornstein) is also
are extensions to nonsingular systems of theorems given here with full proof.
for finite measure-preserving transformations (for
instance, §2, § 4, § 12), and the other are about Nonsingular Transformations
new properly “nonsingular” phenomena (see §5 In this survey, we will consider mainly invertible
§ 9). Philosophically speaking, the dynamics of nonsingular transformations, i.e., those which are
nonsingular systems is more diverse compara- bijections when restricted to an invariant Borel
tively with their finite measure-preserving coun- subset of full measure. Thus when we refer to a
terparts. That is why it is usually easier to nonsingular dynamical system (X, ℬ, m, T ), we
construct counterexamples than to develop a gen- shall assume that T is an invertible nonsingular
eral theory. While infinite measure-preserving transformation (unless the contrary is specified
transformations are not the main subject of this explicitly). Of course, each measure n on ℬ
survey, we cover them partially as they are also which is equivalent to m, i.e., m and n have the
nonsingular systems and arise often as natural same null sets, is also quasi-invariant under T.
examples or counterexamples in the nonsingular In particular, since m is s-finite, T admits an equiv-
setting. Because of shortage of space, we concen- alent quasi-invariant probability measure. For
trate mainly on invertible transformations, and we each i ℤ, we denote by omi or oi the Radon-
have not included as many references as we had Nikodym derivative d(m ∘ T i)/dm L1(X, m). The
wished. General group or semigroup actions are derivatives satisfy the cocycle equation oiþj(x) ¼
practically not considered here (with some oi(x)oj(T ix) for a.e. x and all i, j ℤ.
exceptions in §15 devoted to applications).
A number of open problems are scattered through Basic Properties of Conservativity and
the entire text. Ergodicity
We thank J. Aaronson, J. R. Choksi, V. Ya. A measurable set W is said to be wandering if for
Golodets, M. Lemańczyk, F. Parreau, and all i, j 0 with i 6¼ j, T iW \ T jW ¼ ;. Clearly, if
E. Roy for useful remarks to the first edition of T has a wandering set of positive measure, then it
Ergodic Theory: Nonsingular Transformations 235
cannot be conservative. A nonsingular transfor- 1 on the group ℤ furnished with the counting
mation T is incompressible if whenever T1C C, measure is an example of an ergodic non-
then m(C \T 1C) ¼ 0. conservative (infinite measure-preserving)
transformation.
Proposition 2.1 (see Krengel 1985) Let
(X, ℬ, m, T ) be a nonsingular dynamical system.
Proposition 2.3 Let (X, ℬ, m, T ) be a non-
The following are equivalent:
singular dynamical system. The following are
equivalent:
(i) T is conservative.
(ii) For every measurable set
n
(i) T is conservative and ergodic.
A, m A∖[1 n¼1 T A ¼ 0.
(ii) For every set A of positive measure,
(iii) T is incompressible. n
m X∖[1 n¼1 T A ¼ 0. (In this case, we will
(iv) Every wandering set for T is null.
þ1 say A sweeps out.)
(v) i¼0 oi ðxÞ ¼ 1 at a.e. x (provided that (iii) For every measurable set A of positive mea-
m(X) < 1). sure and for a.e. x X, there exists an
integer n > 0 such that T nx A.
Since any finite measure-preserving transfor- (iv) For all sets A and B of positive measure,
mation is incompressible, we deduce that it is there exists an integer n > 0 such that
conservative. This is the statement of the classical m(T nA \ B) > 0.
Poincaré recurrence lemma. If T is a conservative (v) If A is such that T 1A A, then m(A) ¼ 0 or
nonsingular transformation of (X, ℬ, m) and m(Ac) ¼ 0.
A ℬ a subset of positive measure, we can define
an induced transformation TA of the space A set W of positive measure is said to be weakly
(A, ℬ \ A, m A) by setting TAx ≔ T nx if n ¼ n(x)
+
þ1
{0, 1, . . ., n}, nn(0) ¼ 0.5 and nn(i) ¼ 1/(2n) for
C¼ x: i¼0
f T i x oi ðxÞ ¼ 1 a:e: and D ¼ 0 < i n and all n ℕ. Denote by (X, m) the infinite
x: þ1
i¼0 f T x oi ðxÞ < 1 a:e: .
i product probability space 1 n¼1 ðAn , nn Þ: Of course,
The set C is called the conservative part of T, m is nonatomic. A point of X is an infinite sequence
and D is called the dissipative part of T. If D is of x ¼ ðx n Þ1 n¼1 with xn An for all n. Given a1 A1,
positive measure, we call T dissipative. If D is of . . ., an An, we denote the cylinder
full measure, we call T totally dissipative. x ¼ ðxi Þ1 i¼1 X : x1 ¼ a1 , . . . , xn ¼ an g by
If T is ergodic and m is nonatomic, then T is [a1, . . ., an]. Define a Borel map T : X ! X by
automatically conservative. The translation by setting
236 Ergodic Theory: Nonsingular Transformations
measure n equivalent to m. Let ’ ≔ dm/dn. Then topology. Then P 6¼ 0 if and only if there is f L2(-
X, m) such that f 6¼ 0 and UT f ¼ f. Of course,
1 UT j f j ¼ j f j. We now define a nontrivial finite
omi ðxÞ ¼ ’ðxÞ’ T i x for a:a: x X and all i ℤ: ð2Þ
measure l ≺ m by setting dm dl
≔j f j2 . It is straight-
Fix a real C > 1 such that the set forward to verify that l is invariant under T.
EC ≔ ’1([C1, C]) X is of positive measure. Denote by I the sub-s-algebra of T-invariant
By a standard approximation argument, for each sets. Let m ½j I stand for the conditional expec-
sufficiently large n, there is a cylinder such that tation with respect to I . Note that if T is ergodic,
m(EC \ [a1, . . ., an]) > 0.9m([a1, . . ., an]). Since then m ½ f j I ¼ fdm: Now we state a non-
nnþ1(0) ¼ 0.5, it follows that m(EC \ [a1, . . ., singular analogue of Birkhoff’s pointwise ergodic
an, 0]) > 0.8m([a1, . . ., an, 0]). Moreover, by the theorem, due to Hurewicz (1944) and in the form
pigeon hole principle there is 0 < i n þ 1 with stated by Halmos (1946).
m(EC \ [a1, . . ., an, i]) > 0.8m([a1, . . ., an, i]).
Theorem 2.6 (Hurewicz pointwise Ergodic The-
Find Nn > 0 such that T Nn ½a1 , . . . , an , 0 ¼
orem). If T is conservative, m(X) ¼ 1, f, g L1(X,
½a1 , . . . , an , i: Since omNn is constant on [a1, . . .,
m) and g > 0, then
an, 0], there is a subset E0 EC \ [a1, . . ., an, 0]
of positive measure such that n1
m
T Nn E0 EC \ ½a1 , . . . , an , i: Moreover, oN n ðxÞ ¼ f T i x oi ðxÞ
i¼0 m ½ f j I ðxÞ
! as
nnþ1 ðiÞ=nnþ1 ð0Þ ¼ ðn þ 1Þ1 for a.a. x [a1, . . ., n1 m ½gj I ðxÞ
an, 0]. On the other hand, we deduce from (2) that g T i x oi ðxÞ
i¼0
omNn ðxÞ C2 for all x E0, a contradiction. n ! 1 for a:e: x:
Mean and Pointwise Ergodic Theorems. There is a nonsingular version (Silva and
Rokhlin Lemma Thieullen 1991) of the subadditive ergodic theo-
Let (X, ℬ, m, T ) be a nonsingular dynamical sys- rem of Kingman. Let T be a nonsingular transfor-
tem. Define a unitary operator UT of L2(X, m) by mation, and let (on) be its sequence of Radon-
setting Nikodym derivatives.
p A sequence of functions ( fn) is said to be
UT f ≔ o1 f ∘ T: ð3Þ subadditive if fnþm fm þ fn ∘ T mom for all n,
Ergodic Theory: Nonsingular Transformations 237
m 0. Since ( fn) is a subadditive, one can verify Theorem 2.10 (Alpern’s lemma (Alpern and
that the following limit Prasad 1990)). Let T be an aperiodic nonsingular
transformation of a standard probability space
1 (X, m). Let π ¼ (π1, π2, . . .) be a probability vector
m ½ð f n ÞðxÞ≔n!1
lim m ½ f n j I ðxÞ
n such that {k| πk > 0} is a relatively prime set of
integers. Then there is a measurable partition
exists almost everywhere. P ¼ {Pk,i| k > 0, i ¼ 1, . . ., k} of X satisfying
Theorem 2.7 (Nonsingular subadditive Ergodic (a) TPk,i ¼ Pk, iþ1 for each k and every i < k.
Theorem). If T is conservative, m(X) ¼ 1, ( fn) is a k
(b) i¼1 mðPk,i Þ ¼ pk for each k.
subadditive sequence of integrable functions,
g L1(X, m) and g > 0, then
f n ðxÞ m ½ð f n ÞðxÞ
! as Ergodic Decomposition
n1
i¼0 g T x oi ð x Þ
i m ½gj I ðxÞ
A proof of the following theorem may be found in
n ! 1 for a:e: x: Aaronson (1997, 2.2.8) and Aaronson (1987, §6).
Of course, Theorem 2.6 follows from Theorem Theorem 2.11 (Ergodic Decomposition Theo-
2.7 if we set f n ðxÞ≔ n1
i¼0 f T x oi ðxÞ for all n > 0
i
rem). Let T be a conservative nonsingular trans-
and x X. formation on a standard probability space
A transformation T is aperiodic if the T-orbit of (X, ℬ, m). There exists a standard probability
a.e. point from X is infinite. The following classi- space ðY, n, A Þ and a family of probability mea-
cal statement can be deduced easily from Propo- sures my on (X, ℬ), for y Y, such that
sition 2.1.
(i) For each A ℬ, the map y 7! my(A) is Borel
Lemma 2.8 (Rokhlin’s lemma (Friedman 1970)). and for each A ℬ
Let T be an aperiodic nonsingular transformation
of a standard probability space (X, m). For each
ε > 0 and integer N > 1, there exists a measurable mðAÞ ¼ my ðAÞdnðyÞ:
set A such that the sets A, TA, . . ., T N1A are
disjoint and m(A [ TA [ [ T N1A) > 1 ε.
(ii) For y, y0 Y, the measures my and my0 are
This lemma was refined later (for ergodic
mutually singular.
transformations) by Lehrer and Weiss as follows.
(iii) For each y Y, the transformation T is
nonsingular and conservative, ergodic on
Theorem 2.9 (ϵ-free Rokhlin lemma (Lehrer and (X, ℬ, my).
m
Weiss 1982)). Let T be ergodic and m (iv) For each y Y, o1 y ¼ om1 my a:e:
nonatomic. Then for a subset B X and any (v) (Uniqueness) If there exists another proba-
kN
N for which [1 k¼0 T ðX∖BÞ ¼ X, there is a set bility space ðY 0 , n0 , A 0 Þ and a family of prob-
A such that the sets A, TA, . . ., TN1A are disjoint ability measures m0y0 on (X, ℬ), for y0 Y0,
and A [ TA [ [ TN1A B. satisfying (i)–(iv), then there exists a
kN
The condition [1 k¼0 T ðX∖BÞ ¼ X holds of measure-preserving isomorphism θ : Y ! Y0
course for each B 6¼ X if T is totally ergodic, i.e., such that my ¼ m0yy for n-a.e. y.
T p is ergodic for any p, or if N is prime. We now
state a nonsingular version of Alpern’s lemma It follows that if T preserves an equivalent
which is a generalization of Lemma 2.8. s-finite measure, then the system (X, ℬ, my, T ) is
238 Ergodic Theory: Nonsingular Transformations
of type II for v-a.a. y. The space ðY, n, A Þ is called turns out that there is a wealth of such measures.
the space of T-ergodic components. To state a corresponding result, we first write an
important definition.
Generators
A s-algebra F is called a generator for a non- Definition 2.13 Two nonsingular systems
singular transformation T on a standard probabil- (X, ℬ, m, T ) and (X, ℬ0, m0, T0) are called orbit
ity space (X, ℬ, m), if _1 n¼1 T F ¼ ℬ . It was
n
equivalent if there is a one-to-one bimeasurable
shown in Rokhlin (1965) and Parry (1966) that map ’ : X ! X with m0 ∘ ’ m and such that f
T has a countable generator, i.e., a countable maps the T-orbit of x onto the T0-orbit of ’(x) for
partition P of X so that the s-algebra of a.a. x X.
P-measurable sets is a generator for T. It was The following theorem was proved in
refined by Krengel (1970): If T is of type II1 or Katznelson and Weiss (1972), Krieger (1976a),
III, then there exists P consisting of two sets only. and Schmidt (1977b).
Moreover, given a sub-s-algebra F ℬ such that
F TF and [k>0TkF ¼ ℬ, the set {A F j (A, Theorem 2.14 Let (X, T ) be as in Theorem 2.12.
X \ A) is a generator of T} is dense in F . It follows, Then for each ergodic dynamical system
in particular, that T is isomorphic to the shift on ðY, C , n, SÞ of type II1 or III, there exist
{0, 1}ℤ equipped with a quasi-invariant probabil- uncountably many mutually disjoint Borel mea-
ity measure. For a version of the Krengel 2-sets sures m on X such that (X, T, ℬ, m) is orbit equiv-
generator theorem in the Borel category, we refer alent to ðY, C , n, SÞ.
to Hochman (2019). On the other hand, T may not have any finite
invariant measure. The first such example
The Glimm-Effros Theorem appeared in Eigen et al. (1998b). We present a
The classical Bogolyubov-Krylov theorem states simpler one.
that each homeomorphism of a compact space
admits an ergodic invariant probability measure Example 2.15 Let T be an irrational rotation on
(Cornfeld et al. 1982). The following statement by the circle , and let K be a nowhere dense closed
Glimm (1961) and Effros (1965) is a “non- subset of of positive Lebesgue measure. Let
singular” analogue of that theorem. (We consider X be the complement of the T-orbit [n ℤ TnK of
here only a particular case of ℤ-actions.) K. Then X is a T-invariant Gδ-Subset of zero
Lebesgue measure. Hence X is Polish in the
+
Theorem 2.12 Let X be a Polish space and induced topology and T X is an aperiodic
T : X ! X an aperiodic homeomorphism. Then homeomorphism of X. Since T is minimal, X is
dense in and the (T X)-orbit of each point
+
(ii) There is an orbit of T which is not locally X. If it is invariant under T X, then l can be
closed. considered also as a finite T-invariant measure on
(iii) There is no Borel set which intersects each . Since T is uniquely ergodic, l is the Lebesgue
orbit of T exactly once. measure. However X is of zero Lebesgue mea-
(iv) There is a continuous probability Borel mea- sure, a contradiction.
sure m on X such that (X, m, T ) is an ergodic Let T be an aperiodic Borel transformation of a
nonsingular system. standard Borel space X. Denote by M(T ) the set
of all ergodic T-nonsingular continuous measures
A natural question arises: Under the conditions on X. Given m M(T ), let N(m) denote the
of the theorem, how many such m can exist? It family of all Borel m-null subsets. Shelah and
Ergodic Theory: Nonsingular Transformations 239
Weiss showed (Shelah and Weiss 1982) that be the restriction of the product measure m Leb
\m M(T )N(m) coincides with the collection of all on X ℝ to X f and define, for t 0,
Borel T-wandering sets.
n1
shifts, nonsingular Poisson suspensions and Gauss- (iv) A partial order on E so that e, e0 E are
ian transformations, IDPFT systems and natural comparable if and only if e, e0 E(n) for
extensions of nonsingular endomorphisms. some n and rn(e) ¼ rn(e0).
atomic. Given a1 A1, . . ., an An, we denote will assume always that the diagram is essentially
by [a1, . . ., an] the cylinder x ¼ (xi)i > 0 j x1 ¼ a1, simple, i.e., there is only one infinite path xmax ¼
. . ., xn ¼ an. If x 6¼ (0, 0, . . .), we let l(x) be the (xn)n>0 with xn maximal for all n and only one xmin ¼
smallest number l such that the l-th coordinate of (xn)n>0 with xn minimal for all n. The Bratteli-
x is not ml 1. We define a Borel map T : X ! X Vershik map TB : XB ! XB is defined as follows:
by (1) if x 6¼ (m1, m2, . . .) and put Tx ≔ (0, 0, . . .) Txmax ¼ xmin. If x ¼ (xn)n>0 6¼ xmax, then let k be the
if x ¼ (m1, m2, . . .). Of course, T is isomorphic to smallest number such that xk is not maximal. Let yk
a rotation on a compact monothetic totally discon- be a successor of xk. Let (y1, . . ., yk) be the unique
nected Abelian group. It is easy to check that T is path such that y1, . . ., yk1 are all minimal. Then we
m-nonsingular and let TBx ≔ (y1, . . ., yk, xk þ 1, xkþ2, . . .). It is easy to
1
see that TB is a homeomorphism of XB. Suppose that
nn ðTxÞn ðnÞ
om1 ðxÞ ¼ we are given a sequence PðnÞ ¼ Pðv,eÞ V n1 EðnÞ
n¼1
nn ð x n Þ
of stochastic matrices, i.e.,
lðxÞ1
nlðxÞ xlðxÞ þ 1 n n ð 0Þ
¼
nlðxÞ xlðxÞ nn ðmn 1Þ (i) Pðv,e
nÞ
> 0 if and only if v ¼ sn(e).
n¼1
ðnÞ
(ii) fe EðnÞ jsn ðeÞ¼vg Pv,e ¼ 1 for each v
for a.a. x ¼ (xn)n>0 X. It is also easy to verify V(n1).
that T is ergodic. It is called the nonsingular
product odometer associated to ðmn , nn Þ1n¼1 : We For e1 E(1), . . ., en E(n), let [e1, . . ., en]
note that Ornstein’s transformation (Example 2.4) denote the cylinder {x ¼ (xj)j>0| x1 ¼ e1, . . ., xn ¼
is a nonsingular product odometer. en}. Then we define a Markov measure on XB by
setting
Markov Odometers
We define Markov odometers as in Dooley and mP ð½e1 , . . . , en Þ ¼ P1s1 ðe1 Þ,e1 P2s2 ðe2 Þ,e2 Pnsn ðen Þ,en
Hamachi (2003a). An ordered Bratteli diagram
B (Herman et al. 1992) consists of
for each cylinder [e1, . . ., en]. The dynamical sys-
tem (XB, mP, TB) is called a Markov odometer. It is
(i) A vertex set V which is a disjoint union of
easy to see that every nonsingular product odom-
finite sets V(n), n 0, V0 is a singleton.
eter is a Markov odometer where the
(ii) An edge set E which is a disjoint union of
corresponding V(n) are all singletons.
finite sets E(n), n > 0;
(iii) Source mappings sn : E(n) ! V(n1) and
Tower Transformations
range mappings rn : E(n) ! V(n) such that
This construction is a discrete analogue of flow
s1
n ðvÞ 6¼ ; for all v V
(n1)
and r 1
n ðvÞ 6¼ ; under a function. Given a nonsingular dynamical
for all v V , n > 0;
(n)
system (X, m, T ) and a measurable map f : X ! ℕ,
Ergodic Theory: Nonsingular Transformations 241
we define a new dynamical system (X f, m f, T f) by I n,0 ½ j, . . . I n,hn 1 ½ j followed by the spacers
setting Sn,0 ½ j, . . . Sn,sn ðjÞ1 ½ j:
measurable map R : X ! X such that m(A) ¼ 0 if (Eigen 1982). Ryzhikov showed (Ryzhikov 1993)
and only if m(R1A) ¼ 0. Suppose that m is s-finite that every element of this group is a product of
on R1ℬ. We define the Radon-Nikodym deriva- three involutions (i.e., transformations of order 2).
tive om1 of R by setting om1 ¼ dm∘R
dm
1 ∘ R . It was Moreover, a nonsingular transformation is a prod-
shown in (Silva 1988; Silva and Thieullen 1995) uct of two involutions if and only if it is conjugate
that there exists a s-finite standard measure space to its inverse by an involution.
(X , ℬ , m ), an invertible m -nonsingular trans- Inspired by (Halmos 1956), Ionescu Tulcea
formation R , and a Borel map π : X ! X such (Ionescu Tulcea 1965) and Chacon and Friedman
that the following hold: m ∘p1 ¼ m, pR ¼ (Chacon and Friedman 1965) introduced the weak
Rp, om1 is π1(ℬ)-measurable and and the uniform topologies, respectively, on Aut
n 1
_n>0R π (ℬ) ¼ ℬ . The dynamical system (X, m). The weak one – we denote it by dw – is
(X , ℬ , R , m ) is defined uniquely (up to a nat- induced from the weak operator topology on the
ural isomorphism) and called the natural exten- group of unitary operators in L2(X, m) by the embed-
sion of R. It coincides with the standard Rokhlin ding T 7! UT (see §2.3). Then (Aut(X, m), dw) is a
definition of the natural extension in the case Polish topological group and Aut0(X, n) is a closed
where R preserves m and m is finite. subgroup of Aut(X, m). This topology will not be
affected if we replace m with any equivalent mea-
Theorem 3.3 R is conservative if and only if R is sure. We note that Tn weakly converges to T if and
m-recurrent, i.e., only if m T 1 1
n ADT A ! 0 for each A ℬ and
d(m ∘ Tn)/dm ! d(m ∘ T)/dm in L1(X, m). For each
i1
p 1, one can also embed Aut(X, m) into the
h ∘ Ri oi ¼ þ1 a:e:, where oi ¼ om1 ∘ Rj
i0 j¼0
isometry group of L p(X, m) via a formula similar
to (3) but with another power of the Radon-
for each integrable function h > 0. Moreover, if Nikodym derivative in it. The strong operator topol-
R is m-recurrent, then R is ergodic if and only if ogy on the isometry group induces the very same
R is ergodic (Silva 1988; Silva and Thieullen weak topology on Aut(X, m) for all p 1 (Choksi
1995). and Kakutani 1979). Danilenko showed in
Let R be a nonsingular one-sided Bernoulli Danilenko (1995) that (Aut(X, m), dw) is contract-
shift ðX, mÞ ¼ 1 n¼1 ðA, mn Þ: Then the natural ible. It follows easily from the Rokhlin lemma that
extension of R is isomorphic to the two-sided periodic transformations are dense in Aut(X, m).
nonsingular Bernoulli shift T on ðX , m Þ ¼ It is natural to ask which properties of non-
1 singular transformations are typical in the sense
n¼1 A, mn , where mn ¼ mn if n > 0 and
mn ¼ m1 if n 0. The corresponding projection of Baire category. The following technical lemma
(see Friedman 1970; Choksi and Kakutani 1979)
π : X ! X is the natural projection, i.e.,
is an indispensable tool when considering such
π(. . ., a1, a0, a1, a2, . . .) ≔ (a1, a2, . . .).
problems.
Topological Groups ΑUΤ(X, m), Lemma 4.1 The conjugacy class of each aperi-
ΑUΤ2(X, m) and AUT1(X, m) odic transformation T is dense in Aut(X, m) endo-
wed with the weak topology.
4.1. Let (X, ℬ, m) be a standard probability space, It follows that Aut(X, m) has the Rokhlin prop-
and let Aut(X, m) denote the group of all non- erty, i.e., there is an element of this group whose
singular transformations of X. Let v be a finite or conjugacy class is dense in the group. Using
s-finite measure equivalent to m; the subgroup of Lemma 4.1 and the Hurewicz ergodic theorem,
the v-preserving transformations is denoted by Choksi and Kakutani (Choksi and Kakutani 1979)
Aut0(X, n). Then Aut(X, m) is a simple group proved that the ergodic transformations form a
(Eigen 1981), and it has no outer automorphisms dense Gδ in Aut(X, m). The same holds for the
Ergodic Theory: Nonsingular Transformations 245
subgroup Aut0(X, n) (Sachdeva 1971; Choksi and course, C(T) is a closed subgroup of Aut(X, m)
Kakutani 1979). Combined with (Ionescu Tulcea and C(T ) {Tn| n ℤ}. In a similar way, if
1965), the above implies that the ergodic transfor- T Aut0(X, n), the measure-preserving central-
mation of type III is a dense Gδ in Aut(X, m). For izer C0(T ) ≔ Aut0(X, n) \ C(T ) of T is a weakly
further refinement of this statement, we refer to closed subgroup of Aut0(X, n). The following
section “Orbit Theory”. problems solved (by several authors) for
Since the map T 7! T T ( p times) from probability-preserving systems are still open for
Aut(X, m) to Aut(X p, mp) is continuous for each the nonsingular case. The properties are:
p > 0, we deduce that the set E 1 of transformations
with infinite ergodic index (which means that T (i) T has square root.
T ( p times) is ergodic for each p > 0) is a Gδ (ii) T embeds into a flow.
in Aut(X, m). It is nonempty by (Kakutani and Parry (iii) T has nontrivial invariant sub-s-algebra.
1963). Since this E 1 is invariant under conjugacy, it (iv) C(T ) contains a torus of arbitrary dimension.
is dense in Aut(X, m) by Lemma 4.1. Thus, we
obtain that E 1 is a dense Gδ. In a similar way, one typical (residual) in Aut(X, m) or Aut0(X, n)?
can show that E 1 \ Aut0(X, n) is a dense Gδ in The uniform topology on Aut(X, m), finer than
Aut0(X, n) (see also (Sachdeva 1971; Choksi and dw, is defined by the metric
Kakutani 1979; Choksi and Nadkarni 2000) for
original proofs of these claims). du ðT, SÞ ¼ mðfx : Tx 6¼ SxgÞ
A nonsingular transformation T is called rigid
þm x : T 1 x 6¼ S1 x :
if T ni ! Id weakly for some sequence nk ! 1.
The rigid transformations form a dense Gδ in
This topology is also complete metric. It
Aut(X, m). It follows that the set of multiply recur-
depends only on the measure class of m. However,
rent nonsingular transformations is residual
the uniform topology is not separable, and that is
(Ageev and Silva 2001). A finer result was
why it is of less importance in ergodic theory. We
established in Danilenko and Silva (2004): The
refer to (Chacon and Friedman 1965; Friedman
set of polynomially recurrent transformations in
1970; Choksi and Kakutani 1979; Choksi and
Aut0(X, n) is residual in Aut0(X, n). For the defi-
Prasad 1983) for the properties of du.
nition of multiple and polynomial recurrence, we
4.2. Suppose now that m(X) ¼ 1 but m is
refer to §6.5 below.
s-finite. We now let
Given T Aut(X, m), we denote the central-
izer {S Aut(X, m) | ST ¼ TS} of T by C(T ). Of
dm∘T
Aut2 ðX, mÞ ≔ T AutðX, mÞ j 1 L2 ðmÞ and
dm
dm∘T
Aut1 ðX, mÞ ≔ T AutðX, mÞ j 1 L1 ðmÞ :
dm
that a sequence ðT n Þ1
n¼1 converges to T in d2 if
Theorem 4.2 (Danilenko et al. 2022a)
dm ∘ T n dm ∘ T
Tn ! T weakly and dm dm !0
2
as n ! 1. In a similar way, one can define a • (Aut2(X, m), d2) is a Polish group.
topology d1 on Aut1(X, m): A sequence ðT n Þ1
n¼1
• (Aut1(X, m), d1) is a Polish group.
246 Ergodic Theory: Nonsingular Transformations
Now we state the main result of this section – Theorem 5.5 Every nontransitive ergodic flow is
Krieger’s theorem on orbit classification for ergo- an associated flow of an ergodic transformation
dic transformations of type III. It is a far-reaching of type III0.
generalization of the basic result by H. Dye: Any In Krieger (1976b), Krieger introduced a map
two ergodic probability-preserving transforma- F as follows. Let T be an ergodic transformation
tions are orbit equivalent (Dye 1959). of type III0. Then the associated flow of T is a flow
built under function with a base transformation
Theorem 5.4 (Orbit equivalence for type III sys- F(T ). We note that the orbit equivalence class of
tems (Krieger 1969, 1970, 1972, 1976a, b)). Two F(T ) is well defined by the orbit equivalent class
ergodic transformations of type III are orbit of T. If Fn(T ) fails to be of type III0 for some
equivalent if and only if their associated flows 1 n < 1, then T is said to belong to Krieger’s
are isomorphic. In particular, for a fixed hierarchy. For instance, the transformation
0 < l 1, any two ergodic transformations of constructed in Example 5.3 belongs to Krieger’s
type IIIl are orbit equivalent. hierarchy. Connes gave in Connes (1975) an
The original proof of this theorem is rather example of T such that F(T) is orbit equivalent
complicated. Simpler treatment of it can be to T (see also Hamachi and Osikawa (1981;
found in Hamachi and Osikawa (1981) and Giordano and Skandalis 1985a). Hence T is not
Katznelson and Weiss (1991). in Krieger’s hierarchy.
We also note that every free ergodic flow can be
realized as the associated flow of a type III0 trans- Almost Continuous Orbit Equivalence
formation. However it is somewhat easier to con- In this subsection, by a dynamical system we
struct a ℤ2-action of type III0 whose associated flow mean a quadruple (X, t, m, T ), where (X, t) is a
is the given one. For this, we take an ergodic Polish space, m is a nonatomic Borel measure of
nonsingular transformation Q on a probability full support, and T is a nonsingular ergodic
space (Z, ℬ, l) and a measure-preserving transfor- homeomorphism of X such that the function
mation R of an infinite s-finite measure space o1 : X ! ℝ is continuous (has a continuous
(Y, F , n) such that there is a continuous homomor- version).
phism π : ℝ ! C(R) with (dn ∘ π(t)/dn)( y) ¼ exp (t)
for a.a. y (for instance, take a type III1 transforma- Definition 5.6 Two dynamical systems
tion T and put R ≔ T and π(t) ≔ St). Let ’ : Z ! ℝ (X, t, m, T) and (X 0, t0, m0, T0) are almost contin-
be a Borel map with infZ ’ > 0. Define two trans- uously orbit equivalent if there are dense invariant
formations R0 and Q0 of (Z Y, l n) by setting: Gδ subsets X0 X and X00 X0 of full measure
and a homeomorphism ’ : X0 ! X00 such that
R0 ðx, yÞ ≔ ðx, RyÞ, Q0 ðx, yÞ ¼ ðQx, Ux yÞ,
• ’({Tnx| n ℤ}) ¼ {(T0)n’(x) | n ℤ} at
where Ux ¼ π(’(x) log (dm ∘ Q/dm)(x)). Notice every x X0.
that R0 and Q0 commute. The corresponding ℤ2- • m ∘ ’1 m0 and the Radon-Nikodym deriva-
action generated by these transformations is 1
tive dm∘’
dm0 is (can be chosen) continuous.
ergodic. Take any transformation V Aut(Z
• letting S ≔ ’1T0’, we have Tx ¼ Sn(x)x and
Y, l n) whose orbits coincide with the orbits of
Sx ¼ T m(x)x, where n and m are continuous on
the ℤ2-action. (According to Connes et al. (1981),
X0.
any ergodic nonsingular action of any countable
amenable group is orbit equivalent to a single
transformation.) It is now easy to verify that the We note that in the case where X and X 0 are
associated flow of V is the special flow built under infinite product spaces, T and T0 preserve m and m0,
’ ∘ Q1 with the base transformation Q1. Then respectively, and we omit the requirement that X0
V is of type III0. Since Q and ’ are arbitrary, we and X0 are Gδ then the above definition of ’ is
deduce the following from Theorem 2.17. equivalent to the “finitary” equivalence from the
Ergodic Theory: Nonsingular Transformations 249
celebrated work of Keane and Smorodinsky iðRÞ Aut RT , mRT by setting i(R)(x, y) ≔ (Rx,
(1979). It was shown by del Junco and Şahin Ry). Then the map R 7! i(R) is an embedding of
(2009) that any two ergodic probability- N[T] into Aut RT , mRT . Denote by t the topol-
preserving homeomorphisms of Polish spaces ogy on N[T] induced by the weak topology on
are almost continuously orbit equivalent. The Aut RT , mRT via i (Danilenko 1995). Then (N
same is true for any ergodic homeomorphisms [T], t) is a Polish group. A sequence Rn con-
preserving infinite s-finite local measures (del verges to R in (N[T], t) if Rn ! R weakly
Junco and Şahin 2009). In Danilenko and del (in Aut(X, m)) and Rn TR1 n ! RTR
1
uniformly
Junco (2011), a topological analogue rtop(T ) of (in [T]).
r(T) was introduced. It is a closed subgroup of ℝ Given R N[T], denote by R the Maharam
which contains r(T ), and it is invariant under the extension of R. Then R N T and it commutes
almost continuous orbit equivalence. In with (St)t ℝ. Hence it defines a nonsingular trans-
Danilenko and del Junco (2011), two-type III formation mod R on the space (Z, n) of the asso-
homeomorphisms were constructed which are ciated flow W ¼ (Wt)t ℝ of T. Moreover, mod
measure-theoretically orbit equivalent but not R belongs to the centralizer C(W) of W in Aut(Z,
almost continuously orbit equivalent (their rtop- n). Note that C(W) is a closed subgroup of
invariants are different). (Aut(Z, n), dw).
Let T be of type II1, and let m0 be the invariant
Theorem 5.7 Let (X, t, m, T ) and (X 0, t0, m0, T0) s-finite measure equivalent to m. If R N[T], then
be ergodic nonsingular homeomorphisms of Pol- it is easy to see that the Radon-Nikodym deriva-
ish spaces. If the two systems are either tive dm0 ∘ R/dm0 is invariant under T. Hence it is
(Danilenko and del Junco 2011) constant, say c. Then mod R ¼ log c.
(i) Of type IIIl with 0 < l < 1 and rtop(T ) ¼ Theorem 5.8 If T is of type III, then the map
rtop(T 0) ¼ log l ℤ or mod : N[T] ! C(W ) is a continuous onto homo-
(ii) Of type III1, morphism. The kernel of this homomorphism is
the t-closure of [T]. Hence the quotient group
then they are almost continuously orbit t
N ½T =½T is (topologically) isomorphic to C(W).
equivalent. t
In particular, ½T is cocompact in N[T] if and
Characterization of almost continuous orbit only if W is a finite measure-preserving flow with
equivalence for homeomorphisms of type III0 a pure point spectrum (Hamachi and Osikawa
remains an open problem. 1981; Hamachi 1981a).
The following theorem describes the homo-
Normalizer of the Full Group. Outer topical structure of normalizers.
Conjugacy Problem
Let Theorem 5.9 Let T be of type II or III, 0 l < 1.
t
The group ½T is contractible. N[T] is homo-
N ½T ¼ R AutðX, mÞjR½T R1 ¼ ½T ,
topically equivalent to C(W). In particular, N[T] is
contractible if T is of type II. If T is of type IIIl with
i.e., N[T] is the normalizer of the full group
0 < l < 1, then π1(N[T]) ¼ ℤ (Danilenko 1995).
[T] in Aut(X, m). We note that a transformation
The outer period p(R) of R N[T] is the
R belongs to N[T] if and only if R(OrbT(x)) ¼
smallest positive integer n such that Rn [T]. We
OrbT(Rx) for a.a. x. To define a topology on N[T],
write p(R) ¼ 0 if no such n exists.
consider the T-orbit equivalence relation
Two transformations R and R0 in N[T] are
RT X X and a s-finite measure mR on RT
called outer conjugate if there are transformations
given by mRT ¼ X y OrbT ðxÞ dðx,yÞ dmðxÞ: For
V N[T] and S [T] such that V RV1 ¼ R0S.
R N[T], we define a transformation The following theorem provides convenient (for
250 Ergodic Theory: Nonsingular Transformations
Since T’ commutes with the action of G on Theorem 5.13 Let T be an ergodic nonsingular
X G by inverted right translations along the transformation. If G is a compactly generated
second coordinate, this action induces an ergodic [FIA]-group, then there is a bounded ergodic
G-action W’ ¼ (W’(g))g G on the space (Z, n) of cocycle ’ of T with values in G (Danilenko 2021).
T’-ergodic components. It is called the Mackey We recall that a locally compact Polish group
range (or Poincaré flow) of ’ (Mackey 1966; G is called a [FIA]-group if the group of inner
Feldman and Moore 1977; Schmidt 1977a). automorphisms of G is relatively compact in the
We note that ’ is regular (and cobounds with group of all automorphisms of G furnished with
dense range into H G) if and only if W’ is the natural topology (Grosser and Moskowitz
transitive (and H is the stabilizer of a point 1971). Of course, each Abelian group is [FIA].
z Z, i.e., H ¼ {g G | W’(g)z ¼ z}). Hence The cocycle ’ is bounded if there is a compact
every cocycle taking values in a compact group is subset K in G such that ’ takes values in K.
regular. Theorem 5.5 is a particular case of the follow-
It is often useful to consider the double cocycle ing result.
’0 ≔ ’ o1 instead of ’. It takes values in the
group G ℝþ . Since T ’0 is exactly the Maharam Theorem 5.14 Let G be amenable. Let V be an
extension of T’, it follows from Maharam (1964) ergodic nonsingular action of G ℝþ : Then
that ’0 is transient or recurrent if and only if ’ is there is an ergodic nonsingular transformation
transient or recurrent, respectively. T and a recurrent cocycle ’ of T with values in
G such that V is isomorphic to the Mackey range
Theorem 5.11 (Orbit classification of cocycles of the double cocycle ’0 (Golodets and
(Golodets and Sinel’shchikov 1994)). Let ’, Sinel’shchikov 1990; Fedorov 1985; Adams
’0 : X ! G be two recurrent cocycles of an ergodic et al. 1994).
transformation T. They are weakly equivalent if Given a cocycle ’ M(X, G) of T, we say that
and only if their Mackey ranges W ’0 and W ’00 are a transformation R N[T] is compatible with ’ if
isomorphic. the cocycles α’ and α’ ∘ (R R) of RT are
Another proof of this theorem was presented in cohomologous. Denote by D(T, ’) the group of
Fedorov (1985). all such R. It has a natural Polish topology which
is stronger than t (Danilenko and Golodets 1996).
Theorem 5.12 Let T be an ergodic nonsingular Since [T] is a normal subgroup in D(T, ’), one can
transformation. Then there is a cocycle of T with consider the outer conjugacy equivalence relation
dense range in G if and only if G is amenable. inside D(T, ’). It is called ’-outer conjugacy. Sup-
It follows that if G is amenable then the subset pose that G is Abelian. Then an analogue of Theo-
of cocycles of T with dense range in G is a dense rem 5.10 for the ’-outer conjugacy is established in
Gδ in M(X, G) (just adapt the argument following Danilenko and Golodets (1996). Also, the cocycles
Example 5.3). The “only if ” part of Theorem 5.12 ’ with D(T, ’) ¼ N[T] are described there.
was established in Zimmer (1978). The “if” part
was considered by many authors in particular ITPFI Transformations and AT-Flows
cases: G is compact Zimmer (1977), G is solvable A nonsingular transformation T is called ITPFI
or amenable almost connected Golodets and (This abbreviates “infinite tensor product of fac-
Sinel’shchikov (1985), etc. The general case was tors of type I” (came from the theory of von
proved in Golodets and Sinel’shchikov (1983) Neumann algebras).) if it is orbit equivalent to a
and Herman (1979a) (see also a refinement in nonsingular product odometer (associated to a
(Aaronson and Weiss 2004)). sequence ðmn , nn Þ1
n¼1 , see § 3.1). If the sequence
We note that the “if” part in Theorem 5.12 can mn can be chosen bounded, then T is called ITPFI
be refined in the case where G is a compactly of bounded type. If mn ¼ 2 for all n, then T is called
generated Abelian group. ITPFI2. By Giordano and Skandalis (1985b),
252 Ergodic Theory: Nonsingular Transformations
every ITPFI-transformation of bounded type is transformation which does not satisfy this prop-
ITPFI2. In view of Theorem 5.4 and Example erty (Krieger 1972). Hence this transformation is
5.1, every ergodic transformation of type II or not ITPFI. Though not every ergodic transforma-
IIIl with 0 < l 1 is ITPFI2. tion is orbit equivalent to a nonsingular product
A remarkable characterization of ITPFI trans- odometer, a “weaker” form of this statement
formations in terms of their associated flows was holds.
obtained by Connes and Woods (1985). We first
single out a class of ergodic flows. A nonsingular Theorem 5.16 Each ergodic nonsingular trans-
flow V ¼ (Vt)t ℝ on a space (Ω, n) is called formation is orbit equivalent to a Markov odom-
approximate transitive (AT) if given ϵ > 0 and eter (see §3.2) (Dooley and Hamachi 2003a).
f 1 , . . . , f n L1þ ðX, mÞ, there exists f L1þ ðX, mÞ In Dooley and Hamachi (2003b), an explicit
and l1 , . . . , ln L1þ ðℝ, dt Þ such that example of a non-ITPFI ergodic Markov odome-
ter (not satisfying property A) was constructed.
dn∘V t Later Munteanu in Munteanu (2012) exhibited an
fj f ∘V t l ðtÞdt <ϵ
ℝ dn j 1
ergodic non-ITPFI transformation satisfying
property A. In Joita and Munteanu (2014), it was
for all 1 j n. A flow built under a constant constructed an explicit example of a non-AT non-
ceiling function with a funny rank-one (Ferenczi singular flow W built under a function and over a
1985) probability preserving base transformation nonsingular product odometer. Hence every non-
is AT (Connes and Woods 1985). In particular, singular ergodic transformation whose associated
each ergodic finite measure-preserving flow with flow is isomorphic to W is non-ITPFI.
a pure point spectrum is AT.
Theorem 5.15 An ergodic nonsingular transfor- Mixing Notions and Multiple Recurrence
mation is ITPFI if and only if its associated flow is
AT (Connes and Woods 1985). The study of mixing and multiple recurrence are
The original proof of this theorem was given in central topics in classical ergodic theory (Cornfeld
the framework of von Neumann algebras theory. et al. 1982; Furstenberg 1981). Unfortunately,
A simpler, purely measure-theoretical proof was these notions are considerably less “smooth” in
given later in Hawkins (1990b) (the “only if ” part) the world of nonsingular systems. The very con-
and Hamachi (1992) (the “if ” part). It follows cepts of any kind of mixing and multiple recur-
from Theorem 5.15 that every ergodic flow with rence are not well understood in view of their
pure point spectrum is the associated flow of an ambiguity. Below we discuss nonsingular systems
ITPFI transformation. This was refined recently in possessing a surprising diversity of such proper-
Berendschot and Vaes (2022b): Every ergodic ties that seem equivalent but are different indeed.
flow with a pure point spectrum is the associated
flow of an ITPFI2 transformation. This fact was Weak Mixing
proved earlier in Hamachi and Osikawa (1986) Let T be an ergodic conservative nonsingular
only for flows whose spectrum is θΓ, where Γ is a transformation. A number l ℂ is an L1-eigen-
subgroup of ℚ and θ ℝ. The existence of ITPFI value for T if there exists a nonzero f L1 so that
transformations which are not of bounded type f ∘ T ¼ lf a.e. It follows that jl j ¼ 1 and f has
was shown in Krieger (1972). constant modulus, which we assume to be
Krieger introduced an invariant for the orbit 1. Denote by e(T ) the set of all L1-eigenvalues
equivalence, called property A, and showed that of T. T is said to be weakly mixing if e(T ) ¼ {1}.
each product odometer satisfies property A. He We refer to (Aaronson 1997, Theorem 2.7.1) for
also constructed an ergodic nonsingular proof of the following Keane’s ergodic multiplier
Ergodic Theory: Nonsingular Transformations 253
shifts; see Aaronson (1997). More recently, it was transformation that is not rationally weakly
shown in Dai et al. (2015) and Bozgan et al. mixing. We also mention an example of a weakly
(2015) that rank-one (infinite measure-preserving) mixing, rationally ergodic, and Koopman mixing
transformations are subsequence boundedly ratio- (or zero type, see §6.3 for the definition) transfor-
nally ergodic. The first version of Bozgan et al. mation that is not subsequence rationally weakly
(2015) has a proof that the rank-one transforma- mixing (Aaronson 2016).
tions are subsequence weakly rationally ergodic; a The set of transformations that are subse-
simpler proof was found in Danilenko (2016b), quence rationally weakly mixing is residual
where this property is also established for the class (Aaronson 2013), while the set of rationally
of funny rank one transformations and the class of weakly mixing transformations is meagre
ergodic transformations of balanced finite rank. (Aaronson 2013). Since the set of power weakly
(A transformation is called of balanced finite rank mixing rank-one transformations is residual, and
if it is of finite rank and the bases of the Rokhlin the rank-one transformations are subsequence
towers on the n-th step of the cutting-and-stacking boundedly rationally ergodic, there exist rank
inductive construction have asymptotically com- one transformations that are power weakly mixing
parable measures as n ! 1.) Therefore all these and subsequence boundedly rationally ergodic but
transformations are nonsquashable in view of not rationally weakly mixing.
Theorem 6.2.
The rank-one transformations for which the Mixing, Zero-Type
sequence of cuts ðr n Þ1
n¼1 is bounded are boundedly
We now consider several attempts to define
rationally ergodic (Aaronson et al. 2017; Dai et al. (strong) mixing for nonsingular maps. Probably
2015; Bozgan et al. 2015). As for the examples of the first notion of mixing for infinite measure-
rationally weakly mixing transformations, preserving systems was proposed by Hopf in
Aaronson (2013) shows that Markov shifts with (1937). The idea was to show an asymptotic rate
certain conditions on their associated renewal for the sequence m(A \ TnB) for a large class of
sequences are rationally weakly mixing, and Dai finite measure sets A, B. More precisely, a trans-
et al. (2015) give rank-one examples. Subse- formation T is mixing for a ring R (called now
quence rational weak mixing and rational weak Krickeberg mixing), where R is a ring of sets of
mixing for products of powers have been studied finite measure that is invariant under T and gener-
in Aaronson (2013) and Adams (2015). ates the entire s-algebra, if there is a sequence
We have the following implications for rational ðrn Þ1
n¼1 such that for all A, B R we have
weak mixing.
lim r mðA \ T n BÞ ¼ mðAÞmðBÞ:
n!1 n
Theorem 6.3 If a transformation is sequentially
rationally weakly mixing, then it is weakly mixing Hopf proved such a property for an infinite
(Aaronson 2013). measure-preserving transformation defined on
ℝ+ [0, 1] that is now called an infinite random
Theorem 6.4 If a transformation is rationally walk; with R being the ring of Riemann measur-
weakly mixing, then it is weakly doubly ergodic able subsets. If R is the ring of all subsets of finite
(Bozgan et al. 2015). measure, then there are no Krickerberg R - mixing
It is an open problem whether weak double transformations because of the existence of
ergodicity implies rational weak mixing. weakly wandering sets. We note that the above
Aaronson (2013) asked if weak rational ergodicity (purely measure theoretical) definition R - mixing
and weak mixing imply rational weak mixing. is due to Friedman (1978) who extended
This was answered in negative in (Dai et al. Krickeberg’s one (Krickeberg 1967) given for
2015), where was constructed an example of a continuous transformations of topological spaces
weakly mixing rationally ergodic rank-one endowed with a measure. Recently, there have
256 Ergodic Theory: Nonsingular Transformations
been several works showing this version of subsets A1, . . ., Ak of finite measure whenever
mixing and computing mixing rates for several |ni nj| ! 1 if i 6¼ j.
transformations. Melbourne and Terhesiu For 0 α 1, Kakutani suggested a related
(Melbourne and Terhesiu 2012) have verified definition of α-type: An infinite measure-
mixing for a large class of maps including AFN preserving transformation is of α-type if
maps with indifferent fixed points; these methods lim supn!1 m(A \ TnA) ¼ αm(A) for every subset
were extended to invertible transformations by A of finite measure. In (Osikawa and Hamachi
Melbourne (2015) and to additional maps by 1971), examples of ergodic transformations of
Gouëzel (2011). Recently, Dolgopyat and any α-type and a transformation of not any type
Nándori (2019) have shown Krickerberg mixing were constructed.
for a class of special flows; other recent work It was shown in Danilenko (2016a) and Loh
appeared in Bruin et al. (2019). et al. (2018) that for each pair k n, there exists
Another approach to mixing was proposed by a mixing rank-one infinite measure-preserving
Krengel and Sucheston (1969) for nonsingular transformation of ergodic index k and conservative
maps. Given a sequence of measurable sets index n. Rigid infinite measure-preserving rank-
{An}, let sk({An}) denote the s-algebra generated one transformations of arbitrary ergodic index
by Ak, Akþ1, . . .. A sequence {An} is said to be were constructed in Danilenko (2016a). Of course,
remotely trivial if \1 k¼0 sk ðfAn gÞ ¼ f;, X g modm, rigidity implies infinite conservative index.
and it is semiremotely trivial if every subsequence We now isolate an important class of concrete
contains a further subsequence that is remotely rank-one transformations and examine mixing
trivial. A nonsingular transformation T of a properties within this class. Let T be a rank-one
s-finite measure space is called mixing if for transformation associated with a sequence
every set A of finite measure the sequence ðr n , wn , sn Þ1
n¼1. If wn(0) ¼ wn(1) ¼ ¼ wn(rn 1)
{T nA} is semiremotely trivial, and completely and sn( j) ¼ zn þ j for j ¼ 0, . . ., rn1, then T is
mixing if {T nA} is semiremotely trivial for all called a high staircase (called also tower staircase
measurable sets A. Krengel and Sucheston show in Bowles et al. (2001). It was shown in Bowles
that T is completely mixing if and only if it is type et al. (2001) that each high staircase is weakly
II1 and mixing for the equivalent finite invariant doubly ergodic and hence weak mixing. However,
measure. Thus, there are no type III and II1 there exist high staircases whose Cartesian square
completely mixing nonsingular transformations is not ergodic (Bowles et al. 2001). As for the
on probability spaces. We note that this definition mixing of the high staircases, the following theo-
of mixing in infinite measure spaces depends on rem was proved in (Danilenko and Ryzhikov
the choice of measure inside the equivalence class 2011). It is an infinite analogue of the Adams
(but it is independent if we replace the measure by solution (Adams 1998) of the Smorodinsky
an equivalent measure with the same collection of conjecture.
sets of finite measure).
Hajian and Kakutani showed (1964) that an r2
Theorem 6.5 If lim n!1 r1 rnn1 ¼ 0 and
ergodic infinite measure-preserving transforma- 1 zn
¼ 1, then the associated high staircase
tion Tis either of zero type: limn!1 m(T nA \ A) ¼ n¼1 hn
0 for all sets A of finite measure, or of positive is infinite measure-preserving and mixing.
type: lim supn!1 m(TnA \ A) > 0 for all subsets Mixing high staircases which are power
A of finite positive measure. It appears that T is weakly mixing were constructed in (Danilenko
mixing if and only if it is of zero type (Krengel and and Ryzhikov 2011).
Sucheston 1969). We note that in infinite measure, We note that mixing (zero type) does not imply
mixing implies mixing of all orders, i.e., if a either ergodicity or conservativity in the category
measure-preserving T is of zero type, then of infinite measure-preserving transformations.
mðT n1 A1 \ \ T nk Ak Þ ! 0 for each k and all Indeed, a translation on ℝ endowed with the
Ergodic Theory: Nonsingular Transformations 257
Lebesgue measure is nonergodic, totally dissipa- maps that is motivated by statistical mechanics
tive but of zero type. It may seem that mixing and and uses global observables. The definition is
ergodicity together are stronger than any kind of with respect to a collection of sets, global observ-
nonsingular weak mixing considered above. ables, and local observables. We choose a family
However, it is not the case: If T is a weakly mixing V of measurable sets of finite measure so that it
infinite measure-preserving transformation of contains sets V1 V2 such that [iVi ¼ X. We
zero type and S is an ergodic probability- also have a subspace G of L1 functions (called
preserving transformation, then T S is ergodic global observables), and a subspace ℒ of L1 func-
and of zero type. On the other hand, the L1- tions (called local observables). There is also a
spectrum e(T S) is nontrivial, i.e., T S is not condition on the growth rate of the measure of V -
weakly mixing, whenever S is not weakly mixing. elements under iteration by T. Then Lenci defines
We also note that there exist rank-one infinite an infinite volume average for elements F of G by
measure-preserving transformations T of zero
type such that T T is not conservative (hence
mðFÞ ¼ lim F dm:
not ergodic) (Adams et al. 1997). In contrast to V!X V
that, if T is of positive type, then all of its finite
Cartesian products are conservative (Aaronson By this limit, we mean that for every neighbor-
and Nakada 2000). Another result that suggests hood of mðFÞ, there is a number M > 0 so that
that there is no good definition of mixing in the when m(V ) > M for a set V in V , then VFdm is in
infinite measure-preserving case was proved in the neighborhood. He shows that under the above
(James et al. 2008). It is shown there that while conditions, mðF∘T n Þ ¼ mðFÞ: Then he defines
the mixing finite measure-preserving transforma- several notions of what he calls infinite volume
tions are measurably sensitive, there exists no mixing (Lenci 2010); we mention three here. The
infinite measure-preserving system that is measur- transformation T is said to be global-local mixing-
ably sensitive. (Measurable sensitivity is a mea- 1 if for all F in G and all g in ℒ with gdm ¼ 0, we
surable version of the strong sensitive dependence have
on initial conditions – a concept from topological
theory of chaos.)
lim ðF∘T n Þg dm ¼ 0:
The Krengel-Sucheston concept of mixing n!1
+
union of p-wandering sets, T Cp. is p-recurrent, equivalent invariant probability measure for each
and 1 k
k¼1 m B \ T B \ \ T
dk
B ¼ 1 for n ℕ. Assume that the transformation T ¼
every B Cp. 1n¼1 T n is nonsingular with respect to the infinite
Let T be an infinite measure-preserving trans- product measure m ≔ 1 n¼1 mn : In other words,
formation and let F be a s-finite factor (X, m, T ) is an IDPFT dynamical system.
(i.e., invariant subalgebra) of T. Inoue (Inoue
2004) showed that for each p > 0, if T F is Theorem 7.1 The transformation T is either con-
+
p-recurrent, then so is T provided that the exten- servative or totally dissipative. Moreover, if S is
sion T ! T F is isometric. It is unknown yet an ergodic conservative nonsingular transforma-
+
whether the latter assumption can be dropped. tion, then the direct product T S is either con-
However, partial progress was achieved in servative or totally dissipative (Danilenko and
(Meyerovitch 2007): If T F is multiply recur- Lemańczyk 2019).
+
rent, then so is T.
Let P ≔ fq ℚ½t j qðℤÞ ℤ and q(0) ¼ 0}. Theorem 7.2 Let (Xn, nn, Tn) be mildly mixing
An ergodic conservative nonsingular transforma- (see §15.1 for the definition) for each n > 0. If T is
tion T is called p-polynomially recurrent if for m-conservative, then T is sharply weak mixing
every q1 , . . . , qp P and every subset B of posi- (Danilenko and Lemańczyk 2019).
tive measure there exists k ℕ with Examples of rigid ergodic but not weakly
mixing IDPFT transformations of Krieger type
IIIl, for each l (0, 1) were constructed in
m B \ T q1 ðkÞ B \ \ T qp ðkÞ B > 0:
(Danilenko and Lemańczyk 2019). Some families
of 0-type IDPFT transformations of type III1
If T is p-polynomially recurrent for every appeared in (Danilenko and Lemańczyk 2019)
p ℕ, then it is called polynomially recurrent. and of all possible Krieger types in (Danilenko
Furstenberg’s theorem on multiple recurrence was and Kosloff 2022).
significantly strengthened in (Bergelson and
Leibman 1996), where it was shown that every Theorem 7.3 Let K {IIIl | 0 l 1} t {II1}.
finite measure-preserving transformation is poly- Then there is a 0-type weakly mixing IDPFT trans-
nomially recurrent. However, Danilenko and formation of type K (Danilenko and Kosloff 2022).
Silva (Danilenko and Silva 2004) constructed
ergodic conservative and not of type II1. Krengel class is never of type IIIl, 0 l < 1. In (Vaes
conjectured that the shift is of type III indeed. In and Wahl 2018), Vaes and Wahl, answering a
(Hamachi 1981b), Hamachi showed that question from (Danilenko and Lemańczyk
Krengel’s class contains ergodic conservative 2019), found a convenient condition for a non-
nonsingular Bernoulli shifts of type III. This was singular Bernoulli shift from the generalized
further refined by Kosloff who constructed type Krengel class to be conservative. Utilizing that
III1 ergodic conservative shifts belonging to condition, they constructed, for each l (0, 1),
Krengel’s class (Kosloff 2011). In (Kosloff an explicit example of power weakly mixing non-
2013), Kosloff constructed a nonsingular singular Bernoulli shift of type III1 with m1(0) ¼ l.
Bernoulli shift of type III1 (and belonging to the We note that the previously known Bernoulli
Krengel class) which is power weakly mixing. shifts of type III1 were constructed via involved
Weiss asked about possible Krieger’s types for inductive procedures. Vaes and Wahl also pro-
the nonsingular Bernoulli shifts. Answering his vided in (Vaes and Wahl 2018) a family of type
question, Kosloff proved in a subsequent paper III1-examples of Bernoulli shifts that contains
(Kosloff 2014) that each conservative Bernoulli examples with finite ergodic index (less
shift from the Krengel class is ergodic and either than 73). Analyzing that family more thoroughly,
of type II1 or of type III1. In particular, the non-- Kosloff and Soo showed that it contains type III1
type-II1 conservative Bernoulli shifts constructed nonsingular Bernoulli shifts of arbitrary ergodic
in (Krengel 1970; Hamachi 1981b; Kosloff index.
2011)) are all of type III1 indeed.
Theorem 8.2 Let A ¼ {0, 1} and c > 0. Let
Generalized Krengel Class mcn ð0Þ≔ 12 þ pcn 1fn ℕ jpn>2cg for each n ℤ.
Kosloff’s result from (Kosloff 2014) was further
There exists D > 16 such that the Bernoulli shift
extended in (Danilenko and Lemańczyk 2019).
T on n ℤ(A, mn) is ergodic of type III1 for all
We say that a nonsingular Bernoulli shift belongs
c < D and totally dissipative for all c > D. In
to the generalized Krengel class if A ¼ {0, 1} and
addition, if k ℕ and pkþ1
D
c < pDk then T is of
mn ¼ m1 for each n 0. We note that these trans-
ergodic index k (Kosloff and Soo 2022).
formations are the natural extension of the one-
sided nonsingular Bernoulli shifts defined on
(Aℕ, n>0 mn). Every shift from the generalized General Nonsingular Bernoulli Shifts
Krengel class is a K-automorphism. The studying of general nonsingular Bernoulli
shifts was initiated by Kosloff in (2013).
Theorem 8.1 (On types of nonsingular Bernoulli
shifts from the generalized Krengel class (Kosloff Theorem 8.3 (Mixing of nonsingular Bernoulli
2014; Danilenko and Lemańczyk 2019)). Let shifts (Kosloff 2013)). If #A ¼ 2 and (mn)n ℤ is a
A ¼ {0, 1} and let T be a nonsingular Bernoulli sequence of probabilities on A such that (4) holds,
shift on (Aℤ, n ℤ mn) from the generalized then T is either of type II1 and mixing (with respect
Krengel class. to the equivalent invariant probability measure)
or of zero type.
(i) If n>0(mn(0) m1(0))2 < 1, then m is equiv- In (Kosloff 2019), Kosloff noticed that under
alent to n ℤm1 and hence T is of type II1. some natural conditions, conservativity of
(ii) If n>0(mn(0) m1(0))2 ¼ 1 and T is con- Bernoulli shifts implies ergodicity. His proof was
servative, then T is ergodic of type III1. More- based essentially on the Hurewicz ergodic theorem
over, the Maharam extension of T is a weakly and properties of the tail equivalence relation on
mixing K-automorphism. Aℤ. Danilenko (Danilenko 2019a) refined his
results by exploiting the interplay between T and
Thus, Krieger’s type of each nonsingular the measurable equivalence relation on Aℤ gener-
Bernoulli shift from the generalized Krengel ated by the finite permutations of coordinates.
Ergodic Theory: Nonsingular Transformations 261
Theorem 8.4 (Weak mixing of conservative each l (0, 1), an example of type IIIl Bernoulli
nonsingular Bernoulli shifts). Let A be finite. shift was constructed with mn Leb for each n.
Examples of nonsingular Bernoulli shifts of each
(i) If infn ℤ mina Amn(a) > 0 and T is conser- possible Krieger’s type were given in a later paper
vative, then T is weakly mixing (see Kosloff (Berendschot and Vaes 2022a). An alternative
2019; Danilenko 2019a). proof of this result appeared in a recent work
(ii) If #A ¼ 2 and inf n ℤ min a A logjmmn ðaðaÞ Þj > (Danilenko and Kosloff 2022).
nþ1
Definition 8.8 Let V ¼ (Vt)t ℝ be a nonsingular • Whether each Poisson flow is an AT-flow asso-
flow on a standard probability space (X, n). Given ciated with an ITPFI2-transformation.
n > 1, consider two mutually commuting non- • Whether each infinitely divisible flow is
singular flows U ¼ (Ut)t ℝ and D ¼ (Dt)t ℝ on Poisson, whether each infinitely divisible
the product space (X, n)n: AT-flow is Poisson.
such that all the entries of Mn are strictly positive. Theorem 8.15 Let A be finite and let M ¼
Let (XM, T, m) be a nonsingular Markov shift, and (M(a, b))a,b A be a primitive 0–1-valued A A-
let m be generated by a sequence (πn, Pn)n ℤ as in matrix. Let a measure m on XM be generated by a
§3.6. Suppose that m is nonatomic and that πn is sequence (πn, Pn)n ℤ as in §3.6 and inf
fully supported on A for each n. If inf {Pn(a, b) | n ℤ, M(a, b) ¼ 1} > 0. Let the
{Pn(a, b) | n ℤ, M(a, b) ¼ 1} > 0 and T is Markov shift (XM, T, m) be nonsingular and con-
conservative, then T is weakly mixing. servative (Avraham-Re’em 2022).
We isolate a class of nonsingular Markov shifts
for which Pn ¼ P1 and πn ¼ π1 for all n 0 and (i) If limn!1Pn does not exist, then T is of type
call it the Markov-Krengel class. Each shift from III1.
this class is the natural extension of the (ii) If there exists the limit P+ ≔ limn!þ1 Pn and
corresponding one-sided nonsingular Markov P ≔ limn!1Pn, then P+ ¼ P.
shift (Danilenko and Lemańczyk 2019). There is (iii) If A ¼ {0, 1} and there exists the limit
an analog of Theorem 8.1 for the Markov-Krengel Q ≔ limn!1Pn, then T is either of type II1
shifts. or III1. More precisely, T is of type II1 if and
only if
Theorem 8.14 (Danilenko and Lemańczyk
1 1 2
2019). Let M ¼ , Pn be a bistochastic
1 1 Pn ða, bÞPn ða0 , b0 Þ Qða, bÞQða0 , b0 Þ
n1 a, b, a0 , b0 A
0:5 0:5
matrix for each n ℤ, Pk ¼ and < 1:
0:5 0:5
πk ¼ (0.5,0.5) for each k 0. Let the The corresponding equivalent invariant prob-
corresponding Markov-Krengel shift ðXM , T, mÞ ability measure (if exists) is the Markov measure
be nonsingular and conservative. Then either T is defined by Q and the distribution l on A satisfying
of type II1 (if n>0 j Pn(0, 0) 0.5 j < 1 ) or lQ ¼ l.
III1 (otherwise). In the latter case, the Maharam It was also shown in (Avraham-Re’em 2022)
extension of T is a weakly mixing that an analogue of Theorem 8.15(iii) holds also
K-automorphism. Moreover, if m is equivalent to for the golden mean Markov shift for which A ¼
a Bernoulli (i.e., infinite product) measure, then 1 0 1
T is of type II1.
{0, 1, 2} and M ¼ 1 0 1 :
Concrete examples of Markov-Krengel shifts
(XM, m, T ) of type III1 such that m is not equiva- 0 1 0
lent to a Bernoulli measure were constructed in
(Danilenko and Lemańczyk 2019; Kosloff 2021).
Recently, Avraham-Re’em extended and Dynamical Properties of Nonsingular
refined the aforementioned results on Markov Poisson Suspensions and Nonsingular
shifts (Avraham-Re’em 2022). To state his results, Gaussian Transformations
we introduce some notation. Given n ℤ and a,
b A, we let Nonsingular Poisson Suspensions
Let (X, ℬ, m) be a s-finite infinite standard mea-
pn1 ðbÞ sure space and let T Aut2(X, m). Then the
P ðb, aÞ if pn ðaÞ 6¼ 0, Poisson suspension (X , m , T ) is well defined
Pn ða, bÞ ¼ pn ðaÞ n1
0 otherwise: by Theorem 3.1. The first problem to consider is
to find out when T admits an absolutely continu-
For a stochastic A A-matrix Q, let l be the ous invariant probability measure. A satisfactory
distribution on A such that lQ ¼ l. We let solution of this problem is obtained in (Danilenko
lðbÞ et al. 2022a).
Qða, bÞ ≔ lðaÞ Qðb, aÞ:
264 Ergodic Theory: Nonsingular Transformations
Theorem 9.1 The following are equivalent: Theorem 9.4 The set
• There exists a T -invariant probability measure fT Aut 2 ðX, mÞ j Tand T are both ergodic and of type III 1 g
r≺m ,
is a dense Gδ in (Aut2(X, m), d2). The set
dm∘T n
supn ℤ 1 < 1,
dm 2 fT kerw j Tand T are both ergodic and of type III 1 g
Theorem 9.3 If T is of 0-type and there is no Theorem 9.7 Let V be an AT-flow associated
with an ITPFI2-transformation. Then there is a
T-invariant measure k ≺ m such that dk
dm totally dissipative transformation T Aut2(X, m)
2
1 L ðmÞ, then T is of 0-type (Danilenko et al. such that T is weakly mixing and the associated
2022a). flow of T is isomorphic to V (Berendschot and
The following theorem is proved via Baire Vaes 2022b).
category tools.
Ergodic Theory: Nonsingular Transformations 265
spectrum which may be regarded as an analogue cl ∘ T ¼ lcl for each l. Moreover, e(T) is of
of the discrete spectrum. We also include results Lebesgue measure 0 and it can have an arbitrary
on computation of the maximal spectral type of Hausdorff dimension (Aaronson 1997, 1983;
the “nonsingular” Koopman operator for rank-one Moore and Schmidt 1980).
nonsingular transformations. A proper Borel subgroup E of is called
L1 - spectrum and Groups of Quasi-invariance (i) Weak Dirichlet if lim supn!1 lðnÞ ¼ 1 for
Let T be an ergodic nonsingular transformation of each finite complex measure l supported on E
(X, ℬ, m). A number l belongs to the L1-
(ii) Saturated if lim supn!1 j lðnÞ jj lðEÞ j
spectrum e(T) of T if there is a function
for each finite complex measure l on ,
f L1(X, m) with f ∘ T ¼ lf. f is called an L1-
eigenfunction of T corresponding to l. Denote by
where lðnÞ denote the n-th Fourier coefficient of
E(T) the group of all L1-eigenfunctions of abso-
l. Every countable subgroup of is saturated.
lute value 1. It is a Polish group when endowed
with the topology of converges in measure. If T is
Theorem 10.3 e(T) is s-compact in the usual
of type I I1, then the L1-eigenfunctions are
topology on (Host et al. 1991) and saturated
L2(m0)-eigenfuctions of T, where m0 is an equiva-
(Méla 1983; Host et al. 1991).
lent invariant probability measure. Hence e(T ) is
It follows that e(T ) is weak Dirichlet (this fact
countable. Osikawa constructed in (Osikawa
was established earlier in (Schmidt 1982)).
1977) the first examples of ergodic nonsingular
It is not known if every Polish group continu-
transformations with uncountable e(T ).
ously embedded in as a s-compact saturated
We state now a nonsingular version of the von
group is the eigenvalue group of some ergodic
Neumann-Halmos discrete spectrum theorem. Let
nonsingular transformation. This is the case for
Q be a countable infinite subgroup. Let K be
the so-called H2-groups and the groups of quasi-
a compact dual of Qd, where Qd denotes Q with
invariance of measures on (see below). Given a
the discrete topology. Let k0 K be the element
sequence nj of positive integers and a sequence
defined by k0(q) ¼ q for all q Q. Let R : K ! K
aj 0, the set of all z such that
be defined by Rk ¼ k þ k0. The system (K, R) is 1 nj 2
called a compact group rotation. The following j¼1 aj j1 z j < 1 is a group. It is called an
theorem was proved in (Aaronson and Nadkarni H2-group. Every H2-group is Polish in an intrinsic
1987). topology stronger than the usual circle topology.
Theorem 10.4
Theorem 10.1 Assume that the L1-
(i) Every H2-group is a saturated (and hence
eigenfunctions of T generate the entire s-algebra
weak Dirichlet) s-compact subset of (Host
ℬ. Then T is isomorphic to a compact group
et al. 1991).
rotation equipped with an ergodic quasi-invariant 1
measure. (ii) If j¼0 aj ¼ þ1, then the corresponding
subset L is disjoint from e(T), then there is an admit a continuous homomorphism c : e(S) ! E(S)
H2-group containing e(T) and disjoint from L. with cl ∘ T ¼ lcl for all l e(S). Hence
Example 10.5 ((Aaronson and Nadkarni e(S) 6¼ H(m) for any measure m satisfying the
1987), see also (Osikawa 1977)). Let (X, m, T ) conditions of Theorem 10.6.
be the nonsingular product odometer associated Assume that T is an ergodic nonsingular com-
1
to a sequence 2, nj j¼1 . Let nj be a sequence of pact group rotation. Let ℬ0 be the s-algebra gen-
positive integers such that nj > i < jni for all j. erated by a subcollection of eigenfunctions. Then
For x X, we put h(x) ≔ nl(x) j < l(x)nj. Then ℬ0 is invariant under T and hence a factor (see
h is a Borel map from X to the positive integers. §12) of T. It is not known if every factor of T is of
Let S be the tower over T with height function this form. It is not even known whether every
h (see §3.3). Then e(S) is the H2-group of all z factor of T must have nontrivial eigenvalues.
with 1 nj 2
j¼1 nj ð0Þnj ð1Þj1 z j < 1:
It was later shown in (Host et al. 1991) that if Koopman Unitary Operator for a Nonsingular
1 2 System
j¼1 nj ð0Þnj ð1Þ nj =njþ1 < 1, then the L1-
Let (X, ℬ, m, T ) be a nonsingular dynamical sys-
eigenfunctions of S generate the entire s-algebra, tem. In this subsection, we consider spectral prop-
i.e., S is isomorphic (measure theoretically) to a erties of the Koopman operator UT defined by (3).
nonsingular compact group rotation. First, we note that the spectrum of T is the entire
Let m be a finite measure on : Let H(m) circle (Nadkarni 1979). Next, if UT has an
≔ {z ℤ| δz m m}, where * means the eigenvector, then T is of type II1. Indeed, if there
convolution of measures. Then Hm is a group are l and 0 6¼ f L2(X, m) with UT f ¼ lf,
called the group of quasi-invariance of m. It then the measure n, dn(x) ≔ | f(x)|2dm(x), is finite,
has a Polish topology whose Borel sets T-invariant, and equivalent to m. Hence if T is of
agree with the Borel sets which H(m) inherits type III or II1, then the maximal spectral type sT
from , and the injection map of H(m) into is of UT is continuous. Another “restriction” on sT
continuous. This topology is induced by the was found in (Roy 2009): No Foïaş-Strătilă mea-
weak operator topology on the unitary group in sure is absolutely continuous with respect to sT if
the Hilbert space L2 ð, mÞ via the map T is of type II1. We recall that a symmetric mea-
HðmÞ 3 z 7! U z , ðU z f ÞðxÞ ¼ ðd ðdz mÞ=dmÞðxÞf ðxzÞ for sure s on possesses Foïaş-Strătilă property if
f L2 ð, mÞ: Moreover, H(m) is saturated (Host for each ergodic probability-preserving system (Y,
et al. 1991). If m(H(m)) > 0, then either H(m) v, S) and f L2(Y, n), if s is the spectral measure
is countable or m is equivalent to l (Mandrekar of f, then f is a Gaussian random variable
and Nadkarni 1969). (Lemańczyk et al. 2000). For instance, measures
supported on Kronecker sets possess this
Theorem 10.6 Let m be an ergodic with respect property.
to the H(m)-action by translations on . Then As we have noted in §6, mixing (0-type) is an
there is a compact group rotation (K, R) and a L2-spectral property for nonsingular transforma-
finite measure on K quasi-invariant and ergodic tions. Also, if T is infinite measure-preserving,
under R such that e(R) ¼ H(m). Moreover, there is then T is mixing if and only if n1 n1 i¼0 U T ! 0
ki
a continuous one-to-one homomorphism c : e(R) in the strong operator topology for each strictly
! E(R) such that cl ∘ R ¼ lcl for all l e(R). increasing sequence k1 < k2 < (Krengel and
(Aaronson and Nadkarni 1987). Sucheston 1969). This generalizes a well-known
It was shown by Aaronson and Nadkarni theorem of Blum and Hanson for probability-
(1987) that if n1 ¼ 1 and nj ¼ ajaj 1 a1 for preserving maps. For comparison, we note that
positive integers aj 2 with 1 1
j¼1 aj < 1, then ergodicity is not an L2-spectral property of infinite
the transformation S from Example 10.5 does not measure-preserving systems.
268 Ergodic Theory: Nonsingular Transformations
representation W of Affℝ(L2(m)) is well defined in P P mðPÞ log mðPÞ: In the study of measure-
F(L2(m)) via the formula preserving systems, the classical (Kolmogorov-
Sinai) entropy proved to be a very useful invariant
1
W ð f , V ÞE ðhÞ ≔ e2k f k2 h f ,Vhi E ðf þ VhÞ, f L2 ðmÞ: for isomorphism (Cornfeld et al. 1982). The key
fact of the theory is that if m ∘ T ¼ m then the limit
Given T Aut2(X, m), we have that lim n!1 n1 H ni¼1 T i P exists for every P :
dm∘T
1, U T Aff ℝ L2 ðmÞ : However, if T does not preserve m, the limit may
dm
no longer exist. Some efforts have been made to
extend the use of entropy and similar invariants to
Theorem 10.8 (Koopman operator associated
the nonsingular domain. These include Krengel’s
with T (Danilenko et al. 2022a)). If T Aut2(X, m),
entropy of conservative measure-preserving maps
then under the canonical identification of L2(m )
and its extension to nonsingular maps, Parry’s
and F(L2(m)),
entropy and Parry’s nonsingular version of
Shannon-McMillan-Breiman theorem, Poisson
dm∘T
UT ¼ W 1, U T : entropy, critical dimension by Mortiss and
dm
Dooley, etc. Unfortunately, these invariants are
less informative than their classical counterparts
Koopman Unitary Operators Associated with and they are more difficult to compute.
Nonsingular Gaussian Transformations
Let ℋ be a separable infinite dimensional real Krengel and Parry’s Entropies
Hilbert space. Denote by (X, m) the probability Let S be a conservative measure-preserving trans-
space where the nonsingular Gaussian transfor- formation of a s-finite measure space (Y, E, n). The
mations Gh,O for all (h, O) Aff ℋ are defined Krengel entropy (Krengel 1967) of S is defined by
(see §3.8). It is well known that there is a canon-
ical isometry between L2(X, m) and the symmetric hKr ðSÞ ¼ supfnðEÞhðSE Þ j 0 < nðEÞ < þ1g,
Fock space F(ℋ).
where h(SE) is the Kolmogorov-Sinai entropy of
Theorem 10.9 (Koopman operator associated SE. It follows from Abramov’s formula for the
with Gh,O (Danilenko and Lemańczyk 2022)). If entropy of induced transformation that hKr(S) ¼
(h, O) Aff ℋ, then under the canonical iden- m(E)h(SE) whenever E sweeps out, i.e.,
tification of L2(X, m) and F(ℋ), [i0SiE ¼ X. A generic transformation from
Aut0(X, m) has entropy 0. Krengel raised a ques-
1 tion in (Krengel 1967): Do there exist a zero
U Gh,O ¼ W h, O :
2 entropy infinite measure-preserving S and a zero
entropy finite measure-preserving R such that
It follows from Theorems 10.8 and 10.9 that hKr(S R) > 0? This problem was solved in
each nonsingular Poisson transformation T is (Danilenko and Rudolph 2009) (a special case
spectrally equivalent to the nonsingular Gaussian was announced by Silva and Thieullen in an
transformation G dm∘T
: October 1995 AMS conference (unpublished)):
dm 1
with another transformation R such that n ∘ R ¼ cn LLB. This result was extended in (Janvresse et al.
for a constant c 6¼ 1, then hKr(S) is either 0 or 1 2010) in the following way.
(Silva and Thieullen 1995).
Now let T be a type III ergodic transformation Theorem 11.1 Let T be an ergodic quasi-finite
of (X, ℬ, m). Silva and Thieullen define an transformation. Then either there is the Krengel-
entropy h (T ) of T by setting h ðT Þ ≔ hKr T , Pinsker factor of T which is also the Parry-Pinsker
and the Poisson-Pinsker (see the next subsection
where T is the Maharam extension of T (see §5.2).
below) factor of Tor T is remotely infinite, i.e., there
Since T commutes with transformations which
exists a sub-s-algebra F ℬ such that T1F F ,
“multiply” T -invariant measure, it follows that
_n>0T nF ¼ F and the subalgebra ^n>0TnF
h (T ) is either 0 or 1.
does not contain subsets of positive finite measure.
Let T be the standard IIIl-odometer from Exam-
ple 5.1 (i). Then h (T) ¼ 0. The same is true for a
Poisson Entropy
so-called ternary product odometer associated with
Poisson entropy for infinite measure-preserving
the sequence ð3, nn Þ1n¼1 , where nn(0) ¼ nn(2) ¼ transformations was introduced in (Roy 2005).
l/(1 þ 2l) and nn(1) ¼ l/(1 þ l) (Silva and
Let (X, m) be an infinite s-finite space, and let
Thieullen 1995). It is not known however whether
T be a m-preserving invertible transformation of
every ergodic nonsingular product odometer has
X. The Poisson suspension T of T is well defined
zero entropy. On the other hand, it was shown in
on a probability space (X , m ) and m ∘ T ¼ m
(Silva and Thieullen 1995) that h (T) ¼ 1 for
(see §3.7). It is ergodic if and only if T has no
every K-automorphism.
invariant sets of finite positive measure. It follows
The Parry entropy (Parry 1969) of S is defined
from Theorem 10.8 that U T is the “exponent” of
by
UT. Hence, the maximal spectral type of U T is
n 0(n!)1(sT) n, where sT is a measure of the
hPa ðSÞ ≔ H S1 FjF jF is a s finite maximal spectral type of UT.
subalgebra of B such that F S1 Fg: Now the Poisson entropy hPo(T ) of T is h(T ).
The main question is: hPo(T ) coincides with
Parry showed (1969) that hPa(S) hKr(S). It is hPa(T ) or hkr(T )? It was shown in (Janvresse
still an open question whether the two entropies et al. 2010) that hPa(T ) hPo(T ). If T is quasifinite
coincide. This is the case when S is of rank one or rank one, then the three entropies of T coincide
(since hKr(S) ¼ 0) and when S is quasi-finite (Parry (Janvresse et al. 2010). If T is the infinite Markov
1969). The transformation S is called quasi-finite if shift associated with a pair (P, π) for recurrent and
there exists a subset of finite measure A Y such irreducible P (see §3.6), then
that the first return time partition (An)n>0 of A has
finite entropy. We recall that x An , n is the hKr ðT Þ ¼ hPa ðT Þ
smallest positive integer such that Tnx A. An ¼ hPo ðT Þ ¼ pðaÞ Pða,bÞ log Pða, bÞ:
example of non-quasi-finite ergodic infinite a A b A
entropies coinside on T F for some F , then contains x. We put o1 ¼ 0. Parry shows in (Parry
+
hKr(T ) ¼ hPo(T ). On the other hand, Janvresse 1963) that
and de la Rue constructed an ergodic conserva-
n
tive infinite measure-preserving transformation j¼0 log m Cnj T j x oj ðxÞ oj1 ðxÞ
T such that hKr(T ) ¼ 0 but hPo(T ) > 0 (Janvresse n
i¼0 oj ðxÞ
and de la Rue 2012).
1
! H P j _ T 1 P
i¼1
Definition 11.2 An ergodic measure-preserving
transformation T of a s-finite measure space dm∘T 1 i
log E j _ T P dm
(X, ℬ, m) is said to have totally positive Poisson X dm i¼0
entropy if for each s-finite T-invariant
sub-s-algebra F ℬ, the Poisson entropy of the for a.a. x. Parry also shows that under the afore-
system (X, F , m F , T ) is strictly positive.
+
mentioned conditions on T,
We note that the Poisson suspension of the
system (X, F , m F , T ) from the above definition
+
n n1
1 j jþ1
is canonically a factor of X, ℬ, m, T : Such fac- H _ T j P H _ T j P
n j¼0
i¼0
j¼0
i¼1
tors of T are called Poissonian. Roy showed in
1
(Roy 2010) that if T has totally positive Poisson ! H P j _ T i P
i¼1
entropy, then T is of zero type.
invariants for isomorphism of nonsingular sys- coordinate). It also follows from Theorem 11.6
tems. Notice also that that if T is a product odometer of bounded type,
then α(T 1) ¼ α(T) and β(T 1) ¼ β(T). In
n
i¼1 oi ðxÞ
log (Dooley and Mortiss 2006), Theorem 11.6 was
aðT Þ ¼ lim inf and bðT Þ extended to a subclass of Markov odometers.
n!1 log n
n Those results were further extended to so-called
log i¼1 oi ðxÞ
¼ lim sup : G-measures on product spaces (Mansfield and
n!1 log n
Dooley 2017) and a class of Bratteli-Vershik sys-
tems with multiple edges (Dooley and Hagihara
Moreover, 0 α(T) β(T) 1. If T is of type
2012). The critical dimensions for nonsingular
II1, then α(T ) ¼ β(T ) ¼ 1. If T is the standard IIIl-
Bernoulli shifts (see §3.5) were investigated in
odometer from Example 5.1, then aðT Þ ¼
(Dooley and Mortiss 2007):
bðT Þ ¼ logð1 þ lÞ 1þl
l
log l:
Theorem 11.7 For any ϵ > 0, there exists a
Theorem 11.5 (i) For every l [0, 1] and every nonsingular Bernoulli shift S from the Krengel
c [0, 1], there exists a nonsingular product class with α(S) < ϵ and β(S) > 1 ϵ.
odometer of type IIIl with critical dimension
equal to c (Mortiss 2002). Nonsingular Restricted Orbit Equivalence
(ii) For every c [0, 1], there exists a non- In (Mortiss 2000), Mortiss initiated study of a
singular product odometer of type II1 with criti- nonsingular version of Rudolph’s restricted orbit
cal dimension equal to c (Dooley and Mortiss equivalence (Rudolph 1985). This work is still in
2009). its early stages and does not yet deal with any
form of entropy. However she introduced non-
Let T be the nonsingular product odometer singular orderings of orbits, defined sizes, and
associated with a sequence ðmn , nn Þ1 n¼1 : Let showed that much of the basic machinery still
s(n) ¼ m1 mn and let H ðP n Þ denote the entropy works in the nonsingular setting.
of the partition of the first n coordinates with
respect to m. We now state a nonsingular version
of Shannon-MacMillan-Breiman theorem for Nonsingular Joinings and Factors
T from (Dooley and Mortiss 2009).
The theory of joinings is a powerful tool to study
Theorem 11.6 Let mi be bounded from above. probability-preserving systems and to construct
Then striking counterexamples. It is interesting to
study what part of this machinery can be extended
n
log mi ðxi Þ to the nonsingular case. However, there are some
(i) aðT Þ ¼ lim inf n!1 inf i¼1
log sðnÞ principal obstacles for such extensions:
H ðP n Þ
¼ lim inf n!1 log sðnÞ and
n • There are too many quasi-invariant measures
log mi ðxi Þ
(ii) bðT Þ ¼ lim supn!1 inf i¼1
in view of the Glimm-Effros theorem (see The-
log sðnÞ
H ðP n Þ
orem 2.12).
¼ lim supn!1 log sðnÞ • Ergodic components of a nonergodic joining
need not be joinings of the original systems.
for a.a. x ¼ (xi)i1 X.
It follows that in the case when α(T) ¼ β(T ), There are several ways to bypass these obsta-
the critical dimension coincides with cles. The principal idea is to select always an
H ðP n Þ
log sðnÞ : In (Mortiss 2002), this expression
lim n!1 appropriate (rather narrow) class of quasiinvariant
(when it exists) was called AC-entropy (average measures under consideration or impose some
Ergodic Theory: Nonsingular Transformations 273
Clearly, product measure, graph-joinings, and ones); however, these transformations are not
the relative products are all rational joinings. prime.
Moreover, a rational joining of finite measure- A more general notion than MSJ, called graph
preserving systems is measure-preserving and a self-joinings (GSJ), was introduced (Silva and
rational joining of type II1’s is of type II1 Witte 1992): Just replace the the words “on T j
(Rudolph and Silva 1989). Thus we obtain the for some j ℤ” in Definition 12.2(ii) with “on
finite measure-preserving theory as a special S for some element S C(T ).” For finite measure-
case. As for the definition of MSJ, it depends on preserving transformations, GSJ over {m} is the
a class M of equivalent measures. In the finite same as the usual twofold simplicity (del Junco
measure-preserving case, M ¼ {m}. However, in and Rudolph 1987). The famous Veech theorem
the nonsingular case no particular measure is dis- on factors of twofold simple maps (see del Junco
tinguished. We note also that Definition 12.2 and Rudolph 1987) was extended to nonsingular
(ii) involves some restrictions on all rational join- systems in (Silva and Witte 1992) as follows: If a
ings and not only ergodic ones as in the finite system (X, ℬ, m, T ) has GSJ, then for every non-
measure-preserving case. The reason is that an trivial factor A of T there exists a locally compact
ergodic component of a nonsingular joining subgroup H in C(T ) (equipped with the weak
needs not be a joining of measures equivalent to topology) which acts smoothly (i.e., the partition
the original ones (Aaronson 1987). For finite into H-orbits is measurable) and such that A ¼
measure-preserving transformations, MSJ over fB ℬ j mðhBDBÞ ¼ 0 for all h H}. It follows
that there is a cocycle ’ from ðX, A, m A Þ to
+
{m} is the same as the usual twofold MSJ (del
Junco and Rudolph 1987). H such that T is isomorphic to the ’-skew product
A nonsingular transformation T on (X, ℬ, m) is
+
extension (T A)’ (see §6.4). Of course, the ergo-
called prime if its only factors are ℬ and {X, dic nonsingular product odometers and, more
;} mod m. A (nonempty) class M of probability generally, ergodic nonsingular compact group
measures equivalent to m is said to be centralizer rotation (see § 10.1) have GSJ. However, except
stable if for each S C(T ) and m1 M, the for this trivial case (the Cartesian square is non-
measure m1 ∘ S is in M. ergodic) plus the systems with MSJ from
(Rudolph and Silva 1989), no examples of type
Theorem 12.3 Let (X, ℬ, m, T ) be a ergodic III systems with GSJ are known. In particular, no
nonatomic dynamical system such that T has smooth examples have been constructed so far.
MSJ over a class M that is centralizer stable. This is in sharp contrast with the finite measure-
Then T is prime and the centralizer of T consists preserving case where abundance of simple
of the powers of T (Rudolph and Silva 1989). (or close to simple) systems are known (see del
A question that arises is whether such a non- Junco and Rudolph 1987; Thouvenot 1995;
singular dynamical system (not of type II1) exist. Danilenko 2007).
Expanding on Ornstein’s original construction
from (Ornstein 1972), Rudolph and Silva con- Nonsingular Coding and Factors of Cartesian
struct in (Rudolph and Silva 1989), for each Products of Nonsingular Maps
0 l 1, a nonsingular rank-one transformation As we have already noticed above, the non-
Tl that is of type IIIl and that has MSJ over a class singular MSJ theory was developed in (Rudolph
M that is centralizer stable. Type II1 examples and Silva 1989) only for twofold self-joinings.
with analogues properties were also constructed The reasons for this were technical problems
there. In this connection, it is worth to mention the with extending the notion of rational joinings
example by Aaronson and Nadkarni (Aaronson from twofold to n-fold self-joinings. However,
and Nadkarni 1987) of II1 ergodic transforma- while the twofold nonsingular MSJ or GSJ prop-
tions that have no factor algebras on which the erties of T are sufficient to control the centralizer
invariant measure is s-finite (except for the entire and the factors of T, it is not clear whether it
Ergodic Theory: Nonsingular Transformations 275
implies anything about the factors or centralizer of (i) If l1 6¼ l, then F is equal mod 0 to one of the
T T. Indeed, to control them one needs to know four algebras ℬ ℬ, ℬ N , N ℬ, or
the fourfold joinings of T. However, even in the N N , where N ¼ f;, Xg:
finite measure-preserving case, it is a long- (ii) If l1 ¼ l, then F is equal mod 0 to one of the
standing open question whether twofold MSJ following algebras ℬ ℬ, ℬ N , N
implies n-fold MSJ. That is why del Junco and ℬ, N N , or (Tm Id)ℬ2 for some
Silva (2003) apply alternative – nonsingular cod- integer m.
ing – techniques to classify the factors of Carte-
sian products of nonsingular Chacón maps. The It is not hard to obtain type III1 examples of
techniques were originally used in (del Junco Chacón maps for which the previous two theo-
1978) to show that the classical Chacón map is rems hold. However the construction of type II1
prime and has trivial centralizer. They were and type III0 nonsingular Chacón transformations
extended to nonsingular systems in (del Junco is more subtle as it needs the choice of on to vary
and Silva 1995). with n. In (Hamachi and Silva 2000), Hamachi
For each 0 < l < 1, we denote by Tl the and Silva construct type III0 and type II1 exam-
Chacón map (see §3.4) corresponding the ples; however, the only property proved for these
sequence of probability vectors wn ¼ (l/(1 þ maps is ergodicity of their Cartesian square. More
2l), 1/(1 þ 2l), l/(1 þ 2l)) for all n > 0. One recently, Danilenko (2004) has shown that all of
can verify that the maps Tl are of type IIIl. (The them (in fact, a wider class of nonsingular Chacón
classical Chacón map corresponds to l ¼ 1.) All maps of all types) are power weakly mixing.
of these transformations are defined on the same In (Choksi et al. 1989), Choksi, Eigen, and
standard Borel space (X, ℬ). These transforma- Prasad asked whether there exists a zero entropy,
tions were shown to be power weakly mixing in finite measure-preserving mixing automorphism
(Adams et al. 2001). The centralizer of any finite S, and a nonsingular type III automorphism T,
Cartesian product of nonsingular Chacón maps is such that T S has no Bernoulli factors. Theorem
computed in the following theorem. 12.5 provides a partial answer (with a mildly
mixing only instead of mixing) to this question:
Theorem 12.4 Let 0 < l1 < . . . < lk 1 and n1, If S is the finite measure-preserving Chacón map
. . ., nk be positive integers. Then the centralizer of and T is a nonsingular Chacón map as above, the
factors of T S are only the trivial ones, so T S
the Cartesian product T n l1
1
. . . T n
lk
k
is gen-
has no Bernoulli factors.
erated by maps of the form U1 . . . Uk, where
each Ui, acting on the ni-dimensional product Joinings and MSJ for Infinite Measure-
space Xni , is a Cartesian product of powers of Preserving Systems
T li or a coordinate permutation on Xni : (del Junco Adams, Friedman, and Silva introduced in
and Silva 2003). (Adams et al. 1997) an infinite version of Chacón
Let π denote the permutation on X X defined map T as a rank-one transformation associated
by π(x, y) ¼ (y, x), and let ℬ2 denote the sym- with ðr n , on , sn Þ1
n¼1 such that rn ¼ 3, on(0) ¼
metric factor, i.e., ℬ2 ¼ {A ℬ ℬ| π(A) ¼
on(1) ¼ on(2), sn(0) ¼ 0, sn(1) ¼ 1 and sn(2) ¼
A}. The following theorem classifies the factors of
3hn þ 1 for each n > 0. This is called the infinite
the Cartesian product of any two nonsingular type
Chacón transformation. Let (X, m) be the space of
IIIl, 0 < l < 1, or type II1 Chacón maps.
T. Of course, m(X) ¼ 1. This transformation has
infinite ergodic index (Adams et al. 1997), is not
Theorem 12.5 Let T l1 and T l2 be two non- power weakly mixing and not multiply recurrent
singular Chacón systems. Let F be a factor alge- (Gruher et al. 2003), and has trivial centralizer
bra of T l1 T l2 : (del Junco and Silva 2003). (Janvresse et al. 2018). For each d > 0, Janvresse,
276 Ergodic Theory: Nonsingular Transformations
The limit exists and does not depend on the choice all C1-orientation-preserving diffeomorphisms
of x and f. It is obvious that T is nonsingular with with irrational rotation numbers. In contrast to
respect to Lebesgue measure l : Moreover, if that, Hawkins constructs in (Hawkins 1982) a
T Diff rþ ðÞ and r(T ) is irrational, then the type III0 C1-diffeomorphism of the
dynamical system ð, l , T Þ is ergodic (Cornfeld 4-dimensional torus which is not ITPFI.
et al. 1982). It is interesting to ask: which Examples of n-to-1 conservative ergodic non-
Krieger’s type can such systems have? singular C1-endomorphisms on the 2-torus, not
Katznelson showed in (Katznelson 1977) that admitting an equivalent s-finite invariant mea-
the subset of type III C1-diffeomorphisms and the sure, were constructed in (Hawkins and Silva
subset of type II1C1-diffeomorphisms are dense 1991). In (Avila and Bochi 2007), it is shown
in Diff 1þ ðÞ: Hawkins and Schmidt refined the
that a C1 generic expanding map of has no
idea of Katznelson from (Katznelson 1977) to absolutely continuous s-finite invariant measure.
construct, for every irrational number α [0, 1) Kosloff in (2021) showed that 2 admits a C1
which is not of constant type (i.e., in whose con- Anosov diffeomorphism of type III1 with respect
tinued fraction expansion the denominators are to Lebesgue measure. We recall that this phenom-
not bounded) a transformation T Diff 2þ ðÞ enon is impossible in the class of conservative
which is of type III1 and r(T ) ¼ α (Hawkins and C1þα Anosov diffeomorphisms because by a the-
Schmidt 1982). It should be mentioned that class orem of Gurevich and Oseledets, every such trans-
C2 in the construction is essential, since it follows formation is of type II1 (with respect to Lebesgue
from a remarkable result of Herman that if measure). In a later work (Kosloff 2018), he
T Diff 3þ ðÞ , then under some condition on α extended this result to d for every d > 3. The
(which determines a set of full Lebesgue mea- case d ¼ 3 remains open.
sure), T is measure theoretically (and topologi-
cally) conjugate to a rotation by r(T ) (Herman
1979b). Hence T is of type II1. Miscellaneous Topics
In (Hawkins 1983), Hawkins shows that every
smooth paracompact manifold of dimension 3 Let T be an ergodic measure-preserving transfor-
admits a type IIIl diffeomorphism for every mation of an infinite s-finite nonatomic measure
l [0, 1]. This extends a result of Herman space (X, ℬ, m).
(Herman 1979a) on the existence of type III1
diffeomorphisms in the same circumstances. On Normalizing Constants for Ergodic
It is also of interest to ask: Which free ergodic Theorem
flows are associated with smooth dynamical sys- Replacing m with an equivalent probability mea-
tems of type III0? Hawkins proved that any free sure, one can deduce from the Hurewicz ergodic
ergodic C1-flow on a smooth, connected, para- theorem that the average 1n n1 i
i¼0 f T x converges
compact manifold is the associated flow for a C1- to 0 a.e. for each function f L (X, m). In view of
1
diffeomorphism on another manifold (of higher that, a natural question arises: Is there a sequence
dimension) (Hawkins 1990a). of positive numbers ðan Þ1 n¼1 such that
A nice result was obtained in (Katznelson
1979; Hawkins and Woods 1984). If n1
1
T Diff 2þ ðÞ and the rotation number of T has f T i x ! fdm a:e: ð9Þ
an i¼0
unbounded continued fraction coefficients, then
ð, l , T Þ is ITPFI. Moreover, a converse also for each f L1(X, m)? Aaronson answered this
holds: Given a nonsingular product odometer R, question negatively in (Aaronson 1977b). He
the set of orientation-preserving C1- showed that if there is a sequence of positive num-
diffeomorphisms of the circle which are orbit bers ðan Þ1
n¼1 and a single integrable function f 0
equivalent to R is C1-dense in the Polish set of with fdm > 0 such that (9) holds, then m(X) < 1.
278 Ergodic Theory: Nonsingular Transformations
Thus, no normalizing constants in the ergodic theo- transformation of rank one has s-finite MSJ.
rem for infinite measure-preserving transformations Since the rank-one transformations are non-
exist. Since then, other forms of convergence of squashable, it follows that the centralizer of each
ergodic averages for a given sequence have been zero-type rank-one transformation is just its
studied, for which the reader may refer to (Aaronson powers.
1981, 1997; Aaronson and Zweimüller 2014; Thaler
1998; Thaler and Zweimüller 2006) and the refer- Definition 14.2 Let T be a rank-one
ences therein. The situation is somewhat different in transformation associated with ðr n , on , sn Þ1 n¼1 :
the case of symmetric Birkhoff sums, i.e., when we It is called partially bounded if there is L > 0
replace the average n1 i¼0 in (9) with jij<n. It is such that r n L, on ð0Þ ¼ ¼ on ðr n 1Þ,
shown in (Aaronson et al. 2017) that though (9) with max 0i<j<rn 1 jsn ðiÞ sn ðjÞj < L, sn ðr n 1Þ ¼ 0
jij<n instead of n1i¼0 does not hold for all f L (-
1
and min 0i<rn 1 sn ðiÞ hn for each n > 0
X, m), there are examples of ergodic conservative (Gaebler et al. 2018).
infinite measure-preserving transformations It was shown in (Gaebler et al. 2018) that for
ðX, B, m, T Þ admitting sequences (an)n with an ! þ each partially bounded transformation, the cen-
1 and such that tralizer consists of just the powers. Of course,
the family of partially bounded transformations
1 does not intersect the set of zero-type rank-
lim sup an f T i x ¼ fdm and
n 2 one maps.
jij<n
1 1
lim inf an f Tix ¼ fdm a:e: Asymmetry and Bergelson’s Question
n 2 2
jij<n
We say that T is asymmetric if T is not isomorphic
to T1. Explicit examples of asymmetric infinite
for each f L1þ ðX, mÞ: See also (Kosloff 2017).
rank-one transformations are constructed in
(Danilenko and Ryzhikov 2012; Ryzhikov 2014)
Around King’s Weak Closure Theorem (see there for asymmetric maps which embed into
We recall that if S is probability preserving rank- a flow). It was shown in (Gaebler et al. 2018) that
one map, then C(S) is the weak closure of the set if T is a partially bounded rank-one transforma-
{Sn| n ℤ} (King theorem, (King 1986)). It is tion, then T is isomorphic to T 1 if and only if
still unclear whether this theorem extends to the sn(i) ¼ sn(rn 2 i) for all i ¼ 0, . . ., rn 2
infinite measure-preserving rank-one transforma- eventually in n.
tions. However, there are some classes of infinite Bergelson asked: Is there T of infinite ergodic
rank-one maps for which it is true: zero-type maps index such that T T 1 is not ergodic? Of course,
and partially bounded maps. such a T is asymmetric. The question is still open.
However some partial progress was achieved in
Definition 14.1 A s-finite self-joining (of order (Clancy et al. 2016; Danilenko 2016a). In (Clancy
2) of T is a s-finite T T-invariant measure l on et al. 2016), an example of a rank-one T was
(X X, ℬ ℬ) such that l(A X) ¼ l(X A) ¼ constructed such that T T is ergodic but T
l(A) for all A ℬ of finite measure. If for each T1 is not. Similar examples appeared also in
ergodic s-finite self-joining l of T, there is n ℤ (Danilenko 2016a). However, they do not answer
such that l(A B) ¼ m(A TnB), then T is said Bergelson’s question because T has ergodic index
to have minimal s-finite self-joinings (of order 2). 2 in these examples. It was also shown in
The above concept of MSJ permits to control (Danilenko 2016a) that within the class of infinite
the m-preserving centralizer C0(T ) of T: If T has Markov shifts, the answer on Bergelson’s ques-
MSJ, then C0(T ) ¼ {Tn| n ℤ}. It was shown by tion is negative. As for the rank-one transforma-
Ryzhikov and Thouvenot (Ryzhikov and tions, it was shown in (Clancy et al. 2016) that
Thouvenot 2015) that each zero-type T T1 is always conservative.
Ergodic Theory: Nonsingular Transformations 279
systems: T is mildly mixing if and only if for each Furstenberg in (1967) initiated studying the class
ergodic nonsingular transformation S, the product W ⊥ of transformations disjoint from all weakly
T S is ergodic (Furstenberg and Weiss 1978). mixing ones. Let D denote the class of distal
Furthermore, T is mildly mixing if and only if for transformations and MðW ⊥ Þ the class of multi-
each ergodic nonsingular transformation S, the pliers of W ⊥ (for the definitions, see (Glasner
product T S is orbit equivalent to S (Hawkins 1994)). Then D MðW ⊥ Þ W ⊥ : In
and Silva 1997/98). In particular, the associated (Danilenko and Lemańczyk 2005; Lemańczyk
flows of S and T S are isomorphic. Moreover, if and Parreau 2003), it was shown by constructing
R is a nonsingular transformation such that R S explicit examples that these inclusions are strict.
is ergodic for any ergodic nonsingular S, then R is We record this fact here because nonsingular ergo-
of type II1 (and mildly mixing) (Schmidt and dic theory was the key ingredient of the arguments
Walters 1982). In this context, we note that for in the two papers pertaining to the theory of
every ergodic infinite measure-preserving trans- probability-preserving systems. The examples
formation T there is an ergodic Markov shift are of the form T’,S(x, y) ¼ (Tx, S’(x)y), where
S such that T S is not conservative, hence not T is an ergodic rotation on (X, m), (Sg)g G a
ergodic (Aaronson et al. 1979); also that S can be mildly mixing action of a locally compact group
chosen to be rank-one and rigid (Elyze G on Y and ’ : X ! G a measurable map. Let W’
et al. 2018). denote the Mackey action of G associated with ’,
and let (Z, k) be the space of this action. The key
Ergodicity of Gaussian Cocycles observation is that there exists an affine isomor-
Let T be an ergodic (equivalently, weakly mixing) phism of the simplex of T’,S-invariant probability
measure-preserving Gaussian transformation on a measures whose pullback on X is m and the sim-
standard probability space ðX, B, mÞ, and let H be plex of W’ S quasi-invariant probability mea-
the corresponding invariant Gaussian subspace of sures whose pullback on Z is k and whose Radon-
the real Hilbert space L20 ðX, mÞ≔L2 ðX, mÞ ℝ: Nikodym cocycle is measurable with respect to Z.
The following conjecture was stated in This is a far-reaching generalization of
(Lemańczyk et al. 2001): For each function Furstenberg theorem on relative unique ergodicity
f H, either f is a T-coboundary (equivalently, a of ergodic compact group extensions.
Gaussian coboundary, i.e., the transfer function
belongs to H ), or the skew product transformation Symmetric Stable and Infinitely Divisible
Tf acting on the product space (X ℝ, m Leb) is Stationary Processes
ergodic. We note that Tf preserves the infinite Rosinsky in (1995) established a remarkable con-
measure m Leb. The affirmative answer was nection between structural studies of stationary
obtained in (Danilenko and Lemańczyk 2022) stochastic processes and ergodic theory of non-
(and independently in (Marrakchi and Vaes singular transformations (and flows). For simplic-
2022)) under an assumption that T is mildly ity, we consider only real processes in discrete
mixing. Some examples of ergodic Tf with rigid time. Let X ¼ (Xn)n ℤ be a measurable stationary
weakly mixing T were constructed in (Marrakchi symmetric α-stable (SαS) process, 0 < α < 2.
and Vaes 2022). However, in the general setting, This means that any linear combination
n
the problem remains open. k¼1 ak X jk , jk ℤ, ak ℝ has an SαS-
distribution. (The case α ¼ 2 corresponds to
Disjointness and Furstenberg’s Class W ⊥ Gaussian processes.) Then the process admits a
Two probability-preserving systems (X, m, T ) and spectral representation
(Y, v, S) are called disjoint if m n is the only T
S-invariant probability measure on X Y whose
Xn ¼ f n ðyÞMðdyÞ, n ℤ, ð10Þ
coordinate projections are m and v, respectively. Y
Ergodic Theory: Nonsingular Transformations 281
where fn Lα(Y, m) for a standard s-finite mea- Poisson Suspensions of Infinite Measure-
sure space (Y, ℬ, m) and M is an independently Preserving Transformations
scattered random measure on ℬ such that Poisson suspensions over infinite measure-
E exp (iuM(A)) ¼ exp (|u|αm(A)) for every preserving transformations (see (Roy 2005,
A ℬ of finite measure. By (Rosinsky 1995), 2009)) are widely used in statistical mechanics
one can choose the kernel ( fn)n ℤ in a special to model ideal gas, Lorentz gas, etc. (see
way: There are a m - nonsingular transformation (Cornfeld et al. 1982)). Being a particular case of
T and measurable maps ’ : X ! {1, 1} and the Poisson suspensions over nonsingular trans-
f Lα(Y, m) such that fn ¼ Unf, n ℤ, where formations (see §3.7), they are always probability
U is the isometry of Lα(X, m) given by Ug ¼ preserving. Together with the Gaussian dynamical
’ (dm ∘ T/dm)1/α g ∘ T. If, in addition, the systems, they are also an important source of
smallest T-invariant s-algebra containing examples and counterexamples in ergodic theory.
f1(ℬℝ) coincides with ℬ and Supp Due to a close similarity with the well-studied
{f ∘ Tn : n ℤ} ¼ Y, then the pair (T, ’) is called Gaussian systems, a natural question arises: Are
minimal. It turns out that minimal pairs always there ergodic Poisson suspensions whose ergodic
exist. Moreover, two minimal pairs (T, ’) and self-joinings are Poisson? Such suspensions are
(T0, ’0) representing the same SαS process are called PAP. They are analogue of GAG in the
equivalent in some natural sense (Rosinsky theory of Gaussian systems (Lemańczyk et al.
1995). Then one can relate ergodic-theoretical 2000). Janvresse, de la Rue, and Roy constructed
properties of (T, ’) to probabilistic properties of PAP suspension in (Janvresse et al. 2017) (see also
(Xn)n ℤ. For instance, let Y ¼ C t D be the Hopf (Parreau and Roy 2008)). The example of an infi-
decomposition of Y (see Theorem 2.2). We let nite measure-preserving T with “minimal self-
XDn ≔ D f n ðyÞMðdyÞ and XCn ≔ C f n ðyÞMðdyÞ: joinnings” from (Janvresse et al. 2019) plays a
Then we obtain a unique (in distribution) decom- crucial role in their construction. We also mention
position of X into the sum XD þ XC of two inde- a result of Meyerovitch (Meyerovitch 2013) related
pendent stationary SαS-processes. to weak mixing of infinite measure-preserving sys-
Another kind of decomposition was consid- tems and Poisson suspensions: If T is a conservative
ered in (Samorodnitsky 2005). Let P be the largest ergodic infinite measure-preserving transformation,
invariant subset of Y such that T P has a finite
+
measure Q of X. Then Q is a shift invariant s-finite other words, (Ym)m1 is the random walk associ-
measure on ℝℤ. Decomposing the dynamical sys- ated with the (nonstationary) process ( f ∘ T n)n0.
tem (ℝℤ, t, Q) in various natural ways (Hopf Let us call this random walk recurrent if the
decomposition, 0-type and positive type, cocycle f of T is recurrent (see §5.5). It was
so-called “rigidity free” part, and its complement), shown in (Schmidt 1984) that in the case m ∘ T ¼
he obtains corresponding decompositions for the m, i.e., the process is stationary, this definition is
process X. Here t stands for the shift on ℝℤ. equivalent to the standard one.
282 Ergodic Theory: Nonsingular Transformations
analyzing the set of all ergodic quasi-invariant on M. According to Ratner, h has no finite invari-
(or just s-finite invariant) measures because this ant measures on M if G is infinite (except for
set is wildly huge (see §2.6). The situation measures supported on closed orbits). However
changes if we impose some restrictions on the there are infinite invariant Radon measures, for
measures. For instance, if the system under ques- instance, the volume measure. In the case when
tion is a homeomorphism (or a topological flow) G is free Abelian and Γ0 is cocompact, every
defined on a locally compact Polish space, then it homomorphism ’ : G ! ℝ determines a unique
is natural to consider the class of (s-finite) invari- up to scaling ergodic invariant Radon measure
ant Radon measures, i.e., measures taking finite (e.i.r.m.) m on T1(M) such that m ∘ dD ¼ exp
values on the compact subsets. We give two (’(D))m for all D G (Babillot and Ledrappier
examples. 1998) and every e.i.r.m. arises this way (Sarig
First, the seminal results of Giordano, Putnam, 2004). Moreover, all these measures are quasi-
and Skau on the topological orbit equivalence of invariant under g. In the general case, an interest-
compact Cantor minimal systems were extended ing bijection is established in (Ledrappier and
to locally compact Cantor minimal (l.c.c.m.) sys- Sarig 2007) between the e.i.r.m. which are quasi-
tems in (Danilenko 2001b; Matui 2002). Given a invariant under g and the “non-trivial minimal”
l.c.c.m. system X, we denote by M(X) and M1(X) positive eigenfunctions of the hyperbolic
the set of invariant Radon measures and the set of Laplacian on M. This result was extended in the
invariant probability measures on X. Notice that recent work (Landesberg and Lindenstrauss
M1(X) may be empty (Danilenko 2001b). It was 2022).
shown in (Matui 2002) that two systems X and X 0
are topologically orbit equivalent if and only if Von Neumann Algebras
there is a homeomorphism of X onto X 0 which There is a fascinating and productive interplay
maps bijectively M(X) onto M(X 0) and M1(X) between nonsingular ergodic theory and von Neu-
onto M1(X 0). Thus M(X) retains an important mann algebras. The two theories alternately
information on the system – it is “responsible” influenced development of each other. Let
for the topological orbit equivalence of the under- (X, ℬ, m, T ) be a nonsingular dynamical system.
lying systems. Uniquely ergodic l.c.c.m. systems Given ’ L1(X, m) and j ℤ, we define
(with unique up to scaling infinite invariant Radon operators A’ and Uj on the Hilbert space L2(Z
measure) were constructed in (Danilenko 2001b). ℤ, m n) by setting
The second example is related to study of the
smooth horocycle flows on tangent bundles of A’ f ðx, iÞ≔’ T i x f ðx, iÞ, U j f ðx, iÞ≔f ðx, i jÞ
hyperbolic surfaces. Let be the open disk
equipped with the hyperbolic metric jdz j/(1 Then U j A’ U j ¼ A’∘T : Denote by M the
j
|z|2), and let MöbðÞ denote the group of Möbius von Neumann algebra (i.e., the weak closure of
transformations of . A hyperbolic surface can be the *-algebra) generated by A’, ’ L1(X, m)
written in the form M ≔ G∖MöbðÞ, where Γ is a and Uj, j ℤ. If T is ergodic and aperiodic, then
torsion-free discrete subgroup of MöbðÞ: Sup- M is a factor, i.e., M \ M0 ¼ ℂ1, where M0
pose that Γ is a nontrivial normal subgroup of a denotes the algebra of bounded operators com-
lattice Γ0 in MöbðÞ: Then M is a regular cover of muting with M. It is called a Krieger’s factor.
the finite volume surface M0 ≔ G0 ∖MöbðÞ: The Murray-von Neumann-Connes’ type of M is
group of deck transformations G ¼ Γ0/Γ is finitely exactly the Krieger’s type of T. The flow of
generated. The horocycle flow (ht)t ℝ and the weights of M is isomorphic to the associated
geodesic flow (gt)t ℝ defined on the unit tangent flow of T. Two Krieger’s factors are isomorphic
bundle T 1 ðÞ descend naturally to flows, say if and only if the underlying dynamical systems
h and g, on T1(M). We consider the problem of are orbit equivalent (Krieger 1976b). Moreover, a
classification of the h-invariant Radon measures number of important problems in the theory of
284 Ergodic Theory: Nonsingular Transformations
von Newmann algebras such as classification of ℋ, and a Γ-cocycle (Ck)k1. Moreover, using
subfactors, computation of the flow of weights nonsingular ergodic theory Golodets (Golodets
and Connes’ invariants, outer conjugacy for auto- 1969) constructed for each k ¼ 2, 3, . . ., 1, an
morphisms, etc. are intimately related to the irreducible representation of CAR such that
corresponding problems in nonsingular orbit the- dim ℋ ¼ k. This answered a question of Gårding
ory. We refer to (Moore 1982; Feldman and and Wightman (Gårding and Wightman 1954)
Moore 1977; Giordano and Skandalis 1985a, b; who considered only the case k ¼ 1.
Hamachi and Kosaki 1993; Danilenko and
Hamachi 2000) for details. Unitary Representations of Locally Compact
Groups
Representations of CAR Nonsingular actions appear in a systematic way in
Representations of canonical anticommutation the theory of unitary representations of groups.
relations (CAR) is one of the most elegant and Let G be a locally compact second countable
useful chapters of mathematical physics, provid- group and H a closed normal subgroup of G.
ing a natural language for many body quantum Suppose that H is commutative (or, more gener-
physics and quantum field theory. By a represen- ally, of type I, see (Dixmier 1969)). Then the
tation of CAR, we mean a sequence of bounded natural action of G by conjugation on H induces
linear operators a1, a2, . . . in a separable Hilbert a Borel G-action, say α, on the dual space H – the
space K such that ajak þ akaj ¼ 0 and aj ak þ set of unitarily equivalent classes of irreducible
ak aj ¼ dj,k : unitary representations of H. If now U ¼ (Ug)g G
Consider {0, 1} as a group with addition mod is a unitary representation of G in a separable
2. Then X ¼ {0, 1}ℕ is a compact Abelian group. Hilbert space, then by applying Stone decompo-
Let Γ ≔ {x ¼ (x1, x2, . . .) : limn!1xn ¼ 0}. Then sition theorem to U H one can deduce that α is
+
Γ is a dense countable subgroup of X. It is gener- nonsingular with respect to a measure m of the
ated by elements γk whose k-coordinate is 1 and “maximal spectral type” for U H on H: More-
+
the other ones are 0. Γ acts on X by translations. over, if U is irreducible, then α is
Let m be an ergodic Γ-quasi-invariant measure on ergodic. Whenever m is fixed, we obtain a one-
X. Let (Ck)k1 be Borel maps from X to the group to-one correspondence between the set of coho-
of unitary operators in a Hilbert space ℋ satisfy- mology classes of irreducible cocycles for α with
ing Ck ðxÞ ¼ Ck ðx þ dk Þ, Ck ðxÞCl ðx þ dl Þ ¼ values in the unitary group on a Hilbert space ℋ
Cl ðxÞCk ðx þ dk Þ, k 6¼ l for a.a. x. In other words, and the subset of G consisting of classes of those
(Ck)k1 defines a cocycle of the Γ-action. We now unitary representations V for which the measure
+
put ℋ≔L2 ðX, mÞ ℋ and define operators ak in associated to V H is equivalent to m. This corre-
ℋ by setting spondence is used in both directions. From infor-
mation about cocycles, we can deduce facts about
representations and vise versa (Kirillov 1978;
ðak f ÞðxÞ ¼ ð1Þx1 þþxk1 ð1 xk ÞCk ðxÞ
Dixmier 1969).
dm∘dk
ðxÞf ðx þ dk Þ,
dm
Further Directions
where f : X ! ℋ is an element of ℋ and x ¼
(x1, x2, . . .) X. It is easy to verify that a1, a2, . . . While some of the results that we have cited for
is a representation of CAR. The converse was nonsingular ℤ-actions extend to actions of locally
established in (Gårding and Wightman 1954; compact Polish groups (or subclasses of Abelian
Golodets 1969): Every factor-representation (this or amenable ones), many natural questions remain
means that the von Neumann algebra generated by open in the general setting. For instance, what is
all ak is a factor) of CAR can be represented as the Rokhlin lemma, or the pointwise ergodic the-
above for some ergodic measure m, Hilbert space orem (for some obstacles toward extension of the
Ergodic Theory: Nonsingular Transformations 285
ratio ergodic theorem to nonsingular actions of Aaronson J, Nadkarni M (1987) L1 eigenvalues and L2
arbitrary amenable groups, see (Hochman 2013); spectra of nonsingular transformations. Proc Lond
Math Soc 55(3):538–570
a weak version of this theorem was proved Aaronson J, Nakada H (2000) Multiple recurrence of Mar-
recently in (Danilenko 2019a)), or the definition kov shifts and other infinite measure-preserving trans-
of entropy for nonsingular actions of general formations. Israel J Math 117:285–310
countable amenable groups? The theory of Aaronson J, Park KK (2009) Predictability, entropy and
information of infinite transformations, Fundam Math
abstract nonsingular equivalence relations 206:1–21
(Feldman and Moore 1977) or, more generally, Aaronson J, Weiss B (2004) On Herman’s theorem for
nonsingular groupoids (Ramsay 1971) and poly- ergodic, amenable group extensions of endomor-
morphisms (Vershik 1983) is also a beautiful part phisms. Ergodic Theory Dyn Syst 24(5):1283–1293
Aaronson J, Weiss B (2018) Distributional limits of posi-
of nonsingular ergodic theory that has nice appli- tive, ergodic stationary processes and infinite ergodic
cations: description of semifinite traces of transformations. Ann Inst Henri Poincaré Probab Stat
AF-algebras, classification of factor representa- 54:879–906
tions of the infinite symmetric group (Vershik and Aaronson J, Zweimüller R (2014) Limit theory for some
positive stationary processes with infinite mean,
Kerov 1985), path groups (Albeverio et al. 1983), English, with English and French summaries. Ann
etc. Nonsingular ergodic theory needs different Inst Henri Poincaré Probab Stat 50(1):256–284
tools for the most part when we pass from ℤ- Aaronson J, Lin M, Weiss B (1979) Mixing properties of
actions to noninvertible endomorphisms or, more Markov operators and ergodic transformations, and
ergodicity of Cartesian products. Israel J Math 33:
generally, semigroup actions. Several concrete 198–224
open problems are mentioned throughout the entry. Aaronson J, Kosloff Z, Weiss B (2017) Symmetric
Birkhoff sums in infinite ergodic theory. Ergodic The-
ory Dyn Syst 37:2394–2416
Abdalaoui EH el, Nadkarni MG (2016) A non-singular
Bibliography transformation whose spectrum has Lebesgue compo-
nent of multiplicity one. Ergodic Theory Dyn Syst 36:
Aaronson J (1977a) Rational ergodicity and a metric 671–681
invariant for Markov shifts. Israel J Math 27:93–123 Adams TM (1998) Smorodinsky’s conjecture on rank-one
Aaronson J (1977b) On the ergodic theory of non- mixing. Proc Am Math Soc 126:739–744
integrable functions and infinite measure spaces. Israel Adams TM (2015) Rigidity sequences of power rationally
J Math 27(2):163–173 weakly mixing transformations, preprint,
Aaronson J (1979) Rational ergodicity, bounded rational arXiv:1503.05806v1
ergodicity and some continuous measures on the circle. Adams T, Silva CE (2015) On infinite transformations with
Israel J Math 33:181–197 maximal control of ergodic two-fold product powers.
Aaronson J (1981) The asymptotic distributional behav- Israel J Math 209:929–948
iour of transformations preserving infinite measures. Adams T, Silva CE (2016) Weak rational ergodicity does
J Analyse Math 39:203–234 not imply rational ergodicity. Israel J Math 214:
Aaronson J (1983) The eigenvalues of nonsingular trans- 491–506
formations. Israel J Math 45:297–312 Adams T, Silva CE (2018) Weak mixing for infinite mea-
Aaronson J (1987) The intrinsic normalizing constants of sure invertible transformations. In: Ergodic theory and
transformations preserving infinite measures. dynamical systems in their interactions with arithmetics
J. Analyse Math. 49:239–270 and combinatorics, Lecture notes in Mathematics,
Aaronson J (1997) An introduction to infinite ergodic 2213, Springer, Cham, pp 327–349
theory. American Mathematical Society, Providence, Adams S, Elliott GA, Giordano T (1994) Amenable actions
RI, USA of groups. Trans Am Math Soc 344:803–822
Aaronson J (2013) Rational weak mixing in infinite mea- Adams T, Friedman N, Silva CE (1997) Rank-one weak
sure spaces. Ergodic Theory Dyn Syst 33:1611–1643 mixing for nonsingular transformations. Israel J Math
Aaronson J (2016) Conditions for rational weak mixing. 102:269–281
Stoch Dyn 16:1660004 Adams T, Friedman N, Silva CE (2001) Rank one power
Aaronson J, Lemańczyk M (2005) Exactness of Rokhlin weak mixing for nonsingular transformations. Ergodic
endomorphisms and weak mixing of Poisson bound- Theory Dyn Syst 21:1321–1332
aries, algebraic and topological dynamics. In: Contem- Ageev ON, Silva CE (2001) Genericity of rigid and mul-
porary mathematics, 385. American Mathematical tiply recurrent infinite measure-preserving and non-
Society, Providence, pp 77–88 singular transformations, Proceedings of the 16th
286 Ergodic Theory: Nonsingular Transformations
summer conference on general topology and its appli- Choksi JR, Kakutani S (1979) Residuality of ergodic mea-
cations. Topology Proc 26(2):357–365, 2001/02 surable transformations and of ergodic transformations
Albeverio S, Hoegh-Krohn R, Testard D, Vershik AM which preserve an infinite measure. Ind Univ Math
(1983) Factorial representations of Path groups. J 28:453–469
J Funct Anal 51:115–231 Choksi JR, Nadkarni MG (1994) The maximal spectral
Alpern S, Prasad VS (1990) Return times for nonsingular type of a rank one transformation. Can Math Bull
measurable transformations. J Math Anal Appl 152: 37(1):29–36
470–487 Choksi JR, Nadkarni MG (2000) Genericity of nonsingular
Araki H, Woods EJ (1968) A classification of factors. Pub transformations with infinite ergodic index. Colloq
RIMS, Ser A 3:51–130 Math 84/85:195–201
Arano Y, Isono Y, Marrakchi M (2021) Ergodic theory of Choksi JR, Prasad VS (1983) Approximation and Baire
affine isometric actions on Hilbert spaces. Geom Funct category theorems in ergodic theory. In: Measure theory
Anal 31(5):1013–1094 and its applications (Sherbrooke, Que., 1982), Lecture
Atkinson G (1976) Recurrence of co-cycles and random notes in mathematics 1033. Springer, Berlin, pp 94–113
walks. J Lond Math Soc 13:486–488 Choksi JR, Hawkins JM, Prasad VS (1987) Abelian
Avila A, Bochi J (2007) Generic expanding maps without cocylces for nonsingular ergodic transformations and
absolutely continuous invariant s-finite measure. Math the genericity of type III1 transformations. Monat fur
Res Lett 14(5):721–730 Math 103:187–205
Avraham-Re’em N (2022) On absolutely continuous Choksi J, Eigen S, Prasad V (1989) Ergodic theory on
invariant measures and Krieger-type of Markov sub- homogeneous measure algebras revisited, Measure
shifts. J d’Anal Math 147:201–253 and measurable dynamics (Rochester, NY, 1987). In:
Babillot M, Ledrappier F (1998) Geodesic paths and Contemporary Mathematics 94. American Mathemati-
horocycle flow on abelian covers. In: Lie groups and cal Society, Providence, pp 73–85
ergodic theory (Mumbai, 1996), Tata Institute of Fun- Clancy J, Friedberg R, Kasmalkar I, Loh I, Padurariu T,
damental Research studies in Mathematics, 14, Tata Silva CE, Vasudevan S (2016) Ergodicity and
Institute of Fundamental Research, Bombay, pp 1–32 conservativity of products of infinite transformations
Berendschot T, Vaes S (2022a) Nonsingular Bernoulli actions and their inverses. Colloq Math 143:271–291
of arbitrary Krieger type. Anal PDE 15:1313–1373 Connes A (1975) On the hierarchy of W. Krieger. Ill J Math
Berendschot T, Vaes S (2022b) Bernoulli actions of type 19:428–432
III0 with prescribed associated flow. Journal of the Connes A, Krieger W (1977) Measure space automor-
Institute of Mathematics of Jussieu, to appear phisms, the normalizers of their full groups, and
Bergelson V, Leibman A (1996) Polynomial extensions of approximate finiteness. J Funct Anal 24(4):336–352
van der Waerden’s and Semerédi’s theorems. J Am Connes A, Woods EJ (1985) Approximately transitive
Math Soc 9:725–753 flows and ITPFI factors. Ergodic Theory Dyn Syst
Bezuglyi SI, Golodets VY (1985) Groups of measure space 5(2):203–236
transformations and invariants of outer conjugation for Connes A, Woods EJ (1989) Hyperfinite von Newmann
automorphisms from normalizers of type III full algebras and Poisson boundaries of time dependent
groups. J Funct Anal 60(3):341–369 random walks. Pacific J Math 37:225–243
Bezuglyi SI, Golodets VY (1991) Weak equivalence and Connes A, Feldman J, Weiss B (1981) An amenable equiv-
the structures of cocycles of an ergodic automorphism. alence relation is generated by a single transformation.
Publ Res Inst Math Sci 27(4):577–625 Ergodic Theory Dyn Syst 1:431–450
Björklund M, Kosloff Z, Vaes S (2021) Ergodicity and type Cornfeld IP, Fomin SV, Sinaĭ YG (1982) Ergodic theory,
of nonsingular Bernoulli actions. Invent Math 224(2): Grundlehren der Mathematischen Wissenschaften,
573–625 245. Springer–Verlag, New York
Bonanno C, Giulietti P, Lenci M (2018) Infinite mixing for Dai I, Garcia X, Padurariu T, Silva CE (2015) On rationally
one-dimensional maps with an indifferent fixed point. ergodic and rationally weakly mixing rank-one trans-
Nonlinearity 31:5180–5213 formations. J Ergodic Theory Dyn Syst 35:1141–1164
Bowles A, Fidkowski L, Marinello A, Silva CE Danilenko AI (1995) The topological structure of Polish
(2001) Double ergodicity of nonsingular transforma- groups and groupoids of measure space transforma-
tions and infinite measure-preserving staircase trans- tions. Publ Res Inst Math Sci 31(5):913–940
formations. Ill J Math 45(3):999–1019 Danilenko AI (1998) Quasinormal subrelations of ergodic
Bozgan F, Sanchez A, Silva CE, Stevens D, Wang J (2015) equivalence relations. Proc Am Math Soc 126(11):
Subsequence bounded rational ergodicity of rank-one 3361–3370
transformations. Dynamical Syst 30:70–84 Danilenko AI (2001a) Funny rank one weak mixing for
Bruin H, Melbourne I, Terhesiu D (2019) Rates of mixing nonsingular Abelian actions. Israel J Math 121:29–54
for non-Markov infinite measure semi-flows. Trans Am Danilenko AI (2001b) Strong orbit equivalence of locally
Math Soc 371:7343–7386 compact Cantor minimal systems. Int J Math 12:113–123
Chacon RV, Friedman NA (1965) Approximation and Danilenko AI (2004) Infinite rank one actions and non-
invariant measures. Z Wahrscheinlichketstheorie und singular Chacon transformations. Ill J Math 48(3):
Verw Gebiete 3:286–295 769–786
Ergodic Theory: Nonsingular Transformations 287
Danilenko AI (2007) (C, F)-actions in ergodic theory, in Danilenko AI, Ryzhikov VV (2010) Spectral multiplicities
“Geometry and Dynamics of Groups and Spaces”. of infinite measure preserving transformations. Funct
Progr Math 265:325–351 Anal Appl 44:161–170
Danilenko AI (2013) A survey on spectral multiplicities Danilenko AI, Ryzhikov VV (2011) Mixing constructions
of ergodic actions. Ergodic Theory Dyn Syst 33: with infinite invariant measure and spectral multiplici-
81–117 ties. Ergodic Theory Dyn Syst 31:853–873
Danilenko AI (2016a) Finite ergodic index and asymmetry Danilenko AI, Ryzhikov VV (2012) On self-similarities of
for infinite measure preserving actions. Proc Am Math ergodic flows. Proc Lond Math Soc 104:431–454
Soc 144:2521–2532 Danilenko AI, Silva CE (2004) Multiple and polynomial
Danilenko AI (2016b) Actions of finite rank: weak rational recurrence for Abelian actions in infinite measure.
ergodicity and partial rigidity. J Ergodic Theory Dyn J Lond Math Soc 69(2):183–200
Syst 36:2138–2171 Danilenko AI, Solomko AV (2009) Infinite measure pre-
Danilenko AI (2017) Directional recurrence and direc- serving flows with infinite ergodic index. Colloq Math
tional rigidity for infinite measure preserving actions 115:13–19
of nilpotent lattices. Ergodic Theory Dyn Syst 37: Danilenko AI, Kosloff Z, Roy E (2022a) Nonsingular
1841–1861 Poisson suspensions. J d’Anal Math, 146:741–790
Danilenko AI (2018) Infinite measure preserving transfor- Danilenko AI, Kosloff Z, Roy E (2022b) Generic non-
mations with Radon MSJ. Israel J Math 228:21–51 singular Poisson suspension is of type III1. Ergodic
Danilenko AI (2019a) Weak mixing for nonsingular Theory Dyn Syst 42(4):1415–1445
Bernoulli actions of countable amenable groups. Proc Day S, Grivna B, McCartney E, Silva CE (1999) Power
Am Math Soc 147:4439–4450 weakly mixing infinite transformations. New York
Danilenko AI (2019b) Rank-one actions, their (C,F)- J Math 5:17–24
models and constructions with bounded parameters. del Junco A (1978) A simple measure-preserving transfor-
J d’Anal Math 139(2):697–749 mation with trivial centralizer. Pac J Math 79:357–362
Danilenko AI (2021) On the bounded cohomology for del Junco A, Rudolph DJ (1987) On ergodic actions whose
ergodic nonsingular actions of amenable groups. Israel self-joinings are graphs. Ergodic Theory Dyn Syst 7:
J Math 243(1):421–436 531–557
Danilenko AI (2022) Krieger’s type for ergodic non- del Junco A, Şahin A (2009) Dyesś theorem in the almost
singular Poisson actions of non-(T) locally compact continuous category. Israel J Math 173:235–251
groups. Ergodic Theory Dyn Systems, to appear del Junco A, Silva CE (1995) Prime type IIIl automor-
Danilenko AI, del Junco A (2011) Almost continuous orbit phisms: an instance of coding techniques applied to
equivalence for non-singular homeomorphisms. Israel nonsingular maps. In: Takahashi Y (ed) Fractals and
J Math 183:165–188 dynamics (Okayama/Kyoto, 1992). Plenum, New York,
Danilenko AI, Golodets VY (1996) On extension of pp 101–115
cocycles to normalizer elements, outer conjugacy, and del Junco A, Silva CE (2003) On factors of nonsingular
related problems. Trans Am Math Soc 348(12): Cartesian products. Ergodic Theory Dyn Syst 23(5):
4857–4882 1445–1465
Danilenko AI, Hamachi T (2000) On measure theoretical Derriennic Y, Frączek K, Lemańczyk M, Parreau F (2008)
analogues of the Takesaki structure theorem for type III Ergodic automorphisms whose weak closure of off-
factors. Colloq Math 84/85:485–493 diagonal measures consists of ergodic self-joinings.
Danilenko AI, Kosloff Z (2022) Krieger’s type of non- Colloq Math 110:81–115
singular Poisson suspensions and IDPFT systems. Dixmier J (1969) Les C*-algebres et leurs représentations.
Proc Am Math Soc 150(4):1541–1557 Gauthier–Villars Editeur, Paris
Danilenko AI, Lemańczyk M (2005) A class of multipliers Dolgopyat D, Nándori P (2019) Infinite measure renewal
for W ⊥. Isr J Undergrad Math 148:137–168 theorem and related results. Bull Lond Math Soc 51:
Danilenko AI, Lemańczyk M (2019) K-property for 145–167
Maharam extensions of non-singular Bernoulli and Dooley AN, Hagihara R (2012) Computing the critical
Markov shifts. Ergodic Theory Dyn Syst 39(12): dimensions of Bratteli-Vershik systems with multiple
3292–3321 edges. Ergodic Theory Dyn Syst 32:103–117
Danilenko AI, Lemańczyk M (2022) Ergodic cocycles of Dooley AH, Hamachi T (2003a) Nonsingular dynamical
IDPFT systems and non-singular Gaussian actions. systems, Bratteli diagrams and Markov odometers.
Ergodic Theory Dyn Syst 42(5):1624–1654 Israel J Math 138:93–123
Danilenko AI, Park KK (2011) Rank-one flows of trans- Dooley AH, Hamachi T (2003b) Markov odometer actions
formations with infinite ergodic index. Proc Am Math not of product type. Ergodic Theory Dyn Syst 23:
Soc 139:201–207 813–829
Danilenko AI, Rudolph DJ (2009) Conditional entropy Dooley AH, Mortiss G (2006) On the critical dimension
theory in infinite measure and a question of Krengel. and AC entropy for Markov odometers. Monatsh Math
Israel J Math 172:93–117 149:193–213
288 Ergodic Theory: Nonsingular Transformations
Dooley AH, Mortiss G (2007) The critical dimensions of Gaebler J, Kastner A, Silva CE, Xu X, Zhou Z (2018)
Hamachi shifts. Tohoku Math J. 59(2):57–66 Partially bounded transformations have trivial central-
Dooley AH, Mortiss G (2009) On the critical dimension of izers. Proc Am Math Soc 146:5113–5127
product odometers. Ergodic Theory Dyn Syst 29: Gårding L, Wightman AS (1954) Representation of anti-
475–485 commutation relations. Proc Natl Acad Sci U S A 40:
Dooley AH, Klemes I, Quas AN (1998) Product and Mar- 617–621
kov measures of type III. J Aust Math Soc Ser A 65(1): Giordano T, Skandalis G (1985a) Krieger factors isomor-
84–110 phic to their tensor square and pure point spectrum
Dye H (1959) On groups of measure-preserving transfor- flows. J Funct Anal 64(2):209–226
mations I. Am J Math 81:119–159, and II, Am J Math Giordano T, Skandalis G (1985b) On infinite tensor prod-
85 (1963), 551–576 ucts of factors of type I2. Ergodic Theory Dyn Syst 5:
Effros EG (1965) Transformation groups and C*-algebras. 565–586
Ann Math 81(2):38–55 Glasner E (1994) On the multipliers of W ⊥ . Ergodic
Eigen SJ (1981) On the simplicity of the full group of Theory Dyn Syst 14:129–140
ergodic transformations. Israel J Math Glasner E, Weiss B (2016) Weak mixing properties for
40(3–4):345–349 non-singular actions. Ergodic Theory Dyn Syst 36:
Eigen SJ (1982) The group of measure preserving trans- 2203–2217
formations of [0,1] has no outer automorphisms. Math Glimm J (1961) Locally compact transformation groups.
Ann 259:259–270 Trans Am Math Soc 101:124–138
Eigen S, Hajian A, Halverson K (1998a) Multiple recur- Golodets VY (1969) A description of the representations of
rence and infinite measure preserving odometers. Israel anticommutation relations. Uspehi Matemat Nauk
J Math 108:37–44 24(4):43–64
Eigen S, Hajian A, Weiss B (1998b) Borel automorphisms Golodets VY, Sinel’shchikov SD (1983) Existence and
with no finite invariant measure. Proc Am Math Soc uniqueness of cocycles of ergodic automorphism with
126:3619–3623 dense range in amenable groups. Preprint FTINT AN
Elyze M, Kastner A, Ortiz Rhoton J, Semenov V, Silva CE USSR, pp 19–83
(2018) On conservative sequences and their application Golodets VY, Sinel’shchikov SD (1985) Locally compact
to ergodic multiplier problems. Colloq Math 151: groups appearing as ranges of cocycles of ergodic
123–145 Z-actions. Ergodic Theory Dyn Syst 5:47–57
Fedorov A (1985) Krieger’s theorem for cocycles, preprint Golodets VY, Sinel’shchikov SD (1990) Amenable ergodic
Feldman J, Moore CC (1977) Ergodic equivalence rela- actions of groups and images of cocycles. (Russian).
tions, cohomology, and von Neumann algebras. Dokl Akad Nauk SSSR 312(6):1296–1299
I. Trans Am Math Soc 234:289–324 Golodets VY, Sinel’shchikov SD (1994) Classification and
Ferenczi S (1985) Systèmes de rang un gauche. Ann Inst structure of cocycles of amenable ergodic equivalence
H Poincare’ Probab Statist 21:177–186 relations. J Funct Anal 121:455–485
Friedman NA (1970) Introduction to Ergodic Theory. Van Gouëzel S (2011) Correlation asymptotics from large devi-
Nostrand ations in dynamical systems with infinite measure.
Friedman NA (1978) Mixing transformations in an infinite Colloq Math 125:193–212
measure space, Studies in probability and ergodic the- Grosser S, Moskowitz M (1971) Compactness conditions
ory. Adv Math Suppl Stud 2:167–184 in topological groups. J Reine Angew Math 246:1–40
Furman A (2002) Random walks on groups and random Gruher K, Hines F, Patel D, Silva CE, Waelder R (2003)
transformations. In: Handbook of dynamical systems, Power weak mixing does not imply multiple recurrence
vol 1A. North-Holland, Amsterdam, pp 931–1014 in infinite measure and other counterexamples.
Furstenberg H (1967) Disjointness in ergodic theory, min- New York J Math 9:1–22
imal sets and diophantine approximation. Math Syst Haddock B, Leng J, Silva CE (2022) Nonsingular trans-
Theory 1:1–49 formations that are ergodic with isometric coefficients
Furstenberg H (1981) Recurrence in ergodic theory and and not weakly doubly ergodic. Indagationes
combinatorial number theory. Princeton University Mathematicae 33: 1297–1311
Press, Princeton Hajian AB, Kakutani S (1964) Weakly wandering sets and
Furstenberg H, Glasner E (2010) Stationary dynamical invariant measures. Trans Am Math Soc 110:136–151
systems. In: Dynamical numbers interplay between Halmos PR (1946) An ergodic theorem. Proc Natl Acad
dynamical systems and number theory, Contemporary Sci U S A 32:156–161
mathematics, vol 532. American Mathematical Society, Halmos PR (1956) Lectures on ergodic theory. Publication
pp 1–28 of the Mathematical Society of Japan 3, Tokyo.
Furstenberg H, Weiss B (1978) The finite multipliers of Reprinted Chelsea Publishing Co., New York, 1960
infinite ergodic transformations. In: The structure of Hamachi T (1981a) The normalizer group of an ergodic
attractors in dynamical systems, Lecture notes in math- automorphism of type III and the commutant of an
ematics 668. Springer, Berlin, pp 127–132 ergodic flow. J Funct Anal 40:387–403
Ergodic Theory: Nonsingular Transformations 289
Hamachi T (1981b) On a Bernoulli shift with nonidentical Host B, Méla J-F, Parreau F (1991) Nonsingular transfor-
factor measures. Ergodic Theory Dyn Syst 1:273–283 mations and spectral analysis of measures. Bull Soc
Hamachi T (1992) A measure theoretical proof of the Math France 119:33–90
Connes-Woods theorem on AT-flows. Pac J Math 154: Hurewicz W (1944) Ergodic theorem without invariant
67–85 measure. Ann Math 45:192–206
Hamachi T, Kosaki H (1993) Orbital factor map. Ergodic Inoue K (2004) Isometric extensions and multiple recur-
Theory Dyn Syst 13:515–532 rence of infinite measure preserving systems. Israel
Hamachi T, Osikawa M (1981) Ergodic groups of auto- J Math 140:245–252
morphisms and Krieger’s theorems. Seminar on Math- Ionescu Tulcea A (1965) On the category of certain classes
ematical Science, Keio University, 3 of transformations in ergodic theory. Trans Am Math
Hamachi T, Osikawa M (1986) Computation of the asso- Soc 114:261–279
ciated flows of ITPFI2 factors of type III0, Geometric James J, Koberda T, Lindsey K, Silva CE, Speh P (2008)
methods in operator algebras (Kyoto, 1983), Pitman Measurable sensitivity. Proc Am Math Soc 136:
research notes in mathematics series 123. Longman 3549–3559
Scientific and Technical, Harlow, pp 196–210 Janvresse E, de la Rue T (2012) Zero Krengel entropy does
Hamachi T, Silva CE (2000) On nonsingular Chacon trans- not kill Poisson entropy. Annales de l’I.H.P, Pro-
formations. Ill J Math 44:868–883 babilit’es et statistiques 48:368–376
Hawkins JM (1982) Non-ITPFI diffeomorphisms. Israel Janvresse E, Meyerovitch T, de la Rue T, Roy E (2010)
J Math 42:117–131 Poisson suspensions and entropy of infinite transfor-
Hawkins JM (1983) Smooth type III diffeomorphisms of mations. Trans Am Math Soc 362:3069–3094
manifolds. Trans Am Math Soc 276:625–643 Janvresse E, de la Rue T, Roy E (2017) Poisson suspen-
Hawkins JM (1990a) Diffeomorphisms of manifolds with sions and SuShis. Annales Scientifiques de l’ École
nonsingular Poincare flows. J Math Anal Appl 145(2): Normale Supérieure 50:1301–1334
419–430 Janvresse E, de la Rue T, Roy E (2018) Invariant measures
Hawkins JM (1990b) Properties of ergodic flows associ- for Cartesian powers of Chacon infinite transformation.
ated to product odometers. Pac J Math 141:287–294 Israel J Math 224:1–37
Hawkins J, Schmidt K (1982) On C2-diffeomorphisms of Janvresse E, de la Rue T, Roy E (2019) Nearly finite
the circle which are of type III1. Invent Math 66(3): Chacon transformation. Ann H Lebesgue 2:369–414
511–518 Jaworski W (1994) Strongly approximatively transitive
Hawkins J, Silva CE (1991) Noninvertible transformations actions, the Choquet-Deny theorem, and polynomial
admitting no absolutely continuous s-finite invariant growth. Pac J Math 165:115–129
measure. Proc Am Math Soc 111(2):455–463 Johnson ASA, Şahin AA (2015) Directional recurrence for
Hawkins J, Silva CE (1997/98) Characterizing mildly infinite measure preserving Zd-actions. Ergodic Theory
mixing actions by orbit equivalence of products. Dyn Syst 35:2138–2150
New York J Math, 3A. Proceedings of the New York Joita M, Munteanu RB (2014) A property of ergodic flows.
Journal of Mathematics Conference, June 9–13, Stud Math 225:249–258
(1997), 99–115 Kabanov JM, Lipcer RS, Sirjaev AN (1977) On the ques-
Hawkins J, Woods EJ (1984) Approximately transitive tion of absolute continuity and singularity of probabil-
diffeomorphisms of the circle. Proc Am Math Soc ity measures. Math USSR Sbornik 33:203–221
90(2):258–262 Kaimanovich VA, Vershik AM (1983) Random walks on
Herman M (1979a) Construction de difféomorphismes groups: boundary and entropy. Ann Probab 11:
ergodiques, preprint 457–490
Herman M-R (1979b) Sur la conjugaison differentiable des Kakutani S (1948) On equivalence of infinite product mea-
diffeomorphismes du cercle a des rotations. (French). sures. Ann Math 49:214–224
Institute Hautes Etudes Science Publication Mathemat- Kakutani S, Parry W (1963) Infinite measure preserving
ics, No. 49, 5–233 transformations with “mixing”. Bull Am Math Soc 69:
Herman RH, Putnam IF, Skau CF (1992) Ordered Bratteli 752–756
diagrams, dimension groups and topological dynamics. Katznelson Y (1977) Sigma-finite invariant measures for
Int J Math 3(6):827–864 smooth mappings of the circle. J Analyse Math 31:
Hochman M (2013) On the ratio ergodic theorem for group 1–18
actions. J Lond Math Soc 88:465–482 Katznelson Y (1979) The action of diffeomorphism of the
Hochman M (2019) Every Borel automorphism without circle on the Lebesgue measure. J Analyse Math 36:
finite invariant measures admits a two-set generator. 156–166
J Eur Math Soc (JEMS) 21:271–317 Katznelson Y, Weiss B (1972) The construction of quasi-
Hopf E (1937) Ergodentheorie, Ergebnisse der Mathematik invariant measures. Israel J Math 12:1–4
und ihrer Grenzgebiete, vol 3. Springer, Berlin, p 5 Katznelson Y, Weiss B (1991) The classification of non-
Host B, Méla J-F, Parreau J-F (1986) Analyse harmonique singular actions, revisited. Ergodic Theory Dyn Syst
des mesures. Astérisque, No. 135–136 11:333–348
290 Ergodic Theory: Nonsingular Transformations
Keane M, Smorodinsky M (1979) Bernoulli schemes of the (Proceedings of the conference on Ohio State Univer-
same entropy are finitarily isomorphic. Ann Math 109: sity, Columbus, 1970). Lecture notes in mathematics,
397–406 vol 160. Springer, Berlin, pp 158–177
King JL (1986) The commutant is the weak closure of the Krieger W (1972) On the infinite product construction of
powers, for rank-1 transformations. Ergodic Theory nonsingular transformations of a measure space. Invent
Dyn Syst 6:363–384 Math 15:144–163; and erratum in 26 (1974), 323–328
Kirillov AA (1978) Elements of the theory of representa- Krieger W (1976a) On Borel automorphisms and their
tions. Nauka, Moscow quasi-invariant measures. Math Z 151:19–24
Kosloff Z (2011) On a type III1 Bernoulli shift. Ergodic Krieger W (1976b) On ergodic flows and isomorphism of
Theory Dyn Syst 31:1727–1743 factors. Math Ann 223:19–70
Kosloff Z (2013) The zero-type property and mixing of Kubo I (1969) Quasi-flows. Nagoya Math J 35:1–30
Bernoulli shifts. Ergodic Theory Dyn Syst 33:549–559 Landesberg O, Lindenstrauss E (2022) On Radon measures
Kosloff Z (2014) On the K property for Maharam exten- invariant under horospherical flows on geometrically
sions of Bernoulli shifts and a question of Krengel. infinite quotients. Int Math Res Not IMRN:11602–11641
Israel J Math 199:485–506 Ledrappier F, Sarig O (2007) Invariant measures for the
Kosloff Z (2017) A universal divergence rate for symmet- horocycle flow on periodic hyperbolic surfaces. Israel
ric Birkhoff sums in infinite ergodic theory. Trans Am J Math 160:281–315
Math Soc 369:6373–6388 Lehrer E, Weiss B (1982) An ϵ-free Rokhlin lemma. Ergo-
Kosloff Z (2018) On manifolds admitting stable type III1 dic Theory Dyn Syst 2:45–48
Anosov diffeomorphisms. J Modern Dyn 13:251–270 Lemańczyk M, Parreau F (2003) Rokhlin extensions and
Kosloff Z (2019) Proving ergodicity via divergence of lifting disjointness. Ergodic Theory Dyn Syst 23:
ergodic sums. Stud Math 248:191–215 1525–1550
Kosloff Z (2021) Conservative Anosov diffeomorphisms of Lemańczyk M, Parreau F, Thouvenot J-P (2000) Gaussian
the two torus without an absolutely continuous invariant automorphisms whose ergodic self-joinings are Gauss-
measure. Ann Sci Ec Norm Supér 54(4):69–131 ian. Fundam Math 164:253–293
Kosloff Z, Soo T (2021) The orbital equivalence of Lemańczyk M, Lesigne E, Skrenty D (2001) Multiplicative
Bernoulli actions and their Sinai factors. J Mod Dyn Gaussian cocycles. Aequationes Math 61:162–178
17:145–182 Lenci M (2010) On infinite-volume mixing. Commun
Kosloff Z, Soo T (2022) Some factors of nonsingular Math Phys 298:485–514
Bernoulli shifts. Stud Math 262(1):23–43 Lenci M (2013) Exactness, K-property and infinite mixing.
Krengel U (1967) Entropy of conservative transforma- Publ Mat Urug 14:159–170
tions. Z Wahrscheinlichkeitstheorie und Verw Gebiete Lenci M (2017) Uniformly expanding Markov maps of the
7:161–181 real line: exactness and infinite mixing. Discrete Contin
Krengel U (1969) Darstellungssätze für Strömungen und Dyn Syst 37:3867–3903
Halbströmungen. II. Math Ann 182:1–39 Loh I, Silva CE (2017) Strict doubly ergodic infinite trans-
Krengel U (1970) Transformations without finite invariant formations. Dyn Syst 32(4):519–543
measure have finite strong generators. In: Contributions Loh I, Silva CE, Athiwaratkun B (2018) Infinite symmetric
to Ergodic theory and probability (Proceedings of the ergodic index and related examples in infinite measure.
Conference on Ohio State University, Columbus, Stud Math 243:101–115
1970). Springer, Berlin, pp 133–157 Mackey GW (1966) Ergodic theory and virtual group.
Krengel U (1976) On Rudolph’s representation of aperiodic Math Ann 166:187–207
flows. Ann Inst H Poincaré Sect B (NS) 12(4):319–338 Maharam D (1964) Incompressible transformations. Fund
Krengel U (1985) Ergodic theorems. de Gruyter Studies in Math LVI:35–50
Mathematics, Berlin Mandrekar V, Nadkarni M (1969) On ergodic quasi-
Krengel U, Sucheston L (1969) On mixing in infinite invariant measures on the circle group. J Funct Anal
measure spaces. Z. Wahrscheinlichkeitstheorie und 3:157–163
Verw. Gebiete 13:150–164 Mansfield DF, Dooley AN (2017) The critical dimension
Krickeberg K (1967) Strong mixing properties of Markov for G-measures. Ergodic Theory Dyn Syst 37:824–836
chains with infinite invariant measure. In: Proceedings Marrakchi A, Vaes S (2022) Nonsingular Gaussian actions:
of the Fifth Berkeley symposium on mathematical sta- beyond the mixing case. Adv Math 397:108190, 62 p
tistics and probability (Berkeley, 1965/66), Vol. II: Matui H (2002) Topological orbit equivalence of locally
Contributions to probability theory, Part 2. University compact Cantor minimal systems. Ergodic Theory Dyn
of California Press, Berkeley, pp 431–446 Syst 22:1871–1903
Krieger W (1969) On nonsingular transformations of a Méla J-F (1983) Groupes de valeurs propres des systèmes
measure space, I, II. Z Wahrscheinlichkeitstheorie und dynamiques et sous-groupes saturés du cercle. C R
Verw Gebiete 11:83–119 Acad Sci Paris Se’r I Math 296(10):419–422
Krieger W (1970) On the Araki-Woods asymptotic ratio set Melbourne I (2015) Mixing for invertible dynamical sys-
and nonsingular transformations of a measure space. In: tems with infinite measure. Stoch Dyn 15(2):
Contributions to Ergodic theory and probability 1550012, 25
Ergodic Theory: Nonsingular Transformations 291
Melbourne I, Terhesiu D (2012) Operator renewal theory Parry W (1966) Generators and strong generators in ergo-
and mixing rates for dynamical systems with infinite dic theory. Bull Am Math Soc 72:294–296
measure. Invent Math 189:61–110 Parry W (1969) Entropy and generators in ergodic theory.
Meyerovitch T (2007) On multiple and polynomial recur- W. A. Benjamin, Inc., New York/Amsterdam
rent extensions of infinite measure preserving transfor- Parthasarathy KR, Schmidt K (1977) On the cohomology
mations, preprint. ArXiv: http://arxiv.org/abs/math/ of a hyperfinite action. Monatsh Math 84(1):37–48
0703914 Ramsay A (1971) Virtual groups and group actions. Adv
Meyerovitch T (2013) Ergodicity of Poisson products and Math 6:243–322
applications. Ann Probab 41:3181–3200 Rokhlin VA (1965) Generators in ergodic theory. II.
Milnor J (1988) On the entropy geometry of cellular (Russian. English summary). Vestnik Leningrad Univ
automata. Complex Syst 2:357–385 20(13):68–72
Moore CC (1967) Invariant measures on product spaces. Rosinsky J (1995) On the structure of stationary stable
In: Proceedings of the Fifth Berkeley symposium, processes. Ann Probab 23:1163–1187
pp 447–459 Roy E (2005) Mesures de Poisson, infinie divisibilité et
Moore CC (1982) Ergodic theory and von Neumann alge- propriétés ergodiques, Thèse de doctorat de
bras. In: Proceedings of the symposium on pure math- l’Université Paris 6
ematics, 38. American Mathematical Society, Roy E (2007) Ergodic properties of Poissonian ID pro-
Providence, pp 179–226 cesses. Ann Probab 35:551–576
Moore CC, Schmidt K (1980) Coboundaries and homo- Roy E (2009) Poisson suspensions and infinite ergodic
morphisms for nonsingular actions and a problem of theory. Ergodic Theory Dyn Syst 29:667–683
H. Helson. Proc Lond Math Soc 40(3):443–475 Roy E (2010) Poisson-Pinsker factor and infinite measure
Mortiss G (2000) A non-singular inverse Vitali lemma with preserving group actions. Proc Am Math Soc 138:
applications. Ergodic Theory Dyn Syst 20:1215–1229 2087–2094
Mortiss G (2002) Average co-ordinate entropy. J Aust Rudolph DJ (1985) Restricted orbit equivalence. Mem Am
Math Soc 73:171–186 Math Soc 323
Mortiss G (2003) An invariant for nonsingular isomor- Rudolph D, Silva CE (1989) Minimal self-joinings for
phism. Ergodic Theory Dyn Syst 23:885–893 nonsingular transformations. Ergodic Theory Dyn
Munteanu RB (2012) A non-product type non-singular Syst 9:759–800
transformation which satisfies Krieger’s Property Ryzhikov VV (1993) Factorization of an automorphism of
A. Israel J Math 190:307–324 a full Boolean algebra into the product of three involu-
Nadkarni MG (1979) On spectra of nonsingular transfor- tions. (Russian). Mat Zametki 54(2):79–84, 159; trans-
mations and flows. Sankhya Ser A 41(1–2):59–66 lation in Math. Notes 54 (1993), no. 1–2,
Nadkarni MG (1998) Spectral theory of dynamical sys- 821–824 (1994)
tems, Birkhäuser Advanced Texts: Basler Lehrbücher. Ryzhikov VV (2014) On the asymmetry of multiple
Birkhäuser Verlag, Basel asymptotic properties of ergodic actions. Math Notes
Neretin YA (1996) Categories of symmetries and infinite- 96:416–422
dimensional groups. Oxford University Press, New York Ryzhikov VV, Thouvenot J-P (2015) On the centralizer of
Ornstein D (1960) On invariant measures. Bull Am Math an infinite mixing rank-one transformation. Funct Anal
Soc 66:297–300 Appl 49:230–233
Ornstein D (1972) On the root problem in ergodic theory. Sachdeva U (1971) On category of mixing in infinite
In: Proceedings of the Sixth Berkeley symposium on measure spaces. Math Syst Theory 5:319–330
mathematical statistics and probability. University of Samorodnitsky G (2005) Null flows, positive flows and the
California Press, pp 347–356 structure of stationary symmetric stable processes. Ann
Osikawa M (1977/78) Point spectra of nonsingular flows. Probab 33:1782–1803
Publ Res Inst Math Sci 13:167–172 Sarig O (2004) Invariant measures for the horocycle flows
Osikawa M (1988) Ergodic properties of product type on Abelian covers. Invent Math 157:519–551
odometers. Springer Lect Notes Math 1299:404–414 Schmidt K (1977a) Cocycles on ergodic transformation
Osikawa M, Hamachi T (1971) On zero type and positive groups. Macmillan lectures in mathematics,
type transformations with infinite invariant measures. vol 1. Macmillan Company of India, Ltd., Delhi
Mem Facul Sci, Kyushu Univ 25:280–295 Schmidt K (1977b) Infinite invariant measures in the circle.
Parreau F, Roy E (2008) Poisson joinings of Poisson sus- Symp Math 21:37–43
pensions, preprint Schmidt K (1982) Spectra of ergodic group actions. Israel
Parry W (1963) An ergodic theorem of information theory J Math 41(1–2):151–153
without invariant measure. Proc Lond Math Soc 13(3): Schmidt K (1984) On recurrence. Z Wahrscheinlich-
605–612 keitstheorie verw Gebiete 68:75–95
Parry W (1965) Ergodic and spectral analysis of certain Schmidt K, Walters P (1982) Mildly mixing actions of
infinite measure preserving transformations. Proc Am locally compact groups. Proc Lond Math Soc 45:
Math Soc 16:960–966 506–518
292 Ergodic Theory: Nonsingular Transformations
Shelah S, Weiss B (1982) Measurable recurrence and Series, 205. Cambridge University Press, Cambridge,
quasi-invariant measures. Israel J Math 43:154–160 pp 207–235
Silva CE (1988) On m-recurrent nonsingular endomor- Ullman D (1987) A generalization of a theorem of Atkinson
phisms. Israel J Math 61:1–13 to non-invariant measures. Pac J Math 130:187–193
Silva CE, Thieullen P (1991) The subadditive ergodic Vaes S, Wahl J (2018) Bernoulli actions of type III1 and L2-
theorem and recurrence properties of Markovian trans- cohomology. Geom Funct Anal 28:518–562
formations. J Math Anal Appl 154(1):83–99 Vershik AM (1983) Many valued mappings with invariant
Silva CE, Thieullen P (1995) A skew product entropy for measure (polymorphisms) and Markov processes.
nonsingular transformations. J Lond Math Soc 52(2): J Sov Math 23:2243–2266
497–516 Vershik AM, Kerov SV (1985) Locally semisimple alge-
Silva CE, Witte D (1992) On quotients of nonsingular actions bras. Combinatorial theory and K0-functor. Modern
whose self-joinings are graphs. Int J Math 5:219–237 Prob Math 26:3–56
Thaler M (1998) The Dynkin-Lamperti arc-sine laws for Yuasa H (2013) Uniform sets for infinite measure-
measure preserving transformations. Trans Am Math preserving systems. J d’Anal Math 120:333–356
Soc 350(11):4593–4607 Yuasa H (2020) A relative, strictly ergodic model theorem
Thaler M, Zweimüller R (2006) Distributional limit theo- for infinite measure-preserving systems. J d’Anal Math
rems in infinite ergodic theory. Probab Theory Relat 140:591–616
Fields 135(1):15–52 Zimmer RJ (1977) Random walks on compact groups and
Thouvenot J-P (1995) Some properties and applications of the existence of cocycles. Israel J Math 26:84–90
joinings in ergodic theory. In: Ergodic theory and its Zimmer RJ (1978) Amenable ergodic group actions and an
connections with harmonic analysis (Alexandria, application to Poisson boundaries of random walks.
1993), London Mathematical Society, Lecture Note J Funct Anal 27:350–372
(see Appendix C in (Matomäki et al. 2015)). In
Sarnak’s Conjecture from the particular, m and l are strongly aperiodic.
Ergodic Theory Point of View Arithmetic function A sequence of complex
numbers is usually denoted by u ¼ (un). But
Joanna Kułaga-Przymus and Mariusz Lemańczyk if such a sequence is, in some sense, important
Faculty of Mathematics and Computer Science, from number theory point of view, one speaks
Nicolaus Copernicus University, Toruń, Poland about an arithmetic function and rather writes
u : ℕ ! ℂ, u ¼ (u(n)).
An arithmetic function u is said to be multipli-
Article Outline cative, whenever u(1) ¼ 1 and u(m n) ¼ u-
(m) u(n) for any choice of coprime m, n ℕ.
Glossary The prominent examples of multiplicative
Definition of the Subject functions are the Möbius function m and the
Introduction Liouville function l. The Möbius function
Chowla Conjecture m : ℕ ! {1, 0, 1} is defined by
Sarnak’s Conjecture m(p1. . .pk) ¼ (1)k for different prime num-
Arithmetic Properties of the Möbius Function bers p1, . . . , pk (in what follows, the set of
Future Directions primes is denoted by ℙ), m(1) ¼ 1 and m(n) ¼ 0
Bibliography for all non-square-free numbers. The Liouville
function l : ℕ ! {1, 1} is given by
Glossary l ðnÞ ¼ ð1Þi1 þ...þik for n ¼ pi11 . . . pikk with
p1, . . . , pk ℙ and i1, . . . , ik ℕ. Clearly
Aperiodicity We say that u : ℕ ! ℂ is aperiodic m ¼ l m2, where m2 is nothing but the
whenever u has a mean, equal to zero, characteristic function of the set S of square-
along each arithmetic progression: free numbers. In fact, l is completely multipli-
lim N!1 N1 nN uðan þ bÞ ¼ 0. Many classi- cative, that is, l(m n) ¼ l(m) l(n) for any
cal multiplicative functions are aperiodic, choice of m, n ℕ. We extend both, m and l,
including m and l. to negative coordinates symmetrically.
For a parameter N 1, a distance between Completely deterministic point We say that
u, v : ℕ ! is defined as point x X is completely deterministic
1=2
1 Re ðuðpÞvðpÞÞ
(Weiss 1971) (see also (Kamae 1973)) if for
ðu, v; N Þ≔ p : any n V (x), we have hðT, X, B ðXÞ, nÞ ¼ 0.
p ℙ, pN
By the variational principle, htop(T, X) ¼ 0 if
and only if all points of X are completely
We say that u : ℕ ! is strongly aperiodic deterministic.
(Matomäki et al. 2015), whenever Entropy There are two basic notions of entropy:
2
Mðu w; N Þ≔ min jtjN ðu w, n ; N Þ ! 1 as
it
topological and measure-theoretic. We skip the
N ! 1 for every Dirichlet character w (i.e., for definitions and refer the reader, for example, to
every periodic, completely multiplicative func- (Downarowicz 2011). For a topological
tion). Strong aperiodicity implies aperiodicity. dynamical system (T, X) its topological entropy
The converse is not in general true (see Theorem will be denoted by htop(T, X) and for a measure-
B.1 in (Matomäki et al. 2015)), but it is true for theoretic dynamical system ðT, X, B, nÞ the
(bounded) real valued multiplicative functions corresponding measure-theoretic entropy will
be denoted by hðT, X, B, nÞ. The basic connec- T-invariant measures is denoted by M (T, X).
tion between them is the variational principle: The subset of ergodic measures (which is
htop ðT, XÞ ¼ sup hðT, X, B ðXÞ, nÞ ¼ always non-empty) is denoted by Me(T, X).
n MðT, XÞ
Each n M(T, X) gives rise to a measure-
sup hðT, X, B ðXÞ, nÞ: theoretic dynamical system ðT, X, B ðXÞ, nÞ,
n Me ðT, XÞ
where B ðXÞ stands for the s-algebra of Borel
Furstenberg system Let u ℤ : For each
subsets of X. With the weak--topology, M (T, X)
n V(u), the system ðS, Xu , B ðXu Þ, nÞ is called becomes a compact metrizable space. (T, X) is
a Furstenberg system of u. For each n Vlog(u), called uniquely ergodic if |M (T, X)| ¼ 1.
the system ðS, Xu , B ðXu Þ, nÞ is called a loga-
Joinings of measure-theoretic dynamical
rithmic Furstenberg system of u. systems Assume that ðRi , Zi , C i , ki Þ is a
For each jujℤ , one can consider juj[0,1]ℤ. measure-theoretic dynamical system, i ¼ 1,
Then (S, X|u|) is a topological factor of (S, Xu) (the
2. Each R1 R2-invariant measure r on
map p : Xu ! Xjuj given by p(x) ¼ jxj, under- C 1 C 2 projecting on k1 and k2, respectively,
stood coordinatewise, is equivariant with S). For
is called a joining of the automorphisms R1 and
n V (u), we have p (n) V (|u|), where p (n)
R2. The set of joinings between R1 and R2 is
stands for the image of n via p. Moreover, denoted by J(R1, R2) and each r J(R1, R2)
the Furstenberg system ðS,Xu ,B ðXu Þ, nÞ is an
yields a (new) measure-theoretic dynamical sys-
extension of S, X juj ,B Xjuj , p ðnÞ . In partic-
tem ðR1 R2 , Z 1 Z2 , C 1 C 2 , rÞ: When R1
ular, if V (|u|) is a singleton (|u| is a generic point), and R2 are both ergodic, then the set Je(R1, R2) of
then all Furstenberg systems of u have
ergodic joinings between R1 and R2 is non-empty.
S, X juj ,B Xjuj , n (where n is the unique If J(R1, R2) ¼ {k1 k2}, then R1 and R2 are
member of V (|u|)) as their factor. called disjoint (in the sense of Furstenberg).
Generic point Assume that (T, X) is a topologi- Let (T, X) be a topological dynamical system. Fix
cal dynamical system. We say that x X is a x X and let u be an arithmetic function
generic point for a Borel measure n on X, bounded by 1, that is, u : ℕ ! : Each accumu-
whenever the ergodic theorem holds for T at lation point r of N1 nN dðT x,S uÞ is a (T S)-
n n
nN
Invariant measure Given a topological dynam-
ical system (T, X), the set of probability Borel for each f C (X) and x X.
Sarnak’s Conjecture from the Ergodic Theory Point of View 295
V log ðxÞ \ Me ðT, XÞ V ðxÞ: ð1Þ Sarnak in 2010 formulated a now celebrated con-
jecture on the Möbius orthogonality. It states that
Moreover, using an idea from Tao (n.d.), it has each topological dynamical system (T, X) of zero
been proved in Gomilko et al. (2020) that entropy is Möbius orthogonal, that is, whenever
htop(T, X) ¼ 0,
if V log ðxÞ ¼ fvg and v is ergodic, then
1
1 lim f ðT n xÞmðnÞ ¼ 0
lim k!1 d n ¼ n, for a subset
nN k T x
N!1 N ð4Þ
Nk ð2Þ nN
of topological systems in which Möbius the point m f1,0, 1gℤ is generic for the
disjointness has been shown. We concentrate on _
relatively independent extension nS of nS ,
main ideas, and describe some general results. ð5Þ
via the natural map
f1,0, 1gℤ 3 ðxn Þn1 7! x2n n1
f0, 1gℤ :
_
Introduction The measure nS is given by the following
condition: for each block C over the alphabet
{1, 0, 1}, we have nS ðCÞ ¼ 2k nS C2 , where
_
The entry is organized as follows. In sections
“Chowla Conjecture” and “Sarnak’s Conjecture,” 2
C is obtained from C by taking the square (or,
respectively, we discuss Chowla and Sarnak’s equivalently, absolute value) at each coordinate
conjectures from the ergodic-theoretic viewpoint, and k is the cardinality of the support of C. To see
including dynamical interpretations of purely this, we use the following:
number-theoretic statements and the main strate-
gies used to attack Sarnak’s conjecture. Remark 4.1 The span of the
Section “Arithmetic Properties of the Möbius family of continuous functions
Function” is a survey of results on Sarnak’s con- F j0 ∘Sk0 F j1 ∘Sk1 . . . F j‘ ∘Sk‘ : ji 0, ki ℤg
jecture, arranged with respect to the properties of forms an algebra that distinguishes points. It fol-
the Möbius function that come into play in the lows directly from the Stone-Weierstrass theorem
proof. Finally, in section “Future Directions,” we that the values of integrals Fk1 ∘Sr1 Fk2 ∘Sr2
state some open problems. . . . Fk‘ ∘ Sr‘ dk determine a measure k.
¼ 0,
Even though Sarnak’s Conjecture is defined in
terms of topological dynamics, it can be translated
if for some 0 j k, uj is strongly aperiodic. This
to ergodic-theoretic language. Namely
conjecture was stated first in Elliott (1992) and
Elliott (1994), and in its original version turned
1 1
out to be false, see (Matomäki et al. 2015) for f ðT n xÞmðnÞ ¼ f Fd d n n :
N nN XXm N nN ðT x,S mÞ
details. Also, a logarithmically averaged version
of Elliott conjecture appears in the literature. For
the details, see Tao and Teräväinen (2019a, b) and Thus, we need to study the properties of join-
the references therein. ings given by the limit points of
Sarnak’s Conjecture from the Ergodic Theory Point of View 299
crete and cocompact subgroup G. Let p : ℤ ! G tor S, Xm2 , nS . To complete the proof, we use the
be any its polynomial sequence: pðnÞ ¼ orthogonality of F to L2 Xm2 , nS .
p ð nÞ p ðnÞ
a1 1 . . . ak k ,
where pj : ℕ ! ℕ is a polynomial,
Remark 5.5 It still remains open whether (S)
j ¼ 1, . . . , k and f : G/G ! ℝ a Lipschitz function.
implies (C), see however Remark 5.16.
Then
In Huang et al. (2019a), Möbius orthogonal-
N ity for low complexity systems is discussed.
f ðpðnÞGÞmðnÞ ¼ O f ,G,G,A
nN log A N Following (Ferenczi 1997), we say that the
measure-complexity of m M(T, X) is weaker
for all A > 0. than a ¼ (an)n1 if
for each e > 0 (here dn ðy, zÞ ¼ Theorem 5.6 (Huang et al., 2019a) Suppose that
1 n
T j y, T j z ). (C) holds for correlations of order 2 (i.e., for r ¼ 1).
n j¼1 d
Then (T, X) is Möbius orthogonal whenever all
300 Sarnak’s Conjecture from the Ergodic Theory Point of View
invariant measures for (T, X) are of complexity MOMO property was introduced in El Abdalaoui
weaker than n. et al. (2018) to deal with Möbius orthogonality of
uniquely ergodic models of a given measure-
To obtain a non-conditional result, Huang, theoretic dynamical system. Moreover, we have:
Wang, and Ye used a difficult estimate of
Matomäki, Radziwiłł, and Tao (namely, “Trun- Theorem 5.9 (El Abdalaoui et al. 2018) The
cated Elliott on the average,” applied to m) from following conditions are equivalent:
(Matomäki et al. 2015). The cost to be paid is a
further strengthening of the assumptions on the (i) All zero entropy systems are Möbius orthog-
complexity of (T, X). onal, that is, Sarnak’s conjecture holds.
(ii) For each zero entropy system (T, X), we
Theorem 5.7 (Huang et al. 2019a) Suppose that have lim N!1 N1 nN f ðT n xÞmðnÞ ¼ 0
all invariant measures of (T, X) are of sub-
polynomial complexity, that is, their complexity when N ! 1, for each f C(X), uniformly
is weaker than (nt)n1 for each t > 0. Then (T, in x X, that is, uniform Sarnak’s conjec-
X) is Möbius orthogonal. ture holds.
(iii) All zero entropy systems enjoy the strong
See Huang et al. (n.d.) for the most recent MOMO property.
application of this result.
Finally, let us point out a consequence of the By taking f ¼ 1, we obtain that strong u-OMO
result on correlations of m of order 2. Directly implies the following:
from Corollary 4.12, we have:
1
uðnÞ ! 0 ð9Þ
Corollary 5.8 All topological dynamical sys- bK k<K bk n<bk þ1
K!1
tems whose all invariant measures yield systems
with discrete spectrum are Möbius orthogonal. for every sequence 0 ¼ b0 < b1 < b2 < with
bk+1 bk ! 1. In particular, N1 nN uðnÞ ! 0.
In the uniquely ergodic case, an earlier and N!1
independent proof of this fact was given by In a similar way (by considering finite rotations),
Huang, Wang, and Zhang (Huang et al. 2019b) one can deduce N1 nN uðan þ bÞ ! 0. Thus, (8)
N!1
(for the totally uniquely ergodic case, see El can be seen as a form of aperiodicity. A further
Abdalaoui et al. (2017b)). The result also follows analysis reveals that, in fact, we deal with a special
from Huang et al. (2019a). behaviour of u on a typical short interval. All
strongly aperiodic multiplicative functions satisfy
(9) (this follows from Theorem A.1 (Matomäki
Strong MOMO Property et al. 2015)), hence condition (9) is satisfied both
Given an arithmetic function u, following El for m and l, cf. section “Behaviour on Short
Abdalaoui et al. (2018), we say that (T, X) satisfies Intervals.”
the strong u-OMO property if, for any increasing Recently, in Gomilko et al. (2020), the strong
sequence of integers 0 ¼ b0 < b1 < b2 < with u-OMO property was rephrased in the language of
bk + 1 bk ! 1, for any sequence (xk) of points in functional analysis; and it is equivalent to
X, and any f C(X), we have
1
1 lim uðnÞf ∘T n
f T nbk xk uðnÞ ! 0: ð8Þ K!1 bKþ1
kK bk n<bkþ1
bK k<K K!1 Cð X Þ
bk n<bkþ1
¼ 0 for all f CðXÞ, ðbk Þ as above:
If u ¼ m, we speak about the strong MOMO
property. (The acronym MOMO stands for Usefulness of the strong MOMO concept is
Möbius Orthogonality of Moving Orbits.) Strong seen in the following result:
Sarnak’s Conjecture from the Ergodic Theory Point of View 301
Proposition 5.10 (El Abdalaoui et al. 2018) If orthogonal, and in particular, why zero entropy
(R, Z, D, k) is an ergodic (measure-theoretic) should play a special role. As we will see, how-
dynamical system and (T, X) is its uniquely ergo- ever, positive entropy systems are not expected to
dic model satisfying the strong MOMO property, enjoy the strong MOMO property, cf. Theorem
then all uniquely ergodic models of (R, Z, D, k) 5.9. Indeed, the following has been proved in El
are Möbius orthogonal. In fact, the strong MOMO Abdalaoui et al. (2018) (the result has been proved
holds in all of them. in El Abdalaoui et al. (2018) for the Liouville
function but it can also be proved for m):
Möbius orthogonality of Positive Entropy
Theorem 5.13 (El Abdalaoui et al. 2018) Let
Systems
u {1, 0, 1}ℤ be a generic point for the mea-
If we take a positive entropy system (T, X), it is
sure nm2 : Then the following conditions are
natural to expect that it is not Möbius orthogonal.
equivalent:
Indeed, trivially, the full shift on {0, 1}ℤ is not,
and more generally subshifts of finite type are not,
(i) (T, X) satisfies strong u-OMO property.
see Karagulyan (2017). One can also show that
(ii) (T, X) is of zero entropy.
the subshift S, Xm2 (which is of positive entropy,
see Peckner (2015)) is not Möbius orthogonal,
As an immediate consequence, we have the
despite the fact that m2 itself is a completely
following:
deterministic point and Möbius orthogonality
holds at it: lim N!1 N1 nN f ðSn m2 ÞmðnÞ ¼ 0
Corollary 5.14 If Chowla conjecture holds, then
for each f C Xm2 , see Ferenczi et al. (2018).
a system (T, X) has the strong MOMO property if
However, Sarnak’s conjecture does not exclude a
and only if it has zero entropy.
possibility that some positive entropy system is
also Möbius orthogonal. (It is mentioned in
Sarnak (n.d.) that Bourgain (unpublished) had (Slog) vs. (Clog)
such a construction.) Downarowicz and Serafin The logarithmic version of Sarnak’s conjecture
proved the following general result: was formulated in Tao (2016) along with (Clog)
and it postulates that
Theorem 5.11 (Downarowicz and Serafin
2019a) Fix an integer N 2. Let u be any 1 1
lim f ðT n xÞmðnÞ ¼ 0 Slog
bounded, real, aperiodic sequence. Then, there N!1 log N nN n
exists a subshift (S, X) over N symbols of entropy
arbitrarily close to log N, uncorrelated to u: (with all parameters as in (4)). In Tao (2017), Tao
lim N!1 N1 nN f ðSn xÞuðnÞ ¼ 0 for each showed the following:
f C(X) and x X.
Theorem 5.15 (Slog) is equivalent to (Clog).
Even more surprisingly, they proved a uniform
version of the above result: Remark 5.16 Combining Theorem 5.15 with
Corollary 4.3, we obtain that (Slog) implies (C)
Theorem 5.12 (Downarowicz and Serafin along a subsequence of logarithmic density 1. In
2019b) Under the same assumption on u, given particular, (S) implies (C) along a subsequence of
N 2, there exists a strictly ergodic subshift over full logarithmic density.
N symbols, of entropy arbitrarily close to log N,
uniformly uncorrelated to u. Let us here recall one more “logarithmic
conjecture” from (Tao 2017) which confirms a
Realizing that, one might be anxious what special role played by nil-systems in dynamics.
finally is the class of systems which are Möbius Let (T g0 , G/G) be a nilrotation. Let f C(G/G) be
302 Sarnak’s Conjecture from the Ergodic Theory Point of View
Lipschitz continuous and x0 G. Then (for namely so-called logarithmic strong MOMO
H N with H ! 1) property (cf. section “Strong MOMO Property”):
¼ o ðH log N Þ: Snil
log Equivalently, for all increasing sequences
(bk) ℕ with bk+1 bk ! 1, all (xk) X and
Theorem 5.17 (Tao 2017) Snil is equivalent f C(X),
log
to (Slog) (and (Clog)).
1 1
lim f T nbk xk mðnÞ
K!1 log bKþ1 kK n
Finally, as a consequence of the result on log- bk n<bkþ1
arithmic correlations of m of order 2 (using Cor- ¼ 0:
ollary 4.7), we obtain:
Theorem 5.20 (Gomilko et al. 2020) Assume
Corollary 5.18 All topological dynamical sys- that a topological system (T, X) satisfies the loga-
tems whose all invariant measures yield systems rithmic strong MOMO property. Then there exists
with singular spectrum are logarithmically a sequence A ¼ A(T, X) ℕ with full logarithmic
Möbius orthogonal. density such that, for each f C(X),
could be used to prove orthogonality from m, A basic method to prove orthogonality with a
namely, internal disjointness. Here, a priori, multiplicative function comes from the multipli-
one does not use any other property of m than cative orthogonality criterion (MOC):
multiplicativity and boundedness.
B. As we have seen, (S) is intimately related to Theorem 6.1 (Bourgain et al. 2013; Daboussi,
(C), and therefore one cannot expect to con- Kátai 1986) Assume that (fn) ℂ is a bounded
firm (S) without using further number- sequence. Assume that for all (sufficiently large)
theoretic properties of m. This directs atten- prime numbers p 6¼ q,
tion to aperiodicity and behavior on the
so-called short intervals. It extends further 1
lim f pn f qn ¼ 0: ð10Þ
to studying Furstenberg systems of m N!1 N
nN
(including the logarithmic ones) and trying to
interpret arithmetic properties of m as ergodic Then, for each bounded multiplicative function
properties of the corresponding dynamical u, we have lim N!1 N1 nN f n uðnÞ ¼ 0. In par-
systems. One can hope finally to deduce ticular, (fn) is Möbius orthogonal.
(some kind of) Furstenberg (!) disjointness of
Furstenberg systems of m with a wide subclass Remark 6.2 Notice that Theorem 6.1 does not
of zero entropy systems (hopefully, with all require from u anything but multiplicativity and
such systems). boundedness.
As we will see, these two approaches often In the dynamical context (T, X) the simplest
intertwine, proving once again that number theory way to use Theorem 6.1 is to take fn ¼ f(Tnx). In
and ergodic theory should not be studied sepa- this form, MOC appeared for the first time in
rately from each other. Bourgain et al. (2013) and was used to prove
that the horocycle flows are Möbius orthogonal.
To see how MOC is used and how it is related to
Arithmetic Properties of the Möbius Furstenberg’s disjointness theory (Furstenberg
Function 1967), assume that M(T, X) ¼ {m}, X fdm ¼ 0,
and the corresponding measure-theoretic system
Multiplicativity is totally ergodic. Then, any measure r V((x, x))
Internal disjointness. Joinings (introduced in a (considered in the topological dynamical system
seminal paper of Furstenberg (Furstenberg 1967)) (Tp Tq, X X)) is a joining of T p and T q. If we
have been present in ergodic theory for over now assume that (Tp, X, m) and (Tq, X, m) are dis-
50 years. Disjointness (absence of nontrivial join- joint for sufficiently large primes p 6¼ q, then
ings), as a form of an extremal non-isomorphism r ¼ m m and, as a result, the limit in (10) equals
and a measure-theoretic invariant, has always XX f f dr ¼ 0, that is, the assumptions of MOC
played a crucial role in classification problems. are satisfied.
(Recall also that different powers for a typical In general, a use of MOC is not that simple.
automorphism of a standard Borel space are Consider an irrational rotation Tx ¼ x + a on the
pairwise disjoint (del Junco 1981). See also circle X ¼ ℝ/ℤ. To see that (10) holds for all
more recent Kanigowski et al. (2020).) It characters, one uses the Weyl criterion on uniform
appeared, however, in many other contexts, distribution. However, there are continuous zero
including homogenous dynamics, with applica- mean functions for which (10) fails (Kułaga-
tions in number theory. Sarnak’s conjecture gave Przymus and Lemańczyk 2015), which shows
yet a new impetus, in particular for studying clearly, that in general we can only expect (10)
(approximate) disjointness for different to hold for a linearly dense set of continuous
sub-actions. functions.
304 Sarnak’s Conjecture from the Ergodic Theory Point of View
In some cases, MOC cannot be applied directly properties of m (other than multiplicativity)
(e.g., when the systems under consideration fail to begin to play a significant role.
be weakly mixing) and the spectral approach can Zero entropy continuous interval maps. In
help. Examples can be found in (El Abdalaoui Karagulyan (2015), (S) for zero entropy continu-
et al. (2014), (2016), and Bourgain (2013). ous intervals maps and orientation-preserving cir-
AOP property. The following ergodic counter- cle homeomorphisms is established. The starting
part of MOC was developed in El Abdalaoui et al. point for developing the main tools is the result of
(2017b): an ergodic automorphism ðT, X, B, mÞ is Davenport (7), which shows clearly that the
said to have asymptotically orthogonal powers examples under consideration are indeed “rela-
(AOP) if for each f , g L20 ðX, B, mÞ, we have tives” of irrational rotations. Additionally, in
order to treat the case of interval maps one studies
o-limit sets and it turns out that, in fact, one deals
lim sup f g dk
ℙ 3p, q!1, p6¼qk J e ðT p , T q Þ XX with an odometer.
Synchronized automata. In Deshouillers
¼ 0: ð11Þ et al. (2015), Deshouillers, Drmota, and Müllner
prove that (S) is true for automatic sequences
Clearly, if the powers of T are pairwise disjoint, generated by synchronizing automata (the inputs
then T enjoys the AOP property. However, this are read with the most significant digit first). In
condition is not necessary, the powers of T having fact, they prove orthogonality of such sequences
AOP property may even be isomorphic. Moreover, from any bounded function u that is aperiodic.
AOP implies total ergodicity and zero entropy Almost periodic sequences. We say that a
(El Abdalaoui et al., 2017b). The relation between sequence is Weyl rationally almost periodic
strong MOMO and AOP properties is described by (WRAP) whenever it can be approximated arbi-
the following result: trarily well by periodic sequences in Weyl
pseudo-metric dW given by dW ðx, yÞ ¼
Theorem 6.3 (El Abdalaoui et al. 2017a, b) Let lim supN!1 supl1 N1 j fl n l þ N : xðnÞ 6¼
u be a bounded multiplicative function. Suppose
yðnÞg j . It is proved in Bergelson et al. (2019)
that ðR, Z, C, kÞ satisfies AOP. Then the following
that each subshift (S, Xx) given by a Weyl almost
are equivalent:
periodic sequence is Möbius orthogonal (in fact,
we have orthogonality to any bounded aperiodic
• u satisfies (9);
arithmetic function u).
• The strong u-OMO property is satisfied in each
uniquely ergodic model (T, X) of R.
Behaviour on Short Intervals
In particular, if the above holds, for each During the last four years, an enormous progress
f C(X), we have concerning the short interval behavior of strongly
aperiodic multiplicative functions has been made
1 due to the breakthrough result of Matomäki and
f ðT n xÞuðnÞ ! 0 uniformly on X: Radziwiłł. The main result of Matomäki and
N nN N!1
Radziwiłł (2016), for m, in its simplified form
can be written as
Aperiodicity
As all periodic sequences are orthogonal to m, one 1 1
lim mðhÞ ¼ 0:
M 1mM H
can expect that sequences with some properties M, H!1 mh<mþH
with H¼oðMÞ
similar to periodicity will also be Möbius orthog-
onal. Notice also that Möbius orthogonality of
periodic sequences ((S) for rotations on finite ð12Þ
groups) corresponds to m being aperiodic. This This gave an impetus to study convergence on
is the simplest situation where some additional short intervals in ergodic theory and it has become
Sarnak’s Conjecture from the Ergodic Theory Point of View 305
a new, crucial player from the point of view of Moreover, it is shown that all synchronized
Sarnak’s conjecture. Condition (12) can be also automata yield WRAP sequences. Thus, the
reformulated in the following way: for each above theorem strengthens the aforementioned
(bn) ℕ with bn+1 bn ! 1, we have result by Deshouillers, Drmota, and Müllner in
Deshouillers et al. (2015).
1 Rigid systems. In Kanigowski et al. (n.d.-a),
lim mðnÞ ¼ 0, ð13Þ Kanigowski, Lemańczyk, and Radziwiłł study
K!1 bKþ1
kK bk n<bkþ1
rigid systems. (A measure-theoretic system
ðR, Z, C, kÞ is rigid if, for some increasing
cf. Section “Strong MOMO Property” 5.3. sequence (qn) of natural numbers, we have
Almost periodic sequences. In Bergelson f ∘ Rqn ! f in L2(Z, k) for each f L2(Z, k).
et al. (2019), in case of WRAP x, the authors Rigid systems are of zero entropy. Moreover, the
also ask about the behavior of averages of the typical measure-theoretic automorphism is rigid
form and weakly mixing.) To formulate their results,
we need some definitions and facts. Given a nat-
1
f Sh z mðnÞ ð14Þ ural number q, the sum ℙ 3 pjq1/p is called the
H mh<mþH
prime volume of q. The prime volume grows
slowly with q:
(where z Xx) for large values of H and arbitrary
m ℕ. Under (C), convergence to zero uni- 1
log log log q þ Oð1Þ:
formly in m does not take place; however, it is p
pjq
shown in Bergelson et al. (2019) that for a “typi-
cal” m ℕ the averages in (14) are small. The
However “most” of the time, the prime volume
key argument in the proof comes from a result of
of q stays bounded: if we set
Matomäki, Radziwiłł, and Tao:
ð16Þ
306 Sarnak’s Conjecture from the Ergodic Theory Point of View
They prove the following: Ferenczi and Mauduit 2018; Karagulyan 2020).
Other applications are given for C2+eAnzai skew
Theorem 6.6 (a) Assume the (T, X) is a good products and for some so-called Rokhlin exten-
topological system. Then (T, X) is Möbius sions of rotations.
orthogonal.
(b) Suppose that each ergodic invariant mea- One more of the consequences is the following
sure of (T, X) yields either BPV rigidity or PR result:
rigidity and Me(T, X) is countable then (T, X) is
Möbius orthogonal. Corollary 6.9 (Kanigowski et al. n.d.-a) No
Furstenberg system of the Möbius function m is
A key tool here is a strengthening of the main either BPV or PR rigid. The same holds for the
result of Matomäki and Radziwiłł (Matomäki & Liouville function l.
Radziwiłł 2016) (cf. (12)) to short interval behav-
iour along arithmetic progressions:
Logarithmic Furstenberg Systems
Frantzikinakis and Host study logarithmic
Theorem 6.7 (Kanigowski et al. n.d.-a) For
Furstenberg systems associated to m (and l).
each e > 0, there exists L0 such that for each
They prove the following remarkable result:
L L0 and q 1 satisfying pjq1/p
(1 e)p L1/p we can find M0 ¼ M0(q, L)
Theorem 6.10 (Frantzikinakis and Host 2018)
such that for all M M0, we have
Each zero entropy topological system (T, X) with
only countably many ergodic measures is loga-
rithmically Möbius orthogonal.
M=Lq q1
mðmÞ In particular, uniquely ergodic systems of zero
j¼0 a¼0 m ½z þ jLq, z þ ð j þ 1ÞLq topological entropy satisfy (Slog). The key argu-
m a mod q ment in the proof of Theorem 6.10 is the following
structural result on the logarithmic Furstenberg
< eM
systems of m and l:
for some 0 z < Lq.
Theorem 6.11 (Frantzikinakis and Host 2018)
Each logarithmic Furstenberg system of m or l
Remark 6.8 Despite the fact that PR rigidity
is a factor of a system that:
does not seem to be stable under different
(uniquely ergodic) models of a measure-
• has no irrational spectrum,
preserving transformation, assuming (T, X) is
• has ergodic components isomorphic to direct
uniquely ergodic, it is proved that (T, X) satisfies
products of infinite-step nilsystems and
the strong MOMO property whenever its unique
Bernoulli systems.
invariant measure yields either BPV or PR rigid-
ity. Via Proposition 5.10, we obtain that if in a
model PR rigidity holds, all of the models are
Möbius orthogonal. This, in particular, applies to The starting point for the proof of the above
all ergodic transformations with discrete spec- theorem, resulting in a reduction of the problem to
trum. Moreover, it is shown in Kanigowski et al. purely ergodic context, is an identity of Tao
(n.d.-a) that for a.e. IET (of d 3 intervals) BPV (implicit in (Tao 2016)) showing that self-
rigidity holds, so a.e. IET (and all their uniquely correlations of m (and l) are averages of its
ergodic models) is Möbius orthogonal. This is to dilated self-correlations with prime dilates.
be compared with previously known results for Frantzikinakis and Host also prove that logarith-
3-IETs (Bourgain 2013; Chaika and Eskin 2019; mic Furstenberg systems of m (and l) are “almost
Sarnak’s Conjecture from the Ergodic Theory Point of View 307
For m we need to take into account that its del Junco A (1981) Disjointness of measure-preserving
Furstenberg systems have the discrete spectrum transformations, minimal selfjoinings and category.
In: Ergodic theory and dynamical systems, I (College
factor given by the Mirsky measure of m2. Park, Md., 1979–80), volume 10 of Progress in math-
ematics, pp 81–89. Birkhäuser, Boston
Deshouillers J-M, Drmota M, Müllner C (2015) Automatic
Furstenberg Disjointness in Non-ergodic Case sequences generated by synchronizing automata fulfill
As Host and Frantzikinakis’ analysis shows, if the the Sarnak conjecture. Studia Math 231(1):83–95
(potential) logarithmic Furstenberg systems of l or Downarowicz T (2011) Entropy in dynamical systems,
volume 18 of New Mathematical Monographs. Cam-
m are non-ergodic, then they are very non-ergodic.
bridge University Press, Cambridge
One of open questions by Frantzikinakis is whether Downarowicz T, Serafin J (2019a) Almost full entropy
the system 2 3 ðx, yÞ 7! ðx, x þ yÞ 2 consid- subshifts uncorrelated to the Möbius function. Int
ered with Lebesgue measure can be a Furstenberg Math Res Not 2019(11):3459–3472
Downarowicz T, Serafin J (2019b) A strictly ergodic, pos-
system of l. Of course this example is a measure-
itive entropy subshift uniformly uncorrelated to the
theoretic system which is Furstenberg disjoint Möbius function. Studia Math 251(2020):195–206.
from all ergodic systems. It seems to be a prob- Published online (Online First)
lem of independent interest to fully understand El Abdalaoui EH, Lemańczyk M, de la Rue T (2014) On
spectral disjointness of powers for rank-one transfor-
the class of transformations disjoint from all
mations and Möbius orthogonality. J Funct Anal
ergodic transformations. 266(1):284–317
El Abdalaoui EH, Kasjan S, Lemańczyk M (2016) 0-1
sequences of the Thue-Morse type and Sarnak’s con-
jecture. Proc Am Math Soc 144(1):161–176
Bibliography El Abdalaoui EH, Kułaga-Przymus J, Lemańczyk M, de la
Rue T (2017a) The Chowla and the Sarnak conjectures
Bergelson V, Kułaga-Przymus J, Lemańczyk M, Richter from ergodic theory point of view. Discrete Contin Dyn
FK (2019) Rationally almost periodic sequences, poly- Syst 37(6):2899–2944
nomial multiple recurrence and symbolic dynamics. El Abdalaoui EH, Lemańczyk M, de la Rue T (2017b)
Ergodic Theory Dyn Syst 39(9):2332–2383 Automorphisms with Quasidiscrete spectrum, multipli-
Bourgain J (2013) On the correlation of the Moebius func- cative functions and average orthogonality along short
tion with rank-one systems. J. Anal. Math. intervals. Int Math Res Not 2017(14):4350–4368
120:105–130 El Abdalaoui EH, Kułaga-Przymus J, Lemańczyk M, de la
J. Bourgain, P. Sarnak, and T. Ziegler. Disjointness of Rue T (2018) Möbius disjointness for models of an
Möbius from horocycle flows. In From Fourier analy- ergodic system and beyond. Israel J Math
sis and number theory to Radon transforms and geom- 228(2):707–751
etry, volume 28 of Dev. Math., pages 67–83. Springer, Elliott PDTA (1990) Multiplicative functions jg j 1 and
New York, 2013. their convolutions: an overview. In: Séminaire de
Chaika J, Eskin A (2019) Möbius disjointness for interval Théorie des Nombres, Paris 1987–88, volume 81 of
exchange transformations on three intervals. Journal of Progress in mathematics, pp 63–75. Birkhäuser Boston,
Modern Dynamics 14(1):55–86 Boston
Chowla S (1965) The Riemann hypothesis and Hilbert’s Elliott PDTA (1992) On the correlation of multiplicative
tenth problem. Mathematics and its applications, functions. Notas Soc Mat Chile 11(1):1–11
vol 4. Gordon and Breach Science Publishers, Elliott PDTA (1994) On the correlation of multiplicative
New York and the sum of additive arithmetic functions. Mem Am
Collectif (1975) Fonctions multiplicatives presque péri- Math Soc 112(538):viii+88
odiques B. In Journées arithmétiques de Bordeaux, Ferenczi S (1997) Measure-theoretic complexity of ergodic
number 24–25 in Astérisque, pp 321–324. Société systems. Isr J Math 100(1):189–207
mathématique de France S. Ferenczi, J. Kułaga-Przymus, and M. Lemańczyk.
Daboussi H, Delange H (1974) Quelques propriétés des Sarnak’s conjecture: what’s new. In S. Ferenczi,
fonctions multiplicatives de module au plus égal à 1. C J. Kułaga-Przymus, and M. Lemańczyk, Ergodic the-
R Acad Sci Paris Sér A 278:657–660 ory and dynamical systems in their interactions with
Daboussi H, Delange H (1982) On multiplicative arithmet- arithmetics and combinatorics, volume 2213 of Lecture
ical functions whose modulus does not exceed one. Notes in Math., 163–235. Springer, Cham, 2018.
J Lond Math Soc 26(2):245–264 Ferenczi S, Mauduit C (2018) On Sarnak’s conjecture and
Davenport H (1937) On some infinite series involving Veech’s question for interval exchanges. J Anal Math
arithmetical functions. II. Quart J Math Oxford 134(2):545–573
8:313–320
Sarnak’s Conjecture from the Ergodic Theory Point of View 309
Frantzikinakis N (2004) The structure of strongly station- Kułaga-Przymus J, Lemańczyk M (2015) The Möbius
ary systems. J Anal Math 93(1):359–388 function and continuous extensions of rotations.
Frantzikinakis N (2017) Ergodicity of the Liouville system Monatsh Math 178(4):553–582
implies the Chowla conjecture. Discrete Anal, Paper Lemańczyk M, Müllner (n.d.) C. Automatic sequences are
no. 19, 41pp orthogonal to aperiodic multiplicative functions.
Frantzikinakis N, Host B (2018) The logarithmic Sarnak Published online in Discrete Continuous Dynamical
conjecture for ergodic weights. Ann of Math Systems (2020). https://doi.org/10.3934/dcds.2020260
187(3):869–931 Matomäki K, Radziwiłł M (2016) Multiplicative functions
Frantzikinakis N, Host B (2019) Furstenberg systems of in short intervals. Ann Math 183(3):1015–1056
bounded multiplicative functions and applications. Int Matomäki K, Radziwiłł M, Tao T (2015) An averaged form
Math Res Not. https://doi.org/10.1093/imrn/rnz037 of Chowla’s conjecture. Algebra Number Theory
Furstenberg H (1967) Disjointness in ergodic theory, min- 9(9):2167–2196
imal sets, and a problem in Diophantine approximation. Müllner C (2017) Automatic sequences fulfill the Sarnak
Math Syst Theory 1:1–49 conjecture. Duke Math J 166(17):3219–3290
Gomilko A, Kwietniak D, Lemańczyk M (2018) Sarnak’s Peckner R (2015) Uniqueness of the measure of maximal
conjecture implies the Chowla conjecture along a sub- entropy for the squarefree flow. Isr J Math
sequence. In: Ferenczi S, Kułaga-Przymus J, 210(1):335–357
Lemańczyk M (eds) Ergodic theory and dynamical Ram Murty M, Vatwani A (2018) A remark on a conjecture
systems in their interactions with arithmetics and com- of Chowla. J Ramanujan Math Soc 33(2):111–123
binatorics, volume 2213 of Lecture Notes in Mathe- Ramaré O (2018) Chowla’s conjecture: from the Liouville
matics. Springer, Cham, pp 237–247 function to the Moebius function. In: Ferenczi S,
Gomilko A, Lemańczyk M, de la Rue T (2020) Möbius Kułaga-Przymus J, Lemańczyk M (eds) Ergodic the-
orthogonality in density for zero entropy dynamical ory and dynamical systems in their interactions with
systems. Pure and Applied Functional Analysis arithmetics and combinatorics, volume 2213 of Lec-
5:1357–1376 ture notes in mathematics. Springer, Cham,
Green B, Tao T (2012) The Möbius function is strongly pp 317–323
orthogonal to nilsequences. Ann Math 175(2):541–566 Sarnak P. Three lectures on the Möbius function, random-
Huang W, Wang Z, Ye X (2019a) Measure complexity and ness and dynamics. Webpage. http://publications.ias.
Möbius disjointness. Adv Math 347:827–858 edu/sarnak/
Huang W, Wang Z, Zhang G (2019b) Möbius disjointness Tao T (2016) The logarithmically averaged Chowla and
for topological models of ergodic systems with discrete Elliott conjectures for two-point correlations. Forum
spectrum. J Mod Dyn 14:277–290 Math Pi 4:e8, 36
Huang W, Liu J, Wang K. (n.d.) Möbius disjointness for Tao T (2017) Equivalence of the logarithmically averaged
skew products on a circle and a nilmanifold. Preprint. Chowla and Sarnak conjectures. In: Number theory –
https://arxiv.org/abs/1907.01735 Diophantine problems, uniform distribution and appli-
Jenvey E (1997) Strong stationarity and de Finetti’s theo- cations. Springer, Cham, pp 391–421
rem. J Anal Math 73(1):1–18 Tao T (n.d.) The logarithmically averaged and non-
Kamae T (1973) Subsequences of normal sequences. Isr logarithmically averaged Chowla conjectures.
J Math 16:121–149 Webpage, https://terrytao.wordpress.com/2017/10/20/
Kanigowski A, Lemańczyk M, Radziwiłł M. (n.d.-a) Rigidity thelogarithmically-averaged-and-non-logarithmically-
in dynamics and Möbius disjointness, to appear in averaged-chowlaconjectures/
Fundamenta Math. https://arxiv.org/abs/1905.13256 Tao T, Teräväinen J (2018) Odd order cases of the loga-
Kanigowski A, Lemańczyk M, Ulcigrai C. (2020) On rithmically averaged Chowla conjecture. J Théor
disjointness properties of some parabolic flows. Invent Nombres Bordeaux 30(3):997–1015
Math 221(1):1–111. https://arxiv.org/abs/1810.11576 Tao T, Teräväinen J (2019a) The structure of correlations of
Karagulyan D (2015) On Möbius orthogonality for interval multiplicative functions at almost all scales, with appli-
maps of zero entropy and orientation-preserving circle cations to the Chowla and Elliott conjectures. Algebra
homeomorphisms. Ark Mat 53(2):317–327 Number Theory 13(9):2103–2150
Karagulyan D (2017) On Möbius orthogonality for sub- Tao T, Teräväinen J (2019b) The structure of logarithmi-
shifts of finite type with positive topological entropy. cally averaged correlations of multiplicative functions,
Stud Math 237(3):277–282 with applications to the Chowla and Elliott conjectures.
Karagulyan D (2020) Hausdorff dimension of a class of Duke Math J 168(11):1977–2027
three-interval exchange maps. Discrete and Contin Weiss B (1971) Normal sequences as collectives. In: Pro-
Dynam Syst 40(3):1257–1281 ceedings of the symposium on topological dynamics
Kátai I (1986) A remark on a theorem of H. Daboussi. Acta and ergodic theory. University of Kentucky
Math Hungar 47(1–2):223–225
310 Sarnak’s Conjecture from the Ergodic Theory Point of View
Mauduit C, Rivat J (2015) Prime numbers along Rudin– Sarnak P (2012) Mobius randomness and dynamics. Not S
Shapiro sequences. J Eur Math Soc (JEMS) 17 Afr Math Soc 43(2):89–97
(10):2595–2642 Sawin W. Dynamical models for Liouville and obstruc-
McNamara R. Sarnak’s conjecture for sequences of almost tions to further progress on sign patterns. Preprint,
quadratic word growth. Preprint, https://arxiv.org/abs/ https://arxiv.org/abs/1809.03280
1901.06460 Soundararajan K (2017) The Liouville function in short
Mentzen MK (2017) Automorphisms of subshifts defined intervals. Astérisque, 2015/2016(390):Exp. No. 1119,
by B-free sets of integers. Colloq Math 147(1):87–94 453–479. Séminaire Bourbaki. Vol. 2015/2016.
Mirsky L (1947) Note on an asymptotic formula connected Exposés 1104–1119
with r-free integers. Quart J Math Oxford Ser 18:178–182 Sun W. Sarnak’s conjecture for nilsequences on arbitrary
Mirsky L (1949) Arithmetical pattern problems relating to number fields and applications. Preprint, https://arxiv.
divisibility by rth powers. Proc London Math Soc (2) org/abs/1902.09712
50:497–508 Tao T. Furstenberg limits of the Liouville function.
Peckner R (2018) Möbius disjointness for homogeneous Webpage, https://terrytao.wordpress.com/2017/03/05/
dynamics. Duke Math J 167(14):2745–2792 furstenberg-limits-of-the-liouville-function/
Pleasants PAB, Huck C (2013) Entropy and diffraction of Teräväinen J (2018) On binary correlations of multiplica-
the k-free points in n-dimensional lattices. Discrete tive functions. Forum Math Sigma 6:e10, 41
Comput Geom 50(1):39–68 Veech WA (2017) Möbius orthogonality for generalized
Ryzhikov VV (2013) Bounded ergodic constructions, Morse-Kakutani flows. Am J Math 139(5):1157–1203
disjointness, and weak limits of powers. Trans Moscow Wang Z (2017) Möbius disjointness for analytic skew
Math Soc 165–171 products. Invent Math 209(1):175–196
measures the distortion of volume under f in
Smooth Ergodic Theory that metric.
Hopf argument A technique developed by
Amie Wilkinson Eberhard Hopf for proving that a conservative
Northwestern University, Evanston, IL, USA diffeomorphism or flow is ergodic. The argu-
ment relies on the Ergodic Theorem for invert-
ible transformations, the density of continuous
Article Outline functions among integrable functions, and the
existence of stable and unstable foliations for
Glossary the system. The argument has been used, with
Definition of the Subject various modifications, to establish ergodicity
Introduction for hyperbolic, partially hyperbolic and non-
The Volume Class uniformly hyperbolic systems.
The Fundamental Questions Hyperbolic A compact invariant set Λ M for a
Lebesgue Measure and Local Properties of diffeomorphism f : M ! M is hyperbolic if, at
Volume every point in Λ, the tangent space splits into
Ergodicity of the Basic Examples two subspaces, one that is uniformly
Hyperbolic Systems contracted by the derivative of f, and another
Beyond Uniform Hyperbolicity that is uniformly expanded. Expanding maps
The Presence of Critical Points and Other and Anosov diffeomorphisms are examples
Singularities of globally hyperbolic maps. Hyperbolic
Future Directions diffeomorphisms and flows are the archetypi-
Bibliography cal smooth systems displaying chaotic behav-
ior, and their dynamical properties are well-
Glossary understood. Nonuniform hyperbolicity and
partial hyperbolicity are two generalizations
Conservative, dissipative Conservative dynam- of hyperbolicity that encompass a broader
ical systems (on a compact phase space) are class of systems and display many of the cha-
those that preserve a finite measure equivalent otic features of hyperbolic systems.
to volume. Hamiltonian dynamical systems are Sinai–Ruelle–Bowen (SRB) measure The con-
important examples of conservative systems. cept of SRB measure is a rigorous formulation
Systems that are not conservative are called of what it means for an invariant measure to be
dissipative. Finding physically meaningful “physically meaningful”. An SRB measure
invariant measures for dissipative maps is a attracts a large set of orbits into its support,
central object of study in smooth ergodic and its statistical features are reflected in the
theory. behavior of these attracted orbits.
Distortion estimate A key technique in smooth
ergodic theory, a distortion estimate for a
smooth map f gives a bound on the variation Definition of the Subject
of the jacobian of f n in a given region, for
n arbitrarily large. The jacobian of a smooth Smooth ergodic theory is the study of the statisti-
map at a point x is the absolute value of the cal and geometric properties of measures invariant
determinant of derivative at x, measured in a under a smooth transformation or flow. The study
fixed Riemannian metric. The jacobian of smooth ergodic theory is as old as the study of
abstract ergodic theory, having its origins in discussion of many of the recent developments in
Bolzmann’s Ergodic Hypothesis in the late nine- the field of smooth ergodic theory is (Bonatti
teenth Century. As a response to Boltzmann’s et al. 2005).
hypothesis, which was formulated in the context This entry assumes knowledge of the basic
of Hamiltonian Mechanics, Birkhoff and von concepts in ergodic theory and of basic differen-
Neumann defined ergodicity in the 1930s and tial topology. The texts (Cornfeld et al. 1982) and
proved their foundational ergodic theorems. The (Hirsch 1979) contain the necessary background.
study of ergodic properties of smooth systems saw
an advance in the work of Hadamard and E. Hopf
in the 1930s their study of geodesic flows for The Volume Class
negatively curved surfaces. Beginning in the
1950s, Kolmogorov, Arnold and Moser devel- For simplicity, assume that M is a compact,
oped a perturbative theory producing obstructions boundaryless C1 Riemannian manifold, and that
to ergodicity in Hamiltonian systems, known as f : M ! M is an orientation-preserving, C1 map
Kolmogorov–Arnold–Moser (KAM) Theory. satisfying m(Dx f ) > 0, for all x M, where
Beginning in the 1960s with the work of Anosov
mðDx f Þ ¼ inf kDx f ðvÞk :
and Sinai on hyperbolic systems, the study of v T x M, kvk¼1
smooth ergodic theory has seen intense activity.
This activity continues today, as the ergodic prop- If f is a diffeomorphism, then this assumption
erties of systems displaying weak forms of hyper- is automatically satisfied, since in that case
bolicity are further understood, and KAM theory m(Dx f ) ¼ kDf(x) f 1k1 > 0. For non-invertible
is applied in increasingly broader contexts. maps, this assumption is essential in much of the
following discussion. The Inverse Function The-
orem implies that any map f satisfying these
Introduction hypotheses is a covering map of positive degree
d 1.
This entry focuses on the basic arguments and These assumptions will avoid the issues of
principles in smooth ergodic theory, illustrating infinite measures and the behavior of f near crit-
with simple and straightforward examples. The ical points and singularities of the derivative. For
classic texts (Arnol’d and Avez 1986; Mañé most results discussed in this entry, this assump-
1987) are a good supplement. tion is not too restrictive. The existence of crit-
The discussion here sidesteps the topic of Kol- ical points and other singularities is, however, a
mogorov–Arnold–Moser (KAM) Theory, which complication that cannot be avoided in many
has played an important role in the development important applications. The ergodic-theoretic
of smooth ergodic theory. For reasons of space, analysis of such examples can be considerably
detailed discussion of several active areas in more involved, but contains many of the ele-
smooth ergodic theory is omitted, including: ments discussed in this entry. The discussion in
higher mixing properties (Kolmogorov, Bernoulli, section “Beyond Uniform Hyperbolicity” indi-
etc.), finer statistical properties (fast decay of cor- cates how some of these additional technicalities
relations, Central Limit Theorem, large devia- arise and can be overcome. For simplicity, the
tions), smooth thermodynamic formalism discussion here is confined almost exclusively to
(transfer operators, pressure, dynamical zeta func- discrete time evolution. Many, though not all, of
tions, etc.), the smooth ergodic theory of random the results mentioned here carryover to flows
dynamical systems, as well as any mention of and semi flows using, for example, a cross-
infinite invariant measures. The text (Baladi section construction (see Chap. 1 in (Mañé
2000) covers many of these topics, and the texts 1987)).
(Kifer 1986, 1988; Liu and Qian 1995) treat ran- Every smooth map f : M ! M satisfying these
dom smooth ergodic theory in depth. An excellent hypotheses preserves a natural measure class, the
Smooth Ergodic Theory 315
measure class of a finite, smooth Riemannian dissipative map. A map f is called dissipative if
volume on M. Fix such a volume n on M. Then every f-invariant measure with full support has a
there exists a continuous, positive jacobian func- singular part with respect to volume. As was just
tion x 7! jacx f on M, with the property that for seen, if a diffeomorphism f has a periodic sink,
every sufficiently small ball B M, and every then f is dissipative; more generally, if a
measurable set A B one has: diffeomorphism f has a periodic point p of period
k such that jacp f k 6¼ 1, then f is dissipative.
n ð f ð AÞ Þ ¼ jacx f dnðxÞ :
B
The Fundamental Questions
The jacobian of f at x is none other than the
absolute value of the determinant of the derivative For a given smooth map f : M ! M, there are the
Dx f (measured in the given Riemannian metric). following fundamental questions.
To see that the measure class of n is preserved by f,
observe that the Radon–Nikodym derivative 1. Is f conservative? That is, does there exist an
1
df n invariant measure in the class of volume? If so,
dn ðxÞ at x is equal to y f 1 ðxÞ jacy f > 0.
is it unique?
Hence f n is equivalent to n, and f preserves the
2. When f is conservative, what are its statistical
measure class of n.
properties? Is it ergodic, mixing, a K-system,
In many contexts, the map f has a natural
Bernoulli, etc.? Does it obey a Central Limit
invariant measure in the measure class of volume.
Theorem, fast decay of correlations, large devi-
In this case, f is said to be conservative. One
ations estimates, etc.?
setting in which a natural invariant smooth mea-
3. If f is dissipative, does there exist an invariant
sure appears is Hamiltonian dynamics. Any solu-
measure, not in the class of volume, but
tion to Hamilton’s equations preserves a smooth
(in some sense) natural with respect to volume?
volume called the Liouville measure. Further-
What are the statistical properties of such a
more, along the invariant, constant energy hyper
measure, if it exists?
manifolds of a Hamiltonian flow, the Liouville
measure decomposes smoothly into invariant
There are several plausible ways to “answer”
measures, each of which is equivalent to the
these questions. One might fix a given map f of
induced Riemannian volume. In this way, many
interest and ask these questions for that specific f.
systems of physical or geometric origin, such as
What tends to happen in the analysis of a single
billiards, geodesic flows, hard sphere gases, and
map f is that either:
evolution of the n-body problem give rise to
smooth conservative dynamical systems. See
• the question can be answered using “soft”
Dynamics of Hamiltonian Systems.
methods, and so the answer applies not only
Note that even though f preserves a smooth
to f but to perturbations of f, or even to generic
measure class, it might not preserve any measure
or typical f inside a class of maps; or,
in that measure class. Consider, for example, a
• the proof requires “hard” analysis or precise
diffeomorphism f : S1 ! S1 of the circle with
asymptotic information and cannot possibly be
exactly two fixed points, p and q, f 0( p) > 1 > f 0
answered for a specific f, but can be answered
(q) > 0. Let m be an f-invariant probability
for a large set of ft in a typical (or given)
measure. Let I be a neighborhood of p.
1 parametrized family {ft}t (1, 1) of smooth
Then \ f n ðI Þ ¼ fpg, but on the other hand, maps containing f ¼ f0.
n¼1
m( fn(I )) ¼ m(I) > 0, for all n. This implies that
m({p}) > 0, and so m does not lie in the measure Both types of results appear in the discussion
class of volume. This is an example of a that follows.
316 Smooth Ergodic Theory
Lebesgue Measure and Local Properties of S1 ∖ A by pairwise disjoint elements {Ji} of the
of Volume union [1n¼1 P n with the properties:
conditions on the partitions P n , then one can draw implies that there exists an interval I S1 such
stronger conclusions. that the density of A in I is large: l(A : I ) > 1 ε.
A very useful theorem in this respect is the Similarly, one may choose an interval J such that
Lebesgue Density Theorem. A point x M is l(Ac : J) > 1 ε. Without loss of generality, one
a Lebesgue density point of a measurable set may choose I and J to have the same length. Since
X M if a is irrational, Rα has a dense orbit, which meets
the interval I. Since Rα is an isometry, this implies
lim mðX : Br ðxÞÞ ¼ 1 , that there is an integer n such that
r!0
l Rna ðI Þ D J < elðI Þ. Since l(I) ¼ l(J ), this read-
where Br(x) is the Riemannian ball of radius ily implies that j l A : Rna ðI Þ lðA : J Þ j< e.
r centered at x. Notice that the notion of Lebesgue Also, since A is invariant, and Rα is invertible
density point depends only on the smooth struc- and preserves measure, one has:
ture of M, because any two Riemannian metrics
have the same Lebesgue density points. The l A : Rna ðI Þ ¼ l Rna ðAÞ : Rna ðI Þ ¼ lðA : I Þ
Lebesgue Density Theorem states that if A is a >1e :
measurable set and A is the set of Lebesgue den-
sity points of A, then m A D A ¼ 0. But for ε sufficiently small, this contradicts the
facts that l(A : J) ¼ 1 l(Ac : J ) < ε and
j l A : Rna ðI Þ lðA : J Þ j< e. □
Ergodicity of the Basic Examples Note that this is not a proof of the strongest
possible statement about Rα (namely, minimality
This section contains proofs of the ergodicity of and unique ergodicity). The point here is to show
two basic examples of conservative smooth maps: how “soft” arguments are often sufficient to estab-
irrational rotations on the circle and the doubling lish ergodicity; this proof uses no more about Rα
map on the circle. See ▶ Ergodic Theory: Basic than the fact that it is a transitive isometry. Hence
Examples and Constructions for a more detailed the same argument shows:
description of these maps. These proofs serve as
an elementary illustration of some of the funda- Theorem 1 Let f : M ! M be a transitive isom-
mental techniques and principles in smooth ergo- etry of a Riemannian manifold M. Then f is ergo-
dic theory. dic with respect to Riemannian volume.
Rotations on the circle. Denote by S1 the
circle ℝ/ℤ, which is an additive group, and by l One can isolate from this proof a useful
normalized Lebesgue-Haar measure on S1. Fix a principle:
real number α ℝ. The rotation Rα : S1 ! S1 is Fundamental Principle # 2: Isometries pre-
the translation defined by Rα(x) ¼ x þ α. Since serve Lebesgue density at all scales, for arbitrarily
translations preserve Lebesgue-Haar measure, the many iterates.
map Rα is conservative. Note that Rα is a This principle implies, for example, that a
diffeomorphism and an isometry with respect to smooth action by a compact Lie group on M is
the canonical flat metric (length) on S1. ergodic along typical (nonsingular) orbits. This
principle is also useful in studying area-preserving
Proposition 2 If α ℚ, then the rotation flows on surfaces and, in a refined form, unipotent
Rα : S1 ! S1 is ergodic with respect to Lebesgue flows on homogeneous spaces. In the case of
measure. surface flows, ergodicity questions can be reduced
to a study of interval exchange transformations.
Proof Let A be an Rα-invariant set in S1, and See the entry ▶ Ergodic Theory: Basic Examples
suppose that 0 < l(A) < 1. Denote by Ac the and Constructions for a detailed discussion of
complement of Ain S1. Fix ε > 0. Proposition 1 interval exchange transformations and flows on
318 Smooth Ergodic Theory
surfaces. ▶ Introduction to Ergodic Theory con- In this proof, the facts that the intervals in
tains detailed information on unipotent flows. P n have constant length 2n and that the jaco-
Doubling map on the circle. Let T2 : S1 ! S1 bian of T2n restricted to such an interval is
be the doubling map defined by T2(x) ¼ 2x. Then constant and equal to 2n are not essential.
T2 is a degree-2 covering map and endomorphism The key fact really used in this proof is the
of S1 with constant jacobian jacxT2 2. Since assertion that the ratio:
dðT 2 Þ l
dl ¼ 12 þ 12 ¼ 1, T2 preserves Lebesgue-Haar
measure. The doubling map is the simplest exam- l T n2 ðA \ J Þ : T n2 ðJ Þ
ple of a hyperbolic dynamical system, a topic lð A : J Þ
treated in depth in the next section.
As with the previous example, the focus here is is bounded, independently of n. In this case, the
on the property of ergodicity. It is again possible ratio is 1 for all n because T2 has constant
to prove much stronger results about T2, such as jacobian.
Bernoulli city, by other methods. Instead, here is a It is tempting to try to extend this proof to
soft proof of ergodicity that will generalize readily other expanding maps on the circle, for example,
to other contexts. a C1, l-preserving map f : S1 ! S1 with
dC1 ð f , T 2 , Þ small. Many of the aspects of this
Proposition 3 The doubling map T2 : S1 ! S1 is proof carry through mutatismutandis for such an
ergodic with respect to Lebesgue measure. f, save for one. A C1-small perturbation of T2 will
in general no longer have constant jacobian, and
Proof Let A be a T2-invariant set in S1 with the variation of the jacobian of f n on a small
l(A) > 0. Let p S1 be the fixed point of T2, so interval can be (and often is) unbounded. The
that T2( p) ¼ p. For each n ℕ, the preimages of reason for this unboundedness is a lack of control
p under T n
2 define a (mod 0) partition P n into 2
n
of the modulus of continuity of f ’. Hence this
open intervals of length 2 ; the elements of P n are
-n
argument can fail for C1 perturbations of T2. On
the connected components of S1 ∖T n2 ðfpgÞ. Note the other hand, the argument still works for C2
that the sequence of partitions P 1 , P 2 , . . . is perturbations of T2, even when the jacobian is not
nested, in the sense of Proposition 1. Restricted constant.
to any interval J P n , the mapT2n is a The principle behind this fact can be loosely
diffeomorphism onto S1 ∖ {p} with constant jaco- summarized:
0
bian jacx T n2 ¼ T n2 ðxÞ ¼ 2n . Fundamental Principle # 3: On controlled
scales, iterates of C2 expanding maps distort
Since A is invariant, it follows that Lebesgue density in a controlled way.
T n
2 ðAÞ ¼ A. Fix ε > 0. Proposition 1 implies This principle requires further explanation and
that there exists an n ℕ and an interval J P n justification, which will come in the following
such that l(A : J ) > 1 ε. Note that section. The C2 hypothesis in this principle
T n2 ðA \ J Þ A. But then accounts for the fact that almost all results in
smooth ergodic theory assume a C2 hypothesis
lðAÞ l T n2 ðA \ J Þ (or something slightly weaker).
¼ jacx T n2 dlðxÞ
A\ J Hyperbolic Systems
¼ 2n lð A \ J Þ
¼ 2n lðA : J ÞlðJ Þ One of the most developed areas of smooth ergo-
> 2n ð1 eÞlðJ Þ ¼ 1 e : dic theory is in the study of hyperbolic maps and
attractors. This section defines hyperbolic maps
Since ε was arbitrary, one obtains that and attractors, provides examples, and investi-
l(A) ¼ 1. □ gates their ergodic properties. See (Robinson
Smooth Ergodic Theory 319
1995; Katok and Hasselblatt 1995) and Hyper- kDx f n ðvÞk Cmn kvk ,
bolic Dynamical Systems for a thorough discus-
sion of the topological and smooth properties of with respect to some (any) Riemannian metric on
hyperbolic systems. M. An expanding map is clearly hyperbolic, with
A hyperbolic structure on a compact U ¼ M, Es the trivial bundle, and Eu ¼ TM. Any
f-invariant set Λ M is given by a Df-invariant disk in M is a local unstable manifold for f.
splitting TΛM ¼ Eu Es of the tangent bundle
over Λ and constants C, m > 1 such that, for every Anosov Diffeomorphisms
x Λ and n ℕ: A diffeomorphism f : M ! M is called Anosov if
the tangent bundle splits as a direct sum TM ¼
v Eu ðxÞ ) kDx f n ðvÞk C1 mn kvk , Eu Es of two Df-invariant subbundles, such that
Eu is uniformly expanded and Es is uniformly
and v Es ðxÞ ) kDx f n ðvÞk Cmn kvk :
contracted by Df. Similarly, a flow ’t : M ! M
is called Anosov if the tangent bundle splits as
A hyperbolic attractor for a map f : M ! M is
a direct sum TM ¼ Eu E0 Es of three
given by an open set U M such that: f ðU Þ U,
D’t-invariant subbundles, such that E0 is gener-
and such that the set Λ ¼ \n 0 f n(U ) carries a
ated by ’,_ Eu is uniformly expanded and Es is
hyperbolic structure. The set Λ is called the
uniformly contracted by D’t. Like expanding
attractor, and U is an attracting region. A map
maps, an Anosov diffeomorphism is an Anosov
f : M ! M is hyperbolic if M decomposes (mod 0)
attractor with Λ ¼ U ¼ M.
into a finite union of attracting regions for hyper-
A simple example of a conservative Anosov
bolic attractors. Typically one assumes as well
diffeomorphism is a hyperbolic linear automor-
that the restriction of f to each attractor Λi is
phism of the torus. Any matrix A SL(n, ℤ)
topologically transitive.
induces an automorphism of Rn preserving the
Every point p in a hyperbolic set Λ has smooth
integer lattice Zn, and so descends to an automor-
stable manifold W s ðpÞ and unstable manifold
phism fA : T n ! T n of the n-torus T n ¼ ℝn/ℤn.
W u ðpÞ, tangent, respectively, to the subspaces
Since the determinant of A is 1, the
Es( p) and Eu( p). The set W s ðpÞ is precisely the
diffeomorphism fA preserves Lebesgue-Haar mea-
set of q M such that d( f n( p), f n(q)) tends to
sure on T n. In the case where none of the eigen-
0 as n ! 1, and it follows that f ðW sðpÞÞ ¼
values of A have modulus 1, the resulting
W s ð f ðpÞÞ. When f is a diffeomorphism, the unsta-
diffeomorphism fA is Anosov. The stable bundle
ble manifold W u ðpÞ is uniquely defined and is the
Es at x T n is the parallel translate to x of the sum
stable manifold of f 1. When f is not invertible,
of the contracted generalized eigenspaces of A,
local unstable manifolds exist, but generally are
and the unstable bundle Eu at x is the translated
not unique. If Λ is a transitive hyperbolic attractor,
sum of expanded eigenspaces.
then every unstable manifold of every point
In general, the invariant subbundles Eu and Es
p Λ is dense in Λ.
of an Anosov diffeomorphism are integrable and
tangent to a transverse pair of foliations W u and
Examples of Hyperbolic Maps and Attractors W s , respectively (see, e.g (Hirsch et al. 1977). for
a proof of this). The leaves of W s are uniformly
Expanding Maps contracted by f, and the leaves of W u are uni-
The previous section mentioned briefly the Cr formly contracted by f 1. The leaves of these
perturbations of the doubling map T2. Such per- foliations are as smooth as f, but the tangent bun-
turbations (as well as T2 itself) are examples of dles to the leaves do not vary smoothly in the
expanding maps. A map f : M ! M is expanding if manifold. The regularity properties of these folia-
there exist constants m > 1 and C > 0 such that, for tions play an important role in the ergodic prop-
every x M, and every nonzero vector v TxM : erties of Anosov diffeomorphisms.
320 Smooth Ergodic Theory
The first Anosov flows to be studied exten- the assertion in Fundamental Principle # 3 in this
sively were the geodesic flows for manifolds of context.
negative sectional curvatures. As these flows are
Hamiltonian, they are conservative. Eberhard Lemma There exists a constant K 1 such that,
Hopf showed in the 1930s that such geodesic for all n ℕ, and for all x, y fn(I), one has:
flows for surfaces are ergodic with respect to
Liouville measure (Hopf 1939); it was not until ðf n Þ0 ðxÞ
the 1960s that ergodicity of all such flows was K 1 K:
ðf n Þ0 ðy Þ
proved by Anosov (Anosov 1967). The next sec-
tion describes, in the context of Anosov Proof Since f is C2 and f’ is bounded away from
diffeomorphisms, Hopf’s method and important 0, the function α(x) ¼ log ( f 0(x)) is C1. In partic-
refinements due to Anosov and Sinai. ular, a is Lipschitz continuous: there exists a
constant L > 0 such that, for all x, y S1, jα(x)
DA Attractors α( y) j < Ld(x, y). For n 0, let αn(x) ¼ log (( f n)0(x)).
A simple way to produce a non-Anosov hyper- The Chain Rule implies that
bolic attractor on the torus is to start with an an ðxÞ ¼ n1 a f i
ðx Þ .
i¼0
Anosov diffeomorphism, such as a linear hyper-
bolic automorphism, and deform it in a neighbor-
The expanding hypothesis on f implies that
hood of a fixed point, turning a saddle fixed point
for all x, y fn(I ) and for i ¼ 0, . . ., n, one has
into a source, while preserving the stable foliation.
d( f i(x), f i( y)) C1mind( f n(x), f n( y))
If this procedure is carried out carefully enough, 1 i n
C m . Hence
the resulting diffeomorphism is a dissipative
hyperbolic diffeomorphism, called a derived n1
from Anosov (DA) attractor. Other examples of j an ð x Þ an ð y Þ j j a f i ðxÞ a f i ðyÞ j
hyperbolic attractors are the Plykin attractor and i¼0
the solenoid. See (Robinson 1995). n1
L d f i ðxÞ, f i ðyÞ
i¼0
Distortion Estimates
n1
Before describing the ergodic properties of hyper-
L C1 min
bolic systems, it is useful to pause for a brief i¼0
discussion of distortion estimates. Distortion esti- 1
< LC 1 m1 1 m1 :
mates are behind almost every result in smooth
ergodic theory. In the hyperbolic setting, distor-
Setting K ¼ exp (LC1m1(1 m1)1), one
tion estimates are applied to the action of f on
now sees that ( f n)0(x)/( f n)0( y) lies in the interval
unstable manifolds to show that the volume dis-
[K1. K], proving the claim. □
tortion of f along unstable manifolds can be con-
In this distortion estimate, the function
trolled for arbitrarily many iterates.
α : M ! ℝ is called a cocycle. The same argument
The example mentioned at the end of the pre-
applies to any Lipschitz continuous (or even
vious section illustrates the ideas in a distortion
Hölder continuous) cocycle.
estimate. Suppose that f : S1 ! S1 is a C2
expanding map, such as a C2 small perturbation
Ergodicity of Expanding Maps
of T2. Then there exist constants m > 1 and C > 0
The ergodic properties of C2 expanding maps are
such that ( f n)0(x) > Cmn for all x and n.
completely understood. In particular, every con-
Let d be the degree of f. If I is a sufficiently
servative expanding map is ergodic, and every
small open interval in S1, then for each n, fn(I) is
expanding map is conservative. The proofs of
a union of d disjoint intervals. Furthermore, each
these facts use Fundamental Principles # 1 and
of these intervals has diameter at most C1mn
3 in a fairly direct way.
times the diameter of I. It is now possible to justify
Smooth Ergodic Theory 321
Every C2 conservative expanding map is ergo- f-invariant measure absolutely continuous with
dic with respect to volume. The proof is a straight- respect to l. With more work, one can show that
forward adaptation of the proof of Proposition 3 n is exact. See (Mañé 1987) for details.
(see, e.g (Mañé 1987)). Here is a description of the
proof for M ¼ S1. As remarked earlier, the proof of Ergodicity of Conservative Anosov
Proposition 3 adapts easily to a general expanding Diffeomorphisms
map f : S1 ! S1 once one shows that for every Like conservative C2 expanding maps, conserva-
f-invariant set A, and every connected component tive C2 Anosov diffeomorphisms are ergodic.
J of fn(S1 ∖ {p}), the quantity This subsection outlines a proof of this fact.
Unlike expanding maps, however, Anosov
lð f n ð A \ J Þ : f n ð J Þ Þ diffeomorphisms need not be conservative. The
lð A : J Þ subsection following this one describe a type of
invariant measure that is “natural” with respect to
is bounded independently of n. This is a fairly volume, called a Sinai-Ruelle-Bowen (or SRB)
direct consequence of the distortion estimate in measure. The central result for hyperbolic systems
Lemma 6.1 and is left as an exercise. states that every hyperbolic attractor carries an
The same distortion estimates show that every SRB measure.
2
C expanding map is conservative, preserving a
probability measure v in the measure class of The Hopf Argument
volume. Here is a sketch of the proof for the case In the 1930s Hopf (Hopf 1939) proved that the
M ¼ S1. To prove this, consider the push-forward geodesic flow for a compact, negatively-curved
ln ¼ f n l. Then ln is equivalent to Lebesgue, and surface is ergodic. His method was to study the
its Radon–Nikodym derivative dln ∖ dl is the Birkhoff averages of continuous functions along
density function leaves of the stable and unstable foliations of the
flow. This type of argument has been used since
1 then in increasingly general contexts, and has
rn ðxÞ ¼ :
jacy f n come to be known as the Hopf Argument.
y f n ðxÞ
The core of the Hopf Argument is very simple.
Since f n l is a probability measure, it follows To any f : M ! M one can associate the stable
that S1 rn dl ¼ 1. A simple argument using the equivalence relation ~s, where x~s y iff limn!1
distortion estimate above (and summing up over d( f n(x), f n( y)) ¼ 0. Denote by W s(x) the stable
all dn branches of f-n at x) shows that there exists a equivalence class containing x. When f is invert-
constant c 1 such that for all x, y S1, ible, one defines the unstable equivalence relation
to be the stable equivalence relation for f 1, and
rn ðxÞ one denotes by W u(x) the unstable equivalence
c1 c: class containing x.
rn ð y Þ
The first step in the Hopf Argument is to show
Since the integral of rn is 1, the functions rn are that Birkhoff averages for continuous functions
uniformly bounded away from 0 and 8. It is easy are constant along stable and unstable equivalence
to see that the measure nn ¼ 1n ni¼1 f i l has density classes. Let f : M ! ℝ be an integrable function,
1 n and let
n i¼1 ri . Let n be any subsequential weak* limit
of nn; then n is absolutely continuous, with density n
r bounded away from 0 and 8. With a little more 1
f ¼ lim sup f fi : ð1Þ
care, one can show that r is actually Lipschitz n!1 n i¼1
continuous.
As a passing comment, the ergodicity of n and Observe that if f is continuous, then for every
positivity of r imply that n is the unique x M and x0 Ws(x), limn ! 1 j f( f n(x))
322 Smooth Ergodic Theory
Theorem 2 (Anosov) Let f be a C2, conservative physically observable measures for hyperbolic
Anosov diffeomorphism. Then f is ergodic. attractors (Sinai 1972; Ruelle 1976; Bowen
1975). Such measures are now known as Sinai-
Proof By the Hopf Argument, it suffices to show Ruelle-Bowen (SRB) measures, and have been
that if cs and cu are L2 functions with the follow- shown to exist for non-hyperbolic maps with
ing properties: some hyperbolic features. This subsection
describes the construction of SRB measures for
1. cs is constant along leaves of W s , hyperbolic attractors.
2. cu is constant along leaves of W u , and An f-invariant probability measure m is called
3. cs ¼ cu a.e., an SRB (or physical) measure if there exists an
open set U M containing the support of m such
then cs (and so cu as well) is constant a.e. that, for every continuous function f : M ! ℝ and
This is proved using the absolute continuity of l-a.e. x U,
W u and W s . Since M is connected, one may argue
this locally. Let G be the full measure set of p M 1
n
Bernoulli shift. This map sends the SRB measure is uniformly expanded, and Ec is dominated,
to a Gibbs state for a mixing Markov shift (see meaning that for some n 1 and for all x M:
▶ Pressure and Equilibrium States in Ergodic
Theory). A result that subsumes all of the results kDx f n jEs k < m Dx f n jEc kDx f n jEc k < mðDx f n jEu Þ :
in this section is:
Partial hyperbolicity is a C1-open condition:
Theorem 3 (Sinai, Ruelle, Bowen) Let Λ M any diffeomorphism sufficiently C1-close to a
be a transitive hyperbolic attractor for a C2 map partially hyperbolic diffeomorphism is itself par-
f : M ! M. Then f has an ergodic SRB measure m tially hyperbolic. For an extensive discussion of
supported on Λ. Moreover: the disintegration of m examples of partially hyperbolic dynamical sys-
along unstable manifolds of Λ is equivalent to the tems, see the survey article (Burns et al. 2001) and
induced Riemannian volume, the Lyapunov expo- the book (Pesin 2004). Among these examples
nents of m are all positive, and m is Bernoulli. are: the time-1 map of an Anosov flow, the
frame flow for a compact manifold of negative
sectional curvature, and many affine transforma-
Beyond Uniform Hyperbolicity tions of compact homogeneous spaces. All of
these examples preserve the volume induced by
The methods developed in the smooth ergodic a Riemannian metric on M.
theory of hyperbolic maps have been extended As in the Anosov case, the stable and unstable
beyond the hyperbolic context. Two natural gen- bundles Es and Eu of a partially hyperbolic
eralizations of hyperbolicity are: diffeomorphism are tangent to foliations, again
denoted by W s and W u respectively (Brin and
• partial hyperbolicity, which requires uniform Pesin 1974). Brin-Pesin and Pugh-Shub proved
expansion of Eu and uniform contraction of Es, that these foliations are absolutely continuous.
but allows central directions at each point, in A partially hyperbolic diffeomorphism
which the expansion and contraction is domi- f : M ! M is accessible if any point in M can be
nated by the behavior in the hyperbolic direc- reached from any other along an su-path, which is
tions; and, a concatenation of finitely many subpaths, each of
• nonuniform hyperbolicity, which requires which lies entirely in a single leaf of W s or a
hyperbolicity along almost every orbit, but single leaf of W u . Accessibility is a global, topo-
allows the expansion of Eu and the contraction logical property of the foliations W u and W s that
of Es to weaken near the exceptional set where is the analogue of transversality of W u and W s for
there is no hyperbolicity. Anosov diffeomorphisms. In fact, the trans-
versality of these foliations in the Anosov case
This section discusses both generalizations. immediately implies that every Anosov
Smooth Ergodic Theory 325
can in examples be countably infinite; nonuniform of Ledrappier and Young (1985a, 1985b), and the
hyperbolicity alone does not imply ergodicity. proof by Barreira-Pesin-Schmeling that hyper-
bolic measures have a well-defined dimension
The Dissipative Case (Barreira et al. 1999). Hyperbolic Dynamical Sys-
As mentioned above, establishing the existence of tems contains a discussion of these results; see this
a nonsingular hyperbolic measure is a difficult entry there for further information.
problem in general. In systems with some global
form of hyperbolicity, such as partial hyper-
bolicity, it is sometimes possible to “borrow” the The Presence of Critical Points and Other
expansion from the unstable direction and lend it Singularities
to the central direction, via a small perturbation.
Nonuniformly hyperbolic attractors have been Now for a discussion of the aforementioned tech-
constructed in this way (Viana 1997). This nical difficulties that arise in the presence of sin-
method is also behind the construction of a C1 gularities and critical points for the derivative.
open set of nonuniformly hyperbolic Singularities, that is, points where Df (or even
diffeomorphisms in (Shub and Wilkinson 2000). f ) fails to be defined, arise naturally in the study of
For a given system of interest, it is sometimes billiards and hard sphere gases. The first subsec-
possible to prove that a given invariant measure is tion discusses some progress made in smooth
hyperbolic by establishing an approximate form ergodic theory in the presence of singularities.
of hyperbolicity. The idea, due to Wojtkowski and Critical points, that is, points where Df fails to
called the cone method, is to isolate a measurable be invertible, appear inescapably in the study of
bundle of cones in TM defined over the support of noninvertible maps. This type of complication
the measure, such that the cone at a point x is already shows up for noninvertible maps in
mapped by Dx f into the cone at f(x). Intersecting dimension 1, in the study of unimodal maps of
the images of these cones under all iterates of Df, the interval. The second subsection discusses the
one obtains an invariant subbundle of TM over the technique of parameter exclusion, developed by
support of f that is nonuniformly expanded. Jakobson, which allows for an ergodic analysis of
Lai-Sang Young has developed a very general a parametrized family of maps with criticalities.
method (Young 1998) for proving the existence of The technical advances used to overcome these
SRB measures with strong mixing properties in issues in the interval have turned out to have
systems that display “some hyperbolicity”. The applications to dissipative, nonhyperbolic,
idea is to isolate a region X in the manifold diffeomorphisms in higher dimension, where the
where the first return map is hyperbolic and dis- derivative is “nearly critical” in places. The last
tortion estimates hold. If this can be done, then the subsection describes extensions of the parameter
map carries a mixing, hyperbolic SRB measure. exclusion technique to these near-critical maps.
The precise rate of mixing is determined by the
properties of the return-time function to X; the Hyperbolic Billiards and Hard Sphere Gases
longer the return times, the slower the rate of In the 1870s the physicist Ludwig Boltzmann
mixing. hypothesized that in a mechanical system with
More results on the existence of hyperbolic many interacting particles, physical measure-
measures are discussed in the next section. ments (observables), averaged over time, will
An important subject in smooth ergodic theory converge to their expected value as time
is the relationship between entropy, Lyapunov approaches infinity. The underlying dynamical
exponents, and dimension of invariant measures system in this statement is a Hamiltonian system
of a smooth map. Significant results in this with many degrees of freedom, and the “expected
area include the Pesin entropy formula (Pesin value” is with respect to Liouville measure.
1976), the Ruelle entropy inequality (Ruelle Loosely phrased in modern terms, Boltzmann’s
1978), the entropy-exponents-dimension formula hypothesis states that a generic Hamiltonian
328 Smooth Ergodic Theory
system of this form will be ergodic on constant Bunimovich (1974). Sinai and Bunimovich pro-
energy submanifolds. Reasoning that the time ved that these billiards are ergodic and non-
scales involved in measurement of an observable uniformly hyperbolic. For the Boltzmann-Sinai
in such a system are much larger than the rate of problem with N 3, the relevant associated
evolution of the system, Boltzmann’s hypothesis dynamical system is a higher dimensional billiard
allowed him to assume that physical quantities table in Euclidean space, with circular arcs
associated to such a system behave like constants. replaced by cylindrical boundary components.
In 1963, Sinai revived and formalized this In a planar billiard table with circular/flat
ergodic hypothesis, stating it in a concrete formu- boundary, the behavior of vectors encountering a
lation known as the Boltzmann-Sinai Ergodic flat segment of boundary is easily understood, as
Hypothesis. In Sinai’s formulation, the particles is the behavior of vectors meeting a circular seg-
were replaced by N hard, elastic spheres, and to ment in a neighborhood of the normal vector. If
compactify the problem, he situated the spheres the billiard map is ergodic, however, every open
on a k-torus, k ¼ 2, 3. The Boltzmann-Sinai Ergo- set of vectors will meet the singularities in the
dic Hypothesis is the conjecture that the induced table infinitely many times. To establish the non-
Hamiltonian system on the 2kN-dimensional con- uniform hyperbolicity of such billiard tables via
figuration space is ergodic on constant energy conefieds, it is therefore necessary to understand
manifolds, for any N 2. precisely the fraction of time orbits spend near
Sinai verified this conjecture for N ¼ 2 by these singularities. Furthermore, to use the Hopf
reducing the problem to a billiard map in the argument to establish ergodicity, one must avoid
plane. As background for Sinai’s result, a brief the singularities in the second derivative, where
discussion of planar billiard maps follows. distortion estimates break down. The techniques
Let D ℝs be a connected region whose for overcoming these obstacles involve imposing
boundary @D is a collection of closed, piecewise restrictions on the geometry of the table (even
smooth simple curves the plane. The billiard map more so for higher dimensional tables), and are
is a map defined (almost everywhere) on @D well beyond the scope of this paper.
[π, π]. To define this map, one identifies each The study of hyperbolic billiards and hard
point (x, θ) @D [π, π] with an inward- sphere gases has a long and involved history.
pointing tangent vector at x in the plane, so that See the articles (Szász 2000) and (Chernov and
the normal vector to @D at x corresponds to the Markarian 2003) for a survey of some of the
pair (x, π/2). This can be done in a unique way on results and techniques in the area. A discussion
the smooth components of @D. Then f(x, θ) is of methods in singular smooth ergodic theory,
obtained by following the ray originating at with particular applications to the Lorentz attrac-
(x, θ) until it strikes the boundary @D for the first tor, can be found in (Araújo and Pacifico 2007).
time at (x0, θ 0). Reflecting this vector about the Another, more classical, reference is (Katok et al.
normal at x’, define f(x, θ) ¼ (x0, π θ 0). 1986), which contains a formulation of properties
It is not hard to see that the billiard map is on a critical set, due to Katok–Strelcyn, that are
conservative. The billiard map is piecewise useful in establishing ergodicity of systems with
smooth, but not in general smooth: the degree of singularities.
smoothness of f is one less than the degree of
smoothness of @D. In addition to singularities Interval Maps and Parameter Exclusion
arising from the corners of the table, there are The logistic family of maps ft : x 7! tx(1 x)
singularities arising in the second derivative of defined on the interval [0, 1] is very simple to
f at the tangent vectors to the boundary. define but exhibits an astonishing variety of
In the billiards studied by Sinai, the boundary dynamical features as the parameter t varies. For
@D consists of a union of concave circular arcs small positive values of t, almost every point in I is
and straight line segments. Similar billiards, but attracted under the map ft to the sink at 1. For
with convex circular arcs, were first studied by values of t > 4, the map has a repelling hyperbolic
Smooth Ergodic Theory 329
Cantor set. As the value of t increases between continuous invariant measure (Graczyk and
0 and 4, the map ft undergoes a cascade of period- Swiatek 1997; Lyubich 1999). Jakobson’s method
doubling bifurcations, in which a periodic sink applies not only to the logistic family but to a very
of period 2n becomes repelling and a new sink of general class of C3 one-parameter families of
period 2n þ 1 is born. At the accumulation of maps on the interval.
period doubling at t ≈ 3.57, a periodic point
of period 3 appears, forcing the existence of peri- Near-Critical Diffeomorphisms
odic points of all periods. The dynamics of ft for Jakobson’s method in one dimension proved to
t close to 4 has been the subject of intense inquiry extend to certain highly dissipative diffeomo-
in the last 20 years. rphisms. The seminal paper in this extension is
The map ft, for t close to 4, shares some of the due to Benedicks and Carleson; the method has
features of the doubling map T2; it is 2-to-1, since been extended in a series of papers (Mora
except at the critical point 12, and it is uniformly and Viana 1993; Benedicks and Young 1993;
expanding in the complement of a neighborhood Benedicks and Viana 2001) and has been formu-
of this critical point. Because this neighborhood lated in an abstract setting (Wang and Young
of the critical point is not invariant, however, the 2008).
only invariant sets on which ft is uniformly hyper- This extension turns out to be highly non-
bolic have measure zero. Furthermore, the second trivial, but it is possible to describe informally
derivative of ft vanishes at the critical point, mak- the similarities between the logistic family and
ing it impossible to control distortion for orbits higher-dimensional “near critical” diffeo-
that spend too much time near the critical point. morphisms. The diffeomorphisms to which this
Despite these serious obstacles, Michael method applies are crudely hyperbolic with a one
Jakobson (1981) found a method for constructing dimensional unstable direction. Roughly this
absolutely continuous invariant measures for means that in some invariant region of the mani-
maps in the logistic family. The method has fold, the image of a small ball under f will be
come to be known as parameter exclusion and stretched significantly in one direction and shrunk
has seen application far beyond the logistic fam- in all other directions. The directions of stretching
ily. As with billiards, it is possible to formulate and contraction are transverse in a large propor-
geometric conditions on the map ft that control tion of the invariant region, but there are isolated
both expansion (hyperbolicity) and distortion on a “near critical” subregions where expanding and
positive measure set. As these conditions involve contracting directions are nearly tangent.
understanding infinitely many iterates of ft, they The dynamics of such a diffeomorphism are
are impossible to verify for a given parameter very close to 1-dimensional if the contraction is
value t. strong enough, and the diffeomorphism resembles
Using an inductive formulation of this condi- an interval map with isolated critical points, the
tion, Jakobson showed that the set of parameters critical points corresponding to the critical regions
t near 4 that fail to satisfy the condition at iterate where stable and unstable directions are tangent.
n have exponentially small measure (in n). He An illustration of this type of dynamics is the
thereby showed that for a positive Lebesgue mea- Hénon family of maps fa, b : (x, y) 7! (1 ax2 þ
sure set of parameter values t, the map ft has an by, x), the original object of study in Benedicks-
absolutely continuous invariant measure Carlesson’s work. When the parameter b is set to
(Jakobson 1981). This measure is ergodic 0, the map fa, b is no longer a diffeomorphism, and
(mixing) and has a positive Lyapunov exponent. indeed is precisely a projection composed with the
The delicacy of Jakobson’s approach is confirmed logistic map. For small values of b and appropri-
by the fact that for an open and dense set of ate values of a, the Hénon map is strongly dissi-
parameter values, almost every orbit is attracted pative and displays the near critical behavior
to a periodic sink, and so ft has no absolutely described in the previous paragraph. In analogy
330 Smooth Ergodic Theory
to Jakobson’s result, there is a positive measure set interest, including the infinite dimensional sys-
of parameters near b ¼ 0 where fa, b has a mixing, tems that arise in the study of partial differen-
hyperbolic SRB measure. tial equations.
See (Viana and Lutstsatto 2003) for a detailed • Carry the methods of smooth ergodic theory
exposition of the parameter exclusion method for further into the study of smooth actions of
Hénon-like maps. discrete groups (other than the integers) on
manifolds. When do such actions admit
(possibly non-invariant) “physical” measures?
Future Directions
There are other interesting open areas of future
In addition to the open problems discussed in the inquiry, but this gives a good sample of the range
previous sections, there are several general ques- of possibilities.
tions and problems worth mentioning:
Bonatti C, Dìaz LJ, Viana M (2005) Dynamics beyond Hopf E (1939) Statistik der geodätischen Linien in
uniform hyperbolicity. A global geometric and proba- Mannigfaltigkeiten negativer Krümmung. Ber Verh
bilistic perspective. Encyclopaedia of mathematical Sachs Akad Wiss Leipzig 91:261–304. (German)
sciences, 102. Mathematical Physics, III. Springer, Jakobson MV (1981) Absolutely continuous invariant
Berlin measures for one-parameter families of one-
Bowen R (1970) Markov partitions for Axiom dimensional maps. Commun Math Phys 81(1):39–88
A diffeomorphisms. Am J Math 92:725–747 Katok A (1979) Bernoulli diffeomorphisms on surfaces.
Bowen R (1975) Equilibrium states and the ergodic theory Ann Math 110(2):529–547
of Anosov diffeomorphisms. Lecture Notes in Mathe- Katok A, Hasselblatt B (1995) Introduction to the modern
matics, vol 470. Springer, Berlin/New York theory of dynamical systems. With a supplementary
Brin MI, Pesin JB (1974) Partially hyperbolic dynamical chapter by Katok and Leonardo Mendoza. Encyclope-
systems. Izv Akad Nauk SSSR Ser Mat 38:170–212. dia of Mathematics and its Applications, vol 54. Cam-
(Russian) bridge University Press, Cambridge
Brin M, Stuck G (2002) Introduction to dynamical sys- Katok A, Strelcyn JM, Ledrappier F, Przytycki F (1986)
tems. Cambridge University Press, Cambridge Invariant manifolds, entropy and billiards; smooth
Bunimovic LA (1974) The ergodic properties of certain maps with singularities. Lecture notes in mathematics,
billiards. (Russian) Funkcional Anal i Priložen 8(3): 1222. Springer, Berlin
73–74 Kifer Y (1986) Ergodic theory of random transformations.
Burns K, Wilkinson A, On the ergodicity of partially Progress in probability and statistics, vol 10.
hyperbolic systems. Ann of Math. To appear Birkhäuser, Boston
Burns K, Pugh C, Shub M, Wilkinson A (2001) Recent Kifer Y (1988) Random perturbations of dynamical sys-
results about stable ergodicity. Smooth ergodic theory tems. Progress in probability and statistics, 16.
and its applications. ( Seattle, 1999), 327–366. Proc Birkhäuser, Boston
Sympos Pure Math, 69, Amer Math Soc, Providence Ledrappier F, Young LS (1985a) The metric entropy of
Burns K, Dolgopyat D, Pesin Y, Pollicott, M. Stable ergo- diffeomorphisms. I. Characterization of measures sat-
dicity for partially hyperbolic attractors with negative isfying Pesin’s entropy formula. Ann Math 122(2):
central exponents. Preprint 509–539
Chernov N, Markarian R (2003) Introduction to the ergo- Ledrappier F, Young LS (1985b) The metric entropy of
dic theory of chaotic billiards. Second edition. diffeomorphisms. II. Relations between entropy, expo-
Publicações Matemáticas do IMPA. IMPA Mathemati- nents and dimension. Ann Math 122(2):540–574
cal Publications 24 Col’quio Brasileiro de Liu PD, Qian M (1995) Smooth ergodic theory of random
Matemática. 26th Brazilian Mathematics Colloquium dynamical systems, Lecture notes in mathematics,
Instituto de Matemática Pura e Aplicada (IMPA), Rio vol 1606. Springer, Berlin
de Janeiro Lyubich M (1999) Feigenbaum–Coullet–Tresser univer-
Cornfeld IP, Fomin SV, Sinai YG (1982) Ergodic theory. sality and Milnor’s hairiness conjecture. Ann Math
Translated from the Russian by A. B. Sosinskii. 149(2):319–420
Grundlehren der mathematischen Wissenschaften Mañé R (1987) Ergodic theory and differentiable dynam-
[Fundamental Principles of Mathematical Sciences], ics. Translated from the Portuguese by Silvio
vol 245. Springer, New York L. Ergebnisse der Mathematik und ihrer Grenzgebiete
Dolgopyat D (2004) On differentiability of SRB states for (3) [Results in Mathematics and Related Areas (3)],
partially hyperbolic systems. Invent Math 155(2): vol 8. Springer, Berlin
389–449 Mora L, Viana M (1993) Abundance of strange attractors.
Dolgopyat D, Pesin Y (2002) Every compact manifold Acta Math 171(1):1–71
carries a completely hyperbolic diffeomorphism. Palis J (2005) A global perspective for non-conservative
Ergod Theor Dynam Syst 22(2):409–435 dynamics. Ann Inst H Poincaré Anal Non Linéaire
Dolgopyat D, Wilkinson A (2003) Stable accessibility is 22(4):485–507
C1 dense. Geometric methods in dynamics. Pesin JB (1976) Characteristic Ljapunov exponents, and
II. Astérisque No. 287 ergodic properties of smooth dynamical systems with
Graczyk J, Swiatek G (1997) Generic hyperbolicity in the invariant measure. Dokl Akad Nauk SSSR 226. (Russian)
logistic family. Ann Math 146(2):1–52 Pesin JB (1977) Characteristic Ljapunov exponents, and
Grayson M, Pugh C, Shub M (1994) Stably ergodic smooth ergodic theory. Uspehi Mat Nauk 32(196):
diffeomorphisms. Ann of Math 140(2):295–329 55–112. 287 (Russian)
Hirsch MW (1979) Differential topology. Graduate Texts Pesin YB (2004) Lectures on partial hyperbolicity and
in Mathematics, No. 33. Springer, New York/ stable ergodicity. Zurich lectures in advanced mathe-
Heidelberg matics. European Mathematical Society (EMS), Zürich
Hirsch MW, Pugh CC, Shub M (1977) Invariant manifolds. Pesin YB, Sinai YG (1983) Gibbs measures for partially
Lecture Notes in Mathematics, vol 583. Springer, hyperbolic attractors. Ergod Theor Dynam Syst
Berlin/New York 2(3–4):417–438
332 Smooth Ergodic Theory
Pugh C, Shub M (1972) Ergodicity of Anosov actions. Sinai JG (1972) Gibbs measures in ergodic theory.
Invent Math 15:1–23 (Russian) Uspehi Mat Nauk 27 no. 4(166):21–64
Pugh C, Shub M (1989) Ergodic attractors. Trans Am Math Szász D (2000) Boltzmann’s ergodic hypothesis, a conjec-
Soc 312(1):1–54 ture for centuries? Hard ball systems and the Lorentz
Pugh C, Shub M (1996) Stable ergodicity and partial gas, Encycl Math Sci, vol 101. Springer, Berlin,
hyperbolicity. International conference on dynamical pp 421–448
systems (Montevideo, 1995), 182–187, Pitman Res Tsujii M (2005) Physical measures for partially hyperbolic
Notes Math Ser, 362, Longman, Harlow surface endomorphisms. Acta Math 194(1):37–132
Robinson C (1995) Dynamical systems. Stability, sym- Viana M (1997) Multidimensional nonhyperbolic
bolic dynamics, and chaos. Studies in Advanced Math- attractors. Inst Hautes Études Sci Publ Math 85:
ematics. CRC Press, Boca Raton 63–96
Rodríguez HA, Rodríguez HF, Ures R (2008) Partially Viana M, Lutstsatto S (2003) Exclusions of parameter
hyperbolic systems with 1D-center bundle. Invent values in Hénon-type systems. (Russian) Uspekhi Mat
Math 172(2) Nauk 58 (2003), no. 6(354), 3–44; translation in
Ruelle D (1976) A measure associated with axiom-A Russian Math. Surveys 58(6):1053–1092
attractors. Am J Math 98(3):619–654 Wang QD, Young LS (2008) Toward a theory of rank one
Ruelle D (1978) An inequality for the entropy of differen- attractors. Ann Math 167(2)
tiable maps. Bol Soc Brasil Mat 9(1):83–87 Young LS (1998) Statistical properties of dynamical sys-
Shub M, Wilkinson A (2000) Pathological foliations and tems with some hyperbolicity. Ann Math 147(2):
removable zero exponents. Invent Math 139(3):495–508 585–650
Sinai JG (1961) Geodesic flows on compact surfaces of Young LS (2002) What are SRB measures, and which
negative curvature. Dokl Akad Nauk SSSR 136: dynamical systems have them? Dedicated to David
549–552 (Russian); translated as Soviet Math Dokl 2: Ruelle and Yasha Sinai on the occasion of their 65th
106–109 birthdays. J Stat Phys 108(5–6):733–754
preserving smooth flow. Such flows are also
Ergodic and Spectral Theory called multi-valued or locally Hamiltonian.
of Area-Preserving Flows on We say that two flows ’ℝ on (X, m) and fℝ on
Surfaces (Y, v) and are isomorphic as measure-preserving
flows if there exists an isomorphism F : X ! Y, i.e.,
Krzysztof Frączek1 and Corinna Ulcigrai2 a bimeasurable map which transports the measure
1
Faculty of Mathematics and Computer Science, m to v (i.e., m(F1(A)) ¼ n(A) for any Borel set
Nicolaus Copernicus University, Toruń, Poland A Y) and commutes with the dynamics, i.e.,
2
Institut für Mathematik, Universität Zürich, F(ft(x)) ¼ ct(F(x)) for m-almost every (a.e.)
Zürich, Switzerland x X. If additionally ’ℝ, fℝ and F are smooth,
then the flows ’ℝ and fℝ are smoothly
isomorphic.
Article Outline By a joining between two flows ’ℝ on (X, m)
and fℝ on (Y, v) we mean any probability (’t
Glossary ft)t ℝ– invariant measure on X Y whose pro-
Definition of the Subject jections on X and Y are equal to m and v, respec-
Introduction tively. Two flows ’ℝ and fℝ are called disjoint
Examples (in the sense of Furstenberg) if their only common
Locally Hamiltonian flows joining is the product joining m n.
Background and Tools A m-preserving flow ’ℝ is ergodic if any
Invariant Measures and (Unique) Ergodicity invariant Borel set A X (i.e., m(’tAΔA) ¼ 0 for
Behavior of Ergodic Averages all t ℝ) has measure zero or its compliment X\A
Mixing Properties has measure zero.
Spectral Properties A m-preserving flow ’ℝ is weakly mixing if
Disjointness Results for any pair f, g L20 ðX, mÞ L20 ðX, mÞ is the sub-
Open Directions space of zero mean functions),
Bibliography
T
1
Glossary lim j ð f ∘’t Þ g dmjdt ¼ 0:
T!1 T 0 X
Borel flow is a family ’ℝ ¼ (’t)t ℝ of Borel maps A m-preserving flow ’ℝ is mixing if for any pair
on a topological space X such that (t, x) 7! ’tx is f, g L20 ðX, mÞ,
also Borel, ’0 ¼ IdX and ’t1 þt2 ¼ ’t1 ∘ ’t2 for all
t1, t2 ℝ. The flow preserves a Borel probability lim ð f ∘’t Þ g dm ¼ 0:
measure m on X if m(’t(A)) ¼ m(A) for any Borel t!1 X
lim inf k f f ∘’t kL2 ðX,mÞ > 0: state physics, or statistical mechanics models.
t!þ1
Despite being low-dimensional systems of zero
topological entropy, they present a rich display
A m-preserving flow ’ℝ is mixing of all orders if
of fine chaotic properties.
for any n 2 and any n-tuple A0, . . ., An1 of
We define below two classes of area-
Borel sets,
preserving flows, linear flows on translation sur-
faces and locally Hamiltonian flows, whose ergo-
m A0 \ ’t1 ðA1 Þ \ \ ’t1 þþtn1 ðAn1 Þ
dic and spectral properties have been the object of
t1 , t2 ..., tn1 !1
! mðA0 Þ mðAn1 Þ: intense research activity in the last decades. Area-
preserving flows on surfaces also provide one of
A Borel measure sf on ℝ is the spectral measure the fundamental classes of parabolic, or slowly
of f L2(X, m) if chaotic, dynamical systems (see also the survey
Ulcigrai (2021). Contrary to hyperbolic systems,
which display sensitive dependence on initial con-
eits dsf ðtÞ ¼ ðf ∘’s Þ f dm for all s ℝ:
ℝ X ditions (i.e., divergence in time of nearby initial
conditions, the so-called butterfly effect) which
The spectral type of a m-preserving flow ’ℝ is an happens (infinitesimally) at exponential speed,
equivalence class of a Borel measure s on ℝ such parabolic systems display a slow form of sensitive
that for every f L20 ðX, mÞ, sf is absolutely con- divergence, with speed of divergence which is
tinuous with respect to s and there exists sub-exponential (and actually polynomial or sub-
f 0 L20 ðX, mÞ such that sf 0 ¼ s: polynomial in all known examples).
connection with problems arising in solid-state ðx0 ðtÞ, y0 ðtÞÞ ¼ ðcos y, sin yÞ y S1 , ð1Þ
physics as well as in pseudo-periodic topology
(see, e.g., the survey by Zorich (1999)). Indeed, which moves points at unit speed along (the image
Novikov (1982) and his school in the 1990s advo- by the projection p : ℝ2 ! ℝ2/ℤ2 of) Euclidean
cated the study of locally Hamiltonian flows as lines in direction θ (i.e., lines making an angle θ
model to describe the motion of an electron in a with the horizontal axes); see Fig. 1a.
metal under a magnetic field in the semi-classical
approximation (the surface appears here as Fermi
energy level surface). Novikov made some con- Linear flows on translation surface Linear
jectures (known as Novikov problem) on the flows (also called translation flows) can be
asymptotic behavior of trajectories of electrons. defined more in general on translation surfaces,
At the same time, Arnold (1991) made a conjec- namely surfaces which are locally Euclidean out-
ture on mixing for the flows we call today Arnold side a finite number of conical singularities.
flows. This conjecture has been the motivation for A translation surface can be defined as a quotient
a lot of the work on the mixing properties of
locally Hamiltonian flows. S≔P1 [ P2 [ [ Pn =
The current century has seen a lot of advances
where Pi ℝ2 for 1 i n are disjoint polygons
in our understanding of the chaotic properties of
in ℝ2, with clockwise oriented boundary, with the
smooth area-preserving flows (a class which
property that their edges can be paired into cou-
includes locally Hamiltonian flows), in particular
ples (e, e0) where e and e0 are parallel and isomet-
exploiting Teichmüller dynamics tools, but also
ric and have opposite orientation (see an example
under the influence of the work of Marina Ratner
in Fig. 2) and the equivalence relation ~ identifies
in homogeneous dynamics. The study of linear
the edges of the pair (e, e0) by the (unique) trans-
flows on infinite translation surfaces has only
lation of ℝ2 which maps e to e0. The request that
begun in the last decade, in particular motivated
e and e0 have opposite orientations is needed to
by the study of the periodic Ehrenfest model and it
guarantee that, after gluing, one obtains a transla-
is still a widely open research direction.
tion surface, in which all sides are identified by
translations, and not a half-translation surface,
Examples where identifications by a central symmetry com-
posed with a translation are allowed. Translation
Linear flows on the torus The basic example of surfaces can be equivalently defined as the datum
an area-preserving flow is the linear flow on the of an Abelian differential on a Riemann surface,
torus 2 ≔ℝ2 =ℤ2 (see Fig. 1) given by solutions while half-translation surfaces correspond to qua-
ðxðtÞ, yðtÞÞ 2 to dratic differentials.
The resulting space S is a compact, oriented follows. Let o be a fixed smooth area form
topological surface which is endowed with a flat, (locally given in coordinates (x, y) by f(x, y)dx ^ dy
Euclidean metric outside a finite set S ¼ S(S) the where f is a smooth positive function). Thus,
(image after gluings) of the set of vertices of the equivalently, the pair (S, o) is a two-dimensional
polygons. Points in S are known as singularities; symplectic manifold. We consider smooth flows
each of these points has a neighborhood where the ’ℝ on S which preserve a measure m given by
metric has a conical singularity with a total cone integrating a smooth density with respect to o. We
angle 2πk, k ℕ (i.e., it is locally of the form assume that the area is normalized so that m(S) ¼
ds2 ¼ dr2 þ (k r dθ)2). Translation surfaces can be 1. It turns out that such smooth area-preserving
equivalently defined as a pair (X, o) where X is a flows on S are in one-to-one correspondence with
compact Riemann surface and o is an Abelian smooth closed real-valued differential 1-forms as
differential; we refer to the surveys (Masur and follows. Given a smooth, closed, real-valued dif-
Tabachnikov 2002; Yoccoz 2010; Forni and ferential 1-form , let X be the vector field deter-
Matheus 2014). mined by ¼ iXo where iX denotes the
On a translation surface S, the notion of a contraction operator, i.e., iXo ¼ o(X, ) and con-
direction θ S1 (in particular the notion of hor- sider the flow ’ℝ on S given by X. Since is
izontal and vertical direction) is well defined not closed, the transformations ’t, t ℝ, are area-
only locally, in each polygon, but globally (since preserving. Conversely, every smooth area-
the identifications are performed by translations). preserving flow can be obtained in this way.
Thus, for each θ S1, we can globally define a The flow ’ℝ is known as the multi-valued
flow ’yℝ ≔ ’yt t ℝ given locally by solutions Hamiltonian flow associated to . Indeed, the
to (1). As in the case of the torus, ’yℝ moves points flow ’ℝ is locally Hamiltonian, i.e., locally one
along the (quotient to S) of lines in direction θ can find coordinates (x, y) on S in which ’ℝ is
with unit speed; see Fig. 2. Notice that at a singu- given by the solution to the equations
larity v S with cone angle 2πk, there is not a
unique but k lines in direction θ and v is a saddle x_ ¼ @H=@y,
point of the linear flow ’yℝ with 2 k-prongs, of the y_ ¼ @H=@x
type shown in Fig. 4c for k ¼ 3. Notice further-
more that these flows preserve a Euclidean area, for some smooth real-valued Hamiltonian func-
but are discontinuous flows, since singularities are tion H. A global Hamiltonian H cannot be in
reached in finite time. general defined (see Nikolaev and Zhuzhoma
(1999), Section 1.3.4), but one can think of ’ℝ
as globally given by a multi-valued Hamiltonian
Locally Hamiltonian flows function.
Locally Hamiltonian flows necessarily have
Smooth (in particular continuous) flows on fixed points or singularities. Singularities, as
S preserving a smooth measure can be defined as shown in Fig. 4, can be either centers (Fig. 4a),
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 337
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces, Fig. 3 Examples of flows
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces, Fig. 4 Type of singularities
Time-changes New flows can be obtained by by cohomologous cocycles are isomorphic (see,
perturbation starting from the examples in the e.g., Avila et al. (2021a), Lemma 2.1, or Katok
previous section. The simplest perturbation is a (2001), §9, for related results). The time-change
time-change, also known as a time- fℝ given by the cocycle t is trivial if (2) with t1 ¼
reparametrization, which produces flows that t and t2 ¼ id admits a solution. In this case, fℝ
move points along the same orbits, but with dif- and fℝ are conjugated by the conjugacy
ferent speed. The flow fℝ is a time-change x 7! fuðxÞ x and hence the time-change is isomor-
(or reparametrization) of a flow fℝ on S if there phic to the original flow. Equation (2) is an exam-
exists a measurable function t : S ℝ ! ℝ such ple of a cohomological equation. Thus, trivial
that for every x S the map t 7! t(x, t) is time-changes are described by solutions to the
continuous and strictly increasing, and for all so-called cohomological eq. A key feature of
x S and t ℝ we have ftðx,tÞ ðxÞ ¼ ft ðxÞ: area-preserving flows is the existence of obstruc-
Since fℝ is assumed to be a flow, the function t tions (invariant distributions) to solve cohomo-
is an additive cocycle over the flow fℝ, that is, it logical equations; see Forni (1997). As a
satisfies the cocycle identity: consequence, among smooth time-changes,
smoothly trivial time-changes are rare (i.e., form
tðx, s þ tÞ ¼ tðfs ðxÞ, tÞ þ tðx, sÞ a finite or countable codimension subspace) and
time-changes can display essentially different
for all x S and all s, t ℝ. If fℝ is a smooth spectral and ergodic properties compared to the
flow, we will say that fℝ is a smooth original flow. Nevertheless, certain ergodic prop-
reparametrization if the cocycle t is a smooth erties, like ergodicity and cohomological proper-
function such that the maps t(x, ) are homeomor- ties, which only depend on the orbit structure and
phisms. Even these simple perturbations can pro- hence are independent of the time-change, persist
duce a genuinely new flow, i.e., a flow which is in a time-change.
not measurably conjugated to the unperturbed
flow (see below). More drastic (non-smooth)
time-changes, when t is allowed to have singu- Background and Tools
larities where it blows up, can introduce singular-
ities in the time-changed flow. The simplest type Minimality and minimal components From the
of singularity is a stopping point (also known as topological dynamics point of view, one is inter-
fake saddle point), i.e., a fixed point which has a ested in the qualitative behavior of every trajec-
neighborhood foliated by flow trajectories. tory. An example where all orbits can be
There is an obvious way to produce measurably understood are linear flows on tori (e.g., given
(or even smoothly) conjugated flows using by (1)), which satisfy a well-known dichotomy:
reparametrizations, associated to solutions to the either all orbits are periodic, namely there exists a
so-called cohomological equation as follows. t0 > 0 such that ’t0 þt ðxÞ ¼ ’t ðxÞ for every x S
Two additive cocycles t1(x, t), t2(x, t) are said to and t ℝ (this happens exactly when the slope θ
be measurably (respectively smoothly) cohomo- in (1) is rational), or ’ℝ is minimal, i.e., every
logous if their difference t1 t2 is a measurable (forward) trajectory {’t(x), t 0}, for any x S,
(respectively smooth) coboundary, i.e., if there is dense.
exists a measurable (respectively smooth) function In presence of fixed points (see Fig. 4), which
u : M ! ℝ, called the transfer function, such that are unavoidable in higher genus, the definition of
minimality should be adjusted as follows. We say
t1 ðx, tÞ t2 ðx, tÞ ¼ uðxÞ u∘ft ðxÞ ð2Þ that the trajectory of x S is regular if the whole
orbit {’t(x), t ℝ} is well defined and the limits
for all (x, t) S ℝ. An elementary, but funda- limt ! 1’t(x) do not exist. A flow ’ℝ : S ! S is
mental, result establishes that time-changes given called quasi-minimal (or simply minimal, by
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 339
abusing the terminology) if every regular trajec- connections, called an island of periodic orbits;
tory is dense. If ’ℝ has a fixed point that is a center see Fig. 6a. Hence, in presence of centers, the flow
(Fig. 4a), then since a neighborhood of the center ’ℝ is never minimal (since orbits in the comple-
is foliated by periodic orbits, ’ℝ cannot be ment of the island avoid the island and hence
(quasi-)minimal. cannot be dense).
Let us say that a segment of a trajectory, of the Mayer (1943), Levitt (1983), and Zorich
form {’t(x), a < t < b} (where possibly a ¼ 1 (1999) proved independently that each smooth
or b ¼ þ 1 ) is a saddle connection if it starts and area-preserving flow can be decomposed into a
ends in a saddle point (when a or b is 1, by this finite number of subsurfaces with boundary Si,
we mean that limt ! 1’t(x) are saddle points). i ¼ 1, . . ., N such that for each i the restriction of
A saddle connection is called a saddle loops if the ’ℝ to Si is a periodic component, i.e., the interior
initial and final saddles are the same (see Fig. 6a). of Si if foliated into closed orbits of ’ℝ, or Si is
A classical result on flows on surfaces (see, such that the restriction of ’ℝ to Si is (quasi-)
e.g., Yoccoz (2010)) is that if ’ℝ has no saddle minimal; the latter are called minimal compo-
connections, then it is (quasi-)minimal. The nents. Periodic components are either islands
corresponding result at the level of IETs, proved (as in Fig. 6a) or cylinders filled by periodic orbits
by Keane (1975), is that an IET T with an irreduc- and bounded by saddle loops, as in Fig. 6b. Min-
ible combinatorial datum π such that the orbits imal components (see an example in Fig. 7a), by
discontinuities of T are all infinite and distinct topological reasons, cannot be more than g (the
(a condition called I.D.O.C. by Keane (1975) genus of S). The flows in Fig. 3, for example, can
and nowadays known as Keane’s condition in be decomposed, in the case of 3a, into three peri-
the literature; see, e.g., Yoccoz (2010)) is minimal. odic components, two islands and one cylinder
Notice that if ’ℝ has a fixed point which is a filled by closed orbits, and two minimal compo-
center, one can show that the center is contained in nents (one of genus one and one of genus two),
a disk filled with closed (i.e., periodic) trajectories while, in the case of the flow on the torus in
and bounded by a saddle loop or a union of saddle Fig. 3b, there is one island and one minimal com-
ponent (the so-called Arnold flow).
One can show (see below) that minimal com-
ponents of a locally Hamiltonian flow (and in
particular minimal such flows, for which S is in
itself a minimal component) are time-
reparametrization (see the previous section) of
linear flows on translation surfaces (although
they time-change in this case is singular), so in
Ergodic and Spectral Theory of Area-Preserving particular, they have the same orbits and the same
Flows on Surfaces, Fig. 6 Periodic components topological behavior as linear flows (see, e.g.,
Zorich (1999). This was in part one of the original δi [1, 1] such that T(x) ¼ x þ δi for any
motivations (in addition to the unfolding of ratio- x Ii. Notice that a rotation Rα can be seen as a
nal billiards in the West) that sparked the interest 2-IET, which exchanges the intervals [0, 1α)
of mathematicians such as Zorich in the ergodic and [1α, 1). Thus IETs provide a natural gener-
theory of linear flows. alization of circle rotations and play for linear
flows in higher genus an analogous role to rota-
Interval exchange maps and Poincaré tions for linear flows on tori. In general, the
sections A central idea introduced by Poincaré images T(I1), . . ., T(Id) are exchanged intervals
was that the study of a surface flow can be often which form a new partition of [0, 1). A d-IET is
reduced to the study of a one-dimensional discrete completely determined by two data, namely a
dynamical system, by taking what we nowadays combinatorial datum π which determines the
call a Poincaré section and considering the order of the exchanged intervals in [0, 1) (this
Poincaré first return map of the flow to the section can be a permutation of {1, . . ., d} or a pair of
(when and where it is defined). permutations of an alphabet of cardinality d – see
In genus one, if we consider the linear flow (1) Yoccoz (2010) – a convention which is often
on 2 and take the horizontal side ½0, 1 f0g useful to study fine chaotic properties of IETs)
2 , the Poincaré map is the (rigid) rotation and a length vector l ¼ (l1, . . ., ld) which
Rα : [0, 1] ! [0, 1] given by x 7! Rα(x) ¼ x þ belongs to the simplex
α mod 1 (where α ¼ cot θ, see (1)). More
generally, if we start from a flow ’ℝ ≔ (’t)t ℝ Dd ≔ l ℝdþ , li 0 for all 1 i d, ð3Þ
on a torus, i.e., on a compact, orientable surface
S of genus one, and assume that it does not have d
fixed points or closed orbits (or more generally l
i¼1 i
¼1 :
Reeb components; see Nikolaev and Zhuzhoma
(1999)), there is a (global) section given by a Then the corresponding IET T ¼ T(π,l) exchanges
closed transverse curve and the Poincaré first the intervals
return map to it is a diffeomorphism f : S1 ! S1
of the circle S1 ffi ℝ/ℤ.
I i ¼ ½li , r i Þ ¼ lj , lj for i ¼ 1, . . . , d
Interval exchange transformations (IETs) As in 0j<i 0ji
the case of genus one, an essential tool to study a
higher genus flow is to consider a (local) trans- according to the permutation π (Fig. 8b). We say
versal I S to the flow and the Poincaré first that the combinatorial datum is irreducible if the
return map T of the flow on I (when it is defined, indexes {1, . . ., d} of subintervals cannot be
for example, almost everywhere when the flow decomposed into two subsets {1, . . ., k} and
preserves a finite measure with full support; see {k þ 1, d} with 1 k < d which are exchanged
more generally (Nikolaev and Zhuzhoma 1999)). by themselves.
Consider first a linear flow on a translation More generally, given any (smooth) flow ’ℝ
surface. In this case, Poincaré maps are piecewise on S, not necessarily preserving a smooth invari-
isometries of an interval, known as interval ant measure, first return maps T : I ! I are one-to-
exchange transformations (or for short, IETs): a one piecewise diffeomorphisms known as gener-
one-to-one map T : I ! I of I ¼ [0, 1) is a alized interval exchange transformations: a map
(standard) interval exchange transformations of T : I ! I is a generalized interval exchange trans-
d 2 intervals, or d-IET for short, if one can formations or, for short, a GIET (Fig. 8a), if one
partition I into intervals I1, . . ., Id so that the can partition I into intervals I1, . . ., Id (finitely
restriction Ti of T to Ii, for each 1 i d, is an many since we are assuming that ’ℝ has finitely
translation. Thus, for any Ii there exists many fixed points) so that the restriction Ti of T to
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 341
ðnÞ
Special flows representations Linear flows and t ðx, sÞ ¼ x, s þ t r ðxÞ , where
so that fT,r
(n)
minimal components of area-preserving flows can r (x) denotes the Birkhoff sums cocycle, i.e.,
be conveniently described using special flows. the additive cocycle defined by
These representations provide a concrete tool to
work with them and study their fine ergodic and r ðnÞ ðxÞ≔ r Tkx if n 0,
0k<n
spectral properties. Given a Poincaré map of a
flow to a section and the additional information r ðnÞ ðxÞ≔ r T k ðxÞ if n < 0,
of the return time to the section, special flows nk<0
allow reconstructing (a measurable conjugated associated to r and n is the unique integer number
version of) the flow. We recall the construction with r(n)(x) s þ t < r(nþ1)(x). It describes the
for our special setting. motion of a point in (x, s) Ir I ℝ along
Let T : I ! I be an (ergodic) IET and let vertical trajectories, modulo the identification of
r : I ! ℝ>0 [ {+1} be an integrable function each point (x, r(x)), x I, with the point (Tx, 0);
such that r ¼ inf x I r ðxÞ > 0: The special flow see Fig. 9.
342 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
Special representations of linear flows Linear mixing) was Arnold. In Arnold (1991) he states
flows on translation surfaces can be represented that what we nowadays call Arnold flow, i.e., a
as special flows over IETs, under roof functions minimal component of a locally Hamiltonian flow
which are piecewise constant: more precisely, the on a torus with a center and simple saddle, can be
return time r : I ! ℝ>0 of a linear flow on a represented as a special flow over a rotation
translation surface S to a transverse section I S Rα : [0, 1] ! [0, 1] under a roof function of the
is constant on each continuity interval Ii of the IET form
T ¼ T(π,l) : I ! I induced as Poincaré map; see
Fig. 9. Then the roof function can be identified r ðxÞ ¼ C j logðxÞ j þ2C j logð1 xÞ j ð4Þ
with a vector r ℝd>0 : As noted by Veech (1982b),
the vector r is in a 2 g-dimensional subspace for some C > 0. This function is said to have
H(π) ℝd, where g is the genus of the surface S. logarithmic singularities at the endpoints of
Smooth time-changes of the linear flow pro- [0, 1]; see Fig. 10b. The singularities are further-
duce flows which can still be represented as spe- more called asymmetric, since the constants
cial flows over T, under a roof r which is piecewise 2C 6¼ C are different. The singularities at x ¼
smooth on each Ii and extends smoothly to its 0 and x ¼ 1 correspond to the trajectory which is
closure; see Fig. 10a. a separatrix, i.e., ends in the saddle. Because of the
nature of the Hamiltonian local parametrization of
Special representation of Arnold flows It is well a saddle, it takes infinite time to reach the saddle
known that (minimal components of) locally and the motion along trajectories which are close
Hamiltonian flows can be represented as special to the separatrix is slowed down logarithmically;
flows over IETs, but in this case the function r is see, e.g., (Frączek and Ulcigrai 2012), [Appendix
singular, i.e., blows up at (some) endpoints of A] for a calculation. The fact that the constant 2C
continuity intervals (see, e.g., Ulcigrai (2011), is exactly the double of the other constant C can be
Ravotti (2017), Conze and Frączek (2011), and explained because trajectories on one side of the
Frączek and Ulcigrai (2012)). The nature of the saddle loop pass twice near the saddle, while on
singularities of r turns out to depend crucially on the other side only once.
the nature and type of fixed points of the surface
flow. The first to remark on the importance of such Special representation of locally Hamiltonian
representation and the nature of the singularities flows with simple saddles Singularities which
for the study of chaotic properties (in particular are simple saddles (standard saddles with
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces, Fig. 10 Special representation of flows
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 343
4-prongs, as in Fig. 4b) produce singularities of the singularities. The number of exchanged intervals
roof function as several singularities which, like in is d ¼ 2g þ s 1 in the case when ’ℝ is minimal
the case of Arnold flows, have logarithmic nature and s is the number of simple saddles, or, for a
in the following sense. We say that a function minimal component S0, d ¼ 2g0 þ s0 1, where g0
r : I ! ℝ for an IET T(π,l) has logarithmic singu- is the genus of S0 and s0 is the number of (simple)
larities if there exist constants Cþ
i , Ci ℝ, i ¼ saddles in the closure of S0.
1, . . . , d, and a function g’ absolutely continuous We say that the logarithmic singularities are of
on the interior of each interval Ii, i ¼ 1, . . ., d (i.- geometric type if at least one among C d and
e., with the notation that we will introduce later, a C 1
p ðd Þ is zero and at least one among C þ
1 or
function gr AC di¼1 I i such that Cþ
p1 ð1Þ is zero (as shown in the examples in
Fig. 11). We denote by LG di¼1 I i the space of
d
r ðxÞ ¼ Cþ functions with logarithmic singularities of geo-
i logðjIjfðx li Þ=jIjgÞ
i¼1 metric type. One can furthermore show that the
d roof functions arising from suitably chosen spe-
C
i logðjIjfðr i xÞ=jIjgÞ þ gr ðxÞ cial representations (namely when the section is
i¼1 standard, i.e., both endpoints belong to saddle
ð5Þ separatrices) are of geometric type. This notion
plays a crucial role in some results on locally
See Fig. 11 for an example. Consider now either a Hamiltonian flows; see, e.g., (Frączek and
minimal locally Hamiltonian flow ’ℝ on S or the Ulcigrai 2012, 2021).
restriction of a locally Hamiltonian flow on S to a
minimal component S0 S. Let be the associated Symmetric and asymmetric logarithmic
closed 1-form and assume that is Morse (the singularities When all the saddles are simple,
corresponding local Hamiltonian is a Morse func- and hence the roof has logarithmic singularities,
tion), so all saddles of the flow ’ℝ are simple. it is crucial, for understanding finer chaotic fea-
Then ’ℝ can be shown to be (measure theoreti- tures, to distinguish between symmetric or asym-
cally) isomorphic to a special flow Tr : Ir ! Ir over metric singularities, in the following sense. Let
an interval exchange transformation T : I ! I of LSG tdi¼1 I i be the subspace of functions with
d 1 intervals and under a roof r with logarithmic geometric type logarithmic singularities
LG tdi¼1 I i which in addition satisfy the symme-
try condition
d d
C
i Cþ
i ¼ 0: ð6Þ
i¼1 i¼1
representation of an Arnold flow, while minimal base, hence the name. Von Neumann flows also
flows with only nondegenerate saddles give roofs admit an interpretation as special representations
with symmetric logarithmic singularities; cf. also of area-preserving surface flows. Consider a
Blokhin flows. locally Hamiltonian flow which is non-minimal:
assume that there are saddle connections which
Degenerate saddles and power-singularities - separate the surface into subsurfaces with bound-
Consider now degenerate saddles (i.e., saddles ary, each of which is a periodic or a minimal
with 2 k pronges, k>2; see Fig. 4c for k ¼ 3). component. Note that due to the presence of
These saddles produce stronger, power-type sin- fixed points, the transition times for the flow
gularities of the roof function; see Kočergin along the saddle connections are infinite. Using a
(1975). We say that li or ri is a power-like singu- reparametrization of the flow that results in sig-
larity of power-type 0 < γ < 1 if there exists a nificant acceleration around fixed points,
constant Ci such that described in Frączek and Lemańczyk (2009b),
we obtain a flow for which all transition times
lim ðx li Þg r ðx li Þ ¼ Cþ along saddle connections are finite. Then, each
i ;
x!lþ minimal component of such reparametrization
i
ð7Þ
lim ðr i xÞg r ðr i xÞ ¼ C
i : admits a representation as a von Neumann flow.
x!ri
Furthermore, the sum of jumps of the roof is
related to the sum of transition times (summed
Thus r behaves like the functions Ci =xg in a
according to orientation) along the saddle connec-
neighborhood of li or ri. As a degenerate special
tions forming the boundary of the minimal com-
case, a stopping point (fake saddle) introduces a
ponent (cf. Conze and Frączek (2011)).
single symmetric power-singularity of the roof
function.
Open sets and typical behavior In order to
Time-changes through special representa- describe the dynamical behavior of a generic or
tions One can show that two special flows over typical (area-preserving) surface flow, one can
the same transformation T under two different introduce the following topologies and measures
roof functions are one a time-change of the or measure classes (see below) on the space of
other. Thus, comparing the special representations IETs, linear flows, and locally Hamiltonian flows
results recalled in this section for linear flows and and distinguish between open sets with different
minimal (components of) locally Hamiltonian typical dynamical properties.
flows, one can show that the latter are time-
changes of translation flows via a singular Almost every IET and almost every
reparametrization (which amount to changing an linear flow Translation surfaces and their linear
r which is piecewise constant to one which has flows, as well as their Poincaré sections, IETs, are
singularities). described by finite-dimensional spaces. We say
that a result holds for almost every d-IET on a
Von Neumann flows Another class of special unit interval if it holds for all irreducible data π on
flows over IETs which has been studied in the d-symbols and almost every length vector l in the
literature are so-called von Neumann flows. The simplex Δd; see (3). Since linear flows can be
name is used to denote special flows over rotations represented as special flows over IETs under
or IETs under a function r which is piecewise piecewise constant roof functions, it is sufficient
absolutely continuous (and continuous on each to add 2 g data (related to the values of the return
continuity interval of the IET) with non-zero time on each continuity interval). A result holds
sum of jumps, i.e., r0(x)dx 6¼ 0; see Frączek and for almost every linear flow if it holds for a.e. IET
Lemańczyk (2009a). These were first studied by in the base and a.e. choice of r ℝd>0 \ H ðpÞ:
von Neumann (1932) in the case of rotations in the Translation surfaces (up to cut and paste
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 345
The open sets U min and U : min To classify symmetry condition (6). For a typical flow in
chaotic behavior in locally Hamiltonian flows it U : min , the restriction of the flow on each minimal
is crucial to distinguish between two component admits a representation as a special
(complementary, up to measure zero) open sets. flow over a minimal IET under a roof with asym-
Remark that if the flow ’ℝ given by a closed metric logarithmic singularities (like in the case of
1-form has a saddle loop homologous to zero the Arnold flow; see (4)). See, for example,
(i.e., the saddle loop is a separating curve on the (Ravotti 2017).
surface), then the saddle loop is persistent under
small perturbations (see Sect. 2.1 in Zorich (1999)
or Lemma 2.4 in Ravotti (2017)). In particular, the Invariant Measures and (Unique)
set of locally Hamiltonian flows which have at Ergodicity
least one saddle loop is open and gives the set
denoted U : min above. Flows in the open set Let us discuss the existence and finiteness of
U : min admit a non-trivial decomposition into ergodic measures, (unique) ergodicity, and non-
periodic components and minimal components. uniquely ergodic examples.
The second open set, which we call U min , is
given by the interior (which one can show to be
Invariant measures: existence and
non-empty) of the complement of U : min , i.e., the
finiteness Given a (topological) flow ’ℝ on S,
set of locally Hamiltonian flows without saddle
since we are assuming that S is compact, the
loops homologous to zero. One can show that
existence of a finite (probability) invariant mea-
saddle loops non-homologous to zero (and saddle
sure is guaranteed by the Krylov-Bogolybov
connections) vanish after arbitrarily small pertur-
theorem. Katok shows in (Katok 1973) that any
bations and neither the set of 1-forms with saddle
topologically transitive surface flow with non-
loops non-homologous to zero (or saddle connec-
degenerate (Morse) saddles has a probability
tions) nor its complement is open (see Ravotti
invariant measures with non-trivial support and
(2017) for details).
no atoms, positive on open sets.
When S is endowed with a reference smooth
Full measure of minimality In view of Keane’s
area form o and ’ℝ is assumed to be smooth
criterium for minimality (Keane 1975), one can
(at least outside the singularity set), it is natural
show that the set of non-minimal IETs has mea-
to ask about the existence of a (finite) invariant
sure zero and, furthermore, Hausdorff
measure which is absolutely continuous with
codimension 1. In view of the definition of mea-
respect to o and in particular of measures which
sure class on locally Hamiltonian flows, we there-
have a smooth (or at least differentiable) density.
fore deduce that:
If the flow can be linearized, i.e., conjugated to a
linear flow, via a differentiable or smooth
Theorem 1 In U min , the typical flow is minimal
conjugacy, such measures can be obtained by
(in particular there are no centers and there is a
pull-back of the Lebesgue measure via the
unique minimal component) and the typical flow
conjugacy.
on U : min is minimal when restricted to each
component which is not a periodic component
(bounded by saddle loops homologous to zero). Linearization questions and results Questions
about linearization and regularity of the
Special flows representations in U min and U : min conjugacy are hard and largely open. Marmi
et al. showed in Marmi et al. (2012) that for almost
The typical locally Hamiltonian flow in U min every linear flow ’ℝ on S, for any r 2, among
admits a representation as a special flow over a C rþ3 -perturbations supported outside the singu-
(minimal) IET under a roof with logarithmic sin- larity set S S, those which are linearizable with
gularities (see (5)) and the roof satisfies the a C r conjugacy form a submanifold of finite
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 347
codimension (g 1)(2r þ 1) þ s, where g denotes A sharper bound was proved by Katok (1973),
the genus of the surface and s denotes the number who showed that the number of invariant mea-
of singularities. In Marmi et al. (2012), it is sures is at most g [d/2], where g is the genus of
conjectured that for r ¼ 1 those which are C1- S (or of the minimal component S0). The proof of
linearizable form a submanifold of codimension the dimension bound uses the idea of Katok fun-
3g þ s 3. This was proved by Ghazouani (2021) damental class and the symplectic structure on the
for a measure-zero special case (hyperbolic absolute homology H1(S, ℝ) given by the inter-
periodic-type IETs). section form; see also Forni (2002) and McMullen
In very recent work, Ghazouani and Ulcigrai (2020).
(2021) have proved a rigidity theorem for folia-
tions associated to smooth flows with Morse sad- First non-uniquely ergodic examples Katok’s
dles on surfaces (at the moment only of genus 2): upper bound is optimal, as was shown by Sataev
for almost all (a.a.) smooth flows with respect to (1975), who constructed examples of flows on
the Katok fundamental class, if the corresponding S with a cone of invariant measure of arbitrary
foliation is topologically conjugate to an dimension less or equal to the genus g. An exam-
orientable measured foliation of that fundamental ple of a minimal non-uniquely ergodic IET with
class, then it is C1 conjugate. This result confirms d ¼ 4 was discovered by Keane (1977) as a count-
(in genus 2) another conjecture of Marmi erexample of an early form of the so-called Keane
et al. (2012). conjecture (see below) which stated that all min-
imal interval exchanges are (uniquely)
Bounds on the number of ergodic invariant ergodic. Earlier examples also appeared in the
measures It is a general fact that finite (resp. work of Katok and Stepin (1966) and an example
probability) invariant measures form a cone with d ¼ 5 was built by Keynes and Newton
(resp. a simplex) generated by ergodic measures. (1976) building on Veech (1969). A modern
In our setting, the number of independent ergodic revisitation of Keane’s counterexample (which
probability measures turns out to be finite: indeed, was produced with an ad-hoc self-induction pro-
as observed by Oseledec (1966), the number of cedure) can be obtained using the tools given by
independent ergodic invariant measures for a Rauzy-Veech induction (which were developed
d-interval exchange transformation T is bounded only later). Exploiting this point of view, in the
d (see, e.g., Yoccoz (2010) for a proof). In fact, lecture notes (Yoccoz 2010), examples of minimal
Oseledets estimated the maximal spectral multi- non-uniquely ergodic IETs are produced in any
plicity of any d-IET from above by d. This can be Rauzy class.
seen by showing that a d-IET has rank d. Another question which has been investigated
Let us assume that ’ℝ preserves a smooth recently for non-uniquely ergodic IETs is
invariant probability measure and is minimal genericity, i.e., the existence of an (a fortiori
(or restrict it to a minimal component S0 S). non-uniquely ergodic) IET T and a point x I
By considering the (minimal) interval exchange whose orbit under T equidistributes for a measure
transformations which appear as Poincaré sec- that is not ergodic. A d-IET with these properties
tions of ’ℝ to sections (after linearizing the has been constructed by Chaika and
GIET exploiting the invariant measure induced Masur (2015).
by the smooth invariant measure for ’ℝ), one
can deduce from the IETs upper bound that the Keane conjecture and typical unique
number of independent ergodic invariant proba- ergodicity Keane conjectured in Keane (1977)
bility measures for ’ℝ is also at most d, where d is that almost every IET is uniquely ergodic. Keane
the number of exchanged intervals and hence can conjecture (as it became known) was proved in
be chosen (by taking the section to be normal, i.e., 1982 at the same time by Masur (1982) and Veech
with endpoints on separatrices) to be d ¼ 2g þ (1982b). Both proofs exploit renormalization and
s 1 where g is the genus of S and s is the introduced seminal ideas and techniques and are
cardinality of fixed points. considered early milestones of the successful
348 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
application of Teichmüller dynamics to the study Theorem 3 Almost every locally Hamiltonian
of IETs and translation surfaces; see, e.g., the flow in U min is not only minimal, but also
surveys Chaika and Weiss (2022) or Zorich ergodic. For almost every flow in U : min , the
(2006). While Veech works directly on IETs restriction of the flow on each minimal component
developing an induction algorithm known as is ergodic.
Rauzy-Veech induction, Masur shows that almost Due to the presence of saddle fixed points,
every linear flow (and hence a.e. IET) is uniquely locally Hamiltonian flows (their restrictions to
ergodic by using geometric arguments and the minimal components) on higher genus surfaces
Teichmüller geodesic flow on the moduli space are never uniquely ergodic.
of translation surfaces. Later in 1992, Masur pro-
ves in Masur (1992) the celebrated Masur’s crite- Hausdorff dimension of non-uniquely ergodic
rium for unique ergodicity, which shows that directions Both Masur (1992) and Masur and
unique ergodicity follows from the assumption Smillie (1991) study the exceptional set NU ðSÞ of
that the Teichmüller geodesic starting from the directions on a given translation surface S which
given translation surface is not divergent in mod- are minimal but fail to be uniquely ergodic. They
uli space. show that not only this set has measure zero by the
proof of Keane’s conjecture, but it is also small in
Ergodicity everywhere, in almost every the Hausdorff dimension point of view:
direction An important strengthening of Masur
and Veech results was later proved by Kerckhoff Theorem 4 For every translation surface S, the
et al. (1986): Hausdorff dimension of the set NU ðSÞ of non-
uniquely ergodic directions does not exceed 1/2.
Theorem 2 For every translation surface, the Furthermore, for any connected component C in
linear flow in almost every direction in uniquely the moduli space of translation surfaces with
ergodic. g > 1 there is a constant c > 0, called the
This result covers in particular billiards in Masur-Smillie constant of component C, such
rational polygons, which can be unfolded that for almost every translation surface S C
(according to the Zemljakov & Katok construc- the Hausdorff dimension of NU ðSÞ is exactly c.
tion in Zemljakov and Katok (1975); see also Fox Sporadic examples of non-uniquely ergodic
and Kershner (1936)) to linear flows on interval were found, as already mentioned, by sev-
(a measure zero set of) translation surfaces. eral authors before the proof of Keane’s conjecture
Keane (1977); Keynes and Newton (1976).
Consequences for locally Hamiltonian
flows The first examples of smooth flows on sur- The slit-tori g 5 2 example A much studied
faces which are ergodic were the so-called example of a translation surface where one can
Blokhin examples (see Blohin (1972)). There are produce and study the exceptional set NU of non-
measure zero examples since they are essentially uniquely ergodic directions is the surface of genus
glued out of genus one subsurfaces. Since ergo- two obtained by gluing two identical flat tori
dicity of a (minimal component of a) locally Ham- along a slit; see Fig. 12. This surface, introduced
iltonian flow is equivalent to ergodicity of (and by Masur and Smillie, can be used to give a
hence any) interval exchange transformation geometric presentation related to previous work
which appears as the Poincaré map, result on of Veech (1969) on skew products over rotations.
ergodicity of typical locally Hamiltonian flow Cheung showed in Cheung (2003) that the upper
can be deduced from the proof of Keane’s conjec- bound of 1/2 for the Hausdorff dimension of NU
ture by Masur and Veech and the relation between is achieved in these examples. A full dichotomy
the two notions of full measure. We obtain: was later proved by Cheung et al. (2011) that
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 349
i.e., linear functionals D i , 1 i g on Sobolev display polynomial oscillations, i.e., for almost
observables. By obstruction one means here that every initial point |IT( f, x)| O(Tn) in the sense
if D i ðf Þ 6¼ 0 for some 1 i g, then f cannot be a that
coboundary. The value of the first distribution is
logjI T ðf , xÞj
simply the integral of S fdm, so in genus g ¼ 1 this lim sup ¼n ð10Þ
T!1 log T
is the only condition and is automatically satisfied
for mean-zero observables. Furthermore, in Forni
(2021) he shows that for a.e. smooth minimal for some exponent 0 < n < 1. This phenomenon,
area-preserving flow when f belongs to the kernel known as polynomial deviations of ergodic aver-
of all D i , 1 i g, f is indeed a coboundary. ages, was soon after explained in seminal work by
Kontsevich (1997) and Zorich (1997) relating
The cohomological equation for IETs Inspired power deviations to Lyapunov exponents of
by the work of Forni (1997) and Marmi et al. renormalization and proposed a finer description
(2005) considered the cohomological equation of polynomial deviations which became known as
over an IET T, rediscovered the obstructions in Kontsevich-Zorich conjecture.
this setting, and described a full measure condi-
tion (that they called Roth-type condition) that The Kontsevich-Zorich conjecture Kontsevich
guarantee that after removing the obstructions, & Zorich conjectured in Kontsevich and Zorich
the cohomological equation can be solved. More (1997) that the phenomenon of polynomial devia-
precisely, they show that for a Roth-type IET tions holds for the ergodic integrals IT( f, x) of any
T : I ! I, given an observable v : I ! ℝ, absolutely smooth observable (and not only those coming
continuous on each continuity interval Ii and with from cohomology classes, which can be reduced
the derivative of bounded variation and Iv0(x) to the study of Birkhoff sums over the IET obtained
as Poincaré section of the flow for an observable
dx ¼ 0 (zero sum of jumps), one can find
that is a characteristic function of a continuity inter-
w : I ! ℝ piecewise constant on each Ii and a
val, i.e., the setting originally considered in Zorich
continuous u : I ! ℝ such that
(1997)). They conjectured furthermore that ergodic
u ∘ T u ¼ v w: ð9Þ integrals display a power spectrum of oscillatory
behaviors, i.e., there are exactly g positive expo-
The function w, which belongs to the nents 0 < ng n2 < n1 ≔ 1 (which corre-
d-dimensional space, provides the obstruction spond to the positive Lyapunov exponents of
for v to be a coboundary. In Marmi and Yoccoz’s renormalization) and, for each, a subspace of finite
(2016), it was later shown that u is Hölder contin- codimension of smooth observables that present
uous. Further generalizations of this result were polynomial deviations as above with exponent n ¼
proved by Marmi et al. (2020) (who show how to ni. Forni (2002) could prove the bulk of this con-
modify the proof to assume a weaker condition, jecture for linear flows on translation surfaces and
called absolute Roth-type); Forni et al. (2017) for integrals of sufficiently regular functions, by
(who deal with the case of some zero Lyapunov showing there are indeed g positive exponents.
exponents); and Lanneau et al. (2021) (who con- His results also apply to locally Hamiltonian
sidered the case of T linear involution). flows in U min and to sufficiently regular observ-
ables which vanish at the singularities. The simplic-
Power deviations of ergodic averages Zorich in ity of the spectrum, namely that the g exponents are
the 1990s discovered experimentally (by consid- all distinct, was later proved in Avila and
ering IETs and Birkhoff sums of characteristic Viana (2007).
functions of continuity intervals) that for almost
every linear flow in higher genus (g 2), the Bufetov functionals and limit shapes A finer
integrals IT( f, x) (in the special case of observ- analysis of the behavior of Birkhoff sums or inte-
ables corresponding to cohomology classes) grals, beyond the size of oscillations, appears in
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 351
Bufetov (2014). Bufetov (2014) shows in partic- the case of degenerate saddles and discovered the
ular that (for typical translation flows and suffi- existence of a new type of power deviations asso-
ciently regular observables) the asymptotic ciated to smooth observables that do not vanish at
behavior of ergodic integrals can be described in some degenerate saddles. The new exponents are
terms of g (where g is the genus of the surface) determined by the jet of the observable at the
cocycles Fi(t, x), 1 i g (also called Bufetov degenerate singularity.
functionals): each Fi : ℝ S ! ℝ is a cocycle
over the flow ’ℝ (in the sense that Fi(t þ s, x) ¼
Fi(t, x) þ Fi(s, ’t(x)) for any x S and t ℝ), Mixing Properties
F1(T, x) T and each Fi has power deviations
jFi ðT, xÞj OðT ni Þ with exponent ni. Together, Finer chaotic properties such as mixing or weak
the cocycles encode the asymptotic behavior of mixing (as well as, more generally, spectral prop-
the ergodic integrals up to sub-polynomial behav- erties), crucially depend not only on the orbits of
ior, in the sense that for some constants ci ¼ ci( f ), the flow, but also on the speed of motion along the
orbits (i.e., the time-parametrization). In particu-
T lar, they are very different for linear flows and
f ð’t ðxÞÞdt ¼ c1 T þ c2 F2 ðT, xÞ þ . . . their time-changes and locally Hamiltonian
0
þcg Fg ðT, xÞ þ Errðf, T, xÞ, flows. For the latter, mixing or its absence
depends crucially on the type of singularities.
ð11Þ
Weak mixing in linear flows and IETs Weak
where for every x S with regular orbit, the
mixing is an important and much investigated
quantity Err( f, T, x) is an error term which
question for linear flows and IETs, in view of the
grows subpolynomially, i.e., for any ϵ > 0 there
lack of mixing in this setting.
exists Cϵ > 0 such that
Absence of mixing Katok proved already in the
j Errð f , T, xÞ j Cϵ T ϵ :
1980s Katok (1980) that interval exchange trans-
formations are never mixing (generalizing earlier
Using these results, Bufetov could also prove
work with Stepin; see Katok and Stepin (1967),
some limit theorems for translation flows, in par-
[Remark 8.1] in the special case of 3-IETs). In the
ticular the convergence along (exponentially
proof, Katok shows (using a simple combinatorial
sparse) subsequences of the distribution of ergo-
argument which exploits that the induced map of
dic integrals of regular observables, when the
any d-IET on a subinterval is again an IET of at
initial point x S is randomized (see Bufetov
most d þ 2-subintervals) that any d-IETs is par-
(2014) for statements).
tially rigid, i.e., there exists 0 < α < 1 (depending
on d only) and a diverging sequence of times (nk)k
Deviations phenomena due to singulari- such that for every measurable set A I,
ties Frączek and Ulcigrai (2021) gave new proof
of the existence of a power deviation spectrum and LebðA \ T nk AÞ a LebðAÞ, 8k ℕ: ð12Þ
asymptotic cocycles for smooth observables over
locally Hamiltonian flows with Morse singularities From this it follows also that IETs have no mixing
which extends Bufetov-Forni results to smooth factor. In Katok (1980) it is furthermore shown,
observables which do not vanish at singularities exploiting partial rigidity of IETs, that any special
as well as flows in U : min: Their approach, inspired flow over an IET under a roof of bounded varia-
by Marmi et al. (2005), provides also a description tion is not mixing. This implies in particular that
of the full measure set of locally Hamiltonian flows linear flow, nor any smooth time-change of a
in terms of a Diophantine-like condition. Frączek linear flow, can be mixing. In the special case of
and Kim (2021) pushed a similar approach to treat special flows over rotations under smooth roofs
352 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
(or equivalently smooth time-changes of flows on IET, was given 10 years later by Avila and Forni
2 ) absence of mixing was proved earlier, in (2007). It exploits deep results from Teichmüller
Kočergin (1972). dynamics, in particular positivity of the second
Lyapunov exponent of the Teichmüller flow
Full measure of weak mixing Recall that the (Forni 2002) and a parameter exclusion argument,
ergodic theoretic property of weak mixing, by a which is very delicate for the case of IETs, but
classical result of Halmos (1960), is topologically much simpler in the case of linear flows (see the
generic while mixing is not. In Veech (1984) it Appendix of Avila and Forni (2007)) on
was conjectured that almost every IET should be a.e. translation surface.
weak mixing, unless it is essentially a rotation
(since rotations have eigenvalues and a discrete Weak mixing explicit examples The proof by
spectrum), i.e., if the combinatorial datum π is not Avila and Forni (2007) is not constructive.
that of a rotation with fake singularities. These Explicit examples of weak mixing IET
IETs are called of rotation type. (of periodic type, i.e., self similar) were
Veech (1984) introduced a condition on the constructed in Sinai and Ulcigrai (2005) and
combinatorial datum π of an IET (called W-con- Ferenczi and Zamboni (2011). Linearly recurrent
dition) under which he could prove full measure (or bounded-type) IETs are also explicit (and
of weak mixing. As an example, when d is odd, include in particular periodic-type IETs); there-
the π which exchanges the order, i.e., maps fore also linearly recurrent IETs with combinato-
i 7! d i þ 1 for 1 i d is of type W. Geo- rial datum π of type W provide explicit examples
metrically, the type W condition is essentially of weak mixing IETs by Boshernitzan and
equivalent to asking that the IET is a Poincaré Nogueira (2004).
section of a flow on a (minimal component of a) Translation surfaces obtained by unfolding of a
surface which has more than one saddle point. rational billiard (which constitute a measure zero
Boshernitzan and Nogueira (2004), [Theorem subset) on which the linear flow is weak mixing
5.3] showed that all linearly recurrent IETs for a.e. direction were constructed by Avila and
(or equivalently, bounded-type IETs, which have Delecroix (2016) (in the class of Veech surfaces).
measure zero) with combinatorial datum of type
W are weakly mixing. Notice that when π is not of Hausdorff dimension of non-weak
type W, bounded-type IETs do not have to be weak mixing Once it has been shown that weak mixing
mixing; see, for example, the examples of 4-IETs IETs or linear flows have measure zero, one can
produced in Hmili (2010): these examples have π study the Hausdorff dimension of the comple-
which is not type W and are not weak mixing, ment. The parameter exclusion argument in
since they have a nonconstant eigenfunction. Avila and Forni (2007) shows some estimates on
One can verify furthermore that they are linearly the Hausdorff dimension of translation surfaces
recurrent when the eigenvalue is badly for which the vertical linear flow flows fail to be
approximable. weak mixing. Avila and Leguil (2018) studied the
Nogueira and Rudolph could prove already in Hausdorff dimension of the set of non-weakly
the 1990s that almost every IET with π not of mixing IETs. They showed that its Hausdorff
rotation type is topologically weak mixing, i.e., dimension is strictly less than d 1, where d is
has no nonconstant continuous eigenfunction (see the number of exchanged intervals. It follows
Nogueira and Rudolph 1997) (in contrast with from Chaika and Masur (2020) that the Hausdorff
weak mixing, which is equivalent to the absence dimension is at least d 32 : Combining the work
of nonconstant measurable eigenfunctions). An of Chaika and Masur (2020) and Al-Saqban et al.
unrestricted proof of almost everywhere weak (2021), one can show that the set of non-weakly
mixing for IETs not of rotation type, both for mixing IETs with permutation of type W has
almost every linear flow and for almost every dimension d 32 (see Chaika and Masur (2020)).
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 353
Mild mixing in IETs and von Neumann roof function over an irrational rotation with
flows Mild mixing, which can be defined as the bounded partial quotients to be mild mixing.
absence of rigid factors, is an intermediate prop- Using this criterion, Frączek (2009) has shown
erty between weak mixing and mixing. Thus, it is that when the genus is at least two, the set of
a natural property to explore systems which are Abelian differentials for which the vertical flow
known to be weak mixing but not mixing. is mildly mixing is dense in every stratum of
moduli space. He used an approximation of any
Mild mixing IETs As shown by Veech (1984), a translation surface by surfaces having many ver-
typical IET is rigid, so it cannot be mildly mixing. tical saddle connections. Vertical flows on such
The existence of mild mixing IETs has been first surfaces have special representations over
shown among 3-IETs: Ferenczi et al. (2005), rotations.
[Theorem 4.1] showed that linearly recurrent
3-IETs are mild mixing (since they have a prop- Mixing properties of smooth flows In smooth
erty known as minimal self-joinings; see area-preserving flows, due to the presence of
Kanigowski and Lemańczyk (2020)). Robertson Hamiltonian saddles, the closer a trajectory is to
(2019) has considered IETs with combinatorics π a saddle singularity, the more motion along the
of type W (the condition under which Veech first trajectory is slowed down. This generates a shear-
proved weak mixing) and shown that when line- ing phenomenon that can create mixing. The pres-
arly recurrent (a measure zero but full Hausdorff ence of absence of mixing depends both on the
dimension condition), they are mild mixing. strength of the singularity (in particular if it degen-
erates or nondegenerates) and on the presence of
Von Neumann flows Von Neumann flows in traps (or more generally saddle loops homologous
genus one (i.e., special flows over rotations or to zero), i.e., the symmetry or asymmetry of the
IETs under a piecewise absolutely continuous roof in case of logarithmic singularities. For
function with non-zero sum of jumps) were intro- generic flows given by Morse forms one has
duced by von Neumann (with rotations as the indeed the following dichotomy:
base) in 1936 to produce examples of weak
mixing systems. He proved that such flows are Theorem 5 Inside the open set U min in which the
weakly mixing for each irrational rotation if the typical flow is minimal, almost every locally Ham-
roof function is piecewise C1. The weak mixing iltonian flow is weakly mixing, but it is not
property was then proved for the von Neumann mixing. On the other hand, for a full measure set
class of functions but over ergodic interval of flows in U : min , the restriction to each of min-
exchange transformations by Katok (2001). imal components is mixing.
Frączek and Lemańczyk (2006) showed that von This statement summarizes a number of results
Neumann flows over rotations (with a roof with (in particular Ulcigrai (2009, 2011) and Ravotti
non-zero sum of jumps) can be furthermore mild (2017)) which we now describe in detail.
mixing (but for rotation numbers α of bounded
type, which form a measure zero set). This result Diophantine-type conditions Results on mixing
was extended in Frączek et al. (2007) to roof properties require the introduction of
functions with zero sum of jumps. Both results Diophantine-like conditions, which describe the
exploit versions of the Ratner properties (see full measure set of locally Hamiltonian flows for
below). which the results hold. For g ¼ 1 these can be
expressed as properties of the entries of the con-
Mildly mixing linear flows As for IETs, the tinued fraction expansion of the rotation number.
Veech result (Veech 1984) says that the linear In higher genus, these are expressed in terms of
flow on a typical translation surface is rigid, so it the renormalization given by the Teichmüller flow
also cannot be mildly mixing. Frączek et al. or renormalization algorithms for IETs such as
(2007) gave a criterion for a piecewise constant Rauzy-Veech induction. For a survey of these
354 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
Diophantine-like conditions, we refer to the ICM motivated the question of mixing in locally Ham-
proceedings by Ulcigrai (2022). iltonian flows with only nondegenerate saddles
(i.e., simple saddles). Arnold in the 1990s noticed
Mixing of Kocergin flows and related a geometric phenomenon which could produce
results The first examples of mixing smooth mixing in the ergodic minimal components of
flows on higher genus surfaces were given already locally Hamiltonian flows on the torus known as
in the 1970s (in Kočergin (1975)) exploiting Arnold flows (Arnold 1991). His intuition was
degenerate saddles (i.e., multi-saddles with proved to be correct shortly after by Sinai and
k 6 prongs, as in Fig. 4c) since in this case the Khanin (1992), who showed mixing under an
saddles have a much stronger slowing down arithmetic condition of full measure on the rota-
effect. Kochergin showed that special flows over tion number α. The full measure assumption on
ergodic IETs under roofs with polynomial singu- the rotation number α is that the entries an of the
larities (see (7)) are mixing for all ergodic IETs on continued fraction expansion of α do not grow too
the base (in particular for all irrational rotation α in fast, namely there exists a power 1 < t < 2 and
the base in the g ¼ 1 case). This implies that a C > 0 such that |an| Cnt. The condition was later
typical locally Hamiltonian flow is mixing if it has improved in Kochergin (2004).
degenerate saddles (as in Fig. 4c) which all give The question of whether mixing is typical also
rise to polynomial singularities of the roof func- for flows on higher genus is more delicate and
tion (e.g., for local Hamiltonians given by ℑ (zk) requires a generalization of the Diophantine-like
for k 2). condition in higher genus, which was introduced
As a special case, Kočergin (1975) result in Ulcigrai (2007) and proved to be of full mea-
applies to flows on 2 with a stopping point sure by Avila et al. (2006). Ulcigrai proved
(which can be seen as a fake degenerate saddle) (Ulcigrai 2007) that for a.e. IET, the special flow
that admit a representation as special flows over under an asymmetric roof with one logarithmic
rotations under a roof with a symmetric power singularities is mixing. Later, Ravotti (2017)
singularity. For this special case, polynomial showed that the same full measure condition can
mixing estimates were proven in Fayad (2001) also be used to extend the result to roofs with
for special observables when the power singular- several asymmetric logarithmic singularities, as
ity is sufficiently strong. in (4). This implies (see Ravotti (2017)) that in
While smooth time-changes of linear flows on the open set U : min , the restriction of the typical
2
are never mixing by Katok (1980), a mechanism locally Hamiltonian flow ’ℝ on each of its mini-
similar to the one exploited by Kočergin (1975) mal components is mixing.
was used by Fayad (2002) to show that there exist In Ravotti (2017), quantitative results on the
smooth (actually analytic) time-changes of a flow speed of mixing are also proved: for a typical ’ℝ
on 3 which are mixing. Time-changes of flows on in U : min , restricted to a minimal component, the
3 can be seen as special flows over a two- speed of decay of correlations (also sometimes
dimensional rotation ℝa : 2 ! 2 , given by called speed of mixing) is sub-polynomial and
ðx1 , x2 Þ 7! ðx1 þa1 , x2 þ a2 Þ 2 : For mixing actually logarithmic, namely for every pair f,
smooth time-changes to exist, a should be a g of smooth observables there exist constants
Liouville vector and the frequencies α1, α2 should c > 0, α > 0 such that
have sequences of continued fraction denominators
that are intercalated with respect to each other. ðf ∘’t Þ gdm fdm gdm clogt a :
Notice that these are measure zero examples.
Arnold conjecture and mixing in the The role of shearing A crucial ingredient in the
asymmetric case Since the presence of a degen- proofs of these mixing results (and more generally
erate saddle is not generic, Kocergin work of mixing in parabolic flows) is played by a
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 355
geometric shearing phenomenon: if we consider a shearing on one side and hence, in this case, the
small arc γ transversal to the trajectories of the accumulation of shearing predominantly in one
flow ’ℝ, so that when flowing it, ’t(γ) passes direction produces global shearing.
nearby a saddle separatrix without hitting the sad-
dle point (as shown in Fig. 13), the different Ratner’s properties Striking consequences of
deceleration rates of points cause ’t(γ) to shear shearing (such as measure and joining rigidity)
in the direction of the flow (see Fig. 13a). Shear- were proved for another famous class of parabolic
ing allows to deduce mixing from ergodicity: by a flows, namely horocycle flows on hyperbolic sur-
Fubini-type argument, given measurable set faces and their time-changes, by exploiting a
A X, for every large t one can cover an arbi- quantitative shearing property introduced by
trarily large proportion At A of with a collection Marina Ratner and nowadays known as Ratner
of short transversal segments γ, each of which, property (or RP). The difficulty in proving Ratner-
after time t, shadows a long trajectory of ’ℝ and, type properties for smooth flows on higher genus
therefore, by (unique) ergodicity is (close to) surfaces is given by the presence of singularities
equidistributed. Furthermore, the speed of mixing (which are unavoidable when g 2), which intro-
depends on the speed of shearing. duce discontinuities and destroy the slow form of
The shearing accumulated can be later divergence a la Ratner: as soon as two nearby
destroyed when the segment ’t(γ) passes near trajectories are separated by hitting a saddle,
the other side of a saddle (see Fig. 13b). The indeed, one drastically loses control of the diver-
presence of a saddle loop, though, (as in gence. A first generalization of the Ratner prop-
Fig. 14a) typically creates an asymmetry (this erty (the finite Ratner property) was introduced by
was the key intuition of Arnold that had motivated Frączek and Lemańczyk (2006) and proved for a
his conjecture on mixing) by producing stronger special class of von Neumann flows (special under
a piecewise absolutely continuous roof with non-
zero sum of jumps over an irrational rotation of
bounded type). The finite Ratner property was
also proved for flows when the sum of jumps is
zero by Frączek et al. (2007). Notice that von
Neumann flows are not (globally) smooth. For
other examples of this type of Ratner property,
see also Frączek and Lemańczyk (2010).
A version of the Ratner property is also a key
ingredient in the proof in Kanigowski (2015).
The Ratner property in its classical form
Ergodic and Spectral Theory of Area-Preserving as well as the weaker versions defined in Frączek
Flows on Surfaces, Fig. 13 Shearing mechanism and Lemańczyk (2006, 2010) is expected to fail
for smooth area-preserving flows with non-
degenerate fixed points. The failure of the classi-
cal Ratner property was formally proved in a
special case in Fayad and Kanigowski (2016),
[Theorem 1] (for a class of Kochergin flows),
and this result gives reasons to believe that, for
similar reasons, the classical Ratner property
should indeed always fail in presence of
singularities.
A new variant of the RP which has the same
Ergodic and Spectral Theory of Area-Preserving dynamical consequences, called Switchable
Flows on Surfaces, Fig. 14 Mixing mechanism Ratner Property (or SRP for short), was
356 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
introduced by Fayad and Kanigowski (2016) and asymmetry which produces global shearing, one
proved to hold for almost every (minimal compo- can expect that the effect of shearing on two differ-
nent of) an Arnold flow in U : min when g ¼ 1 ent sides of the same saddles compensates and
(as well as for a measure zero set of Kochergin cancels out.
flows). According to this variation, it is sufficient
to see the Ratner divergence of orbits for most Symmetric logarithms over rotations Consider a
pairs of initial conditions (x, y) either in the future special flow over an irrational rotation Rα under a
(for t > 0) or in the past (for t < 0), depending on roof with a symmetric logarithmic singularity, i.e.,
the pair of initial points. Thus, if one pair of
nearby trajectories is separated by hitting a singu- r ðxÞ ¼ C j logðxÞ j þC j logð1 xÞ j : ð13Þ
larity, and hence their distance explodes in an
uncontrolled manner, one can still hope to switch Already in the 1970s, Kočergin (1976) proved
the direction of time (from which the name of that these flows are not mixing for a full measure
switchable Ratner property) and be able to prove set of rotation numbers α and, much more
the Ratner slow form of divergence when flowing recently, extended this result to all irrational rota-
backward in time. Kanigowski et al. (2019) gen- tion numbers (Kochergin 2007). The criterion for
eralize the result by Fayad and Kanigowski to the absence of mixing exploited in Kočergin
higher genus and show that for almost every (1976) can be seen as a generalization of Kočergin
’ℝ U : min , the restriction of ’ℝ to any minimal (1972) absence of mixing result and shows that
component satisfies the Switchable Ratner Prop- (at least for typical) locally Hamiltonian flows
erty for any genus g 1. This also covers as mixing via shearing is essentially the only possi-
special case minimal components of typical ble way of achieving mixing. Indeed it was
flows in g ¼ 1 with more than one simple saddle. conjectured in Lemańczyk (2000) and proved by
In Fayad and Kanigowski (2016) the case of more Schmidt (2002) that mixing in special flows over a
saddles is also considered, but under a condition rotation is only possible if the distribution of the
on the relative position of the saddles which has sequence of Birkhoff sums of the roof function is
measure zero. not tight, i.e., Leb({x; |r(n)(x)| > K}) tends to 1 as
K tends to infinity.
Multiple mixing Applying joining techniques
introduced by Host (1991) and developed by Symmetric logarithms over IETs While the key
Ryzhikov and Tuveno (2006), the Switchable example studied by Kochergin (special flows over
Ratner property can be used in particular to rotations under a symmetric logarithmic singular-
show that mixing can be upgraded to mixing of ity) constitutes a prototype result for the absence
all orders. In particular, the results in Fayad and of mixing, these special flows do not arise as
Kanigowski (2016), combined with Sinai and representations of typical locally Hamiltonian
Khanin (1992), imply that typical Arnold flows flows, since special representations of typical
are mixing of all orders (as well as a measure zero minimal flows with only simple saddles always
set of Kochergin flows in genus one). Similarly, yield special flows under roofs with symmetric
the result of Kanigowski et al. (2019) combined logarithmic singularities over IETs with d 4.
with Ravotti (2017) shows that for a full measure Ulcigrai (2011) proved the absence of mixing for
set of locally Hamiltonian flows in U : min , each such flows for almost all interval exchange trans-
restriction to a minimal component is mixing of formations in the base. Thus, a.e. flow in U min is
all orders. not mixing. A special case of the absence of
mixing result for surfaces with g ¼ 2 and two
Absence of mixing in the symmetric case For isometric saddles was proved in Scheglov
minimal locally Hamiltonian flows in U min , which (2009); see also the recent work by Chaika et al.
have only simple saddles, since there are no saddle (2021) for a more geometric proof. The proof in
loops homologous to zero and hence no Scheglov (2009) exploits a combinatorial analysis
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 357
of the substitutions induced by Rauzy-Veech logarithmic singularities. Thus, a.e. flow in U min
induction to show that orbits satisfy a certain is weakly mixing but not mixing.
symmetry. In Chaika et al. (2021), this symmetry
is deduced geometrically from the existence of a
hyperelliptic involution. Spectral Properties
We remark that in U min there exist nevertheless
exceptional mixing flows (contrary to the case of We move now to the properties of the spectrum of
the torus in view of Kočergin (2007)), as shown in Koopman unitary operators and their spectral
Chaika and Wright (2019), where the authors measures. We recall that the Koopman unitary
could produce sporadic mixing examples in g ¼ operator associated to a dynamical system is
5. Also for this example, shearing is still at the given by f 7! f ∘ T for f L2(I, dx) when T is
base of mixing, but it is not produced by asym- an IET or f 7! f ∘ ’t for f L2(S, m) when ’ℝ is a
metry of the singularities, but rather by an asym- surface flow preserving m. We refer the reader to
metric equidistribution, so that trajectories, at the entry by Kanigowski and Lemańczyk (2020)
different time scales, spend much more time on or Katok and Thouvenot (2006) for a survey of
one side of a saddle than another. The IETs con- spectral theory of dynamical systems.
sidered are finite covers of rotations (for which
Diophantine-type conditions can be given in Spectral measures of IETs and linear
terms of continued fraction entries) which are flows The first general spectral result about
only barely uniquely ergodic and for which orbits IETs is Oseledets theorem in Oseledec (1966)
equidistribute very slowly and in a very stating that the maximal spectral multiplicity of
asymmetric way. any d-interval exchange transformation T is
More recently, Kanigowski and Kułaga- bounded by d. In 1984, Veech proved in Veech
Przymus (2016) showed (exploiting the former (1984), [Theorem 1.4] the a.e. IET is rigid which
work of Kułaga (2012)) that special flows with implies by standard arguments (see, e.g., Katok
symmetric logarithmic singularities over IETs of and Thouvenot (2006)) that T has singular spec-
bounded type are mild mixing (the assumption on trum. It follows furthermore from Avila and Forni
the IET used in Kanigowski and Kułaga-Przymus (2007) work on weak mixing that a.e. IET, as well
(2016) is not explicitly that the IET is of bounded- as a.e. linear flow, has continuous spectrum.
type, but a condition on orbits of discontinuities, Exotic examples of IETs with eigenvalues with
which can a posteriori be shown to be equivalent various properties were constructed by Ferenczi
to bounded type or linearly recurrent). The main and Zamboni (2011)); see also Hmili (2010).
component of the proof is to show that the flow Let w ¼ wA be the characteristic function of a
satisfies the Switchable Ratner Property. continuity interval A ¼ Ii of T (or, more generally,
of a floor of a cutting and stacking presentation of
Weak mixing via logarithmic singulari- T, as those given by Rauzy-Veech induction).
ties Frączek and Lemańczyk (2003) showed A condition for the spectral measure sw to be
that flows over all irrational rotations under any continuous at a point o S1 was given by
roof functions with one symmetric logarithm are Bufetov et al. (2006) and used to produce explicit
weakly mixing. Generalizing this result, Ulcigrai examples of (periodic-type) IETs with continuous
(2009) also proved weak mixing for special flows spectrum in Sinai and Ulcigrai (2005).
with logarithmic singularities (not necessarily Sinai raised the question: to find modulus of
symmetric, although in the asymmetric case continuity for the spectral measures of translation
mixing is already known; see Ulcigrai (2007) flows. In Bufetov and Solomyak (2018, 2020),
and Ravotti (2017)) for almost every IET on the Bufetov and Solomyak developed an approach
base. The proof exploits the partial rigidity of IETs to this problem and succeeded in obtaining Hölder
proved by Katok (1980) as well as the presence of estimates for spectral measures in the case of
358 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
surfaces of genus 2. The final result for surfaces of surfaces of any genus g 2 with singular contin-
higher genus, independently proved in Bufetov uous spectrum (see Frączek and Lemańczyk
and Solomyak (2021) and Forni (2022), says (2003), [Theorem 1]) using Blohin (1972) con-
that for every stratum in the moduli space there struction. Since these are essentially built gluing
are γ > 0 and β > 0 such that for a.e. translation genus one flows and the resulting flow has a lot of
surface in the stratum there exists C > 0 such that saddle connections, they are highly non-typical.
for every f : S ! ℝ smooth enough
Lebesgue spectrum via a stopping point A per-
1þ j l j
b haps the most surprising result for minimal flows
sf ð½l r, l þ r Þ C k f k rg ð14Þ
jlj on the torus with one stopping point or fake sin-
gularity (see Fig. 15) was proved recently by
for every l 6¼ 0 and r > 0. A similar result for Forni, Fayad, and Kanigowski. In Fayad et al.
a.e. IETs is harder to prove; Avila, Forni, and (2021), they proved that if the degenerate singu-
Safaee estimated moduli of continuity of spectral larity is sufficiently strong, the spectrum is count-
measures for a.e. IETs in Avila et al. (2021b). able Lebesgue. In Kočergin (1975), Kocergin
flows admit a special flow representation where
Quantitative weak mixing The main tool leading the roof has power-type singularities. In genus
to the proof (14) is quantitative estimates of one, for a flow with one degenerate singularity,
so-called twisted ergodic integrals of the form one has a special flow over a rotation, under the
roof r(x) ¼ c0/xγ þ c1/(1 x)γ for some 0 < γ < 1.
1 T
1þ j l j b The assumption in Fayad et al. (2021) is that γ is
e2pilt f ð’t xÞdt C k f k T g sufficiently close to 1. The absolute continuity of
T 0 jlj
the spectrum, taking as a starting point an idea
proved (in different forms) in Forni (2022), from Forni and Ulcigrai (2012) is proved from
Bufetov and Solomyak (2021), and Avila et al. estimate on the (polynomial) speed of decay for
(2021b). On the other hand, quantitative estima- natural observables, by showing that decay is
tions of twisted intergrals lead to an efficient ver- square-summable. Forni and Ulcigrai exploited
sion of weak mixing square-summable estimates on the decay of cor-
relations to prove in Forni and Ulcigrai (2012) that
T smooth time-changes of horocycle flows have the
1
jhf ð’t xÞ, gij2 dt C k f k k g k T g Lebesgue spectrum, as conjectured by Katok and
T 0 Thouvenot (2006). An extra difficulty in this set-
ting is given by the symmetry of the power singu-
for f, g smooth enough zero mean functions; see
larity, which creates cancellations and parts of
Forni (2022) (and Avila et al. (2021b) for IETs).
space where there is no shearing. The authors
Spectra of locally Hamiltonian flows While the
classification of mixing properties of locally Ham-
iltonian flows is essentially complete (see Theo-
rem 5), there are few results on their spectral
properties for g 2.
also introduce a criterium to show that the spec- are not isomorphic and, moreover, their factors
trum has countable multiplicity suited for para- are also not isomorphic. We refer to the survey
bolic settings. The criterium was also used in de la Rue (2020).
Fayad et al. (2021) to complete the proof of the
Katok-Thouvenot conjecture on smooth time- Disjointness from mixing systems For all sys-
changes of horocycle flows, by showing count- tems for which absence of mixing is known, one
able multiplicity of the Lebesgue spectrum. can try to strengthen the result by proving
disjointness from all mixing flows. The proof of
Singular spectrum in genus 2 In the opposite that no IET is mixing in Katok (1980) already
direction, Chaika et al. proved in Chaika et al. implies (even if this is not explicitly stated) that
(2021) that a typical minimal locally Hamiltonian IETs are disjoint from all mixing systems; see also
flow on a genus two surface with two isomorphic Ryzhikov (1994). In fact, partial rigidity (see (12))
simple saddles (Fig. 16) has purely singular spec- implies disjointness from mixing systems. The
trum. The result, inspired by the techniques used to proof by Katok also covers the case of special
prove the singularity result (for special flows over flows over IETs under piecewise constant roofs,
rotations) in Frączek and Lemańczyk (2003, 2004), which are also partially rigid. This strategy does
deduces singularity of the spectrum from absence not work for all roof functions of bounded varia-
of mixing and rigidity, exploiting the geometric tion. Frączek and Lemańczyk (2009a), [Appendix
symmetries given by the hyperelliptic involution. A] showed that von Neumann flows over IETs are
It also provides an independent proof of absence of never partially rigid.
mixing for typical flows in the same class (g ¼ 2, Frączek and Lemańczyk strengthened
two isomorphic saddles, see Fig. 16) proved in Kočergin results in Kočergin (1972, 1976) show-
Scheglov (2009). As in Scheglov (2009), the ing absence of mixing for flows over rotations (for
assumption that the saddles are isometric is crucial smooth roofs and symmetric logarithmic singular-
to guarantee that the underlying surface has an ities respectively) by proving in Frączek and
inner symmetry, which plays a crucial role in the Lemańczyk (2004) that for every irrational α, the
proof. More precisely, the linear flow of which the special flows over Rα under a smooth roof either
locally Hamiltonian flow is a time-change is a flow has bounded variation or whose Fourier coeffi-
on a translation surface S which admits a hyper- cients decay as O(1/n) (so in particular for a roof
elliptic involution, i.e., an affine automorphism with one symmetric logarithmic singularity) are
F : S ! S which is an involution, i.e., F2 ¼ Id. disjoint from all mixing flows. Spectral
disjointness was also proved in Frączek and
Lemańczyk (2003) for a.e. rotation and the roof
Disjointness Results function with one symmetric logarithmic singu-
larity. All these results are based on tightness of
The notion of disjointness was introduced in the the distribution of Birkhoff sums of the roof func-
1960s by Furstenberg (see in particular tion, and the study of their weak limits of joinings
Furstenberg (1967)). Recall that disjoint flows (via Markov operators) determined the graphs of
’t in S S. This approach was strengthened in rare property. As shown by del Junco (1981) (for
Frączek and Lemańczyk (2005), where automorphisms) and by Danilenko and Ryzhikov
disjointness from all mixing systems was proved (2012) (for flows), typically we have even
also for all special flows over ergodic IETs and disjointness with the inverse action. Although
under bounded variation roofs (in particular for all non-reversibility is a typical property, it is not
von Neumann flows over IETs) and for special easy to prove in concrete examples, especially
flows over a.e. rotation and under any roof func- for flows on surfaces. In general, proving the
tion with symmetric logarithmic singularities non-isomorphism of zero entropy and spectrally
(in particular for Blokhin flows in any genus). equivalent systems (a flow is always spectrally
Notice that the tightness proved by Ulcigrai isomorphic with its inverse) is a difficult task.
(2011) is sufficient to apply the techniques devel- One of the standard techniques here is
oped in Frączek and Lemańczyk (2005). Combin- disjointness, for which proving techniques are
ing these two results one has that a.e. locally better developed.
Hamiltonian flow in U min is disjoint from all The first result for surface flows was proved in
mixing flows. Frączek and Lemańczyk (2009a), where, using a
shearing mechanism and Ratner’s techniques, the
Joining properties of IETs As we have already authors showed that for every rotation Rα of
mentioned, every IET is partially rigid, so is dis- bounded type, every von Neumann flow is dis-
joint from all mixing systems. Moreover, a.e. IET joint from its inverse. This result was expended to
is rigid; see Veech (1984). Chaika (2012) proved a a.a. rotations in Frączek et al. (2014) and to
much stronger result about rigidity of a.e. IET, a.e. IET in Berk and Frączek (2015), but in both
saying that if (qn)n ℕ is any increasing sequence papers only the non-isomorphism of the von Neu-
of natural numbers, then a.e. IET is rigid along a mann flow and its inverse is proved. The methods
subsequence of (qn)n ℕ. This implies an amazing developed in Frączek et al. (2014) and Berk and
result that any ergodic system is disjoint from Frączek (2015) (do not rely on Ratner’s tech-
a.e. IET; see Chaika (2012), [Theorem 1]. niques) are inspired by Ryzhikov’s results and
Veech asked, already in the 1980s (see Veech involve the study of weak limits of graph
(1982a)), whether a.e. IET is simple, i.e., the only 3-joinings. They were further creatively devel-
(ergodic) self-jonings are graphs and the product oped in Berk et al. (2020), where they were allo-
measure. This question was motivated by the arti- wed to prove disjointness of von Neumann flows
cle by del Junco (1983), where he found a one- with their inverse for a.a. IETs.
parameter family of simple 3-IETs. Further exam- The problem of reversibility in a class of linear
ples of simple d-IETs with d 3 were produced in flows on translation surfaces is more complicated.
Ferenczi and Zamboni (2011). A surprisingly neg- Here the answer depends on connected compo-
ative response to Veech’s question was given by nent C in the moduli space. If the connected
Chaika and Eskin (2021) where they studied the component C is hyperelliptic, then every linear
set of joinings of a.a. 3-IETs and proved that it is flow on any translation surfaces in C is isomorphic
rather large. The set of self-joinings is a Poulsen to its inverse via the hyperelliptic involution. On
simplex, i.e., the set of ergodic joinings is dense. the other hand, for a typical (in the topological
sense) translation surface in any non-hyperelliptic
Non-reversibility A measure-preserving flow component C, the linear flow is disjoint from its
’ℝ ¼ (’t)t ℝ is reversible if it is isomorphic to inverse; see Berk et al. (2020).
its inverse flow ’1
ℝ ¼ ð’t Þt ℝ : It is also some-
times assumed that the map giving the isomor- Disjointness of rescalings A property which
phism is an involution (its second iteration is the seems to be common among parabolic flows
identity). Reversibility is often observed for (apart from some well-known exceptions in the
dynamical systems of physical origin. In contrast, homogeneous world) is disjointness of rescalings,
from the point of view of ergodic theory, this is a defined as follows. Given a real number k > 0, by
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 361
the k-rescaling of ’ℝ ¼ (’t)t ℝ, we simply mean Rescalings of von Neumann flows Using simple
the flow ’kℝ ≔ ð’kt Þt ℝ (in which the time is joining arguments Frączek and Lemańczyk (2009a)
rescaled by the factor k). Thus, a rescaling is a showed that every von Neumann flow over any
special type of time-reparametrization of a flow, ergodic IET is not isomorphic to its any rescaling.
given by a linear time-change. We say that ’ℝ has Disjointness of rescalings for von Neumann flows
disjoint rescalings if for all (or all but finitely was studied by Berk and Kanigowski (2021). They
many) p, q > 0, the rescalings ’pℝ and ’qℝ , considered von Neumann flows over IETs and
where p, q > 0 and p 6¼ q, are disjoint (in the showed disjointness (even spectral disjointness) of
sense of Furstenberg). Disjointness of rescalings rational rescalings, i.e., rescalings ’kℝ where
has played a key role in proving some of the first k ℚ, for a.e. IET. A similar proof also allows
instances of Sarnak’s conjecture on the Moebius dealing with special flows over an IET T under
function (see the survey Kułaga-Przymus and piecewise constant function with a singularity in
Lemańczyk (2020)). the interior of a continuity interval of the IET.
Spectral disjointness of rational rescalings is also
Rescalings of Arnold flows Kanigowski, proved in Berk and Kanigowski (2021) for a.e. IET.
Lemańczyk, and Ulcigrai recently proved in
Kanigowski et al. (2020) that disjointness of Von Neumann flows over rotations The new
rescalings is typical among Arnold flows (see criterion for disjointness introduced by
Fig. 3b): Kanigowski et al. (2020) has been also used by
Dong and Kanigowski (2020) to study von Neu-
mann flows in genus one. They proved that for
Theorem 6 Let ’ℝ be the (Arnold) special flow any irrational rotation Rα, any real rescaling of a
over a.e. rotation Rα under a roof with asymmetric special flow ’ℝ over Rα under a piecewise abso-
logarithmic singularities with constants C0 6¼ C1 lutely continuous function with only one discon-
given by r(x) ¼ C0 j log x j þ C1 j log (1 x) j þ tinuity is disjoint from ’ℝ.
h(x) where h is smooth. Then there exist only two
values of the form q, 1/q such that ’ℝ and ’pℝ are Symmetric log over rotations Berk and
disjoint for any positive p {1, q, 1/q}, where Kanigowski (2021) also studied the case of a roof
q ¼ C0/C1. with one symmetric logarithmic singularity over
The proof exploits a new criterium for rotations (Kochergin prototype example of the
disjointness based on the switchable Ratner prop- absence of mixing) and also proved that in this
erty. The criterion was devised and formulated so case one has (spectral) disjointness of rational
that it can be applied to prove disjointness of two rescalings for a.e. rotation number. The techniques
flows which both have the switchable Ratner in this symmetric setting (where no form of Ratner
property when in both one can observe a con- property is known) are very different and based on
trolled form of divergence of nearby trajectories a refinement of the techniques used to prove the
(e.g., polynomial divergence), but the speed of absence of mixing and disjointness from mixing
divergence for the two flows is different (e.g., flows in Frączek and Lemańczyk (2004).
for one flow it is linear, and in the other,
quadratic).
Open Directions
Disjointness from other parabolic flows The cri-
terion is used in Kanigowski et al. (2020) also to Despite the surge of activity and many results on
show that a typical Arnold flow is disjoint from ergodic, mixing, and spectral properties which
any smooth time-change of the horocycle flow have been proved in the last decades on area-
(and in particular from the classical horocycle preserving flows on surfaces, both on IETs and
flow itself), thus showing that these two classes linear flows exploiting Teichmüller dynamics and
of parabolic flows are truly distinct. on smooth or locally Hamiltonian flows, many
362 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
questions remain open. We indicate here only further investigated. While the typical flow in
some questions and directions which arise natu- U min is weakly mixing but not mixing (see Theo-
rally from the results surveyed in this entry. rem 5), we do not know if weak mixing holds
everywhere when g 1 and no other examples
Linear flows and IETs Even though the ergodic other than the rather special examples of mixing
theory of typical IETs and translation flows is flow in g ¼ 5 in Chaika and Wright (2019) were
much studied and well understood, some ques- constructed. Within U : min , mixing holds a.e. on
tions remain open. Weak mixing is known for every minimal component (see Theorem 5), but
a.e. IET and a.e. linear flow but is not explicit one can wonder if there exist exceptional exam-
and while explicit examples of weak mixing ples without mixing for g 2. They are known
(and its lack) were constructed, a classification not to exist in g ¼ 1 in view of Kochergin work
of all translation surfaces on which linear flows Kochergin (2004), where mixing is proved for
are weak mixing for a.e. direction is still out of every irrational rotation and the roof function
reach. Another open question is the exact compu- with logarithmic singularities of strongly
tation of Hausdorff dimension of the set of non- asymmetric type.
weakly mixing IETs (see Question 1.6 in Chaika As we discussed, a.e. flow in U : min has not
and Masur (2020)). only minimal components which are mixing, but
Another Hausdorff dimension computation also mixing of all orders. Proving that any mixing
question concerns non-uniquely ergodic IETs, minimal component of a flow in U : min , as well as
namely what is the Hausdorff dimension of any mixing Kochergin flow, is also mixing of all
d-IETs with precisely k ergodic invariant proba- orders is an open problem (see Question 38 in
bility measures, for a given 1 < k g d/2 (listed Fayad and Krikorian (2018) ICM proceedings).
as Question 1.8 in Chaika and Masur (2020)). Showing it would confirm the famous (still open)
While a.e. IET has singular spectrum, whether Rokhlin conjecture (i.e., that mixing implies
there can exist exceptional IETs with an abso- mixing of all orders; see Quas (2017)) in the
lutely continuous component in the spectrum is setting of area-preserving surface flows.
not known. Finally, all examples where mild mixing is
The simplicity conjecture in Veech (1982a) on known (for special flows over rotations or IETs
simplicity of d-IETs for d 4 is still wide open, under symmetric logarithmic singularities, but
together with an understanding of joinings and also for von Neumann flows and for IETs them-
self-joinings. It also remains open whether almost selves) are over IETs which are linearly recurrent
every 3-IET is prime; it was a question suggested (or bounded-type) rotations or IETs. Perhaps mild
by Veech. A measure-preserving system is called mixing is not possible otherwise.
prime if it has no non-trivial factors.
While upper bounds on the Hausdorff dimen- Nature of the spectrum Spectral property of
sion of non-uniquely ergodic IETs are now locally Hamiltonian flows is a natural question,
known, a finer understanding of the set of non- which has been lingering for decades (see, e.g.,
uniquely ergodic directions NU ðSÞ on a given Katok and Thouvenot (2006), [Section 5]).
surface S, in the spirit of the full dichotomy for The recent result by Fayad et al. (2021) for
the slit-tori example, is probably out of reach. flows in g ¼ 1 with stopping points suggests that
Even if there exists a surface for which the it may be possible to prove that the spectrum is
Hausdorff dimension of NU ðSÞ is strictly between countable Lebesgue also in higher genus when in
0 and 1/2 is not known (see also Question 1.9 in presence of degenerate, sufficiently strong (multi-
Chaika and Masur (2020)). saddle) singularities. It is not clear what to expect
when the degenerate singularity is not sufficiently
Locally Hamiltonian Flows strong. In Fayad et al. (2021), the power γ of the
Exceptional mixing results Nevertheless, many singularity of the roof of the special flow repre-
questions about exceptional behavior can be sentation (see (7)) is close to 1. One might hope
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 363
that absolute continuity could hold for all powers linear flows, as well as infinite extensions of
γ 1/2, but this is out of reach with the current locally Hamiltonian flows, has been an active
techniques. If γ < 1/2, the approach which uses recent area of research. The latter extensions, as
square-integrability of the decay of correlations well as the study of translation surfaces which are
estimates fails completely. Abelian covers of compact translation surfaces, is
In the nondegenerate case, the singularity related to the study of their Poincaré maps, which
result proved in Chaika et al. (2021) for typical are skew-product extensions of IETs, which pro-
flows in g ¼ 2 with simple (isomorphic) saddles vide important infinite ergodic theory examples
indicates that the spectrum could be purely singu- and generalize the much studied theory of piece-
lar also for a.e. flow in U min : In the open set wise smooth cocycles over rotations. A skew-
U : min , which consists of flows with non- product extension of T : I ! I over a group
degenerate singularities that are not minimal, but G given by the cocycle ’ : I ! G is the map
have several minimal components, the nature of (x, g) 7! (Tx, g þ ’(x)). Such skew products
the spectrum (for the restriction of a typical flow where G ¼ ℤd and ’ is piecewise constant appear
to a minimal component) is unclear. These flows as Poincaré maps of Abelian covers of translation
are indeed mixing, but with sub-polynomial rate surfaces; ℝ-extensions of locally Hamiltonian
(see Ravotti (2017), which provides logarithmic flows provide cocycles ’ which are piecewise
upper bounds), and it is not clear whether to continuous or have (logarithmic or power-like)
expect singularity or absolute continuity of the singularities.
spectrum. The nature of the spectrum is an open Many results on the ergodicity of such infinite
problem even in g ¼ 1, for Arnold flows. measure-preserving flows and maps were proved
for special families of examples; see, e.g., Hubert
Disjointness of rescalings We believe and Weiss (2013), Hooper (2015), Málaga
disjointness of rescalings should also hold for Sabogal and Troubetzkoy (2016), Ralston and
typical locally Hamiltonian flows in higher Troubetzkoy (2017), or Chaika and Robertson
genus, but this is currently an open problem. Pre- (2019) for linear flows and IETs with infinite
liminary work seems to indicate that, despite some invariant measures and Fayad and Lemańczyk
technical additional difficulties, the techniques (2006), Conze and Frączek (2011), or Frączek
used to prove Theorem 6 should allow proving and Ulcigrai (2012, 2021) for extensions of
disjointness of rescalings for all minimal compo- locally Hamiltonian flows. Some typical non-
nents of typical flows in the open set U : min : ergodicity results were proved as well; see
Berk and Kanigowski (2021) proof of Frączek and Ulcigrai (2014) and Frączek and
disjointness of (rational) rescalings for symmetric Hubert (2018). The problem of ergodicity is nev-
logarithmic singularity over a full measure set of ertheless still widely open, both in the context of
rotations (Kochergin prototype example of absence linear flows on Abelian covers of translation sur-
of mixing) gives a good indication that disjointness faces and for extensions of locally Hamiltonian
of rescalings could also hold for typical flows flows. Other possible directions of investigation
(under a suitable full measure Diophantine-type are the study of Radon invariant measures and the
condition) in the complementary set U min : existence of limit theorems.
Dong C, Kanigowski A (2020) Rigidity of a class of Frączek K, Lemańczyk M (2005) On disjointness proper-
smooth singular flows on 2 J Mod Dyn 16:37–57 ties of some smooth flows. Fundam Math 185(2):
Fayad B (2001) Polynomial decay of correlations for a 117–142
class of smooth flows on the two torus. Bull Soc Math Frączek K, Lemańczyk M (2006) On mild mixing of spe-
France 129(4):487–503 cial flows over irrational rotations under piecewise
Fayad B, Forni G, Kanigowski A (2021) Lebesgue spec- smooth functions. Ergod Theory Dyn Syst 26(3):
trum of countable multiplicity for conservative flows 719–738
on the torus. J Am Math Soc 34(3):747–813 Frączek K, Lemańczyk M (2010) Ratner’s property and
Fayad B, Kanigowski A (2016) Multiple mixing for a class mild mixing for special flows over two-dimensional
of conservative surface flows. Invent Math 203(2): rotations. J Mod Dyn 4:609–635
555–614 Frączek K (2009) Density of mild mixing property for
Fayad B, Krikorian R (2018) Some questions around qua- vertical flows of abelian differentials. Proc Am Math
siperiodic dynamics. In: Proceedings of the Interna- Soc 137(12):4229–4142
tional Congress of Mathematicians—Rio de Janeiro Frączek K, Hubert P (2018) Recurrence and non-ergodicity
2018, Invited lectures, vol III. World Sci. Publ, Hack- in generalized wind-tree models. Math Nachr
ensack, NJ, pp 1909–1932 291(11–12):1686–1711
Fayad B, Lemańczyk M (2006) On the ergodicity of cylin- Frączek K, Kim M (2021) New phenomena in deviation of
drical transformations given by the logarithm. Mosc Birkhoff integrals for locally Hamiltonian flows.
Math J 6(4):657–672, 771–772 arXiv:2112.13030
Fayad BR (2002) Analytic mixing reparametrizations of Frączek K, Kułaga-Przymus J, Lemańczyk M (2014) Non-
irrational flows. Ergod Theory Dyn Syst 22(2):437–468 reversibility and self-joinings of higher orders for ergo-
Ferenczi S, Holton C, Zamboni LQ (2005) Joinings of dic flows. J Anal Math 122:163–227
three-interval exchange transformations. Ergod Theory Frączek K, Lemańczyk M (2009a) On the self-similarity
Dyn Syst 25(2):483–502 problem for ergodic flows. Proc Lond Math Soc 99(3):
Ferenczi S, Zamboni LQ (2011) Eigenvalues and simplic- 658–696
ity of interval exchange transformations. Ann Sci Éc Frączek K, Lemańczyk M (2009b) Smooth singular flows
Norm Supér (4) 44(3):361–392 in dimension 2 with the minimal self-joining property.
Forni G (1997) Solutions of the cohomological equation Monatsh Math 156(1):11–45
for area-preserving flows on compact surfaces of higher Frączek K, Lemańczyk M, Lesigne E (2007) Mild mixing
genus. Ann Math 146(2):295–344 property for special flows under piecewise constant
Forni G (2002) Deviation of ergodic averages for area- functions. Discrete Contin Dyn Syst 19(4):691–710
preserving flows on surfaces of higher genus. Ann Frączek K, Ulcigrai C (2012) Ergodic properties of infinite
Math 155(1):1–103 extensions of area-preserving flows. Math Ann 354(4):
Forni G (2021) Sobolev regularity of solutions of the 1289–1367
cohomological equation. Ergod Theory Dyn Syst Frączek K, Ulcigrai C (2014) Non-ergodic ℤ-periodic bil-
41(3):685–789 liards and infinite translation surfaces. Invent Math
Forni G (2022) Twisted translation flows and effective 197(2):241–298
weak mixing. J Eur Math Soc (JEMS) 24(2022), no. Frączek K, Ulcigrai C (2021) On the asymptotic growth of
12, 4225–4276 Birkhoff integrals for locally Hamiltonian flows and
Forni G, Marmi S, Matheus C (2017) Cohomological ergodicity of their extensions. arXiv:2112.05939
equation and local conjugacy class of diophantine inter- Furstenberg H (1967) Disjointness in ergodic theory, min-
val exchange maps to appear in Proc. Amer. Math. imal sets, and a problem in Diophantine approximation.
Soc., https://doi.org/10.1090/proc/14538 Math Syst Theory 1:1–49
Forni G, Matheus C (2014) Introduction to Teichmüller Ghazouani S (2021) Local rigidity for periodic generalised
theory and its applications to dynamics of interval interval exchange transformations. Invent Math 226(2):
exchange transformations, flows on surfaces and bil- 467–520
liards. J Mod Dyn 8(3–4):271–436 Ghazouani S, Ulcigrai C (2021) A priori bounds for GIETs,
Forni G, Ulcigrai C (2012) Time-changes of horocycle affine shadows and rigidity of foliations in genus 2. Pre-
flows. J Mod Dyn 6(2):251–273 print arXiv:2106.03529
Fox RH, Kershner RB (1936) Concerning the transitive Halmos PR (1960) Lectures on ergodic theory. Chelsea
properties of geodesics on a rational polyhedron. Duke Publishing Co., New York
Math J 2(1):147–150 Hmili H (2010) Non topologically weakly mixing interval
Frączek K, Lemańczyk M (2003) On symmetric logarithm exchanges. Discrete Contin Dyn Syst 27(3):1079–1091
and some old examples in smooth ergodic theory. Hooper WP (2015) The invariant measures of some infinite
Fundam Math 180(3):241–255 interval exchange maps. Geom Topol 19(4):1895–2038
Frączek K, Lemańczyk M (2004) A class of special flows Host B (1991) Mixing of all orders and pairwise indepen-
over irrational rotations which is disjoint from mixing dent joinings of systems with singular spectrum. Israel
flows. Ergod Theory Dyn Syst 24:1083–1095 J Math 76(3):289–298
366 Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces
Hubert P, Weiss B (2013) Ergodicity for infinite periodic Kochergin AV (2007) Nondegenerate saddles, and absence
translation surfaces. Compos Math 149(8):1364–1380 of mixing. II. Mat Zametki 81(1):145–148
Kanigowski A (2015) Ratner’s property for special flows Kontsevich M (1997) Lyapunov exponents and Hodge
over irrational rotations under functions of bounded theory. In: The mathematical beauty of physics
variation. Ergod Theory Dyn Syst 35(3):915–934 (Saclay, 1996), Advanced Series in Mathematical
Kanigowski A, Kułaga-Przymus J (2016) Ratner’s prop- Physics, vol 24. World Scientific Publishing, River
erty and mild mixing for smooth flows on surfaces. Edge, NJ, pp 318–332
Ergod Theory Dyn Syst 36(8):2512–2537 Kontsevich M, Zorich A (1997) Lyapunov exponents and
Kanigowski A, Kułaga-Przymus J, Ulcigrai C (2019) Mul- Hodge theory. arXiv:hep-th/9701164
tiple mixing and parabolic divergence in smooth area- Kočergin AV (1976) Nondegenerate saddles, and the
preserving flows on higher genus surfaces. J Eur Math absence of mixing. Mat Zametki 19(3):453–468
Soc (JEMS) 21(12):3797–3855 Kułaga J (2012) On the self-similarity problem for smooth
Kanigowski A, Lemańczyk M (2020) Spectral theory of flows on orientable surfaces. Ergod Theory Dyn Syst
dynamical systems. Springer Berlin Heidelberg, Berlin, 32(5):1615–1660
Heidelberg, pp 1–40 Kułaga-Przymus J, Lemańczyk M (2020) Sarnak’s conjec-
Kanigowski A, Lemańczyk M, Ulcigrai C (2020) On ture from the ergodic theory point of view. Springer
disjointness properties of some parabolic flows. Invent Berlin Heidelberg, Berlin, Heidelberg, pp 1–19
Math 221(1):1–111 Lanneau E, Marmi S, Skripchenko A (2021) Cohomolog-
Katok A (2001) Cocycles, cohomology and combinatorial ical equations for linear involutions. Dyn Syst 36(2):
constructions in ergodic theory. In: Smooth ergodic 292–304
theory and its applications (Seattle, WA, 1999), Pro- Lemańczyk M (2000) Sur l’absence de mélange pour des
ceedings of Symposia in Pure Mathematics, vol 69. flots spéciaux au-dessus d’une rotation irrationnelle.
American Mathematical Society, Providence, RI, Colloq Math 84–85:29–41
pp 107–173. In collaboration with E. A. Robinson, Jr Levitt G (1983) Feuillettages des surfaces. Thèse
Katok A, Thouvenot J-P (2006) Spectral properties and Málaga Sabogal A, Troubetzkoy S (2016) Ergodicity of the
combinatorial constructions in ergodic theory. In: Ehrenfest wind-tree model. C R Math Acad Sci Paris
Handbook of dynamical systems, vol 1B. Elsevier 354(10):1032–1036
B. V., Amsterdam, pp 649–743 Marmi S, Moussa P, Yoccoz J-C (2005) The cohomologi-
Katok AB (1973) Invariant measures of flows on oriented cal equation for Roth-type interval exchange maps.
surfaces. Sov Math Dokl 14:1104–1108 J Am Math Soc 18(4):823–872. (electronic)
Katok AB (1980) Interval exchange transformations and Marmi S, Moussa P, Yoccoz J-C (2012) Linearization of
some special flows are not mixing. Israel J Math 35(4): generalized interval exchange maps. Ann Math
301–310 (2) 176(3):1583–1646
Katok AB, Stepin AM (1966) Approximation of ergodic Marmi S, Ulcigrai C, Yoccoz J-C (2020) On Roth type
dynamical systems by periodic transformations. Dokl conditions, duality and central Birkhoff sums for
Akad Nauk SSSR 171:1268–1271 I.E.M. Astérisque, (416, Quelques aspects de la théorie
Katok AB, Stepin AM (1967) Approximations in ergodic des systèmes dynamiques: un hommage à Jean-
theory. Uspehi Mat Nauk 137(5):81–106 Christophe Yoccoz.II), pp. 65–132
Keane M (1975) Interval exchange transformations. Math Marmi S, Yoccoz J-C (2016) Hölder regularity of the
Z 141:25–31 solutions of the cohomological equation for Roth type
Keane M (1977) Non-ergodic interval exchange transfor- interval exchange maps. Comm Math Phys 344(1):
mations. Israel J Math 26(2):188–196 117–139
Kerckhoff S, Masur H, Smillie J (1986) Ergodicity of Masur H (1982) Interval exchange transformations and
billiard flows and quadratic differentials. Ann Math measured foliations. Ann Math 115:169–200
(2) 124((2):293–311 Masur H (1992) Hausdorff dimension of the set of non-
Keynes HB, Newton D (1976) A “minimal”, nonuniquely ergodic foliations of a quadratic differential. Duke
ergodic interval exchange transformation. Math Math J 66(3):387–442
Z 148(2):101–105 Masur H, Smillie J (1991) Hausdorff dimension of sets of
Kočergin AV (1972) The absence of mixing in special flows nonergodic measured foliations. Ann Math (2) 134(3):
over a rotation of the circle and in flows on a two-dimen- 455–543
sional torus. Dokl Akad Nauk SSSR 205:512–518. Masur H, Tabachnikov S (2002) Rational billiards and flat
(Translated in: Sov Math Dokl 13:949–952, 1972) structures. In: Handbook of dynamical systems, vol 1A.
Kočergin AV (1975) Mixing in special flows over a shifting North-Holland, Amsterdam, pp 1015–1089
of segments and in smooth flows on surfaces. Mat Sb Mayer A (1943) Trajectories on the closed orientable sur-
96(138):471–502 faces. Rec Math [Mat Sbornik] NS 12(54):71–84
Kochergin AV (2004) Some generalizations of theorems on McMullen CT (2020) Teichmüller dynamics and unique
mixing flows with nondegenerate saddles on a two- ergodicity via currents and Hodge theory. J Reine
dimensional torus. Mat Sb 195(9):19–36 Angew Math 768:39–54
Ergodic and Spectral Theory of Area-Preserving Flows on Surfaces 367
Nikolaev I, Zhuzhoma E (1999) Flows on 2-dimensional Ulcigrai C (2022) Dynamics and ‘arithmetics’ of higher
manifolds. In: Lecture notes in mathematics, vol 1705. genus surface flows. EMS Press, ICM 2022
Springer Proceedings
Nogueira A, Rudolph D (1997) Topological weak-mixing Ulcigrai C (2007) Mixing of asymmetric logarithmic sus-
of interval exchange maps. Ergod Theory Dyn Syst pension flows over interval exchange transformations.
17(5):1183–1209 Ergod Theory Dyn Syst 27(3):991–1035
Novikov SP (1982) The Hamiltonian formalism and a Ulcigrai C (2009) Weak mixing for logarithmic flows over
multivalued analogue of Morse theory. (Russian) interval exchange transformations. J Mod Dyn 3(1):
Uspekhi Matematicheskikh Nauk 37(5):3–49. 35–49
(Translated in: Russ Math Surv 37(5):1–56, 1982) Ulcigrai C (2011) Absence of mixing in area-preserving
Oseledec VI (1966) The spectrum of ergodic automor- flows on surfaces. Ann Math (2) 173(3):1743–1778
phisms. Dokl Akad Nauk SSSR 168:1009–1011 Ulcigrai C (2021) Slow chaos in surface flows. Boll
Poincaré H (1987) Les méthodes nouvelles de la Unione Mat Ital 14(1):231–255
mécanique céleste. In: Les Grands Classiques Veech WA (1969) Strict ergodicity in zero dimensional
Gauthier-Villars, Librairie Scientifique et Technique dynamical systems and the Kronecker-Weyl theorem
Albert Blanchard, Paris mod 2. Trans Am Math Soc 140:1–33
Quas A (2017) Ergodicity and mixing properties. Springer Veech WA (1982a) A criterion for a process to be prime.
Berlin Heidelberg, Berlin, Heidelberg, pp 1–20 Monatsh Math 94(4):335–341
Ralston D, Troubetzkoy S (2017) Residual generic ergo- Veech WA (1982b) Gauss measures for transformations on
dicity of periodic group extensions over translation the space of interval exchange maps. Ann Math 115:
surfaces. Geom Dedicata 187:219–239 201–242
Ravotti D (2017) Quantitative mixing for locally Hamilto- Veech WA (1984) The metric theory of interval exchange
nian flows with saddle loops on compact surfaces. Ann transformations I. generic spectral properties. Am
Henri Poincaré 18(12):3815–3861 J Math 107(6):1331–1359
Robertson D (2019) Mild mixing of certain interval- Viana M (n.d.) Dynamics of interval exchange transforma-
exchange transformations. Ergod Theory Dyn Syst tions and Teichmüller flows. Available from http://w3.
39(1):248–256 impa.br/viana. Lecture Notes
Ryzhikov VV (1994) The absence of mixing in special von Neumann J (1932) Zur Operatorenmethode in der
flows over rearrangements of segments. Mat Zametki klassischen Mechanik. Ann Math (2) 33(3):587–642
55(6):146–149. (Translated in: Math Notes Yoccoz J-C (2010) Interval exchange maps and translation
55(5–6):648–650, 1994) surfaces. In: Homogeneous flows, moduli spaces and
Ryzhikov VV, Tuveno Z-P (2006) Disjointness, divisibil- arithmetic, Clay Mathematics Proceedings, vol 10. Amer-
ity, and quasi-simplicity of measure-preserving actions. ican Mathematical Society, Providence, RI, pp 1–69
Funktsional Anal i Prilozhen 40(3):85–89 Zemljakov AN, Katok AB (1975) Topological transitivity
Sataev EA (1975) The number of invariant measures for of billiards in polygons. Mat Zametki 18(2):291–300
flows on orientable surfaces. Izv Akad Nauk SSSR Ser Zorich A (1984) S. P. Novikov’s problem of the semiclas-
Mat 39(4):860–878 sical motion of an electron in a homogeneous magnetic
Scheglov D (2009) Absence of mixing for smooth flows on field that is close to rational. Uspekhi Mat Nauk
genus two surfaces. J Mod Dyn 3(1):13–34 39(5(239)):235–236
Schmidt K (2002) Dispersing cocycles and mixing flows Zorich A (1997) Deviation for interval exchange transfor-
under functions. Fundam Math 173(2):191–199 mations. Ergod Theory Dyn Syst 17(6):1477–1499
Sinai YG, Khanin KM (1992) Mixing for some classes of Zorich A (1999) How do the leaves of a closed 1-form
special flows over rotations of the circle. wind around a surface? In: Pseudoperiodic topology,
Funktsional’nyi Analiz i Ego Prilozheniya 26(3): American Mathematical Society Translations: Series 2,
1–21. (Translated in: Funct Anal Appl 26(3)155–169, vol 197. American Mathematical Society, Providence,
1992) RI, pp 135–178
Sinai YG, Ulcigrai C (2005) Weak mixing in interval Zorich A (2006) Flat surfaces. In: Frontiers in number
exchange transformations of periodic type. Lett Math theory, physics, and geometry, I. Springer, Berlin,
Phys 74(2):111–133 pp 437–583
f : X ! ℝ (“potential”), those invariant mea-
Pressure and Equilibrium States sures that maximize a functional of the form
in Ergodic Theory F(m) ¼ h(m) þ hf, mi are called “equilibrium
states” for f.
Jean-René Chazottes1 and Gerhard Keller2 Ergodic theory Ergodic theory is the mathemat-
1
Centre de Physique Théorique, CNRS/IP Paris, ical theory of measure-preserving dynamical
Palaiseau, France systems.
2
Department Mathematik, Universität Erlangen- Gibbs state In many cases, equilibrium states
Nürnberg, Erlangen, Germany have a local structure that is determined by
the local properties of the potential f. They
are called “Gibbs states.”
Article Outline Invariant measure In this entry: a probability
measure m on X which is invariant under the
Glossary transformation T, that is, for which hf ∘ T, mi ¼
Definition of the Subject hf, mi for each continuous f : X ! ℝ. Here
Introduction hf, mi is a short-hand notation for X f dm.
Warming Up: Thermodynamic Formalism for The triple (X, T, m) is called a measure-
Finite Systems preserving dynamical system.
Shift Spaces, Invariant Measures, and Entropy Pressure The maximum of the functional F(m)
The Variational Principle: A Global is denoted by P(f) and called the “topologi-
Characterization of Equilibrium cal pressure” of f, or simply the “pressure”
The Gibbs Property: A Local Characterization of of f.
Equilibrium Sinai-Ruelle-Bowen measure Special equili
Examples on Shift Spaces brium or Gibbs states that describe the statisti
Examples from Differentiable Dynamics cs of the attractor of certain smooth dynamical
Nonequilibrium Steady States and Entropy systems.
Production
Some Ongoing Developments and Future
Directions Definition of the Subject
Bibliography
Gibbs and equilibrium states of one-dimensional
Glossary lattice models in statistical physics play a promi-
nent role in the statistical theory of chaotic
Dynamical system In this entry: a continuous dynamics. They first appear in the ergodic theory
transformation T of a compact metric space X. of certain differentiable dynamical systems, called
For each x X, the transformation T generates “uniformly hyperbolic systems,” mainly Anosov
a trajectory (x, Tx, T2x, . . .). and Axiom A diffeomorphisms (and flows). The
Entropy In this entry: the maximal rate of infor- central idea is to “code” the orbits of these systems
mation gain per time that can be achieved by into (infinite) symbolic sequences of symbols by
coarse-grained observations on a measure- following their history on a finite partition of their
preserving dynamical system. This quantity is phase space. This defines a nice shift dynamical
often denoted h(m). system called a subshift of finite type or a topo-
Equilibrium state In general, a given dynamical logical Markov chain. Then the construction of
system T : X ! X admits a huge number of their “natural” invariant measures and the study of
invariant measures. Given some continuous their properties are carried out at the symbolic
© Springer Science+Business Media, LLC, part of Springer Nature 2023 369
C. E. Silva, A. I. Danilenko (eds.), Ergodic Theory,
https://doi.org/10.1007/978-1-0716-2388-6_414
Originally published in
R. A. Meyers (ed.), Encyclopedia of Complexity and Systems Science, © Springer-Verlag 2009
https://doi.org/10.1007/978-3-642-27737-5_414-3
370 Pressure and Equilibrium States in Ergodic Theory
between one-dimensional lattice systems and present a selection of important examples: mea-
dynamical systems is made by symbolic dynamics. sure of maximal entropy, Markov measures and
Informally, symbolic dynamics consists of Hofbauer’s example of nonuniqueness of equilib-
replacing the orbits of the original system by its rium state; uniformly expanding Markov maps of
history on a finite partition of its phase space the interval, interval maps with an indifferent
labeled by the elements of the “alphabet” A. There- fixed point, Anosov diffeomorphisms and
fore, each orbit of the original system is replaced by Axiom A attractors with Sinai-Ruelle-Bowen
an infinite sequence of symbols, that is, by an measures, and Bowen’s formula for the Hausdorff
element of the set Aℤ or Aℕ, depending on whether dimension of conformal repellers. As we shall see,
the map describing the dynamics is invertible or Sinai-Ruelle-Bowen measures are the only phys-
not. The action of the map on an initial condition is ically observable measures and they appear natu-
then easily seen to correspond to the translation rally in the context of nonuniformly hyperbolic
(or shift) of its associated symbolic sequence. In diffeomorphisms (Young 2002).
general there is no reason to get all sequences of Aℤ A revival of the interest to Anosov and Axiom
or Aℕ. Instead one gets a closed invariant subset A systems occurred in statistical mechanics in the
X (a subshift) which can be very complicated. For a 1990s. Several physical phenomena of non-
certain class of dynamical systems the partition can equilibrium origin, like entropy production and
be successfully chosen so as to form a Markov chaotic scattering, were modeled with the help of
partition. In this case, the dynamical system those systems (by G. Gallavotti, P. Gaspard,
under consideration can be coded by a subshift of D. Ruelle, and others). This new interest led to
finite type (also called a topological Markov chain) new results about old Anosov and Axiom
which is a very nice symbolic dynamical system. A systems, see, for example, Chernov (2002) for
Then one can play the game of statistical physics: a survey and references. In section “Non-
for a given continuous, real-valued function equilibrium Steady States and Entropy Produc-
(a “potential”) on X, construct the corresponding tion,” we give a very brief account of entropy
Gibbs states and equilibrium states. If the potential production in the context of Anosov systems
is regular enough, one expects uniqueness of the which highlights the role of relative entropy.
Gibbs state and that it is also the unique equilibrium This entry is a little introduction to a vast subject
state for this potential. This circle of ideas – ranging in which we have tried to put forward some aspects
from Gibbs states on finite systems over invariant not previously described in other expository texts.
measures on symbolic systems and their For readers willing to deepen their understanding
(Shannon-)entropy with a digression to of equilibrium and Gibbs states, there are the clas-
Kolmogorov-Chaitin complexity to equilibrium sic monographs by Bowen (2017) and by Ruelle
states and Gibbs states on subshifts of finite (2004), the monograph by one of us (Keller 1998),
type – is presented in the next four sections. and the survey article by Chernov (2002) (where
At this point it should be remembered that the Anosov and Axiom A flows are reviewed). Those
objects which can actually be observed are not texts are really complementary.
equilibrium states (they are measures on X) but
individual symbol sequences in X, which reflect
more or less the statistical properties of an equi- Warming Up: Thermodynamic
librium state. Indeed, most sequences reflect these Formalism for Finite Systems
properties very well, but there are also rare
sequences that look quite different. Their proper- We introduce the thermodynamic formalism in an
ties are described by large deviations principles elementary context, following Jaynes (1989). In
which are not discussed in the present article. We this view, entropy, in the sense of information
shall indicate some references along the way. theory, is the central concept.
In sections “Examples on Shift Spaces” and Incomplete knowledge about a system is con-
“Examples from Differentiable Dynamics” we veniently described in terms of probability
372 Pressure and Equilibrium States in Ergodic Theory
distributions on the set of its possible states. This distribution m provided the value E can be attained
is particularly simple if the set of states, call it X, is at all by some hU, ni. In order to derive an explicit
finite. Then the equidistribution on X describes formula for this m we introduce a Lagrange mul-
complete lack of knowledge, whereas a probabil- tiplier β ℝ and study, for each β, the
ity vector that assigns probability 1 to one single unconstrained problem
state and probability 0 to all others represents
maximal information about the system. A well- H mb þ bU , mb ¼ pðbU Þ≔ max ðH ðnÞ þ hbU , niÞ:
n
established measure of the amount of uncer-
ð2Þ
tainty represented by a probability distribution
n ¼ (n(x))x X is its entropy
In analogy to the convention in ergodic theory
we call p(βU) the pressure of βU and the maxi-
H ðnÞ≔ nðxÞ log nðxÞ, mizer mβ the corresponding equilibrium distribu-
xX
tion (synonymously equilibrium state).
The equilibrium distribution mβ satisfies
which is zero if the probability is concentrated in
one state and which attains its maximum value
log|X| if n is the equidistribution on X, that is, if mb ðxÞ ¼ exp ðpðbU Þ þ bU ðxÞÞ for all x X
n(x) ¼ |X|1 for all x X. In this completely ð3Þ
elementary context we will explore two concepts
whose generalizations are central to the theory of as an elementary calculation using Jensen’s inequal-
equilibrium states in ergodic theory: ity for the strictly convex function t 7! t log t
shows:
• Equilibrium distributions – defined in terms of
a variational problem ebU ðxÞ
H ðnÞ þ hbU , ni ¼ nðxÞ log
• The Gibbs property of equilibrium xX
nð x Þ
distributions
ebU ðxÞ
log nð x Þ
xX
nðxÞ
The only mathematical prerequisite for this
section are calculus and some elements from ¼ log ebU ðxÞ ,
probability theory. xX
Equilibrium Distributions and the Gibbs with equality if and only if eβU is a constant
Property multiple of n. The observation that n ¼ mβ is a
Suppose that a finite system can be observed maximizer proves at the same time that p(βU) ¼
through a function U : X ! ℝ (an “observable”), log x XeβU(x).
and that we are looking for a probability distribu- The equality expressed in (3) is called the Gibbs
tion m which maximizes entropy among all distri- property of mβ, and we say that mβ is a Gibbs
butions n with a prescribed expected value distribution if we want to stress this property.
hU, ni ≔ x Xn(x)U(x) for the observable U. In order to solve the constrained problem (1) it
This means we have to solve a variational problem remains to show that there is a unique multiplier
under constraints: β ¼ β(E) such that hU, mβi ¼ E. This follows from
the fact that the map β 7! hU, mβi maps the real
H ðmÞ ¼ max fH ðnÞ : hU, ni ¼ Eg: ð1Þ line monotonically onto the interval
(minU, maxU ) which, in turn, is a direct conse-
As the function n 7! H(n) is strictly concave, quence of the formulas for the first and second
there is a unique maximizing probability derivative of p(βU) w.r.t β:
Pressure and Equilibrium States in Ergodic Theory 373
the first identity in (4) can be rephrased as: mβ is mb ða0 . . . an1 Þ ¼ exp ðPðbfÞ þ bfðai ÞÞ:
i¼0
the gradient at βU of the function p.
A similar analysis can be performed for an ℝd- Indeed, comparison with (3) shows that mβ is
valued observable U. In that case a vector β ℝd the n-fold product of the probability distribution
of Lagrange multipliers is needed to satisfy the
b on A that maximizes H(n) þ βn(f) among all
mloc
d linear constraints.
distributions n on A. It follows that n1 H mb ¼
H mlocb so that (6) implies P(βf) ¼ p(βf) for
Systems on a Finite Lattice
observables f that depend only on one coordinate.
We now assume that the system has a lattice
structure, modeling its extension in space, for
instance. The system can be in different states at Shift Spaces, Invariant Measures, and
different positions. More specifically, let n ¼ Entropy
f0, 1, . . . , n 1g be a set of n positions in
space, let A be a finite set of states that can be We now turn to shift dynamical systems over a
attained by the system at each of its sites, and finite alphabet A.
denote by X≔An the set of all configurations of
states from A at positions of n . It is helpful to Symbolic Dynamics
think of X as the set of all words of length n over We start by fixing some notation. Let ℕ denote the
the alphabet A. We focus on observables Un which set {0, 1, 2, . . .}. In the sequel we need
are sums of many local contributions in the sense
that Un ða0 . . . an1 Þ ¼ n1 • A finite set A (the “alphabet”),
i¼0 fðai . . . aiþr1 Þ for
some “local observable” f : Ar ! ℝ. (The index • The set Aℕ of all infinite sequences over A, that
i þ r 1 has to be taken modulo n.) In terms of f is, the set of all x ¼ x0 x1 . . . with xn A for all
the maximizing measure can be written as n ℕ,
• The translation (or shift) s : Aℕ ! Aℕ,
mb ða0 . . . an1 Þ ðsxÞn ¼ xnþ1 , for all n ℕ,
n1 • A shift invariant subset X ¼ s(X) of Aℕ. With a
¼ exp nPðbfÞ þ b fðai . . . aiþr1 Þ , slight abuse of notation we denote the restric-
i¼0 tion of s to X by s again.
ð5Þ
We mention two interpretations of the dynamics
where P(βf) ≔ n1p(βUn). A first immediate con- of s: it can describe the evolution of a system with
sequence of (5) is the invariance of mβ under a cyclic state space X in discrete time steps (this is the
shift of its argument, namely mβ(a1. . .an1a0) ¼ prevalent interpretation if s : X ! X is obtained
mβ(a0. . .an1). Therefore, we can restrict the maxi- as a symbolic representation of another dynamical
mizations in (1) and (2) to probability distributions n system), or it can be the spatial translation of the
374 Pressure and Equilibrium States in Ergodic Theory
configuration of a system on an infinite lattice (gen- probability distributions is compact in the weak
eralizing the point of view from subsection “Sys- topology, the coarsest topology on M ðXÞ for
tems on a Finite Lattice” above). In the latter case which n 7! hf, ni is continuous for all f C(X),
one usually looks at the shift on the two-sided shift “Measure Preserving Systems”, subsection “Exis-
space Aℤ, for which the theory is nearly identical. tence of Invariant Measures.” (Note that in func-
On Aℕ one can define a metric d by tional analysis this is called the weak-* topology.)
Henceforth we will use both terms, “measure” and
d x, y ≔2N ðx,yÞ “distribution,” if we talk about probability
distributions.
where N x, y ≔ min fk ℕ : xk 6¼ yk g: A measure n on X is invariant if expectations of
observables are unchanged under the shift, that is, if
ð7Þ
h f ∘s, ni ¼ h f , ni
Hence dðx, yÞ ¼ 1 if and only if x0 6¼ y0, and
for all bounded measurable f : X ! ℝ:
dðx, xÞ ¼ 0 upon agreeing that N ðx, xÞ ¼ 1 and
21 ¼ 0. Equipped with this metric, Aℕ becomes
The set of all invariant measures is denoted by
a compact metric space and s is easily seen to be a
M s ðXÞ. As a closed subset of M ðXÞ it is compact
continuous surjection of Aℕ. Finally, if X is a
in the weak topology. Of special importance
closed subset of Aℕ, we call the restriction
among all invariant measures n are the ergodic
s : X ! X, which is again a continuous surjection,
ones which can be characterized by the property
a shift dynamical system. We remark that
that, for all bounded measurable f : X ! ℝ,
d generates on Aℕ the product topology of the
discrete topology on A, just as many variants of n1
d do. For more details ▶ “Symbolic Dynamics”. 1
lim f sk x ¼ h f , n i
As usual, C(X) denotes the space of real-valued n!1 n k¼0
ð8Þ
continuous functions on X equipped with the for n a:e: ðalmosteveryÞ x,
supremum norm kk1.
that is, for a set of x of n-measure one. They are the
Invariant Measures indecomposable “building blocks” of all other
A probability distribution n (or simply distribu- measures in M s ðXÞ , “Measure Preserving Sys-
tion) on X is a Borel probability measure on X. It is tems” or ▶ “Ergodic Theorems.” The almost
unambiguously specified by its values n[a0. . .an1] everywhere convergence in (8) is Birkhoff’s ergo-
(n ℕ, ai A) on cylinder sets dic theorem ▶ “Ergodic Theorems,” the constant
limit characterizes the ergodicity of n.
½a0 . . . an1
≔fx X : xi ¼ ai for all i ¼ 0, . . . , n 1g: Entropy of Invariant Measures
We give a brief account of the definition and basic
Any bounded and measurable f : X ! ℝ properties of the entropy of an invariant measure
(in particular any f C(X)) can be integrated by n. For details and the generalization of this con-
any distribution n. To stress the linearity of the cept to general dynamical systems, we refer to
integral in both, the integrand and the integrator, ▶ “Entropy in Ergodic Theory” or Katok and
we use the notation Hasselblatt (1995), and to Katok (2007) for an
historical account.
Let n M s ðXÞ . For each n > 0 the cylinder
h f , ni≔ f dn:
X probabilities n[a0. . .an1] give rise to a probabil-
ity distribution on the finite set An , see section
In probabilistic terms, hf, ni is the expectation “Warming Up: Thermodynamic Formalism for
of the observable f under n. The set M ðXÞ of all Finite Systems,” so
Pressure and Equilibrium States in Ergodic Theory 375
The Variational Principle: A Global entropy, and that means by the macroscopic
Characterization of Equilibrium state of the system. (In contrast, the word “state”
was used in the above section on finite systems to
Usually, a dynamical systems model of a “phys- designate microscopic states.)
ical” system consists of a state space and a map As, for each n M s ðXÞ, the functional
(or a differential equation) describing the f 7! h(n) þ hf, ni is affine on C(X), the pres-
dynamics. An invariant measure for the system sure functional P : C(X) ! ℝ, which, by defini-
is rarely given a priori. Indeed, many (if not tion, is the pointwise supremum of these
most) dynamical systems arising in this way functionals, is convex. It is therefore instructive
have uncountably many ergodic invariant mea- to fit equilibrium states into the abstract frame-
sures. This limits considerably the “practical work of convex analysis (Israel 1979; Keller
value” of Birkhoff’s ergodic theorem (8) or the 1998; Moulin-Ollagnier 1985; Walters 1992).
Shannon-McMillan-Breiman theorem (10): not To this end recall the identities in (4) that iden-
only do the limits in these theorems depend on tify, for finite systems, equilibrium states as gra-
the invariant measure n, but also the sets of dients of the pressure function p : ℝ|A| ! ℝ and
points for which the theorems guarantee almost guarantee that p is twice differentiable and
everywhere convergence are practically disjoint strictly convex. In the present setting where
for different n and n0 in M s ðXÞ . Therefore, a P is defined on the Banach space C(X), differen-
choice of n has to be made which reflects the tiability and strict convexity are no more
original modeling intentions. We will argue in guaranteed, but one can show:
this and the next sections that a variational prin-
ciple with a judiciously chosen “observable” Equilibrium States as (Sub)-gradients
may be a useful guideline – generalizing the f M s ðXÞ is an equilibrium state for f if and
observations for finite systems collected in the only if m is a subgradient (or tangent functional)
corresponding section above. As announced ear- for P at f, i.e., if P(f þ c) P(f) hc, mi for all
lier we restrict again to shift dynamical systems, c C(X). In particular, f has a unique equilib-
because they are rather universal models for rium state m if and only if P is differentiable at f
many other systems. with gradient m, i.e., if,
Equilibrium States 1
lim t!0 ðPðf þ tcÞ PðfÞÞ
We define the pressure of an observable f C(X) t
as ¼ hc, mi for all c CðXÞ: ð13Þ
PðfÞ≔supfhðnÞ þ hf, ni : n M s ðXÞg: ð12Þ Let us see how equilibrium states on X ¼ Aℕ can
directly be obtained from the corresponding equilib-
Since M s ðXÞ is compact and the functional rium distributions on finite sets An introduced in
n 7! h(n) þ hf, ni is upper semicontinuous, the subsection “Systems on a Finite Lattice.” Define
supremum is attained – not necessarily at a unique f(n) : An ! ℝ by f(n)(a0. . .an1) ≔ f(a0. . .an1
measure as we will see (which is remarkably a0. . .an1. . .), denote by Un the corresponding
different from what happens in finite systems). global observable on An, and let mn be the equilib-
Each measure n for which the supremum is rium distribution on An that maximizes H(m) þ
attained is called an equilibrium state for f. Here hUn, mi. Then all weak limit points of the “approx-
the word “state” is used synonymously with “dis- imative equilibrium distributions” mn on An are equi-
tribution” or “measure” – a reflection of the fact librium states on Aℕ.
that in “well-behaved cases,” as we will see in the This can be seen as follows: Let the measure m
next section, this measure is uniquely determined on Aℕ be any weak limit point of the mn. Then,
by the constraint(s) under which it maximizes given ϵ > 0 there exists k ℕ such that
Pressure and Equilibrium States in Ergodic Theory 377
acterized in terms of the logarithms of the nor- limits of these measures are obviously shift invari-
malizing constants of these approximating ant, and a more involved estimate we do not
distributions. Let Sn fðxÞ≔fðxÞ þ fðsxÞ þ present here shows that each such weak limit m
. . . þ fðsn1 xÞ: From each cylinder set satisfies h(m) þ hf, mi P(f).
[a0. . .an1] we can pick a point z such that We note that the same arguments work for any
Sn fðzÞ is the maximal value of Snf on this set. other sequence of sets En which contain exactly one
We denote the collection of the |A|n points we point from each cylinder. So there are many ways to
obtain in this way by En. Observe that En is not approximate equilibrium states, and if there are
unambiguously defined, but any choice we more than one equilibrium state, there is generally
make will do. no guarantee that the limit is always the same.
378 Pressure and Equilibrium States in Ergodic Theory
Nonuniqueness of Equilibrium States: An and hence equilibrium states need not exist, that
Example is, the supremum in (12) need not be attained by
Before we turn to sufficient conditions for the any invariant measure. A well-known sufficient
uniqueness of equilibrium states in the next sec- property that guarantees the upper semicontinuity
tion, we present one of the simplest nontrivial of the entropy function is the expansiveness of the
examples for nonuniqueness of equilibrium states. system, see, for example, Ruelle (1973): a contin-
Motivated by the so-called Fisher-Felderhof drop- uous transformation T of a compact metric space
let model of condensation in statistical mechanics is positively expansive, if there is a constant γ > 0
(Fisher and Felderhof 1970; Fisher 1967), such that for any two points x and y from the space
Hofbauer (1977) studies an observable f on X ¼ there is some n ℕ such that Tnx and Tny are at
{0, 1}ℕ defined as follows: Let (ak) be a sequence least a distance γ apart. If T is a homeomorphism
of negative real numbers with limk!1ak ¼ 0. Set one says it is expansive, if the same holds for some
sk ≔ a0 þ . . . þ ak. For k 1 denote n ℤ. The previous results carry over without
Mk ≔fx X : x0 ¼ . . . ¼ xk1 ¼ 1, xk ¼ 0g and changes (although at the expense of more compli-
M0 ≔fx X : x0 ¼ 0g, and define cated proofs) to general expansive systems. The
variational principle (14) holds in the very general
fðxÞ≔ak for x Mk and fð11 . . .Þ ¼ 0: context where T is a continuous action of ℤdþ on a
compact Hausdorff space X. This was proved in
Then f : X ! ℝ is continuous, so that there Misiurewicz (1976) in a simple and elegant way.
exists at least one equilibrium state for f. In the monograph (Moulin-Ollagnier 1985) it is
Hofbauer proves that there is more than one equi- extended to amenable group actions.
1 sk
librium state if and only if k¼0 e ¼ 1 and
1
k¼0 ð k þ 1 Þe sk
< 1. In that case P(f) ¼ 0, so
one of these equilibrium states is the unit mass The Gibbs Property: A Local
δ11. . ., and we denote the other equilibrium state Characterization of Equilibrium
by m1, so h(m1) þ hf, m1i ¼ 0. In view of (13) the
pressure function is not differentiable at f. In this section we are going to see that, for a
What does the pressure function β 7! P(βf) sufficiently regular potential f on a topologically
look like? As h(δ11. . .) þ hβf, δ11. . .i ¼ 0 for all β, mixing subshift of finite type, one has a unique
P(βf) 0 for all β. Observe now that fðxÞ 0 equilibrium state which has the “Gibbs property.”
with equality only for x ¼ 11 . . . This implies that This property generalizes formula (5) that we
hf, mi < 0 for all m M s ðXÞ different from δ11. . .. derived for finite lattices. Subshifts of finite type
From this we can conclude: are the symbolic models for Axiom
A diffeomorphisms, as we shall see later on.
• P(βf) P(f) ¼ 0 for β > 1, so
P(βf) ¼ 0 for β 1. Subshifts of Finite Type
• P(βf) h(m1) þ hβf, m1i ¼ h(m1) þ hf, m1i We start by recalling what is a subshift of finite
(1 β)hf, m1i ¼ (1 β)hf, m1i. type and refer the reader to ▶ “Symbolic Dynam-
ics” or Lind and Marcus (1995) for more details.
It follows that, at β ¼ 1, the derivative from the Given a “transition matrix” M ¼ (Mab)a,b A
right of P(βf) is zero, whereas the derivative from whose entries are 0’s or 1’s, one can define a
the left is at most hf, m1i < 0. subshift XM as the set of all sequences x Aℕ
such that Mxi xiþ1 ¼ 1 for all i ℕ. This is called
More on Equilibrium States a subshift of finite type or a topological Markov
In more general dynamical systems the entropy chain. We assume that there exists some integer p0
function is not necessarily upper semicontinuous such that M p has strictly positive entries for all
Pressure and Equilibrium States in Ergodic Theory 379
p p0. This means that M is irreducible and We now make several comments on this
aperiodic. This property is equivalent to the prop- theorem.
erty that the subshift of finite type is topologically
mixing. A general subshift of finite type admits a • The Gibbs property (16) gives a uniform control
decomposition into a finite union of transitive of the measure of all cylinders in terms of their
sets, each of which being a union of cyclically “energy.” This strengthens considerably the
permuted sets on which the appropriate iterate is asymptotic equipartition property (11) that we
topologically mixing. In other words, topologi- recover if we restrict (16) to the set of mf measure
cally mixing subshifts of finite type are the build- 1 where Birkhoff’s ergodic Theorem (8) applies,
ing blocks of subshifts of finite type. and use the identity hf, mfi P(f) ¼ h(mf).
• Gibbs measures on topologically mixing sub-
The Gibbs Property for a Class of Regular shifts of finite type are ergodic (and actually
Potentials mixing in a strong sense) as can be inferred
The class of regular potentials we consider is that from Ruelle’s Perron-Frobenius Theorem see
of “summable variations.” We denote by vark(f) the next subsection.
the modulus of continuity of f on cylinders of • Suppose that there is another invariant measure
length k 1, that is, m0 satisfying (16), possibly with a constant C0
different from C. It is easy to verify that m0 ¼ fm
vark ðfÞ≔sup fðxÞ f y : x ½y0 . . . yk1 : for some m-integrable function f by using (16)
and the Radon-Nikodym Theorem. Shift invari-
ance imposes that, m-a. e., f ¼ f ∘ s. Then the
If vark(f) ! 0 as k ! 1, this means that f is
ergodicity of m implies that f is a constant m-a. e.,
(uniformly) continuous with respect to the dis-
thus m0 ¼ m; see Bowen (2017).
tance (7). We impose the stronger condition
• One could define a Gibbs state by saying that it
1 is an invariant measure m satisfying (16) for a
vark ðfÞ < 1: ð15Þ given continuous potential f. If one does so, it
k¼1 is simple to verify that such a m must also be an
equilibrium state. Indeed, using (16), one can
We can now state the main result of this
deduce that hf, mi þ h(m) P(f). The con-
section.
verse need not be true in general, see subsec-
The Gibbs state of a summable potential Let
tion “More on Hofbauer’s Example” below.
XM be a topologically mixing subshift of finite
But the summability condition (15) is indeed
type. Given a potential f : XM ! ℝ satisfying
sufficient for the coincidence of Gibbs and
the summability condition (15), there is a
equilibrium states. A proof of this fact can be
(probability) measure mf supported on XM, that
found in Ruelle (2004) or Keller (1998).
we call a Gibbs state. It is the unique s-invariant
measure which satisfies the following property:
Ruelle’s Perron-Frobenius Theorem
There exists a constant C > 0 such that, for all
The powerful tool behind the theorem in the pre-
x XM and for all n 1,
vious subsection is a far-reaching generalization
of the classical Perron-Frobenius theorem for irre-
mf ½x0 . . . xn1
C1 ducible matrices. Instead of a matrix, one intro-
exp ðSn fðxÞ nPðfÞÞ
duces the so-called transfer operator, also called
C: “ Gibbs property” ð16Þ the “Perron-Frobenius operator” or “Ruelle’s
operator,” which acts on a suitable Banach space
Moreover, the Gibbs state mf is ergodic and is of observables. It is D. Ruelle (1968) who first
also the unique equilibrium state of f, that is, the introduced this operator in the context of one-
unique invariant measure for which the supremum dimensional lattice gases with exponentially
in (12) is attained. decaying interactions. In our context, this
380 Pressure and Equilibrium States in Ergodic Theory
that is, given mf, the relative entropy h(|mf), as a It is not difficult to verify that the total number
function on M s ðXM Þ, attains its minimum only at of periodic sequences of period n equals the trace
mf. of the matrix Mn, that is, we have the formula
Indeed, by (18) we have h(n|mf) ¼ P(f)
hf, ni h(n). We now use (12) and the fact that m
Indeed, as m is a Gibbs measure, there are equilibrium state, one of them being δ11. . ., which
βab ℝ (a, b A) and constants P ℝ, cannot be a Gibbs state for any continuous f.
C > 0 such that
m½x0 . . . xn1
C1 Examples from Differentiable Dynamics
a,b A bab fn ðxÞ nP
ab
exp
In this section we present a number of examples to
C ð20Þ which the general theory developed above does not
apply directly but only after a transfer of the theory
for all x Aℕ and all n ℕ. Let from a symbolic space to a manifold. We restrict to
r ab ≔ exp bab b0 A bab0 qab0 P . Then the examples where the results can be transferred
denominator in (20) equals r x0 x1 . . . r xn2 xn1 , and it because those aspects of the smooth dynamics we
follows that m is equivalent to the stationary Mar- focus on can be studied as well on a shift dynamical
kov measure defined by the (nonstochastic) system that is obtained from the original one via
matrix (rab)a,b A. As m is ergodic, m is this Mar- symbolic coding. (We do not discuss the coding
kov measure, and as m satisfies the linear con- process itself which is sometimes far from trivial,
straints m[ab] ¼ m[a]qab, we conclude that m ¼ nQ. but we focus on the application of the Gibbs and
equilibrium theory.) There are alternative
The Ising Chain approaches where instead of the results the concepts
Here the task is to characterize all “spin chains” in and (partly) the strategies of proofs are transferred
x f1, þ1gℕ (or, more commonly, {1, +1}ℤ) to the smooth dynamical systems. This has led both
which are as random as possible with the con- to an extension of the range of possible applications
straint that two adjacent spins have a prescribed of the theory and to a number of refined results
probability p 6¼ 12 to be identical. With fðxÞ≔x0 x1 (because some special features of smooth systems
this is equivalent to requiring that x is typical for a necessarily get lost by transferring the analysis to a
Gibbs distribution mβf where β ¼ β( p) is such that completely disconnected metric space).
hf, mβfi ¼ 2p 1. It follows that there is a In the following examples, T denotes a
constant C > 0 such that for each n ℕ and (possibly piecewise) differentiable map of a
any two “spin patterns” a ¼ a0 . . . an1 and b ¼ compact smooth manifold M. Points on the man-
b0 . . . bn1 ifold are denoted by u and v. In all examples there
is a Hölder continuous coding map π : X ! M
mbf ½a0 . . . an1 from a subshift of finite type X onto the manifold
log b Na Nb C,
mbf ½b0 . . . bn1 which respects the dynamics, that is, T ∘ π ¼
π ∘ s. This factor map π is “nearly” invertible in
where N a and N b are the numbers of identical the sense that the set of points in M with more
adjacent spins in a and b, respectively. than one preimage under π has measure zero for
all T-invariant measures we are interested
More on Hofbauer’s Example in. Hence such measures m on M correspond
We come back to the example described in sub- unambiguously to shift invariant measures m ¼
section “Nonuniqueness of Equilibrium States: m∘p1 . Similarly observables f on M and f ¼
An Example.” It is easy to verify that in that f∘p on X are related.
example varkþ1(f) ¼ |ak|. For instance, if
ak ¼ 1/(k þ 1)2 there is a unique Gibbs/equi- Uniformly Expanding Markov Maps of the
librium state. If ak ¼ 3 log ((k þ 1)/k) for k 1 Interval
and a0 ¼ log 1j¼1 j3 , then from Hofbauer A transformation T on M ≔ [0, 1] is called a
(1977) we know that f admits more than one Markov map, if there are 0 ¼ u0 < u1 < . . . < uN ¼ 1
Pressure and Equilibrium States in Ergodic Theory 383
such that each restriction Tjðui1 ,ui Þ is strictly mono- and minimizing this relative entropy just amounts to
tone, C1þr for some r > 0, and maps (ui1, ui) onto maximizing h(m) þ hf, mi for f ¼ log |T0| ∘ π.
a union of some of these N monotonicity intervals. As the results on Gibbs distributions from section
It is called uniformly expanding if there is some “The Gibbs Property: A Local Characterization of
k ℕ such that l ≔ infx|(Tk)0(x)| > 1. It is not Equilibrium” apply, we conclude that
difficult to verify that the symbolic coding of such a
system leads to a topological Markov chain over m½a0 . . . an1
C1 C
the alphabet A ¼ {1, . . ., N}. To simplify the dis- jI a0 . . . an1 j
cussion we assume that the transition matrix M of
this topological Markov chain is irreducible and for some C > 0. So the unique T-invariant mea-
aperiodic. sure m that minimizes the relative entropy hðmjmÞ
Our goal is to find a T-invariant measure m is equivalent to Lebesgue measure m. (The exis-
represented by m M s ðXM Þ which minimizes tence of an invariant probability measure equiva-
the relative entropy to Lebesgue measure on [0, 1] lent to m is well known, also without invoking
entropy theory. It is guaranteed by a “Folklore
1 Theorem” (Jakobson 2002).)
hðmjmÞ≔ lim m½a0 . . . an1
n!1 n
a0 , ..., an1 f1, ..., N g
Interval Maps with an Indifferent Fixed Point
m½a0 . . . an1 The presence of just one point x [0, 1] such that
log ,
nn ½a0 . . . an1 T0(x) ¼ 1 dramatically changes the properties of the
system. A canonical example is the map
where nn ½a0 . . . an1 ≔jI a0 ...an1 j . (Recall that, Tα : x 7! x(1 þ 2αxα) if x [0, 1/2[ and x 7! 2x 1
without insisting on invariance, this would just be if x [1/2, 1]. We have T0(0) ¼ 1, that is, 0 is an
the Lebesgue measure itself.) The existence of the indifferent fixed point. For α [0, 1[ this map
limit will be justified below – observe that m is not admits an absolutely continuous invariant proba-
a Gibbs state as v is in Eq. (17). The argument rests bility measure dm(x) ¼ h(x)dx, where h(x)~xα
on the simple observation (implied by the uniform when x ! 0 (Thaler 1980). In the physics literature,
expansion and the piecewise Hölder-continuity of this type of map is known as the “Manneville-
T0) that T has bounded distortion, that is, that there Pomeau” map. It was introduced as a model of
is a constant C > 0 such that for all n ℕ, transition from laminar to intermittent behavior
a0. . .an 1 {1, . . ., N}n and u I a0 ...an1 holds (Pomeau and Manneville 1980). In Gaspard and
Wang (1988) the authors construct a piecewise
C1 jI a0 ...an1 j ðT n Þ0 ðuÞ C, or, equivalently, affine version of this map to study the complexity
jI a0 ...an1 j of trajectories (in the sense of subsection “A Short
C1 C,
exp Sn fðuÞ Digression on Complexity”). This gives rise to a
countable state Markov chain. In Wang (1989) the
ð21Þ close connection to the Fisher-Felderhof model and
Hofbauer’s example (see subsection “Non-
where fðuÞ≔ log jT 0 ðuÞj. (Observe the similar- uniqueness of Equilibrium States: An Example”)
ity between this property and the Gibbs prop- was realized. We refer to Sarig (2001) for recent
erty (16).) Assuming bounded distortion we have developments and a list of references.
at once
Axiom A Diffeomorphisms, Anosov
n1
1 Diffeomorphisms, Sinai-Ruelle-Bowen
hðmjmÞ ¼ lim H n ðmÞ f∘sk , m
n!1 n Measures
k¼0
The first spectacular application of the theory of
¼ hðmÞ hf, mi,
Gibbs measures to differentiable dynamical
384 Pressure and Equilibrium States in Ergodic Theory
systems was Sinai’s approach to Anosov 2. The union of all stable manifolds through
diffeomorphisms via Markov partitions (Sinai points of Ω is a subset of M with positive
1968) that allowed one to code the dynamics of volume.
these maps into a subshift of finite type and to 3. The pressure PTjO fðuÞ ¼ 0.
study their invariant measures by methods from
equilibrium statistical mechanics (Sinai 1972) that
had been developed previously by Dobrushin, In this case the unique equilibrium and Gibbs
Lanford, and Ruelle (Dobrushin 1968a, b, c, state m+ of T|Ω is called the Sinai-Ruelle-Bowen
1969; Lanford and Ruelle 1969). Not much later (SRB) measure of T|Ω. It is uniquely characterized
this approach was extended by Bowen (1970) to by the identity hTjO ðmþ Þ ¼ fðuÞ , mþ . (For all
Smale’s Axiom A diffeomorphisms (and to other T-invariant measures on Ω one has “<”
Axiom A flows by Bowen and Ruelle (1975)); instead of “¼.”)
see also Ruelle (1976). The interested reader can Further properties of SRB measures Suppose
consult, for example, Young (2002) for a survey, PTjO fðuÞ ¼ 0 and let m+ be the SRB measure.
and either (Bowen 2017) or (Chernov 2002) for
details.
Both types of diffeomorphisms act on a 1. For a set of points u M of positive volume
smooth compact Riemannian manifold M and we have:
are characterized by the existence of a compact
n1
T-invariant hyperbolic set Λ M. Their basic 1
lim f T k u ¼ h f , mþ i:
properties are described in detail in the contri- n!1 n k¼0
bution ▶ “Ergodic Theory: Basic Examples and
Constructions.” Very briefly, the tangent bundle (Indeed, because of (ii) of the above character-
over Λ splits into two invariant subbundles – a ization, this holds for almost all points of the
stable one and an unstable one. Correspond- union of the stable manifolds through points of
ingly, through each point of Λ there passes a Ω.)
local stable and a local unstable manifold 2. Conditioned on unstable manifolds, m+ is abso-
which are both tangent to the respective sub- lutely continuous to the volume measure on
spaces of the local tangent space. The unstable unstable manifolds.
derivative of T, that is, the derivative DT
restricted to the unstable subbundle, is uniformly In the special case of transitive Anosov
expanding. Its Jacobian determinant, denoted by diffeomorphisms, the whole manifold is a hyper-
J(u), is Hölder continuous as a function on Λ. bolic set and Ω ¼ M. Because of transitivity,
Hence the observable f(u) ≔ log |J(u)| ∘ π is property (ii) from the characterization of attractors
Hölder continuous, and the Gibbs and equilib- is trivially satisfied, so there is always a unique
rium theory apply (via the symbolic coding) SRB measure m+. As T1 is an Anosov
to the diffeomorphism T (modulo possibly diffeomorphism as well – only the roles of stable
a decomposition of the hyperbolic set into and unstable manifolds are interchanged – T1
irreducible and aperiodic components, called has a unique SRB measure m which is the unique
basic sets, that can be modeled by topologically equilibrium state of T1 (and hence also of T ) for
mixing subshifts of finite type). The main results f(s) ≔ log |J(s)|. One can show:
are: SRB measures for Anosov diffeomorphisms
Characterization of attractors The following The following assertions are equivalent:
assertions are equivalent for a basic set Ω Λ:
1. m+ ¼ m.
1. Ω is an attractor, that is, there are arbitrarily 2. m+ or m is absolutely continuous w.r.t the
small neighborhoods U M of Ω such that volume measure on M.
TU U. 3. For each periodic point u ¼ Tnu M, |J(u)| ¼ 1,
where J denotes the determinant of DT.
Pressure and Equilibrium States in Ergodic Theory 385
We remark that, similarly to the case of Mar- (hyperbolic) Julia set, such that (J, ℂ, T ) is a
kov interval maps, the unstable Jacobian of Tn at conformal repeller.
u is asymptotically equivalent to the volume of the Conformal repellers J are in general fractal
“n-cylinder” of the Markov partition around u. So sets and one can measure their “degree of
the maximization of h(m) þ hf(u), mi by the SRB fractality” by means of their Hausdorff dimen-
measure m+ can again be interpreted as the mini- sion, dimH(J ). Roughly speaking, one computes
mization of the relative entropy of invariant mea- this dimension by covering the set J by balls with
sures with respect to the normalized volume, and radius less than or equal to δ. If Nδ(J ) denotes the
the fact that P(f(u)) ¼ 0 in the Anosov (or more cardinality of the smallest such covering, then we
generally attractor) case means that m+ is as close expect that
to being absolutely continuous as it is possible for
a singular measure. This is reflected by the above ~
N d ðJ ÞddimH ðJÞ , as d ! 0:
properties (a) and (b).
We emphasize the meaning of property We refer the reader to ▶ “Ergodic Theory:
(a) above: it tells us that the SRB measure m+ is Fractal Geometry” or Falconer (2003), Pesin
the only physically observable measure. Indeed, (1997) for a rigorous definition (based on
in numerical experiments with physical models, Carathéodory’s construction) and for more infor-
one picks an initial point u M “at random” mation on fractal geometry.
(i.e., with respect to the volume or Lebesgue mea- Bowen’s formula relates dimH(J) to the unique
sure) and follows its orbit Tku, k 0.
zero of the pressure function b 7! P bf where
f≔ ð log jT 0 jÞjJ . It is not difficult to see that
Bowen’s Formula for the Hausdorff indeed this map has a unique zero for some pos-
Dimension of Conformal Repellers itive β.
Just as nearby orbits converge towards an attrac-
By property (1), Sn f const n log a, which
tor, they diverge away from a repeller. Conformal
implies (by (13)) that db d
P bf ¼ f, mb
repellers form a nice class of systems which can
log a < 0 . As P(0) equals the topological
be coded by a subshift of finite type. The con-
entropy of J, that is, the logarithm of the largest
struction of their Markov partitions is much sim-
eigenvalue of the matrix M associated to the Mar-
pler than that of Anosov diffeomorphisms, see, for
kov partition, P(0) is strictly positive. Therefore,
example, Zinsmeister (2000).
(recall that the pressure function is continuous)
Let us recall the definition of a conformal
there exists a unique number β0 > 0 such that
repeller before giving a fundamental example.
Given a holomorphic map T : V ! ℂ where P b0 f ¼ 0.
V ℂ is open and J a compact subset of ℂ, one It turns out that this unique zero is precisely
says that (J, V, T ) is a conformal repeller if dimH(J ):
Bowen’s formula The Hausdorff dimension of
1. There exist C > 0, α > 1 such that |(Tn)0 (z)| J is the unique solution of the equation
Cαn for all z J, n 1. P bf ¼ 0, β ℝ; in particular
2. J ¼ \ n 1Tn (V).
3. for any open set U such that U \ J 6¼ 0, there
From the definition it follows that T(J ) ¼ J and This formula was proven in Ruelle (1982) for
T1(J) ¼ J. a general class of conformal repellers after the
A fundamental example is the map T : z ! z2 þ seminal paper (Bowen 1979). The main tool is a
c, c ℂ being a parameter. It can be shown that distortion estimate very similar to (21). A simple
for jcj < 14 there exists a compact set J, called a exposition can be found in Zinsmeister (2000).
386 Pressure and Equilibrium States in Ergodic Theory
SRB measures for Anosov diffeomorphisms and As we saw, many dynamical systems with uni-
Axiom A attractors have been accepted recently form hyperbolic structure (e.g., Anosov maps,
as conceptual models for nonequilibrium steady axiom A diffeomorphisms) can be modeled by
states in nonequilibrium statistical mechanics. Let subshifts of finite type over a finite alphabet. We
us point out that the word “equilibrium” is used in already mentioned in subsection “Interval Maps
physics in a much more restricted sense than in with an Indifferent Fixed Point” the typical exam-
ergodic theory. Only diffeomorphisms preserving ple of a map of the interval with an indifferent
the natural volume of the manifold (or a measure fixed point, whose symbolic model is still a sub-
equivalent to the volume) would be considered as shift of finite type, but with a countable alphabet.
appropriate toy models of physical equilibrium The thermodynamic formalism for such systems
situations. In the case of Anosov diffeomorphisms is by now well developed (Fiebig et al. 2002;
this is precisely the case if the “forward” and Gurevich and Savchenko 1998; Sarig 1999,
“backward” SRB measures m+ and m coincide. 2001, 2003) and used, for example, for multi-
Otherwise, the diffeomorphism models a situation dimensional piecewise expanding maps (Buzzi
out of equilibrium, and the difference between m+ and Sarig 2003). An active line of research is
and m can be related to entropy production and related to systems admitting representations by
irreversibility. symbolic models called “towers” constructed by
Gallavotti and Cohen (1995; Gallavotti 1996) using “inducing schemes.” The fundamental
introduced SRB measures as idealized models of example is the class of one-dimensional unimodal
nonequilibrium steady states around 1995. In maps satisfying the “Collet-Eckmann condition.”
order to have as firm a mathematical basis as A first attempt to develop thermodynamic formal-
possible they made the “chaotic hypothesis” that ism for such systems was made in Bruin and
the systems they studied behave like transitive Keller (1998) where existence and uniqueness of
Anosov systems. Ruelle (1996) extended his equilibrium measures for the potential function
approach to more general (even nonuniformly) fb ðuÞ ¼ b log jT 0 ðuÞj with β close to 1 was
hyperbolic dynamics; see also his reviews established. Very recently, new developments in
(Ruelle 1998, 2003) for more recent accounts this direction appeared, see, for example, (Bruin
discussing also a number of related problems; and Todd 2008, 2009; Pesin and Senti 2008).
see by Rondoni and Mejía-Monasterio (2007), A largely open field of research concerns a new
too. The importance of the Gibbs property of branch of nonequilibrium statistical mechanics,
SRB measures for the discussion of entropy pro- the so-called chaotic scattering theory, namely
duction was also highlighted in Jiang et al. (2000), the analysis of chaotic systems with various open-
where it is shown that for transitive Anosov ings or holes in phase space, and the
diffeomorphisms the relative entropy h(m+|m) corresponding repellers on which interesting
equals the average entropy production rate invariant measures exist. We refer the reader to
hlog|J|, m+i of m+ where J denotes again the Chernov (2002) for a brief account and references
Jacobian determinant of the diffeomorphism. to the physics literature. The existence of
In particular, the entropy production rate is (generalized) steady states on repellers and the
zero if, and only if, h(m+|m) ¼ 0, that is, using so-called escape rate formula have been observed
coding and (19), if, and only if, m+ ¼ m. According numerically in a number of models. So far, little
to subsection “Axiom A Diffeomorphisms, Anosov has been proven mathematically, except for
Diffeomorphisms, Sinai-Ruelle-Bowen Measures,” Anosov diffeomorphisms with special holes
this is also equivalent to m+ or m being absolutely (Chernov 2002) and for certain nonuniformly
continuous with respect to the volume measure. hyperbolic systems (Bruin et al. 2010).
Pressure and Equilibrium States in Ergodic Theory 387
Lanford OE, Ruelle D (1969) Observables at infinity and Ruelle D (1996) Positivity of entropy production in non-
states with short range correlations in statistical equilibrium statistical mechanics. J Stat Phys 85:1–23
mechanics. Commun Math Phys 13:194–215 Ruelle D (1998) Smooth dynamics and new theoretical
Lewis JT, Pfister C-E (1995) Thermodynamic probability ideas in nonequilibrium statistical mechanics. J Stat
theory: some aspects of large deviations. Russ Math Phys 95:393–468
Surv 50:279–317 Ruelle D (2003) Extending the definition of entropy to
Lind D, Marcus B (1995) An introduction to symbolic nonequilibrium steady states. Proc Nat Acad Sci USA
dynamics and coding. Cambridge University Press, 100(6):3054–3058
Cambridge Ruelle D (2004) Thermodynamic formalism: the mathe-
Misiurewicz M (1976) A short proof of the variational matical structures of equilibrium statistical mechanics,
principle for a ℤNþ action on a compact space. In: Cambridge Mathematical Library, 2nd edn. Cambridge
International conference on dynamical systems in University Press, Cambridge
mathematical physics (Rennes, 1975), Astérisque, Sarig O (1999) Thermodynamic formalism for countable
No. 40, Soc Math France, pp 147–157 Markov shifts. Ergod Theory Dyn Syst 19(6):
Moulin-Ollagnier J (1985) Ergodic theory and statistical 1565–1593
mechanics. In: Lecture notes in mathematics, vol 1115. Sarig O (2001) Phase transitions for countable Markov
Springer, Berlin shifts. Commun Math Phys 217(3):555–577
Pesin Y (1997) Dimension theory in dynamical systems. Sarig O (2003) Existence of Gibbs measures for countable
Contemporary views and applications. University of Markov shifts. Proc Am Math Soc 131(6):1751–1758
Chicago Press, Chicago Seneta E (2006) Non-negative matrices and Markov
Pesin Y, Senti S (2008) Equilibrium measures for maps chains. In: Springer series in statistics. Springer
with inducing schemes. J Mod Dyn 2(3):397–430. Sinai JG (1968) Markov partitions and
https://doi.org/10.3934/jmd.2008.2.397 C-diffeomorphisms. Funct Anal Appl 2:61–82
Pesin Y, Weiss (1997) The multifractal analysis of Gibbs Sinai JG (1972) Gibbs measures in ergodic theory. Russ
measures: motivation, mathematical foundation, and Math Surv 27(4):21–69
examples. Chaos 7(1):89–106 Thaler M (1980) Estimates of the invariant densities of
Pollicott M (2000) Rates of mixing for potentials of sum- endomorphisms with indifferent fixed points. Israel
mable variation. Trans Am Math Soc 352(2):843–853 J Math 37(4):303–314
Pomeau Y, Manneville P (1980) Intermittent transition to Walters P (1975) Ruelle’s operator theorem and
turbulence in dissipative dynamical systems. Commun g-measures. Trans Am Math Soc 214:375–387
Math Phys 74(2):189–197 Walters P (1992) Differentiability properties of the pres-
Rondoni L, Mejía-Monasterio C (2007) Fluctuations in sure of a continuous transformation on a compact met-
nonequilibrium statistical mechanics: models, mathe- ric space. J Lond Math Soc (2) 46(3):471–481
matical theory, physical mechanisms. Nonlinearity Wang X-J (1989) Statistical physics of temporal intermit-
20(10):R1–R37 tency. Phys Rev A 40(11):6647–6661
Ruelle D (1968) Statistical mechanics of a one-dimensional Young L-S (1990) Large deviations in dynamical systems.
lattice gas. Commun Math Phys 9:267–278 Trans Am Math Soc 318:525–543
Ruelle D (1973) Statistical mechanics on a compact set Young L-S (2002) What are SRB measures, and which
with ℤn action satisfying expansiveness and specifica- dynamical systems have them? Dedicated to David
tion. Trans Am Math Soc 185:237–251 Ruelle and Yasha Sinai on the occasion of their 65th
Ruelle D (1976) A measure associated with Axiom birthdays. J Stat Phys 108(5–6):733–754
A attractors. Am J Math 98:619–654 Zinsmeister M (2000) Thermodynamic formalism and
Ruelle D (1982) Repellers for real analytic maps. Ergod holomorphic dynamical systems. SMF/AMS texts and
Theory Dyn Syst 2(1):99–107 monographs, vol 2. American Mathematical Society
valued (or complex-valued) function on X.
Parallels Between Topological Then AN ð f , xÞ ¼ N1 N1
n¼0 f ðT xÞ is called the
n
Dynamics and Ergodic Theory ergodic average. The multiple ergodic aver-
ages (or called “nonconventional averages”)
Wen Huang, Song Shao and Xiangdong Ye are the following ones
CAS Wu Wen-Tsun Key Laboratory of N1
p ðnÞ p ð nÞ p ðnÞ
Mathematics, and Department of Mathematics,
1
N f 1 T 11 x f 2 T 22 x . . . f d T 2d x ,
n¼0
University of Science and Technology of China,
Hefei, Anhui, China where T1, T2, . . ., Td are invertible and act on
a probability space ðX, X , mÞ , f1, . . ., fd are
functions on X, and p1, . . ., pd are integral
Article Outline polynomials.
Recurrent point A point is recurrent if it is in its
Glossary own future.
Definition of the Subject Topological dynamics The study of the asymp-
Introduction and History totic behaviors of a homeomorphism from a
Recurrence and Other Dynamical Properties topological space to itself. Such a self-
Entropy Theory homeomorphism of a topological space is
Structure Theorems and Multiple Ergodic called a topological dynamical system.
Averages Transitive point A point is a transitive point
Further Directions when every point is in its future.
References Transitivity and minimality A system is transi-
tive when it contains at least one transitive
Glossary point. A system is minimal when every point
is a transitive point.
Ergodicity A measure-preserving system is ergo-
dic if it is essentially indecomposable, in the
sense that given any invariant measurable set,
Definition of the Subject
either the set or its complement has measure 0.
Measure-preserving transformation A map
By topological dynamics, it is “the study of trans-
from a measure space to itself such that for
formation groups with respect to those topological
each measurable subset of the space, it has
properties whose prototype occurred in classical
the same measure as its inverse image under
dynamics” (Gottschalk and Hedlund 1955). If in
the map. Such a measure-preserving map on a
this definition the adjective “topological” is replaced
measure space is called a measure-preserving
by “measure-theoretic,” then one obtains a descrip-
system.
tion of measurable dynamics, also called ergodic
Measure-theoretic and topological entropy A
theory.
nonnegative (possibly infinite) real number
In this entry we try to present theorems illus-
which describes the complexity of a measure-
trating the analogy between topological and mea-
preserving transformation and topological
surable dynamics, and theorems showing drastic
dynamics.
contrast between them.
(Multiple) Ergodic average Let ðX, X , m, T Þ be
a measure-preserving system and f be a real-
shifts. In the 1970s, Furstenberg showed how to Let (X,T) be a t.d.s. By Krylov-Bogolioubov
translate questions in combinatorial number the- theorem, there is at least one invariant probability
ory into ergodic theory. This inspired a new line of measure m on the Borel s-algebra B X . In such a
research, which ultimately led to stunning recent way, ðX, B X , T, mÞ can be viewed as an
results in combinatorial number theory. m.p.s. which also explains why some topological
For simplicity in this article we only consider results can be proved via ergodic theory.
ℤ-actions. Thus, by a topological dynamics On the other hand, for any m.p.s. ðX, X , m, T Þ
(t.d.s. for short) we mean a pair (X,T), where X is one can find some t.d.s. X, T and an invariant
a compact metrizable space with metric r and
measure m on the Borel s-algebra B such that
T : X ! X is a homeomorphism. By a measure- X
preserving system (m.p.s. for short) we mean a ðX, X , m, T Þ is isomorphic to X, B , m, T
X
quadruple ðX, X , m, T Þ, where ðX, X , mÞ is a Borel (Furstenberg 1981). X, T is called a topological
probability space and T, T–1 : X ! X are both model of ðX, X , m, T Þ. This makes the possibility
measurable and measure preserving, that is, to use topological methods in the study of ergodic
T 1 X ¼ X ¼ T X and m(A) ¼ m(T1A) for each theory.
A X. We try our best to exhibit the similarity or
There are two types of problems in topological parallels between two theories, from the notations
dynamics and ergodic theory. The first type can be to the results. There is a very nice survey by
viewed as the internal problems which concern Glasner and Weiss (2005) on the same subject,
with understanding the homeomorphisms or mea- here we will review the subject in some new
sure preserving transformations and trying to angles and present some new progress.
decide when two of them are conjugate or Due to our personnel interest, knowledge, and
isomorphic. The second type is the applications the space available we can only choose some
the theories to other branches of mathematics or in aspects to do so. Thus, the materials we choose
physics. For the internal problems, the usual way are restricted, and are the ones we are familiar
is to look for conjugacy or isomorphic invariants. with. In the first part of the entry we will discuss
Such invariants could be a property (e.g., mini- dynamical properties where most of results are
mality, ergodicity), or is an assignment of some classical. In the second part we focus on the
object (e.g., a number like entropy, a group or a study of entropy, particularly on the so-called
structure). For more details, see (Walters 1982). local entropy theory which is relatively new. In
Let us now introduce the notions of conjugacy the third part we will consider the structure theo-
and isomorphism. Let (X,T) and (Y,S) be two rems and their applications to the combinatorial
t.d.s. A continuous map p : X ! Y is called a number theory which are the hot points of current
homomorphism or factor map between (X,T) and research and developing very fast. We hope in
(Y,S) if it is onto and p ∘ T ¼ S ∘ p. In this case we such a way we could cover certain old and new
say (X,T) is an extension of (Y,S) or (Y,S) is a factor results, and some materials from the internal prob-
of (X,T). When p is a homeomorphism, we then lems to the applications of the theories.
say that (X,T) and (Y,S) are conjugate, so in this The authors would like to thank B. Kra,
case we review the two systems are the same. B. Host, and M. Lemańczyk for valuable com-
To define the same notion in ergodic theory, we ments which improved the writing of the survey
keep in mind that a null set is negligible. Suppose significantly.
ðXi , X i , mi Þ is a probability space and Ti : Xi ! Xi is
measure preserving, i ¼ 1, 2. We say that T2 is a
factor of T1 or T1 is an extension of T2 if there exists Recurrence and Other Dynamical
Mi X i with mi(Mi) ¼ 1 and Ti(Mi) Mi(i ¼ 1, 2) Properties
and there exists a measure preserving transforma-
tion f : M1 ! M2 with fT1(x) ¼ T2f(x) for any Given a dynamical system, we are interested in
x M1. If f is 1-1 and f1 is also measure those points that can be observed repeatedly. This
preserving, then we say f is an isomorphism. leads to the notion of recurrence. The basic fact is
392 Parallels Between Topological Dynamics and Ergodic Theory
that any t.d.s. has a recurrent point and for any Minimality Versus Ergodicity
measurable set with positive measure there are
points returning to the original set infinitely Minimality and Transitivity
many times. We are particularly interested in The first recurrence property we will discuss is the
the systems that are “non-decomposable” in minimality. Recall that we only consider homeo-
some sense. Such systems are minimal or transi- morphisms. A t.d.s. is minimal if the orbit of any
tive systems in topological dynamics and ergo- point is dense in the space, where the orbit of x is
dic systems in ergodic theory. Below we will see the set {T nx : n ℤ+}. Given a t.d.s. (X, T), we
that when studying dynamical properties, some say (Y, T) is a subsystem if Y is a closed subset of
interesting subsets of ℕ, ℤ+ ¼ ℕ [ {0} or ℤ will X and is invariant under T, that is, TY ¼ Y. The
be involved. This indicates that a dynamical minimality is equivalent to the statement that there
system has a close relation with the combinato- is no non-empty proper invariant closed subset.
rial number theory. We will find many evidence We say x is a periodic point if there is n ℕ such
in the sequel. that T n(x) ¼ x. If x is a periodic point, then it is
Since many statements of the paper are better easy to see that the orbit of x is minimal. Basic
stated using the notion of a family, we now give examples of minimal t.d.s can be found in books
the definition. See Akin (1997), Furstenberg (Auslander 1988; Ellis 1969; Kurka 2003; de
(1967, 1981) for more details. Vries 1993), etc.
We define family on ℤ+, and it works for ℕ, ℤ, Now we consider a weaker notion than mini-
etc. A collection F of subsets of ℤ+ is a family if mality, namely transitivity. A t.d.s. (X,T) is tran-
it is hereditary upward, that is, F1 F2 and sitive (for a survey, see Kolyada and Snoha
F1 F imply F2 F. A family F is called proper (1997)) if there is some point x with dense orbit.
if it is neither empty nor the entire power set of We have
ℤ+, or, equivalently if ℤþ F and 0 2 = F . Any
nonempty collection A of subsets of ℤ+ generates Theorem 2.1 Let (X,T) be a t.d.s. Then the fol-
a family lowing statements are equivalent:
¼ fF ℤþ : ℤþ nF 2 = Fg
Let (X,T) be a t.d.s. x X is a (positively)
recurrence point if there is a sequence ni ! + 1
It is not hard to see that if F is a family, then such that T ni x ! x, i ! 1. So, if x is recurrent,
then we can observe its motion near x infinitely
ðF Þ ¼ F : many times. There are several ways to show the
following basic fact.
If a family F is closed under finite intersections
and is proper, then it is called a filter. A family F Theorem 2.2 (Birkhoff recurrence theorem).
has the Ramsey property if A ¼ A1 [ A2 F Any t.d.s. has a recurrence point.
implies that A1 F or A2 F . It is well known This theorem has an important generalization,
that a proper family has the Ramsey property if namely the multiple topological recurrence theo-
and only if its dual F is a filter (Furstenberg rem (Furstenberg 1981). We mention that it is
1981). equivalent to the well-known van der Waerden’s
Parallels Between Topological Dynamics and Ergodic Theory 393
theorem (van der Waerden 1927; Furstenberg for every i ℤ+; S is thick if it contains arbitrarily
1981). long runs of positive integers, that is, for every
When x is recurrent, the restriction of T to its n ℕ there exists some an ℤ+ such that {an,
orbit closure is onto and thus this orbit closure is a an + 1, . . ., an + n} S. It is easy to show that the
transitive subsystem. A point is a transitive point dual family of syndetic subsets is the family of
if the orbit closure is dense in X. We say x is a thick subsets, and vice versa. The following fact
minimal point if the orbit closure of x is a minimal was due to Gottschalk (n.d.) (see also Gottschalk
subset. We have and Hedlund (1955)).
Theorem 2.3 Let (X,T) be a transitive t.d.s. Then Theorem 2.5 For any t.d.s. (X,T), x X is a
the set of transitive points is a Gd subset of X. minimal point if and only if N(x, U) is syndetic for
If (X,T) is transitive and not minimal, then the set each neighborhood U of x.
of non-transitive points is dense in X. The fact that any t.d.s. has a minimal subset
We remark that there exist transitive follows from the Zorn’s lemma, and one can find a
t.d.s. whose non-transitive points are periodic constructive proof in Weiss (2000).
points (Downarowicz and Ye 2002). The minimality of some systems is important
The following well-known fact due to in the applications. For example the minimality of
Furstenberg (1981) tells us how recurrence is (Nd(X), hT T . . . T, T T2 . . . Tdi) has
related to an IP-set. Let fpi g1
i¼1 be a sequence in an application to the simple proof of van der
ℕ. Define Waerden’s theorem (Glasner 2003), where
hT T . . . T, T T2 . . . Tdi be the
k group generated by T T . . . T,
FS ðfpi gÞ ¼ pi j : 1 i1 < . . . < ik , k ℕ : T T2 . . . Td and Nd(X) is the orbit closure
j¼1
of (x,. . .,x) under the two actions (see section
“Structure Theorems and Multiple Ergodic Aver-
A subset F ℕ is called an IP-set if it contains
ages”). The minimality of (x,. . ., x) under the face
some FS fpi g1 i¼1 : Denote the family of all group actions (see section “Structure Theorems
IP-sets by F ip. The well-known Hindman’s theo- and Multiple Ergodic Averages”) has an applica-
rem (Hindman 1974) states that if N1 [ . . . [ Nk is tion to the proof of the structure theorem of a
a partition of ℕ, then one of the cell contains an minimal t.d.s. (Shao and Ye 2012).
IP-set. It is equivalent to say that F ip has the Related to recurrence we may define a recur-
Ramsey property, and thus the dual family F ip is rence set. A ℕ is a recurrence set if for any
a filter. Any set in F ip is called an IP*-set. t.d.s. (X, T) there is a point x X and {ni} A,
For a t.d.s. (X, T), x X and U X let i ! 1 such that T ni x ! x. Equivalently, a set A is
recurrent if and only if for any minimal t.d.s (X, T)
N ðx, U Þ ¼ fn ℤþ : T n x Ug: and non-empty open set U there is some n A
such that U \ T n U 6¼ 0. We have the following
t.d.s. (X, T) such that there is a recurrent point A deep open question related to recurrence is
x X and an open neighborhood U with that: If A is a recurrence set for rotation on a torus
R [ {0} N(x, U) (Furstenberg 1981). of arbitrary dimension, is it a recurrence set? For
The minimality property is related the syndetic the related research see (Weiss 2000; Ellis and
subsets. A subset S of ℤ+ is syndetic if it has a Keynes 1972; Følner 1954; Katznelson 2001;
bound on the size of the gaps, that is, there is Huang and Ye 2012; Huang et al. 2016; Glasner
N ℕ such that fi, i þ 1, . . . , i þ N g \ S 6¼ 0
and the map such that x 7! fdmx is in L1 ðX, X , mÞ; Theorem 2.10 (Halmos-von Neumann Theo-
b) for every f L1 ðX, X , mÞ, m ð f jI ðT ÞÞ rem). (Halmos and von Neumann 1942) An ergo-
ðxÞ ¼ fdmx for m-a.e. x X, where I ðT Þ ¼ dic system has discrete spectrum if and only if it is
B B X : m T 1 BDB ¼ 0 is the s-algebra of isomorphic to a rotation of a compact monothetic
T-invariant sets (see Glasner (2003)). group.
We note that in the topological setting we do
not have such a decomposition except for distal Equicontinuity
transformations defined below. For some discus- Let (X, T) be a t.d.s. and let C(X) be the vector
sion about this topic see Glasner (1994). space of all continuous ℂ-valued functions on X.
Then (C(X), kk1) is a complex Banach space.
Equicontinuity Versus Measurable Kronecker The Koopman operator fT: C(X) ! C(X) is
defined by fT(f) ¼ f ∘ T. We say 0 6¼ f C(x) is
Kronecker Systems an eigenfunction for T if there exists l ℂ such
For each m.p.s. ðX, X , m, T Þ we associate a that fTf ¼ lf. Call l the eigenvalue for
Koopman operator U: L2(m) ! L2(m) such that T corresponding to the eigenfunction f.
U f (x) ¼ f (Tx) for each f L2(m). A complex We say (X, T) has topological discrete spectrum
number l is called an eigenvalue of T if there is if the smallest closed linear subspace of C(X)
0 6¼ f L2(m) such that U f ¼ lf. The function f is containing the eigenfunctions of T is C(X), that is,
called an eigenfunction of T corresponding to the the eigenfunctions span C(X). A transitive t.d.s. (X,
eigenvalue l. It is immediate that T is ergodic if T) has topological discrete spectrum if and only if it
and only if 1 is a simple eigenvalue of T. is minimal and equicontinuous, that is, for each
An ergodic m.p.s. has discrete spectrum if e > 0, there is d > 0 such that r(x, y) < d implies
there is an orthonormal basis for L2(m) which that r(T nx, T ny) < e for any n ℤ (Walters 1982).
consists of eigenfunctions of T. Each ergodic Note that the property of being equicontinuous
m.p.s. admits a maximal factor with discrete spec- does not depend on the choice of the metric d of
trum, called the Kronecker factor. Its higher order X. An important fact is that each minimal
generalization involving the rotation on t.d.s. admits a maximal equicontinuous factor
nilmanifold will be discussed in the later sections. (Xeq, Teq) (Ellis 1969; Ellis and Gottschalk 1960).
If f L2(m) is an eigenfunction, then cl A generalization of this fact to the rotation on
{U f : n ℤ} is a compact subset of L2(m).
n nilmanifolds will be discussed in the later section.
Generally, we say f almost periodic if cl By the following theorem, an equicontinuous
{Unf : n ℤ} is compact in L2(m). It is well system can be viewed as the simplest t.d.s.
known that the set of all almost periodic functions
(denoted by Hc) is spanned by the set of eigen- Theorem 2.11 (Halmos-von Neumann Theo-
functions, and there exists a T-invariant rem). (Walters 1982) Let (X, T) be a t.d.s. Then
sub-s-algebra K m of B such that Hc ¼ (X, T) is minimal and equicontinuous if and only if
L2 X, K m , m (see (Hulse 1982; Zimmer 1976a)). (X, T) is topologically conjugate to a minimal
We call K m the Kronecker algebra of ðX, X , m, T Þ, rotation on a compact abelian metric group.
and X, K m , m, T is the Kronecker factor of In the study of topological dynamics, one of
ðX, X , m, T Þ (Kronecker n.d.). Thus T has discrete the first problems was to find the smallest closed
spectrum if H c ¼ L2 ðX, X , mÞ or equivalently invariant equivalence relation R(X) on (X, T) such
K m ¼ X. that (X/R(X), T) is equicontinuous. A natural can-
By the following theorem, an ergodic didate for R(X) is the so-called regionally proxi-
m.p.s. with discrete spectrum can be viewed as mal relation RP(X) introduced by Ellis and
the simplest ergodic system. Gottschalk (1960). It is a difficult problem to
396 Parallels Between Topological Dynamics and Ergodic Theory
find conditions under which RP(X) is an equiva- Topological Weak Mixing Versus Measurable
lence relation. Starting with Veech (1968), various Weak Mixing
authors, including MacMahon (1978), Ellis and
Keynes (1971), Bronstein (1979), etc., came up Topological Weak Mixing
with various sufficient conditions for RP(X) to be Transitivity and strong mixing are two basic
an equivalence relation. The generalization of notions of recurrence. There is a notion between
regionally proximal relation to higher orders will them which was introduced by Furstenberg and
be discussed in section “Structure Theorems and important to understand the dynamical properties.
Multiple Ergodic Averages.” We say a t.d.s. (X, T) is weakly mixing if
(X X, T T) is transitive. Furstenberg (1967)
showed that (X, T) is weakly mixing if and only if
Topological Mixing Versus Measurable Mixing
{N(U, V): U, V X non-empty open subsets} is a
filter if and only if N(U, V) is thick for each open
Measurable Mixing
non-empty subsets U, V of X.
We say an m.p.s. ðX, X , m, T Þ is strongly mixing if
For a minimal system we can say more. Define
for any A, B X, limn ! 1m(A \ TnB) ¼ m(A)m(B).
the upper density of A by
The stronger notions are the K-system (which
will be discussed later) and Bernoulli system
jA \ f0, 1, . . . , N 1gj
which we will not touch, see (Glasner 2003). dðAÞ ¼ lim sup :
N!1 N
We remark that a long open question is the
Rohlin’s conjecture: if ðX, X , m, T Þ is strongly
mixing, is it true that for all k ℕ and A0, A1, Similarly we define dðAÞ. If dðAÞ ¼ d ðAÞ ¼ a,
. . ., Ak X we then say that the density of A is a.
2008). We have:
where A is thick and B is dynamical syndetic.
Now we turn to the notion of weak Theorem 2.24 (Huang et al. 2011a) Let (X, T) be
disjointness. We say two systems are weakly dis- a t.d.s. and p be an ergodic measure on (X, T).
joint if the product system is transitive in the Then the ergodic m.p.s. ðX, B X , m, T Þ is either
topological settings or ergodic in the measure sensitive for m or has discrete spectrum.
theoretical setup. Weak disjointness is very useful Sensitivity is only one aspect of chaotic behav-
to define new dynamical properties. We have iors. Nowadays, there are many definitions of
defined mild mixing in the previous sections in chaos, for example, Li-Yorke’s chaos (Li and
such a way. Yorke 1975), Devaney’s chaos (Devaney 1989),
Parallels Between Topological Dynamics and Ergodic Theory 401
The pioneering work on topological model was tuples in the product space and it can be used to
done by Jewett (1969/1970) and Krieger (1970/ investigate the global properties of the systems.
1971). That is For example, by using the properties of entropy
pairs, one obtains the existence of the maximal
Theorem 2.27 (Jewett-Krieger). Let ðX, X , T, mÞ zero entropy factor of any t.d.s. (topological
be an ergodic m.p.s. Then there is a minimal Pinsker factor). For related survey, see (Glasner
uniquely ergodic t.d.s. (Y, S) with invariant mea- and Ye 2009; Oprocha and Zhang 2014).
sure n such that ðX, X , m, T Þ is isomorphic to
ðY, B Y , n, SÞ.
Entropy: Topological Versus Measurable
We note that one can add some additional prop-
In 1958 Kolmogorov firstly introduced the
erties to the topological model. For example, in
entropy of an m.p.s. by generating partition
Lehrer (1987) Lehrer showed that the strictly ergodic
(Kolmogorov 1958, 1959). Sinai subsequently
model can be required to be a topological (strongly)
found a natural way to make this notion better-
mixing system in addition. An ergodic system has a
behaved by observing that generators maximize
doubly minimal (i.e., the orbit of any point
entropy relative to a partition among all parti-
(x, y) X2 is dense provided they are not in the
tions with finite entropy, and formulated the def-
same orbit) model if and only if it has zero entropy
inition that has become the standard one (Sinai
(Weiss 1995); and an ergodic system has a strictly
1959). Later the topological entropy was intro-
ergodic, UPE (uniform positive entropy) model if
duced by Adler et al. who formulated it in terms
and only if it has positive entropy (Glasner and
of open covers (Adler et al. 1965). Equivalent
Weiss 1994). Note that not any dynamical properties
definition based on separated and spanning set
can be added in the uniquely ergodic models. For
was independently given by Bowen (1971) and
example, Lindernstrauss showed that every ergodic
Dinaburg (1971).
measurable distal system ðX, X , m, T Þ has a minimal
Now we define entropy. Let (X,T) be a t.d.s. A
topologically distal model (Lindenstrauss 1999).
cover of X is a family of subsets of X, whose union
This topological model need not, in general, be
is X. A partition of X is a cover of X whose
uniquely ergodic. Weiss (1985) generalized the the-
elements are pairwise disjoint. Given two covers
orem of Jewett-Krieger to the relative case which
U, V of X, U is said to be finer than V (denoted by
will be discussed below. For more information we
U V or V ≼U ) if each element of U is
refer to Glasner and Weiss (2005).
contained in some element of V ; set
U _ V ¼ fU \ V : U U, V V g and T i U ¼
T i U : U U for i ℤ+. Denote by N ðU Þ
Entropy Theory
the minimal cardinality among all cardinalities
of subcovers of U.
Entropy is one of the measurements of the com-
plexity of dynamical systems. It is an important
Definition 5 Let (X,T) be a t.d.s. and U be a finite
conjugacy or isomorphic invariant. The concept
open cover of X. The topological entropy of U is
of entropy was invented by Clausius in 1854, and
defined by
Shannon introduced it into information theory in
1948. In 1958 Kolmogorov carried it over to
1 i
ergodic theory. One can find a brief history of htop ðT, U Þ ¼ lim log N _n1
i¼0 T U :
n!þ1 n
entropy in the recent book (Downarowicz 2011).
There are lots of excellent books introducing the The topological entropy of (X,T) is htop ðT Þ ¼
classical theory of entropy such as (Downarowicz supU htop ðT, U Þ , where supremum is taken over
2011; Glasner 2003; Rudolph 1990; Walters
all finite open covers of X.
1982). In this entry we mainly focus on the i 1
Note that log N _n1 i¼0 T U is a sub-
so-called local entropy theory, starting from the n¼1
pioneer work of Blanchard in the early 1990s. In additive sequence and hence htop ðT, U Þ is well
such a theory one studies the local properties of defined.
Parallels Between Topological Dynamics and Ergodic Theory 403
Let ðX, X , m, T Þ be an m.p.s. and P X be the set earlier regularity notions for random processes:
of finite measurable partitions of X. Suppose present becomes asymptotically independent of
x P X and an s-algebra A X . The entropy of x all sufficiently long past.
(with respect to A), written Hm(x) (resp. Hm ðxjA Þ),
is defined by the formula Definition 7 An m.p.s. ðX, X , m, T Þ is called a
Kolmogorov system, or a K-system, if there is a
H m ð xÞ ¼ mðAÞ log mðAÞ sub s-algebra K X with the following
Ax properties:
Definition 6 Let ðX, X , m, T Þ be an m.p.s. and A K-system completely differs from zero
x P X . The entropy of x is defined by entropy systems, see Theorem 2.20. It can be
viewed as the most complicated systems in the
1 i language of entropy. A fundamental result is that
hm ðT, xÞ ¼ lim sup H m _n1
i¼0 T x :
n!þ1 n an m.p.s. is K-system if and only if it has
completely positive entropy (each nontrivial factor
And the entropy of ðX, X , T, mÞ is has positive entropy) if and only if every measur-
able partition by two nontrivial elements has pos-
hm ðT Þ ¼ sup hm ðT, xÞ: itive entropy if and only if every measurable
a PX
partition by finite nontrivial elements has positive
entropy (Pinsker 1960; Rohlin 1967).
It is well known that htop(T) and hm(T) are
Now we consider t.d.s. Using the first two
conjugacy and isomorphism invariants respec-
conditions above Blanchard (1992) introduced
tively, and they have been the most successful
the notion of c.p.e. and u.p.e. in t.d.s. as an ana-
invariant so far in t.d.s. and m.p.s. In 1958 Kol-
logue of the K-system in ergodic theory.
mogorov asked if entropy is a complete isomor-
phic invariant on the collection of Bernoulli shifts, Definition 8 A t.d.s. (X,T) has completely posi-
see (Glasner 2003). This was answered affirma- tive entropy (c.p.e. for short) if each nontrivial
tively by Ornstein in 1970 (Ornstein 1970). factor has positive entropy and uniform positive
The basic relationship between topological entropy (u.p.e. for short) if every cover by two
entropy and measure-theoretic entropy is given nontrivial open sets has positive entropy.
by the variational principle (Goodwyn 1969; Blanchard (1992) showed that u.p.e. implies
Goodman 1971; Misiurewicz 1976). weak mixing and c.p.e. implies the existence of
an invariant measure with full support. The topic
Theorem 3.1 (The variational principle). Let (X, on the relative notion of c.p.e and u.p.e. can be
T) be a t.d.s. Then found in Glasner and Weiss (1995b) and Huang
et al. (2007).
htop ðT Þ ¼ sup hm ðT Þ : m M ðX, T Þ
¼ sup hm ðT Þ : m M e ðX, T Þ : Definition 9 (Huang and Ye 2006) Let (X,T) be a
t.d.s. (X,T) has uniform positive entropy of order n
Measurable and Topological K-Systems (u.p.e. of order n, for short), if any cover of X by
In this subsection we discuss an important class of n non-dense open sets has positive topological
m.p.s. in ergodic theory: K-systems, and its topo- entropy. We say (X,T) has u.p.e. of all orders or
logical analogy. Introduced by Kolmogorov it is topological K if it has u.p.e. of order n for
(1958), it is an isomorphism invariant version of every n 2.
404 Parallels Between Topological Dynamics and Ergodic Theory
Clearly, u.p.e. of order 2 is just u.p.e. It is Definition 10 (Glasner and Weiss 1994; Huang
shown that a u.p.e. system is mildly mixing and Ye 2006). Let (X,T) be a t.d.s. An n-tuple
(Huang and Ye 2006); and any minimal topolog- ðxi Þn1 Xn , n 2, is called an entropy n-tuple if
ical K is strongly mixing (Huang et al. 2005). for some 1 i 6¼ j n, xi 6¼ x j , and for any
Huang and Ye (2006) answered several open admissible open cover U with respect to ðxi Þn1 ,
questions concerning the nature of u.p.e. and htop ðT, U Þ > 0. Entropy 2-tuples are called entropy
c.p.e. Namely, they showed that u.p.e. of order pairs.
n does not imply u.p.e. of order n + 1 for each We denote by En(X,T) the set of entropy
n 2 (answering a question by Host (Glasner n-tuples. Then following the ideas of Blanchard
and Weiss 1995b)); there is a transitive diagonal (1993) we have
system which does not have u.p.e. (of order 2)
(Blanchard 1993, Question 1); there is a 1. If U ¼ fU 1 , , U n g is an open cover of X with
u.p.e. (of order 2) system having no ergodic htop ðT, U Þ > 0 , then for all 1 i n there
measure with full support (Blanchard 1992, exists xi U ci such that ðxi Þn1 is an entropy
Question 2). n-tuple.
In Glasner and Weiss (1994) Glasner and 2. En(X, T) [ Dn(X) is a closed T(n)-invariant sub-
Weiss showed that if a t.d.s. admits a K-measure set of Xn.
with full support, then it has u.p.e; and there is a 3. Let p : (X, T) ! (Y, S) be a factor map. Then
minimal u.p.e. system that is universal for any p(n) (En(X, T) [ Dn(X)) ¼ En(Y, S) [ Dn(Y).
ergodic m.p.s. with positive entropy. Above
results were extended to u.p.e. of all orders in It follows from (1) that (X,T) has positive
Huang and Ye (2006). Particularly, the authors entropy if and only if E2 ðX ,T Þ 6¼ 0 ; (X,T) is
proved that if a t.d.s. admits an invariant u.p.e. of order n if and only if for every point
K-measure with full support, then it has u.p.e. of ðxi Þn1 Xn not on the diagonal Dn(X) is entropy
all orders, and a t.d.s. (X,T) has u.p.e. of all orders n-tuple.
if and only if there is an invariant measure
m M ðX, T Þ such that for each partition a of Definition 11 Let ðX, X , m, T Þ be an
X by finite non-dense Borel sets one has m.p.s. The Pinsker s-algebra is P m ¼
hm(T, a) > 0. A X : hm ðT, fA, X∖AgÞ0 : The corresponding
Measurable and Topological Entropy Tuples
factor is called the Pinsker factor of ðX, X , m, T Þ.
Now we consider entropy tuples which began
The Pinsker factor of ðX, X , m, T Þ is the largest
with Blanchard’s work for entropy pairs
zero entropy factor. By Rohlin-Sinai’s theorem,
(Blanchard 1993).
an m.p.s. ðX, X , m, T Þ is K-system if and only if
the Pinsker factor is trivial, that is, P m ¼ fX, 0g
In Blanchard et al. (1995) the authors intro- Theorem 3.3 (Blanchard et al. 1997) For a given
duced the notion of entropy pairs for a measure t.d.s. (X,T) and a finite open cover U of X there is
and their notion cannot be directly generalized for an invariant measure m MðX ,T Þ with inf a hm
n-tuples when n > 2. Here we will give a defini- ðT, aÞ htop ðT, U Þ , where the infimum is taken
tion of entropy n-tuples for a measure, which was over all finite Borel partitions of X which are finer
introduced by Huang and Ye in Huang and Ye than U.
(2006) and is the same as the notion of entropy In Huang and Ye (2006) Huang and Ye showed
pairs for a measure when n ¼ 2. that, if m M ðX, T Þ and hm(T, a) > 0 for each finite
Borel partition a of X which is finer than U, then
Definition 13 Let (X,T) be a t.d.s. and infa hm(T, a) > 0 and htop ðT, U Þ > 0 providing
m M ðX, T Þ . An n-tuple ðxi Þn1 XðnÞ , n 2, is some kind of converse statement of Theorem 3.3.
called an entropy n-tuple for m if for some for To study the question whether inf a hm ðT, aÞ ¼
some 1 i 6¼ j n, xi 6¼ x j , and for any admis- htop ðT, U Þ for a given finite open cover U,
sible Borel partition a with respect to ðxi Þn1 , Romagnoli (2003) introduced the following entropy
hm(T, a) > 0. for covers
Let m M ðX, T Þ and P m be the Pinsker
s-algebra of ðX, B X , m, T Þ. Define the conditional hþ
m ðT, U Þ ¼ inf hm ðT, aÞ and
aU
independent joining ln(m) on XðnÞ , B nX , T ðnÞ by
1
hm ðT, U Þ ¼ lim inf H m ðaÞ,
n n n!þ1 n a _n1 T i U
i¼0
ln ð m Þ Ai ¼ 1Ai jP m dm,
i¼1 X i¼1
where a is a finite Borel partition of X. In fact they
are the same: hþ m ðT, U Þ ¼ hm ðT, U Þ (Huang
where Ai B X , i ¼ 1, , n. The following result
et al. 2006).
shows that the set of entropy n-tuples for an
invariant measure is in the support of ln(m) and
Theorem 3.4 (The local variational principle).
we remark that the case n ¼ 2 was proved in
(Romagnoli 2003; Glasner and Weiss 2005) Let
Glasner (1997) and the general case was proved
(X,T) be a t.d.s. and U be a finite open cover of
in Huang and Ye (2006).
X. Then
Theorem 3.2 Let (X,T) be a t.d.s., m M ðX, T Þ
max hþ
m ðT, U Þ ¼ max hm ðT, U Þ
and n 2. Then m M ðX, T Þ m M ðX, T Þ
¼ htop ðT, U Þ:
Emn ðX, T Þ ¼ suppðln ðmÞÞ∖Dn ðXÞ:
Note that the classical variational principle fol-
By Theorem 3.2 it is clear that hm(T) ¼ 0 if and
lows from the local ones by note that hm ðT Þ ¼
only if Em2 ðX, T Þ ¼ 0 ; Emn ðX, T Þ [ Dn ðXÞ is a
supU hm ðT, U Þ, where supremum is taken over all
Theorem 3.5 (Huang and Ye 2006) Let (X,T) be 1. for m-a.e. x X, Emn x ðX, T Þ Emn ðX, T Þ for
a t.d.s. If m M ðX, T Þ, then En ðX, T Þ Emn ðX, T Þ each n 2.
for each n 2 and there exists m M ðX, T Þ such 2. ðxi Þn1 Emn ðX, T Þ, then for every neighborhood
that En ðX, T Þ ¼ Emn ðX, T Þ for each n 2. V of ðxi Þn1 ,
We remark that Blanchard et al. (1997)
constructed a t.d.s. and an entropy pair for that m x X : V \ Emn x ðX, T Þ 6¼ 0 > 0:
system, which is not a metric entropy pair for any
ergodic measure. Also the property that the prod-
Thus we can choose X0 B X such that
uct of u.p.e, of order n (resp. of all orders) systems
m(X0) ¼ 1 and [ Emn x ðX, T Þ : x X0
is again u.p.e. of order n (resp. of all orders) was
∖Dn ðX Þ ¼ E mn ðX , T Þ
proved by Glasner (1997) for n ¼ 2 and by Huang
and Ye (2006) for the general case.
Weak Horseshoe
In Huang and Ye (2006) Huang and Ye obtained
The Ergodic Decomposition
the following equivalent characterization of topo-
Let (X,T) be a t.d.s. Let m M ðX, T Þ and
logical entropy n-tuples. Let (X,T) be a t.d.s. and
m ¼ Xmxdm(x) be its ergodic decomposition. The
n 2. Then (x1, , xn) En(X, T) if and only if
ergodic decomposition of m also gives an ergodic
for any neighborhood U1 Un of
decomposition of the m-entropy of a a P X :
(x1, , xn), there exists a positive density subset
hm ðT, aÞ ¼ X hmx ðT, aÞdmðxÞ (Denker et al.
S ¼ {s1 < s2 < } of ℤ+ such that
1976). This property also holds for hm ðT, U Þ for \1 si
UtðiÞ 6¼ 0 for any t {1, 2, , n}S
i¼1 T
any finite Borel cover U of X.
(see also (Kerr and Li 2007, 2009)).
Motivated by this, let J be a subset of ℤ+ we
Theorem 3.6 (Huang and Ye 2006) Let (X,T) be
say that (X,T) has a weak horseshoe with an inter-
a t.d.s., m M ðX, T Þ and U be a finite Borel cover
polating set J if there exist two disjoint closed
of X. If m ¼ Xmxdm(x) is the ergodic decomposi-
subsets U0,U1 of X such that for any
tion of m then
t {0, 1}J, \ j J T j U tð jÞ 6¼ 0 , that is, there
exists xt X such that Tj(xt) Ut(j) for any
hm ðT, U Þ ¼ hmx ðT, U ÞdmðxÞ j J (Huang and Lu 2017) . If the subset J has
X
positive density, then we say that (X,T) has a weak
horseshoe.
Given a finite open cover U of a t.d.s. (X,T), we
know that there exists n M e ðX, T Þ with
hn ðT, U Þ ¼ htop ðT, U Þ by Theorem 3.6 and the Theorem 3.8 (Huang and Ye 2006) A t.d.s. has
local variational principle. Moreover, it was showed positive topological entropy if and only if it has a
that the entropy map m M ðX, T Þ 7! hm ðT, U Þ is weak horseshoe.
upper semicontinuous (Huang et al. 2011b). This result has many applications. Note that the
The following result discloses the relation of result was proved by Glasner and Weiss in
entropy tuples for an invariant measure and Glasner and Weiss (1995a) for symbolic dynam-
entropy tuples for ergodic measures in its ergodic ics, and was extended to countable amenable
decomposition, which was proven for n ¼ 2 in group actions by Kerr and Li in Kerr and Li
Blanchard et al. (1997) and in general (Huang and (2007). Recently, in Huang and Lu (2017)
Ye 2006). Huang and Lu studied the complicated dynamics
of infinite dimensional random dynamical sys-
Theorem 3.7 Let (X,T) be a t.d.s. and tems and showed that in this setting positive topo-
m M ðX, T Þ with m ¼ Xmxdm(x) the ergodic logical entropy also implies the existence of weak
decomposition of m. Then horseshoes.
Parallels Between Topological Dynamics and Ergodic Theory 407
Sequence Entropy: Topological Versus gives an example with hStop ðT Þ ¼ log 2 but
Measurable supm M ðX,T Þ hSm ðT Þ ¼ 0 (Goodman 1974).
The sequence entropy. of an m.p.s. for a sequence Note that for an m.p.s. ðX, X , m, T Þ, if hm(T) > 0,
of ℤ+ was introduced by Kushnirenko in 1967 then for all S F inf , hSm ðT Þ > 0 (Saleski 1977).
(Kushnirenko 1967). Later, the topological Similarly for a t.d.s. (X,T), if htop(T) > 0, then for
sequence entropy was investigated by Goodman all S F inf , hStop ðT Þ > 0 (Huang et al. 2005). If
in 1974 (Goodman 1974). X is a countable compact metric space, it is known
Now we define sequence entropy. Denote by that htop(T) ¼ 0 for any t.d.s. (X,T), and this is not
F inf the set of all increasing sequences of ℤ+. Let the case for the topological sequence entropy
S ¼ f0 t1 t2 g F inf and U be a finite (Ye and Zhang 2008).
open cover of X. The topological sequence Similar to the entropy tuples, one can define
entropy of U with respect to (X,T) along S is sequence entropy n-tuple and sequence entropy
defined by n-tuple for m. They have lots of similar properties
with entropy tuples. But since there is no varia-
1
hStop ðT, U Þ ¼ lim sup log N _ni¼1 T ti U : tional principle for sequence entropy, we do not
n!þ1 n have local variational principles for sequence
entropy tuples. We will not discuss sequence
The topological sequence entropy of (X,T) entropy tuples in details, and we refer to Huang
along sequence S is hStop ðT Þ ¼ supU hStop ðT, U Þ, et al. (2003, 2004), Huang and Ye (2009), Kerr
where supremum is taken over all finite open and Li (2007), and Maass and Shao (2007), etc. to
covers of X. If S ¼ ℤ+, we recover standard interested readers.
topological entropy. In this case we omit the It is found that to characterize different mixing
superscript ℤ+. properties using sequence entropy related to some
For an m.p.s. ðX, X , T, mÞ, a sequence S ℤ+ sequences is effective and fruitful. Here we just
and a partition x P X , the sequence entropy give an example to make the point. For more
hSm ðT, xÞ and hSm ðT Þ can be defined similarly. about this topic, we refer to Coronel et al.
hStop ðT Þ and hSm ðT Þ are conjugacy and isomorphism (2009), Hulse (1982, 1986), Huang et al. (2005),
invariants respectively. Saleski (1977), and Zhang (1992, 1993).
Let ðX, X , m, T Þ be an m.p.s. If hm(T) > 0, then
hSm ðT Þ ¼ K ðSÞhm ðT Þ, where K(S) is a number and Theorem 3.9 (Huang et al. 2005) Let ðX, X , m, T Þ
does not dependent on T (Krug and Newton 1972). be an m.p.s. Then the following statements are
This result implies that sequence entropy is equivalent:
uninteresting as a new invariant in case T has pos-
itive entropy. However, little is known in case 1. ðX, X , m, T Þ is mildly mixing;
hm(T) ¼ 0. 2. For any a a P X by two nontrivial elements
For a t.d.s. (X,T), Goodman (1974) showed (with and IP-set F, there exists an infinite sequence
a restriction that can be removed (Eberlein 1975), A F such that hAm ðT, aÞ > 0;
see also (Huang and Ye 2009)) that for any S F inf 3. For any a a P X by finite nontrivial elements
and IP-set F, there exists an infinite sequence
hStop ðT Þ sup hSm ðT Þ, A F such that hAm ðT, aÞ > 0.
m M ðX, T Þ
In the above theorem a finite measurable set
with an equality in case htop(T) > 0. If htop(T) ¼ 0, C X is nontrivial if 0 < m(C) < 1. We have
then the variational principle for topological similar result for weakly mixing systems. In the
sequence entropy needs not to hold. Goodman topological case we have:
408 Parallels Between Topological Dynamics and Ergodic Theory
Theorem 3.10 (Huang et al. 2005) Let (X,T) be a Theorem 3.12 (Huang et al. 2003) If a minimal t.
t.d.s. Then the following statements are d.s. (X,T) is null, then it is an almost one-to-one
equivalent: extension of its maximal equicontinuous factor
(Xeq,Teq). Moreover, it is uniquely ergodic and
1. (X,T) is topologically mildly mixing. has a discrete spectrum with respect to the unique
2. For any cover U of X by two non-dense open measure.
sets and IP-set F, there exists an infinite Recently, this result was improved by Fuhrmann
sequence A F such that hAtop ðT, U Þ > 0. et al. (2018). They showed that if a minimal t.d.s. (X,
3. For any cover U of X by finite non-dense open T) is null, then it is a regular extension of (Xeq,Teq),
sets and IP-set F, there exists an infinite that is, if p : (X, T) ! (Xeq, Teq) is the factor map
sequence A F such that hAtop ðT, U Þ > 0. and m M ðX, T Þ be the unique measure on X, then
m({x X:| p1(p(x))| ¼1}) ¼ 1. Note that for a
We also have a similar result for topologically minimal system, whether p : X ! Xeq is regular can
weakly mixing systems. be interpreted by the so-called diam mean
equicontinuity (García-Ramos et al. 2019).
Note that the nullness in non-minimal sys-
Null Systems: Measurable Versus Topological
tems was studied in Qiu and Zhao (n.d.). More-
An m.p.s. ðX, X , m, T Þ is called null, if hSm ðT Þ ¼ 0
over, the so-called tame system was extensively
for any S F inf : The following Kushnirenko’s studied, see (Fuhrmann et al. 2018; Glasner
theorem gives the characterization of discrete 2018; Huang 2006; Kerr and Li 2007). Note
spectrum via sequence entropy. that a minimal null systems is tame (Kerr and
Li 2007).
Theorem 3.11 (Kushnirenko 1967) An m.p.s. It is an open problem (see (Huang et al. 2003)):
ðX, X , m, T Þ has discrete spectrum if and only if it Does a transitive non-minimal null system exist?
is null.
In fact, for an m.p.s. ðX, X , m, T Þ and a P X
Maximal Pattern Entropy
max hSm ðT, aÞ ¼ H m ajK m , The notion of maximal pattern entropy. was intro-
S F inf
duced in Huang and Ye (2009). For a t.d.s. (X,T),
where K m is the Kronecker algebra (Huang et al. n ℕ and a finite open cover U let
2004). As shown in Huang et al. (2004, 2005), for
B B and R F inf , cl({Un1B : n R}) is a pX,U ðnÞ ¼ max N _ni¼1 T ti U :
ðt1 <t2 <<tn Þ ℤnþ
compact set of L2(m) if and only if
hSm ðT, fB, X∖BgÞ ¼ 0 for each infinite sequence The maximal pattern entropy of T with respect
S R. In particular, B K m if and only if to U is defined by
hSm ðT, fB, X∖BgÞ ¼ 0 for any S F inf .
A t.d.s. (X,T) is null if hStop ðT Þ ¼ 0 for any 1
htop ðT, U Þ ¼ lim log pX,U ðnÞ:
n!þ1 n
S F inf : Note that it is easy to show that an
equicontinuous system is null. It is natural to
The maximal pattern entropy of (X,T) is
conjecture that if a minimal system is null then it
htop ðT Þ
¼ supU htop ðT, U Þ, where supremum is
is also equicontinuous. Unfortunately this is not
the case, see for example (Goodman 1974). But taken over all finite open covers of X.
we have that for a minimal system Kushnirenko’s Analogously, given an m.p.s. ðX, X , m, T Þ we
statement remains true modulo an almost one-to- can define the maximal pattern entropy hm ðT Þ. We
one extension. Note that p : (X, T) ! (Y, S) is remark that one of equivalent definitions is that
almost one to one if {x X:| p1(p(x))| ¼1} is a the maximal pattern entropy is the supremum of
dense Gd subset. sequence entropies, that is,
Parallels Between Topological Dynamics and Ergodic Theory 409
Theorem 3.13 (Huang and Ye 2009) For a t.d.s. in Cánovas (2004), Tan et al. (2010), and Tan
(X,T) we have that htop ðT Þ ¼ supS F inf hStop ðT Þ (2011) that S(X) ¼ {0, log2, 1} when X is a finite
and for an m.p.s. ðX, X , m, T Þ we have hm ðT Þ ¼ graph. Moreover, it was showed that S(X) ¼
supS Finf hSm ðT Þ. {0, log2, log3, . . .} [ {1} when X is a
zerodimensional space with infinite derived sets
Thus a t.d.s. (X,T) is null if and only if
(Tan et al. 2010). Recently, Snoha et al. (2018)
htop ðT Þ ¼ 0 ; an m.p.s. ðX, X , m, T Þ has discrete
obtained the following:
spectrum if and only if hm ðT Þ ¼ 0. The maximal
pattern entropy has some interesting properties. Theorem 3.15 For every set {0} A {0, log2,
For example, they are conjugacy or isomorphism log3, . . .} [ {1} there exists a one-dimensional
invariant. Moreover, k ℕ\{0}, htop T k ¼ continuum XA ℝ3 with S(XA) ¼ A.
htop ðT Þ, and hm T k ¼ hm ðT Þ. An interesting result For the related work, see (Cánovas 2004; Tan
about the value of the maximal pattern entropy is et al. 2010; Tan 2011).
as follows:
Other Tuples
Theorem 3.14 We want to mention that the idea to study the local
properties of tuples can be applied in many other
1. For a t.d.s. (X,T), htop ðT Þ f log k : k ℕg [ situations. For example, the notion of weakly
f1g (Huang and Ye 2009). mixing tuples can be defined and used in the
2. For an ergodic m.p.s. ðX, X , m, T Þ, study of weakly mixing systems and null systems
hm ðT Þ f log k : k ℕg [ f1g (Saleski 1977). (Huang et al. 2003). Tame systems are an impor-
tant class of zero entropy systems. The notion of
In fact, by introducing sequence entropy IT-tuples is a main tool to study the properties
tuples, one can show that for a t.d.s. (X,T), (Huang 2006; Kerr and Li 2007). The notion of
htop ðT Þ is logk for some k ℕ [ {1} with sensitive tuples is also a useful tool to study sen-
k the maximal length of intrinsic sequence entropy sitivity both in topological dynamics and ergodic
tuples. For an m.p.s. ðX, X , m, T Þ, if hm ðT Þ > theory (Huang et al. 2011a). The notion of F fip -
log ðn 1Þ for n 2, then there is an intrinsic pairs and its application can be found in Dong
m-sequence entropy tuple of length n; conversely, et al. (2013). Moreover, the notion of entropy
if there is an intrinsic m-sequence entropy tuple of tuples can be generalized to entropy points,
length n and m is ergodic, then hm ðT Þ log n. See sequence, and sets (Dou et al. 2006; Ye and
(Huang and Ye 2009) for details. Zhang 2007).
Theorem 3.12 says that if a minimal system (X,T) We believe that such ideas can be applied to
is null, that is, htop ðT Þ ¼ 0 then (X,T) is an almost more situations.
one-to-one extension of its maximal equicontinuous
factor and is uniquely ergodic. There is a natural
question: if a minimal system (X,T) is bounded, that Structure Theorems and Multiple
is, htop ðT Þ < 1, then what will happen? Recently Ergodic Averages
Huang et al. showed that each bounded minimal
system is an almost finite to one extension of its To answer the question if a minimal distal t.d.s. is
maximal equicontinuous factor, and it has only an equicontinuous one, Furstenberg obtained the
finitely many ergodic measures (Huang et al. structure theorem for a minimal distal system.
2019b). Then the structure theorem for the general mini-
For a compact metric space X, let S(X) be the mal t.d.s. was built. Almost at the same time,
set of the values htop ðT Þ for all continuous self- Furstenberg and Zimmer obtained the structure
maps T on X. It is clear that {0} S(X) {0, log2, theorem for an ergodic m.p.s. It should be noted
log3, . . .} [ {1} by Theorem 3.14. It was shown that the theorem has an important role to get an
410 Parallels Between Topological Dynamics and Ergodic Theory
ergodic proof of the well-known Szemerédi’s the- 2. for every ι < there exists an extension
orem. To solve the problem if the multiple ergodic fι : Wι+1 ! Wι which is either proximal or
averages converge in L2-norm, Host-Kra obtained equicontinuous,
a structure theorem of an ergodic m.p.s. involving 3. for a limit ordinal n < the system Wn is the
nilsystems. The corresponding structure theorem inverse limit of the systems {Wι}ι<n,
of a minimal t.d.s. involving nilsystems was built
by Host-Kra-Maass and Shao-Ye. Here we will
W ¼ X:
review those results. Moreover, we will show how
topological methods can be used to prove the
We say that (X,T) is a PI-system if there exists a
pointwise converges of multiple ergodic averages
for distal systems. strictly PI system X and a proximal extension y :
X ! X.
If in the definition of PI-systems, we replace
Structure Theorems in Topological Dynamics proximal extensions by almost one-to-one exten-
The structure theory of minimal systems. origi- sions we get the notion of HPI-systems. If we
nated in Furstenberg’s seminal work (Furstenberg replace the proximal extensions by trivial exten-
1963). It was mainly developed for group actions sions (i.e., we do not allow proximal extensions at
on Hausdorff spaces. Here we assume for the rest all), we have I-systems.
of the paper that T is a homeomorphism, and we In this terminology Furstenberg’s structure the-
will also restrict to metrizable systems. We refer orem for a distal system and the Veech-Ellis struc-
the reader to Auslander (1988), Bronstein (1979), ture theorem for a point distal system can be stated
Ellis (1969), Glasner (1976), Veech (1977), and as follows:
de Vries (1993) for details.
We first recall definitions of extensions. Let Theorem 4.1 (Furstenberg 1963, Furstenberg) A
p : (X, T) ! (Y, S) be an extension of t.d.s. Write minimal t.d.s. is distal if and only if it is an
I-system.
Rp ¼ ðx1 , x2 Þ X2 : pðx1 Þ ¼ pðx2 Þ :
Theorem 4.2 (Veech 1970; Ellis 1973, Veech-
We say that p is proximal if Rp P(X, T), Ellis) A minimal t.d.s. is point distal if and only
where P(X,T) is the set of all proximal pairs. An if it is an HPI-system.
extension p is an equicontinuous extension if for Finally we state the structure theorem for the
every e > 0 there is d > 0 such that (x, y) Rp general minimal systems, which was built in Ellis
and r(x, y) < d imply r(T nx, T ny) < e, for every et al. (1975), McMahon (1976), Veech (1977),
n ℤ. An equicontinuous extension is also called and Glasner (1976). Roughly speaking, the class
an isometric or almost periodic extension. An of minimal flows is the smallest class of flows
extension p is a (topologically) weakly mixing containing the trivial flow and closed under
extension if (Rp, T T) as a subsystem of the (a) homomorphisms, (b) inverse limits, and
product system (X X, T T) is transitive. (c) three “building blocks” which are
equicontinuous extensions, proximal extensions,
Definition 14 We say that a minimal system (X, and topologically weakly mixing extensions. To
T) is a strictly PI system (PI means proximal- be precise, we have the following structure theo-
isometric) if there is an ordinal (which is count- rem for minimal systems.
able since X is metrizable) and a family of systems
{(Wι, wι)}ι such that Theorem 4.3 (Structure theorem for minimal
systems). Let (X,T) be a minimal t.d.s.
1. W0 is the trivial system, Then we have the following diagram:
Parallels Between Topological Dynamics and Ergodic Theory 411
where a : Y ! G is a measurable map, called the A subset C of X is called scrambled if any two
cocycle defining the extension. We denote X0 by distinct points x, y C form a scrambled pair.
YaG/H, and T0 by Ta. A t.d.s. (X, T) is said to be Li-Yorke chaotic if
there is an uncountable scrambled set in X. Using
Theorem 4.4 (Zimmer). (Zimmer 1976b) Let Furstenberg-Zimmer structure theorem, Blanchard
ðX, X , m, T Þ be an ergodic m.p.s. Then et al. (2002a) showed that positive entropy implies
ðX, X , m, T Þ is measurable distal if and only if Li-Yorke chaos. In fact, they showed that if for a
there exists a countable ordinal and a directed t.d.s. (X, T) there is some ergodic measure m such
family of factors ðXy , X y , my , T Þ, y such that that ðX, B X , m, T Þ is not measurable distal, then (X,
T) is Li-Yorke chaotic. Note that Kerr and Li gave
1. X0 ¼ {pt} is the trivial system and X ¼ X. an alternative proof of this fact using the combina-
2. For y < the extension py : Xy+1 ! Xy is torial argument (Kerr and Li 2007). Using the
isometric and nontrivial (i.e., not an structure of minimal systems, Akin et al. showed
isomorphism). that if a minimal system (X, T) is not PI, then it is
3. For a limit ordinal l , Xl ¼ Li-Yorke chaotic (Akin et al. 2010). It is known that
if each pair not in the diagonal is (positively)
lim Xy ði:e:X l ¼ _X y Þ.
y<l Li-Yorke, then it has zero entropy (Blanchard
et al. 2002b). It is an open question if this is still
Here is the Furstenberg-Zimmer structure the- true if we replace positively Li-Yorke by positively
orem for the action ℤ. For the general locally or negatively Li-Yorke.
compact acting group actions, we refer to We refer the readers to Blanchard and Huang
Furstenberg (1981), Glasner (2003), Zimmer (2008), Downarowicz (2014), Huang (2008),
(1976a, b). Huang and Jin (2016), Huang et al. (2014, 2015,
2017a), Huang and Lu (2017), Li and Qiao
Theorem 4.5 (Furstenberg-Zimmer’s structure (2018), Wang et al. (2019), Zhang (2006), Zhou
theorem). Each ergodic m.p.s. ðX, X , m, T Þ is a et al. (2017), and Liu (2019) for more study on
weakly mixing extension of an ergodic distal sys- chaotic phenomenon appearing in positive
tem X. entropy systems by using the variational principle
and the structure theorems.
p0 p1 pn1 pn pnþ1 p
X0 X1 Xn Xnþ1 X X:
distalfactor
Multiple Ergodic Averages
Ramsey theory is best defined by example and the
Structure theorems are very useful in the study classic example of a Ramsey type theorem is the
of combinatorial number theory, we refer to result of van der Waerden (1927) in the 1920s: if
Bergelson (2006), Bergelson and Leibman (1996), the integers are partitioned into finitely many sub-
Furstenberg (1977, 1981, 1991), Frantzikinakis sets, at least one of the subsets contains arbitrarily
(2017), and Kra (2006), etc. for more details. long arithmetic progressions. Erdös and Turán
Here we mention an example of the applications (1936) conjectured that a weaker assumption suf-
of the structure theorems in the theory of chaos. fices: if E is a set of integers whose upper density
The notion of Li-Yorke chaos was introduced in Li dðEÞ is positive, then E contains arbitrarily long
and Yorke (1975). Let (X, T) be a t.d.s. A pair arithmetic progressions. This conjecture was ver-
(x, y) X X is called a Li-Yorke pair if ified by Szemerédi in 1975 (Szemerédi 1975), and
was reproved by Furstenberg using ergodic theo-
lim inf rðT n x, T n yÞ ¼ 0 and lim suprðT n x, T n yÞ retic methods in 1976 (Furstenberg 1977). Green
n!þ1 n!þ1 and Tao showed that the primes contain arbitrarily
> 0: long arithmetic progressions (Green and Tao
Parallels Between Topological Dynamics and Ergodic Theory 413
interesting, but very difficult, problem is to find a characteristic factor Z, meaning that Z is charac-
way to give structural results that control these teristic and that every proper factor of Z is not
more general averages. Such results are not yet characteristic. To describe this minimal character-
available in the general setting of nilpotent groups istic factor, we need to introduce some notions.
of transformations, nor even just for commuting
transformations. We refer to Furstenberg (2010), Definition 16 Let G be a group. For A, B G, we
Kra (2006), and Host and Kra (2018) for more write [A, B] for the subgroup spanned by {[a,
details on this topic. b] ¼ aba1b1 : a A, b B}. The commutator
subgroups Gj, j 1, are defined inductively by
setting G1 ¼ G and Gj+1 ¼ [Gj, G]. Let d 1 be an
Characteristic Factors in Ergodic Theory integer. We say that G is d-step nilpotent if Gd + 1
In the study of multiple ergodic averages, the idea is the trivial subgroup.
of characteristic factors plays a very important Let G be a d-step nilpotent Lie group and G be
role. This idea was suggested by Furstenberg in a discrete cocompact subgroup of G. The compact
Furstenberg (1977), and the notion of “character- manifold X ¼ G/G is called a d-step nilmanifold.
istic factors” was first introduced in a paper by The group G acts on X by left translations and we
Furstenberg and Weiss (1996). Let ðX, X , m, T Þ be write this action as (g, x) 7! gx. The Haar mea-
an m.p.s. and ðY, Y , m, T Þ be a factor of X. Let sure m of X is the unique probability measure on
{p1, . . ., pd} be a family of integer valued poly- X invariant under this action. Let t G and T be
nomials, d ℕ. We say that Y is a characteristic the transformation x 7! tx of X. Then (X, m, T) is
factor of X for the scheme {p1, . . ., pd} if for all called a d-step nilsystem.
f 1 , . . . , f d L1 ðX, X , mÞ, Conze and Lesigne had shown (Conze and
Lesigne 1984, 1987, 1988) that an inverse limit
N1
1
lim T p1 ðnÞ f 1 T p2 ðnÞ f 2 . . . T pd ðnÞ f d of 2-step nilsystems is the characteristic factor
N!1 N n¼0
for the 3-term multiple ergodic averages. The
N1
1 general case was confirmed by constructing
T p1 ðnÞ ð f 1 jY ÞT p2 ðnÞ ð f 2 jY Þ . . . T pd ðnÞ ð f d jY Þ ! 0:
N n¼0 L2 such factors in Host and Kra (2005) (see also
(Ziegler 2007)).
Finding a characteristic factor often gives a Now we outline Host-Kra’s methods and results,
reduction of the problem of evaluating limit and we need some notations first. Let X be a set, and
behavior of multiple averages to special systems. let d 1 be an integer. We view element in {0, 1}d
In the rest of the entry, we mainly consider the as a sequence e ¼ e1. . . ed of 0’s and 1’s, and let
scheme {n, 2n, . . ., dn}. That is, we consider the
je j ¼ e1 + e2 + . . . ed. We denote X2 by X[d]. A point
d
N1
A point x X[d] can be decomposed as x ¼ (x0, x00)
1 withx0,x00 X[d 1],wherex0 ¼ (xe0 :e {0, 1}d 1)
f 1 ðT n xÞ f 2 T 2n x . . . f d T dn x : ð4Þ
N n¼0 and x00(xe1 : e {0, 1}d1), which induces a natural
identification of X[d] and X[d 1] X[d1].
To study (4), it suffices to consider ergodic As examples, points in X[2] are like (x00, x10,
systems and that we will restrict to this case x01, x11).
below. Furstenberg (1977) proved that the distal Let ðX1 , X 1 , m1 , T 1 Þ , ðX2 , X 2 , m2 , T 2 Þ be two
factor is a characteristic factor for (4), and this m.p.s. and let ðY, Y , n, SÞ be a common factor
observation was a main step in the proof of the with pi : Xi ! Y for i ¼ 1, 2 the factor maps. Let
ergodic version of Szemerédi theorem, see (1). mi ¼ mi,y dn(y) represent the disintegration of mi
But this fact is not enough to show the mean with respect to Y. Let m1 Y m2 denote the measure
convergence of (4). It always exists a minimal defined by
Parallels Between Topological Dynamics and Ergodic Theory 415
1=2k
m1 Y m2 ðAÞ ¼ m1,y m2,y dnðyÞ,
Y j k f j kk ¼ Cjej f ðxe Þdm½k ðxÞ :
X ½k k
e f0, 1g
for all A X 1 X 2: The system
ðX1 X2 , X 1 X 2 , m1 Y m2 , T 1 T 2 Þ is called That jk| kk is a seminorm can be proved as in
the relative product of X1 and X2 with respect to Host and Kra (2005), and it is called the Host-Kra
Y and is denoted X1 Y X2 . m1 Y m2 is also called seminorm. The equation below follows immedi-
relatively independent joining of X1 and X2 over Y. ately from the definition of the measures and the
For an m.p.s. ðX, X , m, T Þ we write I ðT Þ for the Ergodic Theorem, which can be considered as an
s-algebra A X : T 1 A ¼ A of invariant sets. alternate definition of the seminorms. For every
Let ðX, X , m, T Þ be an ergodic system and k ℕ. integer k 0 and every f L1(m), one has
We define a measure m[k] on X[k] invariant under
T[k] ¼ T T . . . T (2k times), by m½1 ¼ 1=2kþ1
N1
mI ðT Þ m ¼ m m; and for k 1, 1 2k
j k f jkkþ1 ¼ lim j f Tnf j k
:
N!1 N
n¼0
m½kþ1 ¼ m½k m½k :
I ðT ½k Þ Then by the Host-Kra seminorm we can define
factors ðZd1 , Z d1 , md1 , T Þ.
For an integer k 1, let (Ok, Pk) be the system
corresponding to the s-algebra I ½k and let Definition 17 Let ðX, X , m, T Þ be an ergodic
m.p.s. For d ℕ, there exists a T-invariant
s-algebra Z d1 of X such that for f L1(m),
m½k ¼ m½ok dPk ðoÞ
Ok
j k f jkd ¼ 0 if and only if ð f jZ d1 Þ ¼ 0:
[k]
denote the ergodic decomposition of m under
T[k]. Then by definition Let ðZd1 , Z d1 , md1 , T Þ be the factor of
X associated to the sub s-algebra Z d1 .
If X ¼ Zd–1, then X is called a system of order
m½kþ1 ¼ m½ok m½ok dPk ðoÞ: d – 1.
Ok
Using van der Corput Lemma, Host and Kra
If (X, m, T) is weakly mixing, then by induc- showed that the original averages along arithmetic
tion I T ½k is trivial and m[k] is the 2k Cartesian progressions is controlled by the seminorms.
power m2 of m for k 1.
k
is real and nonnegative. Therefore, we can define This theorem states that the factor Zd 1 is
a seminorm jk| kk on L1(m) by characteristic for the average (4). The bulk of the
416 Parallels Between Topological Dynamics and Ergodic Theory
work, and also the most technical portion, is given by Glasner in Glasner (1994). Let (X, T) be a
devoted to the description of these factors. The t.d.s. and d ℕ. Let sd ¼ T T2 . . . Td.
structure theorem states the system (Y, T) is said to be an topological characteristic
ðZd1 , Z d1 , md1 , T Þ is a (measure theoretic) factor of order d if there exists a dense Gd set O of
inverse limit of d – 1-step nilsystems. X such that for each x O the orbit closure L ¼
We isolate the first coordinate, writing X½d ¼ O xd , sd is p . . . p (d times) saturated, where
X2 1 and then writing a point x X [d]as x ¼
d
xd ¼ (x, . . ., x) (d times) and p : X ! Y is the
(x0, x*), where x ¼ ðxe : e6¼0Þ X½d and corresponding factor map. That is,
0 ¼ 00 . . . 0 {0, 1}d. (x1, x2, . . ., xd) L if and only if
x01 , x02 , . . . , x0d L whenever for all 1 i d,
Theorem 4.9 (Host-Kra). (Host and Kra 2005) pðxi Þ ¼ p x0i . In Glasner (1994), it was shown
Let ðX, X , m, T Þ be an ergodic system and d ℕ. that if (X, T) is a distal minimal system, then its
Then the following properties are equivalent: largest d-step distal factor (in the Furstenberg’s
tower of a minimal distal system) is a topological
1. X is a system of order d – 1, that is, characteristic factor of order d; if (X, T) is a weakly
ðX, X , m, T Þ ¼ ðZd1 , Z d1 , md1 , T Þ. mixing system (X, T), then the trivial system is its
2. The system ðX, X , m, T Þ is a (measure theoretic) topological characteristic factor. It is a deep open
inverse limit of d – 1-step nilsystems. problem whether for a minimal distal system one
3. jk| kd is a norm on L1(m), equivalently, jkf | can replace the largest d-step distal factor in
kd ¼ 0 implies that f ¼ 0. Glasner’s theorem by the maximal d-step
4. There exists a measurable map J : X½d ! X pro-nilfactor.
such that x0 ¼ J xe : 0 6¼ e f0, 1gd for m[d] The second way was given by Host, Kra, and
almost every x ¼ (xe : e {0, 1}d) X[d]. Maass in Host et al. (2010). They obtained a
topological structure theory involving nilsystems
Z0 is the trivial factor, and Z1 is the Kronecker for all minimal distal systems, which can be
factor ði:e:Z 1 ¼ K mÞ and more generally, Zk is a viewed as an analog of the purely ergodic struc-
compact abelian group extension of Zk–1. Further- ture theory of (Host and Kra 2005) and the refine-
more, the sequence of factors is increasing ment of the Furstenberg’s structure theorem for
minimal distal systems. In Host et al. (2010), a
fpt g ¼ Z 0 Z1 Zn Z nþ1 certain generalization of the regionally proximal
X: relation, namely RP[d] (the regionally proximal
relation of order d), was introduced and used to
and if T is weakly mixing, then Zk is the trivial produce the maximal pro-nilfactors, which can be
factor for all k. seen as the characteristic factor of the minimal
Convergence of the linear multiple ergodic system (X, T). In the following we will give
average then follows easily from the general prop- some details of this approach.
erties of nilmanifolds proved by Leibman (2005) If n ¼ (n1, . . ., nd) ℤd and e {0, 1}d, we
(which was proved by Lesigne for connected define n e ¼ di¼1 ni ei . Let (X, T) be a t.d.s. and
groups earlier (Lesigne 1989)). See also let d 1 be an integer. We define Q[d](X) to be the
(Bergelson et al. 2005; Ziegler 2005). For more closure in X[d] of elements of the form
details on this topic, we refer to the recent book
(Host and Kra 2018). T ne x ¼ T n1 e1 þ...þnd ed x : e ¼ e1 e2 . . . ed f0, 1gd ,
examples, Q[2] is the closure in X[2] ¼ X4 of the set 2. If x, y Q[d] have 2d – 1 coordinates in
{(x, Tmx, T nx, T n+mx) : x X, m, n ℤ}. Q[d] common, then x ¼ y.
may be viewed as the topological correspondence 3. If x, y X are such that (x, y, . . ., y) Q[d],
of m[d]. then x ¼ y.
½d
Face transformations T j : X½d ! X½d , j ¼
1, . . . , d are defined as follows: for every x ¼ A minimal system satisfying one of the equiva-
lent properties above is called a (topological)
ðxe Þe f0,1gd X½d
system of order (d – 1) or a (topological)
(d – 1)-step pro-nilsystem.
½d Txe if e j ¼ 1; Note that any system of order d is isomorphic
Tj x ¼
e xe if e j ¼ 0: in the measure theoretic sense to a topological
system of order d (Host et al. 2010). Let (X, T)
The face group of dimension d is the group be a system of order d, then the maximal measur-
F ½d ðXÞ of transformations of X[d] spanned by the able and topological factors of order j coincide,
face transformations. The cube group or parallel- where j d (Dong et al. 2013).
epiped group of dimension d is the group G ½d ðXÞ
spanned by the diagonal transformation and the Definition 18 Let (X, T) be a t.d.s. and let d ℕ.
face transformations. We often write F ½d and G ½d The points x, y X are said to be regionally
instead of F ½d ðXÞ and G ½d ðXÞ, respectively. It is proximal of order d if for any d > 0, there exist
easy to verify that Q[d] is the closure in X[d] of x0, y0 X and a vector n ¼ (n1, . . ., nd) ℤd such
Sx½d : S F ½d , x X . If x is a transitive point of that r(x, x0) < d, r(y, y0) < d, and
X, then Q[d] is the orbit closure of x[d] under the
group G ½d : For x X, we write rðT ne x0 , T ne y0 Þ < d for any e f0, 1gd ∖f0g:
x[d] ¼ (x, x, . . ., x) X[d].
The set of regionally proximal pairs of order
Theorem 4.10 If (X, T) is minimal and d ℕ, d is denoted by RP[d] (or by RP[d](X, T) in case of
then Q½d , G ½d is minimal (Host et al. 2010). And ambiguity), and is called the regionally proximal
for all x X, O x½d , F ½d , F ½d is minimal relation of order d.
(Shao and Ye 2012). The above definition was introduced in Host
Theorem 4.10 can be viewed as a topological and Maass (2007) and Host et al. (2010). The
analogue of the following ergodic theorem. authors showed (Host et al. 2010) that if a system
is minimal and distal, then RP[d] is an equivalence
Theorem 4.11 (Host and Kra 2005) If relation, and a very deep result stating that
ðX, X , m, T Þ is an ergodic m.p.s. and d ℕ, then (X/RP[d], T) is the maximal d-step pro-nilfactor
X½d , m½d , G ½d is ergodic. And (Od, Pd) is ergodic of the system. Using the theory of Ellis semi-
under the action of the group F ½d . group, Shao and Ye (2012) showed that all these
Compared with Theorem 4.9, the following results in fact hold for arbitrarily minimal systems
structure theorem characterizes the inverse limits of abelian group actions. In a recent paper by
of nilsystems using dynamical parallelepipeds. Glasner et al. (2018), the same question is consid-
ered for a general group action, and similar results
Theorem 4.12 (Host-Kra-Maass). (Host et al. are proved.
2010) Assume that (X, T) is a minimal t.d.s. and
let d 2 be an integer. The following properties Theorem 4.13 (Shao and Ye 2012; Host et al.
are equivalent: 2010) Let (X, T) be a minimal t.d.s. and d ℕ.
Then
1. X is an (topologically) inverse limit of (d – 1)-
step minimal nilsystems. 1. RP[d](X) is an equivalence relation.
418 Parallels Between Topological Dynamics and Ergodic Theory
2. (Xd ¼ X/RP[d], T) is the maximal d-step pro- group htd, sdi. We remark that if (X, T) is mini-
nilfactor of (X, T). mal, then all Nd(X, x) coincide, which will be
denoted by Nd(X). It was shown by Glasner
Note that RP[1] is the classical regionally prox- (1994) that if (X, T) is minimal, then
imal relation and X1 ¼ Xeq is the maximal (Nd(X), htd, sdi) is minimal. Hence if
equicontinuous factor of the system. Furthermore, (Nd(X), htd, sdi) is uniquely ergodic, then it is
the sequence of factors is increasing strictly ergodic. In fact, we can give a nice
model for any ergodic m.p.s.:
fpt g ¼ X0 X1 Xn Xnþ1
X: Theorem 4.14 (Huang et al. 2019d) Let
ðX, X , m, T Þ be an ergodic m.p.s. and d ℕ.
and if (X, T) is topologically weakly mixing, then Then it has a strictly ergodic model X, T such
Xn is the trivial factor for all n. that N d X , htd , sd i is strictly ergodic.
There are other ways to study the counterpart Note that for each uniquely ergodic t.d.s. it has
of characteristic factors in a t.d.s. For example, in very nice convergence property: for a t.d.s. (X, G)
Cai and Shao (2019), the topological characteris- with G ¼ ℤd, then (X, G) is uniquely ergodic if
tic factors along cubes of minimal systems are and only if for every continuous function
studied; and in Glasner et al. (2019), the authors f C(X) the sequence of functions
1
considered the regionally proximal relation of Nd g ½0,N1 d f ðgxÞ converges uniformly to a con-
order d along arithmetic progressions. stant function. Thus as an application of Theorem
Further development of the theory of cubes in 4.14 we can show the following:
an abstract setting, calling these structures
nilspaces, were given by Host and Kra (2008), Theorem 4.15 (Huang et al. 2019d) Let
Camarena and Szegedy (Cai and Shao 2019; ðX, X , m, T Þ be an ergodic m.p.s. and d ℕ.
Szegedy 2012), Gutman et al. (2019a, b, 2020), Then for f1, . . ., fd L1(m) the averages
Candela (2017a, b), etc.
1
f 1 ðT n xÞ f 2 ðT nþm xÞ . . . f d T nþðd1Þm x
N2
ðn, mÞ j0, N1j2
Topological Methods in the Study of the
Multiple Ergodic Averages
converge to a constant m - a. e.
Though results on the convergence of multiple ergo-
To prove Theorem 4.14, we found that not
dic averages in L2 norm are very rich now, there are a
every strictly ergodic model is the one we need,
few ones about almost pointwise convergence. In
and Jewett-Krieger Theorem is not enough for our
1989 Bourgain (1989) showed that
pðnÞ purpose. Fortunately, we find that Weiss’s Theo-
lim N!1 N1 N1 n¼0 f T x exists a.e. for all
0 rem is a right tool.
p(n) ℤ[n] and f L ðX, X , mÞ with p > 1. Note
p
that this result may fail when p ¼ 1 (Buczolich and We say that p : X ! Y is a topological model for
Mauldin 2010). And in 1990 Bourgain (1990) pro- a factor map p : ðX, X , m, T Þ ! ðY, Y , n, SÞ if p is a
ved that lim N!1 N1 N1 topological factor map and there exist measure the-
n¼0 f 1 ðT x Þ f 2 ðT a 2 n x Þ
a1 n
1
exists a.e. for all f1, f2 in L ðX, X , mÞ. Recently the oretical isomorphisms f and c such that the
authors of the current paper find that one may use diagram
topological methods to study almost pointwise con- f
vergence of multiple ergodic averages. X!X
Let (X, T) be a t.d.s. and d ℕ. Set p ## p
td ¼ T . . . T (d times) and c
Y!Y
sd ¼ T T2 . . . Td. Let htd, sdi be the
group generated by td, sd. For any x X, let is commutative, that is, pf ¼ cp. Weiss (1985)
N d ðX, xÞ ¼ O ððx, . . . , xÞ, htd , sd iÞ, the orbit clo- generalized the theorem of Jewett-Krieger to the
sure of (x, . . ., x) (d times) under the action of the relative case. Namely, he proved that
Parallels Between Topological Dynamics and Ergodic Theory 419
Y is independent if it is the product measure.
ergodic model for ðY, Y , n, T Þ , then there is a A system ðX, X , m, T Þ is said to be pairwise inde-
uniquely ergodic model X, B , m, T for pendently determined (PID) if all pairwise inde-
X
ðX, X , m, T Þ and a factor map p : X ! Ywhich is a pendent d-self joinings (d 3) are independent.
model for p : X ! Y. Each weakly mixing m.p.s. with spectral type
For d 3 let pd2 : X ! Zd 2 be the factor singular w.r.t Lebesgue measure is PID (Host
map from X to its d – 2-step nilfactor Zd–2. By the 1991); and so is each finite-rank mixing transfor-
results of Host-Kra-Maass in Host et al. (2010), mation (Ryzhikov 1993).
Zd–2 may be regarded as a topological system in Using topological model, one can show the
the natural way. Using Weiss’s Theorem there is a following result.
uniquely ergodic model X, X , m, T for
ðX, X , m, T Þ and a factor map pd2 : X ! Zd2 Theorem 4.18 (Gutman et al. 2018) Let
which is a model for pd2 : X ! Zd2. We then ðX, X , m, T Þ be a weakly mixing and pairwise
show that X, T is what we need, that is, independently determined (PID) m.p.s. Then for
all d ℕ and all f 1 , . . . , f d L1 ðX, X , mÞ, (4)
N d X , htd , sd i is uniquely ergodic. exists a.s. Moreover, for a generic m.p.t. (4) exists
Using some results developed when proving a.s.
Theorem 4.15, one has Now we discuss the convergence along the
cube groups. Host and Kra showed the following
Theorem 4.17 (Huang et al. 2019d) Let convergence along the cube groups.
ðX, X , m, T Þ be an ergodic distal system, and
d ℕ. Then for all f1, . . ., fd L1(m)
Theorem 4.19 (Host and Kra 2005) Let
ðX, X , m, T Þ be an m.p.s., and d ℕ. Then for
N1
1 functions fe L1(m), e {0, 1}d, e 6¼ (0, . . ., 0),
f 1 ðT n xÞ . . . f d T dn x
N n¼0 the averages
d
converge mm a.e. 1
: f e ðT ne xÞ
Note that by Furstenberg-Zimmer’s structure the- N i Mi
i¼1 n ½M1 , N 1 Þ...½Md , N d Þð0, ..., 0Þ6¼e f0, 1gd
orem and Theorem 4.17 the open question (if (4)
converges pointwisely) is reduced to deal with the ð5Þ
weakly mixing extensions. Even for weakly mixing
systems, the question on the pointwise convergence converge in L2(X) as N1 – M1, N2 M2, . . .,
of (4) still remains open. A partial answer to this Nd Md tend to +1.
question was obtained by Assani (1998), who Let ðX, X , m, T Þ be an ergodic m.p.s. and
showed that if ðX, X , m, T Þ is a weakly mixing d ℕ. Using Weiss’s theorem we can prove
system such that the restriction of X to its Pinsker that it has a strictly ergodic model ðX, TÞ such
algebra has spectral type singular w.r.t. the Lebesgue that ðQ½d ðXÞ, G ½d Þ is strictly ergodic. Then we can
measure, then the limit of (4) exists a.e. Also in reprove that (5) converge m a.e. (Huang et al.
Derrien and Lesigne (1996), it was shown that the 2017b). This result for the averages along [0,
limit of (4) exists a.e. for a K-system. N – 1]d was first built by Assani (2010), and Chu
Now we describe some results for a class of and Franzikinakis (2012) extended the result to a
weakly mixing m.p.s. obtained in Gutman et al. very general case.
420 Parallels Between Topological Dynamics and Ergodic Theory
It should be noted that these methods were satisfy the equation. The case where the polyno-
further developed in Donoso and Sun (2016, mial p is linear was completely solved by Rado
2018a, b). (1933). A notorious old question of Erdös and
Graham (1980) is that whether the equation
x2 + y2 ¼ z2 is partition regular?
Further Directions Hindman-Graham in 1979 asked the following
question: for any finite coloring of ℕ, does there
Pointwise Convergence of the Multiple exist x, y ℕ such that {x, y, xy, x + y} is
Averages monochromatic? Moreira (2017) solved a weaker
In contrast to the progress on mean convergence, form by showing that: for any finite coloring of ℕ,
the problem of pointwise convergence for multi- there exist x, y ℕ such that {x, xy, x + y} is
ple averages along arithmetic progressions monochromatic.
remains open. At present for a single transforma-
tion, the most well-known results are due to Sarnak Conjecture
Bourgain (1990) and Huang et al. (2019d). The Möbius function m: ℕ ! {1, 0, 1} is
Donoso and Sun extended the results in Huang defined by m(1) ¼ 1 and
et al. (2019d) to commuting transformations in
Donoso and Sun (2018a).
ð1Þk if n is a product of k
mðnÞ ¼ distinct primes;
Analogue Results
It is always an interesting question to get analogue 0 otherwise:
results in topological dynamics and ergodic the-
ory. We mention two of them: Let (X, T) be a t.d.s. We say a sequence x is
We say (X, T) is totally minimal if (X, T n) is realized in (X, T) if there is an f C(X) and an
minimal for and n 6¼ 0. A deep open question is x X such that x(n) ¼ f(T nx) for any n ℕ.
the so-called “odd recurrent problem” (for the A sequence x is called deterministic if it is realized
ergodic version, see (Frantzikinakis 2004)). Let in a system with zero topological entropy. Here is
(X, T) be totally minimal. For given d ℕ and the conjecture by Sarnak (2009):
j ℤ+ with 0 j d 1, does there exist a The Möbius function m is linearly disjoint from
sequence {ni} with ni (mod d) ¼ j and x X such any deterministic sequence x. That is,
that
N
1
lim mðnÞxðnÞ ¼ 0:
T ni x ! x, T 2ni x ! x, . . . , T dni x ! x, i ! 1? N!1 N
n¼1
A related question is that if (X, T) is totally Though there are many papers which solved
minimal, is it true that there is x X such that some special classes of zero entropy systems (see
2
T n x : n ℤ is dense in X? For a partial solu- (Green and Tao 2010; Ferenczi et al. 2018;
tion and the ergodic versions, see (Huang et al. Kułaga-Przymus and Lemańczyk 2019) and the
2019c; Furstenberg 1981; Bourgain 1989). references therein), it seems that there is a long
way to go to settle the conjecture completely.
Ramsey Type Theorems
An important question in Ramsey theory is to
determine which algebraic equations, or systems References
of equations, are partition regular over the natural
numbers. Partition regularity of the equation p- Adler RL, Konheim AG, McAndrew MH (1965) Topolog-
ical entropy. Trans Am Math Soc 114:309–319
(x1, x2, . . ., xn) ¼ 0(n 2) amounts to saying that, Akin E (1997) Recurrence in topological dynamics,
for any partition of ℕ into finitely many cells, Furstenberg families and Ellis actions. Plenum Press,
some cell contains distinct x1, x2, . . ., xn that New York/London
Parallels Between Topological Dynamics and Ergodic Theory 421
Akin E, Kolyada S (2003) Li-Yorke sensitivity. Non- Blanchard F, Glasner E, Kolyada S, Maass A (2002a) On
linearity 16:1421–1433 Li-Yorke pairs. J Reine Angew Math 547:51–68
Akin E, Auslander J, Berg K (1993) When is a transitive Blanchard F, Host B, Ruette S (2002b) Asymptotic pairs in
map chaotic? In: Convergence in ergodic theory and positive-entropy systems. Ergodic Theory Dyn Syst
probability (Columbus, OH, 1993), 25C40. Ohio State 22:671–686
University Mathematical Research Institute Publica- Bourgain J (1989) Pointwise ergodic theorems for arith-
tions, 5. de Gruyter, Berlin, 1996 metic sets. With an appendix by the author, Harry
Akin E, Glasner E, Huang W, Shao S, Ye X (2010) Suffi- Furstenberg, Yitzhak Katznelson and Donald
cient conditions under which a transitive system is S. Ornstein. Inst Hautes Études Sci Publ Math 69:5–45
chaotic. Ergodic Theory Dyn Syst 30:1277–1310 Bourgain J (1990) Double recurrence and almost sure
Assani I (1998) Multiple recurrence and almost sure con- convergence. J Reine Angew Math 404:140–161
vergence for weakly mixing dynamical systems. Israel Bowen R (1971) Entropy for group endomorphisms and
J Math 103:111–124 homogeneous spaces. Trans Am Math Soc
Assani I (2010) Pointwise convergence of ergodic averages 153:401–414
along cubes. J Anal Math 110:241–269 Bronstein IU (1979) Extensions of minimal transformation
Auslander J (1988) Minimal flows and their extensions. groups. Martinus Nijhoff Publications, The Hague
North-Holland mathematics studies, vol 153. North- Buczolich Z, Mauldin RD (2010) Divergent square aver-
Holland, Amsterdam ages. Ann of Math 171(2):1479–1530
Auslander J, Furstenberg H (1994) Product recurrence and Cadre B, Jacob P (2005) On pairwise sensitivity. J Math
distal points. Trans Am Math Soc 343:221–232 Anal Appl 309:375–382
Auslander J, Yorke J (1980) Interval maps, factors of maps, Cai F, Shao S (2019) Topological characteristic factors
and chaos. Thoku Math J 32(2):177–188 along cubes of minimal systems. Discrete Contin Dyn
Bergelson V (2006) Combinatorial and diophantine appli- Syst 39(9):5301–5317
cations of ergodic theory. Appendix A by A. Leibman Candela P (2017a) Notes on nilspaces: algebraic aspects.
and Appendix B by Anthony Quas and Máté Wierdl. Discrete Anal 15
In: Handbook of dynamical systems, vol 1B. Elsevier Candela P (2017b) Notes on compact nilspaces. Discrete
B. V., Amsterdam, pp 745–869 Anal 16
Bergelson V, Leibman A (1996) Polynomial extensions of Cánovas JS (2004) Topological sequence entropy of inter-
van der Waerden’s and Szemeredi’s theorems. J AMS val maps. Nonlinearity 17(1):49–56
9:725–753 Chu Q, Frantzikinakis N (2012) Pointwise convergence for
Bergelson V, Leibman A (2002) A nilpotent Roth theorem. cubic and polynomial ergodic averages of non-
Invent Math 147:429–470 commuting transformations. Ergodic Theory Dyn
Bergelson V, Host B, Kra B (2005) Multiple recurrence and Syst 32:877–897
nilsequences.With an appendix by Imre Ruzsa. Invent Conze J, Lesigne E (1984) Théorèmes ergodiques pour des
Math 160:261–303 mesures diagonales. (French) [Ergodic theorems for
Billingsley P (1965) Ergodic theory and information. diagonal measures]. Bull Soc Math France 112:143–175
Wiley, New York/London/Sydney Conze J, Lesigne E (1987) Sur un théorème ergodique pour
Birkhoff GD (1927) Dynamical systems, vol 9. American desmesures diagonales. (French) [On an ergodic theo-
Mathematical Society Colloquium Publications, rem for diagonal measures]. Probabilits, 1–31, Publ.
New York Inst. Rech. Math. Rennes, 1987-1, Université
Blanchard F (1992) Fully positive topological entropy and Rennes I, Rennes, 1988
topological mixing. Symbolic dynamics and its appli- Conze J, Lesigne E (1988) Sur un théorème ergodique pour
cations. AMS Contemp Math 135:95–105 des mesures diagonales. (French) [On an ergodic theo-
Blanchard F (1993) A disjointness theorem involving topo- rem for diagonal measures]. C R Acad Sci Paris Sr
logical entropy. Bull Soc Math France 121:465–478 I Math 306:491–493
Blanchard F, Huang W (2008) Entropy sets, weakly mixing Cornfeld IP, Fomin SV, Sinaĭ YG (1982) Ergodic theory.
sets and entropy capacity. Discrete Contin Dyn Syst Fundamental principles of mathematical sciences,
20(2):275–311 vol 245. Springer, New York, p x,486
Blanchard F, Lacroix Y (1993) Zero-entropy factors of Coronel A, Maass A, Shao S (2009) Sequence entropy and
topological flows. Proc Am Math Soc 119:985–992 rigid salgebras. Stud Math 194:207–230
Blanchard F, Host B, Maass A, Martinez S, Rudolph DJ de la Rue T (2012) Joinings in ergodic theory. Mathematics
(1995) Entropy pairs for a measure. Ergodic Theory of complexity and dynamical systems, vol 1–3.
Dyn Syst 15(4):621–632 Springer, New York, pp 796–809
Blanchard F, Glasner E, Host B (1997) A variation on the de Vries J (1993) Elements of topological dynamics.
variational principle and applications to entropy pairs. Kluwer, Dordrecht
Ergodic Theory Dyn Syst 17:29–43 Denker M, Grillenberger C, Sigmund K (1976) Ergodic
Blanchard F, Host B, Maass A (2000) Topological com- theory on compact spaces. Springer lecture notes in
plexity. Ergodic Theory Dyn Syst 20:641–662 mathematics, vol 527. Springer, Berlin
422 Parallels Between Topological Dynamics and Ergodic Theory
Derrien J, Lesigne E (1996) Un théorème ergodique poly- Erdös P, Graham RL (1980) Old and new problems and
nomial ponctuel pour les endomorphismes exacts et les results in combinatorial number theory, Monographies
K-systèmes. (French) [A pointwise polynomial ergodic de LEnseignement Mathématique [Monographs of
theorem for exact endomorphisms and K-systems]. LEnseignement Mathématique], vol 28. Université de
Ann Inst H Poincar Probab Stat 32:765–778 Genève, LEnseignement Mathématique, Geneva
Devaney RL (1989) An introduction to chaotic dynamical Erdös P, Turán P (1936) On some sequences of integers.
systems. Addison-Wesley Publishing Company J Lond Math Soc 11:261–264
Advanced Book Program, Redwood City Fayad B, Kanigowski A (2016) Multiple mixing for a class
Dinaburg EI (1971) A connection between various entropy of conservative surface flows. Invent Math
characterizations of dynamical systems. (Russian) Izv 203(2):555–614
Akad Nauk SSSR Ser Mat 35:324–366 Ferenczi S, Kulaga-Przymus J, Lemanczyk M (2018)
Dong P, Shao S, Ye X (2012) Product recurrent properties, Sarnak’s conjecture: what’s new. Lecture Notes Math
disjointness and weak disjointness. Israel J Math 2213:163–235
188:463–507 Følner E (1954) Generalization of a theorem of
Dong P, Donoso S, Maass A, Shao S, Ye X (2013) Infinite- Bogoliuboff to topological abelian groups. Math
step nilsystems, independence and complexity. Ergodic Scand 2:5–19
Theory Dyn Syst 33:118–143 Frantzikinakis N (2004) The structure of strongly station-
Donoso S, Shao S (2017) Uniformly rigid models for rigid ary systems. J Anal Math 93:359C388
actions. Stud Math 236:13–31 Frantzikinakis N (2017) Ergodicity of the Liouville system
Donoso S, Sun W (2016) A pointwise cubic average for implies the Chowla conjecture. Discrete analysis paper
two commuting transformations. Israel J Math no. 19, 41 pp
216:657–678 Frantzikinakis N, McCutcheon R (2012) Ergodic theory:
Donoso S, Sun W (2018a) Pointwise multiple averages for recurrence. Mathematics of complexity and dynamical
systems with two commuting transformations. Ergodic systems, vol 1–3. Springer, New York, pp 357–368
Theory Dyn Syst 38(6):2132–2157 Fuhrmann G, Glasner E, Jäger T, Oertel C (2018) Irregular
Donoso S, Sun W (2018b) Pointwise convergence of some model sets and tame dynamics. arXiv:1811.06283
multiple ergodic averages. Adv Math 330:946–996 Furstenberg H (1963) The structure of distal flows. Am
Dou D, Ye X, Zhang G (2006) Entropy sequences and J Math 85:477–515
maximal entropy sets. Nonlinearity 19:53–74 Furstenberg H (1967) Disjointness in ergodic theory, min-
Downarowicz T (2011) Entropy in dynamical systems. imal sets, and a problem in Diophantine approximation.
New mathematical monographs, vol 18. Cambridge Math Syst Theory 1:1–49
University Press, Cambridge Furstenberg H (1977) Ergodic behavior of diagonal mea-
Downarowicz T (2014) Positive topological entropy sures and a theorem of Szemerédi on arithmetic pro-
implies chaos DC2. Proc Am Math Soc gressions. J Anal Math 31:204–256
142(1):137–149 Furstenberg H (1981) Recurrence in ergodic theory and
Downarowicz T, Ye X (2002) When every point is either combinatorial number theory. M. B. Porter lectures.
transitive or periodic. Colloq Math 93:137–150 Princeton University Press, Princeton
Eberlein E (1975) On topological entropy of semigroups of Furstenberg H (1982) IP-systems in ergodic theory. Con-
commuting transformations. In: International confer- ference in modern analysis and probability (New
ence on dynamical systems in mathematical physics, Haven, Conn., 1982). Contemporary Mathematics
Rennes, 1975, pp 17–62 26, American Mathematical Society, Providence,
Ellis R (1969) Lectures on topological dynamics. W. A. 1984, pp 131–148
Benjamin, Inc., New York Furstenberg H (1988) Nonconventional ergodic averages.
Ellis R (1973) The Veech structure theorem. Trans Am The legacy of John von Neumann (Hempstead, NY,
Math Soc 186:203–218 1988). In: Proceedings of Symposia in Pure Mathemat-
Ellis R, Gottschalk W (1960) Homomorphisms of trans- ics, vol 50, American Mathematical Society, Provi-
formation groups. Trans Am Math Soc 94:258–271 dence, 1990, pp 43–56
Ellis R, Keynes H (1971) A characterization of the Furstenberg H (1991) Recurrent ergodic structures and
equicontinuous structure relation. Trans Am Math Soc Ramsey theory. In: Proceedings of the International
161:171–181 Congress of Mathematicians, vol I, II (Kyoto, 1990).
Ellis R, Keynes H (1972) Bohr compactifications and a Mathematical Society of Japan, Tokyo, 1991,
result of Følner. Israel J Math 12:314–330 pp 1057–1069
Ellis R, Nerurkar M (1988) Enveloping semigroup in ergo- Furstenberg H (2010) Ergodic structures and non-
dic theory and a proof of Moore’s ergodicity theorem. conventional ergodic theorems. International Congress
Dynamical systems (College Park, MD, 1986–87). of Mathematicians
Lecture notes in mathematics, 1342. Springer, Berlin, Furstenberg H, Weiss B (1977) The finite multipliers of
pp 172–179 infinite ergodic transformations. The structure of
Ellis R, Nerurkar M (1989) Weakly almost periodic flows. attractors in dynamical systems. In: Proceedings of
Trans Am Math Soc 313:103–119 the conference on North Dakota State University,
Ellis R, Glasner S, Shapiro L (1975) Proximal-isometric Fargo, N.D., 1977). Lecture notes in mathematics,
flows. Adv Math 17:213–260 668. Springer, Berlin, 1978, pp 127–132
Parallels Between Topological Dynamics and Ergodic Theory 423
Furstenberg H, Weiss B (1978) Topological dynamics and Glasner E, Gutman Y, Ye X (2018) Higher order regionally
combinatorial number theory. J Anal Math 34:61–85 proximal equivalence relations for general minimal
Furstenberg H, Weiss B (1996) A mean ergodic theorem group actions. Adv Math 333:1004–1041
2
for N1 N Glasner E, Huang W, Shao S, Ye X (2019) Regionally
n¼1 f ðT xÞg
n
T n x . In: Convergence in ergodic
proximal relation of order d along arithmetic progres-
theory and probability (Columbus, OH, 1993). Ohio sions and nilsystems, in prepare
State University Mathematical Research Institute Pub- Goodman TNT (1971) Relating topological entropy and
lications, 5. de Gruyter, Berlin, pp 193–227 measure entropy. Bull Lond Math Soc 3:176–180
García-Ramos F (2017) Weak forms of topological and Goodman TNT (1974) Topological sequence entropy. Proc
measure theoretical equicontinuity: relationships with Lond Math Soc 29:331–350
discrete spectrum and sequence entropy. Ergodic The- Goodwyn L (1969) Topological entropy bounds measure-
ory Dyn Syst 37:1211–1237 theoretic entropy. Proc Am Math Soc 23:679–688
García-Ramos F, Jäger T, Ye X (2019) Mean Gottschalk W (n.d.) Orbit closure decompositions and
equicontinuity, almost automorphy and regularity, almost periodic properties. Bull Am Math Soc
preprint 50:915–919
Glasner S (1976) Proximal flows. Lecture notes in mathe- Gottschalk W, Hedlund G (1955) Topological dynamics.
matics, vol 517. Springer, Berlin/New York American Mathematical Society Colloquium publica-
Glasner E (1994) Topological ergodic decompositions and tions, vol 36. American Mathematical Society,
applications to products of powers of a minimal trans- Providence
formation. J Anal Math 64:241–262 Green B, Tao T (2008) The primes contain arbitrarily long
Glasner E (1996) Structure theory as a tool in topological arithmetic progressions. Ann Math 167(2):481–547
dynamics. In: Descriptive set theory and dynamical Green B, Tao T (2010) The Möbius function is strongly
systems (Marseille-Luminy, 1996), 173C209. London orthogonal to nilsequences. Ann Math 175(2):541–566
Mathematical Society lecture note series, vol 277. Gutman Y, Huang Y, Shao S, Ye X (2018) Almost sure
Cambridge University Press, Cambridge, 2000 convergence of the multiple ergodic average for certain
Glasner E (1997) A simple characterization of the set of weakly mixing systems. Acta Math Sinica 34:79–90
m-entropy pairs and applications. Israel J Math Gutman Y, Manners F, Varjú P (2019a) The structure
102:13–27 theory of Nilspaces III: inverse limit representations
Glasner E (1998) On minimal actions of Polish groups. and topological dynamics. http://arxiv.org/abs/1605.
Topology Appl 117:259–272 08950
Glasner E (2003) Ergodic theory via Joinings. Mathemat- Gutman Y, Manners F, Varjú P (2019b) The structure
ical surveys and monographs, 101. American Mathe- theory of Nilspaces II: representation as nilmanifolds.
matical Society, Providence Trans Am Math Soc 371:4951–4992
Glasner E (2018) The structure of tame minimal dynamical Gutman Y, Manners F, Varjú P (2020) The structure theory
systems for general groups. Invent Math of Nilspaces I. J Anal Math 140:299–369
211(1):213–244 Haddad K, Ott W (2008) Recurrence in pairs. Ergodic
Glasner S, Maon D (1989) Rigidity in topological dynam- Theory Dyn Syst 28:1135–1143
ics. Ergodic Theory Dyn Syst 9:309–320 Halmos PR (1960) Lectures on Ergodic theory. Chelsea
Glasner S, Weiss B (1983) Minimal transformations with Publishing Co, New York, p vii, 101
no common factor need not be disjoint. Israel J Math Halmos P, von Neumann J (1942) Operator methods in
45:1–8 classical mechanics II. Ann Math 43:332–350
Glasner E, Weiss B (1994) Strictly ergodic, uniform posi- Hindman N (1974) Finite sums from sequences within
tive entropy models. Bull Soc Math France cells of a partition of ℕ. J Combin Theory Ser
122(3):399–412 A 17:1–11
Glasner E, Weiss B (1995a) Quasi-factors of zero-entropy Hochman M (2012) On notions of determinism in topolog-
systems. J Am Math Soc 8(3):665–686 ical dynamics. Ergodic Theory Dyn Syst 32:119–140
Glasner E, Weiss B (1995b) Topological entropy of exten- Host B (1991) Mixing of all orders and pairwise indepen-
sions. In: Ergodic theory and its connections with har- dent joinings of systems with singular spectrum. Israel
monic analysis (Alexandria, 1993). London J Math 76:289–298
Mathematical Society lecture note series, vol 205. Host B, Kra B (2001) Convergence of Conze-Lesigne
Cambridge University Press, Cambridge, pp 299–307 averages. Ergodic Theory Dyn Syst 21:493–509
Glasner E, Weiss B (2005) On the interplay between mea- Host B, Kra B (2002) An odd Furstenberg-Szemerdi theo-
surable and topological dynamics. In: Hasselblatt B, rem and quasi-affine systems. J Anal Math 86:183–220
Katok A (eds) Handbook of dynamical systems, Host B, Kra B (2005) Nonconventional averages and
vol 1B. North-Holland, Amsterdam, pp 597–648 nilmanifolds. Ann Math 161:398–488
Glasner E,Weiss B (2015) On doubly minimal systems and Host B, Kra B (2008) Parallelepipeds, nilpotent groups and
a question regarding product recurrence. Gowers norms. Bull Soc Math France 136(3):405–437
arXiv:1508.02817 Host B, Kra B (2018) Nilpotent structures in Ergodic
Glasner E, Ye X (2009) Local entropy theory. Ergodic theory. Mathematical surveys and monographs, 236.
Theory Dyn Syst 29:321–356 American Mathematical Society, Providence
424 Parallels Between Topological Dynamics and Ergodic Theory
Host B, Maass A (2007) Nilsystèmes d’ordre deux et Huang W, Xu L, Yi YF (2015) Asymptotic pairs, stable sets
parallélépipédes. Bull Soc Math France 135:367–405 and chaos in positive entropy systems. J Funct Anal
Host B, Kra B, Maass A (2010) Nilsequences and a struc- 268(4):824–846
ture theory for topological dynamical systems. Adv Huang W, Shao S, Ye X (2016) Nil Bohr-sets and almost
Math 224:103–129 automorphy of higher order. Memoirs of the American
Host B, Kra B, Maass A (2016) Variations on topological Mathematical Society. American Mathematical Soci-
recurrence. Monatsh Math 179:57–89 ety, Providence
Huang W (2006) Tame systems and scrambled pairs under Huang W, Li J, Ye X, Zhou XY (2017a) Positive topolog-
an abelian group action. Ergodic Theory Dyn Syst ical entropy and D-weakly mixing sets. Adv Math
26(5):1549–1567 306:653–683
Huang W (2008) Stable sets and e-stable sets in positive- Huang W, Shao S, Ye X (2017b) Strictly ergodic models
entropy systems. Commun Math Phys 279(2):535–557 and pointwise ergodic averages for cubes. Comm Math
Huang W, Jin L (2016) Stable sets and mean Li-Yorke Stat 5:93–122
chaos in positive entropy actions of bi-orderable ame- Huang W, Li J, Thouvenot J, Xu L, Ye X (2019a) Bounded
nable groups. Ergodic Theory Dyn Syst complexity, mean equicontinuity and discrete spec-
36(8):2482–2497 trum. Ergodic. Theory Dyn Syst. arXiv:1806.02980
Huang W, Lu K (2017) Entropy, chaos, and weak horse- Huang W, Lian Z, Shao S, Ye X (2019b) Minimal systems
shoe for infinite-dimensional random dynamical sys- with finitely many ergodic measures, preprint
tems. Commun Pure Appl Math 70(10):1987–2036 Huang W, Shao S, Ye X (2019c) Topological correspon-
Huang W, Ye X (2002a) Devaney’s chaos or 2-scattering dence of multiple ergodic averages of nilpotent group
implies Li-Yorke’s chaos. Topology Appl 117:259–272 actions. J Anal Math. arXiv:1604.07113
Huang W, Ye X (2002b) An explicit scattering, non-weakly Huang W, Shao S, Ye X (2019d) Pointwise convergence of
mixing example and weak disjointness. Nonlinearity multiple ergodic averages and strictly ergodic models.
15:1–14 J Anal Math. arXiv:1406.5930
Huang W, Ye X (2004) Topological complexity, return Huang W, Shao S, Ye X (2019e) An answer to Furstenberg’
times and weak disjointness. Ergodic Theory Dyn problem on topological disjointness. Ergodic Theory
Syst 24(3):825–846 Dyn Syst. arXiv:1807.10155
Huang W, Ye X (2005) Dynamical systems disjoint from Huang W, Wang Z, Ye X (2019f) Measure complexity and
any minimal system. Trans Am Math Soc Möbius disjointness. Adv Math 347:827–858
357(2):669–694 Hulse P (1982) Sequence entropy and subsequence gener-
Huang W, Ye X (2006) A local variational relation and ators. J Lond Math Soc 26:441–450
applications. Israel J Math 151:237–280 Hulse P (1986) Sequence entropy relative to an invariant
Huang W, Ye X (2009) Combinatorial lemmas and appli- s-algebra. J Lond Math Soc 33:59–72
cations to dynamics. Adv Math 220(6):1689–1716 James J, Koberda T, Lindsey K, Silva CE, Speh P (2008)
Huang W, Ye X (2012) Generic eigenvalues, generic Measuable sensitivity. Proc Am Math Soc
homomorphisms and weak disjointness. Contemp 136:3549–3559
Math 567:119–143 Jewett RI (1969/1970) The prevalence of uniquely ergodic
Huang W, Li S, Shao S, Ye X (2003) Null systems and systems. J Math Mech 19:717–729
sequence entropy pairs. Ergodic Theory Dyn Syst Kalikow S (1984) Twofold mixing implies threefold
23(5):1505–1523 mixing for rank one transformations. Ergodic Theory
Huang W, Maass A, Ye X (2004) Sequence entropy pairs Dyn Syst 4:237–259
and complexity pairs for a measure. Ann Inst Fourier Kanigowski A, Kułaga-Przymus J, Ulcigrai C (2017) Mul-
(Grenoble) 54(4):1005–1028 tiple mixing and parabolic divergence in smooth area-
Huang W, Shao S, Ye X (2005) Mixing via sequence preserving flows on higher genus surfaces. J Eur Math
entropy. Algebraic and topological dynamics. Contemp Soc. arXiv: 1606.09189
Math 385:101–122 Katok A (1980) Lyapunov exponents, entropy and the
Huang W, Ye X, Zhang G (2006) A local variational periodic orbits for diffeomorphisms. Inst Hautes tudes
principle for conditional entropy. Ergodic Theory Dyn Sci Publ Math 51:137–173
Syst 26(1):219–245 Katok A, Hasselblatt B (1995) Introduction to the modern
Huang W, Ye X, Zhang G (2007) Relative entropy tuples, theory of dynamical systems. In: Encyclopedia of
relative U.P.E. and C.P.E. extensions. Israel J Math mathematics and its applications, vol 54. Cambridge
158:249–283 University Press, Cambridge, p xviii,802
Huang W, Lu P, Ye X (2011a) Measure-theoretical sensi- Katznelson Y (2001) Chromatic numbers of Cayley graphs
tivity and equicontinuity. Israel J Math 183:233–283 on ℤ and recurrence. Combinatorica 21:211–219
Huang W, Ye X, Zhang G (2011b) Local entropy theory for Kerr D, Li H (2007) Independence in topological and C*-
a countable discrete amenable group action. J Funct dynamics. Math Ann 338(4):869–926
Anal 261(4):1028–1082 Kerr D, Li H (2009) Combinatorial independence in mea-
Huang W, Li J, Ye X (2014) Stable sets and mean Li-Yorke surable dynamics. J Funct Anal 256(5):1341–1386
chaos in positive entropy systems. J Funct Anal Kolmogorov AN (1958) A new metric invariant of tran-
266(6):3377–3394 sient dynamical systems and automorphisms in
Parallels Between Topological Dynamics and Ergodic Theory 425
Lebesgue spaces. (Russian) Dokl Akad Nauk SSSR Maass A, Shao S (2007) Structure of bounded topological-
(NS) 119:861–864 sequence-entropy minimal systems. J Lond Math Soc
Kolmogorov AN (1959) Entropy per unit time as a metric 76(3):702–718
invariant of automorphisms. (Russian) Dokl Akad McMahon DC (1976) Weak mixing and a note on a struc-
Nauk SSSR 124:754–755 ture theorem for minimal transformation groups. Ill
Kolyada S, Snoha L (1997) Some aspects of topological J Math 20:186–197
transitivity ł a survey. In: Iteration theory (ECIT 94) McMahon DC (1978) Relativized weak disjointness and
(Opava), Grazer mathematische Berichte, vol 334. relatively invariant measures. Trans Am Math Soc
Karl-Franzens-Universität Graz, Graz, p 3C35 236:225–237
Korner TW (1987) Recurrence without uniform recur- Misiurewicz M (1976) A short proof of the variational
rence. Ergodic Theory Dyn Syst 7:559–566 principle for a ℤnþ action on a compact space.
Kra B (2006) From combinatorics to ergodic theory and back Astérisque 40:227–262
again. International Congress of Mathematicians, vol III. Moothathu TKS (2010) Diagonal points having dense
European Mathematical Society, Zürich, pp 57–76 orbit. Colloq Math 120:127–138
Krieger W (1970/1971) On unique ergodicity. In: Proceed- Moreira J (2017) Monochromatic sums and products in ℕ.
ings of the sixth Berkeley symposium on mathematical Ann Math 185:1069–1090
statistics and probability, vol II: probability theory. Uni- Nemytskiǐ VV (1949) Topological problems in the theory
versity of California Press, Berkeley, 1972, pp 327–346 of dynamical systems, Uspekhi Matematicheskikh
Kronecker L. (n.d.) Naherrungsweise ganzzahlige Nauk, 5. American Mathematical Society translation,
Auflosunglinear Gleichungen, S.-B. Preuss, Akad. no. 103, 1954
Wiss., 1179-93, 1271-99. Werke III (1), pp 47–109 Nemytskiǐ VV, Stepanov VV (1949) Qualitative theory of
Krug E, Newton D (1972) On sequence entropy of automor- differential equations. GITTL Moscow (Russian).
phisms of a Lebesgue space. Z Wahrscheinlicht- Princeton University Press, Princeton, 1960
keitstheorie Verw Gebiete 24:211–214 Nicol M, Petersen K (2012) Ergodic theory: basic exam-
Kułaga-Przymus J, Lemańczyk M (2019) Sarnak’s conjec- ples and constructions. Mathematics of complexity and
ture from the ergodic theory point of view. This volume dynamical systems, vol 1–3. Springer, New York,
Kurka P (2003) Topological and symbolic dynamics. pp 264–287
Cours Spécialisés specialized courses, 11. Société Oprocha P (2010) Weak mixing and product recurrence.
Mathématique de France, Paris Ann Inst Fourier (Grenoble) 60(4):1233–1257
Kushnirenko AG (1967) On metric invariants of entropy Oprocha P (2019) Double minimality, entropy and
type. Russ Math Surv 22:53–61 disjointness with all minimal systems. Discrete Contin
Kwietniak D, Oprocha P (2012) On weak mixing, mini- Dyn Syst 39:263–275
mality and weak disjointness of all iterates. Ergodic Oprocha P, Zhang G (2013) On weak product recurrence and
Theory Dyn Syst 32:1661–1672 synchronization of return times. Adv Math 244:395–412
Lehrer E (1987) Topological mixing and uniquely ergodic Oprocha P, Zhang G (2014) Topological aspects of dynam-
systems. Israel J Math 57:239–255 ics of pairs, tuples and sets. In: Recent progress in
Leibman A (2005) Pointwise convergence of ergodic aver- general topology. III. Atlantis Press, Paris, pp 665–709
ages for polynomial sequences of translations on a Ornstein D (1970) Bernoulli shifts with the same entropy
nilmanifold. Ergodic Theory Dyn Syst 25:201–213 are isomorphic. Adv Math 4:337–352
Lesigne E (1989) Théeorèemes ergodiques pour une trans- Park KK, Siemaszko A (2001) Relative topological Pinsker
lation sur un nilvariété. (French) [Ergodic theorems for factors and entropy pairs. Monatsh Math 134:67–79
a translation on a nilmanifold]. Ergodic Theory Dyn Parry W (1967) Zero entropy of distal and related trans-
Syst 9:115–126 formations. In: Auslander J, Gottschalk W (eds) Topo-
Li J, Qiao YX (2018) Mean Li-Yorke chaos along some logical dynamics. Benjamin, New York
good sequences. Monatsh Math 186(1):153–173 Pinsker MS (1960) Dynamical systems with completely
Li J, Ye X (2016) Recent development of chaos theory in positive or zero entropy. Dokl. Akad. Nauk SSSR
topological dynamics. Acta Math Sinica 32:83–114 133:1025–1026 (Russian); translated as Soviet Math.
Li TY, Yorke J (1975) Period three implies chaos. Am Math Dokl. 1:937–938
Mon 82:985–992 Qiu J, Zhao J (2020) A note on mean equicontinuity. J Dyn
Li J, Tu S, Ye X (2015a) Mean equicontinuity and mean Diff Equat 32:101–116
sensitivity. Ergodic Theory Dyn Syst 35:2587–2612 Qiu J, Zhao J. (n.d.) Null systems in the non-minimal case.
Li J, Yan K, Ye X (2015b) Recurrence properties and Ergodic Theory Dyn Syst. https://www.cambridge.org/
disjointness on the induced spaces. Discrete Contin core/journals/ergodic-theory-and-dynamical-systems/
Dyn Syst 35:1059–1073 article/null-systems-in-the-nonminimal-case/F8AD15
Lindenstrauss E (1995) Lowering topological entropy. 10DBAE43CA94CC0CB5A3DD22DD
J Anal Math 67:231–267 Rado R (1933) Studien zur Kombinatorik (German). Math
Lindenstrauss E (1999) Measurable distal and topological Z 36:424C470
distal systems. Ergodic Theory Dyn Syst 19:1063–1076 Rohlin, VA (1967) Lectures on the entropy theory of trans-
Liu K (2019) D-weakly mixing subset in positive entropy formations with invariant measure. (Russian) Uspehi
actions of a nilpotent group. J Differ Equ 267:525–546 Mat. Nauk 22. 137(5):3–56
426 Parallels Between Topological Dynamics and Ergodic Theory
Rohlin VA, Sinai YG (1961) Construction and properties Veech WA (1970) Point-distal flows. Am J Math
of invariant measurable partitions. Dokl Akad Nauk 92:205–242
SSSR 141:1038–1041. [In Russian] Veech WA (1977) Topological systems. Bull Am Math Soc
Romagnoli PP (2003) A local variational principle for the 83:775–830
topological entropy. Ergodic Theory Dyn Syst Walsh M (2012) Norm convergence of nilpotent ergodic
23:1601–1610 averages. Ann Math 175:1667–1688
Rudolph D (1979) An example of a measure preserving Walters P (1982) An introduction to ergodic theory. Grad-
map with minimal self-joinings, and applications. uate texts in mathematics, 79. Springer, New York
J Anal Math 35:97–122 Wang YP, Chen EC, Zhou XY (2019) Mean Li-Yorke
Rudolph DJ (1990) Fundamentals of measurable dynam- chaos for random dynamical systems. J Diff Equat
ics. Oxford Science publications. The Clarendon Press 267(4):2239–2260
Oxford University Press, New York, p x,168 Ward T (2012) Ergodic theory: interactions with combina-
Ryzhikov VV (1993) Joinings and multiple mixing of the torics and number theory. Mathematics of complexity
actions of finite rank. (Russian) Funktsional. Anal i and dynamical systems, vol 1–3. Springer, New York,
Prilozhen 27: 63–78, 96; translation in Funct. Anal pp 313–326
Appl 27:128–140 Weiss B (1985) Strictly ergodic models for dynamical
Saleski A (1977) Sequence entropy and mixing. J Math systems. Bull Amer Math Soc (NS) 13:143–146
Anal Appl 60:58–66 Weiss B (1989) Countable generators in dynamics – uni-
Sarnak P (2009) Three lectures on the Mobius function versal minimal models. Contemp Math 94:321–326
randomness and dynamics. http://publications.ias.edu/ Weiss B (1995) Multiple recurrence and doubly minimal
sarnak/paper/512 systems. Topological dynamics and applications
Schweizer B, Smital J (1994) Measures of chaos and a (Minneapolis, MN, 1995). Contemporary mathematics,
spectral decomposition of dynamical systems on the 215. American Mathematical Society, Providence,
interval. Trans Am Math Soc 344:737–754 199, pp 189–196
Serafin J (2013) Non-existence of a universal zero-entropy Weiss B (2000) Single orbit dynamics. CBMS regional
system. Israel J Math 194:349–358 conference series in mathematics, 95. American Math-
Shao S, Ye X (2012) Regionally proximal relation of order ematical Society, Providence
d is an equivalence one for minimal systems and a Ye X, Zhang G (2007) Entropy points and applications.
combinatorial consequence. Adv Math 231:1786–1817 Trans Am Math Soc 359:6167–6186
Sinai YJ (1959) On the concept of entropy for a dynamic Ye X, Zhang R (2008) Countable compacta admitting
system. (Russian) Dokl Akad Nauk SSSR homeomorphisms with positive sequence entropy.
124:768–771 J Dyn Diff Equat 20(4):867–882
Snoha L, Ye X, Zhang R (2018) Topology and topological Yu T (2019) Measure-theoretic mean equicontinuity and
sequence entropy. Sci China Math. arXiv:1810.00497 bounded complexity. J Diff Equat 267:6152–6170
Szegedy B (2012) On higher order Fourier analysis. arXiv: Zhang Q (1992) Sequence entropy and mild mixing. Can
1203.2260 J Math 44:215–224
Szemerédi E (1975) On sets of integers containing no Zhang Q (1993) Conditional sequence entropy and mild
k elements in arithmetic progression. Acta Arith mixing extensions. Can J Math 45:429–448
27:199–245 Zhang G (2006) Relative entropy, asymptotic pairs and
Tan F (2011) The set of sequence entropies for graph maps. chaos. J Lond Math Soc 73(1):157–172
Topology Appl 158(3):533–541 Zhou XM, Chen ER, Zhou XY (2017) Relative entropy
Tan F, Ye X, Zhang R (2010) The set of sequence entropies and mean Li-Yorke chaos. Adv Math (China)
for a given space. Nonlinearity 23(1):159–178 46:407–414
Tao T (2008) Norm convergence of multiple ergodic aver- Ziegler T (2005) A non-conventional ergodic theorem for a
ages for commuting transformations. Ergodic Theory nilsystem. Ergodic Theory Dyn Syst 25:1357–1370
Dyn Syst 28:657–688 Ziegler T (2007) Universal characteristic factors and
van derWaerden BL (1927) Beweis einer Baudetschen Furstenberg averages. J Am Math Soc 20:53–97
Vermutung. Nieuw Arch Wisk 15:212–216 Zimmer RJ (1976a) Extensions of ergodic group actions.
Veech WA (1968) The equicontinuous structure relation Ill J Math 20:373–409
for minimal Abelian transformation groups. Am J Math Zimmer RJ (1976b) Ergodic actions with generalized dis-
90:723–732 crete spectrum. Ill J Math 20:555–588
automorphism, captures many invariants of
Symbolic Dynamics topological conjugacy for shifts of finite type.
Embedding (Section “Shift Spaces and Sliding
Brian Marcus Block Codes”) A one-to-one sliding block
Department of Mathematics, University of British code from one shift space to another; equiva-
Columbia, Vancouver, BC, Canada lently, a one-to-one continuous shift-
commuting mapping from one shift space to
another.
Article Outline Factor map (Section “Shift Spaces and Sliding
Block Codes”) An onto sliding block code
Glossary from one shift space to another; equivalently,
Definition of the Subject an onto continuous shift-commuting mapping
Introduction from one shift space to another. Sometimes
Origins of Symbolic Dynamics: Modeling of called Factor Code.
Dynamical Systems Finite equivalence (Section “Other Coding
Shift Spaces and Sliding Block Codes Problems”) A common extension of two shift
Shifts of Finite Type and Sofic Shifts spaces given by finite-to-one factor codes.
Entropy and Periodic Points Full shift (Section “Shift Spaces and Sliding
The Conjugacy Problem Block Codes”) The set of all bi-infinite
Other Coding Problems sequences over an alphabet (together with the
Coding for Data Recording Channels shift mapping). Typically, the alphabet is finite.
Connections with Information Theory and Higher dimensional shift space (Section
Ergodic Theory “Higher Dimensional Shift Spaces”) A set of
Higher Dimensional Shift Spaces bi-infinite arrays of a given dimension, deter-
Future Directions mined by a collection of finite forbidden
Addendum to the Second Edition arrays. Typically, the alphabet is finite.
Bibliography Markov partition (Section “Origins of Sym-
bolic Dynamics: Modeling of Dynamical Sys-
Glossary tems”) A finite cover of the underlying phase
space of a dynamical system, which allows the
Almost conjugacy (Section “Other Coding system to be modeled by a shift of finite type.
Problems”) A common extension of two shift The elements of the cover are closed sets,
spaces given by factor codes that are one-to- which are allowed to intersect only on their
one almost everywhere. boundaries.
Automorphism (Section “The Conjugacy Prob- Measure of maximal entropy (Section “Connec-
lem”) An invertible sliding block code from a tions with Information Theory and Ergodic The-
shift space to itself; equivalently, a shift- ory”) A shift-invariant measure of maximal
commuting homeomorphism from a shift measure-theoretic entropy on a shift space. Its
space to itself; equivalently, a topological measure-theoretic entropy coincides with the
conjugacy from a shift space to itself. topological entropy of the shift space.
Dimension group (Section “The Conjugacy Road problem (Section “Other Coding Prob-
Problem”) A particular group associated to a lems”) A recently-solved classical problem in
shift of finite type. This group, together with a symbolic dynamics, graph theory and autom-
distinguished sub-semigroup and an ata theory.
Systems” reviews the roots of symbolic dynamics The book (1985) by Berstel and Perrin treats the
in modeling of dynamical systems. Section “Shift subject of variable length codes and contains
Spaces and Sliding Block Codes” lays the foun- many ideas related to symbolic dynamics. Finally,
dation by defining the kinds of spaces and map- there is an excellent recent survey by Boyle on
pings considered in the subject. Section “Shifts of Open Problems in Symbolic Dynamics (Boyle
Finite Type and Sofic Shifts” focuses on distin- 2007).
guished special classes of spaces, known as shifts
of finite type and sofic shifts. Section “Entropy
and Periodic Points” introduces the most funda- Origins of Symbolic Dynamics: Modeling
mental invariants, periodic points and topological of Dynamical Systems
entropy. Sections “The Conjugacy Problem” and
“Other Coding Problems” survey progress on the Symbolic dynamics began as an effort to model
conjugacy problem and other classification/cod- dynamical systems using sequences of symbols.
ing problems for shifts of finite type and sofic A dynamical system is a pair (X, T ) where X is a
shifts. In section “Coding for Data Recording set and T is a transformation from X to itself. For
Channels,” we present applications to coding for definiteness, we assume that T is invertible,
data recording. Section “Connections with Infor- although this is not a necessary restriction. Since
mation Theory and Ergodic Theory” provides a T maps X to itself, we can iterate T : T2 ¼ T ∘ T,
link with information theory and ergodic theory. T3 ¼ T ∘ T ∘ T, etc. The orbit of a point x X is
Finally, section “Higher Dimensional Shift the sequence of points: . . ., T 2(x), T 1(x), x,
Spaces” treats higher dimensional symbolic T(x), T2(x), . . . In the theory of dynamical sys-
dynamics. tems, one asks questions about orbits such as the
While this article covers many of the most following: Are there periodic orbits (i.e., x such
important topics in the subject, others have been that Tn(x) ¼ x for some n > 0)? Are there dense
omitted or treated lightly, due to space limitations. orbits (the orbit of x is dense if for any point y in X,
These include one-sided shift spaces, countable Tn(x) is “close” to y for some n)? How does the
state symbolic systems, orbit equivalence, flow behavior of an orbit vary with x? How can we
equivalence, the automorphism group, cellular describe the collection of all orbits of the dynam-
automata, and substitution systems. References ical system? When is the dynamical system “cha-
to work in these subareas can be found in the otic”? For more information on dynamical
sources mentioned below. systems, we refer the reader to (Blanchard et al.
For introductory reading on symbolic dynam- 2004; Devaney 1987; Hassellblatt and Katok
ics and its applications, beyond this article, one 1995).
can consult the textbooks Kitchens (1998) and The subject of dynamical systems has its roots
Lind and Marcus (1995). There are also excellent in Classical Mechanics; in that setting, X is the set
introductory survey articles, such as Boyle of all possible states of a system (e.g., the posi-
(1993), Lind and Schmidt (2002), and tions, momenta of all particles in a physical sys-
S. Williams (2004a). In addition, there are very tem), and the transformation T is the time
good expositions which focus on other aspects of evolution map, which maps the state of the system
the subject. These include Beal (1993), which at one time to the state of the system at one time
focuses on connections between symbolic dynam- unit later.
ics and automata theory, the lecture notes Marcus- Symbolic dynamics provides a model for the
Roth-Siegel (1998), which focuses on constrained orbits of a dynamical system (X, T) via a space of
coding applications, and Immink (2004a), which sequences. This is done by “quantizing” X into
focuses on applications to data storage. There are cells, associating symbols to the cells and
also several excellent collections of articles on representing points as bi-infinite sequences of
special areas of the subject, such as Blanchard symbols. For instance, in Fig. 1, X is a square,
et al. (2000), Walters (1992), Williams (2004b). and T is some transformation of the square.
430 Symbolic Dynamics
problem in dynamical systems leads to a coding We are interested in sets that can be specified
problem between constrained sets of sequences. by a list (finite or infinite) of forbidden blocks.
We have described dynamical systems as the Namely, given a collection F of “forbidden
discrete-time iteration of a single mapping. How- blocks” over A , the subset X consisting of all
ever, continuous-time iterations have been studied sequences in A Z, none of whose subwords belong
since the inception of the subject. These are to F , is called a shift space (or simply shift), and
known as continuous-time flows, with the main we write X ¼ XF . When a shift space X is
example being the set of solutions to a system of contained in a shift space Y, we say that X is a
ordinary differential equations. Indeed, the work subshift of Y.
of Hedlund and Morse mentioned above was done
in this context.
Example 1 X is the set of all binary sequences
with no two 1’s next to each other. Here X ¼ XF ,
where F ¼ {11}. This shift is called the golden
Shift Spaces and Sliding Block Codes
mean shift, for reasons that will become apparent
later.
Let A be an alphabet of symbols, which we
assume to be finite. The principal objects of
study in symbolic dynamics are certain kinds of Example 2 X is the set of all binary sequences so
collections of sequences of symbols from A. Typ- that between any two 1’s there are an even number
ically, these sequences are infinite x ¼ x0 x1x2. . ., of 0’s. We can take for F the collection
but it is often more convenient to deal with
bi-infinite sequences x ¼ . . . x2x1x0 x1x2. . . 102nþ1 1 : n 0 :
For some problems, the results are similar in the
infinite and bi-infinite categories, while for other
This example is naturally called the even shift.
problems, they are quite different. In this article,
we focus on the bi-infinite setting.
The symbol xi is the ith coordinate of x. When Example 3 X is the set of all binary sequences
writing a specific sequence, we need to specify such that between any two successive 1’s, number
which is the 0th coordinate. As suggested in sec- of 0’s is prime. We can take for F the collection
tion “Origins of Symbolic Dynamics: Modeling
of Dynamical Systems,” this is done with a deci- f10n 1 : n is compositeg :
mal point to separate the xi with i 0 from those
with i < 0: x ¼ . . . x2x1. x0 x1x2. . . . A block or This example is naturally called the prime shift.
word over A is a finite sequence of symbols from Alternatively (and equivalently), shift spaces
A . A block of length N is called an N-block. For can be defined as closed, shift-invariant subsets of
blocks u, v, the block uv is the concatenation of full shifts. Here, “closed” means with respect to a
u and v, and for a block w, the concatenation of metric, for which two points are close if they agree
N copies of w is denoted wN. in a large “central block”; one such metric is
The full A -shift A Z is the set of all bi-infinite r(x, y) ¼ 2k if x 6¼ y, with k maximal such that
sequences of symbols from A . The full r-shift is x[k, k] ¼ y[k, k] (with the conventions that
the full shift over the alphabet {0, 1, . . ., r 1}. r(x, y) ¼ 0 if x ¼ y and r(x, y) ¼ 2 if x0 6¼ y0).
The shift map s on a full shift maps a point x to the Let X be a subset of a full shift, and let ℬN(X)
point y ¼ s(x) whose ith coordinate is yi ¼ xi+1. denote the set of all N-blocks that occur in ele-
The orbit of a point in a full shift is its orbit ments of X. The language of X is
under the shift map. The full shift contains many ℬ(X) ¼ [NℬN(X). It can be shown that the lan-
different types of orbits. For instance, it contains a guage of a shift space determines the shift space
dense orbit (namely, any sequence which contains uniquely, and so we can equally well describe a
every block in the alphabet) and periodic orbits shift space by specifying the “occurring” or “allo-
(namely, any sequence which is periodic). wed” blocks, rather than the forbidden blocks. For
432 Symbolic Dynamics
example, the golden mean shift is specified by the ðbN ðxÞÞi ¼ x½i,iþN1 :
language of blocks in which 1’s are isolated.
This establishes a connection with automata Then the Nth higher block shift or higher block
theory (Aho et al. 1974; Béal 1993), which studies presentation of a shift space X is the image
collections of blocks, rather than infinite or X[N] ¼ bN(X) in the full shift over A ðNÞ.
bi-infinite sequences. The languages that occur Similarly, define the Nth higher power code
in symbolic dynamics, i.e., as ℬ(X) for some gN : X ! A ðNÞ by
Z
Let X be a shift space over A , and F : one shift space X to another Y, we say that X and
ℬmþnþ1 ðXÞ ! C be a block map. Then the map Y are conjugate, denoted X ffi Y.
f : X ! C Z defined by y ¼ f(x), with yi given by As an example, the higher block map bN is a
F above, is called the sliding block code with conjugacy between a shift space X and its higher
memory m and anticipation n induced by F. We block shift X[N]. Via this code, we can “re-code”
will denote the formation of f from F by f ¼ any sliding block code as a 1-block code (though
F½1
m,n
, or more simply by f ¼ F1 if the memory typically a conjugacy and its inverse cannot, by
and anticipation of f are understood. If not spec- this artifice, be simultaneously re-coded to
ified, the memory is taken to be 0. If Y is a shift 1-block codes).
space contained in C Z and f(X) Y, we write In this section, we have given examples of
f : X ! Y. relatively simple sliding block codes. But the typ-
In analogy with the characterization of shift ical conjugacy, as well as factor code and embed-
spaces as closed shift-invariant sets, sliding ding, can be much more complicated.
block codes can be characterized in a topological
manner: namely, as the maps between shift spaces
that are continuous and commute with the shift.
Shifts of Finite Type and Sofic Shifts
This result is known as the Curtis-Hedlund-
Lyndon theorem (Hedlund 1969).
A shift of finite type (SFT) is a shift space that can
be described by a finite set of forbidden blocks,
Example 5 Let A ¼ f0, 1g ¼ C, X ¼ A Z, m ¼ 0, i.e., a shift space X having the form XF for some
n ¼ 1, and F(a0a1) ¼ a0 + a1 (mod2). Let finite set F of blocks. The terminology shift of
f ¼ F1 : X ! X. finite type (or subshift of finite type) comes from
dynamical systems (Smale 1967).
Example 6 The sliding block code, generated by An SFT is M-step (or has memory M) if it can
F(00) ¼ 1, F(01) ¼ 0 ¼ F(10), maps the golden be described by a collection of forbidden blocks
mean shift onto the even shift. all of which have length M + 1. It is easy to see that
any SFT is M-step for some M.
Example 7 There is a trivial sliding block code Since any shift space can be defined by many
from the full 2-shift into the full 3-shift, generated different collections of forbidden blocks, it is use-
by F(0) ¼ 0, F(1) ¼ 1. ful to have the following equivalent condition
expressed in terms of allowed blocks: an SFT is
If a sliding block code f : X ! Y is onto, then f M-step if and only if whenever u is an allowed
is called a factor code or factor map, and Y is a block of length at least M, u0 is the suffix of u with
factor of X. If f : X ! Y is one-to-one, then f is length M and a is a symbol, then ua is allowed if
called an embedding of X into Y. The sliding block and only if u0a is allowed. In other words, in order
code in Example 7 is an embedding but not a to tell whether a symbol can be allowably
factor code, while the codes in Examples 5 and 6 concatenated to the end of an allowed word u,
are factor maps, but not embeddings. one need only look at the last M symbols of u.
A major (and unrealistic) goal of symbolic This is analogous to the “finite memory” property
dynamics is to classify in an explicit way shift of M-step Markov chains.
spaces up to the following natural notion of equiv- The golden mean shift X is a 1-step SFT, since
alence. A sliding block code f : X ! Y is a it was defined by a forbidden list consisting of
conjugacy (or topological conjugacy) if it is exactly one block: F ¼ {11}. Equivalently, it is
invertible with sliding block inverse. Equiva- only the last symbol of an allowed block that
lently, a conjugacy is a bijective sliding block determines whether a given symbol can be
code and therefore simultaneously a factor code concatenated at the end. In contrast, the even
and an embedding. If there is a conjugacy from shift is not an SFT: for any M, the symbol 1 can
434 Symbolic Dynamics
be concatenated to the end of exactly one of the “connecting” block v such that uvw is allowed.
(allowed) words 10M and 10M+1. While shift spaces do not always decompose into
Recall that the higher block code bM is a disjoint unions of irreducible shifts, every SFT can
conjugacy from X to X[M]. Via this code, any be written as a finite disjoint union of irreducible
SFT can be recoded to a 1-step SFT. And so, any SFT’s Xi together with “transient” one-way con-
sliding block code on a shift space can be recoded nections from one Xi to another. And irreducible
to a 1-block code on a 1-step SFT. It is useful to edge shifts can be characterized in a particularly
have a concrete description of 1-step SFT’s. In concrete form: namely, XG is irreducible if and
fact, these are precisely the shift spaces consisting only if G is irreducible, i.e., for every ordered pair
of all bi-infinite sequences of vertices along paths of vertices I and J there is a path in G starting at
on a finite directed graph. These are called vertex I and ending at J.
shifts. We find it more convenient to work with There is a stronger notion which is defined by a
sequences of edges instead. To be precise: uniformity condition on the length of the
Let G be a finite directed graph (or simply connecting block. A shift space is mixing if when-
graph) with vertices (or states) V ¼ V ðGÞ and ever u and w are allowed blocks, there is an N,
edges E ¼ E ðGÞ. For an edge e, i(e) denotes the possibly depending on u and w, such that for all
initial state and t(e) the terminal state. A path in n N, there is block v of length n such that uvw is
G is a finite sequence of edges in G such that the allowed. And an edge shift XG is mixing if and
terminal state of an edge coincides with the initial only if G is primitive, i.e., there is an integer
state of the following edge; a cycle in G is path N such that for any n N and any ordered pair
that begins and ends at the same state. We will of vertices I and J, there is a path in G of length
assume that G is essential, i.e., that every state has n starting at I and terminating at J. It follows that
at least one outgoing edge and one incoming edge. for SFT’s in the definition of mixing, the uniform
The adjacency matrix A ¼ A(G) is the matrix connecting length N can be chosen independent of
indexed by V with AIJ equal to the number of the allowed blocks u and w.
It can be shown that, in some sense, any irre-
edges in G with initial state I and terminal state J.
ducible SFT X can be broken down into a union of
Since a graph and its adjacency matrix essentially
disjoint maximal mixing shifts; namely, X can be
determine the same information, we will fre-
written as the disjoint union of finitely many sets
quently associate a graph G with its adjacency
Xi, i ¼ 0, . . ., p 1 such that s(Xi) ¼ Xi+1 mod p
matrix A and a nonnegative integer matrix A with
and for each i, sp restricted to Xi can be regarded
a graph G.
as a mixing SFT. This is a consequence of Perron-
The edge shift XG or XA is the shift space over
Frobenius theory, upon which symbolic dynamics
the alphabet A ¼ E defined by
relies heavily; see Seneta (1980) for an introduc-
tion to this theory.
XG ¼ XA
A sofic shift is the set of bi-infinite sequences
¼ x ¼ ðxi Þi Z E Z : each xiþ1 follows xi : obtained from a finite labeled directed graph
G ¼ ðG, ℒÞ ; here, G is a finite directed graph
It can be readily verified that edge shifts are and ℒ is a labeling of the edges of G. The labeled
1-step SFT’s. While edge shifts do not include all graph is often called a presentation of the sofic
1-step SFT’s, any 1-step SFT can be recoded to an shift. The golden mean shift and even shift are
edge shift, and, compared with vertex shifts, edge sofic, with presentations given in Figs. 3 and 4.
shifts offer the advantage of a more compact SFT’s are sofic because any M-step SFT can be
description. presented by a graph whose states are allowed
For many purposes, one can study a general M-blocks. Note also that any sofic shift is a factor
shift space X by breaking it into smaller, more of an SFT, namely, via a (1-block) factor code ℒ1
well-behaved pieces. A shift space is irreducible on the edge shift XG based on a presentation
if whenever u and w are allowed blocks, there is a (G, ℒ). In fact, the converse is true, and so, the
Symbolic Dynamics 435
Morse shift with the full 2-shift) with positive Lanford 1970). As an example, the zeta function
entropy but no periodic points at all. of the golden mean shift is 1/(1 t t2).
However, for irreducible SFT’s and sofic shifts, The discussion above applies equally well to
the entropy h(X) can be recovered from the sequence SFT’s since they are conjugate to edge shifts. For
pn(X). The key to understanding this is the fact that a sofic shift, the zeta function turns out to be a
for an edge shift X ¼ XA, pn(X) ¼ tr(An) and thus is rational function, i.e., quotient of two polyno-
the sum of the nth powers of the (non-zero) eigen- mials. This can be shown by analyzing properties
values of A; in the case that A is primitive, the largest of a right resolving presentation of the sofic shift.
eigenvalue lA strictly dominates the other eigen- From this it turns out that the zeta function of the
values, and thus for large n, even shift is (1 t)/(1 t t2). The technique for
computing zeta functions of sofic shifts was
log pn ðXÞ ¼ log trðAn Þ~ ~ ðX Þ :
n log lA nh developed by Manning (1971) (actually, Manning
developed the technique to compute zeta func-
This shows that for a mixing SFT, the entropy tions of hyperbolic dynamical systems).
equals the growth rate of numbers of periodic So, for sofic shifts, all of the periodic point
points. In fact, this result applies to all SFT’s and information is determined by a finite collection
sofic shifts. of complex numbers, namely, the zeros and
poles of the zeta function.
Theorem 4 For a sofic shift X, Finally, we mention another simple invariant
obtained from the periodic points. The period,
1 per(X), of a shift space X is the gcd of lengths of
lim sup log pn ðXÞ ¼ hðXÞ :
n!1 n periodic points in X, i.e., the gcd of the set of
n such that pn(X) 6¼ 0. If X ¼ XG is an edge shift
and G is irreducible, then per(XG) ¼ per(G),
The limsup turns out be a limit in the case that which is defined to be the gcd of cycle lengths in
X is a mixing sofic shift. G and coincides with the gcd of the lengths of all
Most of what we have stated for pn(X) here cycles in G based at any given state. For an irre-
applies equally well to qn(X), the number of points ducible graph with period p and any state I, the
of least period n in X. This follows from the fact number of cycles of length N, a multiple of p, at
that “most” periodic points of period n have least I grows like lNAðGÞ ¼ 2NhðXG Þ . Also, an irreducible
period n. graph G is primitive if per(G) ¼ 1.
The periodic point information can be conve-
niently combined into a single invariant, known as
the zeta function. For a shift space X,
The Conjugacy Problem
1
pn ð X Þ n
zX ðtÞ ¼ exp t : The conjugacy problem for SFT’s and sofic shifts
n¼1
n is a major open problem. After much effort, it
remains unsolved today. Much of what we know
For an edge shift XA, one computes the zeta goes back to R. Williams (1973/74) in the 1970s.
function to be the reciprocal of a polynomial: One of Williams’ main results was that any
conjugacy can be decomposed into simple build-
1 1 ing blocks, as follows.
zXA ðtÞ ¼ ¼ ,
tr wA ðt1 Þ detðI tAÞ Let A and B be nonnegative integral matrices,
with associated graphs G and H. An elementary
which is completely determined by the non-zero equivalence from A to B is a pair (R, S) of rectan-
eigenvalues (with multiplicity) of A (Bowen and gular nonnegative integral matrices satisfying
438 Symbolic Dynamics
In fact, Williams showed that any conjugacy Theorem 6 R. Williams (1973/74) Every
can be decomposed into a composition of conjugacy from one edge shift to another is the
Symbolic Dynamics,
Fig. 5 A state splitting
Symbolic Dynamics 439
composition of splitting codes and amalgamation equivalence is a shift equivalence of lag 1 and
codes. that shift equivalence is an equivalence relation.
It follows that:
This classification for edge shifts naturally
extends to SFT’s since every SFT is conjugate to Theorem 8 Williams (1973/74) Strong shift
an edge shift. It also extends to sofic shifts, and we equivalence implies shift equivalence. More pre-
describe this in the context of irreducible sofic cisely, if A B(lag ‘), then A~B(lag ‘).
shifts.
Recall that an irreducible sofic shift has a Recall from section “Entropy and Periodic
unique minimal right resolving presentation. Points” that for an edge shift XA, the set of
Any labeled graph can be completely described nonzero eigenvalues, with multiplicity, of
by a symbolic adjacency matrix, which records A determines the zeta function and hence this
the transitions (edges) in the underlying physical set is an invariant of conjugacy. Using the shift
graph, as well as the labels of the edges. Namely, equivalence equations, one can show more: the
the symbolic adjacency matrix is indexed by the entire Jordan form corresponding to the nonzero
states of the underlying graph and the (I, J)-entry eigenvalues is an invariant. This information
is the formal sum of the labels of edges from I to J. depends only on properties of the adjacency
It turns out that the notions of elementary equiv- matrix considered as a linear transformation
alence, and hence strong shift equivalence, can be (over R or Q). However, A is a nonnegative,
extended to more general categories, in particular integral matrix and both nonnegativity and inte-
to symbolic adjacency matrices. grality provide substantially more information.
One such invariant that follows from shift equiv-
Theorem 7 Krieger (1984), Nasu (1986) Let alence and makes use of integrality is the
X and Y be irreducible sofic shifts. Let A and Bowen-Franks group (Bowen and Franks
B be the symbolic adjacency matrices of the min- 1977), BF(A) ¼ Zr/Zr(I A).
imal right-resolving presentations of X and Y, Until recently all of the information contained in
respectively. Then X and Y are conjugate if and known conjugacy invariants, such as those above,
only if A and B are strong shift equivalent. was subsumed in shift equivalence. And Kim and
Roush showed that shift equivalence is decidable
The classification, provided by these results, (Kim and Roush 1979, 1988), meaning that there is
would be of limited use if the story ended here. a finite decision procedure via a Turing machine
Fortunately, Williams showed that strong shift that decides whether two given edge shifts, and
equivalence yields a strong, delicate, and some- therefore two given SFT’s, are conjugate (they
what computable necessary condition for also showed that a notion of shift equivalence for
conjugacy. sofic shifts, formulated by Boyle and Krieger
Let A and B be nonnegative integral matrices (1986), is decidable (Kim and Roush 1990)). So, a
and ‘ 1. A shift equivalence of lag ‘ is a pair central focus of the subject was the question: is shift
(R, S) of rectangular nonnegative integral matri- equivalence a complete invariant of conjugacy?
ces satisfying the shift equivalence equations The answer turns out to be No, as proven by Kim
and Roush (1999); see also the survey article Wag-
AR ¼ RB, SA ¼ BS, A‘ ¼ RS, B‘ ¼ SR: oner (1992). However, it is a complete invariant of a
weaker form of conjugacy; we say that XA and XB
We denote this situation by (R, S) : A~B(lag l). are eventually conjugate if all sufficiently large
We say that A is shift equivalent to B, written A~B, powers are conjugate. It is not hard to show that if
if there is a shift equivalence from A to B of some A~B, then XA and XB are eventually conjugate, and
lag. It is not hard to see that an elementary the converse is true as well:
440 Symbolic Dynamics
Theorem 9 Kim and Roush (1979), Williams In many areas of mathematics, objects are stud-
(1973/74) Edge shifts XA and XB are eventually ied by means of their symmetries. This holds true
conjugate if and only if A and B are shift in symbolic dynamics, where symmetries are
equivalent. expressed by automorphisms. An automorphism
of a shift space X is a conjugacy from X to itself.
Also the Kim-Roush counterexamples and The set of all automorphisms of a shift space X is a
subsequent work do not bear on a special case of group under composition and is naturally called
the conjugacy problem: if A is shift equivalent to the automorphism group, denoted aut(X).
the 1 1 matrix [n], is XA conjugate to the full The goals are to understand aut(X) as a group
n-shift? This question, known as the little shift (What kinds of subgroups does it contain? How
equivalence problem, is particularly intriguing “big” is it?) and how it acts on X, e.g., given shift-
because in this case the condition A~[n] is simply invariant subsets U, V, such as finite sets of peri-
the statement that A has exactly one non-zero odic points, when is there an automorphism of
eigenvalue, namely, n. X that maps U to V?. One might hope that the
Shift equivalence can be characterized in automorphism group would shed new light on the
another way that has turned out to be very useful. conjugacy problem for SFT’s. Indeed, tools devel-
Let A be an r r integral matrix. Let RA denote the oped to study the automorphism group eventually
real eventual range of A, i.e., RA ¼ RrAr. The paved the way for Kim and Roush to find exam-
dimension group of A is defined: ples of shift equivalent matrices that are not strong
shift equivalent. On the other hand, the automor-
DA ¼ v RA : vAk Zr for some k 0 : phism group cannot tell the entire story. For
instance, aut(XA) and autðXA⊤ Þ are isomorphic,
The dimension group automorphism dA of A is since any automorphism read backwards can be
the restriction of A to DA, so that dA(v) ¼ vA for viewed as an automorphism of the transposed
v DA. The dimension pair of A is (DA, dA). shift, yet XA and XA⊤ may fail to be conjugate
If A is also nonnegative, then we define the (for an example due to Kollmer, see p. 81 of Parry
dimension semigroup of A to be and Tuncel (1982)). It is not even known if the
automorphism groups of the full 2-shift and the
Dþ þ r full 3-shift are isomorphic.
A ¼ v RA : vA ðZ Þ for some k 0 :
k
A good deal of our understanding of the action
The dimension triple of A is DA , Dþ of the automorphism group on an SFT comes from
A , dA .
understanding its induced representation as an
It can be shown that the dimension triple
action of the dimension group; this action is
completely characterizes shift equivalence, i.e.,
known as the dimension representation. For a
two nonnegative integral matrices are shift equiv-
much more thorough exposition on aut(X), we
alent if and only if their dimension groups are
refer the reader to Wagoner (1992, 2004).
isomorphic by an isomorphism that preserves the
dimension semigroup and intertwines the dimen-
sion group automorphisms. Also, by associating
equivalence classes of certain subsets of the shift Other Coding Problems
space XA to elements of DþA , one can interpret the
dimension triple in terms of the action of the shift The difficulties encountered in attempts to solve
map on XA (Boyle et al. 1987; Krieger 1980a). the conjugacy problem motivated the formulation
And the dimension triple arises prominently in the and study of weaker, but meaningful, notions of
study of the automorphism group of an SFT, equivalence. For instance, we might say that two
which we now briefly describe. The dimension shift spaces are equivalent if one can be invertibly
group for SFT’s was developed by Krieger encoded to the other by some kind of “finite-state
(1980a, b). machine.” A precise version of this is as follows.
Symbolic Dynamics 441
Shift spaces X and Y are finitely equivalent if almost invertible factor codes fX : W ! X,
there is an SFT W together with finite-to-one fY : W ! Y. We call (W, fX, fY) an almost
factor codes fX : W ! X and fY : W ! Y. We conjugacy between X and Y.
call W a common extension, and fX, fY the legs. For irreducible sofic shifts, it can be shown that
Here, by “finite-to-one” we mean merely that any almost invertible factor code is finite-to-one,
each point has a finite number of inverse images. and so almost conjugacy implies finite equiva-
It can be shown that any finite-to-one factor code lence. Thus, entropy is again an invariant, and
from one shift space to another must preserve together with a second very mild invariant, it is
entropy, and so entropy is an invariant of finite complete:
equivalence. For irreducible sofic shifts, entropy
is a complete invariant: Theorem 11 Adler-Marcus (1979) Let X and
Y be irreducible sofic shifts with minimal right
Theorem 10 Parry (1977) Two irreducible sofic resolving presentations (G, ℒ) and (H, M).
shifts are finitely equivalent if and only if they Then X and Y are almost conjugate if and only if
have the same entropy. h(X) ¼ h(Y) and per(G) ¼ per(H).
Note that from this result and the fact that In particular, if X and Y are mixing, then per(-
finite-to-one codes between general shift spaces G) ¼ 1 ¼ per(H) and entropy itself is a complete
preserve entropy, for irreducible sofic shifts, we invariant.
could have just as well-defined finite equivalence Thus, with an almost conjugacy, one can
with the common extension W merely being a invertibly encode most sequences in X to those
shift space. However, if W is an SFT, we get a of Y without the need for auxiliary state informa-
more concrete coding interpretation as follows. tion. Moreover, if X and Y are almost conjugate,
First, we recode W to an edge shift XG and recode then there is an almost conjugacy of X and Y in
the legs, fX ¼ (FX)1 and fY ¼ (FY)1, to one- which one leg is right-resolving and the other leg
block codes, and (with a bit more argument) we is left-resolving (and the common extension is
can assume that G is irreducible. In this set-up, the irreducible (Adler and Marcus 1979)). This gives
finite-to-one condition translates to the so-called an even more concrete interpretation to the
“no-diamond” condition, which means that for encoding.
any given pair of states I, J and finite sequence The proofs of Theorems 10 and 11 are actually
w, there is at most one path from I to J with label quite constructive. For illustration, we consider a
w (Coven and Paul 1975, 1977). Since, for any very special, but historically important, case.
fixed state I, the number of cycles of length n, a Let G be a graph with constant out-degree n.
multiple of p ¼ per(G), at I grows like 2nh(W), we A road coloring F is a labeling of G such that at
have, in this set-up a means to invertibly encode a each state of G, each symbol 0, . . ., n 1 appears
“large” set of allowed blocks in X to allowed exactly once as the label of an outgoing edge. An
blocks in Y: namely, fix state I, and a large n, n-ary word w is synchronizing if all paths that are
which is a multiple of p; for any cycle g of length labeled w end at the same state. Figure 6 gives
n at state I encode the FX -label of g to the FY examples of road-colorings, with n ¼ 2.
-label of g. For a road-coloring, a binary word may be
For encoding and decoding, one can dispense viewed as a sequence of instructions given to
with state information if the legs are “almost drivers starting at each of the states.
invertible.” A factor code f is almost invertible A synchronizing word is a word that drives every-
if it is one-to-one on sequences that are typical in body to the same state. For instance, in Fig. 6a, the
the following sense: x is typical if every allowed word 11 drives everybody to the state in the lower-
block appears infinitely often in x both to the left left corner. But Fig. 6b does not have a synchro-
and the right. We then say that shift spaces X and nizing word because whenever a driver takes a “0”
Y are almost conjugate if there is an SFT W and road he stays where he is and whenever a driver
442 Symbolic Dynamics
takes a “1” road he rotates by 120 . The road- Theorem 12 Road Theorem (Trachtman 2007)
coloring in Fig. 6c is essentially the only road- If G is a finite directed primitive graph with con-
coloring of its underlying graph, and there is no stant out-degree n, there a road-coloring of
synchronizing word because drivers must always G which has a synchronizing word.
oscillate between the two states.
Now, let X be the full n-shift and Y be an Trachtman’s approach relies heavily on earlier
irreducible SFT with entropy logn. Suppose that work of Friedman (1990) and Kari (2001).
we could find a presentation (G, ℒ) of Y with The primitivity assumption above is close to
G having constant out-degree n and ℒ1 finite- necessary. Clearly some kind of connectivity is
to-one. Then, define F to be any road coloring of required and in the presence of irreducibility,
G. Then F1 would be a finite-to-one (in fact, right primitivity would be necessary since otherwise
resolving!) factor code from XG to X. And we there would be a “phase” introduced in the graph
would obtain a finite equivalence with X ¼ XG, that would never allow a word to synchronize, as
fY ¼ ℒ1 and fX ¼ F1. If, moreover, we could in Fig. 6c.
choose ℒ and F such that fY and fX are almost So far, we have focused on equivalences
invertible, then we would have an almost between symbolic systems. There has also been
conjugacy. considerable attention paid to problems of embed-
It turned that this could be arranged for fY via a ding one system into another and factoring one
construction related to state splitting (Adler et al. onto another.
1977). And if Y were mixing, then G could be One of the most striking results of this type is
chosen to be primitive and fY almost invertible. In the Krieger embedding theorem. It is not hard to
this setting, fX ¼ F1 would be almost invertible show that any proper subshift of an irreducible
iff F has a synchronizing word; the sufficiency SFT must have strictly smaller entropy. Thus, a
follows from the fact that every bi-infinite binary necessary condition for a proper embedding of a
sequence which contains w infinitely often to the shift space into an irreducible SFT is that it have
left would be the label of exactly one bi-infinite strictly smaller entropy. This condition, together
sequence of edges (to see this, use the synchro- with a trivially necessary condition on periodic
nizing and road-coloring properties). points, turns out to be sufficient. Recall that qn(X)
The construction of such a labeling F became denotes the number of points of least period
known as the Road Problem, which remained n in X.
open for 30 years. In the meantime, a weaker
version of the road problem was solved and Theorem 13 Embedding Theorem (Krieger
applied to yield this special case of Theorem 11 1982) Let X and Y be irreducible shifts of finite
(see Adler et al. 1977). Nevertheless, the problem type. Then there is a proper embedding of X into
remained an important problem in graph/automata Y if and only if h(X) < h(Y) and for each n 1,
theory and was only recently solved: qn(X) qn(Y).
Symbolic Dynamics 443
In fact, Krieger’s theorem shows that these of the Perron-Frobenius theorem for primitive
conditions are necessary and sufficient for a matrices; and the Net Trace condition assures
proper embedding of any shift space into a mixing that the number of periodic points of least period
shift space. The analogous problems for embed- n would be nonnegative.
ding into irreducible or mixing sofic shifts are still We now turn from embeddings to factors. One
open, though there are partial results (Boyle special case, which is somewhat related to the
1983). Road Problem above and also important for data
Using the embedding theorem and other tools recording applications (Section “Coding for Data
from symbolic dynamics, Boyle and Handelman Recording Channels”) is:
(1991) obtained a stunning application to linear
algebra: namely, a complete characterization of Theorem 15 Adler et al. (1983a), Marcus
the non-zero spectra of primitive matrices over (1979)) An SFT X factors onto the full n-shift
R. In fact, they obtained characterizations of iff h(X) log (n).
non-zero spectra for primitive matrices over
many other subrings of R. While they did not While this special case treats both the equal
obtain a complete characterization over Z, they entropy case (h(X) ¼ log (n)) and unequal entropy
formulated a conjecture for Z and obtained many case (h(X) > log (n)), in general, the factor prob-
partial results towards that conjecture, which was lem naturally divides into two cases: lower
later proven using other tools. The result, stated entropy factors and equal entropy factors. In either
below, shows that three simple necessary condi- case, a trivial necessary condition for Y to be a
tions on a set of nonzero complex numbers are factor of X is that whenever qn(X) 6¼ 0, there exists
actually sufficient. In order to state these condi- a d/n such that qd(Y) 6¼ 0. This condition is
tions, we need the following notation: denoted P(X) ↘ P(Y). Building on ideas from
Let L ¼ {l1, . . ., lk} be a list of Krieger’s embedding theorem, this necessary con-
nonzero complex numbers (with multiplicity). dition was shown to be sufficient.
Let f L ðtÞ ¼ ki¼1 ðt li Þ, and trn ðLÞ ¼
k Theorem 16 Lower Entropy Factor Theorem
d=n mðn=d Þ
k
i¼1 li , with m being the Mobius
(Boyle 1983) Let X and Y be irreducible SFT’s
Inversion function.
with h(X) > h(Y). Then there is a factor code from
X to Y if and only if P(X) ↘ P(Y).
1. Integrality Condition: fL(t) is a monic polyno-
mial (with integer coefficients).
As with the embedding theorem, the lower
2. Perron Condition: There is a positive entry in
entropy factors problem for irreducible sofic shifts
L, occurring just once, that strictly dominates
is still open.
in absolute value all other entries. We denote
The equal entropy factors problem for SFT’s is
this entry by lL.
quite different. Clearly, P(X) ↘ P(Y) is a neces-
3. Net Trace Condition: trn(L) 0 for all n 1.
sary condition. A second necessary condition
involves the dimension group and is simplest to
Theorem 14 Kim-Ormes-Roush (2000) Let L be state in the case of mixing edge shifts XA and XB.
a list of nonzero complex numbers satisfying the We say that a subgroup D of the dimension
Integrality, Perron, and Net Trace Conditions. group DA is pure if whenever an integer multiple
Then there is a primitive integral matrix A for of an element v DA is in D, then so is v;
which L is the non-zero spectrum of A. intuitively D does not have any “rational holes”
in DA. The condition is that there is a pure dA
These conditions are all indeed necessary for L -invariant subgroup D of DA such that (DB, dB) is
to be the non-zero spectrum of a primitive integral a quotient of (D, dA|D).
matrix. The integrality condition is that L forms a In the equal entropy case, this condition and
complete set of algebraic conjugates; the Perron the trivial periodic point condition, P(X) ↘ P(Y),
condition states that L must satisfy the conditions subsume all known necessary conditions for the
444 Symbolic Dynamics
stationary measure m on G, we define a stationary block x[n(x),n(x)], where the function n(x) has
measure n ¼ f(m) on XH by transporting m to XH: finite expectation (Adler and Marcus 1979). In
for a measurable set A in XH, define particular, by Theorem 11, whenever XG and XH
are mixing edge shifts with the same entropy, the
nðAÞ ¼ m f1 ðAÞ : measure-preserving transformations defined by
mG and mH are isomorphic via an isomorphism
Then f defines a measure-preserving homo- with finite expected coding length.
morphism from the MPT defined by m to the The notions of conjugacy, finite equivalence,
MPT defined by n. Since measure-preserving almost conjugacy, embedding, factor code, and so
homomorphisms between MPT’s cannot reduce on can all be generalized to the context of station-
measure-theoretic entropy, we have h(n) h(m). ary measures, in particular to stationary Markov
Suppose that now f is actually a conjugacy. chains. For instance, a conjugacy between two
Then it defines a measure-preserving isomor- stationary measures is a map that is simulta-
phism, and so h(m) ¼ h(n). If m ¼ mG, then by neously a conjugacy of the underlying shift spaces
uniqueness, we have n ¼ mH. Thus, f defines a and an isomorphism of the associated measure-
measure-theoretic isomorphism between the MPT preserving transformations. Many results in sym-
defined by mG and the MPT defined by mH. In fact, bolic dynamics have been generalized to the con-
this holds whenever f is merely an almost invert- text of stationary Markov chains. There is a
ible factor code. This establishes the following substantial literature on this, in particular on
result: finitary isomorphisms with finite expected coding
time, e.g., (Krieger 1983; Marcus and Tuncel
Theorem 20 Let G, H be irreducible graphs, and 1990; Mouat and Tuncel 2002; Parry 1979;
let mG, mH be the stationary Markov chains of Schmidt 1984). The expositions (Parry 1991;
maximal entropy on G, H. If XG, XH are almost Parry and Tuncel 1982) give a nice introduction
conjugate (in particular, if they are conjugate), to the subject of strong finitary codings between
then the measure-preserving transformations stationary Markov chains. See also the research
defined by mG and mH are isomorphic. papers (Gomez 2003; Marcus and Tuncel 1991,
1993; Parry and Schmidt 1984; Parry and Tuncel
Hence, conjugacies and almost conjugacies 1981; Tuncel 1981, 1983).
yield isomorphisms between measure-preserving
transformations defined by stationary Markov
chains of maximal entropy. In fact, the isomor- Higher Dimensional Shift Spaces
phisms obtained in this way have some very desir-
able properties compared to the run-of-the-mill In this section, we introduce higher dimensional
isomorphism. For instance, an isomorphism f shift spaces. For a more thorough introduction, we
between stationary processes typically has an infi- refer the reader to Lind (2004). For the related
nite window; i.e., to know f(x)0, you typically subject of tiling systems, see Robinson (2004),
need to know all of x, not just a central block Radin (1996), and Mozes (1989, 1992).
x[n, n] (these are the kinds of isomorphisms that The d-dimensional full A -shift is defined to be
Zd
appear in general ergodic theory and in particular A . Ordinarily, A is a finite alphabet, and here we
in Ornstein’s celebrated isomorphism theory restrict ourselves to this case. An element x of the
(Ornstein 1970)). In contrast, by definition, a full shift may be regarded as a function x : Zd ! A,
conjugacy always has a finite window of uniform or, more informally, as a “configuration” of alphabet
size. It turns out that an isomorphism obtained choices at the sites of the integer lattice Zd.
d
from an almost conjugacy, as well as its inverse, For x A Z and F Zd, let xF denote the
has finite expected coding length in the sense that restriction of x to F. The usual metric on the one-
to know f(x)0, you need to know only a central dimensional full shift naturally generalizes to a
448 Symbolic Dynamics
1 xi,j ¼ B ) xi,jþ1 ¼ T:
hðXÞ ¼ lim log j X½0,n1 ½0,n1 j,
n!1 n2
least R, any occurring configurations on F and F0 no points that are periodic in any single direction.
can be combined to form an occurring configura- And some other results on entropy of proper sub-
tion on F [ F0 (Burton and Steif 1994; Ward shifts of strongly irreducible SFT’s carry over
1994). For one-dimensional SFT’s, this is equiv- from one dimension to higher dimensions
alent to mixing, but much stronger than mixing for (Pavlov 2011; Quas and Trow 2000). But in
two-dimensional SFT’s. many cases where versions of the result carry
Just as in one dimension, we have the notion of over, the proofs are much different from those in
sliding block code for higher dimensional shifts. one dimension.
For finite alphabets A, ℬ , a cube F Zd and a Measures of maximal entropy for two-
function F : A F ! ℬ, the mapping f ¼ F1 : dimensional SFT’s behave very differently from
d d
A Z ! ℬZ defined by the one-dimensional case. For instance, even with
very strong mixing properties, such as strong irre-
F1 ðxÞn ¼ FðxnþF Þ ducibility, there can be more than one measure of
maximal entropy (Burton and Steif 1994), and the
is called a sliding block code. By restriction, we relationships among entropy-preserving, finite-to-
have the notion of sliding block code from one one, and almost invertibility for factor codes
d-dimensional shift space to another. As expected, discussed in section “Other Coding Problems” can
for d-dimensional shifts X and Y, the sliding block be very different in higher dimensions (Meester and
codes f : X ! Y coincide exactly with the con- Steif 2001). Other differences with respect to
tinuous translation-commuting maps from X to Y, entropy can be found in Quas and Sahin (2003).
i.e., the maps which are continuous with respect to There is also the natural notion of higher
the metric r, defined above, and which satisfy dimensional sofic shifts, which can be defined as
f ∘ sn ¼ sn ∘ f for all n Zd. Thus, it makes those shift spaces that are factors of SFT’s. Recall
sense to consider the various coding problems, in that every one-dimensional sofic shift is a right-
particular the conjugacy, factor, and embedding resolving, and hence entropy-preserving, factor of
problems, in the higher dimensional setting, but an SFT. It is not known if there is an analogue to
these are very difficult. this fact in higher dimensions, although recently
Even the question of determining when a there has been some progress: every sofic shift Y is
higher dimensional SFT of entropy at least logn a factor of an SFT with entropy arbitrarily close to
factors onto the full n-shift seems very difficult h(Y) (Desai 2006).
(in contrast to Theorem 15). However, there are Finally, there is a subclass of d-dimensional
some positive results for strongly irreducible SFT’s which is somewhat tractable, namely,
SFT’s. For instance, it is known that any strongly d-dimensional shifts with group structure in the
irreducible SFT of entropy strictly larger than logn following sense. Let A be a (finite) group. Then
factors onto the full n-shift (Desai 2009; Johnson the full d-dimensional shift over A is also a group
and Madden 2005). In fact, that result requires with respect to the coordinate-wise group struc-
only a weaker assumption than strong irreducibil- ture. A (higher-dimensional) group shift is a sub-
d
ity, but still much stronger than mixing; recent shift of A Z which is also a subgroup. For a survey
examples (Boyle et al. 2010) show, among other on results for this class, we refer the reader to Lind
things, that one cannot weaken that assumption to and Schmidt (2002).
mere mixing.
There are results on other coding problems as
well. For instance, a version of the Embedding Future Directions
theorem in one dimension (Theorem 13) holds for
SFT’s in two dimensions with a strong mixing The future directions of the subject will likely be
property (Lightwood 2003/04); however, it is determined by progress on solutions to open prob-
required that the shift to be embedded contains lems. In the course of describing topics in this
Symbolic Dynamics 451
entry, we have mentioned many open problems Problem,” is one of the most fundamental prob-
along the way. For a much more complete list on a lems in the subject: given two SFTs X and Y, how
wealth of subareas of symbolic dynamics, we can you tell whether X and Y are conjugate? While
refer the reader to the article (Boyle 2007). many sensitive invariants of conjugacy are
While it is difficult to single out the most impor- known, it is still not known if this problem is
tant challenges, certainly the problem of under- even decidable by a finite algorithm.
standing multidimensional shift spaces, especially As described in the “The Conjugacy Problem”
of finite type, is one of the most important. section, conjugacy between a pair of SFTs is
equivalent to strong shift equivalence (SSE) of
the corresponding adjacency matrices. SSE
Addendum to the Second Edition implies shift equivalence (SE) which is known to
be decidable and, for a long time, was conjectured
Since the original publication of this article, there to be equivalent to SSE. But as mentioned in the
has been an enormous amount of activity and section, Kim and Roush (1999) showed that this is
progress in symbolic dynamics. While some false even for irreducible matrices. Since then
major problems remain unsolved, with little or there has been very little progress on the
no progress, others have been solved, partially or conjugacy problem per se.
completely, and vast new areas of exploration SSE and SE are purely algebraic notions for
have been carved out. It would be impossible in matrices over the semi-ring ℤ+. However, they
this brief update to summarize the developments can be defined as meaningful relations over other
in symbolic dynamics within the past 10 years. categories, in particular rings. It is known that
Here, we focus mainly on developments within over ℤ, SSE and SE are equivalent. But there are
subareas of symbolic dynamics most closely rings over which SSE and SE are not equivalent.
related to the content of the original article. We Boyle and Schmedling showed that the extent to
have also included some major results known at which they differ is captured by invariants from
the time of the original article but not included in algebraic K-theory (Boyle and Schmieding 2019).
the article. The section on “Other Coding Problems”
The symbolic modeling approach, introduced began with notions of equivalence of SFTs weaker
in the section on the “Origins of Symbolic than conjugacy, namely, finite equivalence and
Dynamics: Modelling of Dynamical Systems,” almost conjugacy, for which one can effectively
continues to be an essential tool in modelling decide equivalence by simple invariants, namely,
smooth dynamical systems. Markov partitions, entropy and period (Theorems 10 and 11).
such as those for the main examples, hyperbolic A version of Theorem 11 has been extended to
toral automorphisms and Smale’s horsehoe, have the setting of a certain class of shift spaces over
finitely many “rectangles” corresponding to a countably infinite alphabets by Boyle, Buzzi, and
finite alphabet, upon which a shift space model Garcia (2006).
is built. One can also consider shift spaces over Theorems 10 and 11 are intimately related to
countably infinite alphabets instead. Recently finite-to-one factor maps (also called finite-to-one
Markov partitions with countably many rectan- factor codes). Such a map has a “degree,” defined
gles were constructed by Sarig (2013) to model as the “typical” number of preimages of any given
surface diffeomorphisms by shift spaces over point. Allahbakhshi and Quas (2013) introduced
countably infinite alphabets. This enabled him to the analogous notion of class degree for infinite-
prove a fundamental conjecture of Katok [(2007), to-one factor maps. They used this notion to
problem 7.4] on the growth rate of periodic points obtain bounds on the number of maximal relative
for arbitrary surface diffeomorphisms with posi- entropy ergodic lifts of a given ergodic measure
tive entropy. on the range. The class degree has also turned out
The conjugacy problem for shifts of finite type to be key to understanding the structure of infinite-
(SFTs), considered in the section “The Conjugacy to-one factor codes. This was further developed
452 Symbolic Dynamics
by Allahbakhshi, Hong, and Jung (2015) who irreducible SFTs up to a notion of flow conjugacy
established dynamical properties for infinite-to- by simple invariants (Franks 1984). This classifi-
one factor maps analogous to those of the fibers cation has been extended to a large class of irre-
of a finite-to-one factor map. ducible sofic shifts by Boyle, Carlsen, and
The Embedding Theorem (Theorem 13) and Eilers (2018).
Lower Entropy Factor Theorem (Theorem 16) for The state-splitting algorithm for constructing
irreducible SFTs have been generalized by sliding block decodable encoders, introduced in
Thomsen (2004) for special classes of mixing the section on “Coding for Data Recording Chan-
sofic shifts. He also developed structure to explain nels,” was used widely in practice in the 1980s
why his results cannot hold for all mixing sofic and 1990s. Variants of the algorithm were used to
shifts. A recent preprint of Krieger (2018) stated a aid in the construction of codes which became
complete necessary, sufficient, and decidable set standards in the recording industry, including the
of conditions for the Lower Entropy Factor The- (1,7) code (Adler et al. 1983b) for hard disk
orem for all mixing sofic shifts. drives, EFMPlus for the DVD (Immink 1994)
Lind (1984) characterized the set of numbers (see also [(Immink 2004b), section 11.5.2]) and
that can occur as entropies of SFTs in terms of a the code used in the first generation of Linear Tape
set of algebraic integers, known as Perron num- Open (Ashley et al. 1999). Today, the algorithm is
bers; specifically, the entropies of SFTs are used mainly as a proof of concept in new schemes
exactly the logs of roots of Perron numbers. such as codes for radio frequency identification
These are positive, in particular real, algebraic (Barbero et al. 2014), weakly constrained codes
integers. Thurston (1990) introduced a generali- (Elishco et al. 2016), and parity-preserving codes
zation known as complex Perron numbers. As (Roth and Siegel 2018).
shown by Kenyon (1996), these complex num- A major development over the past two decades
bers are precisely the expansion factors l for a has been the extension of symbolic dynamics to the
collection of self-similar tilings of the plane; these multidimensional setting. The set-up was described
are certain tilings by translates of a finite set of in the section on “Higher Dimensional Shift
basic tiles s.t. for each copy T of a basic tile in the Spaces.” Here, bi-infinite sequences are replaced
tiling, lT is tiled by translates of the same set of by arrays of symbols on the points of the integer
basic tiles. For further work in this area, see lattice ℤd. As mentioned in that section, Hochman
Thurston [(2014), especially Theorem 1.3]. and Meyerovitch (2007) characterized the set of
The characterization of the collection of zeta numbers which can occur as the entropy of a ℤd
functions of SFTs (equivalently, numbers of peri- SFT, d 2, as the set of all right recursively
odic orbits of all periods) given in Theorem 14 enumerable numbers (this is very different from
was conjectured by Boyle and Handelman (1991) Lind’s characterization in d ¼ 1 (Lind 1984)).
and proven by Kim, Ormes, and Roush (Kim et al. This result led to a development that has strongly
2000). In particular, this result characterized the tied symbolic dynamics to computability theory.
possible multi-sets of non-zero eigenvalues of Many of the central results in this development
primitive square matrices over ℤ. One can ask are summarized in Jeandel (2016).
for similar characterizations if one replaces ℤ by One central concept in this development is the
other rings. A more general spectral conjecture, notion of an effective shift space X, which means a
which pertains to a wide class of subrings S of ℝ, shift space for which there exists a list F of for-
remains open (Boyle 2007), Conjecture 6.1]. The bidden patterns that defines X and can be pro-
case S ¼ ℝ was already obtained in Boyle and duced by a Turing machine. Of course, any ℤd-
Handelman (1991). For more recent work on this SFT or sofic shift is effective. But even for d ¼ 1,
subject, see Boyle and Schmieding (2016). there are many effective shifts that are not
To every SFT, there is an associated continuous sofic. Nevertheless, any effective ℤd-shift space
suspension flow, and Franks completely classified X can be “simulated” by a ℤd+1-SFT Y in the sense
Symbolic Dynamics 453
that X is a ℤd-subaction of a factor of Y (Aubrun intimately related to the theory of Gibbs states in
and Sablik 2013). statistical mechanical systems (Ruelle 2004).
Using essentially the same argument to show As in the original article, for a list of major
that the entropy of a ℤd-SFT is rre, one can show open problems in Symbolic Dynamics and pro-
that the entropy of an effective ℤd-shift space is gress on their solutions, we refer the reader to
rre. Since the collection of sofic shift spaces lies in Boyle (2007).
between the SFTs and the effective shift spaces, it
follows that the entropies of the sofic shift spaces
are exactly the rre numbers. So, for every ℤd sofic Bibliography
shift there is a ℤd SFT with the same entropy. As
mentioned in the original article, a major open Adler R, Marcus B (1979) Topological entropy and equiv-
problem is whether for every ℤd sofic shift alence of dynamical systems. Memoirs of the American
X there is a ℤd SFT which factors onto X and has Mathematical Society, vol 219. AMS, Providence
Adler R, Weiss B (1970) Similarity of automorphisms of
the same entropy. This problem remains unsolved. the torus.
The extension from ℤ to ℤd symbolic dynam- Memoirs of the American Mathematical Society,
ics can be carried further. Namely, for an arbitrary vol 98. AMS, Providence
countable group G and a finite alphabet A , a Adler R, Konheim A, McAndrew M (1965) Topological
entropy. Trans Am Math Soc 114:309–319
G-shift space is a closed subset of the full shift Adler RL, Goodwyn LW, Weiss B (1977) Equivalence of
space AG that is invariant under the action of G on topological Markov shifts. Isr J Math 27:48–63
A G: for g G, define g: A G ! A G by: Adler RL, Coppersmith D, Hassner M (1983a) Algorithms
for sliding block codes – an application, of symbolic
dynamics to information theory. Trans IEEE Inf Theory
ðgðxÞÞh ¼ xhg 29:5–22
Adler RL, Hassner M, Moussouris JP (1983b, November
Much of the classical theory for Z and Zd 1) Method and apparatus for generating a noiseless
sliding block code for a (1,7) channel with rate 2/3.
symbolic dynamics has been generalized to large
United States patent 4,413,251
classes of amenable groups G. And more recently Aho AV, Hopcroft JE, Ullman JD (1974) The design and
a notion of sofic entropy for dynamics of certain analysis of computer algorithms. Addison-Wesley,
non-amenable groups has played a prominent role Reading
Allahbakhshi M, Quas A (2013) Class degree and relative
(Bowen 2012, 2018).
maximal entropy. Trans Am Math Soc 365:1347–1368
There are many other subareas of symbolic Allahbakhshi M, Hong S, Jung U (2015) Class closing
dynamics that are very active and of great impor- factor codes and constant-to-one factor codes from
tance today. For instance, in algebraic symbolic shifts of finite type. Dyn Syst 30:485–500
Ashley J (1988) A linear bound for sliding block decoder
dynamics one endows the shift space with a group
window size. Trans IEEE Inf Theory 34:389–399
structure such that the shift map is an automor- Ashley J (1991) Resolving factor maps for shifts of finite
phism. One can characterize many symbolic type with equal entropy. Ergod Theory Dyn Syst
dynamical properties of such shift spaces in 11:219–240
Ashley J (1996) A linear bound for sliding block decoder
terms of algebraic objects (Schmidt 2012). In
window size, II. Trans IEEE Inf Theory 42:1913–1924
cellular automata, one studies the iteration of slid- Ashley J, Jaquette G Marcus B, Seger P (1999) Runlength
ing block codes from one shift space to another as limited encoding/decoding with robust resync. US pat-
dynamical systems in their own right. These ent 5,969,649
Aubrun N, Sablik M (2013) Simulation of effective sub-
dynamical systems exhibit a rich variety of
shifts by two-dimensional subshifts of finite type. Acta
dynamical behavior (Ceccjerini-Silberstein and Appl Math 126:3563
Coornaert 2010). Thermodynamic formalism Barbero A, Rosnes E, Yang G, Ytrehus O (2014) Near-field
studies the notion of topological pressure, a gen- passive RFID communication: channel model and code
design. IEEE Trans Commun 62:1716–1726
eralization of topological entropy and equilibrium
Béal M-P (1990) The method of poles: a coding method for
states, and a generalization of measures of maxi- constrained channels. Trans IEEE Inf Theory
mal entropy (Keller 1998; Sarig 2009). This is 36:763–772
454 Symbolic Dynamics
Béal M-P (1993) Codage symbolique. Masson, Paris of the American Mathematical Society, vol 377. AMS,
Béal M-P (2003) Extensions of the method of poles for Providence
code construction. Trans IEEE Inf Theory Boyle M, Buzzi J, Gomez-Aiza R (2006) Almost isomor-
49:1516–1523 phism for countable state Markov shifts. J Reine
Bedford T (1986) Generating special Markov partitions for Angew Math 592:2347
hyperbolic toral automorphisms using fractals. Ergod Boyle M, Pavlov R, Schraudner M (2010) Multi-
Theory Dyn Syst 6:325–333 dimensional sofic shifts without separation and their
Berger R (1966) The undecidability of the domino prob- factors. Trans Amer Math Soc 362:4617–4653
lem. Memoirs of the American Mathematical Society. Boyle M, Carlsen TM, Eilers S (2018) Flow equivalence of
AMS, Providence sofic shifts. Isr J Math 225:111–146
Berstel J, Perrin D (1985) Theory of codes. Academic, Burton R, Steif J (1994) Nonuniqueness of measures of
New York maximal entropy for subshifts of finite type. Ergod
Blanchard F, Maass A, Nogueira A (2000) In: Blanchard F, Theory Dyn Syst 14(2):213–235
Maass A, Nogueira A (eds) Topics in symbolic dynam- Calkin N, Wilf H (1998) The number of independent sets in
ics and applications. LMS lecture notes, vol 279. Cam- a grid graph. SIAM J Discret Math 11:54–60
bridge University Press, Cambridge Ceccjerini-Silberstein, T. and Coornaert, M. Cellular
Blanchard P, Devaney R, Keen L (2004) Complex dynam- automata and groups (2010), Springer Berlin
ics and symbolic dynamics. In: Williams S (ed) Sym- Cidecyian R, Evangelos E, Marcus B, Modha M (2001)
bolic dynamics and its applications. Proceedings of Maximum transition run codes for generalized partial
symposia in applied mathematics. AMS, Providence, response channels. IEEE J Sel Area Commun
pp 37–59 19:619–634
Bowen R (1970) Markov partitions for axiom Coven E, Paul M (1974) Endomorphisms of irreducible
A diffeomorphisms. Am J Math 92:725–747 shifts of finite type. Math Syst Theory 8:167–175
Bowen R (1973) Symbolic dynamics for hyperbolic flows. Coven E, Paul M (1975) Sofic systems. Isr J Math
Am J Math 95:429–460 20:165–177
Bowen L (2012) Sofic entropy and amenable groups. Coven E, Paul M (1977) Finite procedures for sofic sys-
Ergod Theory Dyn Syst 32:427–466 tems. Monats Math 83:265–278
Bowen L (2018) A brief introduction to sofic entropy Cover T, Thomas J (1991) Elements of information theory.
theory, arXiv: 1711.02062v2 Wiley, New York
Bowen R, Franks J (1977) Homology for zero-dimensional Desai A (2006) Subsystem entropy for Zd sofic shifts.
basic sets. Ann Math 106:73–92 Indag Math 17:353–360
Bowen R, Lanford OE (1970) Zeta functions of restrictions Desai A (2009) A class of Zd -subshifts which factor onto
of the shift transformation. Proceedings of symposia in lower entropy full shifts. Proc Am Math Soc 137:2613–
pure mathematics, vol 14. AMS, Providence, pp 43–50 2621
Boyle M (1983) Lower entropy factors of sofic systems. Devaney R (1987) An introduction to chaotic dynamical
Ergod Theory Dyn Syst 3:541–557 systems. Addison-Wesley, Reading
Boyle M (1993) Symbolic dynamics and matrices. In: Elishco O, Meyerovitch T, Schwartz M (2016) Semicon-
Brualdi R et al (eds) Combinatorial and graph theoretic strained systems. IEEE Trans Inf Theory 62:1688–1702
problems in linear algebra. IMA volumes in mathemat- Fischer R (1975a) Sofic systems and graphs. Monats Math
ics and its applications, vol 50. Springer New York, 80:179–186
New York, pp 1–38 Fischer R (1975b) Graphs and symbolic dynamics. Colloq
Boyle M (2007) Open problems in symbolic dynamics. Math Soc János Bólyai: Top Inf Theory 16:229–243
Contemponary Math; for solutions and updates, see Franaszek PA (1968) Sequence-state coding for digital
http://www.math.umd.edu/mmb transmission. Bell Syst Tech J 47:143–155
Boyle M, Handelman D (1991) The spectra of nonnegative Franaszek PA (1982) Construction of bounded delay codes
matrices via symbolic dynamics. Ann Math for discrete noiseless channels. J IBM Res Dev
133:249–316 26:506–514
Boyle M, Krieger W (1986) Almost Markov and shift Franaszek PA (1989) Coding for constrained channels: a
equivalent sofic systems. In: Aleixander comparison of two approaches. J IBM Res Dev
J (ed) Proceedings of Maryland special year in dynam- 33:602–607
ics 1986–87. Lecture notes in mathematics, vol 1342. Franks J (1984) Flow equivalence of subshifts of finite
Springer, Berlin, pp 33–93 type. Ergod Theory Dyn Syst 4:5366
Boyle M, Schmieding S (2016) Strong shift equivalence Friedman J (1990) On the road coloring problem. Proc Am
and the generalized spectral conjecture for nonnegative Math Soc 110:1133–1135
matrices. Linear Algebra Appl 498:231–243 Gomez R (2003) Positive K-theory for finitary iso-
Boyle M, Schmieding S (2019) Strong shift equivalence and moprhisms of Markov chains. Ergod Theory Dyn
algebraic K-theory. J Reine Angew Math 752:63–104 Syst 23:1485–1504
Boyle M, Marcus BH, Trow P (1987) Resolving maps and Hadamard J (1898) Les surfaces a courbures opposées et
the dimension group for shifts of finite type. Memoirs leurs lignes geodesiques. J Math Pure Appl 4:27–73
Symbolic Dynamics 455
Hassellblatt B, Katok A (1995) Introduction to the modern Kim KH, Ormes N, Roush F (2000) The spectra of non-
theory of dynamical systems. Cambridge University negative integer matrices via formal power series. Am
Press, Cambridge J Math Soc 13:773–806
Hedlund GA (1939) The dynamics of geodesic flows. Bull Kitchens B (1998) Symbolic dynamics: one-sided,
Am Math Soc 45:241–260 two-sided and countable state Markov chains. Springer,
Hedlund GA (1944) Sturmian minimal sets. Am J Math Berlin
66:605–620 Kitchens B, Schmidt K (1988) Periodic points, decidability
Hedlund GA (1969) Endomorphisms and automorphisms and Markov subgroups. In: Alexander JC (ed) Dynamical
of the shift dynamical system. Math Syst Theory systems: proceedings of the special year. Springer lecture
3:320–375 notes in mathematics, vol 1342. Springer Berlin Heidel-
Hochman M (2009) On the dynamics and recursive prop- berg, Berlin/Heidelberg, pp 440–454
erties of multidimensional symbolic systems. Invent Kitchens B, Marcus B, Trow P (1991) Eventual factor
Math 176:131–167 maps and compositions of closing maps. Ergod Theory
Hochman M, Meyerovitch T (2007) A characterization of Dyn Syst 11:85–113
the entropies of multidimensional shifts of finite type. Krieger W (1980a) On a dimension for a class of homeo-
Ann Math 171(2010):2011–2038 morphism groups. Math Ann 252:87–95
Hollmann HDL (1995) On the construction of bounded- Krieger W (1980b) On dimension functions and topologi-
delay encodable codes for constrained systems. Trans cal Markov chains. Invent Math 56:239–250
IEEE Inf Theory 41:1354–1378 Krieger W (1982) On the subsystems of topological Mar-
Immink KAS (1994) EFMPlus, 8–16 modulation code. US kov chains. Ergod Theory Dyn Syst 2:195–202
patent 5,696,505 Krieger W (1983) On the finitary isomorphisms of Markov
Immink KAS (2004a) Codes for mass data storage, shifts that have finite expected coding time. Wahrsch
2nd edn. Shannon Foundation Press, Eindhoven Z 65:323–328
Immink KAS (2004b) Codes for mass data storage sys- Krieger W (1984) On sofic systems I. Isr J Math
tems, 2nd edn. Shannon Foundation Publishers, 48:305–330
Eindhoven Krieger W (2018) On images of sofic systems,
Jeandel E (2016) Computability in symbolic dynamics. arXiv:1101.1750
CiE, Paris, pp 124–131 Lightwood S (2003/04) Morphisms form non-periodic Z2
Johnson A, Madden K (2005) Factoring higher- subshifts I and II. Ergod Theory Dyn Syst 23:587–609,
dimensional shifts of finite type onto full shifts. Ergod 24:1227–1260
Theory Dyn Syst 25:811–822 Lind D (1984) The entropies of topological Markov shifts
Karabed R, Siegel P, Soljanin E (1999) Constrained coding and a related class of algebraic integers. Ergod Theory
for binary channels with high intersymbol Dyn Syst 4:283–300
intereference. Trans IEEE Inf Theory 45:1777–1797 Lind D (1989) Perturbations of shifts of finite type. SIAM
Kari J (2001) Synchronizing finite automata on Eulerian J Discret Math 2:350–365
digraphs. Springer Lect Notes Comput Sci Lind D (1996) A zeta function for Zd -actions. In:
2136:432–438 Pollicott M, Schmidt K (eds) Proceedings of Warwick
Kastelyn PW (1961) The statistics of dimers on a lattice. symposium on Zd -actions. LMS lecture notes, vol 228.
Physica A 27:1209–1225 Cambridge University Press, Cambridge, pp 433–450
Katok A (2007) Fifty years of entropy in dynamics Lind D (2004) Multi-dimensional symbolic dynamics. In:
1958–2007. J Modern Dyn 4:545–596 Williams S (ed) Symbolic dynamics and its applica-
Keller G (1998) Equilibrium states in ergodic theory. Cam- tions. Proceedings of symposia in applied mathematics,
bridge University Press, Cambridge vol 60. AMS, Providence, pp 81–120
Kenyon R (1996) The construction of self-similar tilings. Lind D, Marcus B (1995) An introduction to symbolic
Geom Funct Anal 6:471–488 dynamics and coding. Cambridge University Press,
Kenyon R (2008) Lectures on dimers. http://www.math. Cambridge
brown.edu/~rkenyon/papers/dimerlecturenotes.pdf Lind D, Schmidt K (2002) Symbolic and algebraic dynam-
Kim KH, Roush FW (1979) Some results on decidability of ical systems. In: Hasselblatt B, Katok A (eds) Hand-
shift equivalence. J Comb Inf Syst Sci 4:123–146 book of dynamics systems. Elsevier, Amsterdam,
Kim KH, Roush FW (1988) Decidability of shift equiva- pp 765–812
lence. In: Alexander J (ed) Proceedings of Maryland Manning A (1971) Axiom A diffeomorphisms have ratio-
special year in dynamics 1986–87. Lecture notes in nal zeta functions. Bull Lond Math Soc 3:215–220
mathematics, vol 1342. Springer, Berlin, pp 374–424 Marcus B (1979) Factors and extensions of full shifts.
Kim KH, Roush FW (1990) An algorithm for sofic shift Monats Math 88:239–247
equivalence. Ergod Theory Dyn Syst 10:381–393 Marcus BH, Roth RM (1991) Bounds on the number of
Kim KH, Roush FW (1999) Williams conjecture is false states in encoder graphs for input-constrained channels.
for irreducible subshifts. Ann Math 149:545–558 Trans IEEE Inf Theory 37:742–758
456 Symbolic Dynamics
Marcus B, Tuncel S (1990) Entropy at a weight-per- Parry W, Tuncel S (1981) On the classification of Markov
symbol and embeddings of Markov chains. Invent chains by finite equivalence. Ergod Theory Dyn Syst
Math 102:235–266 1:303–335
Marcus B, Tuncel S (1991) The weight-per-symbol poly- Parry W, Tuncel S (1982) Classification problems in ergo-
tope and scaffolds of invariants associated with Markov dic theory. LMS lecture notes, vol 67. Cambridge Uni-
chains. Ergod Theory Dyn Syst 11:129–180 versity Press, Cambridge
Marcus B, Tuncel S (1993) Matrices of polynomials, pos- Pavlov R (2011) Perturbations of multidimensional shifts
itivity, and finite equivalence of Markov chains. J Am of finite type. Ergod Theory Dyn Syst 31:483–526
Math Soc 6:131–147 Petersen K (1989) Ergodic theory. Cambridge University
Marcus BH, Roth RM, Siegel PH (1998) Constrained sys- Press, Cambridge
tems and coding for recording chapter. In: Brualdi R, Quas A, Sahin A (2003) Entropy gaps and locally maximal
Huffman C, Pless V (eds) Handbook on coding theory. entropy in Zd -subshifts. Ergod Theory Dyn Syst
Elsevier, New York; updated version at http://www. 23:1227–1245
math.ubc.ca/~marcus/Handbook/ Quas A, Trow P (2000) Subshifts of multidimensional shifts
Markley N, Paul M (1981a) Matrix subshifts for Zv sym- of finite type. Ergod Theory Dyn Syst 20:859–874
bolic dynamics. Proc Lond Math Soc 43:251–272 Radin C (1996) Miles of tiles. In: Pollicott M, Schmidt
Markley N, Paul M (1981b) Maximal measures and K (eds) Ergodic theory of Zd -actions. LMS lecture
entropy for Zv subshifts of finite type. In: Devaney R, notes, vol 228. Cambridge University Press, Cam-
Nitecki Z (eds) Classical mechanics and dynamical bridge, pp 237–258
systems. Dekker notes, vol 70. Dekker, New York, Robinson RM (1971) Undecidability and nonperiodicity
pp 135–157 for tilings of the plane. Invent Math 12:177–209
Meester R, Steif J (2001) Higher-dimensional subshifts of Robinson EA (2004) Symbolic dynamics and tilings of Rd.
finite type, factor maps and measures of maximal In: Williams S (ed) Symbolic dynamics and its appli-
entropy. Pac Math J 200:497–510 cations. Proceedings of symposia in applied mathemat-
Morse M (1921) Recurrent geodesics on a surface of neg- ics, vol 60. AMS, Providence, pp 81–120
ative curvature. Trans Am Math Soc 22:84–100 Roth R, Siegel P (2018) On parity-preserving constrained
Morse M, Hedlund GA (1938) Symbolic dynamics. Am coding. In: Proceedings of international symposium on
J Math 60:815–866 information theory, pp 1804–1808
Morse M, Hedlund GA (1940) Symbolic dynamics II, Rudolph D (1990) Fundamentals of measurable dynamics.
Sturmian trajectories. Am J Math 62:1–42 Oxford University Press, Oxford
Mouat R, Tuncel S (2002) Constructing finitary isomor- Ruelle D (2004) Thermodynamic formalism, 2nd edn.
phisms with finite expected coding time. Isr J Math Cambridge University Press, Cambridge
132:359–372 Sarig O (2009) Lecture notes on thermodynamic formalism
Mozes S (1989) Tilings, substitutions and the dynam- for topological Markov shifts. http://www.weizmann.ac.
ical systems generated by them. J Anal Math il/math/sarigo/sites/math.sarigo/files/uploads/tdfnotes.pdf
53:139–186 Sarig O (2013) Symbolic dynamics for surface
Mozes S (1992) A zero entropy, mixing of all orders tiling diffeomorphisms with positive entropy. J Am Math
system. In: Walters P (ed) Symbolic dynamics and its Soc 26:341–426
applications. Contemporary mathematics, vol 135. Schmidt K (1984) Invariants for finitary isomorphisms with
AMS, Providence, pp 319–326 finite expected code lengths. Invent Math 76:33–40
Nagy Z, Zeger K (2000) Capacity bounds for the three- Schmidt K (1990) Algebraic ideas in ergodic theory. AMS-
dimensional (0, 1) run length limited channel. Trans CBMS reg conference, vol 76. AMS, Providence
IEEE Inf Theory 46:1030–1033 Schmidt K (1995) Dynamical systems of algebraic origin.
Nasu M (1986) Topological conjugacy for sofic systems. Birkhauser, Basel
Ergod Theory Dyn Syst 6:265–280 Schmidt K (2012) Dynamical systems of algebraic origin.
Ornstein D (1970) Bernoulli shifts with the same entropy Birkhauser (reprint of 1995 edition)
are isomorphic. Adv Math 4:337–352 Schwartz M, Bruck S (2008) Constrained codeds as net-
Parry W (1964) Intrinsic Markov chains. Trans Am Math works of relations. IEEE Trans Inf Theory 54:2179–2195
Soc 112:55–66 Seneta E (1980) Non-negative matrices and Markov
Parry W (1977) A finitary classification of topological chains, 2nd edn. Springer, Berlin
Markov chains and sofic systems. Bull Lond Math Shannon C (1948) A mathematical theory of communica-
Soc 9:86–92 tion. Bell Syst Tech J 27:379–423, 623–656
Parry W (1979) Finitary isomorphisms with finite expected Sinai YG (1968) Markov partitions and
code-lengths. Bull Lond Math Soc 11:170–176 C-diffeomorphisms. Funct Anal Appl 2:64–89
Parry W (1991) Notes on coding problems for finite state Smale S (1967) Differentiable dynamical systems. Bull
processes. Bull Lond Math Soc 23:1–33 Am Math Soc 73:747–817
Parry W, Schmidt K (1984) Natural coefficients and invari- Thomsen K (2004) On the structure of a sofic shift space.
ants for Markov shifts. Invent Math 76:15–32 Am Math Soc Trans 356:3557–3619
Symbolic Dynamics 457
Thurston WP (1990) Groups, tilings, and finite state Walters P (1982) An introduction to ergodic theory.
automata. Lecture notes, AMS colloquium lectures, Springer graduate texts in mathematics, vol 79.
American Mathematical Society Springer, Berlin
Thurston WP (2014) Entropy in dimension one. Frontiers Walters P (1992) In: Walter P (ed) Symbolic dynamics and
in complex dynamics, Princeton mathematical series, its applications. Contemporary mathematics, vol 135.
vol 51. Princeton University Press, Princeton AMS, Providence
Trachtman A (2007) The road coloring problem. Israel Ward T (1994) Automorphisms of Zd -subshifts of finite
J Math 172:51–60 type. Indag Math 5:495–504
Tuncel S (1981) Conditional pressure and coding. Isr Weiss B (1973) Subshifts of finite type and sofic systems.
J Math 39:101–112 Monats Math 77:462–474
Tuncel S (1983) A dimension, dimension modules and Williams RF (1973/74) Classification of subshifts of finite
Markov chains. Proc Lond Math Soc 46:100–116 type. Ann Math 98:120–153; Erratum: Ann Math
Wagoner J (1992) Classification of subshifts of finite type 99:380–381
revisited. In: Walters P (ed) Symbolic dynamics and its Williams S (2004a) Introduction to symbolic dynamics.
applications. Contemporary mathematics, vol 135. In: Williams S (ed) Symbolic dynamics and its
AMS, Providence, pp 423–444 applications. Proceedings of symposia in applied
Wagoner J (2004) Strong shift equivalence theory. In: mathematics, vol 60. AMS, Providence, pp 1–12
Walters P (ed) Symbolic dynamics and its applications. Williams S (2004b) In: Williams S (ed) Symbolic dynam-
Proceedings of symposia in applied mathematics, ics and its applications. Proceedings of symposia in
vol 60. AMS, Providence, pp 121–154 applied mathematics, vol 60. AMS, Providence
Ergodic measure-preserving trans-
Operator Ergodic Theory formation A mpt such that the only sets A
with θ1A = A a.e are null sets or complements
Guy Cohen1 and Michael Lin2 of null sets.
1
School of Electrical Engineering, Ben-Gurion Infinitesimal generator of a C0-semigroup The
University, Beer-Sheva, Israel operator Ax ≔ limt!0+ t1(T(t)x x) with
2
Department of Mathematics, Ben-Gurion domain D ðAÞ≔fx X : Ax exists}.
University, Beer-Sheva, Israel Koopman operator The operator on Lp(Ω, S,
m), induced by a measure-preserving transfor-
mation θ, defined by Tf = f ∘ θ.
Article Outline Markov-Feller operator A Markov operator on
bounded measurable functions of a compact
Glossary Hausdorff space which preserves continuity.
Definition of the Subject Markov operator The operator induced on
Introduction bounded measurable functions on (Ω, S) by
The Mean Ergodic Theorem a transition probability P(t, A), defined by
Rates of Convergence Pf(t) = f(s)P(t, ds).
Uniform Ergodic Theorems Measure (probability)-preserving transforma-
Strong Cesàro Convergence tion (mpt) A measurable mapping θ of a
Weak Stability and Mixing measure (probability) space (Ω, S, m) satisfy-
Stability ing m(θ1A) = m(A) for any A S.
Averaging Along Subsequences Minimal topological dynamical system A
General Averaging Methods topological dynamical system (K, t) such that
Modulated Ergodic Theorems for every s K the orbit {tns} is dense.
Resolvent Conditions and Growth of Powers Positive operator (on a Banach lattice) An
Continuous Time (C0-semigroups) operator which maps the positive cone of a
Bibliography Banach lattice into itself.
Power-bounded operator An operator T with
Glossary supnkT nk < 1.
Operator A bounded linear operator on a (real or
Blum-Hanson property Norm convergence to complex) Banach space X.
zero of all averages 1n nj¼1 T kj x along any Operator semigroup A family S of operators
increasing {kj} ℕ whenever T nx ! 0 weakly. such that TS S for T, S S.
Cesàro bounded operator An operator T with Orbit of a vector under an operator semigroup
supn 1n n1 k
< 1: S The set fTx : T S g.
k¼0 T
Coboundary (of an operator) A vector (Semi)flow A family of transformations
y (I T)X, i.e., such that y = x Tx for {θt}t ℝ({θt}t0) such that θt+s = θt ∘ θs, with
some x X. θ0 the identity map.
Contraction An operator T with kT k 1. Stolz region The closed convex hull (in ℂ) of the
C0-semigroup/one-parameter semigroup A point 1 and a disk of radius r < 1.
family of bounded operators {T(t)}t 0 Transition probability A function P(t, A) on
defined on X such that T(t + s) = T(t)T(s) for t, Ω S (where (Ω, S) is a measurable space),
s 0, T(0) = I, and T(t)x is continuous on such that P(t, ) is a probability on S for t Ω
[0, 1) for every x X. and P(, A) is measurable for every A S.
Transformation A mapping of a set into itself. Halmos (1949a): “It was very quickly recognized
Topological dynamical system A compact that the proper general framework for von
Hausdorff space K with a continuous map t Neumann’s mean ergodic theorem lay in the direc-
of K to itself. tion of Hilbert spaces and Banach spaces,
Uniform ergodicity Convergence of the aver- whereas the extent of generality suitable to
ages 1n n1 k
k¼0 T in the operator norm (uniformly
Birkhoff’s theorem was to be found in the concept
on the unit ball of the Banach space). of a measure space.” Thus, the origin of operator
Uniquely ergodic topological dynamical ergodic theory is in the mean ergodic theorem,
system A topological dynamical system although the uniform distribution theorem of
(K, t) with a unique probability measure m Weyl (1916) can be restated as an ergodic theo-
satisfying m ∘ t1 = m. rem. The ergodic theory of measure-preserving
(Weak) mean ergodic theorem (weak) conver- transformations and pointwise limit theorems for
gence of the averages 1n n1 k some classes of positive operators are discussed in
k¼0 T x for every x in
other chapters of this volume.
the Banach space X on which the operator T is
Birkhoff and von Neumann were both moti-
defined.
vated by a problem in statistical mechanics,
(Weak) stability (Weak) convergence of T nx for
Boltzmann’s “ergodic hypothesis,” so accord-
every x X.
ingly their theorems were formulated and proved
(Weakly) almost periodic operator An opera-
in continuous time, i.e., for a flow of invertible
tor T such that for every x X the orbit {T nx}
measure-preserving transformations rather than
is (weakly) conditionally compact.
for the iterates of one such transformation
(Weakly) quasi-compact operator An operator
(discrete time). See Petersen (1996) for compari-
T such that for some n > 0 there exists a (weakly)
son of the significance in scientific applications of
compact operator Q satisfying kT n Qk < 1.
these two theorems.
Operator ergodic theory has applications in the
study of measure-preserving transformations,
Definition of the Subject
topological dynamics, and Markov operators, but
it is also inspired by results first obtained in these
Operator ergodic theory deals with the asymptotic
areas. Results for C0-semigroups (continuous
behavior of the powers of a power-bounded oper-
time) have applications also in partial differential
ator, or a bounded one-parameter semigroup of
equations.
operators, defined on a (real or complex) Banach
space. We will focus on the theory for powers of a
single operator, and with the growth of the norms
of certain operators which are only Cesàro Introduction
bounded. The asymptotic behavior is studied in
different operator topologies and in various Let ≔fz ℂ :jzj¼ 1g be the unit circle in the
modes of convergence. However, almost every- complex plane, and let α [0, 1) be irrational.
where convergence of positive operators on Lp The transformation yz≔e2pia z, z corresponds
spaces is not discussed methodically in this chap- to a rotation of the circle by the angle 2πα. In
ter, since this is done in other chapters of this 1884, Kronecker proved that for each z the orbit
volume. {θnz} is dense in . Weyl (1910) extended it in his
The topic of ergodic theorems emerged from famous uniform distribution theorem, which,
the mean ergodic theorem for unitary operators in using Weyl (1916), can be stated as a mean ergo-
complex Hilbert spaces of von Neumann (1932) dic theorem in CðÞ.
and the pointwise ergodic theorem of
G.D. Birkhoff (1931) for measure-preserving Theorem 1 Let α [0, 1) be irrational, and for f
transformations. We quote from the survey of continuous on define Tf(z) ¼ f(e2πiαz). Then
Operator Ergodic Theory 461
1 n
k¼1 T f ðzÞ converges uniformly to the constant
k In Hopf (1937), F. Riesz proved elementarily
n
f dm, where m is the normalized Lebesgue mea- that when T is a contraction of a Hilbert space,
sure on . Tx ¼ x if and only if T x ¼ x, and extended
The result is immediate for trigonometric poly- Theorem 3 to contractions. Visser (1938) proved
nomials, which by Fejér’s theorem are dense in weak mean ergodicity of power-bounded opera-
CðÞ; this proves the theorem. Moreover, trigo- tors in Hilbert space. G. Birkhoff (1939) proved
nometric polynomials are dense in L2 ð, mÞ, and the mean ergodic theorem for contractions in uni-
we obtain the following. formly convex spaces, extending the Lp result of
F. Riesz (1938).
In their study of Markov chains, Kryloff and
Corollary 2 Let α [0, 1) be irrational, and let
Bogoliouboff (1937) presented the following.
Tf(z) ¼ f(e2πiαz) be the Koopman operator on
L2 ð, mÞ: Then for every f L2 ð, mÞ we have
1 n Theorem 4 Let P(t, A) be a transition probabil-
k¼1 T f f dm 2 ! 0:
k
n ity on (Ω, S) and T the Markov operator defined
Now let θ be any invertible measure- on the space of bounded measurable functions by
preserving transformation of a probability space
Tf(t) ¼ f(s)P(t, ds). If there exist n 1 and a
(Ω, S, m). Then the Koopman operator Tf ¼ f ∘ θ
compact operator Q, such that kT n Qk < 1,
is unitary on L2(Ω, S, m), an observation which
then T is uniformly ergodic – the averages
led von Neumann (1932) to use the spectral theo- 1 n k
rem to prove the following mean ergodic theorem. n k¼1 T converge in operator norm.
but T nx need not converge weakly to 0. Hopf subsequence {nj} we have Mnj x ! y weakly,
(1932) characterized (1) when T is the Koopman then Ty ¼ y.
operator of an ergodic probability-preserving
transformation t, by ergodicity of (t t) Proposition 6 Let T be Cesàro bounded on
(u, v) ≔ (tu, tv) on (Ω Ω, m m); in Hopf X satisfying 1n T n x ! 0 for every x X. Then
(1937), he called such t weak mixing, while t the following are equivalent for x X:
was called mixing if T nx ! Ex weakly.
By a strange coincidence, Banach’s book (i) kMnxk ! 0.
(1932) was published in the same year, and it (ii) Mnj x ! 0 weakly for some increasing sub-
was realized that von Neumann’s theorem can be sequence {nj}.
generalized “to more general Banach spaces and (iii) f(x) ¼ 0 for every functional
to more general classes of operators,” quoting f F (T ) X .
Kakutani (1950). (iv) x ðI T ÞX:
462 Operator Ergodic Theory
The proof of (iii) implies (iv) uses the Hahn- the semigroup and x X. Eberlein proved a general
Banach theorem. (i) ) (ii) ) (iii) are obvious, ergodic theorem on the convergence of Mαx, from
and (iv) ) (i) is easy. Proposition 6 is part of the which he deduced Theorem 7.
proof in Yosida (1938) of the following mean The following was proved by Krotkov and
ergodic theorem, proved independently in Halperin (1953).
Kakutani (1938).
An operator T on X is called weakly almost
Proposition 9 Let T be Cesàro bounded on a
periodic if for every x X the orbit {T nx}n is
reflexive Banach X. If 1n T n x ! 0 (weakly) for
weakly conditionally compact.
every x X, then T is (weakly) mean ergodic.
Theorem 7 Let T be a weakly almost periodic Combining Lemma 5 and Proposition 6, we
operator on X. Then T is mean ergodic. obtain that a power-bounded operator is mean
Since the Krein-Šmulian theorem appeared ergodic if (and only if) it is weakly mean
only in 1940, Yosida assumed that T is power- ergodic. However, this is false in general;
bounded and that {Mnx} is weakly compact. Both Derriennic (2000) has an example of a weakly
assumptions follow from weak almost periodicity. mean ergodic operator on a Hilbert space which
The operator in Theorem 1 is weakly almost is not mean ergodic (so lim sup 1n kT n k > 0Þ:
periodic on CðÞ. Weak almost periodicity is not A general method for obtaining examples like
necessary for mean ergodicity of a power- Derriennic’s was given by Tomilov and
bounded operator; Sine (1976) has a mean ergodic Zemánek (2004).
positive contraction T on C(K ) such that T2 is not Hille (1945) constructed, on L1[0, 1], the first
mean ergodic, hence T is not weakly almost example of a mean ergodic operator which is not
periodic. Several other examples are presented power-bounded, with kT nk O(n1/4). Kornfeld
by Gerlach and Glück (2019). However, if T is a and Kosek (2003) constructed, for any γ (0, 1),
positive contraction on a Banach lattice with order a positive mean ergodic operator on L1[0, 1]
continuous norm, mean ergodicity of T implies satisfying limn nγ1kT nk ¼ 1. Kosek (2011)
that of T2, by Derriennic and Krengel (1981). constructed a mean ergodic operator on L1 with
Komorník (1993) proved that a positive contrac- lim sup n1kT nk > 0. Cesàro boundeness alone
tion on L1 is weakly almost periodic if (and only does not imply mean ergodicity, even if X is finite-
if) it is mean ergodic. dimensional, by a simple example of Assani
(1986). However, Émilion (1985) proved that
Corollary 8 Every power-bounded operator on any Cesàro-bounded positive operator on a reflex-
a reflexive Banach space is mean ergodic. ive Banach lattice is mean ergodic (n1T n ! 0
Corollary 8 was proved by Lorch (1939), inde- strongly is not assumed, but follows).
pendently of Kakutani and of Yosida. Kakutani Lemma 5 and Proposition 6 yield the
and Yosida both saw that for proving Corollary following:
8 you really need only weak almost periodicity of
the operator. The results of von Neumann, Riesz,
Visser, and Birkhoff mentioned in the introduc- Theorem 10 Let T be weakly mean ergodic on
tion are special cases of Corollary 8. X. Then
Eberlein (1949) extended the mean ergodic the-
orem to general operator semigroups S, replacing X ¼ F ðT Þ ðI T ÞX, ð2Þ
the Cesàro averages by what he called invariant
integrals – a family {Mα} of operators on the and the limit operator E is the projection on F (T )
space X, indexed by α in a directed set, such that corresponding to (2).
for any x X, Mαx is in the closed convex hull of Conversely, if T is Cesàro bounded and
1 n
the orbit fTx : T S g of x, supαkMαk < 1, and n T x ! 0 (weakly) for every x X and (2)
limα Mα(I T)x ¼ limα(I T)Mαx ¼ 0 for any T in holds, then T is (weakly) mean ergodic.
Operator Ergodic Theory 463
For T as in Proposition 6, it is easy to show that (or L1); Anosov (1973) proved that there exists
the set of points x X such that (Mnx) converges f CðÞ such that f ¼ g g ∘ θ with g non-
is a closed subspace, which equals the right-hand integrable. Gottschalk and Hedlund (1955) proved
side of (2). that if (K, t) is a minimal topological dynamical
Using the decomposition (2), we obtain an system and Tf ¼ f ∘ t on C(K), then f (I T)
n1 k
easy proof, due to Satō (1979), of the following C(K) if (and only if) supn k¼0 T f 1 < 1:
theorem of Sine (1970), originally proved for Inspired by this result, Browder (1958) proved the
contractions. following:
Proposition 6 yields easily that for T as in y ðI T ÞX if and only if sup T k y < 1:
n
k¼0
Theorem 11, the fixed points of T always sepa-
ð3Þ
rate those of T, so if X is reflexive, we obtain
Corollary 8 from Theorem 11.
Put Sn ≔ n1k¼0 T : The seemingly weaker con-
k
Corollary 12 Let (K, t) be a uniquely ergodic dition supN N n¼1 kSn yk2 < 1 also implies
1 N
topological dynamical system. Then the operator y (I T )X. This condition was shown in
Tf ¼ f ∘ t on C(K ) is mean ergodic. Robinson (1960) to imply y (I T )X when
The corollary is due to Oxtoby (1952). Theo- T is unitary; see also Kozma and Lev (2011).
rem 1 follows from Theorem 7 or from Cuny and Weber (2018) and Volný (2018) proved
Corollary 12. that if T is a contraction on a Hilbert space, then
1 2
k¼0 k T k y < 1 implies y (I T )X. Itera-
Theorem 13 Let 0 rn ! 0, and let T be mean tive methods for solving Poisson’s equation
ergodic, with Ex ¼ lim Mnx. If for every x X (I T )x ¼ y were studied by Reich (1973).
there is cx such that kMnx Exk cxrn, then Lin and Sine (1983) extended Browder’s result
kMn Ek ! 0 (T is uniformly ergodic). to dual operators, proving (3) when X ¼ Y and
Theorem 13 is a consequence of the uniform T ¼ S , with S power-bounded on Y; see also
boundedness principle. When T acts on a complex Devys (2012). They also proved (3) for contrac-
Banach space, the decomposition (2) yields that tions on L1. See Aaronson and Weiss (2000) for
when T is uniformly ergodic, 1 is an isolated point related results. Kornfeld and Lin (1997) proved (3)
in the spectrum of T (see the section Uniform for irreducible Markov-Feller operators on C(K ),
ergodic theorems). Since, by A. Ionescu Tulcea thus extending the topological dynamics result of
(1963) and Foiaş (1964), the spectrum of the L2 Gottschalk and Hedlund (1955). Lin and Suciu
Koopman operator T of an invertible ergodic (2015) studied the equation (I T )x ¼ y when
probability-preserving transformation of a T is weakly mean ergodic.
Lebesgue space is the whole unit circle , T has Square functions were originally used in the
no uniform rate of convergence in the mean ergo- study of a.e. convergence of orthogonal expan-
dic theorem. A constructive proof for this was sions; see the survey of Stein (1982). Motivated
given by Krengel (1978); see also Kakutani and by a square function inequality by Jones,
Petersen (1981). Ostrovskii, and Rosenblatt (1996) for contractions
Motivated by Euler’s formal approach to Fourier in Hilbert space, and by its extension by Jones
series, Wintner (1945) studied the existence of sol- et al. (1998) to certain isometries in Lp, Avigad
utions g L1 to the coboundary equation f ¼ and Rute (2015) obtained the following
g g ∘ θ, where θ is as in Theorem 1 and f L2 p-variation theorem in uniformly convex spaces.
464 Operator Ergodic Theory
Theorem 15 Let X be isomorphic to a uniformly A nonreflexive Banach space with all contrac-
convex Banach space. Then there exists p 2 tions means ergodic was exhibited by Fonf
such that for every power-bounded T satisfying et al. (2010).
infnkT nxk B kxk for some B > 0 and every For any power-bounded T we have (using
x X, there exists C (depending on X, p, and T) Proposition 6 for the second inclusion)
such that for every increasing sequence {nk} we
have n1
ðI T ÞX x X : sup Tkx < 1
n
1 k¼0
p
Mnkþ1 Mnk x C kxkp 8x X: ðI T ÞX, ð4Þ
k¼1
Kohlenbach and Leuştean (2009) used proof Theorem 14 yields equality in the first inclu-
theory to obtain a quantitative proof of the mean sion when X is reflexive. Fonf et al. (2011) proved
ergodic theorem of G. Birkhoff (1939). that if X has a basis and for every power-bounded
Let T be invertible on X reflexive with T there is equality in the first inclusion of (4), then
1 ‘þn k X is reflexive.
supn ℤkT nk<1. Then n k¼‘ T x Ex ! 0 Bermúdez et al. (2000) studied operators such
for any ‘ ℤ. Motivated by Cotlar’s (1955) that T nx/n ! 0 implies that Mn(T )x converges
a.e. convergence of the ergodic Hilbert transform (a “local” mean ergodicity) and proved that if T k
of a probability-preserving transformation and by has this property, so does T. Radjavi et al. (2003)
Petersen (1983), Campbell (1986) used the spec- studied local ergodic properties for T which is not
tral theorem and the unitary dilation theorem to Cesàro bounded. Their main result is:
prove the following.
Theorem 17 Let T be a compact operator on a
Theorem 16 Let T be a contraction on a complex (real or complex) Banach space X, and let x X.
Hilbert space ℋ. Then for every x ℋ, the limit (i) If n1kT nxk ! 0 and supnkMnxk < 1, then
Hx ¼ lim n!1 nk¼1 1k T k x T k x exists, and Mnx converges in norm to a fixed point of T. (ii) If
H is a bounded linear operator on ℋ for some subsequence supk kT nk xk < 1, then Mnx
Berkson and Gillespie (1987) proved that if converges.
T is invertible on Lp, 1 < p < 1, with
n 1 k
supn ℤkT nk < 1, then k
k¼1 k T x T x
converges for every x Lp. Rates of Convergence
Sucheston (1976) asked whether a Banach
space, in which every contraction (or every Let T be mean ergodic. When T is uniformly
power-bounded operator) is mean ergodic, must ergodic, (2) yields that S, the restirction of T to
be reflexive. Building upon a partial result of Y≔ðI T ÞX, satisfies kMn(S)k ! 0. For large n,
Zaharopol (1986), Emel’yanov (1997) proved we have that I Mn(S) is invertible on Y, so I – S is
that a Banach lattice is reflexive if (and only if) invertible, i.e., I – T is invertible on ðI T ÞX, so
every power-bounded operator is mean (I-T)X is closed, and kMn Ek ¼ O 1n : When T is
ergodic. Fonf et al. (2001) proved that if a not uniformly ergodic, by Theorem 13 there is no
Banach space has a basis and all power- uniform rate of convergence, so we may find
bounded operators are mean ergodic, then the x X with kMnx Exk ! 0 arbitrarily slowly.
space must be reflexive; additional equivalent However, Butzer and Westphal (1971) observed
conditions were obtained by Farkas and Kreidler that kMn xk ¼ o 1n implies x ¼ 0, so the fastest
(2021). Since a basis is assumed, this result does possible rate for x ðI T ÞX is O 1n , when
not include Emel’yanov’s. An analogue for C0- x (I T )X; by Theorem 14, if X is reflexive,
semigroups was proved by Mugnolo (2004). this rate is attained only when x (I T )X.
Operator Ergodic Theory 465
j¼1 j : Cuny
Ty
Theorem 18 Let f ℋ with spectral measure sf (2009) proved for positive contractions of a fixed
defined on(π, π], and assume Ef ¼ 0. Then for Lp, p > 1, that norm convergence of Hf implies its
any a (0, π] and for any n, we have almost everywhere convergence. Additional
properties of H and its domain of definition were
2 given by Cohen et al. (2010) and by Haase and
sinða=2Þ a a
kM n f k2 sf , , Tomilov (2010). Using a functional calculus,
a=2 n n
Gomilko et al. (2011) proved:
where the constant is best possible. Put Sk ¼
sf ak, ak and ΔSk ¼ Sk Skþ1; then Theorem 20 If T is power-bounded and the
series 1
j
4 1
n1 o(1/ log n); this rate is optimal.
S kM n f k2 Sn þ ðk þ 1Þ2 DSk : Halmos (1949b) proved that if T is the
p2 n n k¼1
Koopman operator of an invertible ergodic mea-
sure preserving transformation of a nonatomic
Let α [0, 2). Then the rate kMn f k2 Bnα
holds if and only if for some A > 0 we have probability space, there exists f ðI T ÞL2 for
1 Tj f
sf(δ, δ] Aδα for every δ (0, π]. which j¼1 j does not converge in L2-norm.
Butzer and Westphal (1972) obtained for T Cuny (2009) proved that if that series converges
power-bounded mean ergodic, but not uniformly in norm (even without invertibility of the transfor-
ergodic, the following approximation estimate of mation), then it converges a.e.
Let T be invertible on L p, 1 < p < 1, with
Mny for y ðI T ÞX : There exists B > 0 such that
supn ℤkT nk < 1; Cuny (2010) used the spectral
1
kMn yk B inf x ðIT ÞX k y x k þ nþ1 ðI T Þ1 x :
integration of Berkson and Gillespie (1987) to
Nasri-Roudsari et al. (1995) proved the optimality obtain some conditions for the Lp convergence
of this type of estimates. of 1 1
j¼1 j1a T y, 0 a < 1.
j
Let T be power-bounded on a Banach space
X (so T is a contraction in the equivalent norm
jkxkj ≔ supn0kT nx k). For 0 < α < 1, Derriennic
Uniform Ergodic Theorems
and Lin (2001) used the power series ð1 tÞa ¼
ðaÞ
1 1 j¼1 aj t , 0 t 1, to define the fractional
j The convergence of the averages of a mean ergo-
ðaÞ j
power ðI T Þa ¼ I 1 j¼1 aj T (fractional dic operator need not be in operator norm, even in
powers of generators of continuous one-parameter reflexive spaces; this is the case of the L2
semigroups had been defined differently more Koopman operator of an ergodic invertible
than 40 years earlier). They proved that probability-preserving transformation on [0, 1]
466 Operator Ergodic Theory
with Lebesgue measure. When the averages with TTj ¼ TjT ¼ ljTj, TjTi ¼ 0 for i 6¼ j, and TjS ¼
Mn(T) converge in operator norm, we say that STj ¼ 0.
T is uniformly ergodic. When T is as in Theorem 21, it is implied in the
Fonf et al. (2001) proved that if X is a Banach proof that its peripheral spectrum sðT Þ \ con-
space with basis such that every power-bounded sists only of (finitely many) eigenvalues; this fol-
operator is uniformly ergodic, then X is finite- lows from Theorem 22 below, since lT is quasi-
dimensional. Thus conditions for uniform ergo- compact, hence uniformly ergodic, for every
dicity of an operator, with or without prior l .
assumption of mean ergodicity, are of interest. Brunel and Revuz (1974) observed that T is
We have already noted that if T is uniformly quasi-compact if and only if T ¼ S þ Q, with
ergodic, then Y ≔ (I T )X is closed and I T is spectral radius r(S) < 1 and Q with finite-
invertible on Y. When X is over ℂ, this yields that dimensional range.
for l near 1 the restriction of lI T to Y is It follows from the work of Eberlein (1949)
invertible. The decomposition (2) yields that that if T is power-bounded and a convex power-
those l are in the resolvent r(T ); thus, if series S ≔ 1 k¼1 ak T ak 0,
k
k ak ¼ 1, is
1 s(T ), then it is isolated in the spectrum. (weakly) quasi-compact, then T is mean ergodic,
See Theorem 22 below for additional uniformly ergodic when S is quasi-compact; this
information. extends the results of Yosida and Kakutani (1938,
In their study of Markov chains, Kryloff and 1941).
Bogoliouboff (1937) introduced a new concept, When T is uniformly ergodic, (I T)X is
which applies also to general operators. An oper- closed, so when T is power-bounded we have
ator T on a Banach space is called quasi-compact kMn Ek ¼ O(1/n). Yoshimoto (1993) studied
if there exist n 1 and a compact operator Q, such the uniform rate of convergence when T is quasi-
that kT n Qk < 1. Yosida and Kakutani (1938) compact but not power-bounded, and usual aver-
proved that a power-bounded weakly quasi- ages are replaced by C(α) averages; in Yoshimoto
compact operator (Q weakly compact) is mean (1996), he studied the power-bounded case.
ergodic. Yosida and Kakutani (1941) observed As an application of his operational calculus,
that T is (weakly) quasi-compact if and only if Dunford (1943a) obtained the following.
there exists a sequence of (weakly) compact oper-
ators Qn such that kT n Qnk ! 0; they showed Theorem 22 Let T on a complex Banach space
that the condition of Doeblin (1937) on Markov X satisfy n1kT nk ! 0. Then the following are
operators implies quasi-compactness and proved equivalent:
the following theorem, of which Theorem 4 is a
special case. (i) T is uniformly ergodic.
(ii) Either 1 r(T ), or 1 is a simple pole of the
Theorem 21 Let T be a power-bounded quasi- resolvent R(l, T ) ≔ (lI T)1.
compact operator on a complex Banach space (iii) (I T)2X is closed.
X. Then T is uniformly ergodic. More precisely,
T has only finitely many eigenvalues of modulus Note that even for T power-bounded, 1 isolated
one (if any), say lj, j ¼ 1, . . ., k, each of finite in s(T ) does not imply that 1 is a pole, e.g.,
multiplicity, and there exist projections Tj, j ¼ 1, T ¼ I V on L2[0, 1] (V the Volterra operator),
. . ., k and a quasi-compact S with kSn k ð1þe M which is power-bounded by Allan (1997).
Þn
Burlando (1997) proved that given j ℕ, if
for some ε > 0, such that T on a complex X satisfies njkT nk ! 0 and 1 is
k
a pole of the resolvent, then nj n1k¼0 T
k
con-
Tn ¼ lnj T j þ Sn n ¼ 1, 2, . . . verges in operator norm, and identified the limit;
j¼1 other conditions are also presented.
Operator Ergodic Theory 467
When X is a complex Banach space, we would noted there that for general contractions the “if”
like to extend the characterization of condition part of (8) need not hold.
(i) of Theorem 28, given in (6) for unitary operators In connection with Theorem 28(iii), Eisner and
on Hilbert spaces. Extending the work of Jacobs Müller (2021) studied particular subsequences
(1957), deLeeuw and Glicksberg (1961) studied along which the averages of any weakly almost
weakly almost periodic semigroups of operators periodic operator, which has no unimodular
(the image of every vector under the semigroup is eigenvalues, converge in norm.
weakly conditionally compact), which are neces- DeLaubenfels and Vũ (1996) proved the fol-
sarily bounded. A very special case of their results lowing (see also Theorem 45 below).
is the following decomposition.
Theorem 32 Let T be power-bounded on X over ℂ,
Theorem 29 Let T be weakly almost periodic on with sðT Þ \ countable. Then kT nxk ! 0 if (and
a complex Banach space X. Then only if) hx, ci ¼ 0 whenever T c ¼ lc, j l j ¼ 1.
Extending the result of Krengel (1972) for
X ¼ spanfy : Ty¼ ly, jlj¼ 1g unitary operators, Müller and Tomilov
(2010) proved:
z : 0 weak closurefT n zgn0 : ð7Þ
Theorem 33 Let T be power-bounded on a com-
Moreover, 0 weak - closure {T nx}n 0 if and plex Hilbert space ℋ with sðT Þ \ infinite and
only if hx, ’i ¼ 0 for every ’ X with T ’ ¼ no unimodular eigenvalues. Then there exists a
l’, j l j ¼ 1. dense set of vectors x ℋ such that for some
Since, for T power-bounded, kT nzk ! 0 if (and increasing {nk} we have hT nk x, T nj xi ¼ 0 when
only if) kT nk zk ! 0 along some subsequence j 6¼ k.
{nk}, we obtain: When T is power-bounded on X separable
(over ℂ), the set of unimodular eigenvalues
Corollary 30 Let T be almost periodic on a com- sp ðT Þ \ is countable, by Jamison (1965), but
plex Banach space X. Then sp(T ) may be uncountable (as the left shift on ‘2
shows).
X ¼ spanfy : Ty¼ ly, jlj¼ 1g
fz : kT n zk ! 0g:
Weak Stability and Mixing
Proposition 31 Let T be weakly almost periodic
Recall that an ergodic probability-preserving
on a complex Banach space X. Then
transformation is called mixing if its Koopman
n operator T on L2 satisfies T n ! E in the weak
1
sup j T k x, ’ j ! 0 operator topology. We therefore call a power-
k’k1 n k¼1 bounded T on a Banach space X weakly stable if
ð8Þ
if and only if hx, ci ¼ 0 for some E (necessarily a projection) T n ! E in
whenever T c ¼ lc, j l j¼ 1: the weak operator topology. Weak stability
implies power-boundedness and mean ergodicity,
L. Jones and Lin (1980) proved (8) for so when X is over ℂ the spectral radius of T does
T power-bounded on complex X, assuming only not exceed 1.
that {T nx} is weak-* sequentially compact in X
(e.g., X does not contain an isomorphic copy of Theorem 34 (Foguel (1963)). Let T be a con-
‘1). Their proof uses the result for isometries in traction on a Hilbert space ℋ and x ℋ. Then
Hilbert spaces, proved in Krengel (1972), but does T nx ! 0 weakly if and only if hT nx, xi ! 0. Hence
not use the DeLeeuw-Glicksberg theory. It is also T nx ! 0 weakly if and only if T nx ! 0 weakly.
470 Operator Ergodic Theory
The following result of Foguel (1963) shows every increasing {kj}; the converse implication
that weak stability of contractions in ℋ is a prob- always holds. Theorem 37 says that any contrac-
lem only for unitary operators. tion on a Hilbert space has the BH property.
A weakly stable T on X, with T n ! E in the
Theorem 35 Let T be a contraction on a Hilbert weak operator topology, has the BH property if
space ℋ and let K ≔fy ℋ: kT nyk ¼ kT nyk ¼ 1 n kj
Ex ! 0 for every increasing {kj}
j¼1 T x
k y k 8 n 1}. Then K is invariant under T and T ,
n
the restriction T jK is unitary, and limhT nz, zi ¼ 0 and every x X. Theorem 37 raises the question
for any z ⊥ K . whether any weakly stable operator on a Banach
Horowitz (1969) proved that if in Theorem 35 space has the BH property.
T is normal, then kT nzk ! 0 for z ⊥ K . Akcoglu and Sucheston (1972) proved that
In the following, Badea and Müller (2009) weakly stable contractions on L1 have the BH
showed that in general the convergence to 0 of property. Millet (1976) proved that a weakly
hT nx, xi can be arbitrarily slow. stable positive operator on L1 has the BH prop-
erty. On the other hand, Akcoglu et al. (1974)
Theorem 36 Let T on a complex Hilbert space constructed a topological dynamical system
ℋ satisfy T nx ! 0 weakly for every x ℋ (so T (K, t) such that its Koopman operator T on
is power-bounded), and assume 1 s(T ). Then C(K ) is weakly stable, but for some
for every positive sequence rn ! 0 with supn x C(K ) (9) fails. Lefèvre and Matheron
rn < 1 there exists x ℋ with kx k ¼ 1 such (2016) proved that when K is a compact metric
that RehT nx, xi > rn. space, all contractions have the BH property if
The following theorem, proved independently and only if K has only finitely many accumula-
by Jones and Kuftinec (1971) and by Akcoglu and tion points.
Sucheston (1972), was first proved by Blum and Akcoglu and Sucheston (1975), and then Bel-
Hanson (1960) for the L2 Koopman operator of a low (1975), proved that any weakly stable posi-
mixing probability-preserving transformation. tive contraction on Lp, 1 < p < 1, has the BH
property. Müller and Tomilov (2007) proved that
Theorem 37 Let T be a contraction in a Hilbert when T is a contraction of ‘p, 1 p < 1, the
space ℋ and x ℋ. Then equivalence (9) holds; they also showed that a
power-bounded T on a Hilbert space may be
weakly stable without possessing the BH prop-
T n x ! 0 weakly if and only if
erty. Satō (1980) proved that if T is positive and
1 n
T kj x ! 0 ð9Þ weakly stable on Lp of a finite measure space,
n j¼1
1 < p < 1, and for some p < p1 < 1 T is also
for every increasing kj :
power-bounded in Lp1 , then T has the BH property
on Lp.
Krengel (1971) and Conze (1973) constructed
Lefèvre et al. (2016) studied the BH property
increasing sequences {kj} such that for every
in terms of general sequences {xn} X converg-
ergodic mpt θ there exists f L1 for which
1 n kj
ing weakly to zero. They have additional exam-
n j¼1 f∘y does not converge a.e. ples of real Banach spaces in which every
Berend and Bergelson (1986) abstracted The- contraction has the BH property; see also
orem 37, by proving that for {xn} ℋ bounded, Grivaux (2019). Azizov and Chilin (2017) pro-
1 n
xn ! 0 weakly if and only if n j¼1 xkj ! 0 for ved the BH property for contractions on certain
separable Banach lattices.
every increasing {kj}.
Eisner (2010) proved the following discrete
We say that a power-bounded T on a Banach
analogue of a result of Chill and Tomilov (2007);
space X has the Blum-Hanson (BH) property if
when its condition (10) holds for any x X and
1 n
Tnx ! 0 weakly implies n
kj
j¼1 T x ! 0 for ’ X , we obtain weak stability.
Operator Ergodic Theory 471
boundedness (or even mean ergodicity); Léka 35 and 34. In Sect. II.6 of their book, Sz-Nagy and
(2010) has an example with s(T ) ¼ {1}. For Foiaş (1967) proved the following.
additional information and new proofs see the
survey of Batty and Seifert (2022). Theorem 44 Let T be a completely nonunitary
Allan et al. (1987) proved the following. contraction on a complex Hilbert space ℋ. If
sðT Þ \ has zero Haar measure (on ), then
Theorem 43 Let f ðzÞ ¼ 1 k¼0 ak z
k
with kT nxk ! 0 and kT nxk ! 0 for any x ℋ.
k0 k | ak | < 1, and let T be power-bounded Following Kérchy and van Neerven (1997),
on a complex Banach space X. Then kT nf(T)k ! 0 Mustafayev (2014) studied conditions on the
if and only if f(l) ¼ 0 for every l sðT Þ \ . local spectrum which imply local stability. His
The conclusion of Theorem 43 was proved by work yields a partial extension of Theorem 44 to
Esterle et al. (1990) for every f in the disk algebra power-bounded operators, by restricting the null-
when T is a contraction of a Hilbert space. When sets containing sðT Þ \ .
X is a Hilbert space, Léka (2009) extended Theo- For the general case, Arendt and Batty (1988)
rem 43 to f with k|ak| < 1. Huang (1995) proved the following; alternate proofs are indi-
proved that if kT nf(T)k ! 0 for some f(z) with cated in Batty (1994). Lyubich and Vũ (1988)
k|ak| < 1, then sðT Þ \ has measure zero. proved the continuous case, and their proof was
A continuous time version of the Katznelson- adapted to the discrete case in Eisner (2010). See
Tzafriri theorem, for bounded C0-semigroups, also Batty and Vũ (1990).
was proved by Esterle et al. (1992) and by
Vũ (1992b). Theorem 45 Let T be a power-bounded operator
Following Kalton et al. (2004), see Theorem on a complex Banach space X. If sðT Þ \ is
67 below, Malinen et al. (2007) proved that for countable and T has no unimodular eigenvalues,
any T, either lim infn(n þ 1)kT n(I T)k ¼ 0, or then kT nxk ! 0 for x X.
lim infn(n þ 1)kT n(I T )k e1. Dungey Eisner (2010) proved a discrete analogue of a
(2008) gave several necessary and sufficient con- resolvent condition of Tomilov (2001) which
ditions for kT n(I T )k ¼ O(n1/2) in Theorem yields stability, and obtained as a corollary the
42. The rate 1/n is related to the Ritt resolvent following.
condition; see the section Resolvent conditions
Theorem 46 Let T be power-bounded on a com-
and growth of powers below. Seifert (2016)
plex Hilbert space ℋ and x ℋ. Then kT nxk
obtained a rate in Theorem 42 using the growth
! 0 if and only if
of kR(eit, T)k as t ! 0. Cohen and Lin (2016)
studied the rate 1/nα, 0 < α 1, when X is a 2p
Hilbert space. Badea and Seifert (2017) obtained 2
lim R reit , T x dt ¼ 0
a condition on the numerical range which implies r!1þ 0
Following Lasota et al. (1984), Sine (1991) contraction on a (real or complex) Hilbert space
defined a power-bounded operator T on a Banach ℋ and x ℋ if and only if for every l the
space X to be constrictive if there exists a compact averages 1n nj¼1 lkj converge.
set K X such that dist(T nx, K ) ! 0 for every Motivated by a result of Brunel and Keane
kx k 1. When K ¼ {0}, we have stability. By the (1969), Bourgain (1988b) and Bourgain et al.
definition, if T is constrictive, then for each x X (1989) proved the “return times theorem,” which
the orbit {T nx} is precompact, so T is almost provides increasing sequences {kj}, with lim
periodic, hence mean ergodic. Sine (1991) proved kj/j < 1, such that averaging any Koopman oper-
that a contraction T is constrictive if and only if ator on any Lp along {kj} converges a.e., hence also
T is strongly almost periodic and, in the decom- in Lp norm (1 p < 1). Applying it to all rotations
position (7), spanfy : Ty¼ ly, jlj¼ 1g is finite- of , we obtain, by Proposition 49, that for these
dimensional (and on its complement T is stable). sequences 1n nj¼1 T kj x converges in norm for any
Extending the result of Lasota et al. (1984) for L1, contraction T on a Hilbert space ℋ and x ℋ.
Bartoszek (1988) proved the following. Extending a result of Furstenberg (1981) for
unitary operators (for which Bergelson (1987)
Theorem 48 Let T be a constrictive positive con- gave a nonspectral proof), Kunszenti-Kovács
traction of a Banach lattice. Then there exist r et al. (2011) proved the following.
positive unit vectors y1, . . ., yr and r positive
functionals ’1, . . ., ’r such that T permutes the Theorem 50 Let q be a nonconstant real polyno-
r mial with integer coefficients which maps ℕ to
j¼1 ’j ðxÞyj ! 0 for every
n
yj, and T x
itself. Then the limit cq ðlÞ≔ lim n!1 1n k¼1 lqðkÞ
x. Hence, for some 1 d r !, T d is stable.
exists for l and is zero for l not a root of unity.
For every contraction T on a Hilbert space ℋ
Averaging Along Subsequences and x ℋ, the averages 1n nk¼1 T qðkÞ x converge
in norm.
The Blum-Hanson Theorem 37 and Theorem 28 If ℋ is over ℂ, then the limit is
l:∃jlj ¼1 cq ðlÞEl x, where El x≔ lim n n k¼0 lT x
1 n1 k
(iii) raise the question for which increasing sub-
sequences of positive integers {kj} the averages is the orthogonal projection on the eigenspace
1 n kj
n j¼1 T converge strongly for contractions, or corresponding to l .
power-bounded operators, on a Hilbert space ℋ. Given a contraction T on ℋ, the limit is a
Blum and Eisenberg (1974) proved a “uniform projection for every q with q(0) ¼ 0 if and only if
distribution” criterion on a sequence {kj} for the roots of unity different from 1 are not eigen-
strong convergence of 1n nj¼1 T kj to the ergodic values of T.
limit E(T) for every unitary operator; it yields also Proposition 49 does not apply to power-
convergence for any contraction on ℋ, by the bounded operators. Using a different method, ter
unitary dilation theorem. For contractions on ℋ, Elst and Müller (2017) proved:
the following general criterion, implied in
Furstenberg (1981) for unitary operators, was pre- Theorem 51 Let q be a nonconstant real polyno-
sented for the complex case in Rosenblatt (1994), mial with rational coefficients which maps ℕ to
in Rosenblatt and Wierdl (1995), and in Berend itself. If T is a power-bounded operator on Hilbert
et al. (2002). For the real case, one applies the space ℋ over ℂ such that roots of unity are not
complex case in the complexification of ℋ, as in eigenvalues, then 1n nk¼1 T qðkÞ x ! 0 for every
Michal and Wyman (1941). x ℋ.
Eisner and Müller (2021) extended Theorem
Proposition 49 Let {kj} be an increasing 51 by proving that for a general real polynomial
sequence of positive integers. The averages q with [q(k)] 0 for k ℕ, we have convergence
1 n
of 1n nk¼1 T ½qðkÞ x for every x ℋ.
kj
n j¼1 T x converge in norm for every
474 Operator Ergodic Theory
Bourgain (1989) proved that for T the preservation of the limit (Silverman-Toeplitz the-
Koopman operator on Lp of an invertible orem). Such a matrix is called a regular summa-
probability-preserving transformation, we have bility matrix, or in short a regular matrix. The
a.e. convergence of 1n nk¼1 T qðkÞ f for f Lp, Cesàro averaging corresponds to the matrix with
1 < p < 1, and q as in Theorem 51. This yields an,j ¼ 1/(n þ 1) for j n and 0 for j > n. The first
Lp-norm convergence, also in L1. For general real to consider averaging of powers of a power-
polynomials, he proved a.e. convergence of bounded operator by regular matrices was
1 n ½qðkÞ
f for f L1. L. Cohen (1940), who proved:
n k¼1 T
Let {pj} be the sequence of prime numbers in
increasing order. Extending earlier results of Theorem 53 Let A ¼ (an,j)n,j 0 be a regular
Bourgain (1988a) and of Wierdl (1988), Nair matrix such that
(1993) proved that given q a nonconstant polyno-
1
mial mapping ℕ to itself, for the Koopman oper-
lim jan,jþ1 an,j j ¼ 0 uniformly in n:
ator as above on Lp, 1 < p < 1, we have k!1
j¼k
a.e. convergence of 1 n T qðpj Þ f for f L ,
n j¼1 p
hence in Lp norm. Applying Nair’s result to all If T is a power-bounded operator on a Banach
rotations of and using Proposition 49, we obtain space X such that for every x X the sequence
the following result, proved directly in Eisner and Bn x ≔ 1 j
j¼0 an,j T x is weakly conditionally com-
Lin (2018). pact, then for every x the sequence Bnx converges
to a fixed point.
Theorem 52 Let {pj} be the sequence of primes In particular, when all the elements of A above
in increasing order. Given a nonconstant polyno- are nonnegative with 1 j¼0 an,j ¼ 1 for every n,
mial q mapping ℕ to itself, for every contraction then the above convergence holds for every
on a (real or complex) Hilbert space ℋ and weakly almost periodic T.
x ℋ, the averages 1 n T qðpj Þ x converge in
n j¼1
Eberlein (1984) looked at the Bn in Theorem 53
norm. as invariant integrals, in the sense of
Boshernitzan and Wierdl (1996) studied con- Eberlein (1949).
ditions on certain real sequences a(t), which tend Note that when the regular matrix A is triangu-
to +1 as t ! 1, such that for every Koopman lar (i.e., an,j ¼ 0 for j > n), the series
operator T of an invertible probability-preserving Bn x≔ 1 j¼0 an,j T x is a finite sum, so is defined
j
transformation, the averages 1n nk¼1 T ½aðkÞ f con- for any operator T, and one can consider the con-
verge in L2-norm. vergence of Bnx even for T not power-bounded.
Let {wk}k0 be a sequence of nonnegative
numbers with w0 > 0 and 1 k¼0 wk ¼ 1 , and
General Averaging Methods put W n ¼ nk¼0 wk. The matrix A defined by an,j ¼
wj/Wn for j n and zero for j > n is regular; it
Since the averages of a convergent sequence con- yields the weighted averages with weights {wk}.
verge to the same limit, it is natural to apply more Lin et al. (1999) proved:
general averaging (summability) methods which
preserve convergence and limits, when conver- Theorem 54 Let {wk} be nonnegative with
gence of the arithmetic means fails. w0 > 0 and 1 k¼0 wk ¼ 1: The weighted aver-
A matrix A ¼ (an, j)n, j0 maps ‘1 into itself, by 1 n
ages W n k¼0 wk T k x !Ex for every mean ergodic
ðAbÞn ¼ 1 j¼0 an,j bj for b ≔ (b0, b1, . . .) ‘1, T on a Banach space X and x X if and only if
if and only if supn 1 j¼0 jan,j j < 1; if, in addition,
lim n!1 W1n wn þ n1
k¼0 jwkþ1 wk j ! 0:
the matrix A satisfies limn!1 an,j ¼ 0 for every
j and lim n!1 1 j¼0 an,j ¼ 1, then it maps the
The latter condition is satisfied when {wk} is
space c of convergent sequences into itself, with nondecreasing and wn/Wn tends to 0.
Operator Ergodic Theory 475
For α > 0, the Cesàro-α averages of an Hille (1945) applied his results to the operator
operator T, denoted Can ðT Þ, are defined as norm topology. The converse of the first statement
follows: For β > 1, let Ab0 ¼ 1 and in Theorem 55 was proved for the operator norm
Abn ≔ðb þ 1Þ ðb þ nÞ=n! for n > 0, and let topology by Badiozzaman and Thorpe (1992) and
n reproved by Ed-dari (2003); put together, the fol-
Can ðT Þ ¼ j¼0 Anj =An
a1 a
T j . The corresponding
lowing was obtained (proved by Yoshimoto
matrix an,j ≔Aa1
nj =An for
a
0 j n and 0 for (1998) for 0 < α 1).
j > n is regular and satisfies the conditions of
Theorem 53. An operator T on X is called (weakly)
Theorem 56 Let α > 0. The operator T is uni-
(C, α) ergodic if Can ðT Þx converges (weakly) as
formly (C, α) ergodic if and only if it is uniformly
n ! 1, for every x X. When supn Can ðT Þ <
Abel ergodic and nαkT nk ! 0.
1, we say that T is (C, α) bounded. Note that (C, 1)
An analogous result for the strong operator
(weak) ergodicity is (weak) mean ergodicity.
topology was proved in Ed-dari (2004).
Li et al. (2008) proved that for every 0 < α < 1
Lin et al. (2015) proved that T is uniformly Abel
there exists a positive T on some L1 such that T is
ergodic if and only if for some (all) 0 < r < 1,
(C, β) bounded for β > α, but is not (C, α)
(Ar(T ))n converges uniformly as n ! 1;
bounded.
Kozitsky et al. (2013) gave a spectral condition
Let T satisfy lim supn!1kT nk1/n 1 (spectral
for this property.
radius r(T ) 1 when X is over ℂ). Then for
Li et al. (2008) showed that in any infinite-
0 < r < 1, the series 1 k k
k¼0 r T converges in dimensional Banach space, Abel boundedness
operator norm, and we call Ar ¼
does not imply Cesàro boundedness. However,
Ar ðT Þ≔ð1 r Þ 1 k¼0 r k k
T the Abel averages of T. Émilion (1985) proved that any Abel bounded
For any sequence 0 < rn " 1, the matrix an,j ¼ positive operator on a Banach lattice is Cesàro
ð1 r n Þr jn is regular and satisfies the assumptions bounded, and when the lattice is reflexive it is
of Theorem 53. An operator T as above is called mean ergodic.
Abel bounded if sup0<r<1kAr(T )k < 1, and Abel Improving the extension of Gelfand’s theorem
ergodic if Ex≔ lim r!1 Ar ðT Þx exists for every x; by Mbekhta and Zemánek (1993), Grobler and
in the latter case Ex F (T ), and we have X¼ Huijsmans (1995) proved that if s(T ) ¼ {1} and
F ðT Þ ðI T ÞX: If for T as above and l > 1, we both T and T 1 are Abel bounded, then T ¼ I.
denote Rðl, T Þ≔ 1 k¼0 l
ðkþ1Þ k
T (the resolvent at A regular matrix A is called uniformly regular
l in the complex case), then we have if limn!1supj|an,j| ¼ 0. Fong and Sucheston
lim r!1 Ar ðT Þx ¼ lim l!1þ ðl 1ÞRðl, T Þx. (1974), extending an earlier result of Hanson and
Hille (1945) studied (C, α) and Abel averaging Pledger (1969), proved the following extension of
of general sequences in a Banach space. The fol- the Blum-Hanson Theorem 37.
lowing is his application of his general results.
Theorem 57 Let T be a contraction on a (real or
Theorem 55 Let α > 0. If T is a (C, α) ergodic complex) Hilbert space ℋ and let x ℋ. Then
operator on a Banach space X, then kT nxk/nα T nx converges weakly if and only if for every
converges to zero for any x X, and T is Abel
uniformly regular matrix A the averages Bn x ¼
ergodic, with lim r!1 Ar ðT Þx ¼ lim n!1 Can ðT Þx. 1 j
If T is power-bounded and Abel ergodic, then it j¼0 an,j T xconverge in norm.
is (C, α) ergodic. They also proved that when T is a contraction
If T is (C, α) bounded and Abel ergodic, it is of L1, the convergence of T n in the weak operator
(C, β) ergodic for any β > α. topology is equivalent to the convergence in the
Analogous results for C0-semigroups were strong operator topology of 1 j
j¼0 an,j T for every
proved by Kendall and Reuter (1956), using the uniformly regular matrix A. Akcoglu and
abstract ergodic theorems of Eberlein (1949). Sucheston (1975) proved the same equivalence
476 Operator Ergodic Theory
for positive contractions of Lp, 1 < p < 1. Lin l . By looking at the product of θ with the
and Weber (2007) used Theorem 57 to obtain: rotation by l, acting on ðO Þ, it can be deduced
from Birkhoff’s theorem that 1n n1 k k
k¼0 l T f con-
Theorem 58 Let wj 0 with w0 > 0 and verges a.e. for any f L1(m). Wiener and Wintner
W n ≔ nj¼0 wj ! 1: Then for every contraction (1941) proved that when θ is ergodic, for
T on a Hilbert space with T n ! 0 weakly, the f L1(m) there is a null set A S such that for
weighted averages W1n nj¼0 wj T j converge o A, 1n n1 k¼0 l T f ðoÞ converges for every
k k
1 n 2
strongly if and only if W 2 j¼0 wj ! 0: l : The Wiener-Wintner theorem follows by
n
Dunford and Schwartz (1956) extended the applying the extension by Rudolph (1994) of the
pointwise ergodic theorem, proving that if T is a return times theorem to all rotations of ; see the
contraction of L1(m) of a s-finite measure, such discussion following Proposition 65. See also
that kTfk1 k fk1 for f L1 \ L1, then for Assani (2003). The Wiener-Wintner theorem
yields that if a ‘1 can be uniformly approxi-
every f L1 the averages 1n n1 k
k¼0 T f converge
mated by trigonometric polynomials, then for
a.e. Akcoglu (1975) proved that if T is a positive
every Koopman operator T and every f L1(m)
contraction of Lp(m), 1 < p < 1, then 1n n1 k
k¼0 T f the averages 1n nk¼0 ak T k f converge a.e.
converges a.e. for every f Lp. By classical
Ryll-Nardzewski (1975) noted that if k ¼
summability theory for numerical sequences
{kj}j 0 is an increasing sequence in ℕ with limn
(extended in Hille (1945) to vector sequences),
kn/n ¼ d > 0, then averaging along k is equivalent
in both the above theorems lim r!1 Ar ðT Þf exists
to the modulated average with the zero-one
a.e. This raises the question if, when m is finite,
weights am ¼ 1 if and only if m {kj}.
Can ðT Þf , 0 < a < 1 converges a.e. for every
For the study of sequences a ¼ {ak}k 0 such
f Lp, in either theorem. Irmisch (1980) proved
that the modulated averages 1n nk¼0 ak T k f con-
that in Akcoglu’s theorem, the convergence holds
verge almost everywhere for every Koopman
when α > 1/p, but may fail when α ¼ 1/p. Déniel
operator of an ergodic mpt and every f Lp
(1989) proved Irmisch’s result for the Koopman
(some fixed 1 p < 1), we refer the reader to
operator of a probability-preserving transforma-
Bellow and Losert (1985), Bourgain et al. (1989),
tion and gave an example that even for such a
Rudolph (1994), and to the book of Assani
Koopman operator the a.e. convergence may fail
(2003). For some recent results, see Fan (2019).
when α ¼ 1/p; the norm convergence holds for
Let T be mean ergodic on a complex Banach
every 0 < α < 1 by Theorem 55.
space. A complex sequence a ≔ {ak}k0 will be
called a good modulating sequence for T if the
modulated averages Mn ðT, aÞ≔ 1n n1 k
k¼0 ak T con-
Modulated Ergodic Theorems
verge in the strong operator topology; the conver-
Let T be a weakly almost periodic operator on a gence is called a modulated ergodic theorem (the
complex Banach space. Then for any l the ak need not be positive, so we avoid the often used
operator lT is also weakly almost periodic, so by terminology of “weighted” ergodic theorems). If
Theorem 7 1n n1 k k a is a good modulating sequence for the L2-
k¼0 l T converges in the strong
operator topology. It follows that if Koopman operators of all rotations of , then
k
a ≔ {ak}k0 ‘1 can be approximated uni- we must have that cðlÞ≔ lim n 1n k¼0 ak l exists
formly in k by sequences p ¼ {p(k)} of the form for every l ; such a sequence is called a
pðkÞ≔ m Hartman sequence, and by Kahane (1961) the
j¼1 cj lj with cj ℂ and lj (called
k
set fl : cðlÞ 6¼ 0g is at most countable.
trigonomtric polynomial sequences), then the
Berend et al. (2002) showed that a second neces-
averages 1n nk¼0 ak T k converge in the strong oper-
sary condition for modulating all rotations is that
ator topology.
supn supl j1n n1
k¼0 ak l j < 1, and these two
k
Let T be the Koopman operator of a
conditions together imply that a is a good
probability-preserving θ on (Ω, S, m), and fix
Operator Ergodic Theory 477
modulating sequence for every contraction T of a For mean ergodic operators, Lin et al. (1999)
complex Hilbert space. The limit was identified in showed:
Lin et al. (1999):
Theorem 63 A sequence a ¼ {ak} is a good
n1 modulating sequence for every mean ergodic
1
lim ak T k x ¼ cðlÞE l, T x, x ℋ, power-bounded operator if and only if it is
n!1 n k¼0 jlj¼1 Hartman, a W1, and 1n n1 k¼0 jakþ1 ak j ! 0:
A bounded sequence a is called (weakly)
where E(l, T ) is the orthogonal projection on the almost periodic if its orbit in ‘1 under the shift
eigenspace of l (note that also S({bk}) ≔ {bkþ1} is conditionally (weakly) com-
fl : Eðl, T Þx 6¼ 0g is countable).
pact. Then the restriction of S to span Sk a is
For 1 p < 1, denote W p ≔ a :k akW p ≔
1=p (weakly) almost periodic. Eisner (2013) used the
lim supn 1n n1
k¼0 jak j
p
< 1g: Then k akW p Jacobs-deLeeuw-Glicksberg decomposition
is a seminorm, and Wp is complete. We put (Theorem 29) to obtain the following.
W1 ≔ ‘1. Çömez et al. (1998) proved:
Theorem 64 Let S be a weakly almost periodic
operator on a complex Banach space Y, and fix
Theorem 59 Let a Wp, 1 < p < 1 be
y Y and f Y . Then the sequence
Hartman. Then a is a good modulating sequence
a ≔ {f(Sky)}k0 is Hartman (and obviously
for every weakly almost periodic operator on a
bounded). Hence a weakly almost periodic
complex Banach space.
sequence is Hartman.
By Berend et al. (2002), Theorem 59 may fail
Lin et al. (1999) used the Wiener-Wintner the-
when a W1. However, the necessity of a W1
orem to generate Hartman sequences.
follows from the next result, due to Lin
et al. (1999).
Proposition 65 Let t be an ergodic probability-
Theorem 60 A sequence a is a good modulating preserving transformation on O, S, n , and fix
sequence for every almost periodic operator on a g Lp(n), 1 p 1. Then for a.e. o O the
complex Banach space if and only if it is Hartman sequence a≔ g tk o k0 is Hartman and in Wp.
and a W1. By Berend et al. (2002), the Hartman
Eisner and Lin (2018) proved the following. sequences a Wp defined in Proposition 65 are
good modulating sequences for every contraction
Theorem 61 Let a W1. Then for every power- on a Hilbert space.
bounded T on a Banach space X and x X with Rudolph (1994) extended Bourgain’s return
T nx ! 0 (weakly), we have 1n n1 k¼0 ak T x ! 0
k times theorem, showing that for p 1, the
(weakly). Hartman sequences a Wp defined in Proposi-
By Berend et al. (2002) there exists a Hartman tion 65 yield a.e. convergence of 1n n1 k
k¼0 ak f y o
sequence a W1 which satisfies for every probability-preserving θ on (Ω, S, m)
1 n1
and f Lq(m), q ¼ p/( p 1). Demeter et al.
supn supl jn k¼0 ak l j < 1, so this a is a
k
good modulating sequence for all contractions in (2008) proved that when p > 1, the above
a Hilbert space, but not for all almost periodic sequences a Wp yield a.e. convergence of
operators. Eisner and Lin (2018) proved: 1 n1 k
n k¼0 ak f y o for every probability preserving
θ on (Ω, S, m) and f L2(m); Demeter (2012)
Proposition 62 Let a W1. For every contrac- refined it to f Lq , q1 < 32 p1 :
tion T on a Hilbert space ℋ and x ℋ with A sequence a which for some 1 p < 1 is in
T nx ! 0 weakly, we have 1n nk¼1 ak T k x ! 0 if the Wp closure of the trigonometric polynomial
and only if an ¼ o(n). sequences is called a p-Besicovitch sequence. The
478 Operator Ergodic Theory
ness was studied for many years, until Lyubich with cn 0 and n0 cn ¼ 1, is Ritt.
(1999) and Nagy and Zemánek (1999) proved The Kreiss condition. Kreiss (1962) pre-
(independently) the following. sented the following resolvent condition (Kreiss
Operator Ergodic Theory 479
resolvent condition) for T on a complex Banach (2016) is that when supn Cðn2Þ ðT Þ < 1, then
space with r(T ) 1: kR(l, T ) k C|l 1|2/(| l| 1)3, j l j > 1.
Earlier, Kreiss had given a resolvent condition
C for the generator of a C0-semigroup, inspired by
k Rðl, T Þ k for j l j> 1: ð12Þ
j l j 1 the Hille-Yosida theorem, which in finite-
dimensional spaces yields boundedness of the
Ritt’s condition (11) clearly implies (12). Oper- semigroup; however, in contrast to the discrete
ators satisfying (12) are often called Kreiss time, Eisner and Zwart (2006) constructed a C0-
bounded operators. Power-bounded operators semigroup with exponential growth whose gener-
satisfy (12). Kreiss proved that in finite- ator satisfies Kreiss’ condition.
dimensional spaces (12) implies power- Van Castren (1980) proved that if T is power-
boundedness. Lubich and Nevanlinna (1991) pro- bounded invertible on a complex Hilbert space
ved that (12) implies kT nk ¼ O(n); by Shields with sðT Þ , and T 1 satisfies (12) (which is
(1978) or Nevanlinna (2001), this is the best esti- equivalent to condition (ii) in van Casteren’s the-
mate. However, Nevanlinna (2001) showed that if orem), then also T 1 is power-bounded. This
T satisfies (12) and its peripheral spectrum extended results of Gokhberg and Krein (1967)
sðT Þ \ has arc-length (Lebesgue) measure and of Stampfli (1972).
zero, then kT nk ¼ o(n). Independently, Cohen McCarthy (1971) gave an example of
et al. (2020) and Bonilla and Müller (2021) pro- T invertible on ‘2(ℤ) which satisfies the stronger
ved that in Hilbert spaces, (12) implies kT nk ¼ condition (strong Kreiss resolvent condition,
O(n/ log n). Cuny (2020) proved that a Kreiss sometimes called iterated Kreiss condition):
bounded T on Lp(Ω, m), 1 < p < 1, satisfies
p
kT n k ¼ Oðn= log nÞ; he obtained the estimate C
Rk ðl, T Þ
kT nk ¼ O(n/(logn)1/s), for some s 2, when T is ðjlj1Þk ð13Þ
Kreiss bounded, defined on a space in a subclass whenever j l j> 1, k ¼ 1, 2, . . .
of the spaces which are uniformly convex in an
equivalent norm (i.e., UMD spaces, see Cuny
but is not power-bounded; in the example also T1
(2020)).
satisfies (13). By McCarthy (1971), condition (13)
Strikwerda and Wade (1997) proved that (12) p
implies that kT n k ¼ Oð nÞ: This estimate was
is equivalent to supn supjgj¼1 Cðn2Þ ðgT Þ < 1, proved also by Lubich and Nevanlinna (1991),
where Cðn2Þ is the Cesàro average of order 2. This who showed it is the best possible in general
characterization was extended by Aleman and Banach spaces. Cohen et al. (2020) showed that
Suciu (2016), who proved that T satisfies (12) if in Hilbert space (13) implies kT nk ¼ O((logn)k)
and only if for some integer r 2 we have for some k > 0 (which depends on T ). Lyubich
supn supjgj¼1 CðnrÞ ðgT Þ < 1: Gomilko and (2010) obtained a family of examples in L p[0, 1]
Zemánek (2008) proved that T satisfies (12) if satisfying (12) but not (13). Nevanlinna (1997,
and only if T m satisfies (12) for some (all) 2001) proved that T satisfies (13) if and only if
m > 1. Suciu and Zemánek (2013) proved that if for some M we have
T satisfies (12), then Cðn2Þ ðT Þx converges strongly
for x F ðT Þ ðI T ÞX; if in addition sðT Þ \ ezT Mejzj 8z ℂ: ð14Þ
¼ f1g, then kMnþ1(T ) Mn(T)k ! 0. Abadias
and Bonilla (2019) proved that if T satisfies (12), Gomilko and Zemánek (2013) proved that
then for every α 2 we have T satisfies (13) if and only if T m satisfies it for
ðaÞ
Cnþ1 ðT Þ CðnaÞ ðT Þ ! 0 (with no assumption some (all) integers m > 1.
Montes-Rodríguez et al. (2005) defined the
on the spectrum). A special case of Suciu uniform Kreiss resolvent condition by
480 Operator Ergodic Theory
n
Tk C boundedness) and the strong Kreiss resolvent con-
sup whenever dition (13) are independent. They proved also that
n1 lkþ1 j l j 1
k¼0 if T is absolutely Cesàro bounded, then kT nk ¼
j l j> 1: ð15Þ O(n1ε) for some ε > 0 (which depends on T ).
Cuny (2020) proved that when the Banach space
They showed that (15) does not imply (13) and X has type 1 < p 2 (e.g., Lr spaces, 1 < r < 1,
proved that (15) holds if and only if there exists and more generally spaces which are uniformly
C > 0 such that convex in an equivalent norm), an absolutely
Cesàro bounded T satisfies kTnk ¼ O(n1/p).
1
n Cohen et al. (2020) proved that a positive
sup ðgT Þk C 8 j g j¼ 1: ð16Þ Cesàro bounded operator on a complex Banach
n n k¼1
lattice satisfies the uniform Kreiss resolvent
condition.
The proof that (15) implies (12) is immediate.
Strikwerda and Wade (1997) showed that (12)
does not imply (16). Gomilko and Zemánek Continuous Time (C0-semigroups)
(2008) proved that (13) implies (15), hence (16);
together with the McCarthy-Lubich-Nevanlinna Birkhoff and von Neumann were motivated by a
p
estimate kT n k ¼ Oð nÞ under (13), Proposition problem in statistical mechanics and therefore
9 yields: proved their ergodic theorems in continuous
time. Von Neumann (1932) used in his proof
Theorem 69 Let T on a reflexive Banach space the spectral theorem for unitary representations
satisfy the strong Kreiss resolvent condition (13). of ℝ.
Then γT is mean ergodic for every g . A C0-semigroup (or one-parameter semi-
If T is power-bounded, then (13) holds (in an group, strongly continuous semigroup) is a fam-
equivalent norm, T is a contraction and C ¼ 1 ily {T(t)}t0 of bounded linear operators on a
in (12)). Bermúdez et al. (2020) proved Banach space X such that T(0) ¼ I, T(t þ s) ¼
T(t)T(s) for t, s 0, and T(t)x is continuous on
Theorem 70 Let T on a Hilbert space satisfy the [0, 1) for every x X. The limit
uniform Kreiss resolvent condition (15). Then o0 ≔ limt!1 log k T(t) k /t < 1 (exists by
kT nk ¼ o(n). Hence γT is mean ergodic for subadditivity) is the type (or growth bound)
every g . of the semigroup, and for any δ > o0 there
Theorem 70 was extended by Cuny (2020) to exists Mδ such that kT(t) k Mδeδt for t 0.
the reflexive Lp spaces (in fact, to the above men- The generator (infinitesimal generator) of a
tioned UMD spaces). Bonilla and Müller (2021) C0 - semigroup {T(t)}t0 is the operator
obtained examples of mean ergodic operators on a Ax ≔ limt!0+t1(T(t)x x) with domain
complex Hilbert space which do not satisfy the D ðAÞ≔fx X : Ax exists}. For properties of the
Kreiss resolvent condition (12). Bermúdez et al. generator, we refer the reader to Section VIII.1
(2020) used a strict strengthening of (16) for the of the book by Dunford and Schwartz (1958):
following.
(i) The domain of the generator is dense in X,
Theorem 71 Let T on a Banach space satisfy and A is a closed operator.
supn n1 n1
k¼0 T x C k x k for some C > 0.
k
(ii) For x D ðAÞ and t > 0, T ðtÞx D ðAÞ and
Then kT k ¼ o(n). When X is reflexive, γT is mean
n
dt T ðtÞx ¼ AT ðtÞx ¼ T ðtÞAx:
d
(iv) If Re l > o0, then l r(A), and A C0-semigroup {T(t)}t0 is called mean
1
Rðl, AÞx ¼ 0 elt T ðtÞxdt for every x X. (uniformly) ergodic if the averages
Ms x≔ 1s 0 T ðtÞxdt
s
converge in the strong
Hille (1952) introduced the following abstract (uniform) operator topology, as s ! 1. The
generalization (the abstract Cauchy problem) of limit is a projection on the common fixed points
the (homogenous) initial value problem of partial of {T(t)}t0.
differential equations. Let A be a linear operator
(not necessarily bounded) mapping a subspace D Theorem 72 A bounded C0-semigroup on a
(not necessarily closed) of a complex Banach reflexive Banach space is mean ergodic.
space X into X, and consider the problem of find- This continuous version of Lorch’s Theorem
ing a differentiable function y(t), defined for t > 0 (Corollary 8) follows from the abstract ergodic
with values in D, such that dtd yðtÞ ¼ AyðtÞ for theorems of Eberlein (1949). Hille (see
t > 0, and y(t) ! y0 as t ! 0+ for a given y0. Section 18.6 of Hille and Phillips (1957),
When D ¼ X and A is bounded, the operator Section V.4 of Engel and Nagel (2000)) proved
etA is a well-defined bounded operator, and the following.
y(t) ≔ etAy0 solves the abstract Cauchy problem.
Hille noted that if A is the generator of a C0- Theorem 73 Let {T(t)}t0 be a mean ergodic C0-
semigroup T(t), then y(t) ≔ T(t)y0 solves abstract semigroup on X, with generator A. Then X ¼
Cauchy problem when y0 D ðAÞ: kerðAÞ RangeðAÞ, and Ex ≔ lims!1 Msx is
Ergodic theorems for a C0-semigroup {T(t)} the projection on ker(A) corresponding to this
deal with the asymptotic behavior of T(t)x as decomposition.
t ! 1, in different topologies and different Lin (1974b) proved the continuous analogue of
modes of convergence. These yield information Theorem 23.
on the asymptotic behavior of the solutions of the
abstract Cauchy problem when A is the generator
Theorem 74 A C0-semigroup with generator
of a C0-semigroup; from this perspective, it is
A is uniformly ergodic if and only if
important to use assumptions only on the genera-
limt!1 t1 k T(t) k ¼ 0, and the range of A is
tor of the semigroup, which is the given datum of
closed.
the problem. The book of van Neerven (1996)
Krengel and Lin (1984) proved the continuous
studies the different spectral properties of the gen-
analogue of Browder’s Theorem 14.
erator (which is usually only an unbounded closed
operator), and their connections to the asymptotic Theorem 75 Let {T(t)}t0 be a bounded C0-
behavior of the semigroup. The paper of semigroup on a reflexive Banach space, with gen-
Rozendaal and Veraar (2018), which uses growth erator A. Then y is in the range of A if and only if
rates of the resolvent of the generator for s
sups>0 0 T ðtÞxdt < 1:
obtaining growth rates of the semigroup, contains
Eisner (2010) emphasizes the similarities of
references to papers which appeared after publi-
the discrete and continuous cases and proves in
cation of van Neerven’s book.
parallel results for discrete time and their ana-
A nonspectral approach is in the book of
logues for continuous time. However, not all
Emel’yanov (2007), which emphasizes C0-semi-
results in the discrete case have analogues in con-
groups of positive operators on Banach lattices, in
tinuous time, e.g., Eisner and Zwart (2006), and
particular C0-semigroups of positive operators on
some results for semigroups have no analogue in
L1 spaces.
discrete time, e.g., Gerlach (2013), where it is
Many results on the asymptotic behavior of C0-
shown that a convergence result for certain C0-
semigroups have analogues for discrete time, and
semigroups fails in discrete time under the analo-
sometimes the continuous time version was pro-
gous assumptions, due to some periodic behavior.
ved first (see Theorems 25, 38, and 46).
482 Operator Ergodic Theory
Bermúdez T, Bonilla A, Müller V, Peris A (2020) Cesàro theory, Banach Center Publications, vol 75. Polish
bounded operators in Banach spaces. J Anal Math 140: Academy of Science, Warsaw, pp 71–109
187–206 Cohen LW (1940) On the mean ergodic theorem. Ann
Besicovitch AS (1954) Almost periodic functions. Dover Math 41(2):505–509
Publications, New York Cohen G, Lin M (2016) Remarks on rates of convergence
Birkhoff GD (1931) Proof of the ergodic theorem. Proc of powers of contractions. J Math Anal Appl 436:
Natl Acad Sci U S A 17:656–660 1196–1213
Birkhoff G (1939) The mean ergodic theorem. Duke Math Cohen G, Cuny C, Lin M (2010) The one-sided ergodic
J 5:19–20 Hilbert transform in Banach spaces. Stud Math 196:
Blum JR, Eisenberg B (1974) Generalized summing 251–263
sequences and the mean ergodic theorem. Proc Am Cohen G, Cuny C, Lin M (2014) Almost everywhere
Math Soc 42:423–429 convergence of powers of some positive Lp contrac-
Blum JR, Hanson DL (1960) On the mean ergodic theorem tions. J Math Anal Appl 420:1129–1153
for subsequences. Bull Am Math Soc 66:308–311 Cohen G, Cuny C, Eisner T, Lin M (2020) Resolvent
Bonilla A, Müller V (2021) Kreiss bounded and uniformly conditions and growth of powers of operators. J Math
Kreiss bounded operators. Rev Mat Complut 34: Anal Appl 487:124035, 24 p
469–487 Çömez D, Lin M, Olsen J (1998) Weighted ergodic theo-
Boshernitzan M, Wierdl M (1996) Ergodic theorems along rems for mean ergodic L1-contractions. Trans Am Math
sequences and Hardy fields. Proc Natl Acad Sci U S Soc 350:101–117
A 93:8205–8207 Conze J-P (1973) Convergence des moyennes ergodiques
Bourgain J (1988a) An approach to pointwise ergodic pour des sous-suites. In: Contributions au calcul des
theorems. In: Geometric aspects of functional analysis probabilités, Bulletin Society Mathematics, France,
(1986/87), Lecture notes in Mathematics 1317. Mém No. 35, Society Mathematics, France, pp 7–15
Springer, Berlin, pp 204–223 Cotlar M (1955) A unified theory of Hilbert transforms and
Bourgain J (1988b) Temps de retour pour les systèmes ergodic theorems. Rev Mat Cuyana 1:105–167
dynamiques. C R Acad Sci 306(12):483–485 Cuny C (2009) On the a.s. convergence of the one-sided
Bourgain J (1989) Pointwise ergodic theorems for arith- ergodic Hilbert transform. Ergodic Theory Dyn Syst
metic sets. Inst Hautes Études Sci Publ Math 69:5–45 29:1781–1788
Bourgain J, Furstenberg H, Katznelson Y, Ornstein DS Cuny C (2010) Norm convergence of some power series of
(1989) On return-time sequences, Appendix to Bourgain operators in L p with applications in ergodic theory.
(1989). Inst Hautes Études Sci Publ Math No 69:42–45 Stud Math 200:1–29
Browder F (1958) On the iteration of transformations in Cuny C (2020) Resolvent conditions and growth of powers
noncompact minimal dynamical systems. Proc Am of operators on L p spaces. Pure Appl Funct Anal 5:
Math Soc 9:773–780 1025–1038
Brunel A, Keane M (1969) Ergodic theorems for operator Cuny C, Weber M (2018) On Nörlund summation and
sequences. Z Wahrscheinlichkeitstheorie und Verw ergodic theory, with applications to power series of
Gebiete 12:231–240 Hilbert contractions. Bull Pol Acad Sci Math 66:69–85
Brunel A, Revuz D (1974) Quelques applications pro- Datko R (1970) Extending a theorem of A. M. Liapunov to
babilistes de la quasi-compacité. Ann Inst H Poincaré Hilbert space. J Math Anal Appl 32:610–616
Sect B (NS) 10:301–337 deLaubenfels R, Vũ Q-P (1996) The discrete Hille-Yosida
Burlando L (1997) A generalization of the uniform ergodic space and the asymptotic behaviour of individual orbits
theorem to poles of arbitrary order. Stud Math 122:75–98 of linear operators. J Funct Anal 142:539–548
Butzer PL, Westphal U (1971) The mean ergodic theorem deLeeuw K, Glicksberg I (1961) Applications of almost
and saturation. Indiana Univ Math J 20:1163–1174 periodic compactifications. Acta Math 105:63–97
Butzer PL, Westphal U (1972) Ein Operatorenkalkül für Demeter C (2012) Improved range in the return times
das approximationstheoretische Verhalten des theorem. Can Math Bull 55:708–722
Ergodensatzes im Mittel. In: Linear operators and Demeter C, Lacey MT, Tao T, Thiele C (2008) Breaking
approximation, International series of numerical math- the duality in the return times theorem. Duke Math
ematics, vol 20. Birkhäuser, Basel, pp 102–114 J 143:281–355
Campbell JT (1986) Spectral analysis of the ergodic Hil- Déniel Y (1989) On the a.s. Cesàro-α convergence for
bert transform. Indiana Univ Math J 35:379–390 stationary or orthogonal random variables. J Theor Pro-
Chacon RV (1969) Weakly mixing transformations which bab 2:475–485
are not strongly mixing. Proc Am Math Soc 22:559–562 Derriennic Y (1976) Lois “zéro ou deux” pour les processus
Chen J-C, Shaw S-Y (2009) Growth order and stability of de Markov. Applications aux marches aléatoires. Ann
discrete semigroups. Nonlinear Anal 71(12):e2879– Inst H Poincaré Sect B (NS) 12:111–129
e2882 Derriennic Y (2000) On the mean ergodic theorem for
Chill R, Tomilov Y (2007) Stability of operator semi- Cesàro bounded operators. Colloq Math 84/85(part
groups: ideas and results. In: Perspectives in operator 2):443–455
484 Operator Ergodic Theory
Derriennic Y, Krengel U (1981) Subadditive mean ergodic Emel’yanov EY (1997) Banach lattices on which every
theorems. Ergodic Theory Dyn Syst 1:33–48 power-bounded operator is mean ergodic. Positivity 1:
Derriennic Y, Lin M (2001) Fractional Poisson equations 291–295
and ergodic theorems for fractional coboundaries. Émilion R (1985) Mean-bounded operators and mean
Israel J Math 123:93–130 ergodic theorems. J Funct Anal 61:114
Devys O (2012) Localisation spectrale à l’aide des poly- Esterle J (1983) Quasimultipliers, representations of H1,
nômes de Faber et équation de cobord. PhD thesis, and the closed ideal problem for commutative Banach
Lille. https://pepite-depot.univ-lille.fr/LIBRE/EDSPI/ algebras. In: Radical Banach algebras and automatic
2012/50376-2012-Devys.pdf continuity, Lecture Notes in Mathematics 975.
Doeblin W (1937) Sur les propriétés asymptotiques de Springer, Berlin, pp 66–162
mouvements régis par certains types de chaînes sim- Esterle J, Strouse E, Zouakia F (1990) Theorems of
ples. Bull Soc Math Roumaine 39–1:57–115; Katznelson-Tzafriri type for contractions. J Funct
39–2 (1938), 3–61 Anal 94:273–287
Drissi D, Zemánek J (2000) Gelfand-Hille theorems for Esterle J, Strouse E, Zouakia F (1992) Stabilité asymptotique
Cesàro means. Quaest Math 23:375–381 de certains semi-groupes d’opérateurs et idéaux
Dunford N (1943a) Spectral theory, I. Convergence to primaires de L1(R+). J Operator Theory 28:203–227
projections. Trans Am Math Soc 54:185–217 Fan A-H (2019) Weighted Birkhoff ergodic theorem with
Dunford N (1943b) Spectral theory. Bull Am Math Soc 49: oscillating weights. Ergodic Theory Dyn Syst 39:
637–651 1275–1289
Dunford N, Schwartz JT (1956) Convergence almost Farkas B, Kreidler H (2021) Relative compactness of orbits
everywhere of operator averages. J Rational Mech and geometry of Banach spaces. J Math Anal Appl
Anal 5:129–178 495:124660
Dungey N (2008) On time regularity and related conditions Foguel S (1963) Powers of a contraction in Hilbert space.
for power-bounded operators. Proc Lond Math Soc Pac J Math 13:551–562
97(3):97–116 Foguel S (1976) More on the “zero-two” law. Proc Am
Dungey N (2011) Subordinated discrete semigroups of Math Soc 61:262–264
operators. Trans Am Math Soc 363:1721–1741 Foguel S, Weiss B (1973) On convex power series of a
Eberlein WF (1949) Abstract ergodic theorems and weak conservative Markov operator. Proc Am Math Soc 38:
almost periodic functions. Trans Am Math Soc 67: 325–330
217–240 Foiaş C (1964) Sur les mesures spectrales qui interviennent
Eberlein WF (1984) On retrogression in mean ergodic dans la théorie ergodique. J Math Mech 13:639–658
theory. J Approx Theory 42:293–298 Fonf VP, Lin M, Rubinov A (1996) On the uniform ergodic
Ed-dari E (2003) On the (C,α) uniform ergodic theorem. theorem in Banach spaces that do not contain duals.
Stud Math 156:3–13 Stud Math 121:67–85
Ed-dari E (2004) On the (C,α) Cesàro bounded operators. Fonf VP, Lin M, Wojtaszczyk P (2001) Ergodic character-
Stud Math 161:163–175 izations of reflexivity of Banach spaces. J Funct Anal
Eisner T (2010) Stability of operators and operator semi- 187:146–162
groups, Operator theory: advances and applications, Fonf VP, Lin M, Wojtaszczyk P (2010) A non-reflexive
vol 209. Birkhäuser, Basel Banach space with all contractions mean ergodic. Israel
Eisner T (2013) Linear sequences and weighted ergodic J Math 179:479–491
theorems. Abstr Appl Anal:815726, 5 p Fonf VP, Lin M, Wojtaszczyk P (2011) Poisson’s equation
Eisner T, Lin M (2018) On modulated ergodic theorems. and characterizations of reflexivity of Banach spaces.
J Nonlinear Var Anal 2:131–154 Colloq Math 124:225–235
Eisner T, Müller V (2021) Power bounded operators and Fong H, Sucheston L (1974) On a mixing property of
the mean ergodic theorem for subsequences. J Math operators in Lp spaces. Z. Wahrscheinlichkeitstheorie
Anal Appl 493:124523, 25 p und Verw. Gebiete 28:165–171
Eisner T, Serény A (2008) Category theorems for stable Furstenberg H (1981) Recurrence in ergodic theory and
operators on Hilbert spaces. Acta Sci Math (Szeged) combinatorial number theory. Princeton University
74:259–270 Press, Princeton
Eisner T, Zwart H (2006) Continuous-time Kreiss resol- Gaposhkin VF (1998) On the rate of decrease of the prob-
vent condition on infinite-dimensional spaces. Math abilities of deviations for means of stationary pro-
Comput 75(256):1971–1985 cesses. Mat Zametki 64(3):366–372; English
El Abdalaoui EH, El Machkouri M, Nogueira A (2010) translation in Math Notes 64 (1998), 316–321
A criterion of weak mixing property. Séminaires et Gelfand I (1941) Zur Theorie der Charaktere der Abelschen
Congrès 20:105–111 topologischen Gruppen. Mat Sbornik 9:49–50
El-Fallah O, Ransford T (2002) Extremal growth of powers Gerlach M (2013) On the peripheral point spectrum and the
of operators satisfying resolvent conditions of Kreiss- asymptotic behavior of irreducible semigroups of Har-
Ritt type. J Funct Anal 196:135–154 ris operators. Positivity 17:875–898
Operator Ergodic Theory 485
Gerlach M, Glück J (2019) Mean ergodicity vs weak Hopf E (1932) Proof of Gibbs’ hypothesis on the tendency
almost periodicity. Stud Math 248:45–56 toward statistical equilibrium. Proc Nat Acad Sci U S
Glück J (2015) On weak decay rates and uniform stability A 18:33–340
of bounded linear operators. Arch Math 104:347–356 Hopf E (1937) Ergodentheorie, Ergebnisse der Mathematik
Gokhberg I, Krein MG (1967) Description of contraction und ihrer Grenzgebiete 5, No. 2. Julius Springer
operators which are similar to unitary operators, Func- Horowitz S (1968) Some limit theorems for Markov pro-
tional. Anal Appl 1:33–52 cesses. Israel J Math 6:107–118
Goldstein JA (1993) Extremal properties of contraction Horowitz S (1969) Strong ergodic theorems for Markov
semigroups on Hilbert and Banach spaces. Bull Lond processes. Proc Am Math Soc 23:328–334
Math Soc 25:369–376 Huang SZ (1995) Stability properties characterizing the
Goldstein JA, Nagy B (1995) An extremal property of spectra of operators on Banach spaces. J Funct Anal
contraction semigroups in Banach spaces. Ill J Math 132:361–382
39:441–449 Ionescu Tulcea A (1963) Random series and spectra of
Gomilko A, Tomilov Y (2018) On discrete subordination measure-preserving transformations. In: Ergodic the-
of power bounded and Ritt operators. Indiana Univ ory, Proceedings of the International Symposium,
Math J 67:781–829 Tulane University, New Orleans. Academic Press, Bos-
Gomilko A, Zemánek J (2008) On the uniform Kreiss ton, pp 273–292
resolvent condition. Funct Anal Appl 42:230–233 Irmisch R (1980) Punktweise Ergodensätze für (c,α)-
Gomilko A, Zemánek J (2013) On the strong Kreiss resol- Verfahren, 0<α<1. PhD thesis, Darmstadt
vent condition. Complex Anal Oper Theory 7:421–435 Jacobs K (1957) Fastperiodizitätseigenschaften
Gomilko A, Haase M, Tomilov Y (2011) On rates in mean allgemeiner Halbgruppen in Banach-Räumen. Math
ergodic theorems. Math Res Lett 18:201–213 Z 67:83–92
Gottschalk W, Hedlund GA (1955) Topological dynamics, Jamison B (1965) Eigenvalues of modulus 1. Proc Am
American Mathematical Society Colloquium Publica- Math Soc 16:375–377
tions, vol 36. American Mathematical Society, Jones L (1971) A mean ergodic theorem for weakly mixing
Providence operators. Adv Math 7:211–216
Grabiner S, Zemánek J (2002) Ascent, descent, and ergodic Jones L, Kuftinec V (1971) A note on the Blum-Hanson
properties of linear operators. J Operator Theory 48: theorem. Proc Am Math Soc 30:202–203
69–81 Jones L, Lin M (1976) Ergodic theorems of weak mixing
Grivaux S (2019) The Blum-Hanson property. Concr Oper type. Proc Am Math Soc 57:50–52
6:92–105 Jones L, Lin M (1980) Unimodular eigenvalues and weak
Grobler JJ, Huijsmans CB (1995) Doubly Abel bounded mixing. J Funct Anal 35:42–48
operators with single spectrum. Quaest Math 18: Jones RL, Ostrovskii IV, Rosenblatt JM (1996) Square
397–406 functions in ergodic theory. Ergodic Theory Dyn Syst
Groh U (1983) On the peripheral spectrum of uniformly 16:267–305
ergodic positive operators on C*-algebras. J Operator Jones RL, Kaufman R, Rosenblatt JM, Wierdl M (1998)
Theory 10:31–37 Oscillation in ergodic theory. Ergodic Theory Dyn Syst
Haase M, Tomilov Y (2010) Domain characterizations of 18:889–935
certain functions of power-bounded operators. Stud Kachurovskiǐ AG, Podvigin IV (2016) Estimates of the rate
Math 196:265–288 of convergence in the von Neumann and Birkhoff ergo-
Halmos PR (1944) In general a measure preserving trans- dic theorems. Trans Moscow Math Soc:1–53
formation is mixing. Ann Math 45(2):786–792 Kachurovskiǐ AG, Sedalishchev VV (2010) On the con-
Halmos PR (1949a) Measurable transformations. Bull Am stants in the estimates for the rate of convergence in von
Math Soc 55:1015–1034 Neumann’s ergodic theorem. Mat Zametki 87:
Halmos PR (1949b) A nonhomogeneous ergodic theorem. 756–763, (Russian; English translation in Math. Notes
Trans Am Math Soc 66:284–288 87 (2010), 720–727)
Hanson DL, Pledger G (1969) On the mean ergodic theorem Kahane J-P (1961) Sur les coefficients de Fourier-Bohr.
for weighted averages. Z Wahrscheinlichkeitstheorie und Stud Math 21:103–106
Verw Gebiete 13:141–149 Kakutani S (1938) Iteration of linear operations in complex
Hardy GH, Littlewood JE (1913) Sur la série de Fourier Banach spaces. Proc Imp Acad Tokyo 14:295–300
d’une fonction à carré sommable. C. R. Acad Sci Paris Kakutani S (1950) Ergodic theory. In: Proceedings of the
156:1307–1309 1950 International Congress of Mathematicians, Cam-
Hiai F (1978) Weakly mixing properties of semigroups of bridge, pp 128–142
linear operators. Kodai Math J 1:376–393 Kakutani S, Petersen K (1981) The speed of convergence
Hille E (1945) Remarks on ergodic theorems. Trans Am in the ergodic theorem. Monatsh Math 91:11–18
Math Soc 57:246–269 Kalton N, Montgomery-Smith S, Oleszkiewicz K, Tomilov
Hille E (1952) Une généralisation du problème de Cauchy. Y (2004) Power-bounded operators and related norm
Ann Inst Fourier (Grenoble) 4:31–48 estimates. J Lond Math Soc 70:463–478
486 Operator Ergodic Theory
Katznelson Y, Tzafriri L (1986) On power bounded oper- Kunszenti-Kovács D, Nittka R, Sauter M (2011) On the
ators. J Funct Anal 68:313–328 limits of Cesàro means of polynomial powers. Math
Kendall DG, Reuter GEH (1956) Some ergodic theorems Z 268:771–776
for one-parameter semigroups of operators. Philos Lasota A, Li TY, Yorke JA (1984) Asymptotic periodicity
Trans R Soc Lond Ser A 249:151–177 of the iterates of Markov operators. Trans Am Math Soc
Kérchy L, van Neerven J (1997) Polynomially bounded 286:751–764
operators whose spectrum on the unit circle has mea- Le Merdy C, Xu Q (2012) Strong q-variation inequalities
sure zero. Acta Sci Math (Szeged) 63:551–562 for analytic semigroups. Ann Inst Fourier 62:
Kohlenbach U, Leuştean L (2009) A quantitative mean 2069–2097
ergodic theorem for uniformly convex Banach spaces. Lefèvre P, Matheron É (2016) The Blum-Hanson property
Ergodic Theory Dyn Syst 29:1907–1915. Erratum: for C(K ) spaces. Pac J Math 282:203–212
p 1995 Lefèvre P, Matheron É, Primot A (2016) Smoothness,
Koliha JJ (1974) Power convergence and pseudoinverses asymptotic smoothness and the Blum-Hanson property.
of operators in Banach spaces. J Math Anal Appl 48: Israel J Math 211:271–309
446–469 Léka Z (2009) A Katznelson-Tzafriri type theorem in Hil-
Komorník J (1993) Asymptotic periodicity of Markov and bert spaces. Proc Am Math Soc 137:3763–3768
related operators. In: Dynamics reported, Dynamical Léka Z (2010) A note on the powers of Cesàro bounded
report expositions dynamics systems (N.S.), 2, operators. Czechoslov Math J 60(135):1091–1100
Springer, Berlin, pp 31–68 Leonov VP (1961) On the dispersion of time means-
Koopman BO, von Neumann J (1932) Dynamical systems dependent of a stationary stochastic process. Teor
of continuous spectra. Proc Natl Acad Sci USA 18: Veroyatn Primen 6:93–101, in Russian; English transl
255–263 in Theory Probab Appl 6 (1961), 87–93
Kornfeld I, Kosek W (2003) Positive L1 operators associ- Li Y-C, Satō R, Shaw S-Y (2008) Boundedness and growth
ated with nonsingular mappings and an example of orders of means of discrete and continuous semigroups
E. Hille. Colloq Math 98:63–77 of operators. Stud Math 187:1–35
Kornfeld I, Lin M (1997) Coboundaries of irreducible Lin M (1974a) On the uniform ergodic theorem. Proc Am
Markov operators on C(K ). Israel J Math. 97:189–202 Math Soc 43:337–340
Kosek W (2011) Example of a mean ergodic L1 operator Lin M (1974b) On the uniform ergodic theorem II. Proc
with the linear rate of growth. Colloq Math 124(1): Am Math Soc 46:217–225
15–22 Lin M (1978) Quasi-compactness and uniform ergodicity
Kozitsky Y, Shoikhet D, Zemánek J (2013) Power conver- of positive operators. Israel J Math 29:309–311
gence of Abel averages. Arch Math 100:539–549 Lin M (1998) The uniform zero-two law for positive oper-
Kozma G, Lev N (2011) Exponential Riesz bases, discrep- ators in Banach lattices. Studia Math 131:149–153
ancy of irrational rotations and BMO. J Fourier Anal Lin M, Sine R (1983) Ergodic theory and the functional
Appl 17:879–898 equation (I–T )x¼y. J Operator Theory 10:153–166
Krein MG, Šmulian V (1940) On regularly convex sets in Lin M, Suciu L (2015) Poisson’s equation for mean ergodic
the space conjugate to a Banach space. Ann Math operators. In: Infinite products of operators and their
41(2):556–583 applications, Contemporary mathematics 636, Proceed-
Kreiss H-O (1962) ber die Stabilitätsdefinition für ings of the American Mathematical Society, Provi-
Differenzengleichungen die partielle dence, pp 141–148
Differentialgleichungen approximieren. BIT 2: Lin M, Weber M (2007) Weighted ergodic theorems and
153–181 strong laws of large numbers. Ergodic Theory Dyn Syst
Krengel U (1971) On the individual ergodic theorem for 27:511–543
subsequences. Ann Math Statist 42:1091–1095 Lin M, Olsen J, Tempelman A (1999) On modulated ergo-
Krengel U (1972) Weakly wandering vectors and weakly dic theorems for Dunford-Schwartz operators. Ill
independent partitions. Trans Am Math Soc 164: J Math 43:542–567
199–226 Lin M, Shoikhet D, Suciu L (2015) Remarks on uniform
Krengel U (1978) On the speed of convergence in the ergodic theorems. Acta Sci Math (Szeged) 81:251–283
ergodic theorem. Monatsh Math 86:3–6 Lorch ER (1939) Means of iterated transformations in
Krengel U, Lin M (1984) On the range of the generator of a reflexive vector spaces. Bull Am Math Soc 45:945–947
Markovian semigroup. Math Z 185:553–565 Lotz H (1968) Über das Spektrum positiver Operatoren.
Krotkov V, Halperin I (1953) The ergodic theorem for Math Z 108:15–32
Banach spaces with convex-compactness. Trans Lotz H (1981) Uniform ergodic theorems for Markov
R Soc Canada Sect III 47:17–20 operators on C(X). Math Z 178:145–156
Kryloff N, Bogoliouboff N (1937) Sur les propriétés en Lotz H (1985) Uniform convergence of operators on L1
chaîne. C R Acad Sci Paris 204:1386–1388 and similar spaces. Math Z 190:207–220
Kunszenti-Kovács D (2015) Almost weak polynomial sta- Lubich C, Nevanlinna O (1991) On resolvent conditions
bility of operators. Houst J Math 41:901–913 and stability estimates. BIT 31:293–313
Operator Ergodic Theory 487
Luecke G (1977) Norm convergence of Tn. Canadian operators, Banach Center Publications, vol 38.
J Math 29:1340–1344 IMPAN, Warsaw, pp 247–264
Lyubich Y (1999) Spectral localization, power bounded- Nevanlinna O (2001) Resolvent conditions and powers of
ness and invariant subspaces under Ritt’s type condi- operators. Stud Math 145:113–134
tion. Stud Math 134:153167 Ng ACS, Seifert D (2020) Optimal rates of decay in the
Lyubich Y (2001) The single-point spectrum operators Katznelson-Tzafriri theorem for operators on Hilbert
satisfying Ritt’s resolvent condition. Stud Math 145: spaces. J Funct Anal 279(12):108799, 21 p
135–142 Oxtoby J (1952) Ergodic sets. Bull Am Math Soc 58:
Lyubich Y (2010) The power boundedness and resolvent 116–136
conditions for functions of the classical Volterra oper- Pazy A (1972) On the applicability of Lyapunov’s theorem
ator. Stud Math 196:41–63 in Hilbert space. SIAM J Math Anal 3:291–294
Lyubich YI, Vũ QP (1988) Asymptotic stability of linear Petersen K (1983) Another proof of the existence of the
differential equations in Banach spaces. Stud Math 88: ergodic Hilbert transform. Proc Am Math Soc 88:
37–42 39–43
Malinen J, Nevanlinna O, Turunen V, Yuan Z (2007) Petersen K (1996) Ergodic theorems and the basis of sci-
A lower bound for the difference of powers of linear ence. Synthese 108:171–183
operators. Acta Math Sin (Engl Ser) 23:745–748 Radjavi H, Tam P-K, Tan K-K (2003) Mean ergodicity for
Malinen J, Nevanlinna O, Yuan Z (2009) On a Tauberian compact operators. Stud Math 158:207–217
condition for bounded linear operators. Math Proc R Ir Reich S (1973) Iterative solution of linear operator equa-
Acad 109:101–108 tions in Banach spaces. Atti Accad Naz Lincei Rend Cl
Mbekhta M, Zemánek J (1993) Sur le théorème ergodique Sci Fis Mat Nat 54(8):551–554
uniforme et le spectre. C R Acad Sci Paris Sr I Math Riesz F (1938) Some mean ergodic theorems. J Lond Math
317(12):1155–1158 Soc 13:274–278
McCarthy C (1971) A strong resolvent condition does not Ritt RK (1953) A condition that limn!1n1Tn¼0. Proc
imply power-boundedness. Chalmers Inst Technol Am Math Soc 4:898–899
Univ Gothenburg preprint 15 Robinson EA (1960) Sums of stationary random variables.
Michal AD, Wyman M (1941) Characterization of com- Proc Am Math Soc 11:7779
plex couple spaces. Ann Math 42(2):247–250 Rokhlin VA (1948) A “general” measure-preserving trans-
Millet A (1976) Un théorème ergodique en moyenne. C. R. formation is not mixing (Russian). Doklady Akad Nauk
Acad Sci Paris Sér A-B 283(16):A1103–A1106 SSSR (NS) 60:349–351
Montes-Rodríguez A, Sánchez-Álvarez J, Zemánek Rokhlin VA (1961) Exact endomorphisms of Lebesgue
J (2005) Uniform Abel-Kreiss boundedness and the spaces (Russian), Izv. Akad Nauk SSSR Ser Mat 25:
extremal behaviour of the Volterra operator. Proc 499–530; English translation: Am Math Soc Transl
Lond Math Soc 91(3):761–788 39(2), (1964), 1–36
Mugnolo D (2004) A semigroup analogue of the Fonf-Lin- Rosenblatt JM (1994) Norm convergence in ergodic theory
Wojtaszczyk ergodic characterization of reflexive and the behavior of Fourier Transforms. Canadian
Banach spaces with a basis. Stud Math 164:243–251 J Math 46:184–199
Müller V, Tomilov Y (2007) Quasisimilarity of power Rosenblatt JM, Wierdl M (1995) Pointwise ergodic theo-
bounded operators and Blum-Hanson property. rems via harmonic analysis. In: Ergodic theory and its
J Funct Anal 246:385–399 connections with harmonic analysis, London Mathe-
Müller V, Tomilov Y (2010) Weakly wandering vectors matical Society, Lecture notes series, 205. Cambridge
and interpolation theorems for power bounded opera- University Press, Cambridge, UK, pp 3–151
tors. Indiana Univ Math J 59:1121–1144 Rozendaal J, Veraar M (2018) Sharp growth rates for semi-
Mustafayev HS (2014) The behavior of the orbits of power groups using resolvent bounds. J Evol Equ 18:1721–
bounded operators. Oper Matrices 8:975–997 1744
Nagel R (1974) Ergodic and mixing properties of linear Rudolph DJ (1994) A joinings proof of Bourgain’s return
operators. Proc R Ir Acad 74:245–261 time theorem. Ergodic Theory Dyn Syst 14:197–203
Nagy B, Zemánek J (1999) A resolvent condition implying Ryll-Nardzewski C (1975) Topics in ergodic theory. In
power boundedness. Stud Math 134:143–151 Probability winter school, Karpacz 1975, Lecture notes
Nair R (1993) On polynomials in primes and J. Bourgain’s in Mathematics, vol 472. Springer, Berlin, pp 131–156
circle method approach to ergodic theorems, II. Stud Satō R (1979) The Hahn-Banach theorem implies Sine’s
Math 105:207–233 mean ergodic theorem. Proc Am Math Soc 77:426
Nasri-Roudsari D, Nessel RJ, Zeler R (1995) Resonance Satō R (1980) On the Blum-Hanson theorem in Lp. Math
principles with applications to mean ergodic theorems J Okayama Univ 22:27–32
and projection operators. Acta Math Hungar 68: Seifert D (2016) Rates of decay in the classical Katznelson-
269–285 Tzafriri theorem. J Anal Math 130:329–354
Nevanlinna O (1997) On the growth of the resolvent oper- Shields A (1978) On Möbius bounded operators. Acta Sci
ators for power bounded operators. In: Linear math (Szeged) 40:371–374
488 Operator Ergodic Theory
Sine R (1970) A mean ergodic theorem. Proc Am Math Soc Vũ QP (1992b) Theorems of Katznelson-Tzafriri type for
24:438–439 semigroups of operators. J Funct Anal 103:74–84
Sine R (1976) A note on the ergodic properties of homeo- Weiss G (1989) Weakly ‘p-stable linear operators are
morphisms. Proc Am Math Soc 57:169–172 power stable. Int J Systems Sci 20:2323–2328
Sine R (1991) Constricted systems. Rocky Mountain Weyl H (1910) Über die Gibbssche Erscheinung and
J Math 21:1373–1383 verwandte Konvergenzphänomene. Rend Circ Mat
Stampfli J (1972) A local spectral theory for operators III. Palermo 30:377–407; reprinted In: Weyl, Gesammelte
Trans Am Math Soc 168:133–151 Abhandlungen, Band I, Springer-Verlag, Berlin/
Stein EM (1982) The development of square functions in Heidelberg/New York, 1968, pp 321–353
the work of A. Zygmund. Bull Am Math Soc (NS) 7: Weyl H (1916) Über die Gleichverteilung von Zahlen mod.
359–376 Eins, Math Ann 77:313–352; reprinted In: Weyl,
Strikwerda J, Wade B (1997) A survey of the Kreiss matrix Gesanunelte Abhandhmgen, Band I, Springer-Verlag,
theorem for power bounded families of matrices and its Berlin/Heidelberg/New York, 1968, 563–599
extensions. In: Linear operators, Banach Center Publi- Wiener N, Wintner A (1941) Harmonic analysis and ergo-
cation, vol 38. IMPAN, Warsaw, pp 339–360 dic theory. Am J Math 63:415–426
Sucheston L (1976) Problems. In: Probability in Banach Wierdl M (1988) Pointwise ergodic theorem along the
spaces (Oberwolfach 1975), Springer lecture notes in prime numbers. Israel J Math 64:315–336
Mathematics, vol 526. Springer, Berlin/New York, Wintner A (1945) The linear difference equation of first
pp 285–290 order for angular variables. Duke Math J 12:445–449
Suciu L (2016) Estimations of the operator resolvent by Yoshimoto T (1993) On the speed of convergence in the
higher order Cesàro means. RM 69:457–475 (C,α) uniform ergodic theorem for quasi-compact oper-
Suciu L, Zemánek J (2013) Growth conditions and Cesàro ators. J Math Anal Appl 176:413–422
means of higher order. Acta Sci Math (Szeged) 79: Yoshimoto T (1996) On the convergence rate in the uni-
545–581 form ergodic theorem. J Math Anal Appl 200:149–161
Sz-Nagy B, Foiaş C (1967) Analyse harmonique des Yoshimoto T (1998) Uniform and strong ergodic theorems
opérateurs de l’espace de Hilbert, Masson, Paris; in Banach spaces. Ill J Math 42:525–543
Akadémiai Kiadó, Budapest. English translation: Har- Yosida K (1938) Mean ergodic theorem in Banach spaces.
monic analysis of operators on Hilbert space, North- Proc Imp Acad Tokyo 14:292–294
Holland, Amsterdam and Akadémiai Kiadó, Budapest, Yosida K, Kakutani S (1938) Applications of mean ergodic
1970. Second edition (with coauthors Bercovici and theorems to the problems of Markoff process. Proc Imp
Kérchy), Universitext, Springer, New York, 2010 Acad Tokyo 14:333–339
Tempelman A (1974) Ergodic theorems for amplitude Yosida K, Kakutani S (1941) Operator-theoretical treat-
modulated random fields. Litovsk Mat Sbor 14: ment of Markoff process and mean ergodic theorem.
221–229, (in Russian). Engl. transl.: Lithuan. Math. Ann Math 42:188–228
Transl 14 (1975), 698–704 Zaharopol R (1986) Mean ergodicity of power-bounded
ter Elst AFM, Müller V (2017) A van der Corput-type operators in countably order complete Banach lattices.
lemma for power bounded operators. Math Z 285: Math Z 192:81–88
143–158 Zemánek J (1994) On the Gelfand-Hille theorems. In:
Tomilov Y (2001) A resolvent approach to stability of Functional analysis and operator theory, Banach Center
operator semigroups. J Operator Th 46:63–98 Publication 30, Polish Academy of Sciences, Institute
Tomilov Y, Zemánek J (2004) A new way of constructing Mathematics, Warsaw, pp 369–385
examples in operator ergodic theory. Math Proc Camb Zsidó L (2007) Weak mixing properties of vector
Philos Soc 137:209–225 sequences. In: The extended field of operator theory,
Van Castren J (1980) A problem of Sz. Nagy. Acta Sci Operator theory advances and application, vol.
math (Szeged) 42:189–194 171, Birkhäuser, Basel, pp 361–388
van Neerven J (1996) The asymptotic behaviour of semi-
groups of linear operators, Operator theory: advances Books
and applications, vol 88. Birkhäuser, Basel Dunford N, Schwartz JT (1958) Linear operators, part I,
Visser C (1938) On the iteration of linear operations in a Interscience. Wiley, New York
Hilbert space. Neder Akad Wetensch 41:487–495 Eisner T, Farkas B, Haase M, Nagel R (2015) Operator
Volný D (2018) Martingale-coboundary representation for theoretic aspects of ergodic theory, Graduate texts in
stationary random fields. Stoch Dyn 18(2):1850011. mathematics, vol 272. Springer, Cham
18 pp Emel’yanov E (2007) Non-spectral asymptotic analysis of
von Neumann J (1932) Proof of the quasi-ergodic hypoth- one-parameter operator semigroups, Operator theory:
esis. Proc Natl Acad Sci U S A 18:70–82 advances and applications, vol 173. Birkhäuser, Basel
Vũ QP (1992a) A short proof of the Y. Katznelson’s and Engel KJ, Nagel R (2000) One-parameter semigroups for
L. Tzafriri’s theorem. Proc Am Math Soc 115: linear evolution equations, Graduate texts in mathemat-
1023–1024 ics, vol 194. Springer, New York
Operator Ergodic Theory 489
Foguel SR (1969) The ergodic theory of Markov pro- Krengel U (1985) Ergodic theorems. de Gruyter, Berlin
cesses. Van-Nostrand, New York Riesz F, Sz-Nagy B (1955) Leçons d’analyse
Goldstein J (1985) Semigroups of linear operators and fonctionnelle (French), 3rd edn. Gauthier-Villars,
applications, Oxford mathematical monographs. The Paris; Akadémiai Kiadó, Budapest. English transla-
Clarendon Press, Oxford University Press, New York. tion: Functional analysis, Dover Publications, Inc.,
Second ed. Dover Publications, Mineola, NY, 2017. New York, 1990
Halmos PR (1960) Lectures on ergodic theory. Chelsea Schaefer HH (1974) Banach lattices and positive operators,
Publishing Co., New York Die Grundlehren der mathematischen Wissenschaften,
Hille E, Phillips RS (1957) Functional analysis and semi- Band 215. Springer, New York/Heidelberg
groups, rev edn., vol 31. American Mathematical Soci- Yosida K (1980) Functional analysis, Sixth edition,
ety Colloquium Publications, American Mathematical Grundlehren der Mathematischen Wissenschaften,
Society, Providence vol 123. Springer, Berlin/New York
Conjugacy Two actions G ↷ X1 and G ↷ X2 are
Dynamical Systems and C-Algebras conjugate if there is a homeomorphism
F : X1 ! X2 such that F(gx) ¼ gF(x) for
T. Giordano and H.-C. Liao every x X1; and g G.
University of Ottawa, Ottawa, ON, Canada Continuous orbit equivalence Two group
actions are continuously orbit equivalent if
they are topologically orbit equivalent and the
Article Outline associated orbit cocycles are continuous.
Countable amenable group A countable group
Glossary that admits finite subsets that are almost invari-
Definition of the Subject ant under translation.
Introduction Dimension group An ordered abelian group that
Topological Orbit Equivalence is an inductive limit of simplicial groups. It is a
Mean Dimension, Small Boundary Property, and complete invariant for AF algebras.
Classification of C-Algebras Étale equivalence relation An equivalence rela-
Boundary Actions and C-Simplicity tion on a topological space with a topology
C-Simplicity and Unique Trace Property such that the map (x, y) 7! y is a local homeo-
References morphism. For an action G ↷ X the associated
equivalence relation is given by x y , gx ¼ y
Glossary for some g G.
Free action An action G ↷ X whose stabilizer
AF algebra An inductive limit of finite- subgroup {g G : gx ¼ x} is trivial for every
dimensional C-algebras. x X.
AF relation An inductive limit of finite equiva- G-boundary A topological space X equipped
lence relations. with a minimal G-action such that the induced
Affable equivalence relation An étale equiva- action of G on the space of regular Borel
lence relation that is orbit equivalent to an AF G-invariant probability measures on X is
relation. proximal.
Almost finite actions An action that admits dis- Isomorphism Two actions G1 ↷ X1 and G2 ↷ X2
joint towers such that each tower has a shape are isomorphic if there is a homeomorphism
that is almost invariant under translations and F : X1 ! X2 and a group isomorphism
the union of the towers almost covers the entire α : G1 ! G2 such that F(gx) ¼ α(g)F(x) for
space. every x X1.
Bratteli diagram An infinite graph with a Minimal action An action G ↷ X where every
sequence of finite sets Vn of vertices and a orbit {gx : g G} is dense in X.
sequence of finite sets En of edges connecting Proximal action An action G ↷ X where for
the vertices of Vn1 and Vn. every pair x1, x2 in X there is a net ti in
Bratteli–Vershik transformation A homeo- G such that limi tix1 ¼ limi tix2.
morphism of the path space of a Bratteli dia- Reduced group C-algebras A C-algebra
gram that can be viewed as a generalized constructed from the left regular representation
odometer. of a group.
C -algebra Algebra of operators on a Hilbert Shape The finite set {g1, . . ., gn} associated to a
space that is closed under adjoint and closed tower [ni¼1 gi V:
in the norm topology.
pivotal in classifying such actions. Moreover, the orbit equivalent to a ℤ-action. This conjecture
classification of AF (approximately finite) rela- was proved by Ornstein and Weiss (1980). The
tions, which provides the base for classifying most general case was proved by Connes,
more general actions, relies on the classification Feldman, and Weiss (1982) by establishing that
of AF C-algebras. This again parallels the influ- an amenable nonsingular countable equivalence
ence von Neumann algebras had on the study of relation R can be generated by a single transfor-
orbit equivalence in the measurable case. In this mation, or equivalently, is hyperfinite, that is, R
section, we also describe some of the recent is, up to a null set, a countable increasing union of
results on continuous orbit equivalence rigidity. finite equivalence relations.
In the section “Mean Dimension, Small For the Borel case, Weiss proved that actions
Boundary Property, and Classification of C Alge- of ℤn are (orbit equivalent to) hyperfinite Borel
bras,” we discuss recent developments in classifi- equivalence relations, whose classification was
cation of amenable C-algebras and the notion of obtained by Dougherty, Jackson, and Kechris
“regularity.” The main theme here is to connect (1994). It is not yet known if an arbitrary Borel
regularity at the dynamical level to the C-alge- action of a discrete amenable group is orbit
braic level via the crossed product construction. It equivalent to a ℤ-action. More generally, hyper-
turns out that certain natural and purely dynamical finiteness was proved for Borel actions of any
questions, such as the existence of Kakutani– finitely generated group with polynomial growth
Rokhlin type decomposition for actions of more by Jackson–Kechris–Louveau (2002) and of any
general groups, have a direct impact on the study countable abelian group by Gao and Jackson
of the associated C-algebras. Dynamical ideas (2015). In a recent preprint (Conley et al.
such as mean dimension and comparing sets by 2020), this result was extended to Borel actions
invariant measures are also crucial in this type of of polycyclic groups and for free actions of a
connections. At the same time, the study of regu- large class of solvable groups including the
larity of crossed product C-algebras also led to Baumslag–Solitar group BS(1,2) and the lamp-
interesting dynamical results, such as a new char- lighter group.
acterization of the small boundary property in In this chapter, we present some of the devel-
terms of certain kind of tower decomposition. opments in topological orbit equivalence,
Finally, the section “Boundary Actions and C- reviewing in particular the classification up to
Simplicity” is devoted to some of the recent (topological) orbit equivalence of minimal actions
breakthroughs that apply boundary actions to the of finitely generated abelian groups on the Cantor
study of group C-algebras. More precisely we set. For connected spaces, using a result of
review the notion of Furstenberg boundary, dis- Sierpinski (see Kuratowski (1968), Ch V,
cuss how it can be realized by an operator alge- 47, III), any orbit equivalence is also an isomor-
braic construction (the so-called Hamana phism. Therefore, we consider only spaces that
boundary), and outline how this insight led to are totally disconnected. The strategy in the topo-
solutions to many long outstanding questions logical case follows the one used both in the
about group C-algebras. measurable and in the Borel case:
2. Consider the rich and tractable class of AF review the notion of continuous orbit equivalence
(approximately finite) equivalence relations which is in a one-to-one correspondence with a
on the Cantor set. In the subsection “AF- strong notion of isomorphism of the associated
Equivalence Relations and Bratteli–Vershik C-algebras. This domain of research is very
Transformations,” we recall their definition active and we hope our description, even if it is
and that they can always be realized as tail very short, will show its importance. In “ℤd-
equivalence on a Bratteli diagram. In the sub- odometers” and “Cohomology of ℤd-odometers”,
section “The Bratteli–Vershik Model,” we give we review the results on ℤd-odometers.
the definition of the Bratteli–Vershik system In the measurable case, full groups were intro-
introduced by R. Herman, I. Putnam, and duced in 1963 by H. Dye in his study of orbit
C. Skau (1992) and state their remarkable the- equivalence (1959, 1963). Several different full
orem: any Cantor minimal system is conjugate groups can be associated to a topological dynam-
to a Bratteli–Vershik system. There exist now ical system (on the Cantor set). In particular, the
several detailed presentations and surveys of topological full groups, introduced in Giordano
this very important result (see Remark 4). In et al. (1999) and Tomiyama (1996) for ℤ-actions,
the subsection “The Classification up to Iso- have been intensively studied and outstanding
morphisms of AF-Equivalence Relations: The results obtained. We will not present them here,
Bratteli–Elliott–Krieger Theorem,” we recall as several remarkable surveys have recently been
the classification up to isomorphism of mini- written (de Cornulier 2014; Matui 2017;
mal AF relations on the Cantor set. Katzlinger 2019).
In “Orbit Equivalence of AF-Equivalence Let Y be a compact metric space and G be an
Relations”, we classify, up to orbit equiva- infinite countable group represented on Y as a
lence, minimal AF-equivalence relations. group of self-homeomorphisms. Recall that the
A key technical result for this step is the topological dynamical system (Y, G) is minimal
absorption theorem for minimal AF-relations. if any of the following equivalent conditions are
This theorem, which we describe in “The satisfied:
Absorption Theorem”, states that a “small”
extension of a minimal AF relation is orbit (a) The empty set ; and Y are the only G-invariant
equivalent to the original AF relation. closed subsets of Y.
3. Prove that equivalence relations arising from (b) For every y Y, its G-orbit OrbG{y} ¼ {gy|
minimal actions of finitely generated abelian g G} is dense in Y.
groups are affable, that is, orbit equivalent to (c) For every nonempty open set U Y, there
an AF-equivalence relation. We describe these exists a finite subset
results in “Orbit Equivalence of Minimal k
Actions of a Finitely Generated Abelian f g1 , g2 , . . . , gk g G with [ gj U ¼ Y:
j¼1
Group”.
The following example is a key model of min-
In “A Topological Krieger Theorem: The imal homeomorphism of the Cantor set.
Notion of Strong Orbit Equivalence”, we describe
links between Cantor minimal systems and the Example 1 Let a ¼ ðan Þn1 , with an ℕ,
C-algebras associated to them. We introduce the an 2, and X be the compact abelian group
notion of strong orbit equivalence and describe a f0, 1, . . . , an 1g (under the operation of
topological Krieger theorem. We mention also n1
Jewett–Krieger-type realization results proved by addition mod an at the n-th coordinate, with
Ormes (1997). carry over to the right). Then the a-adic adding
In “Continuous Orbit Equivalence” and “Con- machine, given by ’(x) ¼ x þ (1,0,0, . . .), x X,
tinuous Orbit Equivalence and C-algebra”, we is a minimal homeomorphism of the Cantor set.
Dynamical Systems and C-Algebras 495
Cantor minimal systems form a very rich class structure, (R, t) is a locally compact (principal)
of topological dynamical systems. For example, groupoid, cf. Renault (1980).
recall that Ellis (1960) proved, for any countable
(infinite), discrete group, the existence of a free, Definition 1 (Étale equivalence relation). The
action on a compact metric space and therefore on locally compact principal groupoid (R, t) on the
the Cantor set by the following G-equivariant compact metric space X is an étale equivalence
version of Alexander–Urysohn result (see relation if r : R ! X is a local homeomorphism,
Giordano and de la Harpe (1997) for a proof of that is, for all (x, y) R, there exists an open
this result). neighborhood U(x, y) t of (x, y) so that r(U(x,
y)) is open in X and r : U(x, y) ! r(U(x, y)) is a
Proposition 1 Let α be a continuous action of a homeomorphism. In particular, r is an open map.
countable group G on a metrizable compact
space Y. Example 2 Let G be a countable discrete group
Then there exists a continuous action a of G on acting freely on a locally compact metric space X.
the Cantor set X and a factor map w : X ! Y Let
(i.e., an equivariant continuous surjective map).
If, moreover, Y is an infinite set and α is a RG ¼ fðx, gxÞjx X,jg G X Xg
minimal action, then a can be chosen minimal.
that is, the RG-equivalence classes are simply the
Étale Equivalence Relations G-orbits. Topologize RG by transferring the prod-
We first recall the definition and the first properties uct topology on X G to RG via the bijection
of étale equivalence relations (for more details (x, g) ! (x, gx). As G is discrete, RG is an étale
see, e.g., Renault (1980); Paterson (1999); Put- equivalence relation. (If G does not act freely, we
nam (2010)) and recall the notions of isomor- get a bijection between RG and a closed subset of
phism and orbit equivalence of étale equivalence X G X by the map (x, gx) ! (x, g, gx) and we
relations. transfer the product topology on X G X to
Let X be a compact, metric space and R be a RG.)
countable equivalence relation on X, that is,
R X X is an equivalence relation so that for Remark 1
all x X, each equivalence class R[x] ¼ {y X| 1) This definition of étaleness is equivalent
(x, y) R} is at most countable. to the various definitions of an étale
Recall that R has a natural (principal) groupoid (or r-discrete) locally compact groupoid that
structure, with unit space equal to the diagonal can be found in the literature (e.g., Renault
Δ ¼ {(x, x) R| x X}. More specifically, if (1980), Definition 2.6 and Proposition 2.8
(x, y), (y, z) R, then (x, y)(y, z) ¼ (x, z) is the and Paterson (1999), Definitions 2.2.1 and
product of this composable pair and the inverse of 2.2.3). The existence of an (essential) unique
(x, y) R is the pair (y, x). The unit space Δ of R Haar system consisting of counting measures
is by definition {(x, y)(x, y)1| (x, y) R} ¼ follows from our definition (see Paterson
{(x, x)| x X}. The range map r : R ! X and (1999), Proposition 2.2.5).
the source map s : R ! X are defined by r(x, y) ¼ 2) A countable equivalence relation on the Cantor
y and s(x, y) ¼ x, respectively, where (x, y) R set may be given nonequivalent étale topolo-
and both maps are surjective. gies ti, i ¼ 1, 2, contrasting with the situation
Assume that R is given a Hausdorff locally in the countable (standard) Borel equivalence
compact, second countable (hence metrizable) relation setting, where the Borel structure is
topology t, so that the product of composable uniquely determined by the inclusion of R in
pairs (with the topology inherited from the prod- X X.
uct topology on R R) is continuous. Also, the 3) A countable equivalence relation R on a com-
inverse map on R is a homeomorphism. With this pact metric space X cannot always be endowed
496 Dynamical Systems and C-Algebras
with an étale topology. For example, the equiv- Definition 2 (Isomorphism and Orbit Equiva-
alence relation on X ¼ [0, 1] given by lence). Let (X1, R1, t1) and (X2, R2, t2) be two
étale equivalence relations on compact,
R ¼ D [ fð0, 1Þ, ð1, 0Þg
metrizable spaces X1 and X2. Then
cannot be endowed with an étale topology t.
1) (X1, R1, t1) and (X2, R2, t2) are orbit equiv-
If t were an étale topology on R, then there
alent if there exists a homeomorphism
would exist an open neighborhood U of (0, 1)
h : X1 ! X2, called an orbit equivalence,
in R such that r : U ! r(U ) is a local homeo-
such that
morphism. As the unit space Δ is closed,
{(0, 1), (1, 0)} is open in R, and therefore ðx, yÞ R1 if and only if ðhðxÞ, hðyÞÞ R2 :
U \ {(0, 1), (1, 0)} is also open and the
restriction of r to U \ {(0, 1), (1, 0)} is a 2) If moreover h : X1 ! X2 can be chosen such
local homeomorphism. Thus, either r|{(0,1)} that h h : (R1, t1) ! (R2, t2) is a homeo-
or r|{(0,1),(1,0)} is open in [0, 1]. morphism, (X1, R1, t1) and (X2, R2, t2) are
Contradiction! isomorphic.
Example 3 Let X be a compact metrizable space Proposition 3 Let (X1, G1) and (X2, G2) be two
and (R, t) be a compact étale equivalence relation free actions of countable discrete groups G1 and
on X. Then (see Giordano et al. (2004), Proposi- G2 on compact metrizable spaces X1 and X2. Then
tion 3.2),
1) (X1, G1) is orbit equivalent to (X2, G2) if and
1) t is the relative topology trel from R X X. only if RG1 and RG2 are orbit equivalent.
2) There exists M 1 such that, for all x X, #R 2) If (X1, G1) is isomorphic to (X2, G2) (i.e., there
[x] M, where #R[x] is the cardinality of exist a homeomorphism h : X1 ! X2 and a
R[x]. group isomorphism α : G1 ! G2 such that
h(gx) ¼ α(g)(hx), for all g G1 and
Therefore, if (R, t) is an étale equivalence x X1), then RG1 and RG2 are isomorphic.
relation and R contains an infinite equivalence
class, then t is not the relative topology from Remark 2 The converse of the implication
R X X. (2) does not hold as shown by H. Dahl (2008),
where she constructs free minimal actions of non-
The following proposition generalizes the isomorphic locally finite groups which give iso-
Feldman–Moore characterization of countable morphic equivalence relation.
(standard) Borel equivalence relations given in
Feldman and Moore (1977), Theorem 1): By a theorem of Sierpinski (Kuratowski
(1968), Theorem 6, Ch. V, §47, III), orbit equiv-
Proposition 2 (Giordano et al. (2004), Proposi- alence implies isomorphism in the following case:
tion 2.3). Let (R, t) be an equivalence relation on
a zero-dimensional set X. Then there exists a Theorem 1 Let (X1, G1) and (X2, G2) be two free
countable group G of homeomorphisms of actions of countable discrete groups G1 and G2 on
X such that R ¼ RG. compact metrizable spaces X1 and X2.
If the two actions are orbit equivalent and if X1
Note that it is not always possible to find a is connected, then they are isomorphic.
group G acting freely on X and such that R ¼
RG as shown by Hjorth and Molberg (2006). Let R X X be an étale equivalence relation.
There are two natural notions of equivalence Then as defined by J. Renault (1980), a Borel
between étale equivalence relations. measure m on X is R-invariant if mðr ðU ÞÞ ¼
Dynamical Systems and C-Algebras 497
mðsðU ÞÞ, for every bisection U R. Recall that U (1) B(X, R) is the subgroup of C(X, ℤ) generated
is a bisection of an étale equivalence relation (R, t) by all functions wrðUÞ wsðU Þ , for a compact
if it is open and the restriction of the range and the bisection U R:
source map to U are homeomorphisms. The set of (2) Bm(X, R) ¼ { f C(X, ℤ)| f dm ¼ 0,
all bisections forms a basis for the topology t (see, 8m M(X, R)}.
e.g., Renault (1980)). Let us denote by M(X, R) the
compact convex cone of R-invariant probability Remark 3
measures on X. Using that R ¼ RG for some (a) If MðX, RÞ ¼ 0, then Bm(X, R) ¼ C(X, ℤ).
countable group G, the following proposition fol- (b) If m M(X, R), then by definition
lows from a straightforward adaptation of the proof mðr ðU ÞÞ ¼ mðsðU ÞÞ for any compact bisec-
for Cantor minimal systems (see Giordano et al. tion U R: Hence, B(X, R) is a subgroup
(1995), p. 80). A direct and detailed proof for étale of Bm(X, R).
equivalence relations can be found in Putnam
(2010, Theorem 2.8). Definition 4 Let (X, R) be as above. We define
the two preordered abelian groups:
Proposition 4 Let (X1, R1) and (X2, R2) be two
orbit equivalent étale equivalence relations on (1) D(X, R) ¼ C(X, ℤ)/B(X, R), where
compact, metrizable spaces X1 and X2. Then
there exists a homeomorphism h : X1 ! X2 D(X, R)+ ¼ {[ f]j ∃ g C(X, ℤ), g 0 such
which implements a bijection between that f g B(X, R)}.
M(X1, R1) and M(X2, R2).
(2) Dm(X, R) ¼ C(X, ℤ)/Bm(X, R), where
Invariants of Étale Equivalence Relations
To an étale equivalence relation (X, R) on a Dm ðX, RÞþ ¼ ½ f m j∃g CðX, ℤÞ, g 0
totally disconnected, compact, Hausdorff space,
we associate two preordered abelian groups such that ð f gÞ dm ¼ 0, 8m MðX, RÞ :
D(X, R) and Dm(X, R) which are invariants of,
respectively, isomorphism and orbit equivalence. If (X, R) is an étale equivalence relation on a
By a preordered group we mean a countable totally disconnected compact metrizable space X,
abelian group G with a subsemigroup then by Remark 3 (b), there is a canonical positive
G+containing 0, called the positive cone, such that homomorphism πm from D(X, R) onto Dm(X, R).
By integration, any m M(X, R) defines a
ðiÞ Gþ þ Gþ Gþ , ðiiÞ G ¼ Gþ Gþ : state Im on D(X, R), that is, a positive homomor-
phism from D(X, R) to ℝ such that Im([1]) ¼ 1,
If moreover G+ \ (G+) ¼ {0}, then G is an
and as any state on D(X, R) is of this form, we
ordered group.
have
An order unit for (G, G+) is an element u G+
such that for all g G, there is n ℕ with nu
Proposition 5 Let X be a totally disconnected,
g G+.
compact metrizable space and R be an étale
Recall that M(X, R) denotes the convex set of
equivalence relation on X.
all R-invariant probability measures on X.
Then there is a bijective correspondence
between the set M(X, R) of R-invariant probabil-
Definition 3 Let (X, R) be an étale equivalence ity measures on X and the set S(D(X, R), [1X]) of
relation on a totally disconnected compact states on (D(X, R), D(X, R)+, [1X]).
metrizable space X, and C(X, ℤ) be the countable
abelian group of continuous functions from X to By Remark 3 (a), if MðX, RÞ ¼ 0, then Dm(X,
(X, R)) of D(X, R) is {[ f] D(X, R)j t([ f]) ¼ has torsion (see Gähler et al. (2013) or Matui
0 for all t S (D(X, R), D(X, R)+, [1X])}, we get (2008b)). Hence, it is not a torsion free group.
Putnam (1989) in his study of the C-algebra Theorem 2 (Giordano et al. (2004), Theorem
associated to (X, ’) showed that D(X, R’) is a 3.8). Let G be a countable group acting minimally
dimension group (see below for the precise and freely on the Cantor set X. Then RG is AF if
definition), and Poon (1989) showed that and only if G is locally finite.
D(X, R’) is an ordered group if ’ is topologi-
cally transitive. In a recent prepublication, AF-equivalence relations satisfy the following
D. Handelman and M. Boyle show that if ’ is stability properties:
a homeomorphism of a compact metrizable
zero-dimensional space X, then ’ is chain Proposition 8 (Giordano et al. (2004), Proposi-
recurrent if and only if D(X, R’) is an tion 3.12).
unperforated abelian group.
2. For N 2, there are examples of minimal actions 1) An inductive limit of a sequence of
’ of ℤN on the Cantor set, such that D(X, R’) AF-equivalence relations on X is AF.
Dynamical Systems and C-Algebras 499
2) Let (R, t) be an AF-equivalence relation and Endowed with the relative topology of X(V,E)
R0 R be an open subequivalence relation. X(V,E), then RN is a compact étale equivalence
Then R0 , tjR0 is AF. relation (hence each equivalence class is finite),
and RN is an open subset of RNþ1, for all N 0.
Let us now describe the fundamental example Let R ¼ [N0RN, and give R the inductive
of an AF-equivalence relation. We begin with a limit topology. This means that a sequence
Bratteli diagram (see Herman et al. 1992; Effros {(xn, yn)} in R converges to (x, y) in R if and
1981). It is a locally finite, infinite directed graph only if {xn} converges to x, {yn} converges to
which consists of a vertex set V ¼ 1 n¼0 V n and an
y (in X), and, for some N, (xn, yn) is in RN for all
edge set E ¼ 1 E , where the Vn and the En’s
’s but finitely many n. Then R is an étale equiva-
n¼1 n
are finite disjoint sets. lence relation which we denote by AF(V, E).
The edges in En connect vertices in Vn1 with If (V 0, E0) is a Bratteli diagram obtained from
vertices in Vn. If e connects v Vn1 with (V, E) by telescoping, then the étale equivalence
u Vn, we write s(e) ¼ v and r(e) ¼ u, where relations AF(V, E) and AF(V 0, E0) are naturally
s : En ! Vn1 and r : En ! Vn are the source and isomorphic.
range maps, respectively. We will assume that The Bratteli diagram (V, E) is simple if there
s1(v) 6¼ ; for all v V and that r1(v) 6¼ ; for exists a telescoping of (V, E) such that the
all v V \V0. resulting Bratteli diagram (V 0, E0) has full connec-
Given a Bratteli diagram (V, E) and a sequence tion between all consecutive levels. Then (V, E) is
0 ¼ m0 < m1 < m2 < in ℤ+, the telescoping of simple if and only if every AF(V, E)-equivalence
(V, E) to {mn} is the Bratteli diagram (V 0, E0), class is dense in X(V,E).
where V 0n ¼ V mn and E0n ¼ Emn1 þ1 ∘ ∘Emn , The above example is in fact the general case.
and the source and the range maps are as above. Indeed, we have
To a Bratteli diagram (V, E), we associate its
infinite path space X(V,E) given by Theorem 3 (Giordano et al. (2004), Theorem
3.9). Let R be an AF-equivalence relation on a
totally disconnected compact metrizable space X.
ðe1 , e2 , . . .Þjei Ei , r ðei Þ ¼ sðeiþ1 Þ8i 1 :
Then, there exists a Bratteli diagram (V, E)
such that R is isomorphic to the AF-equivalence
1
Clearly XðV,EÞ n¼1 En , and endowed with relation AF(V, E) on X(V,E).
the relative topology, X(V,E) is a compact, Moreover, (X, R) is minimal if and only if the
metrizable, totally disconnected space. Bratteli diagram (V, E) is simple.
To a finite path p ¼ (e1, e2, . . ., en) starting at
v0 V0 is associated the cylinder set U( p) ¼ Recall that the dimension group G(V, E) asso-
{( f1, f2, . . .) X(V,E)| fi ¼ ei, i ¼ 1, 2, . . ., n}. ciated to a Bratteli diagram (V, E) is the inductive
The cylinder sets are clopen sets and form a basis limit of the sequence of the simplicial groups
for the topology of X(V,E). If (V, E) is simple (see jV j
ℤjV n j , ðℤþ Þ n : Dimension groups were intro-
below for the definition), then X(V,E) has no iso-
lated points, and so X(V,E) is a Cantor set. (We will duced by G. A. Elliott (1976) in his classification
in the sequel disregard the trivial case where the of AF C-algebras to denote any ordered group
path space X(V, E) is a finite set.) isomorphic to an inductive limit of a sequence of
For each N 0, let simplicial groups. The following result of Effros,
Handelman, and Shen (1980) provides an abstract
characterization of dimension groups.
RN ¼ ðe, f Þ XðV,EÞ XðV,EÞ jen
Theorem 4 A countable, ordered abelian group
¼ f n for all n > N : (G, G+) is a dimension group if and only if
500 Dynamical Systems and C-Algebras
1) it is unperforated, i.e. for every g G and Then they proved the following remarkable
every positive integer n, if ng G+, then result:
g G+, and
2) It has the Riesz interpolation property, that is, Theorem 6 (Herman et al. 1992). Let (X, ’) be a
for any a1, a2, b1, b2 G with a1, a2 b1, b2, Cantor minimal system. Then there exists a properly
there exists c G with ai c bj, for i, ordered Bratteli diagram (V, E, ) such that the
j ¼ 1, 2. associated Bratteli–Vershik system (X(V,E), ’(V,E))
is conjugate to (X, ’).
Note that a dimension group being
unperforated is torsion free. Recall that an order Remark 4
ideal in an ordered group (G, G+) is a subgroup I a) Vershik (1981, 1982) used sequences of refin-
such that I ¼ I + I +, where I + ¼ I \ G+, and ing measurable partitions of a measurable
if 0 a b, with a G and b I , then a I , space to realize ergodic automorphisms of a
and that an ordered group G is simple if its only measured space.
order ideals are {0} and G. b) There are several detailed presentations and
We then have (see Herman et al. 1992): surveys of the results of Herman et al. (1992),
see, for example, Skau (2000), Durand (2010);
Theorem 5 For a Bratteli diagram (V, E) and the Putnam (2010); Bezuglyi and Karpel (2016);
associated AF-equivalence relation AF(V, E) on Putnam (2018) and the monograph (Durand
X(V,E), the group D(X(V,E), AF(V, E)) is the dimen- and Perrin 2022).
sion group G(V, E) associated to (V, E). It is simple c) The case of aperiodic Cantor systems was stud-
if and only if AF(V, E) is minimal. ied in Bezuglyi et al. (2005) and
Medynets (2006).
Definition 6 Let X(V,E) be the path space of a Theorem 8 (Herman et al. (1992), Cor. 6.3). Let
properly ordered Bratteli diagram. Then the (X, ’) be a Cantor minimal system. Then
Bratteli–Vershik transformation ’(V,E) on X(V,E) (D(X, ’), D(X, ’)+, [1X]) is a unital, simple, acy-
is defined by: if x ¼ (e1, e2, . . .) X(V,E) and clic (i.e., not order isomorphic to ℤ) dimension
x 6¼ xmax, then ’(V,E)(x) is the successor of x in the group. Furthermore, if (G, G+) is any simple, acy-
lexicographic ordering and ’(V,E)(xmax) ¼ xmin. clic dimension group with distinguished order unit
Dynamical Systems and C-Algebras 501
u, there exists a Cantor minimal system (X, ’) so 2. R-thin if m(Y ) ¼ 0 for all R-invariant proba-
that (G, G+, u) ffi (D(X, ’), D(X, ’)+, [1X]). bility measures m on X.
ðh hÞ1 , R becomes an AF-equivalence actions are affable, that is, orbit equivalent to an
relation containing R and whose unital AF-equivalence relation. The key tool for this step
dimension group is order isomorphic to is the absorption theorem for minimal AF rela-
Dm(XE, R). tions (Theorem 10).
We first describe the case of minimal ℤ-actions
As a consequence of Theorem 12 and Theorem studied in Giordano et al. (1995). Notice however
9, we obtain the classification up to orbit equiva- that the proof of the classification up to orbit
lence of AF-equivalence relations. equivalence of minimal ℤ-actions given in
Giordano et al. (1995) did not use the absorption
Corollary 1 (Putnam (2010), Cor. 3.2). For a theorem (see Remark 7).
minimal AF-equivalence relation (X, R) on the We then discuss the affability of minimal
Cantor set X, its unital dimension group actions of ℤd on the Cantor set. The case d ¼ 2
was proved in Giordano et al. (2008) and the
ðDm ðX, RÞ, Dm ðX, RÞþ , ½1X Þ general case in Giordano et al. (2010). This
proof of the affability is extended to minimal
is a complete invariant of orbit equivalence. actions of finitely generated abelian groups by
using a result that ;. Johansen proved in his thesis
Remark 7 (Johansen 1998).
1) The classification of AF-equivalence relations By Theorem 14, any minimal ℤ-action on the
was originally given in Giordano et al. (1995), Cantor set is affable. Therefore, we get
as a consequence of the classification of min-
imal Cantor systems. The proof in Giordano Theorem 13 (Giordano et al. (1995), Theorem
et al. (1995) relies on nontrivial results in 2.2). For i ¼ 1, 2, let (Xi, Ri) be the two minimal
homological algebra. The new proof given by equivalence relations on the Cantor set, which are
Ian Putnam is certainly the “right” one, as it either AF or induced by a Cantor minimal system.
avoids completely the use of homological Then the following statements are equivalent:
algebra and is very dynamical in nature.
2) In Putnam (2010), the following generalized (1) (X1, R1) and (X2, R2) are orbit equivalent.
version of Theorem 12 is in fact proved: (2) The dimension groups Dm(X1, R1) and
Let (X, R) be a minimal AF-equivalence rela- Dm(X2, R2) are order isomorphic by a map
tion and H be a subgroup of Inf (D(X, R)) such preserving the distinguished order units.
that D(X, R)/H is torsion free. (3) There exists a homeomorphism F : X1 ! X2
Then there exists a minimal AF-equivalence carrying M(X1, R1) onto M(X2, R2).
relation R on X, containing R and orbit equiv-
alent to R and such that The equivalence between (1) and (2) is given by
Corollary 1, and the implication from (3) to (2)
follows directly from the definition of Bm(Xi, Ri)
D X, R ffi DðX, RÞ=H: (Definition 3). By Proposition 4, (1) implies (3).
By Proposition 1.16 and Corollary 4.2 of
Orbit Equivalence of Minimal Actions of a Effros (1981), we then get
Finitely Generated Abelian Group
The complete classification of minimal Corollary 2 (Giordano et al. (1995), Cor. 2.1).
AF-equivalence relations on the Cantor set up to Let (X1, ’1) and (X2, ’2) be two uniquely ergodic
orbit equivalence was given in Corollary 1. In this Cantor minimal systems. Then they are orbit
section, we extend this complete classification to equivalent if and only if {m1(E)| E X1, clopen} ¼
minimal actions of finitely generated abelian {m2(E)| E X2, clopen}, where mi is the unique
groups on the Cantor set by showing that these ’i-invariant measure on Xi for i ¼ 1, 2.
504 Dynamical Systems and C-Algebras
Recall (see Giordano et al. (1995), p. 63 and This work was continued in a 2018 paper by
Putnam et al. (1986) for a survey) that a Denjoy Alvin, Ash, and Ormes (2018), with the additional
homeomorphism is an aperiodic homeomorphism assumption that the speedup function is bounded.
of 1 , which is not conjugate to a rigid rotation. They showed that a minimal bounded speedup of
By a Denjoy system, we mean a Denjoy homeo- an odometer is conjugate to an odometer.
morphism restricted to its unique invariant Cantor We now discuss the general case of actions of
set. A Denjoy system is uniquely ergodic. finitely generated abelian groups. Let ’ be a free
Let m be the invariant probability measure of a minimal action of ℤd on the Cantor set X and let
uniquely ergodic Cantor minimal system (X, ’), R’ be the associated étale equivalence relation.
and G be the subgroup {m(E); E X, clopen} þ The affability of R’ is proved by constructing
ℤ of ℝ. By Corollary 2, sub-equivalence relations of R’:
In 2016, Ash studied in (2016) speedups of Theorem 15 Any minimal action of a finitely
Cantor minimal systems and showed that if generated abelian group on the Cantor set is
(Xi, ’i) are two minimal Cantor systems, affable.
(X2, ’2) is a speedup of (X1, ’1) if and only if
there is a unital surjective homomorphism F from As a consequence of Theorems 13 and 15, we
Dm(X2, ’2) onto Dm(X1, ’1) such that then have
ðDm ðX, RÞ, Dm ðX, RÞþ , ½1X Þ Let (X, ’) be a Cantor minimal system and
C(X, ’) be the associated C-crossed product
þ
ffi Dm ðX0 , R0 Þ, Dm ðX0 , R0 Þ , ½1X0 : C(X)⋊’ℤ. Recall (see, e.g., Giordano et al.
(1995), p. 60 for the precise references) that
Then as a corollary of Theorem 8, we have
1. C(X, ’) is a simple, unital, AT-algebra of real
Corollary 4 For any simple dimension groups rank zero and stable rank one, hence is classi-
G with a trivial infinitesimal subgroup, there fied by its Elliott invariant;
exists a Cantor minimal system (X, ’) such that 2. If Ki(C(X, ’)) denotes, for i ¼ 0, 1, the
Dm(X, ’) ¼ G. K-groups of the crossed product, then
K0(C(X, ’)) is a simple dimension group
Remark 9 Let d 2. By Theorem 16 and Cor-
order isomorphic to D(X, ’), and K1(C(X,
ollary 4, for any free minimal action of
’)) ¼ ℤ (see [?]).
ℤd , Dm ðX, Rℤd Þ is a simple, acyclic dimension
group with no nontrivial infinitesimal elements, Hence, we have
but an exact description of the range of the invari-
ant Dm is not yet known.
Theorem 18 A complete isomorphism invariant
However, using results of M.-I. Cortez and for the family of C(X, ’), where (X, ’) is a Can-
S. Petite (2014), S.-M. Høynes (2016) showed tor minimal system, is the simple dimension group
that it contains all unital simple dimension groups (D(X, ’), D(X, ’)+) with distinguished order unit
(G, u), with no nontrivial infinitesimal elements, [1X].
and a noncyclic rational subgroup ℚ(G, u).
Remark 10
A Topological Krieger Theorem: The Notion of
(1) Boyle and Handelman (1994) showed that all
Strong Orbit Equivalence
possible (topological) entropies can be real-
In this subsection, (X, ’) will always denote a
ized in the class of Cantor minimal system
Cantor minimal system. The motivation of
(X, ’), such that (D(X, ’), [1]) is order iso-
Giordano et al. (1995) was to obtain a topological
morphic to ℤ 12 , 1 : Therefore,
version of the famous Krieger theorem.
Let (X1, ’1) and (X2, ’2) be two orbit equiva- by a Cantor minimal system in a prescribed orbit
lent Cantor minimal systems and F : X1 ! X2 be equivalence class. More precisely, he proves in
an orbit equivalence. Then F defines two maps m, (Ormes 1997, Theorem 7.2):
n : X1 ! ℤ given for x X, by
Theorem 20 Let (Y, n, T ) be an ergodic proba-
nðxÞ mðxÞ bility measure-preserving dynamical system,
Fð’1 ðxÞÞ ¼ ’2 ðFðxÞÞ and F ’1 ðxÞ
(X, ’) a Cantor minimal system, and m M(X, ’)
¼ ’2 ðFðxÞÞ: an ergodic measure.
Then there is a Cantor minimal system (X, c),
We will call n and m the orbit cocycles associ- orbit equivalent to (X, ’) such that (Y, n, T ) is
ated to F. measurably conjugate to (X, c, m).
Definition 8 Two Cantor minimal systems Moreover in Ormes (1997, Theorem 27), he
(X1, ’1) and (X2, ’2) are strongly orbit equivalent gives necessary and sufficient spectral conditions
(SOE) if there exists an orbit equivalence for a Jewett–Krieger-type realization of an ergo-
F : X1 ! X2 such that each of the associated dic dynamical system by a Cantor minimal sys-
orbit cocycles m, n : X1 ! ℤ has at most one tem in a prescribed strong orbit equivalence
point of discontinuity. class.
a) The equivalence between the statements (1) F(g1x1) ¼ α1(x1, g1)F(x1), x1 X1, g1 G1
(2) and (3) follows from Theorem 18 (2) F1(g2x2) ¼ α2(x2, g2)F1(x2), x2 X2,
b) If (X, ’1) and (X, ’2) are strongly orbit equiv- g2 G2
alent, then the coboundary subgroups B(X, ’1)
and B(X, ’2) are equal and (2) follows Remark 12 If for i ¼ 1 and 2, (Xi, Gi) is topo-
logically free, then α1 and α2 are uniquely deter-
Remark 11 Recall that the famous Jewett– mined by the conditions (1) and (2) of Definition 9
Krieger theorem (see, e.g., Glasner (2003)) states and are continuous one-cocycles (see, e.g., Li
that any ergodic transformation of a nonatomic (2018a)).
Lebesgue space has a uniquely ergodic minimal
Cantor model. For two topological dynamical systems
Ormes (1997) presents Jewett–Krieger-type (Xi, Gi), let us recall that they are isomorphic if
realization results of an ergodic dynamical system there is a homeomorphism F : X1 ! X2 and a
Dynamical Systems and C-Algebras 507
group isomorphism γ : G1 ! G2 such that for all continuous orbit equivalent to (X, G) is iso-
g G1, x X1, morphic to it. Then he gives several examples
of topologically free dynamical systems which
FðgxÞ ¼ gðgÞFðxÞ: are continuous orbit equivalence rigid.
3) Medynets, Sauer and Thom, in their study of
The notion of continuous orbit equivalence Cantor systems and quasi-isometry of groups
was initially studied by M. Boyle in his thesis (Medynets et al. 2017), prove that two finitely
(Boyle 1983), where he proved: generated non-amenable groups are quasi-
isometric if and only if they admit continuous
Proposition 9 For i ¼ 1, 2, let ’i be a topolog- orbit equivalent Cantor minimal actions. In
ically transitive homeomorphism of a compact particular, free groups of different rank admit
metrizable space Xi. If ’1 and ’2 are continuously continuous orbit equivalent Cantor minimal
orbit equivalent, then ’1 and ’2 are flip conju- actions, unlike in the measurable setting.
gate, that is, ’1 is conjugate to ’2 or to ’1
2 or the
ℤ-dynamical systems (X1, ’1) and (X2, ’2) are Li (2018b), by studying a weaker form of con-
isomorphic. tinuous orbit equivalence, extends Medynets,
Sauer, and Thom’s result and proves in particular
This proposition is the analogue in the ergodic that two finitely generated groups are quasi-
probability measure-preserving case to isometric if and only if there exist Kakutani equiv-
Belinskaya’s result: the integrability of one of alent topologically free systems of these groups
the associated orbit cocycles implies flip on compact spaces.
conjugacy (in the measurable category).
Boyle and Tomiyama (1998) generalized this Continuous Orbit Equivalence and C-algebra
result as follows: Let (X, G) be a topological dynamical system and
X ⋊ G be the associated topological transforma-
Theorem 21 For i ¼ 1, 2, let ’i be a topologically tion groupoid. Then it is an étale groupoid and is a
free homeomorphism of a compact Hausdorff principal groupoid (i.e., X ⋊ G ¼ RG) if the action
space Xi. If ’1 and ’2 are continuously orbit equiv- is free, and X ⋊ G is essentially principal if the
alent, then there exist open partitions X1 ¼ A1 A2 action is topologically free. Recall that the
and X2 ¼ B1 B2 such that ’1 jA1 is conjugate to reduced C-algebra Cr ðX⋊GÞ of the transforma-
’2 jB1 and ’1 jA2 is conjugate to ’12 B2 : tion groupoid X ⋊ G is canonically isomorphic to
the C-crossed product C0(X) ⋊ G (see, e.g., Li
Remark 13 (2018a) or Matui (2017)).
1) In measurable dynamics, orbit equivalence
rigidity is an important domain of research Remark 14 If (X, G) is topologically free, then
and several impressive results have been pro- the pair (C0(X)⋊G, C0(X)) is a Cartan pair as
ved (see, e.g., Furman (1999); Ioana (2011); defined by J. Renault (2008).
Monod and Shalom (2006); Popa (2007)). In
topological dynamics, M. Boyle’s result was Generalizing a classical result of I. Singer
the first proved positive rigidity result. Li (1955) on the interplay between measured dynam-
(2018a) obtains both positive and negative ical systems and the Murray–von Neumann group
rigidity results. measure space construction, X. Li (2018a) proves:
2) In Li (2018a), the author defines a topologi-
cally free dynamical system (X, G) to be con- Theorem 22 Let (Xi, Gi) be two topologically
tinuous orbit equivalence rigid if any free dynamical systems as above. Then the follow-
topologically free dynamical system ing are equivalent:
508 Dynamical Systems and C-Algebras
(1) (X1, G1) and (X2, G2) are continuously orbit If G1 ¼ G2 ¼ G, and if there exists a homeo-
equivalent. morphism F : X1 ! X2 such that F ∘ ’1(g) ¼
(2) The transformation groupoids X1 ⋊ G1 and ’2(g) ∘ F for all g G, we will say that (X1, ’1)
X2 ⋊ G2 are isomorphic. and (X2, ’2) are conjugate.
(3) There is a C-isomorphism F from C0(X1) ⋊ For two topological dynamical systems, the
G1 onto C0(X2) ⋊ G2 such that F(C0(X1)) ¼ following implications are clear: conjugacy )
C0(X2). isomorphism ) continuous orbit equivalence )
orbit equivalence.
Remark 15 In this section, we characterize (algebraically)
1) The equivalence of (2) and (3) is due to these four types of equivalences for ℤ- and ℤ2-
J. Renault (2008). odometers. Let us first recall the definition of ℤd-
2) J. Tomyiama proved the equivalence of (1) and odometers.
(3) for topologically free ℤ-actions in Let G be a (nontrivial) decreasing sequence
Tomiyama (1996). (Gn)n1 of finite index subgroups of ℤd. For
3) Theorem 22 is extended to more general n 1, let qn : ℤd/Gnþ1 ! ℤd/Gn be the quotient
groupoids in a recent publication of Carlsen, map and let XG be the corresponding inverse limit.
Ruiz, Sims, and Tomforde (2021). Let fG be the profinite action of ℤd on the XG
given by
For ℤ-actions, using Theorem 23, M. Boyle
and J. Tomiyama (1998) proved the following fG ðvÞðx1 , x2 , . . .Þ ¼ ðx1 þ v, x2 þ v, . . .Þ v ℤd :
rigidity result:
Then following Cortez (2006), a ℤd-odometer
Theorem 23 For i ¼ 1, 2, let ’i be a topologically is any ℤd-Cantor minimal system conjugate to
free homeomorphism of a compact Hausdorff XG , fG for some G as above. Note that
space Xi. Then the following are equivalent:
ℤd to YH, the Pontryagin dual H=ℤd of H/ℤd, (1) If d, d 0 2, then (YH, cH) and ðY H0 , cH0 Þ are
which is a compact, totally disconnected space. conjugate if and only if d ¼ d 0 and H ¼ H 0.
Then let (YH, cH) be the ℤd-dynamical system (2) If d, d 0 2, then (YH, cH) and ðY H0 , cH0 Þ are
where cH denotes the action of ℤd on YH given by isomorphic if and only if d ¼ d 0 and there
exists α GLd(ℚ) with det(α) ¼ 1 such
cnH ðxÞ ¼ x þ rðnÞ, for n ℤd , x Y H : that αH ¼ H 0.
(3) If d, d 0 2, then (YH, cH) and ðY H0 , cH0 Þ are
Note that continuously orbit equivalent if and only if
d ¼ d 0 and there exists α GLd(ℤ) such
(1) If ℤd H ℚd, then (YH, cH) is free if and that αH ¼ H 0.
0
only if H is dense in ℚd (4) The ℤd-action (YH, cH) and the ℤd -action
(2) If ℤd H1 H2 ℚd, then there is a natural ðY H0 , cH0 Þ are orbit equivalent if and only if
factor map from Y H2 , cH2 to Y H1 , cH1 the superindex [[H : ℤd]] of ℤd in H is equal to
0
(3) If ℤd H1 H2 ℚd, then the inverse limit superindex H 0 : ℤd :
of the ℤ -dynamical systems Y Hn , cHn n1 is
d
Then, the cohomology groups H(Γ, C(X, ℤ)) yðx, m þ nÞ ¼ yðx, mÞ þ yð’ðmÞðxÞ, nÞ for all
are canonically isomorphic to the cohomology
x X and m, n ℤd :
groups H G ’ associated to the étale groupoid
G ’ : The cohomology H ðG Þ for any étale A cocycle θ is a coboundary if there is
groupoid G were introduced by J. Renault h C(X, ℤ) such that θ(x, m) ¼ h(’(m)(x))
(1980) and has been widely studied. h(x) for all x X and m ℤd.
Let (YH, cH) be the ℤd-odometer associated to
a dense subgroup ℤd H ℚd. Then using
Let (X, ’) be a free minimal ℤd-Cantor system.
Giordano et al. (2019), Theorem 2.5, and a simple
Then the following holds:
case of Shapiro’s Lemma, we have
t1m ð½y Þ ¼ t1m ðyÞðe1 Þ, , t1m ðyÞðed Þ : We use the following notation and terminolo-
gies throughout this chapter. Let G be a discrete
group and X be a compact Hausdorff space.
We then get A G-action is a group homomorphism α from
G into the homeomorphism group of X. In this
Theorem 26 (Giordano et al. (2019), Theorem case, we also write α : G ↷ X for the action and say
4.4). Let 1 d 2. (YH, cH) be the ℤd-odometer that X is a G-space.
associated to a dense subgroup ℤd H ℚd, and Given a G-action α : G ↷ X, there is an induced
m be its unique invariant probability measure. action of G on the C-algebra C(X) of continuous
Then functions on X given by
ℤ
ordðaÞ≔ max 1U ð x Þ 1 , mdim ½0, 1 d ,s ¼ dim ½0, 1 d ¼ d:
xX
Ua
2.0.1)). We refer the reader to Giordano et al. tracial state tm on the reduced crossed product
(2018, Part I) and Ara et al. (2011) for more on C(X)⋊rG (see Giordano et al. (2018), Example
the Cuntz subequivalence relation. 10.1.31). In fact, if the action is free, then every
tracial state on C(X)⋊rG has this form (see, e.g.,
Definition 11 For two positive elements a, b in Giordano et al. (2018), Proposition 11.1.21).
A+, we say a is Cuntz subequivalent to b, denoted
a ≾ b, if there exists a sequence (rn)n in A such that Given a tracial state t T(A), define the rank
lim n!1 r n brn a ¼ 0: If a ≾ b and b ≾ a, function dt : A+ ! [0, 1] by
then we write a b.
1
d t ðaÞ≔ lim t an
n!1
Example 9 Let A be the algebra Mn ðÞ of n n
matrices. For two positive elements a and b in A,
(see Blackadar and Handelman (1982) for more
we have a ≾ b if and only if rank(a) rank (b).
details).
Example 10 Let X be a compact Hausdorff space
Example 13 If A ¼ C(X) and t is the tracial state
and let f, g C(X)+. Then f ≾ g if and only if
induced by a Borel probability measure m on X,
supp( f ) supp (g), where supp( f) ≔ f 1(0, +1)
then dt( f ) ¼ m( f 1(0, 1)) for any f in C(X)+.
(see, e.g., Ara et al. (2011), p. 6, Proposition 2.5).
One can show that if a ≾ b, then dt(a) dt(b)
Example 11 Let G ↷ X be an action of a count-
for every t T(A) (see Blackadar and
able discrete group G on a Cantor space X, and let
Handelman (1982), Theorem II.2.2). The con-
A and B be two clopen sets. Suppose there is a
verse is not necessarily true, and the radius of
clopen partition A ¼ C1 t C2 t Cn and group
comparison, as introduced by Toms (2006), is
elements s1, s2, . . ., sn such that the sets s1C1, s2C2,
designed to measure the failure of this “compari-
. . ., snCn are pairwise disjoint subsets of B, then
son property.”
the characteristic function wA is Cuntz sub-
In the next definition, we use the notation
equivalent to the characteristic function wB in the
M1 ðAÞ≔[1 n¼1 Mn ðAÞ, where each Mn(A) is
crossed product C(X)⋊rG.
embedded into Mnþ1(A) as the upper-left corner.
To avoid technicalities, from now on we only
consider the class of unital stably finite exact C- Definition 12 Let A be a unital stably finite exact
algebras. It includes all the crossed products that C-algebra. The radius of comparison rc(A) of A is
we are interested in (see, e.g., Blackadar (2006) the infimun of the strictly positive real numbers r
for relevant C-algebraic definitions). such that for any a, b M1(A)þ, if dt(a) þ r <
Recall that a tracial state on a C-algebra A is a dt(b), for all t T(A), then a≾b.
state (i.e., a positive linear functional with norm
one) ’ that satisfies ’(ab) ¼ ’(ba) for all a and The case rc(A) ¼ 0, also known as strict com-
b in A. Let T(A) denote the set of tracial states on parison, is especially important for the classifica-
A. By combining a result of Blackadar and tion of simple nuclear C - algebras (see section
Handelman (1982) and a result of Haagerup “Classification of Simple Nuclear C-algebras”).
(2014), we deduce that the set T(A) is nonempty
for any C-algebra in the class we consider. Example 14 Let X be a finite CW complex. Then
by Toms (2006) and Elliott and Niu (2013),
Example 12 Let G ↷ X be an action of a count- rc(C(X)) is essentially half of the covering dimen-
able discrete amenable group G on a compact sion of X. More precisely, if dim X denotes the
metrizable space X, and let m be a G-invariant covering dimension of X, we have the following
Borel probability measure. Then m induces a by Elliott and Niu (2013, Corollary 1.2):
514 Dynamical Systems and C-Algebras
1
dim X dim X rcðCðXÞ⋊r,a ℤÞ mdimðX, aÞ:
2 rcðCðXÞÞ 1: 2
2 2
The following classification theorem is due to It is natural to ask how these properties are
many hands, including Gong et al. (2020a and related to each other. The following conjecture is
2020b); Kirchberg (1995); Phillips (2000); and due to Toms and Winter.
Tikuisis et al. (2017) (which in turn depend on a
large body of work). Conjecture 1 For an infinite-dimensional unital
separable simple nuclear C-algebra, the follow-
Theorem 30 Unital separable simple nuclear ing three conditions are equivalent:
infinite-dimensional C-algebras that have finite
nuclear dimension and satisfy the UCT are clas- (i) Finite nuclear dimension
sified by the Elliott invariant. (UCT stands for the (ii) Z -stability
Universal Coefficient Theorem of Rosenberg and (iii) Strict comparison.
Schochet (1987)).
The implication (i) ) (ii) is proved by Winter
In Tu (1999), the author proves that if G is (2012) and (ii) ) (iii) is due to Rørdam (2004).
an amenable group, then every crossed product For the upward implications, many results were
C(X)⋊rG satisfies the UCT. It is an open question obtained based on the groundbreaking work of
whether every nuclear C-algebra satisfies the Matui and Sato (2012, 2014). In another recent
UCT. breakthrough (Castillejos et al. 2021), the impor-
tant notion of uniform property Γ was identified
Example 19 Let θ be an irrational number in and the equivalence between (i) and (ii) is now
0, 12 and ay : ! be the rotation by θ on the fully established. Combining all the known impli-
cations we arrive at the following theorem:
circle. The crossed product CðÞ⋊ay ,r ℤ is known
as the irrational rotation algebra and often
Theorem 31 Let A be a unital separable simple
denoted by A y : It can be shown that A y has nuclear
nuclear infinite-dimensional C-algebra. The fol-
dimension one (see Winter and Zacharias (2010),
lowing are equivalent:
Example 6.1) and belongs to the class of algebras
in Theorem 30. Thanks to the work of Pimsner (i) A has finite nuclear dimension.
and Voiculescu (1980) and Rieffel (1981), A y and (ii) A is Z -stable:
A y0 are isomorphic if and only if θ ¼ θ0. (iii) A has strict comparison and uniform prop-
erty Γ.
Examples and results show that regularity also
assumes other forms. One of them is the property of In particular, the Toms–Winter conjecture
strict comparison (i.e., zero radius of comparison) holds for any unital separable simple nuclear
which already appeared in the section “Mean C-algebra with uniform property Γ.
Dimension and Radius of Comparison” (see Defi-
nition 12). Another one is known as Z -stability or At the time of writing, it is an open question
Jiang-Su stability, which says that the C-algebra whether every unital separable nuclear C-algebra
must remain unchanged (up to isomorphism) after has uniform property Γ.
tensoring with the specific C-algebra Z, called the
Jiang-Su algebra (Jiang and Su 1999). One of the Example 20 Let X be an infinite compact
important features of Z is that although it is infinite metrizable space, and let T : X ! X be a minimal
dimensional as a linear space, it has the same Elliott homeomorphism. Toms and Winter proved that if
invariant as the algebra of complex numbers . X has finite covering dimension, then the reduced
Therefore, in some sense the Elliott invariant does crossed product C(X)⋊Tℤ has finite nuclear
not “see” the algebra Z, whence cannot distinguish dimension (and hence is Z -stable) (Toms and
a C-algebra A from the tensor product A Z. Winter 2013). Later it was shown by Elliott and
Dynamical Systems and C-Algebras 517
Niu that C(X)⋊Tℤ is Z -stable whenever (X, T ) Basically, comparison is a property that asserts
has mean dimension zero (Elliott and Niu 2017). the converse.
Almost Finiteness, Comparison, and the Small Definition 17 Let G be a discrete group and X be
Boundary Property a G-space. We say the action G ↷ X has
A central problem at the interface of C-algebras (dynamical) comparison if m(A) < m(B) for all
and topological dynamics is to understand when a G-invariant measures m implies A ≺ B for all
crossed product C(X)⋊rG is regular, meaning that open sets A and B.
it satisfies one of the properties appearing in the
Toms–Winter conjecture (Conjecture 1). Several Example 21 By Glasner and Weiss (1995,
dimension theories for topological dynamics, Lemma 2.5), every Cantor minimal system has
including Rokhlin dimension (e.g., Hirshberg comparison.
et al. (2015); Szabó (2015); Szabó et al. (2019)), It’s worth noting that this result essentially
dynamic asymptotic dimension (Guentner et al. follows from the existence of Kakutani–Rokhlin
2017), and tower dimension (Kerr 2020), have tower decomposition for Cantor minimal systems
been introduced and they have direct applications (see the proof of Putnam (1989), Lemma 3.1),
in establishing the finiteness of nuclear dimension. which gives us a hint to a result below that says
Note that the dynamic asymptotic dimension was almost finiteness implies comparison, since
originally motivated by the Baum-Connes conjec- almost finiteness (see Definition 19 below) can
ture. In addition to applications to C-algebras, be viewed as a generalization of the Kakutani–
they are also interesting dynamical properties in Rokhlin decomposition.
their own rights. We apologize for not being able
Example 22 More generally, let G be a countable
to discuss these important developments due to
discrete group of subexponential growth. A result
length, but to the interested readers we recommend
of Downarowicz and Zhang shows that every free
the notes by Sims et al. (2020, Part III). In this
action of G on any zero-dimensional compact
subsection, we focus on the dynamical analogue of
metrizable space has comparison.
Z-stability and strict comparison and discuss their
relationship with the small boundary property and Remark 21 To the authors’ best knowledge, it is
the Toms–Winter conjecture. an open question whether every free action of a
We start with a dynamical version of strict countable discrete amenable group on any com-
comparison. The basic idea of comparing sets by pact metrizable space has comparison.
their measures appeared in the work of Glasner We now turn to almost finiteness, which can be
and Weiss (1995) (and the type of subequivalence viewed as a topological version of the Ornstein–
relation used there dates back to Hopf (1932)). Weiss tiling theorem and, at the same time, a
The following definitions first appeared in talks dynamical analogue of Z -stability:
given by Wilhelm Winter.
With the notion of castles introduced in Defi-
Definition 16 Let G be a discrete group and X be nition 13, we can now define almost finiteness.
a G-space. For two open subsets A and B of X, we
say A is subequivalent to B, written as A ≺ B, if for Definition 18 Let G ↷ X be a free action of a
every closed subset C of A there are group ele- countable discrete group G on a compact metric
ments s1, s2, . . ., sn in G and open sets U1, U2, . . ., space X. The action G ↷ X is almost finite if for
Un such that C [ni¼1 U i , the sets siUi are every finite subset K G, and every δ > 0, there
pairwise disjoint, and tni¼1 si U i B: exist:
It is clear that if A ≺ B, then m(A) m(B) for (1) A castle {(Vi, Si)}i I such that each level is
every G-invariant probability measure m. open with diameter at most δ, and each shape
518 Dynamical Systems and C-Algebras
is (K, δ)-invariant (i.e., |gSiΔSi|/|Si| < δ for all Theorem 32 (Kerr 2020, Theorem 9.2 and The-
g K and all i I ) orem 12.4). Let α : G ↷ X be a free minimal action
(2) A set S0i Si for each i I such that S0i < of a countable discrete amenable group on a
djSi j and compact metrizable space.
Remark 22 The term “almost finiteness” was Corollary 5 Let α : G ↷ X be a free minimal
first introduced by Matui. In Matui (2012), action of a countable discrete amenable group on
Matui defined almost finiteness for étale a compact metrizable space. If the action is almost
groupoids whose unit space is compact and totally finite, then the crossed product C(X)⋊rG is clas-
disconnected and studied the homology groups of sified by its Elliott Invariant.
these groupoids. It was shown in Kerr (2020) that
this definition is equivalent to Definition 18 for Another very natural way to quantify condition
transformation groupoids arising from groups act- (ii) in Definition 18 is through invariant measures.
ing on compact totally disconnected spaces. This leads to the following version of almost
finiteness, given in Kerr and Szabó (2020).
Example 23 The Kakutani-Rokhlin tower decom-
position shows that every Cantor minimal system is Definition 19 (Kerr and Szabó G (2020) Defini-
almost finite. Matui proved in Matui (2012) that the tion 3.5 and Proposition 3.3) Let G be a discrete
same holds for every free action of ℤn on a compact group and let X be a compact metrizable G-space.
metrizable totally disconnected space. Then, an action G ↷ X is almost finite in measure
if it satisfies the condition (i) in Definition 18 and
Example 24 By Kerr and Szabó (2020, Theorem the following condition (ii’):
B), the results from the previous example can be
extended to actions of ℤn on any finite-
dimensional compact metrizable spaces. m X∖ Si V i <d
iI
The next theorem of Kerr relates almost finite-
ness to comparison and Z -stability of the associ- where the supremum is taken over all G-invariant
ated crossed product. Borel probability measures m.
Dynamical Systems and C-Algebras 519
If we allow, in the above definition, the levels the case of ℤ-actions was first established in
to have arbitrary diameter then we obtain pre- Lindenstrauss (2000)). It is a major open problem
cisely the uniform Rokhlin property discussed in whether the equivalence of these two properties
the section “Mean Dimension and Radius of still holds for actions of countable amenable
Comparison.” groups.
Intuitively, almost finiteness implies almost The following theorem is due to Kerr and
finiteness in measure, and comparison allows Szabó.
going backward. This turns out to be true (see
the proof of Kerr and Szabó (2020), Theorem Theorem 33 (Kerr and Szabó (2020), Theorem
6.1), so by Theorem 32 comparison together 5.6). Let G ↷ X be a free action of a countable
with almost finiteness in measure implies that discrete amenable group on a compact metrizable
the crossed product is Z -stable. To state another space. Then the action is almost finite in measure
characterization of almost finiteness in measure, if and only if it has the small boundary property.
let us recall the definition of the small boundary
property introduced by Lindenstrauss and Weiss As a consequence of Theorem 33 and Theorem
(2000). 32, the following are equivalent for a free action
of a countable amenable group on a compact
Definition 20 Let G be a discrete amenable group metrizable space:
and X be a G-space. An action G ↷ X has the small
boundary property if for every point x X and (1) The action is almost finite.
every open neighborhood U of x, there is an open (2) The action has comparison and the small
set V such that x V U and m V∖V ¼ 0 for boundary property.
every G-invariant probability measure m.
It might be interesting to compare this result
The definition of the small boundary property to the equivalence between (ii) and (iii) in
was first given by Linden-strauss and Weiss using Theorem 31.
the notion of orbit capacity (Lindenstrauss and Returning to the connection to the classifica-
Weiss (2000), Definition 5.1) and was proved to tion of nuclear C-algebras, we recall that by
be equivalent to the above definition in Proposi- Theorem 31 the Toms–Winter conjecture holds
tion 3.3 of Kerr and Szabó (2020). for any simple crossed product C(X)⋊rG that
has uniform property Γ. The next result establishes
Remark 23 Recall that a topological space a formal link between the small boundary prop-
X has zero (small) inductive dimension if for erty and uniform property Γ.
every x X and every open set U containing x,
there exists an open set V such that x V U Theorem 34 (Kerr and Szabó (2020), Theorem
and V has empty boundary (see, e.g., Pears 9.4). Let G be an infinite countable discrete ame-
(1975), Chap. 4). Therefore, the small boundary nable group and let X be a compact metrizable
property can be viewed as a dynamical analogue G-space. If the action G ↷ X has the small bound-
of a zero-dimensional space where the smallness ary property, then the crossed product C(X)⋊rG
of the boundary is captured by the G-invariant has uniform property Γ.
measures.
As a consequence, if mean dimension zero of
It was shown in Lindenstrauss and Weiss the action turns out to imply strict comparison for
(2000) that the small boundary property implies the crossed product (note that this is a very special
mean dimension zero. The converse is known to case of Conjecture 1), then every free minimal
be true for ℤn-actions (see Gutman et al. (2016); action G ↷ X with the small boundary property
520 Dynamical Systems and C-Algebras
show that the composition evx ∘ b restricts to an said to be completely isometric if its matrix ampli-
invariant mean for the stabilizer subgroup Gx (see fication ’ðnÞ : Mn ðS Þ ! Mn ðT Þ is an isometry for
Breuillard et al. (2017, Proposition 2.7)). It fol- every n
lows that for every x @ FG, the stabilizer sub-
group Gx is amenable. Definition 22 (Hamana 1985). Let S be an oper-
Day proved that every group has an amenable ator system and let G be a discrete group.
normal subgroup Ra(G) that contains all the other
amenable normal subgroups of G (Day (1957), (1) A G-action on S is a group homomorphism
Lemma 4.1). The group Ra(G) is called the ame- from G into the group of complete order iso-
nable radical of G. The previous paragraph, morphisms on S. In this case, S is called a
together with strong proximality and minimality G-operator system.
of @ FG, yields the next theorem. (2) A map ’ : S ! T between two G-operator
Recall that an action of G on X is a group systems is a G-equivariant map if
homomorphism from G into Homeo(X). The ker-
’ðgaÞ ¼ g’ðaÞ
nel of the action is by definition the kernel of this
homomorphism.
for all a S and g G.
Theorem 36 (Furman (2003); see also Breuillard
Definition 23 (Hamana 1985). An operator sys-
et al. (2017), Proposition 2.8). The kernel of the
tem U is said to be injective if for any inclusion
action G ↷ @ FG is precisely the amenable radical
S T of operator systems and any u.c.p. map
Ra(G).
’ : S ! U, there exists a u.c.p. map ’ : T ! U
such that ’ðaÞ ¼ ’ðaÞ for all a S (i.e., ’ is an
In particular, if G is amenable, then @ FG is a
extension of ’).
one-point space. In general, @ FG is extremally
disconnected, that is, every open subset of @ FG
has an open closure (Kalantar and Kennedy Example 26
(2017), Remark 3.16; see also Breuillard et al.
(2017), Proposition 2.4). (1) Arveson’s extension theorem (see, e.g.,
Paulsen (2002), Chap. 7) shows that B(H ) is
The Hamana Boundary injective for any Hilbert space H.
Definition 21 A unital self-adjoint subspace of a (2) If V U is an inclusion of operator systems
unital C-algebra is called an operator system. such that U is injective and there is a u.c.p.
projection from U onto V , then V is necessar-
Recall that a linear map ’ : A ! B between two ily injective. It follows that an operator system
C -algebras is positive if ’(A+) B+, and V BðH Þ is injective if and only if there exists
completely positive (c.p.) if for every n N , the a u.c.p. projection from B(H) onto V
matrix amplification ’(n) : Mn(A) ! Mn(B),
’(n)([aij]) ¼ [’(aij)] is positive (see, e.g., Paulsen Definition 24 Let G be a discrete group.
(2002)). Suppose A is a unital C-algebra and A G-operator system U is said to be G-injective
S A is an operator system. Then for every if for any inclusion S T of G-operator systems
n , the operator system Mn ðS Þ inherits a norm and any G-equivariant u.c.p. map ’ : S ! U,
and order structure from Mn(A). Therefore, we can there exists a G-equivariant u.c.p. map ’ : T !
define c.p. maps between operator systems. An U such that ’ðaÞ ¼ ’ðaÞ for all a S (i.e., ’ is an
invertible and unital completely positive (u.c.p.) extension of ’).
map between two operator systems whose inverse
is also c.p. is called a complete order isomorphism Example 27 Let V be an injective operator sys-
(see Blackadar (2006), II.6.9.16). Moreover, a tem and let G be a discrete group. The operator
map ’ : S ! T between two operator systems is system ‘1 ðG, V Þ admits a G-action defined by
522 Dynamical Systems and C-Algebras
Definition 25 Let G be a discrete group and S be It follows that I G ðÞ is an injective operator
a G-operator system. (Kalantar and Kennedy system.
2017, p. 5). By Choi and Effros (1977, Theorem 3.1), the
injective envelope I G ðÞ becomes a C-algebra
(1) A G-extension of S is a pair ðT , kÞ of a when equipped with the Choi-Effros product
G-operator system T and a G-equivariant
unital complete isometry k : S ! T . a ∘ b≔cðabÞ:
(2) A G-extension ðT , kÞ is said to be
• G-injective if T is G-injective; It is clear that the product is commutative,
• G-essential if for every G-equivariant hence by the Gelfand–Naimark theorem I G ðÞ is
u.c.p. map ’ : T ! W , ’ is a complete isomorphic to the algebra of continuous functions
isometry whenever ’ ∘ k is on some compact Hausdorff space.
Example 28 Let S BðH Þ be a G-operator sys- Definition 27 Let G be a discrete group. The
tem. Consider the map k : S ! ‘1 ðG, BðHÞÞ Hamana boundary @ HG is the compact Hausdorff
defined by space that satisfies Cð@ H GÞ ¼ I G ðÞ:
Then the pair (‘1(G, B(H )), k) is a Proposition 11 (Kalantar and Kennedy (2017),
G-extension of S, and by examples 26 and 27, Proposition 3.4 and Proposition 3.7). The
this extension is G-injective. G-action on @ HG is minimal and strongly proxi-
mal, that is, @ HG is a G-boundary.
Definition 26 Let S be a G-operator system.
A G-extension of S that is both G-injective and Theorem 38 (Kalantar and Kennedy (2017),
G-essential is called a G-injective envelope of S. Theorem 3.11). @ HG ¼ @ FG.
tl(a) ¼ haδg, δgi. We say G has the unique trace Corollary 6 Every discrete C-simple group has
property if tl is the only tracial state on Cr ðGÞ: the unique trace property. (The converse was later
The following groundbreaking result was first shown to be false by Le Boudec (2017).)
proved by Kalantar and Kennedy (2017, Theorem
6.2) and later reproved by Breuillard, Kalantar, Theorem 39 also led to algebraic criterions for
Kennedy, and Ozawa (2017, Theorem 3.1). C-simplicity. Following Breuillard et al. (2017)
we say a subgroup H of G is normalish if for any
Theorem 39 Let G be a discrete group. Then the n 1 and t1, . . ., tn G, the intersection \i ti Ht 1
i
following are equivalent: is infinite.
Theorem 43 (Kennedy (2020), Theorem 5.1). A Boyle M, Handelman D (1994) Entropy versus orbit equiv-
discrete group G is C-simple if and only if it has alence for minimal homeomorphisms. Pac J Math
164(1):1–13
no amenable residually normal subgroups. Boyle M, Tomiyama J (1998) Bounded topological orbit
equivalence and C-algebras. J Math Soc Japan 50(2):
The results and techniques obtained in 317–329
Kalantar and Kennedy (2017) and Breuillard Bratteli O (1972) Inductive limits of finite dimensional C-
algebras. Trans Am Math Soc 171:195–234
et al. (2017) have led to many further develop- Breuillard E, Kalantar M, Kennedy M, Ozawa N (2017)
ments and research directions, including, but not C-simplicity and the unique trace property for discrete
limited to, the study of ideal structure for crossed groups. Publ Math Inst Hautes Études Sci 126:35–71
products and groupoid C-algebras (Borys 2019; Brown NP, Ozawa N (2008) C-algebras and finite-
dimensional approximations. Graduate studies in math-
Bryder 2017; Kalantar and Scarparo 2021; ematics, vol 88. American Mathematical Society,
Kawabe 2017; Kennedy et al. 2021; Kennedy Providence
and Schafhauser 2019), amenability of Bryder RS (2017) Injective envelopes and the intersection
Thompson’s groups (Le Boudec and Bon 2018), property. Preprint. arXiv:1704.02723
Carlsen TM, Ruiz E, Sims A, Tomforde M (2021) Recon-
and stationary C-dynamical systems (Hartman struction of groupoids and C-rigidity of dynamical
and Kalantar 2017). systems. Advanced Mathematics, 390: Paper
No. 107923, 55
Castillejos J, Evington S, Tikuisis A, White S, Winter
W (2021) Nuclear dimension of simple C-algebras.
References Invent Math 224(1):245–290
Choi MD, Effros EG (1977) Injectivity and operator
Alvin L, Ash DD, Ormes NS (2018) Bounded topological spaces. J Funct Anal 24(2):156–209
speedups. Dyn Syst 33(2):303–331 Conley C, Jackson S, Marks A, Seward B, Tucker-Drob
Ara P, Lledó F, Perera F, eds (2011) Aspects of operator R (2020) Borel asymptotic dimension and hyperfinite
algebras and applications, volume 534 of Contempo- equivalence relations. Preprint. arXiv:2009.06721
rary Mathematics. American Mathematical Society, Connes A (1976) Classification of injective factors. Cases
Providence; Real Sociedad Matemática Española, II1, II1, IIIl, l 6¼ 1. Ann Math 104(1):73–115
Madrid. Papers from the UIMP-RSME Lluís Connes A, Feldman J , Weiss B (1982) An amenable
A. Santaló Summer School in Mathematics held at the equivalence relation is generated by a single transfor-
Universidad Internacional Menéndez Pelayo, Santan- mation. Ergodic Theory Dynam Syst, 1(4):431–450
der, July 21–25, 2008 Coornaert M (2015) Topological dimension and dynamical
Arnoux P, Ornstein DS, Weiss B (1985) Cutting and systems. Universitext. Springer, Cham. Translated and
stacking, interval exchanges and geometric models. revised from the 2005 French original
Israel J Math 50(1–2):160–168 Cortez MI (2006) ℤd Toeplitz arrays. Discrete Contin Dyn
Ash DD (2016) Topological speedups. ProQuest LLC, Syst 15(3):859–881
Ann Arbor, MI, Thesis (Ph.D.), University of Denver Cortez MI, Medynets K (2016) Orbit equivalence rigidity
Bezuglyi S, Karpel O (2016) Bratteli diagrams: structure, of equicontinuous systems. J Lond Math Soc 94(2):
measures, dynamics. In: Dynamics and numbers, 545–556
volume 669 of Contemporary Mathematics. American Cortez MI, Petite S (2014) Invariant measures and orbit
Mathematical Society, Providence, pp 1–36 equivalence for generalized Toeplitz subshifts. Groups
Bezuglyi S, Dooley AH, Medynets K (2005) The Rokhlin Geom Dyn 8(4):1007–1045
lemma for homeomorphisms of a Cantor set. Proc Am Crainic M, Moerdijk I (2000) A homology theory for étale
Math Soc 133(10):2957–2964 groupoids. J Reine Angew Math 521:25–46
Blackadar B (2006) Operator algebras, volume 122 of Cuntz J (1978) Dimension functions on simple C-alge-
Encyclopaedia of mathematical sciences. Springer, bras. Math Ann 233(2):145–153
Berlin. Theory of C-algebras and von Neumann alge- Dahl H (2008) AF equivalence relations associated to
bras, Operator Algebras and Non-commutative locally finite groups. J Ramanujan Math Soc 23(1):
Geometry, III 77–95
Blackadar B, Handelman D (1982) Dimension functions Day MM (1957) Amenable semigroups. Ill J Math 1:
and traces on C-algebras. J Funct Anal 45(3):297–340 509–544
Borys C (2019) The Furstenberg boundary of a groupoid. de Cornulier Y (2014) Groupes pleins-topologiques
Preprint. arXiv:1904.10062 (d’après Matui, Juschenko, Monod, . . .). Astérisque,
Boyle MM (1983) Topological orbit equivalence and fac- (361):Exp. No. 1064, viii, pp 183–223
tor maps in symbolic dynamics. ProQuest LLC, Ann de la Harpe P (2007) On simplicity of reduced C-algebras
Arbor, MI. Thesis (Ph.D.), University of Washington of groups. Bull Lond Math Soc 39(1):1–26
Dynamical Systems and C-Algebras 525
Dougherty R, Jackson S, Kechris AS (1994) The structure Gähler F, Hunton J, Kellendonk J (2013) Integral coho-
of hyperfinite Borel equivalence relations. Trans Am mology of rational projection method patterns. Algebr
Math Soc 341(1):193–225 Geom Topol 13(3):1661–1708
Downarowicz T (2005) Survey of odometers and Toeplitz Gao S, Jackson S (2015) Countable abelian group actions
flows. In: Algebraic and topological dynamics, and hyperfinite equivalence relations. Invent Math
volume 385 of Contemporary Mathematics. American 201(1):309–383
Mathematical Society, Providence, pp 7–37 Ghys É, de la Harpe P, eds (1988) Sur les groupes hyper-
Downarowicz T, Zhang G (2017) The comparison property boliques d’après Mikhael Gromov, volume 83 of Pro-
of amenable groups. arXiv:1901.01457 gress in Mathematics. Birkhäuser Boston, Inc., Boston,
Downarowicz T, Zhang G (2019) Symbolic extensions of MA, 1990. Papers from the Swiss Seminar on Hyper-
amenable group actions and the comparison property. bolic Groups held in Bern
arXiv:1712.05129 Giol J, Kerr D (2010) Subshifts and perforation. J Reine
Durand F (2010) Combinatorics on Bratteli diagrams and Angew Math 639:107–119
dynamical systems. In Combinatorics, automata and Giordano T, de la Harpe P (1997) Moyennabilité des
number theory, volume 135 of Encyclopedia Math. groupes dénombrables et actions sur les espaces de
Appl. Cambridge University Press, Cambridge, Cantor. C R Acad Sci 324(11):1255–1258
pp 324–372 Giordano T, Putnam IF, Skau CF (1995) Topological orbit
Durand F, Perrin D (2022) Dimension groups and dynam- equivalence and C-crossed products. J Reine Angew
ical systems—substitutions, Bratteli diagrams and Can- Math 469:51–111
tor systems, volume 196 of Cambridge Studies in Giordano T, Putnam IF, Skau CF (1999) Full groups of
Advanced Mathematics. Cambridge University Press, Cantor minimal systems. Israel J Math 111:285–320
Cambridge Giordano T, Putnam I, Skau C (2004) Affable equivalence
Dye HA (1959) On groups of measure preserving trans- relations and orbit structure of Cantor dynamical sys-
formations. I. Am J Math 81:119–159 tems. Ergodic Theory Dynam Syst 24(2):441–475
Dye HA (1963) On groups of measure preserving trans- Giordano T, Matui H, Putnam IF, Skau CF (2008) Orbit
formations. II. Am J Math 85:551–576 equivalence for Cantor minimal ℤ2-systems. J Am
Effros EG (1981) Dimensions and C-algebras, Math Soc 21(3):863–892
volume 46 of CBMS Regional conference series in Giordano T, Matui H, Putnam IF, Skau CF (2010) Orbit
Mathematics. Conference Board of the Mathematical equivalence for Cantor minimal ℤd-systems. Invent
Sciences, Washington, DC Math 179(1):119–158
Effros EG, Handelman DE, Shen CL (1980) Dimension Giordano T, Kerr D, Phillips NC, Toms A (2018) Crossed
groups and their affine representations. Am J Math products of C-algebras, topological dynamics, and
102(2):385–407 classification. Advanced Courses in Mathematics.
Elliott GA (1976) On the classification of inductive limits CRM Barcelona. Birkhäuser/Springer, Cham. Lecture
of sequences of semisimple finite-dimensional alge- notes based on the course held at the Centre de Recerca
bras. J Algebra 38(1):29–44 Matemàtica (CRM) Barcelona, June 14–23, 2011
Elliott GA, Niu Z (2013) On the radius of comparison of a Edited by Francesc Perera
commutative C-algebra. Can Math Bull 56(4):737–744 Giordano T, Putnam IF, Skau CF (2019) ℤd-odometers and
Elliott GA, Niu Z (2017) The C-algebra of a minimal cohomology. Groups Geom Dyn 13(3):909–938
homeomorphism of zero mean dimension. Duke Math Glasner S (1976) Proximal flows. Lecture notes in Mathe-
J 166(18):3569–3594 matics, vol 517. Springer, Berlin/New York
Ellis R (1960) Universal minimal sets. Proc Am Math Soc Glasner E (2003) Ergodic theory via joinings,
11:540–543 volume 101 of Mathematical surveys and monographs.
Feldman J, Moore CC (1977) Ergodic equivalence rela- American Mathematical Society, Providence
tions, cohomology, and von Neumann algebras. Glasner E, Weiss B (1995) Weak orbit equivalence of
I. Trans Am Math Soc 234(2):289–324 Cantor minimal systems. Int J Math 6(4):559–579
Forrest AH, Hunton J (1999) The cohomology and Glasner E, Weiss B (2015) Uniformly recurrent subgroups.
K-theory of commuting homeomorphisms of the Can- In: Recent trends in ergodic theory and dynamical
tor set. Ergodic Theory Dynam Syst 19(3):611–625 systems, volume 631 of Contemp. Math. American
Furman A (1999) Orbit equivalence rigidity. Ann Math Mathematical Society, Providence, pp 63–75
150(3):1083–1108 Gong G, Lin H, Niu Z (2020a) A classification of finite
Furman A (2003) On minimal strongly proximal actions of simple amenable Z -stable C-algebras, I: C-algebras
locally compact groups. Israel J Math 136:173–187 with generalized tracial rank one. C R Math Acad Sci
Furstenberg H (1973) Boundary theory and stochastic pro- Soc R Can 42(3):63–450
cesses on homogeneous spaces. In Harmonic analysis Gong G, Lin H, Niu Z (2020b) A classification of finite
on homogeneous spaces (Proc. Sympos. Pure Math., simple amenable Z -stable C-algebras, II: C-algebras
Vol. XXVI, Williams Coll., Williamstown, MA., with rational generalized tracial rank one. C R Math
1972), pp 193–229 Acad Sci Soc R Can 42(4):451–539
Gaboriau D (2000) Coût des relations d’équivalence et des Guentner E, Willett R, Guoliang Y (2017) Dynamic
groupes. Invent Math 139(1):41–98 asymptotic dimension: relation to dynamics, topology,
526 Dynamical Systems and C-Algebras
coarse geometry, and C-algebras. Math Ann Kalantar M, Kennedy M (2017) Boundaries of reduced C-
367(1–2):785–829 algebras of discrete groups. J Reine Angew Math 727:
Gutman Y, Lindenstrauss E, Tsukamoto M (2016) Mean 247–267
dimension of ℤk-actions. Geom Funct Anal 26(3): Kalantar M, Scarparo E (2021) Boundary maps and covar-
778–817 iant representations. Preprint. arXiv:2106.06382
Haagerup U (1987) Connes’ bicentralizer problem and Katzlinger L (2019) Topological full groups. Preprint.
uniqueness of the injective factor of type III1. Acta arXiv:1907.07424
Math 158(1–2):95–148 Kawabe T (2017) Uniformly recurrent subgroups and the
Haagerup U (2014) Quasitraces on exact C-algebras are ideal structure of reduced crossed products. Preprint.
traces. C R Math Acad Sci Soc R Can 36(2-3):67–92 arXiv:1701.03413
Haagerup U (2017) A new look at C-simplicity and the Kennedy M (2020) An intrinsic characterization of C-
unique trace property of a group. In Operator algebras simplicity. Ann Sci Éc Norm Supér 53(5):1105–1119
and applications—the Abel Symposium 2015, Kennedy M, Schafhauser C (2019) Noncommutative
volume 12 of Abel Symp. Springer, Cham, pp 167–176 boundaries and the ideal structure of reduced crossed
Hamana M (1985) Injective envelopes of C-dynamical products. Duke Math J 168(17):3215–3260
systems. Tohoku Math J 37(4):463–487 Kennedy M, Kim S-J, Li X, Raum S, Ursu D (2021) The
Hartman Y, Kalantar M (2017) Stationary C-dynamical ideal intersection property for essential groupoid C-
systems. To appear in J. Eur. Math. Soc. (includes an algebras. Preprint. arXiv:2107.03980
appendix by Uri Bader, Yair Hartman, and Mehrdad Kerr D (2020) Dimension, comparison, and almost finite-
Kalantar). arXiv:2107.03980 ness. J Eur Math Soc (JEMS) 22(11):3697–3745
Herman RH, Putnam IF, Skau CF (1992) Ordered Bratteli Kerr D, Szabó G (2020) Almost finiteness and the small
diagrams, dimension groups and topological dynamics. boundary property. Commun Math Phys 374(1):1–31
Int J Math 3(6):827–864 Kirchberg E (1995) Exact C-algebras, tensor products,
Hines T (2015) The radius of comparison and mean dimen- and the classification of purely infinite algebras. In
sion. ProQuest LLC, Ann Arbor, MI, Thesis (Ph.D.)– Proceedings of the International Congress of Mathema-
Purdue University ticians, vol. 1, 2 (Zürich, 1994), pp 943–954.
Hirshberg I, Winter W, Zacharias J (2015) Rokhlin dimen- Birkhäuser, Basel
sion and C-dynamics. Commun Math Phys 335(2): Krieger W (1976) On ergodic flows and the isomorphism
637–670 of factors. Math Ann 223(1):19–70
Hjorth G, Molberg M (2006) Free continuous actions on Krieger W (1979/80) On a dimension for a class of homeo-
zero-dimensional spaces. Topology Appl 153(7): morphism groups. Math Ann, 252(2):87–95
1116–1131 Kuratowski K (1968) Topology. Vol. II. Academic Press,
Hopf E (1932) Theory of measure and invariant integrals. New York-London; Państwowe Wydawnictwo
Trans Am Math Soc 34(2):373–393 Naukowe [Polish Scientific Publishers], Warsaw. New
Høynes S-M (2016) Toeplitz flows and their ordered edition, revised and augmented, Translated from the
K-theory. Ergodic Theory Dynam Syst 36(6): French by A. Kirkor
1892–1921 Laca M, Spielberg J (1996) Purely infinite C-algebras
Hunton J (2015) Spaces of projection method patterns and from boundary actions of discrete groups. J Reine
their cohomology. In: Mathematics of aperiodic order, Angew Math 480:125–139
volume 309 of Progr. Math. Birkhäuser/Springer, Le Boudec A (2017) C-simplicity and the amenable rad-
Basel, pp 105–135 ical. Invent Math 209(1):159–174
Ioana A (2011) W-superrigidity for Bernoulli actions of Le Boudec A, Bon NM (2018) Subgroup dynamics and C-
property (T) groups. J Am Math Soc 24(4):1175–1226 simplicity of groups of homeomorphisms. Ann Sci Éc
Ioana A (2013) Classification and rigidity for von Neu- Norm Supér 51(3):557–602
mann algebras. In: European Congress of Mathematics. Li H (2013) Sofic mean dimension. Adv Math 244:
European Mathematical Society, Zürich, pp 601–625 570–604
Ioana A (2018) Rigidity for von Neumann algebras. In: Li X (2018a) Continuous orbit equivalence rigidity. Ergo-
Proceedings of the International Congress of Mathema- dic Theory Dynam Syst 38(4):1543–1563
ticians—Rio de Janeiro 2018. Vol. III. Invited lectures. Li X (2018b) Dynamic characterizations of quasi-isometry
World Sci. Publ., Hackensack, pp 1639–1672 and applications to cohomology. Algebr Geom Topol
Jackson S, Kechris AS, Louveau A (2002) Countable 18(6):3477–3535
Borel equivalence relations. J Math Log 2(1):1–80 Lindenstrauss E (2000) Mean dimension, small entropy
Jiang X, Su H (1999) On a simple unital projectionless C- factors and an embedding theorem. Inst Hautes Études
algebra. Am J Math 121(2):359–413 Sci Publ Math (89):227–262. 1999
Johansen Ø (1998) Ordered K-theory and Bratteli dia- Lindenstrauss E, Weiss B (2000) Mean topological dimen-
grams: implications for Cantor minimal systems. Ph. sion. Israel J Math 115:1–24
D. thesis, NTNU Matui H (2008a) An absorption theorem for minimal AF
Johnson ASA, McClendon DM (2022) Topological equivalence relations on Cantor sets. J Math Soc Japan
speedups of ℤd-actions. Dyn Syst 37(2):222–261 60(4):1171–1185
Dynamical Systems and C-Algebras 527
Matui H (2008b) Torsion in coinvariants of certain Cantor Phillips NC (2016) The C-algebra of a minimal homeo-
minimal ℤ2-systems. Trans Am Math Soc 360(9): morphism with finite mean dimension has finite radius
4913–4928 of comparison. Preprint. arXiv:1605.07976
Matui H (2012) Homology and topological full groups of Pimsner M, Voiculescu D (1980) Imbedding the irrational
étale groupoids on totally disconnected spaces. Proc rotation C-algebra into an AF-algebra. J Operator The-
Lond Math Soc 104(1):27–56 ory 4(2):201–210
Matui H (2015) Topological full groups of one-sided shifts Poon YT (1989) A K-theoretic invariant for dynamical
of finite type. J. Reine Angew. Math. 705:35–84 systems. Trans Am Math Soc 311(2):515–533
Matui H (2016) Étale groupoids arising from products of Popa S (2007) Cocycle and orbit equivalence superrigidity
shifts of finite type. Adv Math 303:502–548 for malleable actions of w-rigid groups. Invent Math
Matui H (2017) Topological full groups of étale groupoids. 170(2):243–295
In: Operator algebras and applications—the Abel Sym- Powers RT (1975) Simplicity of the C-algebra associated
posium 2015, volume 12 of Abel Symposium. with the free group on two generators. Duke Math.
Springer, Cham, pp 203–230 J. 42:151–156
Matui H, Sato Y (2012) Strict comparison and Putnam IF (1989) The C-algebras associated with mini-
Z -absorption of nuclear C-algebras. Acta Math. mal homeomorphisms of the Cantor set. Pac J Math
209(1):179–196 136(2):329–353
Matui H, Sato Y (2014) Decomposition rank of UHF- Putnam IF (2010) Orbit equivalence of Cantor minimal
absorbing C-algebras. Duke Math J 163(14):2687–2708 systems: a survey and a new proof. Expo Math 28(2):
Medynets K (2006) Cantor aperiodic systems and Bratteli 101–131
diagrams. C R Math Acad Sci Paris 342(1):43–46 Putnam IF (2018) Cantor minimal systems, volume 70 of
Medynets K, Sauer R, Thom A (2017) Cantor systems and University lecture series. American Mathematical Soci-
quasi-isometry of groups. Bull Lond Math Soc 49(4): ety, Providence
709–724 Putnam I, Schmidt K, Skau C (1986) C-algebras associ-
Monod N, Shalom Y (2006) Orbit equivalence rigidity and ated with Denjoy homeomorphisms of the circle.
bounded cohomology. Ann Math 164(3):825–878 J Oper Theory 16(1):99–126
Murray FJ, von Neumann J (1936) On rings of operators. Renault J (1980) A groupoid approach to C-algebras,
Ann Math 37(1):116–229 volume 793 of Lecture notes in Mathematics. Springer,
Murray FJ, von Neumann J (1943) On rings of operators. Berlin
IV. Ann Math 44:716–808 Renault J (2003) AF equivalence relations and their
Niu Z (2019) Comparison radius and mean topological cocycles. In: Operator algebras and mathematical phys-
dimension: ℤd-actions. Preprint. arXiv:1906.09172 ics (Constantza, 2001). Theta, Bucharest, pp 365–377
Niu Z (2022) Comparison radius and mean topological Renault J (2008) Cartan subalgebras in C-algebras. Irish
dimension: Rokhlin property, comparison of open Math Soc Bull 61:29–63
sets, and subhomogeneous C-algebras. J Anal Math Rieffel MA (1981) C-algebras associated with irrational
146(2):595–672 rotations. Pac J Math 93(2):415–429
Ormes NS (1997) Strong orbit realization for minimal Rørdam M (2003) A simple C-algebra with a finite and an
homeomorphisms. J Anal Math 71:103–133 infinite projection. Acta Math 191(1):109–142
Ornstein DS, Weiss B (1980) Ergodic theory of amenable Rørdam M (2004) The stable and the real rank of
group actions. I. The Rohlin lemma. Bull Amer Math Z -absorbing C-algebras. Int J Math 15(10):1065–1084
Soc (NS) 2(1):161–164 Rosenberg J, Schochet C (1987) The Künneth theorem and
Ornstein DS, Weiss B (1987) Entropy and isomorphism the universal coefficient theorem for Kasparov’s gener-
theorems for actions of amenable groups. J Analyse alized K-functor. Duke Math J 55(2):431–474
Math 48:1–141 Sims A, Szabó G, Williams D (2020) Operator algebras
Paterson ALT (1999) Groupoids, inverse semigroups, and and dynamics: groupoids, crossed products, and
their operator algebras, volume 170 of Progress in Rokhlin dimension. Advanced courses in Mathematics.
Mathematics. Birkhäuser Boston, Inc, Boston, MA CRM Barcelona. Birkhäuser/Springer, Cham, © 2020.
Paulsen V (2002) Completely bounded maps and operator Lecture notes from the Advanced Course held at Centre
algebras, volume 78 of Cambridge Studies in de Recerca Matemàtica (CRM) Barcelona, March
Advanced Mathematics. Cambridge University Press, 13–17, 2017, Edited by Francesc Perera.
Cambridge Singer IM (1955) Automorphisms of finite factors. Am
Pears AR (1975) Dimension theory of general spaces. J Math 77:117–133
Cambridge University Press, Cambridge, UK Skau C (2000) Ordered K-theory and minimal symbolic
Phillips NC (2000) A classification theorem for nuclear dynamical systems. Colloq Math 84/85:203–227. Ded-
purely infinite simple C-algebras. Doc Math 5:49–114 icated to the memory of Anzelm Iwanik
Phillips NC (2005) Crossed products of the Cantor set by Sugisaki F (1998) The relationship between entropy and
free minimal actions of ℤd. Commun Math Phys strong orbit equivalence for the minimal homeomor-
256(1):1–42 phisms. II. Tokyo J Math 21(2):311–351
528 Dynamical Systems and C-Algebras
Sugisaki F (2003) The relationship between entropy and Tu J-L (1999) La conjecture de Baum-Connes pour les
strong orbit equivalence for the minimal homeomor- feuilletages moyennables. K-Theory 17(3):215–264
phisms. I. Internat J Math 14(7):735–772 Vershik AM (1981) Uniform algebraic approximation of
Szabó G (2015) The Rokhlin dimension of topological ℤm- shift and multiplication operators. Dokl Akad Nauk
actions. Proc Lond Math Soc 110(3):673–694 SSSR 259(3):526–529
Szabó G, Wu J, Zacharias J (2019) Rokhlin dimension for Vershik AM (1982) A theorem on Markov periodic
actions of residually finite groups. Ergodic Theory approximation in ergodic theory. Zap Nauchn Sem
Dynam Syst 39(8):2248–2304 Leningrad Otdel Mat Inst Steklov (LOMI) 115:72–82.
Tikuisis A, White S, Winter W (2017) Quasidiagonality of 306. Boundary value problems of mathematical phys-
nuclear C-algebras. Ann Math 185(1):229–284 ics and related questions in the theory of functions, 14
Tomiyama J (1996) Topological full groups and structure Villadsen J (1998) Simple C-algebras with perforation.
of normalizers in transformation group C-algebras. J Funct Anal 154(1):110–116
Pacific J Undergrad Math 173(2):571–583 von Neumann J (1932) Proof of the quasi-ergodic hypoth-
Toms AS (2006) Flat dimension growth for C-algebras. esis. Proc Natl Acad Sci 18(1):70–82
J Funct Anal 238(2):678–708 Winter W (2012) Nuclear dimension and Z-stablity of pure
Toms AS, Winter W (2013) Minimal dynamics and C-algebras. Invent Math 187(2):259–342
K-theoretic rigidity: Elliott’s conjecture. Geom Funct Winter W, Zacharias J (2010) The nuclear dimension of C-
Anal 23(1):467–481 algebras. Adv Math 224(2):461–498
Descriptive complexity This term is used for the
The Complexity and the placement of a structure or a classification prob-
Structure and Classification lem among the benchmarks of reducibility.
of Dynamical Systems Distal Topologically distal is definition 20, mea-
sure distal is definition 27.
Matthew Foreman =+ equivalence relation Introduced and
University of California at Irvine, Irvine, CA, discussed in section “=+ and the Friedman-
USA Stanley jump operator.”
E0 equivalence relation Definition 32.
Factor map: measurable Let X, Y be standard
Article Outline measure spaces and T : X ! X, S : Y ! Y be
measure-preserving transformations. Then S is
Glossary a factor of T if there is a (not necessarily invert-
Introduction: What Is a Dynamical System? What ible) measure-preserving π : X ! Y such that
Is Structure? What Is a Classification? π ∘ T = S ∘ π almost everywhere. The map π is
Examples of Structure and Classification Results the factor map.
in Dynamical Systems Factor map: topological Let X, Y be topological
Descriptive Complexity spaces and T : X ! X, S : Y ! Y be homeo-
Complexity in Structure Theory morphisms. Then S is a factor of T if there is a
Complexity in Classification Theory continuous surjective map π : X ! Y such that
Standard Mathematical Objects in Each Region π ∘ T = S ∘ π. The map π is the factor map.
Placing Dynamical Systems in Each Region Ill-founded, Well-founded If T X < ℕ is a tree,
Bibliography then T is well-founded if and only if there is no
function f : ℕ ! X such that for all n, f {0,1,2,
+
Glossary . . .n} T. Equivalently, T has no infinite
paths. If T is not well-founded, then T is ill-
Analytic set A subset A of a Polish space X is founded.
analytic if there is a Polish space Y and a Borel K -automorphisms Definition 10.
set B X Y such that A = {x: for some y Y, Kakutani equivalence Definition 60.
(x, y) B}. Equivalently A is the projection of Minimal homeomorphism A homeomorphism
B to the X-axis. h : X ! X is minimal if for every x X,
Anosov diffeomorphism Definition 13. {hn(x) : n ℕ} is dense.
Benchmarking An informal term describing the Measure conjugacy Let S, T be measure-pre-
location of a set or an equivalence relation in serving transformations defined on standard
terms of reducibility to other sets or equiva- measure spaces (X, ℬ, m) and (Y, C , n). Then
lence relations. S, T are measure conjugate if there is an invert-
Bernoulli shift Definition 7. ible measure-preserving transformations
Borel hierarchy Definition 70 f : X ! Y such that f ∘ T = S ∘ f almost
Co-analytic set A subset C of a Polish space X is everywhere.
co-analytic if there is a Polish space Y and a Morse-Smale diffeomorphism Definition 14.
Borel set B X Y such that C = {x for all P11 -norms Definition 75.
y Y, (x, y) B. Equivalently, the comple- f(m) Let X, Y be Polish spaces, m a measure on
ment of C is analytic. X and f : X ! Y be a measurable map. Then f
and m induce a measure f(m) on the Borel well-ordering of {s T : s t} such for s1,
subsets of Y by setting f(m)(B) = m(f1(B). s2 t, s2Ts1 if and only if s1s2. A path
Polish space A Polish space is a topological through a tree is a one-to-one function p from an
space (X, t) such that there is some complete ordinal α whose range is t-upwards closed and
separable metric d on X inducing the topology if β < γ < α, p(γ)T p(β). A branch through T is
t. The topology t is called a Polish Topology. a maximal path through T. A tree is well-
Polish group action A Polish group is a topo- founded if and only if it has no infinite paths.
logical group with a Polish topology. A Polish Trees of finite sequences Fix a set X. Let X<ℕ be
group action is a Polish group acting G acting the collection of finite sequences of elements of
jointly continuously on a Polish space X. See X. Order X <ℕ by setting s t if and only if s is
section “Polish Group Actions.” an initial segment of t. A set T X <ℕ is a tree
Reduction If A and B are subsets of Polish of finite sequences of elements of X if and only
spaces X and Y, then a one dimensional reduc- if whenever s T and t is an initial segment of
tion of A to B is a function f : X ! Y such that s then t T. Trees of finite sequences can be
for all x, x A if and only if f(x) B. If <ℕ
viewed as elements of f0, 1gX : Putting the
A X X and B Y Y, then a two dimen- discrete topology on {0, 1} and the product
sional reduction is a function f : X ! Y such <ℕ
topology on f0, 1gX , the space of trees is a
that for all (x1, x2) X X, (x1, x2) A if and compact topological space. If X is countable
only if (f(x1), f(x2)) B. The reduction is then the space of trees on X is homeomorphic
continuous or Borel if the function is continu- to the Cantor set. Trees are discussed in section
ous of Borel. One- and two-dimensional Borel “Reducing Ill-Founded Trees.”
reductions are notated as ≼1ℬ and ≼2ℬ : (section Topologically transitive If X is a metric space, a
“What Is a Reduction?”). homeomorphism h : X ! X is topologically
Rotation number Definition 2 transitive if for some x X, {hn(x) : n ℕ} is
Separable and complete measure spaces A dense.
measure space (X, ℬ, m) is separable if there Turbulent Turbulence is a property of a contin-
is a countable subset A ℬ such that for all uous action of a Polish group on a Polish space.
S ℬ and ϵ > 0, there is an A A such that It gives a method for showing that an equiva-
m(AΔS) < ϵ. A measure space is complete if lence relation is not the result of an S1-action.
whenever A ℬ has m(A) = 0 and B A, then It is discussed in section “Turbulence” where
B ℬ. the notion is formally defined.
Smooth equivalence relation Definition 31. Well-founded, Ill-founded If T X<ℕ is a tree,
Smooth manifold A manifold with a Ck struc- then T is well-founded if and only if there is no
ture for some k 1. function f : ℕ ! X such that for all n, f {0,1,2,
+
Part of the intention is to suggest methods and The upshot is that in any given paragraph in the
problems in related areas. It is not meant to be a entry, one of the audiences will be questioning the
comprehensive study in any sense and makes no point of what is being presented. I ask the reader’s
attempt to place results in a historical context. patience in understanding the dual mission.
Rather it is intended to suggest classification and What Is the Difference Between a Structure
anti-classification results that are likely to be com- Theory and a Classification? The setting for all
mon in many areas of dynamical systems. Hence, of these results is a Polish space X, and we study
the focus is on giving particular examples where classes C X. In some cases C ¼ X, in which
descriptive set theory techniques have been suc- case the structure question is less cogent.
cessful, and mention related work in passing to The distinction between structure and classifica-
suggest that the examples given here are not iso- tion is a bit vague. Very roughly, a structure theo-
lated (Apologies for leaving out important results, rem for C gives criteria for membership of elements
this reflects my ignorance and space limitations.). that belong to X to be in C . The structure theorem
The target audience for this entry is twofold. In gives more information than the definition of C .
no particular order, one audience consists of Often this additional information simply gives
researchers whose primary interest is in some a very explicit test for belonging to C that may use
area of dynamics who may not be aware of, or ideas slightly different than the definition.
who want to learn more about the methods for Another form of a structure theorem is a method
studying the complexity of structure and classifi- for building an element of C. The form of this type
cation offered by descriptive set theory. For these of classification is:
people, the attempt is to give familiar examples
from dynamical systems where the complexity is xC
understood relative to known benchmarks. Logi- if and only if
cal technicalities are avoided as much as possible,
x can be built using a construction with properties 1Þ-nÞ
and the background necessary is given in the
appendix. This is naive descriptive set theory – where properties 1-n) are concrete and explicit.
with the rough meaning that it omits the techni- While a structure theory involves one object at
calities of quantifiers. A self-contained source on a time and answers the question whether that
basic descriptive set theory, requiring very mini- object is in C X, a classification theory involves
mal background is Foreman (2010). There are two objects known to be in C and asks whether
many excellent, more advanced sources for this they are equivalent with respect to an equivalence
material (Kechris 1995; Marker 2002; relation in question. It is usually concerned with
Moschovakis 2009). The author makes no attempt assigning invariants to the equivalence classes.
at historical attributions of the theorems in Examples of common equivalence relations
descriptive set theory proper, these are well- can include being in the same orbit by a group
covered in Kechris (1995); Moschovakis (2009). action, or being isomorphic, or have some other
The second audience are those descriptive set property in common.
theorists who are likely to be very familiar with This entry is concerned with two cogent ques-
the language of reductions and quantifier com- tions. The first is
plexity. For these people, the attempt is to give a
Can the structure or classification theory be done
very high-level overview of some of the known
with inherently countable information (is it Borel)?
results specifically in dynamical systems. The
choices of examples reflect the authors taste and The second question is:
background, but include extremely famous results Where does its complexity sit relative to existing
and the problems left open by them. There is a list benchmarks? For example: Are there numerical
of open problems at the end of the entry and more invariants? Can one assign countable structures to
elements of the class so that isomorphism is the
will appear in a future arXiv submission with
invariant?
several co-authors, particular F. G. Ramos.
532 The Complexity and the Structure and Classification of Dynamical Systems
Many classical structure theorems were proved in successes with the groups ℤ and ℝ, so that will
the 1960s and 1970s. They fit well into the rubric be the focus in this entry. In general the theory gets
suggested by this entry. progressively more difficult as the groups move
What is anti-classification? Reductions are from ℤ to ℤd to general amenable groups, to free
tools that allow the establishment of lower bounds groups, and then to general non-amenable groups.
on the complexity of equivalence relations. These Since part of the story being told here is that
lower bounds are often the established bench- classifications can be “impossible,” showing this
marks discussed in this entry. The complexity of in the simplest, most concrete situation illustrates
some of some benchmarks is extremely high– the point most dramatically.
often they are not even Borel. The roots of the theory of much of dynamical
Why is this important? An underlying thesis of systems can be traced to the study of vector fields
this entry is that a general solution to a question on smooth manifolds, naturally linked to smooth
about a Polish space that is not Borel cannot be ℝ-actions. (See Smale 1963). As argued by Smale
solved with inherently countable information in (1967) these actions have smooth cross sections
(This is not original to this paper). If an equiva- that give significant information about the
lence relation is not Borel a general question corresponding solutions to ordinary differential
whether x1 x2 requires some uncountable equations. The smooth cross-sections are ℤ-
resource – usually an application of the actions by diffeomorphisms of the manifold. In
uncountable Axiom of Choice. Thus, a general short, ℝ-actions give rise to interesting ℤ-actions.
theme of this entry consists of determining when In ergodic theory, a class of ℤ-actions are induced
a question can be answered using inherently by ℝ-actions using the method of first returns.
countable information: it focusses on the Borel/ This is studied via Kakutani equivalence and
non-Borel distinction. discussed in section “Kakutani Equivalence.”
This language can be confusing: for example, Turning this around, ℤ-actions can induce ℝ-
if an equivalence relation E can be “classified by actions using the method of suspensions. This
countable structures,” it sounds like it is “classifi- technique allows lifting the complexity results
able.” However even for countable structures, the from ℤ-actions to ℝ-actions. The upshot is that
isomorphism relation may not be Borel. An exam- ℤ and ℝ-actions are closely related. For this rea-
ple of this is the equivalence relation of isomor- son Smale (1967) and others argue for studying ℤ-
phism for countable groups (See section “S1- actions. Since the ℤ-actions are determined by
Actions”). their generator, this amounts to studying single
Dynamical Systems Broadly speaking a transformations.
dynamical system is any group action f : G In summary, this entry will focus on ℤ-actions,
X ! X. However, to be interesting, the group which are determined by the generating transfor-
actions are taken to preserve some structure on mation – in effect we are studying single
X. Commonly studied types of structure include transformations.
topological, measure theoretic, smooth, and What structure and equivalence relations
complex. will be considered? While we will give examples
There is a deep and well-developed theory for of difficulties with structure theory in other con-
general groups, both amenable and non- texts, the structure and classification theory of
amenable, discrete, topological, and carrying a transformations described in this entry breaks
differentiable or complex structure. Accordingly, very roughly into the quantitative theory and the
the actions can be discrete, continuous, or smooth qualitative theory. The ergodic theorem gives a
actions. framework for studying a function by repeated
Typical groups include ℤn, ℝn, more general sampling along an orbit of a transformation T.
Lie groups, free groups n , and groups arising as As the number of samples grow, the averages con-
automorphism groups of natural structures. How- verge to the average of the function over the whole
ever, much of the theory is modeled on initial space provided the given transformation is
The Complexity and the Structure and Classification of Dynamical Systems 533
ergodic. Hence, it can be viewed as the quantitative diffeomorphisms, for example, this is the motiva-
theory. The appropriate equivalence relation is tion for focusing on single transformations, taking
conjugacy by measure-preserving transformations. the manifold to be as simple as possible (say the
In (Smale 1967), Smale describes the study of 2-torus) and the diffeomorphisms to be C1. In
transformations (in particular diffeomorphisms of many situations, the results are more general, but
manifolds) up to topological conjugacy as the that is not the goal.
qualitative theory. Symbolic shifts are also stud- Homeomorphisms Often the class being
ied up to homeomorphisms (and up to their auto- given a structure theory consists of homeomor-
morphism groups). phisms with additional properties. In addition to
The understanding of the complexity of classi- smooth structure, one can consider minimal
fications in the quantitative theory is better devel- homeomorphisms, or topologically transitive
oped than the understanding of the complexity of homeomorphisms. An important example in sec-
classifications in the qualitative theory. Ergodic tion “Natural Classes That Are Not Borel” is the
theory classifications are discussed in section collection of topologically distal transformations.
“Measure Isomorphism.” The nascent connec- We define the first two classes as they arise in
tions with the qualitative theory is discussed in many contexts. Let X be a topological space and
section “Topological Conjugacy”, along with the h : X ! X be a homeomorphism. Then h is min-
many open problems. imal if every forward h-orbit of an x X is dense.
To repeat, the two equivalence relations this A weaker property is topological transitivity,
survey will focus on are: which means that for some x X the forward
h-orbit of x is dense.
• Measure conjugacy What a Classification isn’t. To make sense of
• Topological conjugacy what a classification is, it might best be motivated
by giving an example what a classification isn’t.
The first equivalence relation involves transfor- For concreteness, let us consider the space of
mations that act on probability measure spaces X, Y, measure-preserving transformations of the unit
perhaps with other structural restrictions. (For interval with the equivalence relation of measure
example, volume preserving diffeomorphisms of isomorphism (conjugacy).
a compact manifold.) Two such transformations Because there are only continuum many
S, T are measure conjugate (or measure isomor- measure-preserving transformations, there are at
phic) if there is a measure isomorphism f : X ! Y most continuum many equivalence classes [T]. By
such that S ∘ f ¼ f ∘ T almost everywhere. the Axiom of Choice, one can build a one-to-one
The second equivalence relation involves function
homeomorphisms acting on topological spaces
X, Y, perhaps with other structural restrictions F : f½T : T MPTg ! ℝ
(S and T might be required to preserve smooth
structures on X and Y.) A pair S, T are topologically where MPT is the collection of ergodic
conjugate (or topologically equivalent) if there is measure preserving transformations. By letting
a homeomorphism h : X ! Y such that S ∘ h ¼ C(T ) ¼ F([T]), we get a map giving complete
h ∘ T. numerical invariants to the equivalence relation
Choices of examples A theme of the survey is of measure isomorphism.
that many natural questions have intractable com- This is of course NOT what is meant by a
plexity, such as being infeasible using inherently classification, since simply evoking the Axiom of
countable information. For this to be most con- Choice gives no useful information whatsoever.
vincing, the classes should be taken to be as clear Requirements to be a structure theory or a
and concrete as possible in each context. In the classification A structure theory or a classification
context of the qualitative behavior of must be effective or computable in some sense.
534 The Complexity and the Structure and Classification of Dynamical Systems
There are at least three common versions of Examples of Structure and Classification
effectiveness: Results in Dynamical Systems
Then [r( f )]1 is called the rotation number of f Theorem 4. (Poincaré) Suppose that f is a
homeomorphism of S1 with a dense orbit. Then
The following facts are necessary for this to f is topologically conjugate to Rθ for an
make sense: irrational θ.
536 The Complexity and the Structure and Classification of Dynamical Systems
+
Theorem 5. Let f be an orientation preserving
homeomorphism from S1 to S1. Then: equal to some wi.
It is straightforward to verify that the collection
1. f has a rational rotation number if and only if f of subshifts of finite type is a countable subset of
has a periodic point. the compact subsets of Sℤ. Hence the structure
2. If r( f ) ¼ p/q and ( p, q) ¼ 1 then f has periodic aspect is easy to understand.
points and each periodic point has period q. Each shift of finite type is determined by finite
information (the forbidden words) and hence it
For homeomorphisms with period points the makes sense to ask whether it is possible to deter-
issue relates to the classification of order preserv- mine whether two subshifts are topologically con-
ing homeomorphisms of the unit interval, which is jugate using inherently finite information. Various
studied in Hjorth (2000). invariants exist which are derivable from adja-
After this entry was written Joint work of cency matrices and two subshifts of finite type
the author and Gorodetski shows that even for are conjugate by a homeomorphism if and only
diffeomorphisms of S1, the equivalence relation if their adjacency matrices are strong shift equiv-
of conjugacy-by-homeomorphisms turns out to alent. This is an example of a recursive reduction.
be a maximal among equivalence classes reduc- It reduces the equivalence relation of conjugacy
ible to S1-actions and thus, quite complex. The by homeomorphism to the equivalence relation on
complexity lies in diffeomorphisms with fixed the collection of adjacency matrices of being
points. strong shift invariant (See section “What Is a
Reduction?”).
Symbolic Shifts However, the question of whether there is an
Let S be a finite or countable alphabet (Several algorithm to determine conjugacy by homeomor-
authors refer to an alphabet as a “language,” a phisms for shifts of finite type remains an open
practice dating to Tarski). We let Sℤ stand for problem (see Open Problem 2).
the collection of bi-infinite countable sequences While other invariants exist for aperiodic sub-
hf(n) : n ℤi written in the alphabet S. shifts such as topological entropy, these are not
The space Sℤ carries the natural product topol- complete invariants for conjugacy by homeomor-
ogy. With respect to this topology, the shift map phism. In Camerlo and Gao (2001), it is shown
sh : Sℤ ! Sℤ defined by: that the classification of homeomorphisms of the
Cantor set up to conjugacy has no Borel invari-
shðf ÞðnÞ ¼ f ðn þ 1Þ ants – in fact it is maximal for S1-actions (See
The Complexity and the Structure and Classification of Dynamical Systems 537
from one moment earlier. K automorphisms are Rokhlin (1967) proved that there is always a
characterized by the present being asymptotically countable generating set, and Krieger (1970,
independent of the past. 1972a) proved that if the entropy of T is finite,
then there is a finite generating set.
Definition 10. A K automorphism is a Krieger (1972b) showed the stronger result that
measure-preserving transformation T of a stan- if the original transformation T is ergodic and
dard probability space (X, ℬ, m) such that there K A ℤ is the support of n, then (K, sh) is
is a sub-s-algebra K ℬ with: uniquely ergodic. We have outlined the proof of
the following theorem
1. K TK ,
2. the union of the algebras T n K , for n 0 Theorem 11. Let (X, ℬ, m) be an ergodic
generates ℬ as a s - algebra, measure-preserving system. Then there is a count-
n
3. \1n¼1 T K ¼ f;, X g: able alphabet A and a closed shift-invariant set
A ℤ carrying a unique shift-invariant mea-
There is a perfect set of non-isomorphic K sure n such that ð, C , n, shÞ is isomorphic to
automorphisms, as was shown by Ornstein and (X, ℬ, m, T ). If (X, ℬ, m, T) has finite entropy
Shields in (1973). In section “Reducing E0 to then the alphabet A can be taken to be finite.
Dynamical Systems”, it is shown that the very
elegant classification theory for Bernoulli shifts Koopman Operators
does NOT extend to the K automorphisms and Let (X, ℬ, m, T ) be a measure-preserving system.
little is known about classifying the K Define a map
automorphisms: For example, it is not known if
the measure-isomorphism relation restricted to the U T : L2 ðXÞ ! L2 ðXÞ
K automorphisms is Borel (See Open Problem 4.).
by setting UT ([ f ]) ¼ [ f ∘ T ]. Then UT is a well-
Symbolic Systems as Models for Measure- defined unitary operator on L2(X), and moreover,
Preserving Transformations if S and T are isomorphic measure-preserving
Let (X, ℬ, m) be a standard probability space, and transformations, then US and UT are unitarily
T : X ! X be an ergodic measure-preserving equivalent.
transformation. Then a set A ℬ is a generating Thus, it is possible to assign to each measure-
set if ℬ is the smallest T-invariant s-algebra preserving transformation a unitary operator in a
containing A (where we identify sets if their way that sends conjugate transformations to uni-
symmetric difference is zero). tarily equivalent operators. These operators are
If A is a countable or finite generating set then called the Koopman Operators. The equivalence
we can make the elements of A disjoint and keep it classes of Koopman operators are thus a collec-
a generating set. For this reason, the terminology tion of invariants, but except for very special cases
generating partitions is frequently used. If A ¼ (such as in the next subsection) they are not com-
hAn : n ℕi is a partition of a set of measure one, plete invariants.
then almost every x in X determines a ℤ-sequence
of names hAnk : k ℕi where T k ðxÞ Ank : The Translations on Compact Groups
sequence is called the A name of x. If A gener- We describe the Halmos-von Neumann Theorem.
ates, then there is a set S of measure one such that Let G be a compact group. Then G carries a Haar
for all x 6¼ y S the A name of x is different probability measure that is invariant under left
from the A name of y. multiplication. Suppose that g G and Haar
Hence, on S there is a one-to-one map f from measure is ergodic for the map Tg : G ! G
X to A ℤ given by sending x to its A name: given by Tg(h) ¼ gh. Then G must be the closure
Letting n ¼ f(m) (so copying the measure on of the powers of g : G is a monothetic abelian
X over to a measure on A ℤ), we get an isomorphic group (The material presented in this example is
copy of X as a shift space A ℤ , n : explained in (Foreman 2000)).
540 The Complexity and the Structure and Classification of Dynamical Systems
rather than being a diffeomorphism, because “dif- 1. Pi is a hyperbolic periodic point for i ¼ 1, . . .l,
ferential conjugacy is too fine” to reveal the rele- 2. [li¼1 W s ðPi Þ ¼ M,
vant properties such as structural stability. 3. [li¼1 W u ðPi Þ ¼ M,
Some examples of classes of 4. Wu(Pi) and Ws(Pj) are transversal for all i, j.
diffeomorphisms There are many well-studied
classes of diffeomorphisms. For the purposes of
this entry, we will assume that they are C1 and 1. (Structure) Both the Anosov and the Morse-
usually assume that the manifolds M are compact. Smale diffeomorphisms are examples of struc-
We start with two of the well-known classes of turally stable diffeomorphisms. The structur-
diffeomorphisms. ally stable diffeomorphisms are those f for
Suppose f : M ! M is a diffeomorphism and which there is a C1-neighborhood U such that
that Λ M is invariant. Then Λ is hyperbolic for every g U is topologically conjugate to f.
f if there are constants C > 0 and 0 < l < 1 and a Thus
decomposition of the tangent bundle restricted to (a) The Anosov diffeomorphisms form an
Λ as: open set
(b) The Morse-Smale diffeomorphisms form
TM L ¼ Es Eu an open set
Tf ðE Þu
¼ E u
(c) The structurally stable diffeomorphisms
Tf ðEs Þ ¼ Es form an open set
It follows that there are only countably
and for v Es, n > 0 many classes of Anosov and Morse-Smale
diffeomorphisms.
kTf n ðxÞvk Cln k v k It would be a nice picture if the collection of
structurally stable diffeomorphisms formed an
and for v Eu, n 0 open and dense subset of the diffeomorphisms,
but this was refuted by Newhouse (Newhouse
kTf n ðxÞvk Cln k v k 1970, 1974).
2. (Classification) Each of the three classes form
Definition 13. A diffeomorphism f : M ! M is an open subset of the space of
Anosov if the whole manifold M is hyperbolic diffeomorphisms. Each of the open sets
for f. decompose into a union of open subsets
corresponding to each of the equivalence clas-
Suppose that x is a periodic point of order ses. Thus for each class, there is a single count-
m and x, f(x), . . . f m1(x) are hyperbolic points. able list of representatives and radii that
Then the stable and unstable manifolds of x are: capture all of the members of the class. The
representatives can be taken to be rational
Ws ¼ y Mjf mk ðyÞ ! x as k ! 1 (in the appropriate sense) and the radii can be
take to be of the form 1/n. Hence the countable
Wu ¼ y Mjf mk ðyÞ ! x as k ! 1
lists can be viewed as members of a Polish
space.
These are immersed Euclidean spaces of
The upshot is that for each of the three
dimension corresponding to the number of eigen-
classes, one can assign complete numerical
values of the differential D f m(x) that are greater
invariants in a Borel (even continuous) way.
or less than one in absolute value.
The equivalence relation is Borel reducible to
the equality relation on a Polish space.
Definition 14. A diffeomorphism f : M ! M is
Morse-Smale if and only if there are a finite col- In dimension 2, more intuitive invariants can
lection of periodic orbits P1, . . . Pl such that: be assigned in a computable way. For Anosov
542 The Complexity and the Structure and Classification of Dynamical Systems
diffeomorphisms on the 2-torus these are hyper- A structure theorem is concerned with a single
bolic elements of SL2(ℤ) and for Morse-Smale transformation, and classification theory is
diffeomorphisms, they are graphs with colored concerned with pairs of transformations.
edges (Oshemkov and Sharko 1998; Peixoto
1973). Similar results hold for Morse-Smale
diffeomorphisms in dimension 3 (Bonatti Descriptive Complexity
et al. 2019).
An alert reader will notice a problem with both For a structure theory or classification theory to be
the structure and classification results just stated. useful, it is important that it be computable in
The program is to classify equivalence classes some sense. As the example given in section
of diffeomorphisms, and the definition of “Introduction: What Is a Dynamical System?
Anosov, Morse-Smale, and structurally stable What Is Structure? What Is a Classification?”
diffeomorphisms are not invariant under topolog- that used the Axiom of Choice illustrated, classi-
ical conjugacy. Ideally, given a diffeomorphism fications or structure theorems that are not in some
f that is topologically conjugate to an element of sense computable are likely to be meaningless.
one of these classes, there would be a way of The structure theoretical levels of complexity
constructing a g equivalent to f that actually are easier to describe than the complexity bench-
belongs to the class. This seems to be an open marks for classification, as it is a series of yes-no
problem (See Open Problem 9). questions. Recall from the discussion in the intro-
duction the three basic questions about how effec-
More Detailed Structure Theory tive a structure theory can be are:
So far the term “structure theory” has been used
solely for the purpose of determining membership 1. Is the structure theory computable with realis-
in a class. However, in several cases a structure tic resources assumptions?
theory gives more information – for example, 2. Can it be carried out with inherently finite
about the factor structure of a member of a class. information?
The Furstenberg structure theorem for topologi- 3. Can it be carried out with inherently countable
cally distal transformations and the Furstenberg- information?
Zimmer theory for measure-preserving transfor-
mations are examples of this. These are discussed We will ignore the first question and say little
in section “Natural Classes That Are Not Borel.” about the second. The main point of the entry is
that there are natural examples of dynamical sys-
Summary tems fail even the third question. The third ques-
The previous sections are intended to give a con- tion can be rephrased as asking whether the
text for structure theory and classification theory. structure theory shows that the class C is Borel
These are somewhat vague terms, but the inten- (Of course, once a structure theory has been
tion is the following: shown to be Borel, the question arise to determine
what level in the Borel hierarchy it lies in. There is
• Structure theory takes place in the context of an such a literature, but it is not addressed in this
ambient Polish space and gives a method for survey.).
determining membership in a class C : It can Remark. In this survey, we only consider
also give information about the factors of a equivalence relations that are analytic. These
transformation or explicit information about include all orbit equivalence relations of Polish
the way the transformation moves elements in groups. (See example 73.) Some natural equiva-
the space it acts on. lence relations are not analytic (such the equiva-
• Classification theory considers a class C and an lence relation on distal flows of having the same
equivalence relation E on that class. It is distal height). The theory is not nearly as well
concerned with determining when two ele- developed for these so they are not covered in
ments of C are E-equivalent. this survey.
The Complexity and the Structure and Classification of Dynamical Systems 543
Fill in the blank As the example given earlier Thus the inverse image of B under f is A.
showed, without some restrictions on the function Because the definition of Borel measurable
f this notion is not meaningful. If B and Y\B are implies that the inverse image of a Borel set is
non-empty then every set A X is reducible to B: Borel, it follows that
fix b0 B and b1 Y\B and define f(x) ¼ b0 if
x A and f(x) ¼ b1 if x X \A. Then f reduces if A is not Borel, then B is not Borel:
A to B. Thus, one must fill in the blank with some
condition on the concreteness of f. Notation: We write A≼1ℬ B if A is Borel reduc-
In the complexity theory world of computer ible to B. In this notation, the previous remarks
science, the blank is often filled in by some state- can be written for the record as:
ment “linearly,” “polynomially,” “exponentially,”
or “recursively.” In the context of this entry, we Remark 15. Let X, Y be Polish spaces. Let A X
focus on Borel functions f. and B Y, with B Borel. If A≼1ℬ B then A is Borel.
Starting in the 1950s, in the subject then known
as Recursion Theory (Soare 1987) and now called
Reductions of Equivalence Relations
Computability Theory, the joke was reversed:
For the classification problem a two dimensional
To show a problem B is not solvable, you find an version is more useful. Let X and Y be Polish
unsolvable problem A and reduce it to B by a
_________ function.
spaces and E X X, F Y Y be equivalence
relations. Then
If B were solvable, and A is reduced to B, A would
also be solvable, yielding a contradiction. To make E is Borel reducible to F
the definition complete, one has to fill in the blank,
usually from one of the three types of functions: if and only if there is a (unary) Borel function
feasibly computable, recursively computable or f : X ! Y such that for all x1, x2 X
Borel.
544 The Complexity and the Structure and Classification of Dynamical Systems
x1 Ex2 if and only if f ðx1 ÞFf ðx2 Þ: analogue to the Cantor-Schroeder-Bernstein the-
orem fails.) We use the notation AℬB (or C
We use the notation E≼2ℬ F: or...) to mean the equivalence relation of
It is important to recognize how the two defi- bi-reducibility: A≼ℬB and B≼ℬA.
nitions differ. In the second, two-dimensional def- Let ~ be the relation A≼ℬB and B≼ℬA. Then
inition, f has domain X, not X X. So, in essence: ≼B gives a partial ordering of ~ classes of sets or
f assigns an F-equivalence class to each x in X in a
equivalence relations depending on the dimension
manner that two elements of X get assigned the one is working in.
same F-equivalence class if and only if they are in Caveat! There are several diagrams in this entry
the same E-equivalence class. that give the relationship of reducibility between
Viewing the classes of F as invariants, this is various classes of objects. The regions signify
paraphrasing of the statement that f computes the structures that are reducible to the type of object
invariants assigned to members of X. used to label the region. So, for example, an object
Thus for ≼2ℬ , f is a function that assigns values is in the region Polish group actions if it can be
to pairs (x1, x2) by considering each xi separately. reduced to a Polish group action. In particular, this
However viewed as a function from the Polish applies to Figs. 2, 4, 5, and 10. In the diagrams, a
space W ¼ (X X) to the Polish space Z ¼ line segment going upwards means Borel Reduc-
(Y Y ) it is also a one-dimensional reduction. ible and a pointed arrow means that the downwards
reduction does not hold. Not having an arrow
Remark 16. Suppose that E X X and F Y means that bi-reducibility is open. An arrow at
Y, then E≼2ℬ F implies E≼1ℬ F: both ends indicates that bi-reducibility does hold.
Universality Given a class C of either sets
(in the one-dimensional case) or equivalence rela-
Continuous Reductions
tions (in the two dimensional case) an object
An important special case of Borel reducibility is
B C is universal for C if every A C is reducible
when the function f is continuous. This is stronger
to it. The interpretation of this is that B is the most
than being Borel reducible. Essentially, every
complex element of C . This is used for all of the
general statement below applies to continuous
notions: Borel reductions, continuous reductions,
reductions as well as Borel reductions. If A is
and computable reductions.
continuously reducible to B we write A≼1C B and
Terminology warning: Maximal is often used
if E is continuously reducible to F we write
as a synonym of universal. Less frequently com-
E≼2C F: If continuity is clear from context we
plete is also used. We will use maximal when
omit the subscript C and if the number of variables
talking about the ≼2ℬ relation on equivalence
is clear from context we will omit the superscript
relations, the two dimensional case. We will use
in both cases.
complete for ≼1ℬ restricted to sets, the one-
The Pre-ordering dimensional case.
For both Borel and continuous reductions and Because of the transitivity of the notions of
both the one- and two-dimensional cases, the rela- reducibility, if B, C C and B is universal for C
tion of ≼ℬ and ≼C is transitive. To see this we note and B≼ℬC, then C is universal for C . If follows
that if f : X ! Y is a reduction (in the appropriate that all universal sets are bi-reducible to each
sense) and g : Y ! Z is a reduction (in the same other.
sense), then g ∘ f is a reduction in the same sense. The upshot We can view the notions ≼ℬ and
As with all pre-orderings, we can have ≼C as measures of complexity. If A≼ℬB, then B is
bi-reducible sets and bi-reducible equivalence at least as complex as A. In a given class C there
relations. There are examples where A≼ℬB and may be ≼-maximal elements. They represent the
B≼ℬA but there is no bijection f : X ! Y such that equivalence class of the most complex elements
f reduces A to B, and f1 reduces B to A. (The of C .
The Complexity and the Structure and Classification of Dynamical Systems 545
Basic Technique for showing a set is analytic or • Can you determine whether an arbitrary x X
co-analytic but not Borel. The basic technique belongs to C ?
for showing that a set C is not Borel is reducing a • How complicated is a given method for build-
known nonBorel set to C . To use this effectively, ing elements of C ?
one needs a starting point. The sets IFT or WF
play this role for analytic and co-analytic sets. The complexity of the class The basic ques-
Let X be a Polish space and C X be tion is what level of descriptive complexity the
analytic. Then to show that C is complete analytic, class lies in relative to the basic benchmarks,
and so not Borel, it suffices to build a reduction given above.
An early example: weak mixing vs strong
f : T rees ! X mixing The first example of the use of descriptive
set theory to solve a natural structure problem in
such that dynamical systems is due to Halmos and Rokhlin.
A prominent open problem in the 1940s was
T is ill-founded if and only if f ðT Þ C : whether the property of a transformation being
weakly mixing implied the property of being
By Theorem 18, it follows that C is not Borel. strongly mixing.
Similarly the basic technique for showing that In 1944, in a paper called In general a
a co-analytic set C is complete co-analytic is to measure-preserving transformation is mixing
build a reduction (Halmos 1944), Halmos proved that the collection
of weakly mixing transformations is a dense G d
f : T rees ! X set in the weak topology. In a paper published in
1948 with a title translated as A general measure-
such that preserving transformation is not mixing (Rohlin
T is well-founded if and only if f ðT Þ C . 1948), Rokhlin showed that the strong mixing
The most basic Benchmarks The first collec- transformations is meager (first category). The
tion of benchmarks is determining whether a combination of the two papers show that there
classification is computable, Borel or analytic- are weakly mixing transformations that are not
and-not-Borel. These are the basic levels of strongly mixing.
descriptive complexity. If an example is Borel, A proof that explicitly uses complexity shows
one can also ask what level it lives in in the Borel that the collection of strongly mixing transforma-
hierarchy. tions is not P02 , and hence cannot be the same
Paraphrasing this informally, the question class as the weakly mixing transformations.
is whether every classification can be done (In fact the class is P03 .) In particular, there is a
using inherently finite techniques, inherently weakly mixing transformation that is not strongly
countable techniques, or whether it is impossible mixing, as it is complete S02 .
to do the classification using inherently count- An explicit example of a weakly mixing trans-
able techniques. If a classification is complete formation that is not strongly mixing was given
analytic or co-analytic, then there is no possible using Gaussian systems was provided by
classification that is feasible with countable Maruyama in 1949 (Theorem 11 of (Maruyama
resources. 1949)). A very natural example of such a transfor-
mation is due to Chacon (1969).
Complexity in Structure Theory
Examples at Three Levels of Complexity
For our purposes, structure theories for classes We now give examples of each level beyond
C X are concerned with two questions: recursively computable.
The Complexity and the Structure and Classification of Dynamical Systems 547
5. The set 1
Gð f Þ ¼ lim sup logjPern ð f Þj
n!1 n
in the space ℕℕ Diff r(M) is Borel. Natural Classes that Are Not Borel
In the topological setting A classical collection,
We give the proof that appeared in Foreman due to Hilbert (see Zippin 2013) is defined as
and Gorodetski (2022) for items 1–3. The first follows.
item is immediate: since M is compact there is Definition 20. Let (X, d) be a metric space and
a minimum distance between x and f(x). The T : X ! X be a homeomorphism. Then (X, T ) is
set of diffeomorphisms whose minimum dis- (topologically) distal if for all distinct x, y X,
tance is bigger than 1/n is open, hence the set infn ℤ{d(T nx, T ny)} > 0.
of diffeomorphisms with no fixed points
is open. Clearly, isometries are examples of distal trans-
To see the second item: To see that U 1 is Borel, formations. Moreover, the distal transformations
let us choose a countable base for the topology on are closed under taking skew products with com-
M, {Vk}k ℕ, such that pact metric groups. Using these two facts one can
generate a wide class of distal transformations.
diam V 1 diam V 2 diam V 3 . . . Fact. The collection of distal transformations is
a co-analytic set. This is exposited in example 74.
and diam Vk ! 0 as k ! 1. The set
Definition 21. Let (X, d, T) and (Y, d 0, S) be a
V k ¼ ff Diff r ðMÞjf has no fixed points outside of V k g compact separable metric spaces and π : X ! Y
be a factor map. Then X is an isometric extension
is open, and hence the set U 1 ¼ of Y if there is a real valued function r(x1, x2)
\K1 [1 defined for all pairs (x1, x2) in the same fibre of
k¼K V k ∖U 0 is Borel.
X (i.e., whenever π(x1) ¼ π(x2)) and such that
To see that U 2 is Borel, notice that each of the
sets 1. r(x1, x2) is continuous as a function on the
subset of X X defined by the condition
V k1 ,k2 ¼ ff Diff r ðMÞj f has no fixed points outside of V k1 [ V k2 g:
π(x1) ¼ π(x2).
2. For each y Y, r(x1, x2) defines a metric on
is open, and hence the fiber π1( y) under which this fiber is iso-
metric to a fixed homogeneous metric space.
U2 ¼ \ [ V k1 ,k2 ∖ðU 0 [ U 1 Þ 3. r(Tx1, Tx2) ¼ r(x1, x2) for all x1, x2 in the
K1k1 , k2 K same fibre of X over Y.
is Borel. It is clear how to continue for n fixed The next definition captures the process of
points by induction. iterating the isometric extensions transfinitely:
Example. Artin-Mazur Diffeomorphisms It
follows easily from the proposition that the function Definition 22. (X, T ) is a quasi-isometric exten-
giving the exponential rate of growth of the number sion of (Y, T) if (Y, T) is a factor of (X, T), and there
of periodic points G : Diff r(M) ! ℝ [ { 1} is an ordinal and a factor (Xx, T) of (X, T) for
defined by each x such that:
The Complexity and the Structure and Classification of Dynamical Systems 549
1. (X0, T ) ¼ (Y, T); (X, T) ¼ (X, T ). norm on the collection of distal flows. It follows
2. if x < x0 then (Xx, T ) is a factor of Xx0 , T . that for all countable ordinals there is a distal
3. (Xxþ1, T) is an isometric extension of (Xx, T). flow of distal height and that the collection of
4. if x is a limit ordinal then (Xx, T) is the inverse minimal distal homeomorphisms of distal height
limit of Xx0 , T : x0 < x : less than is a Borel set.
In the setting of measure-preserving trans-
The Furstenberg Structure Theorem formations Separate work of Furstenberg and
(Furstenberg 1963) for topologically distal flows Zimmer (Furstenberg 1981; Zimmer 1976) pro-
says: duced analogues of the topological Furstenberg
Structure Theorem that hold for measure-
Theorem 23. (Furstenberg) Let (X, d, T) be a preserving transformations. We describe it here
minimal, topologically distal flow. Then (X, d, T) without giving details and with very slight
is a quasi-isometric extension of the trivial flow. inaccuracies.
The category we are working in consists of
It is natural to ask what ordinals arise as in the measure-preserving transformations of a standard
Furstenberg Structure Theorem. Given a distal measure space, which we can take to be the unit
flow (X, d, T ) define the distal height of the flow interval with Lebesgue measure. By Halmos’ the-
as the least ordinal such that (X, d, T ) is topo- orem, the ergodic diffeomorphisms form a dense
logically equivalent to a quasi-isometric extension G d set. Hence, the weak topology makes the
of the trivial flow represented as a tower of exten- ergodic transformations a Polish space. We take
sions of height . our ambient Polish space EMPT to be the set of
It was shown in Beleznay and Foreman (1995) ergodic measure-preserving transformations of
that the collection of distal heights of minimal the unit interval.
distal flows on compact metric spaces is exactly Let ðY, C , n, SÞ be an ergodic measure-
the collection of countable ordinals. The follow- preserving transformation, and G a compact
ing theorem (Beleznay and Foreman 1995) sum- group. Let H be a compact subgroup of G. Then
marizes these results: the quotient space G/H carries a left-invariant
Haar measure. A compact co-cycle extension
Theorem 24. (Beleznay-Foreman) Let M be the
Polish space of minimal flows on compact metric Sf : Y G=H ! Y G=H
spaces. Then:
of Y is determined by a measurable function
1. The collection of distal flows is a complete f : Y ! G/H and given by the formula:
co-analytic set; in particular, it is non-Borel.
2. The natural function that associates to each Sf y, ½g H ¼ SðyÞ, ½fðyÞg H :
distal (X, d, T), its distal height is a P11 -norm.
Then Sf preserves the product of n and Haar
The first item is proved by showing that there is measure on G/H.
a Borel reduction from the set of codes for well-
orderings, WO to the set of distal flows as a subset Definition 25. (See, e.g., Furstenberg 1981). Let
of the space of minimal ℤ-flows. (X, ℬ, m, T ) and ðY, C , n, SÞ be ergodic measure-
If (X0, d0, T0), (X1, d1, T2) are minimal distal preserving systems. Then:
flows on compact spaces, set (X0, d0, T0)D(X1,
d1, T2) if the distal height of (X0, d0, T0) is less 1. X is a compact extension of Y if X is measure
than or equal to the distal height of (X1, d1, T2). equivalent to a compact co-cycle extension
Then the theorem shows that D is a true P11 - of Y.
550 The Complexity and the Structure and Classification of Dynamical Systems
relations E that have roots outside of dynamical with countable classes are referred to as countable
systems. The most sensitive measures of com- equivalence relations.
plexity are the two dimensional questions about The orbit equivalence relation of any countable
reductions between equivalence relations. The group action has countable classes, so there is a
general regions used to classify equivalence rela- plethora of examples.
tions are given in Fig. 2. Let hfn : n ℕi be a collection of Borel
A basic one-dimensional benchmark If X is a bijections of a Polish space to itself. Let Γ be the
Polish space then X X is also Polish. For any group generated by hfn : n ℕi. Then each orbit
equivalence relation, E X X, one can ask the of Γ is countable and the orbit equivalence relation
one-parameter question: is Borel. Conversely, a theorem of Feldman and
Moore (1977) states:
• Is E Borel as a subset of X X?
Theorem 29. (Feldman-Moore) Suppose that
E X X is a countable Borel equivalence rela-
If not, then determining whether a pair
tion on a Polish space X. Then there is a countable
(x0, x1) X X is in the relation E cannot be
group Γ and a Borel action of Γ on X such that E is
accomplished with inherently countable
the orbit equivalence relation of the action.
resources.
Many important equivalence relations arise out A ≼2B maximal countable Borel equivalence
of group actions. If G acts on X then the equiva- relation Let G be a countable discrete group and
lence relation x y if and only if there is a g G f : G X ! X be a Borel action on a Polish space
such that gx ¼ y is called the orbit equivalence X. Then the orbit equivalence relation is a count-
relation. able Borel equivalence relation on X.
A Polish group action is an action of a Polish Following (Jackson et al. 2002) we let sG be the
group G on a Polish space X that is jointly contin- shift action of G on the space XG, consisting of
uous in each variable. Section “S1-Actions” is functions f : G ! X. Let E(G, X) be the orbit
concerned with actions of the group of permuta- equivalence relation of this action. Dougherty
tions of the natural numbers, S1. When we say G et al. (1994) showed that if G is the free group
acts on X where G and X are both Polish, we will on two generators, F2, then E(F2, X) is a maximal
always be assuming that the action is a Polish Borel countable equivalence relation. The follow-
group action. ing is immediate from the Dougherty-Jackson-
Remark. When we introduce a class of equiv- Kechris theorem.
alence relations (such as the S1-actions defined
below), we really mean to take ≼2ℬ downwards Theorem 30. Let E1 be any countable equiva-
closure. So the class we label “S1-actions” lence relation bi-reducible to E(F2, X) for any
means any equivalence relation reducible to an Polish space X. Then every countable Borel equiv-
S1-action. alence relation is reducible to E1.
benchmark of complexity. If Y is a Polish space, Theorem 34. (Harrington et al. 1990) Let X be a
we let ¼Y Y Y be the equivalence relation of Polish space and E a Borel equivalence relation
equality on Y. on X. Then either:
If Y is a Polish space then there is a Borel
injection g : Y ! ℝ (See Foreman 2010). Let 1. E is smooth or
f be a Borel reduction of any equivalence relation 2. E0 ≼2ℬ E:
E X X to (Y,¼Y). Composing f with g we see
that g ∘ f is a reduction of (X, E) to (ℝ,¼ℝ).
Moreover g ∘ f is Borel if f is. Thus, we can Corollary 35. ¼ ≼2ℬ E0 but ¼. Thus E0 is
assume that Borel reductions to any ¼Y can be strictly more complicated than the identity equiv-
changed to Borel reductions to equality on the real alence relation.
numbers.
This theorem suggests an orderly stair-step
Definition 31. Let E be an equivalence relation pattern to the benchmarks, but this is misleading,
on a Polish space X. Then E is smooth if E≼2ℬ ¼ℝ : as we shall see.
We prove the easy direction of Theorem 34,
In 1990, Harrington et al. (1990), extending because it is most useful in dynamical systems.
earlier work of Glimm (1961) and Effros (1965),
showed a very general dichotomy theorem. 1. Put a measure on {0, 1} by weighting each
element 1/2. The product measure m on
Definition 32. Let E0 be the equivalence relation {0, 1}ℕ is often called the coin-flipping
on the Cantor set {0, 1}ℕ defined by setting f E0 g measure.
if and only if there is an N for all m > N, f(m) ¼ 2. Let G ¼ Sn ℕℤ2 be the direct sum of infinitely
g(m). many copies of ℤ2 (with finite support).
3. The coordinatewise action of G on
Proposition 33. The equivalence relation ¼ on ({0, 1}ℕ, ℬ, m) preserves m.
{0, 1}ℕ is continuously reducible to E0. 4. A standard 0 – 1 law for this action says that if
A {0, 1}ℕ is invariant under the G-action
‘ Fix a bijection h, i : ℕ ℕ ! ℕ. Define then A either has m-measure 0 or m-measure 1.
R : {0, 1}ℕ ! ℕ by setting (This simple, elegant
proof was taken from notes from a lecture of Su Theorem 36. Suppose that E is an equivalence
Gao. See citation in the bibliography) relation on an uncountable Polish space X and
E0 ≼2ℬ E. Then E is not smooth.
Rðf Þðhm, niÞ ¼ f ðnÞ:
‘ Toward a contradiction, assume that E is
Since R is well-defined, f ¼ g implies R( f ) Borel reducible to the identity equivalence rela-
E0R(g). On the other hand suppose for all tion on ℝ by a map S : X ! ℝ. Let R : {0, 1}ℕ ! X
m > N, R( f )(m) ¼ R(g)(m). Let n ℕ and choose be a reduction of E0 to E. By composing, we see
an M such that hM, ni > N. Then that S ∘ R is a reduction of E0 to the identity
relation on [0, 1]. It suffices to show that there
f ð nÞ ¼ Rðf ÞðhM, niÞ can be no such reduction from E0 to the identity
¼ RðgÞðhM, niÞ relation ¼ on [0, 1].
¼ gðnÞ: a Suppose that S is such a reduction. Because
G preserves E0, for each rational number r ℝ,
The Harrington-Kechris-Louveau dichotomy Lr ¼ {f : S( f ) < r} is G invariant, and so has
is the following theorem. measure zero or one. Let
The Complexity and the Structure and Classification of Dynamical Systems 553
x ¼ supfr : Lr has measure oneg: operation” to this set at each stage also gives a
proper hierarchy in ≼2ℬ :
Then I ¼ {f : S( f ) ¼ x} has measure one. The following result is proved with a similar
Hence there are f, g I such that f E0 g, a approach to remark 36. See (Gao 2022) for a
contradiction. simple elegant proof of the first reduction.
5+ and the Friedman-Stanley Jump Operator Theorem 37. The following relations hold:
Let S1 be the group of permutations of the natural
numbers (Some authors reserve the notation S1
for the group of permutations of ℕ that move only
finitely many points and use S1 for the whole In particular,
group of permutations. The convention in this
paper is different and is becoming universal.). S1- Actions
Then S1 is a G d subset of ℕℕ and so carries a We now discuss S1-actions, where S1 is the
Polish topology. This topology is compatible with Polish group of all permutations of ℕ that was
the group structure and so we view S1 as a Polish defined in the previous section. The S1-actions
group. are a fundamental benchmark because their
The following operation on analytic equiva- actions are ubiquitous: every non-Archimedean
lence relations was defined by Friedman and Stan- Polish group is isomorphic to a closed subgroup
ley (Friedman and Stanley 1989). of S1 (See Becker and Kechris 1996).
Let E X X be an analytic equivalence Fix an infinite perfect Polish space X and con-
relation on a Polish space. Define E+to be the sider P Xℕ defined as the collection of infinite
equivalence on Xℕ given by setting sequences hxn : n ℕi Xℕ such that
h[xn]E : n ℕiE+-equivalent to h[yn]E : n ℕi
if and only if for all n 6¼ m, xn 6¼ xm :
for all n there is an m xnEym Then P is a perfect closed subset of Xℕ, S1 acts
! !
and on P coordinatewise and for x , y P we have
for all m there is an n ymExn.
! ! ! !
x ¼þ y if and only if there is a f S1 , f x ¼ y :
Because the collection of Borel sets (and
respectively analytic sets) are closed under count- Thus the equivalence relation on P induced by
able intersections and unions, the form of the ¼+is the orbit equivalence relation of an S1-
definition makes it clear that if E is Borel, then action.
E+is Borel (and similarly for E being analytic). If
X and Y are perfect Polish spaces, then ¼þX and ¼Y
þ
Remark 38. The equivalence relation ¼+restricted
are the Borel bi-reducible. Moreover starting with to P is a Borel equivalence relation. Thus there are
the identity relation ¼ iterating “+” operation any faithful S1-actions inducing Borel equivalence
countable ordinal number α of times leads to a relations.
sequence of equivalence hEβ : β < αi such that
! !
β < β0 implies Eb ≼ℬ Eb0 but (Friedman Remark 38 is immediate: for x and y in
! !
and Stanley 1989). Thus this is a strict hierarchy P, x ¼þ y if and only if for all n there is an
of height o1. m with xn ¼ ym and for all m there is an n with
!
If E is Borel then x Xℕ : ½xn E 6¼ ½xm E for y m ¼ x n.
Notation The only example of the use of the
all n 6¼ m} is a Borel set. Restricting the “+- “+-operation” in this entry will be when E is the
554 The Complexity and the Structure and Classification of Dynamical Systems
identity equivalence relation, ¼, on the unit circle. 1. (Mekler 1981) F≼2ℬ Egroups
We will restrict ¼+to one-to-one sequences. In an 2. (Friedman, Stanley, see (Gao 2009))
abuse of notation, we will continue to refer to this F≼2ℬ Egraphs
relation as ¼+.
Classification by countable structures We Thus both Egroups and Egraphs are ≼2ℬ -maximal
now give an example to illustrate the importance among S1-actions.
of S1-actions for creating invariants for equiva-
lence relations. These examples illustrate why equivalence
Let (X, E) be an analytic equivalence relation relations that are reducible to S1-actions are
that has complete invariants consisting of count- referred to as those that can be classified by count-
able groups. This means that a countable group is able structures, the name for them given in
associated to each element of X by a Borel func- (Hjorth 2000). We note that there is varying ter-
tion in such a way that xEy just in case the group minology for equivalence relations that are ≼2ℬ
associated with x is isomorphic to the group asso-
bi-reducible to maximal S1-actions. They are
ciated with y. In the language we are using here,
called maximal or universal S1-actions and also
the equivalence relation E is being reduced to the
Borel Complete equivalence relations.
equivalence relation on pairs of countable groups
To illustrate the large collection of equivalence
given by isomorphism. The next example makes
relations that arise from S1-actions and are ≼2ℬ
this precise.
maximal among S1-actions we cite results of
Example. Countable groups are determined
Camerlo and Gao (2001).
by the characteristic function of their multiplica-
Recall that almost finite commutative C-
tion. Explicitly, for G an infinite countable group
algebras (commutative “AF” C-algebras) are
we can assume that its domain is ℕ. Define
those that are inverse limits of finite dimensional
commutative C algebras. They have a classifica-
1 if l G m ¼ n
wG ðl, m, nÞ ¼ tion in terms of Bratelli diagrams (See Camerlo
0 otherwise and Gao 2001).
3
Then each wG f0, 1gℕ and {wG : G is a Theorem 40. (Camerlo-Gao) The isomorphism
3
countable group} is a G d subset of f0, 1gℕ , and relation among commutative AF C-algebras is
hence form a Polish space. Call it CG. ≼2ℬ -maximal among S1-actions
Let S1 act on CG by setting (f wG)(l, m, n) ¼
wG(f(l), f(m), f(n)). If G, H are isomorphic, let A maximal S1-action is NOT Borel One
f : G ! H be the isomorphism. Then f takes the property that makes equivalence relations such
multiplication table of G to the multiplication as Egraphs and Egroups very useful is that maximal
table of H. Hence f wG ¼ wH. Similarly if f wG ¼ S1 relations are not Borel. We record this here:
wH, then f : G ! H is an isomorphism.
Fact 41. S1-actions that are ≼2ℬ -maximal
This example is clearly not specific to groups– have complete analytic orbit equivalence rela-
it can be adapted to any countable structures. In tions. Hence, they are not Borel.
particular, it applies to countable graphs viewed as In particular, isomorphism of countable
2
elements of f0, 1gℕ : Let Egroups be the equiva- groups, isomorphism of countable graphs, and
lence relation of isomorphism of countable groups isomorphism of AFC-algebras are not Borel
and Egraphs be the equivalence relation of isomor- equivalence relations.
phism of countable graphs. For some applications in dynamical systems
the particular case of “isomorphism of countable
Theorem 39. Let S1 act on an arbitrary Polish graphs” is a particularly convenient example of an
space X and F be the resulting orbit equivalence ≼2ℬ -maximal S1 orbit equivalence relation.
relation. Then:
The Complexity and the Structure and Classification of Dynamical Systems 555
3. If H G is a closed subgroup, then the max- we don’t expand on systems with numerical
imal Polish H-action is continuously reducible invariants or on equivalence relations with E0
to the maximal Polish G-action. embedded in them, though we give examples of
each for dynamical systems. For S1 we have
Items 1 and 2 are due to Becker and Kechris. already given an example of a Borel action
Item 3 is essentially due to Mackey in the Borel (Remark 38) and ≼2ℬ -maximal S1-actions
case and proved by Hjorth in the continuous case. (Theorems 39 and 40).
Uspenskiĭ proved that the group of homeomor-
phisms of the Hilbert cube is a universal Polish Countable Equivalence Relations
group: every Polish group is isomorphic to a Recall that an equivalence relation is said to be
closed subgroup of this group. Thus, item 2 fol- countable if it has countable classes (see section
lows by applying items 1 and 3 to the group “Equivalence Relations with Countable Classes”)
Uspenskiĭ used (Gao 2009; Uspenskiĭ 1986) See In section “Countable Equivalence Relations”,
Fig. 2 for a diagram of the regions discussed here. we saw that Dougherty et al. (1994) showed that
there is a ≼2ℬ -maximal countable equivalence
relation they call E1. So, understanding count-
Standard Mathematical Objects in Each able equivalence relations is, in part, understand-
Region ing their relationship to E1. Rather than try to
cover a large literature, we focus on work of
The classes with classifications in the lower Thomas (2003, 2011) that is related to dynamical
regions of the diagram are extremely plentiful so systems and gives natural example of a ≼2B -
The Complexity and the Structure and Classification of Dynamical Systems, Fig. 2 Basic regions of complexity
The Complexity and the Structure and Classification of Dynamical Systems 557
Thus, it follows from the Mackey-Hjorth theory any Polish group action and is, moreover, ≼2ℬ -
(Theorem 44) that the isomorphism action of the minimal with these properties.
measure-preserving transformations on the ergo- View {0, 1}ℕℕ as the collection of infinite
dic transformations is reducible to the maximal sequences hxn : n ℕi where xn {0, 1}ℕ. Set
unitary group action. hxninE1hymim if and only if there is an N for all
Choquet Simplexes An example of a natural n > N, xn ¼ yn. Note that this is an analogue of E0
equivalence relation of maximal ≺2B -complexity where elements of {0, 1} have been replaced by
among Polish group actions is due to Sabok. elements of {0, 1}ℕ and the equivalence relation
Recall that a Chouqet simplex is a compact, is eventual equality.
separable, affine space X such that any point
x X is the integral of a unique measure concen- Theorem 46. (Kechris-Louveau, (Kechris and
trated at the extreme points of X. There is a Louveau 1997)) E1 is Borel and not reducible to
Choquet simplex, the Poulsen simplex, which is a Polish group action.
universal in the sense that every Choquet simplex
can be embedded as a face. This puts a Polish We now give another example arising in topol-
topology on the collection of Choquet simplexes. ogy. Let X be the collection of compact separable
Let ðC , tÞ be this Polish space. metric spaces with ℕ as a dense subset. Then X
Set two Choquet simplexes S1, S2 equivalent if carries a natural Polish topology.
there is an affine homeomorphism taking S1 to S2. We define the notion of bi-Lipschitz equiva-
Let AFF(Poulsen) be the group of affine homeo- lence. For compact separable metric K 1 , K 2 set
morphisms of the Poulsen simplex. Then K 1 RK 2 if and only if there is a bijection f :
AFF(Poulsen) is a Polish group and its natural K 1 ! K 2 such that both f, f1 are Lipschitz. The
action on C maps faces to faces, so Aff(Poulsen) following theorem appears in Rosendal (2005).
acts on C by a Polish action.
Lindenstrauss et al. (1978) showed that any Theorem 47. (Rosendal) The equivalence rela-
affine homeomorphism between two proper sub- tion R X X is Borel but not reducible to a
faces of the Poulsen extends to an affine homeo- Polish group action.
morphism of the whole simplex. It follows that the
action of AFF(Poulsen) on C coincides with the The Rosendal theorem is proved by reducing
relation of being affinely homeomorphic. Thus, E1 to bi-Lipschitz equivalence and applying The-
we can view the equivalence relation of being orem (Kechris and Louveau 1997). It is
affinely homeomorphic as the orbit equivalence conjectured that E1 is ≼1ℬ -below every Borel
relation of a Polish group action. equivalence relation that is not reducible to a
M. Sabok (2016) showed that affine homeo- Polish group action. This is Open Problem 16.
morphism of Choquet simplices is ≼2ℬ maximal
among equivalence relations induced by Polish The Maximal Analytic Equivalence Relation
group actions: While it was a folklore result that there is a max-
imal equivalence relation among all analytic
equivalence relation, the construction was quite
Theorem 45. (Sabok) Every orbit equivalence
abstract. In Ferenczi et al. (2009), Ferenczi,
relation of a Polish group action is ≼2ℬ -reducible
Louveau, and Rosendal gave a natural example.
to the equivalence relation of affine homeomor-
It is an equivalence relation on the Polish space of
phism of Choquet simplices.
separable Banach spaces, which can be viewed as
the collection of closed subspaces of ℂ([0, 1])
Borel Equivalence Relations that Don’t Arise endowed with the Effros Borel structure.
from a Polish Group Actions
Kechris and Louveau defined a Borel equivalence Theorem 48. (Ferenczi, Louveau, Rosendal)
relation E1 on {0, 1}ℕℕ that is not reducible to The relation of isomorphism between separable
The Complexity and the Structure and Classification of Dynamical Systems 559
Banach spaces is maximal among all analytic than the relationship between topological
equivalence relations. conjugacy and descriptive set theory.
Work of Mitin (1998) and Ryzhikov (1985)
Since there are equivalence relations that are not studies measure-preserving transformations and
reducible to Polish group actions, and the Ferenczi, show that they encode the theory of Peano
Louveau, Rosendal example is maximal, it follows Arithmetic. As a result, many questions about
that the Ferenczi, Louveau, Rosendal example them are recursively undecidable. In this section,
does not arise as a Polish group action. we will focus on Borel reducibility.
We can now put these examples into the basic
benchmark regions (Fig. 4): Systems with Numerical Invariants
The clearest example here is the class of
Bernoulli Shifts. As noted in section “Rolling
Placing Dynamical Systems in Each Region Dice: Bernoulli Shifts”, the collection of
measure-preserving transformations isomorphic
Measure Isomorphism to a Bernoulli shift has a structure theory, and
The relationship between measure isomorphism Ornstein’s Theorem (Theorem 9) shows that
and descriptive complexity is better understood entropy is a complete numerical invariant.
The Complexity and the Structure and Classification of Dynamical Systems, Fig. 4 Classes placed in regions of
complexity
560 The Complexity and the Structure and Classification of Dynamical Systems
Foreman, Rudolph, and Weiss showed the fol- ≼2ℬ below isomorphism of ergodic measure-
lowing results (Foreman et al. 2011). preserving transformations. By Theorem 54, the
relation is strict.
Theorem 54. (Foreman, Rudolph, Weiss) Let
G be the group of measure-preserving transfor- Shortly after this entry was written, B. Weiss
mations of [0,1] with the standard Polish topology was able to show the following result, as yet
and EMPT be the ergodic transformations in unpublished:
G. The conjugacy action on G is complete Theorem. (Weiss) Let G be a countable group.
analytic. In particular, it is not Borel. Let X be the space of homomorphisms of G to
MPT([0, 1]) such that the action is free and
Using the method of suspensions one can lift ergodic. If G of the form H ℤ, then there is a
these results to ℝ-actions (Foreman et al. 2011). Borel reduction from the isomorphism relation on
X to the isomorphism relation on ℤ-actions.
Corollary 55. The isomorphism relation Corollary. If G ¼ ℤd for d 1 then there is a
between ergodic measure-preserving ℝ-flows is Borel reduction from free measure-preserving
complete analytic, and hence not Borel. G-actions to measure-preserving ℤ-actions.
Remark. The techniques of Theorem 56 adapt A ℬ be a set of positive measure and define
exactly to prove the analogous result for smooth mA : P(A) \ ℬ ! [0, 1] by setting
measure-preserving transformations. However,
because the isomorphism relation on smooth m ð BÞ
m A ð BÞ ¼ :
measure-preserving transformations is not a m ð AÞ
group action, the technique of turbulence is not
directly applicable. Thus it is possible that the By Poincare recurrence, for almost all x A,
measure isomorphism relation on C1 transfor- there is an n > 0 such that T n(x) A. Let n(x) be
mations on the 2-torus is ≼2ℬ bi-reducible with the least such x and TA(x) ¼ T n(x)(x). Then
the maximal S1-action. This is the content of TA : A ! A and is a measure-preserving transfor-
Open Problem 11. mation. If T is ergodic then TA is ergodic.
The proof of Theorem 59 involves three steps:
Definition 60. Two ergodic transformations
1. Identify a class of symbolic shifts (the circular (X, ℬ, m, T ) and ðY, C , n, SÞ are said to be
systems) that are realizable using the Anosov- Kakutani equivalent if there are positive measure
Katok ABC method. sets A ℬ and B C such that TA is isomorphic
2. Show that the class of measure-preserving to SB.
transformations containing an odometer has
the same factor and isomorphism structure as Gerber and Kunde (2021) proved the following
the class of circular systems. They are “isomor- theorem, following the general steps of the proof
phic” categories by an isomorphism F. of Theorem 59, but with significant
3. Build a reduction from the space of ill-founded improvements.
trees to the odometer systems; use the isomor-
phism F to transfer the range to the circular Theorem 61. The equivalence relation of
systems and then the ABC method to realize Kakutani equivalence on ergodic measure-
the results as diffeomorphisms. preserving diffeomorphisms of the 2-torus is com-
plete analytic. Hence, it is not a Borel equivalence
The proofs appear in the three papers: Foreman relation.
and Weiss (2019a); Foreman and Weiss (2019b);
Foreman and Weiss (2022). Because Kakutani equivalence for measure-
We note that these results give no upper bound preserving diffeomorphisms is ≼2ℬ reducible to
on the complexity of the isomorphism relation for Kakutani equivalence for general measure-
ergodic measure-preserving transformations, preserving transformations, the following is
other than the obvious one-that of the maximal immediate (Chronologically, the corollary pre-
Polish group action. Hence, we ask the question: ceded the theorem):
Is the maximal Polish group action reducible to
isomorphism of ergodic measure-preserving Corollary 62. The equivalence relation of
transformation? Kakutani equivalence on the space of ergodic
measure-preserving transformations on [0, 1] is
This is Open Problem 12.
complete analytic and hence, not Borel.
Kakutani Equivalence
Gerber and Kunde also proved an ℝ-flow ver-
An important relation in ergodic theory is
sion of Corollary 62.
Kakutani equivalence (see Thouvenot 2002) for
a nice exposition).
Definition 63. Let F ¼ {ft : t ℝ} be a measure-
Let T be a measure-preserving transformation
preserving ℝ-flow on (X, m). A reparametrization
on a standard measure space (X, ℬ, m). Let
of F is a jointly measurable function t : X
The Complexity and the Structure and Classification of Dynamical Systems 563
ℝ ! ℝ such that for almost every x the map Smooth Transformations with Numerical
t 7! t(x, t) is a homeomorphism of ℝ and the Invariants
map (x, t) 7! ft(x, t)(x) is a measure-preserving As discussed in section “Smooth Dynamics”, the
flow on X. collection of smooth transformations that is struc-
turally stable has complete numerical invariants.
If G ¼ {gt : t ℝ} is another measure- These include the classes of Anosov and Morse-
preserving flow on X, then F and G are isomor- Smale diffeomorphisms. It is not clear how far, if
phic up to a homeomorphic time change if there is at all, these classes can be extended and still have
a reparameterization t of F and a measure- complete numerical invariants.
preserving transformation f : X ! X such that
for all t and almost every x: Diffeomorphisms of the Torus Above E0.
Foreman and Gorodetski showed that there are
gt ðfðxÞÞ ¼ f f tðx,tÞ ðxÞ : diffeomorphisms of any manifold of dimension
at least 2 that are ≼2ℬ above E0. It follows that
topological conjugacy for general diffeomorphisms
The Gerber-Kunde techniques extend to show:
is not numerically classifiable.
Corollary 64. The equivalence relation of iso- Theorem 65. (Foreman, Gorodetski) Let M be a
morphism up to a homeomorphic time change Ck manifold of dimension at least 2 and 1 k 1 .
for C1 measure-preserving flows is complete Then, there is a continuous reduction R from E0 to
analytic and hence not Borel. the collection of Ck diffeomorphisms with the
relation of topological conjugacy.
The proofs of the Gerber-Kunde theorems do
not give information about ≼2ℬ reducibility to any Remark. In fact, the construction creates a
of the benchmark analytic equivalence relations. reduction that takes values on the boundary of
(This is Open Problem 13.) the Morse-Smale diffeomorphisms of the torus.
We give here a nearly honest proof, which is
The Summary Diagram relatively easy to illustrate with a picture.
Figure 5 summarizes this section. The K Let M be a manifold of dimension at least
automorphisms are not on the diagram because, 2. We take it to have dimension 2 for convenience
other than being above E0, their position is not (see Fig. 6). We work inside a neighborhood
known. For the same reason, the Kakutani equiv- diffeomorphic to ℝ2. Choose two points (the left
alence relation is not on the diagram. star and the right star) and four sequences of disks.
The disks’ diameters go to zero at the same rela-
tively fast rate, and two of them converge to the
Topological Conjugacy left point and two of them converge to the right
In this section, we discuss the qualitative behavior point. There are two lines that separate the region
as proposed by Smale. We do this in both the zero- into four parts. Each part contains exactly one of
dimensional and the smooth contexts. “Equiva- the sequences of disk. One of the lines passes
lence” in this context means conjugate by a through the left and right points, giving a notion
homeomorphism. of upper and lower sequences.
After this entry was finished, in joint work with
Gorodetski, the author improved the results for 1. The support of the diffeomorphism is the clo-
smooth transformations on manifolds with the sure of the interiors of the disks.
equivalence relation of topological conjugacy to 2. The diffeomorphism rotates each interior circle
all dimensions at least one. These will appear in a around the center of each disk and each disk
future papers. contains a subdisk that the diffeomorphism
564
The Complexity and the Structure and Classification of Dynamical Systems
The Complexity and the Structure and Classification of Dynamical Systems, Fig. 5 Measure-preserving systems
The Complexity and the Structure and Classification of Dynamical Systems 565
rotates by a constant amount. (The subdisks are swap finitely many of the lower disks. Thus, R is a
indicated by the shading.) reduction of E0 to topological conjugacy.a
3. Each of the sequence of disks above the hori-
zontal line is identified with a fixed sequence of This example is, in many ways, unsatisfying. It
rotation numbers tending to zero and these are is the identity on large portions of the space. This
different for the two sequences. raises myriad questions. A simple one to frame is
4. There is a fixed sequence of different rotation the following (See Open Problem 14.)
numbers with |rn| < 1/2 and hrn : n ℕi
tending to zero, with infinitely many positive Does E0 reduce to the collection of topologically
minimal diffeomorphisms of the 2-torus with the
and infinitely many negative. relation of topological conjugacy?
5. The lower subdisks have rotation numbers
h rn : n ℕi.
6. The lower subdisks then code an infinite A Dynamical Class that Is ≼2ℬ Maximal for
sequence of 0’s and 1’s depending on whether Countable Equivalence Relations
they agree on the sign of the rotation in the nth A well-studied class of dynamical systems is the
subdisk. collection of finite subshifts. These are closed,
shift-invariant subsets of Sℤ for a finite alphabet S.
The reduction R maps {0, 1}ℕ to Let C be the class of finite subshifts. If S is the
diffeomorphisms built with the two points and alphabet then a finite code or block code is a
the sequences of disks. Given f : ℕ ! {0, 1}, function c : Sl ! S for some l ℕ. Then
R( f ) is the diffeomorphism that c determines a mapping c : Sℤ ! Sℤ, where
c( f ) is achieved by “starting at zero and sliding
• Rotates the n-th subdisc of lower right c along f in both directions.” At each location
sequence in the positive direction and lower f restricted to the l-sized window is input into
left sequence in the negative direction if f(n) ¼ c which outputs a value in S.
1. If and 0 are compact subshifts and conju-
• Rotates the nth subdisk of the lower left gate by a homeomorphism h, then that homeo-
sequence in the positive direction and the nth morphism is given by a pair of finite codes, one
subdisk of the lower right sequence in the for h and one for h1 (See Lind and Marcus 1995).
negative direction if f(n) ¼ 0. Since there are only countably many finite codes,
it follows that the equivalence relation has only
By construction, on the range of R, each con- countable classes.
jugating homeomorphism of ℝ2 must take the Clemens (see Clemens 2009) showed that gen-
sequences of upper disks to themselves, and only eral problem of homeomorphism is ≼2ℬ
566 The Complexity and the Structure and Classification of Dynamical Systems
maximal among equivalence relations with count- Corollary 69. Let M be a smooth manifold of
able classes. dimension at least five. Then the equivalence rela-
tion of topological conjugacy on the space of
Theorem 66. (Clemens) The relation of topolog- diffeomorphisms of M is complete analytic. In
ical conjugacy between subshifts of Sℤ (with S particular it is not Borel.
finite) is ≼2ℬ maximal among analytic equivalence
relations with countable classes. The reduction used in the proof of Theorem 68
can be explained with pictures (although a rigor-
An Example of a Maximal S1-Action from ous proof requires some work).
Dynamical Systems Fix a graph G with vertices ℕ. Consider the
Since every perfect Polish space contains a solid unit ball in ℝ3 and choose a set I ¼
homeomorphic copy of the Cantor set, it is natural hpn : n ℕi of countably many algebraically
to ask what the complexity is of the collection of independent points tending to the South Pole.
homeomorphisms of the Cantor set with the rela- Draw line segments between each pair of points
tion E of topological conjugacy. This was and between each point and the South Pole. The
answered by Camerlo and Gao (2001). result is a complete graph on countable many
points: the independent points together with the
Theorem 67. (Camerlo-Gao) Topological South Pole.
conjugacy of homeomorphisms of the Cantor set Let G ¼ (ℕ, E) be an arbitrary countable
is bi-reducible with a maximal S1-action. graph. Start the reduction by identifying ℕ with
the countably many points, but not the South Pole
As in the case of the smooth examples that (Fig. 7).
exhibit high complexity (Theorems 65 and 68) The coding device is to put disjoint five dimen-
the transformations in the range of the reduction sional “cigars” around each of the lines between
are far from being minimal. Thus, we have the the points in I. Given a graph G with vertices ℕ,
corresponding question: is topological conjugacy the support of the diffeomorphism fG that will be
of minimal homeomorphisms of the Cantor set the image of G under that reduction, will be the
bi-reducible with the maximal S1-action? What closure of the cigars. The diffeomorphism fG
about topologically transitive? (See Open Prob- either flows toward the center of the cigar if the
lem 15.) points connected by the cigar are connected in
G or flows away from the center if they are not
Topological Conjugacy of Diffeomorphisms Is Not connected (Fig. 8).
Borel
It might be expected that for objects as concrete as
a Ck diffeomorphism of a compact manifold, topo-
logical conjugacy would be Borel. However, in
dimensions 5 and above, it is known not to be
the case.
Because Graph Isomorphism is a ≼2ℬ The Complexity and the Structure and Classification
maximal S1-action, it is complete analytic. Hence, of Dynamical Systems, Fig. 7 The unit ball with edges
the following corollary is immediate. through it connecting independent points
The Complexity and the Structure and Classification of Dynamical Systems 567
To show this is a reduction one must show that homeomorphism that mimics f’s behavior on the
if we are given two graphs G ¼ (ℕ, E) and H ¼ countable set I and fixes the south pole of the ball
(ℕ, F), then the reduction produces diffeo- in ℝ3. For a single transposition, this is easy in four
morphisms fG and fH such that dimensions (See Fig. 9). However even two trans-
positions can interfere with each other. Having five
• G is isomorphic to H dimensions gives sufficient room to prevent this.
if and only if
• fG is topologically equivalent to fH. The Summary Diagram
Figure 10 summarizes the situation for topologi-
If fG is topologically equivalent to fH then cal conjugacy of diffeomorphisms, as of July 1,
the witnessing homeomorphism fixes the south 2022. The section above describes what was
pole and induces a permutation of the points in I. known on February 1, 2022 and hence does not
This, in turn, gives a permutation of the natural exactly correspond with the diagram.
numbers that is an isomorphism between the
graphs G and H. A Descriptive Set Theory Facts
The difficult direction is the opposite. Suppose We give a very quick review of descriptive set
that we are given a bijection f : ℕ ! ℕ that is an theoretic facts used in this article. For a survey that
isomorphism between G and H. We need to find a includes complete proofs of the facts stated here,
The Complexity and the Structure and Classification of Dynamical Systems, Fig. 10 Conjugacy by homeomorphisms
The Complexity and the Structure and Classification of Dynamical Systems 569
see the article Naive Descriptive Set Theory the use of the uncountable Axiom of Choice. For
(Foreman 2010). That article does not assume example, the Polish spaces contain the hierarchy
any familiarity with logic. of Borel Sets, the smallest s-algebra of subsets
We assume the reader is familiar with the ordi- X containing the open sets. This algebra is built by
nals (see Halmos’ Naive Set Theory), which are induction on the ordinals in a hierarchy of length
canonical representatives for any well-ordering. o1. Heuristically, it contains all sets that can be
Ordinality is concerned with order, as opposed to built with inherently countable information.
cardinality, which is concerned with size as deter- For classifying dynamical systems, Borel sets
mined by bijections. The start with the natural are not sufficient. One needs analytic and
numbers 0, 1, 2, . . . which form the ordinal o, co-analytic sets. These are extensions of the Borel
and continue by putting a point on top to get o þ sets that are nonetheless universally measurable.
1: (Fig. 11) We now give an inductive construction of the
After this a second copy of o to get o þ o ¼ Borel sets that begins with the open sets. To keep
o 2. Every ordinal α has an immediate succes- track of the level of complexity of the set, we
sor, α þ 1, and every collection of ordinals has a introduce the following “logical” notation:
supremum. Every ordinal is either a successor
ordinal or a limit ordinal. Informally, ordinals are Definition 70. Let (X, t) be a Polish topological
defined to be the set of smaller ordinals. So, 0 is space. The levels of the Borel hierarchy are as
the empty set, 1 ¼ {0} and so forth. (See (Halmos follows:
1974; Jech 2003) or (Levy 2002) for explanation.)
We will use this convention here: α ¼ {β : β is an 1. S01 sets are the open subsets of X.
ordinal and β < α}. 2. P01 sets are the closed sets.
In ZFC, the ordinals give canonical represen- 3. Suppose the hierarchy has been defined up to
tatives of the isomorphism types of every well- (but not including) the ordinal level α < o1.
ordering. The Axiom of Choice is equivalent to
Then A S0a if and only if there is a sequence
the statement that every set can be well-ordered. It
hBi| i oi of subsets of X with Bi P0bi such
follows that there are ordinals of every cardinality.
that for each i o, βi < α and
The smallest uncountable ordinal is called o1.
Because the collection of real numbers is
uncountable, it has cardinality at least the cardi- A ¼ [ Bi :
io
nality of o1 and Cantor’s Continuum Hypothesis
says that the two cardinalities are the same.
Descriptive set theory is largely concerned 4. B P0a if and only if there is a subset of
with Polish Spaces X – these are topological X, A S0a such that B ¼ X \A.
spaces whose topology is compatible with a sep-
arable complete metric. A Polish space is perfect We write D0a ¼ S0a \ P0a :
if it contains no isolated points. A measure on a
Polish space is standard if open sets are measur- The second and fourth items imply that the
able; it is non-atomic, complete, and separable. collection defined this way is closed under com-
Very roughly, the main topic of descriptive set plements and the third item says, in particular, that
theory is understanding what can be done without S0a sets are constructed by taking arbitrary
The Complexity and the Structure and Classification of Dynamical Systems, Fig. 11 A few ordinals
570 The Complexity and the Structure and Classification of Dynamical Systems
countable unions of sets of lower complexity. is called a co-analytic set. Note that we could get
Thus, S0a sets are closed under countable unions an equivalent definition of coanalytic by taking a
and P0a sets are closed under countable intersec- Borel set B and setting
tions. But none of the S0a ’s are closed under
countable intersections, nor are the P0a ’s closed C ¼ fx : ðfor all yÞðx, yÞ B g: ð6Þ
under unions. Thus these definitions organize the
Borel sets in a hierarchy of length o1. Because o1 To be sure, the definition is clear – the analytic
has uncountable cofinality, [a<o1 S0a ¼ [a<o1 P0a sets are all sets of the form of A in Eq. 4 with B a
forms a s-algebra. Borel set. The co-analytic sets are all sets of the
Here are some facts: form in Eq. 5, or equivalently all sets of the form
Fact. Let X be an uncountable Polish space. in Eq. 6 with B a Borel set. For purposes of
Then for all countable α: showing that a set is analytic or co-analytic, it is
useful to use logical notation: “for some y” can be
S0aþ1 6¼ P0aþ1 : be written ∃y and “for all y” can be written 8y.
Informally analytic sets are “(∃y)Borel” and
• Both S0a and P0a are proper subsets of D0aþ1 : co-analytic sets are “(8y)Borel.” Because prod-
• In turn, each D0a is a proper subset of each ucts of Polish spaces are Polish, a set that can be
written “(∃y)(∃z)Borel” is analytic and “(8y)(8z)
S0aþ1 and each P0aþ1 .
Borel” is co-analytic.
Notation The “S11 sets” are the analytic sets
These facts are summarized in Fig. 12.
and the “P11 -sets” are the co-analytic sets.
Analytic and co-Analytic sets We now turn to Fact. Every perfect Polish space contains a
the next type higher. We define two collections of non-Borel analytic set. Moreover, the analytic
universally measurable sets using projections. sets are closed under countable intersections
Let X and Y be Polish spaces and B X Y be and unions. Hence, the coanalytic sets are also
a Borel set. The set closed under unions and intersections.
In many ways, the analytic and co-analytic sets
A ¼ fx : ðfor some yÞðx, yÞ Bg ð4Þ function similarly to the open and closed sets.
Here is a remarkable theorem about the rela-
is called an analytic set. It’s complement tionship between analytic and co-analytic sets due
to Lusin with a very important corollary due to
C ¼ fx : ðfor all yÞðx, yÞ Bg ð5Þ Suslin. (See (Lusin 1933), or section 3.5 of
The Complexity and the Structure and Classification of Dynamical Systems, Fig. 12 The construction of the
Borel Sets
The Complexity and the Structure and Classification of Dynamical Systems 571
(Foreman 2010). Reference (Graham and Kantor ðT, x, yÞ : x 6¼ y andðfor some qÞðfor all nÞ,
2009) gives a popular history account.)
d ðT n ðxÞ, T n ðyÞÞ > 1=q :
Theorem 71. (Lusin’s Separation Theorem) Sup-
pose that X is a Polish space and A, B are disjoint
is Borel,
analytic subsets of X. Then there is a Borel set
C with A C and C \ B ¼ 0 . (So C separates
A from B.) T : ðfor all x 6¼ yÞfor some q for all n,
Corollary 72. (Suslin’s Theorem) Let A X. If is in the form of Eq. 6. It follows that the collection
both A and X \ A are analytic, then A is Borel. of distal homeomorphisms of X is co-analytic.
Example 73. For Borel group actions, the orbit Reductions and Hierarchies
equivalence relation is always analytic. If G is Fact. Let X, Y be Polish spaces and A X, B Y.
acting on X then Let f : X ! Y reduce A to B.
0-entropy, and K automorphisms: Is there a ≼2ℬ Similarly in classes where there are existing
relationship between any of these classes, or invariants (such as the schemes that are used in
between one of these classes and the class of dimension 3 in (Bonatti et al. 2019)) one can ask
arbitrary ergodic probability measure-preserving whether those invariants are themselves Borel.
transformations. Open Problem 10. Is the relation of topolog-
Open Problem 6. Is isomorphism for ergodic ical conjugacy Borel when restricted to Axiom
measure-preserving transformations on s-finite A diffeomorphisms of a compact surface? Can
measure spaces Borel? E0 be embedded into topological conjugacy for
What is the ≼2ℬ relationship between isomor- Axiom A transformations?
phisms of ergodic probability measure-preserving Open Problem 11. Let M be a smooth mani-
transformations and isomorphism of ergodic fold with a smooth volume element n. Let SE be the
s-finite measure-preserving transformations? collection of smooth, ergodic, n-measure-
Open Problem 7. Do the structure theorems preserving transformations of M. Is the measure
for distality give Borel criteria for isomorphism? isomorphism relation on SE bi-reducible with the
More precisely: maximal S1-action?
Open Problem 12. (Sabok’s Conjecture) Is the
A. Let X be the collection of minimal homeomor- measure isomorphism relation on ergodic
phisms on compact metric spaces and D be the measure-preserving transformations of the unit
collection of topologically distal transforma- interval ≼2ℬ bi-reducible with the maximal Polish
tions. Is there a Borel set B X X such that group action?
B \ ðD D Þ is the relation of topological Open Problem 13. Where does the Kakutani
conjugacy? equivalence relation on ergodic measure-
B. Let MD be the collection of ergodic measure preserving transformations sit among the analytic
distal transformations. Is there a Borel set equivalence relations in ≼2ℬ ? Is it reducible to an
B EMPT EMPT such that B \ S1-action? It is known to be a complete analytic
ðMD MD Þ is the relation of measure set, hence not Borel (see Theorem 61).
conjugacy? In particular, what is the ≼2ℬ relationship
between the equivalence relation of Kakutani
Open Problem 8. Suppose that T : X ! X is an equivalence and isomorphism between ergodic
ergodic finite entropy transformation. Is there a probability measure-preserving transformations?
compact manifold M with a smooth volume ele- Between Kakutani equivalence and isomorphism
ment n and a measure-preserving diffeomorphism of ergodic measure-preserving transformations
S such that (X, ℬ, m, T ) is measure isomorphic to on s-finite measure spaces?
ðM, C , n, SÞ? Open Problem 14. Does E0 reduce to the
Open Problem 9. Let C be the collection of collection of topologically minimal
diffeomorphisms of a compact manifold M that diffeomorphisms of the 2-torus with the relation
are topologically conjugate to a structurally sta- of topological conjugacy? What about topologi-
ble diffeomorphism. Let E be the relation of topo- cally transitive diffeomorphisms?
logical conjugacy restricted to C : Is E Borel Open Problem 15. Is topological conjugacy of
reducible to ¼? In other words, does E have minimal homeomorphisms of the Cantor set
Borel computable, complete numerical bi-reducible with the maximal S1-action? What
invariants? about topologically transitive homeomorphisms?
This can also be asked for any standard class Open Problem 16. Suppose that F is a Borel
of structurally stable transformations, such as the equivalence relation on a Polish space that is not
Morse-Smale transformations. Is the conjugacy Borel reducible to the orbit equivalence relation
relation Borel? of a Polish group action. Is E1 ≼2ℬ F?
574 The Complexity and the Structure and Classification of Dynamical Systems
Acknowledgments The author received help from many Effros EG (1965) Transformation groups and C*-algebras.
people in writing and editing this entry. He particularly Ann Math 81(2):38–55
wants to acknowledge Filippo Calderoni, Marlies Gerber, Feldman J (1974) Borel structures and invariants for mea-
Anton Gorodetski, Philipp Kunde, Andrew Marks, Ronnie surable transformations. Proc Am Math Soc 46:
Pavlov. Alexander Kechris made invaluable corrections on 383–394
an early text and correspondence with Su Gao was essential Feldman J, Moore CC (1977) Ergodic equivalence rela-
in the completion of the text. As always, Benjamin Weiss tions, cohomology, and von Neumann algebras.
provided important suggestions and comments and pro- I. Trans Am Math Soc 234(2):289–324
vided very helpful references. Ferenczi V, Louveau A, Rosendal C (2009) The complex-
Expository Books and Articles on Analytic Equiva- ity of classifying separable Banach spaces up to iso-
lence Relations morphism. J Lond Math Soc 79(2):323–345
1. The book by Gao (2009), Foreman M (2000) A descriptive view of ergodic theory,
2. The books by Kechris (1995), Kechris–Miller Descriptive set theory and dynamical systems
(2004) and Becker–Kechris (1996), (Marseille-Luminy, 1996), London Mathematical Soci-
3. The paper by Friedman and Stanley (1989), ety lecture note series, vol 277. Cambridge Univ. Press,
4. Dave Marker’s website (Marker 2002). Cambridge, pp 87–171
The author would like to acknowledge partial support from Foreman M (2010) Naive descriptive set theory, arXiv:
NSF grant DMS-2100367. 2110. 08881, 1–53.
Foreman M (2018) What is a Borel reduction? Not Am
Math Soc 65(10):1263–1268
Foreman M, Weiss B (2019a) From odometers to circular
systems: a global structure theorem. J Mod Dyn 15:
Bibliography 345–423
Foreman M, Weiss B (2019b) A symbolic representation
Aaronson J (1997) An introduction to infinite ergodic for Anosov-Katok systems. J Anal Math 137(2):
theory, Mathematical surveys and monographs, 603–661
vol 50. American Mathematical Society, Providence, Foreman M, Weiss B (2022) Measure preserving
RI diffeomorphisms of the torus are unclassifiable. J Eur
Barreira L, Valls C (2013) Dynamical systems, an intro- Math Soc (JEMS), 1–80.
duction. Universitext, Springer, London Foreman M, Gorodetski A (2022) Anti-classification
Becker H, Kechris AS (1996) The descriptive set theory of results for smooth dynamical systems, arXiv: 2206.
polish group actions. London Mathematical society 09322, 1–56.
lecture note series. Cambridge University Press, Foreman M, Weiss B (2004) An anti-classification theorem
Cambridge for ergodic measure preserving transformations. J Eur
Beleznay F, Foreman M (1995) The collection of distal Math Soc (JEMS) 6(3):277–292
flows is not Borel. Am J Math 117(1):203–239 Foreman M, Rudolph DJ, Weiss B (2011) The conjugacy
Beleznay F, Foreman M (1996) The complexity of the problem in ergodic theory. Ann Math 173(3):
collection of measure-distal transformations. Ergodic 1529–1586
Theory Dyn Syst 16(5):929–962 Friedman H, Stanley L (1989) A Borel reducibility theory
Bonatti C, Grines V, Pochinka O (2019) Topological clas- for classes of countable structures. J Symb Log 54(3):
sification of Morse-Smale diffeomorphisms on 894–914
3-manifolds. Duke Math J 168(13):2507–2558 Furstenberg H (1963) The structure of distal flows. Am
Braverman M, Yampolsky M (2009) Computability of J Math 85:477–515
Julia sets, Algorithms and computation in mathematics, Furstenberg H (1981) Recurrence in ergodic theory and
vol 23. Springer-Verlag, Berlin combinatorial number theory, Porter lectures. Princeton
Camerlo R, Gao S (2001) The completeness of the isomor- University Press, Princeton
phism relation for countable Boolean algebras. Trans Gao S (2009) Invariant descriptive set theory, A series of
Am Math Soc 353(2):491–518 monographs and textbooks, vol 293. Taylor and
Chacon RV (1969) Weakly mixing transformations which Francis, Boca Raton
are not strongly mixing. Proc Am Math Soc 22: Gao S (2022) Dynamical systems and countable structures,
559–562 Lecture notes: https://www.birs.ca/workshops/2022/
Clemens JD (2009) Isomorphism of subshifts is a universal 22w5134/files/Su%20Gao/banff%20lecture%20gao%
countable Borel equivalence relation. Israel J. Math. 20nopause.pdf
170:113–123 Gao S, Pestov V (2003) On a universality property of some
Ding L, Gao S (2014) Is there a spectral theory for all abelian Polish groups. Fundam Math 179(1):1–15
bounded linear operators? Not Am Math Soc 61(7): Gerber M, Kunde P (2021) Anti-classification results for
730–735 the Kakutani equivalence relation, https://arxiv.org/
Dougherty R, Jackson S, Kechris AS (1994) The structure abs/2109.06086v1, 1–72.
of hyperfinite Borel equivalence relations. Trans Am Glimm J (1961) Locally compact transformation groups.
Math Soc 341(1):193–225 TAMS 101:124–138
The Complexity and the Structure and Classification of Dynamical Systems 575
Graham L, Kantor J-M (2009) Naming infinity. Belknap Khinchin AI (1949) Mathematical foundations of statisti-
Press, Cambridge cal mechanics. Dover Publications, Inc., New York.
Halmos PR (1944) In general a measure preserving trans- Translated by G. Gamow
formation is mixing. Ann Math 45(2):786–792 King J (1986) The commutant is the weak closure of the
Halmos PR (1956) Lectures on ergodic theory. Chelsea powers, for rank-1 transformations. Ergodic Theory
Publishing Company, New York Dyn Syst 6(3):363–384
Halmos PR (1974) Naive set theory, Undergraduate texts Krieger W (1970) On entropy and generators of measure-
in mathematics. Springer-Verlag, New York- preserving transformations. Trans Am Math Soc 149:
Heidelberg 453–464
Halmos PR and von Neumann J (1942) Operator methods Krieger W (1972a) Erratum to: “On entropy and generators
in classical mechanics. II, Ann of Math. 43(2):332–350 of measure-preserving transformations”. Trans Am
Harrington LA, Kechris AS, Louveau A (1990) A Glimm- Math Soc 168:519
Effros dichotomy for Borel equivalence relations. J Am Krieger W (1972b) On unique ergodicity. In: Proceedings
Math Soc 3(4):903–928 of the sixth Berkeley symposium on mathematical sta-
Hasselblatt B (2007) Problems in dynamical systems and tistics and probability, University of California, Berke-
related topics, Dynamics, ergodic theory, and geome- ley, 1970/1971, vol II: Probability theory, pp 327–346
try, Mathematical Sciences Research Institute Publica- Levy A (2002) Basic set theory. Dover Publications, Inc.,
tions, vol 54. Cambridge University Press, Cambridge, Mineola, Reprint of the 1979 original [Springer, Berlin;
pp 273–324 MR0533962 (80k:04001)].
Hjorth G (1999) Around nonclassifiability for countable Lind D, Marcus B (1995) An introduction to symbolic
torsion free abelian groups. In: Eklof PC, Göbel R (eds) dynamics and coding. Cambridge University Press,
Abelian Groups and Modules: International conference Cambridge
in Dublin, August 10–14, 1998. Birkhäuser, Basel, Lindenstrass J, Olsen G, Sternfeld Y (1978) The Poulsen
pp 269–292 simplex. Annales de l’institute Fourier 28(1):91–114
Hjorth G (2000) Classification and orbit equivalence rela- Lusin N (1933) Sur les classes des constituantes des
tions, Mathematical surveys and monographs, vol 75. complémentaires analytiques. Ann Scuola Norm
American Mathematical Society, Providence, RI Super Pisa Cl Sci 2(3):269–282
Hjorth G (2001) On invariants for measure preserving Marker D (2002) Descriptive set theory, http://homepages.
transformations. Fundam Math 169(1):51–84 math.uic.edu/~marker/math512/dst.pdf, 1–56.
Jackson S, Kechris AS, Louveau A (2002) Countable Maruyama G (1949) The harmonic analysis of stationary
Borel equivalence relations. J Math Log 2(1):1–80 stochastic processes. Mem Fac Sci Kyūsyū Univ A 4:
Jech TJ (2003) Set theory: The third millenium edition. 45–106
Springer, Berlin Mekler AH (1981) Stability of nilpotent groups of class
Kanovei V (2008) Borel equivalence relations: structure 2 and prime exponent. J Symb Log 46(4):781–788
and classification, University lecture series, vol 14. Mirzakhani M, Feng T (2014) Introduction to ergodic
American Mathematical Society, Providence, RI theory, https://www.mit.edu/~fengt/ergodic_
Katok A, Hasselblatt B (1995) Introduction to the modern theory.pdf, 1–56.
theory of dynamical systems, Encyclopedia of Mathe- Mitin AV (1998) Undecidability of the elementary theory
matics and its applications, vol 54. Cambridge Univer- of groups of measure-preserving transformations. Mat
sity Press, Cambridge Zametki 63(3):414–420
Kechris A (1995) Classical descriptive set theory, Graduate Moschovakis YN (2009) Descriptive set theory: second
texts in mathematics, 1st edn. Springer, New York edition, Mathematical surveys and monographs,
Kechris A (2010) Global aspects of Ergodic Group actions, no. 155, 2nd edn. American Mathematical Society,
Mathematical surveys and monographs. Am Math Providence, RI
Soc 160 Newhouse SE (1970) Nondensity of axiom A(a) on S2,
Kechris AS, Louveau A (1997) The classification of global analysis. In: Proceedings of the Symposium on
Hypersmooth Borel equivalence relations. J Am Math- Pure Mathematics, vol XIV, Berkeley, 1968, American
ematical Soc 10(1):215–242 Mathematical Society, Providence, pp 191–202.
Kechris A, Miller BD (2004) Topics in orbit equivalence, Newhouse SE (1974) Diffeomorphisms with infinitely
Lecture notes in mathematics. Springer, Berlin/ many sinks. Topology 13:9–18
Heidelberg Ornstein D (1970a) Bernoulli shifts with the same entropy
Kechris AS, Sofronidis NE (2001) A strong generic ergo- are isomorphic. Adv Math 4:337–352
dicity property of unitary and self-adjoint operators. Ornstein D (1970b) The isomorphism problem for
Ergodic Theory Dyn Syst 21(5):1459–1479 measure-preserving transformations. In: Proceedings
Kechris AS, Tucker-Drob RD (2013) The complexity of of the symposium on functional analysis, Monterey,
classification problems in ergodic theory, Appalachian 1969. Academic Press, New York, pp 71–74. MR
set theory 2006–2012, London Mathematical Society 0262463
lecture note series, vol 406. Cambridge University Ornstein D (1974) Ergodic theory, randomness, and
Press, Cambridge, pp 265–299 dynamical systems, Yale mathematical monographs,
576 The Complexity and the Structure and Classification of Dynamical Systems
No. 5. Yale University Press, New Haven/London, Proceedings of the International Congress on Mathe-
James K. Whittemore. Lectures in Mathematics given maticians, Stockholm, 1962. Institute Mittag-Leffler,
at Yale University Djursholm, pp 49–496
Ornstein D (1975) Some open problems in Ergodic theory. Smale S (1967) Differentiable dynamical systems. Bull
Publications mathématiques et informatique de Am Math Soc 73:747–817
Rennes, no. S4 Soare R (1987) Recursively enumerable sets and degrees,
Ornstein DS, Shields PC (1973) An uncountable family of perspectives in mathematical logic. Springer, Berlin/
K-automorphisms. Adv Math 10:63–88 Heidelberg
Ornstein DS, Weiss B (1974) Finitely determined implies Thomas S (2003) The classification problem for torsion-
very weak Bernoulli. Israel J Math 17:94–104 free abelian groups of finite rank. J Am Math Soc 16(1):
Oshemkov AA, Sharko VV (1998) On the classification of 233–258
Morse-Smale flows on two-dimensional manifolds. Thomas S (2011) The classification problem for S-local
Mat Sb 189(8):93–140 torsion-free abelian groups of finite rank. Adv Math
Peixoto MM (1973) On the classification of flows on 226(4):3699–3723
2-manifolds. In: Proceedings of the Symposium Thomas S, Velickovic B (1999) On the complexity of the
dynamical systems, University of Bahia, Salvador, isomorphism relation for finitely generated groups.
1971, pp 389–419 J Algebra 217(1):352–373
Petersen KE (1983) Ergodic theory, Cambridge studies in Thouvenot J-P (2002) Entropy, isomorphism and equiva-
advanced mathematics. Cambridge University Press, lence in ergodic theory, Handbook of dynamical sys-
Cambridge tems, vol 1A. North-Holland, Amsterdam, pp 205–238
Rohlin V (1948) A “general” a measure-preserving trans- Uspenskiĭ VV (1986) A universal topological group with a
formation is not mixing. Doklady Akad Nauk SSSR 60: countable basis. Funktsional Anal i Prilozhen 20(2):
349–351 86–87
Rohlin V (1967) Lectures on the entropy theory of trans- Walters P (1982) An introduction to ergodic theory, Grad-
formations with invariant measure. Uspehi Mat Nauk uate texts in mathematics. Springer, New York
22(5):3–56, 137. Weihrauch K (2000) Computable analysis, Texts in theo-
Rosendal C (2005) Cofinal families of Borel equivalence retical computer science. An EATCS series. Springer,
relations and quasiorders. J Symb Log 70(4):1325–1340 Berlin, An introduction
Ryzhikov VV (1985) Representation of transformations Wikipedia. Hamiltonian system. https://en.wikipedia.org/
preserving the Lebesgue measure, in the form of a wiki/Hamiltonian_system
product of periodic transformations. Mat. Zametki Wikipedia. Rotation number. https://en.wikipedia.org/
38(6):860–865, 957. wiki/Rotation_number
Sabok M (2016) Completeness of the isomorphism prob- Zimmer RJ (1976) Ergodic actions with generalized dis-
lem for separable C*-algebras. Invent Math 204(3): crete spectrum. Ill J Math 20(4):555–588
833–868 Zippin L (2013) Transformation groups, Lectures in Topol-
Smale S (1963) Dynamical systems and the topological ogy; the University of Michigan Conference of 1940.
conjugacy problem for diffeomorphisms. In: University of Michigan Press, Ann Arbor
Hausdorff space containing ℕ characterizes
Ergodic Theory: Interactions bℕ up to homeomorphism.
with Combinatorics and Curvature An intrinsic measure of the curvature
Number Theory of a Riemannian manifold depending only on
the Riemannian metric; in the case of a surface
Tom Ward it determines whether the surface is locally
School of Mathematics, University of Leeds, convex (positive curvature), locally saddle-
Leeds, UK shaped (negative), or locally flat (zero).
Diophantine approximation Theory of the
approximation of real numbers by rational
Article Outline numbers: how small can the distance from a
given irrational real number to a rational num-
Glossary ber be made in terms of the denominator of the
Definition of the Subject rational?
Introduction Diophantine problem A system of equations for
Ergodic Theory which the set of integer solutions is sought.
Frequency of Returns The description of the set of solutions may be
Ergodic Ramsey Theory and Recurrence qualitative (asserting that the set is finite),
Orbit-Counting as an Analogous Development quantitative (bounding the number of solutions
Diophantine Analysis as a Toolbox in terms of parameters like degrees), or effec-
Future Directions tive (bounding the height of possible solutions,
Bibliography and as a result bounding both the number of
solutions and their location).
Glossary Dynamical zeta function For a map with fn
points of period n, the dynamical zeta function
Almost everywhere (abbreviated a.e.) A prop- is formally defined to be exp nP1 fnn zn for a
erty that makes sense for each point x in a complex variable z. Convergence in a disc of
measure space (X, ℬ, m) is said to hold almost positive radius is associated with exponential
everywhere (or a.e.) if the set N X on which it bounds on the growth in the number of peri-
does not hold satisfies N ℬ and m(N) = 0. odic points.
Baker’s theorem A lower bound for the absolute Equidistributed A sequence is equidistributed if
value of linear combinations of logarithms of the asymptotic proportion of time it spends in
algebraic numbers; this is a fundamental result an interval is proportional to the length of the
in transcendental number theory. interval.
Čech–Stone compactification of ℕ, bℕ A com- Ergodic A measure-preserving transformation is
pact Hausdorff space containing ℕ as a dense ergodic if the only invariant functions are equal
subset with the property that any map from ℕ to a constant almost everywhere (a.e.); equiv-
to a compact Hausdorff space K extends alently if the transformation exhibits the con-
uniquely to a continuous map bℕ ! K. This vergence in the quasi-ergodic hypothesis.
property and the fact that bℕ is a compact Ergodic theory The study of statistical proper-
ties of orbits in abstract models of dynamical
This entry gives a brief overview of some of the ways in
which number theory and combinatorics interact with
systems; more generally properties of
ergodic theory. The main themes are illustrated by measure-preserving (semi-)group actions on
examples related to recurrence, mixing, orbit counting, and measure spaces.
Diophantine analysis.
Geodesic (flow) The shortest path between two infinite families of solutions arising from
points on a Riemannian manifold; such a geo- vanishing sub-sums).
desic path is uniquely determined by a starting Topological entropy A numerical invariant of
point and the initial tangent vector to the path topological dynamical systems that measures
(that is, a point in the unit tangent bundle). The the asymptotic growth in the complexity of
transformation on the unit tangent bundle orbits under iteration. The variational princi-
defined by flowing along the geodesic defines ple states that the topological entropy of a
the geodesic flow. topological dynamical system is the supremum
Haar measure (on a compact group) If G is a over all invariant measures of the measure-
compact topological group, the unique mea- theoretic entropies of the dynamical systems
sure m defined on the Borel sets of G with the viewed as measurable dynamical systems.
property that m(A + g) = m(A) for all g G and
m(G) = 1.
Measure-theoretic entropy A numerical invari- Definition of the Subject
ant of measure-preserving systems that reflects
the asymptotic growth in complexity of measur- Number theory is a branch of pure mathematics
able partitions refined under iteration of the map. concerned with the properties of numbers in gen-
Mixing A measure-preserving system is mixing eral, and integers in particular. The areas of most
if measurable sets (events) become asymptoti- relevance to this entry are Diophantine analysis (the
cally independent as they are moved apart in study of how real numbers may be approximated by
time (under iteration). rational numbers, and the consequences for solu-
Orbit Dirichlet series For a map with on closed tions of equations in integers); analytic number
orbits of length n, the Dirichlet series nP1 onns theory, and in particular asymptotic estimates for
for a complex variable s. Convergence on a the number of primes smaller than X as a function of
right half-plane is associated with polynomial X; equidistribution, and questions about how the
bounds on orbit growth. digits of real numbers are distributed. Combinator-
Pólya-Carlson theorem If a complex power ics is concerned with identifying structures in dis-
series with integer coefficients has radius of crete objects; of most interest here is that part of
convergence 1, then either it defines a rational combinatorics connected with Ramsey theory,
function or it admits a natural boundary at its asserting that large subsets of highly structured
radius of convergence. objects must automatically contain large replicas
(Quasi) Ergodic hypothesis The assumption of that structure. Ergodic theory is the study of
that, in a dynamical system evolving in time asymptotic behavior of group actions preserving a
and preserving a natural measure, there are probability measure; it has proved to be a powerful
some reasonable conditions under which the part of dynamical systems with wide applications.
“time average” along orbits of an observable
(that is, the average value of a function defined
on the phase space) will converge to the “space Introduction
average” (that is, the integral of the function
with respect to the preserved measure). Ergodic theory, part of the mathematical study of
Recurrence Return of an orbit in a dynamical dynamical systems, has pervasive connections
system close to its starting point infinitely often. with number theory and combinatorics. This
S-unit theorems A circle of results stating that entry briefly surveys how these arise through a
linear equations in fields of zero characteristic small sample of results. Unsurprisingly, many
have only finitely many solutions taken from details are suppressed, and of course the selection
finitely-generated multiplicative subgroups of of topics reflects the author’s interests far more
the multiplicative group of the field (apart from than it does the full extent of the flow of ideas
Ergodic Theory: Interactions with Combinatorics and Number Theory 579
between ergodic theory and number theory. In this entry. The ideas and conditions surrounding
addition, the selection of topics has been chosen the quasi-ergodic hypothesis were eventually
in part to be complementary to those in related placed on a firm mathematical footing by devel-
entries in the Encyclopedia. A particularly enor- opments starting in 1890. For a single measure-
mous lacuna is the theory of a class of systems preserving transformation T : X ! X of a proba-
called “arithmetic dynamical systems” itself; bility space (X, ℬ, m), Poincaré (1890) showed a
these are rational functions that map a variety to recurrence theorem: if E ℬ is any measurable
itself. The goal of the theory is to understand the set, then for a.e. x E there is an infinite set of
arithmetic and geometry of orbits of points under return times, 0 < n1 < n2 < with T nj ðxÞ E
iteration, and (depending on the field over which (of course Poincaré noted this in a specific set-
the variety is defined) it has strong connections to ting, concerned with a natural invariant measure
algebraic and arithmetic geometry. The mono- for the “three-body” problem in planetary
graph by Silverman (2007) gives a comprehensive motion).
overview. Poincaré’s qualitative result was made quanti-
More sophisticated aspects of this connection – tative in the 1930s, when von Neumann (1932)
in particular the connections between ergodic the- used the approach of Koopman (1931) to show the
ory on homogeneous spaces and Diophantine mean ergodic theorem: if f L2(m) then there is
analysis – are covered in the entries ▶ “Ergodic some f L2 ðmÞ for which
Theory on Homogeneous Spaces and Metric
Number Theory” by Kleinbock and ▶ “Ergodic 1
N1
indicator function of a measurable set A shows This was eventually proved by Kuz’min
that ergodicity guarantees that a.e. orbit spends an (1928) and Lévy (1929), and the probability dis-
asymptotic proportion of time in A equal to the tribution of the digits is the Gauss–Kuz’min law.
volume m(A) of that set (as measured by the invari- Khinchin (1964) developed this further, showing
ant measure). This points to the start of the perva- for example that
sive connections between ergodic theory and
number theory – but as this and other articles lim ða1 ðxÞa2 ðxÞ . . . an ðxÞÞ1=n
n!1
relate, the connections extend far beyond this.
1 log n= log 2
ð n þ 1Þ 2
¼
n¼1
nðn þ 2Þ
Frequency of Returns
¼ 2:68545 for a:e: x:
In this section, we illustrate the way in which a
dynamical point of view may unify, explain, and Lévy (1936) showed that the denominator
extend quite disparate results from number qn(x) of the nth convergent qpn ððxxÞÞ (the rational
n
a is irrational. This result was refined and averages N1 N1n¼0 f ðT xÞ must converge to X f dm1
n
extended in many directions; for example, a.e. with respect to m1 and to X f dm2 a.e. with
Hlawka (1964) and others found rates for the respect to m2. Thus the presence of many invariant
convergence in terms of the discrepancy of the measures for a continuous map means that ergodic
sequence, Weyl (1916) proved equidistribution averages along the orbits of specific points need
for {n2a}, and Vinogradov for {pna} where pn is not converge to the space average with respect to a
the nth prime. chosen invariant measure.
582 Ergodic Theory: Interactions with Combinatorics and Number Theory
1
lim jfa A : Mi < a < N i gj > 0:
i!1 N i Mi
Ergodic Ramsey Theory and Recurrence
Erdős and Turán (1936) conjectured the stron-
In 1927, van der Waerden proved a conjecture ger statement that any subset of ℕ with positive
attributed to Baudet: if the natural numbers are upper density must contain arbitrary long arith-
written as a disjoint union of finitely many sets, metic progressions. This statement was shown for
Ergodic Theory: Interactions with Combinatorics and Number Theory 583
arithmetic progressions of length 3 by Roth Poincaré recurrence is the statement that ℕ is a set
(1952) in 1952, then for length 4 by Szemerédi of recurrence. Furstenberg and Katznelson (1978)
(1969) in 1969. The general result was eventually showed that if T1, , Tk form a family of com-
proved by Szemerédi (1975) in 1975 in a lengthy muting measure-preserving transformations and
and extremely difficult argument. A is a set of positive measure, then
Furstenberg saw that Szemerédi’s Theorem
would follow from a deep extension of the 1
N1
N k t 1
1 pi,j ðnÞ
liminf m Tj A > 0: There are a large number of deep combinatorial
N !1 N
n¼1 i¼1 j¼1 consequences of this result, not all of which seem
accessible by other means.
Using the Furstenberg correspondence princi-
ple, this gives a multi-dimensional polynomial Sets of Primes
Szemerédi theorem: If P : ℤr ! ℤ‘ is a polynomial In a remarkable development, Szemerédi’s theo-
mapping with the property that P(0) = 0, and rem and some of the ideas behind ergodic Ramsey
F ℤr is a finite configuration, then any set theory joined results of Goldston et al. (2009) in
S ℤ‘ of positive upper Banach density contains playing a part in Green and Tao’s proof (Green
a set of the form u + P(nF) for some u ℤ‘ and and Tao 2008) that the set of primes contains
n ℕ. arbitrarily long arithmetic progressions. This pro-
In a different direction, motivated in part by found result is surveyed from an ergodic point of
Hindman’s theorem, the multiple recurrence view in the article of Kra (2006). As with
results generalize to IP-sets. Furstenberg and Szemerédi’s theorem itself, this result has been
Katznelson (1985) proved a linear IP-multiple extended to a polynomial setting by Tao and
recurrence theorem in which the recurrence is Ziegler (2008). Given integer-valued polynomials
guaranteed to occur along an IP-set. f1, . . ., fk ℤ[t] with
A combinatorial proof of this result has been
found by Nagle et al. (2006). Bergelson and f 1 ð 0Þ ¼ ¼ f k ð 0Þ ¼ 0
McCutcheon (2000) extended these results by
proving a polynomial IP-multiple recurrence the- and any e > 0, Tao and Ziegler proved that there
orem. To formulate this, make the following def- are infinitely many integers x, m with 1 O m O xe
initions. Write F for the family of non-empty for which x + f1(m), . . ., x + fk(m) are primes. We
finite subsets of ℕ, so that a sequence indexed refer to a survey of Ziegler for an overview of
by F is an IP-set. More generally, an F -sequence these developments (Ziegler 2014).
Ergodic Theory: Interactions with Combinatorics and Number Theory 585
In a different direction, Lalley (1987) found Counting Orbits for Group Endomorphisms
orbit asymptotics for closed orbits satisfying con- A prism through which to view some of the deeper
straints in the Axiom A setting without using issues that arise in section “Counting Orbits and
Tauberian theorems. His more direct approach is Geodesics” is provided by group endomorphisms.
still analytic, using complex transfer operators The price paid for having simple closed formulas
(the same objects used by Parry and Pollicott to for all the quantities involved is of course a severe
study the dynamical zeta function at complex loss of generality, but the diversity of examples
values) and indeed somewhat parallels a Tauberian illustrates many of the phenomena that may be
argument. expected in more general settings when hyper-
Further resonances with number theory arise bolicity is lost.
here. For example, there are results on the distri- Consider an endomorphism T : X ! X of a
bution of closed orbits for group extensions compact group with the property that
(analogous to Chebotarev’s theorem) and for
orbits with homological constraints (see Sharp Fn ðT Þ ¼j fx X : T n x ¼ xg j< 1
(1993), Katsuda and Sunada (1990)).
Of course the great diversity of dynamical sys- for all n ⩾ 1. The number of closed orbits of
tems subsumed in the phrase “prime orbit theo- length n under T is then
rem” creates new problems and challenges, and in
particular if there is not much geometry to work 1
O n ðT Þ ¼ mðn=dÞFd ðT Þ: ð7Þ
with then the reliance on Markov partitions and n
djn
transfer operators makes it difficult to find higher-
order asymptotics. In simple situations (hyperbolic toral automor-
Dolgopyat (1998) has nonetheless managed to phisms for example), it is straightforward to show
push the Markov methods to obtain uniform that
bounds on iterates of the associated transfer oper-
ators to the region ℜ(s) > s0 with s0 < 1. This pT ðX Þ ¼
result has wide implications; an example most j ft : t a closed orbit under T of lengthOX g j
relevant to the analogy with number theory is the
eðX þ1Þhtop ðT Þ
work of Pollicott and Sharp (1998) in which :
X
Dolgopyat’s result is used to show that for certain
(8)
geodesic flows there is a two-term prime orbit
theorem of the form Waddington (1991) considered quasi-
hyperbolic toral automorphisms, showing that
pðXÞ ¼ li ehtop X þ O ecX the asymptotic (8) in this case is multiplied by an
explicit almost-periodic function bounded away
for some c < htop. from zero and infinity.
For non-positive curvature manifolds less is This result has been extended further into non-
known: Knieper (1997) finds upper and lower hyperbolic territory, which is most easily seen via
bounds for the function counting closed geodesics the so-called connected S-integer dynamical sys-
on rank-1 manifolds of non-positive curvature of tems introduced by Chothi et al. (1997). Fix an
the form algebraic number field with set of places PðÞ
and set of infinite places P1 ðÞ , an element of
ehX infinite multiplicative order x , and a finite set
A OpðXÞOBehX S PðÞ∖P1 ðÞ with the property that |x|w O 1
X
for all w 2= S [ P1 ðÞ . The associated ring of
for constants A, B > 0. S-integers is
588 Ergodic Theory: Interactions with Combinatorics and Number Theory
Haar measure. On connected groups, the S-integer theory in a direct way; we illustrate this by
construction has been used by Baier et al. (2013) describing a selection of dynamical problems
to show that the following “exotic” orbit growth that call on particular parts of number theory in
properties occur. an essential way. The example of mixing in sec-
• For any k (0, 1), there is an ergodic com- tion “Mixing and Additive Relations in Fields” is
pact connected group automorphism T : X ! X particularly striking for two reasons: the results
with MT (N ) k log N. needed from number theory are relatively recent,
• For any r ℕ and k > 0, there is an ergodic and the ergodic application directly motivated a
compact connected group automorphism further development in number theory.
T : X ! X with MT (N) k(log log N )r.
• For any d (0, 1) and k > 0, there is an
Orbit Growth and Convergence
ergodic compact connected group automorphism
The analysis of periodic orbits – how their number
T : X ! X with MT (N) k(logN )d.
grows as the length grows and how they spread
We refer to the survey of Miles et al. (2015) for
out through space – is of central importance in
an overview.
dynamics (see Katok (1980) for example). An
instance of this is that for many simple kinds of
Pólya–Carlson Dichotomy dynamical systems T : X ! X (where T is a con-
Everest et al. (2005) noticed that the very simplest tinuous map of a compact metric space ðX, dÞ), the
example of a nontrivial S-integer system, namely, logarithmic growth rate of the number of periodic
the automorphism dual to x 7! 2x on ℤ 16 , had points exists and coincides with the topological
the property that its dynamical zeta function entropy h(T ) (an invariant giving a quantitative
admitted a natural boundary at its circle of con- measure of the average rate of growth in orbit
vergence. This phenomenon later emerged as a complexity under T ). That is,
pervasive feature of group automorphisms, and
Bell et al. (2014) formulated the conjecture that 1
log Fn ðT Þ ! htop ðT Þ ð10Þ
compact group automorphisms exhibit a Pólya– n
Carlson dichotomy: their dynamical zeta function
for many of the simplest dynamical systems. For
is either rational or admits a natural boundary.
example, if X ¼ r is the r-torus and T = TA is the
This has been proved in many cases with S or its
complement finite, but in general remains open. automorphism of the torus corresponding to a
matrix A in GLr(ℤ), then TA is ergodic with respect
This property certainly does not hold for maps in
to Lebesgue measure if and only if no eigenvalue
general, as it is easy to find examples of maps
whose dynamical zeta function is a nonrational of A is a root of unity. Under this assumption, we
have
function satisfying an algebraic identity.
Similar questions arise for endomorphisms of r
abelian varieties, and in positive characteristic, the Fn ðT A Þ ¼ j lni 1 j
relationship between rationality, algebraicity, and i¼1
the existence of natural boundaries for the dynam-
ical zeta function has been studied by Byszewski and
and Cornelissen (2018), where once again a
r
dichotomy of Pólya–Carlson type is found.
htop ðT A Þ ¼ log max f1, jli jg ð11Þ
i¼1
assumption, the convergence is less clear: for raised the (still open) question of whether any
r ⩾ 4 the automorphism TA may be ergodic with- measure-preserving transformation can be mixing
out being hyperbolic. That is, while no eigen- without being mixing of all orders.
values are unit roots some may have unit A class of group actions that are particularly
modulus. As pointed out by Lind (1982) in his easy to understand are the algebraic dynamical
study of these quasihyperbolic automorphisms, systems studied systematically by Schmidt
the convergence (10) does still hold for these (1995): here X is a compact abelian group, each
systems, but this requires a significant Tg is a continuous automorphism of X, and m is the
Diophantine result (the theorem of Gel’fond Haar measure on X. Schmidt (1989) related mixing
(1960) suffices; one may also use Baker’s theorem properties of algebraic dynamical systems with
(Baker 1990)). Even further from hyperbolicity lie G = ℤd to statements in arithmetic and showed
the family of S-integer systems (Chothi et al. that a mixing action on a connected group could
1997; Ward 1998); their orbit-growth properties only fail to mix in a certain way. Later Schmidt and
are intimately tied up with Artin’s conjecture on the author (Schmidt and Ward 1993) showed that
primitive roots and prime divisors of linear recur- for X connected, mixing implies mixing of all
rence sequences. orders. The proof proceeds by showing that the
result is exactly equivalent to the following state-
ment: if is a field of characteristic zero, and G is a
Mixing and Additive Relations in Fields
finitely generated subgroup of the multiplicative
The problem of higher-order mixing for commut-
group , then the equation
ing group automorphisms provides a striking
example of the dialogue between ergodic theory
a1 x 1 þ þ an x n ¼ 1 ð13Þ
and number theory, in which deep results from
number theory have been used to solve problems
for fixed a1 , . . . , an has a finite number of
in ergodic theory, and questions arising in ergodic
solutions x1, . . ., xn G for which no subsum
theory have motivated further developments in
i Iaixi with I ⊊ {1, . . ., n} vanishes. The
number theory.
bound on the number of solutions to (13) follows
An action T of a countable group G on a prob-
from the profound extensions to W. Schmidt’s
ability space (X, ℬ, m) is called k-fold mixing or
subspace theorem in Diophantine geometry
mixing on (k + 1) sets if
(Schmidt 1972) by Evertse and Schlickewei (see
Evertse and Schlickewei (1999), van der Poorten
m A0 \ T g1 A1 \ . . . \ T gk Ak ! mðA0 Þ mðAk Þ
and Schlickewei (1991), and Schlickewei (1990))
ð12Þ for the details).
The argument in Schmidt and Ward (1993)
as may be cast as follows: failure of k-fold mixing
in a connected algebraic dynamical system
gi g1
j ! 1 for i 6¼ j implies (via duality) an infinite set of solutions
to an equation of the shape (13) in some field of
with the convention that g0 = 1G, for any sets characteristic zero. The S-unit theorem means that
A0, . . ., Ak ℬ; gn ! 1 in G means that for this can only happen if there is some proper sub-
any finite set F G there is an N with n > N ) gn 2
= F. sum that vanishes infinitely often. This infinite
For k = 1 the property is called simply mixing. family of solutions to a homogeneous form of
This notion for single transformations goes back to (13) with fewer terms can then be translated
the foundational work of Rohlin (1949), where he back via duality to show that the system fails to
showed that ergodic group endomorphisms are mix for some strictly lower order, proving that
mixing of all orders (and so the notion is not mixing implies mixing of all orders by induction.
useful for distinguishing between group endomor- Stronger Diophantine results allow these argu-
phisms as measurable dynamical systems). He ments to be extended in various directions. Miles
Ergodic Theory: Interactions with Combinatorics and Number Theory 591
and Ward (2006) used explicit bounds on the x A \ T ð2k ,0Þ A ) x T ð0,2k Þ A,
number of solutions of S-integer systems due to
Evertse et al. (2002) to show that mixing implies so
mixing of all orders for actions of ℚd by automor-
phisms of a compact connected abelian group. A \ T ð2k ,0Þ A \ T ð0,2k Þ ðA þ x Þ ¼ ∅
Mixing properties for algebraic dynamical sys-
tems without the assumption of connectedness are
quite different, and in particular, it is possible to for all k ⩾ 1, which shows that T cannot be mixing
have mixing actions that are not mixing of all on three sets. Arenas-Carmona et al. (2008)
orders. This is a simple consequence of the fact showed that if the dilates of the specific shape
that the constituents of a disconnected algebraic {(0, 0), (1, 0), (0, 1)} are avoided, then
dynamical system are associated with fields of Ledrappier’s system exhibits mixing of all orders.
positive characteristic, where the presence of the The full picture of higher-order mixing proper-
Frobenius automorphism can prevent higher- ties on disconnected groups is rather involved; see
order mixing. Ledrappier (1978) pointed this out Schmidt’s monograph (Schmidt 1995). A simple
via examples of the following shape. Let illustration is the construction by Einsiedler and
the author (Einsiedler and Ward 2003) of systems
with any prescribed order of mixing. When such
X¼
systems fail to be mixing of all orders, they fail in a
2
xℤ2 : xðaþ1,bÞ þ xða,bÞ þ xða,bþ1Þ ¼ 0 ðmod 2Þ very specific way – along dilates of a specific shape
(a finite subset of ℤd). One insight into the range of
and define the ℤ2-action T to be the natural shift possible behaviors for algebraic dynamical systems
action, of this is seen in the following result. Call a shape
“admissable” if it does not lie on a line in ℤd, does
T ðn,mÞ x ¼ xðaþn,bþmÞ : contain 0, and has the property that for any k > 1 the
ða,bÞ
set 1k S contains nonintegral points. Then it is shown
in Ward (1997) that if S and T are admissable
It is readily seen that this action is mixing
shapes, then there is an algebraic ℤd-action that is
with respect to the Haar measure. The condition
mixing on S and not mixing on T unless a translate
x(a+1,b) + x(a,b) + x(a,b+1) = 0 (mod2) implies that,
of T is a subset of S. In Ledrappier’s example above,
for any k ⩾ 1,
the shape that fails to mix is {(0, 0), (1, 0), (0, 1)}.
This gives an order of mixing as detected by shapes;
2k
2k computing this is in principle an algebraic problem.
xð0,2k Þ ¼ xð j,0Þ
j¼0 j On the other hand, there is a more natural definition
of the order of mixing, namely, the largest k for
¼ xð0,0Þ þ xð2k ,0Þ ðmod 2Þ ð14Þ
which (12) holds; computing this is in principle a
Diophantine problem. A conjecture emerged
since every entry in the 2kth row of Pascal’s trian- (formulated explicitly by Schmidt (2001)) that for
gle is even apart from the first and the last. Now let any algebraic dynamical system, if every set of
A = {x X : x(0,0) = 0} and let x X be any cardinality r ⩾ 2 is a mixing shape, then the system
element with x(0,0) = 1. Then X is the disjoint is mixing on r sets.
union of A and A + x, so This question motivated Masser (2004) to prove
an appropriate analogue of the S-unit theorem on
1 the number of solutions to (13) in positive charac-
m ð A Þ ¼ mð A þ x Þ ¼ :
2 teristic as follows. Let H be a multiplicative group
and fix n ℕ. An infinite subset A Hn is called
However, (14) shows that
broad if it has both of the following properties:
592 Ergodic Theory: Interactions with Combinatorics and Number Theory
rapidly, and many future directions of research are Anosov DV (1967) Geodesic flows on closed Riemannian
discussed in the entries ▶ “Ergodic Theory on manifolds of negative curvature. Trudy Mat Inst
Steklov 90:209
Homogeneous Spaces and Metric Number The- Arenas-Carmona L, Berend D, Bergelson V (2008)
ory” by Kleinbock, ▶ “Ergodic Theory: Rigidity” Ledrappier’s system is almost mixing of all orders.
by Niţică, and ▶ “Ergodic Theory: Recurrence” Ergodic Theory Dynam Systems 28(2):339–365
by Frantzikinakis and McCutcheon. Some of Auslander L, Green L, Hahn F (1963) Flows on homoge-
neous spaces. With the assistance of L. Markus and
the directions most relevant to the examples W. Massey, and an appendix by L. Greenberg. Annals
discussed in this entry include the following. of Mathematics Studies, No. 53. Princeton University
The recent developments mentioned in section Press, Princeton
“Sets of Primes” clearly open many exciting pros- Baier S, Jaidee S, Stevens S, Ward T (2013) Automor-
phisms with exotic orbit growth. Acta Arith 158(2):
pects involving finding new structures in arith- 173–197
metically significant sets (like the primes). The Baker A (1990) Transcendental number theory, 2nd edn.
original conjecture of Erdős and Turán (1936) Cambridge mathematical library. Cambridge Univer-
asked if a Aℕ 1a ¼ 1 is sufficient to force the sity Press, Cambridge
Bell J, Miles R, Ward T (2014) Towards a Pólya–Carlson
set A to contain arbitrary long arithmetic progres- dichotomy for algebraic dynamics. Indag Math
sions and remains open. This would of course (NS) 25(4):652–668
imply both Szemerédi’s theorem (Szemerédi Benford F (1938) The law of anomalous numbers. Proc
Am Philos Soc 78:551–572
1975) and the result of Green and Tao (2008) on
Bergelson V (2003) Minimal idempotents and ergodic
arithmetic progressions in the primes. More gen- Ramsey theory. In: Topics in dynamics and ergodic
erally, it is clear that there is still much to come theory, volume 310 of London Math. Soc. Lecture
from the dialogue subsuming the four parallel Note Ser. Cambridge University Press, Cambridge,
pp 8–39
proofs of Szemerédi’s: one by purely combinato-
Bergelson V, Leibman A (1996) Polynomial extensions of
rial methods, one by ergodic theory, one by hyper- van der Waerden’s and Szemerédi’s theorems. J Am
graph theory, and one by Fourier analysis and Math Soc 9(3):725–753
additive combinatorics. For an overview, see the Bergelson V, McCutcheon R (2000) An ergodic IP poly-
nomial Szemerédi theorem. Mem Amer Math Soc
survey papers of Tao (2006, 2007a, b).
146(695):viii+106
In the context of the orbit-counting results in Birkhoff GD (1931) Proof of the ergodic theorem. Proc
section “Orbit-Counting as an Analogous Devel- Natl Acad Sci U S A 17:656–660
opment,” a natural problem is to, on the one hand, Bohl P (1909) Über ein in der Theorie der säkularen
Störungen vorkommendes Problem. J Undergrad
obtain finer asymptotics with better control of the
Math 135:189–283
error terms and, on the other hand, to extend the Borel E (1909) Les probabilités denombrables et leurs
situations that can be handled. In particular, applications arithmetiques. Rend Circ Math Palermo
relaxing the hypotheses related to hyperbolicity 27:247–271
Bourgain J (1988a) An approach to pointwise ergodic
(or negative curvature) is a constant challenge.
theorems. In: Geometric aspects of functional analysis
The rate of mixing result at the end of section (1986/87), volume 1317 of Lecture Notes in Math.
“Mixing and Additive Relations in Fields” points Springer, Berlin, pp 204–223
towards a large array of natural conjectures Bourgain J (1988b) On the maximal ergodic theorem for
certain subsets of the integers. Israel J Math 61(1):
concerned with rate of mixing, central limit theorem
39–72
phenomena, and rigidity of cocycles for ℤd-actions. Bowen R (1970) Markov partitions for axiom a
diffeomorphisms. Am J Math 92:725–747
Bowen R (1972) The equidistribution of closed geodesics.
Am J Math 94:413–423
Bibliography Bowen R (1973) Symbolic dynamics for hyperbolic flows.
Am J Math 95:429–460
Byszewski J, Cornelissen G (2018) Dynamics on abelian
Primary Literature varieties in positive characteristic. Algebra Number
Adler RL, Weiss B (1970) Similarity of automorphisms of
Theory 12(9):2185–2235
the torus. Memoirs of the American Mathematical
Chothi V, Everest G, Ward T (1997) S-integer dynamical
Society, No. 98. American Mathematical Society,
systems: periodic points. J Reine Angew Math (489):
Providence
99–132
594 Ergodic Theory: Interactions with Combinatorics and Number Theory
Dolgopyat D (1998) On decay of correlations in Anosov Green B, Tao T (2010) Linear equations in primes. Ann
flows. Ann Math (2) 147(2):357–390 Math 171(3):1753–1850
Einsiedler M, Ward T (2003) Asymptotic geometry of non- Haynes A, White CJ (2014) Group automorphisms with
mixing sequences. Ergod Theor Dyn Syst 23(1):75–85 prescribed growth of periodic points, and small primes
Erdös P (1949) On a new method in elementary number in arithmetic progressions in intervals. Adv Math
theory which leads to an elementary proof of the prime 252:572–585
number theorem. Proc Nat Acad Sci U S A 35:374–384 Hill TP (1995) Base-invariance implies Benford’s law.
Erdős P, Turán P (1936) On some sequences of integers. Proc Am Math Soc 123(3):887–895
J Lond Math Soc 11:261–264 Hindman N (1974) Finite sums from sequences within
Everest G, Stangoe V, Ward T (2005) Orbit counting with cells of a partition of ℕ. J Combin Theory Ser A
an isometric direction. In: Algebraic and topological 17:1–11
dynamics, volume 385 of Contemp. Math. American Hlawka E (1964) Discrepancy and uniform distribution of
Mathematical Society, Providence, pp 293–302 sequences. Compos Math 16(83–91):1964
Everest G, Miles R, Stevens S, Ward T (2007) Orbit- Host B, Kra B (2005a) Convergence of polynomial ergodic
counting in non-hyperbolic dynamical systems. averages. Israel J Math 149:1–19. Probability in
J Reine Angew Math 608:155–182 mathematics
Everest G, Miles R, Stevens S, Ward T (2010) Dirichlet Host B, Kra B (2005b) Nonconventional ergodic averages
series for finite combinatorial rank dynamics. Trans and nilmanifolds. Ann Math (2) 161(1):397–488
Am Math Soc 362(1):199–227 Huber H (1959) Zur analytischen Theorie hyperbolischen
Evertse JH, Schlickewei HP (1999) The absolute subspace Raumformen und Bewegungsgruppen. Math Ann
theorem and linear equations with unknowns from a 138:1–26
multiplicative group. In: Number theory in progress, Katok A (1980) Lyapunov exponents, entropy and periodic
Vol. 1 (Zakopane-Kościelisko, 1997). de Gruyter, Ber- orbits for diffeomorphisms. Inst Hautes Études Sci Publ
lin, pp 121–142 Math 51:137–173
Evertse J-H, Schlickewei HP, Schmidt WM (2002) Linear Katsuda A, Sunada T (1990) Closed orbits in homology
equations in variables which lie in a multiplicative classes. Inst Hautes Études Sci Publ Math 71:5–32
group. Ann Math (2) 155(3):807–836 Khanin K, Lopes Dias J, Marklof J (2007) Multi-
Ferenczi S, Kamiński B (1995) Zero entropy and direc- dimensional continued fractions, dynamical
tional Bernoullicity of a Gaussian Z2-action. Proc Am renormalization and KAM theory. Commun Math
Math Soc 123(10):3079–3083 Phys 270(1):197–231
Furstenberg H (1961) Strict ergodicity and transformation Khinchin AI (1964) Continued fractions. The University of
of the torus. Am J Math 83:573–601 Chicago Press, Chicago/London
Furstenberg H (1977) Ergodic behavior of diagonal mea- Knieper G (1997) On the asymptotic geometry of non-
sures and a theorem of Szemerédi on arithmetic pro- positively curved manifolds. Geom Funct Anal 7(4):
gressions. J Analyse Math 31:204–256 755–782
Furstenberg H, Katznelson Y (1978) An ergodic Koopman B (1931) Hamiltonian systems and transforma-
Szemerédi theorem for commuting transformations. tions in Hilbert spaces. Proc Natl Acad Sci U S A
J Analyse Math 34:275–291. (1979 17:315–318
Furstenberg H, Katznelson Y (1985) An ergodic Kuz’min RO (1928) A problem of Gauss. Dokl Akad Nauk
Szemerédi theorem for IP-systems and combinatorial 1928:375–380
theory. J Analyse Math 45:117–168 Lagarias JC (1994) Geodesic multidimensional continued
Furstenberg H, Weiss B (1978) Topological dynamics and fractions. Proc Lond Math Soc (3) 69(3):464–488
combinatorial number theory. J Analyse Math Lalley SP (1987) Distribution of periodic orbits of sym-
34:61–85. (1979 bolic and axiom a flows. Adv Appl Math 8(2):154–193
Gel’fond AO (1960) Transcendental and algebraic num- Lalley SP (1988) The “prime number theorem” for the
bers. Translated from the first Russian edition by Leo periodic orbits of a Bernoulli flow. Am Math Mon
F. Boron. Dover, New York 95(5):385–398
Goldfeld D (2004) The elementary proof of the prime Ledrappier F (1978) Un champ markovien peut etre
number theorem: an historical perspective. In: Number d’entropie nulle et mélangeant. C R Acad Sci Paris
theory (New York, 2003). Springer, New York, Ser A-B 287(7):A561–A563
pp 179–192 Leibman A (2005) Convergence of multiple ergodic aver-
Goldston DA, Pintz J, Yildirim CY (2009) Primes in ages along polynomials of several variables. Israel
tuples. I. Ann Math (2) 170(2):819–862 J Math 146:303–315
Gorodnik A, Spatzier R (2015) Mixing properties of com- Levy P (1929) Sur les lois de probabilité dont dependent
muting nilmanifold automorphisms. Acta Math les quotients complets et incomplets d’une fraction
215(1):127–159 continue. Bull Soc Math France 57:178–194
Gowers WT (2007) Hypergraph regularity and the multi- Lévy P (1936) Sur quelques points de la théorie des pro-
dimensional Szemerédi theorem. Ann Math (2) babilités dénombrables. Ann Inst H Poincaré 6(2):
166(3):897–946 153–184
Green B, Tao T (2008) The primes contain arbitrarily long Lind DA (1982) Dynamical properties of quasihyperbolic
arithmetic progressions. Ann Math (2) 167(2):481–547 toral automorphisms. Ergod Theor Dyn Syst 2(1): 49–68
Ergodic Theory: Interactions with Combinatorics and Number Theory 595
Linnik UV (1944) On the least prime in an arithmetic Schlickewei HP (1990) S-unit equations over number
progression. I. The basic theorem. Rec Math [Mat fields. Invent Math 102(1):95–107
Sbornik] NS 15(57):139–178 Schmidt WM (1972) Norm form equations. Ann Math
Margulis GA (1969) Certain applications of ergodic theory 96:526–551
to the investigation of manifolds of negative curvature. Schmidt K (1989) Mixing automorphisms of compact
Funkcional Anal i Priložen 3(4):89–90 groups and a theorem by Kurt Mahler. Pac J Math
Margulis GA (2004) On some aspects of the theory of 137(2):371–385
Anosov systems. Springer Monographs in Mathemat- Schmidt K, Ward T (1993) Mixing automorphisms of
ics. Springer, Berlin. With a survey by Richard Sharp: compact groups and a theorem of Schlickewei. Invent
Periodic orbits of hyperbolic flows, Translated from the Math 111(1):69–76
Russian by Valentina Vladimirovna Szulikowska Selberg A (1949) An elementary proof of the prime-
Masser DW (2004) Mixing and linear equations over number theorem. Ann Math 50(2):305–313
groups in positive characteristic. Israel J Math 142: Selberg A (1956) Harmonic analysis and discontinuous
189–204 groups in weakly symmetric Riemannian spaces with
Mertens F (1874) Ein Beitrag zur analytischen applications to Dirichlet series. J Indian Math Soc (NS)
Zahlentheorie. J Reine Angew Math 78:46–62 20:47–87
Miles R, Ward T (2006) Mixing actions of the rationals. Sharp R (1991) An analogue of Mertens’ theorem for
Ergod Theor Dyn Syst 26(6):1905–1911 closed orbits of axiom a flows. Bol Soc Brasil Mat
Miles R, Ward T (2011) A directional uniformity of peri- (NS) 21(2):205–229
odic point distribution and mixing. Discrete Contin Sharp R (1993) Closed orbits in homology classes for
Dyn Syst 30(4):1181–1189 Anosov flows. Ergod Theor Dyn Syst 13(2):387–408
Nagle B, Rödl V, Schacht M (2006) The counting lemma Sierpiński W (1910) Sur la valeur asymptotique d’une
for regular k-uniform hypergraphs. Random Struct certaine somme. Bull Int Acad Pol Sci Lett (Cracovie)
Algoritm 28(2):113–179 1910:9–11
Newcomb S (1881) Note on the frequency of the use of Sinaĭ JG (1966) Asymptotic behavior of closed geodesics
digits in natural numbers. Am J Math 4(1):39–40 on compact manifolds with negative curvature. Izv
Noorani MSM (1999) Mertens’ theorem and closed orbits Akad Nauk SSSR Ser Mat 30:1275–1296
of ergodic toral automorphisms. Bull Malaysian Math Sinaĭ JG (1968) Construction of Markov partitions.
Soc (2) 22(2):127–133 Funkcional Anal i Priložen 2(3):70–80
Oxtoby JC (1952) Ergodic sets. Bull Am Math Soc 58: Smale S (1967) Differentiable dynamical systems. Bull
116–136 Am Math Soc 73:747–817
Parry W (1969) Ergodic properties of affine transformations Szemerédi E (1969) On sets of integers containing no four
and flows on nilmanifolds. Am J Math 91: 757–771 elements in arithmetic progression. Acta Math Acad
Parry W (1983) An analogue of the prime number theorem Sci Hungar 20:89–104
for closed orbits of shifts of finite type and their sus- Szemerédi E (1975) On sets of integers containing no
pensions. Israel J Math 45(1):41–52 k elements in arithmetic progression. Acta Arith
Parry W (1984) Bowen’s equidistribution theory and the 27:199–245
Dirichlet density theorem. Ergod Theor Dyn Syst 4(1): Tao T, Ziegler T (2008) The primes contain arbitrarily long
117–134 polynomial progressions. Acta Math 201(2):213–305
Parry W, Pollicott M (1983) An analogue of the prime van der Poorten AJ, Schlickewei HP (1991) Additive rela-
number theorem for closed orbits of axiom a flows. tions in fields. J Austral Math Soc Ser A 51(1):154–170
Ann Math 118(3):573–591 van der Waerden BL (1927) Beweis einer Baudet’schen
Poincaré H (1890) Sur le problème des trois corps et les Vermutung. Nieuw Arch Wisk 15:212–216
equations de la Dynamique. Acta Math 13:1–270 von Neumann J (1932) Proof of the quasi-ergodic hypoth-
Pollicott M, Sharp R (1998) Exponential error terms for esis. Proc Natl Acad Sci U S A 18:70–82
growth functions on negatively curved surfaces. Am Waddington S (1991) The prime orbit theorem for quasi-
J Math 120(5):1019–1042 hyperbolic toral automorphisms. Monatsh Math
Rado R (1933) Studien zur Kombinatorik. Math Z 36(1): 112(3):235–248
424–470 Waldschmidt M (1993) Minorations de combinaisons
Ratner M (1973) Markov partitions for Anosov flows on linéaires de logarithmes de nombres algébriques. Can
n-dimensional manifolds. Israel J Math 15:92–114 J Math 45(1):176–224
Rohlin VA (1949) On endomorphisms of compact com- Ward T (1997/98) Three results on mixing shapes. N Y
mutative groups. Izvestiya Akad Nauk SSSR Ser Mat J Math 3A (Proceedings of the New York Journal of
13:329–340 Mathematics Conference, June 9–13, 1997):1–10
Roth K (1952) Sur quelques ensembles d’entiers. C R Acad Ward T (1998) Almost all S-integer dynamical systems
Sci Paris 234:388–390 have many periodic points. Ergod Theor Dyn Syst
Sárközy A (1978) On difference sets of sequences of 18(2):471–486
integers. III. Acta Math Acad Sci Hungar 31(3–4): Ward T (2005) Group automorphisms with few and with
355–386 many periodic points. Proc Am Math Soc 133(1): 91–96
596 Ergodic Theory: Interactions with Combinatorics and Number Theory
Weyl H (1910) Über die Gibbssche Erscheinung und University Press, Cambridge. With a supplementary
verwandte Konvergenzphänomene. Rendiconti del chapter by Anatole Katok and Leonardo Mendoza
Circolo Matematico di Palermo 30:377–407 Kra B (2006) The Green-Tao theorem on arithmetic pro-
Weyl H (1916) Uber die Gleichverteilung von Zahlen mod gressions in the primes: an ergodic point of view. Bull
Eins. Math Ann 77:313–352 Am Math Soc (NS) 43(1):3–23 (electronic)
Wiener N (1932) Tauberian theorems. Ann Math 33(1): Krengel U (1985) Ergodic theorems, volume 6 of de
1–100 Gruyter studies in mathematics. Walter de Gruyter,
Berlin. With a supplement by Antoine Brunel
McCutcheon R (1999) Elemental methods in ergodic Ram-
Books and Reviews sey theory, volume 1722 of Lecture notes in mathemat-
Arnol’d VI, Avez A (1968) Ergodic problems of classical ics. Springer, Berlin
mechanics. Translated from the French by A. Avez. Miles R, Staines M, Ward T (2015) Dynamical invariants for
W. A. Benjamin, New York/Amsterdam group automorphisms. In: Recent trends in ergodic the-
Bergelson V (1996) Ergodic Ramsey theory – an update. In: ory and dynamical systems, volume 631 of Contemp.
Ergodic theory of Zd actions (Warwick, 1993–1994), Math. Amer. Math. Soc., Providence, pp 231–258
volume 228 of London Math. Soc. Lecture Note Ser. Petersen K (1989) Ergodic theory, volume 2 of Cambridge
Cambridge University Press, Cambridge, pp 1–61 studies in advanced mathematics. Cambridge Univer-
Bergelson V (2000) Ergodic theory and Diophantine prob- sity Press, Cambridge. Corrected reprint of the 1983
lems. In: Topics in symbolic dynamics and applications original
(Temuco, 1997), volume 279 of London Math. Schmidt K (1995) Dynamical systems of algebraic origin,
Soc. Lecture Note Ser. Cambridge University Press, volume 128 of Progress in Mathematics. Birkhäuser,
Cambridge, pp 167–205 Basel
Bergelson V (2006) Combinatorial and Diophantine appli- Schmidt K (2001) The dynamics of algebraic ℤd-actions.
cations of ergodic theory. In: Handbook of dynamical In: European congress of mathematics, Vol. I (Barce-
systems, vol 1B. Elsevier B. V., Amsterdam, lona, 2000), volume 201 of Progr. Math. Birkhäuser,
pp 745–869. Appendix A by A. Leibman and Appen- Basel, pp 543–553
dix B by Anthony Quas and Máté Wierdl Schweiger F (1995) Ergodic theory of fibred systems and
Cornfeld IP, Fomin SV, Sinaĭ YG (1982) Ergodic theory. metric number theory. Oxford Science Publications. The
Springer, New York Clarendon Press/Oxford University Press, New York
Dajani K, Kraaikamp C (2002) Ergodic theory of numbers, Schweiger F (2000) Multidimensional continued fractions.
volume 29 of Carus mathematical monographs. Math- Oxford Science Publications. Oxford University Press,
ematical Association of America, Washington, DC Oxford
del Junco A (2008) Ergodic theorems. In: Encyclopedia of Silverman JH (2007) The arithmetic of dynamical systems,
complexity and systems science. Springer volume 241 of Graduate texts in mathematics. Springer,
Denker M, Grillenberger C, Sigmund K (1976) Ergodic New York
theory on compact spaces. Lecture notes in mathemat- Tao T (2006) Arithmetic progressions and the primes.
ics, vol 527. Springer, Berlin Collect. Math., vol. Extra:37–88
Einsiedler M, Ward T (2011) Ergodic theory with a view Tao T (2007a) The dichotomy between structure and ran-
towards number theory, volume 259 of Graduate texts domness, arithmetic progressions, and the primes. In:
in mathematics. Springer, London International congress of mathematicians, vol I. Euro-
Ellis R (1969) Lectures on topological dynamics. W. A. pean Mathematical Society, Zürich, pp 581–608
Benjamin, New York Tao T (2007b) What is good mathematics. Bull Am Math
Furstenberg H (1981) Recurrence in ergodic theory and Soc (NS) 44(4):623–634
combinatorial number theory. Princeton University Totoki H (1969) Ergodic theory. Lecture notes series, No.14.
Press, Princeton. M. B. Porter Lectures Matematisk Institut, Aarhus Universitet, Aarhus
Furstenberg H, Katznelson Y, Ornstein D (1982) The ergo- van der Waerden BL (1971) How the proof of Baude’s
dic theoretical proof of Szemerédi’s theorem. Bull Am conjecture was found. In: Studies in pure mathematics
Math Soc (NS) 7(3):527–552 (presented to Richard Rado). Academic, London,
Glasner E (2003) Ergodic theory via joinings, pp 251–260
volume 101 of Mathematical surveys and monographs. Walters P (1982) An introduction to ergodic theory,
American Mathematical Society, Providence volume 79 of Graduate texts in mathematics. Springer,
Hejhal DA (1976) The Selberg trace formula and the Rie- New York
mann zeta function. Duke Math J 43(3):441–482 Weil A (1967) Basic number theory. Die Grundlehren der
Iosifescu M, Kraaikamp C (2002) Metrical theory of con- mathematischen Wissenschaften, vol 144. Springer,
tinued fractions, volume 547 of Mathematics and its New York
applications. Kluwer Academic, Dordrecht Ziegler T (2014) Linear equations in primes and dynamics
Katok A, Hasselblatt B (1995) Introduction to the modern of nilmanifolds. In: Proceedings of the international
theory of dynamical systems, volume 54 of Encyclo- congress of mathematicians—Seoul 2014, vol II.
pedia of mathematics and its applications. Cambridge Kyung Moon Sa, Seoul, pp 569–589
Metric number theory Metric number theory
Ergodic Theory on (or, specifically, metric Diophantine approxi-
Homogeneous Spaces mation) refers to the study of sets of real num-
and Metric Number Theory bers or vectors with prescribed Diophantine
approximation properties.
Dmitry Kleinbock
Department of Mathematics, Brandeis University,
Waltham, MA, USA Definition of the Subject
Here is a brief outline of the rest of the article. In enough T. The former is sometimes referred to as
the next section, we survey basic results, some clas- asymptotic approximation and amounts to study-
sical, some obtained relatively recently, in metric ing sets W m,n ðcÞ mentioned in the previous sec-
Diophantine approximation. Section “Connection tion. The latter, less studied set-up of uniform
with Dynamics on the Space of Lattices” is devoted approximation, has attracted some attention in
to a description of the connection between recent years. Both set-ups can be approached
Diophantine approximation and dynamics, specifi- using tools from dynamics on the space of lattices,
cally flows on the space of lattices. In sections as we shall see below.
“Diophantine Approximation with Dependent The simplest choice for functions c happens to
Quantities: The Set-Up” and “Further Results,” we be the following: let us denote
specialize to the set-up of Diophantine approxima-
tion on manifolds or, more generally, approximation cc,v ðxÞ ¼ cxv :
properties of vectors with respect to measures satis-
fying some natural conditions and show how appli- From Dirichlet’s Theorem, it easily follows
cations of homogeneous dynamics contributed to that W m,n c1,n=m ¼ Mm,n . The constant c ¼ 1
important recent developments in the field.
is not optimal: the smallest
p value of c for which
Section “Future Directions” mentions several open
W 1,1 cc,1 ¼ ℝ is 1= 5 , and the optimal con-
questions and directions for further investigation.
stants are not known in higher dimensions,
although some estimates can be given, see
Basic Facts (Schmidt 1980).
Systems of linear forms which do not belong to
General references for this section: (Cassels 1957; W m,n cc,n=m for some positive c are called badly
Schmidt 1980). The starting point for all the approximable; that is, we set
explorations in metric number theory is
Dirichlet’s Theorem (1842), stating that for any def
BAm,n ¼ Mm,n ∖[c>0 W m,n cc,n=m :
Y Mm,n and for any T > 1, there exist q ¼
(q1, . . ., qn) ℤn\{0} and p ¼ ( p1, . . ., pm) ℤm
satisfying the following system of inequalities: Their existence in arbitrary dimensions was
shown by Perron. Note that a real number y (m ¼
n ¼ 1) is badly approximable if and only if its
k Yq p k < T n=m and k q k T: ð5Þ
continued fraction coefficients are uniformly
bounded. It was proved by Jarnik (1929) in the
In fact, it is this paper of Dirichlet which gave
case m ¼ n ¼ 1 and by Schmidt in the general case
rise to his box principle. Later another proof of the
same result was given by Minkowski. (Schmidt 1969) that badly approximable matrices
form a set of full Hausdorff dimension: that is, dim
A natural question to ask is whether one can
(BAm,n) ¼ mn.
improve (5) by replacing T n/m by a smaller func-
tion, that is, consider the following system of On the other hand, it can be shown that each of
inequalities: the sets W m,n cc,n=m for any c > 0 has full
Lebesgue measure, and hence, the complement
k Yq p k < cðT Þ and k q k T, ð5aÞ BAm, n to their intersection has measure zero.
This is a special case of a theorem due to
where c is a positive, continuous, decreasing Khintchine (1924) in the case n ¼ 1 and to
function which decays to zero at infinity. Histori- Groshev (1938) in full generality, which gives
cally, there have been two directions to pursue in the precise condition on the function c under
this regard: looking for solvability of (5a) for an which the set of c-approximable matrices has
unbounded set of T > 0 versus for all large full measure. Namely, if c is nonincreasing (this
600 Ergodic Theory on Homogeneous Spaces and Metric Number Theory
assumption can be removed in higher dimensions VWAm,n has full Hausdorff dimension. Matrices
but not for n ¼ 1, see (Duffin and Schaeffer contained in the intersection
1941)), then l-almost no (resp., l-almost every)
Y Mm,n is c-approximable, provided the sum \W
v>0
m,n c1,v ¼ fY Mm,n : oðY Þ ¼ 1g
1
kn1 cðkÞm ð6Þ are called Liouville and form a set of Hausdorff
k¼1 dimension (n – 1)m, that is, to the dimension of
Y for which Yq ℤ for some q ℤn\{0} (the
converges (resp., diverges). (Here and hereafter l
latter belongs to W m,n ðcÞ for any positive c).
stands for Lebesgue measure.) This statement is
Metric theory of uniform approximation was
usually referred to as the Khintchine–Groshev
initiated by Davenport and Schmidt in (1969/
Theorem. The convergence case of this theorem
1970, 1970). Let us say that Y Mm,n is c-
follows in a straightforward manner from the
Dirichlet (notation: Y D m,n ðcÞ) if the system of
Borel–Cantelli Lemma, but the divergence case
inequalities (5a) has solutions in (p, q) ℤm
is harder. It was reproved and sharpened in 1960
(ℤn\{0}) for all sufficiently large T. (Again, this
by Schmidt (1960), who showed that if the
definition is slightly different from the one used in
sum (6) diverges, then for almost all Y, the number
several papers on the subject (Kleinbock and
of solutions to (4) with kq k T is asymptotic to
Wadleigh 2018, 2019; Kleinbock et al. 2021a),
the partial sum of the series (6) (up to a constant)
where powers of norms were considered.)
and also gave an estimate for the error term.
Dirichlet’s Theorem asserts that D m,n c1,n=m ¼
A special case of the convergence part of the
Mm,n : However, the constant 1 is now optimal: it
theorem shows that W m,n c1,v has measure
was shown in (Davenport and Schmidt 1970) that
zero whenever v > n/m. Y is said to be very well the Lebesgue measure of D m,n cc,n=m is zero for
approximable if it belongs to W m,n c1,v for some any c < 1. This phenomenon can be easily
v > n/m. That is, explained by means of dynamics on homoge-
neous spaces, see (Dani 1985; Kleinbock and
def
VWAm,n ¼ [v>n=m W m,n c1,v : Weiss 2008). Y is said to be Dirichlet-Improvable
def
if it belongs to the null set DIm,n ¼
More specifically, let us define the Diophantine [c<1 D m,n cc,n=m , and singular if it belongs to
exponent o(Y ) of Y (sometimes called “the exact def
order” of Y) to be the supremum of v > 0 for SINGm,n ¼ \c>0 D m,n cc,n=m : Note that Schmidt
which Y W m,n c1,v . Then o(Y ) is always not also showed that DIm,n contains BAm,n, hence is a
less than n/m and is equal to n/m for Lebesgue-a.e. set of full Hausdorff dimension.
Y; in fact, VWAm,n ¼ {Y Mm,n : o(Y ) > n/m}. Note also that all the aforementioned proper-
The Hausdorff dimension of the null sets ties behave nicely with respect to transposition;
W m,n c1,v was computed independently by this is described by the so-called Khintchine’s
Besicovitch (1929) and Jarnik (1928) in the one- Transference Principle (Chapter V in (Cassels
dimensional case and by Dodson (1992) in gen- 1957)). For example, Y BAm,n if and only if
eral: when v > n/m, one has YT BAn,m, and Y VWAm,n if and only if
YT VWAn,m. In particular, many problems
mþn related to approximation properties of vectors
dim W m,n c1,v ¼ ðn 1Þm þ : ð7Þ
vþ1 (n ¼ 1) and linear forms (m ¼ 1) reduce to one
another.
See (Dodson 1993) for a nice exposition of We refer the readers to (Beresnevich et al.
ideas involved in the proof of both the aforemen- 2006; Harman 1998) for very detailed and com-
tioned formula and the Khintchine–Groshev The- prehensive accounts of various further aspects of
orem. Note that it follows from (7) that the null set the theory.
Ergodic Theory on Homogeneous Spaces and Metric Number Theory 601
of lattices giℤk goes to infinity in Ωk , there exists This correspondence allows one to link various
a sequence {vi ℤk\{0}} such that gi(vi) ! 0 as Diophantine and dynamical phenomena. For
i ! 1. Equivalently, for ε > 0 consider a subset example, from the results of (Kleinbock and
Kε of Ωk consisting of lattices with no nonzero Margulis 1996) on abundance of bounded orbits
vectors of norm less than ε; then all the sets Kε are on homogeneous spaces, one can deduce the
compact, and every compact subset of Ωk is aforementioned theorem of Schmidt (1969): the
contained in one of them. Moreover, one can set BAm,n has full Hausdorff dimension. See also
choose a metric on Ωk such that dist(Λ, ℤk) is, up Kleinbock and Weiss (2013) where a winning
to a uniform multiplicative constant, equal to property of BAm,n is established by dynamical
log minv Λ\{0} kvk (see (Ding 1994)), then the methods. A dynamical Borel–Cantelli Lemma
length of the smallest nonzero vector in a lattice ˄ established in (Kleinbock and Margulis 1999)
will determine how far away is this lattice in the can be used for an alternative proof of the
“cusp” of Ωk. Note that Minkowski’s Convex Body Khintchine–Groshev Theorem; see also Sullivan
Theorem shows that Kε ¼ ; when ε > 1. On the (1982) for an earlier geometric approach. Note
other hand, when ε < 1, each of the sets Kε has that establishing a uniform approximation ana-
nonempty interior, and K1 ¼ \ε<1Kε is nonempty, logue of the Khintchine–Groshev Theorem, that
but has empty interior and zero Haar measure. It is is, finding a criterion for sets D m,n ðcÞ to have zero
called the critical locus for the supremum norm, or full measure, is still an open problem; it was
and its structure is described by the Hajós– solved in (Kleinbock and Wadleigh 2018) for
Minkowski Theorem. m ¼ n ¼ 1 using continued fractions, and a partial
Using Mahler’s Criterion, it is not hard to show solution for the general case was developed in
that Y BAm,n if and only if the trajectory Kleinbock et al. (2021a). It is worthwhile to
point out that all the aforementioned proofs are
gt LY ℤk : t ℝþ ð11Þ based on the following two properties of the gt-
action: effective mixing, which forces points to
is bounded in Ωk. This was proved by Dani return to compact subsets and makes preimages of
(1985) in 1985 and later generalized in cusp neighborhoods quasi-independent, and
(Kleinbock and Margulis 1999) to produce a hyperbolicity, which implies that the behavior of
criterion for Y to be c-approximable for any points on unstable leaves is generic. The latter is
nonincreasing function c. Namely, given c one important since the orbits of the group
can in a unique way define the function r(t) such {LY : Y Mm,n} are precisely the unstable leaves
that Y W m,n ðcÞ if and only if gtLYℤk Kr(t) for with respect to the gt-action.
an unbounded set of t > 0. An important special
case is a criterion for a system of linear forms to k Yq p k < cðT Þ and k q k T, ð11aÞ
be very well approximable: Y VWAm,n if and
only if the trajectory (11) has linear growth, We note that other types of Diophantine prob-
that is, there exists a positive γ such that lems, such as conjectures of Oppenheim and
dist(gtLYℤk, ℤk) > γt for an unbounded set of Littlewood mentioned in the previous section,
t > 0. Similarly, one can produce a uniform can be reduced to statements involving actions
version, that is, a criterion for Y to be on Ωk by means of the same principle: Mahler’s
c-Dirichlet: Y D m,n ðcÞ if and only if Criterion is used to relate small values of some
g tLYℤ k Kr(t) for all large enough t > 0. In function at integer points to excursions to infinity
particular, Y is singular if and only if {gtLYℤk} in Ωk of orbit of the stabilizer of this function. In
is divergent and is Dirichlet-Improvable if and particular, Littlewood’s Conjecture deals with
only if gtLYℤk is not in Kε for some ε > 0; that is, multiplicative approximation and is related to a
the gt trajectory of LYℤk eventually stays away multi-parameter action by a certain semigroup of
from the critical locus K1. diagonal matrices, see the section “Further
Ergodic Theory on Homogeneous Spaces and Metric Number Theory 603
Results” part of section “Diophantine Approxi- behavior of the trajectory {gtLYℤk}. One of its
mation with Dependent Quantities: The Set-Up.” applications is a better understanding of uniform
Other important and useful recent applications Diophantine exponents of matrices; another one is
of homogeneous dynamics to metric Diophantine showing that the set DIm,n\(BAm,n [ SINGm,n) is
approximation are related to the circle of ideas uncountable, see (Beresnevich et al. 2020).
roughly called “Diophantine approximation with In another development, the correspondence
dependent quantities” (terminology borrowed between approximation and dynamics can be
from (Sprindžuk 1979)), to be surveyed in the extended to inhomogeneous approximation, that
next two sections. is, studying values Yq þ z þ p of systems of affine
forms (here Y Mm,n and z ℝm) at integer
points (q, p). This way one obtains a zero-infinity
Further Results law for improvement of Dirichlet’s theorem, see
(Kleinbock and Wadleigh 2019) and also (Kim
Overall there have been numerous developments and Kim 2022) for an alternative proof and a
utilizing the dynamical approach to Diophantine version for Hausdorff measures.
problems. For example, the Hausdorff dimension For recent results utilizing the correspondence
of the null set SINGm,n of singular systems of between Diophantine approximation and homoge-
linear forms was computed for m ¼ 2, n ¼ 1 by neous dynamics, see (Athreya et al. 2015;
Cheung (2011) and then for n ¼ 1 and arbitrary Dolgopyat et al. 2017; Björklund and Gorodnik
m by Cheung and Chevallier (2016). In the gen- 2019; Shapira and Weiss 2022) among many other
eral case, the sharp upper estimate was obtained in papers.
(Kadyrov et al. 2017), where dynamics was used
via the method of integral inequalities for height Diophantine Approximation with Dependent
functions on the space of lattices originally devel- Quantities: The Set-Up
oped in (Eskin et al. 1998); those are also known General references for this section: (Bernik and
as Margulis functions, see (Eskin and Mozes Dodson 1999; Sprindžuk 1979).
2022) for a recent survey. A refinement of the Here we restrict ourselves to Diophantine prop-
technique from (Kadyrov et al. 2017), combined erties of vectors in ℝn. In particular, we will look
with the exponential mixing of the gt-action on more closely at the set of very well approximable
Ωk, produces a proof of the Dimension Drop Con- vectors, which we will simply denote by VWA,
jecture (Kleinbock and Mirzadeh 2020); its dropping the subscripts. In many cases, it does not
byproduct is a sharpening of the Davenport– matter whether one works with row or column
Schmidt result, namely, establishing that vectors, in view of the duality remark made at the
dim D m,n cc,n=m < mn for any c < 1. end of section “Basic Facts.” Note however that
during recent years there has been a substantial
The complimentary lower estimate for the
progress toward establishing analogues of results
dimension of the set of singular matrices, produc-
from this section to the space of matrices (this was
ing the equality
mentioned as work in progress in [Kleinbock and
mn Margulis 1998]), see (Kleinbock et al. 2010;
dim ðSINGm,n Þ ¼ mn , Beresnevich et al. 2015; Aka et al. 2018; Das et al.
mþn
2018; Yang 2019).
was established in (Das et al. 2019) using the We begin with a non-example of an application
method of parametric geometry of numbers, stem- of dynamics to Diophantine approximation: a cel-
ming from the work of Schmidt–Summerer (2013) ebrated and difficult theorem which currently, to
and Roy (2015). The latter method is a powerful the best of the author’s knowledge, has no dynam-
tool that gives a strong quantitative way to relate ical proof. Suppose that y ¼ (y1, . . ., yn) ℝn is
Diophantine properties of matrices Y to the such that each yi is algebraic and 1, y1, . . ., yn are
604 Ergodic Theory on Homogeneous Spaces and Metric Number Theory
linearly independent over ℚ. It was established by very algebraic.” At about the same time, Schmidt
Roth for n ¼ 1 (Roth 1955) and then generalized (1964) proved the extremality of fl when
to arbitrary n by Schmidt (1972) that y as above f : I ! ℝ2, I ℝ, is C3 and satisfies
necessarily belongs to the complement of VWA.
In other words, vectors with very special algebraic f 01 ðxÞ f 02 ðxÞ
6¼ 0 for l a:e: x I;
properties happen to follow the behavior of a f 001 ðxÞ f 002 ðxÞ
generic vector in ℝn.
We would like to view the above example as a in other words, the curve parametrized by f has
special case of a general class of problems. nonzero curvature at almost all points. Since then,
Namely, suppose we are given a Radon measure a lot of attention has been devoted to showing that
m on ℝn. Let us say that m is extremal (Sprindžuk measures fl are extremal for other smooth
1980) if m-a.e. y ℝn is not very well maps f.
approximable. Further, define the Diophantine To describe a broader class of examples, recall
exponent o(m) of m to be the m-essential the following definition. Let x ℝd and let f ¼
supremum of the function o(); in other words, ( f1, . . ., fn) be a Ck map from a neighborhood of
x to ℝn. Say that f is nondegenerate at x if ℝn is
def
oðmÞ ¼ sup vjm W c1,v >0 : spanned by partial derivatives of f at x up to some
order. Say that f is nondegenerate if it is non-
Clearly it only depends on the measure class of degenerate at l-a.e. x. It was conjectured by
m. If m is naturally associated with a subset M of Sprindzuk (1979) in 1980 that fl for real analytic
ℝn supporting m (e.g., if M is a smooth sub- nondegenerate f are extremal. Many special cases
manifold of ℝn and m is the measure class of the were established since then (see (Bernik and
Riemannian volume on M, or, equivalently, the Dodson 1999) for a detailed exposition of the
pushforward fl of l by a smooth map theory and many related results), but the general
f parametrizing M), one defines the Diophantine case stood open until the mid-1990s (Kleinbock
exponent o(M) of M to be equal to that of m and and Margulis 1998), when Sprindzuk’s conjecture
says that M is extremal if f(x) is not very well was proved using the dynamical approach (later
approximable for m-a.e. x. Beresnevich (2002) succeeded in establishing and
Then o(m) n for any m, and o(l) ¼ o(ℝn) is extending this result without use of dynamics).
equal to n. The latter justifies the use of the word The proof in (Kleinbock and Margulis 1998)
“extremal”: m is extremal if o(m) is equal to n, that uses the correspondence outlined in the previous
is, attains the smallest possible value. The afore- section plus a measure estimate for flows on the
mentioned results of Roth and Schmidt then can space of lattices which is described below.
be interpreted as the extremality of atomic mea- In the subsequent work, the method of
sures supported on algebraic vectors without (Kleinbock and Margulis 1998) was adapted to a
rational dependence relations. much broader class of measures. To define it we
Historically, the first measure (other than l) to need to introduce some more notation and defini-
be considered in the set-up described above was tions. If x ℝd and r > 0, denote by B(x, r) the
the pushforward of l by the map open ball of radius r centered at x. If B ¼ B(x, r)
and c > 0, cB will denote the ball B(x, cr). For
f ðxÞ ¼ x, x2 , . . . , xn : ð12Þ B ℝd and a real-valued function f on B, let
def
The extremality of fl for f as above was k f kB ¼ sup j f ðxÞ j:
conjectured in 1932 by K. Mahler (1932) and xB
open. If D > 0 and U ℝd is an open subset, let us Another relevant notion is the nonplanarity of
say that v is D-Federer on U if for any ball B U (f, v). Namely, (f, v) is said to be nonplanar if
centered at supp v, one has nnðð3BÞ
BÞ < D whenever whenever B is a ball with n(B) > 0, the restric-
3B U. This condition is often called “doubling” tions of 1, f1, . . ., fn to B \ supp n are linearly
in the literature. See (Kleinbock et al. 2004; independent over ℝ; in other words, f-
Mauldin and Urbański 1996) for examples and (B \ supp n) is not contained in any proper
references. v is called Federer if for v-a.e. affine subspace of ℝn. Note that absolutely
x ℝd, there exist a neighborhood U of x and good implies both good and nonplanar, but the
D > 0 such that v is D-Federer on U. converse is in general not true.
Given C, α > 0, open U ℝd, and a measure Many examples of (absolutely) good and
v on U, a function f : U ! ℝ is called (C, α)-good nonplanar pairs (f, v) can be found in the litera-
on U with respect to v if for any ball B U ture. Already the case n ¼ d and f ¼ Id is very
centered in supp v and any ε > 0, one has interesting. A measure m on ℝn is said to be
friendly (respectively, absolutely friendly) if
nðfx B : j f ðxÞj< egÞ and only if it is Federer and the pair (Id, m) is
a good and nonplanar (respectively, absolutely
e
C nðBÞ: ð13Þ good). See (Kleinbock et al. 2004; Stratmann
k f kn,B
and Urbanski 2006; Urbanski 2005) for many
This condition was formally introduced in examples. An important class of measures is
(Kleinbock and Margulis 1998) for v being given by limit measures of irreducible system
Lebesgue measure and in (Kleinbock et al. 2004) of self-similar or self-conformal contractions
for arbitrary v. A basic example is given by poly- satisfying the Open Set Condition (Hutchinson
nomials, and the upshot of the above definition is 1981); those are shown to be absolutely friendly
the formalization of a property needed for the in (Kleinbock et al. 2004). The prime example is
proof of several basic facts (Dani 1979, 1986; the middle-third Cantor set on the real line. The
Margulis 1975) about polynomial maps into the term “friendly” was cooked up as a loose abbre-
space of lattices. In (Kleinbock et al. 2004) a viation for “Federer, nonplanar and decaying”
strengthening of this property was considered: and later proved to be particularly friendly in
f was called absolutely (C, α)-good on U with dealing with problems arising in metric number
respect to v if for B and ε as above one has theory, see, for example, (Fishman 2006).
Also let us say that a pair (f, v) is non-
a degenerate if f is nondegenerate at v-a.e. x.
e
nðfx B :j f ðxÞj< egÞ C nðBÞ: ð14Þ When v is Lebesgue measure on ℝd, it is proved
k f kB
in Proposition 3.4 in (Kleinbock and Margulis
There is no difference between (13) and (14) 1998) that a nondegenerate (f, v) is good and
when v has full support, but it turns out to be nonplanar. The same conclusion is derived in
useful for describing measures supported on Proposition 7.3 in (Kleinbock et al. 2004),
proper (e.g., fractal) subsets of ℝd. assuming that v is absolutely friendly. Thus,
Now suppose that we are given a measure v on volume measures on smooth nondegenerate
ℝd, an open U ℝd with n(U ) > 0, and a map f ¼ manifolds are friendly, but not absolutely
( f1, . . ., fn) : ℝd ! ℝn. Following (Kleinbock and friendly.
Weiss 2008), say that a pair (f, v) is (absolutely) It turns out that all the aforementioned exam-
good on U if any linear combination of 1, f1, . . ., fn ples of measures can be proved to be extremal by a
is (absolutely) (C, α)-good on U with respect to v. generalization of the argument from (Kleinbock
If for v-a.e. X, there exists a neighborhood U of and Margulis 1998). Specifically, let v be a
X and C, α > 0 such that v is (absolutely) (C, α)- Federer measure on ℝd, U an open subset of ℝd,
good on U, we will say that the pair (f, v) is and f : U ! ℝn a continuous map such that the pair
(absolutely) good. (f, v) is good and nonplanar, then fn is extremal.
606 Ergodic Theory on Homogeneous Spaces and Metric Number Theory
This can be derived from the Borel–Cantelli from (16). For more detail on the proof, see the
Lemma, the correspondence described in the pre- lecture notes (Kleinbock 2010).
vious section, and the following measure esti-
mate: if v, U, and f are as above, then for v-a.e.
x0 U, there exists a ball B U centered at x0 Further Results
and C, a > 0 such that for any t ℝ+ and any
ε > 0, The approach to metric Diophantine approxima-
tion using quantitative nondivergence, that is, the
n x B : gt Lf ðxÞ ℤnþ1 Ke < Cea : ð15Þ implication (i) þ (ii) ) (iii), is not omnipotent. In
particular, it is difficult to use when more precise
Here gt is as in (9) with m ¼ 1 (assuming that results are needed, such as computing/estimating
the row vector viewpoint is adopted). This is a the Hausdorff dimension of the set of
quantitative way of saying that for fixed t, the c1,v-approximable vectors on a manifold. See
“flow” x 7! gtLf(x)ℤnþ1, B ! Ωnþ1 cannot (Beresnevich et al. 2007; Beresnevich and Velani
diverge and in fact must spend a big 2007) for such results. On the other hand, the
(uniformly in t) proportion of time inside com- dynamical approach can often treat much more
pact sets Kε. general objects than its classical counterpart, and
The inequality (15) is derived from a general also can be perturbed in a lot of directions, produc-
“quantitative non-divergence” estimate, which ing many generalizations and modifications of the
can be thought of a substantial generalization of main theorems from the preceding section. It has
theorems of Margulis and Dani (Dani 1979, numerous applications, most of which show up in a
1986; Margulis 1975) on nondivergence of recent survey (Beresnevich and Kleinbock 2022).
unipotent flows on homogeneous spaces. One One of the most important of them is the
of its most general versions (Kleinbock et al. so-called multiplicative version of the set-up of
2004) deals with a measure v on ℝd, a continuous section “Diophantine Approximation with
map h : B ! G, where B is a ball in ℝd centered Dependent Quantities: The Set-Up.” Namely,
def def
at supp v and G is as in (10). To describe the define functions PðxÞ ¼ i jxi j and Pþ ðxÞ ¼
assumptions on h, one needs to employ the com- i maxðjxi j, 1Þ Then, given a function
binatorial structure of lattices in ℝk, and it will be c : ℕ ! ℝ+, one says that Y Mm, n is multipli-
convenient to use the following notation: if V is a catively c-approximable (notation: Y W m,n ðcÞÞ
nonzero rational subspace of ℝk and g G, if there are infinitely many q ℤn such that
define ‘V(g) to be the covolume of g(V \ ℤk) in
gV. Then, given positive constants C, D, α, there PðY q þ pÞ1=m c Pþ ðqÞ1=n ð17Þ
exists C1 ¼ C1(d, k, C, α, D) > 0 with the fol-
lowing property. Suppose v is D-Federer on
for some p ℤm. Since P(x) P+(x) kxkk
B, 0 < r 1, and h is such that for each rational
for x ℝk, any c-approximable Y is multipli-
V ℝk‘V ∘ h is (C, α)-good on B with respect to
catively c-approximable; but the converse is in
v, and k‘V ∘ hkn,B r, where B ¼ 3ðk1Þ B. Then
general not true, see, for example, (Gallagher
for any positive ε r, one has
1962). However if one, as before, considers the
family {c1,v}, the critical parameter for which
n x B : hðxÞℤk K e
the drop from full measure to measure zero
C1 ðerÞa nðBÞ: ð16Þ
occurs is again n/m. That is, if one defines the
multiplicative Diophantine exponent o(Y ) of
Taking h(x) ¼ gtLf(x) and unwinding the defi-
Y by o ðY Þ ¼ sup v : Y W
def
nitions of good and nonplanar pairs, one can show m,n c1,v , then
that (i) and (ii) can be verified for some balls clearly o(Y ) o(Y ) for all Y, and yet o(Y ) ¼
B centered at v-almost every point, and derive (15) n/m for l-a.e. Y Mm,n.
Ergodic Theory on Homogeneous Spaces and Metric Number Theory 607
Now specialize to ℝn (by the same duality prin- the group of diagonal matrices on the space of
ciple as before, it does not matter whether to think lattices. See (Cassels and Swinnerton-Dyer 1955)
in terms of row or column vectors, but we will for an implicit description of this correspondence
adopt the row vector set-up), and define the and (Einsiedler and Lindenstrauss 2006;
multiplicative exponent o(m) of a measure m on Lindenstrauss 2007; Margulis 1997) for more
ℝn by o ðmÞ ¼ sup vjm W c1,v > 0 ; then
def detail.
The dynamical approach also turned out to be
o (l) ¼ n. Following Sprindzuk (1980), say that m
is strongly extremal if o(m) ¼ n. It turns out that fruitful in studying Diophantine properties of
all the results mentioned in the previous section pairs (f, v) for which the nonplanarity condition
have their multiplicative analogues; that is, the fails. Note that obvious examples of non-extremal
measures described there happen to be strongly measures are provided by proper affine subspaces
extremal. This was conjectured by A. Baker of ℝn whose coefficients are rational or are well
(1975) for the curve (12) and then by Sprindzuk enough approximable by rational numbers. On the
in 1980 (Sprindžuk 1980) for analytic non- other hand, it is clear from a Fubini argument that
degenerate manifolds. (We remark that only very almost all translates of any given subspace are
few results in this set-up can be obtained by the extremal. In (Kleinbock 2003) the method of
standard methods, see, e.g., (Beresnevich and (Kleinbock and Margulis 1998) was pushed fur-
Velani 2007)). The proof of this stronger statement ther to produce criteria for the extremality, as well
is based on using the multi-parameter action of as the strong extremality, of arbitrary affine sub-
spaces ℒ of ℝn. Further, it was shown that if ℒ is
gt ¼ diagðet1 þþtn , et1 , . . . , etn Þ, where extremal (resp. strongly extremal), then so is any
smooth submanifold of ℒ which is nondegenerate
t ¼ ðt1 , . . . , tn Þ
in ℒ at a.e. point. (The latter property is a straight-
forward generalization of the definition of non-
instead of gt considered in the previous section.
degeneracy in ℝn: a map f is nondegenerate in ℒ at
One can show that the choice h(x) ¼ gtLf(x) allows
x if the linear part of ℒ is spanned by partial
one to verify (i) and (ii) uniformly in t ℝnþ , and
derivatives of f at x.) In other words, extremality
the proof is finished by applying a multi- and strong extremality pass from affine subspaces
parameter version of the correspondence to their nondegenerate submanifolds.
described in section “Connection with Dynamics A more precise analysis makes it possible to
on the Space of Lattices.” Namely, one can show study Diophantine exponents of measures with
that y VWA 1,n if and only if the trajectory supports contained in arbitrary proper affine sub-
gt Ly ℤk : t ℝnþ grows linearly, that is, for spaces of ℝn. Namely, in (Kleinbock 2008) it is
some γ > 0, one has dist(gtLYℤnþ1, ℤnþ1) > γ k tk shown how to compute o(ℒ) for any ℒ and fur-
for an unbounded set of t ℝnþ . A similar corre- thermore proved that if m is a Federer measure on
spondence was used in (Einsiedler et al. 2006) to ℝd, U an open subset of ℝd, and f : U ! ℝn a
prove that the (conjecturally empty) set of excep- continuous map such that the pair (f, v) is good
tions to Littlewood’s Conjecture, which, using the and nonplanar in ℒ, then o(fn) ¼ o(ℒ). Here we
terminology introduced above, can be called say, generalizing the definition from section
badly multiplicatively approximable vectors: “Diophantine Approximation with Dependent
Quantities: The Set-Up,” that (f, v) is nonplanar
BA ¼ ℝn ∖ [ W
def
n,1 n,1 cc,1=n in ℒ if for any ball B with n(B) > 0, the f-image of
c>0
B \ supp n is not contained in any proper affine
¼ y: inf jqjPðqy pÞ > 0 , subspace of ℒ. (It is easy to see that for a smooth
q ℤ∖f0g, p ℤn
map f : U ! ℒ, (f, l) is good and nonplanar in ℒ
ð18Þ whenever f is nondegenerate in ℒ at a.e. point.) It
is worthwhile to point out that these new applica-
has Hausdorff dimension zero. This was done
tions require a strengthening of the measure esti-
using a measure rigidity result for the action of
mate described at the end of section “Diophantine
608 Ergodic Theory on Homogeneous Spaces and Metric Number Theory
Approximation with Dependent Quantities: The assuming some invariance properties of the fractal
Set-Up”: it was shown in (Kleinbock 2008) that subset, one can aim at divergence Khintchine-type
(i) and (ii) would still imply (iii) if r in (ii) is theorems, such as proving that the intersection of
replaced by rdim V. See (Huang 2022) for the BA with the fractal is null with respect to some
most up-to-date account of what is known about natural measure supported on the fractal. See
Diophantine exponents of affine subspaces. (Simmons and Weiss 2019; Khalil and Luethi
Another application concerns badly 2021) for several recent results in this direction.
approximable vectors. Using the dynamical The argument in these papers involves studying
description of the set BA ℝn due to Dani nondivergence of random walks on the space of
(1985), it turns out to be possible to find badly lattices and is based on the work of Benoist and
approximable vectors inside supports of certain Quint.
measures on ℝn. Namely, if a subset K of ℝn The quantitative nondivergence method also
supports an absolutely friendly measure, then has applications to the problem of improving
BA \ K has Hausdorff dimension not less than Dirichlet’s Theorem (uniform approximation).
the Hausdorff dimension of this measure. In par- For example, in (Kleinbock and Weiss 2008)
ticular, it proves that limit measures of irreducible almost every point of any nondegenerate smooth
system of self-similar/self-conformal contractions manifold is proved not to lie in D ce,n=m for
satisfying the Open Set Condition, such as the
small enough ε depending only on the manifold.
middle-third Cantor set on the real line, contain
Some earlier work was done in (Baker 1976;
subsets of full Hausdorff dimension consisting of
Bugeaud 2002; Davenport and Schmidt 1970)
badly approximable vectors. This was established
and also in (Kleinbock and Weiss 2005b) for the
in (Kleinbock and Weiss 2005a) and later inde-
set of singular vectors. However, a different tech-
pendently in (Kristensen et al. 2006) using a dif-
nique based on Ratner’s Theorem and the Dani–
ferent approach. See also Fishman’s work
Margulis linearization method (Dani and
(Fishman 2006) for a stronger result involving
Margulis 1993) enabled Shah (2009) to prove a
winning sets of Schmidt games (the winning prop-
much stronger result, namely, the equidistribution
erty, due to the work of Schmidt (1966), implies
of gt - translates of orbits {Lf(x)ℤnþ1} for a real
full Hausdorff dimension and is stable with
analytic nondegenerate f. This was later extended
respect to countable intersections). Fishman’s
to multiplicative approximation (Shah 2010),
result has been significantly generalized in 2012
smooth nondegenerate maps (Shah and Yang
(Broderick et al. 2012) by showing that BA has
2022b), affine subspaces (Shah and Yang 2022a;
the hyperplane absolute winning (HAW) prop-
Kleinbock et al. 2021b; Chow and Yang 2019),
erty, which is stronger than winning and is
and submanifollds of Mm,n with min(m, n) > 1
inherited by supports of absolutely friendly mea-
(Yang 2016, 2020; Shah and Yang 2020).
sures. An even more striking development
It is also worthwhile to mention that a general-
occurred several years later, when Beresnevich
ization of the measure estimate discussed in section
(2015), Yang (2019), and then Beresnevich–
“Diophantine Approximation with Dependent
Nesharim–Yang (2022) showed that any non-
Quantities: The Set-Up” was used in (Bernik
degenerate real analytic curve in ℝn contains a
et al. 2001) to estimate the measure of the set of
winning set of badly approximable vectors. More-
points x in a ball B ℝd for which the system
over, they did it in the context of approximation
with weights, thereby giving another proof of
j f ðxÞ q þ p j< e
Scmidt’s conjecture fron (Schmidt 1982), origi-
nally proved by Badziahin, Pollington, and Velani j f 0 ð xÞ q j < d
(2011). Note that quantitative nondivergence esti- jqi j < Qi , i ¼ 1, . . . , n,
mates play a crucial role in the argument of
(Beresnevich et al. 2022). where f is a smooth nondegenerate map B ! ℝn,
It is important to mention that for some special has a nonzero integer solution. For that, Lf(x) as
classes of fractals much more can be said: namely, in (15) has to be replaced by the matrix
Ergodic Theory on Homogeneous Spaces and Metric Number Theory 609
Frantzikinakis–McCutcheon, Nitica, and Ward in Dani SG (1985) Divergent trajectories of flows on homo-
this volume. geneous spaces and diophantine approximation.
J Reine Angew Math 359:55–89
Dani SG (1986) On orbits of unipotent flows on homoge-
Acknowledgments The work on this paper was neous spaces. II. Ergodic Theory Dyn Syst 6:167–182
supported in part by NSF Grants DMS-0239463 and Dani SG, Margulis GA (1993) Limit distributions of orbits
DMS-1900560. of unipotent flows and values of quadratic forms. In: IM
Gelfand Seminar. American Mathematical Society,
Providence, pp 91–137
Davenport H, Schmidt WM (1969/1970) Dirichlet’s theo-
Bibliography rem on diophantine approximation. II. Acta Arith 16:
413–424
Baker A (1975) Transcendental number theory. Cambridge Davenport H, Schmidt WM (1970) Dirichlet’s theorem on
University Press, London diophantine approximation. In: Symposia
Baker RC (1976) Metric diophantine approximation on Mathematica. INDAM, Rome, pp 113–132
manifolds. J Lond Math Soc 14:43–48 Ding J (1994) A proof of a conjecture of C. L. Siegel.
Baker RC (1978) Dirichlet’s theorem on diophantine J Number Theory 46:1–11
approximation. Math Proc Camb Philos Soc 83:37–59 Dodson MM (1992) Hausdorff dimension, lower order and
Bekka M, Mayer M (2000) Ergodic theory and topological Khintchine’s theorem in metric Diophantine approxi-
dynamics of group actions on homogeneous spaces. mation. J Reine Angew Math 432:69–76
Cambridge University Press, Cambridge Dodson MM (1993) Geometric and probabilistic ideas in
Beresnevich V (1999) On approximation of real numbers the metric theory of Diophantine approximations.
by real algebraic numbers. Acta Arith 90:97–112 Uspekhi Mat Nauk 48:77–106
Beresnevich V (2002) A Groshev type theorem for con- Druţu C (2005) Diophantine approximation on rational
vergence on manifolds. Acta Math Hungar 94:99–130 quadrics. Ann Math 333:405–469
Beresnevich V, Velani S (2007) A note on simultaneous Duffin RJ, Schaeffer AC (1941) Khintchine’s problem in
Diophantine approximation on planar curves. Ann metric Diophantine approximation. Duke Math J 8:
Math 337:769–796 243–255
Beresnevich V, Bernik VI, Kleinbock D, Margulis GA Einsiedler M, Kleinbock D (2007) Measure rigidity and
(2002) Metric Diophantine approximation: the p-adic Littlewoodtype problems. Compos Math 143:
Khintchine-Groshev theorem for nondegenerate mani- 689–702
folds. Moscow Math J 2:203–225 Einsiedler M, Lindenstrauss E (2006) Diagonalizable
Beresnevich V, Dickinson H, Velani S (2006) Measure flows on locally homogeneous spaces and number the-
theoretic laws for lim sup sets. Memoirs Am Math ory. In: Proceedings of the international congress of
Soc 179:1–91 mathematicians. Eur Math Soc, Zurich, pp 1731–1759
Beresnevich V, Dickinson H, Velani S (2007) Diophantine Einsiedler M, Katok A, Lindenstrauss E (2006) Invariant
approximation on planar curves and the distribution of measures and the set of exceptions to Littlewood’s
rational points. Ann Math 166:367–426 conjecture. Ann Math 164:513–560
Bernik VI (1984) A proof of Baker’s conjecture in the Eskin A (1998) Counting problems and semisimple
metric theory of transcendental numbers. Dokl Akad groups. In: Proceedings of the international congress
Nauk SSSR 277:1036–1039 of mathematicians. Doc Math, Berlin, pp 539–552
Bernik VI, Dodson MM (1999) Metric Diophantine Eskin A, Margulis GA, Mozes S (1998) Upper bounds and
approximation on manifolds. Cambridge University asymptotics in a quantitative version of the Oppenheim
Press, Cambridge conjecture. Ann Math 147:93–141
Bernik VI, Kleinbock D, Margulis GA (2001) Khintchine- Eskin A, Margulis GA, Mozes S (2005) Quadratic forms of
type theorems on manifolds: convergence case for stan- signature (2,2) and eigenvalue spacings on rectangular
dard and multiplicative versions. Int Math Res Notices 2-tori. Ann Math 161:679–725
2001:453–486 Fishman L (2006) Schmidt’s games on certain fractals.
Besicovitch AS (1929) On linear sets of points of fractional Israel S Math (to appear)
dimensions. Ann Math 101:161–193 Gallagher P (1962) Metric simultaneous diophantine
Bugeaud Y (2002) Approximation by algebraic integers and approximation. J Lond Math Soc 37:387–390
Hausdorff dimension. J Lond Math Soc 65:547–559 Ghosh A (2005) A Khintchine type theorem for hyper-
Bugeaud Y (2004) Approximation by algebraic numbers. planes. J Lond Math Soc 72:293–304
Cambridge University Press, Cambridge Ghosh A (2006) Dynamics on homogeneous spaces and
Cassels JWS (1957) An introduction to Diophantine Diophantine approximation on manifolds. PhD thesis,
approximation. Cambridge University Press, New York Brandeis University, Waltham
Cassels JWS, Swinnerton-Dyer H (1955) On the product of Ghosh A (2007) Metric Diophantine approximation over a
three homogeneous linear forms and the indefinite ternary local field of positive characteristic. J Number Theory
quadratic forms. Philos Trans R Soc Lond 248:73–96 124:454–469
Dani SG (1979) On invariant measures, minimal sets and a Gorodnik A (2007) Open problems in dynamics and
lemma of Margulis. Invent Math 51:239–260 related fields. J Mod Dyn 1:1–35
Ergodic Theory on Homogeneous Spaces and Metric Number Theory 611
Groshev AV (1938) Une théoréme sur les systémes des Kleinbock D, Lindenstrauss E, Weiss B (2004) On fractal
formes linéaires. Dokl Akad Nauk SSSR 9:151–152 measures and diophantine approximation. Selecta Math
Harman G (1998) Metric number theory. Clarendon Press/ 10:479–523
Oxford University Press, New York Kontsevich M, Suhov Y (1999) Statistics of Klein polyhe-
Hutchinson JE (1981) Fractals and self-similarity. Indiana dra and multidimensional continued fractions. In:
Univ Math J 30:713–747 Pseudoperiodic topology. American Mathematical
Jarnik V (1928–9) Zur metrischen Theorie der Society, Providence, pp 9–27
diophantischen Approximationen. Prace Mat-Fiz 36: Kristensen S, Thorn R, Velani S (2006) Diophantine
91–106 approximation and badly approximable sets. Adv
Jarnik V (1929) Diophantischen Approximationen und Math 203:132–169
Hausdorffsches Mass. Mat Sb 36:371–382 Lagarias JC (1994) Geodesic multidimensional continued
Khanin K, Lopes-Dias L, Marklof J (2007) Multi- fractions. Proc Lond Math Soc 69:464–488
dimensional continued fractions, dynamical Lindenstrauss E (2007) Some examples how to use mea-
renormalization and KAM theory. Commun Math sure classification in number theory. In:
Phys 270:197–231 Equidistribution in number theory, an introduction.
Khintchine A (1924) Einige Satze uber Kettenbruche, mit Springer, Dordrecht, pp 261–303
Anwendungen auf die Theorie der Diophantischen Mahler K (1932) Uber das Mass der Menge aller S-Zahlen.
Approximationen. Math Ann 92:115–125 Math Ann 106:131–139
Khintchine A (1963) Continued fractions. P Noordhoff Margulis GA (1975) On the action of unipotent groups in
Ltd, Groningen the space of lattices. In: Lie groups and their represen-
Kleinbock D (2001) Some applications of homogeneous tations (Budapest, 1971). Halsted, New York,
dynamics to number theory. In: Smooth ergodic theory pp 365–370
and its applications. American Mathematical Society, Margulis GA (1989) Discrete subgroups and ergodic the-
Providence, pp 639–660 ory. In: Number theory, trace formulas and discrete
Kleinbock D (2003) Extremal subspaces and their sub- groups (Oslo, 1987). Academic, Boston, pp 377–398
manifolds. Geom Funct Anal 13:437–466 Margulis GA (1997) Oppenheim conjecture. In: Fields
Kleinbock D (2004) Baker-Sprindžuk conjectures for com- medalists’ lectures. World Sci Publishing, River Edge,
plex analytic manifolds. In: Algebraic groups and pp 272–327
arithmetic. Tata Inst Fund Res, Mumbai, pp 539–553 Margulis GA (2002) Diophantine approximation, lattices
Kleinbock D (2008) An extension of quantitative non- and flows on homogeneous spaces. In: A panorama of
divergence and applications to Diophantine exponents. number theory or the view from Baker’s garden. Cam-
Trans AMS 360:6497–6523 bridge University Press, Cambridge, pp 280–310
Kleinbock D, Margulis GA (1996) Bounded orbits of non- Mauldin D, Urbański M (1996) Dimensions and measures
quasiunipotent flows on homogeneous spaces. In: in infinite iterated function systems. Proc Lond Math
Sina?‘s Moscow seminar on dynamical systems. Amer- Soc 73:105–154
ican Mathematical Society, Providence, pp 141–172 Mohammadi A, Salehi Golsefidy A (2008) S-arithmetic
Kleinbock D, Margulis GA (1998) Flows on homogeneous Khintchine-type theorem. Preprint
spaces and Diophantine approximation on manifolds. Moore CC (1966) Ergodicity of flows on homogeneous
Ann Math 148:339–360 spaces. Am J Math 88:154–178
Kleinbock D, Margulis GA (1999) Logarithm laws for Roth KF (1955) Rational approximations to algebraic
flows on homogeneous spaces. Invent Math 138: numbers. Mathematika 2:1–20
451–494 Schmidt WM (1960) A metrical theorem in diophantine
Kleinbock D, Tomanov G (2007) Flows on S? Arithmetic approximation. Can J Math 12:619–631
homogeneous spaces and applications to metric Schmidt WM (1964) Metrische Satze uber simultane
Diophantine approximation. Comment Math Helv 82: Approximation abhangiger Grosen. Monatsh Math
519–581 63:154–166
Kleinbock D, Weiss B (2005a) Badly approximable vec- Schmidt WM (1969) Badly approximable systems of linear
tors on fractals. Israel J Math 149:137–170 forms. J Number Theory 1:139–154
Kleinbock D, Weiss B (2005b) Friendly measures, homo- Schmidt WM (1972) Norm form equations. Ann Math 96:
geneous flows and singular vectors. In: Algebraic and 526–551
topological dynamics. American Mathematical Soci- Schmidt WM (1980) Diophantine approximation.
ety, Providence, pp 281–292 Springer, Berlin
Kleinbock D, Weiss B (2008) Dirichlet’s theorem on Sheingorn M (1993) Continued fractions and congruence
diophantine approximation and homogeneous flows. subgroup geodesics. In: Number theory with an empha-
J Mod Dyn 2:43–62 sis on the Markoff spectrum (Provo, UT, 1991). Dek-
Kleinbock D, Shah N, Starkov A (2002) Dynamics of ker, New York, pp 239–254
subgroup actions on homogeneous spaces of Lie Sprindžuk VG (1964) More on Mahler’s conjecture. Dokl
groups and applications to number theory. In: Hand- Akad Nauk SSSR 155:54–56
book on dynamical systems, vol 1A. Elsevier Science, Sprindžuk VG (1969) Mahler’s problem in metric number
North Holland, Amsterdam, pp 813–930 theory. American Mathematical Society, Providence
612 Ergodic Theory on Homogeneous Spaces and Metric Number Theory
Sprindžuk VG (1979) Metric theory of Diophantine Chow S, Yang L (2019) An effective Ratner
approximations. VH Winston & Sons, Washington, DC equidistribution theorem for multiplicative
Sprindžuk VG (1980) Achievements and problems of the Diophantine approximation on planar lines. Preprint
theory of Diophantine approximations. Uspekhi Mat arXiv:1902.06081
Nauk 35:3–68 Das T, Fishman L, Simmons D, Urbański M (2018)
Starkov A (2000) Dynamical systems on homogeneous Extremality and dynamically defined measures, part I:
spaces. American Mathematical Society, Providence Diophantine properties of quasi-decaying measures.
Stratmann B, Urbanski M (2006) Diophantine extremality Selecta Math 24:2165–2206
of the Patterson measure. Math Proc Camb Philos Soc Das T, Fishman L, Simmons D, Urbański M (2019)
140:297–304 A variational principle in the parametric geometry of
Sullivan D (1982) Disjoint spheres, approximation by numbers. Preprint arXiv:1901.06602
imaginary quadratic numbers, and the logarithm law Dolgopyat D, Fayad B, Vinogradov I (2017) Central limit
for geodesics. Acta Math 149:215–237 theorems for simultaneous Diophantine approxima-
Urbanski M (2005) Diophantine approximation of self- tions. J Éc polytech Math 4:1–36
conformal measures. J Number Theory 110:219–235 Eskin A, Mozes S (2022) Margulis functions and their
Želudeviĉ F (1986) Simultane diophantische applications. In: Dynamics, geometry, number theory:
Approximationen abhangiger Grossen in mehreren the impact of Margulis on modern mathematics. Uni-
Metriken. Acta Arith 46:285–296 versity of Chicago Press, Chicago, pp 342–361
Fishman L, Kleinbock D, Merrill K, Simmons D (2018)
Intrinsic Diophantine approximation on manifolds:
New List
general theory. TAMS 370:577–599
Aka M, Breuillard E, Rosenzweig L, de Saxcé N (2018)
Fishman L, Kleinbock D, Merrill K, Simmons D (2022)
Diophantine approximation of matrices and Lie groups.
Intrinsic Diophantine approximation on quadric hyper-
Geom Funct Anal 28:1–57
surfaces. JEMS 24:1045–1101
Andersen N, Duke W (2021) On a theorem of Davenport
Huang JJ (2022) Extremal affine subspaces and
and Schmidt. Acta Arith 198:37–75
Khintchine-Jarník type theorems. Preprint,
Athreya J, Ghosh A, Tseng J (2015) Spiraling of approxi-
arXiv:2208.04255
mations and spherical averages of Siegel transforms.
Kadyrov S, Kleinbock D, Lindenstrauss E, Margulis GA
J Lond Math Soc 91:383–404
(2017) Singular systems of linear forms and non-escape
Badziahin D, Pollington A, Velani S (2011) On a problem
of mass in the space of lattices. J Anal Math 133:
in simultaneous Diophantine approximation: Schmidt’s
253–277
conjecture. Ann Math 174:1837–1883
Khalil O, Luethi M (2021) Random walks, spectral gaps,
Beresnevich V (2015) Badly approximable points on mani-
and Khintchine’s theorem on fractals. Preprint
folds. Inv Math 202:1199–1240
arXiv:2101.05797
Beresnevich V, Kleinbock D (2022) Quantitative non-
Kim T, Kim W (2022) Hausdorff measure of sets of
divergence and Diophantine approximation on mani-
Dirichlet non-improvable affine forms. Adv Math
folds. In: Dynamics, geometry, number theory: the
403:Paper No. 108353
impact of Margulis on modern mathematics. University
Kleinbock D (2010) Quantitative nondivergence and its
of Chicago Press, Chicago, pp 303–341
Diophantine applications. Lecture notes for Clay Insti-
Beresnevich V, Kleinbock D, Margulis GA (2015) Non-
tute Summer School (Pisa 2007). In: Homogeneous
planarity and metric Diophantine approximation for
flows, moduli spaces and arithmetic, Clay Math Proc,
systems of linear forms. J Number Theory Bordeaux
pp 131–153
27:1–31
Kleinbock D, de Saxcé N (2018) Rational approximation
Beresnevich V, Guan L, Marnat A, Ramirez F, Velani
on quadrics: a simplex lemma and its consequences.
S (2020) Dirichlet is not just bad and singular. Preprint
Enseign Math 64:459–476
arXiv:2008.04043
Kleinbock D, Merrill K (2015) Rational approximation on
Beresnevich V, Nesharim E, Yang L (2022) Winning prop-
spheres. Israel J Math 209:293–322
erty of badly approximable points on curves. Duke
Kleinbock D, Mirzadeh S (2020) On the dimension drop
Math J, to appear
conjecture for diagonal flows on the space of lattices.
Björklund M, Gorodnik A (2019) Central limit theorems
Adv Math, to appear
for Diophantine approximants. Math Ann 374:
Kleinbock D, Rao A (2022a) A zero-one law for uniform
1371–1437
Diophantine approximation in Euclidean norm. Int
Broderick R, Kleinbock D, Fishman L, Reich A, Weiss
Math Res Not 2022:5617–5657
B (2012) The set of badly approximable vectors is
Kleinbock D, Rao A (2022b) Abundance of Dirichlet-
strongly C1 incompressible. Math Proc Camb Philos
improvable pairs with respect to arbitrary norms.
Soc 153:319–339
Mosc J Comb Number Theory 11:97–114
Cheung Y (2011) Hausdorff dimension of the set of singu-
Kleinbock D, Rao A (2022c) Weighted uniform
lar pairs. Ann Math 173:127–167
Diophantine approximation of systems of linear
Cheung Y, Chevallier N (2016) Hausdorff dimension of
forms. Pure Appl Math Q 18:1095–1112
singular vectors. Duke Math J 165:2273–2329
Ergodic Theory on Homogeneous Spaces and Metric Number Theory 613
Kleinbock D, Wadleigh N (2018) A zero-one law for Shah N (2010) Expanding translates of curves and
improvements to Dirichlet’s theorem. PAMS 146: Dirichlet-Minkowski theorem on linear forms. JAMS
1833–1844 23:563–589
Kleinbock D, Wadleigh N (2019) An inhomogeneous Shah N, Yang L (2020) Equidistribution of curves in
Dirichlet theorem via shrinking targets. Compos Math homogeneous spaces and Dirichlet’s approximation
155:1402–1423 theorem for matrices. Discrete Contin Dyn Syst 40:
Kleinbock D, Weiss B (2013) Modified Schmidt games 5247–5287
and a conjecture of Margulis. J Mod Dyn 7:429–460 Shah N, Yang P (2022a) Equidistribution of expanding
Kleinbock D, Margulis GA, Wang J (2010) Metric degenerate manifolds in the space of lattices. Preprint
Diophantine approximation for systems of linear arXiv:2112.13952
forms via dynamics. Int J Number Theory 6:1139–1168 Shah N, Yang P (2022b) Equidistribution of non-uniformly
Kleinbock D, Strombergsson A, Yu S (2021a) Measure stretching translates of shrinking smooth curves and
estimates and improvements to Dirichlet’s theorem. weighted Dirichlet approximation. Preprint
Proc. London Math Soc, to appear arXiv:2204.03194
Kleinbock D, de Saxcé N, Shah N, Yang P (2021b) Shapira U, Weiss B (2022) Geometric and arithmetic aspects
Equidistribution in the space of 3-lattices and of approximation vectors. Preprint arXiv:2206.05329
Dirichlet-improvable vectors on planar lines. Ann Sc Simmons D, Weiss B (2019) Random walks on homoge-
Norm Super Pisa, to appear neous spaces and Diophantine approximation on frac-
Lindenstrauss E, Margulis G (2014) Effective estimates on tals. Invent Math 216:337–394
indefinite ternary forms. Israel J Math 203:445–499 Solan O (2021) Parametric geometry of numbers with
Roy D (2015) On Schmidt and Summerer parametric general flow. Preprint, arXiv:2106.01707
geometry of numbers. Ann Math 182:739–786 Strömbergsson A, Vishe P (2020) An effective
Schmidt WM (1966) On badly approximable numbers and equidistribution result for SL(2,R)⋉(R2) k and appli-
certain games. TAMS 123:78–199 cation to inhomogeneous quadratic forms. J Lond Math
Schmidt WM (1982) Open problems in Diophantine Soc 102:143–204
approximation. In: Diophantine approximations and Yang L (2016) Equidistribution of expanding curves in
transcendental numbers. Progr. Math., 31, Birkhäuser homogeneous spaces and Diophantine approximation
Boston, Boston, MA, pp 271–287 on square matrices. PAMS 144:5291–5308
Schmidt WM, Summerer L (2013) Diophantine approxi- Yang L (2019) Badly approximable points on manifolds
mation and parametric geometry of numbers. Monatsh and unipotent orbits in homogeneous spaces. Geom
Math 169:51–104 Funct Anal 29:1194–1234
Shah N (2009) Equidistribution of expanding translates of Yang P (2020) Equidistribution of expanding translates of
curves and Dirichlet’s theorem on Diophantine approx- curves and Diophantine approximation on matrices.
imation. Invent Math 177:509–532 Invent Math 220:909–948
Measure rigidity Measure rigidity refers to the
Ergodic Theory: Rigidity study of invariant measures for actions of abe-
lian groups and semigroups.
Viorel Niţică
West Chester University, West Chester, PA, USA
Institute of Mathematics, Bucharest, Romania Definition of the Subject
as well as definitions for some terminology used Zimmer (1987). The goal of the program was to
in the sequel can be found in the articles ▶ “Ergo- classify the smooth actions of higher rank semi-
dic Theory on Homogeneous Spaces and Metric simple Lie groups and of their (irreducible) lat-
Number Theory” by Kleinbock, ▶ “Ergodic The- tices on compact manifolds. It was expected that
ory: Recurrence” by Frantzikinakis, McCutcheon, any such lattice action that preserves a smooth
and ▶ “Ergodic Theory: Interactions with Combi- volume form and is ergodic can be reduced to
natorics and Number Theory” by Ward. The the- one of the following standard models: isometric
ory of hyperbolic dynamics is presented in the actions, linear actions on infranil manifolds, and
entry “Hyperbolic Dynamical Systems” by left translations on compact homogeneous spaces.
Viana and in the entry ▶ “Smooth Ergodic The- This original conjecture was disproved by Katok,
ory” by Wilkinson. Lewis (see (Katok and Lewis 1996)): by blowing
up a linear nilmanifold-action at some fixed points
they exhibit real-analytic, volume preserving,
Introduction ergodic lattice actions on manifolds with compli-
cated topology.
The first results about classification of lattices in Nevertheless, imposing extra assumptions on a
semi-simple Lie groups were local, aimed at try- higher rank action, for example the existence of
ing to understand the space of small perturbations some hyperbolicity, allows local and global clas-
of a given linear representation. A major contrib- sification results. The concept of Anosov action,
utor was Weil (1960, 1962, 1964), who proved that is, an action that contains at least one Anosov
local rigidity of linear representations for large diffeomorphism, was introduced for general
classes of groups, in particular lattices. Another groups by Pugh, Shub (1972). The significant
breakthrough was the contribution of Kazhdan differences between the classical ℤ and ℝ cases
(1967), who introduced property (T), allowing to and those of higher rank lattices, or at a more basic
show that large classes of lattices are finitely gen- level, of higher rank abelian groups, went
erated. Rigidity theory matured due to the remark- unnoticed for a while. The surge of activity in
able global rigidity results obtained by Mostow the 80s allowed these differences to surface: for
(1973) and Margulis (1991), leading to a complete lattices in the work of Hurder, Katok, Lewis,
classification of lattices in large classes of semi- Zimmer (see (Hurder 1992; Katok and Lewis
simple Lie groups. 1991, 1996; Katok et al. 1996)); and for higher
Briefly, a hyperbolic (Anosov) dynamical sys- rank abelian groups in the work of Katok, Lewis
tem is one that exhibits strong expansion and con- (1991) and Katok, Spatzier (1994a). As observed
traction along complementary directions. An early in these papers, local and global rigidity are typ-
contribution introducing this class of objects is the ical for such Anosov actions. This generated addi-
paper of Smale (1967), in which basic examples tional research which is summarized in section
and techniques are introduced. A breakthrough “Local Rigidity” and “Global Rigidity”.
came with the results of Anosov (1967), who pro- Differentiable rigidity is covered in section
ved structural stability of the Anosov systems and “Differentiable Rigidity”. An interesting problem
ergodicity of the geodesic flow on a manifold of is to find moduli for the Ck conjugacy, k 1,
negative curvature. Motivated by questions arising of Anosov diffeomorphisms and flows. This
in mathematical physics, chaos theory and other was tackled so far only for low dimensional
areas, hyperbolic dynamics emerged as one of the cases (n ¼ 2 for diffeomorphisms and n ¼ 3 for
major fields of contemporary mathematics. flows). Another direction that can be included
From the beginning, a major unsolved problem in here refers to finding obstructions for higher trans-
the field was the classification of Anosov verse regularity of the stable/unstable foliation of
diffeomorphisms and flows. a hyperbolic system. A spin-off of the research
In the 80s a change in philosophy occurred, done so far, which is of high interest by itself, and
partially motivated by a program introduced by has applications to local and global rigidity,
Ergodic Theory: Rigidity 617
consists of results lifting the regularity of solu- (2005), Niţică, Török (2002), and Spatzier (1995).
tions of cohomological equations over hyperbolic Among these, (Fisher 2006) concentrates mostly
systems. In turn, these results motivated a more on local and global rigidity, (Niţică and Török
careful study of analytic questions about lifting 2002) on differentiable rigidity, (Lindenstrauss
the regularity of real valued continuous functions 2005) on measure rigidity, and (Spatzier 1995)
that enjoy higher regularity along webs of folia- gives a general overview.
tions. We also include in this section rigidity Here is a word of caution for the reader. Many
results for cocycles over higher rank abelian times, instead of the most general results, we
actions. These are crucial to the proof of local present an example that contains the essence of
rigidity of higher rank abelian group actions. what is available. Also, several important facts
A more detailed presentation of the material rele- that should have been included, are left out. This
vant to differentiable rigidity can be found in the is because stating complete results would require
forthcoming monograph (Katok and Niţică). more space than allocated to this material. The
Measure rigidity refers to the study of invariant limited knowledge of the author also plays a role
measures under actions of abelian groups and semi- here. He apologizes for any obvious omissions
groups. If the actions are hyperbolic, higher-rank, and hopes that the bibliography will help fill
and satisfy natural algebraic and irreducibility the gaps.
assumptions, one expects the invariant measures
to be rare. This direction was started by a question
of Furstenberg, asking if any nonatomic probability Basic Definitions and Examples
measure on the circle, invariant and ergodic under
multiplications by 2 and 3, is the Lebesgue mea- A detailed introduction to the theory of Anosov
sure. An early contribution is that of Rudolph systems and hyperbolic dynamics is given in the
(1990), who answered positively if the action has monograph (Katok and Hasselblatt 1995). The
an element of strictly positive entropy. Katok, proofs of the basic results for diffeomorphisms
Spatzier (1996) extended the question to more stated below can be found there. The proofs for
general higher rank abelian actions, such as actions flows are similar. Surveys about hyperbolic
by linear automorphisms of tori and Weyl chamber dynamics in this volume are the article “Hyperbolic
flows. A related direction is the study of the invari- Dynamical Systems” by Viana and the entry
ant sets and measures under the action of horocycle ▶ “Smooth Ergodic Theory” by Wilkinson.
flows, where important progress was made by Consider a compact differentiable manifold
Ratner (1991a, b) and earlier by Margulis (1989, M and f : M ! M a C1 diffeomorphism. Let TM
1997; Dani and Margulis 1990). An application of be the tangent bundle of M, and Df : TM ! TM be
these results present in the last papers is the proof of the derivative of f. The map f is said to be an Anosov
the long standing Oppenheim’s conjecture, about diffeomorphism if there is a smooth Riemannian
the density of the values of the quadratic forms at metric k k on M, which induces a metric dM called
integer points. Recent developments due to and adapted, a number l (0, 1), and a continuous Df
Einsiedler, Katok, Lindenstrauss (2006) give a par- invariant splitting TM ¼ Es Eu such that
tial answer to another outstanding conjecture in
number theory, Littlewood’s conjecture, and k Df v k l k v k , v Es , k Df 1 v k l k v k , v Eu ,
emphasize measure rigidity as one of a more prom-
ising directions in rigidity. More details are shown For each x M there is a pair of embedded C1
in section “Measure Rigidity”. discs W sloc ðxÞ, W uloc ðxÞ, called the local stable
Four other recent surveys of rigidity theory, manifold and the local unstable manifold at x,
each one with a fair amount of overlap but also respectively, such that:
complementary in part to the present one, that
discuss various aspects of the field and its signif- 1. T x W sloc ðxÞ ¼ Es ðxÞ, T x W uloc ðxÞ ¼ Eu ðxÞ;
icance are written by Fisher (2006), Lindenstrauss 2. f W sloc ðxÞ W sloc ðfxÞ, f 1 W uloc ðxÞ W uloc f 1 x ;
618 Ergodic Theory: Rigidity
3. For any m (l, 1), there exists a constant distributions Ecs ¼ Ec Es and Ecu ¼ Ec Es.
C > 0 such that for all n ℕ, In general, all these foliations are only continuous,
but their leaves are differentiable.
d M ðf n x, f n yÞ Cmn dM ðx, yÞ, for y W sloc ðxÞ, Any Anosov diffeomorphism is structurally sta-
d M ðf n x, f n yÞ Cmn dM ðx, yÞ, for y W uloc ðxÞ: ble, that is, any C1 diffeomorphism that is C1 close
to an Anosov diffeomorphism is topologically con-
The local stable (unstable) manifolds can be jugate to the unperturbed one via a Hölder homeo-
extended to global stable (unstable) manifolds morphism. An Anosov flow is structurally stable in
W s(x) and W u(x) which are well defined and the orbit equivalence sense: any C1 small perturba-
smoothly injectively immersed. These global tion of an Anosov flow has the orbit foliation topo-
manifolds are the leaves of global foliations W s logically conjugate via a Hölder homeomorphism
and Wu of M. In general, these foliations are only to the orbit foliation of the unperturbed flow.
continuous, but their leaves are differentiable. Let SL(n, ℝ) be the group of all n-dimensional
Let f : ℝ M ! M be a C1 flow. The flow f is square matrices with real valued entries of determi-
said to be an Anosov flow if there is a Riemannian nant 1. Let SL(n, ℤ) SL(n, ℝ) be the subgroup
metric k k on M, a constant 0 < l < 1, and a with integer entries. Basic examples of Anosov
continuous Df invariant splitting TM ¼ diffeomorphisms are automorphisms of the n-torus
Es E0 Eu such that for all x M and t > 0: n ¼ ℝn =ℤn induced by hyperbolic matrices in
SL(n, ℤ). A hyperbolic matrix is one that has only
1. dtd t¼0 ft Ecx ∖f0g, dim Ecx ¼ 1, nonzero eigenvalues, all away in absolute value from
2. kDftv k lt k v k, v Es, 1. A specific example of such matrix in SL(2, ℤ) is
3. kDftv k lt k v k, v Eu.
2 1
:
For each x M there is a pair of embedded C 1 1 1
discs W sloc ðxÞ, W uloc ðxÞ, called the local (strong)
stable manifold and the local (strong) unstable Basic examples of Anosov flows are given by
manifold at x, respectively, such that: the geodesic flows of surfaces of constant nega-
tive curvature. The unitary bundle of such a sur-
1. T x W sloc ðxÞ ¼ Es ðxÞ, T x W uloc ðxÞ ¼ Eu ðxÞ; face can be realized as M ¼ Γ\PSL(2, ℝ), where
PSL(2, ℝ) ¼ SL(2, ℝ)/{ 1} and Γ is a cocompact
ft W s ðxÞ W sloc ðft xÞ,
2. t locu lattice in PSL(2, ℝ). The action of the geodesic
f W loc ðxÞ W uloc ðft xÞ for t > 0; flow on M is induced by right multiplication with
3. For any m (l, 1), there exists a constant elements in the diagonal one-parameter subgroup
C > 0 such that for all n ℕ,
et=2 0
dM ðft x, ft yÞ Cmt d M ðx, yÞ, ,tℝ :
0 et=2
for y W sloc ðxÞ, t > 0,
t t
dM ðf x, f yÞ Cmt d M ðx, yÞ, A related transformation, which is not hyper-
for y W uloc ðxÞ, t > 0: bolic, but will be of interest in this presentation, is
the horocycle flowinduced by right multiplication
The local stable (unstable) manifolds can be on M by elements in the one parameter subgroup
extended to global stable (unstable) manifolds
W s(x) and W u(x). These global manifolds are the 1 t
leaves of global foliations W s and W u of M. One ,tℝ :
0 1
can also define weak stable and weak unstable folia-
tions with leaves given by W cs(x) ¼ [t ℝ(W s(x)) Of interest in this survey are also actions of more
and W cu(x) ¼ [t ℝ(Wu(x)), which have as tangent general groups than ℤ and ℝ. Typical examples of
Ergodic Theory: Rigidity 619
higher rank ℤk Anosov actions are constructed on translations on Γ\G descends to an A-action on
tori using groups of units in number fields. See N ≔ Γ\G/M. This action is called a Weyl chamber
(Katok et al. 2002) for more details about this flow. Any Weyl chamber flow is an Anosov
construction. A particular example of Anosov ℤ2- action, that is, has an element that acts hyperbol-
action on 3 is induced by the hyperbolic matrices: ically transversally to the orbit foliation of A. Note
that all maximal connected ℝ diagonalizable sub-
0 1 0 2 1 0 groups of G are conjugate and their common
A¼ 0 0 1 , B¼ 0 2 1 : dimension is called the ℝ-rank of G. If the
ℝ-rank k of G is higher than 2, then the Weyl
1 8 2 1 8 4
chamber flow is a higher rank hyperbolic ℝk-
action.
One can check, by looking at the eigenvalues,
An example of semi-simple Lie group is
that A and B are not multiples of the same matrix.
SL(n, ℝ). Let A be the diagonal subgroup of matri-
Moreover, A and B commute.
ces with positive entries in SL(n, ℝ). An example
Typical examples of higher rank Anosov ℝk-
of Weyl chamber flow that will be discussed in the
actions are given by Weyl chamber flows, which
sequel is the action of A by right translations on
we now describe using some notions from the
Γ\SL(n, ℝ), where Γ is a cocompact lattice. In this
theory of Lie groups. A good reference for the
case the centralizer M is trivial. The rank of this
background in Lie group theory necessary here is
action is n 1. The picture of the Weyl chambers
the book of Helgason (1978). Note that for a
for n ¼ 3 is shown in Fig. 1. The signs that appear
hyperbolic element of such an action the center
in each chamber are the signs of half of the
distribution is k-dimensional and coincides with
Lyapunov exponents of a regular element from
the tangent distribution to the orbit foliation of ℝk.
the chamber with respect to a certain fixed basis.
Let G be a semi-simple connected real Lie
For this action, the Lyapunov exponents appear in
group of the noncompact type, with Lie algebra
pairs of opposite signs.
g: Let K G be a maximal compact subgroup that
An example of higher rank lattice Anosov
gives a Cartan decomposition g ¼ k þ p, where k
action that will be discussed in the sequel is the
is the Lie algebra of K and p is the orthogonal
standard action of SL(n, ℤ) on the torus
complement of k with respect to the Killing form
n , ðA, xÞ 7! Ax, A SLðn, ℤÞ, x n : SLðn, ℤÞ is
of g. Let a p be a maximal abelian subalgebra
a (noncocompact!) lattice in SL(n, ℝ). As shown
and A ¼ exp a be the corresponding subgroup.
in (Katok and Lewis 1991), this action is gener-
The simultaneous diagonalization of ad g ðaÞ gives
ated by Anosov diffeomorphisms.
the decomposition
g¼gþ gl , g0 ¼ a þ m,
lL
cohomology of two arbitrary cocycles with values assumption is essential, as follows from a counter
in compact Lie groups. Parry’s result was gener- example found by de la Llave’s (1992).
alized by Schmidt (1999) to cocycles with values Useful tools in this development have been
in Lie groups that, in addition, satisfy a center results from analysis that lift the regularity of a
bunching condition. Niţică, Török (1995) continuous real valued function which is assumed
extended Livshits’s result to cocycles with values to have higher regularity along pairs of transverse
in the group Diff k(M ) of C k diffeomorphism of a Hölder foliations. Many times the foliations are
compact manifold M with stably trivial bundle, the stable and unstable ones associated to a hyper-
k 3. Examples of such manifolds are the tori and bolic system. Journé (1988) proved the Cn,α regu-
the spheres. In this case, the transfer map takes larity of a continuous function that is Cn,α along
values in Diff k3(M), and it is Hölder with respect two transverse continuous foliations with Cn,α
to a natural metric on Diff k3(M ). In (Niţică and leaves. If one is interested only in C1 regularity,
Török 1995) one can also find a generalization of a convenient alternative is a result of Hurder,
Livshits’s result to generic Anosov actions, that is, Katok (1990). This has a simpler proof and can
actions generated by families of Anosov be applied to the more general situation in which
diffeomorphisms that do not interchange the sta- the function is regular along a web of transverse
ble and unstable directions of elements in the foliations. A real analytic regularity result along
family. An example of such an action is the stan- these lines belongs to de la Llave (1997). In cer-
dard action of SL(n, ℤ) on the n-dimensional tain problems, for example when working with
torus. Weyl chamber flows, it is difficult to control the
A question of interest is the following: if two regularity in enough directions to span the whole
Ck cocycles, 1 k o, over a hyperbolic action, tangent space. Nevertheless, the tangent space can
are cohomologous through a continuous/measur- be generated if one consider higher brackets of
able transfer map P, what can be said about the good directions. A C1 regularity result for this
higher regularity of P? For real valued cocycles case belongs to Katok, Spatzier (1994b). In order
the question can be reduced to one about to apply this result, the foliations need to be C1
cohomologically trivial cocycles. Livshits not only along the leaves, but also transversally.
showed that for a real valued C1 cocycle An application of the above regularity results is
cohomologous to a constant the transfer map is to questions about transverse regularity of the
C1. He also obtained C1 regularity results if the stable and unstable foliations of the geodesic
action is given by hyperbolic automorphisms of a flow on certain C1 surfaces of nonpositive curva-
torus. After preliminary results by Guillemin and ture. For compact negatively curved C1 surfaces,
Kazhdan for geodesic flows on surfaces of nega- E. Hopf showed that these foliations are C1, and it
tive curvature, for general hyperbolic systems the follows from the work of Anosov that individual
question was answered positively by de la Llave, leaves are C1. Hurder, Katok (1990) showed that
Marco, Moriyon (1986) in the C1 case and by de once the weak-stable and weak-unstable foliations
la Llave (1997) in the real analytic case. Niţică, of a volume-preserving Anosov flow on a com-
Török (1998) considered the lift of regularity for a pact 3-manifold are C2, they are C1.
transfer map between two cohomologous Another application of the regularity results is
cocycles with values in a Lie group or a to the study of invariants for Ck conjugacy of
diffeomorphism group. In contrast to the case of hyperbolic systems. By structural stability, a
cocycles cohomologous to trivial ones, here one small C1 perturbation of a hyperbolic system is
needs to require for the transfer map a certain C0 conjugate to the unperturbed one. The
amount of Hölder regularity that depends on the conjugacy, in general, is only Hölder. If the
ratio between the expansion/contraction that conjugacy is C1 then it preserves the eigenvalues
appears in the base and the expansion/contraction of the derivative at the periodic orbit. The follow-
introduced by the cocycle in the fiber. This ing two results describe the invariants of smooth
622 Ergodic Theory: Rigidity
and real analytic conjugacy of low dimensional 2003) which contains rigidity results for cocycles
hyperbolic systems. They are proved in a series of over (TNS) actions with values in compact Lie
papers written in various combinations by de la groups. In this situation the number of cohomol-
Llave, Marco, Moryion (de la Llave 1987, 1997; ogy classes is finite. An example of (TNS) action
de la Llave and Moriyon 1988; Marco and is given by the action of a maximal diagonalizable
Moriyon 1987a, b). subgroup of SL(n, ℤ) on n :
Let X, Y be two C1(Co) transitive Anosov Recently Damjanović, Katok (2005) devel-
vector fields on a compact three-dimensional oped a new method that was applied to the action
manifold. If they are C0 conjugate and the eigen- of the matrix diagonal group on Γ\SL(n, ℝ). They
values of the derivative at the corresponding peri- use techniques from (Katok and Kononenko
odic orbits are the same, then the conjugating 1996), where one finds cohomology invariants
homeomorphism is C1(Co). In particular, any for cocycles over partially hyperbolic actions
C1 conjugacy is C1(Co). that satisfy accessibility property. Accessibility
Assume now that f, g are two C1(Co) Anosov means that one can connect any two points from
diffeomorphisms on a compact two dimensional the manifold supporting the partially hyperbolic
manifold. If they are C0 conjugate and the eigen- dynamical system by transverse piecewise
values of the derivative at the corresponding peri- smooth paths included in stable/unstable leaves.
odic orbits are the same, then the conjugating This notion was introduced by Brin, Pesin (1974)
diffeomorphism is C1(Co). In particular, any C1 and it is playing a crucial role in the recent surge of
conjugacy is C1(Co). activity in the field of partially hyperbolic
An important direction was initiated by Katok, diffeomorphisms. See (Burns et al. 2001) for a
Spatzier (1994a) who studied cohomological recent survey of the subject. The cohomology
results over hyperbolic ℤk or ℝk-actions, k 2. invariants described in (Katok and Kononenko
They show that real valued smooth/Hölder 1996) are heights of the cocycle over cycles
cocycles over typical classes of hyperbolic ℤk or constructed in the base out of pieces inside sta-
ℝk, k 2, actions are smoothly/Hölder ble/unstable leaves. They provide a complete set
cohomologous to constants. These results cover, of obstructions for solving the cohomology equa-
in particular, actions by hyperbolic automor- tion. A new tool introduced in (Damianović and
phisms of a torus, and Weyl chamber flows. The Katok 2005) is algebraic K-theory (Milnor 1971).
proofs rely on harmonic analysis techniques, such The method can be extended to cocycles with
as Fourier transform and group representations for non-abelian range. In (Katok and Niţică 2007)
semi-simple Lie groups. one finds related results for small cocycles with
A geometric method for cocycle rigidity was values in a Lie group or the diffeomorphism group
developed in (Katok et al. 2000). One constructs a of a compact manifold.
differentiable form using invariant structures The equivalent of the Livshits theorem in the
along stable/unstable foliations, and the commu- higher-rank setting appears to be a description of
tativity of the action. The form is exact if and only the highest cohomology rather than the first coho-
if the cocycle is cohomologous to a constant one. mology. Indeed, for higher rank partially hyper-
The method covers actions on nilmanifolds satis- bolic actions of the torus, the intermediate
fying a condition called TNS (TotallyNon- cohomologies are trivial, while for the highest
Symplectic). This condition means that the action one the closing conditions characterize the coho-
is higher rank abelian hyperbolic, and that the mology classes. This behavior provides a gener-
tangent space is a direct sum of invariant distribu- alization of Veech cohomological result and of
tions, with each pair of these included in the stable Katok, Spatzier cohomological result for toral
distribution of a hyperbolic element of the action. automorphisms, and was discovered by
The method was also applied to small (i.e. close to A. Katok, S. Katok (1995, 2005).
identity on a set of generators) Lie group valued Flaminio, Forni (2003) studied the cohomolog-
cocycles. A related paper is (Niţică and Török ical equation over the horocycle flow. It is shown
Ergodic Theory: Rigidity 623
that there are infinitely many obstructions to the approach inspired Mostow (1973) to use the
existence of a smooth solution. Moreover, if these boundaries at infinity in his proof of strong rigid-
obstructions vanish, then one can solve the coho- ity of lattices, which in turn was crucial to the
mological equation. In (Forni 1997) it is shown a development of superrigidity due to Margulis
similar result for cocycles over area preserving (1991). See section “Global Rigidity” for more
flows on compact higher-genus surfaces under details.
certain assumptions that hold generically. Recall that hyperbolic dynamical systems are
Mieczkowski (2007) extended these techniques structurally stable. Thus they are, in a certain
and studied the cohomology of parabolic higher sense, locally rigid. We introduce now a precise
rank abelian actions. All these results rely on definition of local rigidity in the infinite-
noncommutative Fourier analysis, more specifi- dimensional setup. The fact that for general
cally representation theory of SL(2, ℝ) and group actions one needs to consider different reg-
SL(2, ℂ). ularities for the actions, perturbations and
conjugacies is apparent from the description of
structural stability for Anosov systems.
Local Rigidity A Ck action α of a finitely generated discrete
group Γ on a manifold M, that is, a homomor-
Let Γ be a finitely (or compactly) generated group, phism α : Γ ! Diff k(M ), is said to be Ck,l,p,r
G a topological group, and π : Γ ! G a homo- locally rigid if any Cl perturbation a which is C p
morphism. The target of local rigidity theory is to close to α on a family of generators, is Cr conju-
understand the space of perturbations of various gate to α, i.e. there exists a Cr diffeomorphism
homomorphisms π. Trivial perturbations of a H : M ! M which conjugates a to α, that is,
homomorphism arise from conjugation by an H ∘ aðgÞ ¼ aðgÞ ∘ H for all g Γ. Note that for
arbitrary element of G. In order to rule them out, Anosov ℤ-actions, C1,1,1,0 rigidity is known as
one says that π is locally rigid if any nearby structural stability. One can also introduce the
homomorphism π0, (that is, π0, close to π on a finite notion of deformation rigidity if the initial action
or compact set of generators of Γ), is conjugate to and the perturbation are conjugate by a continuous
π by an element g G, that is, π(γ) ¼ gπ0(γ)g1 path of diffeomorphisms that has an end coincid-
for all γ Γ. If G is path-wise connected, one can ing to the identity.
also consider deformation rigidity, meaning that A weaker notion of local rigidity can be defined
any nearby continuous path of homomorphisms is in the presence of invariant foliations for the initial
conjugated to the initial one via a continuous path group action and for the perturbation. The map
of elements in G that has an end in the identity. H is now required to preserve the leaves of the
Initial results on local rigidity are about embed- foliations and to conjugate only after factorization
dings of lattices into semi-simple Lie groups. The by the invariant foliations. The importance of this
main results belong to Weil (1960, 1962, 1964). notion is apparent from the leaf wise conjugacy
He showed that if G is a semi-simple Lie group structural stability theorem of Hirsch, Pugh, Shub
that is not locally isomorphic to SL(2, ℝ) and if (1977). See section “Basic Definitions and Exam-
Γ G is an irreducible cocompact lattice, then the ples”. Moreover, for Anosov flows this is the
natural embedding of Γ into G is locally rigid. natural notion of structural stability, and appears
Earlier results were obtained by Selberg (1960), by taking the invariant foliation to be the one-
Calabi, Vesentini (1960), and Calabi (1961). dimensional orbit foliation. For more general
Selberg proved the local rigidity of the natural actions, of lattices or higher rank abelian groups,
embedding of cocompact lattices into SL(n, ℝ). this property is often used in combination to
His proof used the dynamics of iterates of matri- cocycle rigidity in order to show local rigidity.
ces, in particular the existence of singular direc- We discuss more about this when we review
tions, or walls of Weyl chambers, in the maximal local rigidity results for partially hyperbolic
diagonalizable subgroups of SL(n, ℝ). Selberg’s actions.
624 Ergodic Theory: Rigidity
We summarize now several developments in (Niţică and Török 2001) shows that the action r is
local rigidity that emerged in the 80s. Initial C1,1,2,K1 locally rigid. Ingredients in the proof
results (Lewis 1991; Zimmer 1990) were about are two rigidity results, one about TNS actions,
infinitesimal rigidity, that is, a weaker version of and one about actions of property (T) groups.
local rigidity suitable for discrete groups repre- A locally compact group has property (T) if the
sentations in infinite-dimensional spaces of trivial representation is isolated in the Fell topol-
smooth vector fields. Then Hurder (1992) proved ogy. This means that if G acts on a Hilbert space
C1,1,1,1 deformation rigidity and Katok, Lewis, unitarily and it has almost invariant vectors, then it
Zimmer (Katok and Lewis 1991, 1996; Katok has invariant vectors. Hirsch–Pugh–Shub theorem
et al. 1996) proved C1,1,1,1 local rigidity of the implies that perturbations of abelian partially
standard action of SL(n, ℤ), n 3, on the hyperbolic actions of product type are conjugated
n-dimensional torus. In these results crucial use to skew-products of abelian Anosov actions via
was made of the presence of an Anosov element in cocycles with values in diffeomorphism groups. In
the action. Due to the uniqueness of the conjugacy addition, the TNS property implies that the sum of
coming from structural stability, one has a contin- the stable and unstable distributions of any regular
uous candidate for the conjugacy between the element of the perturbation is integrable. The
actions. Margulis, Qian (2001) used the existence leaves of the integral foliation are closed, covering
of a spanning family of directions that are hyper- the base simply. Thus one obtains a conjugacy
bolic for certain elements of the action to show between the perturbation and a product action.
local rigidity of partially hyperbolic actions that Property (T) is used to show that the conjugacy
are not hyperbolic. Another important tool present reduces the perturbed action to a family of pertur-
in many proofs is Margulis and Zimmer super- bations of hyperbolic actions. But the last ones are
rigidity. These results allow one to produce a already known to be conjugate to the hyperbolic
measurable conjugacy for the perturbation. Then action in the base.
one shows that the conjugacy has higher regular- Recent important progress in the question of
ity using the presence of hyperbolicity. Having local rigidity of lattice actions was made by
enough directions to span the whole tangent Fisher, Margulis (2003, 2004, 2005). Their proofs
space is essential to lift the regularity. A cocycle are modeled along the proof of Weil’s local rigid-
to which superrigidity can be applied is the deriv- ity result (Weil 1964) and use an analog of
ative cocycle. Hamilton’s (1982) hard implicit function theorem.
The study of local rigidity of partially hyper- Let G be a connected semi-simple Lie group with
bolic actions that contain a compact factor all simple factors of rank at least two, and Γ G a
was initiated by Niţică, Török (1995, 2001). Let lattice. The main result shows that a volume pre-
n 3 and d 1. Let r be the action serving affine action r of G or Γ on a compact
of SLðn, ℤÞ on nþd ¼ n d given by smooth manifold X is C1,1,1,1 locally rigid.
rðAÞðx, yÞ ¼ ðAx, yÞ, x n , y d , A SLðn, ℤÞ: Then, Lower regularity results are also available.
for K 1, (Niţică and Török 1995) shows that r is A component of the proof shows that if Γ is a
C1,1,5,K1 deformation rigid. The proof is based group with property (T), X a compact smooth
on three results in hyperbolic dynamics: the gen- manifold, and r a smooth action of Γ on X by
eralization of Livshits’s cohomological results to Riemannian isometries, then r is C1,1,1,1
cocycles with values in diffeomorphism groups, locally rigid. An earlier local rigidity result for
the extension of Livshits’s result to general this type of actions by cocompact lattices was
Anosov actions, and a version of the Hirsch, obtained by Benveniste (2000).
Pugh, Shub structural stability theorem improving Many lattices act naturally on “boundaries” of
the regularity of the conjugacy. type G/P, where G is a semi-simple algebraic Lie
Assume now n 3 and K 1. If r is the group and P is a parabolic subgroup. An example
action of SLðn, ℤÞ on nþ1 ¼ n given by is given by G ¼ SL(2, ℝ) and P the subgroup in
rðAÞðx, yÞ ¼ ðAx, yÞ, x n , y , A SLðn, ℤÞ, G consisting of upper triangular matrices. Local
Ergodic Theory: Rigidity 625
rigidity results for this type of actions were found Global Rigidity
by Ghys (1985), Kanai (1996) and Katok,
Spatzier (1997). The first remarkable result in global rigidity
Starting with the work of Katok and Lewis, a belongs to Mostow (1973). For G a connected
related direction was the study of local rigidity for non-compact semi-simple Lie group not locally
higher rank abelian actions. They prove in (Katok isomorphic to SL(2, ℝ), and two irreducible
and Lewis 1991) the C1,1,1,1 local rigidity of the cocompact lattices Γ1, Γ2 G, Mostow showed
action of a ℤn maximal diagonalizable (over ℝ) that any isomorphism θ from Γ1 into Γ2 extends to
subgroup of SL(n þ 1, ℤ), n 2, acting on the an isomorphism of G into itself. G has an involu-
torus nþ1 : These type of results were later pushed tion s whose fixed set is a maximal compact
forward by Katok, Spatzier (1997). Using the subgroup K. One constructs the symmetric Rie-
theory of nonstationary normal forms developed mannian space X ¼ G/K. To each chamber of
in (Guysinsky 2002; Guysinsky and Katok 1998) X corresponds a parabolic group and these para-
by Katok, Guysinsky, they proved several local bolic groups are endowed with a Tits geometry
rigidity results. The first one assumes that G is a similar to the projective geometry of lines, planes
semi simple Lie group with all simple factors of etc. formed in the classical case when G ¼
rank atleast two, Γ a lattice in G, N a nilpotent Lie PGL(n, ℝ). The proof of Mostow’s result starts
group and Λ a lattice in N. Then any Anosov by building a θ-equivariant pseudo-isometric map
affine action of Γ on N/Λ is C1,1,1,1 locally f : G/K1 ! G/K2. The map f induces an inci-
rigid. Second, let ℤd be a group of affine trans- dence preserving θ-equivariant isomorphism
formations of N/Λ for which the derivatives are f0 of the Tits geometries. By Tits’ generalized
simultaneously diagonalizable over ℝ with no fundamental theorem of projective geometry,
eigenvalues on the unit circle. Then the ℤd-action f0 is induced by an isomorphism of G. Finally,
on N/Λ is C1,1,1,1 locally rigid. A related result yðgÞ ¼ f0 g f1
0 gives the desired conclusion.
for continuous groups is the C1,1,1,1 local rigid- The next remarkable result in global rigidity is
ity (after factorization by the orbit foliation) of Margulis’ superrigidity theorem. An account of
the action of a maximal abelian ℝ-split subgroup this development can be found in the monograph
in an ℝ-split semi-simple Lie group of real rank (Margulis 1991). For large classes of irreducible
at least two on G/Λ, where Λ is a cocompact lattices in semi-simple Lie groups, this result clas-
lattice in G. sifies all finite dimensional representations. Let
One can also study rigidity of higher rank abe- G be a semi-simple simply connected Lie group
lian partially hyperbolic actions that are not of rank higher than two and Γ < G an irreducible
hyperbolic. Natural examples appear as automor- lattice. Then any linear representation π of Γ is
phisms of tori and as variants of Weyl chamber almost the restriction of a linear representation of
flows. For the case of ergodic actions by automor- G. That is, there exists a linear representation π1 of
phisms of a torus, this was investigated using a G and a bounded image representation π2 of Γ
version of the KAM (Kolmogorov, Arnold, such that π ¼ π1π2. The possible representations
Moser) method by Damianović, Katok (2007). As π2 are also classified by Margulis up to some facts
usual in the KAM method, one starts with a line- concerning finite image representations. As in the
arization of the conjugacy equation. At each step of case of Mostow’s result, the proof involved the
the iterative KAM scheme, some twisted cohomo- analysis of a map defined on the boundary at
logical equations are solved. The existence of the infinity. In this case the map is studied using
solutions is forced by the ergodicity of the action deep results from dynamics like the multi-
and the higher rank assumptions. Diophantine con- plicativity ergodic theorem of Oseledec (1968)
ditions present in this case allow to control the or the theory of random walks on groups devel-
fixed loss of regularity which is necessary for the oped by Furstenberg (1963). An important conse-
convergence of these solutions to a conjugacy. quence of Margulis superrigidity result is the
626 Ergodic Theory: Rigidity
arithmeticity of irreducible lattices in connected A topological rigidity theorem has been proved
semi-simple Lie groups of rank higher than two. by Farrell, Jones (1989). They showed that if N is
A basic example of arithmetic lattice can be a complete connected Riemannian manifold
obtained by taking the integer points in a semi- whose sectional curvature lies in a closed interval
simple Lie group that is a matrix group, like included in(1, 0], and M is a topological man-
taking SL(n, ℤ) inside SL(n, ℝ). Special cases of ifold of dimension greater than 5, then any proper
superrigidity theorems were proved by Corlette homotopy equivalence f : M ! N is properly
(1992) and Gromov, Schoen (1992) for the rank homotopic to a homeomorphism. In particular, if
one groups Sp(1, n) and respectively F4 using the M and N are both compact connected negatively
theory of harmonic maps. A consequence is the curved Riemannian manifolds with isomorphic
arithmeticity of lattices in these groups. Some of fundamental groups, then M and N are
these results are put into differential geometric homeomorphic.
setting in (Mok et al. 1993). Likewise to the case of local rigidity, a source
Margulis supper rigidity result was extended to of inspiration for results in global rigidity was the
cocycles by Zimmer. A detailed exposition, theory of hyperbolic systems, in particular their
including a self contained presentation of several classification. The only known examples of
rigidity results of Margulis, can be found in the Anosov diffeomorphisms are hyperbolic auto-
monograph (Zimmer 1984). We mention here a morphisms of infranilmanifolds. Moreover, any
version of this result that can be found in (Fisher Anosov diffeomorphism on an infranilmanifold
and Margulis 2003). Let M be a compact mani- is topologically conjugate to a hyperbolic auto-
fold, Ha matrix group, P ¼ M H, and Γ a lattice morphism (Franks 1970; Manning 1974). It is
in a simply connected, semi-simple Lie group conjectured that any Anosov diffeomorphism is
with all factors of rank higher that two. Assume topologically conjugate to a hyperbolic automor-
that Γ acts on M and H in a way that makes the phism of an infranilmanifold. Partial results are
projection from P to M equivariant. Moreover, the obtained in (Newhouse 1970), where the conjec-
action of Γ on P is measure preserving and ture is proved for Anosov diffeomorphisms with
ergodic. Then there exists a measurable map codimension one stable/unstable foliation. The
s : M ! H, a representation π : G ! H, a compact proof of the general conjecture eluded efforts
subgroup K < H which commute with π(G) and a done so far. It is not even known if any Anosov
measurable map k : Γ M ! K such that diffeomorphism is topologically transitive, that is,
γ s(m) ¼ k(γ, m) π(γ) s(γ m). One can easily if it has a dense orbit. A few positive results are
check from the last equation that k is a cocycle. available. Let M be a C1 compact manifold endo-
So, up to a measurable change of coordinates wed with a C1 affine connection. Let f be a
given by the map s, the action of Γ on P is a topologically transitive Anosov diffeomorphism
compact extension via a cocycle of a linear repre- preserving the connection and such that the stable
sentation of G. and unstable distributions are C1. Then Benoist,
Developing further the method of Mostow for Labourie (1993) proved that f is C1 conjugate to a
studying the Tits building associated to a symmet- hyperbolic automorphism of an infranilmanifold.
ric space of non-positive curvature led Ballman, The situation for Anosov flows is somehow
Brin, Eberlein, Spatzier (1985a, b) to a number of different. As shown in (Franks and Williams
characterizations of symmetric spaces. In particu- 1980), there exist Anosov flows that are not topo-
lar, they showed that if M is a complete Riemann- logically transitive, so a general analog of the
ian manifold of non-positive curvature, finite conjecture is false. Nevertheless, for the case of
volume, with simply connected cover, irreducible codimension one stable or unstable foliation, it is
and of rank at least two, then M is isometric to a conjectured in (Verjovsky 1974) that any Anosov
symmetric space with the connected component flow on a manifold of dimension greater than
of Isom(M) having no compact factors. three admits a global cross-section. This would
Ergodic Theory: Rigidity 627
imply that the flow is topologically conjugate to proportional to a rational form. The proof of the
the suspension of a linear automorphism of a conjecture is based on the study of the orbits for
torus. unipotent flows acting by translations on the
For actions of groups larger than ℤ, or ℝ, homogenous space SL(n, ℤ)\SL(n, ℝ). All these
global classification results are more abundant. results were special cases of the Raghunathan
A useful strategy in these type of results, which conjecture about the structure of the orbits of the
are quite technical, is to start by obtaining a mea- actions of unipotent flows on homogenous spaces.
surable description of the action, most of the time Raghunathan’s conjecture was proved in full
using Margulis–Zimmer superrigidity results, and generality by Ratner (1991a, b). Borel, Prasad
then use extra assumptions on the action, such as (1992) raised the question of an analog of
the presence of a hyperbolic element, or the pres- Raghunathan’s conjecture for S-algebraic groups.
ence of an invariant geometric structure, or both, S-algebraic groups are products of real and p-adic
in order to show that the measurable model is algebraic groups. This was answered indepen-
actually continuous or even differentiable. For dently by Margulis, Tomanov (1994) and
actions of higher rank Lie groups and their lattices Ratner (1995).
some representative papers are by Katok, Lewis, A basic example of higher rank abelian hyper-
Zimmer (1996) and Goetze, Spatzier (1999). For bolic action is given by the action of Sm,n, the
actions of higher rank abelian groups see Kalinin, multiplicative semigroup of endomorphisms gen-
Spatzier (2007). erated by the multiplication by m and n, two
nontrivial integers, on the one dimensional torus
1 : Ina pioneering paper (Furstenberg 1967)
Measure Rigidity Furstenberg showed that for m, n that are not
powers of the same integer the action of Sm,n has
Measure rigidity is the study of invariants mea- a unique closed, infinite invariant set, namely 1
sures for actions of one parameter and multi itself. Since there are many closed, infinite invari-
parameter abelian groups and semigroups acting ant sets for multiplication by m, and by n, this
on manifolds. Typical situations when interesting result shows a remarkable rigidity property of the
rigidity phenomena appear are for one parameter joint action. Furstenberg’s result was generalized
unipotent actions and higher rank hyperbolic later by Berend for other group actions on higher
actions, discrete or continuous. dimensional tori and on other compact abelian
A unipotent matrix is one all of whose eigen- groups in (Berend 1983, 1984).
values are one. An important case where the Furstenberg further opened the field by raising
action of a unipotent flow appears is that of the the following question:
horocycle flow. The invariant measures for it were
studied by Furstenberg (1973), who showed that Conjecture 1 Let m be a Sm,n-invariant and ergo-
the horocycle flow on a compact surface is dic probability measure on 1 : Then m is either an
uniquely ergodic, that is, the ergodic measure is atomic measure supported on a finite union of
unique. Dani and Smillie (1984) extended this (rational) periodic orbits or m is the Lebesgue
result to the case of non-compact surfaces, with measure.
the only other ergodic measures appearing being
those supported on compact horocycles. An While the statement appears to be simple,
important breakthrough is the work of Margulis proving it has been elusive. The first partial result
(1989), who solved a long standing question in was given by Lyons (1988) under the strong addi-
number theory, Oppenheim’s conjecture. The tional assumption that the measure makes one of
conjecture is about the density properties of the the endomorphisms generating the action exact.
values of an indefinite quadratic form in three or Later Rudolph (1990) and Johnson (1992)
more variables, provided the form is not weaken the exactness assumption and proved
628 Ergodic Theory: Rigidity
de la Llave R (1997) Analytic regularity of solutions of Ghys E (1985) Actions localement libres du groupe affine.
Livshits’s cohomology equation and some applications Invent Math 82:479–526
to analytic conjugacy of hyperbolic dynamical systems. Goetze E, Spatzier R (1999) Smooth classification of
Ergodic Theory Dynam Syst 17:649–662 Cartan actions of higher rank semi-simple Lie groups
de la Llave R, Moriyon R (1988) Invariants for smooth and their lattices. Ann Math 150:743–773
conjugacy of hyperbolic dynamical systems. Gromov M, Schoen R (1992) Harmonic maps into singular
IV. Commun Math Phys 116:185–192 spaces and p-adic superrigidity for lattices in groups of
de la Llave R, Marco JM, Moriyon R (1986) Canonical rank one. Publ Math IHES 76:165–246
perturbation theory of Anosov systems and regularity Guysinsky M (2002) The theory of nonstationary normal
results for the Livšic cohomology equation. Ann Math forms. Ergodic Theory Dynam Syst 22:845–862
123:537–611 Guysinsky M, Katok A (1998) Normal forms and
Einsiedler M, Katok A (2003) Invariant measures on G/Γ invariant geometric structures for dynamical systems
for split simple Lie groups. Commun Pure Appl Math with invariant contracting foliations. Math Res Lett 5:
56:1184–1221 149–163
Einsiedler M, Lindenstrauss E (2003) Rigidity properties Hamilton R (1982) The inverse function theorem of Nash
of Zd-actions on tori and solenoids. Electron Res and Moser. Bull AMS 7:65–222
Announc AMS 9:99–110 Helgason S (1978) Differential geometry, Lie groups and
Einsiedler M, Katok A, Lindenstrauss E (2006) Invariant symmetric spaces. Academic, New York
measures and the set of exceptions to Littlewood con- Hirsch M, Pugh C, Shub M (1977) Invariant manifolds.
jecture. Ann Math 164:513–560 Lecture notes in mathematics, vol 583. Springer, Berlin
Farrell FT, Jones LE (1989) A topological analog of Host B (1995) Nombres normaux, entropie, translations.
Mostow’s rigidity theorem. J AMS 2:237–370 Israel J Math 91:419–428
Feldman J (1993) A generalization of a result of R Lyons Hurder S (1992) Rigidity of Anosov actions of higher rank
about measures on [0,1]. Israel J Math 81:281–287 lattices. Ann Math 135:361–410
Fisher D (2006) Local rigidity of group actions: past, Hurder S, Katok A (1990) Differentiability, rigidity and
present, future. In: Dynamics, ergodic theory and Godbillon-Veyclasses for Anosov flows. Publ Math
geometry (2007). Cambridge University Press IHES 72:5–61
Fisher D, Margulis GA (2003) Local rigidity for cocycles. Johnson AS (1992) Measures on the circle invariant under
In: Surv Diff Geom VIII. International Press, Cam- multiplication bya nonlacunary subsemigroup of inte-
bridge, pp 191–234 gers. Israel J Math 77:211–240
Fisher D, Margulis GA (2004) Local rigidity of affine Journé JL (1988) A regularity lemma for functions of
actions of higher rank Lie groups and their several variables. Rev Mat Iberoam 4:187–193
lattices. 2003 Kalinin B, Katok A (2002) Measurable rigidity and
Fisher D, Margulis GA (2005) Almost isometric actions, disjointness for Zk-actions by toral automorphisms.
property T, and local rigidity. Invent Math 162:19–80 Ergodic Theory Dynam Syst 22:507–523
Flaminio L, Forni G (2003) Invariant distributions and time Kalinin B, Katok A (2007) Measure rigidity beyond uni-
averages for horocycle flows. Duke Math J 119: form hyperbolicity: invariant measures for Cartan
465–526 actions on tori. J Modern Dyn 1:123–146
Forni G (1997) Solutions of the cohomological equation Kalinin B, Spatzier R (2007) On the classification of Cartan
for area-preserving flows on compact surfaces of higher actions. Geom Funct Anal 17:468–490
genus. Ann Math 146:295–344 Kanai M (1996) A new approach to the rigidity of discrete
Franks J (1970) Anosov diffeomorphisms. In: Chern SS, group actions. Geom Funct Anal 6:943–1056
Smale S (eds) Global analysis (Proc Symp Pure Math, Katok A, Hasselblatt B (1995) Introduction to the modern
XIV, Berkeley 1968). AMS, Providence, pp 61–93 theory of dynamical systems. Encyclopedia of mathe-
Franks J, Williams R (1980) Anomalous anosov flows. In: matics and its applications, vol 54. Cambridge Univer-
Global theory of dynamical systems, Proc Inter Conf sity Press, Cambridge
Evanston, 1979. Lecture notes in mathematics, vol 819. Katok A, Katok S (1995) Higher cohomology for abelian
Springer, Berlin, pp 158–174 groups of toral automorphisms. Ergodic Theory
Furstenberg H (1963) A Poisson formula for semi-simple Dynam Syst 15:569–592
Lie groups. Ann Math 77:335–386 Katok A, Katok S (2005) Higher cohomology for abelian
Furstenberg H (1967) Disjointness in ergodic theory, min- groups of toral automorphisms. II. The partially hyper-
imal sets, anda problem in Diophantine approximation. bolic case, and corrigendum. Ergodic Theory Dynam
Math Syst Theory 1:1–49 Syst 25:1909–1917
Furstenberg H (1973) The unique ergodicity of the Katok A, Kononenko A (1996) Cocycles’ stability
horocycle flow. In: Recent advances in topological for partially hyperbolic systems. Math Res Lett 3:
dynamics, Proc Conf Yale Univ, New Haven 1972. 191–210
Lecture notes in mathematics, vol 318. Springer, Ber- Katok A, Lewis J (1991) Local rigidity for certain groups
lin, pp 95–115 of toral automorphisms. Israel J Math 75:203–241
Ergodic Theory: Rigidity 631
Katok A, Lewis J (1996) Global rigidity results for lattice Margulis GA (1997) Oppenheim conjecture. In: Fields
actions on toriand new examples of vol preserving medalists lectures, vol 5.World Sci Ser 20th Century
actions. Israel J Math 93:253–280 Math. World Sci Publ, River Edge, pp 272–327
Katok A, Niţică V (2007) Rigidity of higher rank abelian Margulis GA (2000) Problems and conjectures in rigidity
cocycles with values in diffeomorphism groups. Geom theory. In: Mathematics: frontiers and perspectives.
Dedicata 124:109–131 AMS, Providence, pp 161–174
Katok A, Niţică V. Differentiable rigidity of abelian group Margulis GA, Qian N (2001) Local rigidity of weakly
actions. Cambridge University Press (to appear) hyperbolic actions of higher rank real Lie groups and
Katok A, Spatzier R (1994a) First cohomology of Anosov their lattices. Ergodic Theory Dynam Syst 21:
actions of higher rank abelian groups and applications 121–164
to rigidity. Publ Math IHES 79:131–156 Margulis GA, Tomanov G (1994) Invariant measures for
Katok A, Spatzier R (1994b) Subelliptic estimates of poly- actions of unipotent groups over local fields of homog-
nomial differential operators and applications to rigid- enous spaces. Invent Math 116:347–392
ity of abelian actions. Math Res Lett 1:193–202 Mieczkowski D (2007) The first cohomology of parabolic
Katok A, Spatzier R (1996) Invariant measures for higher- actions for some higher-rank abelian groups and repre-
rank abelian actions. Ergodic Theory Dynam Syst 16: sentation theory. J Modern Dyn 1:61–92
751–778; Katok A, Spatzier R (1998) Corrections to: Milnor J (1971) Introduction to algebraic K-theory.
invariant measures for higher-rank abelian actions.; Princeton University Press, Princeton
(1996) Ergodic Theory Dynam Syst 16:751–778; Ergo- Mok N, Siu YT, Yeung SK (1993) Geometric superrigidity.
dic Theory Dynam Syst 18:503–507 Invent Math 113:57–83
Katok A, Spatzier R (1997) Differential rigidity of Mostow GD (1973) Strong rigidity of locally symmetric
Anosov actions of higher rank abelian groups and spaces. Ann Math studies, vol 78. Princeton University
algebraic lattice actions. Trudy Mat Inst Stek 216: Press, Princeton
292–319 Newhouse SE (1970) On codimension one Anosov
Katok A, Lewis J, Zimmer R (1996) Cocycle superrigidity diffeomorphisms. Am J Math 92:761–770
and rigidity for lattice actions on tori. Topology 35:27–38 Niţică V, Török A (1995) Cohomology of dynamical sys-
Katok A, Niţică V, Török A (2000) Non-abelian cohomol- tems and rigidity of partially hyperbolic actions of
ogy of abelian Anosov actions. Ergodic Theory Dynam higher rank lattices. Duke Math J 79:751–810
Syst 2:259–288 Niţică V, Török A (1998) Regularity of the transfer map for
Katok A, Katok S, Schmidt K (2002) Rigidity of measur- cohomologous cocycles. Ergodic Theory Dynam Syst
able structure for Zd-actions by automorphisms of a 18:1187–1209
torus. Comment Math Helv 77:718–745 Niţică V, Török A (2001) Local rigidity of certain partially
Kazhdan DA (1967) On the connection of a dual space of a hyperbolic actions of product type. Ergodic Theory
group with the structure of its closed subgroups. Funkc Dynam Syst 21:1213–1237
Anal Prilozen 1:71–74 Niţică V, Török A (2002) On the cohomology of Anosov
Lewis J (1991) Infinitezimal rigidity for the action of SLn(- actions. In: Rigidity in dynamics and geometry, Cam-
Z) on Tn. Trans AMS 324:421–445 bridge, 2000. Springer, Berlin, pp 345–361
Lindenstrauss E (2005) Rigidity of multiparameter actions. Niţică V, Török A (2003) Cocycles over abelian TNS
Israel Math J 149:199–226 actions. Geom Dedicata 102:65–90
Lindenstrauss E (2006) Invariant measures and arithmetic Oseledec VI (1968) A multiplicative ergodic theorem.
quantum unique ergodicity. Ann Math 163:165–219 Characteristic Lyapunov, exponents of dynamical sys-
Livshits A (1971) Homology properties of Y-systems. tems. Trudy Mosk Mat Obsc 19:179–210
Math Zametki 10:758–763 Parry W (1999) The Livšic periodic point theorem for non-
Livshits A (1972) Cohomology of dynamical systems. abelian cocycles. Ergodic Theory Dynam Syst 19:
Izvestia 6:1278–1301 the Livšic cohomology equation. 687–701
Ann Math 123:537–611 Pugh C, Shub M (1972) Ergodicity of Anosov actions.
Lyons R (1988) On measures simultaneously 2- and Invent Math 15:1–23
3-invariant. Israel J Math 61:219–224 Ratner M (1991a) On Ragunathan’s measure conjecture.
Manning A (1974) There are no new Anosov Ann Math 134:545–607
diffeomorphisms on tori. Am J Math 96:422–429 Ratner M (1991b) Ragunathan’s topological conjecture
Marco JM, Moriyon R (1987a) Invariants for smooth and distributions of unipotent flows. Duke Math J 63:
conjugacy of hyperbolic dynamical systems. 235–280
I. Commun Math Phys 109:681–689 Ratner M (1995) Raghunathan’s conjecture for Cartesians
Marco JM, Moriyon R (1987b) Invariants for smooth products of real and p-adic Lie groups. Duke Math J 77:
conjugacy of hyperbolic dynamical systems. III. 275–382
Commun Math Phys 112:317–333 Rudolph D (1990) 2 and 3 invariant measures
Margulis GA (1989) Discrete subgroups and ergodic the- andentropy. Ergodic Theory Dynam Syst 10:395–406
ory. In: Number theory, trace formulas and discrete Schmidt K (1999) Remarks on Livšic’ theory for nonabelian
groups, Oslo, 1987. Academic, Boston, pp 277–298 cocycles. Ergodic Theory Dynam Syst 19:703–721
Margulis GA (1991) Discrete subgroups of semi-simple Selberg A (1960) On discontinuous groups in higher-
Lie groups. Springer, Berlin dimensional symmetric spaces. In: Contributions to
632 Ergodic Theory: Rigidity
function theory. Inter colloq function theory, Bombay. Books and Reviews
Tata Institute of Fundamental Research, pp 147–164 de la Harpe P, Valette A (1989) La propriété (T) de
Smale S (1967) Differentiable dynamical systems. Bull Kazhdan pour les groupes localement compacts.
AMS 73:747–817 Astérisque 175
Spatzier R (1995) Harmonic analysis in rigidity theory. In: Feres R (1998) Dynamical systems and semi-simple
Ergodic theory and its connections with harmonic analy- groups: an introduction. Cambridge tracts in
sis. Alexandria, 1993. London Math Soc Lect Notes Ser, mathematics, vol 126. Cambridge University Press,
vol 205. Cambridge University Press, Cambridge, Cambridge
pp 153–205 Feres R, Katok A (2002) Ergodic theory and dynamics of
Veech WA (1986) Periodic points and invariant pseudo- G-spaces. In: Handbook in dynamical systems, vol 1A.
measures for toral endomorphisms. Ergodic Theory Elsevier, Amsterdam, pp 665–763
Dynam Syst 6:449–473 Gromov M (1988) Rigid transformation groups. In:
Verjovsky A (1974) Codimension one Anosov flows. Bull Bernard D, Choquet-Bruhat Y (eds) Géométrie
Soc Math Mex 19:49–77 Différentielle (Paris, 1986). Hermann, Paris,
Weil A (1960) On discrete subgroups of Lie groups. I. Ann pp 65–139. Travaux en Cours 33
Math 72:369–384 Kleinbock D, Shah N, Starkov A (2002) Dynamics of
Weil A (1962) On discrete subgroups of Lie groups. II. Ann subgroup actions on homogeneous spaces of Lie
Math 75:578–602 groups and applications to number theory. In: Hand-
Weil A (1964) Remarks on the cohomology of groups. Ann book in dynamical systems, vol 1A. Elsevier, Amster-
Math 80:149–157 dam, pp 813–930
Zimmer R (1984) Ergodic theory and semi-simple groups. Knapp A (2002) Lie groups beyond an introduction,
Birhhäuser, Boston 2nd edn. Progress in mathematics, 140. Birkhäuser,
Zimmer R (1987) Actions of semi-simple groups and dis- Boston
crete subgroups. In: Proc Inter Congress of Math Raghunathan MS (1972) Discrete subgroups of Lie groups.
(1986). AMS, Providence, pp 1247–1258 Springer, Berlin
Zimmer R (1990) Infinitesimal rigidity of smooth actions of Witte MD (2005) Ratner’s theorems on unipotent flows.
discrete subgroups of Lie groups. J Differ Geom 31: Chicago lectures in mathematics. University of
301–322 Chicago Press, Chicago
Hyperbolicity A measure is hyperbolic in the
Chaos and Ergodic Theory sense of Pesin if at almost every point no
Lyapunov exponent is zero. See ▶ “Smooth
Jérôme Buzzi Ergodic Theory”.
C.N.R.S. and Université Paris-Sud, Orsay, France Kolmogorov typicality A property is typical in
the sense of Kolmogorov for a topological
space F of parametrized families f ¼ ( ft)t U,
Article Outline U being an open subset of ℝd for some d 1, if
it holds for ft for Lebesgue almost every t and
Glossary topologically generic f F .
Definition of the Subject Lyapunov exponents The Lyapunov exponents
Introduction (▶ Smooth Ergodic Theory) are the limits,
Picking an Invariant Probability Measure when they exist, lim 1 log ðT n Þ0 ðxÞ : v
Tractable Chaotic Dynamics n!1 n
Attempts at Definition
Definition of the Subject We note that, like many ideas (Stewart 1964), this
is not captured by a single mathematical defini-
Chaotic dynamical systems are those which pre- tion, despite several attempts (see, e.g., Blanchard
sent unpredictable and/or complex behaviors. The et al. 2002; Glasner and Weiss 1993; Kolyada
existence and importance of such systems has 2004; Ruette 2003b) for some discussions as
been known at least since Hadamard (1898) and well as the monographs on chaotic dynamics
Poincaré (1892), however it became well-known (Arnol’d 1988; Arnol’d and Avez 1968; Brain
only in the sixties. We refer to (Bergelson 2006; and Berger 2001; Brin and Stuck 2002; Collet
Collet et al. 2005; Fiedler 2002; Guckenheimer and Eckmann 2006; Eckmann and Ruelle 1985;
1979; Gutzwiller 1990; Hasselblatt and Katok Guckenheimer and Holmes 1990; Hasselblatt and
2002; Murray 2002; Puu 2000; Saari 2005; Katok 2003; Robinson 2004; Viana 1997b; Young
Starkov 2000) for the relevance of such dynamics 1995). Let us give some of the most well-known
in other fields, mathematical or not (see also definitions, which have been given mostly from
▶ “Ergodic Theory: Interactions with Combina- the topological point of view, i.e., in the setting of
torics and Number Theory”, ▶ “Ergodic Theory: a self-map T : X ! X on a compact metric space
Fractal Geometry”). The numerical simulations of whose distance is denoted by d:
Chaos and Ergodic Theory 635
T has sensitivity to initial conditions on X 0 X 1. Ergodic (cannot be split) and aperiodic (not
if there exists a constant r > 0 such that for every carried by a periodic orbit);
x X 0, there exists y X, arbitrarily close to 2. Hyperbolic (nearby orbits converge or diverge
x with a finite separating time: at a definite exponential rate);
3. Sinai–Ruelle–Bowen (as smooth as it is
∃n 0 such that d ðT n y, T n xÞ > r: possible).
In other words, any uncertainty on the exact (For precise definitions we refer to ▶ “Smooth
value of the initial condition x makes T n(x) Ergodic Theory” or to the discussions below.) In
completely unknown for n large enough. If X is a particular such a situation implies nonzero entropy
manifold, then sensitivity to initial conditions in and sensitivity to initial condition of a set of non-
the sense of Guckenheimer (1979) means that the zero Lebesgue measure (i.e., positive volume).
previous phenomenon occurs for a set X 0 with Before starting our survey in earnest, we shall
nonzero volume. describe an elementary and classical example, the
T is chaotic in the sense of Devaney (1989) if it full tent map, on which the basic phenomena can
admits a dense orbit and if the periodic points are be analyzed in a very elementary way. Then, in
dense in X. It implies sensitivity to initial condi- section “Picking an Invariant Probability Mea-
tions on X. sure”, we shall give some motivations for intro-
T is chaotic in the sense of Li and Yorke (1975) ducing probability theory in the description of
if there exists an uncountable subset X 0 X of chaotic but deterministic systems, in particular
points, such that, for all x 6¼ y X 0, the unpredictability of their individual orbits. We
define two of the most relevant classes of invariant
lim inf dðT n x, T n yÞ ¼ 0 and measures: the physical measures and those maxi-
n!1
lim sup d ðT n x, T n yÞ > 0: mizing entropy. It is unknown in which generality
n!1 these measures exist and can be analyzed but we
describe in section “Tractable Chaotic Dynamics”
T has generic chaos in the sense of Lasota the major classes of dynamics for which this has
(Piorek 1985) if the set been done. In section “Statistical Properties” we
describe some of the finer statistical properties
ðx, yÞ X X : lim inf d ðT n x, T n yÞ ¼ 0
n!1
that have been obtained for such good chaotic
systems: sums of observables along orbits are
< lim sup dðT n x, T n yÞg
n!1 statistically undistinguishable from sums of inde-
pendent and identically distributed random vari-
is topologically generic (see “Glossary”) in X X. ables. Section “Orbit Complexity” is devoted to
Topological chaos is also sometimes character- the other side of chaos: the complexity of these
ized by nonzero topological entropy (▶ Entropy dynamics and how, again, this complexity can be
in Ergodic Theory): there exist exponentially analyzed, and sometimes classified, using ergodic
many orbit segments of a given length. This theory. Section “Stability” describes perhaps the
implies chaos in the sense of Li and Yorke by most striking aspect of chaotic dynamics: the
(Blanchard et al. 2002). unstability of individual orbit is linked to various
As we shall see ergodic theory describes a forms of stability of the global dynamics.
number of chaotic properties, many of them Finally we conclude by mentioning some of the
implying some or all of the above topological most important topics that we could not address
ones. The main such property for a smooth and we list some possible future directions.
dynamical system, say a C1þα-diffeomorphism Caveat. The subject-matter of this article is
of a compact manifold, is the existence of an somewhat fuzzy and we have taken advantage of
invariant probability measure which is: this to steer our path towards some of our favorite
636 Chaos and Ergodic Theory
theorems and to avoid the parts we know less mass constrained to remain on a surface). At that
(some of which are listed below). We make no time, such an unpredictability was considered a
pretense at exhaustivity neither in the topics nor in purely mathematical pathology, necessarily
the selected results and we hope that our col- devoid of any physical meaning [Duhem qualified
leagues will excuse our shortcomings. Hadamard’s result as “an example of a mathe-
matical deduction which can never be used by
Remark 1 In this article we only consider com- physics” (see pp. 206–211 in (Duhem 1906))!].
pact, smooth and finite-dimensional dynamical Returning to out tent map, we can be more
systems in discrete time, i.e., defined by self- quantitative. At any point x [0, 1] whose orbit
maps. In particular, we have omitted the natural never visits 1/2, the Lyapunov exponent
and important variants applying to flows, e.g., lim n!1 1n log j ðT n Þ0 ðxÞ j is log2. (See the “Glos-
evolutions defined by ordinary differential equa- sary”.) Such a positive Lyapunov exponent corre-
tions but we refer to the textbooks (see, e.g., sponds to infinitesimally close orbits getting
Arnol’d 1988; Hasselblatt and Katok 2002; separated exponentially fast. This can be observed
Katok and Hasselblatt 1995) for these. in Fig. 2. Note how this exponential speed creates
a rather sharp transition.
Elementary Chaos: A Simple Example It follows in particular that experimental or
We start with a toy model: the full tent map T of numerical errors can grow very quickly to size 1,
Fig. 1. Observe that for any point x [0, 1], T n [For simple precision arithmetic the uncertainty is
(x) ¼ {(s(k, n)x þ k) 2n : k ¼ 0, 1, . . ., 2n 1}, 1016 which grows to size 1 in 38 iterations of T.]
where s(k, n) ¼ 1. Hence [n0T n(x) is dense i.e., the approximate orbit may contain after a
in [0, 1]. It easily follows that T exhibits sensitive while no information about the true orbit. This
dependence to initial conditions. Even worse in casts a doubt on the reliability of simulations.
this example, the qualitative asymptotic behavior Indeed, a simulation of T on most computers will
can be completely changed by this arbitrarily suggest that all orbits quickly converge to 0,
small perturbation: x may have a dense orbit which is completely false [Such a collapse to
whereas y is eventually mapped to a fixed point! 0 does really occurs but only for a countable
This is Devaney chaos (Devaney 1989). subset of initial conditions in [0, 1] whereas the
This kind of unstability was first discovered by points with dense orbit make a subset of [0, 1]
J. Hadamard (1898) in his study of the geodesic
flow (i.e., the frictionless movement of a point
with full Lebesgue measure (see below). This topological entropy (▶ Entropy in Ergodic The-
artefact comes from the way numbers are ory) of T is htop(T) ¼ log 2. [For the coincidence
represented – and approximated – on the com- of the entropy and the Lyapunov exponent see
puter: multiplication by even integers tends to below.] The positivity of the topological entropy
“simplify” binary representations. Thus the com- can be considered as the signature of the complex-
putations involved in drawing Fig. 2 cannot be ity of the dynamics and considered as the defini-
performed too naively.]. Though somewhat atyp- tion, or at least the stamp, of a topologically
ical in its dramatic character, this failure illustrates chaotic dynamics.
the unpredictability and unstability of individual Let us move on to a probabilistic point of view.
orbits in chaotic systems. Pick x [0, 1] randomly according to, say, the
Does this mean that all quantitative predictions uniform law in [0, 1]. It is then routine to check
about orbits of T are to be forfeited? Not at all, if that i(x) follows the (1/2, 1/2)-Bernoulli law: the
we are ready to change our point of view and look probability that, for any given k, ik(x) ¼ 0 is 1/2
beyond a single orbit. This can be seen easily in and the iks are independent. Thus i(x), seen as a
this case. Let us start with such a global analysis sequence of random 0 and 1 when x is subject to
from the topological point of view. Associate to the uniform law on [0, 1], is statistically
x [0, 1], a sequence i(x) ¼ i ¼ i0i1i2. . . of 0s and undistinguishable from coin tossing! This impor-
1s according to ik ¼ 0 if Tkx 1/2, ik ¼ 1 other- tant remark leads to quantitative predictions. For
wise. One can check that [Up to a countable set of instance, the strong law of large numbers implies
exceptions.] {i(x) : x [0, 1]} is the set that, for Lebesgue-almost every x [0, 1]
S2 ≔ {0, 1}ℕ of all infinite sequences of 0s and (i.e., for all x [0, 1] except in a set of Lebesgue
1s and that at most one x [0, 1] can realize a measure zero), the fraction of the time spent in any
given sequence as i(x). dyadic interval I ¼ [k 2N, ‘ 2N] [0, 1], k, ‘,
Notice how the transformation f becomes triv- N ℕ, by the orbit of x,
ial in this representation:
1
lim # 0 k < n : Tkx I ð1Þ
iðf ðxÞÞ ¼ i1 i2 i3 . . . if iðxÞ ¼ i0 i1 i2 i3 . . . n!1 n
Thus f is represented by the simple and univer- exists and is equal to the length, 2N, of that
sal “left-shift” on sequences, which is denoted by interval. [Eq. (1) in fact holds for any interval I.
s. This representation of a rather general dynami- This implies that the orbit of almost every
cal system by the left-shift on a space of sequences x [0, 1] visits all subintervals of [0, 1], i.e.,
is called symbolic dynamics (▶ Symbolic Dynam- the orbit is dense: in complete contradiction with
ics), (Lind and Marcus 1995). the above mentioned numerical simulation!]
This can be a very powerful tool. Observe for More generally, we shall see that, if
instance how here it makes obvious that we have f : [0, 1] ! ℝ is any continuous function, then,
complete combinatorial freedom over the orbits of for Lebesgue almost every-x,
T: one can easily build orbits with various asymp-
totic behaviors: if a sequence of S2 contains all the 1
n1
lim f Tkx exists and is equal to fðxÞ dx:
finite sequences of 0s and 1s, then the n!1 n
k¼0
corresponding point has a dense orbit; if the ð2Þ
sequence is periodic, then the corresponding
point is itself periodic, to give two examples of Using strong mixing properties (▶ Ergodicity
the richness of the dynamics. and Mixing Properties) of the Lebesgue measure
More quantitatively, the number of distinct under T, one can prove further properties, e.g.,
subsequences of length n appearing in sequences sensitivity on initial conditions in the sense of
i(x), x [0, 1], is 2n. It follows that the Guckenheimer [The Lebesgue measure is weak-
638 Chaos and Ergodic Theory
a complex proof of the obvious fact that the time observed in most of the situations (see however
average at x ¼ 0 (some set of full measure!) and the caveat in the discussion of the full tent map).
the ensemble average with respect to δ0 are both The existence of a finite statistical description
equal to f(0). In the second case, we obtain a very (or even of a physical measure) is, as we shall see,
general proof of the above Eq. (2). not automatic nor routine to prove. Attracting
Another type of example is provided by the periodic points as in the above silly example pro-
contracting map S : [0, 1] ! [0, 1], S(x) ¼ x/2. vide a first type of physical measures. Birkhoff
S has a unique invariant probability measure, δ0. ergodic theorem asserts that absolutely continu-
For Birkhoff theorem the situation is the same as ous ergodic invariant measures, usually obtained
that of T and δ0: it asserts only that the orbit of 0 is from some expansion property, give another class
described by δ0. of physical measures. These contracting and
One can understand Birkhoff theorem as a (first expanding types can be combined in the class of
and rather weak) stability result: the time averages Sinai–Ruelle–Bowen measures (Ledrappier 1984)
are independent of the initial condition, almost which are the invariant measures absolutely con-
surely with respect to m. tinuous “along expanding directions” (see for the
precise but technical definition ▶ Smooth Ergodic
Physical Measures Theory). Any ergodic Sinai–Ruelle–Bowen mea-
In the above silly example S, much more is true sure which is ergodic and without zero Lyapunov
than the conclusion of Birkhoff Theorem: all exponent [That is, the set of points x M such
points of [0, 1] are described by δ0. This leads to that lim n!1 1n log ðf n Þ0 ðxÞ:v ¼ 0 for some
the definition of the basin of a probability measure
v TxM has zero measure.] is a physical mea-
m for a self-map f of a space M:
sure. Conversely, “most” physical measures [For
counter-examples see (Hofbauer and Keller
ℬðmÞ≔fx M : 8f :
1990).] are of this type (Tsujii 2005; Vasquez
n1
1 2007).
M ! ℝ continuous lim f f k x ¼ f dm :
n!1 n
k¼0
Measures of Maximum Entropy
If M is a manifold, then there is a notion of For all parameters t [3.96, 4], the quadratic
volume and one can make the following defini- maps Qt(x) ¼ tx(1 x), Qt : [0, 1] ! [0, 1],
tion. A physical measure is a probability measure have nonzero topological entropy (de Melo and
whose basin has nonzero volume in M. Say that a van Strien 1993) and exponentially many periodic
dynamical system f : M ! M on a manifold has a points (Hofbauer 1985):
finite statistical description if there exists finitely
many invariant probability measures m1, . . ., mn # x ½0, 1 : Qnt ðxÞ ¼ x
lim ¼ 1:
the union of whose basins is the whole of M, up to n!1 enhtop ðQt Þ
a set of zero Lebesgue measure.
Physical measures are among the main subject On the other hand, by a deep theorem (Graczyk
of interest as they are expected to be exactly those and Świątek 1997; Lyubich 1997) there is an open
that are “experimentally visible”. Indeed, if and dense subset of t [0, 4], such that Qt has a
x0 ℬ(m) and ϵ0 > 0 is small enough, then, by unique physical measure concentrated on a peri-
Lebesgue density theorem, a point x picked odic orbit! Thus the physical measures can
according to, say, the uniform law in the ball completely miss the topological complexity (and
B(x0, ϵ0) of center x0 and radius ϵ0, will be in in particular the distribution of the periodic
ℬ(m) with probability almost 1 and therefore its points). Hence one must look at other measures
ergodic averages will be described by m. Hence to get a statistical description of the complexity of
“experiments” can be expected to follow the phys- such Qt. Such a description is often provided by
ical measures and this is what is numerically measures of maximum entropy mM whose
640 Chaos and Ergodic Theory
measured entropy [The usual phrases are • The equidistribution of periodic points with
“measure-theoretic entropy”, “metric entropy”.] respect to some maximum entropy measure
(▶ Entropy in Ergodic Theory) satisfies: mM :
1
mM ¼ lim #fx X:x¼f n
xg dx :
hðf , mM Þ ¼ sup hðf , mÞ¼1 htop ðf Þ :
n!1
x¼f n x
m Mðf Þ • The holonomy invariance which can be loosely
interpreted by saying that the past and the future
M( f ) is the set of all invariant measures. [One are independent conditionally on the present.
can restrict this to the ergodic invariant measures
without changing the value of the supremum Other Points of View
(▶ Entropy in Ergodic Theory).] Equality Many other invariant measures are of interest in
1 above is the variational principle: it holds for various contexts and we have made no attempt at
all continuous self-maps of compact metric completeness: for instance, invariant measures
spaces. One can say that the ergodic complexity maximizing dimension (Gatzouras and Peres
(the complexity of f as seen by its invariant mea- 1997; Pesin 1997), or pressure in the sense of
sures) captures the full topological complexity the thermodynamical formalism (Katok and
(defined by counting all orbits). Hasselblatt 1995; Ruelle 2004), or some energy
(Anantharaman 2004; Contreras et al. 2001;
Remark 3 The variational principle implies the Jenkinson 2007), or quasi-physical measures
existence of “complicated invariant measures” as describing the dynamics around saddle-type
soon as the topological entropy is nonzero (see invariant sets (Eckmann and Ruelle 1985) or in
(Bonano and Collet 2006) for a setting in which systems with holes (Chernov et al. 2000).
this is of interest).
Tractable Chaotic Dynamics
Maximum entropy measures do not always
exist. However, if f is C1 smooth, then maximum The Palis Conjecture
entropy measures exist by a theorem of Newhouse There is, at this point, no general theory allowing
(1989) and they indeed describe the topological the analysis of all dynamical systems or even of
complexity in the following sense. Consider the most of them despite many recent and exciting
probability measures: developments in the theory of generic C1-
diffeomorphisms (Bonatti et al. 2005; Crovisier
1
n1 2006a). In particular, the question of the generality
mn,ϵ ≔ df k x in which physical measures exist remains open.
n k¼0 x Eðn, ϵÞ
One would like generic systems to have a finite
statistical description (see section “Physical Mea-
where E(n, ϵ) is an arbitrary (ϵ, n)-separated subset sures”). This fails in some examples but these look
[See ▶ Entropy in Ergodic Theory: 8x, y exceptional and the following question is asked by
E(n, ϵ) x 6¼ y ) ∃ 0 k < n d(T kx, T ky) ϵ.] Palis (2000):
of M with maximum cardinality. Then accumula- Is it true that any dynamical system defined by a Cr-
tion points for the weak star topology on the space diffeomorphism on a compact manifold can be
of probability measures on M of mn,ϵ when n ! 1 transformed by an arbitrarily small Cr-perturbation
and then ϵ ! 0 are maximum entropy measures to another dynamical system having a finite statis-
tical description?
(Misiurewicz 1976a).
Let us quote two important additional proper- This is completely open though widely
ties, discovered by Margulis (2004), that often believed [Observe, however, that such a statement
hold for the maximum entropy measures: is false for conservative diffeomorphisms with
Chaos and Ergodic Theory 641
high order smoothness as KAM theory implies Remark 4 Mañé Stability Theorem (see below)
stable existence of invariant tori foliating a subset shows that uniform hyperbolicity is a very natural
of positive volume.]. Note that such a good notion. One can also understand on a more tech-
description is not possible for all systems (see, nical level uniform hyperbolicity as what is
e.g., Hofbauer and Keller 1990; Newhouse needed to apply an implicit function theorem in
1974). Note that one would really like to ask some functional space (see, e.g., Shub 1987).
about unperturbed “typical” [The choice of the
notion of typicality is a delicate issue. The The existence of a finite statistical description
Newhouse phenomenon shows that among C2- for such systems has been proved since the 1970s
diffeomorphisms of multidimensional compact by Bowen, Ruelle and Sinai (Bowen and Ruelle
manifolds, one cannot use topological genericity 1975; Ruelle 1976; Ya 1972) (the expanding case
and get a positive answer. Popular notions are is much simpler (Krzyżewski and Szlenk 1969)).
prevalence and Kolmogorov genericity – see the”
Glossary”.] dynamical systems in a suitable sense, Theorem 1 Let f : M ! M be a C1þα map of a
but of course this is even harder. compact manifold. Assume f to be (i) a uniformly
One is therefore led to make simplifying expanding map on M or (ii) a uniformly hyper-
assumptions: typically of small dimension, uni- bolic diffeomorphism.
form expansion/contraction or geometry.
• f admits a finite statistical description by ergo-
Uniformly Expanding/Hyperbolic Systems dic and hyperbolic Sinai–Ruelle–Bowen mea-
The most easily analyzed systems are those with sures (absolutely continuous in case (i)).
uniform expansion and/or contraction, namely the • f has finitely many ergodic maximum entropy
uniformly expanding maps and uniformly hyper- measures, each of which makes f isomorphic to
bolic diffeomorphisms, see ▶ “Smooth Ergodic a finite state Markov chain. The periodic points
Theory”. [We require uniform hyperbolicity on are uniformly distributed according to some
the so-called chain recurrent set. This is equiva- canonical average of these ergodic maximum
lent to the usual Axiom-A and no-cycle condition.] entropy measures.
An important class of example is obtained as • f is topologically conjugate [Up to some negli-
follows. Consider A : ℝd ! ℝd, a linear map gible subset.] to a subshift of finite type (See
preserving ℤd (i.e., A is a matrix with integer the “Glossary”.)
coefficients in the canonical basis) so that it
defines a map A : d ! d on the torus. If there The construction of absolutely continuous
is a constant Λ > 1 such that for all v ℝd, kA. invariant measures for a uniformly expanding
vk Λkvk then A is a uniformly expanding map. map f can be done in a rather direct way by
If A has determinant 1 and no eigenvalue on the considering the pushed forward measures
unit circle, then A is a uniformly hyperbolic 1 n1 k
n k¼0 f Leb and taking weak star limits while
diffeomorphism (▶ Smooth Ergodic Theory) (see preventing the appearance of singularities, by,
also Brin and Stuck 2002; Katok and Hasselblatt e.g., bounding some Hölder norm of the density
1995; Robinson 2004; Shub 1987). Moreover all using expansion and distortion of f.
C1-perturbations of the previous examples are The classical approach to the uniformly hyper-
again uniformly expanding or uniformly bolic dynamics (Bowen 1975; Ruelle 2004; Shub
hyperbolic. [One can define uniform hyperbolicity 1987) is through symbolic dynamics and coding.
for flows and an important class of examples is Under the above hypothesis one can build a finite
provided by the geodesic flow on compact mani- partition of M which is tailored to the dynamics
folds with negative sectional curvature (Katok (a Markov partition) so that the corresponding
and Hasselblatt 1995).] These uniform systems symbolic dynamics has a very simple structure:
are sometimes called “strongly chaotic”. it is a full shift {1, . . ., d}ℤ, like in the example of
642 Chaos and Ergodic Theory
the full tent map, or a subshifts of finite type. The In this full generality, one already obtains sig-
above problems can then be solved using the nificant results:
thermodynamical formalism inspired from the
statistical mechanics of one-dimensional ferro- • The entropy is bounded by the expansion:
magnets (Ruelle 1968): ergodic properties are d
Lasota and Yorke (1973) found a suitable shown that, for any r > 1, an open and dense
framework. They considered C2 interval maps subset of piecewise Cr and expanding maps have
with jf 0(x) j const > 1 except at finitely many a finite statistical description.
points. They used the Ruelle transfer operator Piecewise hyperbolic diffeomorphisms are
directly on the interval. Namely they studied more difficult to analyze though several results
(conditioned on technical assumptions that can
fðyÞ be checked in many cases) are available (Baladi
ðLfÞðxÞ ¼
j T 0 ðyÞ j and Gouëzel 2008; Chernov 1999; Sataev 1992;
y T 1 x
Young 1985).
acting on functions f : [0, 1] ! ℝ with bounded
variation and obtained the invariant density as the Interval Maps with Critical Points
eigenfunction associated to the eigenvalue 1. One A more natural but also more difficult situation is
can then prove a Lasota–Yorke inequality (which a map for which the uniformity of the expansion
might more accurately be called Doeblin–Fortet fails because of the existence of critical points.
since it was introduced in the theory of Markov [Note that, by a theorem of Mañé a circle map
chains much earlier): without critical points or indifferent periodic
point is either conjugate to a rotation or uniformly
kLfkBV akfkBV þ bkfk1 ð3Þ expanding (Mañé 1985).]
A class which has been completely analyzed at
where kkBV, kk1 are a strong and a weak norm, the level of the above conjecture is that of real-
respectively and α < 1 and β < 1. One can then analytic families of maps of the interval ft : [0, 1]
apply general theorems (Ionescu Tulcea and ! [0, 1], t I, with a unique critical point, the
Marinescu 1950) or (Nussbaum 1970) (see main example being the quadratic family
(Baladi 2000a) for a detailed presentation of this Qt(x) ¼ tx(1 x) for 0 t 4.
approach and its variants). Here α can essentially It is not very difficult to find quadratic maps
be taken as 2 (reflecting the locally simple discon- with the following two types of behavior:
tinuities) divided by the minimum expansion: so
α < 1, perhaps after replacing T with an iterate. In (stable) the orbit of Lebesgue-almost every
particular, the existence of a finite statistical x [0, 1] tends to an attracting periodic orbit;
description then follows (see Broise (1996a) for (chaotic) there is an absolutely continuous invari-
various generalizations and strengthenings of this ant probability measure m whose basin con-
result on the interval). tains Lebesgue-almost every x [0, 1].
The situation in higher dimension is more com-
plex for the reason explained above. One can To realize the first it is enough to arrange the
obtain inequalities such as (3) on suitable if less critical point to be periodic. One can easily prove
simple functional spaces (see, e.g., Saussol 2000) that this stable behavior occurs on an open set of
but proving α < 1 is another matter: discontinu- parameters –thus it is stable with respect to the
ities can get arbitrarily complex under iteration. parameter or the dynamical system. The second
(Buzzi 2001; Tsujii 2000b) show that indeed, in occurs for Q4 with m ¼ dx= pxð1 xÞ. It is
dimension 2 and higher, piecewise uniform much more difficult to show that this chaotic
expansion (with a finite number of pieces) is not behavior occurs for a set of parameters of positive
enough to ensure a finite statistical description if Lebesgue measure. This is a theorem of Jakobson
the pieces of the map have only finite smoothness. (1981) for the quadratic family (see for a recent
In dimension 2, resp. 3 or more, piecewise real- variant (Young and Wang 2006)). Let us sketch
analytic, resp. piecewise affine, is enough to two main ingredients of the various proofs of this
exclude such examples (Buzzi 2000a; Tsujii theorem. The first is inducing: around Lebesgue-
2000a), resp. (Tsujii 2001). (Cowieson 2002) has almost every point x [0, 1] one tries to find a
Chaos and Ergodic Theory 645
time t(x) and an interval J(x) such that ft(x) : • The set of t such that ft is stable is open and
J(x) ! ft(x)(J(x)) is a map with good expansion dense;
and distortion properties. This powerful idea • The remaining set of parameters has zero
appears in many disguises in the non-uniform Lebesgue measure. [This set of “strange
hyperbolic theory (see for instance (Hofbauer parameters” of zero Lebesgue measure has
1979; Young 1998)). The second ingredient is however positive Hausdorff dimension
parameter exclusion: one removes the parameters according to work of Avila and Moreira. In
at which a good inducing scheme cannot be built. particular each of the following situations is
More precisely one proceeds inductively, realized on a set of parameters t of positive
performing simultaneously the inducing and the Hausdorff dimension: non-existence of the
exclusion, the good properties of the early stage Birkhoff limit at Lebesgue-almost every point,
of the inducing allowing one to control the measure the physical measure is δp for p a repelling fixed
of the parameters that need to be excluded to con- point, the physical measure is non-ergodic.]
tinue (Benedicks and Carleson 1985; Jakobson
1981). Indeed, the expansion established at a
given stage allows to transfer estimates in the We note that the theory underlying the above
dynamical space to the parameter space. theorem yields much more results, including a
Using methods from complex analysis and very paradoxical rigidity of typical analytic fam-
renormalization theory one can go much further ilies as above. See (Avila and Moreira 2005).
and prove the following difficult theorems
(actually the product of the work of many people, Non-uniform Expansion/Contraction
including Avila, Graczyk, Kozlovski, Lyubich, Beyond the dimension 1, only partial results are
de Melo, Moreira, Shen, van Strien, Swiatek), available. The most general of those assume uni-
which in particular solves Palis conjecture in form contraction or expansion along some direc-
this setting: tion, restricting the non-uniform behavior to an
invariant sub-bundle often one-dimensional, or
Theorem 2 (Graczyk and Świątek 1997; “one-dimensional-like”.
Kozlovski et al. 2007; Lyubich 1997) Stable A first, simpler situation is when there is a
maps (that is, such that Lebesgue almost every dominated decomposition with a uniformly
orbit converges to one of finitely many periodic expanding term: there is a continuous and invari-
orbits) form an open and dense set among C r ant splitting of the tangent bundle,
interval maps, for any r 2. [In fact this is even TΛM ¼ Euu Ecs, over some Λ an attracting set:
true for polynomials.]. for all unit vectors vu Euu, vc Ecs,
Theorem 3 (Avila and Moreira 2005; Avila Standard techniques (pushing the Riemannian
et al. 2003; Graczyk and Świątek 1997; volume of a piece of unstable leaf and taking
Kozlovski 2003; Lyubich 1997) Let ft : [0, 1] limits) allow the construction of Gibbs u-states
! [0, 1], t [t0, t1], be a real-analytic family of as introduced by (Pesin and Sinaĭ 1982).
unimodal maps. Assume that it is not degenerate
[f t0 and f t1 are not conjugate]. Then: Theorem 4 (Alves–Bonatti–Viana (2000)) [A
slightly different result is obtained in (Bonatti
• The set of t such that ft is chaotic in the above and Viana 2000).] Let f : M ! M be a C2
sense has positive Lebesgue measure; diffeomorphism with an invariant compact subset
646 Chaos and Ergodic Theory
Λ. Assume that there is a dominated splitting Λ M on which the tangent bundle has an invari-
TΛM ¼ Eu Ecs such that, for some c > 0, ant continuous decomposition TΛM ¼ E+ E
such that all vectors of E+ ∖ {0}, resp. E ∖ {0},
1
n1 have positive, resp. negative, Lyapunov expo-
lim sup log f 0 ðf x Þ j Ecs c < 0 nents. Then any zero-noise limit measure m of f is
n!1 n k¼0
a Sinai–Ruelle–Bowen measure and therefore, if it
is ergodic and hyperbolic, a physical measure.
on a subset of Λ of positive Lebesgue measure.
Then this subset is contained, up to a set of zero
One can hope, that typically, the latter ergodic-
Lebesgue measure, in the union of the basins of
ity and hyperbolicity assumptions are satisfied
finitely many ergodic and hyperbolic Sinai –
(see, e.g., Baraviera and Bonatti 2003).
Ruelle – Bowen measures.
By pushing classical techniques and introduc-
ing new ideas for generic maps with one
The non-invertible, purely expansive version
expanding and one weakly contracting direction,
of the above theorem can be applied in particular
Tsujii has been able to prove the following generic
to the following maps of the cylinder (d 16, a is
result (which can be viewed as a 2-dimensional
properly chosen close to 2 and α is small):
extension of some of the one-dimensional results
above: one adds a uniformly expanding
f ðy, xÞ ¼ dx mod 2p, a x2 þ ϵ sinðyÞ
direction):
For almost twenty years the question of the • Misiurewicz condition: d( f nc, d) > c > 0 for
existence of such an attractor (by opposition to all n 1 and all critical points c, d.
an attracting periodic orbit with a very long
period) remained open. Indeed, one knew since Assume the following transversality condition
Newhouse that, for many such maps, there exist on f at a ¼ 0: for every critical point c,
infinitely many such periodic orbits which are (d/da)( fa(ca) pa) 6¼ 0 if ca is the critical point
very difficult to distinguish numerically. But in of fa near c and pa is the point having the same
1991 Benedicks and Carleson succeeded in pro- itinerary under fa as f(c) under c. Assume the
posing an argument refining (with considerable following non-degeneracy of T : f 00 ðcÞ ¼ 0 )
difficulties) their earlier proof of Jakobson one- @T 00 ðc, 0Þ=@y 6¼ 0.
dimensional theorem and established the first part
of the following theorem: • Tab restricted to a neighborhood of S1 {0}
has a finite statistical description by a number
Theorem 7 (Benedicks–Carleson 1991) For of hyperbolic Sinai–Ruelle–Bowen measures
any ϵ > 0, for jbj small enough, there is a set bounded by the number of critical points of f;
A with Leb(A) > 0 satisfying: for all a A, there • There is exponential decay of correlations and
exists z W u(P) such that a Central Limit Theorem (see below) – except,
in an obvious way, if there is a periodic interval
• The orbit of z is dense in W u(P); with period >1;
• lim inf n!1 1n log ðf n Þ0 ðzÞ > 0. • There is a natural coding of the orbits that
remains for ever close to S1 {0} by a closed
invariant subset of a full shift.
Further properties were then established, espe-
cially by Benedicks, Viana, Wang, Young
Very importantly, the above dynamical situa-
(Benedicks and Viana 2001; Benedicks and
tion has been shown to occur near typical homo-
Young 1993, 2000; Young and Wang 2001). Let
clinic tangencies: (Mora and Viana 1993)
us quote the following theorem of Wang and
proved that there is an open and dense subset
Young which includes the previous results:
of the set of all C3 families of diffeomorphisms
unfolding a first homoclinic tangency such that
Theorem 8 (Young and Wang 2001) Let
the above holds. However (Palis and Yoccoz
Tab : S1 [1, 1] ! S1 [1, 1] be such that
2001) shows that the set of parameters with a
Henon-like attractor has zero Lebesgue density
• Ta0(S1 [1, 1]) S1 {0};
at the bifurcation itself, at least under an
• For b > 0, Tab is a diffeomorphism on its image
assumption on the so-called stable and unstable
with
Hausdorff dimensions. (Diaz et al. 1996) estab-
c1b j det Tab(x, y) j cb
lishes positive density for another type of bifur-
for some c > 1 and all (x, y) S1 [1, 1]
cation. Furthermore (Moreira et al. 2001) has
and all (a, b).
related the Hausdorff dimensions to the abun-
dance of uniformly hyperbolic dynamics near
Let fa : S1 ! S1 be the restriction of Ta0. the tangency.
Assume that f ¼ f0 satisfies: Viana (1993) is able to treat situations with
more than one contracting direction. More
• Non-degenerate critical points: f 0(c) ¼ 0 ) recently (Young and Wang 2008) has proposed a
f 00(c) 6¼ 0; rather general framework, with easily checkable
• Negative Schwarzian derivative: for all assumptions in order to establish the existence of
x S1 non-critical, f 0 0 0(x)/f 0(x) 3/2( f 00(x)/ such dynamics in various applications. See also
f 0(x))2 < 0; (Guckenheimer et al. 2006; Wang and Young
• No indifferent or attracting periodic point, i.e., 2003) for applications.
x such that f n(x) ¼ x and j( f )0(x) j 1;
648 Chaos and Ergodic Theory
1
n
1 htop ðf Þ ¼ sup hðf , mÞ
lim d p m M ðf Þ
n!1 log n k k1
X= k t
k¼1 j¼0 j
ℕ
kdþ1 f0, . . . , d g if d is the number of critical/ Stability
discontinuity points. [The kneading invariants are
the suitably defined (left and right) itineraries of By definition, chaotic dynamical systems have
the critical/discontinuity points and endpoints.] orbits which are unstable and numerically
Namely, unpredictable. It is all the more surprising that,
once one accepts to consider their dynamics glob-
ally, they exhibit very good stability properties.
SðT Þ ¼ a f0, . . . , dgℕ : 8n 0
Structural Stability
kþ
an ≺ s a ≺ kan þ1
n
A simple form of stability is structural stability: a
system f : M ! M is structurally Cr-stable if any
where ≺ is a total order on {0, . . ., d}ℕ making system g sufficiently Cr-close to f is topologically
the coding x 7! α non-decreasing. Observe how the same as f, formally: g is topologically conju-
the kneading invariants determine S(T ) in an gate, i.e., there is some homeomorphism [If h were
effective way: knowing their first n symbols is C1, the conjugacy would imply, among other
enough to know the sequences of length n which things, that for every p-periodic point: det(( f p)0
begin sequences of S(T ). We refer to (de Melo and (x)) ¼ det ((g p)0(h( p))), a much too strong
van Strien 1993) for the wealth of information that requirement.] h : M ! M mapping the orbits of
can be extracted from these kneading invariants f to those of g, i.e., g h ¼ h f.
following Milnor and Thurston (1988). Andronov and Pontryaguin argued in the
This form of global simplicity can be extended 1930’s that only such structurally stable systems
to other classes of non-uniformly expanding are physically relevant. Their idea was that the
maps, including those like Eq. (4) using the model of a physical system is always known
notions of subshifts and puzzles of quasi-finite only to some degree of approximation, hence
type (Buzzi 2005, 2006). This leads to the notion mathematical model whose structure depends on
and analysis of entropy-expanding maps, a new arbitrarily small changes should be irrelevant.
open class of non-uniformly expanding maps A first question is: What are these structurally
admitting critical hypersurfaces, defined purely stable systems? The answer is quite striking:
in terms of entropies including the otherwise
untractable examples of Eq. (4). Theorem 12 (Mañé 1988) Let f : M ! M be a C1
A generalization of the representation of uni- diffeomorphism of a compact manifold.
form systems by subshifts of finite type is pro- f is structurally stable among C1-
vided by strongly positive recurrent countable diffeomorphisms of M if and only if f is uniformly
state Markov shifts, a subclass of Markov shifts hyperbolic on its chain recurrent set. [A point x is
(See “Glossary”) which shares many properties chain recurrent if, for all ϵ > 0, there exists a finite
with the subshifts of finite type (Boyle et al. sequence x0, x1, . . ., xn such that x0 ¼ xn ¼ x and
2006; Gurevic 1996; Gurevic and Savchenko d( f(xk), xkþ1) < ϵ. The chain recurrent set is the
1998; Ruette 2003a; Sarig 2001). set of all chain recurrent points.]
These “simple” systems admit a classification
result which in particular identifies their measures A basic idea in the proof of the theorem is that
with entropy close to the maximum (Boyle et al. failure of uniform hyperbolicity gives the oppor-
2006). Such a classification generalizes (Ladler tunity to make an arbitrarily small perturbation
and Marcus 1979). The “ideology” here is that contradicting the structural stability. In higher
complexity of individual orbits in a simple setting smoothness the required perturbation lemmas
must come from randomness, but purely random (e.g., the closing lemma (Arnaud 1998; Katok
systems are classified by their entropy according and Hasselblatt 1995; Mañé 1982)) are not
to Ornstein (1970). available.
654 Chaos and Ergodic Theory
We note that uniform hyperbolicity without which topology should be put on the rather wild
invertibility does not imply C1-stability set of topological conjugacy classes. It is perhaps
(Przytycki 1977). more natural to associate to the system a topolog-
A second question is: are these stable systems ical invariant taking value in a more manageable
dense? (So that one could offer structurally stable set and ask whether the resulting map is
models for all physical situations). A deep discov- continuous.
ery around 1970 is that this is not the case: A first possibility is Zeeman’s Tolerance Sta-
bility Conjecture. He associated to each
Theorem 13 (Abraham–Smale, Simon diffeomorphism the set of all the closures of all
(Abraham and Smale 1970; Simon 1972)) of its orbits and he asked whether the resulting
For any r 1 and any compact manifold M of map is continuous on a dense Gδ subset of the
dimension 3, the set of uniformly hyperbolic class of Cr-diffeomorphisms for any r 0. This
diffeomorphisms is not dense in the space of C r conjecture remains open, we refer to (Crovisier
diffeomorphisms of M. [They use the phenomenon 2006b) for a discussion and related progress.
called “heterodimensional homoclinic A simpler possibility is to consider our favorite
intersections”.] topological invariant, the topological entropy, and
thus ask whether the dynamical complexity as
Theorem 14 (Newhouse 1974) For any r 2, measured by the entropy is a stable phenomenon.
for any compact manifold M of dimension 2, the f 7! htop( f ) is lower semicontinuous for f among
set of uniformly hyperbolic diffeomorphisms is not C0 maps of the interval [On the set of interval
dense in the space of C r diffeomorphisms of M. maps with a bounded number of critical points,
More precisely, there exists a non-empty open the entropy is continuous (Misiurewicz 1995).
subset in this space which contains a dense Gδ Also t 7! htop(Qt) is non-decreasing by complex
subset of diffeomorphisms with infinitely many arguments (de Melo and van Strien 1993), though
periodic sinks. [So these diffeomorphisms have it is a non-smooth function.] (Misiurewicz 1979)
no finite statistical description.] and for f among C1þϵ-diffeomorphisms of a com-
pact surface (Katok 1980). [It is an important
Observe that it is possible that uniform hyper- open question whether this actually holds for
bolicity could be dense among surface C1- C1-diffeomorphisms. It fails for homeomorphisms
diffeomorphisms (this is the case for C1 circle (Rees 1981).] In both cases, one shows the exis-
maps by a theorem of Jakobson (1981)). tence of structurally stable invariant uniformly
In light of Mañé C1-stability theorem this expanding or hyperbolic subsets with topological
implies that structurally stable systems are not entropy close to that of the whole dynamics. On
dense, thus one can robustly see behaviors that the other hand f 7! htop( f ) is upper semi-
are topologically modified by arbitrarily small continuous for C1 maps (Newhouse 1989;
perturbations (at least in the C1-topology)! So Yomdin 1987).
one needs to look beyond these and face that
topological properties of relevant dynamical sys- Statistical Stability
tem are not determined from “finite data”. It is Statistical stability is the property that determin-
natural to ask whether the dynamics is almost istic perturbations of the dynamical system cause
determined by “sufficient data”. only small changes in the physical measures, usu-
ally with respect to the weak star topology on the
Continuity Properties of the Topological space of measures. When the physical measure mg
Dynamics is uniquely defined for all systems g near f, statis-
Structural stability asks the topological dynamics tic stability is the continuity of the map g 7! mg
to remain unchanged by a small perturbation. It is thus defined.
probably at least as interesting to ask it to change Statistical stability is known in the uniform
continuously. This raises the delicate question of setting and also in the piecewise uniform case,
Chaos and Ergodic Theory 655
particular the Pugh–Shub program around stable background) seems restricted to the C1 topology
ergodicity of partially hyperbolic systems (Pugh because of the lack of fundamental tools (e.g.,
and Shub 1999); random iterations (Arnold 1998; closing lemmas) in higher smoothness. But Pesin
Kifer 1986, 1988). theory requires higher smoothness at least techni-
A number of important problems have moti- cally. This is not only a hard technical issue but
vated the study of special forms of chaotic dynam- generic properties of physical measures, when
ics: equidistribution in number theory (Eskin and they have been analyzed are often completely
McMullen 1993; Furstenberg 1981; Host and Kra different between the C1 case and higher smooth-
2005) and geometry (Starkov 2000); quantum ness (Bochi and Viana 2005).
chaos (Anantharaman and Nonnenmacher 2007;
Gutzwiller 1990); chaotic control (Saperstone and Physical Measures
Yorke 1971); analysis of algorithms (Vallée 2006). In higher dimensions, Benedicks and Carleson
We have also omitted the important problem of analysis of the Hénon map has given rise to a
applying the above results. Perhaps because of the rather general theory of Hénon-like maps and
lack of a general theory, this can often be a chal- more generally of the dynamical phenomena asso-
lenge (see for instance (Tucker 1999) for the ciated to homoclinic tangencies. However, the
already complex problem of verifying uniform proofs are extremely technical. Could they be
hyperbolicity for a singular flow). Liverani has simplified? Current attempts like (Young and
shown how theoretical results can lead to precise Wang 2008) center on the introduction of a sim-
and efficient estimates for the toy model of piece- pler notion of critical points, possibly a non-
wise expanding interval maps (Liverani 2001). inductive one (Pujals and Rodriguez-Hertz 2007).
Ergodic theory implies that, in some settings at Can this Hénon theory be extended to the
least, adding noise may make some estimates weakly dissipative situation? to the conservative
more precise (see Kifer 1997). We refer to situation (for which the standard map is a well-
(Fiedler 2001) and the reference therein. known example defying analysis)? In the strongly
dissipative setting, what are the typical phenom-
ena on the complement of the Benedicks–
Future Directions Carleson set of parameters?
From a global perspective one of the main
We conclude this article by a (very partial) selec- questions is the following:
tion of open problems. Can infinitely many sinks coexist for a large set of
parameters in a typical family or is Newhouse phe-
General Theory nomenon atypical in the Kolmogorov or prevalent
In dimension 1, we have seen that the analogue of sense?
the Palis conjecture (see above) is established This seems rather unlikely (see however
(Theorem 2). However the description of the typ- (Araujo 2001)).
ical dynamics in Kolmogorov sense is only Away from such “critical dynamics”, there are
known in the unimodal, non-degenerate case by many results about systems with dominated split-
Theorem 3. Indeed, results like (Bruin et al. 1996) ting satisfying additional conditions. Can these
suggest that the multimodal picture could be more conditions be weakened so they would be satisfied
complex. by typical systems satisfying some natural condi-
In higher dimensions, our understanding is tions (like robust transitivity)? For instance:
much more limited. As far as a general theory is
Could one analyze the physical measures of
concerned, a deep problem is the paucity of results volume-hyperbolic systems?
on the generic dynamics in Cr smoothness with
r > 1. The remarkable current progress in generic A more specific question is whether Tsujii’s
dynamics (culminating in the proof of the weak striking analysis of surface maps with one uni-
Palis conjecture, see (Crovisier 2006a), the refer- formly expanding direction can be extended to
ences therein and (Bonatti et al. 2005) for higher dimensions? can one weaken the
Chaos and Ergodic Theory 657
uniformity of the expansion? The same questions expansion condition [For instance building on
for the corresponding invertible situation is con- our “entropy-hyperbolicity”.] of (Buzzi 2000b,
sidered in (Cowieson and Young 2005). Buzzi), that would be satisfied by a large subset
of the diffeomorphisms?
Maximum Entropy Measures and Topological Another possible approach is illustrated by the
Complexity pruning front conjecture of (Cvitanović et al.
As we explained, C1 smoothness, by a Newhouse 1988) (see also (de Carvalho and Hall 2002;
theorem, ensures the existence of maximum Ishii 1997)). It is an attempt to build a combina-
entropy measures, making the situation a little torial description by trying to generalize the way
simpler than with respect to physical measures. that, for interval maps, kneading invariants deter-
This existence results allow in particular an easy mine the symbolic dynamics by considering the
formulation of the problem of the typicality of bifurcations from a trivial dynamics to an
hyperbolicity: arbitrary one.
Are maximum entropy ergodic measures of systems We hope that our reader has shared in our
with positive entropy hyperbolic for most systems? fascination with this subject, the many surprising
and even paradoxical discoveries that have been
A more difficult problem is that of the finite
made and the exciting current progress, despite
multiplicity of the maximum entropy measures.
the very real difficulties both in the analysis of
For instance: such non-uniform systems as the Henon map and
Do typical systems possess finitely many maximum in the attemps to establish a general (and practi-
entropy ergodic measures? cal) ergodic theory of chaotic dynamical
More specifically, can one prove intrinsic ergo- systems.
dicity (i.e, uniqueness of the measure of maxi-
mum entropy) for an isolated homoclinic class of Acknowledgments I am grateful for the advice and/or
comments of the following colleagues: P. Collet, J.-R.
some diffeomorphisms (perhaps C1-generic)?
Chazottes, and especially S. Ruette. I am also indebted to
Can a generic C1-diffeomorphism carry an infinite the anonymous referee.
number of homoclinic classes, each with topolog-
ical entropy bounded away from zero?
A perhaps more tractable question, given the Bibliography
recent progress in this area: Is a C1-generic par-
tially hyperbolic diffeomorphisms, perhaps with Primary Literature
central dimension 1 or 2, intrinsically ergodic? Aaronson J, Denker M (2001) Local limit theorems for
We have seen how uniform systems have sim- partial sums of stationary sequences generated by
Gibbs–Markov maps. Stochastics Dyn 1:193–237
ple symbolic dynamics, i.e., subshifts of finite
Abdenur F, Bonatti C, Crovisier S (2006) Global domi-
type, and how interval maps and more generally nated splittings and the C1 newhouse phenomenon.
entropy-expanding maps keep some of this sim- Proc Am Math Soc 134(8):2229–2237
plicity, defining subshifts or puzzles of quasi- Abraham R, Smale S (1970) Nongenericity of Ω-stability.
In: Global analysis, vol XIV. Proc Sympos Pure Math.
finite type (Buzzi 2005; Buzzi 2006). (Young
American Mathematical Society, Providence
and Wang 2001) have defined symbolic dynamics Alves JF (2000) SRB measures for non-hyperbolic systems
for topological Hénon-like map which seems with multidimensional expansion. Ann Sci Ecole Norm
close to that of that of a one-dimensional system. Sup 33(4):1–32
Alves JF (2006) A survey of recent results on some statis-
Can one describe Wang and Young symbolic tical features of non-uniformly expanding maps. Dis-
dynamics of Hénon-like attractors and fit it in a crete Contin Dyn Syst 15:1–20
class in which uniqueness of the maximum entropy Alves JF, Araujo V (2003) Random perturbations of non-
measure could be proved? uniformly expanding maps. Geometric methods in
dynamics. I. Astérisque 286:25–62
More generally, can one define nice combina- Alves JF, Viana M (2002) Statistical stability for robust
torial descriptions, for surface diffeomorphisms? classes of maps with non-uniform expansion. Ergodic
Can one formulate variants of the entropy- Theory Dynam Syst 22:1–32
658 Chaos and Ergodic Theory
Alves JF, Bonatti C, Viana M (2000) SRB measures for Barreira L, Pesin Y, Schmeling J (1999) Dimension and
partially hyperbolic systems whose central direction is product structure of hyperbolic measures. Ann Math
mostly expanding. Invent Math 140:351–398 149(2):755–783
Anantharaman N (2004) On the zero-temperature or Benedicks M, Carleson L (1985) On iterations of 1ax2 on
vanishing viscosity limit for certain Markov processes (1,1). Ann Math (2) 122(1):1–25
arising from Lagrangian dynamics. J Eur Math Soc Benedicks M, Carleson L (1991) The dynamics of the
(JEMS) 6:207–276 Hénon map. Ann Math 133(2):73–169
Anantharaman N, Nonnenmacher S (2007) Half delocali- Benedicks M, Viana M (2001) Solution of the basin problem
zation of the eigenfunctions for the Laplacian on an for Hénon-like attractors. Invent Math 143:375–434
Anosov manifold. Ann Inst Fourier 57:2465–2523 Benedicks M, Viana M (2006) Random perturbations and
Araujo V (2001) Infinitely many stochastically stable statistical properties of Hénon-like maps. Ann Inst
attractors. Nonlinearity 14:583–596 H Poincaré Anal Non Linéaire 23:713–752
Araujo V, Pacifico MJ (2006) Large deviations for non- Benedicks M, Young LS (1993) Sinaĭ–Bowen–Ruelle
uniformly expanding maps. J Stat Phys 125:415–457 measures for certain Hénon maps. Invent Math 112:
Araujo V, Tahzibi A (2005) Stochastic stability at the 541–576
boundary of expanding maps. Nonlinearity 18: Benedicks M, Young LS (2000) Markov extensions and
939–958 decay of correlations for certain Hénon maps.
Arnaud MC (1998) Le “closing lemma” en topologie C1. Géométrie complexe et systèmes dynamiques, Orsay,
Mem Soc Math Fr (NS) 74 1995. Astérisque 261:13–56
Arnol’d VI (1988) Geometrical methods in the theory of Bergelson V (2006) Ergodic Ramsey theory: a dynamical
ordinary differential equations. Grundlehren der approach to static theorems. In: International congress
Mathematischen Wissenschaften, vol 250, 2nd edn. of mathematicians, vol II. European Mathematical
Springer, New York Society, Zürich, pp 1655–1678
Arnol’d VI, Avez A (1968) Ergodic problems of classical Berkes I, Csáki E (2001) A universal result in almost sure
mechanics. W.A. Benjamin, New York central limit theory. Stochastic Process Appl 94(1):
Arnold L (1998) Random dynamical systems. Springer 105–134
monographs in mathematics. Springer, Berlin Bishop E (1967/1968) A constructive ergodic theorem.
Avila A, Moreira CG (2005) Statistical properties of J Math Mech 17:631–639
unimodal maps: the quadratic family. Ann Math Blanchard F, Glasner E, Kolyada S, Maass A (2002) On
161(2):831–881 Li-Yorke pairs, English summary. J Reine Angew Math
Avila A, Lyubich M, de Melo W (2003) Regular or sto- 547:51–68
chastic dynamics in real analytic families of unimodal Blank M (1989) Small perturbations of chaotic dynamical
maps. Invent Math 154:451–550 systems. (Russian). Uspekhi Mat Nauk 44:3–28, 203.
Baladi V (2000a) Positive transfer operators and decay of Translation in: Russ Math Surv 44:1–33
correlations. In: Advanced series in nonlinear dynam- Blank M (1997) Discreteness and continuity in problems of
ics, vol 16. World Scientific Publishing, River Edge chaotic dynamics. In: Translations of mathematical
Baladi V, Gouëzel S (2008) Good Banach spaces for piece- monographs, vol 161. American Mathematical Society,
wise hyperbolic maps via interpolation. preprint Providence
arXiv:0711.1960. Available from http://www.arxiv.org Blank M, Keller G (1997) Stochastic stability versus local-
Baladi V, Ruelle D (1996) Sharp determinants. Invent ization in one-dimensional chaotic dynamical systems.
Math 123:553–574 Nonlinearity 10:81–107
Baladi V, Tsujii M (2007) Anisotropic Hölder and Sobolev Blank M, Keller G, Liverani C (2002) Ruelle–Perron–
spaces for hyperbolic diffeomorphisms. Ann Inst Fou- Frobenius spectrum for Anosov maps. Nonlinearity
rier (Grenoble) 57:127–154 15:1905–1973
Baladi V, Young LS (1993) On the spectra of randomly Bochi J (2002) Genericity of zero Lyapunov exponents.
perturbed expanding maps. Commun Math Phys 156: Ergodic Theory Dynam Syst 22:1667–1696
355–385. Erratum: Comm Math Phys 166(1): Bochi J, Viana M (2005) The Lyapunov exponents of
219–220, 1994 generic volume-preserving and symplectic maps. Ann
Baladi V, Kondah A, Schmitt B (1996) Random correla- Math 161(2):1423–1485
tions for small perturbations of expanding maps, Bolsinov AV, Taimanov IA (2000) Integrable geodesic
English summary. Random Comput Dynam 4:179–204 flows with positive topological entropy. Invent Math
Bálint P, Gouëzel S (2006) Limit theorems in the stadium 140:639–650
billiard. Commun Math Phys 263:461–512 Bonano C, Collet P (2006) Complexity for extended
Baraviera AT, Bonatti C (2003) Removing zero Lyapunov dynamical systems. arXiv:math/0609681
exponents, English summary. Ergodic Theory Dynam Bonatti C, Crovisier S (2004) Recurrence et genericite.
Syst 23:1655–1670 Invent Math 158(1):33–104
Barreira L, Schmeling J (2000) Sets of “non-typical” points Bonatti C, Viana M (2000) SRB measures for partially
have full topological entropy and full Hausdorff dimen- hyperbolic systems whose central direction is mostly
sion. Israel J Math 116:29–70 contracting. Israel J Math 115:157–193
Chaos and Ergodic Theory 659
Bonatti C, Viana M (2004) Lyapunov exponents with Carleson L, Gamelin TW (1993) Complex dynamics. In:
multiplicity 1 for deterministic products of matrices. Universitext: tracts in mathematics. Springer,
Ergodic Theory Dynam Syst 24(5):1295–1330 New York
Bonatti C, Diaz L, Viana M (2005) Dynamics beyond Chazottes JR, Gouëzel S (2007) On almost-sure versions
uniform hyperbolicity. A global geometric and proba- of classical limit theorems for dynamical systems. Pro-
bilistic perspective. In: Mathematical physics, III. bab Theory Relat Fields 138:195–234
Encyclopedia of mathematical sciences, vol 102. Chernov N (1999) Statistical properties of piecewise
Springer, Berlin smooth hyperbolic systems in high dimensions. Dis-
Bowen R (1975) Equilibrium states and the ergodic theory crete Contin Dyn Syst 5(2):425–448
of Anosov diffeomorphisms. In: Lecture notes in math- Chernov N, Markarian R, Troubetzkoy S (2000) Invariant
ematics, vol 470. Springer, Berlin/New York measures for Anosov maps with small holes. Ergodic
Bowen R (1979) Hausdorff dimension of quasi-circles. Inst Theory Dynam Syst 20:1007–1044
Hautes Études Sci Publ Math 50:11–25 Christensen JRR (1972) On sets of Haar measure zero in
Bowen R, Ruelle D (1975) Ergodic theory of Axiom abelian Polish groups. Israel J Math 13:255–260
A flows. Invent Math 29:181–202 Collet P (1996) Some ergodic properties of maps of the
Boyle M, Downarowicz T (2004) The entropy theory of interval. In: Dynamical systems, Temuco, 1991/1992.
symbolic extensions. Invent Math 156:119–161 Travaux en Cours, 52. Hermann, Paris, pp 55–91
Boyle M, Fiebig D, Fiebig U (2002) Residual entropy, Collet P, Eckmann JP (2006) Concepts and results in cha-
conditional entropy and subshift covers. Forum Math otic dynamics: a short course. In: Theoretical and math-
14:713–757 ematical physics. Springer, Berlin
Boyle M, Buzzi J, Gomez R (2006) Ricardo almost iso- Collet P, Galves A (1995) Asymptotic distribution of
morphism for countable state Markov shifts. J Reine entrance times for expanding maps of the interval. In:
Angew Math 592:23–47 Dynamical systems and applications. World Sci Ser
Brain M, Berger A (2001) Chaos and chance. In: An Appl Anal, vol 4. World Scientific Publishing, River
introduction to stochastic aspects of dynamics. Walter Edge, pp 139–152
de Gruyter, Berlin Collet P, Courbage M, Mertens S, Neishtadt A, Zaslavsky
Brin M (2001) Appendix A. In: Barreira L, Pesin Y (eds) G (eds) (2005) Chaotic dynamics and transport in clas-
Lectures on Lyapunov exponents and smooth ergodic sical and quantum systems. In: Proceedings of the
theory. Proc Sympos Pure Math, 69, Smooth ergodic International Summer School of the NATO Advanced
theory and its applications, Seattle, WA, 1999. Ameri- Study Institute held in Cargese, August 18–30, 2003.
can Mathematical Society, Providence, pp 3–106 Edited by NATO science series II: mathematics, phys-
Brin M, Stuck G (2002) Introduction to dynamical sys- ics and chemistry, 182. Kluwer, Dordrecht
tems. Cambridge University Press, Cambridge Contreras G, Lopes AO, Thieullen P (2001) Lyapunov
Broise A (1996a) Transformations dilatantes de l’intervalle minimizing measures for expanding maps of the circle,
et theoremes limites. Etudes spectrales d’operateurs de English summary. Ergodic Theory Dynam Syst 21:
transfert et applications. Asterisque 238:1–109 1379–1409
Broise A (1996b) Transformations dilatantes de l’intervalle Cowieson WJ (2002) Absolutely continuous invariant
et théorèmes limites. Astérisque 238:1–109 measures for most piecewise smooth expanding maps.
Bruin H, Keller G, Nowicki T, van Strien S (1996) Wild Ergodic Theory Dynam Syst 22:1061–1078
Cantor attractors exist. Ann Math 143(2):97–130 Cowieson WJ, Young LS (2005) SRB measures as zero-
Buzzi J (1997) Intrinsic ergodicity of smooth interval noise limits. Ergodic Theory Dynam Syst 25:1115–1138
maps. Israel J Math 100:125–161 Crovisier S (2006a) Birth of homoclinic intersections: a
Buzzi J (2000a) Absolutely continuous invariant probabil- model for the central dynamics of partially hyperbolic
ity measures for arbitrary expanding piecewise systems. http://www.arxiv.org/abs/math/0605387
R-analytic mappings of the plane. Ergodic Theory Crovisier S (2006b) Periodic orbits and chain-transitive
Dynam Syst 20:697–708 sets of C1-diffeomorphisms. Publ Math Inst Hautes
Buzzi J (2000b) On entropy-expanding maps. Unpublished Etudes Sci 104:87–141
Buzzi J (2001) No or infinitely many a.c.i.p. for piecewise Crovisier S (2006c) Perturbation of C1-diffeomorphisms
expanding Cr maps in higher dimensions. Commun and generic conservative dynamics on surfaces.
Math Phys 222:495–501 Dynamique des diffeomorphismes conservatifs des sur-
Buzzi J (2005) Subshifts of quasi-finite type. Invent Math faces: un point de vue topologique, 21, Panor synthe-
159:369–406 ses. Soc Math France, Paris, pp 1–33
Buzzi J (2006) Puzzles of quasi-finite type, zeta functions Cvitanović P, Gunaratne GH, Procaccia I (1988) Topolog-
and symbolic dynamics for multi-dimensional maps. ical and metric properties of Henon-type strange
arXiv:math/0610911 attractors. Phys Rev A 38(3):1503–1520
Buzzi J. Hyperbolicity through Entropies. Saint-Flour de Carvalho A, Hall T (2002) How to prune a horseshoe.
Summer probability school lecture notes (to appear) Nonlinearity 15:R19–R68
660 Chaos and Ergodic Theory
de Castro A (2002) Backward inducing and exponential Gatzouras D, Peres Y (1997) Invariant measures of full
decay of correlations for partially hyperbolic attractors. dimension for some expanding maps. Ergodic Theory
Israel J Math 130:29–75 Dynam Syst 17:147–167
de la Llave R (2001) A tutorial on KAM theory. In: Smooth Glasner E, Weiss B (1993) Sensitive dependence on initial
ergodic theory and its applications, Seattle, WA, 1999. conditions. Nonlinearity 6:1067–1075
Proc Sympos Pure Math, vol 69. American Mathemat- Gordin MI (1969) The central limit theorem for stationary
ical Society, Providence, pp 175–292 processes. (Russian). Dokl Akad Nauk SSSR 188:
de Melo W, van Strien S (1993) One-dimensional dynam- 739–741
ics. In: Ergebnisse der Mathematik und ihrer Gouëzel S (2004) Central limit theorem and stable laws for
Grenzgebiete (3), 25. Springer, Berlin intermittent maps. Probab Theory Relat Fields 128:
Denker M, Philipp W (1984) Approximation by Brownian 82–122
motion for Gibbs measures and flows under a function. Gouëzel S (2005) Berry–Esseen theorem and local limit
Ergodic Theory Dynam Syst 4:541–552 theorem for non uniformly expanding maps. Ann Inst
Detmers MF, Liverani C (2007) Stability of statistical H Poincaré 41:997–1024
properties in two-dimensional piecewise hyperbolic Gouëzel S, Liverani C (2006) Banach spaces adapted to
maps. Trans Am Math Soc 360:4777–4814 Anosov systems. Ergodic Theory Dynam Syst 26:
Devaney RL (1989) An introduction to chaotic dynamical 189–217
systems, 2nd edn. Addison–Wesley, Redwood Graczyk J, Świątek G (1997) Generic hyperbolicity in the
Diaz L, Rocha J, Viana M (1996) Strange attractors in logistic family. Ann Math 146(2):1–52
saddle-node cycles: prevalence and globality. Invent Gromov M (1987) Entropy, homology and semialgebraic
Math 125:37–74 geometry. Séminaire Bourbaki, vol 1985/86.
Dolgopyat D (2000) On dynamics of mostly contracting Astérisque No 145–146:225–240
diffeomorphisms. Commun Math Phys 213:181–201 Guckenheimer J (1979) Sensitive dependence on initial
Dolgopyat D (2002) On mixing properties of compact conditions for one-dimensional maps. Commun Math
group extensions of hyperbolic systems. Israel J Math Phys 70:133–160
130:157–205 Guckenheimer J, Holmes P (1990) Nonlinear oscillations,
Dolgopyat D (2004a) Limit theorems for partially hyper- dynamical systems, and bifurcations of vector fields.
bolic systems. Trans Am Math Soc 356:1637–1689 Revised and corrected reprint of the 1983 original. In:
Dolgopyat D (2004b) On differentiability of SRB states for Applied mathematical sciences, vol 42. Springer,
partially hyperbolic systems. Invent Math 155: New York
389–449 Guckenheimer J, Wechselberger M, Young LS (2006) Cha-
Downarowicz T (2005) Entropy structure. J Anal Math 96: otic attractors of relaxation oscillators. Nonlinearity 19:
57–116 701–720
Downarowicz T, Newhouse S (2005) Symbolic extensions Guivarc’h Y, Hardy J (1998) Théorèmes limites pour une
and smooth dynamical systems. Invent Math 160: classe de chaînes de Markov et applications aux
453–499 difféomorphismes d’Anosov. Ann Inst H Poincaré 24:
Downarowicz T, Serafin J (2003) Possible entropy func- 73–98
tions. Israel J Math 135:221–250 Gurevic BM (1996) Stably recurrent nonnegative matrices.
Duhem P (1906) La théorie physique, son objet et sa (Russian) Uspekhi Mat Nauk 51(3)(309):195–196;
structure. Vrin, Paris, 1981 translation in: Russ Math Surv 51(3):551–552
Eckmann JP, Ruelle D (1985) Ergodic theory of chaos and Gurevic BM, Savchenko SV (1998) Thermodynamic for-
strange attractors. Rev Mod Phys 57:617–656 malism for symbolic Markov chains with a countable
Eskin A, McMullen C (1993) Mixing, counting, and number of states. (Russian). Uspekhi Mat Nauk
equidistribution in Lie groups. Duke Math J 71: 53(2)(320):3–106; translation in: Russ Math Surv 53(2):
181–209 245–344
Fiedler B (ed) (2001) Ergodic theory, analysis, and efficient Gutzwiller M (1990) Chaos in classical and quantum
simulation of dynamical systems. Springer, Berlin mechanics. In: Interdisciplinary applied mathematics,
Fiedler B (ed) (2002) Handbook of dynamical systems, vol 1. Springer, New York
vol 2. North-Holland, Amsterdam Hadamard J (1898) Les surfaces à courbures opposees et
Fisher A, Lopes AO (2001) Exact bounds for the polyno- leurs lignes geodesiques. J Math Pures Appl 4:27–73
mial decay of correlation, 1/f noise and the CLT for the Hasselblatt B, Katok A (eds) (2002, 2006) Handbook of
equilibrium state of a non-Hölder potential. Non- dynamical systems, vol 1A and 1B. Elsevier,
linearity 14:1071–1104 Amsterdam
Furstenberg H (1981) Recurrence in ergodic theory and Hasselblatt B, Katok A (2003) A first course in dynamics.
combinatorial number theory. In: MB Porter lectures. In: With a panorama of recent developments. Cam-
Princeton University Press, Princeton bridge University Press, New York
Gallavotti G (1999) Statistical mechanics. In: A short Hennion H (1993) Sur un théorème spectral et son appli-
treatise. Texts and monographs in physics. Springer, cation aux noyaux lipchitziens. Proc Am Math Soc 118:
Berlin 627–634
Chaos and Ergodic Theory 661
Hénon M (1976) A two-dimensional mapping with a Keller G, Liverani C (1999) Stability of the spectrum for
strange attractor. Commun Math Phys 50:69–77 transfer operators. Ann Scuola Norm Sup Pisa Cl Sci
Hesiod (1987) Theogony. Focus/R. Pullins, Newburyport 28(4):141–152
Hochman M (2006) Upcrossing inequalities for stationary Keller G, Nowicki T (1992) Spectral theory, zeta functions
sequences and applications. arXiv:math/0608311 and the distribution of periodic points for Collet-
Hofbauer F (1979) On intrinsic ergodicity of piecewise Eckmann maps. Commun Math Phys 149:31–69
monotonic transformations with positive entropy. Israel Kifer Y (1977) Small random perturbations of hyperbolic
J Math 34(3):213–237 limit sets. (Russian). Uspekhi Mat Nauk 32(1):193–194
Hofbauer F (1985) Periodic points for piecewise mono- Kifer Y (1986) Ergodic theory of random transformations.
tonic transformations. Ergodic Theory Dynam Syst 5: In: Progress in probability and statistics, vol 10.
237–256 Birkhäuser Boston, Inc, Boston
Hofbauer F, Keller G (1982) Ergodic properties of invari- Kifer Y (1988) Random perturbations of dynamical sys-
ant measures for piecewise monotonic transformations. tems. In: Progress in probability and statistics, vol 16.
Math Z 180:119–140 Birkhäuser Boston, Inc, Boston
Hofbauer F, Keller G (1990) Quadratic maps without Kifer Y (1990) Large deviations in dynamical systems and
asymptotic measure. Commun Math Phys 127: stochastic processes. Trans Am Math Soc 321:505–524
319–337 Kifer Y (1997) Computations in dynamical systems via
Hofer H, Zehnder E (1994) Symplectic invariants and random perturbations. (English summary). Discrete
Hamiltonian dynamics. Birkhäuser, Basel Contin Dyn Syst 3:457–476
Host B, Kra B (2005) Nonconventional ergodic averages Kolyada SF (2004) LI-Yorke sensitivity and other concepts
and nilmanifolds. Ann Math 161(2):397–488 of chaos. Ukr Math J 56:1242–1257
Hua Y, Saghin R, Xia Z (2006) Topological entropy and Kozlovski OS (2003) Axiom A maps are dense in the space
partially hyperbolic diffeomorphisms. arXiv:math/ of unimodal maps in the Ck topology. Ann Math
0608720 157(2):1–43
Hunt FY (1998) Unique ergodicity and the approximation Kozlovski O, Shen W, van Strien S (2007) Density of
of attractors and their invariant measures using Ulam’s Axiom A in dimension one. Ann Math 166:145–182
method. (English summary). Nonlinearity 11:307–317 Krengel U (1983) Ergodic theorems. De Gruyter, Berlin
Hunt BR, Sauer T, Yorke JA (1992) Prevalence: a Krzyżewski K, Szlenk W (1969) On invariant measures for
translation-invariant “almost every” on infinite- expanding differentiable mappings. Stud Math 33:
dimensional spaces. Bull Am Math Soc (NS) 27: 83–92
217–238 Lacroix Y (2002) Possible limit laws for entrance times of
Ibragimov I, Linnik Y, Kingman JFC (ed) (trans) an ergodic aperiodic dynamical system. Israel J Math
(1971) Independent and stationary sequences of ran- 132:253–263
dom variables. Wolters-Noordhoff, Groningen Ladler RL, Marcus B (1979) Topological entropy and
Ionescu Tulcea CT, Marinescu G (1950) Théorie ergodique equivalence of dynamical systems. Memoirs Am
pour des classes d’opérations non complètement con- Math Soc 20(219)
tinues. (French). Ann Math 52(2):140–147 Lastoa A, Yorke J (1973) On the existence of invariant
Ishii Y (1997) Towards a kneading theory for Lozi map- measures for piecewise monotonic transformations.
pings. I. A solution of the pruning front conjecture and Trans Am Math Soc 186:481–488
the first tangency problem. Nonlinearity 10:731–747 Ledrappier F (1984) Proprietes ergodiques des mesures de
Jakobson MV (1981) Absolutely continuous invariant Sinaï. Inst Hautes Etudes Sci Publ Math 59:163–188
measures for one-parameter families of one- Ledrappier F, Young LS (1985) The metric entropy of
dimensional maps. Commun Math Phys 81:39–88 diffeomorphisms. I Characterization of measures satis-
Jenkinson O (2007) Optimization and majorization of fying Pesin’s entropy formula. II Relations between
invariant measures. Electron Res Announc Am Math entropy, exponents and dimension. Ann Math 122(2):
Soc 13:1–12 509–539, 540–574
Katok A (1980) Lyapunov exponents, entropy and periodic Leplaideur R (2004) Existence of SRB-measures for some
orbits for diffeomorphisms. Inst Hautes Etudes Sci Publ topologically hyperbolic diffeomorphisms. Ergodic
Math 51:137–173 Theory Dynam Syst 24:1199–1225
Katok A, Hasselblatt B (1995) Introduction to the modern Li M, Vitanyi P (1997) An introduction to Kolmogorov
theory of dynamical systems. In: Encyclopedia of complexity and its applications. In: Graduate texts in
mathematics and its applications, 54. With a supple- computer science, 2nd edn. Springer, New York
mentary chapter by: Katok A, Mendoza L. Cambridge Li TY, Yorke JA (1975) Period three implies chaos. Am
University Press, Cambridge Math Monthly 82:985–992
Keller G (1984) On the rate of convergence to equilibrium Lind D, Marcus B (1995) An introduction to symbolic
in one-dimensional systems. Commun Math Phys 96: dynamics and coding. Cambridge University Press,
181–193 Cambridge
Keller G (1990) Exponents, attractors and Hopf decompo- Liu PD, Qian M, Zhao Y (2003) Large deviations in Axiom
sitions for interval maps. Ergodic Theory Dynam Syst A endomorphisms. Proc R Soc Edinb Sect A 133:
10:717–744 1379–1388
662 Chaos and Ergodic Theory
Liverani C (1995a) Decay of correlations. Ann Math 142: Nagaev SV (1957) Some limit theorems for stationary
239–301 Markov chains. Theor Probab Appl 2:378–406
Liverani C (1995b) Decay of correlations in Piecewise Newhouse SE (1974) Diffeomorphisms with infinitely
expanding maps. J Stat Phys 78:1111–1129 many sinks. Topology 13:9–18
Liverani C (1996) Central limit theorem for deterministic Newhouse SE (1989) Continuity properties of entropy.
systems. In: Ledrappier F, Lewowicz J, Newhouse Ann Math 129(2):215–235; Erratum: Ann of Math
S (eds) International conference on dynamical systems, 131(2):409–410
Montevideo, 1995. Pitman research notes. Math 362: Nussbaum RD (1970) The radius of the essential spectrum.
56–75 Duke Math J 37:473–478
Liverani C (2001) Rigorous numerical investigation of the Ornstein D (1970) Bernoulli shifts with the same entropy
statistical properties of piecewise expanding maps – a are isomorphic. Adv Math 4:337–352
feasibility study. Nonlinearity 14:463–490 Ornstein D, Weiss B (1988) On the Bernoulli nature of
Liverani C, Tsujii M (2006) Zeta functions and dynamical systems with some hyperbolic structure. Ergodic The-
systems. (English summary). Nonlinearity 19: ory Dynam Syst 18:441–456
2467–2473 Ovid (2005) Metamorphosis. W.W. Norton, New York
Lyubich M (1997) Dynamics of quadratic polynomials. I, Palis J (2000) A global view of dynamics and a conjecture
II. Acta Math 178:185–247, 247–297 on the denseness of tinitude of attractors. Asterisque
Mañé R (1982) An ergodic closing lemma. Ann Math 261:335–347
116(2):503–540 Palis J, Yoccoz JC (2001) Fers a cheval non-uniformement
Mañé R (1985) Hyperbolicity, sinks and measures in one- hyperboliques engendres par une bifurcation homocline
dimensional dynamics. Commun Math Phys 100: et densite nulle des attracteurs. CRAS 333:867–871
495–524 Pesin YB (1976) Families of invariant manifolds
Mañé R (1988) A proof of the C1 stability conjecture. Publ corresponding to non-zero characteristic exponents.
Math IHES 66:161–210 Math USSR Izv 10:1261–1302
Margulis G (2004) On some aspects of the theory of Pesin YB (1977) Characteristic exponents and smooth
Anosov systems. In: With a survey by Richard Sharp: ergodic theory. Russ Math Surv 324:55–114
periodic orbits of hyperbolic flows. Springer mono- Pesin Ya (1997) Dimension theory in dynamical systems.
graphs in mathematics. Springer, Berlin In: Contemporary views and applications. Chicago lec-
Melbourne I, Nicol M (2005) Almost sure invariance prin- tures in mathematics. University of Chicago Press,
ciple for nonuniformly hyperbolic systems. Commun Chicago
Math Phys 260(1):131–146 Pesin Ya, Sinaĭ Ya (1982) Gibbs measures for partially
Milnor J, Thurston W (1988) On iterated maps of the hyperbolic attractors. Ergodic Theory Dynam Syst 2:
interval. In: Dynamical systems, College Park, MD, 417–438
1986–87. Lecture Notes in Math, vol 1342. Springer, Piorek J (1985) On the generic chaos in dynamical sys-
Berlin, pp 465–563 tems. Univ Iagel Acta Math 25:293–298
Misiurewicz M (1976a) A short proof of the variational Plykin RV (2002) On the problem of the topological clas-
principle for a Zn+ action on a compact space. Bull sification of strange attractors of dynamical systems.
Acad Polon Sci Sér Sci Math Astronom Phys 24(12): Uspekhi Mat Nauk 57:123–166. Translation in: Russ
1069–1075 Math Surv 57:1163–1205
Misiurewicz M (1976b) Topological conditional entropy. Poincare H (1892) Les methodes nouvelles de la
Stud Math 55:175–200 mecanique céleste. Gauthier–Villars, Paris
Misiurewicz M (1979) Horseshoes for mappings of the Pollicott M, Sharp R (2002) Invariance principles for inter-
interval. Bull Acad Polon Sci Sér Sci Math 27:167–169 val maps with an indifferent fixed point. Commun Math
Misiurewicz M (1995) Continuity of entropy revisited. In: Phys 229:337–346
Dynamical systems and applications. World Sci Ser Przytycki F (1977) On U-stability and structural stability of
Appl Anal, vol 4. World Scientific Publishing, River endomorphisms satisfying. Axiom A Studia Math 60:
Edge, pp 495–503 61–77
Misiurewicz M, Smítal J (1988) Smooth chaotic maps with Pugh C, Shub M (1999) Ergodic attractors. Trans Am Math
zero topological entropy. Ergodic Theory Dynam Syst Soc 312:1–54
8:421–424 Pujals E, Rodriguez-Hertz F (2007) Critical points for
Mora L, Viana M (1993) Abundance of strange attractors. surface diffeomorphisms. J Mod Dyn 1:615–648
Acta Math 171:1–71 Puu T (2000) Attractors, bifurcations, and chaos. In: Non-
Moreira CG, Palis J, Viana M (2001) Homoclinic tangen- linear phenomena in economics. Springer, Berlin
cies and fractal invariants in arbitrary dimension. Rees M (1981) A minimal positive entropy homeomor-
CRAS 333:475–480 phism of the 2-torus. J Lond Math Soc 23(2):537–550
Murray JD (2002, 2003) Mathematical biology. I and II An Robinson RC (2004) An introduction to dynamical sys-
introduction, 3rd edn. In: Interdisciplinary applied tems: continuous and discrete. Pearson Prentice Hall,
mathematics, 17 and 18. Springer, New York Upper Saddle River
Chaos and Ergodic Theory 663
Rousseau-Egele J (1983) Un théorème de la limite locale Szász D (ed) (2000) Hard ball systems and the Lorentz gas.
pour une classe de transformations dilatantes et mono- In: Encyclopedia of mathematical sciences, 101. Math-
tones par morceaux. Ann Probab 11:772–788 ematical physics, II. Springer, Berlin
Ruelle D (1968) Statistical mechanics of a one-dimensional Tsujii M (1992) A measure on the space of smooth map-
lattice gas. Commun Math Phys 9:267–278 pings and dynamical system theory. J Math Soc Jpn 44:
Ruelle D (1976) A measure associated with axiom-A 415–425
attractors. Am J Math 98:619–654 Tsujii M (2000a) Absolutely continuous invariant mea-
Ruelle D (1978) An inequality for the entropy of differen- sures for piecewise real-analytic expanding maps on
tiable maps. Bol Soc Brasil Mat 9:83–87 the plane. Commun Math Phys 208:605–622
Ruelle D (1982) Repellers for real analytic maps. Ergodic Tsujii M (2000b) Piecewise expanding maps on the plane
Theory Dynam Syst 2:99–107 with singular ergodic properties. Ergodic Theory
Ruelle D (1989) The thermodynamic formalism for Dynam Syst 20:1851–1857
expanding maps. Commun Math Phys 125:239–262 Tsujii M (2001) Absolutely continuous invariant measures
Ruelle D (2004) Thermodynamic formalism. In: The math- for expanding piecewise linear maps. Invent Math 143:
ematical structures of equilibrium statistical mechanics, 349–373
2nd edn. Cambridge Mathematical Library, Cambridge Tsujii M (2005) Physical measures for partially hyperbolic
University Press, Cambridge surface endomorphisms. Acta Math 194:37–132
Ruelle D (2005) Differentiating the absolutely continuous Tucker W (1999) The Lorenz attractor exists. C R Acad Sci
invariant measure of an interval map f with respect to f. Paris Ser I Math 328:1197–1202
Commun Math Phys 258:445–453 Vallée B (2006) Euclidean dynamics. Discrete Contin Dyn
Ruette S (2003a) On the Vere-Jones classification and Syst 15:281–352
existence of maximal measures for countable topolog- van Strien S, Vargas E (2004) Real bounds, ergodicity and
ical Markov chains. Pacific J Math 209:366–380 negative Schwarzian for multimodal maps. J Am Math
Ruette S (2003b) Chaos on the interval. http://www.math. Soc 17:749–782
u-psud.fr/%7Eruette Vasquez CH (2007) Statistical stability for
Saari DG (2005) Collisions, rings, and other Newtonian diffeomorphisms with dominated splitting. Ergodic
N-body problems. In: CBMS regional conference Theory Dynam Syst 27:253–283
series in mathematics, 104. Published for the Confer- Viana M (1993) Strange attractors in higher dimensions.
ence Board of the Mathematical Sciences, Washington, Bol Soc Bras Mat (NS) 24:13–62
DC. American Mathematical Society, Providence Viana M (1997a) Multidimensional nonhyperbolic attractors.
Saperstone SH, Yorke JA (1971) Controllability of linear Inst Hautes Etudes Sci Publ Math No 85:63–96
oscillatory systems using positive controls. SIAM Viana M (1997b) Stochastic dynamics of deterministic
J Control 9:253–262 systems. In: Lecture notes 21st Braz Math Colloq
Sarig O (1999) Thermodynamic formalism for countable IMPA. Rio de Janeiro
Markov shifts. Ergodic Theory Dynam Syst 19: Viana M (1998) Dynamics: a probabilistic and geometric
1565–1593 perspective. In: Proceedings of the International Con-
Sarig O (2001) Phase transitions for countable topological gress of Mathematicians, vol I, Berlin, 1998. Doc Math
Markov shifts. Commun Math Phys 217:555–577 Extra I:557–578
Sataev EA (1992) Invariant measures for hyperbolic map- Wang Q, Young LS (2003) Strange attractors in
pings with singularities. Uspekhi Mat Nauk 47: periodically-kicked limit cycles and Hopf bifurcations.
147–202. Translation in: Russ Math Surv 47:191–251 Commun Math Phys 240:509–529
Saussol B (2000) Absolutely continuous invariant mea- Weiss B (2002) Single orbit dynamics. American Mathe-
sures for multidimensional expanding maps. Israel matical Society, Providence
J Math 116:223–248 Ya S (1972) Gibbs measures in ergodic theory. Uspekhi
Saussol B (2006) Recurrence rate in rapidly mixing Mat Nauk 27(166):21–64
dynamical systems. Discrete Contin Dyn Syst 15(1): Yomdin Y (1987) Volume growth and entropy. Israel
259–267 J Math 57:285–300
Shub M (1987) Global stability of dynamical systems. Yoshihara KI (2004) Weakly dependent stochastic
Springer, New York sequences and their applications. In: Recent topics on
Simon R (1972) A 3-dimensional Abraham-Smale exam- weak and strong limit theorems, vol XIV. Sanseido Co
ple. Proc Am Math Soc 34:629–630 Ltd, Chiyoda
Starkov AN (2000) Dynamical systems on homogeneous Young LS (1982) Dimension, entropy and Lyapunov expo-
spaces. In: Translations of mathematical mono- nents. Ergodic Theory Dynam Syst 6:311–319
graphs, vol 190. American Mathematical Society, Young LS (1985) Bowen–Ruelle measures for certain
Providence piecewise hyperbolic maps. Trans Am Math Soc 287:
Stewart P (1964) Jacobellis v Ohio. US Rep 378:184 41–48
664 Chaos and Ergodic Theory
describe using standard geometric tools. For some However, there are numerous examples of
important classes of dynamical systems, these dynamical systems exhibiting pathological behav-
complicated structures are intensively studied ior with respect to fractal geometrical characteris-
using notions of dimension. In many cases it tics. In particular higher-dimensional systems
becomes possible to relate these notions of dimen- seem to be as complicated as general objects con-
sion to other fundamental dynamical characteris- sidered in geometric measure theory. Therefore, a
tics, such as Lyapunov exponents, entropies, clean and unified theory is still not available.
pressure, etc. The study of characteristic notions like entropy,
On the other hand tools from dynamical sys- exponents or dimensions is an essential issue in the
tems, especially from ergodic theory and thermo- theory of dynamical systems. In many cases it
dynamic formalism, are extremely useful to helps to classify or to understand the dynamics.
explore the fractal properties of the objects in Most of these characteristics were introduced for
question. This includes dimensions of limit sets different questions and concepts. For example,
of geometric constructions (the standard Cantor entropy was introduced to distinguish non-
set being the most famous example), which a isomorphic systems and appeared to be a complete
priori, are not related to dynamical systems invariant for Bernoulli systems (Ornstein). Later,
(Furstenberg 1967; Pesin and Weiss 1996). the thermodynamic formalism (see (Ruelle 1978))
Many dimension formulas for asymptotic sets of introduced new quantities like the pressure. Bowen
dynamical systems are obtained by means of (1979) and also Ruelle discovered a remarkable
Bowen-type formulas, i.e. as roots of some func- connection between the thermodynamic formalism
tionals arising from thermodynamic formalism. and the dimension theory for invariant sets. Since
The dimension of a set is a subtle characteristic then many efforts were taken to find the relations
which measures the geometric complexity of the between all these different quantities. It occurred
set at arbitrarily fine scales. There are many that the dimension of invariant sets or measures
notions of dimension, and most definitions carries lots of information about the system, com-
involve a measurement of geometric complexity bining its combinatorial complexity with its geo-
at scale ε (which ignores the irregularities of the metric complexity. Unfortunately it is extremely
set at size less than ε) and then considers the difficult to compute the dimension in general. The
limiting measurement as ε ! 0. A priori (and in general flavor is that local divergence of orbits and
general) these different notions can be different. global recurrence cause complicated global behav-
An important result is the affirmative solution of ior (chaos). It is impossible to study the exact
the Eckmann–Ruelle conjecture by Barreira, (infinite) trajectory of all orbits. One way out is to
Pesin and Schmeling (1999), which says that for study the statistical properties of “typical” orbits by
smooth nonuniformly hyperbolic systems, the means of an invariant measure. Although the
pointwise dimension is almost everywhere con- underlying system might be smooth the invariant
stant with respect to a hyperbolic measure. This measures may often be singular.
result implies that many dimension characteristics
for the measure coincide.
The deep connection between dynamical sys- Preliminaries
tems and dimension theory seems to have been
first discovered by Billingsley (1978) through Throughout the article the following situation is
several problems in number theory. considered. Let M be a compact Riemannian man-
Another link between dynamical systems and ifold without boundary. On M is acting a dynam-
dimension theory is through Pesin’s theory of ical system generated by a C1þα diffeomorphism
dimension-like characteristics. This general the- T : M ! M. The presence of a dynamical system
ory is a unification of many notions of dimension provides several important additional tools and
along with many fundamental quantities in methods for the theory of fractal dimensions.
dynamical system such as entropies, pressure, etc. Also the theory of fractal dimensions allows one
Ergodic Theory: Fractal Geometry 667
to draw deep conclusions about the dynamical Note that this limit exists. mH(s, Z ) is called the
system. The importance and relevance of the s-dimensional outer Hausdorff measure of Z. It
study of fractal dimension will be explained in is immediate that there exists a unique value s*,
later sections. called the Hausdorff dimension of Z, at which
In the next sections some of the most important mH(s, Z ) jumps from 1 to 0.
tools in the fractal theory of dynamical systems In general it is very hard to find optimal cover-
are considered. The definitions given here are not ings and hence it is often impossible to compute the
necessarily the original definitions but rather the Hausdorff dimension of a set. Therefore a simpler
ones which are closer to contemporary use. More notion – the lower and upper Box dimension – was
details can be found in (Pesin 1997b). introduced. The difference to the Hausdorff dimen-
sion is that the covering balls are assumed to have
Some Ergodic Theory the same radius ε. Since then the limit as ε ! 0 does
Ergodic theory is a powerful method to analyze not have to exist one arrives at the notion of the
statistical properties of dynamical systems. All the upper and lower dimension.
following facts can be found in standard books on
ergodic theory like (Petersen 1983; Walters 1982). Dimension of a Measure
The main idea in ergodic theory is to relate Definition 1 Let Z ℝN and let m be a probabil-
global quantities to observations along single ity measure supported on Z. Define the Hausdorff
orbits. Let us consider an invariant measure: m- dimension of the measure m by
( f 1A) ¼ m(A) for all measurable sets A. Such a
measure “selects” typical trajectories. It is impor- dimH ðmÞ inf dimH ðK Þ:
KZ:mðK Þ¼1
tant to note that the properties vary with the invari-
ant measures. Any such invariant measure can be
decomposed into elementary parts (ergodic
components).
Pointwise Dimension
An invariant measure is called ergodic if for
Most invariant sets or measures are not strongly
any invariant set A ¼ T 1A one has m(A)m(M ∖ A) ¼
self-similar, i.e. the local geometry at arbitrarily
0 (with the agreement 0 1 ¼ 0), i.e. from the
fine scales might look different from point to
measure-theoretic point of view there are no non-
point. Therefore, the notion of pointwise dimen-
trivial invariant subsets.
sion with respect to a Borel probability measure is
The importance of ergodic probability mea-
defined.
sures (i.e. m(M) ¼ 1) lies in the following theorem
Let m be a Borel probability measure. By
of Birkhoff.
B(x, ε) the ball with center x and radius ε is
denoted. The pointwise dimension of the mea-
Theorem 1 (Birkhoff) Let m be an ergodic prob-
sure m at the point x is defined as
ability measure and ’ L1(m). Then
n1
lim 1
’ Tkx ¼ M’ dm m a:e: log mðBðx, eÞÞ
n!1 n
k¼0 dm ðxÞ≔ lim
e!0 log e
One should not take the existence of a local relation of “round” balls to “oval” dynamical
dimension (even for good measures) for granted. Bowen balls. If one understands how metric
Later on it will be seen that the spectrum of the balls can be efficiently used to cover dynamical
pointwise dimension (dimension spectrum) is a balls one can use the dynamical and relatively
main object of study in classical multifractal easy relation to compute notion of entropy to
analysis. determine the dimension. However, in higher
The dimension of a measure or a set of mea- dimensions this relation is by far nontrivial.
sures is its geometric complexity. However, under A heuristic argument for comparing “round”
the presence of a dynamical system one also wants balls with dynamical balls is given in section
to measure the dynamical (combinatorial) com- “The Kaplan–Yorke Conjecture”.
plexity of the system. This leads to the notion of
entropy. The Pressure Functional
A useful tool in the dimension analysis of dynam-
Dimension-Like Characteristics and ical systems is the pressure functional. It was
Topological Entropy originally defined by means of statistical physics
Pesin’s theory of dimension-like characteristics (thermodynamic formalism) as the free energy
provides a unified treatment of dimensions and (or pressure) of a potential j (see for example
important dynamical quantities like entropies (Ruelle 1978)). However, in this article a
and pressure. The topological entropy of a contin- dimension-like definition (see (Pesin 1997b)) is
uous map f with respect to a subset Z in a metric more suitable. Again an outer measure using
space (X, r) (in particular X ¼ M – a Riemannian Bowen balls will be used. Let ’ : M ! ℝ be a
manifold) can be defined as a dimension-like continuous function and
characteristic. For each n ℕ and ε > 0, define the
Bowen ball Bn(x, ε) ¼ {y X : r(Ti(x), Ti( y)) ε
for 0 i n}. Then let mP ðZ, a, e, n, ’Þ ¼ lim inf exp ani
n!1
i
mh ðZ, a, e, nÞ≔ n
þ sup ’ Tkx :
x Bni ðx, eÞ k¼0
lim inf
n!1
e ani
: ni > n, [Bni ðx, eÞ Z :
i
i
This defines an outer measure that jumps from
This gives rise to an outer measure that jumps 1 to 0 as α increases. The threshold value α is
from 1 to 0 at some value α . This threshold called the topological pressure of the potential j
value α is called the topological entropy of Z denoted by P(’). In many situations it does not
(at scale e). However, in many situations this depend on ε.
value does not depend on e. The topological There is also a third way of defining the pres-
entropy is denoted by htop(T| Z ). sure in terms of a variational principle (see
If Z is f – invariant and compact, this definition (Walters 1982)):
of topological entropy coincides with the usual
definition of topological entropy (Walters 1982). Pð’Þ ¼ sup hm þ ’ dm
The entropy hm of a measure m is defined as minvariant M
hm ¼ inf {htop(T| Z ) : m(Z ) ¼ 1}. For ergodic
measures this definition coincides with the Kol-
mogorov–Sinai entropy (see (Pesin 1997b)). Brief Tour Through Some Examples
One has to note that in the definition of entropy
metric (“round”) balls are substituted by Bowen Before describing the fractal theory of dynamical
balls and the metric diameter by the “depth” of the systems in more detail some ideas about its role
Bowen ball. Therefore, the relation between are presented. The application of dimension the-
entropy and dimension is determined by the ory has many different aspects. At this point some
Ergodic Theory: Fractal Geometry 669
(but by far not all) important examples are con- following example explains some of those
sidered that should give the reader some feeling difficulties.
about the importance and wide use of dimension Let 1/2 < l < 1 and consider the maps
theory in dynamical system theory. Fi : [0, 1]! [0, 1] given by F1(x) ¼ lx and
F2(x) ¼ lx þ (1 l). Then the images of F1,
Dimension of Conformal Repellers: Ruelle’s F2 have an essential overlap and J ¼ [0, 1]. If one
Pressure Formula randomizes this construction in the way that one
Computing or estimating a dimension via a pres- applies both maps each with probability 1/2 a
sure formula is a fundamental technique. Explicit probability measure is induced on J. This mea-
properties of pressure help to analyze subtle char- sure might be absolute continuous with respect to
acteristics. For example, the smooth dependence Lebesgue measure or not. Already Erdös realized
of the Hausdorff dimension of basic sets for that for some special values of l (for example for
Axiom- A surface diffeomorphisms on the deriv- the inverse of the golden mean) the induced
ative of the map follows from smoothness of measure is singular. In a breakthrough paper
pressure. B. Solomyak (1995) proved that for a.e. l the
Ruelle proved the following pressure formula induced measure is absolutely continuous.
for the Hausdorff dimension of a conformal repel- A main ingredient in the proof is a transversality
ler. A conformal repeller J is an invariant set condition in the parameter space: the images of
T(J ) ¼ J ¼ {x M : f nx V 8 n ℕ and arbitrary two random samples of the (infinite)
some neighborhood Vof J} such that for any x J applications of the maps Fi have to cross with
the differential DxT ¼ a(x)Isox where a(x) is a nonzero speed when the parameter l changes.
scalar with ja(x) j > 1 and Isox an isometry of This is a general mechanism which allows one
the tangent space TxM. to handle more general situations.
More precisely, let NW m denote the set of Theorem 5 (Eggleston 1952) The Hausdorff
nonwandering points of Tm in an open neighbor- dimension of Xp is given by
hood of Λ0 after the homoclinic bifurcation. Let ‘
denote Lebesgue measure. dimH Xp ¼ ð1= log 2Þp
½ log p ð1 pÞ logð1 pÞ :
properties of their continued fraction expansion known to be fractal. However, its complete
(see for example (Aihua et al. 2005; Pollicott and description is still not available.
Weiss 1999)).
Embedology and Computational Aspects of
Infinite Iterated Function Systems and Dimension
Parabolic Systems Tools from dynamical systems are becoming
In the previous section a system with infinitely increasingly important to study the time evolution
many branches appeared. This can be regarded as of deterministic systems in engineering and the
an iterated function system with infinitely many physical and biological sciences. One of the main
maps Fi. This situation is quite general. If one ideas is to model a “real world” system by a
considers a (one-dimensional) system with a par- smooth dynamical system which possesses a
abolic (indifferent) fixed point, i.e. there is a fixed strange attractor with a natural ergodic invariant
point where the derivative has absolute value measure. When studying a complicated real world
equal to 1, one often uses an induced system. system, one can measure only a very small num-
For this one chooses nearby the parabolic point a ber of variables. The challenge is to reconstruct
higher iterate of the map in order to achieve uni- the underlying attractor from the time measure-
form expansion away from the parabolic point. ment of a scaler quantity. An idealized measure-
This leads to infinitely many branches since the ment is considered as a function h : Mn ! ℝ.
number of iterates has to be increased the closer The main tool researchers currently use to
the parabolic point is. The main difference to the reconstruct the model system is called attractor
finite iterated function system is that the setting is reconstruction (see papers and references in (Ott
no longer compact and many properties of the et al. 1994)). This method is based on embedding
pressure functional are lost. with time delays (see the influential paper
Mauldin and Urbański and others (see for (Cruchfield et al. 1980), where the authors attri-
example (Aihua et al. 2005; Mauldin and bute the idea of delay coordinates to Ruelle),
Urbański 1996, 2000, 2002)) developed a thermo- where one attempts to reconstruct the attractor
dynamic formalism adapted to the pressure func- for the model using a single long trajectory.
tional for infinite iterated function systems. Then one considers the points in R pþ1 defined
Besides noncompactness one of the main prob- by (xk, xkþt, xkþ2t, . . ., xkþpt).
lems is the loss of analyticity (phase transitions) Takens (1981) showed that for a smooth
and convexity of the pressure functional for infi- T : Mn ! Mn and for typical smooth h, the map-
nite iterated function systems. ping ’ : Mn ! ℝ2nþ1 defined by x ! (h(x),
h( ft(x)), , h( f 2nt(x)) is an embedding. Since
Complex Dynamics the box dimension of the attractor Λ may be
Let T : ℂ ! ℂ be a polynomial and J its Julia set much less than the dimension of the ambient
(repeller of this system). If this set is hyperbolic, manifold n, an interesting mathematical question
i.e. the derivative at each point has absolute value is whether there exists p < 2n þ 1 such that the
larger than 1 the study of the dimension can be mapping on the attractor ’ : Λ ! ℝp defined by
related to the study of a finite iterated function x ! (h(x), h( f(x)), . . ., h( f p(x)) is 1 1? It is
system. However, in the presence of a parabolic known that for a typical smooth h the mapping j
fixed point this leads to an infinite iterated func- is 1 1 for p > 2dimB(Λ) (Casdagli et al. 1991).
tion system.
If one considers the coefficients of the polyno- Denjoy Systems
mial as parameters one often sees a qualitative This section will give some ideas indicating the
change in the asymptotic behavior. For example, principle difficulties that arise in systems with low
the classical Mandelbrodt set for polynomials complexity. Contrary to hyperbolic systems (each
z2 þ c is the locus of values c C for which the vector in the tangent space is either contracted or
orbit of the origin stays bounded. This set is well expanded) finer mechanisms determine the local
672 Ergodic Theory: Fractal Geometry
behavior of the scaling of balls. While in hyper- Theorem 6 Assume that 0 < δ < 1 and that
bolic systems the dynamical scaling of small balls α (0, 1) is of Diophantine class n (0, 1).
is exponential in a low-complexity system this Then an orientation preserving C1þδ
scaling is subexponential and hence the lineariza- diffeomorphism of the circle with rotation number
tion error is of the same magnitude. Up to now α and minimal set Oda satisfies
there is no general dimension theory for low com-
plexity systems. d
A specific example presented here is consid- dimH Oda :
n
ered in (Kra and Schmeling 2002). Poincaré
showed that to each orientation preserving
homeomorphism of the circle S1 ¼ ℝ/ℤ is associ- Furthermore, these results are sharp, i.e. the
ated a unique real parameter α [0, 1), called the standard Denjoy examples attain the minimum.
rotation number, so that the ordered orbit structure
of T is the same as that of the rigid rotation Rα,
Return Times and Dimension
where Rα(t) ¼ (t þ α) mod 1, provided that α is
Recently an interesting connection between the
irrational. Half a century later, Denjoy (1932)
pointwise dimensions, multifractal analysis, and
constructed examples of C1 diffeomorphisms
recurrence behavior of trajectories was discov-
that are not conjugate (via a homeomorphism) to
ered (Afraimovich et al. 2000; Barreira and
rotations. This was improved later on by Herman
Saussol 2001a; Boshernitzan 1993). Roughly
(1979). In these examples, the minimal set of T is
speaking, given an ergodic probability measure
necessarily a Cantor set Ω.
m the return time asymptotics (as the neighbor-
The arithmetic properties of the rotation number
hood of the point shrinks) of m-a.e point is deter-
have a strong effect on the properties of T. One area
mined by the pointwise dimension of m at this
that has been well understood is the relation
point. The deeper understanding of this relation
between the differentiability of T, the differentiabil-
would help to get a unified approach to dimen-
ity of the conjugation and the arithmetic properties
sions, exponents, entropies, recurrence times and
of the rotation number. (See, for example, Herman
correlation decay.
(1979)) Without stating any precise theorem, the
results differ sharply for Diophantine and for
Liouville rotation numbers (definition follows).
Roughly speaking the conjugating map is always Dimension Theory of Low-Dimensional
regular for Diophantine rotation numbers while it Dynamical Systems – Young’s
might be not smooth at all for Liuoville rotation Dimension Formula
numbers.
In this section a remarkable extension of Ruelle’s
Definition 2 An irrational number α is of dimension formula by Young (1982) for the
Diophantine class n ¼ n(α) ℝ+ if dimension of a measure is discussed.
1
k qa k< Theorem 7 Let T : M2 ! M2 be a C2 surface
qm
diffeomorphism and let m be an ergodic measure.
has infinitely many solutions in integers q for Then
m < n and at most finitely many for m > n where
k k denotes the distance to the nearest integer. 1 1
dimH ðmÞ ¼ hm ðf Þ ,
l1 l2
In (Kra and Schmeling 2002) the effect of the
rotation number on the dimension of Ω is studied. where l1 l2 are the two Lyapunov exponents
There the main result is for m.
Ergodic Theory: Fractal Geometry 673
with
P s log DT x jEux ¼0
where Es and Eu are the stable and unstable direc- < l < 1:
tions, respectively.
The limit set
Some Remarks on Dimension Theory for Low-
L≔ \ T n S1 D
Dimensional Versus High-Dimensional nℕ
Dynamical Systems
Unlike lower dimensions (one, two, or conformal is called the attractor or the solenoid. It is an
repellers), for higher-dimensional dynamical sys- example of a structurally stable basic set and is
tems there are no general dimension formulas one of the fundamental examples of a uniformly
(besides the Ledrappier–Young formula), and in hyperbolic attractor.
general dimension theory is much more difficult. The following result can be proved.
This is due to several problems:
Theorem 9 (Bothe 1995; Hasselblatt and
1. The geometry of the Bowen balls differs in a Schmeling 2004) For all t, the thermodynamic
substantial way from round balls. pressure
2. Number theoretic properties of some scaling
rates (Pollicott and Weiss 1994; Przytycki and @ 2
P dimH Lst log c ðt, yÞ ¼ 0:
Urbański 1989) enter into dimension calcula- @y
tions in ways they do not in low dimensions
(see section “Iterated Function Systems”). In particular, the stable dimension is indepen-
3. The dimension theory of sets is often reduced dent of the stable section.
to the theory of invariant measures. However,
there is no invariant measure of full dimension In this particular case the invariant axes for
in general and measure-theoretic consider- strong and weak contraction split the system
ations do not apply (McCluskey and Manning smoothly and the difficulty is to show that the
1983). strong contraction is dominated by the weaker.
4. The stable and unstable foliations for higher In particular one has to ensure that effects as
dimensional systems are typically not C1 described in section “Iterated Function Systems”
(Hasselblatt 1994; Schmeling 1994). Hence, do not appear. In the general situation this is not
to split the system into an expanding and a the case and one lacks a similar theorem in the
contracting part is far more subtle. general situation. In particular, the unstable
674 Ergodic Theory: Fractal Geometry
foliation is not better than Hölder and does not Consider a small ball B in the phase space. The
provide a “nice” coordinate. image TnB is almost an ellipsoid with axes of
length
Given x M and a vector v TxM the For 1 i s, cover T nB by balls of radius ewi n .
Lyapunov exponent is defined as Then approximately
(1985a) showed that this is indeed a necessary The hypotheses of this theorem are sharp.
condition. They also provided an exact formula: Ledrappier and Misiurewicz (1985) constructed
an example of a nonhyperbolic measure for
Theorem 10 (Ledrappier–Young (1985a, b)) which the pointwise dimension is not constant
With d0 ¼ 0 for a C2 diffeomorphism holds a.e. In (Pesin and Weiss 1997a), Pesin and Weiss
present an example of a Hölder homeomorphism
s with Hölder constant arbitrarily close to one,
h m ðf Þ ¼ wi di d i1 where the pointwise dimension for the unique
i¼1
measure of maximal entropy does not exist
where di are the dimensions of the (conditional) a.e. There is also a one-dimensional example by
measure on the ith unstable leaves. Cutler (1990).
f EL ðaÞ ¼ f EmD ðdimH L aÞ ð3Þ of the pressure functional with respect to a suit-
able chosen family of potentials. In the remaining
where the entropy spectrum on the right-hand side items this is no longer the case. Analyticity and
is with respect to the measure of maximal convexity properties of the pressure functional are
dimension. lost. However, the authors succeeded to provide a
The following list summarizes the state of the satisfactory theory in these cases.
art for the dynamical characteristic multifractal
analysis of dynamical systems. The precise state- Multifractal Analysis and Large Deviation
ments can be found in the original papers. Theory
There are deep connections between large devia-
tion theory and multifractal analysis. The varia-
• (Pesin and Weiss 1997b; Weiss 1999) For con- tional formula for pressure is an important tool in
formal repellers and Axiom-A surface the analysis, and can be viewed (and proven) as a
diffeomorphisms, a complete multifractal anal- large deviation result (Ellis 1985). Some authors
ysis exists for the Lyapunov exponent. use large deviation theory as a tool to effect multi-
• (Barreira et al. 1997b; Pesin and Weiss 1997b; fractal analysis.
Weiss 1999) For mixing a subshift of finite
type, a complete multifractal analysis exists
for the Birkhoff average for a Hölder continu- Future Directions
ous potential and for the local entropy for a
Gibbs measure with Hölder continuous The dimension theory is fast developing and of
potential. great importance in the theory of dynamical sys-
• (Barreira and Saussol 2001b; Pesin and tems. In the most ideal situations (low dimensions
Sadovskaya 2001) There is a complete multi- and hyperbolicity) a generally far reaching and
fractal analysis for hyperbolic flows. powerful theory has been developed. It uses
• (Takens and Verbitzki 1999) There is a gener- ideas from statistical physics, fractal geometry,
alization of the multifractal analysis on sub- probability theory and other fields.
shifts with specification and continuous Unfortunately, the richness of this theory does
potentials. not carry over to higher-dimensional systems.
• (Barreira and Saussol 2001c; Barreira et al. However, recent developments have shown that
2002b) There is an analysis of “mixed” spectra it is possible to obtain a general theory for the
like the dimension spectrum of local entropies dimension of measures. Part of this theory is the
and also an analysis of joint level sets deter- development of the analytic tools of nonuniformly
mined by more than one (measurable) hyperbolic systems.
function. Therefore, the dimension theory of dynamical
• (Pollicott and Weiss 1999) For the Gauss map systems is far from complete. In particular, it is
(and a class of nonuniformly hyperbolic maps) usually difficult to apply the general theory to
a complete multifractal analysis exists for the concrete examples, for instance if one really
Lyapunov exponent. wants to compute the dimension. The general
• (Iommi 2005) A general approach to multi- theory does not provide a way to compute the
fractal analysis for repellers with countably dimension but gives rather connections to other
many branches is developed. It shows in con- characteristics. Moreover, in the presence of neu-
trary to finitely many branches features of non- tral directions (zero Lyapunov exponents) one
analytic behavior. encounters all the difficulties arising in low-
complexity systems.
In the first three statements the multifractal Another important open problem is to under-
spectra are analytic concave functions that can stand the dimension theory of invariant sets in
be computed by means of the Legendre transform higher-dimensional spaces. One way would be to
678 Ergodic Theory: Fractal Geometry
relate the dimension of sets to the dimension of Barreira L (1996a) A non-additive thermodynamic formal-
measures. Such a connection is not clear. The ism and applications to dimension theory of hyperbolic
dynamical systems. Ergodic Theory Dynam Syst 16:
reason is that most systems do not exhibit a mea- 871–927
sure whose dimension coincides with the dimen- Barreira L (1996b) A non-additive thermodynamic formal-
sion of its support (invariant set). But there are ism and dimension theory of hyperbolic dynamical
some reasons to conjecture that any compact systems. Math Res Lett 3:499–509
Barreira L. Variational properties of multifractal spectra.
invariant set of an expanding map in any dimen- IST preprint
sion carries a measure of maximal dimension (see Barreira L, Saussol B (2001a) Hausdorff dimension of
(Gatzouras and Peres 1996; Kenyon and Peres measure via Poincaré recurrence. Commun Math Phys
1996)). If this conjecture is true one obtains an 219(2):443–463
Barreira L, Saussol B (2001b) Multifractal analysis of
invariant measure whose unstable dimension hyperbolic flows. Commun Math Phys 219(2):443–463
coincides with the unstable dimension of the Barreira L, Saussol B (2001c) Variational principles and
invariant set. There is also a measure of maximal mixed multifractal spectra. Trans Am Math Soc
stable dimension. Combining these two measures 353(10):3919–3944. (electronic)
Barreira L, Schmeling J (2000) Sets of “non-typical” points
one could establish an analogous theory for have full topological entropy and full Hausdorff dimen-
invariant sets as for invariant measures. sion. Israel J Math 116:29–70
Last but not least one has to mention the impact Barreira L, Pesin Y, Schmeling J (1997a) Multifractal
of the dimension theory of dynamical systems on spectra and multifractal rigidity for horseshoes. J Dyn
Contr Syst 3:33–49
other fields. This new point of view makes in Barreira L, Pesin Y, Schmeling J (1997b) On a general
many cases the posed problems more tractable. concept of multifractal rigidity: multifractal spectra
This is illustrated in examples from number the- for dimensions, entropies, and Lyapunov exponents.
ory, geometric limit constructions and others. The Multifractal rigidity. Chaos 7:27–38
Barreira L, Pesin Y, Schmeling J (1999) Dimension and
applications of the dimension theory of dynamical product structure of hyperbolic measures. Ann Math
systems to other questions seem to be unlimited. 149:755–783
Barreira L, Saussol B, Schmeling J (2002a) Distribution of
frequencies of digits via multifractal analysis.
J Number Theory 97(2):410–438
Bibliography Barreira L, Saussol B, Schmeling J (2002b) Higher–
dimensional multifractal analysis. J Math Pures Appl
Primary Literature (9) 81(1):67–91
Afraimovich VS, Chernov NI, Sataev EA (1995) Statistical Belykh VP (1982) Models of discrete systems of phase
properties of 2D generalized hyperbolic attractors. synchronization. In: Shakhildyan VV, Belynshina LN
Chaos 5:238–252 (eds) Systems of phase synchronization. Radio i Svyaz,
Afraimovich VS, Schmeling J, Ugalde J, Urias J (2000) Moscow, pp 61–176
Spectra of dimension for Poincaré recurrences. Discret Billingsley P (1978) Ergodic theory and information.
Contin Dyn Syst 6(4):901–914 Krieger
Aihua F, Yunping J, Jun W (2005) Asymptotic Hausdorff Blinchevskaya M, Ilyashenko Y (1999) Estimate for the
dimensions of Cantor sets associated with an asymp- entropy dimension of the maximal attractor for
totically non-hyperbolic family. Ergodic Theory k-constracting systems in an infinite-dimensional
Dynam Syst 25(6):1799–1808 space. Russ J Math Phys 6(1):20–26
Alexander J, Yorke J (1984) Fat Baker’s transformations. Boshernitzan M (1993) Quantitative recurrence results.
Ergodic Theory Dynam Syst 4:1–23 Invent Math 113:617–631
Ambroladze A, Schmeling J (2004) Lyapunov exponents Bothe H-G (1995) The Hausdorff dimension of certain
are not stable with respect to arithmetic subsequences. solenoids. Ergodic Theory Dynam Syst 15:449–474
In: Fractal geometry and stochastics III. Progr Probab, Bousch T (2000) Le poisson n’a pas d’arêtes. Ann IH
vol 57. Birkhäuser, Basel, pp 109–116 Poincaré (Prob-Stat) 36(4):489–508
Artin E (1965) Ein mechanisches System mit quasi- Bowen R (1973) Topological entropy for noncompact sets.
ergodischen Bahnen, collected papers. Addison Wes- Trans Am Math Soc 184:125–136
ley, pp 499–501 Bowen R (1979) Hausdorff dimension of quasi-circles.
Barreira L (1995) Cantor sets with complicated geometry Publ Math IHES 50:11–25
and modeled by general symbolic dynamics. Random Bylov D, Vinograd R, Grobman D, Nemyckii V (1966)
Comput Dyn 3:213–239 Theory of Lyapunov exponents and its application to
Ergodic Theory: Fractal Geometry 679
problems of stability. Izdat “Nauka”, Moscow Grassberger P, Procaccia I, Hentschel H (1983) On the
(in Russian) characterization of chaotic motions, Lect notes. Physics
Casdagli M, Sauer T, Yorke J (1991) Embedology. J Stat 179:212–221
Phys 65:589–616 Halsey T, Jensen M, Kadanoff L, Procaccia I, Shraiman
Ciliberto S, Eckmann JP, Kamphorst S, Ruelle D (1971) B (1986) Fractal measures and their singularities: the
Liapunov exponents from times. Phys Rev A 34 characterization of strange sets. Phys Rev
Collet P, Lebowitz JL, Porzio A (1987) The dimension A 33(N2):1141–1151
spectrum of some dynamical systems. J Stat Phys 47: Hasselblatt B (1994) Regularity of the Anosov splitting
609–644 and of horospheric foliations. Ergodic Theory Dynam
Constantin P, Foias C (1988) Navier-Stokes equations. Syst 14(4):645–666
Chicago University Press Hasselblatt B, Schmeling J (2004) Dimension product
Cruchfield J, Farmer D, Packard N, Shaw R (1980) Geom- structure of hyperbolic sets. In: Modern dynamical
etry from a time series. Phys Rev Lett 45:712–724 systems and applications. Cambridge University
Cutler C (1990) Connecting ergodicity and dimension in Press, Cambridge, pp 331–345
dynamical systems. Ergodic Theory Dynam Syst 10: Henley D (1992) Continued fraction cantor sets, Hausdorff
451–462 dimension, and functional analysis. J Number Theory
Denjoy A (1932) Sur les courbes défines par les équations 40:336–358
différentielles á la surface du tore. J Math Pures Appl 2: Hentschel HGE, Procaccia I (1983) The infinite number of
333–375 generalized dimensions of fractals and strange
Ding M, Grebogi C, Ott E, Yorke J (1993) Estimating attractors. Physica 8D:435–444
correlation dimension from a chaotic times series: Herman MR (1979) Sur la conjugaison différentiable des
when does the plateau onset occur? Phys D 69:404–424 difféomorphismes du cercle á des rotations. Publ l’Inst
Dodson M, Rynne B, Vickers J (1990) Diophantine Math Hautes Études Sci 49:5–234
approximation and a lower bound for Hausdorff dimen- Hunt B (1996) Maximal local Lyapunov dimension bounds
sion. Mathematika 37:59–73 the box dimension of chaotic attractors. Nonlinearity 9:
Douady A, Oesterle J (1980) Dimension de Hausdorff Des 845–852
Attracteurs. CRAS 290:1135–1138 Iommi G (2005) Multifractal analysis for countable Mar-
Eggleston HG (1952) Sets of fractional dimension which kov shifts. Ergodic Theory Dynam Syst 25(6):
occur in some problems of number theory. Proc Lond 1881–1907
Math Soc 54:42–93 Jarnik V (1931) Über die simultanen diophantischen
Ellis R (1984) Large deviations for a general class of Approximationen. Math Zeitschr 33:505–543
random vectors. Ann Prob 12:1–12 Jenkinson O (2001) Rotation, entropy, and equilibrium
Ellis R (1985) Entropy, large deviations, and statistical states. Trans Am Math Soc 353:3713–3739
mechanics. Springer Kalinin B, Sadovskaya V (2002) On pointwise dimension
Falconer K (1990) Fractal geometry, mathematical foun- of non-hyperbolic measures. Ergodic Theory Dynam
dations and applications. Cambridge University Press, Syst 22(6):1783–1801
Cambridge Kaplan JL, Yorke JA (1979) Functional differential equa-
Fan AH, Feng DJ, Wu J (2001) Recurrence, dimension and tions and approximation of fixed points. Lecture notes.
entropy. J Lond Math Soc 64(2):229–244 In: Mathematics, vol 730. Springer, Berlin, pp 204–227
Frederickson P, Kaplan J, Yorke E, Yorke J (1983) The Katznelson Y, Weiss B (1982) A simple proof of some
Liapunov dimension of strange attractors. J Differ Equ ergodic theorems. Isr J Math 42:291–296
49:185–207 Kenyon R, Peres Y (1996) Measure of full dimension on
Frostman O (1935) Potential d’équilibre Et Capacité des affine invariant sets. Ergodic Theory Dynam Syst 16:
Ensembles Avec Quelques Applications à la Théorie 307–323
des Fonctions. Meddel Lunds Univ Math Semin 3: Kesseböhmer M (1999) Multifrakale und Asymptotiken
1–118 grosser Deviationen. Thesis U Göttingen, Göttingen
Furstenberg H (1967) Disjointness in ergodic theory, min- Kingman JFC (1968) The ergodic theory of subadditive
imal sets, and a problem in Diophantine approximation. stochastic processes. J R Stat Soc B30:499–510
Math Syst Theory 1:1–49 Kleinbock D, Margulis G (1998) Flows on homogeneous
Furstenberg H (1970) Intersections of Cantor sets and spaces and Diophantine approximation on manifold.
transversality of semigroups I. In: Problems in analysis. Ann Math 148:339–360
Sympos Salomon Bochner. Princeton University Press, Kra B, Schmeling J (2002) Diophantine classes, dimension
pp 41–59 and Denjoy maps. Acta Arith 105(4):323–340
Gatzouras D, Peres Y (1996) The variational principle for Ledrappier F (1981) Some relations between dimension
Hausdorff dimension: a survey. In: Pollicott M et al and Lyapounov exponents. Commun Math Phys 81:
(eds) Ergodic theory of Zd actions. Proc of the Warwick 229–238
symposium. Lond Math Soc Lect Note Ser vol 228. Ledrappier F (1986) Dimension of invariant measures.
Cambridge University Press, pp 113–125 Proceedings of the conference on ergodic theory and
680 Ergodic Theory: Fractal Geometry
related topics II (Georgenthal, 1986). Teubner–texte Palis J, Viana M (1988) On the continuity of Hausdorff
Math 94:137–173 dimension and limit capacity for horseshoes. Lecture
Ledrappier F, Misiurewicz M (1985) Dimension of invari- notes in math, vol 1331. Springer
ant measures for maps with exponent zero. Ergodic Pesin Y (1977) Characteristic exponents and smooth ergo-
Theory Dynam Syst 5:595–610 dic theory. Russ Math Surv 32(4):55–114
Ledrappier F, Strelcyn JM (1982) A proof of the estimate Pesin Y, Pitskel B (1984) Topological pressure and the
from below in Pesin’s entropy formula. Ergodic Theory variational principle for noncompact sets. Funct Anal
Dynam Syst 2:203–219 Appl 18:307–318
Ledrappier F, Young LS (1985a) The metric entropy of Pesin Y (1992) Dynamical systems with generalized
diffeomorphisms. I Characterization of measures satis- hyperbolic attractors: hyperbolic, ergodic and topolog-
fying Pesin’s entropy formula. Ann Math 122:509–539 ical properties. Ergodic Theory Dynam Syst 12:
Ledrappier F, Young LS (1985b) The metric entropy of 123–151
diffeomorphisms, II. Relations between entropy, expo- Pesin Y (1993) On rigorous mathematical definition of
nents and dimension. Ann Math 122:540–574 correlation dimension and generalized spectrum for
Lopes A (1989) The dimension spectrum of the maximal dimensions. J Stat Phys 71(3–4):529–547
measure. SIAM J Math Anal 20:1243–1254 Pesin Y (1997a) Dimension theory in dynamical systems:
Mãné R (1981) On the dimension of compact invariant sets rigorous results and applications. Cambridge Univer-
for certain nonlinear maps. Lecture notes in mathemat- sity Press, Cambridge
ics, vol 898. Springer Pesin Y (1997b) Dimension theory in dynamical systems:
Mãné R (1990) The Hausdorff dimension of horseshoes of contemporary views and applications. In: Chicago lec-
surfaces. Bol Soc Bras Math 20:1–24 tures in mathematics. Chicago University Press,
Mauldin RD, Urbański M (1996) Dimensions and mea- Chicago
sures in infinite iterated function systems. Proc Lond Pesin Y, Sadovskaya V (2001) Multifractal analysis of
Math Soc 73(1):105–154 conformal axiom A flows. Commun Math Phys
Mauldin RD, Urbański M (2000) Parabolic iterated func- 216(2):277–312
tion systems. Ergodic Theory Dynam Syst 20(5): Pesin Y, Tempelman A (1995) Correlation dimension of
1423–1447 measures invariant under group actions. Random
Mauldin RD, Urbański M (2002) Fractal measures for Comput Dyn 3(3):137–156
parabolic IFS. Adv Math 168(2):225–253 Pesin Y, Weiss H (1994) On the dimension of determin-
Mauldin DR, Urbański M (2003) Graph directed Markov istic and random cantor-like sets. Math Res Lett 1:
systems. In: Geometry and dynamics of limit sets. 519–529
Cambridge tracts in mathematics, vol 148. Cambridge Pesin Y, Weiss H (1996) On the dimension of deterministic
University Press, Cambridge and random cantor-like sets, symbolic dynamics, and
McCluskey H, Manning A (1983) Hausdorff dimension the Eckmann-Ruelle conjecture. Commun Math Phys
For horseshoes. Ergodic Theory Dynam Syst 3: 182:105–153
251–260 Pesin Y, Weiss H (1997a) A multifractal analysis of equi-
Moran P (1946) Additive functions of intervals and librium measures for conformal expanding maps and
Hausdorff dimension. Proc Cambr Philos Soc 42:15–23 Moran-like geometric constructions. J Stat Phys 86:
Moreira C, Yoccoz J (2001) Stable intersections of cantor 233–275
sets with large Hausdorff dimension. Ann Math 154(1): Pesin Y, Weiss H (1997b) The multifractal analysis of
45–96 Gibbs measures: motivation. Mathematical foundation
Neunhäuserer J (1999) An analysis of dimensional theo- and examples. Chaos 7:89–106
retical properties of some affine dynamical systems. Petersen K (1983) Ergodic theory. Cambridge studies in
Thesis, Free University Berlin, Berlin advanced mathematics 2. Cambridge University Press,
Oseledets V (1968) A multiplicative ergodic theorem. Cambridge
Liapunov characteristic numbers for dynamical sys- Pollicott M, Weiss H (1994) The dimensions of some self
tems. Trans Moscow Math Soc 19:197–221 affine limit sets in the plane and hyperbolic sets. J Stat
Ott E, Sauer T, Yorke J (1994) Part I Background. In: Phys 77:841–866
Coping with chaos. Wiley Ser Nonlinear Sci. Wiley, Pollicott M, Weiss H (1999) Multifractal analysis for the
New York, pp 1–62 continued fraction and Manneville-Pomeau transfor-
Palis J, Takens F (1987) Hyperbolicity and the creation of mations and applications to Diophantine approxima-
homoclinic orbits. Ann Math 125:337–374 tion. Commun Math Phys 207(1):145–171
Palis J, Takens F (1993) Hyperbolicity and sensitive cha- Przytycki F, Urbański M (1989) On Hausdorff dimension
otic dynamics at homoclinic bifurcations. Cambridge of some fractal sets. Stud Math 93:155–167
University Press, Cambridge Ruelle D (1978) Thermodynamic formalism. Addison-
Palis J, Takens F (1994) Homoclinic tangencies for hyper- Wesley
bolic sets of large Hausdorff dimension. Acta Math Ruelle D (1982) Repellers for real analytic maps. Ergodic
172:91–136 Theory Dynam Syst 2:99–107
Ergodic Theory: Fractal Geometry 681
Schmeling J (1994) Hölder continuity of the Holonomy Weiss H (1992) Some variational formulas for Hausdorff
maps for hyperbolic basic sets II. Math Nachr 170: dimension, topological entropy, and SRB entropy for
211–225 hyperbolic dynamical system. J Stat Phys 69:879–886
Schmeling J (1997) Symbolic dynamics for Beta-shifts and Weiss H (1999) The Lyapunov spectrum of equilibrium
self-normal numbers. Ergodic Theory Dynam Syst 17: measures for conformal expanding maps and axiom-A
675–694 surface diffeomorphisms. J Stat Phys 95(3–4):615–632
Schmeling J (1998) A dimension formula for endomor- Young LS (1981) Capacity of attractors. Ergodic Theory
phisms – the Belykh family. Ergodic Theory Dynam Dynam Syst 1:381–388
Syst 18:1283–1309 Young LS (1982) Dimension, entropy, and Lyapunov
Schmeling J (1999) On the completeness of multifractal exponents. Ergodic Theory Dynam Syst 2:109–124
spectra. Ergodic Theory Dynam Syst 19:1–22
Schmeling J (2001) Entropy Preservation under Markov Books and Reviews
codings. J Stat Phys 104(3–4):799–815 Bowen R (1975) Equilibrium states and the ergodic theory
Schmeling J, Troubetzkoy S (1998) Dimension and of Anosov diffeomorphisms. Lecture notes in mathe-
invertibility of hyperbolic endomorphisms with sin- matics, vol 470. Springer
gularities. Ergodic Theory Dynam Syst 18: Eckmann JP, Ruelle D (1985) Ergodic theory of chaos and
1257–1282 strange attractors. Rev Mod Phys 57:617–656
Schmeling J, Weiss H (2001) Dimension theory and Federer H (1969) Geometric measure theory. Springer
dynamics. AMS Proc Symposia Pure Math Ser 69: Hasselblatt B, Katok A (2002) Handbook of dynamical
429–488 systems, vol 1, Survey 1. Principal structures. Elsevier
Series C (1985) The modular surface and continued frac- Katok A (1980) Lyapunov exponents, entropy and periodic
tions. J Lond Math Soc 31:69–80 orbits for diffeomorphisms. Inst Hautes Études Sci Publ
Simon K (1997) The Hausdorff dimension of the general Math 51:137–173
Smale–Williams solenoidwith different contraction Katok A, Hasselblatt B (1995) Introduction to the modern
coefficients. Proc Am Math Soc 125:1221–1228 theory of dynamical systems. Cambridge University
Simpelaere D (1994) Dimension spectrum of axiom-A Press, Cambridge
diffeomorphisms, II. Gibbs measures. J Stat Phys 76: Keller G (1998) Equilibrium states in ergodic theory. In:
1359–1375 London Mathematical Society student texts 42. Cam-
Solomyak B (1995) On the random series S ln (an Erdös bridge University Press, Cambridge
problem). Ann Math 142(3):611–625 Mañé R (1987) Ergodic theory and differentiable dynam-
Solomyak B (2004) Notes on Bernoulli convolutions. In: ics. In: Ergebnisse der Mathematik und ihrer
Fractal geometry and applications: a jubilee of Benoît Grenzgebiete 3, vol 8. Springer
Mandelbrot. Proc Sympos Pure Math, vol 72, Part Mario R, Urbański M (2005) Regularity properties of
1. American Mathematical Society, Providence, Hausdorff dimension in infinite conformal iterated
pp 207–230 function systems. Ergodic Theory Dynam Syst 25(6):
Stratmann B (1995) Fractal dimensions for Jarnik limit sets 1961–1983
of geometrically finite Kleinian groups; the semi- Mattila P (1995) Geometry of sets and measures in Euclid-
classical approach. Ark Mat 33:385–403 ean spaces. In: Fractals and rectifiability. Cambridge
Takens F (1981) Detecting strange attractors in turbulence. University Press, Cambridge
Lecture notes in mathematics, vol 898. Springer Pugh C, Shub M (1989) Ergodic attractors. Trans Am Math
Takens F, Verbitzki E (1999) Multifractal analysis of local Soc 312(1):1–54
entropies for expansive homeomorphisms with specifi- Takens F (1988) Limit capacity and Hausdorff dimension
cation. Commun Math Phys 203:593–612 of dynamically defined Cantor sets. Lecture notes in
Walters P (1982) Introduction to ergodic theory. Springer math, vol 1331. Springer
Index
Interval maps, 383, 643–645, 649, 652, 654–657 Kneading invariants, 652, 653, 657
Invariant integrals, 462 Kochergin flows, 354
Invariant measures, 294, 374 Kolmogorov–Arnold–Moser (KAM) theory, 314
entropy of, 374–375 Kolmogorov automorphism, 182
Invariant probability measure, 635, 639, 644, 650–652 Kolmogorov-Chaitin complexity, 371
Invariants of étale equivalence relations, 497 Kolmogorov extension theorem, 37
Inverse limits, 27 Kolmogorov process (K-process), 181–182
Invertible measure preserving system, 66, 70, 71 Kolmogorov–Sinai entropy, 205, 223
Ising chain, 382 Kolmogorov-Sinai theorem, 177
Ising model, 370 Kontsevich-Zorich conjecture, 350
Isomorphism, 39, 493–498, 501, 502, 505, 507, 508, 510, Koopman mixing, 257
511, 516, 521, 522 Koopman operator, 36, 39, 221, 539
Iterated function systems, 669 Koopman representation
Iterated Kreiss condition, 479 Alexeyev’s theorem, 115–116
ITPFI transformation, 251–252 Alpern-Tikhonov topology, 123
Banach problem, 112
disjoint, 114
J entropy, 114
Jacobs-deLeeuw-Glicksberg decomposition, 477 ergodic dynamical system, 112
Jakobson one-dimensional theorem, 647 Gaussian-Kronecker automorphism, 124
Jakobson’s method, 329 Lamperti theorem, 113
Jewett–Krieger theorem, 402, 506 Markov operators, 113
Jewett-Krieger type realization, 494, 506 maximal spectral types, 114
Jiang-Su algebra, 516 multiplicity (see Multiplicity function)
Jiang-Su stability, 516 pairwise independence property, 114
Joinings, 294, 298 spectral isomorphism, 112
Joinings in ergodic theory Thouvenot. Funny rank one, 123
and conjectures in number theory, 164 Koopman unitary operator
definition, 149, 150 associated with nonsingular Gaussian transformation,
disjointness, 153 269
and factors, 155 associated with nonsingular Poisson transformation,
filtering problems, 160 268–269
future research, 166 for nonsingular system, 267–268
and isomorphism, 154 Kreiss bounded operators, 479
Markov intertwinings, 156 Krengel class, 259–260
and multiple ergodic averages, 163 Krengel entropy, 269
Ornstein’s and Krieger’s theorems, 161 Krengel-Pinsker factor, 270
and Rohlin’s multifold mixing, 161, 163 Krengel-Sucheston concept of mixing, 257
self-joining (see Self-joinings) Krickeberg mixing, 255
set of, 152 Krieger embedding theorem, 442–443
Julia set, 22, 213 Krieger generator theorem, 177
Krieger’s theorem, 161, 246, 494, 505
Kronecker factor, 48, 395
K Kronecker subset, 217
Kac’s formula, 235 Krylov–Bogoliubov theorem, 346, 537, 638
Kakutani equivalence, 211–212, 562–563, 573
Kakutani-Rokhlin tower decomposition, 511, 517, 518
Kakutani transformation, 241 L
Kaplan–Yorke formula, 674 Lacunary sequence, 66
Katok formulae, 650 Lagrange multiplier, 372
Katok fundamental class, 345 Laplace–Beltrami operator, 585, 586
K-automorphism, 258 Large deviations principles, 371
Keane conjecture, 347 Lasota–Yorke inequality, 644
Keane’s condition, 339 Lebesgue density theorem, 639
Khintchine–Groshev Theorem, 600 Lebesgue measure, 5–7, 9, 10, 18, 19, 22, 37, 45, 86, 316,
Khintchine’s recurrence theorem, 75 599, 605
Khintchine’s transference principle, 600 Lebesgue number, 185
Kingman’s theorem, 91 Lebesgue probability space, 27, 39, 42
King’s weak closure theorem, 278 Lebesgue space, 23, 46
Index 689
Orbit equivalence of minimal ℤ-actions on the Cantor Poincaré recurrence, 64–65, 579, 583
set, 503 theorem, 390, 394
Orbit theory, 246 Poincaré sequences, 394
cocycles of dynamical systems, 250–251 Point process, 217
continuous orbit equivalence, 248–249 Pointwise dimension, 667
full groups, 246–247 Pointwise ergodic theorems, 88–90, 579
ITPFI transformation, 251–252 Poisson entropy, 270–271
Maharam extension, 247–248 Poisson factor, 226
normalizer of the full group, 249–250 Poisson flow, 262
Ordered Bratteli diagram, 500 Poisson point process, 1, 218–220
Orientation preserving, 6 Poisson processes, 219
isometries, 15 Poisson suspension, 220–223, 263–264, 281
Ornstein’s isomorphism theorem, 206–207 Polish group action, 530, 544, 551, 555, 557–558
Ornstein’s theorem, 161, 206 Polish space, 530–532, 545, 546, 549, 552, 553, 569, 570,
Ornstein’s theory, 56, 182 572
Ornstein-Weiss tiling theorem, 511, 517 Pólya–Carlson dichotomy, 589
Orthogonality of sequences, 295 Polynomial deviations of ergodic averages, 350
Orthonormal basis, 50 Polynomial rate (PR), 305
Oseledec’s theorem, 325 Pontryagin duality, 540
Oseledec and Pesin theory, 674 Positive definite sequence, 67
Oseledets theorem, 642 Positive upper density, 582
Power-bounded operator
on Banach space, 471, 473
P ergodic, 464, 477
Pairwise independence property (PID), 114 on Hilbert space, 473
Pairwise independently determined system on reflexive Banach space, 462
(PID), 162 Power series, 465
Palis conjecture, 640 Predecessor sets, 9
PAPs, 229 Pressure functional, 668
Parabolic systems, 671 Prime number theorem, 585, 586
Parameter exclusion, 329 Prime shift, 431
Parry entropy, 270 Prime volume, 305
Partial hyperbolicity, 18, 324 Probabilistic limit theorems, 648
Partially hyperbolic dynamical systems, 18 Probability distribution, 374
Anosov flows, time-one maps of, 18 Probability measure, 8
uniformly hyperbolic systems, compact group Probability space, 4, 39, 43, 44, 170–172
extensions of, 18 Probability vector, 7, 171, 175, 177
Partially mixing, 49 Product measure, 7
Pascal adic transformation, 56 Product transformation, 40
Periodic components, 339 Prouhet-Thue-Morse (PTM) sequence, 8, 432
Periodic points, 381 Proximal action, 520
Perron-Frobenius theorem, 379–380
Perron-Frobenius theory, 446
Perron numbers, 452 Q
Pesin’s theory of dimension-like characteristics, 668 Qualitative theory, 533, 540
Pesin theory, 19, 325, 642, 656 Quantitative nondivergence method, 608
Phase space, 12, 35 Quantitative Poincaré recurrence, 64–65
Phase space average, 638, 648 Quantitative weak mixing, 358
Phase-structure grammars, 9 Quasi-compact operator, 466
Physical measure, 639, 656 Quasi-ergodic hypothesis, 36, 577, 579
Piecewise C2 expanding maps, 10 Quasi-genericity vs. logarithmic quasi-genericity, 295
Piecewise monotone maps, 652 Quasi-hyperbolic automorphisms, 590
Pigeon hole principle, 236
Pinsker algebra, 214
Pinsker σ-ALGEBRA, 51 R
Pinsker-algebra theorem, 182 Radius of comparison, 512–514
Pinsker factor, 181 Radon-Nikodym derivative, 22, 244, 258
Pinsker field, 181 Radon–Nikodym theorem, 85
Poincaré map, 12, 22 Ramsey theory, 420, 582–584
692 Index
Z
U Zeeman’s tolerance stability conjecture, 654
Uniform approximation, 599 Zero-entropy, 180–181
Uniform ergodic theorem, 465–468 continuous interval maps, 304
Uniform hyperbolicity, 641, 642, 650, 653, 654, 656 detection, 307
Uniform Kreiss resolvent condition, 479 Gaussian system, 224
Uniformly expanding map, 641, 649, 652, 653, 655 Zeta function, 428, 436, 437, 439, 577, 585–587, 589
Uniformly recurrent subgroup, 523 ℤd-odometers, 509