Gravitational Million Body Problem
Gravitational Million Body Problem
Douglas Heggie
Department of Mathematics and Statistics
University of Edinburgh, Edinburgh EH9 3JZ, Scotland
Piet Hut
School of Natural Sciences
Institute for Advanced Study, Princeton, NJ 08540, U.S.A.
For Eiko, Linda, Caroline and Alison
Table of Contents
Preface iii
PART I INTRODUCTIONS 1
1 Astrophysics Introduction 3
2 Theoretical Physics Introduction 15
3 Computational Physics Introduction 25
4 Mathematical Introduction 35
PART II THE CONTINUUM LIMIT: N → ∞ 45
5 Paradoxical Thermodynamics 47
6 Statistical Mechanics 54
7 Motion in a Central Potential 63
8 Some Famous Models 74
9 Methods 84
PART III MEAN FIELD DYNAMICS: N = 106 98
10 Violent Relaxation 99
11 Internal Mass Loss 111
12 External Influences 119
PART IV MICROPHYSICS: N = 2 129
13 Exponential Orbit Instability 130
14 Two-Body Relaxation 138
15 From Kepler to Kustaanheimo 154
i
ii Table of Contents
Since this book is aimed at a broad audience within the physical sci-
ences, we expect most of our readers not to be experts in either astro-
physics or mathematics. For those readers, the title of this book may seem
puzzling at least. Why should they be interested in the gravitational at-
traction between bodies? What is so special about a millionindexmillion-
body problem, rather than a billion or a trillion bodies? What kind of
bodies do we have in mind? And finally, what is the problem with this
whole topic?
In physics, many complex systems can be modeled as an aggregate of
a large number of relatively simple entities with relatively simple interac-
tions between them. It is one of the most fascinating aspects of physics
that an enormous richness can be found in the collective phenomena that
emerge out of the interplay of the much simpler building blocks. Smoke
rings and turbulence in air, for example, are complex manifestations of
a system of air molecules with relatively simple interactions, strongly re-
pulsive at small scales and weakly attractive at larger scales. From the
spectrum of avalanches in sand piles to the instabilities in plasmas of
more than a million degrees in labs to study nuclear fusion, we deal with
one or a few constituents with simple prescribed forces. What is special
about gravitational interactions is the fact that gravity is the only force
that is mutually attractive. Unlike a handful of protons and electrons,
where like charges repel and opposite charges attract, a handful of stars
shows attraction between every pair of stars. As a result, a star cluster
holds itself together: there is no need for a container (as with a gas or
plasma in a lab) or a table (as with a sand pile). And for astronomers,
an extra reason to study star clusters is simply: because they are there.
iii
iv Preface
∗
Addressing an astronomer who had just remarked that “after all, a star is a pretty
simple thing”, R.O. Redman pointed out, that, “at a distance of 10 parsecs you’d
look pretty simple!”
Preface v
∗
Lectures on Celestial Mechanics
viii Preface
the theory. We realised also that the variety of topics which make up the
gravitational million-body problem are best treated at a variety of levels,
and that the interconnectedness of these topics forces interconnections
between the chapters. Some of the topics we address might even strike
some readers as being faintly whimsical. Recognising, therefore, that not
all chapters will be equally accessible or interesting to all readers, we
start each part of the book (each of which consists of several connected
chapters) with an outline of its contents. We hope that readers will thus
find their own optimal route between the contents page and the index.
For much the same variety of reasons we have relegated some material
to boxes. Often these contain details of some derivation or discussion, and
a reader in a hurry could avoid them. Sometimes, however, they contain
background which might help a reader over some difficulty. Sometimes
they collect some useful results which might act as a reference resource.
Sometimes they explore interesting or amusing by-ways that would oth-
erwise interrupt the text. Perhaps, therefore, they should not be passed
over too readily.
We hope the index will both help and intrigue the reader, but the
quirky and personal selection of names there is not meant as a compre-
hensive answer to the question of who did what. (Last time we looked, we
could not even find our own names there.) Even in the extensive list of
references to published work we have not attempted completeness. While
this risks antagonizing the reader for the omission of his or her own most
cherished paper, for which we apologise, it could be just as hazardous
if we tried to attribute every advance to its originator. Our main rea-
son for giving references is to give the reader an entry into the research
literature. Nowadays it is quite easy to move both forwards and back-
wards from a suitable entry point, using such invaluable bibliographical
resources as the Astronomy Abstract Service of the NASA Astrophysics
Data System.
Acknowledgements
Newton’s equations for the gravitational N -body problem are the start-
ing point for all four chapters in Part I, but each time seen in a different
light. To the astrophysicist (Chapter 1) they represent an accurate model
for the dynamical aspects of systems of stars, which is the subject known
as stellar dynamics. We distinguish this from celestial mechanics, and
sketch the distinction between the two main flavours of stellar dynamics.
This book is largely devoted to what is often (but maybe misleadingly)
called collisional stellar dynamics. This does not refer to actual physical
collisions, though these can happen, but to the dominant role of gravita-
tional encounters of pairs of stars. In dense stellar systems their role is
a major one. In collisionless stellar dynamics, by contrast, motions are
dominated by the average gravitational force exerted by great numbers
of stars.
We lay particular emphasis on the stellar systems known as globular
star clusters. We survey the gross features of their dynamics, and also
the reasons for their importance within the wider field of astrophysics.
Though understanding the million-body problem is not among the most
urgent problems in astrophysics, through globular clusters it has close
connections with several areas which are. Another practical topic we
deal with here is that of units, which may be elementary, but is one area
where the numbers can easily get out of hand.
Chapter 2 looks at the N -body equations from the point of view of
theoretical physicists. To them, the gravitational many-body problem is
just one of many interesting N -body problems. In some ways it is simple
(being scale-free), and in other ways complicated (long and short ranges
have comparable importance, for example, unlike in plasmas). Theoreti-
cal physicists understand the two-body problem, just like astrophysicists
do, but for large N their usual methods don’t apply very well. Statistical
1
2 PART I. Introduction
X
j=N
ri − rj
r̈i = −G mj , (1)
j=1,j6=i
|ri − rj |3
where rj is the position vector of the jth body at time t, mj is its mass,
G is a constant, and a dot denotes differentiation with respect to t. Now,
over 300 years later, our motivation has a very different emphasis. The
study of the solar system has lost none of its importance or fascination,
but it now competes with the study of star clusters, galaxies, and even
the structure of the universe as a whole. What is remarkable is that
Newton’s model is as central to this extended field of study as it was in
his own time.
Stellar Dynamics
It may seem surprising that the single, simple set of equations (1) forms a
good first approximation for modeling many astrophysical systems, such
as the solar system, star clusters, whole galaxies as well as clusters of
galaxies (Fig.1). The reason is that gravity, being an attractive long-
range force, dominates everything else in the Universe. The only other
long-range force, electromagnetism, is generally not important on very
large scales, since positive and negative charges tend to screen each other.
Short-range forces, such as gas pressure, are usually only important on
small scales, such as in the interiors of stars. On large scales, comparable
to the size of a galaxy, pressure is rarely important. On intermediate
3
4 1 Astrophysics Introduction
scales we have giant gas clouds in which both pressure and magnetic
fields can play a role.
This dominance of gravity on cosmic scales is a fortunate feature of
our Universe. It implies that it is relatively simple to perform detailed
computer simulations of many astronomical systems. Mastery of the
much more complicated physics of other specializations in astronomy,
such as plasma astrophysics, radiative transfer, or nuclear astrophysics,
is often not immediately necessary. Nor, in applications to star clusters,
is it usually necessary to use the more refined theory of general relativity.
The velocities are too low, except in a few situations involving degenerate
stars (neutron stars). Even the motions of stars round the black hole at
the centre of our Galaxy (Ghez et al. 2000), though it takes place on a
human time scale, can be understood with classical dynamics.
Stellar dynamics can be defined as studying the consequences of Eq.
(1) in astrophysical contexts. Traditionally these equations were discov-
ered by studying the motions of the moon and planets, and for the next
few centuries they were applied mainly to planetary dynamics. Before
the advent of electronic computers, most effort went into developing an-
alytical approximations to the nearly regular motion of the planets. This
field, known as celestial mechanics, had an important influence on devel-
opments both in physics and mathematics. An example is the study of
chaos, which first arose as an annoying complexity barring attempts to
make long-time predictions in celestial mechanics.
The solar system, however, is a very regular system. All planets move
in orbits close to the ecliptic, and all revolve in the same direction. The or-
bits are well-separated, and consequently no close encounters take place.
When we look around us on larger scales, that of star clusters and galax-
ies, then no such regularity applies. In our galaxy, most stars move in
the galactic disk, revolving in the same sense as the sun, but close en-
counters are not excluded. In many globular star clusters, there is not
even a preferred direction of rotation, and all stars move as they please
in any direction. Does this mean that analytic approximations are not
very useful for such systems?
The answer is: “it depends on what you would like to know”. In
general, the less regular a situation is, the larger the need for a computer
simulation. For example, during the collision of two galaxies each star in
each of the galaxies is so strongly perturbed that it becomes very difficult
to predict the overall outcome with pen and paper. This is indeed an
area of research which had to wait for computers to even get started,
in the early seventies (Toomre & Toomre 1972; but see the reference to
Holmberg in Chapter 3). On the other hand, when we want to understand
Stellar Dynamics 5
the conditions in a relatively isolated galaxy, such as our own Milky Way,
then there is some scope for pen-and-paper work. For example, a rich
variety of analytic as well as semi-numerical models has been constructed
for a range of galaxies. However, even in this case we often have to switch
to numerical simulations if we want to obtain more precise results.
components of our galactic system, and going their own way in wide orbits
around the galaxy (cf. Webbink 1988). They are nearly spherical because
they don’t rotate much by astronomical standards (cf. Merritt et al. 1997,
Davoust & Prugniel 1990). They are largely isolated from perturbing
influences of the galactic disk, and therefore form ideal laboratories for
stellar dynamics. Typically they have about a million stars each (Table
1), with a range of individual stellar masses (e.g. Paresce & De Marchi
2000).
Other galaxies also have globular clusters (Fig.1, bottom; Ashman &
Zepf 1998). In some galaxies they are not the oldest but some of the
youngest stellar components (see Lançon & Boily 2000). For instance the
cluster NGC 1866, which lies in the Large Magellanic Cloud, has a mass
of about 105 M but is only about 108 yrs old (Fischer et al. 1992, Testa
et al. 1999). Even in our own galaxy, the most massive young clusters are
found in the apparently hostile environment of the Galactic Centre (Figer
et al. 1999), and they are as massive as a modest old globular cluster
(though they will certainly not last as long, cf. Kim et al. 2000, Portegies
Zwart et al. 2001). The realisation that the formation of globular clusters
is still going on in the universe around us has taken hold only relatively
recently, and has helped to rejuvinate the study of these stellar systems.
For the most part, though, we concentrate on the old and better observed
systems in our own Galaxy.
Box 1. Units
Consider a stellar system whose total mass and energy are M and E,
respectively. Then we may choose M for the unit of mass, and in simple
GM 2
cases a common and useful unit of length is the virial radius Rv = − .
4E
(For example, for a binary star this is roughly the diameter of the relative
orbit.) Then, if we want units in which the constant of gravitation is unity,
3/2 √
the unit of time is required to be Ut = Rv / GM . Another √ common
unit of time is the crossing time, which is defined to be 2 2Ut .
Now in the evaluation of these expressions Rv and M will often be in
units favoured by astrophysicists (e.g. parsecs and solar masses, respec-
tively), while G is most familiar in one of the standard physical systems,
such as SI. The evaluation of Ut then requires conversion to a common
system of units. At this point, because of the enormous distances and
time scales which prevail in astronomy, the formulae bristle with enor-
mous exponents, until the final result is re-expressed in astrophysically
sensible units, and the numbers become reasonably once again. Indeed
it is one of the minor pleasures and surprises of a theorist’s life, and a
clue that he is on the right lines, when sensible numbers emerge at the
end of a long and tiresome calculation studded with huge powers of ten.
Sir Harold Jeffreys once remarked that “incorrect geophysical hypotheses
usually fail by extremely large margins” (Jeffreys 1929). The same is true
in astrophysics.
To avoid this unpleasantness it is much simpler to calculate entirely
within the astrophysical system, provided that you know the value of G
in these units. The following remarks will therefore save much time. Con-
sider units such that masses are measured in M , speeds are measured in
km/sec, and distances are measured in parsecs. Then G = 1/232 approx-
imately. Also, the corresponding unit of time is 106 years approximately.
In integrating the N -body equations numerically, still other units are
used. Part of the reason for this is to avoid carrying around the values
of physical constants, such as G, that would be needed in any physical
system, partly to avoid the huge and unreadable numbers that would
result from an inappropriate choice of physical units, but mainly so as not
to obscure the intrinsic scalability of many simple N -body calculations.
In simulations it is natural to choose units such that G = 1, and,
following Hénon (1971), the units of length and mass are conventionally
chosen so that the total mass of the system is M = 1 and the total
energy is E = −1/4. These choices might seem bizarre, but it then
follows that, if the system is in virial equilibrium (chapter 8), the potential
energy is W = −1/2, and the virial radius is then Rvir = 1. These units
are referred to as N -body units, and are in widespread, but not quite
universal, use. They become rather ambiguous when external fields or
many binary stars are included in the computations.
A Recipe for a Star Cluster 9
Fig. 2. The central region of the globular cluster M15. Image by courtesy of P.
Guhathakurta, UC Santa Cruz.
10 1 Astrophysics Introduction
dying off as their stars escape. Here we paint a few details on this picture,
showing what we think are the stages through which a cluster would have
to pass in order to arrive at something resembling those we see around
us.
To make a star cluster, in practice, one should start out with a large
gas cloud, which under the right conditions undergoes internal collapse on
several length and time scales, to produce a large number of stars. This is
a process which is being modelled with increasing sophistication (see, for
example, the beautiful movie at http://www.ukaff.ac.uk/movies.shtml, or
Klessen & Burkert 2001). Nevertheless the physical processes related to
star formation are very complex, and at present only partly understood.
Therefore, let us perform a thought experiment in which we will cheat
a bit. Let us take a bucket full of ready-made single stars, and sprinkle
them into a limited region of space, in order to watch just the stellar
dynamics between the stars at play.
The simplest way to place the stars in space, is to start from rest. As
soon as the stars are allowed to start moving, the whole system will begin
to collapse under the mutual gravitational attraction of all the stars. If
the stars are sprinkled in at random, the contraction will not proceed
very far, since the individual star-star interactions will cause deflections
from a purely radial infall. The stars will pass at some distance from the
centre, each one at a somewhat different time, and then move out again.
As a consequence, the whole system will breath a few times: globally
shrinking and expanding. However, phase coherence will be lost very
soon, and after a few breathing motions, the stars will be pretty well
mixed. Some stars will be lost, spilled into the surrounding space, never
to return, but most stars will remain bound to the system.
After those first few crossing times, not much will happen, for quite a
while. During a typical crossing time, a typical star will move through
the system on a rosette-shaped orbit, while almost conserving its energy
and angular momentum. On a longer time scale, though, the cumulative
effect of many distant encounters, and occasional closer encounters, will
affect the orbit. If we wait long enough, the memory of the original energy
and angular momentum of the star will be lost. This time scale is called
the relaxation time scale.
Even if the system was set off in a non-spherical distribution, such
as that of a pancake, say, the original order will be erased after a few
relaxation time scales, and the system will become spherical. There will
also be a steady trickle of stars that escape from the system. In addition,
energy will be transported from the centre to the outer regions, and as
a result the inner regions tend to contract. Before long, the centre will
gain in density, by many orders of magnitude, a phenomenon called core
collapse.
Dynamics of Dense Stellar Systems 11
After core collapse, double stars will be formed, that provide a source
of energy for the system, empowering the ongoing evaporation of the
system, until the whole star cluster is dissolved. While the star cluster
slowly evaporates, the system will undergo core oscillations, with vast
swings in central density, if the number of stars is large enough, well
over 104 . These so-called gravothermal oscillations are the result of the
fact that the inner regions, having much shorter relaxation times than
the rest of the system, grow impatient in the post-collapse phase, and
start collapsing all over again. Interestingly, these oscillations exhibit
mathematical chaos, one more example of the gravitational many-body
problem providing a stage for some of the most modern forms of applied
mathematics.
This general picture, of a star cluster being formed, coming to rest,
undergoing a slow form of core ‘collapse’, followed by evaporation, will
be spun out in much greater detail throughout the rest of the book. But
before we take a closer look to the specific million-body problem, let us
see how much the picture can change if we treat the stars as stars and
not as point masses.
in which globular star clusters did not have something important to say,
and this remains true to the present day.
Until the early 1970s these parallel tracks in the study of globular
clusters – the dynamical and the astrophysical – proceeded entirely in-
dependently. The two communities had nothing in common and did not
even need to talk to each other. By the 1990s all this had changed. What
happened?
More than anything else, it was the discovery of variable X-ray sources
in globular clusters in the 1970s which brought down the barriers be-
tween the two communities (Fig.3). Though their origin is still not quite
clear, it was soon realised that dynamical processes were important. Dy-
namicists also realised that here was a new process which could have a
profound influence on their understanding of dynamical evolution. Sud-
denly the dynamics of globular clusters had rejoined the mainstream of
astrophysics, and in the 1970s there was not much that seemed more
glamorous than X-ray astronomy.
The integration of the dynamical and stellar studies of globular clusters
has progressed steadily in the intervening decades, and there is even a new
breed of “cross-over” astrophysicists who are at home in both camps. This
grand unification of cluster studies has been strengthened by a stream of
exciting discoveries: binary stars (through the rapid explosion of data on
radial velocities, and other techniques), millisecond pulsars, white dwarfs,
blue stragglers in dense clusters, core collapse, and so on. They are ideal
Fig. 3. A recent X-ray image of the Andromeda galaxy, M31. More than 5% of
the sources lie in globular clusters belonging to Andromeda. Original colour im-
age by courtesy of the ROSAT Mission (http://www.xray.mpe.mpg.de/) and the
Max-Planck-Institut für extraterrestrische Physik (http://www.mpe.mpg.de/)
14 1 Astrophysics Introduction
Problems
1) Compute the virial radius (Box 1) of a binary consisting of two stars
of mass m1 and m2 in a circular relative orbit of radius a.
2) Compute the speed of a star in a circular orbit at the median tidal
radius (Table 1) around a cluster of median mass. Do this in astro-
physical, physical and N -body units.
3) Using the fact that the magnitude, m, and luminosity, L, of a source
at a distance d are related by
L d
m = −2.5 log10 + 5 log10 +M ,
L 10pc
where L and M are the luminosity and absolute magnitude of the
sun, find the mean surface brightness (within the tidal radius) of the
median globular cluster (using data from Table 1). Express your result
in magnitudes per square arc second.
2
Theoretical Physics Introduction
15
16 2 Theoretical Physics Introduction
N = 2, 3
The three-body problem is so famous precisely because it is one of the
oldest problems that cannot be solved. In contrast, the two-body problem
is one of the oldest solved problems. It was Newton’s great triumph
that he was able to explain why planets move in elliptical orbits around
the sun, as had been discovered earlier by the brilliant Kepler, using
observational data by Tycho Brahe. The significance of Newton’s solution
can hardly be overestimated, since the result was a breaching of the
seemingly separated realms of the temporal, human, sub-lunar world and
the eternal world beyond the Moon’s orb.
Though Newton had shown how to solve the two-body problem, the
presence of even an infinitesimal third body (the so-called “restricted”
three-body problem) is insoluble in the usual sense.∗ We should not
forget, however, that Newton was able to solve some three-body problems
approximately. He knew, for instance, that it is better to treat a planet as
revolving around the barycentre of the inner solar system than about the
sun itself. If we do not insist on a precise solution, as with the two-body
problem, in other words, if we look at it from the point of view of physics
rather than mathematics, as Newton did, many aspects of the behaviour
of the three-body problem are not so hard to understand. Indeed it is one
of the quiet triumphs of recent decades that our intuitive understanding
of three-body motions has developed to the extent it has. Most physics
undergraduates are still exposed to the two-body problem, and can easily
develop a feel for the motion in ellipses and hyperbolae, especially when
the scattering problems of atomic physics are encountered. We think that
it is possible to arrive at a similar feel for the richer behaviour of three-
body systems, and one of our aims in writing this book is to demonstrate
this.
The mathematical development of the three-body problem continued
in the hands of Euler, Lagrange, Laplace and many others, who system-
atised the methods of approximate solution. These led to the remarkable
and successful development of the theory of planetary motion; the suc-
cesses of the recovery of Ceres and the discovery of Neptune; and the
familiar but no less remarkable ability of astronomers to predict the mo-
∗
As an aside, it is amusing to see the progress of physics reflected in our (in)ability to
solve N-body problems. In Newtonian gravity we cannot solve the 3-body problem.
In general relativity we cannot solve the 2-body problem. In quantum electrody-
namics we cannot solve the 1-body problem. And in quantum chromodynamics we
cannot even solve the 0-body problem, the vacuum.
N →∞ 17
tions of the planets, satellites, asteroids, and comets. Equally, they led to
profound insights in the emerging fields of dynamical systems and chaos
(Chapter 4), which in turn have illuminated our understanding of N -
body problems. But they also led away from the sort of approach we
need if our goal is to find quantitative answers to questions about the
million-body problem.
N → ∞
Throughout classical physics there are several important analogues to
the million-body problem, and it is from this background, and not so
much from celestial mechanics, that the most fruitful approaches have
come. At first sight the closest analogy is with plasma physics, where
inverse square laws, as in Eq.(1.1), also arise, but the analogy has not
proved as fruitful as one might think.
In the first place plasmas are often nearly neutral, and for this reason
it makes sense to conceive of plasmas which are nearly uniform, nearly at
rest and of large spatial extent. By contrast, it is impossible to conceive
of an infinite uniform gravitational medium in equilibrium (Fig.1). New-
ton himself glimpsed this problem, and solved it by noting that the stars
had been placed at immense distances from each other, lest they should,
“by their gravity, fall on each other”. This was a satisfactory solution
if one supposed that the system of fixed stars were young enough. For
stellar systems the modern solution is that the random motions or circu-
lation of the stars maintain dynamic equilibrium, just as the motions of
the planets prevent their falling into the sun. Even so, we shall see that
the contraction of stellar systems in dynamic quasi-equilibrium is a real
and fascinating issue, though it is a much gentler process than the head-
long rush suggested by the phrase “gravitational collapse”. In problems
of cosmology, Newton’s dilemma is avoided by an overall expansion or
contraction of the entire universe, though a general relativistic treatment
is required.
The second basic feature of an infinite plasma is that several impor-
tant effects are localised within a Debye length, beyond which the rear-
rangement of charges effectively screens off the influence of an individual
charge (see, for example, Sturrock 1994). If one naively computes the
Debye length for a stellar system, it is found usually to be of the order of
the size of the system itself (Problem 2). Thus it is difficult to treat grav-
itational interactions as being localised within a stellar system, though it
is a difficulty that stellar dynamicists habitually ignore. In any event, all
these considerations help to explain why many of the well known proper-
ties of plasmas have little relevance to stellar systems. And yet there is
18 2 Theoretical Physics Introduction
-2
-2
-2
-2
-2 -1 0 1 2 -2 -1 0 1 2 -2 -1 0 1 2 -2 -1 0 1 2 -2 -1 0 1 2
Fig. 1. Collapse of a uniform “cold” sphere. Initially 2048 particles are dis-
tributed randomly within a sphere. In units such that G = 1, the initial radius
√
is 2.4, the total mass is 1, and successive frames are taken at intervals of 2 2/5
(cf. Box 1.1). Late in the collapse (frame 7 and 8) the distribution of particles
has becomes very irregular. In frame 9 the particles which arrived first have be-
gun to reexpand. By the last row of frames the remaining central condensation
has settled nearly into dynamical equilibrium.
one phrase that stellar dynamicists use all the time which betrays their
debt to plasma physics: the words “Coulomb logarithm”, which occur in
the theory of relaxation (Chapter 14).
Thermodynamics
The plasma analogy exploits the form of the force equation but not
the size of N . Physicists routinely study problems which great numbers
of particles using concepts of thermodynamics. How fruitful is such an
approach to the million-body problem?
From a formal point of view, unfortunately, the field of thermodynamics
excludes a description of self-gravitating systems. The reason is that
the existence of gravitational long-range forces violates the notion of an
asymptotic thermodynamic limit in which physical quantities are either
intensive or extensive. Gravity exhibits what is known in particle physics
Thermodynamics 19
∗
A consistent theory is possible if we let R → ∞ with N/R fixed (de Vega & S’anchez
2000), though what relevance this might have to the million-body problem is unclear
20 2 Theoretical Physics Introduction
A Lack of Handles
Even though we cannot use thermodynamics in a formal way, when
dealing with a star cluster, we can still describe the motion of the stars
in a way that is analogous to the treatment of the motion of molecules in
a gas studied in a laboratory. One important difference is that a swarm
of stars forms an open system, while a body of gas in a lab has to be
contained. Typical textbook experiments in thermodynamics show the
gas to reside inside a cylinder, with a movable piston that allows the
experimenter to change the volume of the gas. In a star cluster, there are
no cylinder and piston. Instead, the stars are confined by their collective
gravitational field.
The structural simplicity of a star cluster thus allows far less experi-
mentation than is the case for a body of gas in a lab situation. Whether in
6
0
y
-2
-4
-6
-8 -6 -4 -2 0 2 4 6 8
x
Fig. 2. Escape from an N -body system. This computer model of a star cluster
shows a thin stream of escapers emerging at the left and right. The escapers
are channeled into these streams because of external forces, and the streams are
curved because of Coriolis forces.
Towards an understanding of the million-body problem 21
to one of the most profound insights into the behaviour of stellar systems
– gravothermal stability.
In addition to fundamental approaches like this, there are several other
routes by which the behaviour of N -body systems are explored (Chapter
9). One can construct toy models, one can borrow models from other
areas (such as the theory of stellar evolution, or the kinetic theory of
gases), and these can be studied analytically or by numerical methods.
Finally, simulations may be based, with the minimum of simplifying as-
sumptions, on the numerical integration of Eqs.(1.1). As a result of all
these types of approaches, over the last few decades we have developed
a relatively deep and accurate understanding of the many subtle aspects
of the evolution of a system of gravitating point masses, the topic of this
book. Indeed the study of the gravitational N -body problem must rank
as one of those mature areas of research where the variety of approaches
enrich each other like a community of craftsmen.
Let us highlight a few points of interest for a general physics point of
view.
Perhaps the most fascinating aspect of self-gravitating systems is their
instability∗ , exemplified by the fact that these systems have a negative
heat capacity (Chapter 5). Thermodynamic purists would shudder, but
the idea is intuitively simple to grasp and very powerful. Removing heat
from a system means reducing its kinetic energy of random motions. If
this is done to a self-gravitating system, its constituents fall towards each
other a little, and in doing so actually pick up more kinetic energy than
they lost in the first place. When this concept is applied to part of a
stellar system (as it easily can be in thought experiments) an instability
can result. Imagine what might happen to a block of material with a
negative specific heat, as heat flows from a hot spot: the more heat it
would lose, the hotter it would get, and it would quickly burst into flame.
Next, the N -body problem is a fascinating example of a system for
which no useful equilibrium exists. Particles can and do escape (Fig.2).
A Maxwellian distribution of velocities would always allow particles of
arbitrarily high speed, but in self-gravitating systems there is a finite es-
cape speed. Therefore the notion of “local thermodynamic equilibrium”,
which is such a powerful idea in many areas, has limited usefulness. Even
if we prevent particles from escaping (in a thought experiment), ther-
modynamic equilibrium is possible, but may be unstable (because of the
negative specific heat). There is certainly no equilibrium in the sense of
solutions of Eqs.(1.1) in which all particles are at rest. Even the con-
∗
We exclude N = 2, and certain kinds of stable larger systems, e.g. hierarchical
triples (Chapter 25)
Towards an understanding of the million-body problem 23
Problems
1) How would Newton have solved the problem of the collapse of the
system of fixed stars if he had had access to a GRAPE∗ ?
2) In plasma physics the Debye length is defined to be
s
kTe
λD = .
4πe2 Ne
Translate this into the language of stellar dynamics, bearing in mind
that the rms speed of electrons is related to the electron temperature
by
s
3kTe
vTe = .
me
If a stellar system is in dynamic equilibrium then it approximately
obeys the virial relation 2T + W = 0, where T is now the total kinetic
energy and W the total potential energy (Chapter 9). By making
suitable estimates for the density and other parameters of the system,
show that its radius is comparable with the Debye length.
3) Newton’s Theorems on the gravitational force due to a uniform spher-
ical shell imply that the force inside vanishes, and the force outside
is the same as that due to an equal point mass at the centre of the
∗
instead of an Apple
24 2 Theoretical Physics Introduction
shell. Hence show that the force at a point due to an infinite uniform
medium can take any value we please.
4) Using Newton’s Theorems (Problem 3) show that the acceleration of
a point at a distance r from the centre of a uniform sphere of finite
radius a and total mass M is
GM r
r̈ = − 3 .
a
Deduce that the sphere collapses homologously, and that collapse is
complete at time
s
π2 a30
t= ,
8GM
where a0 is the initial value. Compute this for the system displayed
in Fig.1.
3
Computational
Physics Introduction
Length/time-scale problem
25
26 3 Computational Physics Introduction
billion years, giving a discrepancy of time scales of a factor 1014 for normal
stars, and 1020 for neutron stars.
indextime scales
Sophisticated algorithms have been developed over the years to deal
with these problems, using individual time step schemes, local coordinate
patches, and even the introduction of mappings into four dimensions in
order to regularize the 3-D Kepler problem (through a Hopf map to a 4-D
harmonic oscillator, cf. Chapter 15). While these algorithms have been
crucial to make the problem tractable, they are still very time-consuming.
Near-equilibrium problem
In the central regions of a star cluster, the two-body relaxation time scale,
which determines the rate at which energy can be conducted
through the system, can be far shorter than the time scale of evolution for
the system as a whole, by several orders of magnitude. For example, in
globular clusters, density contrasts between the centre and the half-mass
radius can easily be as large as 104 , which implies a similar discrepancy
in relaxation time scales.
As a consequence, thermal equilibrium is maintained to a very high
degree. Since it is precisely the deviation from thermal equilibrium that
drives the evolution of the system (Chapter 2), it is extremely difficult to
cut corners in the calculation of close encounters. If any systematic type
of error would slip in here, even at the level of, say, 10−6 , the result could
easily invalidate the whole calculation. It is for this reason that none of
the recently developed fast methods for approximate force calculations
has been adopted in this area, e.g. tree codes, P3 M codes, etc (Barnes
& Hut 1986, Greengard 1990, Efstathiou et al. 1985). All such methods
gain speed at the expense of relatively large errors in the force computa-
tion. The result is that the N -body simulations of stellar dynamics can
boast far fewer particles than in, say cosmological simulations. This is a
pity, because N is often used as a crude “figure of merit” in the art of
simulation, whereas what really matters is the value of the science that
comes out.
Computational requirements 27
Computational requirements
The cpu cost of a direct N -body calculation scales ∝ N 3 , where the inter-
particle forces contribute two powers in N (Problem 1) and the increased
time scale for heat conduction contributes the third factor of N . For
this reason the progress of N -body simulations of star clusters has been
painfully slow, from the earliest published work of S. von Hoerner in 1960
(Fig.1).
Almost none of this progress has been made by large general-purpose
supercomputers. The use of parallel computers, such as Cray-T3E, has
had less impact on this area than on many others, because of the com-
munication bottleneck. The force on each particle necessarily depends
on the position of every other, and therefore it is not usually efficient to
parallelise the force calculations (which are the main bottleneck in serial
codes). Another way of exploiting parallelism is to advance many parti-
cles simultaneously (Spurzem & Baumgardt 2001). While this works well
in simple cases, the enormous range of time scales can ruin the efficiency
of this approach also: individual time steps were introduced precisely so
100000 ?Makino
Makino
1000 Terlevich
Aarseth
100
von Hoerner
10
1960 1965 1970 1975 1980 1985 1990 1995 2000
Date
Fig. 1. The slow progress of N -body simulations of star clusters. Models com-
puted well into the late evolution are plotted against publication date. For a
human perspective, see Aarseth (1999) and von Hoerner (2001).
28 3 Computational Physics Introduction
that it should not be necessary to advance all particles with the tiny time
step required for one close binary!
Currently, with routine calculations, it is only feasible to model the
evolution of a globular cluster containing a few thousand stars, since this
requires some 1015 floating point calculations, equivalent to 10 Gflops-
day, or several months to a year on a typical workstation. Therefore, a
calculation with half a million stars, resembling a typical globular star
cluster, will require ∼ 10 Pflops-day.
In contrast, the memory requirements are and will remain very modest.
All that is needed is to keep track of N = 5 × 105 particles, each with
a mass, position, velocity, and a few higher derivatives for higher-order
integration algorithms. Adding a few extra diagnostics per particle still
will keep the total number of words per particle to about 25 or so. With
200 bytes per particle, the total memory requirement will be a mere 100
Mbytes.
Output requirements will not be severe either. A snapshot of the po-
sitions and velocities of all particles will only take 10 Mbytes. With, say,
105 snapshot outputs for a run, the total run worth 10 Pflops-day will
result in an output of only 1 Tbyte.
Special-purpose Hardware
While general-purpose supercomputers have not yet made much impact
in this field, from time to time it has attracted attention as a possible
application of special-purpose hardware (Fukushige et al. 1999). The
earliest idea along these lines was put into practice by Holmberg as long
ago as 1941 (Holmberg 1941; see also Tremaine 1981). He arranged a
set of light bulbs like the stars in a stellar system, and used photometers
to determine the illumination at each. Since light also obeys an inverse
square law, this provided an analogue estimate of the gravitational field.
The next step in astronomy took place not in stellar dynamics but
in celestial mechanics, with the building and development of the Digital
Orrery (Applegate et al. 1985). For several years it performed ground-
breaking calculations on the stability of the solar system, including the
discovery of chaos in the motion of Pluto (Sussman & Wisdom 1988), until
eventually being regretfully laid to rest in the Smithsonian Museum.
A significant step toward the modeling of globular star clusters was
made in 1995 with the completion of a special-purpose piece of hard-
ware, the GRAPE-4, by an ingenious group of astrophysicists at Tokyo
University (Makino & Taiji 1998). GRAPE, short for GRAvity PipE,
is the name of a family of pipeline processors that contain chips spe-
cially designed to calculate the Newtonian gravitational force between
particles. A GRAPE processor operates in cooperation with a general-
Special-purpose Hardware 29
Fig. 2. The Grape-6 at the University of Tokyo, with J. Makino (right). With
permission.
30 3 Computational Physics Introduction
the GRAPE does not allow the most efficient scalar algorithm to be
implemented.) In addition, cut-down versions of this computer can be
Special-purpose Hardware 31
Box 1 (continued)
do 119 i=1,n,8
ii = 8
if(i+ii .gt. n+1) ii = n- i + 1
call g3frc(x(1,i),awork,pwork,ii)
do 1198 j = 1,ii
do 1197 k = 1,3
f(k,i+j-1)=awork(k,j)
1197 continue
pot(i+j-1) = pwork(j) + mass(i+j-1)*epsinv
1198 continue
119 continue
The particle data are loaded beforehand with similar instructions.
and have been used for simulations in a wide range of other problems
in astrophysics (Hut & Makino 1999) and, indeed, other fields of science,
such as plasma physics, molecular dynamics, the study of turbulence, and
even protein folding.
There are several reasons for GRAPE’s success. In the first place it was
developed quickly, always keeping ahead of general-purpose computers.
Secondly, the mathematical model of inverse square laws is quite fixed,
and can be “hard-wired”. Thirdly, the GRAPE group ensured that the
devices could be made available to potential users throughout the world,
and this maximised the scientific returns.
The introduction of special-purpose hardware has been a truly revolu-
tionary advance, and not just in speed. Before GRAPE and its prede-
cessor, the Digital Orrery, the hardware used by theorists was bought off
the shelf from a computer dealer. By contrast observers have always built
their own hardware (or have had it built to their own specification), even
back to the time of Galileo. From this perspective, GRAPE represents
a remarkable culture shift in the way theorists can do science. In retro-
spect it is not surprising that it is in the area of dynamical astronomy
that this has happened, as it is here that the governing equations and the
underlying physical model are most stable. And we are not yet at the
end of the road: GRAPE 5 is already at work (Kawai et al. 2000) and, as
we write, GRAPE 6 is coming on stream (Fig.2). It is about 100 times
faster than GRAPE 4.
32 3 Computational Physics Introduction
Software Environments
Generating data is only half the job in any simulation. The other half
of the work of a computational theorist parallels that of an observer, and
lies in the job of data reduction. As in the observational case, here too
a good set of tools is essential. And not only that: unless the tools can
be used in a flexible and coherent software environment, their usefulness
will still be limited.
Three requirements are central in handling the data flow from a full-
scale star cluster simulation: modularity, flexibility, and compatibility.
For example, to set up a major simulation, it is very useful to have a set
of model building tools that are sufficiently modular, so that they can be
combined in many different ways. If the data representation is flexible
enough, it will be possible to add new physical variables whose use may
not have been foreseen at the time that the software package was first
developed. And in order for those new variables not to interfere with
existing programs, compatibility is a vital issue as well.
The two main specially constructed environments in use are called
Nemo and starlab (Box 2). Both consist of large collections of software
and tools satisfying the above requirements. In addition a great deal of
N -body work is carried out in the unix-type environments used univer-
sally by computational scientists. The principal codes used in this way
are the suite of N -body programmes written by S.J. Aarseth (1985). An
extremely simplified N -body code is provided in Appendix A.
Problems
1) Use either N -body code in Appendix A to investigate how the CPU
time depends on the number of particles. Try to explain the depen-
dence you find.
2) Code the N -body equations (Eq.(1.1)) using a Runge-Kutta solver,
either one specially prepared for the purpose, or one drawn from any
available numerical library, such as Press et al. (1992). Try to ensure
that the accuracy (judged, for example, by energy conservation) is
comparable with that in Appendix A. Compare the timing with that
of Problem 1, and explain the difference.
3) Code the N -body equations in a symbolic computation package, such
as Maple or Mathematica. Compare the timing with that in Problem
2, again with comparable accuracy, and explain the difference.
Software Environments 33
Box 2. Starlab
Box 2 (continued)
To some extent, Starlab is modeled on NEMO, a stellar dynamics soft-
ware environment developed during the 1980s at the Institute for Ad-
vanced Study, Princeton, in large part by Josh Barnes, with input from
Peter Teuben and Piet Hut (and subsequently maintained and extended
by Peter Teuben). Starlab differs from NEMO mainly in its use of UNIX
pipes, rather than temporary files, its use of tree structures rather than ar-
rays to represent N -body systems, and its guarantee of data conservation–
data which are not understood by a given module are simply passed on
to the next rather than filtered out and lost. The original version of
Starlab was written by Piet Hut in 1989, while on sabbatical at Tokyo
University. From 1993 onwards, Steve McMillan has extended starlab,
with help from Piet Hut, Jun Makino, Simon Portegies Zwart and Peter
Teuben. Visualisation tools, such as partiview, have been added by Stu-
art Levy. Nemo, starlab and partiview are all available at the web site
<http://www.manybody.org>.
Recently, the concept of a ‘virtual observatory’ has been the topic of
several workshops and conferences. The idea is to connect the major
observational archives, from radio to optical to X-ray observations, to
make available a ‘digital sky’ online. Archives of large-scale simulations,
such as those provided by Starlab, will be connected with those virtual
observatories as well, facilitating comparisons between observations and
simulations (Teuben et al. 2001).
4
Mathematical Introduction
X
j=N
ri − rj
r̈i = −G mj , (1)
j=1,j6=i
|ri − rj |3
35
36 4 Mathematical Introduction
variable t (Box 2). Why should one do this? One answer is that it is
an important setting in which to discuss the analytical properties of the
solutions, not only because of its pure-mathematical significance, but for
“practical” reasons also. For example, the numerical treatment of the
equations of motion requires special care in the vicinity of singularities
(cf. Chapter 22), and in the N -body problem these are usually to be
found in the complex t-plane. Another application (Chapter 21) is in
certain problems of three-body scattering, when a third body temporar-
ily approaches a short-period binary star; it turns out that the change
in its energy is determined by the point in the complex plane where the
intruder collides with the binary.
We have treated the two-body singularities as though they are to be
avoided at all costs. In fact they are quite innocuous. For example, sup-
pose a collision occurs in a two-body problem when t = 0, and we try to
study this problem by numerical integration of the equations of motion,
starting at some negative value of t during the approach to the collision.
As t approaches 0 from below, it will be found that the numerical inte-
gration requires ever smaller time steps as we approach the singularity
at t = 0, and there is nothing that can be done to get past it. Our
local theorems about the existence of solutions also take us no further.
When we examine the same problem analytically, however, it turns out
that the coordinates of the two bodies are expressible, in the run-up to
the collision, as power series in the variable t2/3 (Box 2). Since this se-
ries contains non-integer powers of t we recover the fact that x is not an
analytic function at t = 0. But we also observe that the series can be
expressed in powers of (t2 )1/3 , which may be equally well evaluated when
38 4 Mathematical Introduction
body collisions. We may say that the collision singularities have been
regularised.
There is an artificial but mathematically important class of N -body
problems in which these techniques are essential. Two-body collisions
occur naturally if one studies the collinear N -body problem, but when
these collisions have been neutralised, other, higher-order singularities
come into view. The most obvious of these is the triple-collision singular-
ity, in which the coordinates of three bodies tend to coincidence as some
value of t is approached. One way of studying this problem is touched
on in Chapter 21, and it shows that these singularities cannot usually be
handled by the technique of analytic continuation which works so well for
two-body collisions. The problem is that the exponents which occur in
the corresponding power series are not usually simple rational numbers,
such as the power of 2/3 which occurs in the case of two-body encoun- √
ters. Instead, the exponents may involve irrational numbers such as 13
(Chapter 21), even in the simplest case of equal masses, and a series in-
volving such powers of t cannot be evaluated (with a real result) for both
positive and negative values of t. Nevertheless the study of these singu-
larities has progressed to remarkable lengths, and these investigations are
not without their consequences for the astrophysical applications of the
N -body problem.
The collinear N -body problem exhibits also other classes of singulari-
ties, in which no more than two bodies collide at one time, but the colli-
sions occur more and more frequently as a certain time approaches (see
Marchal 1990). In order to give an impression of how this can happen, we
have to anticipate some results of Chapter 21. Consider first the notion
of a binary. In the collinear problem this is a pair of stars exhibiting
a relative motion like that in Eq.(4) of Box 2: they periodically bounce
off each other (as we assume that the relevant two-body encounters have
been regularised). Just after one such bounce the right-hand component
moves to the right at high speed. Now suppose a third body of low mass
approaches from the right and collides with the right-hand component
(Fig.2). After this collision the third body recedes at high speed, its
energy having been gained at the expense of the binary, which becomes
“tighter”. Suppose finally there is a fourth body, to the right of the third
and moving off to the right. The third body, moving very fast, catches
up with it and, being of relatively low mass, bounces back towards the
binary. With sufficient care its next encounter can be arranged to occur,
once again, just after a collision between the binary components. (In
Chapter 20 we shall see in a little more detail how careful choice of initial
conditions can lead, in an analogous problem, to an orbit with desired
properties.) Now we have the design of a powerful four-body machine
40 4 Mathematical Introduction
time
1 2 3 4
Examples like Xia’s are highly contrived and rare. In the N -body
problem there will usually be no singularities on the real time-axis. Even
if we regularise two-body collisions, however, there will usually be plenty
of singularities in the rest of the complex plane. These prevent us from
being able to express the solution of the N -body problem as power series
in t (or the appropriate regularising variable) which converge for all times.
However there is an amazingly simple transformation of the independent
variable (Box 3) which does allow us (in principle) to write the solution
as a series which converges for all times; it is not a power series in t,
however. Unfortunately the solution in this form has never been put to
practical use.
Since a solution of the 3-body problem may be expressed as a conver-
gent series, it is surprising to recall that this problem is often quoted as
one of the famous unsolved problems of applied mathematics. Clearly,
the issue hinges on what is meant by a “solution”. Though the series ex-
pression is a solution of sorts, it would be very hard to extract from this
series any information about the qualitative behaviour of the motion of
the N bodies (or even any quantitative results). Nor is it very useful for
numerical calculations as the rate of convergence is even more painfully
slow than an N -body simulation. One usually expects much more from
a satisfactory solution of a dynamical problem.
The best known class of soluble problems in dynamics are the so-called
“integrable” problems. We shall take this to mean a problem in which
sufficiently many constants of the motion can be found so that the solu-
tion may be written down in terms of “quadratures”, i.e. an integral of
a function of a single variable. How one finds these integrals is another
matter, and usually boils down to identifying a symmetry of the problem
at hand. For example the motion of a particle in any spherical poten-
tial may be integrated using the fact that the angular momentum and
energy are constant, and the existence of these integrals results from the
invariance of the potential under rotations and time-translation.
The question now arises whether the N -body problem is of this type.
For N = 2 the answer is affirmative, and indeed it may be reduced to
the problem of motion in a spherical potential (cf. Chapter 7). In fact in
this case the quadratures can be carried out analytically. Even for N =
3, however, insufficiently many integrals are known, and the search for
other integrals has even led to theorems proving their non-existence under
certain conditions (Whittaker 1927, Moser 1973). Chapter 20 describes a
particular kind of three-body problem where this question can be settled
rather directly. The existence of integrals was one of the questions which
motivated Poincaré to study the three-body problem and, in the process,
42 4 Mathematical Introduction
For some solution of the 3-body problem let us suppose that the sin-
gularity which is closest to the real t-axis is at a distance h > 0 (i.e. its
imaginary part is ±ih). Then the solution of the N -body equations is
analytic (without singularities) throughout a strip of width 2h (Fig.1).
Now the complex transformation
2h 1+τ
t= log
π 1−τ
maps the unit disk τ ≤ 1 into the strip just mentioned. For example, when
t is real the transformation is τ = tanh(πt/(4h)), and this varies from
τ = −1 to τ = 1 as t increases from −∞ to ∞. Since our solution of the
N -body problem is analytic in the stated strip in the t plane, it follows
that, if τ is used as independent variable, the solution of the N -body
problem has no singularities in the unit disk. Therefore, by elementary
theorems in the theory of analytic functions, the solution can be expanded
as a power series in τ for all τ in the unit disk. In principle, therefore,
a solution of the N -body problem may be represented as a convergent
power series for all time. See Saari (1990), Barrow-Green (1996).
Imt
singularity
h
Re t
-h
Fig. 1. A solution of the 3-body problem is analytic in a strip |Im t| < h, which
maps to the unit disc in the complex τ plane.
6N . With all such tricks even the three-body problem can be reduced
only to order 7 (though one more order can be removed by transformation
of the independent variable). Another way of expressing this is to say
that a three-body system moves on a 7-dimensional subspace of phase
space. By studying the topology of this subspace one can sometimes
reach rigorous conclusions on the stability of three-body systems (the
“c2 h” criterion, cf. Marchal 1990). But this topological problem also
has a pure-mathematical life of its own which recently led to a complete
census of all possible types of topology in this context (cf. Diacu 2000).
The topics raised in this chapter, and even the title of Poincaré’s book,
illustrate the remarkably close links that have existed between mathe-
matics and dynamical astronomy ever since the time of Newton. It is
significant, however, that most of this has occurred within the subject
that is even now termed “celestial mechanics”. Loosely speaking, this is
the mathematical study of few-body problems, usually with one domi-
nant mass, such as those found within the solar system. We think that
there is equally fertile ground for such cross-fertilisation between mathe-
matics and the N -body problem of stellar dynamics, which is the subject
of this book. That this is less well-developed than in the area of celes-
tial mechanics can be traced to the fact that astronomy as a whole has
become largely a part of physics. And yet it will be found from certain
sections of this book that tools which have been developed by mathemati-
cians for their own inscrutable reasons have turned out to have important
applications in stellar dynamics, often several decades afterwards.
Problems
1) Show that the set of initial conditions of the N -body problem which
lead to a collision between a specific pair of particles has codimension
2 (i.e. 2 less than the dimension of the set of all initial conditions).
2) Verify that x = −q(1 − σ2 ), y = 2qσ is a solution of Box 2, Eq.(1),
provided that q 6= 0 and
1 t
σ + σ3 = p 3 .
3 2q
Write down the appropriate collision solution, corresponding to q = 0,
and show that the orbit varies smoothly with q as one passes through
the collision orbit. Determine the geometric nature of the orbits. By
treating the x,y plane as a√complex z- plane and applying the Levi
Civita transformation ζ = z, determine the geometric nature of the
transformed orbits, and verify that they vary smoothly through the
collision orbit.
44 4 Mathematical Introduction
Even though this is a book about dense stellar systems (i.e. what is
often called “collisional” stellar dynamics, though no physical collisions
need take place), it rests on a foundation of “collisionless” stellar dy-
namics, and the relevant aspects are surveyed in these five chapters. In
addition we outline the various ways in which the effects of gravitational
encounters can be incorporated, though the details are deferred to later
sections of the book.
Chapter 5 begins with a discussion of the main aspects of the thermo-
dynamic behaviour of N -body systems: how a stellar system responds
to being put in contact with a “heat bath”, for instance. In fact stellar
systems tend to cool down if heat is added; paradoxical though this might
seem, it helps us to understand even the motion of an earth satellite. A
toy model helps to explain what is happening.
Chapter 6 introduces the basic tools used for describing large numbers
of gravitating particles: phase space, the distribution function f , the
gravitational potential, and the equation governing the evolution of f (the
“collisionless Boltzmann equation”). We outline some of its solutions, and
aspects of the manner in which they evolve, especially phase mixing. We
also look at the development of Jeans’ instability.
For our purpose the most important distribution functions are those
exhibiting spherical spatial symmetry. Therefore Chapter 7 is devoted
to the motion of stars in spherical potentials, including constants of the
motion and their link with symmetry. An important example which we
shall exploit later in other contexts is the Kepler problem. We outline
the theory for more general potentials as well. Finally we explore briefly
the nature of nearly radial motions (touching on their importance for
the stability of stellar systems) and those in time-dependent (but still
spherical) potentials.
45
46 PART II. THE CONTINUUM LIMIT: N → ∞
47
48 5 Paradoxical Thermodynamics
Earth’s surface
To Earth’s Centre
potential energy is lost, and therefore the kinetic part of the energy is
actually increased. It would be as if stepping on the brake in a car would
cause the car to speed up, and pushing the accelerator would slow the
car down (in a gravitational car, those pedals presumably would be called
the push and the decelerator, respectively).
Note the symmetry between both operations. Pushing the brake and
seeing the car speed up nonetheless could be accomplished when going
down a steep mountain road, with a brake that is not powerful enough.
In that case, however, pushing the accelerator would have a qualitatively
similar effect, of increasing the velocity (by an even larger amount), and
doing nothing would also lead to a rising speed. In the case of a satellite,
even a slight acceleration would lead to a higher orbit with a consequently
lower speed.
This property of orbits is also of importance in understanding the role
of resonances in the maintenance of spiral patterns in galaxies. In the
words of Lynden-Bell and Kalnajs (1972), “stars act like donkeys slowing
down when pulled forwards and speeding up when held back.” (Transport
analogies seem to be favoured in discussions of this behaviour.)
This paradoxical behaviour does not always occur for centripetal forces
like gravity. Consider, for instance, the artificial case of two-dimensional
gravity. This corresponds to the attraction of infinite parallel rods, which
varies as r−1 , like the force between two line charges in electrostatics.
A system of gravitating rods does not exhibit a negative specific heat
(Padmanabhan 1990, and Problem 2).
Never at Rest
Fig. 1. Energies of escapers in isolated N -body systems (after Giersz & Heggie
1994). In the first half of the evolution there are no binaries; escapers are caused
by two-body encounters (Chapter 16) and have low energy. The escape rate
increases as the core of the system collapses (Chapter 18). Eventually the core is
so dense that binaries form. In their evolution they eject stars with much higher
energies (Chapter 23). If the system were enclosed in a box, as in the text, they
would be perceived through the increased temperature of the walls. Energy is
measured in units of 2/3 of the mean kinetic energy per particle. Results from
many simulations are collected in the one figure.
A Toy Model
1.8
1.6
<T>
1.4
1.2
Fig. 2. Mean kinetic energy hT i against energy E for the toy model discussed
in the text. Both energies are scaled by GM/R.
∗
See Problem 6.3. The calculations are simpler in a related model discussed by
Padmanabhan (1990), but he takes a more advanced viewpoint thermodynamically.
Our model was suggested by a reading of this paper, which contains several other
highly illuminating toy models and discussions of the thermodynamic behaviour of
self-gravitating systems.
52 5 Paradoxical Thermodynamics
only a little bigger than at the boundary, i.e. hT i >∼ E + GM/R. Using
hT i as a surrogate for temperature, we see that, above some minimum
temperature, a system of a given temperature can be in either a high
energy or a low energy state (Fig.2). Furthermore the high energy state
has positive heat capacity dE/dhT i, and the low energy state has negative
heat capacity.
Now suppose we have many slightly interacting particles inside the
enclosure, one at low energy and high temperature (i.e. high up on the
left branch of the curve in Fig.2), and N others at high energy and low
temperature (i.e. lower down on the right branch). Because the slopes of
the two branches are comparable (though of opposite sign), it is easy to
see that a flow of heat from the hot particle to the remainder heats all the
particles, but the inner body is heated more, roughly in the ratio of N :1.
Thus the temperature difference is exacerbated, and the energy of the
single body decreases still further. The outer bodies strike the wall with
increasing energy, which is registered as an increase in the temperature
which the system presents to the outside world.
This is a toy model not just of paradoxical thermodynamic behaviour,
but of the million-body problem itself. If you turn Fig.2 upside down
it resembles part of Fig.17.2, which is concerned with the stability of
self-gravitating systems.
Problems
1) You are the pilot of a spacecraft about to dock with a satellite which
moves ahead of you on the same circular orbit. In which direction
should the thrusters of your spacecraft be fired in order to catch up
with your objective: forward or backward?
2rF
2) Using the theory in Box 1, show that Ṫ = vF and ṙ = − . If
mv
−n
the attractive force of the Earth were to vary as r , show that the
2rF n−1
results would become ṙ = − and Ṫ = vF . Deduce
mv(3 − n) 3−n
that paradoxical behaviour only occurs if 1 < n < 3.
(Harder) What happens if (a) n = 3, (b) n > 3, (c) n = 1, (d)
n < 1?
3) A particle of unit mass moves in the gravitational field of a fixed mass
M according to the equation of motion
GM r
r̈ = − .
r3
If the energy is E = v2 /2 − GM/r, show that r < R provided that
d2
E < −GM/R. Show that 2 r2 = 2T + 2E, where T is the kinetic
dt
A Toy Model 53
54
6 Statistical Mechanics 55
where
X
3
∂
∇r .(f v) = (f vi ),
r
i=1 i
-0.5
-1
-1.5
-2
φ
-2.5
-3
-3.5
-4
-3 -2 -1 0 1 2 3
x
Fig. 1. The potential well of a Plummer model (Chapter 8) for various numbers
of particles N . The potential φ is plotted for points along the x-axis, for N =
1 (the deepest), 4, 16, 64, 256 and 1024, and for the exact Plummer model
(corresponding to infinite N ; this is the heavy dashed curve). The curve for each
successive N was obtained by adding the appropriate number of particles to the
previous system, i.e. the various systems are not independent realisations of a
Plummer model. This figure illustrates the fact that the potential in a phase-
space description (the heavy dashed curve) may differ substantially from that in
an N -body description.
6 Statistical Mechanics 57
and simply write down the corresponding equation in the new coordinates
(cf. Problem 1).
It is worth remarking that Eq.(6) is equivalent to Eq.(3), and not just
deducible from it: if we seek a solution of Eq.(6) in the form δ(r−r(t), v−
ṙ(t)) then it turns out that r(t) must obey Eq.(3). The link between the
two is also evident from a mathematical point of view. The standard
method of solving a partial differential equation like Eq.(6) is by the
method of characteristics (e.g. Garabedian 1986, Chapter 2), and it turns
out that the characteristics of Eq.(6) are just the solutions of Eq.(3).
The analogy with ordinary fluids remains helpful in the interpretation
of Eq.(5). If a fluid is incompressible, then its “convective derivative”
vanishes; the convective derivative is simply the derivative following the
motion of the fluid, and so ∂ρ/∂t + v.∇ρ = 0. The resemblance to
Eq.(5) shows that, in the phase space description of a stellar system, f
is constant if we follow the motion of a star in phase space; in other
words, phase “fluid” is incompressible. This property often allows us to
visualise a solution of the collisionless Boltzmann equation that would be
quite difficult to write down (Fig.2).
∗
Even here, however, the phrase “Vlasov equation” commonly refers to the coupled
system consisting of the collisionless Boltzmann equation and Maxwell’s equations.
Spelling also differs from one source to another; if you need to look it up in an index,
try also Wlasow. See also Hénon (1982).
58 . 6 Statistical Mechanics
r
-2
-2
-1
-1
0 0
r
1
1
2
often called Jeans’ Theorem, after one of the founding fathers of stellar
dynamics. But Jeans was a fine applied mathematician, and in his own
writings (Jeans 1929) he attributed the result to Lagrange.
It is easy to be confused by the expression f (E, J), because this is not
the distribution function of E and J, but the distribution in phase space
expressed as a function of these two variables. To relate the two concepts,
let us define N (E, J)dEdJ as the number Rof stars with E and J in small
intervals dE, dJ , respectively. Then N = f (E, J)δ(v2 /2 + φ − E)δ(|r ×
v| − J)d3vd3 r. This is easily evaluated by first integrating with respect
to v, using r as polar axis. The result is that
where P (E, J) is the radial period of the orbit with parameters E and J
(cf. Chapter 7). Similarly, if f depends on E alone, then the number of
stars per unit energy is
Z rmax
2
N (E) = 16π f (E) r2 (2(E − φ))1/2 dr, (9)
0
where φ(rmax ) = E.
What determines f ? Beyond Jeans’ theorem, Eq.(6) is little guide,
and information on this question comes from other considerations, most
of which lie outside this chapter. First, the interpretation of f shows it
must be non-negative. Another consideration, which is not as irrelevant
as it might seem at first sight, is mathematical convenience. Much ef-
fort has gone into the construction of more-or-less simple distributions
which can be used as models for the interpretation of observations or
the construction of initial conditions for N -body models, and so on (cf.
Chapter 8). Of deeper significance are distribution functions f with phys-
ical meaning. Since Eq.(6) shows that f evolves in time, there is at least
a possibility that concepts of statistical mechanics might be applicable.
If so, it might be possible to suggest particular forms (e.g. entropy max-
ima) towards which f would evolve after sufficient time (cf. Chapter
10). Finally, there is the dynamics which is not embodied in Eq.(6),
i.e. the interactions between neighbouring particles. In a simple model
of an ordinary gas this leads to relaxation to a Maxwellian distribution
of velocities. In stellar dynamics the corresponding mechanism is called
“two-body relaxation” (Chapter 14). It is also known that three- and
four body interactions influence the distribution (Chapter 19, 26), but
these involve “internal degrees of freedom” which are not described by
our single-particle distribution function f .
60 6 Statistical Mechanics
Jeans’ Instability
One of the standard applications of the formalism introduced in this
chapter is the stellar-dynamic analogue of a problem in gas dynamics
studied by Jeans. The calculation is accessible in Binney and Tremaine
(1987), and we merely summarise the physical interpretation and touch
on a historical curiosity. This is a real digression: we have little further
use for the calculation in the remainder of the book.
The problem begins with the unlikely concept of a static, spatially
infinite and uniform medium, described by a potential φ0 , density ρ0 and
distribution function f0 , all of which are independent of position and
time. Next we suppose that these functions are subject to small (t- and
r-dependent) perturbations, denoted by subscript 1, and we proceed to
linearise Eqs.(2) and (5). We find that the terms of zeroth and first order
give the equations
∇2 φ0 = 4πGρ0 (10)
∇2 φ1 = 4πGρ1 (11)
−∇r φ0 .∇v f0 = 0, (12)
∂f1
+ v.∇r f1 − ∇r φ1 .∇v f0 = ∇r φ0 .∇v f1 . (13)
∂t
Unfortunately there is no potential φ0 satisfying both Eqs.(10) and
(12). This should not surprise us: as shown in Problem 2.3, the gravita-
tional force in an infinite uniform system can take any value we please.
We now have two choices. One choice is to revise the assumption of
a uniform, infinite medium in dynamic equilibrium. The second is to
invoke the ‘Jeans swindle’. Let us consider these in turn.
The initial assumptions may be modified in various ways which allow
progress. What we need is an unperturbed model whose potential does
not accelerate (or, more fundamentally, change the momentum of) a per-
turbed particle. (This is the interpretation of the troublesome term on
the right of Eq.(13).) An example is collapse of a “cold” uniform sphere
(Fig.2.1). If we use coordinates which collapse along with the sphere,
all particles are at rest in the absence of perturbation. There is a close
analogy here with the theory of gravitational instability in the expand-
ing universe, where the basic mechanism is well understood (cf. Peebles
1993). The stellar-dynamical problem is quite involved (Aarseth et al.
1988 and Problem 4)∗ , but again the system is unstable, which accounts
for the clumpiness visible in frames 6–8 in the cited figure. Another uni-
form model which does not accelerate a perturbed particle is described
∗
Even its numerical simulation has some subtleties (Theis & Spurzem 1999)
6 Statistical Mechanics 61
Problems
∂f ∂f ∂f
∂f /∂t + ṙ. + v̇r + v̇t = 0.
∂r ∂vr ∂vt
There are at least two reasons for studying this classical problem in a
modern book on the gravitational N -body problem. First, it approxi-
mately describes the motion of individual stars in a nearly spherical sys-
tem. Second, it describes the relative motion of two stars which happen
to come close together. These applications are complementary. In the
first we approximate the potential due to the other stars in the system
by a smooth spherically symmetric field (Fig.6.1). Occasionally, however,
two stars come close together, and then the field of the rest of the cluster
can be neglected temporarily. In this case it is the relative motion of the
two stars that is described by a central force equation:
G(m1 + m2 )
r̈ = − r, (1)
r3
where their masses are m1 and m2 , and r is their relative position vector.
The individual motion of each star can then be constructed if the motion
of their barycentre is known.
In a central force problem motion is described by an equation of the
form
r̈ = −∇φ(r),
where r = |r|. Thus choosing φ = −G(m1 + m2 )/r yields Eq.(1). We
define the specific angular momentum, i.e. angular momentum per unit
mass, by J = r × ṙ. It is immediately seen by differentiation that this
is conserved during motion in a central field, even if the field is time-
dependent. (Physically, a central force exerts no torque.) In turn this
implies that motion is on a fixed plane passing through the centre of the
force field, since J.r = 0. The other important invariant (for a static
potential) is the specific energy, defined by
1
E = ṙ2 + φ(r).
2
63
64 7 Motion in a Central Potential
Orbits which remain outside most of the mass will, on the other hand,
resemble a conic section, by Kepler’s First Law. If this is an ellipse of
semi-major axis a, it follows from the theory of Keplerian motion (Box
2) that the period is approximately
q
P ' 4π 2 a3 /(GM ). (4)
φ
eff
a r
φ= E
Fig. 1. Radial motion in a spherical potential. See the text for an explanation
of the symbols. The case φef f < E < 0 is illustrated.
stellar systems are liable (see, for example, Antonov 1973, Fridman &
Polyachenko 1984, Palmer 1994).
Let us consider orbits of a given energy E and small angular momen-
tum. For J = 0 the orbit is purely radial, and successive maxima in r
are separated (in θ) by an angle exactly equal to π. If J pis small the star
passes close to the origin, and its speed then is nearly 2(E − φ0 ). By
angular momentum conservation it follows that the distance of closest
approach to the origin is nearly proportional to J. It is also easy to see
that the transverse deviation from a purely radial orbit is proportional
to J, and therefore successive maxima in r are separated (in θ) by an-
gles which differ from π by a small angle proportional to J (Fig.2b and
Problem 2).
It is this tiny angle by which nearly radial orbits rotate which allows
them to cooperate with a non-radial perturbation in the potential to give
rise to the “radial orbit instability”, especially in systems where such
orbits are heavily populated. In such a system the pressure tensor is
very anisotropic, and the transverse velocity dispersion is relatively low.
In this sense such systems have a low “transverse temperature”. We al-
ready remarked (Chapter 6) that cold stellar systems tend to clump, by a
process analogous to the Jeans instability, and in much the same way sys-
tems with a low transverse temperature tend to clump into a bar-shaped
configuration (cf. Barnes 1985). A proper understanding of the mecha-
nism by which this happens involves a little more orbit theory, however,
and is deferred to a later remark in this chapter about resonances, and
Box 3.
Because an analysis of general orbits, i.e. those which are neither
nearly radial nor nearly circular, can be reduced to motion in radius (cf.
Fig.1), the results can be expressed as quadratures. For example, the
time between successive maxima in r is
Z rmax Z q
2 dr/ṙ = 2 dr/ 2(E − φ(r)) − J 2 /r2 , (8)
rmin
by Eq.(6). Only a few cases are known where the appropriate integrals can
be expressed in terms of elementary functions. Aside from the familiar
Keplerian and simple harmonic cases, one potential with this property is
GM
the “isochrone” potential φ = − √ , which was so named by
b + b2 + r2
Hénon because it has the property that all orbits of the same energy have
the same radial period (Problem 5).
Let us denote the frequency of radial motion by ωr , and similarly use
ωθ to denote the mean angular velocity θ̇. Thus the angle through which
the star rotates between two successive maxima in r is 2πωθ /ωr . For
7 Motion in a Central Potential 69
y(t) y(t)
1
2
0.5
1
-2 -1 0 1 2 -1 -0.5 0 0.5 1
x(t) x(t)
-1
-0.5
-2
-1
√
Fig. 2. Two orbits in the scaled Plummer potential φ = −1/ 1 + r2 . They start
at r = 1 on the x-axis, moving transversely upwards with speeds (a) 0.9 and (b)
0.1. Note that the angular separation of successive apocentres, when measured in
the direction of motion, lie between the values for the simple harmonic oscillator
(π) and Kepler motion (2π); see Problem 4.
nearly radial orbits this is nearly π, whence ωr ' 2ωθ for such orbits.
Therefore such orbits exhibit a resonance, which has an important role
in the “radial orbit instability”, as already mentioned. See Box 3.
We also mentioned that stars moving close to the centre ofpthe cluster
exhibit nearly simple harmonic motion with a certain period 3π/(Gρ0 ),
and it might have been thought that such orbits would also resonate.
However the number of such stars is small. Consider circular p orbits, for
example. The period of a circular orbit of radius r is 2π r/φ0 . Now φ
can be expressed as a power series in even powers of r (cf. Eq.(2)), and
so, therefore, can the period. Therefore the period varies approximately
linearly with the energy for low-energy orbits. Now in the most usual
models of star clusters, the volume of phase space available to stars of
energy less than E varies as (E − φ0 )3∗ , and so it follows that the number
of stars of a given period
p increases rapidly as the period increases above
the minimum value 3π/(Gρ0 ). Thus there is no significant population
of resonant stars near this frequency.
These remarks are relevant to the radial orbit instability, but there
are other instabilities which can afflict spherical stellar systems. For
example, though it is easy to build a spherical system with an almost
∗
Consider, for example, a three-dimensional simple harmonic oscillator. The energy
is E = (ṙ2 + ω 2 r2 )/2, and the right side is the square of the radius of an ellipsoid
in 6-dimensional phase space.
70 7 Motion in a Central Potential
Box 3 (continued)
Now let f (J, ψ) be the distribution of these two variables. Since ψ and
J are conjugate, f approximately obeys a Wlasow equation
∂f ∂f ∂f
+ ψ̇ + J˙ =0
∂t ∂ψ ∂J
(cf. Eq.(6.7)). We write f , ψ̇ and J˙ as a sum of a contribution from the
underlying spherical cluster model plus a contribution from the perturba-
tion, e.g. f = f0 (J) + f1 (J, ψ), and neglect all but linear terms involving
the perturbation. Then
∂f1 ∂f1 ∂f0
+ AJ ' B sin(2ψ) . (1)
∂t ∂ψ ∂J
(Notice, by the way, that the analogue of the term which gave us so much
trouble in the study of the Jeans instability (Eq.(6.13)) would have the
∂φ0 ∂f1
form , which vanishes because φ0 is spherically symmetric.)
∂ψ ∂J
The forced solution of Eq.(1) is
B ∂f0
f1 = − cos 2ψ. (2)
2AJ ∂J
If f1 is to be consistent with the assumed perturbation in the potential
then we need an enhancement of orbits along the x-axis (near ψ = 0).
By Eq.(2) we see that this will happen if J∂f0 /∂J < 0. It follows that,
for the radial-orbit instability to proceed, it is necessary that the distri-
bution function should increase as J decreases to zero from either side,
i.e. ∂f0 /∂J is positive (negative) if J is negative (positive). This means
that we expect to see the instability only in systems in which there is a
strong radial anisotropy.
There are some general lessons here for several kinds of instabilities in
stellar systems. In order for an instability to flourish, there must be a
resonance, and there must be a sufficient gradient of f0 across it (see, for
example, Palmer 1994, and the classic Lynden-Bell and Kalnajs 1972).
Problems
1) Explain why Eqs.(3) and (5) have the same coefficient.
2) For motion close to the x-axis in the x,y plane of a spherically sym-
metric potential φ(x, y), show that we have ÿ = y(ẍ/x) exactly, and
∂
ẍ = − φ(x, 0) approximately. By considering orbits with initial con-
∂x
dition x = R, y = 0, ẋ = 0 and ẏ = J/R, where R is a constant, show
that the rate of rotation of the orbit is nearly proportional to J.
7 Motion in a Central Potential 73
74
The Isothermal Model 75
∗
i.e. ∇2 u + exp u = 0
76 8 Some Equilibrium Models
Then the solution is uniquely specified (Fig.1), and so we see that the
family of isothermal models has two free dimensional parameters: the
velocity dispersion σ2 (which is related to j by Eq.(2)), and the cen-
tral density. There is, however, also a limiting case, called the singular
isothermal solution, which has infinite central density (Problem 1). We
missed it by our implicit assumption that ρ0 exists. Finite solutions ap-
proach the singular solution asymptotically at large radius (Problem 2,
and Fig.1).
Though it is important to understand the properties of this model
because of its thermodynamic significance (Chapter 17), its practical ap-
plications are limited. If a model of a stellar system is in dynamic equi-
librium there can be no escaping stars, and so f must vanish above the
energy of escape. Now the isothermal distribution, Eq.(1), never van-
ishes, and this means that there is no escape energy. The reason for this
is that the model turns out to have infinite mass (Problems 1 and 2).
King’s Model
0.1
ρ/ρ0 0.01
0.001
1e–4
1e–5
where E0 is the escape energy. This is King’s model (King 1966)∗, and it
has several attractive features, aside from finite mass and radius. Deep
inside the system E E0 , and so the distribution is nearly Maxwellian
where the system may be expected to be nearly relaxed (since the time
of relaxation is shortest where the density is highest, cf. Chapter 14).
Also, the distribution function Eq.(8) turns out to be a good approxima-
tion to a solution of the Fokker-Planck equation. What is occasionally
forgotten, however, is that this solution is not an equilibrium solution,
but one in which stars escape at a rate governed essentially by the two-
body relaxation time scale. Another pitfall is to imagine that the central
velocity dispersion is still given by Eq.(2), but this is only good in the
limit E0 → ∞.
King’s models are characterised by a third parameter in addition to the
two parameters needed for an isothermal model. This third parameter
is often taken to be the quantity W0 = 2j 2 (E0 − Ec ), where Ec is the
energy of a star at rest at the centre. Fig.2 illustrates the models with
W0 = 3 and 7. It is customary to characterise the size of the central
region by what is called the “core radius”, rc , though different definitions
exist. We adopt the definition given by the physically appealing relation
4πG
ρc rc2 = vc2 , (9)
3
where ρc and vc2 are the central density and mean square velocity, respec-
tively; i.e. vc2 = 3σc2 . For an isothermal model, rc is the radius at which
the space density drops to about one third of its central value, and the
projected (or surface) density falls to about one half of its central value.
This and other characteristic radii are plotted in Fig.3. A commonly
used measure of how centrally concentrated a model is is the quantity c
defined as c = log10 (rt /rc ), although King himself defined c without the
logarithm, and with a different definition of rc .
King’s model has remained a cornerstone of stellar dynamics for over 30
years. It has been extended to include anisotropy (Michie & Bodenheimer
1963), a mass spectrum (e.g. Gunn and Griffin 1979), and an external
gravitational field (Heggie and Ramamani 1995), though these extensions
do not have the same support from the Fokker-Planck equation. In ad-
dition, it quite unclear what is the appropriate theoretical definition of
∗
In another paper (King 1962) he proposed a simple mathematical function which
fits the profile of the surface brightness Σ of globular clusters, and it is sometimes
also referred to as King’s model, unfortunately. The formula is
2
Σ = k [1 + (r/rc )2 ]−1/2 − [1 + (rt /rc )2 ]−1/2 . We propose to call this King’s
Law, by analogy with de Vaucouleurs’ Law.
78 8 Some Equilibrium Models
0.1
0.01
ρ/ρ0 1e–3
1e–4
1e–5
1e–6 1 10
r/r c
10
Radius (N-Body Units)
0.1
0.01
0 2 4 6 8 10 12
W
0
Fig. 3. The tidal radius (solid), half-mass radius (dashed) and the core radius
(dotted) plotted against the scaled central potential W0 for the sequence of King
models, in N -body units (Box 1.1).
the core radius if stellar masses are unequal. Partly for this reason rather
more empirical measures are in common use (Casertano and Hut 1985).
Plummer’s Model 79
On the theoretical side, substantial work has been done on the modes of
oscillation of King models (e.g. Sobouti 1985). In particular, Weinberg
(1994a) has pointed out how slowly some modes are damped.
Plummer’s Model
Our next exhibit is the world’s favourite theoretical model, simply be-
cause its structure can be written down in terms of very simple functions;
sufficiently simple, in fact, that any theorist can memorise some of the ba-
sic data without difficulty (Table 1). It is actually the stellar-dynamical
analogue of an n = 5 polytrope (Plummer 1911∗ ; Box 1). Less well known
is the fact that it is one of another series of analytical models which in-
cludes the isochrone model (Chapter 7; see Dejonghe 1984). Evolutionary
calculations starting with Plummer initial conditions have been carried
out by many people (e.g. Spitzer & Shull 1975).
The free parameters of the model are the total mass M and the “scale
radius” a. Table 1 gives several quantities which we have not introduced
before. These include the projected mass density, Σ(d), which is the
density per unit surface area of the sky at the distance of the object (so
that, in astrophysical units, it would be measured in solar masses per
square parsec.) The line of sight passes at a distance d from the centre
of the object. Analogous to σ2 , σz2 is the mean square component of the
velocity along the line of sight. The total mass is M , the mass within
a sphere of radius r is M (r), and M (d) is the mass within a cylinder of
radius d parallel to the line of sight.
∗
He attributes the solution to Schuster, however.
80 8 Some Equilibrium Models
†
Some people say it was four, but the point is the same
Models of observational data 81
3.27/2 a2
Distribution function f (−E)7/2
7π G M 4 m
3 5
−5/2
3M r2
Mass density ρ 1 +
4πa3 a2
−2
M d2
Projected mass density Σ 1 +
πa2 a2
−3/2
a2
Mass within radius r M (r) M 1+ 2
r
−1
a2
Mass within projected radius d M (d) M 1+ 2
d
−1/2
GM r2
(Specific) Potential φ − 1+ 2
a a
1
One-dimensional velocity dispersion σ2 − φ(r)
6 −1/2
3π GM d2
Projected velocity dispersion σz2 1+ 2
64 a a
2
3π GM
Potential energy W −
32 a
W
Kinetic energy T −
2
W
Total energy E
2
a
Core radius rc √
2
16
Virial radius Rv a
3π
a
Half-mass radius Rh √ , ' 1.305a
2 −1
2/3
0.206N a3/2
Half-mass relaxation time trh √
GM ln Λ
to obtain ρ(r) (Merritt and Tremblay 1994), and then φ(r) comes from
solving Poisson’s equation. This gives ρ(φ), from which it is possible to
construct a distribution function f (E) depending on E alone, i.e. an
isotropic model (Problem 5). In turn this suggests that the anisotropy
is, in a sense, arbitrary, and that a whole sequence of models (differing
in the degree of anisotropy) can be constructed to fit a given observed
distribution Σ. Kinematic information is, therefore, essential to lift this
degeneracy, and provide more unambiguous information on the distribu-
tion f (E, J). Usually this is radial velocity data (of the sort catalogued
in Pryor & Meylan 1993) but useful internal proper motion data exist for
some clusters (e.g. van Leeuwen et al. 2000).
The problem of inverting the integral equations from observational data
rather than from mathematical functions is another issue. Consider, for
example, the problem of determining ρ(r). If information on the distribu-
tion of the stars is given in the form of star counts, or even the positions
of all the stars, the inversion usually magnifies the noisiness of the data.
One way to cope with this is to determine a function ρ(r) which best
fits the data with the constraint that it is not too noisy. In practice, one
represents ρ on a radial grid, writes down a corresponding expression for
Σ at the location of each star, and then maximises the likelihood, with an
additional penalty function which is large when the grid of values of ρ is
too noisy, as judged by the differences in the tabular values (in a suitable
sense.) This is not a purely academic discussion; Fischer & Gebhardt
(1995) have given some interesting applications of this kind of method.
There are merits and disadvantages in both methods of fitting models
to data. Parametric fitting can yield deceptively agreeable fits, while
the results of non-parametric fitting take insufficient account of what is
known on theoretical grounds about the distribution function, and the
result depends on the choice of penalty function, leaving considerable
freedom in the hands of the investigator. Nevertheless the velocity maps
of the rotating cluster ω Cen produced by Merritt et al. (1997) are a
striking demonstration of the power of this approach.
Problems
1) Verify that u = − ln(z 2 /2) is an exact solution of Eq.(6). Write down
the corresponding density, and compute the mass inside radius r, and
the projected density.
2) Transform Eq.(6) to variables v = u+log(z 2 /2) and t = log z. Deduce
that v → 0 as t → ∞, and hence show that all isothermal models have
infinite mass. By linearising about√ the equilibrium v = 0, show that
u(z) ' − ln(z 2 /2) + az −1/2 cos( 7(ln z)/2 + b), where a and b are
constants. The decaying oscillations are visible in Fig.1.
Models of observational data 83
∗
R.A. Lyttleton used to say that constructing a mathematical model was a bit like
plucking a chicken, but at the end of the job the bird has still to be airworthy.
Einstein said that everything should be made as simple as possible but not simpler.
†
In the previous chapter we have also used the word “model” in a different but
traditional sense, to denote a specific solution.
84
The scaling description: evaporative models 85
I¨ = 4E − 2W, (1)
∗
For spherical systems it is common to take for R the half-mass radius, which contains
half the mass. Then for many models (cf Chapter 8) β ' 0.4. Another common
choice is the virial radius, defined by taking β = 1/2.
86
GRAPE (Makino & Taiji 1998)
direct parallel (Spurzem 1999)
summation
serial (Aarseth 1999)
N−body model
tree code (McMillan & Aarseth 1993)
9 Methods
Parallel (Joshi et al 2000)
hybrid (Giersz & Spurzem 2000)
Fluid model anisotropic (Louis & Spurzem 1991)
gas
Fig.1. Models of the dynamics of dense stellar systems. We give what we judge to be the most informative introductory reference on
each method of some importance in the study of dense stellar systems; not necessarily the first, or most recent; nor do we include a num-
ber of hybrids, or methods used in other areas of stellar dynamics. The picture of M15 was obtained using NASA’s SkyView facility
(http://skyview.gsfc.nasa.gov) located at NASA Goddard Space Flight Center.
The scaling description: evaporative models 87
dM M
= −µ , (3)
dt tr
Radius
0 1 2 3 4 5 6 7 8 9 10
Time
Fig. 2. Virial oscillations in an N -body model. Initial conditions: Plummer
model (Chapter 8) with N = 16384, and velocities increased to make 2T /W =
−1.4. Units: N -body units (Box 1.1). Each curve is a Lagrangian radius, i.e.
the radius of a sphere containing a fixed mass and centred at the densest part of
the cluster. From the bottom the mass enclosed is 0.02, 0.05, 0.1, 0.2, 0.3, 0.4,
0.5, 0.75, 0.9. Except at large radii, where the time scale is larger, there is no
evidence of more than one complete cycle: the oscillations are heavily damped,
unless two-body relaxation (Chapter 14) is suppressed (cf. David & Theuns
1989). The form factor α (see text) is infinite (logarithmically divergent) for the
Plummer model.
88 9 Methods
This has to be proved from one of the more fundamental models coming
later in this chapter. As an illustration we start with the collisionless
Boltzmann equation
∂f ∂f ∂φ ∂φ
+ vi − = 0.
∂t ∂xi ∂xi ∂vi
If we define Z
I= mf r2 d3 rd3 v,
The first of these is evidently four times the kinetic energy, 4T , while
integration with respect to v shows that the second term is
Z Z Z
r · (r − r0 ) 3 3 0
−2 ρ(r)r · ∇φd3 r = −2G ρ(r)ρ(r0 ) d rd r .
|r − r0 |
Interchanging r and r0 shows that either expression is
Z Z
ρ(r)ρ(r0 ) 3 3 0
−G d rd r = 2W.
|r − r0 |
Finally, E = T + W , and so the virial theorem follows:
I¨ = 4E − 2W.
For a system in dynamical equilibrium, I¨ = 0 on average, and so 4E −
2W = 0, or 2T + W = 0. These are common forms in which the virial
theorem is applied.
The Fluid Description: gas models 89
Fig. 3. Four models of core collapse and gravothermal oscillations: (top left)
an extension of the evaporative model; (top right) the gaseous model; (bottom
left) the Fokker-Planck model; (bottom right) the N -body model. After Allen
& Heggie (1992)∗ , Heggie & Ramamani (1989), Cohn et al. (1989) and Makino
(1996). The density at the centre of a system is plotted against time. The initial
rise to the first maximum corresponds to the phenomenon of “core collapse”,
while the subsequent variations are referred to as “gravothermal oscillations”.
∗ ∗
With reference to Eqs.(17) in that paper, the model parameters used were T10 =
4.186, ξT = 0.00037 and Γ = 1.28, and the initial values were 3.437346, 3.442962
and 1.258366.
The Fluid Description: gas models 91
The resulting model has been very successful in developing our under-
standing of the million-body problem. The phenomenon of core collapse
(Chapter 18) was first understood in detail using models of this gen-
eral kind. Even more impressively, it was with these models that the
phenomenon of gravothermal oscillations (Chapter 28) was actually dis-
covered. Fig.3 shows this behaviour modelled with the four different
methods discussed in this chapter.
Generally speaking the gas model produces results which compare very
well with those produced by other techniques (Aarseth & Lecar 1975),
though the reason for this success has remained obscure. For example,
it has not been demonstrated that all the equations may be obtained by
taking moments of the Boltzmann equation. If one does take moments,
however (Larson 1970, Louis 1990) it is possible to obtain a set of equa-
tions which are rather similar in complexity and effectiveness to the gas
equations, provided that a suitable closure assumption is made, but they
The Fluid Description: gas models 93
Box 3 (continued)
In the case of the moment method, Eq.(1) is obtained from the Fokker-
Planck equation. For the present purpose this is best written in the form
∂f ∂f ∂φ ∂f ∂f
+ vi − = ,
∂t ∂xi ∂xi ∂vi ∂t enc
where the left-side is as in the collisionless Boltzmann equation, and the
right-hand side is the “encounter term”, the form of which is discussed
in Chapter 14. (Briefly, encounters alter the velocities of participating
stars, and this changes f at a rate described by this term.) Next, take
the moment with respect
R to v of m(v − hvi)2 , where the mean velocity is
defined by ρhvi = f vd3 v. The moment of the right-hand side vanishes,
as encounters do not change the total mass, momentum or kinetic energy
of stars in a small spatial volume. Computing the moment of the left-hand
side requires some care. It is also necessary to use the zeroth moment,
∂ρ
or continuity equation, i.e. + ∇ · (ρhvi) = 0, and the sort of tricks
∂t
such as integration by parts that were adopted in the derivation of the
virial theorem (Box 1). If it is assumed that the distribution function is
isotropic in a frame moving with velocity hvi, the result is exactly Eq.(1),
with an explicit form of L, i.e.
1
L = 4πr2 ρhr̂.(v − hvi)(v − hvi)2 i. (2)
2
The term on the right, which involves a third moment of the velocity
distribution, is clearly the flux of kinetic energy (in the rest frame of the
material) crossing a sphere of radius r.
The equation looks the same, but the essential difference with the gas
model is that, in the moment method, another equation must be written
down for the evolution of the third moment. This in turn involves a
fourth moment, whose evolution involves a fifth moment, and it is usual
at this point to close the chain by assuming that this fifth moment can
be expressed in terms of lower moments.
differ in detail (Box 3). One reason for this is that the mean free path
in a stellar system is very long (compared to the typical size of the sys-
tem; cf. Problem 14.5), whereas in an actual gas the mean free path is
short compared to the length scale of the system. Perhaps the success
of the gas equations may be explained if we suppose that so much of
the behaviour of stellar systems depends more on the thermodynamics of
94 9 Methods
Aarseth 1993), though this depends on the formulation (cf. Jernigan &
Porter 1989).
For an honest solution of an N -body problem, however, the problems
are of a different order. As is evident from Problem 3.2, it is all too
easy to code these equations inefficiently, and the present state of the art
represents about 40 years of sustained improvement (Fig.3.1). Different
particles must be integrated with different time steps (or a hierarchy of
timesteps). Depending on the hardware, the force of distant particles
can be summed less often than that from near neighbours (Ahmad &
Cohen 1973). The long-range force itself is readily evaluated nowadays
using special-purpose hardware (GRAPE). Special care is still needed to
deal with the singularities of Eqs.(6) (cf. Chapter 15). Long-lived triple
systems pose another batch of awkward problems. And all this before
one has even figured out how to analyse the data (cf. Casertano & Hut
1985, Eisenstein & Hut 1990), or considered adding the effects of stellar
evolution to the problem.
For some purposes there is no substitute for an honest N -body simu-
lation. One shouldn’t forget, however, that for a wide range of simpler
and yet important problems, more approximate methods such as Fokker-
Planck techniques may be quite adequate, and certainly much faster. For
such problems all techniques give quite similar results (e.g. Aarseth et
al. 1974, Giersz & Spurzem 1994, Takahashi & Portegies Zwart 1998).
Problems
4) Compute the half-mass and virial radii for Plummer’s model, i.e. a
model whose density is
ρ0
ρ= !5/2 ,
r2
1+ 2
a
where ρ0 is the central density and a is a constant.
5) Starting from the collisionless Boltzmann equation, either (a) fill out
the details of the calculation of Eqs.(1) and (2) in Box.3, or (b) derive
Eq.(4).
6) Estimate the conductivity in gas models by equating the time scale of
thermal energy transport with the relaxation time (Eq.(14.8)). Check
your estimate against the model of Lynden-Bell & Eggleton (1980),
who give L = −12πGmC ln Λr2 (ρ/σ)(∂σ2 /∂r), where C is a constant.
7) Suppose that the potential is dominated by a massive central black
hole, and that the density (ρ) and velocity dispersion are power laws in
r. Estimate the energy of stars between r and 2r, and the relaxation
time. If the luminosity L is constant (independent of radius) show that
ρ ∝ r−7/4 . (For a thorough treatment of this problem see Bahcall &
Wolf 1976.)
PART III. MEAN FIELD
DYNAMICS: N = 106
98
10
Violent Relaxation
99
100 10 Violent Relaxation
Phase Mixing
As a first concrete illustration, let us consider a system which is described
by the collisionless Boltzmann equation (Chapter 6). In order to simplify
the discussion to its utmost, let us consider instead the corresponding
one-dimensional problem in a time-independent potential φ(x), i.e.
∂f ∂f ∂φ ∂f
+v − = 0, (1)
∂t ∂x ∂x ∂v
where x is the position, v is the velocity. Now the motion of a particle in
a one-dimensional, time-independent potential is integrable, and instead
of using coordinates x and v in phase space, we may be better to use
action-angle variables, J and θ (e.g. Goldstein 1980). These are canonical
variables such that J is constant and θ (the “phase angle”) increases
linearly with time, its rate of change being some J-dependent frequency
ω(J). Similarly, the distribution function may be written as f (θ, J, t), and
then the collisionless Boltzmann equation, Eq.(1), takes the very simple
form
∂f ∂f
+ ω(J) = 0. (2)
∂t ∂θ
(Note that Eqs.(1) and (2) are different forms of the general equation
f˙ + [H, f ] = 0, i.e. Eq.(6.6), where square brackets denote the usual
Poisson bracket and H is the Hamiltonian. In the variables of Eq.(1) this
is H = v2 /2 + φ, while ω = dH(J)/dJ if action-angle variables are used.)
Phase Mixing 101
00 π 2π 0 π 2π 0 π 2π
ϑ ϑ ϑ
Fig. 1. Phase mixing. Here ω = J. The top three frames give the initial
conditions, as described in the text, and the solution at times t = 2 and 20. The
lower three frames give the same information from 100 initial points.
102 10 Violent Relaxation
initially that all particles are in the region where f0 6= 0, and that we
follow their motion in this diagram. At subsequent times the particles
will still occupy the region where f 6= 0. Eventually, however, the fine
detail which is visible in the top row of Fig.1 would not be discernible
in the distribution of points; certainly this would be true by the time
that the filaments are so fine that most do not contain any point. The
lower three frames of Fig.1 illustrate this. In the last frame the points
are confined to 10 or 11 nearly horizontal strips, as in the frame above,
but this is not evident unless one looks carefully for it (e.g. by looking
at the figure along the page). By this stage we see that the distribution
of points does not change perceptibly, even though the points themselves
are in rapid motion. There is a sense in which equilibrium is approached,
when we think in terms of particles rather than the distribution function;
it is a statistical equilibrium.
The same idea can be reexpressed in terms of the distribution function.
The distribution function is used to tell us how many particles there are
Rin regions of phase space. Let R be such a region. When we compute
R f dθdJ the fine filaments in f do not matter; we would obtain almost
the same answer if f were smoothed over little regions of phase space
which are small compared with R but large compared with the thickness
of the filaments.
It is not difficult to see that, after some time, this smoothed distri-
bution, which is often referred to as a “coarse-grained distribution” in
statistical mechanics, is given by averaging f0 with respect to θ. In other
words, in this smoothed or coarse-grained sense, f does indeed relax to
hf0 iθ . In fact such an equilibrium solution is asymptotically stable, in
the coarse-grained sense (Problem 1). And it is important to understand
the evolution of the coarse-grained density. When one attempts to infer
a phase density by looking at a galaxy with a telescope, or at an N -body
system with a computer, the best one can hope to do is to estimate a
coarse-grained density.
Curiously, the process we have discussed is not often referred to as
relaxation, but as “mixing”. In fact this word, like stability, is one which
is overloaded with meanings, only one or two of which we touch on in
this chapter. The particular type of mixing we have introduced is of a
relatively mild kind, more like setting the tea in a cup in steady circulation
(with a careful circular motion of the spoon) than churning it up with a
vigorous stir. In the dynamical context it is referred to as “phase mixing”,
for good reason. Consider two particles with neighbouring values of J and
the same phase θ at some time t. The original phases of these particles,
i.e. at t = 0, were θ − ω(J)t. Now ω depends on J in general, and so
if t is large the original phases differed greatly, even though the particles
now occupy neighbouring positions in phase space. The original phases
The End Point of Violent Relaxation 103
have become mixed together, by the same mechanism which led to the
stretched, narrow filaments in Fig.1.
Notice that f in Eq.(3) depends on two integrals of motion (i.e. J and
θ − ωt, which illustrates Jeans’ Theorem (Chapter 6). The two integrals
have rather different character, however. One (the first argument of f )
is phase-dependent, and irrelevant for smooth solutions (at late times, in
this case). It is an example of a non-isolating integral, because curves on
which it is constant lie extremely close together (at late times); they do
not “isolate” gross regions of phase space from each other. In all these
respects it is quite different from J, which is an example of an isolating
integral.
∗
As an example, let C(f ) = f 2 (a convex function), and suppose that f takes two
values f1 and f2 with equal probability. Then F = (f1 +f2 )/2, hC(f )i = (f12 +f22 )/2,
and C(F ) = hC(f )i − (f1 − f2 )2 /4
104 10 Violent Relaxation
R
C(F )dτ is less than (or at least never exceeds) its initial value, where
dτ is the element of phase volume (Tolman 1938, Tremaine et al. 1986).
Unfortunately, this is not quite the same thing as proving an H-theorem,
which states that there is a function H which decreases with t (Dejonghe
1987b, Soker 1996).
Unfortunately there are other reasons why this does not get us very
far: there is no compelling reason to prefer one form for C over any other,
and so we are no nearer finding the function to which F should tend. But
we can deploy a further argument. The evolution is constrained by the
fact that, as we follow the flow in phase space, the value of f around us
does not change. (Remember that this is what Eq.(1) means; the phase
fluid is incompressible (Chapter 6).) Therefore particles cannot crowd
together more closely than at time t = 0; they obey a kind of classical
exclusion principle. It is not too surprising, therefore, that the most
probable distribution of particles subject to this constraint is one which
resembles Fermi-Dirac statistics (Lynden-Bell 1967; see also Shu 1978).
Sadly, numerical experiments give at best ambiguous support to this
idea (e.g. Cuperman & Harten 1972), and it is not hard to see one
possible reason for this. We are relying on fluctuations in the potential
to drive the evolution. But the potential depends on the density, which is
an integral over the distribution function. We have seen, in the context of
phase mixing, that such integrals settle down to the integral of a steady,
coarse-grained distribution. Therefore one of the effects of the evolution
is to quench the very potential fluctuations which drive the relaxation.
It is therefore not surprising that the relaxation fails to complete its task
of bringing F to the expected equilibrium form.
None of this theory really vividly brings to mind the picture that as-
trophysicists see when “violent relaxation” is mentioned. They think of
galaxies colliding in what is little short of an explosion (Fig.2). The
galaxies are shattered, and some fragments may even be ejected from the
interaction region fast enough to escape. If this is so there will be almost
as many fragments with small positive energies as there are with small
negative energies. (The escape energy is a quantity set by conditions far
away from the place where the energies of the particles are being vio-
lently relaxed, and the relaxation process does not know about it.) In
other words, the number of stars per unit energy and angular momentum,
N (E, J), is almost independent of E near E = 0, at least for those stars
whose angular momenta are small enough for them to have come from
the ejection region. (If vesc is the escape speed from the edge of the in-
teraction region and R is its radius, the condition is J < Jmax = Rvesc .)
Stars with E ≥ 0 escape, whereas the remainder settle into dynamical
The End Point of Violent Relaxation 105
Fig. 2. Two colliding galaxies (from Mihos & Hernquist 1996). The time (in
units of approximately 1.3 Myr) is given at top right of each frame.
∗ ∂φ
In cylindrical polar coordinates the equations are r̈ − J 2 /r3 = − (r, z),
∂r
∂φ
z̈ = − (r, z). Unless z = 0 the oscillation in z influences the oscillation in r, and
∂z
vice versa.
Motion in Nonintegrable Potentials 107
1.8
1.6
1.4
1.2
1
J
0.8
0.6
0.4
0.2
0 1 2 3ϑ 4 5 6
1.8
1.6
1.4
1.2
1
J
0.8
0.6
0.4
0.2
0 1 2 3ϑ 4 5 6
1.8
1.6
1.4
1.2
1
J
0.8
0.6
0.4
0.2
0 1 2 3ϑ 4 5 6
studying the motion of a single particle, let us consider what happens if, at
time t = 0, we consider a set of particles occupying a little patch in phase
space, much as in our consideration of Fig.1. Again this patch becomes
stretched out as the evolution proceeds, but the rate at which it does so is
very different. In Fig.1, the stretching is caused by the linear increase of
the phase with time. In the corresponding motion in Fig.3c, however, the
factor by which the patch of phase is stretched increases approximately
exponentially with time† . There is mixing, and it is very different from
phase mixing, but it leads to the same end result: in the region where
this kind of behaviour occurs, the coarse-grained distribution function F
will approach an equilibrium form.
Different chaotic zones may well be separated by curves. In this case, a
particle which starts its evolution in one zone is forever separated from the
other. The consequence of this is that the equilibrium value of F in one
region may be quite different from that in another. We should picture a
coarse-grained distribution function which varies smoothly across regions
of regular motion, rather as in the one-dimensional case, but which is flat
in each chaotic zone.
We have spoken of the curves between neighbouring chaotic zones as
impermeable barriers. In fact some of them leak (and in three dimensions
all of them do). A few leak like sieves, with large holes which let particles
through readily; others are more like the paper in a coffee filter, and it
takes a long time before the leakage becomes noticeable. It follows that
the notion of an “equilibrium coarse-grained distribution” depends on the
interval of time, T , in which one is interested. If the boundary between
two chaotic regions is easily breached on a time scale much shorter than
T , then the appropriate distribution will be constant across both regions.
It may be different in two regions if the barrier between them is unlikely
to leak over a time of order T .
Unless the potential we are looking at is very simple, there will be
many regions of chaotic motions, and the barriers between them may be
breached on a wide variety of time scales. It would therefore follow that,
no matter how long we waited, the coarse-grained distribution function
would always be changing. There would always be barriers which could
only be breached on a time-scale comparable with the present age of the
system, and it is leakage across those barriers that would be the most
noticeable evolutionary process at that stage. The distribution function
would already have reached equilibrium across barriers which could be
†
This is different in character from the exponential divergence discussed in Chapter
13. A better picture to have in mind is Arnold’s famous “cat” map (cf. Lichtenberg
& Liebermann 1992, p.306).
Relaxation, Stability and Evolution in Dynamical Astronomy 109
Problems
1) Determine the t-dependence of f at a given point in Fig.1. If f is
averaged over a small interval δJ in J, show that hf i → 1/2, and
determine a time after which |hf i − 1/2| < for any given > 0.
2) Prove the quasi-H-theorem stated in the text for any convex function
C, when evolution is governed by the collisionless Boltzmann equation.
3) Fig.3 deals with the system θ̇ = J, J˙ = sin θ cos t, where = 0, 0.02
and 0.2, respectively, in the three frames. Defining φ = θ − t, show
that φ varies slowly near J = 1, and use the method of averaging to
show that J and φ evolve approximately according to the equations
φ̇ = J − 1
J˙ = sin φ
2
if is small. Within this approximation deduce that (J − 1)2 /2 +
(/2) cos φ is nearly constant, and hence determine the width (in J)
of the resonant region.
11
Internal Mass Loss
111
112 11 Internal Mass Loss
0.8
Fraction of mass remaining
0.6
0.4
0.2
0
7 7.5 8 8.5 9 9.5 10
log(time) (yr)
Fig. 1. Mass lost as a function of time for five possible model stellar populations.
The middle curve corresponds to an analytical approximation of a Miller-Scalo
mass function (Eggleton et al. 1989). The others give results for a power law
mass function with probability density f (m) ∝ m−α , with α = 3.35, 2.35, 1.35
and 0.35. (The value 2.35 comes from Salpeter 1955.) In each case the maximum
and minimum mass are 15 and 0.1 times the mass of the sun, respectively. The
prescription for stellar mass loss is taken from Chernoff & Weinberg (1990),
though they applied it to different mass functions.
Evolution of Length Scale 113
Fig. 2. Remaining bound mass following mass loss (after Lada et al. 1984).
The time scale of mass loss is τR , and τC is a measure of the crossing time. Slow
mass loss is less disruptive than fast mass loss.
small, bound system. Even if less than half the mass is lost, some stars
escape (Fig.2).
The role of dynamic equilibrium in this can be assessed by introducing
the virial ratio q = −T /W . Thus q = 1/2 in equilibrium, and if the
system is bound initially we require 0 < q < 1. If the above calculation
is repeated the result is that E 0 = E(1 − f )(1 − f − q)/(1 − q), and so
the condition that the final system is bound is that f < 1 − q. Thus
a “cool” system can lose a relatively high fraction of its mass and still
remain bound.
Now we return to systems in dynamic equilibrium, and consider what
happens if only a tiny fraction of the mass is lost. Then the change
in mass is dM = −f M , and so dE = E 0 − E ' −3EdM/M . If the
loss of mass is sufficiently slow, the system is able to remain close to
virial equilibrium. Therefore this relation is true for subsequent mass-loss
events, and it follows that E ∝ M −3 . Thus the system can, in the end,
lose an arbitrarily large fraction of its mass without becoming unbound.
Also, the length scale of the system can be estimated by R ∼ −E/M 2 ,
and so R ∝ M −1 .
Before passing on, it is worth stressing that all three scenarios lead to
an increase in the energy of the system, and since the energy is initially
negative, this means that the magnitude of the energy decreases. It might
114 11 Internal Mass Loss
seem that this is obvious, as the mass of the system has also decreased. It
is easy to see, however, that even the energy per unit mass increases, i.e
becomes less negative. In every sense, then, these mechanisms of slow or
impulsive mass loss lead to a heating of the system. We shall see in Part
VIII that some heating mechanism is needed in order to lead a stellar
system to a peaceful death, and nothing more complicated than loss of
mass is required.
Now we turn our attention, more briefly, to another important mech-
anism of mass loss, whose effects on the energy of a system could not
be more different. It occurs because of interactions of pairs of stars. In
such interactions the two stars exchange energy, and in some cases the
final energy of one of the stars may exceed the escape energy. Since the
energy of this escaper is positive, the energy of the remaining (bound)
members of the system is more negative than before. Let us assume, fur-
thermore, that the average energy carried off by the escaper is a certain
fraction of the energy of a single member of the stellar system. There-
fore we may assume that the changes in mass and energy are related
by dE = −f dM (E/M ), where f is a constant (unrelated to the con-
stant f above). Note that dE, dM and E are all negative here, and so
f > 0. It follows that E ∝ M −f , and so R ∝ M 2+f . The results of this
mass loss are therefore qualitatively very different from what we discussed
previously: as the mass of the system decreases the system shrinks. His-
torically, this was the first suggestion of the evolution of stellar systems to
a singular endpoint (Ambartsumian 1938, Spitzer 1940), an issue which
will play a major role in later chapters.
Evolution of Structure
So far the discussion has dealt with gross issues of mass, energy and
radius, but has said nothing about how the detailed structure of a stellar
system is altered. For this purpose we restrict attention to systems which
remain spherical, and close to dynamical equilibrium. Then we may
suppose the distribution function f depends only on E, J and t, where
now E refers to the specific energy of a single star. How, then, does
f (E, J, t) change in response to the loss of mass, i.e. to the resulting
changes in the potential?
One way of arriving at the correct answer is to substitute the expression
f (E, J, t) into the collisionless Boltzmann equation (Eq.6.5), remember-
ing that we must express E and J as functions of r and v. For example
E = v2 /2 + φ(r). Note that E depends on t because φ is time-dependent.
It follows that
∂f dE ∂f
+ = 0. (1)
∂t dt ∂E
Evolution of Structure 115
The argument we have just given is insecure. The rate of change of energy
dE/dt = ∂φ(r, t)/∂t, and so depends on the location of the star. Thus
Eq.(1) tells us that the rate of change of a function which depends on only
E, J and t depends also on r. It is natural, perhaps, to interpret the term
dE/dt in Eq.(1) as an average over the orbit of given energy and angular
momentum, and Box 1 confirms this guess with a better argument.
where N dEdJ is the number of stars with energy and angular momentum
in ranges dE and dJ , respectively, and P is their radial period; the sub-
script indicates that the time-derivative is to be taken at constant values
of Ir .
116 11 Internal Mass Loss
We shall not stop to solve these equations, but draw attention to one
or two qualitative points which their solutions would exhibit. We have
already seen that one consequence of mass loss may be the expansion of
the system. However, the angular momentum of each star is constant, and
determined by the size and velocity dispersion of the system initially. By
the time the system has expanded by a large factor, therefore, most orbits
will have low angular momentum for a system of such size. Even if the
initial distribution of stellar velocities had been isotropic, in the expanded
system nearly radial motions would predominate. It is also likely that this
is one mechanism by which the anisotropy of stellar systems grows in so-
called post-collapse evolution (Chapter 28), even though the mechanism
of the expansion looks rather different.
Despite these remarks frequent use is made in stellar dynamics of an
analogous equation for the evolution of f when the anisotropy is ne-
glected, viz.
∂f ∂s ∂f ∂s
− = 0, (2)
∂t ∂E ∂E ∂t
where s is the volume in phase space of the region in which the energy
is less than E (cf. Chapter 9 and this chapter, Problem 4.) Again this is
based on Eq.(1), except that the expression for Ė is obtained by averaging
over all points in phase space accessible to a star of energy E, instead of
the points accessible to a star of energy E and angular momentum J.
Problems
1) Suppose the main sequence lifetime of a star of mass m is tms =
1010 × (m/M )−2.5 yr (cf. Hansen 1999), that the entire mass of the
star is lost at this time, and that the mass function is as in the caption
to Fig.1. Compute the time scale of mass loss of the cluster, i.e.
−M/Ṁ. (For a more elaborate formula for the main sequence lifetime
see Cassisi & Castellani 1993.)
2) Assume that the mass in Keplerian motion varies as m(t) = mr +
(m0 − mr ) exp(−t/τ ), where m0 and mr are the initial and final (rem-
nant) masses, respectively, and τ is constant. The radial equation of
motion is
c2 Gm(t)
r̈ = 3 − ,
r r2
where c is the specific angular momentum; we assume that this is
constant. The orbit is initially circular with radius a, and we assume
that τ is much smaller than the orbital period, but not infinitesimal.
Show that, during the time when r does not vary significantly from
Evolution of Structure 117
GM x
ẍ − 2ω ẏ + (κ2 − 4ω 2 )x = −
r3
GM y
ÿ + 2ω ẋ = − (1)
r3
GM z
z̈ + ωz2 z = − .
r3
119
120 12 External Influences
Zero-Velocity Surfaces
The analysis of Eqs.(1) begins easily enough. There are two equilibrium
points, on the x-axis at x = ±rL , where
rL ≡ (GM/(4ω 2 − κ2 ))1/3. (3)
These are called Lagrangian points, as they correspond approximately
to two exact solutions of the general three-body problem discovered by
Lagrange (1772).
There is also an integral, called the Jacobi integral, given by C = (ẋ2 +
ẏ + ż 2 )/2 + (κ2 − 4ω 2 )x2 /2 + ωz2 z 2 /2 − GM/r. Its existence stems from
2
the fact that the potential is steady in this rotating, accelerating frame.
For a star at rest at one of the Lagrange points, C = CL ≡ −3GM/2rL.
The equipotential surfaces
(κ2 − 4ω 2 )x2 /2 + ωz2 z 2 /2 − GM/r = const (4)
are called “zero-velocity surfaces”, and are generalisations of what are
called “Hill’s curves” in the two-dimensional problem. These surfaces
exclude the motion from certain regions of space (Fig.1). For values of
C < CL they consist of three pieces, the innermost of which confines a
star to the vicinity of the cluster. Therefore if a star is situated within
122 12 External Influences
the cluster and has C < CL , it cannot escape. This is analogous to the
existence of an escape energy for stars in an isolated cluster. The lowest
value of C at which the surface opens out to infinity is C = CL , and so
it is usual to adopt the “radius” of this last closed surface, i.e. rL , as the
limiting radius of a cluster in a galactic potential. It is usually called the
tidal radius, denoted by rt .
When C CL the star is confined to the middle of the cluster, far from
the reach of the tidal field. It behaves almost like a star moving in the
potential of the cluster alone. As C increases the star can wander further
and the effect of the tidal perturbation becomes stronger. Fig.10.3 tells
us what to expect, and helps to understand the surface of section actually
at C = CL (Fig.2). This shows several planar orbits (z = 0), and plots
the phase space coordinates x and ẋ whenever y = 0 and ẏ > 0. The
points are confined by the Jacobi integral, and the orbits are a mixture
of chaotic and regular ones. The large area of regular orbits on the left
correspond to retrograde motions, i.e. the stars move around the cluster
in the opposite sense to the motion of the cluster around the galaxy. We
have already seen why it is that retrograde motions are so special, and
there is plenty of supporting numerical evidence (e.g. Keenan & Innanen
1975).
Now we return to questions of escape. We have just stated that there
is a limiting “energy” below which escape is impossible, just as in the
problem of escape from an isolated, spherical potential. There the resem-
blance ends. In an isolated cluster, a star with energy above the escape
0.6
0.4
0.2
z
0
-0.2
-0.4
-0.6
-1
-0.5
y0 -1
0.5 -0.5
0
0.5 x
1 1
4
3
2
1
dx/dt
0
-1
-2
-3
-4
-0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8
x
Fig. 2. Surface of section for a modified form of Hill’s equations. Here the
potential of the cluster is a Plummer potential with a suitably chosen scale
√ radius.
The equations of motion are obtained from Eqs.(1) by replacing r by r2 + a2 ,
where a = 0.1 is a suitably chosen scale radius, and setting κ = ω = GM = 1.
For further explanation see text.
energy will almost certainly escape (unless, on its way out, it encounters
another star which once again reduces its energy sufficiently.) In a cluster
in the tidal field of a galaxy, however, this is not guaranteed. We have al-
ready seen that stars in retrograde motion may remain near (but outside)
the cluster indefinitely, and this may easily happen to cluster members in
retrograde motion with C > CL . (The link between epicyclic motion and
retrograde escapers was drawn to our attention by Jeremy Goodman.)
One might say that the tidal radius for retrograde orbits exceeds that for
prograde orbits (Danilov 1999).
Even more fascinating is the fate of stars in prograde motion with
C > CL , and is most easily discussed when C − CL is small. Then the
openings in the equipotential surface through which a star must pass in
order to escape are tiny. Stars which do escape must do so through these
openings, and it is clear that the time they take to find them depends
on the amount by which C exceeds CL (Fukushige & Heggie 2000). This
in turn depends on how much the energy can be changed by relaxation
effects (Chapter 14) during the time taken to find the gaps. The result
of this interplay is that the rate of escape from star clusters in a steady
tidal field depends on N in a quite unexpected way (Baumgardt 2001).
When stars do escape, they do so along the directions of the Lagrange
points, as expected (Fig.2.2). But the Coriolis acceleration (i.e. the
124 12 External Influences
terms with coefficient ±2ω in Eq.(1)), bends the orbits along the direction
of motion of the cluster round the galaxy (Fig.3, left). “Extratidal”
extensions of this general nature are now known around a number of
Galactic globular clusters (e.g. Grillmair et al. 1995) and can be modelled
quite successfully (Combes et al. 1999).
When C = CL the width of the gaps through which stars escape re-
duces to zero, and they lie at the Lagrangian points. These equilibrium
solutions exist only at this critical value of C. As C increases slightly,
they give birth to a pair of periodic orbits, called Liapounov orbits. These
orbits are unstable, but they control the way in which stars can escape.
(Incidentally, the analogous orbits near the Earth, perturbed by the sun,
lead to a series of orbits which have a role in space dynamics. On Decem-
ber 2 1995 the international satellite SOHO, for observations of the sun
and heliosphere, was placed on an orbit of this general kind, with occa-
sional adjustments to quench the instability (“station keeping”). An orbit
in this location is particularly suited for continuous solar observation.)
The manner in which the Liapounov orbit controls escape is a little
complicated, but is most easily understood for orbits in the x, y plane.
For such orbits the Liapounov orbit behaves like a saddle point. An
escaper, which necessarily has to come close to this orbit if C − CL is
small and positive, must pass on one side or the other. On one side it
is deflected by the Liapounov orbit out of the cluster, and it escapes in
every sense. If it approaches on the other side it is deflected back into the
cluster, and must try again later, when the same alternatives apply. Even
a star which does escape may, if it approaches too close to the Liapounov
orbit, spend a long time there before being deflected one way or the other.
Furthermore, the number of times a star has to approach this orbit before
escaping is an extremely irregular function of its position.
Incidentally, the Liapounov orbits (there is one near each Lagrangian
point) also play a critical role in the behaviour of the coorbital satel-
lites of Saturn, and there also give rise to sensitive dependence on initial
conditions. It is an example of “irregular scattering” (Hénon 1988).
Time-Dependent Tides
200 200
150 150
100 100
50 50
0 0
-50 -50
-100 -100
-150 -150
-200 -200
-150
-100
-50
0
50
100
150
-150
-100
-50
0
50
100
150
Fig. 3. Mass loss by a cluster in a tidal field. On the left a King model with
concentration W0 = 3 (Chapter 8) is placed on a circular galactic orbit, and is
shown after one revolution. Stars are lost steadily. On the right the same cluster
has been placed on an elliptical orbit of eccentricity 0.5; it starts at apogalacticon
with a radius corresponding to the tidal radius at perigalacticon, and is shown
after one revolution. In this case stars are lost mostly near perigalacticon, and
there is a gap in the distribution of escapers near the cluster. Galactic centre is
to the right.
126 12 External Influences
with a normal speed of order 100kms−1 . On the other hand the radius
of a typical cluster is of order 10pc, while typical speeds for stars within
the cluster are of order 10kms−1 . Therefore the timescales of passage
through the disk and orbital motions within the cluster are comparable.
This is why the process is often referred to as disk shocking.
While a cluster is traversing the disk, the relative effect of the disk
and the cluster on the internal motions within the cluster are propor-
tional to the differential accelerations, across the extent of the cluster,
of the fields of the disk and cluster, respectively. Since the density in
the disk (∼ 0.1M pc−3 ) is much smaller than that in a typical clus-
ter (∼ 103 M pc−3 ), and since the event is so short-lived, the effect of
the disk is relatively minor. Nevertheless, clusters experience of order
100 such shocks in their lifetime, and so their cumulative effect is not so
negligible.
Box 2 shows that each shock tends to heat the cluster (until it reviri-
alises, once the shock is over), and finally gives an estimate of the time
scale on which repeated shock heating becomes effective. Notice that none
of the parameters of the cluster appear in that expression. Therefore, in
a galaxy with a massive disk, all clusters will succumb to destruction by
disk shocking on a comparable time scale, unless some other process acts
even faster∗ . This is, admittedly, an oversimplification. The strength of
shock heating does depend in detail on the structure, size and mass of the
cluster, and the point in the plane where the crossing occurs: the den-
sity within the disk increases significantly towards the centre of a galaxy.
Also, our estimate of the importance of this effect was a global one; in
fact the effect of disk shocking is strongest on the outer stars of a cluster.
Indeed for many clusters this is the dominant mechanism for changing
the energies of the outermost stars, and has therefore been referred to as
“shock relaxation” (Kundić & Ostriker 1995). Detailed models suggest
that it has a substantial effect on the escape rate (Spitzer & Chevalier
1973).
There is one circumstance in which this effect is suppressed, and that is
when we are considering a cluster where the time scale of the disk crossing
considerably exceeds that of orbital motions within the cluster. One
might expect then that the energy of a star should behave as an adiabatic
invariant, but in fact the situation is more complicated and interesting. In
the first place the typical orbital time scales are longest in the outer parts
of a cluster, and so the effects of a moderately slow disk crossing continue
to be significant there. Secondly, in a general spherical potential the
∗
J.P. Ostriker pointed this out to us
Time-Dependent Tides 127
Problems
1) Show that the mean density within a tidally limited star cluster is
constant, and comparable with that of the galaxy.
2) In a galaxy with a flat rotation curve, show that the tidal radius (for a
cluster of given mass) varies with galactocentric distance as rt ∝ R2/3 .
(See Surdin (1995) for a comparison with observational data.)
3) Explain qualitatively why seas on Earth exhibit two tides each day.
4) Consider a very large, planar epicycle, given by Eqs.(2), which encir-
cles the cluster. By using the method of variation of parameters on the
full Eqs.(1), and averaging the resulting equations over the epicyclic
motion, show that the centre of the epicycle oscillates stably.
5) By linearising Eqs.(1) in the vicinity of the Lagrangian point, find
an approximation to the Liapounov orbits there, and examine their
stability, all within the linear approximation.
PART IV.
MICROPHYSICS: N = 2
129
13
Exponential Orbit Instability
Up to this point in the book we have largely turned our back on the
microscopic character of the million-body problem. Usually we have ap-
proximated the gravitational field by that of a smooth distribution of
matter. Now we concentrate on the interactions between small numbers
of stars in the system, often only two or three stars at a time. In later
parts of the book we shall see how these microscopic processes influence
the large-scale behaviour.
The purpose of this chapter is to look at the question of sensitivity
to initial conditions in the million-body problem. Much current research
in other dynamical problems is devoted to this question, because of its
importance for prediction, for the foundations of statistical mechanics,
and perhaps even for the survival of life on Earth. What lies behind this
remark is the fact that the question of sensitivity is linked to stability,
and the stability of the solar system is something we rely on implicitly.
But the collision of comet Shoemaker-Levy with Jupiter in 1994 reminded
us that the solar system is not the well regulated clock we often take it
for. Less well known is the recent realisation that the rotation of the
Earth (which itself influences climate strongly) appears to be stabilised
by the presence of the moon (see Laskar 1996).
Consider the 1-body problem. The star proceeds with uniform recti-
linear motion r1 (t). Another single star, not interacting with the first,
and starting with a similar initial velocity and position, exhibits sim-
ilar motion r2 (t). The distance between
p p the two stars in space, i.e.
(r1 − r2 )2 , or in phase space, i.e. (r1 − r2 )2 + (ṙ1 − ṙ2 )2 , eventu-
ally depends nearly linearly on t, after an initial transient, during which
they may actually approach. Their separation in momentum space is
constant, however.
130
13 Exponential Orbit Instability 131
To bring gravity into the picture, consider the relative motion of two
stars, i.e. the Kepler problem, with equation of motion
r̈ = −µr/r3 , (1)
where µ is constant. Now we play the same game of considering two Kep-
lerian motions started with similar initial conditions, but not influencing
each other. Assuming they have negative energy, in general these two mo-
tions will have slightly different periods, and so their separation in space
will grow nearly linearly while the separation remains small, except for
an initial transient, and any periodic oscillation. While their separation
δr is small, δr approximately obeys the variational equation
µ 3(r.δr)r
δ̈r = − 3
δr − , (2)
r r2
where r is a solution of Eq.(1). Note that the coefficient on the right-hand
side involves the derivative of the Keplerian acceleration, i.e. the “tidal”
acceleration of Chapter 12. (Observe the resemblance between the form
of these equations in Problem 1 and Eqs.(12.1)).
For a while, Eq.(2) describes approximately the evolution of the dis-
placement between the position on one orbit (r) and that on the other
(r + δr). For large times, however, both orbits remain bounded (being
Keplerian ellipses), and so, therefore, does their separation. The approx-
imation which leads to Eq.(2) breaks down, however, and the solution of
this equation continues to increase nearly linearly (Problem 1).
The three-body problem is non-integrable (cf. Moser 1973), and it is
harder to determine the rate at which solutions separate without recourse
to numerical methods. One problem that has been studied analytically is
Sitnikov’s problem (see Chapter 20). It is a special case of the “restricted”
three-body problem, in which one star, denoted by m3 , is massless. The
other two stars have equal masses and move as a binary on an elliptical
relative orbit. As m3 swings back and forth through the binary it is
subject to the time-dependent acceleration of the binary components.
Sometimes it gains energy and sometimes it loses energy, depending on
the phase of the binary as m3 passes through. It is easy to see that, if the
amplitude of its motion is large, the time between successive passages is
very large. In this case a small difference in the initial conditions leads
to a large difference in the phase of the binary at the time of closest
approach, and a large difference in the change of energy. The subsequent
passage through the binary will then be very different for the two motions.
The motion is very sensitive to initial conditions.
In fact for this example it is plausible to suppose that the difference
between the two motions, while it is still small, is increased roughly by
some factor during each passage through the binary, and therefore grows
132 13 Exponential Orbit Instability
roughly exponentially with time. This growth does not necessarily per-
sist, however, after the time at which the two motions lose all resemblance.
Moreover, in the long run it is likely that one and then the other motion
results in escape. By that stage their separation grows no longer exponen-
tially but nearly linearly (Fig.1). The exponential growth of separation
is also inapplicable at early times: before that first passage through the
binary, the motions separate less rapidly.
While this example might seem ludicrously artificial, it encapsulates
every essential aspect of these questions, even for the million-body prob-
lem. Here too the separation of solutions starting from neighbouring ini-
tial conditions is rather modest at first, and then there is a phase during
which they separate exponentially, until all the stars in the two motions
are separated by a distance of order the size of the system∗ . At still
later times, when stars escape, it may be conjectured that the separation
increases nearly linearly again. Overall, the factor by which the initial
displacement is magnified up until this time can be enormous, even for a
3-body system; factors of order 10200 have been measured in numerical
experiments (Dejonghe & Hut 1986).
In applying this idea to larger N -body systems, all that is missing is an
estimate of the timescale on which the exponential separation acts. After
pioneering numerical work by Miller (1964) there was a long gap before
2 10
1.5 5
1
log|z1 - z2|
0
0.5
0 -5
z
-0.5 -10
-1 -15
-1.5
-20
-2
0
100
200
300
400
500
600
700
800
900
1000
0
50
100
150
200
250
300
350
400
Time Time
Fig. 1. Exponential divergence in Sitnikov’s problem (see text and Chapter 20).
The left hand plot shows one solution of the equations of motion of this problem,
and the right hand plot shows (on a logarithmic scale) the absolute difference
between two initially neighbouring solutions. Up to the time when the third
body escapes (t ' 350) the growth of the difference is roughly exponential.
∗
√
More precisely, at a distance of order the size of the system divided by N ; cf. Box
1. Between that point and the time when √the separation is of order the size of the
system, the separation grows roughly as t.
13 Exponential Orbit Instability 133
Gm/(pV2 )
p D
Box 1 (continued)
Finally we discuss what this result means. First, it is applicable while
the separation between the orbits is small compared with the distance
of closest approach, p. (Recall that the foregoing estimate was based on
taking the differential of the expression GmD/(pV 2 ).) The appropriate
value of p here corresponds to the typical closest distance of approach
in a time of order a crossing time. During this time a star traverses
the entire system. If we project the positions of all stars onto a plane
orthogonal to its trajectory we obtain N stars within a disc of radius R,
i.e. the radius of the cluster. It is√clear, therefore, that the typical closest
encounter distance is of order R/ N . (This is another “nσv” argument!)
Because the growth of displacement is exponential while the displacement
remains small enough, it takes only a few crossing times for any reasonable
initial displacement to reach a size of this order; the dependence on the
initial displacement is logarithmic. After the displacement becomes of
this order, the exponential growth of s slows down (Problem 4).
the first theoretical attack on the problem. This was the work of V.G.
Gurzadyan and G.K. Savvidi (1986), who used a formulation of the prob-
lem which goes back to work of N. Krylov. In this theory the trajectory
of a set of N particles can be treated as a geodesic in their configuration
space when it is endowed with a certain metric (Maupertuis’ Principle).
Then the divergence between two trajectories is analysed with the help
of Jacobi’s equation of geodesic deviation. The timescale on which tra-
jectories diverge may be estimated by computing the scalar curvature of
the manifold, which in turn involves the square of the acceleration ex-
perienced by each body. Gurzadyan & Savvidi estimated this by using
the mean square acceleration, though this is formally divergent, and it is
necessary to impose a short-length cutoff, denoted by rc . Unfortunately
1/2
the estimate of the timescale depends sensitively (as rc ) on this choice,
and the choice made by Gurzadyan & Savvidi leads to a time scale which
subsequent work has not confirmed.
The actual acceleration is never infinite, unless a collision occurs, and
an alternative approach is to recognise that the acceleration on each par-
ticle varies with time, and to consider its cumulative effect. A rough
argument is given in Box 1. It indicates that the encounters which are
most effective in sustaining exponential divergence of neighbouring solu-
tions are those which take place typically once per crossing time. Those
which are closer are more effective, but too rare, while more distant en-
counters are just not powerful enough.
13 Exponential Orbit Instability 135
Problems
1) In Eq.(2), suppose that r corresponds to planar circular motion. In a
frame rotating with the corresponding angular velocity, show that the
coplanar components of Eq.(2) become
ẍ − 2ω ẏ − 3ω 2 x = 0
ÿ + 2ω ẋ = 0,
where ω 2 = µ/r3 . Solve these equations.
2) Show that the variational equations for the N -body problem (analo-
gous to Eq.(2)) are
!
X
N
Gmj 3([ri − rj ].[δri − δrj ])(ri − rj )
δr̈i = − δri − δrj − .
|r − rj |3
j=1,j6=i i
|ri − rj |2
3)
X Gm2
i) Let V = − be the potential function for the gravitational
a<b
rab
N -body problem with equal masses m. (This restriction is inessen-
tial; cf. Gurzadyan & Savvidi 1986). Suppose the 3N -dimensional
configuration space, with coordinates xi (1 ≤ i ≤ 3N ), is given the
metric ds2 = (E − V )dxi dxi , where E is a constant, and summation
convention is in use. Show that geodesics are orbits of the N -body
problem with energy E.
ii) Show that the scalar curvature is
3W,i W,i W,ii
R= (3N − 1)(N − 2) + (3N − 1),
4W 3 W2
where W ≡ E − V . Show that this vanishes if N = 2 (except at a sin-
gularity). Does this have anything to do with the lack of exponential
divergence in the 2-body problem?
iii) For the reduced planar Kepler problem in plane polar coordinates
we have
ds2 = (E + 1/r)(dr 2 + r2 dθ 2 ). (3)
Now consider an axisymmetric surface in R3 of the form (ρ(r), θ, z(r)),
where (ρ, θ, z) are cylindrical polar coordinates. Try to find functions
ρ(r), z(r), so that the metric induced on the surface by the Euclidean
metric in R3 is as in Eq.(3).
4) In the language of Box 1, suppose the separation s exceeds the impact
parameter p of the most effective encounters (p ∼ RN −1/2 , where R
is the size of the N -body system). Ignoring encounters with p
s, show that Eq.(1) in Box 1 is replaced by ln s = ln s0 + Gmntδt,
where δt ∼ 1/(nV s2 ). pUsing this to obtain a differential equation
for s(t), show that s ∼ Gmt/V asymptotically. Assuming that the
separation in energy,
p δE, may be estimated by δE/E ∼ s/R, deduce
that δE/E ∼ t/ts , where ts ∼ V 3 /(G2 m2 n) for a system in virial
equilibrium (Chapter 9).
14
Two-Body Relaxation
138
Theory of two-body encounters 139
integral off at p ∼ Gm/v2 ; it turns out that, when these results are applied
to a star cluster with N stars in approximate dynamical equilibrium, the
term neglected is smaller than the term we shall retain, by a factor of
order log N .
^
v
^
p
Box 1 (continued)
Eq.(2) is not quite accurate enough, even for small-angle scattering,
but can be improved by the following trick. From Eq.(2) it follows that
the angle of deflection is α ' 2G(m1 + m2 )/(pV 2 ), (cf. Box 7.2, Eq.(2)),
while energy conservation shows that the magnitude of ṙ equals V long
after the encounter. Thus a vector of this magnitude has simply been
rotated by this small angle, and so
2G(m1 + m2 ) 2G2 (m1 + m2 )2
δ ṙ = − p̂ − V̂, (3)
pV p2 V 3
where V̂ is a unit vector in the direction of the original relative velocity,
and the second term is a small-angle approximation of −(1 − cos α)V V̂.
The effect of the encounter on the velocity of m1 is
δ ṙ1 = m2 δ ṙ/(m1 + m2 ), (4)
since the barycentre of the two stars moves with constant velocity.
Now we sum the result over many encounters, assuming that different
encounters are independent, as discussed in the text. It is evident that
the average value of p̂ is zero, and so Eqs.(3) and (4) lead to
X X 2G2 m2 (m1 + m2 )
δ ṙ1 = − (ṙ1 − ṙ2 ).
p2 |ṙ1 − ṙ2 |4
In order to perform the sum, let f (ṙ2 ) be the number density, in phase
space, of stars of mass m2 and velocity ṙ2 . The rate at which encounters
occur with stars moving with a range of velocities d3 ṙ2 , with values of
p in the range (p, p + dp), is 2πp|ṙ1 − ṙ2 |f (ṙ2 )dpd3 ṙ2 , and so the mean
value of the sum, over all encounters in time t, is given by Eq.(1) in the
text.
The choice of pmax has given greater difficulty. At one time (Jeans
1929, Chandrasekhar 1942) it was thought that encounters must take
place one at a time, so that only the nearest neighbour of m1 could
contribute to its relaxation. Following more recent treatments, however
(beginning with Cohen et al. 1950 and Hénon 1958), we have assumed im-
plicitly that many relaxing encounters may occur simultaneously. This is
supported by numerical experiments (Farouki & Salpeter 1994, Fukushige
1995). In a finite stellar system it is assumed now that encounters con-
tribute up to a distance p at which the stellar density falls significantly
142 14 Two-Body Relaxation
below its local value∗ . Therefore for a star in the core, i.e. the densest
central part of the system, pmax may be of order the radius of the core.
For a typical star in a nearly spherical system, it may be of the order of
a suitably defined radius, such as the half-mass radius, rh , which is the
radius of an imaginary
R sphere containing the innermost half of the mass.
With this choice, dp/p = log(rh v2 /Gm). Furthermore, for a system in
R v ∼ GM/rh , where M is the total
virial equilibrium, we have typically 2
mass, and so it is usual to choose dp/p = log(γN ), where γ is a constant
of order unity.
By a refinement of the above argument, it was customary for a long
time to take γ = 0.4. More recently, however, comparison with N -body
simulations has consistently suggested a smaller value, around γ = 0.11,
for systems with equal masses (Giersz & Heggie 1994). When there is a
distribution of masses, considerably smaller values are suggested by both
theoretical arguments (Hénon 1975) and numerical experiments (Giersz
& Heggie 1996). Whatever the value of γ, this factor is often referred to
as the Coulomb logarithm, which betrays the analogy with the kinetic
theory of plasmas, where the interaction is electrostatic.
By this stage we have arrived at the result
X Z
4πG2 m2 (m1 + m2 )
δ ṙ1 = −t log γN (ṙ1 − ṙ2 )f (ṙ2)d3 ṙ2 . (2)
|ṙ1 − ṙ2 |3
The remaining integral has a familiar look. Apart from a factor, it is
the same as the formula for the gravitational acceleration due to a mass
distribution with spatial density f . Therefore for an isotropic distribution
of velocities (corresponding to a spherically symmetric distribution of
matter), the result may be written
Z
1X ṙ1
δ ṙ1 = −4πG2 m2 (m1 + m2 ) log γN f (ṙ2 )d3 ṙ2 . (3)
t |ṙ1 |3 |ṙ2 |<|ṙ1 |
The left side of this result is often written dṙ1 /dt, though it must
be remembered that it has a statistical meaning. At any rate it is an
important result. It implies that stars are subject to a deceleration,
which is called “dynamical friction”. Note that, in the case of isotropy,
it is caused by stars moving more slowly than m1 . The strength of this
deceleration increases with the mass m1 . This is unlike the gravitational
acceleration due to a static external body, which would be independent
of m1 , even though dynamical friction is caused by the gravitational
attraction of other stars. To understand this, and to gain a physical
feel for this phenomenon, imagine we move with a star whose velocity is
∗
One could, however, imagine a system, however unrealistic, in which a vast low-
density halo contributed most to relaxation
Theory of two-body encounters 143
high (Figs.1,2). Then the other stars approach predominantly from the
direction in which we are moving. They are deflected by our mass (m1 )
and this forms an enhancement in their density behind us. The larger
our mass the larger the enhancement, and it is this density enhancement
which decelerates us.
Since all stars are subject to dynamical friction, it would seem that all
stars must slow down. Recall, however, that these results are statistical,
and we shall see that they are not even sufficient to describe the main
statistical effect of encounters (cf. Eq.(5), below). It should be added
that the response of a finite, non-uniform stellar system can be a lot
more complicated than Eq.(3), which is a local approximation. Even if
an object orbits a stellar system outside all the members of the system,
so that the local value of f is zero, it will still experience an effective
deceleration caused by the response of the system. A theory dealing with
this and other issues was developed by Tremaine & Weinberg (1984).
Just as Eq.(2) sums the changes in ṙ1 , we may compute the sum of their
P
squares, or, more generally, the tensor δ ṙ1i δ ṙ1j . Looking at Eq.(3) in
Box 1, we see that this involves terms which vary with various powers of
p. Only terms in p−2 yield the Coulomb logarithm; terms which decrease
even more rapidly do not do so, and in fact are smaller by a factor of
this order. It is, however, necessary to treat large-angle scatterings more
exactly to establish this. Such terms are called “non-dominant” terms,
and are usually neglected in the standard theory.
0
y
-1
-2
-4 -3 -2 -1 0
x
Fig. 2. Density contours in the wake behind a fast-moving star. (See Problem
2)
The one result about the tensor which we shall quote is its trace. For
an isotropic distribution f (ṙ2 ) this is given by
Z
1X 1
(δ ṙ1 )2 = 8πG2 m22 log γN ( f (ṙ2 )d3 ṙ2 +
t v1 |ṙ2 |<|ṙ1 |
Z (4)
1
f (ṙ2 )d3 ṙ2 ).
|ṙ2 |>|ṙ1 | v2
Unlike dynamical friction, this result does not involve the mass m1 . This
will be important in Chapter 16. (By the way, just as the integral in
Eq.(3) stems from one analogous to gravitational acceleration, that in
Eq.(4) resembles the expression for the gravitational potential in a spher-
ical distribution. Indeed these integrals are sometimes referred to as
“Rosenbluth Potentials” (cf. Rosenbluth et al. 1957).)
One may continue to characterise the statistical effect of small-angle
scattering by computing the sums of products of degree 3 (and even
higher) in δ ṙ1 . The corresponding integrands decrease sufficiently rapidly
with increasing p, however, that all such terms are non-dominant, in the
sense introduced above.
With Eq.(3) and similar results, other important effects of relaxation
can be analysed. For example the sum of the changes in the specific
Kinetic Equations 145
Kinetic Equations
The information we have obtained describes the statistical effect of re-
laxation due to encounters on a single star. Now we must consider how
this information can be extended to describe its effect on a distribution
of stars. In practice this may be done by starting with the Kolmogorov-
Feller equation
Z
∂
f (ṙ1 , t) = (f (ṙ1 − δ ṙ1 )R(ṙ1 − δ ṙ1 , δ ṙ1 ) − f (ṙ1 )R(ṙ1 , δ ṙ1 )) d3 (δ ṙ1 ),
∂t
(6)
where R(ṙ1 , δ ṙ1 ) denotes the rate at which a star with velocity ṙ1 suffers
encounters which change its velocity by δ ṙ1 . The first term on the right
shows how the distribution function at a particular velocity is increased
by encounters which cause the velocity of a star to change from some
other value, while the second gives the rate at which it decreases as stars
with velocity ṙ1 are scattered to other values of the velocity.
Intuitively obvious though the meaning of this equation is, some fun-
damental assumptions are needed to justify it. For example, it describes
relaxation as a (continuous) Markov process, in which the evolution over
an infinitesimal time interval depends only on the state of f at the begin-
ning of that interval. The considerations of the previous chapter provide
some intuitive foundation for this assumption, though without rigour. On
the other hand the Kolmogorov-Feller equation clearly does not apply on
time scales shorter than the duration of a typical encounter.
It is not yet clear how our statistical information on the cumulative
effect of many encounters may be exploited in Eq.(6), but in fact this is
rather easy. Our small-angle approximation is satisfactory if δ ṙ1 is small,
and a truncated Taylor expansion of Eq.(6) leads to
Z
∂ ∂
f (ṙ1 , t) = − δ ṙ1i f (ṙ1 )R(ṙ1 , δ ṙ1 )d3 (δ ṙ1 )+
∂t ∂ ṙ1i
Z
1 ∂2
δ ṙ1i δ ṙ1j f (ṙ1 )R(ṙ1 , δ ṙ1 )d3 (δ ṙ1 ).
2 ∂ ṙ1i ∂ ṙ1j
146 14 Two-Body Relaxation
Now, except for the factor f (ṙ1 ), the integrals here are the statistical
sums calculated earlier, i.e this equation may be written
X
∂ ∂ 1
f (ṙ1 , t) = − δ ṙ1i f (ṙ1 ) +
∂t ∂ ṙ1i t
X (7)
1 ∂2 1
(δ ṙ1i δ ṙ1j )f (ṙ1 ) .
2 ∂ ṙ1i ∂ ṙ1j t
Note that the terms we have neglected in the Taylor expansion (of de-
gree 3 and higher) are smaller than these terms by a factor of order the
Coulomb logarithm. This is not a very big factor, even for the million-
body problem: for γ = 0.11, ln(γN ) varies from about 6 to about 12
for N in the range from 104 to 106 . Nevertheless, formulations exist
where some of the neglected effects can be accounted for (e.g. Ipser &
Semenzato 1983). Where it has been possible to check, the omission of
these terms seems to make little difference to the resulting behaviour of
a stellar system (Goodman 1983).
Eq.(7) is a form of Fokker-Planck equation. It is a diffusion equation
with a drift term, the first term on the right side resulting from the
action of dynamical friction. An important property of this equation
is that a Maxwellian distribution is a solution. This solution does not,
however, provide a thermal equilibrium solution to the structure of a
stellar system, for two reasons. One is that, as the system relaxes, the
evolution of the distribution of velocities leads to changes in the spatial
distribution of matter; in turn these cause the potential to vary, and
this changes the energies of the stars in addition to the changes due to
two-body relaxation. The problem of incorporating the effect of this
process on the evolution of the distribution function will be discussed in
a moment. The second reason why the Maxwellian distribution fails to
provide an equilibrium model is the boundary conditions: in a stellar
system stars may escape, especially those with high energy. The way in
which a system reconciles these two inconsistent requirements, thermal
equilibrium and boundary conditions, is the subject of chapter 17.
The most important lesson to learn from Eq.(7) is the time scale on
which two-body relaxation acts. In fact, from Eqs.(3), (4) and (7) it can
be seen that this is of order v3 /(G2 m2 n log γN ), where v is a typical
speed, m is a typical mass, and n is the number density of stars∗ . This is
an estimate of what is called the “time of relaxation”, though the precise
definition is a matter of convention; most authors follow Lyman Spitzer,
∗
R
Recall that f is the number density in phase space, so that n = f d3 ṙ
Kinetic Equations 147
Here P is the period of radial motion of a star with these parameters, and
the subscript indicates that the time derivative is taken with the radial
action Ir fixed.
The effect of two-body relaxation is simply to contribute an additional
encounter term to the right-hand side of Eq.(10). We cannot simply
express Eq.(10) in terms of f and then add the right side of Eq.(7),
however, as Eq.(7) deals with functions of velocity and position, whereas
the left side of Eq.(10) involves functions of E1 and J1 . One way of
proceeding is to average the collision term around an orbit of a star with
energy E1 and angular momentum J1 . It is, however, easier to approach
Relaxation and Orbital Motion 149
the problem less directly (Box 2), though the result is the same. It
is referred to as the orbit-averaged Fokker-Planck equation, though the
essential idea goes back to Kuzmin (1957).
There are two formulations of the resulting equation which look rela-
tively elegant and memorable. One, which we learned about from Donald
The effective upper limit of the second integral will usually be finite,
because f will vanish above a certain energy.
Averaging a function F (r) at energy E1 (i.e. over an energy hypersur-
face in phase space) means performing the integral
R
F δ(E1 − v12 /2 − φ)d3 r1 d3 v1
hF i = R .
δ(E1 − v12 /2 − φ)d3 r1 d3 v1
The denominator is just ∂s/∂E1 , and cancels in the expression for DE
(Box 2, Eq.(3)). Integrating with respect to v1 and then leaving only the
integration with respect to r ≡ |r1 | results in
Z φ−1 (E1 ) q
∂s
hF i = 16π 2 2(E1 − φ)F r2 dr.
∂E1 0
∂s
Setting F = 1 gives a convenient expression for .
∂E1
Applying
√ this averaging procedure to Eq.(1) we readily find that the
factor E1 − φ cancels out, and the first term in Eq.(1) yields the ex-
∂s
pression for , i.e. the derivative of the phase-space volume at energy
∂E2
E2 . The result is the surprisingly compact and symmetric expression
Z Z !
E1 ∂s ∂s ∞
DE = 16π G m ln Λ −
2 2 2
f (E2 )dE2 + f (E2 )dE2 .
φ(0) ∂E2 ∂E1 E1
Relaxation and Orbital Motion 151
Box 3 (continued)
A similar sequence of calculations needs to be performed for the coef-
ficient DEE . There are no surprises, but it helps to have the formula
Z −1
16π2 φ (E)
s(E) = {2(E − φ)}3/2 r2 dr (2)
3 0
at hand. The result is that
(Z Z )
E1 ∞
2 2 2
DEE = 32π G m ln Λ s(E2 )f (E2 )dE2 + s(E1 ) f (E2 )dE2 .
φ(0) E1
(3)
These results are now to be substituted into Box 2, Eq.(2). It is again
surprising how much the right-hand side simplifies, if one performs inte-
grations by parts to remove derivatives of s, and a little further
creative
∂f ∂s
manipulation. Also the left hand side may be written as ,
∂t s ∂E
where the subscript denotes a derivative at fixed s, but then we divide
by the second factor to reach Eq.(11).
152 14 Two-Body Relaxation
Problems
1) In Fig.1 let r, θ, φ be spherical polar coordinates centred on the mas-
sive body, which has mass m1 . Suppose stars are treated as test
particles and approach at speed V on paths parallel to the axis θ = 0
when at a large distance. Show that the path of a star with impact
parameter p is given by
p2
r= ,
p sin θ + a(1 − cos θ)
where a = Gm1 /V 2 . (Note that two values of p, of opposite sign, cor-
respond to each value of the pair (r, θ).) By using conservation of flux
between paths with impact parameter p and p + dp, and conservation
of energy, show that the density at r, θ is given by
s 2
1 ∂r
|p| 1 +
ρ r ∂θ
=s ,
ρ0 2a ∂r
1+ r sin θ
r ∂p
where ρ0 is the density far upstream. Adding the results for the two
values of p, show that
ρ r2 sin2 θ + 2ra(1 − cos θ)
= q .
ρ0 r sin θ r2 sin2 θ + 4ra(1 − cos θ)
X
2) Write down an expression for (δE1 )2 which is analogous to Eq.(5).
By means of Box 1, Eqs.(3) and (4), remove all non-dominant terms.
For the case in which the distribution f (ṙ2 ) is isotropic and both
masses are equal, deduce that
Z
1X 32π 2 G2 m2 ln Λ 1 v1
(δE1 )2 = ( v24 f (v2 )dv2
t 3 v1 0
Z ∞
+ v12 v2 f (v2 )dv2 ).
v1
154
Removing the collision singularity 155
of variable
x = − sin2 θ/h, (2)
and then we find that
(−2h)3/2
θ̇ = . (3)
4 sin2 θ
if we make an appropriate choice of sign. Hence
2θ + sin 2θ = (−2h)3/2(t − t0 ), (4)
where t0 is a constant. (This is a form of Kepler’s equation.)
Eqs.(2) and (4) are a parametric form of solution. They may not look
very elegant, but this solution has some remarkable features. First, the
singularity at x = 0 is no longer evident: the solution can be computed
for any time. Second, if we think of θ as the independent variable, we
may also write the solution (Eq.(2)) as x = −(1 − cos 2θ)/(2h), which is
simple harmonic motion with the centre of attraction displaced from the
origin. Therefore, if we transform from t to θ in the equation of motion
we may expect to arrive at the equation of a displaced simple harmonic
oscillator. In fact Eqs.(2) and (3) show
p that this transformation is given
in differential form by dt/dθ = x −2/h. Though the square root is
unpleasant, we can easily scale θ to remove it.
All this suggests the introduction of a new independent variable τ de-
fined by
dt/dτ = r, (5)
where r = |x|. Though τ looks like a scaled version of θ its status is ac-
tually quite different: τ will play the role of a transformed time-variable,
whereas θ was a transformed spatial coordinate. Anyway, the transfor-
mation of Eq.(1) is very simple, and indeed leads to an equation of the
expected form:
x00 = 2hx + 1,
where a prime means d/dτ . This also holds when h ≥ 0, and again the
remarkable feature of this equation is the removal of the singularity at
x = 0. Though the expression for h is singular at x = 0, it need not be
evaluated anywhere except for the initial conditions, since it is constant.
The procedure we have outlined is easily extended to the perturbed
case, and to three dimensions. Indeed it yields a practical equation for
the solution of the perturbed two-body problem (Burdet 1967). If we
simply transform from t to τ , defined by Eq.(5), it is a straightforward
exercise to show that the three-dimensional unperturbed Kepler problem
transforms to
r00 = 2hr − e, (6)
156 15 From Kepler to Kustaanheimo
where
e = ṙ × (r × ṙ) − r/r. (7)
This vector, like h, is a constant of the motion. It is usually called the
Lenz or Runge-Lenz vector. Again we notice that Eq.(6) is displaced
simple harmonic motion.
Most of this chapter is concerned with a different link between the
Kepler problem and the simple harmonic oscillator. Again it is suggested
by thepsolution of the one-dimensional problem, Eq.(2), where we notice
that x(θ) also yields simple harmonic oscillation (this time without
displacement) if θ is used as independent variable. This suggests the
introduction of a new dependent variable X defined by
x = X 2, (8)
in addition to the transformation from t to τ defined by Eq.(5). The
transformations are most easily carried out in the energy equation, and
then differentiation leads to the expected equation
d2 X 1
2
= hX, (9)
dτ 2
Note that the frequency is half that obtained in the previous transfor-
mation. In a way this is obvious from Eq.(8): one oscillation of X gives
two oscillations of x. There is also, however, a geometric significance to
this observation, as will become clear as we proceed. It also accounts for
the fact that Eqs.(6) and (7) are not the preferred route for dealing with
two-body encounters in N -body simulations: Eq.(9) and its descendents
are, in this sense, twice as efficient.
The tricky step with the transformation leading to Eq.(9) is its exten-
sion to three dimensions, and we proceed by way of the planar case. In
two dimensions Eq.(8) is easily generalised if we think of a 2-vector as
a complex number, z. Then the equation of motion is z̈ = −z/|z|3 , the
energy is h = |ż|2 /2 − 1/|z|, and we define new variables by z = Z 2 and
dt/dτ = |z|. Direct substitution yields the rather unpromising equation
2
d2 Z Z 1 dZ
=− + ,
dτ 2 2|Z|2 Z̄ dτ
but we see that the right side simplifies if we transform also the expression
for h. In fact if a prime denotes differentiation with respect to τ , we find
that Z 00 = hZ/2, in precise analogy with Eq.(9).
The transformation we introduced (a step due to Levi Civita 1906)
has a strong geometrical flavour. Without this transformation, Kepler
orbits passing close to the origin are nearly parabolas. The singularity
in the transformation z = Z 2 at z = 0 approximately straightens these
KS regularisation 157
curves out∗ . Indeed when h = 0 the curves are exactly parabolas, and the
motion in the Z-plane is uniform and rectilinear (Fig.1). Furthermore,
if the orbit has h < 0, one revolution in the Z plane corresponds to two
in the z plane: after a single revolution in the z-plane the Z-orbit moves
from one Riemann sheet of the transformation to the other, and after two
revolutions it moves back again.
KS regularisation
It took about 50 years before Levi Civita’s transformation was extended
to the practical case of the 3-dimensional Kepler problem. This delay
seems remarkable when one learns that the requisite tool (quaternions)
had been invented as long ago as 1843. Furthermore, they were invented
by Hamilton (1853) in order to provide a means of performing algebra
with 3-vectors; our requirements here are not dissimilar, i.e. to be able
to take the square root of a 3-vector. Another point of historical interest
is that, when the 3-dimensional extension eventually was carried out, it
was not done in the language of quaternions, but in terms of spinors.
So strongly did this approach hold sway that it was even said that the
requisite transformations could not be expressed in quaternions. Only in
1991† was the full story told, almost 150 years after Hamilton’s discovery.
z plane Z plane
c
c
C
BF D B
E
CG b AE b A
DH F H
G
a a
Fig. 1. Keplerian orbits in two dimensions (left) and in the transformed plane
(right). One circular and one parabolic orbit are shown, and the letters in the
two diagrams show corresponding points
∗
We might say that it “unfolds” the orbit, but this word is loaded mathematically.
It would be more accurate to say that it “blows up” the singularity, as McGehee
(1974) did so famously for the collinear 3-body problem. See also Waldvogel (1982)
for a similar approach to the non-collinear problem.
†
Neutsch (1991). The account in Vivarelli (1985) omits only the role of our Eq.(15)
158 15 From Kepler to Kustaanheimo
In the following account of the theory we shall try to convince the reader
that he could have found his way there with only the most rudimentary
knowledge of quaternions (as in the outline solution to Problem 1, cf.
Appendix B).
The usual convention is that a three-vector (x, y, z) corresponds to the
quaternion ix + jy + kz, where {1, i, j, k} are four basis vectors of the
space of quaternions. Now the real form of Levi Civita’s transform, in
obvious notation, is x = X 2 − Y 2 , y = 2XY . Let us try to treat the
space of quaternions spanned by i and j in the same way, i.e. we write
ix + jy = i(X 2 − Y 2 ) + 2jXY . The right side is easily factored as
i(X − kY )2 , which can be written in the form
ix + jy = (X + kY )i(X − kY ). (10)
The latter form is very convenient for handling rotations, and, now
that we know how to handle one plane with quaternions, we can handle
any plane, simply by rotating it into position. For example, to rotate a
quaternion ix + jy into the quaternion ix + ky (in the plane spanned by
i and k) we pre- and post-multiply as follows (cf. Problem 1):
1+i 1−i
√ (ix + jy) √ = ix + ky. (11)
2 2
1 1
ix + ky = √ (X + iX − jY + kY )i √ (X − iX + jY − kY ).
2 2
ix + jy + kz = qiq̄, (12)
q = q0 exp(iφ), (13)
KS regularisation 159
∗
Note the analogy with the local Abelian U(1)-gauge invariance of quantum electro-
dynamics.
160 15 From Kepler to Kustaanheimo
Numerical regularisation
Though collisions of point masses should not occur in computer simu-
lations of the million-problem, it is a common experience that numerical
methods behave badly even in the vicinity of a singularity. In N -body
simulations a moderately eccentric close binary is close enough to cause
Numerical regularisation 161
Problems
1) If α = ix + jy, show that x = −(αi + iα)/2 and ky = (iα − αi)/2.
Hence obtain Eq. (11).
2) Let qa = ia1 + ja2 + ka3 , where a1 , a2 and a3 are real, and similarly
define qb = ib1 + jb2 + kb3 . Show that qa qb = −a.b + [a, b], where a
is the three vector (a1 , a2 , a3 ), b is defined similarly, a.b is the usual
dot (scalar) product, and [a, b] is the quaternion corresponding to the
usual vector product.
3) A quaternionic view of circular motion
(i) By squaring Eq.(12) show that |r| = kqk2 , with obvious notation.
(This shows that motion on the unit sphere |r| = 1 in space maps to
the sphere kqk = 1 of unit quaternions. Now we construct a way of
visualising the latter.)
(ii) Show that stereographic projection from the quaternion 1 to the
space of imaginary quaternions (spanned by i, j, k) maps q0 +iq1 +jq2 +
162 15 From Kepler to Kustaanheimo
kq3 to (iq1 +jq2 +kq3 )/(1−q0 ). If the unit sphere (q02 +q12 )+(q22 +q32 ) =
1 is parameterised in an obvious way as
q0 = cos θ cos φ
q1 = cos θ sin φ
(18)
q2 = sin θ cos ψ
q3 = sin θ sin ψ,
show that this point maps to the quaternion equivalent to the 3-vector
R = (cos θ sin φ, sin θ cos ψ, sin θ sin ψ)/(1 − cos θ cos φ). (19)
This gives us a way of visualising a unit quaternion, but note that the
coordinates are singular when θ is a multiple of π/2.
(iii) Show that the quaternion given by Eq.(18) maps, via Eq.(12),
to the quaternion equivalent to the 3-vector
(Thus for unit vectors and quaternions, the KS map can be thought
of as the transformation from R to r, given by these formulae.)
(iv) Show that the point
R = (cos θ sin φ, sin θ sin φ, sin θ cos φ)/(1 − cos θ cos φ), 0 < φ < 2π
5) For the three-body problem in the plane with equal masses, let the
complex numbers xi , i = 1, 2, 3, be the position vectors of the three
bodies. Interpret geometrically the numbers
z1 = x3 − x2
1
z2 = x1 − (x2 + x3 ).
2
Finally define three real numbers ui by
u1 = |z1 |2 − |z2 |2
u2 + iu3 = 2z̄1 z2 .
Show that the ui are independent of rotation (and are therefore an
appropriate parameterisation of the shape of the triangle formed by the
three bodies), and that the map (z1 , z2 ) → (u1 , u2 , u3 ) is essentially
the KS map.
PART V.
GRAVOTHERMODYNAMICS:
N = 106
164
16
Escape and Mass Segregation
Escape
We consider the case of an isolated stellar system. For this case a star
with speed v will almost certainly escape if v2 /2 + φ > 0, where φ is the
smoothed potential of the system at the location of the star, with the
convention that φ → 0 at infinity. The exceptions are binary components
(for which the true potential differs significantly from the smoothed po-
tential), and an escaper which, on its way out, interacts with another
star in such a way that its energy once again becomes negative. The
latter possibility is rare in large systems (King 1959), precisely because
two-body relaxation takes place on a much longer time scale than orbital
motions (Chapter 14).
Several mechanisms may be responsible for the generation of an esca-
per. It may result from three- or four-body encounters involving binaries
(Chapter 23). It may happen that loss of mass by stars through their
internal evolution makes the potential well of the cluster sufficiently shal-
low that weakly bound stars become escapers. Here, however, we consider
165
166 16 Escape and Mass Segregation
We first compute the rate at which a star of given mass, velocity and
position experiences encounters which would cause it to escape. This is
Z
Q= V f (v2 )|dp̂|pdpd3 v2 , (1)
the integral being taken over the domain defined in a very implicit way
by text Eq.(1); here f is the distribution of stellar velocities, normalised
to the space density of stars. To make progress, we notice first (from
text Eqs.(2) and (4)) that, for given initial velocities of the two stars,
δ̂v1 is determined by p̂ and p. In fact in Eq.(1) we can transform the
integration with respect to these two variables to give
Z
2 2 f (v2 ) 2
Q = 4G m d δ̂v1 d3 v2 , (2)
(δv1 )3
where we have also used text Eq.(3).
Escape obviously depends on the magnitude of δv1 , which we denote
by δv1 , though it is not the change in v1 . Now from text Eqs.(3) and (4)
it follows that
α
(v1 − v2 ).δv1 = −V 2 sin2 ,
2
whence
v2k = v1 .δ̂v1 + δv1 , (3)
where the subscript k denotes the component parallel to δ̂v1 . This sug-
gests the use of cylindrical polar coordinates for v2 , with δ̂v1 as polar
axis. But since we assume that the distribution of v2 is isotropic it is also
advisable to adopt v2 as one of the variables. In fact an unorthodox but
useful choice is v2k (or δv1 ), v2 and φ, where φ defines the orientation of
v2 around δ̂v1 (i.e. the cylindrical polar angle), and so from Eqs.(2) and
(3) we find that
Z
f (v2 ) 2
Q = 4G2 m2 d δ̂v1 dδv1 dφv2 dv2 , (4)
(δv1 )3
though we have to add the restriction v2 > |v2k |, i.e.
Box 1 (continued)
So far we have not used the fact that we wish to compute the escape
rate. In fact this expression gives a general result for the rate of encoun-
ters leading to a given change in the velocity of the star. Now we add
the condition, text Eq.(1), and the integral with respect to δv1 is most
easily carried out using spherical polar coordinates with v̂1 as polar axis.
If the polar angle is denoted by θ, the conditions (5) and text Eq.(1) may
be expressed as
q
−v1 cos θ + v12 cos2 θ − 2E1 < δv1 < −v1 cos θ + v2 . (6)
Integration with respect to δv1 now yields
Z !
1 1
Q = 4πG m 2 2
f (v2 )v2 dv2 2π sin θdθ 2 − 2 ,
δv1min δv1max
where δv1min/max are the limits in Eq.(6), and the integration with re-
spect to θ is restricted to the range cos2 θ < (2E1 + v22 )/v12 . In this way
we arrive at text Eq.(5).
In order to evaluate the total escape rate over the entire cluster, we
must integrate over the initial velocity and position of the escaper. These
integrations hold no terrors, and the result may be expressed as
dN 256π 4 G2 m2 √
=− 2
dt 3
Z Z (6)
2 f1 f2 (E1 + E2 − φ)3/2
r dr dE1 dE2 ,
E1 +E2 >φ E12
where the distribution function f has been written as a function of the
energy E for both stars, and φ(r) is now the potential at radius r.
This result is more elegant than its proof, which quickly becomes in-
tractable if one sets a foot wrong anywhere along the route. It also reveals
the competing effects which control the net escape rate. The denominator
favours stars which are nearly unbound, as does the large spatial volume
which such stars occupy. On the other hand f is very small near E = 0.
Nevertheless, in models for which the calculations have been carried out,
the bulk of escapers are already weakly bound just prior to the encounter
which ejects them.
Hénon’s formula appears to work well when the assumptions behind it
are satisfied, i.e. the distribution of velocities is isotropic. It differs qual-
itatively, however, from many other formulae for the escape rate (those
Mass segregation 169
summarised in Giersz & Heggie 1994a, for example), which are based on
the time of relaxation. For example, for a Plummer model Hénon’s result
is Ṅ = −0.00088N/(trh ln Λ), while the classic result of Spitzer & Härm
(1958) for a square-well potential is Ṅ = −0.0085N/tr (Spitzer 1987).
The difference in the coefficient (even though this cannot reasonably be
explained by different definitions of the relaxation time) is not the issue.
The issue of principle is indicated by the fact that the Coulomb loga-
rithm only appears in one of the formulae. Hénon argued that escape
by diffusion does not work, because an increasingly weakly bound star
spends longer and longer on an orbit well outside the core, and relaxation
is ineffective; therefore escape is dominated by single-encounter events.
In fact a synthesis of the two basic ideas seems to work best: relaxation
gives rise to a halo of loosely bound particles (on the relaxation time
scale), which can then become unbound in a single passage through the
core (Spitzer & Shapiro 1972).
What is certain is that, in evolving systems, the spatial structure and
the velocity distribution of the stars both change with time. Even if we
continue to assume that the distribution of velocities is isotropic, it is
found that the rate of escape increases as the core contracts and becomes
denser (Chapter 17; see Giersz & Heggie 1994a). In fact, the distribution
also becomes increasingly anisotropic (if it was isotropic initially), and
this enhances the escape rate, in the sense that, in a series of models
which have the same spatial structure but different anisotropy, the escape
rate increases with the amount of radial anisotropy. For these reasons,
if one computes Hénon’s escape rate for a Plummer model (Problem 1),
it is a poor guide to the escape rate from a system which starts life as a
Plummer model.
This discussion summarises the case of isolated systems of equal masses.
As we have seen, escape favours stars of low mass. For a system consisting
of equal numbers of stars of some fixed mass and some very small mass,
the low-mass stars escape about 30 times as fast as the high-mass stars
(Hénon 1969). The total escape rate is enhanced by a similar factor
(compared to the case of equal masses) for more realistic mass functions.
When we place the cluster in a tidal field the rate of escape is further
enhanced, but it also becomes a more complicated dynamical problem
(Chapter 12).
Mass segregation
The preferential escape of stars of low mass is an extreme manifestation
of a tendency in all two-body encounters between stars of different mass,
even when neither escapes. It is most easily understood as the tendency
towards equipartition of kinetic energies, which ideally leads to the con-
170 16 Escape and Mass Segregation
∗
Earlier in the chapter E denoted the total energy (per unit mass)
Mass segregation 171
Problems
176
17 Gravothermal Instability 177
∗
Strictly, the system will evolve very slowly by the formation and hardening of a
binary (Chapter 5). The time scale is very long, however, of order N relaxation
times at least (cf. Problem 21.3).
178 17 Gravothermal Instability
0.6
0.4
0.2
0
x
-0.2
-0.4
d2 W ∂ 2 U dx ∂ 2 U
= + .
dλ2 ∂x∂λ dλ ∂λ2
If this is combined with Eq.(4) it follows that
!2
∂2U
d dW ∂x∂λ ∂2U
=− + ,
dλ dλ ∂2U ∂λ2
∂x2
and so the change of equilibrium can be detected by plotting dW/dλ
against λ and looking for a vertical tangent. If there are many degrees
of freedom with generalised coordinates {xi }, then the same conclusion
follows, the calculation being most easily carried out in coordinates which
diagonalise the second variation of U (x, λ) at fixed λ.
Now how do we apply all this to the gravothermal problem? In fact
almost everything is in place. Let us consider the series of equilibria
corresponding to a fixed total mass (M ) within an enclosure of fixed
radius (and therefore fixed volume, V ). For U we take the entropy S,
and the series of equilibria can be parameterised as we please, but it is
convenient to use the total energy E for the parameter λ. The reason
for this is that the quantity to be plotted, i.e. (∂S/∂E)M,V , then has a
simple interpretation: it is the inverse of the temperature, T . (Recall our
proof in Box 1 that δS = 0 in thermal equilibrium, which followed from
the result δE = T δS if T is uniform.) Therefore we plot 1/T against E
and look for vertical tangents in the resulting curve.
The result is shown in Fig.2. Several parts of this result can be un-
derstood without numerical computation. In the almost uniform, weakly
gravitating limit, it follows from Eq.(1) that T −1 ' 3M k/(2mE), which
yields the part of the curve going off to the right. Similarly the singular
isothermal solution and its asymptotic form (Problem 8.2) give rise to
the spiral.
Now we proceed from the right-hand end of the series of equilibria,
where the models are certainly stable, looking for a vertical tangent. It
is already evident that such points exits, and we need numerical methods
only to locate the first one. In fact it occurs when the scaled radius of
the enclosure is ze = 34.2, approximately. Stellar dynamicists remember
more easily the corresponding value of exp(ue ) = 1/709, cf. Eq.(8.4),
which is the ratio of the space density at the edge to the central value.
This is the last stable isothermal model inside a spherical, adiabatic con-
tainer. Models with a larger density contrast are unstable thermally.
182 17 Gravothermal Instability
2.4
2.2
1/T 2
1.8
1.6
1.4
1.2
1
-0.2 0 0.2 0.4 0.6 0.8
E
Fig. 2. The linear series of isothermal models of fixed mass and volume, in
suitable scaled units. Compare with Chapter 4, Fig.1.
After all this heavy analysis and theory, it is a relief to realise that the
physical origin of the instability is very easy to understand, provided that
one reads the appropriate paper (Lynden-Bell 1968). The arguments have
already essentially been deployed in Chapter 5. Imagine what happens if
a little thermal energy is transferred from the inner part of an enclosed
isothermal system to the outer part. The inner part should cool, but
the gravitational field causes the stars there to drop into lower orbits,
where they move faster than before. Just like a satellite spiralling slowly
to Earth under atmospheric friction, the net effect is that the stars are
moving faster, and the system is hotter, hotter even than at a time before
the heat was transferred. The thermal energy transferred to the outer
parts has a more familiar effect: these parts are held in not by gravity
but by the wall of the enclosure. Just as in a non-self-gravitating gas
in an enclosure, addition of thermal energy pushes up the temperature.
How much the temperature goes up depends on the heat capacity of
the outer part of the system. The larger and more extensive the outer
halo, the smaller the rise in temperature. For a sufficiently large halo
(sufficiently large re ) the rise in temperature of the outer part is smaller
than that of the inner part. Then a temperature gradient has been set
up, more heat flows from the inside to the outside (provided that the
gas is conducting), and the temperature gradient is enhanced. There is
a runaway, the famous “gravothermal catastrophe”.
This simple physical picture is suspicious, because it now seems as
though the boundary condition, an adiabatic rigid enclosure, plays a cru-
cial role. To show qualitatively that the boundary condition is not playing
an essential role, let us consider an isothermal model with a very large
value of re , i.e. very close to the centre of the spiral pattern in Fig.2.
There is a huge range of densities in this model. Therefore there is a
17 Gravothermal Instability 183
huge range in the time scale for the convective transport of heat. On the
time scale for heat transport in the inner parts, the outer parts act effec-
tively as an insulating envelope. Also, the amount of mass in the inner
parts is so small that adjustments in the structure there have little effect
on the outer parts, which are also effectively a rigid barrier. In other
words, the outer parts of the system have the properties of the boundary
which we artificially inserted in order to make our model precise. It fol-
lows that the central parts of an isothermal system will be subject to the
gravothermal instability, and that the boundary conditions far out in the
system are irrelevant.
Precisely how the growth of the instability takes place is beyond the
scope of the above analysis. In particular, it depends on the details of
the mechanism by which heat is transported (Makino & Hut 1991). But
various calculations confirm that the “first” isothermal model in which
thermal inhomogeneities can grow is one in which the density contrast is
approximately the famous 709. This has been shown not only for mod-
els which treat the system as a self-gravitating, conducting, perfect gas
(Nakada 1978), but also for Fokker-Planck calculations (Inagaki 1980).
As we have said, however, the precise boundary conditions are not strictly
relevant to the behaviour of self-gravitating systems, and the critical value
of the density contrast is not the main issue.
There is another way of looking at the role of boundary conditions. So
far we have imagined that energy can be exchanged only between one
part of the system and another. Suppose now that the entire system may
exchange energy with its surroundings. We imagine that the enclosure is
still rigid and allows no mass in or out, but that the system can exchange
energy with its surroundings, which are held at constant temperature.
Then the heat capacity of the system is simply ∂E/∂T . It is therefore
subject to the kind of instability outlined in Chapter 5 as soon as the
tangent to the curve in Fig.2 becomes horizontal. Not surprisingly, this
occurs for a smaller density contrast than in the previous case, because
now the enclosure is taking on the thermodynamic role previously played
by the outer parts of the system. The distinction between the two situa-
tions can also be described in terms of the appropriate ensemble, in the
sense of statistical physics: the microcanonical ensemble for systems of
fixed energy (the first case, in which the enclosure is adiabatic), and the
canonical ensemble for systems of fixed temperature (the second case.)
There is one final twist to this tale. We gave a physical explanation for
the gravothermal catastrophe by contemplating a transfer of heat from
the inside to the outside. But instabilities can grow in either direction;
what happens if heat is transferred from the outside to the inside? It is
not hard to see that both the inside and the outside cool down, but again
the cooling of the outer parts is slight if they are sufficiently massive and
184 17 Gravothermal Instability
Problems
1) A simple approximate model of an isothermal system in an enclosure
of radius re is defined by the density
ρ0 , if r < r0 ;
2
ρ(r) = r0
ρ0
, if r0 < r < re .
r
Compute M (r), i.e. the mass within radius r, and deduce that the
potential energy is
x GMe2
W =− (45x − 30 ln x − 42) ,
5(3x − 2)2 re
where Me is the total mass within the radius re and x = re /r0 > 1.
Using the appropriate form of virial theorem (Problem 9.1, where T
denotes the kinetic energy), compute the temperature and the total
energy. Hence plot the diagram analogous to Fig.2. Show that the
sequence includes models with positive and negative heat capacity.
2) In an enclosed isothermal model with enclosure radius re , the kinetic
3M
energy is kT , where T is the temperature. Use the appropriate
2m
form of the virial theorem, as in Problem 1, to show that the energy
is given by
0
Ere 1 u+ 3u ,
= e
GM 2 u02 2z
in the notation of the isothermal equation (Eq.(8.6)). By solving the
latter equation numerically, and locating the minimum of Ere /(GM 2 )
for varying z, show that the first unstable model corresponds to a
density contrast e−u = 709 approximately.
3) Does a two-dimensional self-gravitating system, in which the interac-
tion potential between two masses m is 2Gm2 ln r, exhibit gravother-
mal instability? Think about this first from the point of view of the
virial theorem (Problem 9.2). Then use the analytical solution (Prob-
lem 8.3) to carry out an analysis as in Fig.2. See Katz & Lynden-Bell
(1978).
18
Core Collapse Rate
for Star Clusters
The last two chapters have assembled most of the qualitative arguments
by which the evolution of the core of a stellar system can be understood.
In summary, the tendency towards equipartition drives the more mas-
sive stars to smaller radii. Unless their total mass is sufficiently small,
equipartition cannot be reached by the time the heavier stars become es-
sentially self-gravitating. When that happens they are eventually subject
to the gravothermal instability. It is the purpose of the present chapter
to flesh out this outline, but we shall do so in two passes, as it were.
First we shall examine the timescales on which these processes act, and a
number of factors which modify the simple picture; and we shall explain
the qualitative nature of the resulting evolution. Then we turn to a more
detailed description of one case which has been studied in great detail:
self-similar collapse in systems with stars of equal mass.
185
186 18 Core Collapse Rate for Star Clusters
cesses are associated with equipartition. Indeed, if one uses some detailed
model (Chapter 9) to follow the evolution of a multi-component model
(i.e. one in which the particles do not all have the same mass), the first
thing one observes is that the heaviest particles drift towards the centre,
increasing the central density and the central proportion of massive par-
ticles (Fig.16.1). The greater the individual masses of the most massive
particles, the faster the evolution.
There are factors which modify this picture in more realistic models
of stellar systems. In the first place, the most massive stars also evolve
(internally) most rapidly (Chapter 11). In the very rapid, late stages of
their evolution, they lose most or all of their mass. Therefore the range
of individual stellar masses shrinks, and this in turn affects the relevant
timescale te . The second point to notice is that the most massive species
may well consist of binaries with massive companions. Though the effects
of close interactions between binaries and single stars are complicated
(Parts VI and VII), in distant encounters a binary behaves essentially
like a single star with a mass equal to the total mass of its components.
Eventually the tendency towards equipartition slows down drastically.
As already mentioned, this does not mean that equipartition has been
achieved; rather, the central parts of the system are now dominated by
the most massive stars, and, as a first approximation, we may consider
that the role of the less massive stars is unimportant. As a step towards
understanding this new situation, we proceed to discuss the evolution of
the core in a single-component system, in which all stars have the same
mass. This problem is of obvious interest in its own right, and it has been
rather well studied.
First let us consider the situation discussed in Chapter 17, i.e. an
isothermal model enclosed in a spherical container. As we saw, this equi-
librium is unstable only if the density contrast (between the centre and
the enclosure radius) exceeds a factor of about 709. At the enclosure
radius the relaxation time is trc /709, where trc denotes the central value.
Therefore we may expect that the time scale on which the instability acts
is at least of this order. In fact it may be much longer, as the system is
nearly isothermal, which reduces the thermal flux.
Now we should consider a more normal stellar system, still consisting
of stars of equal mass, but bound only by their self-gravity, and not by
any external wall. Such a system cannot be isothermal, and so the flux
of heat which drives evolution of the core may be stronger than in the
near-isothermal situation considered previously. Qualitatively, however,
the result is the same: the centre contracts and becomes hotter. At the
higher densities which result, the relaxation time decreases, and in time it
becomes much shorter than at the larger radii where the heat flux drove
the initial contraction. By the time this happens, the hot, central region
The big picture 187
Fig. 1. Core collapse from a King model with W0 = 3, N = 1285. The initial
model is shown on the left, and the view on the right shows the same model to
the same scale after core collapse. The model has a tidal cutoff.
188 18 Core Collapse Rate for Star Clusters
also altered if the system is rotating (Lagoute & Longaretti 1996, Einsel
& Spurzem 1999, Kim et al. 2002).
heat capacity. If we were to derive the standard heat equation for such
a material, we would obtain the backward heat equation.
For the star cluster problem, derivation of the self-similar solution re-
quires a modicum of numerical integration, and our treatment will confine
itself to some basic observations and a brief description of the results. In
the first place let us note that the time scale on which core parameters
such as ρc vary is trc , the central relaxation time. In self-similar evolution
it follows that trc ρ̇c /ρc is constant, and the same is true for other core
parameters. It follows that these parameters will vary with each other
as power laws (the powers being determined by the various constants)
and, if we ignore variations of the Coulomb logarithm, that trc is a linear
function of time.
Let us suppose, then, that
ρc ∝ rc−α , (1)
where α is some constant. Since we are supposing that the evolution is
self-similar, the density profile at any time (i.e. the function ρ(r, t) at
some t) may be obtained from the profile at any other time by scaling.
In other words,
ρc (t1 ) rc (t0 )
ρ(r, t1 ) = ρ r, t0 . (2)
ρc (t0 ) rc (t1 )
Now at large radii, where the relaxation timescale is huge compared with
that in the core (or even with the evolution time scale), the density profile
must be almost stationary. By substituting (1) into Eq.(2) it follows that,
at large radii,
ρ(r) ∝ r−α . (3)
This is the density profile that must be left behind by the outlying ma-
terial of the contracting core.
Now we are in a position to say something about α on physical grounds.
From Eq.(3) it follows that the mass at large radii varies as r 3−α , and
so the gravitational acceleration F varies as r 1−α . By the condition of
hydrostatic balance, i.e.
∂
(ρσ2 ) = ρF,
∂r
it is easy to deduce that the velocity dispersion must vary as r 2−α . Since
the collapse of the core is driven by a negative temperature gradient it
follows that α > 2.
Now the velocity dispersion in the core is related to ρc and rc by the
definition of the core radius (Eq.(8.9)), and so it follows that
σc2 ∝ rc2−α . (4)
190 18 Core Collapse Rate for Star Clusters
Also, the energy contained within the core is of order ρc rc3 σc2 , and by
Eqs.(1) and (4) we see that this varies as rc5−2α . It is physically reasonable
to suppose that this must remain finite as the core radius contracts, and
so α < 5/2.
It has been noted (Lancellotti & Kiessling 2001) that such a solution
would not be a self-similar solution of the full Fokker-Planck equation.
But this requirement is too strong, for it requires that the orbital motions,
as well as the collapse of the density profile, take place on the same time
scale. Here we are studying collapse, on a relaxation time scale, of a
system where orbital motions (on the crossing time scale) are much faster.
Then it is appropriate to solve only the orbit-averaged Fokker-Planck
equation, for which a wider class of self-similar solutions are possible.
Since collapse is driven by relaxation, we may estimate that the time
to collapse, τ , is proportional to the relaxation time, i.e. τ ∝ σc3 /ρc
(Eq.(14.8), except that we ignore the Coulomb logarithm). By Eqs.(1)
2
and (4) we deduce that rc ∝ τ 6 − α . For our range of estimates of α the
power here lies between 0.5 and about 0.57.
It does not appear to be easy to sharpen the possible range of values
of α further without numerical methods. Indeed more detailed models
show that α ' 2.2 or 2.3, and Fig.2 shows the numerically computed
density profile for a self-similar gaseous model. For the corresponding
Fokker-Planck model the e-folding time scale for the central density is
approximately 280trc , which is not so far from the estimate of 709trc
obtained previously on the basis of physical arguments. It should be
pointed out, though, that this homologous phase of core collapse is a
good approximation only very late on. It is preceded by a much longer
phase where the collapse time scale is a smaller multiple of the central
relaxation time.
Finally it should be stressed that this process of core collapse is not
quite the catastrophic process which the words tend to convey. This is
no headlong rush like the collapse to a black hole. It takes place on a
timescale which is very long even relative to the central relaxation time.
It also involves a rapidly decreasing fraction of the total mass of a cluster.
But it does imply that the central density reaches arbitrarily high values
within a finite time. For many galactic globular clusters it is a time
considerably smaller than their age. Even at the present day, a couple of
clusters approach this endpoint of core collapse every billion years in our
galaxy (Hut & Djorgovski 1992). It is accompanied by a marked increase
in the rate at which stars escape (cf. Fig.5.1), and in the rate at which
Self-similar core collapse with equal masses. 191
Fig. 2. The self-similar model of Lynden-Bell & Eggleton (1980). The solid and
dashed straight lines correspond to the asymptotic isothermal relation ρ ∝ r−2
and the asymptotic profile of the collapsing model, ρ ∝ r−α . The variables
plotted are the scaled radius and density, r/rc and ρ/ρc , respectively.
Problems
1) Show from the isotropic Fokker-Planck equation (Eq.14.11) that the
rate of change of f at the bottom of the potential well (energy φ(0))
is given by
Z ∞
f˙ = 16π 2 G2 m2 ln Λ(f 0 f (E)dE + f 2 ),
φ(0)
Core collapse leads to high stellar densities, where interactions may in-
volve more than just two stars at a time. The chapters in this section
are therefore devoted to three-body interactions, especially interactions
between a binary star and a single star. One of our aims in these chap-
ters is to show that important aspects of the three-body problem can be
understood from various points of view, even though the problem itself
lacks a general mathematical solution.
Chapter 19 takes a phenomenological approach, applying notions of
equipartition and energy conservation. This already classifies encounters
according to whether the binary is hard or soft. In some interactions with
hard binaries the result (temporarily) is like a miniature star cluster of
three stars, and our previous knowledge of the behaviour of star clusters
can suggest how this evolves.
Chapter 20 takes an informal mathematical view of the same phenom-
ena. We see that the breakup of triple star clusters exhibits a sensitive
dependence on initial conditions, partly justifying a statistical treatment.
One of the standard examples in which this is most readily understood is
Sitnikov’s problem, which we use to introduce the Smale horseshoe. Fi-
nally we prove informally a theorem which shows that permanent capture
into a triple configuration is (practically) impossible, and end with some
recent surprising discoveries about permanently bound triple systems.
Chapter 21 takes a course half way between the previous ones, exploit-
ing a mixture of approximate analytical tools and physical arguments to
develop a theoretical results on the outcome of three-body interactions.
This is expressed in the language of cross sections, in analogy with simi-
lar problems in atomic scattering theory. Here the analogue of Coulomb
repulsion is gravitational focusing. We estimate the rate at which bi-
naries form within a million-body system. We treat in some detail the
case of adiabatic encounters between a hard binary and a single star,
193
194 PART VI. GRAVITATIONAL SCATTERING: N = 3
when the single star remains “outside” the binary and moves on a time
scale long compared with the period of the binary. In the opposite ex-
treme we analyse cases in which all three stars come within a comparable
small distance from each other, which produces results akin to those of
“threshold” scattering in the atomic context.
Far as these analytical techniques go, it is hard to make them com-
pletely quantitative and sufficiently accurate, which may be accomplished
by computer simulation (Chapter 22). We describe some of the issues in-
volved in integrating few-body systems efficiently (i.e. accurately and
quickly), and the manner in which the calculations may be organised and
automated.
Chapter 23 takes the first steps in applying these results to the evo-
lution of the million-body problem. The cross sections are turned into
quantities representing the rate of occurrence of three-body interactions,
and the mean rate of change of energy of a binary in three-body interac-
tions. One effect of these interactions is that binaries are driven from the
dense central core of an N -body system into the surrounding halo. The
most important results concern the rate at which energy is imparted to
the single stars, as it is this process which halts core collapse and allows
the system to evolve after that.
19
Thought Experiments
As has been seen in Chapter 18, two-body relaxation predicts its own
downfall. It leads to the collapse of the core and, at the level of simplified
models, infinite central density. Clearly, some new dynamical processes,
beyond two-body encounters, must come into play. The very high density
is the clue, for it suggests that a third body may, with increasing prob-
ability, intervene in the two-body encounters which mediate relaxation.
In Chapter 27 it will be seen that three-body encounters do indeed act
on a sufficiently short time scale, late in core collapse, to have a decisive
influence on events. As we note there, this is not the only mechanism
that can work, but we concentrate on it for the time being.
The mechanism is a two-stage one, both stages involving three-body
encounters∗. In the first stage, which we consider in Chapter 21, a three-
body interaction leads to the formation of a binary star and a single third
body (which acts as a kind of catalyst). In the second stage, this binary
interacts with other single stars (again in three-body reactions). In this
chapter we shall study three-body encounters in isolation, in order to
uncover those properties which allow them to play their crucial role in
rescuing the cluster from collapse. Clusters get into this difficulty because
of their negative heat capacity, and in fact it is the negative heat capacity
of binaries which comes to their rescue.
∗
We have in mind systems which initially have no binaries. If “primordial binaries”
(Part VII) are present, the first stage is not needed
195
196 19 Thought Experiments
where mi and vi are, respectively, the mass and speed of the ith compo-
nent (i = 1, 2), and r is their separation. It is more convenient to write
this as
Eb = ε + (m1 + m2 )V 2 /2, (1)
where V is the speed of the centre of mass of the binary and
1 m1 m2 2
ε= v − Gm1 m2 /r; (2)
2 m1 + m2
here v is the relative speed of the components. The “binding energy” of
the binary is the positive quantity −ε; ε, which is the quantity we prefer
to work with, is referred to as the “internal energy” of the binary. The
second term on the right of Eq.(1) can be thought of as the kinetic energy
of the barycentre. It is often called the “external energy” of the binary,
though if the binary is immersed in an N body system the potential
due to the other members would also be included. Note that the kinetic
energy of internal motion, when expressed in terms of the relative speed
v, involves the reduced mass m1 m2 /(m1 + m2 ) of the two components.
Now we add a third body, of mass m3 , and at first we suppose that it
is well separated from the binary. Then the energy of this triple system
is obtained from Eb by adding the kinetic energy of m3 . The result takes
its simplest form in the frame of the barycentre of the triple system, and
is then
1 (m1 + m2 )m3 2
E3 = ε + V , (3)
2 m1 + m2 + m3
where V is the speed of m3 relative to the barycentre of the binary. The
second term on the right is the combined kinetic energy of the third body
and the barycentre of the binary.
Now we suppose that the third body approaches the binary, interacts
with it briefly, and then departs once again to a great distance. (This
is what physicists refer to as a “scattering problem”.) Suppose that
the energy of the binary and the relative speed at a time long after the
encounter are, respectively, ε0 and V 0 . Then it is convenient to gauge the
effect of the interaction by the relative change in the internal energy of
the binary, i.e. ∆ = (ε0 − ε)/ε. If ∆ > 0 the internal energy of the binary
has become more negative, i.e. it has become tighter, or more bound. By
conservation of energy E3 it follows that V 0 > V in this case. It is this
possibility which is all-important in the termination of core collapse, and
so in this chapter we place considerable emphasis on deciding whether ∆
is greater or less than 0 on average.
There are three related qualitative arguments which suggest what the
net effect of such encounters may be. One is based on the negative specific
heat of self-gravitating systems (Chapter 5). We already saw in the last
Energetics of Three-Body Scattering 197
chapter that this property leads to increasing density in the most usual
situation. More precisely, this is what happens to a core in thermal
contact with a cooler halo, and we also saw that, if the surroundings are
hot, the evolution could run in reverse. One might guess that the same is
true of binaries, i.e. that binaries get tighter (more bound) as they react
with other, slow moving stars in three-body interactions, and looser (i.e.
less bound) if the stars it encounters move fast.
The second argument, based on the tendency to equipartition of ener-
gies, strengthens and clarifies these conclusions, and runs as follows. We
begin by noticing that there are degrees of freedom associated with both
terms on the right of Eq.(3). If the second term is much bigger than the
first (i.e. we consider fast encounters, or binaries of low binding energy),
the kinetic energy of relative motion will tend to diminish and the inter-
nal energy of the binary will tend to increase (i.e. it will tend to become
unbound, with ∆ < 0). Similarly, a binary with high binding energy will
tend to become even more bound, and V 0 > V on average.
For the third argument we assume that the interaction can be broken
up into an encounter between the third body and each of the binary com-
ponents separately. If the binary is hard, its components move fast, and
impart some of their energy to the incomer, rather like a stone deflected
by a rotary lawnmower. In this case energy is extracted from the binary,
which hardens. Similarly, if the incomer is moving fast, it transmits some
of this energy to the binary components, and the energy of the binary
increases.
The conclusions of all three arguments suggest that there is a watershed
binding energy, near
above which binaries tend to become more bound, and below which
they tend to become less bound. Though we shall mention some re-
finements shortly, we note here that this kind of result is amply sup-
ported by numerical experiments (Chapter 23; see, for example, Fig.23.1).
The distinction is often expressed by saying that binaries with |ε| <
(m1 + m2 )m3 V 2 /(m1 + m2 + m3 ) are “soft”, which indicates their rela-
tive fragility to encounters, while those with |ε| > (m1 +m2 )m3 V 2 /(m1 +
m2 + m3 ) are called “hard”. In practice the distinction is not quite that
simple. The important point is to distinguish those binaries which harden
on average from those which tend to soften. For equal masses, it turns
out that our expression for the threshold energy has the correct form, and
there is a numerical coefficient near unity. If the masses are equal and
there is a distribution of velocities V , then Eq.(4) is correct if we add a
different numerical coefficient and replace V 2 by its mean square value.
198 19 Thought Experiments
When the masses are different, rather less is known about the correct
threshold energy. When m3 is tiny, i.e. the incomer is a test particle, it
is clear that the critical value is independent of m3 , unlike an estimate
based on Eq.(4). For this case it has been shown that, if the two members
of the binary have equal mass, then the critical value of V is of order the
orbital speed of the binary components about each other (Hills 1990; see
also Fullerton & Hills 1982). Sigurdsson & Phinney (1993) have given
useful data for masses relevant to globular clusters.
Soft Binaries
The equipartition argument is attractively simple, provided one comes
to it with a sufficiently developed physical intuition. But it is not hard
to confirm its prediction with somewhat more concrete considerations, as
we now show. We first consider the case of a fast encounter, i.e. a very
soft binary, in which the first term on the right of Eq.(3) is much smaller
than the second. Then the encounter is nearly impulsive, the relative
position of the two components of the binary hardly alters, and the main
effect of the encounter is on their relative velocity v. If this changes by
an amount δv, then Eq.(2) shows that ε changes by
m1 m2 1
δε ' (v.δv + δv2 ). (5)
m1 + m2 2
In an impulsive encounter δv depends only on the position of the com-
ponents of the binary and not on v. Furthermore, in any reasonable sta-
tistical distribution of binaries, velocities ±v will be present with equal
probabilities, and so the mean value of v.δv will be zero. It follows that
the mean value of δε will be positive, i.e. that soft binaries soften on
average, as expected.
Qualitatively different encounters are possible. If m3 comes sufficiently
close to one component of the binary, then the velocity of that component
may change by so much that its new velocity exceeds the velocity of escape
from its companion. Such an encounter is referred to as an ionisation,
by analogy with atomic scattering. In the usual case, when the binary
survives, we may refer to the encounter as a “fly-by” (which is a name
with definite space-age connotations). There is a third possibility for
these fast encounters, and it is most easily understood in the case of equal
masses. In a very close two-body encounter between equal masses the two
masses almost exactly exchange velocities, just as when a billiard ball hits
another stationary one. If, therefore, in a fast encounter between m3 and
a binary, the intruder scores almost a direct hit on one component, that
component will leave the binary with almost the speed of approach of
m3 , and the latter takes its place as one component. Such an encounter
is an example of an exchange.
Hard Binaries 199
Hard Binaries
Now let us consider the opposite extreme, i.e. encounters in which the
second term on the right of Eq.(3) is small. We shall also suppose that
the encounter takes place in such a way that the minimum distance of
the intruder (m3 ) from the binary is not much greater than the size
of the binary (technically, the length of its semi-major axis). Typical
energies in the interaction are therefore of order ε. If a positive energy of
order |ε| is donated to the third body (or, more precisely, to the kinetic
energy of its motion relative to the binary), then the third body will
recede with much higher speed than its original speed of approach. (This
Fig. 1. One of the exchange orbits computed by Becker in 1920. The position
of one particle is used as the origin of coordinates, and the initial conditions
were chosen so that particles B and C are symmetric under time reversal and a
reflection.
200 19 Thought Experiments
follows from our assumption that |ε| much exceeds the second term on
the right of Eq.(3).) Also, by energy conservation, the binding energy of
the binary has increased by an amount of order |ε|, i.e. it has become
harder, as expected. What if, however, an energy of order |ε| is extracted
from the kinetic energy of the third body? Since |ε| much exceeds the
kinetic energy of the third body, the third body cannot escape, at least
not right away. What happens is that it must return to the vicinity of
the binary and there exchange another packet of energy. Again there are
two possibilities: either m3 can now escape or it cannot. It can be seen
that, in general, the only way in which the encounter can end is with the
eventual escape, after some large or small number of interactions, of one
of the participating stars (Fig.2). When it escapes, it will do so with a
kinetic energy of order ε, which much exceeds its initial kinetic energy,
and so again the net effect of the interaction will have been to harden the
binary.
From the above discussion it can be seen that the encounter may be over
quite promptly. If, on the other hand, there is a complicated interplay of
the three participants, then the encounter is described as a “resonance”.
The name again comes from atomic or nuclear scattering, where it refers
to the unusually large probability of interactions which can occur if the
participants can form a temporary bound state. In the gravitational case
what we see is a temporary, bound triple system. As we shall see in the
next chapter, such a configuration cannot last for ever, but while it lasts
it behaves very much like a star cluster with N = 3. In particular, the
motion exhibits sensitive dependence on initial conditions (Chapter 13),
which suggests that the behaviour of an ensemble of such systems may
exhibit some statistical features. Indeed the distribution of lifetimes of
these systems is nearly exponential (Agekyan et al. 1983), like the random
decay of an atomic nucleus. It turns out that this statistical approach is
a fruitful way of attempting a theoretical prediction of the distribution
of the energy of the eventual escaper (Heggie 1975, Nash & Monaghan
1978). More obviously, it suggests that, in the case of equal masses,
the probability that any of the three participants is the escaper is 1/3;
thus the probability that the resonance leads to exchange should be 2/3.
These resonances in which all three particles interact closely are therefore
sometimes called “democratic”, even in the case of unequal masses.
There is another interesting result which follows from thinking of a
resonance as a small star cluster. In the case of unequal masses our
knowledge of the escape rate in N -body systems (Chapter 16) suggests
that the body of lowest mass is most likely to escape. It follows that
a resonant encounter of a binary with a star whose mass considerably
exceeds that of one of the components is liable to lead to exchange of
that component. In other words, such encounters tend to increase the
mass of the components of a hard binary.
Finally we must turn briefly to the case in which the distance of closest
approach of the third body, denoted by rp , remains considerably greater
than a, the semi-major axis of the binary. Now the energy exchanged
between the two terms in Eq.(3) may be quite small. In fact, it may be
exponentially small, so that − ln |∆| varies roughly as a positive power of
rp /a. The reason for this is that, unless the masses are very disparate,
the timescale on which the third binary is within a distance of a few rp of
the binary much exceeds the period of the binary. Then the perturbation
by the intruder is adiabatic, and an exponentially small net perturbation
is to be expected (Chapter 21). Still, the encounter may result in a fly-by
or in formation of a resonance. Now, however, the third body has too
much angular momentum relative to the barycentre of the binary for a
close interaction, and the resonance is very unlikely to lead to exchange.
202 19 Thought Experiments
Problems
1) Answer the following riddle: How do you make a binary star with a
grain of sand? What this means is the following. Two stars approach
on a hyperbolic relative orbit, and your task is to throw in a grain of
sand so as to extract enough of their kinetic energy to bind them into
a binary. (This is classical mechanics of point masses.)
2) For an N -body system of energy E in virial equilibrium, show that
the threshold energy between hard and soft binaries is of order E/N .
3) Suppose single stars of mass m are distributed uniformly in space
with space density n and have a Maxwellian distribution of velocities.
Assuming that the two stars are uncorrelated, show that the number
density of very soft binaries of energy ε is
π3/2 G3 m6 n2 −5/2
nb ' |ε| .
4(kT )3/2
4) In quantum statistical mechanics, the equilibrium distribution of en-
ergies of hydrogen atoms is given by
f (E) = w(E)ρ(E) exp(−E/kT ),
where the first factor on the right is the statistical weight (multiplic-
ity) of one energy level, the second counts the density of levels per
unit energy, and the third is the usual Boltzmann factor. By applying
the correspondence principle to high-energy states, obtain the corre-
sponding result for the distribution of the binding energy of binaries
in statistical equilibrium, i.e. f (ε) ∝ |ε|−5/2 exp(−ε/kT ).
20
Mathematical Three-
Body Scattering
203
204 20 Mathematical Three-Body Scattering
∗
Note that we seem to have lost information because values of t differing by a multiple
of 2π occupy the same place, but in fact this does not matter: if two test particles
Fractals and Chaos 205
Fig. 1. Angle of deflection of the third body in a set of scattering events. The
system is coplanar, the masses are equal, the initial orbit of the binary is circular
and the initial speed of the third body is fixed. The initial conditions are then
determined by the two parameters b (which is the closest distance to which the
third body would approach the centre of the binary if it were undeflected) and φ,
which is the phase of the binary. For each pair of values the angle of deflection
of the third body is characterised by the shade of grey. There is one episode in
which the third body is almost ejected to infinity, and the “parallel” stripes arise
because the subsequent outcome is similar for motions in which the third body
arrives back at the binary after times which differ by a multiple of the binary
period. (From Boyd & McMillan 1992, copyright 1992 by the American Physical
Society)
arrive at z = 0 with the same value ż but at times differing by a multiple of 2π,
206 20 Mathematical Three-Body Scattering
1
0
0
1
1
0
0
1
11
00
00
11
specifies the time t at which the third body crosses the plane z = 0, and
its speed, for some solution of the problem. Following this solution, we
wait until it crosses the same plane the next time, again measure t and
v, and plot the corresponding point. In this way we have constructed a
map of the plane, called the Poincaré map, which takes the initial point
to the new point. We shall call this map φ, and say that it takes a point
(v0 , t0 ) to the point (v1 , t1 ) = φ(v0 , t0 ). Actually it is defined only on
a subset of the plane, consisting of those points which lead to motions
which do eventually recross the plane z = 0.
If e = 0 it is easy to visualise this map. In this case the two massive
bodies stay at a constant distance from the z-axis and the force experi-
enced by m3 is time-independent, even though the massive stars are in
motion. Hence it is clear that v1 = v0 . Also, the larger the value of v0
(below the escape speed) the longer the time t1 − t0 until m3 returns.
The result of these two facts is that φ rotates points along circles, the
angle of rotation being larger for larger circles (Fig.3). In fact in this
case φ is an example of a “twist” map, as the figure suggests. It can be
used to illustrate a number of concepts, such as “phase mixing”, and the
distinction between “isolating” and “non-isolating” integrals, which were
introduced in Chapter 10.
Now we go on to consider the case in which e is non-zero but small.
In this case the force experienced at a fixed point on the z-axis is time-
dependent. Therefore the energy of the massless particle is no longer
they follow the same subsequent motion with a constant time delay, because the
perturbing effect of the binary is 2π-periodic. For a similar reason it does not
matter that we have dropped the sign of ż by using v instead.
Fractals and Chaos 207
D
0
R
P
D =φ(D )
1 0
φ(R)
Fig.1. The domain of definition of the Poincaré map in the eccentric
Sitnikov problem, and its image.
Now draw a little curvilinear rectangle R in the corner beside P , where
the boundaries of D0 and D1 intersect, and consider what happens to R
under φ. As in the case e = 0, the image of R spirals infinitely many times
just inside the boundary of D1 . Therefore R itself contains infinitely many
pieces of φ(R). Different pieces correspond to different times at which m3
returns to the plane z = 0; for neighbouring pieces the times differ by
nearly one revolution of the binary.
Now a wonderful thing becomes apparent if we apply φ to R, with its
pattern of stripes, once again. Then it is apparent that, not only does
φ(R) cross R in these stripes, but within each stripe is a complete replica
of the infinite pattern of stripes. If we choose a point within a tiny stripe
inside a large stripe, we can then tell how many binary periods elapsed
between each pair of three successive crossings of the plane z = 0 (Fig.2).
Box 1 continued
Once this point is understood it is also clear that, if these numbers
of periods are large enough, they can be chosen arbitrarily, simply by
choosing an arbitrary stripe within an arbitrary stripe. And we need not
stop at three successive crossings. We can repeat the process of applying
φ indefinitely, obtaining a fractal pattern of stripes within stripes within
stripes....ad infinitum. And we can choose initial conditions leading to
an arbitrary sequence of times between successive crossings (as measured,
say, in the nearest whole number of binary periods).
φ(R)
R
S S2
1
φ(S )
φ(S ) 2
1
Fig.2. A piece of the surface of section and parts of its image under φ
and φ2 (i.e. φ applied twice). Only two of the infinitely many strips are
show.
Now it is also possible to extend the argument into the past. This
time, however, we get a fractal set of stripes which cross the first set
transversely. There is a very important set of points which are obtained
by taking the points which lie on both sets of stripes at all levels of
the fractal structure. These correspond to orbits which never escape, no
matter how far forward or back in time we go. Now we can construct
orbits which exhibit any sequence of times between successive crossings,
not only into the future, but into the past as well. Examples are given in
the text.
φ(l )
l
Fig. 3. Sitnikov’s problem for e = 0. The figure shows how several points on
a radial line l are taken, by the action of the Poincaré map φ (thin arcs with
arrowheads) to corresponding points on a spiral curve, which is φ(l).
Box 1 (continued)
Now consider two points which lie on the same component of the fractal
of horizontal strips, and on the same component of φn (R), for some largest
positive n. These points both lie on I, and are very close together. On the
other hand they lie on different components of φn+1 (R), by assumption.
Therefore after n crossings of the plane z = 0 the two orbits differ greatly
in the number of orbital periods before the next crossing. This illustrates
sensitive dependence on initial data.
210 20 Mathematical Three-Body Scattering
Capture
∗
For technical reasons the numbers must be chosen larger than some fixed number.
This is understood in all such statements hereafter.
†
Capture even by a single star is quite possible if the interaction raises tides on either
star; cf. Chapter 27
Capture 211
Following J.E. Littlewood, we shall say that the third body has been
captured if, after some time, all three bodies remain forever within some
sphere S (in space) centred on the barycentre of the three bodies. Now
consider the phase space of the three-body problem, and let x(t) denote an
orbit in this space, so that x is really a vector of positions and momenta.
Let V0 be the set of initial conditions x(0) such that all three bodies lie
within S for all positive times t ≥ 0 (Fig.1). Next, let V1 be the set of
points x(1) which these orbits, originating in V0 at time t = 0, occupy
at time t = 1. It follows from Liouville’s Theorem that the volumes (or
measures) of V0 and V1 are equal. It is also easy to see that V1 lies within
V0 : for any point P in V1 lies on an orbit which started at a point in
V0 , and on the resulting orbit all three bodies subsequently lie within S;
therefore P could be used as an initial condition for an orbit lying entirely
within S; therefore P lies in V0 . To summarise so far, what we have now
established is that V1 is contained within V0 and has the same volume.
Now we concentrate on the set of points which are in V0 but not in V1 .
By what has just been proved this set has zero volume. Let us use one of
these points as initial conditions. On this orbit, all three bodies lie within
S for t ≥ 0, since x(0) lies within V0 . If, in addition, all three bodies lay
within S during the interval from t = −1 to t = 0, then the point x(−1)
could be used as initial conditions for an orbit with the bodies inside S for
t ≥ 0; in other words, x(−1) would lie in V0 , and so x(0) would lie within
V1 . This contradiction establishes that the points in V0 which are not in
V1 include those orbits which are captured during the interval between
t = −1 and t = 0. (There may be others, including orbits in which one
of the bodies is temporarily ejected from S.) Therefore the measure of
capture orbits is zero.
Invariance under time reversal does not have a role in the proof, but
can be used to extend it. By reversing time, the corresponding result
on capture shows that orbits which were bound∗ forever in the past, but
eventually escape, are also rare. The only orbits which one expects to find
are of three sorts: (i) triples which are permanently bound, (ii) triples
which never become bound, and (iii) triples which become temporarily
bound (perhaps more than once, perhaps many times).
∗
Remember that we are here defining “bound” in a particular sense, based on the
locality of the three bodies.
212 20 Mathematical Three-Body Scattering
Box 2 (continued)
V
V1 0
x(1)
x(0)
x(-1)
Of these three types, the last two have become so familiar that many
people must have concluded that nothing else can happen. But stable
permanently bound triples do exist. Despite an earlier example inves-
tigated by Hénon (1976), one of the astonishing developments in this
subject took place around the start of 2000, when Chenciner & Mont-
gomery (2000) discovered, more or less by pure thought, a remarkable
stable periodic solution of the three-body problem with equal masses∗ .
The three particles chase each other round an orbit resembling a figure
8. Even more wonderful stable periodic motions are possible (Fig.4; see
also Montgomery 2001, and other papers and web references therein).
Problems
1) In a hierarchical triple the third body orbits well “outside” the inner
binary. Suppose the inner binary has period 2π and that the orbit of
the third body takes it close to the inner binary at times t0 < t1 <
t2 < . . .. We suppose that the energy of the outer binary remains
constant between close approaches, and that it changes at each close
approach by an amount that depends periodically on the phase of the
∗
It had been found previously by Moore (1993) using a numerical technique, but few
had paid much attention at that time.
Capture 213
Fig. 4. Some periodic motions of the 3-body problem (M. Ruffert). Top left is
the orbit of Chenciner & Montgomery, with numerical initial conditions found by
C. Simó. All three particles traverse the same orbit, and the motion is periodic in
an inertial frame. The other orbits are periodic in a rotating frame. At top right,
two bodies traverse the thicker curve, while one traverses the thinner curve. In
the remaining four frames the three bodies traverse different curves. All motions
are stable for equal masses, and have zero angular momentum.
214 20 Mathematical Three-Body Scattering
Cross Sections
What is important in applications (Chapters 23f) is the energetics of
these interactions, and that is why such stress was laid on the distinction
between soft and hard pairs in Chapter 19. In the present chapter this
consideration implies that we may be interested in interactions with bina-
ries of a given energy. Encounters with such binaries, however, are taking
place all the time with stars which approach from random directions and
random distances. Therefore, besides the energy of the binary, we usu-
ally do not know or care about the other properties of the participants,
except for their statistical distribution. This is true of the approach path
of the third body, and also usually it is true of the other 5 parameters
(besides the energy) which determine the relative motion of the binary
components∗.
An analogy with atomic scattering may now be clear. There also we
may be concerned with binaries (atoms) occupying a specific energy level
and orbital angular momentum (analogous to the energy and eccentricity
of a binary). In the atomic context interactions are characterised by the
∗
One exception may be the eccentricity; an important special case is that of binaries
in initially circular orbits.
215
216 21 Analytical Approximations
G5 m5 n3
Ṅb = 0.75 , (5)
σv9
where v2 = 3σv2 (Goodman & Hut 1993). (The subscript should help
avoid confusion with a cross section.)
It has already been said that we can define cross sections for encounters
of a given type. For example, if we are interested in the amount by which
the binding energy of the binary changes we may proceed as follows. Let
∆ be the relative change in the binding energy, and let σ(∆) be the
cross section of encounters in which the relative change exceeds ∆. In
fact we already know (from a discussion in the previous chapter) that
a substantial fraction of “close” encounters changes the energy of the
binary by an amount comparable with its original value, and so Eq.(4)
is an order-of-magnitude estimate of σ(1). Equivalent information on
the behaviour of interactions is contained in the differential cross section
∗
Fig.14.1 gives a picture of what this means
218 21 Analytical Approximations
Adiabatic Encounters
Let us consider first a “wide” encounter with a hard binary, i.e. one in
which Rp considerably exceeds a. Then the angular velocity of the third
body about the binary, when at closest approach, is Vp /Rp . Now the
mean angular velocity of the binary components is given by Kepler’s 3rd
Law in the form
q
n= GM12 /a3 . (6)
(Note the change of meaning of n.) By use of Eqs.(2) and (3), it follows
that the ratio of these frequencies is
3/2 !−1/2
nRp Rp 2M123 Rp aV 2
= + . (7)
Vp a M12 a GM12
We shall suppose that the masses are not very disparate. Since the binary
is hard, aV 2 /(GM12 ) 1. It follows that, if Rp considerably exceeds
a, the time scale of the encounter will be long compared with the period
of the binary. In this case the encounter is “adiabatic”. It does not
follow that the binary exhibits adiabatic invariance, with exponentially
Adiabatic Encounters 219
Fig. 1. Numerical and analytic estimates of cross sections for ionisation, ex-
change and resonance. The initial eccentricity of the binary is 0, and the relative
velocity of the incoming star is given in units of the minimum speed required
energetically to ionise the binary. Masses are equal. The dotted line gives the
theoretical estimate for exchange in the soft limit (v 1). The dashed line is the
theoretical estimate for ionisation, which occurs only if v ≥ 1. For equal masses
it happens to coincide very nearly with Eq.(4), the cross section for “close” en-
counters with a hard binary. This is why the numerical points follow this line
approximately for v 1. From Hut & Bahcall (1983).
0.008
0.006
0.004
0.002
δa/a
0
-0.002
-0.004
-0.006
-0.008
0 100 200 300 400
t
0.018
0.016
0.014
0.012
0.01
δe
0.008
0.006
0.004
0.002
0
0 100 200 300 400
t
Fig. 2. Relative change in the semi-major axis a (top) and the eccentricity e in
an adiabatic encounter, plotted against time. The binary is circular initially.
point where the third body would experience a collision with the centre of
mass of the binary, another instance of the curious role of singularities in
this subject. At this point the oscillatory terms of the integrand turn into
exponentials, and these determine the dependence of the energy change
(cf. Box 1, Eq.[7]). Not surprisingly, the exponent is essentially the ratio
of the time scales in the problem: the short period of the binary, and
the much longer time scale of the encounter. This is rather common
behaviour for the effect of adiabatic perturbations.
Now we show how to estimate the corresponding cross section. We use
Eq.(8) of Box 1 as an estimate of the relative energy change ∆, though
it is pointed out in the accompanying text that the coefficient of the
exponential also depends on the orbital parameters. When we solve for
p, however, this coefficient enters as a logarithm, and the resulting effect
on the cross section is weak. Aside from this logarithm, then, we estimate
Adiabatic Encounters 221
Let ε be the energy of a binary, and let r denote the position of one
component relative to the other. Thus
m1 m2 2 Gm1 m2
ε= ṙ − ,
2M12 r
where M12 = m1 + m2 . Also,
m1 m2
ε̇ = ṙ.f , (1)
M12
where f is the perturbative acceleration due to m3 , i.e. its contribution
to the equation for r̈. The exact expression for this is
R − µ2 r R + µ1 r
f = Gm3 − ,
|R − µ2 r|3 |R + µ1 r|3
where µi = m1 /M12 and R is the position of m3 relative to the barycentre
of the binary. For encounters which are not too close we approximate this
at lowest (quadrupole) order in a multipole expansion, i.e. write
Gm3 3r.R
f '− 3
r− R . (2)
R R2
In first order perturbation theory we substitute unperturbed motion
for r and R in Eqs.(1) and (2). Terms involving r are periodic with
period 2π/n, and so the right hand side of Eq.(1) may be written as a
Fourier series (with R-dependent coefficients). By averaging Eq.(1) at
fixed R, we see that this series has no constant term, and the harmonics
of lowest order involve a factor proportional to exp(±int). (Harmonics
of higher order oscillate more rapidly, and from what follows it would be
seen that they would yield a negligible contribution to our result.) We
use a parabolic approximation for the motion of the third body (i.e. R).
The solution is standard, if relatively unfamiliar:
R = q(1 + σ2 ), (3)
where q is the distance of closest approach, and σ (not a cross section!)
is related to the time via the equation
s
1 GM123
σ + σ3 = t. (4)
3 2q 3
Here we have taken t = 0 at the instant of closest approach of the third
body, and M123 = m1 + m2 + m3 .
The relative change in energy is given by integrating Eq.(1). This yields
several terms, of which we take the integral
222 21 Analytical Approximations
Box 1 (continued)
Z ∞
I= R−3 exp(int)dt (5)
−∞
that
4/3
πGM123 a
σ = πp2 ∼ 1/3
(3 ln |∆|)2/3 . (8)
M12 V 2
The differential cross section dσ/d∆ varies essentially as 1/|∆|. Note the
similarity to Eq.(4).
Now for the complication. Our previous treatment is really too rough,
because all three stars must arrive simultaneously within the small in-
teraction region of size of order a1 , and as they approach they interact
with each other in a way that cannot, as we have implicitly assumed, be
broken down into two-body approximations. Another way of saying the
same thing is that three-body gravitational focussing is different from the
two-body kind that we introduced earlier in the chapter.
A better way to handle the simultaneous close approach of three bodies
is to treat it as a perturbation from an exact triple collision. Solutions for
a triple collision can be written down exactly, and then linearisation about
one of these solutions can be adapted to provide the required information.
In the context of atomic scattering theory, and threshold scattering in
particular, this approach was initiated by G.H. Wannier (1953). It is
equally successful in the gravitational problem under consideration here.
It would take too long to analyse three-body gravitational scattering
in detail from this point of view, but we can illustrate the ideas involved
by studying close two-body encounters. In particular, we rederive a con-
sequence of Eq.(4) by looking at the relative motion of the third star and
the barycentre of the binary as a perturbation of a two-body collision or-
bit (Box 2). The trick is to find the time at which the deviation from the
collision orbit grows to become of the same order as the spatial size of the
collision solution itself. At this time the collision solution breaks down as
an approximation, and one can interpret the corresponding length scale
as the distance of closest approach.
It turns out that a similar calculation is possible for three-body mo-
tions (Simonovic & Grujic 1987, Heggie & Sweatman 1991), and it leads
to a similar relation which shows how the cross section for a close three-
body interaction within a small distance r depends on r. The calculation
is complicated because there are several configurations which yield exact
collision orbits, and because the resulting eigenvalue problems are some-
what more involved. The kind of result obtained may be illustrated by
the case of equal masses, which is quite easily worked out. In this case it
turns out that the cross section given in Eq.(9) does not √ have quite the
correct dependence on a1 : the exponent should be 2 + 13/2, ' 3.802....
While this discussion dealt with the case in which a binary is greatly
hardened by an encounter (∆ 1), similar considerations occur in en-
counters in which a binary is almost destroyed. In this case all three
participants must emerge from the encounter on orbits close to an exact
triple expansion solution. When the size of the configuration has enlarged
sufficiently, one pair “peel off” to reform a binary, leaving the single star
to escape. The way in which the cross section for such events depends on
Close Triple Encounters 225
Problems
1) A rapidly moving third body approaches one component of a soft
binary much more closely than the other. Use Eqs.(2) and (4) in Box
14.1, to estimate the change in velocity of this component. Substi-
tute this result into Eq.(19.5), using the statistical argument in the
associated text, and hence estimate the rate of softening of the binary.
(Impose a cutoff in impact parameter p at the semi-major axis (a) and
Close Triple Encounters 227
For a very hard binary, for which n V /Rp , the second term in the
bracket may be neglected. By deforming the contour further round
the branch point at t = iRp /V and a cut from there to t = i∞, show
that s
2πn nRp
I exp −
Rp3 V 3 V
asymptotically.
7) An “isosceles” triple system in the plane consists of three stars of equal
mass m = 1 at the points (x, y), (−x, y) and (0, −2y). Write down
equations of motion in units such that G = 1, and verify that these are
satisfied by the collision√solution x = A(−t)2/3 , y = B(−t)2/3, where
A = 3/24/3 and B = 3/24/3 . Obtain the linearised equations for
small relative perturbations δx, δy. Show that these have solutions in
which δx, δy vary as (−t)λ , for four possible values of λ (which you
should find).
22
Laboratory Experiments
229
230 22 Laboratory Experiments
(Hut & Bahcall 1983). Much of the organisation of these runs was still
done by hand, and the process was eventually fully automated by Piet
Hut and Steve McMillan by 1995 (McMillan & Hut 1996).
There are two issues to be confronted with numerical techniques for the
scattering problem. One is the question of how to integrate the equations
of motion, and the other is the matter of how to organise the results.
These methodological issues are the themes of this chapter.
Numerical Integration
∗
The derivative of the acceleration is sometimes called jerk. Perhaps the position and
its successive derivatives should be referred to by the letters x, v, a, j, s, c and p
(up to the 6th derivative), as we have been given to understand that some physicists
refer to the last three as snap, crackle and pop. See
http://www.weburbia.com/physics/jerk.html.
Numerical Integration 231
equations reduce to
xn+1 = xn + hvn+1/2 ,
(5)
vn+3/2 = vn+1/2 − hxn+1 .
∂(xn+1 , vn+3/2 )
By computing the Jacobian determinant it is easy to
∂(xn , vn+1/2 )
show that this is area-preserving∗
It might seem to have nothing to do with area preservation or sym-
plecticness, but in practice an algorithm like the leapfrog method is very
good for long-term energy conservation. The fact that the method is
area-preserving essentially shows that the numerically generated phase
path does not spiral out from (or in to) the origin, which is precisely
the behaviour exhibited by the Euler method (Problem 1 and Fig.1). In
terms of order, i.e. the way in which the local truncation error depends
on h, the leapfrog is very primitive. It is not hard, however, to invent
symplectic methods of higher order (Kinoshita et al. 1991).
Our discussion of the symplectic property of the leapfrog integrator
made the unstated assumption that h was constant. In practice, as we
have already seen, h should be allowed to vary, and then it is not clear
that the symplectic property of the algorithm is preserved. This stum-
bling block to the implementation of effective symplectic integrators was
removed by Hut, Makino and McMillan (1995), who realised that the
way out was to incorporate another exact property of equations (1): their
time-reversal invariance. Observe that, if ri (t) is a solution of Eq.(1) then
so is ri (−t). One consequence of this fact is that, if the velocities of three
stars in a triple system are reversed at some instant, then the stars will
exactly retrace their orbits. Here is another property which we should
perhaps try to incorporate into a numerical integrator.
Let us look at Eqs.(4) from this point of view. By inverting these we
immediately find
n+1 n+1/2
rn
i = ri + (−h)vi ,
n+1/2 n+3/2
vi = vi + (−h)an+1
i .
∗
Strictly speaking an integrator for a problem with one-degree of freedom is sym-
plectic if the map (xn , vn ) → (xn+1 , vn+1 ) is area-preserving. The fact that v is
discretised at intermediate times complicates the issue, but we shall not explore this
detail here.
Numerical Integration 233
2
1.5
1
0.5
0
-0.5
y
-1
-1.5
-2
-2.5
-3
-1 0 1 2 3 4 5 6
x
Since these are true at each step, we can reduce the superscript on the
second equation by one, and obtain
n+1 n+1/2
rn
i = ri + (−h)vi ,
n−1/2 n+1/2
vi = vi + (−h)an
i.
Note that these equations are of exactly the same form as Eqs.(4), except
that h is replaced by −h and the superscript is running downwards. Thus
if we stop a leapfrog integration at some point, reverse h, and continue
using the leapfrog algorithm, we recover the previous results (except for
rounding error). This is the analogue (for algorithms) of the time-reversal
invariance of the equations of motion.
Now the above argument is ruined if we allow the time step to vary, for
example as a function of the accelerations ai , unless we ensure that the
time-step respects the time-reversal invariance as well. One possibility is
to ensure that h depends on a symmetric function of the values at the
234 22 Laboratory Experiments
beginning and the end of the time step. It turns out that such a procedure
works very well.
It is still possible to break a time-symmetric, symplectic integrator
with sufficiently close approaches between two or three of the bodies.
There is a formulation, based on a different Hamiltonian, which will ac-
curately compute even a collision orbit (Mikkola & Tanikawa 1999, see
Chapter 15). The traditional way out, however, is the technique of two-
body regularisation, also discussed in Chapter 15. Its extension to three-
and other few-body systems is best carried out with the “chain regular-
isation” of Mikkola & Aarseth (1993), which is an extension of an old
3-body method devised by Aarseth & Zare (1974)∗. We recommend the
survey by Mikkola (1997) for a fascinating glimpse of the history behind
these methods. In recent years, however, it has even become possible to
combine these regularisation techniques with time-symmetrisation (Fu-
nato et al. 1996, Leimkuhler 1999).
The final point to be mentioned in connection with numerical methods
is the accuracy which it is feasible or desirable to insist on. This is really
the same question for N = 3 as we discussed in general in Chapter 13.
There we already saw that there are situations in which the slightest
change in the initial conditions of a three-body interaction can lead to
drastic changes in the final result. Dejonghe & Hut (1987) have discovered
examples in which an error in initial conditions may be magnified by over
100 orders of magnitude before the final fate of the triple system is settled.
These considerations show that one cannot hope to devise any method
which will produce an accurate solution for all triple interactions.
∗
For N = 3 there is an alternative due to G. Lemaitre (1955), though he is better
known nowadays for his work in cosmology.
Organisation of the calculations 235
may be the initial eccentricity and phase of the binary, and all but one
of the initial coordinates and all velocity components of the incomer.
Against these simplifications, however, we must not forget that the result
depends on the masses, though scaling (e.g. so that the total mass is
unity) removes one.
In a given problem it is usually clear how these parameters are to
be distributed. In a simple situation, for instance, we are interested in
encounters between stars of given masses, and the initial speed of the
intruder and eccentricity of the binary are specified. Then the initial
phase of the binary corresponds to a uniformly random choice of the
initial time, while the remaining parameters of the intruder correspond to
a uniform distribution in space and an isotropic distribution of velocities.
It is customary to select such initial conditions using a well designed
random number generator, but there is some advantage in these problems
in using quasi-random numbers (Press et al. 1992),which aim to sample
the domain of initial conditions more purposefully.
One of the parameters commonly used to specify the initial conditions
of the intruder is the impact parameter, p. i.e. the distance of closest
approach between it and the barycentre of the binary, if it was unde-
flected by the binary. This differs from all other parameters in that it is
unbounded, and it is the most troublesome parameter of all to deal with.
Some upper limit must be imposed, and it is legitimate to do so, for if
p is too large the intruder flies wide of the binary and does not affect
it significantly. But where should the limit be set? The problem with
the choice of maximum impact parameter is this: allowing too large an
impact parameter can imply a large waste of computer time on uninter-
esting orbits; while choosing too small an impact parameter (e.g. one
much smaller than the initial semi-major axis of the binary) will yield a
systematic underestimate of some cross sections, since some encounters
of interest will be missed.
One manual way of handling this situation is to carry out a pilot study
to determine the range of impact parameters in which the encounters of
interest are observed to take place. Only then does one embark on pro-
duction runs, intended to compute cross sections with desired statistical
accuracy (Fig.2).
It is now possible to automate the entire process (McMillan & Hut
1996). Rather than relying on human inspection of pilot calculations,
software has been written which includes an automatic feedback system
ensuring near-optimal coverage of the range of impact parameters while
guaranteeing completeness. The basic idea is to maintain a safety zone
outside the ‘bull’s eye’ region, defined as the area where interesting reac-
236 22 Laboratory Experiments
Fig. 2. Numerically determined cross section for hard binary scattering for equal
v 2 dσ
masses, and a semi-analytic fit. The scaled differential cross section P =
πa2 d∆
is plotted, with error bars, against the relative increase in binding energy, ∆,
where v is the initial relative speed of the third body and the binary, in units of
the minimum speed required for ionisation. From Heggie & Hut (1993).
Problems
1) Solve Eqs.(5) exactly, for initial conditions x0 = 1, v1/2 = 0. Find
out in what sense the solution is superior to that of the corresponding
Euler algorithm
xn+1 = xn + hvn ,
vn+1 = vn − hxn
with initial conditions x0 = 1, v0 = 0.
2) Compare the total of the cross sections at the bottom of Box 1 with
the estimate for the cross section of close encounters (Eq.(21.4)).
3) For fixed masses, how many parameters are needed to specify a three-
body scattering experiment (a) in two dimensions, (b) in one dimen-
sion?
238 22 Laboratory Experiments
The command sigma3 -d 128 -v 1.2 -m 0.3 uses part of the “star-
lab” package (http://www.manybody.org/starlab.html) to estimate the
cross section for various outcomes of encounters between a single star
and a binary. The binary component masses are 1 − m and m, and the
mass of the third star may be set with a further option. The other two
parameters are d, which is a measure of the density of experiments in
a central trial √
zone (the fractional error in the largest cross section will
be of order 1/ d), and v, the speed of the third body when at a large
distance, in units of the value at which the total barycentric energy of
the triple system is zero. Here is a summary of the results. It may not
look beautiful to the reader, but it does to anyone who has organised this
kind of thing by hand.
scatter profile:
m1=0.7 m2=0.3 m3=0.5 r1=0 r2=0 r3=0 v inf=1.2
----------------------------------------------------
total scatterings=25, CPU time=7.45
exch1 exch2 ion error stop
2.220 0.888 3.108 0.000 0.000
+-1.83 +-.628 +-2.03 +-.000 +-.000
----------------------------------------------------
total scatterings=60, CPU time=16.94
exch1 exch2 ion error stop
0.444 0.999 0.777 0.000 0.000
+-.222 +-.333 +-.294 +-.000 +-.000
----------------------------------------------------
total scatterings=312, CPU time=105.6
exch1 exch2 ion error stop
0.768 1.224 0.712 0.000 0.000
+-.158 +-.199 +-.153 +-.000 +-.000
----------------------------------------------------
total scatterings=1560, CPU time=487.97
exch1 exch2 ion error stop
1.025 1.181 0.861 0.000 0.000
+-.101 +-.105 +-.089 +-.000 +-.000
v2 σ
The values are those of , where v is defined as above. Notice that
πa2
the cross section for exchanging the heavier binary companion is smaller.
Organisation of the calculations 239
22
Laboratory Experiments
229
230 22 Laboratory Experiments
(Hut & Bahcall 1983). Much of the organisation of these runs was still
done by hand, and the process was eventually fully automated by Piet
Hut and Steve McMillan by 1995 (McMillan & Hut 1996).
There are two issues to be confronted with numerical techniques for the
scattering problem. One is the question of how to integrate the equations
of motion, and the other is the matter of how to organise the results.
These methodological issues are the themes of this chapter.
Numerical Integration
∗
The derivative of the acceleration is sometimes called jerk. Perhaps the position and
its successive derivatives should be referred to by the letters x, v, a, j, s, c and p
(up to the 6th derivative), as we have been given to understand that some physicists
refer to the last three as snap, crackle and pop. See
http://www.weburbia.com/physics/jerk.html.
Numerical Integration 231
equations reduce to
xn+1 = xn + hvn+1/2 ,
(5)
vn+3/2 = vn+1/2 − hxn+1 .
∂(xn+1 , vn+3/2 )
By computing the Jacobian determinant it is easy to
∂(xn , vn+1/2 )
show that this is area-preserving∗
It might seem to have nothing to do with area preservation or sym-
plecticness, but in practice an algorithm like the leapfrog method is very
good for long-term energy conservation. The fact that the method is
area-preserving essentially shows that the numerically generated phase
path does not spiral out from (or in to) the origin, which is precisely
the behaviour exhibited by the Euler method (Problem 1 and Fig.1). In
terms of order, i.e. the way in which the local truncation error depends
on h, the leapfrog is very primitive. It is not hard, however, to invent
symplectic methods of higher order (Kinoshita et al. 1991).
Our discussion of the symplectic property of the leapfrog integrator
made the unstated assumption that h was constant. In practice, as we
have already seen, h should be allowed to vary, and then it is not clear
that the symplectic property of the algorithm is preserved. This stum-
bling block to the implementation of effective symplectic integrators was
removed by Hut, Makino and McMillan (1995), who realised that the
way out was to incorporate another exact property of equations (1): their
time-reversal invariance. Observe that, if ri (t) is a solution of Eq.(1) then
so is ri (−t). One consequence of this fact is that, if the velocities of three
stars in a triple system are reversed at some instant, then the stars will
exactly retrace their orbits. Here is another property which we should
perhaps try to incorporate into a numerical integrator.
Let us look at Eqs.(4) from this point of view. By inverting these we
immediately find
n+1 n+1/2
rn
i = ri + (−h)vi ,
n+1/2 n+3/2
vi = vi + (−h)an+1
i .
∗
Strictly speaking an integrator for a problem with one-degree of freedom is sym-
plectic if the map (xn , vn ) → (xn+1 , vn+1 ) is area-preserving. The fact that v is
discretised at intermediate times complicates the issue, but we shall not explore this
detail here.
Numerical Integration 233
2
1.5
1
0.5
0
-0.5
y
-1
-1.5
-2
-2.5
-3
-1 0 1 2 3 4 5 6
x
Since these are true at each step, we can reduce the superscript on the
second equation by one, and obtain
n+1 n+1/2
rn
i = ri + (−h)vi ,
n−1/2 n+1/2
vi = vi + (−h)an
i.
Note that these equations are of exactly the same form as Eqs.(4), except
that h is replaced by −h and the superscript is running downwards. Thus
if we stop a leapfrog integration at some point, reverse h, and continue
using the leapfrog algorithm, we recover the previous results (except for
rounding error). This is the analogue (for algorithms) of the time-reversal
invariance of the equations of motion.
Now the above argument is ruined if we allow the time step to vary, for
example as a function of the accelerations ai , unless we ensure that the
time-step respects the time-reversal invariance as well. One possibility is
to ensure that h depends on a symmetric function of the values at the
234 22 Laboratory Experiments
beginning and the end of the time step. It turns out that such a procedure
works very well.
It is still possible to break a time-symmetric, symplectic integrator
with sufficiently close approaches between two or three of the bodies.
There is a formulation, based on a different Hamiltonian, which will ac-
curately compute even a collision orbit (Mikkola & Tanikawa 1999, see
Chapter 15). The traditional way out, however, is the technique of two-
body regularisation, also discussed in Chapter 15. Its extension to three-
and other few-body systems is best carried out with the “chain regular-
isation” of Mikkola & Aarseth (1993), which is an extension of an old
3-body method devised by Aarseth & Zare (1974)∗. We recommend the
survey by Mikkola (1997) for a fascinating glimpse of the history behind
these methods. In recent years, however, it has even become possible to
combine these regularisation techniques with time-symmetrisation (Fu-
nato et al. 1996, Leimkuhler 1999).
The final point to be mentioned in connection with numerical methods
is the accuracy which it is feasible or desirable to insist on. This is really
the same question for N = 3 as we discussed in general in Chapter 13.
There we already saw that there are situations in which the slightest
change in the initial conditions of a three-body interaction can lead to
drastic changes in the final result. Dejonghe & Hut (1987) have discovered
examples in which an error in initial conditions may be magnified by over
100 orders of magnitude before the final fate of the triple system is settled.
These considerations show that one cannot hope to devise any method
which will produce an accurate solution for all triple interactions.
∗
For N = 3 there is an alternative due to G. Lemaitre (1955), though he is better
known nowadays for his work in cosmology.
Organisation of the calculations 235
may be the initial eccentricity and phase of the binary, and all but one
of the initial coordinates and all velocity components of the incomer.
Against these simplifications, however, we must not forget that the result
depends on the masses, though scaling (e.g. so that the total mass is
unity) removes one.
In a given problem it is usually clear how these parameters are to
be distributed. In a simple situation, for instance, we are interested in
encounters between stars of given masses, and the initial speed of the
intruder and eccentricity of the binary are specified. Then the initial
phase of the binary corresponds to a uniformly random choice of the
initial time, while the remaining parameters of the intruder correspond to
a uniform distribution in space and an isotropic distribution of velocities.
It is customary to select such initial conditions using a well designed
random number generator, but there is some advantage in these problems
in using quasi-random numbers (Press et al. 1992),which aim to sample
the domain of initial conditions more purposefully.
One of the parameters commonly used to specify the initial conditions
of the intruder is the impact parameter, p. i.e. the distance of closest
approach between it and the barycentre of the binary, if it was unde-
flected by the binary. This differs from all other parameters in that it is
unbounded, and it is the most troublesome parameter of all to deal with.
Some upper limit must be imposed, and it is legitimate to do so, for if
p is too large the intruder flies wide of the binary and does not affect
it significantly. But where should the limit be set? The problem with
the choice of maximum impact parameter is this: allowing too large an
impact parameter can imply a large waste of computer time on uninter-
esting orbits; while choosing too small an impact parameter (e.g. one
much smaller than the initial semi-major axis of the binary) will yield a
systematic underestimate of some cross sections, since some encounters
of interest will be missed.
One manual way of handling this situation is to carry out a pilot study
to determine the range of impact parameters in which the encounters of
interest are observed to take place. Only then does one embark on pro-
duction runs, intended to compute cross sections with desired statistical
accuracy (Fig.2).
It is now possible to automate the entire process (McMillan & Hut
1996). Rather than relying on human inspection of pilot calculations,
software has been written which includes an automatic feedback system
ensuring near-optimal coverage of the range of impact parameters while
guaranteeing completeness. The basic idea is to maintain a safety zone
outside the ‘bull’s eye’ region, defined as the area where interesting reac-
236 22 Laboratory Experiments
Fig. 2. Numerically determined cross section for hard binary scattering for equal
v 2 dσ
masses, and a semi-analytic fit. The scaled differential cross section P =
πa2 d∆
is plotted, with error bars, against the relative increase in binding energy, ∆,
where v is the initial relative speed of the third body and the binary, in units of
the minimum speed required for ionisation. From Heggie & Hut (1993).
Problems
1) Solve Eqs.(5) exactly, for initial conditions x0 = 1, v1/2 = 0. Find
out in what sense the solution is superior to that of the corresponding
Euler algorithm
xn+1 = xn + hvn ,
vn+1 = vn − hxn
with initial conditions x0 = 1, v0 = 0.
2) Compare the total of the cross sections at the bottom of Box 1 with
the estimate for the cross section of close encounters (Eq.(21.4)).
3) For fixed masses, how many parameters are needed to specify a three-
body scattering experiment (a) in two dimensions, (b) in one dimen-
sion?
238 22 Laboratory Experiments
The command sigma3 -d 128 -v 1.2 -m 0.3 uses part of the “star-
lab” package (http://www.manybody.org/starlab.html) to estimate the
cross section for various outcomes of encounters between a single star
and a binary. The binary component masses are 1 − m and m, and the
mass of the third star may be set with a further option. The other two
parameters are d, which is a measure of the density of experiments in
a central trial √
zone (the fractional error in the largest cross section will
be of order 1/ d), and v, the speed of the third body when at a large
distance, in units of the value at which the total barycentric energy of
the triple system is zero. Here is a summary of the results. It may not
look beautiful to the reader, but it does to anyone who has organised this
kind of thing by hand.
scatter profile:
m1=0.7 m2=0.3 m3=0.5 r1=0 r2=0 r3=0 v inf=1.2
----------------------------------------------------
total scatterings=25, CPU time=7.45
exch1 exch2 ion error stop
2.220 0.888 3.108 0.000 0.000
+-1.83 +-.628 +-2.03 +-.000 +-.000
----------------------------------------------------
total scatterings=60, CPU time=16.94
exch1 exch2 ion error stop
0.444 0.999 0.777 0.000 0.000
+-.222 +-.333 +-.294 +-.000 +-.000
----------------------------------------------------
total scatterings=312, CPU time=105.6
exch1 exch2 ion error stop
0.768 1.224 0.712 0.000 0.000
+-.158 +-.199 +-.153 +-.000 +-.000
----------------------------------------------------
total scatterings=1560, CPU time=487.97
exch1 exch2 ion error stop
1.025 1.181 0.861 0.000 0.000
+-.101 +-.105 +-.089 +-.000 +-.000
v2 σ
The values are those of , where v is defined as above. Notice that
πa2
the cross section for exchanging the heavier binary companion is smaller.
Organisation of the calculations 239
23
Gravitational Burning
and Transmutation
where R is the rate (number per unit time) at which the binary experi-
ences encounters in which the relative change of its binding energy is ∆.
The function R becomes infinite as ∆ → 0, but in practice we can impose
a small positive cutoff on ∆, at the point where positive and negative
energy changes nearly cancel.
As an example, consider the formula
R ∝ ∆−α (1 + ∆)−4.5+α , (2)
which, with α ' 0.54, has been shown to provide a satisfactory fit to
numerical data for ∆ > 0.01 for equal masses in the very hard binary limit
(Heggie & Hut 1993). Then the integral in Eq.(1) is quite insensitive to
the choice of cutoff, and the result is h∆Ri ' 0.21. This result accounts
for the values at the extreme right in Fig.1, which plots h∆Ri, normalised
by the binding energy. Eq.(2) can also be used to show that about half
the hardening rate of a hard binary is accounted for by encounters in
which
∆ > ∆∗ ≡ 0.53. (3)
239
240 23 Gravitational Burning and Transmutation
Keeping this result for later, let us now turn to the effect of an encounter
on the motion of the binary. Let the initial and final velocity of the binary
relative to the intruder be V and V0 , respectively, and let ε be the initial
energy of the binary. By energy conservation (in the rest-frame of the
barycentre of the three bodies, assumed to have equal mass m) it follows
that
1 1
mV 2 + ε = mV 02 + ε(1 + ∆).
3 3
(The factor 1/3 arises because the reduced mass of a system consisting of a
binary of mass 2m and a single star of mass m is 2m/3.) For a sufficiently
hard binary and an average encounter, ∆ε mV 2 , and so this reduces
approximately to V 02 ' −3∆ε/m. In this same frame the velocities of the
binary and single star are Vb0 = V0 /3 and Vs0 = −2V0 /3, respectively,
since the relative velocity must be V0 and the total momentum in the
23 Gravitational Burning and Transmutation 241
barycentric frame must vanish. Finally, to compute the new speed of the
binary and single star in the rest frame of the whole cluster we must add
the velocity of the triple system, but again this is negligible for an average
encounter with a sufficiently hard binary. Thus the final p speeds of the
binary
p and single star in the cluster are approximately −∆ε/(3m) and
2 −∆ε/(3m), respectively. (Remember that ε < 0 and ∆ > 0 typically.)
Armed with this information, let us now follow the fate of a three-body
binary as it experiences a succession of encounters, starting from the time
of its formation in the core of a stellar system. Initially we do not expect
the predicted speeds to be significantly in excess of the velocity dispersion
of the core. The calculations of the preceding paragraph are very approx-
imate in this case, but the reaction products continue to move around in
the core, and their kinetic energy (in which, for a binary, we include only
the kinetic energy of the motion of its barycentre) has increased by |∆ε|
per encounter.
As time goes on and further encounters take place, the binary steadily
hardens, and typical values for the predicted speeds of the reaction prod-
ucts increase. The first thing to happen is that the single star can now
escape from the core, and eventually the time comes when the binary
achieves enough speed to do so also (Fig.2; see also Giersz & Spurzem
2000). This is no dream. There is a known binary (with an 8-hour period)
in the outskirts of the globular cluster M15 which probably got there by
ejection from the core in a three-body encounter (Phinney & Sigurdsson
1991). At least one of the components is a neutron star, and the binary
is sufficiently close to display a relativistic perihelion advance, just as the
planet Mercury does, but about 30, 000 times faster.
When the binary is ejected from the core it finds itself in a region
of lower density, and encounters are less frequent. If, as an example, we
consider a binary moving on a radial orbit, the total number of encounters
during one orbit (from the core to the maximum radius which it can reach
and back to the core again) varies as
Z
Nenc ∼ nσvdt, (4)
100
10
Radius
0.1
0.01
1e-06 1e-05 0.0001 0.001 0.01 0.1
Energy
Fig. 2. Radius and binding energy of all hard binaries in an N -body simulation
of a star cluster, with N = 8192. The horizontal banding at the bottom results
from the fact that the radius was given to only two decimal places; similarly the
vertical banding at the left. The vertical banding at the right results from a few
persistent hard binaries escaping to large radii and (in most cases) drifting back
to the core. The model was a initially a Plummer model, with equal masses and
no tide.
23 Gravitational Burning and Transmutation 243
part of the cluster equals the contribution which this part makes to the
central potential. Thus for a shell of material of mass M and radius r,
the heating is GM m/r. By this argument we also see that, if the star
escapes from the core but not from the cluster, it does not heat the core
alone, but instead all the material between its initial and final radii.
Though it is tangential to our present purpose, it is worth stating that
similar considerations apply to other forms of heating through mass loss.
S. Portegies Zwart has pointed out to us that mass loss from stellar evo-
lution is more important than one might think, because the mass is lost
by the most massive stars (which evolve fastest), and mass segregation
concentrates these in the core. It is here that the maximum energy is
generated per unit mass ejected.
Back to the binary. Eventually it is so hard that, not only does it
eject the single stars it interacts with, but it also ejects itself. Thus the
influence of each binary is self-limiting. Though a binary consisting of
two point masses can in principle release an infinite amount of energy,
the process by which it releases energy is its own bottleneck.
How much energy does the binary contribute before retiring? The
result is fairly easily estimated by adding up the contributions from the
various phases in its evolution (Box 1), which evaluates to about −24mφc .
It is usual to express the central potential in terms of the central one-
dimensional velocity dispersion, σc2 ∗ , though strictly there is no upper
bound on the ratio −φc /σc2 . Usually it is assumed that energy generation
by binaries is important in systems with small cores of very high density,
especially in late core collapse. In this case, as we have seen (Chapter 18)
the evolution of the core is self-similar, and in this case the above ratio is
constant: −φc /σ2 ' 13.5 in anisotropic Fokker-Planck models with stars
of equal mass (Takahashi 1995).
Putting our various estimates together, we find that each newly formed
hard binary eventually contributes an energy of approximately 110mσc2 .
Combining with an estimate for the rate at which such pairs form (see
Eq.(21.5)), we readily find that the rate of heating (per unit volume) is
given by
G5 m6 n3
ε ' 85 . (5)
σc7
∗
The subscript should avoid confusion with a cross section
23 Gravitational Burning and Transmutation 245
One final aspect of this problem which is less well understood and less
easy to model is how to handle the formation and evolution of binaries
in systems with a distribution of stellar masses. One interesting contrast
with the equal-mass case is the behaviour of the cross section for close
encounters (Eq.(21.4)). For stars of equal mass, as a binary hardens its
semi-major axis shrinks and so σ decreases. For stars of unequal mass,
we have already seen that a binary with one component of relatively low
mass is liable to have this component exchanged with a more massive
incomer. Now the energy of the binary need not change much, and an
increase in the mass of one component may cause a to increase. Thus the
close encounter cross section increases, because of both the increase of a
and the increase of the masses. Binary evolution has a runaway tendency
in these circumstances, at least until the components are about as heavy
as they can be.
Problems
1) For a binary on the hard/soft threshold, moving in a cluster in virial
equilibrium, use Eq.(4) and reasonable estimates for n, σ and v to
show that the number of close encounters in a relaxation time is of or-
der 1/ log Λ, i.e. the reciprocal of the Coulomb logarithm. How does
the result depend on the energy of the binary if it is hard? Use your
answer to show that the mean rate of hardening of a hard binary is in-
dependent of the semi-major axis a. Use this result in turn to estimate
the shape of the distribution of binding energies of hard binaries.
2) Assume that a binary is born with small energy, that its energy in-
creases in increments which are distributed as in Eq.(2), that the re-
sulting energy heats a cluster (with or without ejection of the single
star, as appropriate), and that the binary remains in the core until
it escapes. Perform Monte Carlo simulations to compute the mean
energy yield per binary, and compare with the result of Box 1.
3) Problem 18.2 showed that stellar-mass black holes would form a dense
core within the larger core of normal stars. In this core, black hole
binaries form by three-body encounters. Using arguments of Box 1,
estimate the number of single black holes which are ejected in the hard-
ening of each black hole binary, and hence estimate the approximate
fraction of black holes which are ejected as binaries.
Note: the black holes in some of those escaping binaries will later spi-
ral in to each other, under emission of gravitational radiation. These
binaries form a promising category for detection by LIGO, a pair of
laser-based gravitational wave detectors (Portegies Zwart & McMillan
2000).
PART VII. PRIMORDIAL
BINARIES: N = 4
246
24
Binaries in Star Clusters
In the original Star Wars movie there is a brief but memorable scene
of two stars shining down from the afternoon sky. We find it striking
because we are so used to seeing just a single star in the sky, but in fact
it is the sun that is unusual. In the neighbourhood of the sun most stars
are binaries, and some belong to triple systems or even little groups with
still larger numbers of stars (Duquennoy & Mayor 1991). Our nearest
neighbours, for example, are a binary (α Centauri) with a distant third
companion (Proxima Centauri, cf. Matthews & Gilmore 1993). Such
systems are unlikely to arise by chance encounters (Problem 1), and so
the abundance of binaries and triples suggests that most stars are born
that way. Indeed binaries and other multiples are most common among
the youngest stars (cf. Kroupa 1995).
247
248 24 Binaries in Star Clusters
though this argument turned out not to be needed, it takes the dynamics
of star clusters beyond the point-mass approximation for the first time
in this book.
Firm evidence of binaries eventually arrived, but it came from a most
unexpected direction: the discovery of variable sources of X-rays (cf.
Verbunt & Hut 1987, Grindlay 1988). Such sources were already known
elsewhere in the Galaxy, and those discovered in globular clusters were
interpreted in the same way; i.e. as binaries containing neutron stars.
The standard model is that the other star in the binary is a more-or-
less normal star whose surface layers are just on the point of becoming
gravitationally unbound by the attraction of the neutron star∗ . As this
normal star evolves its material expands slowly, and so there is a small
but steady stream of gas which leaves its surface and, by conservation of
angular momentum, orbits the neutron star in a disk. Viscosity in this
disk then slowly drags the material of the disk towards the neutron star,
and the conversion of the potential energy of the gas, as it sinks in the
gravitational field of the neutron star, into other forms of energy, gives
rise to the X-ray emission that makes these objects observable.
The discovery of these sources was the first clear evidence for the pres-
ence of neutron stars as well as for binaries, and was a major factor in
turning astrophysicists’ attention and interest to the globular clusters.
Though the inferred number of binaries was not yet large by galactic
standards, and though it was not yet clear whether they were born in
the clusters or manufactured there, the numbers of X-ray sources were
somehow far higher in globular clusters than in the rest of the Galaxy.
Suddenly these old and placid members of the Milky Way community
turned out to be fertile breeding grounds for some of the most interesting
and energetic objects in the nearby universe.
In due course a few more binaries were eventually discovered by the
radial velocity technique, though only a few percent of all observed stars
were found to be binaries (cf. Hut et al. 1992 for early developments
in this subject). Perhaps the first observers (Gunn & Griffin 1979) had
just been unlucky. At any rate it was realised that there were strong
selection effects which prevented more than a fraction of the binaries from
exhibiting observable changes in their radial velocity. One reason for this
is that it was only possible to observe the brighter stars. These are giants,
with radii considerably exceeding those of most normal stars. Therefore
∗
This process is referred to as Roche lobe overflow. The donor star is surrounded by
an imaginary surface, outside which its gravitational attraction is overcome by that
of the neutron star. The equipotential surfaces closely resemble those which control
escape from a star cluster on a circular orbit about a galaxy (cf. Fig.12.1, which
can be thought of as depicting the “Roche lobe” of a star cluster).
A brief observational history 249
very close binaries could not be expected to survive this giant stage of
stellar evolution, for the same reason that highly eccentric binaries would
not be expected. There was thus a rather limited range of separations
(between the two stars in the binary) where radial velocity binaries with
giant companions could be expected: too small, and the stars collide;
too large, and the binary would be destroyed by encounters. Difficult
though it is to allow accurately for such selection effects, by now it was
no longer at all certain that the proportion of binaries in globular clusters
was very much smaller than in the neighbourhood of the sun, except for
the destruction of wide binaries by encounters.
The hurdle having once been cleared, the evidence for binaries in glob-
ular clusters mounted steadily. Pulsars were observed in binary systems
(again by detecting a periodic Doppler shift, this time in the pulse fre-
quency; cf. Phinney 1996). One of these is even a triple system (Thorsett
et al. 1999). X-ray sources of lower luminosity, thought to be binaries
where a white dwarf takes the place of the neutron star in the brighter
sources, were discovered (cf. Livio 1996). In some clusters it may be
possible to detect binaries in colour-magnitude diagrams: unresolved bi-
naries have similar colours to the other stars but are noticeably brighter
(cf. Rubenstein & Bailyn 1997). Eclipsing binaries were discovered at
last (e.g. Fig.1), many as a by-product of searches for gravitational lenses
(e.g. Kaluzny et al. 1998)! And in recent years it has become possible to
find evidence for radial velocity binaries among the normal stars which
have not yet evolved into giants (Côté & Fischer 1996).
In short, there is now abundant evidence that a substantial fraction of
stars in globular clusters are binaries. Satisfactory as this may be to the
astrophysics community as a whole, stellar dynamicists have been a little
less wholehearted in their response. It is still easier to understand and
model clusters containing single stars, while the introduction of binaries
brings in its wake a host of new issues. Simplest, but still irksome, is
the need to decide, when constructing a dynamical model, what is the
distribution of semi-major axes, eccentricities and mass ratios of the bi-
naries. Much more complicated are the issues involving the finite radii
of the stars, the changes in their internal structure caused by their gravi-
tational interaction, and finally a whole host of questions concerning the
internal evolution of stars in binaries.
For the time being we shall continue to treat the stars in a binary as
point masses, and study the dynamical consequences of the existence of
primordial binaries. The other complications we defer to the final part
of the book. Of course it is by no means clear that these complications
really are so superficial that they can be treated as a minor perturbation
250 24 Binaries in Star Clusters
Fig. 1. The light curve of an eclipsing binary star in the globular cluster M71
(after McVean et al. 1997).
Fig. 2. Evolution of the semimajor axis of a soft binary in the Galaxy. From
Bahcall et al. (1985).
252 24 Binaries in Star Clusters
Problems
1) Estimate the rate of formation of hard binaries in the Galaxy, using
Eq.(21.5).
2) Suppose that the giants in a globular star cluster have mass 0.8M
and radius 0.3AU (cf. Harris & McClure 1983), and that the threshold
between hard and soft binaries occurs when the mean square relative
speed of the components is 10kms−1 . For binaries with one giant
component and one small main sequence component of comparable
mass, and eccentricity e, compare the semi-major axis for contact of
the components at pericentre with the semi-major axis of a binary on
the soft/hard threshold.
3) Binaries are distributed uniformly on the sky with surface density
Σ0 per unit area. Their semi-major axes a are distributed uniformly
in log a. If distance r is measured from the location of a particular
star, the mean surface density of companions, Σ(r), is defined to be
the average value of δN (r)/(2πrδr), where δN (r) is the number of
stars with distances between r and r + δr. Show that Σ(r) ∝ r −2 ,
approximately, for small r, and Σ(r) ' Σ0 for sufficiently large r.
25
Triple Formation and Evolution
Triple systems are very familiar. The motion of the earth and moon
around the sun is lightly perturbed by the other planets, and if such effects
are neglected it is a nice example of a triple system. Furthermore the
distance between the earth and moon is much smaller than their distance
from the sun, and so it is an example of what is called a hierarchical
triple system. The dynamics of such a system can be understood, to a
satisfactory first approximation, as two Keplerian motions. In the case
of the earth-moon-sun system, one of these is the familiar motion of the
moon relative to the earth, and the other is the motion of the barycentre
of the earth-moon system around the sun. The barycentre lies within
the earth, in fact, and we are more familiar with the picture that the
earth orbits around the sun, but it is more accurate to say that it is
the motion of the barycentre that is approximately Keplerian. This was
realised by Newton (Principia, Book I, Prop LXV), and it was he who
really originated the study of hierarchical triples.
The mass ratios in the earth-moon-sun system are rather extreme.
Even though the sun is so distant, its mass is so great that it exerts
a much greater force on the moon than the earth does. Therefore it is
not immediately clear why the moon continues to orbit the earth, and
why the Keplerian approximations are successful (Problem 1). In the
same way the gravitational pull of the sun on the earth greatly exceeds
that of the moon on the earth, and so it is strange that the tides in the
earth’s oceans generally rise and fall in step with the diurnal motion of
the moon and not that of the sun.
Beyond the solar system there are triple systems in which the masses
are more comparable. In the globular cluster Messier 4, for example,
there is a triple consisting of a neutron star in a binary with what is
presumed to be white dwarf, and a more distant substellar object (a
253
254 25 Triple Formation and Evolution
massive planet) in orbit about the inner binary. The masses are thought
to be about 1.35M , 0.35M and ∼ 0.01M (Thorsett et al. 1999).
Triple Formation
How can such a system arise? Well, maybe they were born like that
(cf. Clarke & Pringle 1991). Otherwise the obvious answer, at first sight,
is capture: a third star approaches a binary, and its perturbing action
slightly unbinds the binary and binds it to the binary. Unfortunately we
have already seen a mathematical argument (Chapter 20) that three-body
capture will not work. This is also readily understood physically (Chapter
19): if the perturbation of the third star is strong enough to bind it, on
successive orbits the energy of the third star will perform a random walk
and eventually become unbound again. But if binaries are sufficiently
abundant in a stellar system, binary-binary interactions are as common
as binary-single interactions. It is then natural to ask whether binary-
binary encounters could lead to the formation of hierarchical triples with
non-zero probability.
Consider first the case in which one of the binaries is far harder than the
other. It is then very unlikely that either component of the wider pair will
make a close approach with the close pair; close, that is, compared with
the smaller semi-major axis. Therefore in most such encounters the close
pair will act like a single object with mass equal to the total mass of its two
components. Since we are assuming that the masses of all the single stars
are comparable, this binary behaves like a relatively massive object. Now
we have already seen, in our study of binary-single interactions (Chapter
19), that an exchange interaction occurs with substantial probability, and
that a massive intruder is very likely to displace a less massive component
of the binary. In the case of binary-binary encounters, then, we may
expect that the close pair will eject one component of the wider pair,
probably the less massive one. The end result is that the tight binary
pairs up with the other component, which acts as a third body orbiting
the close pair in a hierarchical triple.
It turns out, as numerical scattering calculations show, that the prob-
ability of formation of a hierarchical triple system in these circumstances
is substantial. Even for equal masses it occurs in roughly 50% of all en-
counters in which the minimum distance of the two binaries is of order
the sum of their original semi-major axes. And even when the original
semi-major axes are equal, so that neither pair can be thought of as acting
as a single object, the probability is reduced by a factor of only two or so
(Mikkola 1984). Nor is hierarchical triple formation in binary-binary en-
counters of only theoretical interest: it is plausible that the triple system
in the globular cluster M4 formed in this way (Rasio et al. 1995).
Triple Evolution 255
Triple Evolution
As before we describe a hierarchical triple as a combination of two Kep-
lerian motions: the relative motion of the “inner” binary, and the motion
of the “outer” star relative to the barycentre, B, of the inner pair (Fig.1).
Let the masses of the components of the inner pair be m1 and m2 , and
let r be the position vector of the first relative to the second. Let the
third star have mass m3 and position vector R relative to B. Then the
energy of the triple, in the rest of frame of its centre of mass, is
E = Ei + Eo + V, (1)
where Ei and Eo are the energies of the inner and outer Keplerian mo-
tions, and V is a correction for the fact that the outer star is not actually
in orbit about a star of mass m1 + m2 at B; thus
∗
except for a misprint: in their Eq.(2), for 2.2 read −2.2
256 25 Triple Formation and Evolution
B r
2
∗
Care is needed if the inner and outer binaries are in resonance. In practice this
exception is important only when the ratio of the outer and inner semi-major axes
is small enough that the hierarchy is close to being unstable (cf. Kiseleva et al.
1994)
Triple Evolution 257
the orientation of the orbit of the third body within its plane. This is
remarkable, because it implies that the magnitude of the orbit-averaged
angular momentum of the outer binary, co , is constant. The argument
leading to this conclusion is analogous to the familiar fact that the angular
momentum of a system is conserved if the Lagrangian of the system
is invariant under rotations. It may also be viewed as an application
of Noether’s Theorem (Box 7.1), or as resulting from the fact that the
angular momentum of the outer orbit is conjugate to the orientation of
the orbit, in the sense of Hamiltonian dynamics.
258 25 Triple Formation and Evolution
Now we are in a position to exploit the fact that the total angular
momentum of the whole triple system, c, is constant, for c = co + ci , and
so
c2 = c2o + 2co ci cos i + c2i (3)
is constant, where i is the relative inclination of the inner and outer orbital
planes. Now, as we have seen, co is constant on average, and so the sum
of the last two terms on the right of Eq.(3) is also constant on average.
However, we see from Eq.(2) in Box 1, and the corresponding expression
for ci , that the last term on the right of Eq.(3) is negligible, since ai ao
in a hierarchical triple, and so the average value of co ci cos i is constant.
Finally, using the definition of ci in terms of the elements of the inner
q as the fact that ai and co are constant on average, we see
orbit, as well
that c⊥ = 1 − e2i cos i is constant on average. The notation reminds us
that c⊥ is proportional to the component of the inner angular momentum
perpendicular to the outer orbit, but we have omitted a factor involving
several quantities which are either constant or constant on average.
This is our first interesting non-trivial result. To investigate its con-
sequences, let us denote initial values by (0). Then, neglecting periodic
variations, what we have found is that
q q
1 − e2i cos i = 1 − e2i (0) cos i(0). (4)
Suppose, for example, that the inner and outer orbital planes are nearly
orthogonal, so that the right side is nearly zero. It follows that, as i and ei
evolve, ei may reach values close to ei = 1, (depending on the maximum
value of cos i). Then the distance of closest approach of the components
of the inner binary may be much smaller than their semi-major axis,
and may even be smaller than the sum of their physical radii. (We are
remembering now that these are not point masses, but stars.) This has
obvious astrophysical consequences, but of course it does not follow, just
because high eccentricities are possible, that they are inevitable. We still
have some work to do.
Before we go on it is interesting to mention another context in which
similar phenomena arise. It concerns a fascinating group of comets named
the Kreutz group, after the 19th century astronomer who first drew at-
tention to them. They are more usually called the “sungrazers”, because
they approach the sun’s surface to within a distance of about a solar ra-
dius. It has been proposed (Bailey et al. 1992) that they originated in
orbits of very high inclination and much larger (i.e. more normal) peri-
helion distance, and that the perturbations by the planets have evolved
their orbits so that they now have relatively low inclination and very high
eccentricity.
Triple Evolution 259
where ω (the ‘argument of pericentre’) is the angle between the major axis
of the inner orbit and the line in which the two orbital planes intersect
(the ‘line of nodes’).
Now we can visualise the secular evolution of the triple system. In
Eq.(5) there are really only three variables: ei , i and ω. Also, hhV ii is
a constant, which can be evaluated from the initial conditions, and so
Eq.(5) states that the evolution takes place on a certain surface in the
space defined by these three variables (Fig.3). Eq.(4) defines another
surface in the same space, and so the evolution must take place on their
intersection, which is a curve. It can be seen from the intersections that,
though the evolution takes two rather different forms, in both forms the
evolution of the eccentricity and the inclination are oscillatory. What
ω
i
∗
Not to be confused with energy!
260 25 Triple Formation and Evolution
6
5
6
4
4 ω3
ω 2
2 1
0 1
1 0.5
3 e
2 i 0.5 0 0
2.5 2 1.5 1 0.5
1 e 3
i
0 0
100
90
Inclination
80
70
60
50
40
30
0 5000 10000
Time
1
0.9
0.8
Eccentricity
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0 5000 10000
Time
Fig. 4. Evolution of the inclination (degrees) and inner eccentricity in a hier-
archical triple system in which the initial inclination was 90◦ . Both inner and
outer orbits were initially circular.
∗
Principia, Book I, Prop. LXVI, Cor.XI. See also Airy 1884, Moulton 1914.
262 25 Triple Formation and Evolution
of nodes. The same thing happens when m1 passes below the orbital
plane of m3 , and when m3 itself is in other positions.
We can easily estimate the angular velocity of precession, although in
Box 2 we do so only in the case in which ei ' 0, by way of illustration.
The result is that the period of precession is
2π 4 m1 + m2 + m3 Po2
= (1 − e2o )3/2 hcos ii, (6)
Ω 3 m3 Pi
where Pi and Po are the inner and outer periods, respectively, and the
average of cos i is taken over one Kozai cycle. This result implies that,
for given masses, eo and mean inclination, the periods are in geometric
progression: the outer period is of order the geometric mean of the inner
and precession periods. This fact, which was known to Newton∗, is a
general rule of thumb for the time scale of long-period perturbations in
hierarchical triples, and for some observed triple systems it leads to a
time scale short enough that the effects are detectable within a few years
(Jha et al. 2000). It also explains the sense in which the perturbations by
a distant or low-mass companion are weak: the evolutionary effects are
∗
Principia, Book III, Prop.XXIII
Triple Evolution 263
just as large as for a close or massive companion, but take much longer
to develop.
Anyone who has taken more than a casual interest in eclipses of the
sun and moon will be aware of our last result on time scales. Eclipses
occur at two times of the year, called “eclipse seasons”, when the sun lies
close to the line of nodes, i.e. the line of intersection of the orbital planes
of the moon (about the earth) and the earth (about the sun.) Because of
the perturbation of the sun on the earth-moon system, the orbital plane
of the moon precesses, the line of nodes rotates in the plane of motion of
the earth, and so the time of year at which eclipse seasons occur slowly
changes. The period of the precession in this case is 18.6 yr. Since the
ratio of the month to the year is about 13.4, Eq.(6) predicts 17.8 yr.
It is appropriate to end where we began, with the most familiar of
all hierarchical triple systems. It was here, moreover, that the peculiar
property of high-inclination orbits found one of its earliest applications
(Lidov 1963). It is little wonder that the moon is not on a high-inclination
orbit!
Problems
1) Explain why the moon continues to orbit the earth even though the
force exerted by the sun on the moon is much larger than that exerted
by the earth on the moon. Explain why the lunar tide is higher than
the solar tide, even though the gravitational attraction of the moon
on the earth is much weaker than that of the sun.
2) Use Eqs.(4) and (5) to compute the maximum eccentricity of the inner
orbit if it is initially nearly circular.
3) Complete the derivation of Eq.(5).
26
A Non-Renewable Energy Source
264
26 A Non-Renewable Energy Source 265
body system. Again, once a hard three-body binary has been formed it
is very difficult to destroy: it keeps hardening and heating the stellar sys-
tem until it is ejected. By contrast we have seen in the previous chapter
that binary-binary encounters can easily destroy one of the participants,
perhaps forming a hierarchical triple system, and this suggests that the
useful energy inside a population of primordial binaries is limited. On
the other hand, it is clear that the cross section for an encounter between
one binary and another exceeds that for interaction between the same
binary and a single star: gravitational focusing is somewhat stronger,
because the participating masses are larger, and the physical size of the
participants is somewhat greater. When we weigh these considerations
together, it is not very clear whether or not binary-binary interactions
will be as effective a source of heat as three-body interactions.
Let us look at these interactions in a little more detail. We saw in
Chapter 22 that, even for three-body interactions, the number of param-
eters required to specify an interesting set of encounters is already rather
substantial, and this problem is much worse for binary-binary encounters.
Therefore we shall make matters as simple as possible by considering stars
of equal mass. It might be thought also simplest to consider binaries with
equal initial energy or semi-major axis, but for our present purposes it is
helpful to return to the rather extreme form of binary-binary encounter
considered in the previous chapter, in which one binary is much harder
than the other.
Broadly speaking, the outcome of a binary-binary encounter may be
classified in the same way as three-body interactions: fly-by, resonance
and exchange (Box 1). (With our choice of parameters the closest analogy
in three-body scattering is encounters between a binary and a single star
in which the mass ratios are 1 : 1 : 2, the last of these being the harder
binary.) Indeed numerical scattering experiments show that the three
outcomes occur in roughly comparable amounts, for impact parameters
in the range in which exchange and resonance encounters can occur.
A fly-by is the easiest to understand by three-body analogies. The
less hard binary is hardened (by an amount of order its initial energy, or
h∆iε on average, cf. Chapter 23) and both reaction products retire from
the encounter at increased speed. Thus the encounter is exothermic (on
average), and both binaries survive. The result is the same if the harder
binary is captured in a “hierarchical resonance”, which will eventually
decay again.
Prompt exchanges are also easy to understand. The massive incomer
displaces one component of the initially wider binary, without necessarily
altering its semi-major axis much, and increases the binding energy of
266 26 A Non-Renewable Energy Source
Hierarchical
Resonance
Resonant
Decay Flyby
Resonant
Exchange
Ionisation
Prompt
Exchange
Hierarchical Triple
Fig. 1. Some of the reaction routes for interactions between one binary and
another much harder one. Initial state: double box; intermediate state: dashed
box; final state: solid box.
Box 1 (continued)
In general (when there is no restriction on the hardness of either binary)
the outcomes may be classified as follows. Let x, x denote the components
of the first binary, and y, y those of the second binary. We use notation in
which stars which are bound into a binary are enclosed in square brackets,
whereas braces (i.e. {}) are used to group objects which are not bound to
each other. Thus the initial configuration of a binary-binary encounter is
represented by {[x, x], [y, y]}. With this notation, Table 1 lists the various
possible final outcomes. For those with an asterisk, a new configuration is
obtained by interchanging x and y. Some listed outcomes would be very
rare, e.g. those involving a bound triple system such as [x, x, y] (Fig.2).
Table 1
Stable outcomes of a binary-binary encounter
Box 1 (continued)
that case the result is a hierarchical triple and a single escaper. But we
must pause before concluding that the result is like that of an exchange
encounter. During the evolution of the triple system, the three stars (one
of which is a close binary) have many opportunities for a close encounter.
Indeed numerical studies show that it is quite likely that, at some stage
before the evolution of a triple system is complete, the smallest separation
of some pair of stars is no more than about 1% of the initial separation
of the binary (Hut & Inagaki 1985), which here means the wider binary.
Now we did assume that the other binary was much harder than the wider
binary, but we now see that even a factor of 100 in hardness may not be
enough to ensure that the binary-binary interaction behaves like a triple
interaction, if a democratic resonance is formed. If indeed there is a close
interaction between the hard binary and one component of the wider pair,
this is likely to lead to the ejection of that component, and possibly also
of the binary. If the latter happens, the result of the encounter is a single
binary and two single stars; the energy released will be comparable with
that of the initially harder binary. If, however, the binary remains bound
to the other component of the originally wider binary, then the resulting
triple system may be either hierarchical or unstable.
This has been a somewhat elaborate discussion, which, incidentally,
illustrates that the problem of classifying these events automatically is
non-trivial. The bottom line, however, is that binary-binary interactions
release energy comparable with that of the less hard component (and
sometimes that of the harder component), but that the probability of
destroying one of the binaries is high, certainly above 50%. It follows
that the maximum energy that may be extracted from a binary, up to
the point of its destruction, is of order its binding energy. This is very
different from the behaviour of three-body binaries, whose energy yield
is determined by the central potential (Chapter 23). In other respects,
however, much the same considerations apply as with three-body binaries.
In particular, reaction products of binary-binary interactions are often
found well outside the core, from which they drizzle back into the core
by dynamical friction (Hut et al. 1992).
With these conclusions in mind let us return to the problem of a stellar
system containing a modest proportion of binaries initially, say f by
number∗ . The simplest way of estimating their effect is by comparing the
energy that they may release with the energy of the whole cluster. The
latter consists of two contributions: the internal energy of the binaries,
Eint , and all the rest, which by contrast we refer to as “external” energy,
∗
This means that, if the cluster contains N objects, where an “object” may be either
a binary or a single star, there are f N binaries
270 26 A Non-Renewable Energy Source
Problems
1) Following the guidance in Box 1, construct a list of temporary (in-
termediate) outcomes of a binary-binary encounter. Hence try to con-
struct pathways, as in Box 1 (Fig.1), leading to each permanent out-
come on the list in Table 1. (For some practical examples, see Aarseth
& Mardling 2001.)
2) Estimate the cross section for close interactions between two hard
binaries. If one is much harder than the other, how is the result
altered if one seeks the cross section for significant perturbation of
both binaries?
3) Obtain the starlab package
(http://www.sns.ias.edu/∼starlab/starlab.html and Box 22.1). The
command
scatter -i "-M 1 -r 0 -v 1 -t -a 1 -e 0 -p -a 1 -e 0"\
-t 100 -D 0.01|xstarplot
will generate a random head-on, equal-mass, binary-binary encounter.
By viewing the plot and inspecting the analysis of the final configura-
tion, try to follow the interaction via Box 1, Fig.1, or its extension to
binaries with comparable energies (Problem 1).
PART VIII. POST-COLLAPSE
EVOLUTION: N = 106
272
27
Surviving Core Collapse
The last 8 chapters, dealing as they have done with interactions between
only three or four stars, might seem a long digression away from the
subject implied by the title of this book. Yet we shall see, as we take up
the thread of the million-body problem where we broke off at the end of
Chapter 18, that an understanding of the behaviour of few-body systems
is crucial in following the evolution of the system through core collapse
and beyond.
We left the system rushing towards core collapse, its central density
rising inexorably, so that it would reach infinite values in finite time.
How is this catastrophe averted? In fact there is no shortage of choices,
for at least five different mechanisms have been proposed over the years.
Admittedly, two are rather out of favour at present: a central black hole
(e.g. Marchant & Shapiro 1980), or runaway coalescence and evolution of
massive stars (Lee 1987a, and Problem 1). The other three involve binary
stars in one guise or another, and it is not hard to see why this is attrac-
tive. After all, the mechanism responsible for core collapse is a two-body
one (Chapter 14). Therefore it is clear that higher-order interactions,
which we have neglected so far, might in principle eventually compete
with two-body relaxation when the density becomes high enough. And
three-body interactions can create binaries (Chapter 21 and Fig.1).
While the creation of binaries in three-body interactions at least brings
binaries into the picture, it is not the principal mechanism by which
core collapse is halted. Nor is this the only way in which binaries may
enter the scene. In this chapter we examine three possibilities: besides
“three-body binaries”, which is the name given to binaries formed in
the way we have just mentioned, we consider “tidal capture binaries”,
formed in dissipative encounters between two single stars (Chapter 31)
and “primordial binaries”, formed along with the single stars at the birth
of the stellar system (Chapter 24).
273
274 27 Surviving Core Collapse
Three-Body Binaries
3
2
2 1
1
0
-1
-2
-3
-4
-5
-6
-7 3
-8
-14 -12 -10 -8 -6 -4 -2 0
Fig. 1. Formation of a binary in a triple encounter. The stars are labelled at the
start of the motion. After a close encounter between 2 and 3, particle 1 absorbs
enough energy to bind them.
∗
A term 4πr2 ε would be added to the right hand side, where ε is the rate of generation
of energy per unit volume, much as in Eq.(1), Box 9.2.
Two-Body Binaries 277
Two-Body Binaries
In the problem we have just been discussing, the stars are effectively point
masses, and all are single stars initially. This is the cleanest scenario for
the investigation of “core bounce”, but it is essentially unrealistic. In fact
stars have finite radii (though the point-mass approximation is adequate
for the degenerate stars, such as neutron stars), and the proportion of
primordial binaries is non-negligible (Part VII).
Consider first a system consisting of single stars with finite radii, i.e.
we continue to ignore primordial binaries. Binaries can now form by tidal
interactions, and no third body is needed to carry off the energy required
to bind them (Chapter 31). The important point is that formation of
binaries by this process is a two-body process. It is not immediately clear,
therefore, that it becomes increasingly important as the core collapses,
because collisional relaxation (which is driving the collapse) is also a two-
body process. Nevertheless it turns out that tidal capture does eventually
dominate. Indeed studies show that, in almost all star clusters, this
process would become important well before binary formation in three-
body interactions had a chance to halt core collapse (Inagaki 1984). We
shall not pursue these questions further, however, partly because the
subsequent fate of a binary formed by tidal capture is a controversial area
278 27 Surviving Core Collapse
(see Chapter 31 again), but also because this process is itself dominated
by interactions involving primordial binaries.
Primordial Binaries
Now we return to the assumption that the radii of the stars can be
neglected, and consider what happens if a certain proportion of the stars
present initially are binaries. As in the previous case, interaction between
binaries, or between binaries and single stars, is really a two-body process,
in the sense that its rate is proportional to the square of the space density.
Just as with tidal-capture binaries, therefore, it is not quite obvious that
the energetics of these reactions can compete with that of relaxation, but
in fact it does, at least if the proportion of binaries is high enough.
To understand why this is so we have to consider two effects which
enter the picture but work in opposite directions. The first effect is that
primordial binaries act in many respects like a species of heavy star. By
the process of mass segregation, therefore, primordial binaries accumulate
in the core, where their abundance steadily rises above its initial value as
the core collapses. This greatly enhances the efficiency of binary-binary
encounters in particular. The second effect is the destructive nature of
binary-binary encounters, which often convert the two binaries into one
binary and two single stars (Chapter 26). It turns out, however, that this
is insufficient to prevent the binaries from halting core collapse.
It seems very likely that it is primordial binaries which would bring core
collapse to a halt in a real star cluster, rather than either three-body or
tidal-capture binaries. Therefore it is worth re-estimating the conditions
under which core collapse will cease if we now consider binary-binary
heating rather than that due to the formation and hardening of three-
body binaries. The result (Box 1) is a condition of the form
density of stars. This is in marked contrast with the arrest of core collapse
by the production of three-body binaries, where the number of stars in
the core at core bounce is independent of N . In that case, the depth
at which core collapse is arrested (as measured by the increase in the
central density) increases with N . When primordial binaries are present,
however, the depth of collapse at core bounce is almost independent of N ,
and deep collapses are prevented (Fig.2). One may say that the essential
physical reason for the difference in the two cases is that binary-binary
heating and mass segregation are two-body processes, while binary-single
heating is a three-body process.
There are, it is true, considerable approximations and simplifications
behind this assertion. In particular, primordial binaries do not last for-
ever. Some are consumed in arresting core collapse, and we must consider
whether enough survive to prevent a later recollapse before the cluster
dies (Chapter 29). For the meantime, however, core collapse has been
brought to a halt, and in the next chapter we consider what options the
system has for its subsequent behaviour.
Core radius
0.1
0.01
0 100 200 300 400 500 600 700 800
Time
Fig. 2. Core collapse with primordial binaries. Two N-body models contain
2425 objects. In one model (dashed line) 75 of these these are single stars with
twice the mass of the other stars, in the other (solid line) they are binaries. For
further details see Heggie & Aarseth 1992, where they are referred to as models
I and S, respectively.
282 27 Surviving Core Collapse
Problems
1) Loss of mass m from the core leads to an effective heating of the
cluster by an amount mφc , where φc is the central potential (Chapter
23). Suppose that some kind of stellar explosion leads to mass loss at
a rate Ṁ < 0 from the core. Neglecting numerical factors, show that
this is sufficient to arrest core collapse if −Ṁ ∼ Mc /trc , where Mc
is the mass of stars in the core and trc is the central relaxation time.
If the mass ejection is associated with stellar evolution, we may sup-
−Ṁ
pose that the associated luminosity of the core is Lc ∼ L ,
M /1010 yr
since the lifetime of the sun is of order 1010 yr. If the cluster is al-
most isothermal (Chapter! 8) out to the half-mass radius, deduce that
10
10 yr
Lc ∼ L rh /rc , where L is the luminosity of the cluster (in
trh
the usual astrophysical sense, but not as in Eq.(3)). (This would imply
a huge core luminosity; cf Goodman (1989), who considers the same
issue after core collapse.)
2) Write down the equilibrium equations in the gaseous model (Box
9.2). Show that no reasonable solution is possible if the rate of energy
generation, ε, takes the form in Eq.(23.5). Try to formulate a physical
understanding of this result.
28
Gravothermal Oscillations
In the last chapter we saw that core collapse ends in what is (by stellar
dynamics standards) a blaze of energy, which is emitted in interactions
involving primordial binaries. Dramatic though that sounds, the real
climax of the whole book is reached in the present chapter, where for the
first time we catch a glimpse of the entire lifespan of a cluster. Admittedly
we concentrate here on a highly idealised cluster, isolated from the rest
of the universe, consisting of stars of equal mass, and totally devoid of
primordial binaries. In the next chapter we shall relax some of these
idealisations.
rh ∝ GM 2 /|E|, (1)
283
284 28 Gravothermal Oscillations
∗
More precisely it is what is sometimes referred to (e.g. Giersz & Spurzem 2000) as
the external energy, i.e. the part not locked up inside binary stars (which is referred
to as the internal energy).
†
Perhaps a better analogy is the burning of deuterium.
Steady Post-Collapse Expansion 285
∗
See the first footnote on the previous page.
286 28 Gravothermal Oscillations
time since the end of core collapse. Using Eq.(14.9), we may deduce that
3/2
M 1/2 r
t − tcoll ∼ 1/2 h , (2)
G m log Λ
∗
Continuing an earlier remark, it is tempting to draw an analogy with the evolution
of red giants. That is too controversial a topic, however, for the analogy to be
enlightening.
Unsteady Post-Collapse Evolution 287
0.5
0.48
0.47
0.46
0.45
0.44
-3.5 -3 -2.5 -2 -1.5 -1 -0.5 0
Log radius
Problems
1) According to an isolated model of Hénon (1965) the time scale
of evolution in post-collapse expansion is given by rh /ṙh ' 10.8trh ,
where trh is the current half-mass relaxation time. N -body models of
this phase show that the number of particles diminishes (by escape)
roughly according to d ln N/d ln rh ' −0.12 (Giersz & Heggie 1994),
while the loss of mass in the precollapse phase is a few percent and
may be neglected here. Fokker-Planck models show that the duration
of core collapse for a Plummer model is about 18trh (0) (Spitzer &
Thuan 1972, Takahashi 1995), where trh (0) is the initial value. Es-
timate the entire lifetime of the model (i.e. until N ' 2) in units of
trh (0).
2) Several toy models illustrating gravothermal behaviour were investi-
gated by Allen & Heggie (1992). Among them was the simple system
dT1 Γ (4−α)/(α−2)
= ξ(T1 − T2 ) − cT1
dt Γ−1
dT2 1 −(6−α)/[2(α−2)]
= ξ(T1 − T2 ) − (T2 − 1)T1
dt Γ−1
where T1 is the temperature of the core, T2 is the temperature of
the region where a temperature inversion may develop, and Γ ' 1.28,
α ' 2.23 and ξ ' 0.00037 are constants. The parameter c ∝ N −2 , and
this term corresponds to binary heating. The forms of this and the
other terms were devised by simple phenomenological considerations
of gravothermal behaviour.
Using appropriate numerical or analytical tools, demonstrate the
following: (a) for sufficiently large c there are no equilibria; (b) for
small enough c there are two equilibria; (c) as c decreases, the hotter
equilibrium first changes from a stable node to a stable focus; (d) as c
decreases further, the hotter equilibrium loses stability and throws off
a stable limit cycle in a Hopf bifurcation.
These changes correspond physically to the following: (a) when N
is very small, the presence of a binary leads to the rapid expansion of
the core; (b) for larger N a stable balance is possible between binary
heating and gravothermal cooling; (c) for still larger N , the approach
to this stable balance has the nature of a damped oscillation (as is
observed in gas models); (d) for large N the balance between heating
and cooling is a dynamic (oscillatory) one.
A further variable is needed before a simple model like this can
demonstrate chaotic behaviour similar to well developed gravothermal
oscillations.
Unsteady Post-Collapse Evolution 293
The previous chapter dealt mostly with a highly idealised model. All
stars were single and had the same mass, and the system was isolated.
As we saw, even the presence of a spectrum of stellar masses changes
the picture, as it is found that gravothermal oscillations set in only for
considerably larger values of N . In the present chapter we shall also see
that the presence of primordial binaries further weakens their probable
relevance. Even when gravothermal oscillations do occur, they seriously
affect the structure of only the innermost 1% or so of a cluster. Therefore
in this chapter we concentrate once more on the steady post-collapse evo-
lution of a stellar system. Also, we mainly have in mind a system with a
significant population of primordial binaries, and boundary conditions set
by the tidal field of the surrounding galaxy. First, however, we consider
the simpler case of an isolated cluster.
Isolated Clusters
The first thing that is changed in post-collapse evolution when we add
primordial binaries is the radius of the core. A similar argument to that
of Box 28.1, shows that the ratio rh /rc is now almost independent of N
(cf. Problem 1). In fact the ratio depends more on the proportion of
binaries (which decreases as the binaries are consumed).
These statements greatly weaken the clues to the occurrence of gravo-
thermal oscillations which we discussed in the case of post-collapse evo-
lution powered by three-body binaries (Chapter 28). In the first place,
the ratio rh /rc at core bounce is also independent of N , if core collapse
is halted by primordial binaries (Chapter 27). Therefore, if there has to
be a period of adjustment to post-collapse conditions, at least it does
not get more serious as N increases. Second, the post-collapse core does
not get smaller and smaller as we consider systems with larger and larger
N . Therefore there is no need to posit a large (and unstable) isother-
294
Isolated Clusters 295
mal region within the half-mass radius. Finally, it is quite possible that
this post-collapse core is as large as that in a modest N -body system
powered by three-body binaries, and we know that these do not exhibit
gravothermal oscillations.
The only worry is that a system may not have enough primordial bi-
naries to sustain a sufficiently long period of post-collapse expansion. A
substantial fraction were already burned simply in halting core collapse
(Chapter 27), and the residual population is gradually used up in pow-
ering the post-collapse evolution. After that, the only effective heating
mechanism left would be the formation and hardening of three-body bi-
naries. The point at which this first happens clearly depends on the
initial proportion of binaries. In early simulations (Spitzer & Mathieu
1980) it was found that core collapse was delayed by less than the time of
core-collapse, even for large binary fractions. Nowadays, however, there
is a consensus (with various methods and authors) that, if the initial
proportion of binaries is of order 10% (depending on the distribution of
binary parameters), then the binaries can keep gravothermal oscillations
at bay for a time about three times the length of the phase of core col-
lapse (Fig.1; see also McMillan & Hut 1994). Even this may, however, be
less than the lifetime of a number of globular clusters.
Fig. 1. Evolution of a cluster with primordial binaries, after Gao et al. (1991).
From top to bottom the curves are the half-mass radius of single stars, all stars
and binaries, and the core radius. All are plotted against time. Mass segregation
during core collapse is followed by a relatively steady post-collapse phase until
the binaries are exhausted, at which point gravothermal oscillations begin.
296 29 Dissolution
With this caveat we assume that the evolution of the core need not
distract us, and turn to the evolution of the cluster as a whole. We
already saw in Chapter 28 what happens if the system is isolated and
has stars of equal mass. When a spectrum of masses is present we might
expect our simple picture to be spoiled by the continued action of mass
segregation. In fact, however, N -body simulations suggest that our simple
picture holds up remarkably well. They show that mass segregation slows
dramatically towards the end of core collapse, and that thereafter the
profile of mass segregation does not alter very much (Fig.16.1). Simple
relations like
3/2
M 1/2 r
t − tcoll ∼ 1/2 h , (1)
G m log Λ
(cf. Eq.(28.2)) apply with unexpected accuracy even when there is a
broad spectrum of stellar masses, though the reasons for our good fortune
in this respect are not understood.
of evolution, we see that the evolution speeds up as the system loses mass,
and in fact should come to an end at a finite time in the future, tend , say.
The time dependence is therefore obtained by equating the right-hand
side of Eq.(1) to the time remaining. Using Eq.(2) again, it follows that
M ∝ tend − t (cf. Fig.2), and that the remaining lifetime of the cluster
is proportional to the current half-mass relaxation time. In the model of
Hénon (1961) the constant of proportionality is approximately 22.4trh .
The tidal truncation progressively removes the stars of highest energy.
By mass segregation these tend to be the stars of lowest mass. The
result is that a dying cluster contains an increasing concentration of the
more massive objects. As in panning for gold (or neutron stars), the tide
washes out the lighter particles. Incidentally this helps to compensate for
the destruction of binaries in the binary-binary reactions that we have
relied on to sustain the post-collapse evolution.
Stars which escape from the cluster, or which are washed across the
tidal boundary, do not necessarily leave the vicinity of the cluster quickly.
Unless of high energy, e.g. escapers resulting from three-body interac-
60000
50000
Mass (solar masses)
40000
30000
20000
10000
0
0 5000 10000 15000 20000 25000
Time (Myr)
Fig. 2. Loss of mass in tidally limited N -body models. From right to left the
initial number of stars is N = 1024, 2048, 4096, 8192, 16384 and 32768. The
models have been scaled to a real cluster with initial mass 6 × 104 M in a tidal
field of a certain strength, and with the same mass function and initial structure
(see Heggie et al. 1998). The scaling is imperfect, for reasons which are discussed
in Baumgardt (2001). The modest increase in the escape rate after about half the
mass has escaped is associated with the change from pre-collapse to post-collapse
evolution.
298 29 Dissolution
tions, they cross the tidal boundary at low speed in the vicinity of the
Lagrangian points (Chapter 12). Thereafter they move off under the ac-
tion of tidal and inertial forces to form “tidal tails” of escaping matter
(Fig.12.3). When the cluster as a whole dissolves, in the sense that its
tidal radius shrinks to zero, there is still a residual concentration of stars
– the smile on the face of the Cheshire cat.
It is remarkable that post-collapse evolution in a tidal field was the first
evolutionary problem in the subject to be understood in detail (Hénon
1961). At that stage the theory was developed for the case of a system
in which all stars have the same mass, the distribution of velocities is
isotropic, and the tidal field is modelled as a cutoff at the tidal radius.
Empirically, however, it has been found that much the same conclusions
apply to much more realistic systems: unequal masses, anisotropic ve-
locity distribution, anisotropic tidal field, primordial binaries, etc. The
nearly linear dependence of the mass on time is one of the most robust
and memorable results in the whole of this subject, and a nice exam-
ple by J. Stodólkiewicz appears prominently on the front cover of the
proceedings of IAU Symposium No.113 (Stodólkiewicz 1985).
Problems
1) Use estimates in Box 27.1, to show that the total heating rate of pri-
mordial binaries in the core is of order f 2 mσc3 /rc , where f is the binary
fraction, m is the individual stellar mass, σc is the central velocity dis-
persion, and rc is the core radius. Deduce that, in steady post-collapse
expansion powered by primordial binaries, the ratio rc /rh is almost
independent of N .
2) (Spitzer 1987, p.59) In homologous evolution in a tidal field show that
the total energy varies with mass according to
d ln |E| 5
= .
d ln M 3
If stars escape across the tidal boundary, modelled as a cutoff at
the tidal radius rt , with negligible excess energy, show that Ė =
−(GM/rt )Ṁ, and deduce that E = −0.6GM 2 /rt . Infer that rt = 2.4
in N -body units (Box 1.1). (In the limit W0 → 0 King models have
this limiting radius [cf. Problem 8.6]. Most models have larger tidal
radii, however [cf. Fig.8.3] and require an additional source of energy
for homologous evolution.)
3) Suppose the estimate in Problem 1, with rc replaced by rh and σc the
rms 1-dimensional speed, can be used to estimate the rate of heating
in an isolated uncollapsed cluster with half-mass radius rh , and that
mass is lost by evaporation (Eq.(9.3), where tr is estimated by the half-
mass relaxation time trh ). Using standard estimates for a virialised
Tidally bound clusters 299
300
PART IX. STAR CLUSTER ECOLOGY 301
cussed and their interrelations, and present the overall evolution of a star
cluster, from birth to death, in a simplified diagram which encapsulates
the gross aspects of its spatial structure. We consider again the timescales
on which the evolution takes place. Finally we consider how it is that the
clusters we see have managed to survive for most of the lifetime of our
Galaxy, and the rate at which these last survivors are perishing.
30
Stellar and Dynamical Evolution
302
30 Stellar and Dynamical Evolution 303
Internal
Dynamics Stellar
Constituents
Core collapse
Mass segregation
Variable stars
Stellar escape
Mass spectrum
4-body encounters
Binaries
3-body encounters
LMXB
2-body encounters
msec pulsars
Stellar velocities
blue stragglers
Anisotropy
BHB stars
Gravothermal oscillations
Abundances
CM diagrams
External
Colours
Effects Black holes
Tidal truncation Stellar evolution
Hierarchical triples
Disk shocking
Bulge shocking
Fig. 1. Some of the problems of globular star clusters, and the interrelations
between them. Arrows between topics on the same side of the diagram are not
shown. Not all arrows imply cause and effect; the “stellar velocities” to “black
holes” means that the former could be an observational indicator of the latter;
in some cases the link is indirect. The diagram is considerably simplified: in fact
the mass spectrum affects almost everything on the left, and any of the first six
categories on the left could affect the CM diagram. Acronyms include: low mass
X-ray binary (LMXB), blue horizontal branch stars (BHB stars, also known as
“faint” or “hot” horizontal branch stars) and colour-magnitude diagrams (CM
diagrams).
therefore play a vital role in the evolution of star clusters. The term ‘ecol-
ogy’, introduced by Heggie (1992), captures the essence of this interplay.
We have already seen one way in which the evolution of the stars affects
the dynamics of a cluster. Towards the end of a star’s life on the main
sequence it loses much of its mass, and, as many stars complete their evo-
lution, the resulting unbinding of the cluster may even cause its complete
dissolution (Chapter 11). Other influences of stellar evolution on stellar
dynamics may be more subtle. According to point-mass dynamics, for
example, a hard binary tends remorselessly to get harder (Chapter 19f).
When we bear in mind that stars have finite radii, however, and that these
radii increase dramatically during certain phases of stellar evolution, the
resulting shrinking of the binary orbit cannot continue indefinitely. This
304 30 Stellar and Dynamical Evolution
may limit the energy which each binary may release, and its potential
importance for dynamical evolution.
Astronomers were also slow to realise the existence of links in the op-
posite direction, i.e. the role of dynamics in shaping stellar evolution.
But the evidence that stellar evolution in star clusters was different from
stellar evolution elsewhere gradually built up, and forced a paradigm shift
that is still not complete.
The earliest evidence of environmental factors in stellar evolution was
the discovery almost half a century ago by Sandage (1953) of “blue strag-
glers”, stars that seem to be substantially younger, and hence bluer, than
the bulk of the stars in a star cluster (Fig.2). The implications of this
discovery were not absorbed by the community at large, and for the next
twenty years, globular clusters still kept most of their secrets hidden.
Both authors of this book remember how, when we took our first course
in astronomy, globular clusters were portrayed as some of the oldest and
dullest objects in astrophysics. Even as we began our research careers,
the dynamics of star clusters was regarded as a quiet backwater in as-
trophysics, while all the excitement was being generated elsewhere; in
high-energy astrophysics, for instance. Paradoxically, it was precisely
from this direction that attention was once more directed to these “old
and boring” objects. In the mid seventies star-cluster research received an
enormous boost from the unexpected discovery of globular cluster X-ray
sources (cf. Fig.1.3).
It soon became clear that these sources had a much higher relative
abundance in globular clusters than in the galaxy as a whole (see Katz
1975, Hut & Verbunt 1983, Bailyn 1996). While the family of globular
clusters contain less than 0.1% of the stars in the galaxy, they harbor
more than 10% of the low-mass X-ray sources throughout the galaxy. The
search was on for an explanation, and an early suspect was what makes
globulars so special, dynamically: their high density. Various scenarios
were proposed, including binaries formed in stellar collisions (Sutantyo
1975), but quickly the dominant explanation arose that neutron stars had
been tidally captured by normal stars (Fabian et al. 1975), to form new
binary stars. In these binaries, mass overflow from the normal star could
then provide the fuel to turn the neutron star into an X-ray source.
More than ten years later, from 1987 onwards, another exciting type of
object was discovered in globular clusters, in rapidly growing numbers:
pulsars, rapidly spinning neutron stars that are discovered through the
pulsed radiation they emit (Fig.3; Kulkarni & Anderson 1996). Many
of these pulsars were found to be a member of a binary system, and
30 Stellar and Dynamical Evolution 305
tidal capture was once again proposed as the main mechanism for the
formation of this other class of unusual binaries.
Soon afterwards, it is true, two important aspects challenged the use
of tidal capture as a panacea to explain any type of weird star system
in globular clusters. On theoretical grounds, closer investigation of the
mechanism of tidal capture cast doubt on its efficiency (mainly Kochanek
1992, but see also Mardling 1995, Kumar & Goodman 1996). On obser-
vational grounds, however, a new mechanism presented itself naturally,
based on the discovery of large numbers of primordial binaries (Chapter
24).
Still, the shift in outlook was permanent. It was now clear that one
could not understand the kinds of stars that were found in globular clus-
ters without taking the stellar environment into account, and, equally,
that the study of dynamics in isolation from stellar evolution led, at best,
to an incomplete picture. Nor was the evidence for this confined to exotic
kinds of star: even the relative abundance of the most ordinary stars ap-
peared to depend on the stellar density, causing observational variations
306 30 Stellar and Dynamical Evolution
in the colour of a star cluster from centre to halo (e.g. Djorgovski et al.
1991). What we shall do in the next two chapters is to describe some
aspects of this interplay between dynamics and the stellar nature of the
bodies involved.
Despite the weakening of the original motivation for considering tidal
capture, we begin there in the next chapter. This process is one of the
simplest, best studied and most robust aspects of cluster dynamics in
which the finite radii of the stars must be taken into account. Tidal
captures and stellar collisions between single stars do take place at sig-
nificant rates in globular star clusters, and are also an important aspect
of interactions involving binaries, though by no means the whole story.
Those interactions are the subject of Chapter 32.
Problems
1) In a system in which all stars have solar mass and radius, and the
(three-dimensional) velocity dispersion is 20kms−1 , compute the en-
ergy of the most energetic binary, in units of 1kT. (The solar radius is
about 7 × 105 km.) Compare the energy with that of a whole globular
star cluster.
2) Consider the arrows in Fig.1, and the mechanisms by which these
influences occur.
31
Collisions and Capture
Once upon a time it was thought that novae arose from stellar colli-
sions. In a normal galactic environment, however, where most novae are
found, collisions are rare, in the sense that a given star is very unlikely to
experience a collision during the age of the universe. The stars are sim-
ply too far apart. In very dense environments, however, such as galactic
nuclei, collisions may even be sufficiently frequent to dominate dynami-
cal evolution. Globular clusters lie in between these extremes. In them,
collisions do affect large numbers of stars, but are not normally thought
to predominate over purely gravitational interactions (i.e. two-body re-
laxation). And the restriction to single stars is an unrealistic idealisation.
Globular clusters possess large populations of binaries, which automati-
cally place stars in close proximity. All in all, it is estimated that there is
a collision between two stars somewhere in the universe every ten seconds.
t−1
coll = 8πGmnRhV
−1
i, (1)
where V is the relative speed of the two stars, and the average is taken
over all stars that may be encountered. (This formula is based on Ke-
plerian relative motion of the two stars, and assumes that gravitational
focusing is large, for which the condition is that V 2 2Gm/R; this is
satisfied in practice in globular star clusters.) For stars of solar type, the
formula may be reexpressed more intelligibly as
n km/s
t−1 ' 2 × 10 −15
yr−1 .
coll
pc−3 V
307
308 31 Collisions and Capture
That very small coefficient shows why collisions are so rare in the neigh-
bourhood of the sun, where n ∼ 1pc−3 and V ∼ 20km/s. In globular
clusters V is not so different, but the density is much higher, typically by
a factor of about 104 at the cluster centre. Hence the collision time scale
is reduced to something of order 1012 yr. Since a cluster lives for roughly
1010 yr, this means that of order 1% of stars in the core of the cluster will
have experienced a direct collision within its lifetime.
No one number can cover the complexity of this issue. In the first place
the collision timescale varies with the stellar radius and mass. Almost
all stars in globular clusters have smaller radii and masses than that
of the sun, and so the collision rate is smaller than we have estimated.
The main exception is the evolved giant stars, whose radii may exceed
that of the sun by a factor of 100. Collisions are less effective than
one might think, because of their low-density envelopes. Even so, in a
collision at 10km/s between a star of mass 1M and a giant of mass 2M ,
the maximum distance of closest approach for capture is almost as large
as the radius of the giant (Bailey & Davies 1999). More importantly,
this phase of a star’s life is relatively short, roughly 15% of its entire
lifetime (Eggleton et al. 1989), and so the contribution of these stars is
not dominant. For example, in models of the star clusters 47 Tuc and ω
Centauri, Davies & Benz (1995) have estimated that the rate of collisions
between two main sequence stars exceeds the rate of collisions between a
main-sequence star and a red giant by at least a factor of two. Such results
depend on the adopted mass function, but the required computations are
easily performed using a simple extension of Eq.(1). Furthermore, there
are now convenient simple formulae for the evolution of single stars, of
any relevant mass, metallicity and age, including formulae for the radius
(Hurley et al. 2000).
The second complicating factor is that the collision timescale depends
on the stellar density. This is smaller outside the core, but on the other
hand there are more stars there. Still, collisions are a two-body
R process,
and so the number of collisions is roughly proportional to ρ2 r2 dr, where
ρ(r) is the density at radius r. For any reasonable density profile this
integral is dominated by the vicinity of the core (cf. Problem 3).
A third point for consideration is the fact that the stellar density and
mass function, and therefore the collision rate, vary enormously because
of the dynamical evolution of the cluster, especially during collapse of
the core (e.g. Statler et al. 1987). The total number of collisions is given
by the time integral of the rate, and an argument along the lines of our
discussion of the spatial variation of the collision rate shows that late
core collapse does not dominate the total number of collisions (Sugimoto
1996); though the collision rate is very high then, this phase is over
too quickly for this to matter. On the other hand mass segregation is
The rate of collisions 309
∗
The factor of 4 on the left-hand side results from half the reduced mass of two stars
of equal mass; cf. Chapter 19.
The effect of collisions 311
Fig. 1. Collision between two main sequence stars (after Sills et al. 2001). Ma-
terial in different layers of the stars is coarsely distinguished by different shades
of gray.
the moon, the water of the oceans must move across the bed of the ocean.
The resistance to this motion is especially strong near narrow channels
in shallow seas, as in the English Channel separating France from the
British Isles. This motion also takes time, and so the maximum height
of the water takes place some time after the moon is highest in the sky.
There is an analogy here with a mildly damped harmonic oscillator which
is forced at a frequency much smaller than its natural frequency. The
forced oscillation lags slightly behind the forcing.
Now return to the encounter between two passing stars, and let us think
of one star as taking the place of the earth with its oceans, while the other
plays the role of the moon. Again the second star raises tides on the first,
and again the high tide at some location on the first star occurs some
time after the second star has passed above that spot. Looking down at
the first star, the second star sees a nearly spherical object, except for
the tides he has raised. The nearer of these is behind him, in the sense
that he has to look in the opposite sense from his orbital motion to see it
(Fig.2). Now this tidal material exerts a weak gravitational pull on the
second star, in the sense opposite to his orbital motion around the first
star. Therefore it slows down his orbital motion.
What this argument shows is that the tidal interaction between the
two stars removes energy from their relative motion. It is clear that most
of it is removed around the time of closest approach. We are also now
imagining that the two stars have approached from a large distance where
they had a non-zero relative speed, of order 20km/sec, say, and therefore a
non-zero kinetic energy of relative motion. If this kinetic energy exceeds
the energy removed tidally at closest approach then the two stars will
eventually separate again. If the reverse is true, however, the two stars
are subsequently bound in a binary. This happens if the loss of energy is
large enough, i.e. if the encounter is sufficiently close. Tidal capture has
occurred.
What happens next is somewhat controversial. Certainly, because the
two stars are bound to each other, further close encounters will occur.
Fig. 2. The tidal bulge created by a passing star. Distortions of star 2 are not
shown. See also the upper right panel of Fig.1.
The effect of collisions 313
What effect these have depends on whether the tide raised by the first
encounter has had time to subside (Kochanek 1992, Mardling 1995). If
not, at the time of the next close approach the old tide could be in advance
of the passing star, and then its effect is to unbind the pair; at least that
would be the sense in which the interaction worked, though it might not
be strong enough to succeed. In this scenario, the newly formed binary
enters a phase of chaotic evolution, sometimes becoming less bound and
sometimes more bound. The rotation of the stars presumably further
complicates the interaction, however (see, for example, Witte & Savonije
1999).
At the same time the internal structure of the stars is being altered.
The tidal interaction (except in some chaotic phases) is extracting energy
from the relative motion of the two stars and transferring it to the tide.
The dissipation of each tide further contributes this energy to the internal
energy of the stars, which should expand somewhat in consequence (cf.
McMillan et al. 1987, Tout & Kembhavi 1993). This clearly has the
effect of enhancing the strength of the tidal interaction, and of leading to
a situation where the two stars can be brought into contact. The upshot
is that a single tidal encounter between two stars, even if it did not lead
directly to physical contact between the two stars, can eventually lead
to their merger. In other words the time scale for encounters leading to
mergers is somewhat shorter than the collision timescale given by Eq.(1).
The numerical size of the enhancement is, however, not large, as the
effects of tidal dissipation decrease as a rather high power of the distance
of closest approach (Problem 4).
Problems
1) Derive Eq.(1).
2) Two stars of mass m and radius R have relative speed V initially,
when they are far apart. Estimate the mass loss in an encounter, by
computing how much mass can be ejected (from the surface of one star
to infinity) by the initial energy of relative motion of the two stars.
(The result is of course zero if V = 0, in contradiction with results of
numerical simulations; cf. Table 2 in Lombardi et al. (1996).)
3) In an isothermal model (Chapter 8) the density at large radius varies
approximately as ρ ∝ r−2 . Show that the rate of collisions is domi-
nated by the contribution from the core.
More precisely, recall that the scaled potential φ satisfies the equa-
tion
2
φ00 + φ0 = 9 exp(−φ),
r
314 31 Collisions and Capture
315
316 32 Binary Star Evolution and Blue Stragglers
is more than one way to make a blue straggler (Ferraro et al. 1993), and
theorists are fortunately not short of ways of doing so too.
Many, if not most, of the collisions between single main sequence stars
in star clusters are likely to produce blue stragglers. One reason is that
mass segregation will produce an excess of heavier stars in the central
regions of a cluster. Other reasons are the larger radii of heavier stars,
together with their enhanced gravitational focusing, both of which favour
collisions for heavier, rather than lighter stars.
An even more efficient mechanism for inducing collisions is provided
by binary–single-star and binary–binary encounters, especially those in-
volving hard binaries, which have a significant chance to lead to resonant
encounters where three or four stars are temporarily captured in a small
area of space. Similar arguments again predict a larger collision probabil-
ity for heavier stars. A special feature of 3-body or 4-body channels for
blue straggler formation is that the mass of the final blue straggler may
well be significantly larger than twice the turn-off mass, if more than two
stars are involved in producing a single merger (Fig.1).
A third way of using binaries to produce blue stragglers is through mass
(Roche lobe) overflow from one component of a suitable binary onto the
other member. This process comes in several flavours: coalescence, and
mass transfer of various kinds (Leonard 1996). None of these mechanisms,
however, require a high density of neighbouring stars, and so they are not
special to the million-body problem.
As a simultaneous illustration of the last two types of blue straggler
formation events, a calculation by Simon Portegies Zwart is reproduced in
Fig.2. The calculations were performed in the Starlab environment, using
a version of the automated three-body scattering package described in
Chapter 22, integrated with a simplified recipes for binary-star evolution.
There is nothing contrived about this particular set of initial condi-
tions. Both the single star and the binary were drawn at random from
a prescribed cluster core population of single stars and binaries. At each
timestep of a thousand years, the probabilities for close encounters be-
tween different objects were calculated, and actual collisions were carried
out in Monte Carlo fashion by spinning a random number generator to
see which stars would actually undergo an encounter. At each timestep,
those stars and binaries that did not undergo an interaction (the vast
majority) were simply evolved with stellar evolution prescriptions: fit-
ting formulas for stellar evolution tracks in the case of single stars, and
simplified recipes in the case of binaries (for more details, see Portegies
Zwart 1996, Portegies Zwart & Verbunt 1996).
318 32 Binary Star Evolution and Blue Stragglers
Fig. 1. Collision between two binaries (after Goodman & Hernquist 1991).
Eventually all four stars merge into a single massive and extended stellar ob-
ject.
turnoff star. The secondary now evolves rather more quickly, but before
it can reach the turn-off, a third star happens to pass through the system,
2 Gyr later, and is captured into a three-body resonance scattering event,
displayed with time moving horizontally from left to right, for a duration
of about a hundred years.
The outcome of the event, not surprisingly, leads to the lightest star
being ejected, in this case the white dwarf, the end product of the original
primary. The incoming 0.8M star now takes the place of the primary.
The orbit has tightened significantly, as could also be expected (hard
binaries tend to get harder; cf. Chapter 19). After another 2.4 Gyr, the
original secondary, saddled with the extra mass from the original primary,
at last begins to climb the giant track. This leads to a second phase of
tidal circularisation, in which the eccentricity induced in the three-body
encounter is erased. After another 15 Myr, mass overflow takes place,
again 0.6M is lost, of which this time 0.2M is accepted by the star
which entered as a result of the exchange. This increases the mass of the
latter from 0.8M to 1.0M , thereby turning this star into a second blue
straggler. The original secondary soon turns into a dwarf, orbiting the
1M star. This second straggler phase finally ends around T = 17.8 Gyr,
during the last phase of giant evolution.
The final state is not displayed in this figure as it takes place in the
future: 17.8Gyr exceeds almost all estimates of the age of the universe.
But we can guess what will happen. After the second blue straggler fills its
Roche lobe, the binary will shrink significantly, and most likely a common
envelope system will be formed. This is likely to lead to the merger of the
two cores, resulting in a late type giant of 1.3M . A bit later, a white
dwarf will then be left behind, with a mass of around 0.7M .
Observationally, the first blue straggler formed by the transfer of mass
in a close binary will not be distinguishable from those formed by the
“classical” route, i.e. in collisions between single stars (Fig.30.2). Only
after the dynamical exchange interaction does the system become clearly
different from “ordinary” blue stragglers: the observation of a blue strag-
gler in an eccentric orbit or in a detached binary with a main-sequence
companion is a direct indication for formation by an exchange reaction.
that interactions involving both single stars and binaries are routes which
greatly enhance the proportion of blue stragglers.
An example is the old open cluster M67, for which Hurley et al. (2001)
have constructed an N -body model which includes the full panoply of
stellar and dynamical evolution. Though open clusters are often regarded
as rather low-grade fare in the world of N -body simulations, a 15000-body
simulation was needed to simulate the last half of its evolution. Twice
that number might have been needed to cover the entire evolution, which
would not have been feasible at present for a stellar system with a rich
complement of active binaries. Blue stragglers are present in M67 at an
abundance roughly 10 times higher than would occur in a population
of isolated binaries and single stars. As this simulation showed, it is the
dense environment in M67 which accounts for its wealth of blue stragglers.
Blue stragglers are just one of the unusual types of object that cannot
form through single star evolution, but require either the presence of pri-
mordial binaries, or the interaction between two, three, or more stars in
a dense stellar system. In addition, X-ray sources, pulsars, spectroscopic
binaries and eclipsing variables have been detected in significant numbers,
some of them right in the middle of the densest central regions of globular
clusters. This diverse collection of objects forms a gold mine for dynam-
icists modelling the evolution of globular clusters. Another metaphor,
coined by Melvyn Davies (Davies 1995) is also apt: the zoo of possible
objects that can be formed in few-body interactions is bewildering, and
rather little has been studied yet in this field — for those daring to brave
this jungle, many species still remain to be discovered.
While all these binaries and binary products provide useful diagnostics
of binaries and their interactions, some of them are even directly involved
in the physical processes of energy generation which drive the expansion
of the core after core collapse (Chapters 27–29). In this sense we are
beginning to get a direct look at the central engines which power the
later phases in the evolution of globular clusters.
Here is a final conceit. One may say that star cluster dynamics is
like a ballet with about a million performers, but it is the binary stars
which take centre stage. It is in their pas de deux that most of the action
develops, and it is through their interactions with other players, in the
form of occasional pas de trois and pas de quatre, that fascinating new
patterns (and even new characters) arise.
Problems
2) A stellar system has n single stars per unit volume, all with radius R.
Assuming that two stars merge if and only if their closest distance is
less than 2R, show that the rate of formation of mergers in two-body
encounters is 4πGmRn2 h1/V i, where V is the relative speed of the
two stars.
The system also has f n hard binaries per unit volume, all with semi-
major axis a, and all components have radius R. The cross section
for binary-single encounters in which at least two participants come
within a distance d is known to be approximately
0.4
πa2 3 Gm d
σ ' 8.5 2
V 2 a a
(Hut & Inagaki 1985). Compute the rate of mergers in binary-single
encounters, and compare with the rate of single-single mergers.
33
Star Cluster Evolution
323
324 33 Star Cluster Evolution
Tidal Cluster
Overflow Death
ch.12 ch.29,33
Mass
Loss Mass Function
from Evolution
Stellar
Evolution Cluster
ch.11,30-2
Mass
Segregation
ch.16
Two-body Collisional
Encounters Relaxation
ch.14
Core
Evolution
ch.17,18,27,28
3- and 4-Body Binary
Encounters Heating
ch.19-22,25 ch.23,26
Structural Evolution
Next we shall try to picture how the system evolves under these processes,
though we shall assume that certain of them are dominant. In particular,
we imagine the evolution of a star cluster bathed in the tidal field of a
parent galaxy and dominated by the following dynamical processes. First
is the internal evolution of the stars, which causes them to lose mass; this
weakens the potential well of the cluster, which may lead to much further
loss of mass. Next, there is two-body relaxation, which leads to mass
segregation and collapse of the core. Finally, collapse is arrested by the
intervention of primordial binaries; the cluster enters the final phase of
its life, continually shedding mass across the shrinking tidal boundary.
Throughout, it is shaken intermittently by tidal shocks.
We shall summarise these results in a diagram which reduces the struc-
ture of the cluster to its characteristic radii: the core radius, rc , the half-
Structural Evolution 325
mass radius, rh , and the tidal radius, rt . To reduce the problem to one
that can be plotted on paper, we take the ratios rh /rt and rc /rh . The
first of these tells us how liable the cluster is to tidal disruption, and the
second states how evolved it is dynamically.
rc C
_
r
h A B
B´
A´
B˝
A˝
rh
_
rt
0.6
0.5
0.4
rc/rh
0.3
0.2
0.1
0
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35
rh/rt
evolution, and the cluster expands by as little as 10%. If there are many,
the cluster may dissipate altogether. Roughly speaking, the fraction of
mass lost also gives the factor by which a cluster would expand, though
if it starts life nearly tidally limited then the mass loss may well result in
the death of the cluster by tidal dissipation. What this fraction is may be
seen in Fig.11.1, though this completely ignores loss of mass by expulsion
of gas, which is likely to dominate the first few million years.
the evolutionary paths denoted by A0 and B 0 . The time scale taken for
this stage of the evolution depends on a number of factors. For a sys-
tem which starts this stage of the evolution at a point where it is tidally
limited the relaxation time scale is proportional to the number of stars
times the galactic orbital period at the location of the cluster. For other
systems, the relevant time scale for the evolution of the core is smaller,
i.e. the evolution of the core is faster.
As we have mentioned, stars escape as rc /rh drops. If the cluster lasts
long enough, however, and assuming that it was born with a sufficient
proportion of primordial binary stars, these begin to release energy in
three- and four-body reactions when the mass inside the core has shrunk
to a certain fraction of the whole, i.e. when rc /rh is small enough. What
happens next depends on whether the cluster is living comfortably inside
its tidal boundary (case AA0 ), or already tidally limited (case BB 0 ). In
the second case the cluster sheds mass, evolving in a self-similar manner
first described many years ago by Hénon (B 00 ). Otherwise the release of
energy drives an expansion of the cluster until it joins its tidally bound
neighbour. This expansion is homologous out to the radius rh , and so the
evolution in our diagram is horizontal (A00 ) until the evolutionary track
arrives at B 00 .
Though these tracks are schematic, more realistic treatments confirm
the gross features. For example, Fig.2b illustrates an N -body simulation
on a diagram with the same axes. One can readily discern stages corre-
sponding to the phases A, A0 and A00 in Fig.2a, even though this cluster
is powered by 3-body binaries rather than primordial binaries. Stage A
is quite short.
Destruction mechanisms
Finally we would like to give a feel for the time scales on which all
this action takes place, with special attention to the lifetime of a stellar
system. A cluster is under attack both from the outside (tidal disruption,
disk shocking, etc.) and from its own internal diseases (stellar evolution,
two-body relaxation). To make matters simpler (without losing relevance
to the old star clusters around us, which must have survived this phase)
we shall ignore destruction on the time scale of stellar evolution, which
must have become ineffective long ago. It is a disease of infancy.
Essentially the task is to estimate the time scales of the main processes
which cause a cluster to lose mass. We ignore tidal overflow in the sense
that this is not a primary cause of loss of mass; it would seem quite
possible for a cluster to survive indefinitely in a tidal field if there were
no other mechanism causing loss of mass (Muzzio et al. 2001). It is true
that we have no reason to suppose that clusters are born in equilibrium
328 33 Star Cluster Evolution
with their tidal field, but again any outlying matter will be stripped on
a short time scale.
The first process to consider, and the one on which much emphasis has
been placed in this book, is two-body relaxation. For its time scale we
take the half-mass relaxation time (Eq.(14.9)), i.e.
1/2 3/2
0.138Mc rh
trh '
G1/2 m ln Λ,
where m is the individual stellar mass, and Mc , rh are the mass and
half-mass radius of the cluster, respectively. If we set this equal to the
ages of the typical old galactic globular star clusters we obtain a feel for
the gross parameters of clusters which can survive the destructive effects
of this process to the present day. The result (with some refinements) is
a line at the position of the lower left boundary in Fig.3, which is a plot
of cluster mass against half-mass radius.
The next process to consider is disk shocking (Chapter 12; see also
Gnedin et al. 1999). This does different things on different time scales.
R<12 kpc
R<8 kpc
R<5 kpc
R<3 kpc
Fig. 3. The survival of galactic globular clusters (from Gnedin & Ostriker 1997).
The masses of 119 galactic globular clusters are plotted against their half-mass
radius. Different symbols are used for clusters within different ranges of galac-
tocentric distance. The three stated destruction mechanisms are thought to
control their distribution. The key at top left refers to the assumed models for
the galactic potential and the distribution of the globular clusters.
Destruction mechanisms 329
The overall energy of the cluster is altered on the time scale given in Box
12.2, Eq.(3). But the energies of individual stars are altered on the time
scale of shock relaxation, which may, like ordinary two-body relaxation,
lead to escape. Therefore we adopt a time scale of the form
Mc RV
tsr ∼ ,
GΣ2 rh3
where R is the galactocentric distance of the cluster, V is its orbital
speed round the galaxy, and Σ is the surface density of the galactic disk.
Setting this (with a suitable estimate of the numerical coefficient) equal
to the lifetime of the globular clusters gives lines at the positions of the
lower right boundaries in Fig.3. Note that the time scale depends on
location within the galaxy, especially on R: it decreases with decreasing
galactocentric radius, not only because of the explicit factor of R, but
also because Σ increases as R decreases.
Now we see that the observed clusters in our galaxy fit rather comfort-
ably into the pocket between these two constraints. They do not, it is
true, explain the absence of clusters of very high mass. This could well
be primordial, i.e. it may be that no such clusters ever existed. There
is, however, another mechanism that might have depopulated this part
of the diagram. We introduced dynamical friction (Chapter 14) as a pro-
cess which slows down stars moving through clusters, especially massive
stars. In the same way it acts on clusters moving through the galaxy, and
Fig. 4. A dying globular cluster: E3. Image obtained by use of NASA’s SkyView
facility (http://skyview.gsfc.nasa.gov) located at NASA Goddard Space Flight
Center.
330 33 Star Cluster Evolution
acts more quickly on more massive objects. Estimates show that it would
have dragged most clusters of high mass (above the upper boundary on
Fig.3) into the Galactic Centre. Together with the two processes of disk
shocking and two-body relaxation, this shows that there is a rather well
constrained triangle of survival in this diagram, where almost all of the
present-day clusters are found.
The present distribution of cluster masses is well explained by a combi-
nation of these dynamical processes and a suitably chosen initial condition
(e.g. a power law). It resembles a log-normal distribution (i.e. normal in
the logarithm of the mass), and can equally well be approximated by a
flat distribution at low masses and a power law at high masses (cf. Baum-
gardt 1998, Vesperini 1998 and references therein, Fall & Zhang 2001).
The radial distribution of clusters is also affected (e.g. Capuzzo-Dolcetta
& Tesseri 1999).
Estimates have been made of the rate at which clusters are dying
(Aguilar et al. 1988, Hut & Djorgovski 1992, Murali & Weinberg 1997,
Johnston et al. 1999). It is likely that about half of the present popu-
lation of the globular clusters in our galaxy will perish within the next
1010 years, which is about how long the present population has lasted.
There is nothing surprising in this, any more than in the fact that the
age of most individuals in the human population of the world is com-
parable with the span of a human life. The clusters which will expire
within the next billion years are sparse objects. Likely candidates are the
clusters E3 (Fig.4) and Palomar 13 (Siegel et al. 2001). Even the names
suggest that these are miserable objects, and not in the same class as the
brilliant clusters that made themselves known to Messier. Almost all of
the low-mass stars have left such an object, leaving behind the heavier
populations which escape least easily: binary stars, white dwarfs. Soon,
by galactic standards, nothing will be left.
Problems
1) Add the line in Fig.3 corresponding to the tidal boundary condition,
assuming that the rotation curve of the galaxy is flat, with rotation
speed 220kms−1 , and that rh < ∼ 0.4rt . (There is a different line for
each galactic radius R.)
2) Suppose that the relaxation time tr of a cluster is always proportional
to the time until its dissolution (Eq.(29.1)). Suppose the initial distri-
bution of tr is a power law f (tr ) ∝ t−α
r , with α ∼ 2, say. Show that
the distribution of log tr eventually evolves into a form resembling a
normal distribution.
Appendix A
A Simple N -Body Integrator
Matlab
This code is presented mainly because of the integrated graphics capa-
bilities of Matlab (Fig.1). To run the code, enter ‘nbody(25,100,10)’ in
a matlab session, assuming that a file containing the following code is in
the matlab search path. The input data are the number of particles, the
number of outputs and the end time, respectively.
The first function initialises, runs at intervals of length dt (collecting
data in y), and outputs a final result.
function nbody(n,nout,tmax)
cpu = cputime;dt=tmax/nout;t=0;
[h,x,xdot,f,fdot,step,tlast,m,y] = initialise(n);
nsteps=0;
while t<tmax
[x,xdot,f,fdot,step,tlast,m,t,nsteps] = ...
hermite(x,xdot,f,fdot,step,tlast,m,t,t+dt,nsteps);
[y]=runtime output(h,x,y,t,n);
end
331
332 Appendix A
final output(t,cpu,nsteps,y)
The next function initialises various scalars, and chooses n positions
and velocities with a uniform distribution inside spheres whose radii are
chosen to give N -body units approximately. It then sets masses, and
times when particles were most recently updated (i.e. 0, initially). Forces,
force derivatives and initial timesteps are selected (cf. Makino & Aarseth
1992). Finally a plot window for showing a movie is initialised.
function [h,x,xdot,f,fdot,step,tlast,m,y] = ...
initialise(n)
y(1)=0;figure(1)
x = uniform(n,6/5);
xdot = uniform(n,sqrt(5/6));
m = ones(1,n)./n;
tlast = zeros(1,n);
A Simple N -Body Integrator 333
for i = 1:n
[f(1:3,i),fdot(1:3,i)] = ffdot(x,xdot,m,i);
step(i) = 0.01*sqrt(sum(f(1:3,i).^2)/sum(fdot(1:3,i).^2));
end
h=plot(x(1,:),x(2,:),’o’);
axis([-2 2 -2 2])
axis square
grid off
set(h,’EraseMode’,’xor’,’MarkerSize’,2)
function [x] = uniform(n,a)
r = a*rand(1,n).^(1/3);
cos theta = 2*rand(1,n)-1;
sin theta = sqrt(1-cos theta.^2);
phi = 2*pi*rand(1,n);
x(1,1:n) = r.*sin theta.*cos(phi);
x(2,1:n) = r.*sin theta.*sin(phi);
x(3,1:n) = r.*cos theta;
Particles are updated in this next function. In the main loop, the next
particle for updating is found (i), positions and velocities of all particles
are predicted, and force and force derivative on i are computed by a
call to ffdot (see below). Higher order Hermite corrections a2 and a3
are computed and the corrections applied. Finally the next timestep is
estimated and some variables are updated.
function [x,xdot,f,fdot,step,tlast,m,t,nsteps] = ...
hermite (x, xdot, f, fdot, step, tlast, m, t, tend,nsteps)
n = size(m,2);
while t < tend
[t,i] = min(step + tlast);
dt = t - tlast;
dts = dt.^2;
dtc = dts.*dt;
dt3 = dt(ones(1,3),:);
dts3 = dts(ones(1,3),:);
dtc3 = dtc(ones(1,3),:);
xtemp = x + xdot.*dt3 + f.*dts3/2 + fdot.*dtc3/6;
xdottemp = xdot + f.*dt3 + fdot.*dts3/2;
[fi,fidot] = ffdot(xtemp,xdottemp,m,i);
a2 = (-6*(f(1:3,i) - fi) - step(i)*(4*fdot(1:3,i) + ...
2*fidot))/step(i)^2;
a3 = (12*(f(1:3,i) - fi) + 6*step(i)*(fdot(1:3,i) + ...
fidot))/step(i)^3;
x(1:3,i) = xtemp(1:3,i) + step(i)^4*a2/24 + ...
334 Appendix A
step(i)^5*a3/120;
xdot(1:3,i) = xdottemp(1:3,i) + step(i)^3*a2/6 + ...
step(i)^4*a3/24;
f(1:3,i) = fi;
fdot(1:3,i) = fidot;
modfi = sum(fi.^2);
modfidot = sum(fidot.^2);
moda2 = sum(a2.^2);
moda3 = sum(a3.^2);
step(i) = min(1.2*step(i),sqrt(0.02*(sqrt(modfi*moda2) ...
+ modfidot)/(sqrt(modfidot*moda3) + moda2)));
tlast(i) = t;
nsteps = nsteps + 1;
end
This customisable function updates the movie and collects data for
plotting the rms distance of all particles against time.
function [y]=runtime output(h,x,y,t,n)
set(h,’XData’,x(1,:),’YData’,x(2,:));
drawnow
m = size(y,1);
y(m+1,1) = sqrt(sum(sum(x(:,:).^2)))/n;
y(m+1,2) = t;
C
The following code lacks comments, but is structured like the preceding
Matlab code, so that the comments there should be helpful.
The code may be compiled with gcc∗ nbody.c -o nbody -lm, and
run with nbody 25 100 10 (say), where the data are the number of stars,
the number of outputs and the end time. It runs about six times faster
than the Matlab code. No movie is produced. To plot the accumulated
data y, gnuplot (http://www.gnuplot.org/) could be used.
#include <stdio.h>
#include <math.h>
#include <stdlib.h>
#include <time.h>
#include <assert.h>
#define NMAX 1024
#define NDIM 3
#define loop(idx,last) for (idx = 0; idx < last ; idx++)
#define min(x, y) (x<y?x:y)
typedef double real;
static int i, j, k;
void uniform(int n, real a, real temp[NMAX][NDIM]) {
real r, cos theta, sin theta, phi;
loop(i, n) {
r = a*pow(drand48(), 1./3.);
cos theta = 2.*rand()/((real)RAND MAX+1)-1;
sin theta = sqrt(1-pow(cos theta, 2));
phi = 2*M PI*rand()/((real)RAND MAX+1);
temp[i][0] = r*sin theta*cos(phi);
temp[i][1] = r*sin theta*sin(phi);
temp[i][2] = r*cos theta;
}
}
void ffdot(real x[NMAX][NDIM], real xdot[NMAX][NDIM],
real m[ ], int i, int n, real fi[ ], real fidot[ ]) {
real rij[NDIM], rijdot[NDIM], r2, r3, r5, rrdot;
loop(k, NDIM) { fi[k] = fidot[k] = 0; }
loop(j, n) {
if (j!=i) {
r2 = rrdot = 0;
loop(k, NDIM) {
∗
We used version 2.95 on a Sun running Solaris 2.7, for example.
336 Appendix A
assert(argc==4);
n = atoi(argv[1]);
noutp = atoi(argv[2]);
dt = atof(argv[3])/(real)noutp;
t = 0;
initialise(n, x, xdot, f, fdot, step, tlast, m);
nsteps=0;
loop(j, noutp) {
nsteps += hermite(x, xdot, f, fdot, step, tlast, m, &t,
t+dt, n);
runtime output(x,t,n);
}
final output(t,cpu,nsteps,noutp);
}
Problems
1) Given acceleration a and its derivative at the beginning and end
(subscripts 0 and 1, respectively) of a time step of length h, obtain
the approximate formulae
1 4 (2) 1 5 (3)
x1 = x01 + h a0 + h a0
24 120
1 1
v1 = v10 + h3 a0 + h4 a0 .)
(2) (3)
6 24
2) Convert the code in Binney & Tremaine 1987 to the Hermite integra-
tor.
340 Appendix A
B.1 Part I
Chapter 1
Chapter 2
1) See Chapter 3.
2) For the radius of a stellar system, see Eq.(9.2), and footnote.
3) Choose an arbitrary centre O, and apply Newton’s Theorems to shells
with centre O.
4) r̈ = −(4π/3)Gρr, which is linear in r, and so collapse is homologous.
Solve the equation of motion at the edge of the sphere (r = a) using
formulae of Box 7.2, in case e = 1. Fig.1: π(6/5)3/2.
341
342 Appendix B
Chapter 3
Chapter 4
Chapter 5
Chapter 6
Chapter 7
Chapter 8
Chapter 9
R
1) T =R 6π pr2 dr. Integrate by parts, and use W =
−G
P P
M (r)dM (r)/r. P P
2) i P ri .(ri − rj )/|ri − rj |3 = j i6=j rj .(rj − ri )/|rj − ri |3 =
j6=iP
(1/2) i j6=i (ri − rj ).(ri − rj )/|ri − rj |3 .
4) Answers in Table 8.1.
Hints to Solution of Problems 345
5) (a)
Z Z
∂f ∂
m(v − hvi)2 d3 v = f m(v − hvi)2 d3 v+
∂t ∂t Z
+2 ˙ 3v
f m(v − hvi).hvid
∂
= 3ρσ2 + 0.
∂t
Similarly the second term gives
∇ · ρh(v − hvi)(v − hvi)2 i + 3ρσ2 hvi + 2ρσ2 ∇ · hvi
Chapter 10
Chapter 11
1) For a power law with extreme masses mmin and mmax M is pro-
max − mmin if α 6= 2. The quoted approximation to
2−α
portional to m2−α
the Miller-Scalo mass function (see caption to Fig.1) implies that the
fraction of stars with mass less than m is F , where
0.19F
m= .
(1 − F )0.75 + 0.032(1 − F )0.25
Show that Ṁ = mṁdF/dm, where the right side is evaluated at the
turnoff mass.
2) Set c2 = Gm0 a, r ' a in the right side of the equation of motion.
(∂E/∂t)s
3) (∂s/∂t)E = − .
(∂E/∂s)t
5) f ' constant, and so ρ ∝ (−φ)3/2 . Cf. Box 8.1.
346 Appendix B
Chapter 12
√
1) In Eq.(3) ω ∼ κ ∼ GρG (cf. Eq.(7.5)).
3) Consider the acceleration (due to the sun) on (i) the sea on the sunlit
surface, (ii) the solid earth, (iii) the sea on the unlit surface.
4) Replace t in the trigonometric terms by t − t0 . Then
ẋ = −a1 κ sin[κ(t−t0 )] if ȧ0 + ȧ1 cos[κ(t−t0 )]+κa1 ṫ0 sin[κ(t−t0 )] = 0.
Three more conditions come from the corresponding treatment of y
and the first two equations of motion (1). The averaging has to be
done numerically. See Ross et al. (1997).
5) See Fukushige & Heggie (2000).
Chapter 13
1) See any dynamics book (e.g. Goldstein 1980) for the transformation
to rotating coordinates.
2) Nearest neighbour distance is of order n−1/3 .
3) (i) Rather than looking up general formulae for geodesics in arbi-
trary metrics, it is quicker to derive the equation with bare hands,
starting from the Euler-Lagrange equation. One obtains
0
0 1 ∂V
f xi =− ,
2f ∂xi
q
where f = (E − V )/(xi0 xi0 ) and 0 = d/ds. (ii) The space is confor-
mally flat, and the expressions for Christoffel symbols and the Ricci
tensor take fairly simple forms. Cf. Synge (1964), though he considers
only the case of a four-dimensional space-time. For the last part con-
sider the two-dimensional two-body problem. (iii) The induced metric
is (ρ02 + z 02 )dr2 + ρ2 dθ 2 . A global solution seems not to be possible if
E < 0.
Chapter 14
Chapter 15
Chapter 16
Chapter 17
Chapter 18
Chapter 19
Chapter 20
1) Kepler’s 3rd Law (cf. Petrosky 1986, who calls the resulting map the
“Kepler” map). Area-preservation: compute the Jacobian. If = −0.1
and t0 = 0, there is interesting structure near E0 = 0.1, 0.24, 0.2344
and 0.23415. (The number of decimal places indicates the care with
which you have to look.) These structures are easier to see if the log
of the escape time is plotted against E0 .
2) Drop a particle parabolically into a Sitnikov machine, and imagine
finding a value for the initial phase at which the particle escapes hy-
perbolically after a number of interactions. Now vary the initial phase.
3) Adjust the phase of each visit to the binary so that the particle
more and more nearly escapes parabolically each time. One gets an
impression of what is needed from the case z(0) = 0, ż(0) = 2.81989
to t ' 1000, if the binary has e = 0.6, P = 2π and a = 1.
4) The equation u00 + (1/3)u0 − (2/9)u + 2u(u2 + 34/3 /4)3/2 = 0, where u
is the scaled z, has equilibria at u = 0 and u = ±37/6 /2. As the energy
decreases the oscillator either escapes or is trapped. In the latter case
the amplitude of the z motion increases without limit in general.
Chapter 21
Z
G2 m3 n a dp
1) ε̇ ∼ , using notation of Eq.(4), Box 14.1, and
V Gm/V 2 p
Eq.(5).
3) Compute Ṅb R3 tr .
350 Appendix B
Chapter 22
Chapter 23
Chapter 24
Chapter 25
1) The relative motion of the moon and earth is governed by the differ-
ence in their acceleration.
2) Put ei (0) = 0 in Eqs.(4) and (5), which imply cos2 i = cos2 i(0)/(1 −
e2i ) and e2 (2 − 5 sin2 i sin2 ω) = 0. Hence e = 0 unless sin2 i ≥ 2/5.
3) z = (a[1 − e cos E] sin ω + b sin E cos ω) sin i; hf (E)i =
Z
1 2π
f (E)(1−e cos E)dE.
2π 0
Chapter 26
2) In the second case, one component of the wider binary must interact
closely with the harder pair.
3) In the input string, M is the projectile mass, v is the initial relative
velocity of the two binaries, and a and e refer to the projectile (p)
and target (t), respectively. The output shows how the particles are
grouped, along with position and velocity components, masses and
energy control.
Chapter 27
1) Use Eq.(3) to obtain the estimate of Ṁ . See Problem 8.1, for the
mass-radius relation in an isothermal cluster.
2) Unless ρ and σ decrease very slowly with increasing r, the third
equation shows that L → 0 as r → ∞, which is inconsistent with the
fourth equation. Relaxation is too slow in the outer halo of a cluster
to remove all the energy generated.
Chapter 28
1) Use the fact that trh ∝ (N rh3 )1/2 (Eq.14.9). Neglecting variation of
the Coulomb logarithm, the unreduced result is approximately
k
t 10.8 N (0)
' + 18,
trh (0) 0.12k 2
where k = 3/0.24 − 1/2.
2) Good values are c = (a) 10−3 , (b) 10−5 , (c) 10−7 and (d) 10−8 .
352 Appendix B
Chapter 29
1) Use Eq.(8.9). Equate the estimate to the heat flux at rh , and use a
virial estimate of σc , assuming the cluster is nearly isothermal inside
rh .
2) For the first step use an estimate like Eq.(2), Box 12.2, and virial
estimates.
3) If the initial total mass is large enough, rh begins by decreasing with
M.
Chapter 30
Chapter 31
Chapter 32
Chapter 33
Appendix A
(2) (3)
1) Use a1 = a0 + hȧ0 + h2 a0 /2 + h3 a0 /6 and a similar expression for
ȧ1 .
5) The command
mksphere -n 25|flat hermite -D 0.1 -t 1|xstarplot
almost does this, but initial velocities are uniform in a cube instead of
in a sphere.
References
Aarseth S.J. (1985). Direct methods for N -body simulations, in Multiple Time
Scales, ed. Brackbill J.U. & Cohen B.I. (Academic Press, New York).
Aarseth S.J. (1999). From NBODY1 to NBODY6: the growth of an industry,
PASP 111, 1333.
Aarseth S.J., Hénon M., Wielen R. (1974). A comparison of numerical methods
for the study of star cluster dynamics, A&A 37, 183.
Aarseth, S.J., Lin, D.N.C. and Papaloizou, J.C.B. (1988). On the collapse and
violent relaxation of protoglobular clusters, ApJ 324, 288-310.
Aarseth S.J., Lecar M. (1975). Computer simulations of stellar systems,
ARA&A 13, 1.
Aarseth S.J., Mardling R.A. (2001). Evolution of binary and multiple star
systems, in “Eggletonfest”, a Meeting in Celebration of Peter Eggleton’s 60th
Birthday, eds. P. Podsiadlowski, S. Rappaport, A.R. King, F. D’Antona, L.
Burder, ASP Conf. Ser. 229 (ASP: San Francisco)
Aarseth S.J., Zare K. (1974). A regularization of the three-body problem, Celes.
Mech. 10, 185.
Abraham R., Marsden J.E., (1978). Foundations of Mechanics, 2nd ed.
(Benjamin/Cummings, Reading)
Agekyan T.A., Anosova Z.P., Orlov, V.V. (1983). Decay time of triple systems,
Astrophysics 19, 66.
Aguilar L., Hut P., Ostriker, J.P. (1988). On the evolution of globular cluster
systems. I - Present characteristics and rate of destruction in our Galaxy,
ApJ 335, 720.
Ahmad A., Cohen L. (1973). A numerical integration scheme for the N-body
gravitational problem, J. Comp. Phys. 12, 389.
Airy G.B. (1884). Gravitation : an elementary explanation of the principal
perturbations in the solar system, Macmillan, London
Alekseev V.M. (1981). Final motions in the three-body problem and symbolic
dynamics, Russ. Math. Surv. 36, 181.
354
References 355
Allen, F.S. and Heggie D.C. (1992). A model gravothermal oscillator, MNRAS
257, 245-56.
Ambartsumian, V.A. (1938). On the dynamics of open clusters, Uch. Zap.
L.G.U., No.22, p.19; transl. in Dynamics of Star Clusters, eds. J. Goodman
and P. Hut (D. Reidel, Dordrecht).
Antonov, V.A. (1962). (Most probable phase distribution in spherical star
systems and conditions for its existence), Vest. Leningrad Univ. 7, 135;
translated in Dynamics of Star Clusters, eds Goodman J., Hut P., (D. Reidel,
Dordrecht)
Antonov V.A. (1973). (On the instability of stationary spherical models with
purely radial motion), in T.B. Omarov, ed, The Dynamics of Galaxies and
Star Clusters, (Nauka Kazakh S.S.R., Alma Ata); transl. in P.T. de Zeeuw,
ed, Structure and Dynamics of Elliptical Galaxies, IAU Symp. 127 (Reidel,
Dordrecht)
Applegate J.H., Douglas M.R., Gürsel Y., Hunter P., Seitz C.L., Sussman G.J.
(1985). A digital orrery, IEEE Trans. Comp. C-34, 822.
Arabadjis J.S., Richstone D.O. (1998). The dynamical evolution of dense
rotating systems: paper I. Two-body relaxation effects, astro-ph/9810192
Arnold, V.I. (1978a). Mathematical Methods of Classical Mechanics
(Springer-Verlag, New York)
Arnold, V.I. (1978b). Ordinary Differential Equations (MIT Press, Cambridge)
Arnold V.I. (1983) Geometrical Methods in the Theory of Ordinary Differential
Equations (Springer, Berlin)
Arnold V.I., Kozlov V.V., Neishtadt A.I. (1997) Mathematical Aspects of
Classical and Celestial Mechanics, 2nd ed (Springer-Verlag, Berlin)
Ashman K.M., Zepf S.E. (1998). Globular cluster systems (Cambridge
University Press, Cambridge)
Bahcall J.N., Hut P., Tremaine S. (1985). Maximum mass of objects that
constitute unseen disk material, ApJ 290, 15.
Bahcall J.N., Wolf R.A. (1976). Star distribution around a massive black hole
in a globular cluster, ApJ 209, 214.
Bailey M.E., Chambers J.E., Hahn G. (1992). Origin of sungrazers - a frequent
cometary end-state, A&A 257, 315.
Bailey V.C., Davies M.B. (1999). Red giant collisions in the Galactic Centre,
MNRAS 308, 257.
Bailyn C.D. (1996). X-ray binaries in globular clusters, in The Origins,
Evolutions, and Destinies of Binary Stars in Clusters, ed. E.F. Milone and
J.-C. Mermilliod (ASP (Conf. Ser. 90), San Francisco).
Barnes J.E. (1985). Dynamical instabilities in spherical stellar systems, in
Dynamics of Star Clusters (IAU Symp. 113), ed. Goodman J., Hut P.
(Reidel, Dordrecht).
Barnes & Hut (1986). A hierarchical O(N log N ) force-calculation algorithm,
Nature 324, 446.
356 References
Chernoff D.F., Shapiro S.L. (1987). Globular cluster evolution in the Galaxy -
a global view, ApJ 322, 113.
Chernoff, D.F. and Weinberg, M.D. (1990). Evolution of globular clusters in
the galaxy, ApJ 351, 121.
Cipriani P., Pettini M. (2001). Strong chaos in N -body problem and
microcanonical thermodynamics of collisionless self-gravitating systems,
astro-ph/0102143
Clarke C.J., Pringle J.E. (1991). The role of discs in the formation of binary
and multiple star systems, MNRAS 249, 588.
Clayton D.D. (1983). Principles of Stellar Evolution and Nucleosynthesis,
(University of Chicago Press, Chicago)
Cohen R.S., Spitzer L., Jr., Routly P.McR. (1950). The electrical conductivity
of an ionized gas, Phys. Rev. 80, 230.
Cohn H. (1979). Numerical integration of the Fokker-Planck equation and the
evolution of star clusters, ApJ 234, 1036.
Cohn H.N. (1985). Direct Fokker-Planck calculations, in Dynamics of star
clusters, IAU Symp. 113, ed. Goodman J., Hut P. (Reidel, Dordrecht).
Cohn, H., Hut, P. and Wise, M. (1989). Gravothermal oscillations after core
collapse in globular cluster evolution, ApJ 342, 814-22.
Combes F., Leon S., Meylan G. (1999). N -body simulations of globular cluster
tides, A&A 352, 149.
Côté P., Fischer P. (1996). Spectroscopic binaries in globular clusters. I. A
search for ultra-hard binaries on the main sequence in M4, AJ 112, 565.
Côté P., Pryor C., McClure R.D., Fletcher J.M., Hesser J.E. (1996).
Spectroscopic binaries in globular clusters. II. A search for long-period
binaries in M22, AJ 112, 574.
Cuperman S., Harten A. (1972). The evolution of a multi-phase space density
collisionless one-dimensional stellar system, A&A 16, 13.
Cvitanović P. (1989). Universality in Chaos, 2nd edition (Adam Hilger, Bristol)
Danilov V.M. (1997). Numerical experiments simulating the dynamics of open
clusters in the Galactic field, Astron. Rep. 41, 163.
David M., Theuns T. (1989). Numerical experiments on radial virial oscillations
in N -body systems, MNRAS 240, 957.
Davies M.B., Benz W. (1995). A stellar audit: the computation of encounter
rates for 47 Tucanae and ω Centauri, MNRAS 276, 876.
Davoust E., Prugniel P. (1990). On the flattening of globular clusters, A&A
230, 67.
Dejonghe H. (1984). The construction of analytical models for globular clusters,
A&A 133, 225.
Dejonghe H. (1987a). A completely analytical family of anisotropic Plummer
models, MNRAS 224, 13.
Dejonghe, H. (1987b). On entropy and stellar systems, ApJ 320, 477-81.
Dejonghe H., Hut P. (1986). Round-off sensitivity in the N -body problem, in
The Use of Supercomputers in Stellar Dynamics, ed. Hut P., McMillan S.
(Lect. Notes Phys., 267, Springer-Verlag, Berlin).
358 References
Farouki R.T., Salpeter E.E. (1994). Mass segregation, relaxation, and the
Couloumb logarithm in N -body systems (again), ApJ 427, 676.
Ferraro F.R., Fusi Pecci F., Cacciari C., Corsi C., Buonanno R., Fahlman G.G.,
Richer H.B. (1993). Blue stragglers in the Galactic globular clusters M3:
evidence for two populations, AJ 106, 2324.
Feynman R.P. (1965). Lectures on Physics, Vol.III. (Addison-Wesley, Reading)
Chapter 6
Figer D.F., Kim S.S., Morris M., Serabyn E., Rich R.M., McLean I.S. (1999).
Hubble Space Telescope/NICMOS Observations of Massive Stellar Clusters
near the Galactic Center, ApJ 525, 750.
Fischer P., Welch D.L., Cote P., Mateo M., Madore B.F. (1992). Dynamics of
the young LMC cluster NGC 1866, AJ 103, 857.
Fischer P., Pryor C., Murray S., Mateo M., Richtler T. (1998). Mass segregation
in young Large Magellanic Cloud clusters. I - NGC 2157, AJ 115, 592.
Fridman A.M., Polyachenko V.L. (1984). Physics of Gravitating Systems,
(Springer Verlag, Berlin, 2 Vols)
Gebhardt K., Fischer P. (1995). Nonparametric dynamical analysis of globular
clusters: M15, 47 TUC, NGC 362, and NGC 3201, AJ 109, 209.
Freitag M., Benz W. (2001). A Monte Carlo code to investigate stellar
collisions in dense galactic nuclei, in Stellar Collisions, Mergers and their
Consequences, ed. Shara M., ASP Conf. Ser. 263 (ASP, San Francisco).
Fukushige T. (1995). Gravitational scattering experiments in infinite
homogeneous N -body systems, in Chaos in Gravitational N -body Systems,
eds. Muzzio J.C., Ferraz-Mello S., Henrard J. (Kluwer, Dordrecht)
Fukushige T., Heggie D.C. (2000). The time-scale of escape from star clusters,
MNRAS 318, 753.
Fukushige T., Hut P., Makino J. (1999). High-performance special-purpose
computers in science, IEEE Computing in Science and Engineering, 1, No. 2,
12
Fullerton L.W., Hills J.G. (1982). Computer simulations of close encounters
between binary and single stars - the effect of the impact velocity and the
stellar masses, AJ 87, 175.
Funato Y., Hut P., McMillan S., Makino J. (1996). Time-symmetrized
Kustaanheimo-Stiefel regularization, AJ 112, 1697.
Gao B., Goodman J., Cohn H., Murphy B. (1991). Fokker-Planck calculations
of star clusters with primordial binaries, ApJ 370, 567.
Garabedian P.R. (1986). Partial Differential Equations, 2nd edition (Chelsea,
New York)
Ghez A.M., Morris M., Becklin E.E., Tanner A., Kremenek T. (2000). The
accelerations of stars orbiting the Milky Way’s central black hole, Nature
407, 349.
Giersz M. (1998). Monte Carlo simulations of star clusters - I. First Results,
MNRAS 298, 1239.
Giersz M., Heggie D.C. (1994a). Statistics of N -body simulations – I. Equal
masses before core collapse, MNRAS 268, 257.
360 References
Giersz M., Heggie D.C. (1994b). Statistics of N -body simulations – II. Equal
masses after core collapse, MNRAS 270, 298.
Giersz M., Heggie D.C. (1996). Statistics of N -body simulations – III. Unequal
masses, MNRAS 279, 1037.
Giersz M., Heggie D.C. (1997). Statistics of N -body simulations – IV. Unequal
masses with a tidal field, MNRAS 286, 709-31.
Giersz M., Spurzem R. (1994). Comparing direct N -body integration with
anisotropic gaseous models of star clusters, MNRAS 269, 241.
Giersz M., Spurzem R. (2000). A stochastic Monte Carlo approach to model
real star cluster evolution - II. Self-consistent models and primordial binaries,
MNRAS 317, 581.
Gilliland R.L., Brown T.M., Guhathakurta P., Sarajedini A., Milone E.F.,
Albrow M.D., Baliber N.R., Bruntt H., Burrows A., Charbonneau D., Choi
P., Cochran W.D., Edmonds P.D., Frandsen S., Howell J.H., Lin D.N.C.,
Marcy G.W., Mayor M., Naef D., Sigurdsson S., Stagg C.R., Vandenberg
D.A., Vogt S.S., Williams M.D. (2000). A lack of planets in 47 Tucanae from
a Hubble Space Telescope search, ApJ 545L, 47.
Gnedin O.Y., Lee H.M., Ostriker J.P. (1999). Effects of tidal shocks on the
evolution of globular clusters, ApJ 522, 935.
Gnedin O.Y., Ostriker J.P. (1997). Destruction of the Galactic globular cluster
system, ApJ 474, 223.
Gnedin O.Y., Ostriker J.P. (1999). On the self-consistent response of stellar
systems to gravitational shocks, ApJ 513, 626.
Goldstein, H. (1980). Classical Mechanics, 2nd edition (Addison Wesley,
Reading, MA)
Goodman, J. (1983). Core collapse with strong encounters, ApJ 270, 700-10.
Goodman J. (1984). Homologous evolution of stellar systems after core collapse,
ApJ 280, 298.
Goodman J. (1987). On gravothermal oscillations, ApJ 313, 576.
Goodman J. (1989). Late evolution of globular clusters, in Dynamics of Dense
Stellar Systems, ed. Merritt D. (Cambridge University Press, Cambridge).
Goodman J., Heggie D.C., Hut P. (1993). On the exponential instability of
N -body systems, ApJ 415, 715.
Goodman J., Hernquist L. (1991). Hydrodynamics of collisions between binary
stars, ApJ 378, 637.
Goodman J., Hut P., eds (1985). Dynamics of Star Clusters, IAU Symp. 113
(Reidel, Dordrecht)
Goodman J., Hut P. (1989). Primordial binaries & globular cluster evolution,
Nature, 339, 40
Goodman J., Hut P. (1993). Binary-single-star scattering. V - Steady state
binary distribution in a homogeneous static background of single stars, ApJ
403, 271.
Greengard L. (1990). The numerical solution of the N -body problem,
Computers in Phys. Mar/Apr, 142-52
References 361
Grillmair C.J., Freeman K.C., Irwin M., Quinn P.J. (1995). Globular clusters
with tidal tails: deep two-color star counts, AJ 109, 2553.
Grindlay J.E. (1988). X-ray binaries in globular clusters, in The Harlow-Shapley
Symposium on Globular Cluster Systems in Galaxies (Proc. IAU Symp. 126),
ed. Grindlay J.E., Philip A.G.D. (Kluwer, Dordrecth).
Guhathakurta P., Yanny B., Schneider D.P., Bahcall J.N. (1996). Globular
cluster photometry with the Hubble Space Telescope. V. WFPC study of
M15’s central density cusp, AJ 111, 267.
Gunn, J.E. and Griffin, R.F. (1979). Dynamical studies of globular clusters
based on photoelectric radial velocities of individual stars. I - M3, AJ 84,
752-73.
Gurzadyan V.G., Savvidi G.K. (1986). Collective relaxation of stellar systems,
A&A 160, 203.
Hachisu I., Nakada Y., Nomoto K., Sugimoto, D. (1978). Gravothermal
catastrophe of finite amplitude, Prog. Theor. Phys. 60, 393.
Hamilton W.R. (1853). Lectures on quaternions (Hodges and Smith, Dublin)
Hansen B.M.S. (1999). Cooling models for old white dwarfs, ApJ 520, 680.
Harris H.C., McClure R.D. (1983). Radial velocities of a random sample of K
giant stars and implications concerning multiplicity among giant stars in
clusters, ApJ 265L, 77.
Harrington R.S. (1970). Encounter phenomena in triple stars, AJ 75, 1140.
Harris, W.E. (1996). A catalog of parameters for globular clusters in the Milky
Way, AJ 112, 1487.
Hayli A. (1970). More comparisons of numerical integrations in the gravitational
N -body problem, A&A 7, 249.
Heggie D.C. (1975). Binary evolution in stellar dynamics, MNRAS 173, 729.
Heggie D.C. (1984). Post-collapse evolution of a gaseous cluster model, MNRAS
206, 179.
Heggie D.C. (1991). Chaos in the N -body problem of stellar dynamics, in
Predictability, Stability, and Chaos in N -Body Dynamical Systems, ed. Roy
A.E. (Plenum Press, New York).
Heggie D.C. (2000). A new outcome of binary-binary scattering, MNRAS 318,
61.
Heggie D.C., Aarseth S.J. (1992). Dynamical effects of primordial binaries in
star clusters. I - Equal masses, MNRAS 257, 513.
Heggie D.C., Hut P. (1993). Binary-single-star scattering. IV - Analytic
approximations and fitting formulae for cross sections and reaction rates,
ApJS 85, 347.
Heggie D.C., Inagaki S., McMillan S.L.W. (1994). Gravothermal expansion in
an N -body system, MNRAS 271, 706.
Heggie, D.C. and Ramamani, N. (1989). Evolution of star clusters after core
collapse, MNRAS 237, 757.
Heggie, D.C. and Ramamani, N. (1995). Approximate self-consistent models for
tidally truncated star clusters, MNRAS 272, 317-22.
362 References
Heggie D.C., Sweatman W.L. (1991). Three-body scattering near triple collision
or expansion, MNRAS 250, 555.
Hénon M. (1958). Un calcul amélioré des perturbations des vitesses stellaires,
Ann. d’Astr. 21, 186.
Hénon M. (1960). L’evasion des étoiles hors des amas isolés , Ann.Astr. 23,
668.
Hénon M. (1961). Sur l’evolution dynamique des amas globulaires, Ann. d’Astr.
24, 369.
Hénon M. (1965). Sur l’evolution dynamique des amas globulaires II. – Amas
isolé, Ann. d’Astr. 28, 62.
Hénon M. (1969). Rates of escape from isolated clusters with an arbitrary mass
distribution, A&A 2, 151.
Hénon M. (1971). The Monte Carlo method, ApSpSci 14, 151.
Hénon, M. (1975). Two recent developments concerning the Monte Carlo
method, in Dynamics of Stellar Systems, ed. Hayli A. (Proc. IAU Coll. 69,
D. Reidel, Dordrecht).
Hénon M. (1976). A complete family of periodic solutions of the planar
three-body problem, and their stability, Cel.Mech. 13, 267.
Hénon M. (1982). Vlasov equation, A&A 114, 211.
Hénon M. (1988). Chaotic scattering modelled by an inclined billiard, Physica
D 33, 132-156.
Hénon M., Heiles C. (1964). The applicability of the third integral of motion:
Some numerical experiments, AJ 69, 73.
Herman J. (1710). Metodo d’investigare l’Orbite de’Pianeti, nell’ ipotesi che le
forze centrali o pure le gravita degli stessi Pianeti sono in regione reciproca
de’ quadrati delle distanze, che i medesimi tengono dal Centro, a cui si
dirigono le forze stesse, Giorn. Lett. d’Italia 2, 447.
Hernquist L., Hut P., Kormendy J. (1991). A post-collapse model for the
nucleus of M33, Nature, 354, 376
Hernquist L., Hut P., Makino J. (1993). Discreteness noise versus force errors
in N -body simulations, ApJ 402, L85.
Hill G.W. (1878). Researches in the lunar theory, Am.J.Math. 1, 5, 129 and
245.
Hills J.G. (1975). Encounters between binary and single stars and their effect
on the dynamical evolution of stellar systems, AJ 80, 809.
Hills, J.G. (1980). The effect of mass loss on the dynamical evolution of a stellar
system: analytic approximations, ApJ 235, 986-91.
Hills J.G. (1983). The effects of sudden mass loss and a random kick velocity
produced in a supernova explosion on the dynamics of a binary star of
arbitrary orbital eccentricity - applications to X-ray binaries and to the
binary pulsars, ApJ 267, 322.
Hills J.G. (1990). Encounters between single and binary stars - The effect of
intruder mass on the maximum impact velocity for which the mean change
in binding energy is positive, AJ 99, 979.
References 363
Hills J.G., Day C.A. (1976). Stellar collisions in globular clusters, Astrophys.
Lett. 17, 87.
Holmberg E. (1941). On the clustering tendencies among the nebulae. II. A
study of encounters between laboratory models of stellar systems by a new
integration procedure, ApJ 94, 385.
Horwitz G., Katz J. (1978). Steepest descent technique and stellar equilibrium
statistical mechanics. III Stability of various ensembles, ApJ 222, 941.
Hurley J., Pols O.R., Tout C.A. (2000). Comprehensive analytic formulae for
stellar evolution as a function of mass and metallicity, MNRAS 315, 543.
Hurley J.R., Tout C.A., Aarseth S.J., Pols O. (2001). Direct N -body modelling
of stellar populations: blue stragglers in M67, MNRAS 323, 630.
Hut P. (1983a). Binary-single star scattering. II - Analytic approximations for
high velocity, ApJ 268, 342.
Hut P. (1983b). The topology of three-body scattering, AJ 88, 1549.
Hut P. (1985). Binary formation and interactions with field stars, in Dynamics
of Star Clusters (Proc. IAU Symp. 113), ed. Hut P., Goodman J. (Reidel,
Dordrecht).
Hut P. (1989). Fundamental timescales in star cluster evolution, in Dynamics
of Dense Stellar Systems, ed. D. Merritt (Cambridge University Press), p.229
Hut P. (1996). Binary formation and interactions with field stars, in Dynamical
Evolution of Star Clusters, IAU Symp. 174, ed. Hut P., Makino J. (Kluwer,
Dordrecht).
Hut P. (1997). Gravitational thermodynamics, Complexity, 3, 38
Hut P., Bahcall J.N. (1983). Binary-single star scattering. I - Numerical
experiments for equal masses, ApJ 268, 319.
Hut P., Djorgovski S. (1992). Rates of collapse and evaporation of globular
clusters, Nature 359, 806.
Hut P., Inagaki S. (1985). Globular cluster evolution with finite-size stars -
Cross sections and reaction rates, ApJ 298, 502.
Hut P., McMillan S., Goodman J., Mateo M., Phinney E.S., Pryor C., Richer
H.B., Verbunt F., Weinberg M. (1992). Binaries in globular clusters, PASP
104, 981.
Hut P., Makino J., eds (1995). Dynamical Evolution of Star Clusters –
Confrontation of Theory and Observations, IAU Symp. 174 (Kluwer,
Dordrecht)
Hut P., Makino J. (1999). Astrophysics on the GRAPE family of special
purpose computers, Science 283, 501.
Hut P., Makino J., McMillan S. (1988). Modelling the evolution of globular
clusters, Nature 336, 31.
Hut P., Makino J., McMillan S. (1995). Building a better leapfrog, ApJ 443L,
93.
Hut P., McMillan S., Romani R.W. (1992). The evolution of a primordial
binary population in a globular cluster, ApJ 389, 527.
Hut P., Paczynski B. (1984). Effects of encounters with field stars on the
evolution of low mass semidetached binaries, ApJ 284, 675.
364 References
Hut P., Verbunt F. (1983). White dwarfs and neutron stars in globular cluster
X-ray sources, Nature 301, 587.
Hut, P. and Verhulst, F. (1981). Explosive mass loss in binary stars - the two
time-scale method, A&A 101, 134-7.
Iben I., Jr. (1985). The life and times of an intermediate mass star - In
isolation/in a close binary, QJRAS 26, 1.
Iben I., Jr., Tutukov A.V. (1997). The lives of stars: from birth to death and
beyond (Part 1), S&T 94, 36.
Iben I., Jr., Tutukov A.V. (1998). How close is close?, S&T 95, 47.
Illingworth G., King I.R. (1977). Dynamical models for M15 without a black
hole, ApJ 218L, 109.
Inagaki S. (1980). The gravothermal catastrophe of stellar systems, PASJ 32,
213.
Inagaki S. (1984). The effects of binaries on the evolution of globular clusters,
MNRAS 206, 149.
Inagaki S., Konishi T. (1993). Dynamical stability of a simple model similar to
self-gravitating systems, PASJ 45, 733.
Inagaki S., Wiyanto P. (1984). On equipartition of kinetic energies in
two-component star clusters, PASJ 36, 391.
Ipser J.R., Semenzato R. (1983). On the effects of strong encounters in stellar
systems. I - A basis for treating anisotropic systems, ApJ 271, 294.
Janes K., ed (1991) The Formation and Evolution of Star Clusters, ASP Conf.
Ser. 13 (ASP, San Francisco)
Jeans J.H. (1902). The stability of a spherical nebula, Phil. Trans. R. Soc. A
199, 49.
Jeans, J.H. (1929). Astronomy and Cosmogony (second edition, Cambridge
University Press, Cambridge); also Dover, New York, 1961
Jeffreys, H. (1929). The Earth, 2nd edition (Cambridge University Press,
Cambridge)
Jernigan J.G., Porter D.H. (1989). A tree code with logarithmic reduction of
force terms, hierarchical regularization of all variables, and explicit accuracy
controls, ApJS 71, 871.
Jha S., Torres G., Stefanik R.P., Latham D., Mazeh T. (2000). Studies
of multiple stellar systems – III. Modulations of orbital elements in the
triple-lined system HD 109648, MNRAS 317, 375.
Johnston H.M., Verbunt F. (1996). The globular cluster population of
low-luminosity X-ray sources, A&A 312, 80.
Johnston K.V., Sigurdsson S., Hernquist L. (1999). Measuring mass-loss rates
from Galactic satellites, MNRAS 302, 771.
Joshi K.J., Rasio F.A., Portegies Zwart S. (2000). Monte Carlo simulations of
globular cluster evolution. I. Method and test calculations, ApJ 540, 969.
Kaliberda V.S., Petrovskaya I.V. (1970). The velocity variation of a star as a
purely discontinuous random process. I. Zero mass stars, Astrofiz. 6, 135.
Kalogera V. (1996). Orbital characteristics of binary systems after asymmetric
supernova explosions, ApJ 471, 352.
References 365
Kaluzny J., Kubiak M., Szymanski M., Udalski A., Krzeminski W., Mateo M.,
Stanek K.Z. (1998). The optical gravitational lensing experiment. Variable
stars in globular clusters. IV. Fields 104A-E in 47 Tucanae, A&AS 128, 19.
Kandrup H.E., Sideris I.V. (2001). Chaos and the continuum limit in the
gravitational N -body problem. I. Integrable potentials, Phys. Rev. E 64,
056209-1.
Katz J. (1978). On the number of unstable modes of an equilibrium, MNRAS
183, 765.
Katz J.I. (1975). Two kinds of stellar collapse, Nature 253, 698.
Katz J., Lynden-Bell D. (1978). The gravothermal instability in two dimensions,
MNRAS 184, 709.
Kawai A., Fukushige T., Makino J., Taiji M. (2000). GRAPE-5: a
special-purpose computer for N -body simulations, PASJ 52, 659.
Keenan D.W., Innanen K.A. (1975). Numerical investigation of galactic tidal
effects on spherical stellar systems, AJ 80, 290.
Kepler, J. (1609). Astronomia Nova (Prague); Johannes Kepler Gesammelte
Werke, Vol.3, 2nd ed (Munich), 1990; english translation: New Astronomy,
trans. William H. Donohue (Cambridge: Cambridge University Press, 1992)
Kim E., Einsel C., Lee H.-M., Spurzem R., Lee M.G. (2002), Dynamical
evolution of rotating stellar systems – II. Post-collapse, equal mass system,
MNRAS, in press
Kim S.S., Figer D.F., Lee H.M., Morris M. (2000). N -body simulations of
compact young clusters near the Galactic Center, ApJ 545, 301.
King I. (1959). The escape of stars from clusters IV. The retardation of escaping
stars, AJ 64, 351.
King, I.R. (1962). The structure of star clusters. I. An empirical density law,
AJ 67, 471-85.
King, I.R. (1966). The structure of star clusters. III. Some simple dynamical
models, AJ 71, 64-75.
King I.R., Sosin C., Cool A.M. (1995). Mass segregation in the globular cluster
NGC 6397, ApJ 452L, 33.
Kinoshita H., Yoshida H., Nakai H. (1991). Symplectic integrators and their
application to dynamical astronomy, CeMDA 50, 59.
Kiseleva L.G., Eggleton P.P., Orlov V.V. (1994). Instability of close triple
systems with coplanar initial doubly circular motion, MNRAS 270, 936.
Klessen R.S., Burkert A. (2001). The formation of stellar clusters: Gaussian
cloud conditions. II., ApJ 549, 386.
Klessen R., Kroupa P. (2001). The mean surface density of companions in a
stellar-dynamical context, A&A 372, 105.
Kochanek C.S. (1992). The dynamical evolution of tidal capture binaries, ApJ
385, 604.
Kontizas M., Hatzidimitriou D., Bellas-Velidis I., Gouliermis D., Kontizas E.,
Cannon R.D. (1998). Mass segregation in two young clusters in the Large
Magellanic Cloud: SL 666 and NGC 2098, A&A 336, 503.
366 References
Press, W.H., Teukolsky, S.A., Vetterling, W.T. and Flannery B.P. (1992).
Numerical Recipes in FORTRAN, 2nd edition (Cambridge University Press,
Cambridge)
Prialnik D. (2000). An Introduction to the Theory of Stellar Structure and
Evolution, (Cambridge University Press, Cambridge)
Pryor, C. and Meylan, G. (1993). Velocity dispersions for Galactic globular
clusters, in Structure and Dynamics of Globular Clusters, eds S. Djorgovski
and G. Meylan, (ASP, San Francisco), Conf. Ser., vol. 50, p. 357
Quinlan G., Tremaine S. (1992). On the reliability of gravitational N -body
integrations, MNRAS 259, 505.
Rasio F.R., McMillan S., Hut P. (1995). Binary-binary interactions and the
formation of the PSR B1620-26 triple system in M4, ApJL, 438, L33
Rauch, K.P. and Tremaine, S. (1996). Resonant relaxation in stellar systems,
New Astron. 1, 149–70.
Reichl L.E. (1980). A Modern Course in Statistical Physics (University of Texas
Press, Austin)
Retterer J.M. (1980). The binding-energy distribution of the binaries in a star
cluster, AJ 85, 249.
Rosenbluth M.N., MacDonald W.M., Judd D.L. (1957). Fokker-Planck equation
for an inverse-square force, Phys. Rev. 107, 1.
Ross D.J., Mennim A., Heggie D.C. (1997). Escape from a tidally limited star
cluster, MNRAS 284, 811.
Rubenstein E.P., Bailyn C.D. (1997). Hubble Space Telescope observations of
the post–core-collapse globular cluster NGC 6752. II. A large main-sequence
binary population, ApJ 474, 701.
Ruffert M. (1992). Collisions between a white dwarf and a main-sequence star.
II - Simulations using multiple-nested refined grids, A&A 265, 82.
Saari D.G. (1990). A visit to the Newtonian N -body problem via elementary
complex-variables, Amer. Math. Mon. 97, 105.
Saari, D.G., Xia, Z. (1995). Off to infinity in finite time, Not. Amer. Math. Soc.
42, 538.
Saffer R.A., ed (1993). Blue Stragglers, Proc. Stars Journal Club Miniworkshop
(STScI, Baltimore)
Salpeter, E.E. (1955). The luminosity function and stellar evolution, ApJ 121,
161-7.
Sandage A.R. (1953). The color-magnitude diagram for the globular cluster
M3, AJ 58, 61.
Saslaw W.C., De Young D.S. (1971). On equipartition in galactic nuclei and
gravitating systems, ApJ 170, 423.
Schwarzschild, B. (2000). Theorists and experimenters seek to learn why gravity
is so weak, Phys. Today, 53, 9, 22
Schwarzschild M. (1958). Structure and Evolution of the Stars (Princeton
University Press, Princeton)
Schwarzschild M., Härm R. (1962). Red giants of Population II. II, ApJ 136,
158.
372 References
Seife, C. (2000). A slow carousel ride gauges gravity’s pull, Science, 288, 944
Shu F.H. (1978). On the statistical mechanics of violent relaxation, ApJ 225,
83.
Siegel M.H., Majewski S.R., Cudworth K.M., Takamiya, M. (2001). A cluster’s
last stand: the death of Palomar 13, AJ 121, 935.
Siegel C.L., Moser J.K. (1971). Lectures on Celestial Mechanics
(Springer-Verlag, Berlin)
Sigurdsson S., Hernquist L. (1993). Primordial black holes in globular clusters,
Nature 364, 423.
Sigurdsson S., Phinney E.S. (1993). Binary–single star interactions in globular
clusters, ApJ 415, 631.
Sills A., Faber J.A., Lombardi J.C., Jr., Rasio F.A., Warren A.R. (2001).
Evolution of stellar collision products in globular clusters. II. Off-axis
collisions, ApJ 548, 323.
Simonovic N., Grujic P. (1987). Small-energy three-body systems: I threshold
laws for the Coulomb interaction, J. Phys. B 20, 3427.
Smith H., Jr. (1979). The dependence of statistical results from N -body
calculations on N , A&A 76, 192.
Sobouti Y. (1985). Linear oscillations of isotropic stellar systems. II - Radial
modes of energy-truncated models, A&A 147, 61.
Soker N. (1996). H-function evolution during violent relaxation, ApJ 457,
287-90.
Spitzer, L., Jr. (1940). The stability of isolated clusters, MNRAS 100, 396.
Spitzer L.Jr. (1969). Equipartition and the formation of compact nuclei in
spherical stellar systems, ApJ 158, L139.
Spitzer, L., Jr. (1987). Dynamical Evolution of Globular Clusters (Princeton
University Press, Princeton).
Spitzer L., Jr., Chevalier R.A. (1973). Random gravitational encounters and
the evolution of spherical systems. V. Gravitational shocks, ApJ 183, 565.
Spitzer L., Jr., Härm R. (1958). Evaporation of stars from isolated clusters,
ApJ 127, 544.
Spitzer L., Jr., Hart M.H. (1971). Random gravitational encounters and the
evolution of spherical systems. I. Method, ApJ 164, 399.
Spitzer L., Jr., Mathieu R.D. (1980). Random gravitational encounters and the
evolution of spherical systems. VIII. Clusters with an initial distribution of
binaries, ApJ 241, 618.
Spitzer L., Jr., Shapiro S.L. (1972). Random gravitational encounters and the
evolution of spherical systems. III. Halo, ApJ 172, 529.
Spitzer L., Jr., Shull J.M. (1975). Random gravitational encounters and the
evolution of spherical systems. VI. Plummer’s model, ApJ 200, 339.
Spitzer L., Jr., Thuan T.X. (1972). Random gravitational encounters and the
evolution of spherical systems. IV. Isolated systems of identical stars, ApJ
175, 31.
References 373
X, ed. Primini F.A., Harnden Jr. F.R., Payne H.E., ASP Conf. Ser. 238
(ASP, San Francisco).
Theis C., Spurzem R. (1999). On the evolution of shape in N -body simulations,
A&A 341, 361.
Thompson, W.B. (1962). An Introduction to Plasma Physics (Pergamon Press,
Oxford)
Thorsett S.E., Arzoumanian Z., Camilo F., Lyne A.G. (1999). The triple pulsar
system PSR B1620-26 in M4, ApJ 523, 763.
Timmes F.X., Woosley S.E., Weaver T.A. (1996). The neutron star and black
hole initial mass function, ApJ 457, 834.
Tolman, R.C. (1936) The Principles of Statistical Mechanics (Clarendon Press,
Oxford)
Toomre A., Toomre J. (1972). Galactic bridges and tails, ApJ 178, 623.
Tout C.A., Aarseth S.J., Pols O.R. (1997). Rapid binary star evolution for
N -body simulations and population synthesis, MNRAS 291, 732.
Tout C.A., Kembhavi A. (1993). Structural changes in stars following tidal
capture, in The Globular Cluster-Galaxy Connection, eds. G.H. Smith and
J.P. Brodie, ASP Conf. Ser. 48 (ASP, San Francisco)
Trager S.C., King I.R., Djorgovski S. (1995). Catalogue of Galactic
globular-cluster surface-brightness profiles, AJ 109, 218.
Tremaine, S. (1981). Galaxy mergers, in The Structure and Evolution of
Normal Galaxies, ed. Fall, S.M. and Lynden-Bell, D. (Cambridge University
Press, Cambridge).
Tremaine, S., Hénon M. and Lynden-Bell, D. (1986). H-functions and mixing
in violent relaxation, MNRAS 219, 285-97.
Tremaine S., Weinberg M.D. (1984). Dynamical friction in spherical systems,
MNRAS 209, 729.
Tsuchiya T., Gouda N. (2000). Relaxation and Lyapunov time scales in a
one-dimensional gravitating sheet system, Phys. Rev. E 61, 948.
van Albada, T.S. (1982). Dissipationless galaxy formation and the r1/4 law,
MNRAS 201, 939-955.
van den Bergh, S. (1980). Star clusters as touchstones for theories of galactic
evolution – a few examples, in Star Clusters, ed. Hesser J.E., IAU Symp. 85
(D. Reidel, Dordrecht)
van Leeuwen F., Le Poole R.S., Reijns R.A., Freeman K.C., de Zeeuw, P.T.
(2000). A proper motion study of the globular cluster ω Centauri, A&A 360,
472.
Verbunt F., Hut P. (1987). The globular cluster population of X-ray binaries,
in The Origin and Evolution of Neutron Stars, ed. D.J. Helfand and J.-H.
Huang, IAUS 125 (Reidel, Dordrecht).
Vesperini E. (1998). Evolution of the mass function of the Galactic globular
cluster system, MNRAS 299, 1019.
Volk O. (1975). Zur Geschichte der Himmelsmechanik: Johannes Kepler,
Leonhard Euler und die Regularisierung, Preprint No.4, Math. Inst., Univ.
Würzburg.
References 375
von Hoerner, S. (1957). Internal structure of globular clusters, ApJ 125, 451.
von Hoerner S. (2001). How it all started, in Dynamics of Star Clusters and
the Milky Way, ed. S. Deiters, B. Fuchs, A. Just, R. Spurzem and R. Wielen
(ASP (Conf. Ser. 228), San Francisco).
Vivarelli M.D. (1985). The KS-transformation in hypercomplex form and the
quantization of the negative-energy orbit manifold of the Kepler problem,
Celes. Mech. 36, 349.
Waldvogel J. (1982). Symmetric and regularized coordinates on the plane triple
collision manifold, Celes. Mech. 28, 69.
Wannier G.H. (1953). The threshold law for single ionization of atoms or ions
by electrons, Phys. Rev. 90, 817.
Webbink R.F. (1988). Kinematics of the Galactic globular cluster system, in
The Harlow-Shapley Symposium on Globular Cluster Systems in Galaxies
IAU Symp. 126 (Kluwer, Dordrecht), p. 49
Weidemann V. (2000). Revision of the initial-to-final mass relation, A&A 363,
647.
Weinberg M.D. (1994a). Weakly damped modes in star clusters and galaxies,
ApJ 421, 481.
Weinberg M.D. (1994b). Adiabatic invariants in stellar dynamics. 1: Basic
concepts, AJ 108, 1398-1402.
Whittaker, Sir E.T. (1927). A Treatise on Analytical Dynamics, 3rd edition
(Cambridge University Press, Cambridge)
Wijers R.A.M.J., van Paradijs J. (1991). An upper limit to the number of
pulsars in globular clusters, A&A 241L, 37.
Witte M.G., Savonije G.J. (1999). Tidal evolution of eccentric orbits in massive
binary systems. A study of resonance locking, A&A 350, 129.
Yoshida H. (1982). A new derivation of the Kustaanheimo-Stiefel variables,
Celes. Mech. 28, 239.
Youngkins V.P., Miller B.N. (2000). Gravitational phase transitions in a
one-dimensional spherical system, Phys. Rev. E 62, 4583.
Index
376
Index 377
300, 302, 303, 305, 307, 309, 315, tidal capture, 273, 277, 278, 300,
321, 330, See also X-ray binaries 304–306, 312
binding energy of, 195–197, 252 two-body, 277
circular, 215, 220 unresolved, 249
circularisation of, 316, 318 visual, 247
contact, 252 black holes, 4, 23, 71, 97, 111, 190,
destruction of, 227, 240, 246, 247, 191, 237, 245, 273, 274, 302, 303,
249, 251, 265, 297 348
distribution of, 198, 202, 245, 249 blow-up, 157, 164, 188
eccentricity of, 160, 215, 219, 227, blue stragglers, 12, 13, 300, 302, 304,
247, 309, 318, 320, 321 305, 310, 315–318, 320, 321
eclipsing, 247, 249, 250, 321 abundance of, 321
energy of, 223 body, iii, iv
evolution of, 11, 245, 272, 276, 284, Boltzmann equation, 92, 94, 176
291, 315–319 collisionless, 45, 46, 57, 58, 61, 74,
84, 88, 89, 93, 94, 97, 100, 110,
formation of, 49, 195, 264, 273,
114, 115, 148
275–277, 284, 291
linearised, 60
rate of, 193, 202, 217, 227, 244,
Boltzmann factor, 20, 202
252, 272
Boltzmann’s constant, 92, 240
stochastic, 244
branch point, 228
hard, 26, 193, 197, 199, 203, 215,
Bulirsch-Stoer method, 230
217–219, 223, 226, 227, 236,
Burrau, 230
239–245, 265, 270, 274, 279, 303,
310, 317, 320, 322, 350 C, 331, 335
hard/soft threshold of, 26, 197, 198, C++, 236
202, 245, 252, 270 c2 h criterion, 43
hardness of, 270, 280 calculus, vi
motion of, 240 canonical ensemble, 20, 21, 183
number of, 285 canonical equations, 231
primordial, 195, 246, 249, 264, 265, canonical momentum, 70
270, 272, 273, 277–279, 281, 283, canonical transformation, 115
284, 291, 294, 295, 298, 305, 321, canonical variables, 100
323, 324, 327 capture, 23, 193, 210–212, 214, 254,
proportion of, 264, 269, 278–280, 266, 307, 308
294, 295, 298 temporary, 309, 317
rate of hardening of, 239, 240, 275, cat map, 108
276 catalyst, 274
soft, 26, 193, 197, 198, 202, 204, catastrophe, 273
215, 218, 226, 227, 240, 251 Cayley numbers, 160
spectroscopic, 321 celestial mechanics, 1, 4, 17, 28, 43
three-body, 269, 273–275, 277–279, central force, 63, 65
281, 287, 294, 295, 327 centrifugal force, 48, 119, 120
378 Index
energy flux, 92, 93, 284, See also heat equipotential, 121, 122, 248, 314
flux escape, 10, 19, 20, 22, 23, 44, 50, 67,
energy generation, 91, 244, 264, 276, 76, 77, 87, 98, 104, 109, 113, 114,
296, 321, See also heating 119, 120, 122, 124–126, 132, 138,
binary-binary, 278, 280, 281, 298 146, 164–167, 169, 170, 190, 200,
rate of, 91, 194, 244, 272, 275, 279, 201, 203, 207–211, 214, 224, 241,
282, 298, 352 243, 245, 248, 255, 267–269, 284,
self-regulating, 285 286, 297, 298, 302, 309, 327, 329,
stochastic, 291 349
energy hypersurface, 150 of binary, 243
energy level, 202, 215 preferential, 166, 169
energy transport, 10, 94, 97, 183, See escape energy, 77, 104, 114, 122, 123,
also heat transport 242, 243
English Channel, 312 escape rate, 123, 167–169, 174, 201,
entropy, 21, 59, 92, 164, 177–179, 286, 297
181, 276 escape speed, 22, 104, 172, 198, 206,
specific, 91 207, 316
environment, 47, 304, 320 Euclidean norm, 36
galactic, 307 Euler, 160
epicycle, 67, 128 Euler angles, 234
epicyclic frequency, 67, 120 Euler method, 232, 233, 237
epicyclic motion, 121, 123 Euler parameters, 160
equations of motion, 54, 55, 342 Euler-Lagrange equation, 346
equilibrium, 17, 22, 74, 91, 98–101, Euler-Lagrange solutions, 99
103, 108, 109, 121, 138, 153, 164, evaporation, 11, 19, 298
176, 177, 179, 181, 282, 284, 287, evaporative model, 85, 86, 89, 90, 153
288, 292 evolution equation, 84, 129
collisionless, 74 evolutionary track, 325–327
dynamic, 17, 18, 23, 79, 85, 88, 89, exchange, 198, 201, 219, 226, 238,
94, 100, 105, 109, 111–114, 140, 254, 265, 267, 269, 320
147, 227 prompt, 265, 266
hydrostatic, 89 resonant, 200
isothermal, 172 exclusion principle, 104
local thermodynamic, 22 existence, 36
quasi-, 17, 89, 109, 138 exothermic reaction, 264, 265, 267,
statistical, 102, 138 274
thermal, 25, 26, 146, 176, 178, 181, exponential divergence, 15, 108, 129,
185 132, 134, 135, 137
unstable, 99, See also instability external effects, 20, 77, 87, 98, 109,
virial, 8, 83, 96, 113, 137, 142, 153, 119, 138, 302, 309
202, 245
equipartition, 50, 164, 169, 172–174, fast motion, 256
185, 186, 193, 197, 198 Feigenbaum sequence, 272
382 Index
ionisation, 198, 202, 219, 226, 236, Kepler’s 3rd Law, 218, 349
240, 266, 267 Kepler’s equation, 38, 66, 155
irrational numbers, 39 Keplerian approximation, 216
isochrone model, 79 kick, 315
isochrone potential, 68, 73 kinematic data, 82
isocline, 75 kinetic theory, v, 22, 89, 145
isolated systems, 64, 109, 111, 119, King model, 46, 74, 76–80, 83, 89,
122, 124, 165, 169, 272, 283, 286, 125, 175, 187, 298, 326
294, 298, 325 multi-mass, 77
lifetime of, 292 King’s Law, 77
isothermal distribution, 76 Kolmogorov-Feller equation, 145
isothermal equation, 75, 184 Kozai cycle, 260, 262
one- and two-dimensional, 83 Kreutz group, 258
isothermal model, 46, 74–77, 82, 176, KS map, 162, 163
177, 181–183, 186–188, 191, 351 KS regularisation, 129, 157
density in, 287, 313, 314 KS transformation, 160, 161
instability of, 295 Kulkarni, ix
singular, 76, 181 Kustaanheimo, 160
isothermal system, 178, 179, 184 Kuzmin, 149
isotropic model, 82
Lada, ix
Jacobi integral, 121, 122, 125 Lagrange, 59
Jacobi’s equation, 134 Lagrange points, 121, 123, 128, 298
Jacobian, 232, 349 Lagrangian, 64, 257
Jeans, viii, 60, 138, 349 Lagrangian shell, 171, 276
Jeans equations, 46, 89 Lagrangian time-derivative, 276
Jeans swindle, 60, 61 Lagrangian variation, 178
Jeans’ Theorem, 59, 74, 83, 103, 148 Landau, 151
jerk, 230 Large Magellanic Cloud, 7
Johnston, ix leapfrog, 161, 231–233
Legendre polynomial, 347
KAM Theorem, vii, 209 Lemaitre, 234
Kepler, 16, 154 Lenz vector, 129, 156
Kepler map, 349 Levi Civita, 43, 156, 158, 160
Kepler motion, 65, 66, 69, 105, 116, Liapounov exponent, 133
131, 139, 161, 163, 166, 253, 255, Liapounov orbit, 124, 128
257, 259, 307, 343 light, 28
Kepler orbit, 156, 158, 233 LIGO, 245
Kepler potential, 67, 68, 343 likelihood, 82
Kepler problem, 26, 45, 53, 58, 64, 73, limit cycle, 292
131, 137, 154, 155, 157, 159, 160 line of sight, 79, 341, 344
five-dimensional, 160 linear series, 164, 179, 182
Kepler’s 1st Law, 65 Liouville’s equation, 75
Index 385
Liouville’s Theorem, 57, 211, 345 mass transfer, 11, 317, 320
Littlewood, 211 Mathematica, 32
LMXB, 302, 303 Matlab, 331, 332, 335
lockstep, 342 Maupertuis’ Principle, 134
log-normal distribution, 330 Maxwell, 138
logistic map, 289 Maxwell’s equations, 57
long-range force, 3, 18, 19, 21, 96 Maxwellian, See distribution
luminosity, 14, 97, 276 function, Maxwellian
Lynden-Bell, 151, 250, 344 McGehee, 157
Lyttleton, 84 McMillan, ix, 230, 232
McVean, ix
M15, 241, 274, 306 mean field, 98
M31, 13 mean free path, 6, 93, 147, 148
M33, 14 mean motion, 66
M4, 253, 254 measure, 36, 210, 211
M67, 321 memory, 28, 30
M71, 250 Mercury, 241
magnetic fields, 4 mergers, 11, 247, 313, 316, 318, 320,
magnitude, 14 322
Makino, ix, 232 rate of, 322
manifold, 134 Messier, 330
Maple, 32, 62, 342 metric, 134, 137, 346
Marchal, 199, 368 Euclidean, 137
Markov process, 145 microcanonical ensemble, 21, 183
mass microphysics, 129
conservation of, 56 Mikkola, 234
time-dependent, 98 Mikkola & Tanikawa, method of, 233,
mass function, 7, 294, 296, 297, 302, 234
305, 308, 323, 324, 326 million-body problem, iv, v, vii, 300,
Miller-Scalo, 112, 345 324
power law, 112 Milone, ix
mass loss, 87, 112, 113, 117, 119, 125, mixing, 102, 103, 108
165, 243, 282, 303, 323–327 model, 84
impulsive, 315 models of observational data, 46, 79
internal, 111 modes, 23, 79
stellar, 12 molecular dynamics, 31
mass overflow, 304, 315, 317, 320 molecules, 6, 20, 21
mass segregation, 75, 147, 164–166, moment, 129, 139, 149
169–173, 187, 191, 242, 243, 264, moment equations, 89, 92
278, 280, 281, 295–297, 302, 308, moment method, 86, 93
310, 317, 323, 324, 326, 348 moment of inertia, 85
primordial, 173 momentum, 54
mass spectrum, 291, 303 Monte Carlo models, 86, 95
386 Index
Monte Carlo sampling, 136, 237, 245, Noether’s Theorem, 64, 257
317 noise, 82
Montgomery, 212, 213 non-dominant terms, 143, 144, 152
moon, 98, 130, 263, 311, 351 non-parametric methods, 46, 82
Moore, 212 novae, 307
Moser, 35, 204 nuclear reactions, 284
mother board, 30 numerical analysis, 25
movie, 332, 334 numerical errors, 192, 230, 232, 233,
multi-component model, 186 237
multiples, 286
multiplicity, 202 one-body problem, 130
multipole expansion, 221, 257 opacity, 91
open systems, 20
N -body codes, 331 orbit average, 115, 149
N -body equations, 3, 30, 35, 46 orbital motion and relaxation, 148
N -body integration, 28, 331 order (of truncation error), 232
N -body methods, 26, 30, 54, 95, 244 oscillations
N -body models, 84, 86, 100, 290, 321 coupled, 106, 107
N -body problem, 1, 15, 35, 42, 63, damped, 87, 292
85, 98, 99, 129, 130, 137, 246, forced, 312
264, 286, 300, See also one-, two-, oscillator, 80, 349, See also simple
few-, many-, million-body harmonic oscillator
problem, etc damped, 214, 312, 344
collinear, 39 one-dimensional, 179
N -body simulations, 26, 27, 32, 90, oscillatory motion, 214
95, 171, 242, 296, 326, 327 Ostriker, 126
N -body systems, description of, 54, output, 28
74
N -body units, 8, 83, 298, 341 P3 M code, 26
nσv argument, 133, 134, 216, 241 Padmanabhan, 51
NBODYx, 331 Pal 13, 330
nearest neighbour, 136, 141, 346 parabolic motion, 66, 221
negative definiteness, 177 parallelisation, 27
Nemo, 32 parallelism, 29
Neptune, 16 parameters, free, 76, 80
neutron stars, 4, 25, 26, 111, 191, parametric methods, 82
241, 248, 249, 253, 274, 277, 297, partial differential equation, 57, 188
304, 305, 315, See also pulsars particle collider, 229, 237
Newton, 3, 16, 17, 35, 253, 261, 311 PCI, 30
Newton’s equations, 95 penalty function, 82
Newton’s laws of motion, 84 pericentre, 66, 233
Newton’s Theorems, 23, 24, 65, 341 argument of, 259, 260
nodes, 259, 261–263 perigalacticon, 125
Index 387
perihelion advance, 241 Poisson’s equation, 55, 74, 75, 80, 82,
period, 65, 257 83, 103
orbital, 69, 133, 325, 327 polytrope, 46, 79, 80, 83, 344
radial, 59, 68, 115 pop, 230
period-doubling, 272, 289–291 population, old, 305
periodic orbit, 124, 210, 213, 230 Portegies Zwart, ix, 243, 317
periodic table, 323 position vector, 35, 54
personal computer, 29 post-collapse evolution, 11, 116, 272,
perturbation, 221 274, 286, 291, 294, 296–298
perturbation theory steady, 295
canonical, 255 unsteady, 287
first order, 221 post-collapse expansion, 283, 285,
phase (of binary), 131, 204, 205, 212, 288, 289, 291, 292, 295
233, 235, 349 steady, 298
phase fluid, 57, 58, 104 potential, 45, 57, 64, 74, 75, 81, 243,
phase mixing, 45, 58, 98, 100, 101, 244, See also spherical, Kepler,
104, 106, 108, 109, 206 isochrone, square well potential
phase space, 42, 45, 54, 56–58, 94, central, 78, 269, 282
111, 211 fixed, 63, 83, 98, 103, 106
phase space average, 117 nonintegrable, 106
phase space volume, 69, 95, 116, 149, scaled, 78
152, 223 slowly varying, 115, 129, 148, 149,
physical laws, 84 152
pipeline, 28, 30 time-dependent, 71, 95, 103
plane of motion, 65 potential well, 56, 65, 75, 83, 165,
planet, 254 170, 172, 191, 242, 324
planetary motion, 16 power series, 37, 41, 42
planetary system, 11 precession, 261–263
planets, extrasolar, 14 pressure, 4, 89, 96
plasma, iii, v, 1, 4, 17, 18, 23, 31, 57, pressure ionisation, 204
91, 142, 151 pressure tensor, 68, 89
Platonic solids, 154 prograde motion, 123
Plummer model, 46, 56, 74, 79, 80, proper motion, 82
83, 87, 97, 135, 169, 171, 174, protein folding, 31
187, 191, 242, 292 Proxima Centauri, 247
Plummer potential, 69, 83, 123 Ptolemaic model, 67
Pluto, 28 pulsars, 249, 304, 306, 315, 321, See
Poincaré, vi, 35, 41, 106, 179, 204 also neutron stars
Poincaré map, 206 millisecond, v, 12, 13, 300, 302
point mass, 6, 11, 15, 23, 111, 120, Pythagorean problem, 230
248, 249
Poisson bracket, 57, 100 quadrature, 41, 68, 229
Poisson fluctuations, 62 quadruple, 11
388 Index
statistical weight, 202 temperature, 19, 47, 49, 50, 52, 75,
steepest descents, 219, 222 89, 92, 172, 174, 176, 181–184,
Stefan-Boltzmann constant, 91 187, 240, 276, 289, 348
stellar dynamics, 1, 4, 6, 10, 21, 23, temperature fluctuations, 290
25, 26, 28, 43, 46, 84, 86, 89, 99, temperature gradient, 182, 189,
109, 135, 138 283–285
collisional, 1, 6, 45, 55, 129 temperature inversion, 272, 288, 290,
collisionless, 1, 6, 45, 55, 84, 95 292
stellar systems, 63, 64, 84, 114, See test particle, 198
also star clusters Teuben, ix
bound, 112 theorems, 35
cold, 68 theoretical physics, 15
cool, 113 thermal conduction, 6, 27, 92, 283,
dense, 11, 14, 45, 84, 86, 187 284
expansion of, 111, 116, 285–290 thermal conductivity, 89, 91, 97
gross evolution of, 46, 94, 114, 272, thermal time scale, 318
See also post-collapse evolution thermodynamics, v, 18–22, 45, 47,
51, 76, 93, 164, 176, 179, 276
hot, 227
first law of, 92, 276
lifetime of, 297
three-body problem, v, 2, 16, 40–43,
one-component, 186
99, 121, 129, 131, 132, 163, 193,
radius of, 341
202, 223, 226, 230, 234, 255, See
rotating, 87, 188
also triples
spherical, 148
analytic solution of, 42
stereographic projection, 161
isosceles, 228
Stiefel, 160 numerical integration of, 230
Stodólkiewicz, 298 periodic solutions of, 99, 212
Stosszahlansatz, 23 restricted, 16, 131, 204
Stumpff variables, 66 three-body processes, 281, See also
Sugimoto, 288, 290 collisions; encounters
sun, 124, 247, 282 threshold scattering, 194
lifetime of, 26 tidal approximation, 121
radius of, 306 tidal boundary, 297, 324, 327, 330
sungrazers, 258 tidal bulge, 312
supernova, 112, 315 tidal circularisation, 320
surface brightness, 14, 77, 79 tidal cutoff, 118, 187, 298
surface of section, 122, 204, 207, 208 tidal disruption, 325, 327
survival triangle, 330 tidal dissipation, 313, 326
symmetry, 41, 64, 80, 120 tidal encounters, 277, 316
symplecticness, 35, 36, 161, 231, 232, tidal field, 122, 123, 125, 131, 169,
234 298, 309, 314, 324, 327
tidal heating, 125
target, 229, 237 tidal interaction, 313
392 Index