Coil 2013
Coil 2013
of the Universe
Alison L. Coil
Department of Physics, University of California, San Diego, CA,
USA
Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420
T.D. Oswalt, W.C Keel (eds.), Planets, Stars and Stellar Systems. Volume 6: Extragalactic Astronomy and
Cosmology, DOI 10.1007/978-94-007-5609-0_8, © Springer Science+Business Media Dordrecht 2013
388 8 The Large-Scale Structure of the Universe
Abstract: Galaxies are not uniformly distributed in space. On large scales, the Universe dis-
plays coherent structure, with galaxies residing in groups and clusters on scales of ∼1–3 h− Mpc,
which lie at the intersections of long filaments of galaxies that are >10 h− Mpc in length. Vast
regions of relatively empty space, known as voids, contain very few galaxies and span the volume
in between these structures. This observed large-scale structure depends both on cosmological
parameters and on the formation and evolution of galaxies. Using the two-point correlation
function, one can trace the dependence of large-scale structure on galaxy properties, such as
luminosity, color, stellar mass, and track its evolution with redshift. Comparison of the observed
galaxy clustering signatures with dark matter simulations allows one to model and understand
the clustering of galaxies and their formation and evolution within their parent dark matter
halos. Clustering measurements can determine the parent dark matter halo mass of a given
galaxy population, connect observed galaxy populations at different epochs, and constrain cos-
mological parameters and galaxy evolution models. This chapter describes the methods used
to measure the two-point correlation function in both redshift and real space, presents the cur-
rent results of how the clustering amplitude depends on various galaxy properties, and discusses
quantitative measurements of the structures of voids and filaments. The interpretation of these
results with current theoretical models is also presented.
1 Historical Background
Large-scale structure is defined as the structure or inhomogeneity of the Universe on scales
larger than that of a galaxy. The idea of whether galaxies are distributed uniformly in space can
be traced to Edwin Hubble, who used his catalog of 400 “extragalactic nebulae” to test the homo-
geneity of the Universe (Hubble 1926), finding it to be generally uniform on large scales. In 1932,
the larger Shapley-Ames catalog of bright galaxies was published (Shapley and Ames 1932), in
which the authors note “the general unevenness in distribution” of the galaxies projected onto
the plane of the sky and the roughly factor of 2 difference in the numbers of galaxies in the
northern and southern galactic hemispheres. Using this larger statistical sample, Hubble (1934)
noted that on angular scales less than ∼10○ , there is an excess in the number counts of galaxies
above what would be expected for a random Poisson distribution, though the sample follows a
Gaussian distribution on larger scales. Hence, while the Universe appears to be homogeneous
on the largest scales, on smaller scales it is clearly clumpy.
Measurements of large-scale structure took a major leap forward with the Lick galaxy cat-
alog produced by Shane and Wirtanen (1967), which contained information on roughly a
million galaxies obtained using photographic plates at the 0.5-m refractor at Lick Observatory.
Seldner et al. (1977) published maps of the counts of galaxies in angular cells across the sky
(see > Fig. 8-1), which showed in much greater detail that the projected distribution of galax-
ies on the plane of the sky is not uniform. The maps display a rich structure with a foamlike
pattern containing possible walls or filaments with long strands of galaxies, clusters, and large
empty regions (see > Chap. 6). The statistical spatial distribution of galaxies from this cata-
log and that of Zwicky et al. (1968) was analyzed by Jim Peebles and collaborators in a series of
papers (e.g., Peebles 1975) that showed that the angular two-point correlation function (defined
below) roughly follows a power-law distribution over angular scales of ∼0.1–5○ . In these papers
it was discovered that the clustering amplitude is lower for fainter galaxy populations, which
The Large-Scale Structure of the Universe 8 389
⊡ Fig. 8-1
Angular distribution of counts of galaxies brighter than B ∼ 19 on the plane of the sky recon-
structed from the Lick galaxy catalog (from Seldner et al. 1977, reproduction created by Ed Groth).
This image shows the number of galaxies observed in 10′ × 10′ cells across the northern galactic
hemisphere, where brighter cells contain more galaxies. The northern galactic pole is at the cen-
ter, with the galactic equator at the edge. The distribution of galaxies is clearly not uniform; clumps
of galaxies are seen in white, with very few galaxies observed in the dark regions between
likely arises from larger projection effects along the line of sight. As faint galaxies typically lie at
larger distances, the projected clustering integrates over a wider volume of space and therefore
dilutes the effect.
These results in part spurred the first large scale redshift surveys, which obtained optical
spectra of individual galaxies in order to measure the redshifts and spatial distributions of
large galaxy samples. Pioneering work by Gregory and Thompson (1978) mapped the three-
dimensional spatial distribution of 238 galaxies around and towards the Coma/Abell 1367
supercluster. In addition to surveying the galaxies in the supercluster, they found that in the fore-
ground at lower redshift there were large regions (> h− Mpc, shown well in their Figure 2a)
with no galaxies, which they termed “voids.” Joeveer et al. (1978) used redshift information
from Sandge and Tammann (1975) to map the three-dimensional distribution of galaxies on
large scales in the southern galactic hemisphere. They mapped four separate volumes that
included clusters as well as field galaxies and showed that across large volumes, galaxies are
clearly clustered in three dimensions and often form “chains” of clusters (now recognized as
filaments).
Two additional redshifts surveys were the KOS survey (Kirshner et al. 1978) and the original
CfA survey (Davis et al. 1982). The KOS survey measured redshifts for 164 galaxies brighter
than magnitude 15 in eight separate fields on the sky, covering a total of 15 deg . Part of the
390 8 The Large-Scale Structure of the Universe
motivation for the survey was to study the three dimensional spatial distribution of galaxies,
about which the authors note that “although not entirely unexpected, it is striking how strongly
clustered our galaxies are in velocity space,” as seen in strongly peaked one dimensional redshift
histograms in each field.
The original CfA survey, completed in 1982, contained redshifts for 2,400 galaxies brighter
than magnitude 14.5 across the north and south galactic poles, covering a total of 2.7 stera-
dians. The major aims of the survey were cosmological and included quantifying the clus-
tering of galaxies in three-dimensions. This survey produced large area, moderately deep
three dimensional maps of large-scale structure (see > Fig. 8-2), in which one could iden-
tify galaxy clusters, voids, and an apparent “filamentary connected structure” between groups
of galaxies, which the authors caution could be random projections of distinct structures
(Davis et al. 1982). This paper also performed a comparison of the so-called complex topol-
ogy of the large-scale structure seen in the galaxy distribution with that seen in N-body
dark matter simulations, paving the way for future studies of theoretical models of structure
formation.
The second CfA redshift survey, which ran from 1985 to 1995, contained spectra for ∼5,800
galaxies and revealed the existence of the so-called Great Wall, a supercluster of galaxies that
extends over 170 h− Mpc, the width of the survey (Geller and Huchra 1989). Large underdense
voids were also commonly found, with a density 20% of the mean density.
Redshift surveys have rapidly progressed with the development of multi-object spectro-
graphs, which allow simultaneous observations of hundreds of galaxies, and larger telescopes,
13h
14h 12h
15h 11h
16h 10h
9h
17h 10000
5000
km s–1
10< δ <20
249 GALAXIES
0
⊡ Fig. 8-2
Distribution of galaxies in redshift space from the original CfA galaxy redshift survey (from Davis
et al. 1982). Plotted are 249 galaxies as a function of observed velocity (corresponding to a given
redshift) versus right ascension for a wedge in declination of 10○ < δ < 20○
The Large-Scale Structure of the Universe 8 391
10 h
2dF Galaxy Redshift Survey
3
h
11 h
2
h
12 h
1h
0.
0h
05
h
13
0.
1
re
23 h
3° slice d sh 0.1
ift 5
h
14
62559 galaxies
0.
220929 total 2
22 h
⊡ Fig. 8-3
The spatial distribution of galaxies as a function of redshift and right ascension (projected through
3○ in declination) from the 2dF Galaxy Redshift Survey (from Colless 2004)
which allow deeper surveys of both lower luminosity nearby galaxies and more distant, lumi-
nous galaxies. At present, the largest redshift surveys of galaxies at low redshift are the Two
Degree Field Galaxy Redshift Survey (2dFGRS, Colless et al. 2001) and the Sloan Digital Sky
Survey 9SDSS, York et al. 2000), which cover volumes of ∼ × h Mpc− and ∼ × h
Mpc− with spectroscopic redshifts for ∼220,000 and a million galaxies, respectively. These sur-
veys provide the best current maps of large-scale structure in the Universe today (see >Fig. 8-3),
revealing a spongelike pattern to the distribution of galaxies (Gott et al. 1986). Voids of ∼10 h−
Mpc are clearly seen, containing very few galaxies. Filaments stretching greater than 10 h− Mpc
surround the voids and intersect at the locations of galaxy groups and clusters.
The prevailing theoretical paradigm regarding the existence of large-scale structure is that
the initial fluctuations in the energy density of the early Universe, seen as temperature devi-
ations in the cosmic microwave background, grow through gravitational instability into the
structure seen today in the galaxy density field. The details of large-scale structure – the sizes,
densities, and distribution of the observed structure – depend both on cosmological parame-
ters such as the matter density and dark energy, as well as on the physics of galaxy formation
and evolution. Measurements of large-scale structure can therefore constrain both cosmology
(see > Chap. 13) and galaxy evolution physics.
In order to quantify the clustering of galaxies, one must survey not only galaxies in clusters (see
> Chap. 6) but rather the entire galaxy density distribution, from voids to superclusters. The
most commonly used quantitative measure of large-scale structure is the galaxy two-point cor-
relation function, ξ(r), which traces the amplitude of galaxy clustering as a function of scale.
392 8 The Large-Scale Structure of the Universe
ξ(r) is defined as a measure of the excess probability dP, above what is expected for an unclus-
tered random Poisson distribution, of finding a galaxy in a volume element dV at a separation
r from another galaxy,
where n is the mean number density of the galaxy sample in question (Peebles 1980). Measure-
ments of ξ(r) are generally performed in comoving space, with r having units of h− Mpc. The
Fourier transform of the two-point correlation function is the power spectrum, which is often
used to describe density fluctuations observed in the cosmic microwave background.
To measure ξ(r), one counts pairs of galaxies as a function of separation and divides by what
is expected for an unclustered distribution. To do this, one must construct a “random catalog”
that has the identical three-dimensional coverage as the data – including the same sky coverage
and smoothed redshift distribution – but is populated with random distribution points. The
ratio of pairs of galaxies observed in the data relative to pairs of points in the random catalog is
then used to estimate ξ(r). Several different estimators for ξ(r) have been proposed and tested.
An early estimator that was widely used is from Davis and Peebles (1983),
n R DD
ξ= − , (8.2)
n D DR
where DD and DR are counts of pairs of galaxies (in bins of separation) in the data catalog
and between the data and random catalogs, and n D and n R are the mean number densities of
galaxies in the data and random catalogs. Hamilton (1993) later introduced an estimator with
smaller statistical errors,
DD RR
ξ= − , (8.3)
(DR)
where RR is the count of pairs of galaxies as a function of separation in the random catalog. The
most commonly used estimator is from Landy and Szalay (1993),
nR nR
ξ= [DD ( ) − DR ( ) + RR] . (8.4)
RR nD nD
This estimator has been shown to perform as well as the Hamilton estimator (>8.3), and while it
requires more computational time, it is less sensitive to the size of the random catalog and han-
dles edge corrections well, which can affect clustering measurements on large scales (Kerscher
et al. 2000).
As can be seen from the form of the estimators given above, measuring ξ(r) depends sen-
sitively on having a random catalog which accurately reflects the various spatial and redshift
selection affects in the data. These can include effects such as edges of slitmasks or fiber plates,
overlapping slitmasks or plates, gaps between chips on the CCD, and changes in spatial sensi-
tivity within the detector (i.e., the effective radial dependence within X-ray detectors). If one
is measuring a full three-dimensional correlation function (discussed below), then the ran-
dom catalog must also accurately include the redshift selection of the data. The random catalog
should also be large enough to not introduce Poisson error in the estimator. This can be checked
by ensuring that the RR pair counts in the smallest bin are high enough such that Poisson errors
are subdominant.
The Large-Scale Structure of the Universe 8 393
3 Angular Clustering
The spatial distribution of galaxies can be measured either in two dimensions as projected
onto the plane of the sky or in three dimensions using the redshift of each galaxy. As it can
be observationally expensive to obtain spectra for large samples of (particularly faint) galax-
ies, redshift information is not always available for a given sample. One can then measure the
two-dimensional projected angular correlation function ω(θ) defined as the probability above
Poisson of finding two galaxies with an angular separation θ,
dP = N[ + ω(θ)]dΩ, (8.5)
where N is the mean number of galaxies per steradian and dΩ is the solid angle of a second
galaxy at a separation θ from a randomly chosen galaxy.
Measurements of ω(θ) are known to be low by an additive factor known as the “inte-
gral constraint,” which results from using the data sample itself (which often does not cover
large areas of the sky) to estimate the mean galaxy density. This correction becomes impor-
tant on angular scales comparable to the survey size; on much smaller scales, it is negligible.
One can either restrict measurements of the angular clustering to scales where the integral con-
straint is not important or estimate the amplitude of the integral constraint correction by doubly
integrating an assumed power-law form of ω(θ) over the survey area, Ω, using
AC =
Ω∫ ∫
ω(θ)dΩ dΩ , (8.6)
where Ω is the area subtended by the survey. In practice, this can be numerically estimated over
the survey geometry using the random catalog itself (see Roche and Eales 1999 for details).
The projected angular two-point correlation function, ω(θ), can generally be fit with a
power law,
ω(θ) = A ω θ δ , (8.7)
where A is the clustering amplitude at a given scale (often 1′ ) and δ is the slope of the correlation
function.
From measurements of ω(θ), one can infer the three-dimensional spatial two-point cor-
relation function, ξ(r), if the redshift distribution of the sources is well known. The two-point
correlation function, ξ(r), is usually fit as a power law, ξ(r) = (r/r )−γ , where r is the char-
acteristic scale length of the galaxy clustering defined as the scale at which ξ = . As the
two-dimensional galaxy clustering seen in the plane of the sky is a projection of the three-
dimensional clustering, ω(θ) is directly related to its three-dimensional analog ξ(r). For a given
ξ(r), one can predict the amplitude and slope of ω(θ) using Limber’s equation, effectively inte-
grating ξ(r) along the redshift direction (e.g., Peebles 1980). If one assumes ξ(r) (and thus
ω(θ)) to be a power law over the redshift range of interest, such that
r (z) γ
ξ(r, z) = [ ] , (8.8)
r
then
√ Γ[(γ − )/] A
w(θ) =π , (8.9)
Γ(γ/) θ γ−
where Γ is the usual gamma function. The amplitude factor, A, is given by
∞ γ
∫ r (z)g(z) ( dz ) dz
dN
A= , (8.10)
[∫∞ ( ddzN ) dz]
394 8 The Large-Scale Structure of the Universe
where d N/dz is the number of galaxies per unit redshift interval and g(z) depends on γ and
the cosmological model,
dz (−γ)
g(z) = ( ) r F(r). (8.11)
dr
Here, F(r) is the curvature factor in the Robertson-Walker metric,
ds = c dt − a [dr /F(r) + r dθ + r sin θdϕ ]. (8.12)
If the redshift distribution of sources, d N/dz, is well known, then the amplitude of ω(θ) can
be predicted for a given power-law model of ξ(r), such that measurements of ω(θ) can be used
to place constraints on the evolution of ξ(r).
Interpreting angular clustering results can be difficult, however, as there is a degeneracy
between the inherent clustering amplitude and the redshift distribution of the sources in the
sample. For example, an observed weakly clustered signal projected on the plane of the sky
could be due either to the galaxy population being intrinsically weakly clustered and projected
over a relatively short distance along the line of sight, or it could result from an inherently
strongly clustered distribution integrated over a long distance, which would wash out the signal.
The uncertainty on the redshift distribution is therefore often the dominant error in analyses
using angular clustering measurements. The assumed galaxy redshift distribution (d N/dz) has
varied widely in different studies, such that similar observed angular clustering results have
led to widely different conclusions. A further complication is that each sample usually spans
a large range of redshifts and is magnitude limited, such that the mean intrinsic luminosity of
the galaxies is changing with redshift within a sample, which can hinder interpretation of the
evolution of clustering measured in ω(θ) studies.
Many of the first measurements of large-scale structure were studies of angular clustering.
One of the earliest determinations was the pioneering work of Peebles (1975) using photo-
graphic plates from the Lick survey (> Fig. 8-1). They found ω(θ) to be well fit by a power law
with a slope of δ = −.. Later studies using CCDs were able to reach deeper magnitude limits
and found that fainter galaxies had a lower clustering amplitude. One such study was conducted
by Postman et al. (1998), who surveyed a contiguous 4○ × 4○ field to a depth of I AB = mag,
reaching to z ∼ . Later surveys that covered multiple fields on the sky found similar results. The
lower clustering amplitude observed for galaxies with fainter apparent magnitudes can in prin-
ciple be due either to clustering being a function of luminosity and/or a function of redshift. To
disentangle this dependence, each author assumes a d N/dz distribution for galaxies as a func-
tion of apparent magnitude and then fits the observed ω(θ) with different models of the redshift
dependence of clustering. While many authors measure similar values of the dependence of
ω(θ) on apparent magnitude, due to differences in the assumed d N/dz distributions, different
conclusions are reached regarding the amount of luminosity and redshift dependence to galaxy
clustering. Additionally, quoted error bars on the inferred values of r generally include only
Poisson and/or cosmic variance error estimates, while the dominant error is often the lack of
knowledge of d N/dz for the particular sample in question.
Because of the sensitivity of the inferred value of r on the redshift distribution of sources,
it is preferable to measure the three-dimensional correlation function. While it is much easier
to interpret three-dimensional clustering measurements, in cases where it is still not feasible to
obtain redshifts for a large fraction of galaxies in the sample, angular clustering measurements
are still employed. This is currently the case in particular with high redshift and/or very dusty,
optically obscured galaxy samples, such as submillimeter galaxies (e.g., Brodwin et al. 2008;
Maddox et al. 2010). However, without knowledge of the redshift distribution of the sources,
these measurements are hard to interpret.
The Large-Scale Structure of the Universe 8 395
Measurements of the two-point correlation function use the redshift of a galaxy, not its distance,
to infer its location along the line of sight. This introduces two complications: one is that a cos-
mological model has to be assumed to convert measured redshifts to inferred distances, and
the other is that peculiar velocities introduce redshift space distortions in ξ parallel to the line
of sight of Sargent and Turner (1977). On the first point, errors on the assumed cosmology are
generally subdominant, so that in theory one could assume different cosmological parameters
and check which results are consistent with the assumed values, which is generally not neces-
sary. On the second point, redshift space distortions can be measured to constrain cosmological
parameters, and they can also be integrated over to recover the underlying real-space correlation
function.
On small spatial scales (≲1 h− Mpc), within collapsed virialized overdensities such as
groups and clusters, galaxies have large random motions relative to each other. Therefore, while
all of the galaxies in the group or cluster have a similar physical distance from the observer,
they have somewhat different redshifts. This causes an elongation in redshift space maps along
the line of sight within overdense regions, which is referred to as “Fingers of God.” The result
is that groups and clusters appear to be radially extended along the line of sight toward the
observer. This effect can be seen clearly in > Fig. 8-4, where the lower left panel, shows galax-
ies in redshift space with large “Fingers of God” pointing back to the observer, while in the
lower right panel the “Fingers of God” have been modeled and removed. Redshift space dis-
tortions are also seen on larger scales (≳1 h− Mpc) due to streaming motions of galaxies that
are infalling onto structures that are still collapsing. Adjacent galaxies will all be moving in the
same direction, which leads to coherent motion and causes an apparent contraction of struc-
ture along the line of sight in redshift space (Kaiser 1987), in the opposite sense as the “Fingers
of God.”
Redshift space distortions can be clearly seen in measurements of galaxy clustering. While
redshift space distortions can be used to uncover information about the underlying matter den-
sity and thermal motions of the galaxies (discussed below), they complicate a measurement of
the two-point correlation function in real space. Instead of ξ(r), what is measured is ξ(s), where
s is the redshift space separation between a pair of galaxies. While some results in the literature
present measurements of ξ(s) for various galaxy populations, it is not straightforward to com-
pare results for different galaxy samples and different redshifts as the amplitude of redshift space
distortions differs depending on the galaxy type and redshift. Additionally, ξ(s) does not follow
a power law over the same scales as ξ(r), as redshift space distortions on both small and large
scales decrease the amplitude of clustering relative to intermediate scales.
The real-space correlation function, ξ(r), measures the underlying physical clustering of
galaxies independent of any peculiar velocities. Therefore, in order to recover the real-space
correlation function, one can measure ξ in two dimensions, both perpendicular to and along
the line of sight. Following Fisher et al. (1994), v and v are defined to be the redshift positions
of a pair of galaxies, s to be the redshift space separation (v − v ) and l = (v + v ) to be the
mean distance to the pair. The separation between the two galaxies across (r p ) and along (π)
the line of sight are defined as
s⋅l
π= , (8.13)
∣l∣
√
rp = s ⋅ s − π . (8.14)
396 8 The Large-Scale Structure of the Universe
ALL ALL
BEFORE AFTER
50
0
Comoving y [h–1 Mpc]
–50
FOGS FOGS
BEFORE AFTER
50
–50
⊡ Fig. 8-4
An illustration of the “Fingers of God” (FoG) or elongation of virialized structures along the line of
sight, from Tegmark et al. (2004). Shown are galaxies from a slice of the SDSS sample (projected
here through the declination direction) in two-dimensional comoving space. The top row shows
all galaxies in this slice (67,626 galaxies in total), while the bottom row shows galaxies that have
been identified as having “Fingers of God.” The right column shows the position of these galaxies in
this space after modeling and removing the effects of the “Fingers of God.” The observer is located
at (x,y=0,0), and the “Fingers of God” effect can be seen in the lower left panel as the positions of
galaxies being radially smeared along the line of sight toward the observer
One can then compute pair counts over a two-dimensional grid of separations to estimate
ξ(r p , π). ξ(s), the one-dimensional redshift space correlation function, is then equivalent to
the azimuthal average of ξ(r p , π).
An example of a measurement of ξ(r p , π) is shown in > Fig. 8-5. Plotted is ξ as a function
of separation r p (defined in this figure to be σ) across and π along the line of sight. What is
usually shown is the upper right quadrant of this figure, which here has been reflected about
both axes to emphasize the distortions. Contours of constant ξ follow the color coding, where
yellow corresponds to large ξ values and green to low values. On small scales across the line of
sight (r p or σ <∼ h− Mpc) the contours are clearly elongated in the π direction; this reflects
the “Fingers of God” from galaxies in virialized overdensities. On large scales across the line
of sight (r P or σ >∼ h− Mpc) the contours are flattened along the line of sight, due to “the
Kaiser effect.” This indicates that galaxies on these linear scales are coherently streaming onto
structures that are still collapsing.
The Large-Scale Structure of the Universe 8 397
As this effect is due to the gravitational infall of galaxies onto massive forming structures,
the strength of the signature depends on Ω matter . Kaiser (1987) derived that the large-scale
anisotropy in the ξ(r p , π) plane depends on β ≡ Ω matter /b on linear scales, where b is the
bias or the ratio of density fluctuations in the galaxy population relative to that of dark matter
(discussed further in the next section below). Anisotropies are quantified using the multipole
moments of ξ(r p , π) defined as
where s is the distance as measured in redshift space, Pl are Legendre polynomials, and θ is the
angle between s and the line of sight. The ratio ξ /ξ , the quadrupole-to-monopole moments
of the two-point correlation function, is related to β in a simple manner using linear theory
(Hamilton 1998),
β + β
ξ /ξ = f (n) , (8.16)
+ β+ β
where f (n) = (+ n)/n and n is the index of the two-point correlation function in a power-law
form: ξ ∝ r −(+n) (Hamilton 1992).
Peacock et al. (2001) find using measurements of the quadrupole-to-monopole ratio in the
2dFGRS data (see > Fig. 8-5) that β = . ± .. For a bias value of around unity (see > Sect. 5
below), this implies a low value of Ω matter ∼ .. Similar measurements have been made with
20
π / h−1Mpc
−20
−20 0 20
σ / h−1Mpc
⊡ Fig. 8-5
The two-dimensional redshift space correlation function from 2dFGRS (from Peacock et al. 2001).
Shown is ξ(rp , π) (in the figure, σ is used instead of rp ), the correlation function as a function of
separation across (σ or rp ) and along (π) the line of sight. Contours show lines of constant ξ at ξ =10,
5, 2, 1, 0.5, 0.2, 0.1. Data from the first quadrant (upper right) are reflected about both the σ and π
axes to emphasize deviations from circular symmetry due to redshift space distortions
398 8 The Large-Scale Structure of the Universe
clustering measurements using data from the SDSS. Very large galaxy samples are needed to
detect this coherent infall and obtain robust estimates of β. At higher redshift, Guzzo et al.
(2008) find β = . ± . at z = . using data from the VVDS and argue that measurements
of β as a function of redshift can be used to trace the expansion history of the Universe. We
return to the discussion of redshift space distortions on small scales below in > Sect. 6.3.
What is often desired, however, is a measurement of the real-space clustering of galaxies. To
recover ξ(r), one can then project ξ(r p , π) along the r p axis. As redshift space distortions affect
only the line-of-sight component of ξ(r p , π), integrating over the π direction leads to a statistic
w p (r p ), which is independent of redshift space distortions. Following Davis and Peebles (1983),
∞ ∞
/
w p (r p ) = ∫ dπ ξ(r p , π) = ∫ d y ξ(r p + y ) , (8.17)
where y is the real-space separation along the line of sight. If ξ(r) is modeled as a power law,
ξ(r) = (r/r )−γ , then r and γ can be readily extracted from the projected correlation function,
w p (r p ), using an analytic solution to ( > 8.17),
r
γ
Γ( )Γ( γ− )
w p (r p ) = r p ( ) γ
, (8.18)
rp Γ( )
where Γ is the usual gamma function. A power-law fit to w p (r p ) will then recover r and γ
for the real-space correlation function, ξ(r). In practice, ( > 8.17) is not integrated to infinite
separations. Often, values of πmax are ∼40–80 h− Mpc, which includes most correlated pairs. It
is worth noting that the values of r and γ inferred are covariant. One must therefore be careful
when comparing clustering amplitudes of different galaxy populations; simply comparing the
r values may be misleading if the correlation function slopes are different. It is often preferred
to compare the galaxy bias instead (see next section).
As a final note on measuring the two-point correlation function, as can be seen
from > Fig. 8-3, flux-limited galaxy samples contain a higher density of galaxies at lower red-
shift. This is purely an observational artifact due to the apparent magnitude limit including
intrinsically lower luminosity galaxies nearby, while only tracing the higher luminosity galax-
ies further away. As discussed below in > Sect. 6, because the clustering amplitude of galaxies
depends on their properties, including luminosity, one would ideally only measure ξ(r) in
volume-limited samples where galaxies of the same absolute magnitude are observed through-
out the entire volume of the sample, including at the highest redshifts. Therefore, often the full
observed galaxy population is not used in measurements of ξ(r), rather volume-limited sub-
samples are created where all galaxies are brighter than a given absolute magnitude limit. This
greatly facilitates the theoretical interpretation of clustering measurements (see > Sect. 8) and
the comparison of results from different surveys.
5 Galaxy Bias
It was realized decades ago that the spatial clustering of observable galaxies need not precisely
mirror the clustering of the bulk of the matter in the Universe. In its most general form, the
galaxy density can be a nonlocal and stochastic function of the underlying dark matter density.
This galaxy “bias” – the relationship between the spatial distribution of galaxies and the under-
lying dark matter density field – is a result of the varied physics of galaxy formation which
The Large-Scale Structure of the Universe 8 399
can cause the spatial distribution of baryons to differ from that of dark matter. Stochasticity
appears to have little effect on bias except for adding extra variance (e.g., Scoccimarro 2000),
and nonlocality can be taken into account to first order by using smoothed densities over larger
scales. In this approximation, the smoothed galaxy density contrast is a general function of the
underlying dark matter density contrast on some scale,
δ g = f (δ), (8.19)
where δ ≡ (ρ/ ρ̄) − and ρ̄ is the mean mass density on that scale. If we assume f (δ) is a linear
function of δ, then we can define the linear galaxies bias b as the ratio of the mean overdensity
of galaxies to the mean overdensity of mass,
b = δ g /δ, (8.20)
and can in theory depend on scale and galaxy properties such as luminosity, morphology, color,
and redshift. In terms of the correlation function, the linear bias is defined as the square root of
the ratio of the two-point correlation function of the galaxies relative to the dark matter
/
b = (ξ gal /ξ dark matter ) (8.21)
and is a function of scale. Note that ξ dark matter is the Fourier transform of the dark matter power
spectrum. The bias of galaxies relative to dark matter is often referred to as the absolute bias as
opposed to the relative bias between galaxy populations (discussed below).
The concept of galaxies being a biased tracer of the underlying total mass field (which is
dominated by dark matter) was introduced by Kaiser (1984) in an attempt to reconcile the dif-
ferent clustering scale lengths of galaxies and rich clusters, which could not both be unbiased
tracers of mass. Kaiser (1984) shows that clusters of galaxies would naturally have a large bias as
a result of being rare objects which formed at the highest density peaks of the mass distribution
above some critical threshold. This idea is further developed analytically by Bardeen et al. (1986)
for galaxies, who show that for a Gaussian distribution of initial mass density fluctuations, the
peaks which first collapse to form galaxies will be more clustered than the underlying mass dis-
tribution. Mo and White (1996) use extended Press-Schechter theory to determine that the bias
depends on the mass of the dark matter halo as well as the epoch of galaxy formation and that a
linear bias is a decent approximation well into the nonlinear regime, where δ > . The evolution
of bias with redshift is developed in theoretical work by Fry (1996) and Tegmark and Peebles
(1998), who find that the bias is naturally larger at earlier epochs of galaxy formation, as the first
galaxies to form will collapse in the most overdense regions of space, which are biased (akin to
mountain peaks being clustered). They further show that regardless of the initial amplitude of
the bias factor, with time galaxies, will become unbiased tracers of the mass distribution (b →
as t → ∞). Additionally, Mann et al. (1998) find that while bias is generally scale dependent,
the dependence is weak, and on large scales, the bias tends toward a constant value.
A galaxy population can be “anti-biased” if b < , indicating that galaxies are less clus-
tered than the dark matter distribution. As discussed below, this appears to be the case for
some galaxy samples at low redshift. The galaxy bias of a given observational sample is often
inferred by comparing the observed clustering of galaxies with the clustering of dark matter
measured in a cosmological simulation. Therefore, the bias depends on the cosmological model
used in the simulation. The dominant relevant cosmological parameter is σ defined as the stan-
dard deviation of galaxy count fluctuations in a sphere of radius 8 h− Mpc, and the absolute
bias value inferred can be simply scaled with the assumed value of σ . As discussed
400 8 The Large-Scale Structure of the Universe
in > Sect. 9.1 below, the absolute galaxy bias can also be estimated from the data directly, with-
out having to resort to comparisons with cosmological simulations, by using the ratio of the
two-point and three-point correlation functions, which have different dependencies on the bias.
While this measurement can be somewhat noisy, it has the advantage of not assuming a cosmo-
logical model from which to derive the dark matter clustering. This measurement is performed
by Verde et al. (2002) and Gaztañaga et al. (2005), who find that galaxies in 2dFGRS have a
linear bias value very close to unity on large scales.
The relative bias between different galaxy populations can also be measured and is defined
as the ratio of the clustering of one population relative to another. This is often measured using
the ratio of the projected correlation functions of each population,
/
b gal/gal = (w p,gal /w p,gal ) , (8.22)
where both measurements of w p (r p ) have been integrated to the same value of πmax . The rel-
ative bias is used to compare the clustering of galaxies as a function of observed parameters
and does not refer to the clustering of dark matter. It is a useful way to compare the observed
clustering for different galaxy populations without having to rely on an assumed value of σ for
the dark matter.
The two-point correlation function has long been known to depend on galaxy properties and
can vary as a function of galaxy luminosity, morphological or spectral type, color, stellar mass,
and redshift. The general trend is that galaxies that are more luminous, early-type, bulge-
dominated, optically red, and/or higher stellar mass are more clustered than galaxies that are
less luminous, late type, disk-dominated, optically blue, and/or lower stellar mass. Presented
below are relatively recent results indicating how clustering properties depend on galaxy prop-
erties from the largest redshifts surveys currently available. The physical interpretation behind
these trends is presented in > Sect. 8 below.
What is plotted is the spatial distribution of galaxies in a flux-limited sample, meaning that all
galaxies down to a given apparent magnitude limit are included. This results in the apparent
lack of galaxies or structure at higher redshift in the figure as at large distances only the most
luminous galaxies will be included in a flux-limited sample. In order to robustly determine
the underlying clustering, one should, if possible, create volume-limited subsamples in which
galaxies of the same luminosity can be detected at all redshifts. In this way, the mean luminosity
of the sample does not change with redshift and galaxies at all redshifts are weighted equally.
The left panel of > Fig. 8-6 shows the projected correlation function, w p (r p ), for galaxies
in SDSS in volume-limited subsamples corresponding to different absolute magnitude ranges.
The more luminous galaxies are more strongly clustered across a wide range in absolute optical
magnitude, from − < M r < −. Power-law fits on scales from ∼0.1 to ∼10 h− Mpc show that
while the clustering amplitude depends sensitively on luminosity, the slope does not. Only in
The Large-Scale Structure of the Universe 8 401
Mr
−18 −19 −20 −21 −22
2
Wp (rp) (h−1 Mpc)
100 1.5
b/b*
1
10 −23<Mr<−22
−22<Mr<−21
−21<Mr<−20 0.5
−20<Mr<−19
−19<Mr<−18 SDSS P(k) Tegmark et al. 04
−18<Mr<−17
2dF ξ(r) Norberg et al. 01
0.1 1 10
−1 0
rp (h−1 Mpc) log10 L/L*
⊡ Fig. 8-6
Luminosity dependence of galaxy clustering. On the left is shown the projected correlation func-
tion, wp (rp ), for SDSS galaxies in different absolute magnitude ranges where brighter galaxies are
seen to be more clustered. On the right is the relative bias of galaxies as a function of luminosity
(Both figures are from Zehavi et al. (2005))
the brightest and faintest magnitude bins does the slope deviate from γ ∼ . and have a steeper
value of γ ∼ .. Across this magnitude, range r varies from ∼2.8 h− Mpc at the faint end
to ∼10 h− Mpc at the bright end. This same general trend is found in the 2dFGRS and other
redshift surveys (e.g., Norberg et al. 2001).
The right panel of > Fig. 8-6 shows the relative bias of SDSS galaxies as a function of lumi-
nosity, relative to the clustering of L∗ galaxies, measured at the scale of r p = . h− Mpc,
which is in the nonlinear regime, where δ > (Zehavi et al. 2005). L∗ is the characteristic
galaxy luminosity defined as the luminosity of the break in the galaxy luminosity function.
The relative bias is seen to steadily increase at higher luminosity and rise sharply above L∗ . This
is in good agreement with the results from Tegmark et al. (2004), using the power spectrum of
SDSS galaxies measured in the linear regime on a scale of ∼100h− Mpc. The data also agree with
the clustering results of galaxies in the 2dFGRS from Norberg et al. (2001). The overall shape
of the relative bias with luminosity indicates a slow rise up to the value at L∗ , above which the
rise is much steeper. As discussed in > Sect. 8.2 below, this trend shows that brighter galaxies
reside in more massive dark matter halos than fainter galaxies.
The clustering strength of galaxies also depends on restframe color and spectral type, with a
stronger dependence than on luminosity. > Figure 8-7 shows the spatial distribution of galaxies
in SDSS, color coded as a function of restframe color. Red galaxies are seen to preferentially pop-
ulate the most overdense regions, while blue galaxies are more smoothly distributed in space.
402 8 The Large-Scale Structure of the Universe
12h
h
16
8h
0.1
4
0.1
0.1
2
4
0.1
0.1
0
2
0.1
0.0
8
0
0.0
0.0
6
8
0.0
ift z tz
0.0
4
shif
6
dsh
0.0
0.0 Red
2
4
Re
0.0
⊡ Fig. 8-7
The spatial distribution of galaxies in the SDSS main galaxy sample as a function of redshift and
right ascension projected through 8○ in declination and color-coded by restframe optical color. Red
galaxies are seen to be more clustered than blue galaxies and generally trace the centers of groups
and clusters, while blue galaxies populate further into the galaxy voids (Taken from Zehavi et al.
(2011))
This is reflected in the correlation function of galaxies split by restframe color. Red galaxies have
a larger correlation length and steeper slope than blue galaxies: r ∼ 5–6 h− Mpc and γ ∼ . for
red L∗ galaxies, while r ∼ 3–4 h− Mpc and γ ∼ . for blue L∗ galaxies in SDSS (Zehavi et al.
2005). Clustering studies from the 2dFGRS split the galaxy sample at low redshift by spectral
type into galaxies with emission line spectra versus absorption line spectra, corresponding to
star forming and quiescent galaxies, and find similar results: that quiescent galaxies have larger
correlation lengths and steeper clustering slopes than star-forming galaxies (Madgwick et al.
2003).
Red and blue galaxies have distinct luminosity-dependent clustering properties. As shown
in >Fig. 8-8, the general trends seen in r and γ with luminosity for all galaxies are well reflected
in the blue galaxy population; however, at faint luminosities (L ≲ .L∗ ), red galaxies have larger
clustering amplitudes and slopes than L∗ red galaxies. This reflects the fact that faint red galaxies
are often found distributed throughout galaxy clusters.
Galaxy clustering also depends on other galaxy properties such as stellar mass, concentra-
tion index, and the strength of the 4,000Å break (D , ) in that galaxies that have larger stellar
mass, more centrally concentrated light profiles, and/or larger D , measurements (indicat-
ing older stellar populations) are more clustered (Li et al. 2006). This is not surprising given the
observed trends with luminosity and color and the known dependencies of other galaxy prop-
erties with luminosity and color. Clearly, the galaxy bias is a complicated function of various
galaxy properties.
The Large-Scale Structure of the Universe 8 403
12
2.6 Red
10 All
2.4 Blue
r0 (h−1 Mpc)
8
2.2
g
6 2
4 1.8
1.6
2
0.1 1 0.1 1
L/L* L/L*
⊡ Fig. 8-8
The clustering scale length, r0 (left), and slope, γ (right), for all, red and blue, galaxies in SDSS as a
function of luminosity. While all galaxies are more clustered at brighter luminosities and red galax-
ies are more clustered than blue galaxies at all luminosities, below L∗ , the red galaxy clustering
length increases at fainter luminosities. The clustering slope for faint red galaxies is also much
steeper than at other luminosities (Taken from Zehavi et al. (2011))
The fact that red galaxies are more clustered than blue galaxies is related to the morphology-
density relation (Dressler 1980), which results from the fact that galaxies with elliptical
morphologies are more likely to be found in regions of space with a higher local sur-
face density of galaxies. The redshift space distortions seen for red and blue galaxies also
show this.
As discussed in > Sect. 4, redshift space distortions arise from two different phenomena:
virialized motions of galaxies within collapsed overdensities such as groups and clusters and the
coherent streaming motion of galaxies onto larger structures that are still collapsing. The former
is seen on relatively small scales (r p ≲ h− Mpc) while the latter is detected on larger scales
(r P ≳ h− Mpc). While the presence of redshift space distortions complicates the measurement
of the real space ξ(r), these distortions can be used to uncover information about the thermal
motions of galaxies in groups and clusters as well as the amplitude of the mass density of the
Universe, Ω matter .
> Figure 8-9 shows ξ(r p , π) for quiescent and star-forming galaxies in 2dF. The quiescent
galaxies on the left show larger “Fingers of God” than the star-forming galaxies on the right,
reflecting the fact that red, quiescent galaxies have larger motions relative to each others. This
naturally arises if red, quiescent galaxies reside in more massive, virialized overdensities with
larger random peculiar velocities than star-forming, optically blue galaxies. The large-scale
coherent infall of galaxies is seen both for blue and red galaxies, though it is often easier to
see for blue galaxies, due to their smaller “Fingers of God.”
404 8 The Large-Scale Structure of the Universe
20 20
p [h−1 Mpc]
p [h−1 Mpc]
0 0
−20 −20
−20 0 20 −20 0 20
a b
s [h−1 Mpc] s [h−1 Mpc]
⊡ Fig. 8-9
Two-dimensional redshift space correlation function ξ(rp , π) (as in >Fig. 8-6, here, σ is used instead
of rp ) for quiescent, absorption line galaxies on the left and star-forming, emission line galaxies on
the right. The redshift space distortions are different for the different galaxy populations, with qui-
escent and/or red galaxies showing more pronounced “Fingers of God.” Both galaxy types exhibit
coherent infall on large scales. Contours show lines of constant ξ at ξ =10, 5, 2, 1, 0.5, 0.2, 0.1 (Taken
from Madgwick et al. (2003))
These small-scale redshift space distortions can be quantified using the σ statistic known
as the pairwise velocity dispersion (Davis et al. 1978; Fisher et al. 1994). This is measured by
modeling ξ(r p , π) in real space, which is then convolved with a distribution of random pairwise
motions, f (v), such that
∞
ξ(r p , π) = ∫ ξ ′(r p , π − v/H ) f (v)dv, (8.23)
−∞
where the random motions are often taken to have an exponential form, which has been found
to fit observed data well:
√
∣v∣
f (v) = √ exp (− ). (8.24)
σ σ
In the 2dFGRS, Madgwick et al. (2003) find that σ = ± km s− for star-forming
galaxies and σ = ± km s− for quiescent galaxies measured on scales of 8–20 h− Mpc.
Using SDSS data, Zehavi et al. (2002) find that σ is ∼300–450 km s− for blue, star-forming
galaxies and ∼650–750 km s− for red, quiescent galaxies. It has been shown, however, that this
statistic can be sensitive to large, rare overdensities, such that samples covering large volumes
are needed to measures it robustly.
Madgwick et al. (2003) further measure the large-scale anisotropies seen in ξ(r p , π) for
galaxies split by spectral type and find that β = . ± . for star-forming galaxies and
β = . ± . for quiescent galaxies. This implies a similar bias for both galaxy types on large
scales, though they find that on smaller scales integrated up to 8 h− Mpc, the relative bias of
quiescent to star-forming galaxies is b rel = . ± ..
The Large-Scale Structure of the Universe 8 405
The observed clustering of galaxies is expected to evolve with time as structure continues to
grow due to gravity. The exact evolution depends on cosmological parameters such as Λ and
Ω matter . Larger values of Λ, for example, lead to larger voids and higher density contrasts
between overdense and underdense regions. By measuring the clustering of galaxies at higher
redshift, one can break degeneracies that exist between the galaxy bias and cosmological param-
eters that are constrained by low redshift clustering measurements. It is therefore useful to
determine the clustering of galaxies as a function of cosmic epoch not only to further constrain
cosmological parameters but also galaxy evolution.
One might expect the galaxy clustering amplitude r to increase over time as overdense
regions become more overdense as galaxies move toward groups and clusters due to gravity.
However, the exact evolution of the clustering of galaxies depends not only on gravity but also on
the expansion history of the Universe and therefore cosmological parameters such as Λ. Addi-
tionally, over time, new galaxies form while existing galaxies grow in both mass and luminosity.
Therefore, the expected changes of galaxy clustering as a function of redshift depend both on
relatively well-known cosmological parameters and more unknown galaxy formation and evo-
lution physics which likely depends on gas accretion, star formation, and feedback processes,
as well as mergers.
For a given cosmological model, one can predict how the clustering of dark matter should
evolve with time using cosmological N-body simulations. For a ΛCDM Universe, r for dark
matter particles is expected to increase from ∼0.8 h− Mpc at z = to ∼5 h− Mpc at z =
(Weinberg et al. 2004). However, according to hierarchical structure formation theories, at high
redshifts, the first galaxies to form will be the first structures to collapse, which will be biased
tracers of the mass. The galaxy bias is expected to be a strong function of redshift, initially >1 at
high redshift and approaching unity over time. Therefore, r for galaxies may be a much weaker
function of time than it is for dark matter as the same galaxies are not observed as a function
of redshift, and over time, new galaxies form in less biased locations in the Universe.
The projected angular and three-dimensional correlation functions of galaxies have been
observed to z ∼ . Star-forming Lyman break galaxies at z ∼ 3–5 are found to have r ∼ 4–6 h−
Mpc, with a bias relative to dark matter of ∼3–4 (Adelberger et al. 2005; Ouchi et al. 2004).
Brighter Lyman break galaxies are found to cluster more strongly than fainter Lyman break
galaxies. The correlation length, r , is found to be roughly constant between z = and z = ,
implying that the bias is increasing at earlier cosmic epoch. Spectroscopic galaxy surveys at
z > are currently limited to samples of at most a few thousand galaxies, so most clustering
measurements are angular at these epochs. In one such study by Wake et al. (2011), photometric
redshifts of tens of thousands of galaxies at < z < are used to measure the angular clustering
as a function of stellar mass. They find a strong dependence of clustering amplitude on stellar
mass in each of three redshift intervals in this range.
At z ∼ , larger spectroscopic galaxy samples exist, and three-dimensional two-point cluster-
ing analyses have been performed as a function of luminosity, color, stellar mass, and spectral
type. The same general clustering trends with galaxy property that are observed at z ∼ are
also seen at z ∼ , in that galaxies that are brighter, redder, early spectral type, and/or more
massive are more clustered. The clustering scale length of red galaxies is found to be ∼5–6 h−
Mpc, while for blue galaxies, it is ∼3.5–4.5 h− Mpc, depending on luminosity (Coil et al. 2008;
Meneux et al. 2006). At a given luminosity, the observed correlation length is only ∼15% smaller
at z = than z = , indicating that unlike for dark matter, the galaxy r is roughly constant over
time. These results are consistent with predictions from ΛCDM simulations.
406 8 The Large-Scale Structure of the Universe
The measured values of r at z ∼ imply that galaxies are more biased at z = than at
z = . Within the DEEP2 sample, the bias measured on scales of ∼1–10 h− Mpc varies from
∼1.25–1.55, with brighter galaxy samples being more biased tracers of the dark matter (Coil
et al. 2006). These results are consistent with the idea that galaxies formed early on in the most
overdense regions of space, which are biased.
As in the nearby Universe, the clustering amplitude is a stronger function of color than of
luminosity at z ∼ . Additionally, the color-density relation is found to already be in place at
z = , in that the relative bias of red to blue galaxies is as high at z = as at z = . (Coil et al.
2008). This implies that the color-density relation is not due to cluster-specific physics, as most
galaxies at z = in field spectroscopic surveys are not in clusters. Therefore, it must be physical
processes at play in galaxy groups that initially set the color and morphology-density relations.
Red galaxies show larger “Fingers of God” in ξ(r p , π) measurements than blue galaxies do, again
showing that red galaxies at z = lie preferentially in virialized, more massive overdensities
compared to blue galaxies. Both red and blue galaxies show coherent infall on large scales.
The current paradigm of galaxy formation posits that galaxies form in the center of larger dark
matter halos, collapsed overdensities in the dark matter distribution with ρ/ ρ̄ ∼ , inside of
which all mass is gravitationally bound. The clustering of galaxies can then be understood as a
combination of the clustering of dark matter halos, which depends on cosmological parameters,
and how galaxies populate dark matter halos, which depends on galaxy formation and evolu-
tion physics. For a given cosmological model, the properties of dark matter halos, including
their evolution with time, can be studied in detail using N-body simulations. The masses and
spatial distribution of dark matter halos should depend only on the properties of dark matter,
not baryonic matter, and the expansion history of the Universe; therefore, the clustering of dark
matter halos should be insensitive to baryon physics. However, the efficiency of galaxy forma-
tion is very dependent on the complicated baryonic physics of, for example, star formation,
gas cooling, and feedback processes. The halo model allows the relatively simple cosmological
dependence of galaxy clustering to be cleanly separated from the more complex baryonic astro-
physics, and it shows how clustering measurements for a range of galaxy types can be used to
constrain galaxy evolution physics.
One can use the observed large-scale clustering amplitude of different observed galaxy pop-
ulations to identify the typical mass of their parent dark matter halos in order to place these
galaxies in a cosmological context (see > Chap. 10). The large-scale clustering amplitude of
dark matter halos as a function of halo mass is well determined in N-body simulations, and
analytic fitting formula are provided by, for example, Mo and White (1996) and Sheth et al.
(2001). Analytic models can then predict the clustering of both dark matter particles and galax-
ies as a function of scale by using the clustering of dark matter halos and the radial density
profile of dark matter and galaxies within those halos (Ma and Fry 2000; Peacock and Smith
2000; Seljak 2000). In this scheme, on large, linear scales where δ < (ρ/ ρ̄ ∼ ), the clustering
The Large-Scale Structure of the Universe 8 407
of a given galaxy population can be used to determine the mean mass of the dark matter halos
hosting those galaxies for a given cosmological model. To achieve this, the large-scale bias is
estimated by comparing the observed galaxy clustering amplitude with that of dark matter in
an N-body simulation, and then galaxies are assumed to reside in halos of a given mass that
have the same bias in simulations.
Simulations show that higher mass halos cluster more strongly than lower mass halos (Sheth
and Tormen 1999). This then leads to an interpretation of galaxy clustering as a function of lumi-
nosity in which luminous galaxies reside in more massive dark matter halos than less luminous
galaxies. Similarly, red galaxies typically reside in more massive halos than blue galaxies of the
same luminosity; this is observationally verified by the larger “Fingers of God” observed for
red galaxies. Combining the large-scale bias with the observed galaxy number density further
allows one to constrain the fraction of halos that host a given galaxy type by comparing the
galaxy space density to the parent dark matter halo space density. This constrains the duty cycle
or fraction of halos hosting galaxies of a given population.
3
ΛCDM
Galaxies
Dark matter
APM survey
2
(Baugh 1996)
logξ(r)
1
−1
−0.5 0 0.5 1
log(r/h−1Mpc)
⊡ Fig. 8-10
Left: The large-scale structure seen in a ΛCDM N-body dark matter only simulation of size 141 × 141
× 8 h−3 Mpc3 . The gray scale indicates the density of dark matter, while the locations of galaxies are
shown with open circles. Galaxies are added to the simulation output using a semi-analytic model
which assumes that dark matter halos above a given mass threshold have at least one “central”
galaxy located at the center of the halo. Higher mass halos contain additional “satellite” galaxies,
which are assigned the location of a random dark matter particle in the halo (Taken from Benson
et al. (2000)). Right: The two-point correlation function of dark matter particles (dotted line) and
galaxies (solid line with dashed line showing Poisson error bars) in the simulation of Benson et al.
(2000) compared with the observed clustering of galaxies in the APM survey (open squares) (Baugh
1996)
two scales (corresponding to the “two-halo” and “one-halo” terms) is combined, a power law
results for the galaxy ξ(r) (right panel, >Fig. 8-10). Galaxies are found to be anti-biased relative
to dark matter (i.e., less clustered than the dark matter) on scales smaller than the typical halo,
though the bias is close to unity on larger scales. The clustering of galaxies that results from
this semi-analytic model is also found to match the observed clustering of galaxies in the APM
survey above a given luminosity threshold (Baugh 1996).
By defining the halo occupation distribution (HOD) as the probability that a halo of a given
mass contains N galaxies, P(N∣M), Berlind and Weinberg (2002) quantify how the observed
galaxy ξ(r) depends on different HOD model parameters. Using N-body simulations, they
identify dark matter halos and place galaxies into the simulation using a simple HOD model
with two parameters: a minimum mass at which a halo hosts, on average, one central galaxy
(M min ) at the center of the halo, and the slope (α) of the P(N∣M) function for satellite galaxies.
The latter determines how many satellite galaxies there are as a function of halo mass. They fur-
ther assume that the satellite galaxies follow an NFW profile, as the dark matter does, though the
concentration of the radial profile can be changed. They show that the “two-halo term” is simply
the halo center correlation function weighted by a large-scale bias factor, while the “one-halo
term” is sensitive to both α and the concentration of the galaxy profile within halos. Obtaining
a power law ξ(r) therefore strongly constrains the HOD model parameters.
The Large-Scale Structure of the Universe 8 409
Kravtsov et al. (2004) propose that the locations of satellite galaxies within dark matter halos
should correspond to locations of subhalos, distinct gravitationally bound regions within the
larger dark matter halos, instead of tracing random dark matter particles. Using cosmological
N-body simulations, they show that at z > ξ(r) for galaxies should deviate strongly from a
power law on small scales, due to a rise in the “one-halo term.” In this model, the clustering
of galaxies can be understood as the clustering of dark matter parent halos and subhalos, and
the power-law shape that is observed at z ∼ is a coincidence of the one- and two-halo terms
having similar amplitudes and slopes at the typical scale of halos. They find that the formation
and evolution of halos and subhalos through merging and dynamical processes are the main
physical drivers of large-scale structure.
With the unprecedentedly large galaxy sample with spectroscopic redshifts that is provided
by SDSS, departures from a power law ξ(r) were detected by Zehavi et al. (2004) using a volume-
limited subsample of 22,000 galaxies from a parent sample of 118,000 galaxies. The deviations
from a power law are small enough at z ∼ that a large sample covering a sufficiently large
cosmological volume is required to overcome the errors due to cosmic variance to detect these
small deviations. It is found that there is a change in the slope of ξ(r) on scales of ∼1–2 h− Mpc;
this corresponds to the scale at which the one- and two-halo term are equal (see > Fig. 8-11).
Zehavi et al. (2004) find that ξ(r) measured from the SDSS data is better fit by an HOD model,
which includes small deviations from a power law, than by a pure power law. The HOD model
that is fit has three parameters: the minimum mass to host a single central galaxy (M min ), the
minimum mass to host a single satellite galaxy (M ), and the slope of P(N∣M) (α), which
103
Mr<−21
102
wp(rp) (h−1Mpc)
10
1.5
1
0.5
0.1 1 10
rp (h−1Mpc)
⊡ Fig. 8-11
The projected correlation function, wp (rp ), for SDSS galaxies with Mr < −21 is shown as data points
with error bars. The best-fit HOD model is shown as a solid line, with the contributions from the one
and two-halo terms shown with dotted lines. The projected correlation function of dark matter at
this redshift is shown with a dashed line. The bottom panel shows deviations in wp (rp ) for the data
and the HOD model from the best-fit power law (Taken from Zehavi et al. (2004))
410 8 The Large-Scale Structure of the Universe
determines the average number of satellite galaxies as a function of host halo mass. In this
model, dark matter halos with M min < M < M host a single galaxy, while above M , they host,
on average, (M/M )α galaxies. Using w p (r p ), one can fit for M and α, while the observed
space density of galaxies is used to derive M min . For a galaxy sample with M r < −, the best-fit
HOD parameters are M min = . × h− M⊙ , M = . × h− M⊙ , and α = ..
In general, these HOD parameters reflect the efficiency of galaxy formation and evolution and
can be a function of galaxy properties such as luminosity, color, stellar mass, and morphology.
Zehavi et al. (2011) present HOD fits to SDSS samples as a function of luminosity and color
and find that α is generally ∼1.0–1.1, though it is a bit higher for the brightest galaxies
(∼1.3 for M r < −22.0). There is a strong trend between luminosity and halo mass; M min varies
as a function of luminosity from ∼10 h− M⊙ for M r < −18 to ∼10 h− M⊙ for M r < −22.
M is generally ∼17 times higher than the value of M min for all luminosity threshold samples
(see >Fig. 8-12). This implies that a halo with two galaxies above a given luminosity is ∼17 times
more massive than a halo hosting one galaxy above the same luminosity limit. Further, the frac-
tion of galaxies that are satellites decreases at higher luminosities, from ∼33% at M r < −18 to 4%
at M r < −22. The right panel of > Fig. 8-12 shows the mass-to-light ratio of the virial halo mass
to the central galaxy r-band luminosity as a function of halo mass. This figure shows that halos
of mass ∼ × h− M⊙ are maximally efficient at galaxy formation at converting baryons
into light.
15
log[Mh / (h−1M )]
14
Mh/Lcen (hM /L )
103
13
M1
12
Mmin
102
11
⊡ Fig. 8-12
Left: The characteristic mass scale of dark matter halos hosting galaxies as a function of the lumi-
nosity threshold of the galaxy sample. Both the minimum halo mass to host a single galaxy is shown
(Mmin ) as well as the minimum mass to host additional satellite galaxies (M1 ). A strong relationship
clearly exists between halo mass and galaxy luminosity. Right: The ratio of the halo mass to the
median central galaxy luminosity as a function of halo mass (Taken from Zehavi et al. (2011))
The Large-Scale Structure of the Universe 8 411
In terms of the color dependence of galaxy clustering, the trend at fainter luminosities of red
galaxies being strongly clustered (with a higher correlation slope, γ, see >Fig. 8-8) is due to faint
red galaxies being satellite galaxies in relatively massive halos that host bright red central galax-
ies (Berlind et al. 2005). HOD modeling therefore provides a clear explanation for the increased
clustering observed for faint red galaxies. For a given luminosity range (− < M r < −),
Zehavi et al. (2011) fit a simplified HOD model with one parameter only to find that the fraction
of galaxies that are satellites is much higher for red than for blue galaxies, with ∼25% of blue
galaxies being satellites and ∼60% of red galaxies being satellites. They find that blue galaxies
reside in halos with a median mass of . h− M⊙ , while red galaxies reside in higher mass
halos with a median mass of . h− M⊙ . However, at a given luminosity, there is not a strong
trend between color and halo mass (though there is a strong trend between luminosity and halo
mass). Instead, the differences in w p (r p ) reflect a trend between color and satellite fraction; the
increased satellite fraction, in particular, drives the slope of ξ(r) to be steeper for red galaxies
compared to blue galaxies. And while the HOD slope α does not change much with increasing
luminosity, it does with color, due to the dependence of the satellite fraction on color. Hav-
ing a higher satellite fraction also places more galaxies in high mass halos (as those host the
groups and clusters that contain the satellite galaxies), which increases the large-scale bias and
boosts the one-halo term relative to the two-halo term. The HOD model facilitates interpre-
tion of the observed luminosity and color dependence of galaxy clustering and provides strong,
crucial constraints on models of how galaxies form and evolve within their parent dark matter
halos.
As mentioned in > Sect. 7 above, the galaxies that are observed for clustering measurements
at different redshifts are not necessarily the same populations across cosmic time. A significant
hurdle in understanding galaxy evolution is knowing how to connect different observed popu-
lations at different redshifts. Galaxy clustering measurements can be combined with theoretical
models to trace observed populations with redshift in that for a given cosmology, one can model
how the clustering of a given population should evolve with time.
The observed evolution of the luminosity dependence of galaxy clustering can be fit surpris-
ingly well using a simple nonparametric, non-HOD, model that relates the galaxy luminosity
function to the halo mass function. Conroy et al. (2006) show that directly matching galaxies
as a function of luminosity to host halos and subhalos as a function of mass leads to a model
for the luminosity-dependent clustering that matches observation from z ∼ to z ∼ . In this
model, the only inputs are the observed galaxy luminosity function at each epoch of interest
and the dark matter halo (and subhalo) mass function from N-body simulations. Galaxies are
then ranked by luminosity and halos by mass and matched one to one, such that lower lumi-
nosity galaxies are associated with halos of lower mass, and galaxies above a given luminosity
threshold are assigned to halos above a given mass threshold with the same abundance or num-
ber density. This “abundance-matching” method uses as a proxy for halo mass the maximum
circular velocity (Vmax ) of the halo; for subhalos, they find that it is necessary to use the value
of Vmax when the subhalo is first accreted into a larger halo to avoid the effects of tidal strip-
ping. With this simple model, the clustering amplitude and shape as a function of luminosity
are matched for SDSS galaxies at z ∼ , DEEP2 galaxies at z ∼ , and Lyman break galaxies
at z ∼ . In particular, the clustering amplitude in both the one- and two-halo regimes is well
412 8 The Large-Scale Structure of the Universe
fit, including the deviations from a power law that seen at z > (Coil et al. 2006; Ouchi et al.
2005). These results imply a tight correlation between galaxy luminosity and halo mass from
z ∼ to z ∼ .
While abundance-matching techniques provide a simple, zero parameter model for how
galaxies populate halos, a richer understanding of the physical properties involved may be
gained by performing HOD modeling. Zheng et al. (2007) use HOD modeling to fit the
observed luminosity-dependent galaxy clustering at z ∼ measured in SDSS with that mea-
sured at z ∼ , in DEEP2 to confirm that at both epochs there is a tight relationship between
the central galaxy luminosity and host halo mass. At z ∼ the satellite fraction drops for higher
luminosities, as at z ∼ , but at a given luminosity, the satellite fraction is higher at z ∼ than
at z ∼ . They also find that at a given central luminosity, halos are ∼1.6 times more massive
at z ∼ than z ∼ , and at a given halo, mass galaxies are ∼1.4 times more luminous at z ∼
than z ∼ .
Zheng et al. (2007) further combine these HOD results with theoretical predictions of the
growth of dark matter halos from simulations to link z ∼ central galaxies to their descendants
at z ∼ and find that the growth of both halo mass and stellar mass as a function of redshift
depends on halo mass. Lower mass halos grow earlier, which is reflected in the fact that more
of their z ∼ mass is already assembled by z ∼ . A typical z ∼ halo with mass × h− M⊙
has about 70% of its final mass in place by z ∼ , while a z ∼ halo with mass h− M⊙ has
∼50% of its final mass in place at z ∼ . In terms of stellar mass, however, in a z ∼ halo of mass
× h− M⊙ , a central galaxy has ∼20% of its stellar mass in place at z ∼ , while the fraction
rises to ∼33% above a halo mass of × h− M⊙ . They further find that the mass scale of the
maximum star formation efficiency for central galaxies shifts to lower halo mass with time, with
a peak of ∼ h− M⊙ at z ∼ and ∼ × h− M⊙ at z ∼ .
At < z < , Wake et al. (2011) use precise photometric redshifts from the NEWFIRM
survey to measure the relationship between stellar mass and dark matter halo mass using HOD
models. At these higher redshifts, r varies from ∼6 to ∼11 h− Mpc for stellar masses ∼
to M⊙ . The galaxy bias is a function of both redshift and stellar mass and is ∼2.5 at z ∼
and increases to ∼3.5 at z ∼ . They find that the typical halo mass of both central and satel-
lite galaxies increases with stellar mass, while the satellite fraction drops at higher stellar mass,
qualitatively similar to what is found at lower redshift. They do not find evolution in the rela-
tionship between stellar mass and halo mass between z ∼ and z ∼ but do find evolution
compared to z ∼ . They also find that the peak of star formation efficiency shifts to lower halo
mass with time.
Simulations can also be used to connect different observed galaxy populations at different
redshifts. An example of the power of this method is shown by Conroy et al. (2008), who com-
pare the clustering and space density of star-forming galaxies at z ∼ with that of star-forming
and quiescent galaxies at z = and z = to infer both the typical descendants of the z ∼
star-forming galaxies and constrain the fraction that have merged with other galaxies by z = .
They use halos and subhalos identified in a ΛCDM N-body simulation to determine which
halos at z ∼ likely host star-forming galaxies, and then use the merger histories in the sim-
ulation to track these same halos to lower redshift. By comparing these results to observed
clustering of star-forming galaxies at z ∼ and z ∼ , they can identify the galaxy populations
at these epochs that are consistent with being descendants of the z ∼ galaxies. They find that
while the lower redshift descendent halos have clustering strengths similar to red galaxies at
both z ∼ and z ∼ , the z ∼ star forming galaxies cannot all evolve into red galaxies by lower
redshift as their space density is too high. There are many more lower redshift descendents
The Large-Scale Structure of the Universe 8 413
than there are red galaxies, even after taking into account mergers. They conclude that most
z ∼ star-forming galaxies evolve into typical L∗ galaxies today, while a non-negligible fraction
become satellite galaxies in larger galaxy groups and clusters.
In summary, N-body simulations and HOD modeling can be used to interpret the observed
evolution of galaxy clustering and further constrain both cosmological parameters and theo-
retical models of galaxy evolution beyond what can be gleaned from z ∼ observations alone.
They also establish links between distinct observed galaxy populations at different redshifts,
allowing one to create a coherent picture of how galaxies evolve over cosmic time.
Redshift surveys unveil a rich structure of galaxies as seen in >Fig. 8-3. In addition to measuring
the two-point correlation function to quantify the clustering amplitude as a function of galaxy
properties, one can also study higher-order clustering measurements as well as properties of
voids and filaments.
Higher-order clustering statistics reflect both the growth of initial density fluctuations as well as
the details of galaxy biasing (Bernardeau et al. 2002), such that measurements of higher-order
clustering can test the paradigm of structure formation through gravitational instability as well
as constrain the galaxy bias. In the linear regime, there is a degeneracy between the amplitude
of fluctuations in the dark matter density field and the galaxy bias in that a highly clustered
galaxy population may be biased and trace only the most overdense regions of the dark matter
or the dark matter itself may be highly clustered. However, this degeneracy can be broken in
the nonlinear regime on small scales. Over time, the density field becomes skewed toward high
density as δ becomes greater than unity in overdense regions (where δ ≡ (ρ/ ρ̄) − ) but cannot
become negative in underdense regions. Skewness in the galaxy density distribution can also
arise from galaxy bias, if galaxies preferentially form in the highest density peaks. One can
therefore use the shapes of the galaxy overdensities through measurements of the three-point
correlation function to test gravitational collapse versus galaxy bias.
To study higher-order clustering, one needs large samples that cover enormous volumes;
all studies to date have focused on low redshift galaxies. Verde et al. (2002) use 2dFGRS to
measure the Fourier transform of the three-point correlation function, called the bispectrum, to
constrain the galaxy bias without resorting to comparisons with N-body simulations in order to
measure the clustering of dark matter. Fry and Gaztanaga (1993) present the galaxy bias in terms
of a Taylor expansion of the density contrast, where the first-order term is the linear term, while
the second-order term is the nonlinear or quadratic term. Measured on scales of 5–30 h− Mpc,
Verde et al. (2002) find that the linear galaxy bias is consistent with unity (b = . ± .), while
the nonlinear quadratic bias is consistent with zero (b = −. ± .). When combined with
the redshift space distortions measured in the two-dimensional two-point correlation function
(ξ(r p , π)), they measure Ω matter = . ± . at z = .. This constraint on the matter density
of the Universe is derived entirely from large-scale structure data alone.
414 8 The Large-Scale Structure of the Universe
Gaztañaga et al. (2005) measure the three-point correlation function in 2dFGRS for tri-
angles of galaxy configurations with different shapes. Their results are consistent with ΛCDM
expectations regarding gravitational instability of initial Gaussian fluctuations. Furthermore,
they find that while the linear bias is consistent with unity (b = .+./−.), the quadratic
bias is nonzero (b /b = −.+./−.). This implies that there is a nongravitational contri-
bution to the three-point function, resulting from galaxy formation physics. These results differ
from those of Verde et al. (2002), which may be due to the inclusion by Gaztañaga et al. (2005)
of the covariance between measurements on different scales. Gaztañaga et al. (2005) combine
their results with the measured two-point correlation function to derive σ = . + ./ −..
If the density field follows a Gaussian distribution, the higher-order clustering terms can be
expressed solely in terms of the lower-order clustering terms. This “hierarchical scaling” holds
for the evolution of an initially Gaussian distribution of fluctuations under gravitational insta-
bility. Therefore, departures from hierarchical scaling can result either from a non-Gaussian
initial density field or from galaxy bias. Redshift space higher-order clustering measurements
in 2dFGRS are performed by Baugh et al. (2004) and Croton et al. (2004a), who measure up
to the six-point correlation function. They find that hierarchical scaling is obeyed on small
scales, though deviations exist on larger scales (∼10 h− Mpc). They show that on large scales,
the higher-order terms can be significantly affected by massive rare peaks such as superclus-
ters, which populate the tail of the overdensity distribution. Croton et al. (2004a) also show that
the three-point function has a weak luminosity dependence, implying that galaxy bias is not
entirely linear. These results are confirmed by Nichol et al. (2006) using galaxies in the SDSS,
who also measure a weak luminosity dependence in the three-point function. They find that
on scales >10 h− Mpc, the three-point function is greatly affected by the “Sloan Great Wall,”
a massive supercluster that is roughly 450 Mpc (Gott et al. 2005) in length and is associated
with tens of known Abell clusters. These results show that even 2dFGRS and SDSS are not large
enough samples to be unaffected by the most massive, rare structures.
Several studies have examined higher-order correlation functions for galaxies split by color.
Gaztañaga et al. (2005) find a strong dependence of the three-point function on color and
luminosity on scales <6 h− Mpc. Croton et al. (2007) measure up to the five-point correlation
function in 2dFGRS for both blue and red galaxies and find that red galaxies are more clustered
than blue galaxies in all of the N-point functions measured. They also find a luminosity depen-
dence in the hierarchical scaling amplitudes for red galaxies but not for blue galaxies. Taken
together, these results explain why the full galaxy population shows only a weak correlation
with luminosity.
9.2 Voids
In maps of the large-scale structure of galaxies, voids stand out starkly to the eye. There appear
to be vast regions of space with few, if any, L∗ galaxies. Voids are among the largest structures
observed in the Universe, spanning typically tens of h− Mpc.
The statistics of voids – their sizes, distribution, and underdensities – are closely tied to
cosmological parameters and the physical details of structure formation (e.g., Sheth and van
de Weygaert 2004). While the two-point correlation function provides a full description of
clustering for a Gaussian distribution, departures from Gaussianity can be tested with higher-
order correlation statistics and voids. For example, the abundance of voids can be used to
test the non-Gaussianity of primordial perturbations, which constrains models of inflation
The Large-Scale Structure of the Universe 8 415
The first challenge in measuring the properties of voids and void galaxies is defining the
physical extent of individual voids and identifying which galaxies are likely to be in voids. The
“void finder” algorithm of El-Ad and Piran (1997), which is based on the point distribution of
galaxies (i.e., does not perform any smoothing), is widely used. This algorithm does not assume
that voids are entirely devoid of galaxies and identifies void galaxies as those with three or less
neighboring galaxies within a sphere defined by the mean and standard deviation of the dis-
tance to the third nearest neighbor for all galaxies. All other galaxies are termed “wall” galaxies
(see > Fig. 8-13). An individual void is then identified as the maximal sphere that contains only
void galaxies. This algorithm is widely used by both theorists and observers.
Cosmological simulations of structure formation show that the distribution and density of
galaxy voids are sensitive to the values of Ω matter and Ω Λ (Kauffmann et al. 1999). Using ΛCDM
400
350
300
250
y (h−1 Mpc)
200
150
100
50
0
−300 −200 −100 0 100 200 300
x (h−1 Mpc)
⊡ Fig. 8-13
Void and wall galaxies in the SDSS. Shown is a projection of a 10 h−1 Mpc slab with wall galax-
ies plotted as black crosses and void galaxies plotted as red crosses. Blue circles indicate the
intersection of the maximal sphere of each void with the midplane of the slab (From Pan et al.
2012)
416 8 The Large-Scale Structure of the Universe
N-body dark matter simulations, Colberg et al. (2005) study the properties of voids within the
dark matter distribution and predicts that voids are very underdense (though not empty) up to
a well-defined sharp edge in the dark matter density. They predict that 61% of the volume of
space should be filled by voids at z = compared to 28% at z = and 9% at z = . They also find
that the mass function of dark matter halos in voids is steeper than in denser regions of space.
Using similar ΛCDM N-body simulations with a semi-analytic model for galaxy evolution,
Benson et al. (2003) show that voids should contain both dark matter and galaxies and that
the dark matter halos in voids tend to be low mass and therefore contain fewer galaxies than
in higher density regions. In particular, at density contrasts of δ < −., where δ ≡ (ρ/ρ mean ) −
, both dark matter halos and galaxies in voids should be anti-biased relative to dark matter.
However, galaxies are predicted to be more underdense than the dark matter halos, assuming
simple physically motivate prescriptions for galaxy evolution. They also predict the statistical
size distribution of voids, finding that there should be more voids with smaller radii (<10 h−
Mpc) than larger radii.
The advent of the 2dFGRS and SDSS provided the first very large samples of voids and
void galaxies that could be used to robustly measure their statistical properties. Applying the
“void finder” algorithm on the 2dFGRS dataset, Hoyle and Vogeley (2004) find that the typical
radius of voids is ∼15 h− Mpc. Voids are extremely underdense, with an average density of
δρ/ρ = −., with even lower densities at the center, where fewer galaxies lie. The volume of
space filled by voids is ∼40%. Probing an even larger volume of space using the SDSS dataset,
Pan et al. (2012) found a similar typical void radius and concluded that ∼60% of space is filled
by voids, which have δρ/ρ = −. at their edges. Voids have sharp density profiles in that
they remain extremely underdense to the void radius, where the galaxy density rises steeply.
These observational results agree well with the predictions of ΛCDM simulations discussed
above.
Studies of the properties of galaxy in voids allow an understanding of how galaxy forma-
tion and evolution progresses in the lowest density environments in the Universe, effectively
pursuing the other end of the density spectrum from cluster galaxies. Void galaxies are found
to be significantly bluer and fainter than wall galaxies (Rojas et al. 2004). The luminosity func-
tion of void galaxies shows a lack of bright galaxies but no difference in the measured faint
end slope (Croton et al. 2005; Hoyle et al. 2005), indicating that dwarf galaxies are not likely
to be more common in voids. The normalization of the luminosity function of wall galaxies is
roughly an order of magnitude higher than that of void galaxies; therefore, galaxies do exist
in voids just with a much lower space density. Studies of the optical spectra of void galax-
ies show that they have high star-formation rates, low 4,000Å spectral breaks indicative of
young stellar populations, and low stellar masses, resulting in high specific star formation rates
(Rojas et al. 2005).
However, red quiescent galaxies do exist in voids, just with a lower space density than
blue, star-forming galaxies (Croton et al. 2005). Croton and Farrar (2008) show that the
observed luminosity function of void galaxies can be replicated with a ΛCDM N-body sim-
ulation and simple semi-analytic prescriptions for galaxy evolution. They explain the existence
of red galaxies in voids as residing in the few massive dark matter halos that exist in voids.
Their model requires some form of star-formation quenching in massive halos (>∼10 M⊙ ),
but no additional physics that operates only at low density needs to be included in their
model to match the data. It is therefore the shift in the halo mass function in voids that leads
to different galaxy properties, not a change in the galaxy evolution physics in low-density
environments.
The Large-Scale Structure of the Universe 8 417
In addition to identifying individual voids and the galaxies in them, one can study the statistical
distribution of voids using the void probability function (VPF). Defined by White (1979), the
VPF is the probability that a randomly placed sphere of radius R within a point distribution will
not contain any points (i.e., galaxies, see > Fig. 8-14). The VPF is defined such that it depends
on the space density of points; therefore, one must be careful when comparing datasets and
simulation results to ensure that the same number density is used. The VPF traces clustering in
the weakly nonlinear regime, not in the highly nonlinear regime of galaxy groups and clusters.
Benson et al. (2003) predict using ΛCDM simulations that the VPF of galaxies should be
higher than that of dark matter, that voids as traced by galaxies are much larger than voids
traced by dark matter. This results from the bias of galaxies compared to dark matter in voids
and the fact that in this model the few dark matter halos that do exist in voids are low mass
and therefore often do not contain bright galaxies. Croton et al. (2004b) measure the VPF in
the 2dFGRS dataset and find that it follows hierarchical scaling laws in that all higher-order
correlation functions can be expressed in terms of the two-point correlation function. They
find that even on scales of ∼30 h− Mpc, higher-order correlations have an impact and that the
VPF of galaxies is observed to be different than that of dark matter in simulations.
Conroy et al. (2005) measure the VPF in SDSS galaxies at z ∼ . and DEEP2 galaxies at
z ∼ and find that voids traced by redder and/or brighter galaxy populations are larger than
z
0.77 0.80 0.82 0.85
30
20
rx (Mpc/h)
10
0
−10
−20
−30
1900 1950 2000 2050
rz (Mpc/h)
z
0.77 0.80 0.82 0.85
30
20
rx (Mpc/h)
10
0
−10
−20
−30
1900 1950 2000 2050
rz (Mpc/h)
⊡ Fig. 8-14
A schematic of the void probability function (VPF). The top panel shows the comoving distribution
of galaxies in a small portion of the DEEP2 survey (projected through 10 h−1 Mpc), while the lower
panel shows a fraction of the empty spheres identified with a radius of 6 h−1 Mpc in the same volume
(from Conroy et al. 2005). Because the figure is projected through one dimension, it may appear
that galaxies reside inside of identified voids; in three dimensions, the voids contain no galaxies
418 8 The Large-Scale Structure of the Universe
voids traced by bluer and/or fainter galaxies. They also find that voids are larger in comoving
coordinates at z ∼ . than at z ∼ ; i.e., voids grow over time, as expected. They show that the
differences observed in the VPF as traced by different galaxy populations are entirely consis-
tent with differences observed in the two-point correlation function and space density of these
galaxy populations. This implies that there does not appear to be additional higher-order infor-
mation in voids than in the two-point function alone. They also find excellent agreement with
predictions from ΛCDM simulations that include semi-analytic models of galaxy evolution.
Tinker et al. (2008) interpret the observed VPF in galaxy surveys in terms of the halo model
(see >Sect. 8). They compare the observed VPF in 2dFGRS and SDSS to halo model predictions
constrained to match the two-point correlation function and number density of galaxies using
a model in which the dark matter halo occupation depends on mass only. They find that with
this model, they can match the observed data very well, implying that there is no need for the
suppression of galaxy formation in voids, i.e., galaxy formation does not proceed differently in
low-density regions. They find that the sizes and emptiness of voids show excellent agreement
with predictions of ΛCDM models for galaxies at low redshift to luminosities of L ∼ .L∗.
9.3 Filaments
Galaxy filaments – long strings of galaxies – are the largest systems seen in maps of large-scale
structure and as such, provide a key test of theories of structure formation. Aragon et al. (2010)
show that in simulations while filaments occupy only ∼% of the volume of space, they account
for ∼40% of the mass content of the z = Universe. Measuring the typical and maximal length of
filaments, as well as their thickness and average density therefore constrains theoretical models.
Various statistical methods have been proposed to identify and characterize the morphologies
and properties of filaments (e.g., Sousbie et al. 2008 and references therein; see > Fig. 8-15).
In terms of their sizes, the largest length scale at which filaments are statistically significant,
and hence identified as real objects, is 50–80 h− Mpc, according to an analysis of galaxies in the
Las Campanas Redshift Survey (LCRS; Shectman et al. 1996) by Bharadwaj et al. (2004). They
show that while there appear to be filaments in the survey on longer scales, these arise from
chance alignments and projection effects and are not real structures. Sousbie et al. (2008) iden-
tify and study the length of filaments in SDSS by identifying ridges in the galaxy distribution
using the Hessian matrix (∂ ρ/∂x i ∂x j ) and its eigenvalues (see > Fig. 8-12). They find excellent
agreement between observations and ΛCDM numerical predictions for a flat, low Ω matter Uni-
verse. They argue that filament measurements are not highly sensitive to observational effects
such as redshift space distortions, edge effects, incompleteness, or galaxy bias, which makes
them a robust test of theoretical models.
Bond et al. (2010) use the eigenvectors of the Hessian matrix of the smoothed galaxy dis-
tribution to identify filaments in both SDSS data and ΛCDM simulations and find that the
distribution of filaments lengths is roughly exponential, with many more filaments of length
≲10 h− Mpc than >20 h− Mpc. They find that the filament width distribution agrees between
the SDSS data and N-body simulations. The mean filament width depends on the smoothing
length; for smoothing scales of 10 and h− Mpc, the mean filament widths are 5.5 and 8.4 h−
Mpc. In ΛCDM simulations, they find that the filamentary structure in the dark matter density
distribution is in place by z = , tracing a similar pattern of density ridges. This is in contrast to
what is found for voids, which become much more prominent and low density at later cosmic
epochs.
The Large-Scale Structure of the Universe 8 419
⊡ Fig. 8-15
Filaments identified in the SDSS galaxy distribution (from Sousbie et al. 2008). Individual filaments
are shown in green overlaid on the galaxy density field show in purple. The Sloan Great Wall is
identified in the foreground, lying between the red arrows
Choi et al. (2010) use the methods of Bond et al. (2010) to study the evolution of filamen-
tary structure from z ∼ . to z ∼ . using galaxies from the DEEP2 survey and the SDSS.
They find that neither the space density of filaments nor the distribution of filament lengths has
changed significantly over the last 7 Gyr of cosmic time, in agreement with ΛCDM numerical
predictions. The distribution of filament widths has changed, however, in that the distribution
is broader at lower redshift and has a smaller typical width. This observed evolution in the fila-
ment width distribution naturally results from nonlinear growth of structure and is consistent
with the results on voids discussed above in that over time, voids grow larger while filaments
become tighter (i.e., have a smaller typical width) though not necessarily longer.
This overview of our current understanding of the large-scale structure of the Universe has
shown that quantitative measurements of the clustering and spatial distribution of galaxies have
wide applications and implications. The nonuniform structure reveals properties of both the
galaxies and the dark matter halos that comprise this large-scale structure. Statistics such as
the two-point correlation function can be used not only to constrain cosmological parame-
ters but also to understand galaxy formation and evolution processes. The advent of extremely
420 8 The Large-Scale Structure of the Universe
large redshift surveys with samples of hundreds of thousands of galaxies has led to very precise
measurements of the clustering of galaxies at z ∼ . as a function of various galaxy properties
such as luminosity, color, and stellar mass, influencing our understanding of how galaxies form
and evolve. Initial studies at higher redshift have revealed that many of the general correlations
that are observed between galaxy properties and clustering at z ∼ were in place when the
Universe was a fraction of its current age. As larger redshift surveys are carried out at higher
redshifts, much more can be learned about how galaxy populations change with time. Theoret-
ical interpretations of galaxy clustering measurements such as the halo occupation distribution
model have also recently made great strides in terms of statistically linking various properties
of galaxies with those of their host dark matter halos. Such studies reveal not only how light
traces mass on large scales but how baryonic mass and dark matter coevolve with cosmic time.
There are many exciting future directions for studies of galaxy clustering and large-scale
structure. Precise cosmological constraints can be obtained using baryon acoustic oscillation
signatures observed in clustering measurements from wide-area surveys (Eisenstein et al. 2005).
Specific galaxy populations can be understood in greater detail by comparing their clustering
properties with those of galaxies in general. For example, the clustering of different types of
active galactic nuclei (AGN) can be used to constrain the AGN fueling mechanisms, lifetimes,
and host galaxy populations (Coil et al. 2009). As discussed above, measurements of galaxy clus-
tering have the power to place strong constraints on contemporary models of galaxy formation
and evolution and advance our understanding of how galaxies populate and evolve within dark
matter halos.
Acknowledgments
The author thanks James Aird, Mirko Krumpe, and Stephen Smith for providing comments on
earlier drafts of the text.
References
Adelberger, K. L., et al. 2005, ApJ, 619, 697 Bond, N. A., Strauss, M. A., & Cen, R. 2010, MNRAS,
Aragon, van de Weygaert, & Jones, 2010, MNRAS, 409, 156
408, 2163 Brodwin, M., et al. 2008, ApJL, 687, L65
Bardeen, J. M., Bond, J. R., Kaiser, N., & Szalay, A. S. Choi, E., et al. 2010, MNRAS, 406, 320
1986, ApJ, 304, 15 Coil, A. L., et al. 2006, ApJ, 644, 671
Baugh, C. M. 1996, MNRAS, 280, 267 Coil, A. L., et al. 2008, ApJ, 672, 153
Baugh, C. M., et al. 2004, MNRAS, 351, L44 Coil, A. L., et al. 2009, ApJ, 701, 1484
Benson, A. J., Cole, S., Frenk, C. S., Baugh, C. M., & Colberg, J. M., et al. 2005, MNRAS, 360, 216
Lacey, C. G. 2000, MNRAS, 311, 793 Colless, M., et al. 2001, MNRAS, 328, 1039
Benson, A. J., Hoyle, F., Torres, F., & Vogeley, M. S. Colless, M., 2004, Cosmological Results from the 2dF
2003, MNRAS, 340, 160 Galaxy Redshift Survey, Measuring and Mod-
Berlind, A. A., & Weinberg, D. H. 2002, ApJ, 575, eling the Universe, ed. Freedman W., Carnegie
587 Observatory Astrophysics Series Vol. 2, Cam-
Berlind, A. A., et al. 2005, ApJ, 629, 625 bridge University Press, 196
Bernardeau, F., Colombi, S., Gaztañaga, E., & Scocci- Conroy, C., Wechsler, R. H., & Kravtsov, A. V. 2006,
marro, R. 2002, PhR, 367, 1 ApJ, 647, 201
Bharadwaj, S., Bhavsar, S. P., & Sheth, J. V. 2004, ApJ, Conroy, C., et al. 2005, ApJ, 635, 990
606, 25 Conroy, C., et al. 2008, ApJ, 679, 1192
The Large-Scale Structure of the Universe 8 421
Croton, D. J., & Farrar, G. R. 2008, MNRAS, 386, 2285 Nichol, R. C., et al. 2006, MNRAS, 368, 1507
Croton, D. J., Norberg, P., Gaztañaga, E., & Baugh, Norberg, P., et al. 2001, MNRAS, 328, 64
C. M. 2007, MNRAS, 379, 1562 Ouchi, M., et al. 2004, ApJ, 611, 685
Croton, D. J., et al. 2004a, MNRAS, 352, 1232 Ouchi, M., et al. 2005, ApJL, 635, L117
Croton, D. J., et al. 2004b, MNRAS, 352, 828 Pan, D. C., Vogeley, M. S., Hoyle, F., Choi, Y.-Y., &
Croton, D. J., et al. 2005, MNRAS, 356, 1155 Park, C. 2012, Cosmic voids in Sloan Digital Sky
Davis, M., Geller, M. J., & Huchra, J. 1978, ApJ, 221, 1 Survey Data Release 7, MNRAS, 421, 926–934,
Davis, M., Huchra, J., Latham, D. W., & Tonry, J. 1982, doi:10.1111/j.1365-2966.2011.20197.x
ApJ, 253, 423 Peacock, J. A., & Smith, R. E. 2000, MNRAS,
Davis, M., & Peebles, P. J. E. 1983, ApJ, 267, 465 318, 1144
Dressler, A. 1980, ApJ, 236, 351 Peacock, J. A., et al. 2001, Nature, 410, 169
Eisenstein, D. J., et al. 2005, ApJ, 633, 560 Peebles, P. J. E. 1975, ApJ, 196, 647
El-Ad, H., & Piran, T. 1997, ApJ, 491, 421 Peebles, P. J. E. 1980, The Large-Scale Structure of the
Fisher, K. B., Davis, M., Strauss, M. A., Yahil, A., & Universe (Princeton, NJ: Princeton University
Huchra, J. P. 1994, MNRAS, 267, 927 Press)
Fry, J. N. 1996, ApJL, 461, L65 Peebles, P. J. E. 2001, ApJ, 557, 495
Fry, J. N., & Gaztanaga, E. 1993, ApJ, 413, 447 Postman, M., Lauer, T. R., Szapudi, I., & Oegerle, W.
Gaztañaga, E., Norberg, P., Baugh, C. M., & Croton, 1998, ApJ, 506, 33
D. J. 2005, MNRAS, 364, 620 Roche, N., & Eales, S. A. 1999, MNRAS, 307, 703
Geller, M. J., & Huchra, J. P. 1989, Science, 246, 897 Rojas, R. R., Vogeley, M. S., Hoyle, F., & Brinkmann,
Gott, III, J. R., Dickinson, M., & Melott, A. L. 1986, J. 2004, ApJ, 617, 50
ApJ, 306, 341 Rojas, R. R., Vogeley, M. S., Hoyle, F., & Brinkmann,
Gott, III, J. R., et al. 2005, ApJ, 624, 463 J. 2005, ApJ, 624, 571
Gregory & Thompson, 1978, ApJ, 222, 784 Sandge & Tammann, 1975, ApJ, 196, 313
Guzzo, L., et al. 2008, Nature, 451, 541 Sargent, W. L. W., & Turner, E. L. 1977, ApJL,
Hamilton, A. J. S. 1992, ApJL, 385, L5 212, L3
Hamilton, A. J. S. 1993, ApJ, 417, 19 Scoccimarro, R. . 2000, ApJ, 544, 597
Hamilton, A. J. S. 1998, in ASSL Vol. 231: The Evolv- Seldner, M., Siebers, B., Groth, E. J., & Peebles, P. J. E.
ing Universe, 185 1977, AJ, 82, 249
Hoyle, F., Rojas, R. R., Vogeley, M. S., & Brinkmann, Seljak, U. 2000, MNRAS, 318, 203
J. 2005, ApJ, 620, 618 Shane, C. D., & Wirtanen, C. A. 1967, in Pub. Lick
Hoyle, F., & Vogeley, M. S. 2004, ApJ, 607, 751 Obs. Vol. 22, part 1
Hubble, E. 1934, ApJ, 79, 8 Shapley, H., & Ames, A. 1932, Ann. Harv. Coll. Obs.,
Hubble, E. P. 1926, ApJ, 64, 321 88, 41
Joeveer, Einasto, & Tago, 1978, MNRAS, 185, 357 Shectman, S. A., et al. 1996, ApJ, 470, 172
Kaiser, N. 1984, ApJL, 284, L9 Sheth, R. K., Mo, H. J., & Tormen, G. 2001, MNRAS,
Kaiser, N. 1987, MNRAS, 227, 1 323, 1
Kamionkowski, M., Verde, L., & Jimenez, R. 2009, Sheth, R. K., & Tormen, G. 1999, MNRAS, 308, 119
JCAP, 1, 10 Sheth & van de Weygaert, 2004, MNRAS, 350, 517
Kauffmann, G., Colberg, J. M., Diaferio, A., & White, Sousbie, T., et al. 2008, ApJL, 672, L1
S. D. M. 1999, MNRAS, 303, 188 Tegmark, M., & Peebles, P. J. E. 1998, ApJL, 500, L79
Kerscher, M., Szapudi, I., & Szalay, A. S. 2000, ApJL, Tegmark, M., et al. 2004, ApJ, 606, 702
535, L13 Tinker, J. L., et al. 2008, ApJ, 686, 53
Kirshner, R. P., Oemler, A., Jr., & Schechter, P. L. Verde, L., et al. 2002, MNRAS, 335, 432
1978, AJ, 83, 1549 Wake, D. A., et al. 2011, ApJ, 728, 46
Kravtsov, A. V., et al. 2004, ApJ, 609, 35 Weinberg, D. H., Davé, R., Katz, N., & Hernquist, L.
Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 2004, ApJ, 601, 1
Li, C., et al. 2006, MNRAS, 368, 21 White, S. D. M. 1979, MNRAS, 186, 145
Ma, C.-P., & Fry, J. N. 2000, ApJ, 543, 503 York, D. G., et al. 2000, AJ, 120, 1579
Maddox, S. J., et al. 2010, A&A, 518, L11 Zehavi, I., et al. 2002, ApJ, 571, 172
Madgwick, D. S., et al. 2003, MNRAS, 344, 847 Zehavi, I., et al. 2004, ApJ, 608, 16
Mann, R. G., Peacock, J. A., & Heavens, A. F. 1998, Zehavi, I., et al. 2005, ApJ, 630, 1
MNRAS, 293, 209 Zehavi, I., et al. 2011, ApJ, 736, 59
Meneux, B., et al. 2006, A&A, 452, 387 Zheng, Z., Coil, A. L., & Zehavi, I. 2007, ApJ, 667, 760
Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 Zwicky, F., Herzog, E., & Wild, P. 1968, Catalogue of
Navarro, J. F., Frenk, C. S., & White, S. D. M. 1997, Galaxies and of Clusters of Galaxies (Pasadena:
ApJ, 490, 493 California Institute of Technology, 1961–1968)