Solid State Physics
Solid State Physics
i
p
2
i
2m
+
1
4
0
ij
Z
j
e
2
[r
i
R
j
[
+
1
2
1
4
0
i=i
e
2
[r
i
r
i
[
. (1)
This leaves out the kinetic energy of the nuclei and their interaction. We will come back to that later. Our rst
assumption is that the nuclei are xed charges in space. The electrons move so fast that that the do not notice the
motion of the nuclei. This is not true and the interaction between electrons and the motion of the nuclei can lead
to exciting phenomena such as superconductivity, but it is a good starting point. The rst term in the equation is
the kinetic energy of the electrons. The second term is the interaction between the electrons and the nuclei, where i
sums over all the electrons in the solid and j sums over all the nuclei with charge Z
j
e. The last term is the Coulomb
interaction between the electrons. The factor 1/2 avoid double counting. Including this term makes it impossible to
6
solve the problem, except for very small systems. We need to simplify something. Option number one is to ignore
it all together. This is a gross simplication, but doing that will help us to understand solids better. What is left is
essentially electrons moving around in a potential landscape built up by the potentials of the nuclei.
Why can we still get a good idea when neglecting something not particularly negligible such as interaction between
electrons. The trick is that we should not consider real nuclei, but eective nuclei. Consider lithium metal. Li has a
conguration 1s
2
2s. The electrons in the 2s orbitals do not feel the 3+ nucleus of Li. Instead, they feel a potential
which is closer to a Li
1+
ion, i.e. the nucleus plus the two 1s electrons. Therefore, there is an eective Z
e
= 1
nucleus as opposed to a Z = 3 nucleus. This concept of treating the electrons as an eective background potential is
very important in condensed-matter physics. However, it should be noted that often it fails miserably.
Crystals can be very complicated, so let us start with something simple: a chain of atoms. Although some materials
can indeed be described as a one-dimensional chain, we are more interested in the concepts, which are very similar
in any dimension. Unfortunately, even a chain is rather complicated to start out with, so let us begin with just two
atoms and later add more atoms to it to form the chain.
1. The hydrogen molecule
Let us also consider the simplest system: a system of Hydrogen-like atoms (again not very practical, since Hydrogen
forms H
2
molecules at room temperature, but still a model often used in theoretical physics). For each site, we only
consider one orbital (in our Hydrogen example, this would be the 1s orbital). So in total, we have two orbitals
1
(r) and
2
(r), since we have two sites. However, this creates the impression that we have two dierent orbitals,
whereas we only have one type of orbital but located at two dierent sites. We can therefore also write (r R
1
)
and (r R
2
), where (r) is the atomic orbital at the origin. With two orbitals in our basis set, we have a total of
four matrix elements
=
_
dr
(r R
1
)H(r R
1
) =
_
dr
(r R
2
)H(r R
2
) (2)
and
t =
_
dr
(r R
1
)H(r R
2
) =
__
dr
(r R
1
)H(r R
2
)
_
, (3)
where we use vector notation to allow easy generalization to arbitrary dimension even though we are considering only
problems in one dimension for the moment.
In the end, we only have two parameters and t. Note that for the two matrix elements are equivalent, since
the on-site energy does not depend on the position in space. Note that for Hydrogen atoms, if [R
1
R
2
[ , we
obtain = 13.6 eV. The wavefunction (r R) would be a 1s orbital. Since the nuclei are far apart the electron
in the wavefunction at site R
1
will not feel the potential of the nucleus at site R
2
(this is basically the assumption
when solving the Hydrogen atom, where the potential is that of a single nucleus). For t, the matrix elements are just
each others conjugate. t is known as the tranfer matrix element or hopping integral. We take put a minus sign here
so that t will be a positive number. This matrix element will go to zero in the limit [R
1
R
2
[ . Note that t
involves an integral of an orbital at site R
1
and one at R
2
. The obitals are localized (generally, their size is of the
order of the Bohr radius). If the nuclei are far apart, the product
(r R
1
)(r R
2
) is very small everywhere in
space. Hence, the matrix will be very small.
Let us take t to be a real number (which it is for Hydrogen atoms, since everything is real when using 1s orbitals).
We can also write this in a bra-ket notation
= R
1
[H[R
1
) = R
2
[H[R
2
) (4)
t = R
1
[H[R
2
) = R
2
[H[R
1
), (5)
where R
i
) denotes the atomic wavefunction at site R
i
. For s-like orbitals, the matrix element would be negative, so t
would be a positive number. This allows us to write the Hamiltonian for the two orbitals in the basis set as a matrix
H =
_
t
t
_
. (6)
Any vector can now be written as a linear combination of
[R
1
) =
_
1
0
_
and [R
2
) =
_
0
1
_
. (7)
7
|
2
|
|
2
FIG. 1: The bonding
B
and
AB
wavefunctions for the hydrogen H
2
are given on the left. The probabilities are on the right.
Let us work with the Hamiltonian on [R
1
)
H =
_
t
t
__
1
0
_
=
_
t
_
= [R
1
) t[R
2
) (8)
Therefore if t = 0, H[R
1
) = [R
1
), i.e. [R
1
) would be an eigenstate of the Hamiltonian. However, the matrix
element t brings the electron from atom 1 to site 2. The matrix element t is therefore often called a transfer matrix
element or a hopping matrix element. Note that the Hamiltonian is not diagonal and eigenstates will be of the form
[) = a
1
[R
1
) +a
2
[R
2
). We would like to solve the Schrodinger equation
H[) = E[). (9)
We can split this out into dierent components as
R
i
[H[) = ER
i
[), (10)
with i = 1, 2. There is a complication in that the overlap matrix elements S = R
1
[R
2
) =
_
dr
(rR
1
)(rR
2
) ,= 0.
Taking this into account, we obtain
_
t
t
_
[) = E
_
1 S
S 1
_
[) (11)
Note that R
1
[R
1
) = R
2
[R
2
) = 1. The eigenvalues can be found by solving the determinant
E t ES
t ES E
= 0. (12)
This gives the equation
( E)
2
(t ES)
2
= 0 E = (t ES) E
B/AB
=
t
1 S
. (13)
In this case, the overlap simply renormalizes the energies. The calculation becomes simpler if we forget about the
overlap between the orbitals, which we will do in the remainder, and take S = 0. This gives E
B/AB
= t. The
eigenstates are also easily obtained and are
[
B/AB
) =
1
2
([R
1
) [R
2
)). (14)
8
Note that for R
1
[H[R
2
) < 0 the bonding (B) combination is the lowest (the antibonding (AB) is the lowest for
R
1
[H[R
2
) > 0). An electron spends an equal amount of time on atom 1 as it does on atom 2, as it should, because
there is atom 1 is equivalent to atom 2. In the bonding orbital, the wavefunction is a lot smoother (note that the anti
bonding orbital has a node between the two atoms). This reduces the kinetic which is proportional to the gradient
of the wavefunction through to the
2
term in the Hamiltonian. The lowering of the kinetic energy due to the
delocalization of the wavefunction is also important in keeping solids together. These are the well-known bonding-
antibonding combinations of a H
2
molecule. Each atom contributes one electron. The two electrons will go into the
bonding orbital, giving
[
B
B
) =
1
2
([R
1
R
1
) +[R
1
R
2
) +[R
2
R
1
) +[R
2
R
2
)), (15)
where we can put two electrons in the bonding orbital due to the spin degree of freedom. Note that the probability of
nding the two electrons on the same atom equal that of nding one electron on each atom. This would be surprising
if electrons really interacted strongly with each other. However, we neglected that. As said, the energy gain is a result
of the delocalization of the wave function. This does not always work. Note that the antibonding states are actually
higher in energy. For example, a Helium atom has two electrons in the 1s. When bringing two He atoms together
two form a He
2
molecule, we can in principle form the same bonding and antibonding wavefunctions. However,
since we now have four electrons in total we have to ll up both the bonding and antibonding states gives an energy
of 2( t) + 2( + t) = 4. Therefore, He cannot take any energetic gain from the formation of the bonding and
antibonding combinations and no real bonding is formed. There is another way to look at this. The 2s electrons
from one helium atom cannot delocalize to the neighboring helium atom, since the 2s orbital of that atom is full and
Paulis principle forbids the delocalization. Therefore, He
2
does not bond and that is why He is a noble element.
2. A linear chain of 1s orbitals
Obviously, this is not much of a solid. However, it is straightforward to extend this procedure to chains of arbitrary
length N. Let us take the atoms on a chain to be numbered 1, 2, 3, , N. General wavefunctions for N sites can
then be written as
(r) =
R
a
R
(r R) or [) =
R
a
R
[R). (16)
Again, we want to solve the Schrodinger equation H[) = E[). For the R
th component, we obtain
R
[H[) = ER
[)
R
R
[H[R)a
R
= E
R
R
[R)a
R
= Ea
R
, (17)
where we have assumed R
[R) =
R,R
. We can write this in matrix form as
_
_
_
H
11
H
1N
. .
. .
H
N1
H
NN
_
_
_
_
_
_
a
1
.
.
a
N
_
_
_ = E
_
_
_
a
1
.
.
a
N
_
_
_, (18)
where H
ij
= R
i
[H[R
j
). As before, we have H
nn
= . If atoms are on neighboring sites, we have H
n,n+1
= t. In
principle, H
n,n+2
. H
n,n+3
, etc. are nonzero (known as next-nearest neighbor and next-next nearest neighbor hopping
matrix elements), but we will take them zero for simplicity. This gives a matrix
H =
_
_
_
_
_
_
_
_
_
t 0 0
t t 0 0
0 t t 0 0
0 0 t t 0 0
0 0 t t 0 0
0 0 t t 0 0
. . . .
_
_
_
_
_
_
_
_
_
(19)
Although, not easily solvable by hand, the energies can be found using some computer program that solves eigenvalue
problems. For dierent lengths N, the eigenvalues are given in Fig. 2. We have taken the constant shift = 0. The
9
FIG. 2: The eigenvalues of a linear chain of 2, 5, 9, 15, 100 atoms with a hopping matrix element t between neighboring atoms.
eigenvalues E
n
with n = 1, , N are ordered and plotted as a function of (n 1)/(N 1), i.e. from 0 to 1. As we
have seen before, for N = 2, the eigenvalues are t. However, if we increase the the length of the chain, we see that
the eigenvalues approach a continuous curve. The extrema of the curve are 2t. The question is what is this curve
and can we obtain this curve in dierent fashion apart from solving the matrix by brute force?
First, we have to realize that we want to study bulk systems or systems that are essentially innity. The study of
surfaces is a whole eld in itself but not our focus at the moment. For small chains the number of atoms close to
the edge of the chain is relatively large. So we can study a large system, but we can also introduce a mathematical
trick. Note, that none of the atoms in the chain is equivalent, since they all have a dierent distance to the end of
the chain. However, if we connect the two ends of the chain to each other, all the atoms become equivalent. This is
known as periodic boundary conditions. Although, this is essentially a mathematical trick, in some systems this is
physically realized. The best known example would be a benzene ring consisting of six carbon atoms. However, in
two and three dimensions, this is more dicult to realize. In two dimensions, the system would be a torus (which
could be achieved, maybe, with a carbon nanotube), but in three dimension, it is impossible to realize. Let us go
back to the diatomic molecule. Applying periodic boundary condition implies that 1 is connected to 2, but 2 is again
connected to 1. This leads to a doubling of the transfer matrix element, giving a Hamiltonian:
H =
_
2t
2t
_
. (20)
The eigenvalues are E
1,2
= 2t. Note that this is equal to the extrema in the continuous curve we obtained for large
N. This is a signicant dierence from the t, we obtained earlier. We could expect that, because all atoms in the
chain are at the end of the chain. For larger
H =
_
_
_
_
_
_
_
_
_
_
_
_
_
t 0 0 0 t
t t 0 0 0 0
0 t t 0 0
0 0 t t 0 0
0 0 t t 0 0
0 0 t t 0 0
. . . .
0 0 . . 0 t t
t 0 . . . 0 t
_
_
_
_
_
_
_
_
_
_
_
_
_
(21)
10
FIG. 3: The lines give the values of e
ikx
and e
(k+K)x
as a function of x. We have taken k = 0.3 and K = 2. The dots
indicate the values at the atomic positions. Note that the Fourier transform only includes these values.
Obviously, when N is very large (and number of atoms are generally very large numbers), this one extra matrix element
will not aect the majority of the eigenvalues, so the eigenvalues with and without periodic boundary conditions should
virtually identical. If you solve this matrix you nd out that the eigenvalues always lie on the curve that is obtained
when N . This curve is known as a band. The electronic structure is quite often known as the band structure.
So what is missing here? The essential ingredient is translational symmetry. Although we are all attached to real
space, it is the periodicity that is really important. When there are milemarkers along the road, you do not say
there is a mile marker at 1 mile, and one a 2 miles, and one at 3 miles. You say there is a mile marker every 1
mile. In a more mathematical term, we should be considering Fourier transforms. The quantity that is related to
space by Fourier transform is momentum (the other important dimension is time, which is related to frequency by
Fourier transform, which, in quantum mechanics is directly related to energy). In free space, space is continuous, and
therefore the momentum is continuous as well. Conservation of momentum in classical mechanics is inherently related
to the translational symmetry of space. When we go to a solid, the translational symmetry of space is no longer
continuous, but related to the interatomic distances and therefore discrete. We applied periodic boundary conditions
so we need to satisfy:
e
ik(x+L)
= e
ikx
(22)
where the length of the chain is L = Na, with a the distance between the atoms. This means is we displace ourselves
by L (the lenght of the chain), we end up at the same position due to the periodic boundary conditions. This implies
that
kL = 2n k =
2n
L
, (23)
where n is an integer. This would lead to an innite amount of wavenumbers k. This is not entirely correct. Before
doing a Fourier transform, we had N orbitals, one for each site. After the Fourier transform, we should still end up
with the same number of states. Therefore, since we have positive and negative numbers, the maximum n is N/2.
The allowed k values are therefore
k = 0,
2
L
,
4
L
, . . . ,
a
. (24)
So we only have k values in the region [
a
,
a
]. You might wonder what happened to all the other k values. From
a point a view of periodicity, they are exactly equivalent. Figure 3 shows a comparison between e
ikx
and e
i(k+2/a)x
(where for the example k is taken 0.3). Although the function are of course dierent, the have the same values at
the sites of the atoms (indicated by the blue dots). However, you might remember from Fourier analysis that you
often need all k values to build special functions (e.g. a square or a delta function) and there might be structure in
the wavefunction that Fourier components with a larger k value. However, the k is only related the periodic part of
the wavefunction. These smaller structures are related to the atomic part of the wavefunction
k
(r) =
1
R
e
ikR
(r R). (25)
Note that the Fourier transform only includes the values of the exponentials at the atomic positions. Therefore, any
k +2n, with n and integer, is equivalent to k. Essentially, this is a unitary transformation of N wavefunction in real
11
space to a k dependent basis. We can also inverse this procedure
(r R) =
1
k
e
ikR
k
(r). (26)
We can also write this in bra-ket notation
[k) =
1
R
e
ikR
[R) and [R) =
1
k
e
ikR
[k). (27)
We now want to understand what the Hamiltonian looks like in this new basis set. Let us rst rewrite the Hamiltonian
in bra-ket notation in real space. Note that
[R
i
)R
j
[ =
i
_
_
_
_
_
_
_
_
_
0
.
.
0
1
0
.
_
_
_
_
_
_
_
_
_
(0, 0, , 0, 1, 0, )
j
= i
_
_
_
_
_
. . .
0 0 0
0 1 0
0 0 0
. . .
_
_
_
_
_
(28)
j
using this we can rewrite the matrix in Eqn. (21)
H =
R,R
[R
)R
[H[R)R[ = t
R
[R+ )R[, (29)
where is a vector connecting R to its nearest neighbors (the atoms left and right of the atom in a chain). The
Fourier transform can be made by inserting the denitions into the Hamiltonian:
H = = t
R
1
N
k,k
e
ik(R+)
[k)k
[e
ik
R
(30)
= t
k,k
_
1
N
R
e
i(k
k)R
__
e
ik
_
[k)k
[ (31)
=
k,k
k,k
k
[k)k
[. (32)
We see that the Hamiltonian is diagonal in k due to the delta function. The diagonal matrix elements (which are
directly the eigenvalues) are given by
k
= t
e
ik
. (33)
The nearest-neighbor vectors for a chain in the x-direction are given by = a x, where x is a unit vector in the
x-direction.
k
= t(e
ika
+e
ika
) = 2t cos ka, (34)
where k = k x. Since goes from
a
to
a
, the eigenvalues lie on a cosine between 2t and 2t, see Fig. 4.
3. The benzene molecule
The Benzene molecule (C
6
H
6
) consists of 6 carbon atom that form a hexagon, see Fig. 5. The hydrogen atoms
are attached to the carbons. Since the atomic number of carbon is six, we have a total of 6 (1 + 6) = 42 electrons.
However, 12 of these are strongly bound in the Carbon 1s orbitals (with a binding energy of more than 280 eV). For
molecular bonding, we can ignore those. The more important ones are the 2s and 2p orbitals of carbon and the 1s
12
-2
-1
0
1
2
k
/
t
k
/a /a /2a /2a 0
FIG. 4: The eigenenergies for a single tight-binding band
k
= 2t cos k as a function of k.
orbitals of hydrogen. Some of these form strong covalent bonds including so-called sp
2
hydrids. We will deal with
covalent bonding later. However, from Fig. 5, we can see that there are 12 covalent bonds, known as (sigma) bonds.
These are comparable to the bonding states of the hydrogen molecule. We can see six bonds between the carbon
atoms and six between the carbon and hydrogen atoms. Since each bond can contain two electrons (again, compare
the H
2
molecule), this accounts for another 24 electrons. This leaves us with 42 12 24 = 6 electrons. In a model
based on bonding this allows for three more bonds, known as pi () bonding. These should be between the carbon
since the hydrogen already have their bond (related to the 1s orbitals) and each carbon needs four bonds (one for the
2s and three for the 2p). However, this leaves us with two possible congurations, see Fig. 5. The idea developed in
the nineteenth century by Kekule i is that the molecule switches between the two congurations.
However, we can apply the ideas from the previous sections to the benzene molecule. We can dene the matrix
element between p
z
orbitals on carbon atoms on neighboring sites i and j as
t = p
zi
[h[p
zj
) =
_
dr
p
z
(r R
i
)H
p
z
(r R
j
). (35)
Note that t > 0. We do not need the conjugate, since the p
z
are real orbitals. The determinant is given by
E
0
E t 0 0 0 t
t E
0
E t 0 0 0
0 t E
0
E t 0 0
0 0 t E
0
E t 0
0 0 0 t E
0
E t
t 0 0 0 t E
0
E
, (36)
where E
0
is the binding energy of the p
z
orbital. Note the presence of the t in the upper-right and bottom-left
corner. This is the matrix element completing the hexagon, i.e. connecting carbon atoms 1 and 6. This matrix
element would be absent when studying a linear hydrocarbon with six carbon atoms (hexadiene). The eigenvalues
are found to be
E = E
0
2t, E
0
t, E
0
t, E
0
+t, E
0
+t, E
0
+ 2t. (37)
The 6 electrons go into the lowest molecular orbitals giving a total energy of E = 6E
0
8t.
The wavefunction for the eigenstate with energy E
0
2t is completely symmetric
1
=
1
6
(
p
1z
+
p
2z
+
p
3z
+
p
4z
+
p
5z
+
p
6z
), (38)
and contains two electrons. This gives a dierent picture than that suggested by that of molecular bonds. An electron
in the
1
orbital is delocalized over all the six carbon atoms and not localized in bonds between two carbon atoms.
13
FIG. 5: The benzene molecule C
6
H
6
forms a hexagon. In the electronic structure of a benzene molecule there are strong
covalent bonds formed by the sp
2
hybrids. The p
z
orbitals that stick out of the plane of the molecule bond less strongly and the
electrons delocalize. Already in the nineteenth century, it was recognized by Kekule, that there are two dierent possibilities
for bonding which are both equivalent. A more modern way is considering the electrons delocalized in molecular bonds.
This is the reason why in modern notations of the benzen molecule, the states are indicated by a circle, see Fig. 5.
The two wavefunctions at energy E
0
t are
2
=
1
3
(
1
2
p
1z
1
2
p
2z
p
3z
1
2
p
4z
+
1
2
p
5z
+
p
6z
), (39)
and
3
=
1
2
(
p
1z
+
p
2z
p
4z
p
5z
). (40)
Since their energies are degenerate, any linear combination of
2
And
3
is also an eigenfunction. These two wave-
functions contain the other four electrons. The wavefunctions for E
0
+t are
4
=
1
3
(
1
2
p
1z
+
1
2
p
2z
p
3z
+
1
2
p
4z
+
1
2
p
5z
p
6z
), (41)
and
5
=
1
2
(
p
1z
p
2z
+
p
4z
p
5z
). (42)
Finally the wavefunction at energy E
0
+ 2t is given by
6
=
1
6
(
p
1z
p
2z
+
p
3z
p
4z
+
p
5z
p
6z
). (43)
The last three orbitals are empty. Note that they look very similar to the rst three wavefunction except that the
signs have changed. Instead of having bonding between neighboring atoms we have antibonding states.
Although at rst sight this might seem like some coecients which just happen to come out of a calculation, on
further inspection you can see that there is some system to the coecients. A rather trivial one is
1
=
1
p
nz
. (44)
14
But
6
we can write as
6
=
1
p
nz
cos n. (45)
Looking more carefully at the wavefunctions, we see that
2
=
1
p
nz
cos n
3
(46)
and
3
=
1
p
nz
sin n
3
. (47)
The same thing we can do for
4
and
5
, but now using 2/3. In addition, since
2
and
3
are degenerate, we can
make linear combinations of them that are still eigenfunctions
+
2,3
=
1
2
(
2
+i
3
) =
1
p
nz
e
in
3
, (48)
and
2,3
=
1
2
(
2
i
3
) =
1
p
nz
e
in
3
. (49)
The same can be done for
4
and
5
. So it appears that the eigenstates look like a Fourier transform
k
of the states
p
nz
in real space
k
=
1
p
nz
e
ik
n
, k
= 0,
3
,
2
3
, . (50)
In fact, even the eigenvalues can be written as E
k
= E
0
+ 2t cos k. Note that we have an eective lattice spacing of
a 1. We can always go back to the real lattice spacing by taking k =
k
a
and whereever we have k
we have to replace
it by ka. This all corresponds nicely to what we observed before that when applying periodic boundary conditions,
we could solve the eigenstates exactly. Here the periodic boundary conditions are not an articial construction, but
a physical reality due to the closed loop.
IV. THE ELECTRONIC STRUCTURE: NEARLY FREE-ELECTRON MODEL (1D)
AM Chapter 9
In the previous chapter, we started from atoms and atomic orbitals. The potential of the nucleus played a dominant
role. When bringing the atoms together, we observed the formation of bands. The gain in bringing the atoms
together was the delocalization of the wavefunction, giving rise to a lowering of the kinetic energy. We can also
directly start from the opposite end and assume the kinetic energy is the most important part in the formation of a
solid and add the potential of the nuclei only later.
In free space in the Schrodinger equation is given by
2
2m
2
(r) = E(r). (51)
The solution is well known and the eigenfunction are simple plane waves
(r) =
1
V
e
ikr
, (52)
where V is the size of the system. The eigenvalues are
k
=
2
k
2
2m
(53)
15
Note that we directly have bands (although generally one does not call them bands for free electrons). Again, we can
apply periodic boundary condition, see Eqn. (24), giving
k
i
=
2n
L
with n integer. (54)
Note that so far there is no restriction on the k values to the range [
a
,
a
]. The only thing we have done so far is
put the particles in a box, which is not yet a solid. The system becomes a solid by introducing the potential U(r) due
to the nuclei that are arrange in a periodic structure. In general, we can express any arbitrary potential in a Fourier
series
U(r) =
k
U
k
e
ikr
. (55)
However, we want to impose that the potential satises the periodicity of the lattice. Therefore, the potential should
be equivalent when moving to a dierent lattice site by the vectors R that indicate the lattice sites:
U(r +R) =
k
U
k
e
ik(r+R)
=
k
e
ikR
U
k
e
ikr
(56)
This is not going to work for any arbitray wavevector k. The vectors K that satisfy this are known as reciprocal
lattice vectors
e
iKR
= 1. (57)
These vectors return again and again since they are intimately related to the translational symmetry of the crystal.
We will discuss in more detail later on how to obtain these reciprocal lattice vectors and discuss various cases for
dierent crystals. Let us just satisfy ourselves that these vectors exist and only consider the case of one dimension
e
iKa
= 1 K =
2n
a
. (58)
Note that since a is much smaller than L, the K are much larger the separation between the k values. In general, the
periodic potential is therefore given by
U(r) =
K
U
K
e
iKr
. (59)
The matrix elements between dierent running waves are given by
k
[U[k) =
K
1
V
_
dxe
ik
r
U
K
e
iKr
e
ikr
=
K
U
K
1
V
_
dre
i(k
Kk)r
=
K
U
K
,k+K
. (60)
In bra-ket notation, we can write the potential as
U =
kK
U
K
[k +K)k[ (61)
This has the importance consequence that a running wave with wavevector k is coupled to the running waves with
wavevectors k2n/a with n = 0, 1, 2, 3, , see Fig. 6. However, this also means that k is no longer a good quantum
number. The periodic potential reduces the k values that are good quantum numbers. Although we are free to choose
the k values, generally the region [K/2, K/2] = [/a, /a] is chosen. The wavevectors inside this region do not
couple to each other with the periodic potential. In addition, any wavevector outside this region is coupled to one
inside by a reciprocal lattice vector K. The allowed k values are therefore equivalent to those found in Eqn. (24)
when discussing the tight-binding model.
Since only the k values separated by a reciprocal lattice vector K couple, we can write the eigenfunctions as a linear
combination of these running waves
k
(r) =
K
c
kK
e
i(kK)r
, (62)
where c
kK
is a coecient. For each k, we nd a set of coupled equations
kK
c
kK
+
U
K
c
kK+K
= Ec
kK
. (63)
16
k
k
/a /a 2/a 0 2/a
FIG. 6: The scattering of electrons between dierent running waves e
ikx
due to a periodic potential resulting from nuclei in a
chain at a distance a from each other. The potential couples k values with each other that are K = 2n/a apart. The blue
arrows show this scattering for an arbitrary k value (Note that only three k points are included). The red arrow shows the
particular case for k =
a
. This is a Bragg reection and the electrons scatters to k =
a
which has exactly the same energy
k
.
Note that this reproduces to E =
kK
in the absence of a periodic potential.
This model is known as the nearly-free electron model and is generally use only in the case the the periodic potential
is weak (such as K, Na, and Al metals). However, even a weak potential can have a large inuence if the energy
separation of the states that if couples is small (or even zero). For the energy dierence to be zero we require
k
=
kK
k
2
= (k K)
2
= k
2
2k K+K
2
, (64)
or
k K =
1
2
K
2
. (65)
Some of you might recognize this as the Laue condition for x-ray diraction (which, of course is equivalent to the Bragg
condition). In principle, this should not be a surprise. X-rays are scattered by a lattice through repeated interaction
with the nuclei. The same applies to electrons. In one dimension, k and K are either parallel or antiparallel and we
nd k =
1
2
K.Let us consider K = 2/a, for which k = K/2 = /a should have the same energy as k K = /a,
which is indeed correct. The eects are therefore strongest close to the k values indicated by the dotted lines in Fig.
6. In this region, we can neglect the coupling to other running waves if [
k
kK
[ U. We are then left with
solving the determinant
k
E U
K
U
K
kK
E
= 0. (66)
This leads to the quadratic equation
(
k
E)(
kK
E) [U
K
[
2
= 0, (67)
which has the solutions
E =
k
+
kK
2
kK
2
_
2
+[U
K
[
2
. (68)
In the limit that
k
=
kK
, we obtain
E =
k
[U
K
[. (69)
17
k
k
/a /a 2/a 0 2/a
2U
FIG. 7: Calculation of the bands in a nearly-free electron model. Shown are the energies
k
and
k+K
with K =
2
a
. The
calculated nearly-free electron bands are shown for the region [K/2, K/2] = [
a
,
a
]
We see that an energy gap opens at the k =
1
2
K. We can also verify that in this limit around k =
1
2
K, we have
E
k
=
2
m
(k
1
2
K), (70)
i.e. the derivative is zero on the Bragg planes and the bands are therefore at. (Not entirely surprising since we just
demonstrated that a gap opens, so the bands must be either in a maximum or a minimum).
We can extend this procedure to include additional reciprocal lattice vectors K. Let us consider the situation in one
dimension for K = 0 and K
=
2
a
. We can also use the following properties of the potential. Since the potential
is real, we have
U
K
= U
K
. (71)
If in addition, we have inversion symmetry (meaning that the system is unchanged when r r, which is the case
for a linear chain), we also have
U
K
= U
K
= U
K
. (72)
Using this we can write down the Hamiltonian.
H =
_
_
k
U
K
U
K
U
K
kK
U
2K
U
K
U
2K
kK
+
_
_
. (73)
The results are given in Fig. 7. Note that the energies
kK
=
2
(k K
)
2
/2m look in the region [K/2, K/2] as
parabolas centered around K
x
k=0, n=1
k=/a, n=1
k=/a, n=2
k=0, n=2
FIG. 8: The wavefunction of the nearly-free electron eigenstates for four eigenvalues E
0,1
, E
a
,1
, E
a
,2
, and E
0,2
. We have
assumed U
K
< 0 and U
2K
> 0.
the region [K/2, K/2] and the band index n. Because of the mixing, we can no longer identify eigenstates with the
free-electron bands. Even at k = 0, we have, for the lowest eigenfunction in Fig. 7,
[E
k=0,1
) = a[k)
_
1 a
2
([k K
) +[k K
+
)), (74)
where the coecient a
= 1 (actually a = 0.9978 for the example in the gure). Indeed close to [k), but still mixed
with [k K
). The repeated-zone scheme has its uses from time to time, as long as you keep in mind that it just
repeats the same information as is already contained in the rst Brillouin zone and nothing new is added.
V. COMPARISON OF RESULTS FOR TIGHT-BINDING AND NEARLY-FREE ELECTRON MODEL
In this section, we will compare the results from tight-binding and the nearly-free electron model. At rst, this
looks like a pointless excersize. For tight-biding we nd a single band given E
k
= 2t cos ka. For the nearly-free
electron model we have several bands which originate from free electron parabolas. Although, we are not going to
make the claim that the two are exactly equivalent, they have more in common that you might think at rst sight. The
tight-binding model starts from atomic states which include the potential of the nucleus fully and then starts adding
the kinetic energy due to the delocalization of the electrons when the solid is formed. The nearly-free electron model
starts from completely delocalized electron and starts adding the eect of the potential of the nuclei. Somewhere
along the way to two must have something in common. (They will never be entirely equivalent, since the treatment of
the potential in the nearly-free electron is somewhat oversimplied. The inclusion of only two Fourier compoments,
U
K
and U
2K
, is obviously insucient to describe the much more complicated shape of the potential.
Let us start by looking at the eigenfunctions in the nearly-free electron model. Let us start with the one with the
lowest energy. The eigenfunction is given in Eqn. (74). This gives a wave in real space for k = 0 and n = 1,
0,1
(x) =
1
L
(a 2
_
1 a
2
cos Kx). (75)
In the case that a = 1 (i.e., the limit U = 0), we just obtain that
0,1
=
1
L
constant throughout the system. The
higher-order oscillation are a result of the (small) mixing in of the K
terms.
Eect are more drastic around k = K
2
,i
(x) =
1
2
L
(e
i
a
x
sgn(U
K
)e
i
a
x
). (76)
with i = 1, 2. The wavefunctions depend on the sign of the coupling. Let us consider the situation U
K
< 0, shown in
19
Fig. 8, which gives
2
,n
(x) =
_
_
_
_
2
L
cos
a
x n = 1
i
_
2
L
sin
a
x n = 2,
(77)
where n is the band index which simple numbers the band with the lowest number assigned to the band lowest in
energy. We see that
2
,1
(x) has extrema on the positions of the nuclei, whereas
2
,2
(x) has nodes on the positions of
the nuclei. This is reminescent of atomic physics: s orbitals have their maxima at the nucleus, whereas, for example
the p
x
orbital has a node at the nucleus. From this we can understand one origin of the discrepance between our
tight-binding and nearly-free electron calculation. In our tight-binding calculation, we only included s orbitals, so
we only get one band hich resembles the lowest band of the nearly-free electron calculation, but not the higher-lying
band. On the other hand, we also see that the eect of folding the free-electron bands is to create orbital-like
structure related to the nuclei. For the highest energy of the second band (n = 2), the wavefunction has nodes not
only at the nuclei, but also between the nuclei, see Fig. 8.
However, we see that we can create a better connection between the two approaches by including p type orbitals.
Let us try to extend our tight-binding approach, following the approach by Slater and Koster, Phys. Rev. B 94, 1498
(1954). We adjust Eqn. (25) to allow the inclusion of more orbitals
k,s
(r) =
1
R
e
ikR
s
(r R). (78)
The dispersion for these states, we found to be
ss
k
= 2(ss) cos ka with (ss) =
_
dr
s
(r R
)H
s
(r R) (79)
where R and R
are nearest neigbors and (ss) = t (using the notation by Slater and Koster indicating that the
matrix element is between two s orbitals; the indicates the type of bonding. There are also and bonding, but
that is not relevant for the discussion here). We can repeat this approach for p orbitals giving
k,p
(r) =
1
R
e
ikR
p
(r R) (80)
with energies
pp
k
= 2(pp) cos ka. with (pp) =
_
dr
p
(r R
)H
p
(r R) (81)
This looks similar, but there is an important dierence. For typical Hamiltonians H, the matrix elements are negative
when the positive lobes are pointing to each other. This is always the case for the s orbital which is positive everywhere
(+)(+)(+)(+)(+)(+)(+)(+) , (82)
where (+) schematically indicates an s orbital at a certain site. This is the state found for k = 0 (and energy
k
= 2(ss) < 0). Note that this is positive everywhere just as the nearly-free electron wavefunction for k = 0 and
n = 1 in Fig. 8. For k =
a
(and energy
k
= 2(ss) > 0), the exponential changes sign at each site, giving
(+)()(+)()(+)()(+)() . (83)
This has nodes between the nuclei, just as the wavefunction for k =
a
and n = 1.
The p orbital on the other hand have a node
(+)(+)(+)(+)(+)(+)(+)(+) (84)
The convention we choose is that the postive lobe of the p orbital (+) is in the positive x direction (we can choose
any convention that we like, but in the end it should lead to the same results). In this convention, we see that the
positive lobes are pointing towards the negative lobes. The matrix elements are therefore negative. This also has as
a consquence that the solution with the lowest energy is the one where the sign changes for each sites, i.e.
(+)(+)(+)(+)(+)(+)(+)(+) (85)
20
However, this looks nice since the positive lobes are pointing towards the positive lobes and the negative lobes are
pointing towards the negative lobes. This is essentially the bonding state with no nodes between the nuclei (but with
nodes on the nuclei). This corresponds well with the lowest n = 2 nearly-free electrons solution for k =
a
(alternating
signs), which also has no nodes between the nuclei. The state with the higher energy is given in Eqn. (84), where
there are nodes between the nuclei, comparable to the nearly-free electron wavefunction for k = 0 and n = 2 in Fig.
8.
We can also compare the dispersion. In order to do this, we need to overcome one more hurdle, namely the matrix
elements between s and p are not zero,
(sp) =
_
dr
s
(r R
)H
p
(r R). (86)
The dispersion does not follow a cosine dependence since an s orbital, in our convention, couples dierently to the
left and the right
(+)(+)(+), (87)
a problem that did not arise when dealing with just s orbitals. Following Eqns. (33) and (34), we nd
sp
k
= (sp)(e
ika
e
ika
) = 2i(sp) sin ka. (88)
We now have a basis (
k,s
,
k,p
) giving a matrix
H =
_
ss
k
sp
k
(
sp
k
)
pp
k
_
(89)
Figure 9 gives a comparison between the lowest two bands (n = 1, 2) in the nearly-free electron model and the s and
p tight-binding model. The t has been done by eye. Although a perfect t cannot be expected due to the dierent
approximations, it is clear that they are giving comparable physics.
Although, at rst sight, it might seem surprising that similar physics comes out of a model starting with free
electrons and a model starting from atomic orbitals. A comparison between tight-binding and nearly-free electron
model can be given as follows:
tight binding nearly free electronmodel
starting point atomic properties kinetic energy of the electrons
basis functions atomic orbitals plane waves
k = n
2
L
periodic boundary conditions periodic boundary conditions
k [
a
,
a
] N atoms give N k values or
N
2
2
L
k
N
2
2
L
periodic potential couples k values
that dier by K = n
2
a
number of bands equals number of atomic orbitals equal number of reciprocal lattice vectors
approximations neglect orbitals U
K
= 0 for large K
(90)
However, when you think more of it, in principle it should not make a dierence. The starting wavefunctions are
just a basis. If my basis is complete (i.e. you can express any arbitrary wavefunction in your basis, a statement you
might remember from Fourier analysis) then solving the same Hamiltonian should give the same eigenfunction and
eigenenergies, regardless of the basis we started with. Dierences do arise because we start making approximations
along the way:
The free-electron model is beautiful as a basis (exponentials are the foundation of Fourier analysis): they are
orthogonal and complete. The tight-binding basis set is more problematic. In principle, the solutions of the
Hydrogen atom for one atom form a complete basis which is orthogonal. However, this is not what we are
doing. We are taking the solution for one atom and putting them on dierent sites. This basis set is not even
orthogonal.
The potential of the nuclei is much better treated in the tight-binding model. There is one important caveat: in
general we are not dealing with Hydrogen atom. The potential is treated much more poorly in the nearly-free
electron model. So far, we have included only two Fourier terms. Obviously, this will give a sinosoidal potential
and not the 1/r potential that one would expect from the nuclei. However, in principle nothing prevents us
from including all Fourier components. Then again, this defeats the purpose of a nearly-free electron model.
21
k
/a /a /2a /2a 0
E
FIG. 9: Comparison of the two bands lowest in energy (n = 1, 2) in a nearly-free electron (red) with a tight-binding calculation
(blue dotted) including an s and a p orbital.
Summarizing, although the nearly-free electron usually takes pride of place in most textbooks over the tight-binding
model (in that it is usually treated in an earlier chapter), the tight-binding model wins out in practicality. Tight-
binding is often used in the scientic literature. The electronic structure is nowadays generally calculated with complex
codes that have been developed over many years. However, one often would like to use the output of such calculations
as the input for other calculations (for the experts, for example, many-body calculations or molecular dynamics). This
is usually done by making a tight-binding t to the more sophisticated electronic structure calculations. Nearly-free
electron models are used much less often (typing in tight-binding on Physical Review online yielded more 3500 papers,
whereas nearly-free electron gave less than 300 papers). One of the reasons is that it works well for only a few systems,
usually involving s and sometimes p orbitals. One might actually wonder why it works at all. One needs that the
nucleus is very well screened. One has to realize that most electrons in a solid are very strongly bound to the nucleus
and only the electrons in the outer shells really form bands. However, this inner shell electrons can eectively screen
the nucleus. They are eectively a negative charge compensating the positive charge of the nucleus.
VI. OTHER WAYS OF KEEPING ATOMS TOGETHER
A. Covalent bonds
Covalent bonds are strong bonds with a clear direction. The concepts are very important conceptually for organic
chemistry. In solids, the ideas become somewhat less clearly dened. The most straightforward covalent bond is the
hydrogren bond of the H
2
atom. There are two electrons in the molecule that both reside in the bonding states. This
is a typical situation for covalent bonding, where a bond between two atoms is shared by two electrons.
When we consider carbon, the situation becomes a bit more complex. Suppose we want to make a linear chain
of carbon atoms. Let us take the chain in the z direction. The conguration of the Carbon atom is given by
(1s)
2
(2s)
2
(2p
x
)
1
(2p
y
)
1
. The 1s electrons are strongly bound and do not contribute to the bonding. These are known
as core electrons. The 2s and 2p orbitals are close in energy The strongest bonding occurs for the bonding between
the orbitals along the chain: the 2s orbital (which does not point in any particular direction) and the 2p
z
, which lies
along the chain. Their coupling is known as -bonding. The 2p
x
and 2p
y
orbitals, which are perpendicular to the
chain, bond as well (known as -bonding) but this is weaker since the orbitals are further apart. Let us return to the
-bonding. Now we have a problem since we have two orbitals along the chain, so which one forms the bonding. The
solution is that they form hybrids that can be expressed as
1
= a
1
s
+a
2
p
z
and
2
= b
1
s
+b
2
p
z
. (91)
22
Atomic orbitals
Hybrids
FIG. 10: The sp-hybrids in Carbon are formed by combining the 2s and 2p orbitals.
The coecients are determined such that the wavefunctions are orthogonal and normalized
_
1
dr = 1 and
_
2
dr = 1 and
_
2
dr = 0 (92)
We want that the new wavefunctions contain both an equal amount of s character. Combining all the equations leads
to the solution
1
=
1
2
(
s
+
p
z
) and
2
=
1
2
(
s
p
z
). (93)
This process is depicted in Fig. 10. What we obtain are two sp hybrids one pointing predominantly in the positive z
direction, whereas the other is pointing in the negative z direction. An example of an sp hybrid is acetylene (C
2
H
2
).
Acetylene is a linear molecule with the two carbon atoms in the center and the two hydrogen atoms at the ends of the
molecules. The two sp hybrids of the Carbon atoms pointing between the carbon atoms form a bonding molecular
orbital between the two carbon atoms. The sp hybrids pointing to the outside of the molecule form molecular bonds
with the hydrogen atoms. In addition to the bonding, we have the bonding orbitals, where the p
x
and p
y
orbital
of one carbon atom connect to the p
x
and p
y
orbital of the other carbon atom, respectively. This creates a total of
three bonds between the two carbon atoms. This is often denoted as
H C C H. (94)
Each carbon atom has four bonds corresponding to the four valence electrons of carbon. The amazing thing about
carbon is the enormous exibility of making covalent bonds. In addition to linear bonds, planar bonds can be made,
known as sp
2
hybrids Following the same procedure as described above, we can nd three new wavefunction consisting
of s, p
x
, and p
y
orbitals:
1
=
1
s
+
_
2
3
p
x
(95)
2
=
1
p
x
+
1
p
y
(96)
2
=
1
p
x
1
p
y
. (97)
These hybrids are depicted in Fig. 11. Note that every orbital has 1/3 s orbital and 2/3 p orbital character. The rst
hybrid has on the positive x side both the positive s orbital and the positive part of the p orbital. There the wave
23
(a)
(b)
(c)
FIG. 11: Hybrids that are formed by combining s and p orbitals: (a) sp, (b) sp
2
, and (c) sp
3
hybrids. From
http://www.chm.davidson.edu/vce/AtomicOrbitals/hybrid.html.
functions add up. On the negative x side, we have a positive s orbital and the negative lobe of the p orbital. There
the wavefunction partially cancel each other. For the second hydrid, the situation is very similar. Note that the wave
function can be written as
2
=
1
s
+
_
2
3
_
1
2
p
x
+
1
2
3
p
y
_
(98)
=
1
s
+
_
2
3
_
cos 120
p
x
+ sin 120
p
y
_
=
1
s
+
_
2
3
p
120
. (99)
The combination of p
x
and p
y
orbtital is eectively a p orbital at 120
degree angle with each other. Two of those couple two the Hydrogen atom, the other connects the two Carbon
atoms, forming a planar molecule. The remaining p is perpendicular to the plane of the molecule (the p
z
orbital).
The two p
z
orbitals of the Carbon atoms for a bonding with each other. Another example we have seen before
namely benzene C
6
H
6
, see Fig. 5. The six carbon atoms form a hexagon with 120
1
=
1
2
(
s
+
p
x
+
p
y
+
p
z
) (100)
2
=
1
2
(
s
+
p
x
p
y
p
z
) (101)
3
=
1
2
(
s
p
x
+
p
y
p
z
) (102)
4
=
1
2
(
s
p
x
p
y
+
p
z
). (103)
The orbitals form a tetrahedron and have mutual angles of 109
.28
. (104)
Obviously, this cost energy, since the electron was bound to the sodium nucleus. The chloride atom accepts the
electron
Cl(1s
2
2s
2
2p
6
3s
2
3p
5
) +e
Cl(1s
2
2s
2
2p
6
3s
2
3p
6
) + 3.61 eV. (105)
Although we gain energy in the latter process, it still costs 1.52 eV in total. However, now we have a positive and a
negative charge that should have an electrostatic attraction. The energy separation in crystalline NaCl is 2.81 10
10
25
m. A naive estimate gives an attractive energy of 5.1 eV. The real cohesive energy is somewhat higher
Na + Cl Na
+
Cl
(we can do the same thing for divalent ions or more complicated systems). The energy dierence is given by
= E(AB) E(A
+
B
). Now let us also introduce a coupling t (with t > 0) between the two congurations. We
then obtain a matrix
H =
_
0 t
t
_
[A
+
B
)
[AB)
(107)
The eigenvalues are given by
E
=
2
1
2
_
2
+ 4t
2
. (108)
The eigenvectors are
[E
) = cos [A
+
B
) = [A
+
B
) and [E
+
) =
[AB) . However, when t/ increases the system becomes more covalent and the eigenvectors are mixtures of both
congurations (note that we cannot let 0, since in this limit we also need to take uctuations to [A
B
+
) into
account). So in real systems, there is a continuous change from covalent to ionic. Numerical programs to calculate
structure do not make a dierence when calculating metals, covalent, or ionic materials.
Now that we understand a molecule, let us extend it to a linear chain. First we need to introduce an additional
term in the Hamiltonian. If we just had positive and negative ions the minimum distance between the ions will be
zero. Obviously, this does not happen because at some distance the electron clouds start to overlap and the ions will
start to repel each other. The total interaction energy on a particular ion is then
U
i
=
i=j
U
ij
, (111)
with
U
ij
= e
r
ij
/
1
4
0
e
2
r
ij
. (112)
The dependence of the repulsive potential is empirical where and are parameters; indicates the strength and
is the eective range of the repulsive potential. The second term is the potential between the ions where the + sign
is for like charges and the is for unlike charges. The total energy is determined by summing over the other ions.
The result we can multiply by N to account for the number of pairs (assuming we are dealing with an ionic crystal
with two dierent ions). This gives
U
tot
= NU
i
= Nze
R/
N
4
0
q
2
R
(113)
where R is the interatomic distance and z is the number of nearest neighbors. For a linear chain z = 2. The constant
is known as the Madelung potential. This constant follows from the summation over the ions in the chain
j=i
(1)
ji
r
ij
=
2
R
n=1
(1)
n
n
. (114)
The Madelung constant is then
= 2
_
1
1
2
+
1
3
1
4
+
_
= 2 ln 2
= 1.386. (115)
26
So for a linear chain we obtain
U
tot
= NU
i
= 2Ne
R/
2N ln 2
4
0
q
2
R
. (116)
Note that there is no additional factor two because we do not want to double count the bonds. Therefore, N should
be the number of pairs and not the number of ions. The evaluation for a chain is relatively easy. However, even for a
relatively simple structure, such as NaCl, nding is complicated since the series do not converge properly. However,
this is more a mathematical problem which can be solved and gives 1.7476 for NaCl.
The expression in Eqn. (113) can be used to nd the equilibrium distance. The minimum in energy is found for
dU
tot
dx
=
Nz
e
R/
+
4
0
q
2
R
2
= 0. (117)
This can also be written as
ze
R
0
/
=
R
0
1
4
0
q
2
R
0
. (118)
This allows us to write the total energy as
U
tot
=
N
4
0
q
2
R
0
_
1
R
0
_
. (119)
The range of is approximately 0.1R
0
.
Since the binding is mainly due to electrostatic forces and not through covalent bonding, the charges can often be
easily replaced by something else. For example, water molecules have a dipole moment and can eectively replace the
positive and negative charges, see Fig. 12. This has as a result that many ionic crystals dissolve in water. Obviously,
not every ionic crystal dissolves in every liquid. The ionic binding can be too strong. For example, monovalent ionic
crystals dissolve more easily than those with a higher valency. In addition, covalency could play a role in the binding.
Also, it depends strongly on the polarizability of the solvent.
C. Van der Waals forces
Van der Waals forces are important for the creation of solids of noble elements. As we saw when discussing the H
2
molecule, a noble element, such as helium, cannot take advantage of bonding since the 1s orbitals is full. Binding can
still occur between helium atoms due to polarization eects. The electric charge of one helium atom induces a dipole
in another helium atoms creating an eective binding. The binding is very weak and solids of noble elements occur
only at very low temperature. In fact, He only becomes a solid when applying pressure at very low temperatures. We
shall not further discuss the Van der Waals forces here.
VII. FORMALIZATION: BLOCH THEOREM
AM Chapter 8
In the chaper on the nearly-free electron model we saw that the wavefunction can be written as, copying Eqn. (62),
k
(r) =
K
c
kK
e
i(kK)r
. (120)
Since the exponentials form a complete basis set (which means we should be able to express any function in them),
this is a general statement. We can therefore express all eigenstates with eigenenergies E
nk
in this fashion. We also
saw that k is a good quantum number in a crystal. Let us therefore separate the part related to k
nk
(r) = e
ikr
K
c
n
kK
e
iKr
= e
ikr
u
nk
(r) with u
nk
(r) =
K
c
n
kK
e
iKr
. (121)
27
This shows that the eigenfunction can be expressed in a plane-wave part e
ikr
and a function u
nk
(r). The interesting
property of the latter is that it has the periodicity of the lattice
u
nk
(r +R) =
K
c
n
kK
e
iK(r+R)
= u
nk
(r), (122)
since e
iKR
= 1. This is related to the orbital structure that we obtained when folding k values back into the rst
Brillouin zone (in one dimension, the region [K/2, K/2]. An alternative way of expressing Blochs theorem is
nk
(r +R) = e
ik(r+R)
u
nk
(r +R) = e
ikR
nk
(r). (123)
That is nice you might say, but what does it physically mean? Let us consider free space. We know that the solutions
are given by plane waves
k,free
(r) =
1
V
e
ikr
. (124)
Free space is uniform. Therefore, we expect that the probability of nding an electron is equal everywhere. This is
the case since [
k,free
[
2
= 1/V everywhere (some of you might say that the probability does not have to be constant:
we can make wavepackages. This is correct, but you might also remember that wave packages are not eigenstates in
free space). However, even though the probability is constant, the wavefunction can still pick up a phase. Under a
displacement , we have
k,free
(r + ) =
1
V
e
ik(r+)
= e
ik
k,free
(r). (125)
In a solid, we do not expect the probability to be uniform in space. On the contrary, it is very natural to expect that
the probability is higher closer to the nuclei where the electron feel and attractive potential. So we do not expect
something for an arbitrary vector . However, we do expect that the probability is exactly the same when we displace
ourselves by a lattice vector R. After all, the solid looks exactly the same again. This is the underlying physics of
Eqn. (123). In a solid, under displament of R, the only change to the wavefunction can be a phase factor.
Does the Bloch theorem also hold for tight-binding wavefunctions that look dierent from Eqn. (120), see Eqn.
(25). Let us check
k
(r +R
) =
1
R
e
ikR
(r +R
R)
=
1
N
e
ikR
RR
e
ik(RR
)
(r +R
R) = e
ikR
k
(r), (126)
which is Blochs theorem.
VIII. PHONONS IN ONE DIMENSION
A. Monoatomic basis
A phonon is a lattice vibration. It is called a collective excitation, since atoms move at the same time. The easiest
way to derive the phonon dispersion is to start from Newtons equation of motion. The potential between the atoms
are considered to behave like springs. Or in other terms, the assumption is that the potential of the atom around its
equilibrium position is given by a parabola. In one dimension, the equation of motion for atom at site R = na where
n is an integer and a the distance between the atoms. is then given by
M
d
2
x
R
dt
2
= K [(x
R+a
x
R
) (x
R
x
Ra
)] (127)
= K(x
R+a
+x
Ra
2x
R
), (128)
We can look for normal modes of oscillation if we assume that
x
R
e
ikRit
. (129)
28
Inserting gives
2
M = K(e
ika
+e
ika
2) = 2K(cos ka 1) (130)
Using that cos 2x = 1 2 sin
2
x or 1 cos 2x = 2 sin
2
x, we nd
=
_
4K
M
sin
1
2
ka. (131)
Unlike electrons, the phonon dispersion is zero for k = 0. This can be understood by visualizing what the displace-
ments are in this limit. For k = 0, all atoms are moving in the same direction. Basically, this is a displacement of the
lattice which does not cost and elastic energy since the distance between the atoms does not change. This is known
as a Goldstone mode. The velocity of a wave packet is given by its group velocity
v
g
=
d
dk
. (132)
(remember that for a free electron, we have
v
g
=
d
dk
=
d()
d(k)
=
d
k
dp
=
d
dp
p
2
2m
=
p
m
, (133)
so the denition makes sense). For the dispersion in Eqn. (131), we nd
v
g
=
_
Ka
2
M
cos
1
2
ka. (134)
The group velocity is zero at the zone boundary k =
a
. For this wavevector, we have a standing wave. For small
wavevectors, we can use cos x
= 1, to obtain
v
g
=
_
Ka
2
M
. (135)
The group velocity in the limit ka 1 is independent of the wavevector k. In the same limit, we can use sin x
= x,
and write the dispersion relation as
=
_
4K
M
1
2
ka =
_
Ka
2
M
k = v
g
k (136)
IX. PERIODICITY AND BASIS
AM Chapters 4-7
In the previous sections, we studied a simple one-dimensional chain consisting of one type of atom. Taking this
chain along the x-direction, we can build up this chain by taking multiples of a primitive vector that gives the vector
needed to go from lattice site into another
a
1
= a x R = na
1
with n = , 2, 1, 0, 1, 2, (137)
However, more important for our considerations of the periodic potential was not so much the real space position
of the atoms, but their eect on reciprocal space. This is directly related to the fact that momentum k is a good
quantum number. Translational symmetry of the potential imposed, reproducing Eqn. (57),
e
iKR
= 1. (138)
In one dimension this gives reciprocal lattice vectors, which again can be expressed in a unit vector
b
1
=
2
a
x K = nb
1
with n = , 2, 1, 0, 1, 2, (139)
29
(b)
(a)
a
FIG. 13: Examples of one-dimensional chains that cannot be described by a single vector. For both (a) and (b), the green
vector is the lattice vector that describes the periodicity.
It is easy to see that R and K satisfy the condition e
iKR
= 1.
Basis. So far, we have considered one-dimensional systems with only one atom in the unit cell. Let us consider
for example a chain with alternating atoms at the lattice positions na
1
, see Fig. 13(a). It can be easily seen that the
periodicity is now 2a instead of a. We therefore need a new primitive vector
a
1
= 2a x R = na
1
with n = , 2, 1, 0, 1, 2, (140)
Using the vectors
r
1
= 0 and r
2
= a x. (141)
This is known as a basis of atoms. We can build the chain by putting atoms of type 1 at sites R+ r
1
and atoms of
type 2 at sites R+r
2
.
An alternative way to obtain a doubling of the primitive vector is by a displacement of the atom, as done, for
example, in Fig. 13(b). The basis is now r
1
= 0 and r
2
= a x with 0 < < 1. In this situation we place equivalent
atoms at the vectors R+r
i
with i = 1, 2. This is known as a Peierls distortion.
X. EFFECTS OF A BASIS ON THE ELECTRONIC STRUCTURE
In this section, we study the eects that the introduction of a basis can have on the electronic structure. Let us
consider the situation of two inequivalent atoms, see Fig. 13(a), which is slightly easier than the situation for displaced
atoms. We already saw that the one-dimensional chain is given by ,
k
= 2t cos ka, see Eqn. (34). Now one of the
atoms is replaced by a dierent atom. First, this could aect the hopping matrix element t. However, this eect is
not that interesting since it does not really make the atoms dierent and we ignore it. The next eect is the dierence
in on-site energy that an electron feels when it is on site. Let us take this dierence in on-site energy to be . We
can now write down the 2 2 tight-binding Hamiltonian for the chain
H =
_
0
k
k
_
[1)
[2)
(142)
Note that atoms of type 1 only couple to atoms of type 2. Atom of the same type do not couple to each other.
However, in principle, we could include next-nearest neighbor matrix elements, which leads to terms of the form
i
k
= 2t
i
cos 2ka for i = 1, 2 on the diagonal. This matrix is solved straightforwardly giving eigenvalues
E
=
2
1
2
_
2
+ 4
k
. (143)
The solutions are given in Fig. 14. The rst thing to note is that the increase of the periodicity from a to 2a leads to
a reduction of the size Brillouin zone from /a to /2a. Second, in the region [/a, /a], we now have two bands
which is directly related to the fact that we now have two atoms in the basis. However, since the states are strongly
mixed we cannot identify a particular band with a particular atom. Third, at the edges of the new Brillouin zone at
/2a, we see that a gap opens. In the limit 0, this gap disappears and the eigenvalues are E
=
k
. We
recover the original band, but folded back inside the smaller Brillouin zone.
For the distorted system, see Fig. 13(b), a similar thing happens. This result can have surprising eects. Let us
suppose we had an undistorted system with a half-lled band
k
. Since the Fermi level lies nicely inside a band, this
system is a metal. Now if the system for some reason distorts (and it is not unusual to have lattice distortions as a
function of temperature), then a gap opens. The lower band in Fig. 14 is then full, whereas the upper band is empty.
Therefore, an energy of is necessary to excite an electron from the occupied states to the unoccupied states. The
system is now an insulator/semiconductor.
30
-2
-1
0
1
2
k
/
t
k
/a
/a /2a /2a 0
FIG. 14: The eect of having alternating atoms 1 and 2 on the electronic structure for a one-dimensional chain. First, the
increase of the periodicity from a to 2a leads to a reduction of the Brillouin zone from /a to /2a. Second, at the edges of
the new Brillouin zone at /2a a gap opens up (for clarity, the new bands have been shifted down by /2).
XI. EFFECTS OF A BASIS ON THE PHONON DISPERSIONS
We now consider what the eect is of the presence of two atom in the basis on the phonon dispersion. We can
extend Eqn. (128) by considering two atoms
M
1
d
2
u
R
dt
2
= K(v
R+a
+v
Ra
2u
R
) (144)
M
2
d
2
v
R+a
dt
2
= K(u
R+2a
+u
R
2v
R+a
), (145)
where u
R
and v
R
are deviations from the equilibrium position R. Again, we look for solutions in the form of travelling
waves
u
R
= ue
ikRit
and v
R+a
= ve
ikRit
. (146)
Inserting in the equations of motion gives
2
M
1
u = Kv(1 +e
2ika
) 2Ku (147)
2
M
2
v = Ku(e
2ika
+ 1) 2Kv. (148)
We can nd the solutions of these two equations by solving the determinant
2K
2
M
1
K(1 +e
2ika
)
K(1 +e
2ika
) 2K
2
M
2
= 0. (149)
This gives
M
1
M
2
4
2K(M
1
+M
2
)
2
+ 2K
2
(1 cos 2ka) = 0. (150)
The solutions are given by
2
=
K(M
1
+M
2
)
M
1
M
2
1
2M
1
M
2
_
4K
2
(M
1
+M
2
)
2
8M
1
M
2
K
2
(1 cos 2ka). (151)
31
/a
k
/a /2a /2a 0
[2C (1/M
1
+1/M
2
)]
1/2
[2C (1/M
2
)]
1/2
[2C (1/M
1
)]
1/2
FIG. 15: Optical and acoustical branches of the phonon dispersion for a diatomic chain with M
1
> M
2
. The limiting frequencies
at k = 0 and k = /2a are indicated.
We can also look at the limiting values for k 0 and k = /2a. For k 0, we have cos 2ka
= 1
1
2
(2ka)
2
, giving
two roots
=
_
2K(
1
M
1
+
1
M
2
) (152)
=
K
2(M
1
+M
2
)
2ka. (153)
As in the previous section, the doubling of the unit cell from a to 2a leads to a reduction of the Brillouin zone from
k = /a to /2a. At the edge of the Brillouin zone the eigenvalues are
=
_
2K
M
1
and =
_
2K
M
2
. (154)
Although one-dimensional materials (or materials that act one-dimensional) are less common than three-dimensional
systems, the eects described here already give a good idea of the consequences of inclusion of more atoms in a basis.
Also in three-dimensional systems there are dierent reason to have two atoms in the basis. Obviously, one can have
two dierent atoms in the compounds, such as, e.g. NaCl. However, some crystals cannot be simply described by the
primitive vectors, but require a basis. Examples are silicon and diamond.
If the chain was really one-dimensional then the atoms can only move in one direction, i.e. along the chain. However,
for a one-dimensional chain in three dimensions the atoms can move in three directions, not only along the chain,
but also perpendicular to the chain. The vibrations along the chain give rise to longitudinal phonons. The modes
perpendicular to the chain lead to the transverse branches. Since there are two directions perpendicular to the chain,
there are two transverse modes. The corresponds to the notion that for a chain of N atoms, there are 3N degrees of
freedom. The splitting is clearly observed experimentally, see for example the phonon dispersions for silicon in Fig.
17. The longitudinal vibrations, being towards the neighboring atoms, cost more energy.
XII. CRYSTAL STRUCTURES
In one dimension, we do not have too many choices. We can chance a, the dierence between the atoms. One
might think that we can also change something by taking the vector not along the x but in a dierenct direction.
32
+
-
+ -
+
- +
-
+ -
+
-
+
-
+
-
+
-
+
-
+
-
+
-
FIG. 16:
FIG. 17: The phonon dispersion in Si in a high-symmetry direction in the Brillouin zone. Along other directions the transverse
modes split. Note that 100 cm
1
= 12.4 meV.
However, this would not change the one-dimensional crystal, but simply rotate the whole crystal. When going to two
dimensions there are many more possibilities. We can take two vectors a
1
, a
2
. Obviously, we have the condition that
the vectors are not parallel. However, they can have dierent lengths and the angle between the vectors can change.
The lattice can be constructed via
R = n
1
a
1
+n
2
a
2
(155)
When studying crystal structures, the lattice is assumed to stretch to innity. This means that u
1
and u
2
assume
any integer value. From each lattice point, the lattice looks entirely equivalent. There is not one particular choice to
reproduce a lattice, see Fig. 18(a) and (b). Some vectors can produce part of the lattice points, but do not reproduce
the entire lattice, see Fig. 18(c). They are therefore not proper lattice vectors.
The lattice points created by the vectors a
1
and a
2
do not directly correspond to atoms. For a simple system, one
can have a simple one to one correspondence between the lattice points and the atoms, see Fig. 19(a). However,
this is not necessary and we can simply displace the atoms from the lattice points, see Fig. 19(b). The two vectors
a
1
and a
2
build up a parallelogram. Each parallelogram contains exactly the same number of atoms (in this simple
case one atom). This parallelogram is a primitive lattice cell. But generally, several atoms can be related to each
lattice point, see Fig. 19(c). These can be equivalent atoms, dierent atoms, and, in a molecular crystal, correspond
to entire molecules. Again, there is no unique choice to create primitive lattice cells. There are an innite number of
ways to create primitive cells. However, they should all have the same area (or in three dimensions, the same volume)
and built up the plane (space) without leaving any empty space.
33
(a) (b)
(c)
FIG. 18: Lattice points in two dimensions. The choice of vectors is not unique. The vectors given in (a) and (b) both reproduce
the same lattice points. On the other hand, the vectors given in (c) do not reproduce the lattive points.
With two arbitrary vectors, an innite number of lattices can be built. However, for lattice vectors of unequal
length and with an arbitrary angle between them, there is not a lot of symmetry. Only rotations of and 2 about
any lattice point bring the lattice back unto itself. This is called an oblique lattice. However, special lattices can
be invariant under rotation of 2/3, 2/4, or 2/6. To achieve this invariance, stricter conditions must be imposed
onto the lattice vector, such as equal length or specic angles between the vectors, such as 90
and 120
. We can
distinguish ve distinct lattices, known as Bravais lattices, see Fig. 20. The most straightforward is the square lattice,
comparable with a two-dimensional Cartesian coordinate system. This is the lattice with the highest symmetry. We
can change the vectors to obtain lattice of lower symmetry. For example, by making their lengths unequal, we obtain
a rectangular lattice. Note that we could rotate the square lattice under 45
. Again, this lowers the symmetry. In the end, one might wonder why we care that one lattice is more symmetric
than another lattice. The answer lies in the fact that the dierent symmetries manifest themselves into their dierent
electronic structure and macroscopic properties. For example, dierent lattice parameters in dierent directions can
lead to a dierent resistivity.
In condensed-matter physics, one is dealing more often with three-dimensional systems. Again, we can built up a
crystal starting from the primitive vectors a
1
, a
2
, and a
3
. For a simple cubic system, these are given by a
1
= a x,
a
2
= a y, and a
3
= a z. Often however the crystal structures are more complicated, such as face-centered cubic or
body-centered cubic. (This is for a large part related to the fact that simple cubic is not close packed. Try obtaining
a simple cubic lattice by throwing marbles in a box . . . ). Let us look at the dierent lattices in more detail. We can
(a)
(b)
(c)
FIG. 19: Lattice vectors can build up dierent lattices. Figures (a) and (b) shows that lattice points do not necessarily coincide
with the position of the atoms. However, each primitive lattice cell contains the same number of atoms. Although it might
appear that there are more atoms in the primitive lattice cell in (a). This is because the atoms are shared by several primitive
lattice cells. Figure (c) shows an example of a lattice with a basis of two atoms.
34
FIG. 20: The ve fundamental two-dimensional Bravais lattices: 1: oblique, 2: rectangular, 3: centered rectangular, 4:
hexagonal, and 5: square. Source: wikipedia
FIG. 21: The symmetry operations of a cube.
classify the dierent lattice according to their dierent symmetry operations. These symmetry operations can include
translation, rotation, reections and inversion. We can separate two sets of operations for the Bravais lattices
Translations through Bravais lattice vectors.
Operations that leave a particular point of the lattice xed (point group operations).
A combination of the above.
Let us again look at the simple cube. The symmetry operations of a cube are shown in Fig. 21. A cube can be
brought onto itself by two-, three-, and four-fold rotations, nine mirror planes, and by inversion. The cube is the
basic shapes with the largest number of point group operations. In total, there are seven dierent basic shapes, see
Fig. basicshapes, that dene seven dierent lattice systems. Why do we care about the change in symmetry. The
reason for that is that the symmetry can teach us a lot about the splitting of the energy levels of dierent orbitals.
When we only look at the local point group operations, this is generally known as crystal eld splitting, see Fig. 23.
In quantum mechanics, we are taught the hydrogen atom, where the orbitals with the same n and l have the same
35
FIG. 22: The seven basic shapes that dene the seven lattice systems.
TABLE I: Real orbitals as a combination of the atomic orbitals
lm
.
.
l = 0 s
00
1
l = 1 p
x
1
2
(
11
1,1
) x
p
y
i
2
(
11
+
1,1
) y
p
z
10
z
l = 2 d
yz
i
2
(
21
+
2,1
)
3yz
d
zx
1
2
(
21
2,1
)
3zx
d
xy
i
2
(
22
2,2
)
3xy
d
x
2
y
2
1
2
(
22
+
2,2
)
3
2
(x
2
y
2
)
d
3z
2
r
2
20
(
3
2
z
1
2
r
2
)
energy (although you probably also learnt that they can be split by the spin-orbit coupling, which we will not consider
here). The quantum number l with its orbital components m = l, l + 1, , l 1, l can be seen as a symmetry
classication for spherical symmetry. Symmetry tells you that dierent l can have dierent energies, but there is no
reason that the 2l + 1 dierent levels classied with m should have dierent energies. The spherical group is known
as SO
3
. The 3 essentially indicates that all three axes are equivalent. This is no longer the case when one applies a
magnetic eld. When taking the eld along the z-axis, the x and y axis are still equivalent to each other, but the z
axis is dierent. This group is known as SO
2
. Symmetry considerations that are beyond the scope of this course tell
that this splits all the dierent levels for a particular l, i.e. all the m have dierent energies. The theory behind this
is known as group theory. Symmetry considerations can teach you many things. For example, many of the theory on
quarks in high-energy physics are based on group-theoretical considerations (in this case the SU
3
group).
36
FIG. 23: The eect of the lowering of the symmetry on the energy levels of the 3d electrons.
The same ideas hold for point-group symmetries. If a systems has a higher symmetry (i.e. more symmetry operations
bring the lattice unto itself) then more levels are degenerate. If the symmetry is lowered, the levels split. Let us
consider an atom in a cubic environment. The atom has six nearest-neighbors at equal distances, see Fig. 23. We
know that an s level in this surrounding does not split, simply it is just one state and it is impossible to split one
state. When we consider the p orbitals, again nothing changes. We can imagine the p
x
, p
y
, and p
z
orbitals and one
can easily verify that they have the same surroundings. Table I shows how to obtain these orbitals from the atomic
orbitals and Fig. 24 gives an idea of the size of the wavefunction. However, this is no longer the case when considering
d orbitals. The d orbitals have l = 2 and are therefore vefold degenerate (neglecting spin). Group symmetry tells you
that the d orbitals split into two groups, one twofold degenerate and one threefold degenerate. These are generally
known as e
g
and t
2g
orbitals, respectively. Although group theory tells you that this is the way they should split in
this symmetry, it does not tell not tell you what the energies are (compare high-energy physics. The symmetry tells
you what quarks there are not their masses). The symmetry tells you how things will split, but the Hamiltonian of
the system causes the actual splitting. In fact, the splitting can be due to dierent eects. If there is nothing in the
Hamiltonian of the six surrounding atoms that aect the atom in the center, see Fig. 23, the splitting is zero and all
the d levels are degenerate. Essentially from an electronic point of view, the symmetry is still spherical. The levels
can split because the neighboring atoms are not neutral (for example, in an ionic crystal, they have a real charge).
These arguments also hold for more complicated systems. Figure 23 shows a typical situation for the d orbitals, where
the t
2g
levels d
xy
, d
yz
, and d
zx
are lower in energy than the e
g
levels d
x
2
y
2 and d
3z
2
r
2. Generally, in a compound,
the transition metal ions are positively charged. For example, the formal valency of nickel in NiO is Ni
2+
. NiO has a
NaCl structure, so the nickel atom is surrounded by six oxygen ions. This is a cubic surrounding as shown in Fig. 23
(the formal notation is O
h
). The surrounding oxygens are negatively charged, O
2
. This means that the 3d orbitals
of nickel experience a potential of the six oxygens. Since this potential is not nicely spherical, the orbitals split. The
electrons in the e
g
orbitals feel the repulsive potential of the oxygens more strongly, since the lobes of the orbitals are
pointing towards the neighboring oxygens, see Fig. 24. An alternative way to split the orbitals is due to the formation
of covalent bonds between the transition metal and the oxygens. Even though the mechanism is dierent, the splitting
is the same. The symmetry can be lowered by, for example, removing the atoms in the z direction, see Fig. 23. This
surrounding has tetragonal symmetry, see Fig. basicshapes. When doing this, the p orbitals split in energy. Just by
looking at the surrounding ions, it is clear that the p
z
orbital is now dierent from the p
x
and p
y
orbitals. For the d
37
FIG. 24: Top: the three p orbitals (p
z
, p
x
, p
y
). Bottom: the ve d orbitals (x
2
y
2
, 3z
2
r
2
, xy, zx, yz).
orbitals, we see that the levels split further. Only the yz and zx orbitals are now equivalent from a symmetry point
of view. In a solid where all the axes are dierent, known as orthrombic symmetry, see Fig. basicshapes, also the
energies of these two orbitals will be dierent.
The point group symmetries lead to seven basic shapes, see Fig. basicshapes, which correspond to seven lattice
systems. However, we have considered now only the point group. We can also include the translational properties.
The combination of point group and translational operations is called the space group. Inclusion of these lead to a
total of fourteen Bravais lattices, see Fig. 25. For the cubic system, there are three inequivalent lattices: the simple
cube, the body-centered cube (bcc), and the face-centered cube (fcc). Although part of the cubic point group, the
primitive vectors of the simple cube do not produce the fcc and bcc lattices. We could build these lattices using the
primitive vectors of a simple cube plus a basis. For example, for bcc, the basis would be 0 and
a
2
( x + y + z) if the
sides of the cube have length a. However, although this might bring out the point group better, they are not the
primitive vectors of the bcc lattice. One choice that reproduces the bcc lattice is
a
1
= x a
2
= y a
3
=
a
2
( x + y + z), (156)
see the left side of Fig. 26. However, generally there are more choices. A more symmetric set is
a
1
=
a
2
( x + y + z) a
2
=
a
2
( x y + z) a
3
=
a
2
( x + y z), (157)
see the right side of Fig. 26. We cannot do this for all lattice systems since some of them are equivalent to other
Bravais lattices. This can already be seen in two dimensions. Whereas there is a centered rectangular Bravais lattice,
there is no centered square lattice. It is easily veried that a centered square lattice is in fact a square lattice with a
smaller lattice constant.
38
XIII. MEASURING CRYSTAL STRUCTURE: DIFFRACTION
The most common method to study crystal structure is X-ray diraction (although it can also be done with electrons
and neutrons). The X-rays scatter through repeated interactions with the atomic nuclei plus their surrounding electron
cloud. This scattering is very similar to the scattering of electrons on the periodic potential as we have seen earlier.
FIG. 25: The fourteen three-dimensional Bravais lattices. Source: wikipedia
39
FIG. 26: Two ways of choosing the primitive vectors for a bcc lattice.
FIG. 27: Bragg reection in back-scattering for a three-dimensional crystal (or a chain). The left side shows the condition for
which Bragg scattering occurs. The light coming in from the left (green dots) is the same. From top to bottom, the red line
shows the reection from the rst, second, and third crystal plane. The outgoing waves are all the same again and therefore
interfere constructively. The wavevector k = /a, with a the distance between the planes, corresponding to a wavelength of
= 2a satises the Bragg condition. The right side shows what happens for a wavevector that does not satisfy the Bragg
condition. Again, all the incoming waves are equivalent. However, since the scattered waves are dierent, there is a destructive
interference.
In fact, we already derived the conditions for the scattering of X-rays, see Eqn. (65),
k K =
1
2
K
2
, or k
K =
1
2
K, (158)
where is
Kis a unit vector in the direction of the reciprocal lattice vector K. When k is parallel to K, we have k =
1
2
K.
This is eectively like a one-dimensional case, where the reciprocal lattice vectors are given by K = 0,
2
a
,
4
a
, .
The smallest nonzero wavevector for which we have Bragg reection is then k =
a
. This corresponds to a wavelength
of 2a, where a is the distance between the lattice planes. This situation is depicted in the left side of Fig. 27. We see
that the scattered waves have all the same phase and therefore interfere constructively. On the right side of Fig. 27,
we see that for an arbitrary wavevector scattered waves from dierent crystal planes, the waves interfere destructively.
We can also see what happens if the planes are inequivalent. Figure 28 shows the situations for a wavevector k = /2a
or a wavelength of = 4a. If the planes were equivalent, one would not expect any constructive interference, which
can be seen from Fig. 28, where the scattered waves have opposite sign when going out of the crystal. This is expected
since k = /2a does not satisfy the condition k =
1
2
K. However, if there are alternating planes of dierent atoms, the
amplitude of the scattered waves are dierent and, although they still interfere destructively, the cancellation is not
complete and one still observes scattered intensity. This is expected since making alternating planes of dierent atoms
40
FIG. 28: Bragg reection in back-scattering for a three-dimensional crystal with k = /2a or a wavelength of = 4a, where a
is the distance between the planes. If the planes were equivalent, this wavevector would not satisfy the Bragg condition and
the scattered waves would interfere destructively. However, if the planes are inequivalent (for example, dierent atoms or the
atoms in one plane are slightly displaced) then there is not a complete cancellation and one can still observe a Bragg peak.
FIG. 29: The scattering of X-rays from a solid with lattice planes separated by d. The angle between the X-ray beam and the
planes is . The scattered angle is 2.
increases the periodicity from a to 2a. Therefore the lowest nonzero reciprocal lattice vector is K = 2/2a = /a and
therefore k = /2a does satisfy the condition k =
1
2
K.
Although we had already obtained the Bragg condition k K =
1
2
K
2
, it is instructive to look at a slightly dierent
derivation. Let us suppose there are X-rays coming in at an angle with respect to the lattice planes, see Fig. 29. As
we saw in the one-dimensional case, in order to obtain constructive interference, the dierence in path length needs
to be a multiple of the wavelengths of the x-rays. It is straightforward to obtain the dierence in pathlength, which
is 2d sin . Equating this to a multiple of the wavelengths gives Braggs law
2d sin = n. (159)
41
FIG. 30: A typical powder diraction pattern where the scattering angle is scanned (a -2 scan). The sample is a NbSe
3
powder. From Zhili Xiao.
This is equivalent to the result that we obtained before, which in X-ray diraction is called the Von Laue condition. For
scattering it basically states that for elastic scattering, the dierence between the incoming and outgoing wavevectors
should be equal to a reciprocal lattice vector
k k
= K or k
= k K. (160)
Since the wavelength before and after the scattering is equivalent (otherwise the scattering would not have been
elastic), the norm should remain the same
k = [k K[ k
2
= k
2
2k K+K
2
k K =
1
2
K
2
, (161)
which is the same result as for scattering of electrons. Note that the reciprocal lattice vectors for the planes in Fig.
29 is K = n
2
d
z, where the unit vector z is perpendicular to the lattice planes. We then obtain
k K =
2
n
2
d
cos(
2
) =
1
2
K
2
=
1
2
_
n
2
d
_
2
2d sin = n, (162)
which reproduces Braggs law.
There are several ways to do X-ray diraction
The Laue method. Here the scattering geometry is xed, but the wavelength is varied or, alternatively, using
not a monochromatic beam, but one containing X-rays with a certain bandwidth. If the crystal structure is
complicated, it is often dicult to properly assign the Bragg peaks. However, it has the advantage that many
Bragg peaks can be collected in a short period and is therefore interesting for time-dependent applications.
More often used methods involve using monochromatic X-rays and changing the scattering condition. In these
techniques, one can change the scattering angle (also known as -2 scan, see Fig. 30). However, this does not
produce all the Bragg peaks, since one also needs to vary the dierent orientations of the planes. This can be done
in two ways. When dealing with a single crystal, one can rotate the crystal to access the dierent orientations
of the crystal (the rotating crystal method). An alternative method is to measure the X-ray diraction of a
powder. Note that a powder is not an amorphous system, but consists of many small crystallites. Therefore,
the powder contains the dierent orientations of the crystal and an averaging occurs (this is no as the powder
or Debye-Scherrer method).
XIV. THE RECIPROCAL LATTICE
By now, it has become clear that when dealing with solids the wavevectors related to the lattice often seem to play
a more important role than the lattice in real space itself. This is a direct result of the fact that the wavevectors k
42
Real space:
Reciprocal space:
bcc
fcc
fcc
bcc
FIG. 31: The primitive unit cell for the bcc (left) and fcc (right) lattices. The spheres indicate the lattice points. Alternatively,
these gures represent the rst Brillouin zones of the fcc (left) and bcc (right) lattices. The spheres then represent the reciprocal
lattice vectors.
are good quantum numbers due to the periodicity of the lattice. This is nothing unusual in physics. The periodicity
in time of a wavefunction, makes the frequency (and at the same time, the energy a good quantum number. As
we have also seen, it is also directly related to symmetry. That momentum is a good quantum number is inherently
related to translational symmetry. Again, we have seen that in quantum mechanics. Angular momentum is a good
quantum number for the hydrogren atom because of spherical symmetry. Also, no one is surprised that momentum
is a good quantum number for a free particle. Conservation of momentum is a result of the translational symmetry
of space. In a solid, however, not every point in space is equivalent to all the other points. Only discrete points are
equivalent to each other. In this section, we take a closer look on how to go from real space to reciprocal space in
three dimensions.
As we have seen before, reciprocal lattice vectors need to satisfy the condition e
iKR
= 1. This can be done by
dening the primitive vectors
b
1
= 2
a
2
a
3
a
1
a
2
a
3
. (163)
The vectors b
2
and b
3
can be obtained by cyclic rotation, i.e 1 2 3 1. Note tha a
2
b
1
= 0 and a
3
b
1
= 0,
since the outer product makes b
1
perpendicular to a
2
and a
3
. In addition, a
1
b
1
= 2, and therefore the condition
e
iKR
= 1 is satised. The term V = a
1
a
2
a
3
is the volume of the parallelopiped spanned by the three primitive
vectors.
The situation is relative straightforward for a simple cubic lattice which can be generated by the primitive vectors
a
1
= a x and a
2
= a y and a
3
= a z R = n
1
a
1
+n
2
a
2
+n
3
a
3
. (164)
The reciprocal lattice vectors satisfying e
iKR
= 1 are also easily obtained
b
1
=
2
a
x and b
2
=
2
a
y and b
3
=
2
a
z K = n
1
b
1
+n
2
b
2
+n
3
b
3
. (165)
Let us now consider a somewhat more complicated lattice, such as the body-centered cube (bcc) as given in Eqn.
(157). Let us rst calculate V . Although we can start with the vectors in Eqn. (157), there is a somewhat simpler
method. When considering the nonprimitive simple cube unit cell, the bcc structure can be described by a lattice and
a basis. The volume of the simple cube is a
2
. However, there are two atoms in the basis. Therefore, the primitive
cell, containing only one atom, has a volume a
3
/2. The rst reciprocal lattice vector is then given by
b
1
= 2
2
a
3
_
a
2
_
2
( x y + z) ( x + y z) =
2
a
( y + z). (166)
Similarly, the other two are given by
b
2
=
2
a
( x + z) and b
2
=
2
a
( x + z) (167)
Interestingly, the reciprocal lattice of a bcc crystal structure is fcc. However, as we have seen, the more interesting
physics often occurs at the regions in reciprocal space where the Bragg condition is satised. The condition k
K =
1
2
K
denes a plane that has
K as its normal vector. In addition, the vector
1
2
K lies in the plane. The rst Brillouin
zone is dened as the smallest volume around a particular reciprocal lattice vector enclosed by the planes given by
the Bragg condition. The rst Brillouin zones for the bcc and fcc lattices are given in Fig. 31. We can use the same
43
procedure to dene a primitive unit cell in real space using planes given by r
R =
1
2
R. These are called Wigner-Seitz
cells and are given in Fig. 31 for the bcc and fcc lattices. However, in practice the Brillouin zone is of much greater
importance than the Wigner-Seitz cell.
Essentially, in experiments such as X-ray and neutron diraction, we are not measuring the real space distribution
of the charge density, but the distribution in reciprocal space, which is the Fourier transform of the real space. Since
the charge distribution is periodic due to the crystal symmetry, the Fourier transform is discrete
n(r) =
K
n
K
e
iKr
, (168)
where the Fourier coecients are given by the inverse Fourier transform
n
K
=
1
V
c
_
cell
n(r)e
iKr
dr, (169)
where the integration is over the volume V
c
of a cell of the crystal. We have seen such a Fourier series before when
introducing the periodic potential in the nearly free-electron model. Writing the density density as a Fourier transform
directly makes it periodic since n(r +R) = n(r) since exp(iK R) 1 denes the reciprocal lattice.
For example, the interaction between light and matter is included by including the vector potential in the kinetic
energy
H =
(p eA)
2
2m
=
p
2
2m
+
e
2m
(p A+A p) +
e
2
A
2
2m
. (170)
The second term causes electronic transitions, such as absorption. For diraction, we are interested in the third term.
I we take the vector potential to be given by a plane wave A(r) = ae
ikr
, where a gives the strength and direction of
the potential, then to lowest order in perturbation theory, the scattering matrix element is given by
[
e
2
A
2
(r)
2m
[) =
e
2
a
2
2m
_
dV
(r)e
i(kk
)r
(r) =
e
2
a
2
2m
_
dV n(r)e
ikr
=
e
2
a
2
2m
F. (171)
with k = k
K
_
dV n
K
e
i(Kk)r
. (172)
This only gives a strong contribution when
k = K, (173)
when
F = V n
K
= N
_
cell
n(r)e
iKr
dr = NS
K
, (174)
where S
K
is the structure factor. If there is only one atom in the basis then S
K
is a number. However, if there
is more than one atom in the basis it often makes sense to describe the charge distribution as a superposition of
separate charge distributions (Note that this is not necessary and sometimes not even possible. One can just brute
force compute S
K
, but separating it provides more insight),
n(r) =
j
n
j
(r r
j
), (175)
where the position of the atoms in the basis are given by R+r
j
. The structure factor is then given by
S
K
=
j
_
n
j
(r r
j
)e
iKr
dr =
j
e
iKr
j
_
n
j
()e
iK
d, (176)
where = r r
j
. If we dene the atomic form factor as
f
i
=
_
n
j
()e
iK
d, (177)
44
FIG. 32: Abolute experimental form factor for metallic Aluminium after Batterman, Chipman, and DeMarco. Al has a fcc
structure.
then the structure factor can be written as
S
K
=
j
f
j
e
iKr
j
. (178)
Note that the scattering intensity is the square of the matrix element which involves S
S.
Structure factor of the bcc lattice. We already studied the reciprocal lattice of a bcc crystal using the primitive
vectors, see Eqn. (166) and (167). However, we know that there is an alternative way of looking at the same lattice,
namely as a simple cubic lattice plus a basis 0 and
a
2
( x + y + z) if the cube has sides of length a. The simple cubic
basis vectors are given in Eqn. (165). The structure factor is then
S
K
= f
1
+f
2
e
i2(n
1
+n
2
+n
3
)
. (179)
We can distinguish two distinct situations
S
K
= f
1
f
2
n
1
+n
2
+n
3
= odd integer (180)
S
K
= f
1
+f
2
n
1
+n
2
+n
3
= even integer. (181)
For the situation that the atoms in the basis are equivalent f
1
= f
2
. Let us look at the xy plane. For (0, 0, 0), (2, 0, 0),
(0, 2, 0), (1, 1, 0), and (2, 2, 0), the structure factor is 2. For (1, 0, 0), (0, 1, 0), (2, 1, 0), and (1, 2, 0), S
K
= 0. This
forms a fcc reciprocal lattice as it should. However, the atoms in the basis can also be inequivalent. This is known as
a CsCl structure. In that case, we can observe the reections that are forbidden when the atoms are equivalent.
Momentum dependence atomic form factor. We want to evaluate the atomic form factor. Let us take the classical
limit
f
i
=
_
n
j
(r)e
iKr
dr. (182)
Let us take the angle between r and K to be and assume that the charge distribution is spherically symmetric.
The integral then becomes
f
i
=
_
dr
_
2
0
d
_
0
dr
2
sin n
j
(r)e
iKr cos
= 2
_
drr
2
n
j
(r)
e
iKr
e
iKr
iKr
, (183)
where n
j
(r) is the radial distribution of the charge density, giving
f
i
= 4
_
drr
2
n
j
(r)
sin Kr
Kr
. (184)
45
1
2
I
3
3
3
3
3
3
3 3
2
II
2
III
2
IV
3
I
FIG. 33: The rst three Brillouin zones for a square lattice.
In the limit that the charge density is close to the lattice point, i.e. much smaller than the distance between the atom,
sin Kr/Kr
a
,
a
], where the k-points
a
satisfy the Bragg condition k =
1
2
K, see Eqn. (158). To nd the rst Brillouin zone in two dimensions, we need
to nd the k points that satisfy the Bragg condition. Obviously, we need to unclude the reciprocal lattice vector
K = 0. The next reciprocal lattice vectors are
2
a
x and
2
a
y. This leads to the following Bragg condition in the
x-direction
k ( x) =
1
2
2
a
k
x
=
a
. (186)
Note that there is no restriction on k
y
. The Bragg condition is therefore satised for all points (0,
a
). This point
is more general. The condition k
K =
1
2
K always corresponds to a line perpendicular to
K that crosses the vector
K at half of its length. The next reciprocal lattice vectors are (
2
a
x,
2
a
y) and (
2
a
x,
2
a
y). The resulting lines
where the Bragg condition is satised are shown in Fig. 33. The rst Brillouin zones is the area around K = 0 up to
the rst line that satises a Bragg condition. Note that for a square lattice the rst Brillouin zone is determined by
the four shortest reciprocal lattice vectors (in general, it is not only determined by the shortest). We then calculate
the values of the energies
kK
=
2
2m
(k K)
2
(187)
inside the rst Brillouin zone. In two dimensions, it is no longer practical to show the values for all k points. One
therefore usually shows the bands in particular directions. A typical choice for two dimensions is (
a
,
a
) (0, 0)
(
a
, 0). The energies can be expressed in terms of =
2
2m
2
a
2
. Let us rst consider the direction from (0, 0) (0,
a
).
The energy
k
goes from 0 to . The energies of
kK
with K equal to
2
a
x and
2
a
y are all equal at (0, 0) (often
known as the point). The energy is given by 4. These degeneracies are directly related to the high symmetry
46
k
k
(/a,0)
(0,0)
(/a, /a) (/a, /a)
1
1,2
I
1
=1
=2
=3
=4
3
I
2
I
2
II
2
IV
2
III
2
I
2
II
2
IV
2
III
2
II
3
I
FIG. 34: The free-electron bands in two dimensions along particular directions indicated in Fig. 33.
0
1
2
3
4
0 1 2 3 4 5
a
2
/
FIG. 35: The green line gives the density of states as a function of energy. The red line gives the integrated density of states.
The dotted dark-blue line gives the density of states of the band from the rst Brillouin zone (indicated by 1 in Fig. 34). The
dotted light-blue gives it integral.
of the point. Moving away from the , the bands split:
k
a
x
decreases to and
k+
a
x
increases to 9. The
bands
k
a
y
remain degenerate and increase to 5. The point (
a
,
a
) is again a special point in the Brillouin zone at
the same distance from (0, 0), (
2
a
,
2
a
), (
2
a
, 0), and (0,
2
a
). The lowest four bands are therefore degenerate with an
2. Moving towards (0, 0), with k
x
= k
y
,
k
decreases to zero.
k
2
a
( x+ y)
increases to 8. The band
k
2
a
x/ y
are
degenerate and increase to 4.
Density of states. For many physical quantities the exact dispersion is not important and one is more interested
in the number of states at a particular energy.
technical intermezzo: Although, this density of states could have been calculated easily analytically, it was actually
done numerical (useful when moving away from the free-electron model). How do you do this. There are several
procedure to do this, some of them more ecient than others. Basically one discretizes the rst Brillouin zone in a
grid with a total of N
k
points. and calculates the eigenvalues for each point (since the matrix is diagonal (U = 0), this
47
0
1
0 1 2 3 4 5
D
O
S
/
FIG. 36: Typical numerical problems in calculating the density of states. Dashed blue line: insucient k points taken into
account in the calculation. Black line: improper treatment of the edges of the Brillouin zone.
is trivial). One could have produced a simple histrogram. This works wonders in case of a square density of states,
but does not quickly produce nice results in other cases. Instead each k point has been broadened by a Gaussian
w(E) =
N
k
e
_
E
k
_
2
, (188)
and all the Gaussians are added together. The Gaussian is normalized to give a weight of 1/N
k
. We have taken
= 0.022. The eect of broadening is visible at the edge of the density of states (E = 0). In addition, we have used
N
k
= 200
2
, which gives a nice smooth result. When N
k
= 50 is taken, one observes oscillations, see Fig. 36, due to the
undersampling of the Brillouin zone. The black line in Fig. 36 shows the eect of an improper treatment of the edges
of the Brillouin zone. The calculation uses only a quarter of the Brillouin zone (only positive k), since the other parts
are equivalent. In fact, we really only need 1/8 of the whole Brillouin zone due to symmetry. However, the k points at
the edges are shared by dierent regions. For example, the k points at the edge of the rst Brillouin zone lie for one
half in the rst and for the other half in the second Brillouin zone. When this is not taking into account properly (i.e.
by scaling these k points by a factor of
1
2
, one observes spikes in the density of states, see the black line in Fig. 36.
Obviously, these would become irrelevant when N
k
, by for N
k
= 100
2
they are clearly visible. There exist more
advanced and ecient methods to calculate the density of states (for example, the tetrahedron method, which ef-
fectively interpolates between the dierent k-points), but for a simple two-dimensional problem this works well enough.
The results are shown in Fig. 35. The end result looks simple: the density of states is constant. Obviously, we
could have derived this more easily. However, in general this cannot be done, so this is certainly a useful check to
see whether this routine works properly. Let us derive this density of states analytically. We ignore the spin of the
electron for the moment. Although we have a discrete number of k states, the density is always large enough that
we can approximate this by a continuous function. The amount of k space occupied by one k point is A
1
=
_
2
L
_
2
following from the periodic boundary conditions in Eqn. (24) appplied in both x and y directions. The number of k
states in a certain are can then be obtained by dividing the are by A
1
. For a free-electron model where the energy is
given by
k
=
2
k
2
2m
, the areas of constant energy are circles. The number of electron inside the circle is given by
N(k) = k
2
/A
1
= k
2
_
L
2
_
2
=
A
4
k
2
, (189)
where A is the area (i.e. the size) of the system. This number can be easily written in terms of energy
N(E) =
A
4
2m
2
E or n(E) =
N(E)
A
=
1
4
2m
2
E, (190)
48
FIG. 37: The Fermi surfaces for dierent number of electrons per site. The 1
st
, 2
nd
, and 3
rd
indicate from which Brillouin
zone the bands originate.
where n(E) is the density of electrons. Often we are also interested in what the density is for a certain energy
dn(E) = (E)dE. This gives the density of states
(E) =
dn(E)
dE
=
1
4
2m
2
. (191)
which is indeed a constant. The plot uses some normalizations. The energy is normalized to the value at (0,
a
),
i.e., =
2
2m
_
a
_
2
. Figure 35 also shows the integrated intensities. Furthermore, if we had one electron per site, the
electron density would be 1/a
2
. However, it is more convenient to think about 1 el/site, so let us multiply times a
2
.
(E)a
2
= a
2
1
4
2m
2
_
a
_
2
_
a
_
2
=
4
= 0.785, (192)
which is the value of the constant density of states in the gure. It is interesting to look at only the band in the rst
Brillouin zone (see the dotted lines). Note that the density of states is no longer constant. You can easily observe
this by drawing circles around the point. For values of k >
a
these turn into arcs, up to k =
a
(with energy 2),
where only one point at the edge of the Brillouin zone is left (actually, there are four points, but since al four points
are shared by four Brillouin zones there is only 1 point). The integral over the rst Brillouin zone goes to one. The
number of states in the rst Brillouin zone can be easily calculated
N
rst
=
_
2
a
_
2
_
L
2
_
2
=
L
2
a
2
=
A
a
2
= N. (193)
Or exactly one state per site. This is not too much dierent from the tight-binding model we saw earlier. There we
started from one state per site and lled up the entire rst Brillouin zone (in one dimension). Again, by imposing the
periodicity of the lattice, we are eectively creating orbitals on each site.
However, we have not applied any potential yet, so we are not going to ll up this orbital but ll up the free-electron
bands. The k-value for electrons per site is determined by
N = 2k
2
_
L
2
_
2
k
= 2
_
2
N =
_
2
a
(194)
49
E
k
(/a,0)
(0,0)
(/a, /a) (/a, /a)
E
F
(U=0), n=1
E
F
(U>0), n=1.4
E
F
(U>0), n=1
E
F
(U=0), n=1.4
FIG. 38: The dashed lines show the free-electron model in two dimensions. The red lines indicate the eect of applying a
potential U. The horizontal lines indicate the Fermi levels for 1.7 electrons per site.
where the factor 2 takes acount of the fact that each state has spin-up and a spin-down component. The corresponding
energy is given by
E
=
2
2m
k
2
=
2
. (195)
The maximum energy in the system (at least at T = 0) is known as the Fermi energy E
F
. For = 1 (1 electron
per site), the Fermi energy is 0.637 with a corresponding Fermi wavevector of 0.798
a
. The lies well inside the
rst Brillouin zone. This means that only the lowest band cross the Fermi level, see Fig. 34. This is essentially
the Sommerfeld model, since the presence of nuclei can be essentially neglected. For = 2, E
F
= 1.27. The
corresponding wavevector k = 1.13
a
lies outside the rst Brillouin zone. Although the pictures are easily constructed
by drawing a circle through the dierent Brillouin zones in Fig. 33, it is important to learn to think in the rst
Brilllouin zone, since no state-of-the-art calculations use free-electron approximations anymore. Things become more
obvious when we compare the Fermi surfaces in Fig. 33 with the bands in Fig. 34. For k from (0, 0) to (
a
, 0), we see
that the band from the rst Brillouin zone is entirely below E
F
for = 2. In Fig. 33, we see that in this direction
there is no Fermi surface in for the bands coming from the rst Brillouin zone (red), but we do see a Fermi surface
in the bands coming from the second Brillouin zone (blue). However, when going from (0, 0) to (
a
,
a
), we see that it
is the band from the rst Brillouin zone that crosses the Fermi level and we see that we have a Fermi surface in this
direction. When going to three electrons per site ( = 3), the part of the Fermi surface related to the bands from
the second Brillouin zone increases, but the rest remain similar. Note that along the (0, 0) (
a
,
a
), there is still a
Fermi surface since
_
6
<
2. When going to = 4, we also start involving the band coming from the third Brillouin
zone. Note that along the (0, 0) (
a
,
a
), rst the band from the second Brillouin zone crosses the Fermi level and
then the band from the third Brillouin zone. Therefore in the direction, we have two Fermi surfaces (blue and green).
However, along the (0, 0) (
a
, 0) direction, there is only the band from the second Brillouin zone crossing the Fermi
level.
XVI. NEARLY-FREE ELECTRON IN TWO DIMENSIONS
So far, we have shown how the free-electron bands are folded back into the rst Brillouin zone when the proper
crystal symmetry is applied. However, in the end, this does not change anything. Without potential, you will simply
end up with the Sommerfeld model. You cannot fool nature, you can pretend there are atoms, but without potential
they are still free electrons. Obviously, there are potentials and even if they are small, they will have an eect close to
the Bragg planes (okay, Bragg lines in two dimensions). For any nite U gaps will open up when the Bragg condition
is satised. However, when U is small the eects will be small as well and the Fermi surfaces will not be too much
50
0
0.5
1
1.5
2
-0.5 0 0.5 1 1.5 2 2.5
(
E
)
E
E
F
(U=0, n=1.4) E
F
(U>0, n=1.4)
FIG. 39: The density of states (E) (green) without (dotted) and with U (solid) in the (nearly) free electron model. The red
lines give their integrated intensity. The vertical lines indicate the Fermi level for 1.4 electrons per site. Since the density of
states is shown for only one spin degree of freedom, they occur when the integrated density of states equals 0.7. The density
of states is shown for the same energy region as the band structure in Fig. 38.
dierent from the free-electron ones in Fig. 37. Let us consider a larger U value and a density of one electron per site
(n = 1). We directly observe a substantial change in the band structure. Degeneracies are lifted and gaps open up,
Fig. 38. Since the band structure is more complicated, we can no longer easily determine the values of E
F
and k
F
.
However, we can calculate them numerically, by determining the density of states (E), see Fig. 39. E
F
is the energy
for which the integral of (E) with respect to energy equals the number of electrons (note that the density of states
is plotted for one spin degree of freedom (up spin or down spin), so we need to integrate up to 0.5). The density of
states is substantially dierent from the free electron one. First, we observe sharp peaks. They are called van Hove
singulaties. They are related to at-band regions. The van Hove singularity just below the Fermi surface for U > 0
is a result of the at-band in the lowest band around (
a
, 0). In addition, we observe gaps in the density of states.
For those energies there are no states in the band structure. Note that the band structure is only given for particular
directions. It is dicult to see from the bands if there is a gap for any k. However, (E) clearly shows the absence
of states. The integrate intensity of (E) up to the gap equals one (or two, if we take both spin degrees of freedom
into account). Therefore, if there were two electrons per site in the system, the material would be a semiconductor.
Figure 40 shows the Fermi surfaces for U = 0 and U > 0 with a density of n = 1. Note that they look very similar.
The surface for U > 0 has some slight distortion but it is close to a circle despite the relatively large value of U. The
reason for that is that the states that are at the Fermi level are not close to satisfying the Bragg condition. This is
the key ingredient for the surprising success of the Sommerfeld model for various materials. Let us see what happens
if the Fermi was close to the states at the edges of the Brillouin zone by changing the number of free electrons to 1.4
51
FIG. 40: The Fermi spheres for the band structures in Fig. 38 without (left) and with U (right).
per site. (This could be done with doping. However, the goal of this excersize is to demonstrate an eect that occurs
in noble metals, where 3d bands play an important role). As expected, when increasing U, gaps open up, see Fig.
38. However, let us consider 1.4 electrons per site. Figure 40 shows that for U = 0, the Fermi sphere is circular (as
we would have expected). However, for U > 0, the Fermi surfaces touches the edges of the Brillouin zone (of course,
the example is specically chosen that this happens). At rst, this looks comparable to what happens for the bands
coming from the rst Brillouin zone (red) in Fig. 37 for = 2. You might expect a picture of the surface for bands
coming from the second Brillouin zone (blue in 40). However, there is none. The trick to understand this is to follow
the Fermi energy in dierent directions. Along the (0, 0) (
a
, 0) directions for U = 0, we cross the free-electron
band centered around (0, 0) for a lling of 1.4 electrons per site, see Fig. 38. For U > 0, a gap opens up and this band
is pushed below the Fermi level. For two electrons per site, the E
F
is larger, see = 2 in Fig. 34, and the Fermi level
crosses the free-electron band centered around (
2
a
, 0), i.e. there is a Fermi surface for bands coming from the second
Brillouin zone, see Fig. 37. On the other hand, the second band for U > 0 is at higher enegy due to the opening
of a gap, see Fig. 38. We can no longer identify this as the band coming from the second Brillouin zone, since the
free-electron bands from the rst and second Brillouin zone are mixed by the periodic potential U. Let us now look
along the (
a
, 0) (
a
,
a
) direction. For a lling of 1.7 electrons per site, the Fermi level does not cross a band, see
Fig. 38. Since the Fermi surface lies entirely inside the rst Brillouin zone, , see Fig. 40, we do not expect a crossing
since the direction (
a
, 0) (
a
,
a
) lies at the edge of the Brillouin zone. For a lling of 2 electrons per site, see = 2
in Fig. 37, we do cross a band. In fact, two bands are crossed since the bands coming from the reciprocal lattice
vectors K = (0, 0) and (
2
a
, 0) are degenerate along the (
a
, 0) (
a
,
a
) direction. This can be understood by noting
that both K are always at the same distance from the line (
a
, 0) and (
a
,
a
). Note that there is a Fermi surface in
this direction for bands coming from the rst (red) and second (blue) Brillouin zones in Fig. 37. For a nite U, this
degeneracy is lifted, see Fig. 38. The Fermi level for a lling of 1.7 electrons per site just crosses the lowest band
and the Fermi surface therefore lies entirely inside the rst Brillouin zone. For the (0, 0) and (
a
,
a
) direction, the
situation is still very similar for U = 0 and U > 0, see Fig. 38.
52
FIG. 41: The tight-binding dispersion
k
for a two-dimensional square lattice along particular directions in the Brillouin zone.
The inset shows the Fermi surface in for a lling of one electron per site.
XVII. TIGHT-BINDING IN TWO DIMENSIONS
The derivation of the tight-binding dispersion in two-dimension is very similar to that in one dimension in Eqn.
(33)
k
= t
e
ik
= t(e
ik
x
a
+e
ik
x
a
+e
ik
y
a
+e
ik
y
a
) = 2t(cosk
x
a +cosk
y
a). (196)
We can plot this curve by hand. Along the (0, 0) (
a
, 0), we have 2t cos k
x
a, equivalent to the one-dimensional
dispersion; along the (
a
, 0) (
a
,
a
), we obtain 2t(1 + cos k
y
a); and along (0, 0) (
a
,
a
), the dispersion is
4t cos ka with k = k
x
= k
y
, see Fig. 41. The density of states (E) is shown in Fig. 42. Since the band is made
up of n 1s electron per site, the total band can accomodate two electrons (taking spin into account). The density
of states is also symmetric around E = 0. This might not be directly obvious by looking at the dispersion curves in
Fig. 41. However, displacing k by (, ) has as result that
k
k
. This directly implies that the Fermi level for
one electron per site is at E = 0. The inset in Fig. 41 shows that the Fermi surface is a square. This can be easily
understood since E = 0 implies cos k
x
+cos k
y
= 0, which is satised, for example when k
y
= k
x
and all the other
straight lines on the Fermi surface.
Another interesting thing to not is the remarkable similarity between the tight-binding dispersion in Fig. 41 and
the lowest band in the two-dimension nearly-free electron model in Fig. 38. Despite the entirely dierent starting
points, the resulting physics is very similar.
XVIII. THE PERIODIC TABLE
Up to now, we have seen discussed the nearly-free electron and the tight-binding approach to the electronic structure
of solids. In the end, we found similar results even though we started from opposite points of view. We now would
like to discuss the electronic structure of several elements. Even starting from a free-electron model, we noticed the
formation of atomic-like structure around the nuclei in the solid. Let us briey recapitulate some of the results from
atomic physics. The eigenfunctions for the Hydrogen atom were obtained by solving the Schrodinger equation
2
2m
2
(r) +V (r)(r) = E(r), (197)
53
0
0.2
0.4
0.6
0.8
1
-4 -2 0 2 4
(
E
)
E/t
FIG. 42: The density of states (E) for tight-binding dispersion
k
, shown in green. The red line shows its integral.
is the potential from the nucleus. This was solved by separation of variables leading to eigenfunctions that can be
written as
nlm
(r, , ) = R
nl
(r)Y
lm
(, ), (198)
where R
nl
are Laguerre polynomials and Y
lm
(, ) spherical harmonics. The quantum numbers are given by n =
1, 2, 3, , l = 0, 1, , n1, and m = l, l+1 , l. The orbitals are commonly denoted as s, p, d, f for l = 0, 1, 2, 3,
respectively. Although the solution is exact for Hydrogen, the interactions between the electrons does not allow us to
obtain exact solutions for larger atomic numbers. There are two approaches. We can try to do our best to treat the
electron-electron interactions as well as possible. This leads to typical atomic physics (Hunds rule, local moments
etc, see lecture notes from 560/1). However, this is many-body problem treating all the electrons explicitly and not
a feasible approach for solid-state physics. The other approach is to assume that the electron only feels an average
potential as a result of the presence of other electrons. This eectively modies the potential V (r). For example,
one can easily understand stand that the 3s electron does not feel the potential of the eleven protons in the nucleus
(a Na
11+
ion), but an eective potential due to the nucleus plus all the strongly bound electrons in the 1s, 2s, and
2p shells. However, we might not be able to calculate the exact shape of the potential and therefore the radial
wavefunction R
nl
(r).
A phenemenological scheme of lling the orbitals is
6s 6p
5s 5p 5d 5f
4s 4p 4d 4f
3s 3p 3d
2s 2p
1s
. (199)
How this works out for the periodic table is shown in Fig. 44.
When we look at solids, we can still observe the atomic character of the orbitals although often orbitals get mixed
and we nd states with, for example, both s and p character. Also, in solids we cannot easily determine the radial
wavefunction R
nl
without doing extensive calculations. However, some symmetry aspects still remain. For example,
the 1s and 2s orbitals have the same spherical part of the wavefunction (Y
00
(, ) = 1/
100
and
200
have to be orthogonal. This directly implies that the radial wavefunction of the 2s orbital R
20
must
have a node (and positive and negative parts), since R
10
does not have a node. This nodes make the orbitals more
extended in space. This leads us to another simple rule of thumbs. The rst orbitals of its kind are small. This means
the 1s, 2p, 3d, and 4f. Of particular importance for condensed-matter physics are the compounds where the valence
band contain 3d and 4f orbitals. Since the orbitals are small Coulomb interactions are important and we expect the
nearly-free electron model to work less well.
XIX. BAND STRUCTURE OF SELECTED MATERIALS: SIMPLE METALS AND NOBLE METALS
Let us rst look at the situation of simple metals. Where can we nd those in the periodic table? First, we do
not want small orbitals, because then strong electron-electron interactions play an important role. This rules out
all transition-metals (such as iron and cobalt), and all the rare earths. We want materials with s and p electrons.
However, some of these are usually only present as gases (such as He) often in the form of molecules (such as H
2
,
N
2
, and O
2
). Certain combinations of s and p electrons have the tendency to form sp hybrids (for example in Si and
carbon compounds such as graphene and diamond), which will be discussed later. We are therefore left with elements
from groups 1A (Na, K, Rb), 2A (Mg, Ca), and 3A (Al). Let us consider Aluminum.
Al has a face-centered cubic lattice (fcc), see Fig. 45. The lattice vectors are
a
1
=
a
2
( y + z), a
2
=
a
2
( z + x), a
3
=
a
2
( x + y). (200)
The reciprocal lattice vectors can be obtained using Eqn. (57)and are
b
1
=
4
a
1
2
( y + z x), b
2
=
4
a
1
2
( z + x y), b
3
=
4
a
1
2
( x + y z), (201)
which corresponds to the lattice vectors of a body-centered cube (bcc). The rst Brillouin is shown in Fig. 46.
Bragg reection occurs in planes perpendicular to the reciprocal lattice vectors at half the length of the vector. The
FIG. 43: The periodic table of elements.
55
FIG. 44: The table shows which shell are being lled when going through the periodic table.
nearest-neighbors in reciprocal space are at the centers of the cube along the (111) directions, giving rise to the largest
planes in the rst Brillouin zone. The next-nearest neighbors are along the sides of the cube in the (100) directions.
The electronic structure is shown in Fig. 47, where is is compared to a free-electron model. The periodic potential is
smaller than we chose for the two-dimensional example. The gap at point X in reciprocal space is less the 0.1 Ry or
of the order of one electron volt. We also observe (for example, along the XW direction), the lifting of degeneracies
due to the periodic potential. However, despite the added complexity of dealing with a fcc lattice, the (nearly) free
electron model is surprisingly successful.
There is another class of compounds, where the free-electron model is useful. These are the noble metals. Although
FIG. 45: The face-centered cubic lattice.
56
FIG. 46: The rst Brillouin zone for Aluminum. Since Al is fcc, the reciprocal lattice vectors form a bcc lattice. The lines
indicate the directions taken in plotting the electronic structure in Fig. 47.
[b]
FIG. 47: The electronic structure of Aluminum along several directions. The dashed lines indicate the results for the free-
electron model. From Segall, Phys. Rev. 124, 1797 (1961).
transition metals, the d shell is full for the noble metals. For example, the atomic structure of copper is [Ar]3d
10
4s
1
,
where [Ar] indicates the same shell as the noble element argon. These shells (1s to 3p) are all full and deep-lying
core levels. The relevant shells are the 3d and 4s levels. Note that even for atomic copper, the 3d shell if full. This
is an exception to the phenomenological diagram that indicate that the 4s shell is lled rst. This is true for all the
other 3d transition metals that have a conguration 3d
n
4s
2
. However, the lled 3d shell is so stable, that it eectively
steals an electron from the 4s shell. Note that the next element, zinc, has a [Ar]3d
10
4s
2
conguration, again with
the 3d shell lled. When looking the electronic structure of copper, see Fig. 48, we expect signicantly more bands
due to the presence of the 3d states. Since copper also has an fcc structure, we can try to compare it to Al, see
Fig. 47. The rst notable dierence is the cluster of bands with relatively narrow dispersion. These are the ve 3d
57
FIG. 48: The electronic structure of copper along several directions. The dashed lines indicate the results for the free-electron
model. From Burdick, Phys. Rev. 129, 138 (1963).
bands (sometimes we see fewer bands due to degeneracies along particular directions). The 3d bands are below the
Fermi level, corresponding to the fact that the 3d states are full. There is also one band which has a large dispersion
(indicated by
1
in the gure. This corresponds to the 4s states. The dierence is due to the hybridization matrix
elements. The dd matrix elements are much smaller due to the smaller radial extent of the 3d orbitals compared to
the 4s orbitals. However, the band related to the 4s does not simply cross the 3d bands. Along the X direction, it
more or less looks like the 4s band merges with the 3d bands, and then reemerges at higher energies. This is a result
of the nite matrix elements between the 4s and 3d orbitals. The dispersion of the 4s band is similar to the band
indicated by
1
for Al, see Fig. 47.
A good technique to determine the density of states is photoemission. Photoemission uses the photoelectric eect.
The kinetic energy E
kin
of the outgoing photoelectrons is measured and can be directly related to their binding
FIG. 49: The Fermi surface of copper (AM page 290). The left shows the Fermi surface in three dimensions. The right shows
the Fermi surface along particular cuts through the First Brillouin zone. The dashed line shows the free-electron result.
58
FIG. 50: Valence band photoemission of copper and silver from Eastman and Cashion, Phys. Rev. Lett. 24, 310 (1970) using
dierent energies for the incoming photon. Note the onset of the d band at 2 and 4 eV below the Fermi level for Cu (3d) and
Ag (4d), respectively. The bands crossing the Fermi level are the 4sp and 5sp bands, respectively.
energies E
b
via
E
kin
= E
b
, (202)
where (or h) is the energy of the photon. The spectra for copper and silver are given in Fig. 50. Dierent photon
energies are used. The spectra look dierent due to changes in the matrix elements as a function of photon energy.
What is clear is the strong emission from the d bands. The onset of is around 2 eV for copper and 4 eV for silver.
The band width for copper is smaller than that of silver (this is most clearly seen in the spectra taken at 21.2 eV).
This is in agreement with our notion that the 3d bands are smaller and therefore more localized than the 4d bands.
The density of states at the Fermi level is rather small and due to the sp bands.
Another interesting eect occurs at the L point in the Brillouin zone. The Fermi surface lies just above the band at
the edge of the Brillouin zone. This situation is very similar to our example for the nearly-free electron model in two
dimensions for n = 1.4 electrons per site, see Fig. 38 (in three dimensions, the L direction is (111), whereas in two
dimensions the (0, 0) ( in two dimensions) to (
a
,
a
) direction is (1, 1). Note that the plot directions are opposite).
We saw that in two dimensions a neck was formed touching the edge of the Brillouin zone. A comparable eect occurs
in three dimension, see Fig. 49, where we nd neck formation along the (111) directions.
59
XX. THERMAL PROPERTIES
In this section, we are going to study the thermal properties of a solid. In particular, we will look at the specic
heat
C =
U
t
. (203)
Let us remind ourselves of the classical gas. For an ideal gas, we are dealing with the equipartion theorem that states
that each particle has an energy
1
2
k
B
T. For three dimension, we then have
U =
3
2
k
B
T C =
U
t
=
3
2
k
B
, (204)
which means a constant specic heat. In a solid, there are contributions to the specic heat due to phonons and
electrons.
A. Electrons
Up till now we have considered the (nearly) free-electron model in one and two dimensions. Let us have a look at
the free-electron model in three dimensions. The Schrodinger equation is given by
H(r) = E(r)
1
2m
2
(r) = E(r). (205)
The solutions are plane waves
(r) =
1
V
e
ikr
with
k
=
2
k
2
2m
. (206)
As is common in condensed-matter physics, we apply periodic boundary conditions in all three directions, which give
k
i
= 0,
2
L
,
4
L
, (207)
where for simplicity we have assumed that the solid is a cube with sides L. For a suciently large systems, this
reproduces the bulk properties. Whereas in the nineteenth century, rst attempts by Drude at describing the electronic
properties of a solid assumed that the electrons behaved as classical particles (no surprise, since quantum mechanics
was not yet developed). However, this model fails in describing many properties and a treatment using quantum
statistics is essential. The average occupation of a level i with energy
i
is given by
n
i
) =
n
ne
n
i
n
e
n
i
(208)
with = 1/k
B
T. Electron are fermions and need to obey Paulis principle. Therefore, n can only assume the values
n = 0, 1.
n
i
) =
e
i
1 +e
i
=
1
e
i
+ 1
. (209)
However, this does not quite work. For T = 0, all states with
i
< 0 would be occupied and all states with
i
> 0
would be empty. However, at T = 0, the energy up to which the states are occupied is determined by the chemical
potential (or the Fermi energy). Substituting
i
E gives the Fermi-Dirac distribution function
f(E) =
1
e
E
k
B
T
+ 1
, (210)
where k
B
is Boltzmanns constant and T is the temperature; is the chemical potential. At T = 0, all the electron
levels below the chemical potential are full and those above are empty. At all temperatures, f() =
1
2
. In a free-
electron model, the position of the chemical potential or the Fermi energy at T = 0 is determined by the number of
electrons. To calculate this, we rst need to nd an expression for the number of levels or the density of states. The
60
k values are quantized in all three directions, and one particular k point occupies a volume of (2/L)
3
in k space.
Since the number of k states is very large (of the order of 10
23
), the Fermi surface is essentially a sphere. The number
of states is then
N = 2
_
L
2
_
3
4
3
k
3
F
=
V
3
2
k
3
F
, (211)
with V = L
3
and where the factor two comes from the spin degrees of freedom for each state. The Fermi wavevector
is then
k
F
=
_
3
2
N
V
_
1/3
(212)
giving a Fermi energy of
E
F
=
2
k
2
F
2m
=
2
2m
_
3
2
N
V
_
2/3
. (213)
Using this expression, we can also nd the number of states whose energy is less than E
N =
V
3
2
N
_
2mE
2
_
3/2
. (214)
When changing the energy dE, the number of states change by
(E) =
dN
dE
=
V
2
2
N
_
2m
2
_
3/2
E. (215)
The Fermi-Dirac statistic are the major cause for the dierence between the classical and quantum-mechanical
specic heats. Classically, a change in temperature aects all particles. Quantum-mechanically, only the electrons
close to the Fermi level are aected. Let us try to estimate the eect. The energy associated with the temperature
(k
B
T
= 25 meV at room temperature) is generally a lot smaller than the Fermi energy (typically of the order of
electron volts). The percentage of electrons that are thermally excited is of the order of T/T
F
. The change in energy
is of the order of k
B
T. The thermal kinetic energy (the kinetic energy at a nite temperature compared to the energy
at T = 0) is then
U
= N
T
T
F
k
B
T. (216)
This leads to a specic heat of
C =
U
t
= Nk
B
T
T
F
. (217)
We note the dierence between the specic heat for classical free particles, which is independent of energy. The specic
heat using the Fermi-Dirac function is temperature dependent. In addition to the change in energy as a function of
temperature, the number of electrons contributing to the specic heat also changes as a function of temperature. In
addition, the specic heat is only on the order of a few percent of the classical specic heat. This is due to the ratio
T/T
F
. At room temperature k
B
T
= 25 meV, so the thermal energy is of the order of meV. However, the Fermi energy
is generally of the order of several electron volts.
Let us now derive a more quantitative expression for the electronic heat capacity valid at temperatures for which
k
B
T E
F
. The change in energy at nite temperatures is given by
U =
_
0
dE E(E)f(E)
_
E
F
0
dEE(E), (218)
where f(E) is the Fermi-Dirac function and (E) is the density of states. We multiply the identity
N =
_
0
dE(E)f(E) =
_
E
F
0
dE(E) (219)
61
by E
F
to obtain
_
_
E
F
0
+
_
E
F
_
dEE
F
(E)f(E) =
_
E
F
0
E
F
(E). (220)
We can use this to rewrite Eqn. (221)
U =
_
E
F
dE (E E
F
)(E)f(E) +
_
E
F
0
(E
F
E)[1 f(E)](E). (221)
The rst term is the energy needed to take electrons from the Fermi level to the states above the Fermi level; the
second term is the energy needed to take electrons from below the Fermi to the Fermi level. Both terms are positive.
The heat capacity can be found by taking the derivative with respect to the temperature T. The only temperature
dependent term is the Fermi-Dirac function. Regrouping gives
C
el
=
dU
dT
=
_
0
dE (E E
F
)(E)
df(E)
dT
. (222)
At temperatures T E
F
, we see that the derivative is only large close to the Fermi level. Taking the density of
states constant gives
C
el
= (E
F
)
_
0
dE (E E
F
)
df(E)
dT
. (223)
There are some small dierence between the Fermi energy E
F
and the chemical potential . The chemical potential
is the point where f() =
1
2
. The Fermi energy is the energy where N =
_
E
F
0
dE(E)f(E). However, for k
B
T E
F
,
this dierence is negligible and we take
= E
F
. Taking the derivative of the Fermi-Dirac function with = k
B
T
gives
df(E)
d
=
E E
F
2
exp
EE
F
(exp
EE
F
+ 1)
2
. (224)
Setting x =
EE
F
and using
d
dT
= k
B
, we obtain
C
el
= (E
F
)
_
0
dE(E E
F
)
df(E)
d
d
dT
(225)
= (E
F
)
_
0
d
_
E E
F
_
(E E
F
)
2
2
exp
EE
F
(exp
EE
F
+ 1)
2
k
B
(226)
= k
2
B
T(E
F
)
_
E
F
/k
B
T
dxx
2
e
x
(e
x
+ 1)
2
. (227)
We can safely replace the lower limit by since the integrand is negligible in this region. The integrand is given by
_
dxx
2
e
x
(e
x
+ 1)
2
=
2
3
. (228)
This gives a heat capacity
C
el
=
2
3
(E
F
)k
2
B
T. (229)
The density of states at the Fermi level can be rewritten as
(E
F
) =
3N
2E
F
=
3N
2k
B
T
F
, (230)
giving
C
el
=
2
2
Nk
B
T
T
F
, (231)
apart from a factor, in agreement with our earlier estimate.
62
B. Phonons
In solids, there is also a heat capacity due to phonons. Let us start by calculating the density of states. The volume
occupied by one k point is again given by
_
2
L
_
3
=
8
3
V
. (232)
The number of states in a volume in reciprocal space around k = 0, is then given by
N =
V
8
3
4
3
k
3
=
V
6
2
k
3
, (233)
for each polarization. We obtain for the density of states
(E) =
dN
d
=
V
2
2
k
2
dk
d
. (234)
Although, in principle, it is possible to calculate numerically the density of states for any possible dispersion
relationship, it is more insightful to obtain analytical results for some simplied dispersions. In our calculation of
phonons in one dimension, we noticed that close to k
2
. (236)
A cut-o frequency
D
is determined from Eqn. (233)
D
=
6
2
v
3
N
V
. (237)
We also need the occupation. The average occupation is given by
n) =
n
ne
n
n
e
n
=
1
n
ln
_
n
e
n
_
. (238)
The summation is a simple geometric series
n
e
n
= 1 +e
+e
2
+ =
1
1 e
(239)
This gives
n()) =
1
ln
1
1 e
=
1
ln(1 e
) =
e
1 e
(240)
or
n) =
1
e
1
(241)
This is the well-known Planck distribution law.
We can now proceed with the calculation of the specic heat by noting that the thermal energy is given by
U =
_
d()n()) =
_
D
0
d
V
2
2
v
3
2
e
1
, (242)
63
for each polarization type. For simplicity, we assume that all polarizations have the same dispersion and the total
thermal energy is obtained by multiplying by a factor 3,
U =
3V
2
2
v
3
_
D
0
d
3
e
1
=
3V
2
2
v
3
_
k
B
T
_
4
_
x
D
0
dx
x
3
e
x
1
, (243)
where x = /k
B
T and x
D
=
D
/k
B
T = T
D
/T with the Debye temperature T
D
=
D
/k
B
. Using the expression
for the Debye frequency in Eqn. (237), we can express
T
D
=
D
k
B
=
_
6
2
N
V
_
1/3
v
k
B
. (244)
This gives a total phonon energy of
U = 9Nk
B
T
_
T
T
D
_
3
_
x
D
0
dx
x
3
e
x
1
. (245)
The heat capacity is found from
C
ph
=
U
T
=
3V
2
2
v
3
T
__
D
0
d
3
e
1
_
=
3V
2
2
v
3
_
_
D
0
d
3
(e
1)
2
_
e
1
k
B
T
2
_
(246)
=
3V
2
2
2
v
3
k
B
T
2
_
k
B
T
_
5
_
x
D
0
dx
x
4
e
x
(e
x
1)
2
(247)
= 9Nk
B
_
T
T
D
_
3
_
x
D
0
dx
x
4
e
x
(e
x
1)
2
(248)
For T T
D
, it approaches C
ph
= 3Nk
B
. The heat capacity approaches zero for T T
D
. In this limit, the upper
limit of the integral approaches innity and the integral becomes
4
/15 and e
x
1. The thermal energy is then
U =
3
5
4
Nk
B
T
4
T
3
D
(249)
The specic heat is then
C
ph
=
12
5
4
Nk
B
_
T
T
D
_
3
. (250)
This is known as the Debye T
3
approximation. An intuitive understanding can be obtained by assuming that only a
fraction of the total states are occupied. This volume fraction is determined by the temperature, where vk
T
= k
B
T,
giving a thermal wavevector k
T
= k
B
T/v. The volume fraction is therefore determined by the excited states relative
to the total number of states (determined by the Debye frequency) and is (T/T
D
)
3
. Each of the excited moments
has a thermal energy k
B
T. The total energy is then approximately 3Nk
B
T(T/T
D
)
3
. The specic heat is therefore
12Nk
B
(T/T
D
)
3
.
Einstein model. The Einstein model assumes a dispersion that is constant as a function of k. Alternatively, one can
view this as N independent oscillators. The thermal energy is then
U = Nn()) =
N
e
k
B
T
1
, (251)
where, for convenience, we write instead of
0
. The heat capacity is then
C
ph
=
U
T
= Nk
B
_
k
B
T
_
2
e
k
B
T
(e
k
B
T
1)
2
. (252)
In three dimensions, N should be replaced by 3N. In the high-temperature limit, we obtain again 3Nk
B
which is the
Dulong-Petit value.
64
Experimental heat capacity. At temperatures much below the Debye temperature and the Fermi temperature the
heat capacity can be written as the sum of both the electron and phonon contributions
C = C
el
+C
ph
= T +AT
3
. (253)
It is often convenient to plot this as
C
T
= +AT
2
. (254)
The observed values of the coecient are of the expected magnitude but the eective electron masses do not
correspond closely to the free-electron value. One often denes a thermal eective mass dened as
m
th
=
observed
free
m. (255)
The deviations in the electron model are directly related to the changes in dispersion as observed in the nearly-free
electron model or the tight-binding model. The changes can be dramatic and up to two orders of magnitude for
rare-earth compounds known as heavy-fermion systems.
XXI. OPTICAL PROPERTIES
In a regular course on electricity and magnetism, the electronic structure is treated in a very approximate way (this
section will, unlike AM, follow SI units). The assumption is that an external electric eld induces a polarization P
counteracting the external eld. The external eld is given by the displacement D which can be related to external
charges that create the external eld. The external eld and the polarization together produce the electric eld inside
the solid
E =
1
0
(DP), (256)
where
0
is the permittivity of free space (vacuum). We know from Gausss law that E = /
0
, where is the
charge density. Taking the divergence of the above equation gives
0
E = D P =
free
+
ind
, (257)
where we have used D =
free
and P =
ind
, giving the free and induced charges, respectively. The assumption
is now that the polarizability is directly proportional to the electric eld
P =
0
E, (258)
where is known as the electric susceptibility. This quantity is supposed to represent all the electronic structure of
the solid. Often the solid is represented as a simple electron gas. Although this assumption works often surprisingly
well (it explains for example why light is reected from metals, giving them the mirror-like appearance), it does not
work well when considering more details (for example, why is copper reddish, silver silver, and gold yellow). To obtain
some understanding about these details, we have to understand the interaction between light and solids in terms of
transitions in the electronic structure. However, let us rst continue with the standard E&M approach.
Inserting Eqn. (258) into (256), gives
E =
1
0
(D
0
E) E =
1
0
(1 +)
D =
D
r
=
D
, (259)
where we have introduced the relative permittivity
r
= 1 + , also known as the dielectric constant. We see that
once we have found , we can also directly relate the electric eld inside the solid to the externally applied eld.
The Lorentz oscillator. Let us rst consider the example of a Lorentz oscillator. This classical model is useful since
it can be related to quantum-mechanical interband transitions in a solid. We can also reduce it to the Drude model
which quantum mechanically includes all intraband transitions (i.e. no scattering with reciprocal lattice vectors).
The quantum-mechanical theory can therefore be seen as an extension of its classical analog. We start by considering
Newtons equation for an electron in an electric eld
F = ma eEmv m
2
0
r = ma. (260)
65
Classically, this means that the electron feels an electric force eE. It is held in its place by an eective spring force
m
2
0
r from the nucleus. In addition, it feels a viscous damping mv. Quantum mechanically, we will see that
0
corresponds to the energy of inter/intraband transtions. The damping is due to scattering giving a nite lifetime
(and therefore a broadening ) to the electronic states. Eqn. (260) leads to a dierential equation
d
2
r
dt
2
+
dr
dt
+
2
0
r =
e
m
E. (261)
If the local electric eld varies as e
it
, then r e
it
has a similar time dependence. Since r gives the displacement
of the electron with respect to the nucleus, this gives an induced dipole moment
r =
eE
m
1
2
0
2
i
p = er = ()E with () =
e
2
m
1
2
0
2
i
. (262)
The total polarization per unit volume is given by
P =
N
V
p = n()E
0
E =
n()
0
, (263)
where n = N/V is the electron density. This gives for the dielectric constant
r
= 1 + = 1 +
n()
0
= 1 +
ne
2
m
0
1
2
0
2
i
. (264)
We can reduce this to the Drude result by realizing that the Drude model excludes the attractive potential of the
nuclei (
0
= 0) and does not include scattering (). This gives
r
= 1
ne
2
m
0
1
2
= 1
2
p
2
, (265)
where
2
p
= ne
2
/
0
m is the plasma frequency. This result basically has the electron gas moving along with the electric
eld.
Let us see what a typical value is for the plasma frequency. Let us consider aluminum. Al is face-centered cubic
with a lattice constant a = 4.05
A. Since this is fcc there are four atoms in the cube, see Fig. 45 (eight at the
corners that count for 1/8 and six at the sides that count for a 1/2). Al has a 3s
2
3p conguration, so each Al atom
contributes three electrons. This gives a density n = 4 3/(4.05 10
10
)
3
= 1.8 10
29
m
3
. Inserting the numbers
gives
p
= 2.4 10
16
s
1
or in energy = 99 eV. This is undoubtedly going to be too high since we know that the
Drude model neglects the Fermi-Dirac statistics and therefore overestimates n.
Interaction of light with a solid. This is all nice, but how does this aect the interaction of light with a solid, i.e.
the optical properties of a solid? Let us go back to Maxwell equations. Two of them include the divergence of the
electric and magnetic elds
D =
free
(
R
E) =
free
= 0, (266)
in the absence of an applied electric eld. Some care should be taken here with the permittivity. We use here a real
permittivity
R
= Re[]. As we will see the complex part will be related to the currents. Furthermore, we also have
for the magnetic induction B = 0. The other two involve the rotation of the magnetic and electric elds.
E+
B
t
= 0 and H
D
t
= J
free
, (267)
where J
free
is the current due to the free charges. Since we are interested in the propagation of a wave in a solid, we
want to transform these equations into a wave equation. This can be done by taking the rotation as follows
E = ( E)
2
E =
B
t
. (268)
In an isotropic medium, we have the relation B = H (similar to the relation D = E). Since E = 0, we obtain
with the use of the other Maxwell equation
2
E =
H
t
=
t
(H) =
R
2
E
t
2
+
J
free
t
. (269)
66
For the current, we have apply Ohms law: J = E, where is the conductivity. This gives us the wave equation
2
E
R
2
E
t
2
=
E
t
. (270)
The term with is related to the optical conductivity related to optical transitions accompanying photon absorption.
Remember that the wave equation in free space is given by
2
E
0
2
E
t
2
= 0, (271)
where
0
0
= 1/c
2
, where c is the speed of light in vacuum. We will now look for solution of the electric eld in a
solid of the form
E = E
0
e
iqrit
. (272)
Inserting into Eqn. (269) gives
q
2
+
R
2
= i q
2
=
r
1
c
2
2
+i
r
0
=
r
2
c
2
_
1
+i
c
2
_
=
r
2
c
2
_
1
+i
_
, (273)
where
R
0
1
and
r
= 1 in a nonmagnetic solid. We can only nd a solution if the wavevector is complex, which
means we allow for evanescent waves. Let us dene a complex diractive index n,
q =
c
n =
c
(n +i), (274)
where n is the refractive index and is the extinction coecient (many books use k or K here, but this notation is a
bit confusing). Substituting gives
2
c
2
(n +i)
2
=
2
c
2
(n
2
2
+ 2in)
r
2
c
2
_
1
+i
_
, (275)
or
1
=
n
2
r
and
=
2n
r
. (276)
Often this is written as a complex dielectric function
r
=
1
+i
2
=
n
2
r
, (277)
giving
1
= =
n
2
r
and
2
=
=
2n
r
. (278)
Let us consider nonmagnetic systems from now on, for which we have
1
= = n
2
2
and
2
= 2n. (279)
We can also invert these equations to nd the expressions for n and in terms of
1
and
2
. Writing
n
2
2
2n
_
2
1
= 0 n
4
1
n
2
2
2
_
2
= 0 (280)
This is a cubic equation in n
2
, where the roots are given by
n
2
=
1
2
1
+
1
2
_
2
1
+ 4
_
2
2
_
2
, (281)
or
n =
_
1
2
1
+
1
2
_
2
1
+
2
2
. (282)
67
E
i
H
i
E
r
H
r
FIG. 51: The reectance of light at the surface of a solid.
Similarly, we can nd
=
_
1
2
1
+
1
2
_
2
1
+
2
2
. (283)
We can split the dielectric constant that we obtained before into a real and imaginary part
r
= 1 +
ne
2
m
0
1
2
0
2
i
. = 1 +
ne
2
m
0
2
0
2
(
2
0
2
)
2
+
2
2
+i
ne
2
m
0
(
2
0
2
)
2
+
2
2
. (284)
which gives
1
= 1 +
ne
2
m
0
2
0
2
(
2
0
2
)
2
+
2
2
and
2
=
ne
2
m
0
(
2
0
2
)
2
+
2
2
. (285)
As we will see later on,
2
is related to the absorption. Note that the function is like a Lorentzian (it is plotted in
Fig. 57), with a width related to . Note that in the limit 0 this function becomes a -function, going to zero
for ,=
0
and having a value of 1/ for =
0
. This means that the solid can only absorb light with energy
0
. Obviously, few solids have only one resonance frequency. However, we can easily generalize the Lorentz oscillator
problem for many oscillators. Let us take n
j
the density of electrons bound with resonance frequency
j
r
= 1 +
j
n
j
e
2
m
0
1
2
j
2
i
j
, (286)
with
j
n
j
= n.
Drude model and intraband transitions. Although the Drude model fails in a quantitative description of a solid (by
grossly overestimating the number of electrons relevant for the conductivity and other properties, due to the neglect
of Fermi-Dirac statistics), the idea that the conduction is determined by electrons that behave almost freely works
rather well. This implies that
0
= 0 The value of is can be related to the inverse of the mean free time between
collisions, = 1/. This gives for the dielectric functions
= 1
2
p
( +i)
1
= 1
2
p
2
1 +
2
2
and
2
=
2
p
(1 +
2
2
)
(287)
Reectance. Although there are many dierent types of optical spectroscopies, one of the most intuitive ones is
the reectance. Let us consider an wave coming in perpendicular, see Fig. 51. Let us take the incident electric eld
along the x-direction,
E
i
= E
i
e
i qzit
x, (288)
where q can be complex. We can nd the magnetic eld from the Maxwell equation
E =
H
t
= iH. (289)
68
20
15
10
5
0
5
10
0 0.5 1 1.5 2
/
p
1
,
2
FIG. 52: The dependence of
1
and
2
as a function of (normalized to the plasma frequency
p
) of the Drude model. The
scattering is small, i.e. =
1
p
.
The rotation can be obtained from
E =
x y z
z
E
x
0 0
=
E
x
z
y = i qE
x
y (290)
This gives
H
i
=
q
E
i
e
i qzit
y, (291)
which means H is perpendicular to E (as it should be). At the surface part of the wave is reected and part of it is
transmitted. However, the other Maxwell equations impose continuity of the electric eld
E
i
+E
r
= E
t
, (292)
where r and t stand for reected and transmitted, respectively. For the magnetic eld, we must satisfy
H
i
H
r
= H
t
, (293)
where we should take care the the direction of the magnetic eld changes for the reected light, see Fig. 51. We can
relate this to the electric eld by
q
1
(E
i
E
r
) =
q
2
E
t
, (294)
Using that q = n/c and that medium 1 is the vacuum for which we have n
1
= 1 and
1
=
0
and, in addition,
taking the solid nonmagnetic, which gives
2
=
0
, we obtain
E
i
E
r
= nE
t
. (295)
69
0
0.5
1
1.5
2
2.5
3
3.5
4
0 0.5 1 1.5 2
/
p
n
,
n
n
Transparent Reflecting
FIG. 53: The dependence of n and as a function of (normalized to the plasma frequency
p
) for the Drude model. Same
as in Fig. 52.
0
0.2
0.4
0.6
0.8
1
0 0.5 1 1.5 2 2.5
R
/
p
FIG. 54: The dependence of the reectivity R as a function of (normalized to the plasma frequency
p
) for the Drude model.
Same as in Fig. 52.
Combining this with Eqn. (292) gives
( n 1)E
i
+ (1 + n)E
r
= 0 r =
E
r
E
i
=
1 n
1 + n
. (296)
The reectivity is then given by
R = r
r =
1 n
1 + n
2
=
(1 n)
2
+
2
(1 +n)
2
+
2
(297)
70
FIG. 55: The reectance of aluminum. The plasma frequency is given by
p
= 14.7 eV. The decrease in reectance around
1.4 eV is due to a weak interband transition.
Given the expressions for
1
and
2
, see Fig. 52, for the Drude model, it is straightforward to calculate the index of
refraction n = n+i and the reectivity, see Fig. 53 and 54. Let us rst look at the most intuitive one: the reectivity,
see Fig. 54. In the Drude, we see that the reectivity is close 100% up to the plasma frequency
p
. How does this work
out for our prototype nearly-free electron system Al? Figure 55 shows that there is a remarkable agreement in the
reectivity. Close to 90% of the light is reected up to the plasma frequency. The wavelength of visible light ranges
from 380 (violet) to 750 (red) nm. This corresponds to an energy of 1.66-3.27 eV. The value of
p
for Al is 14.7 eV.
Therefore all the visible light is reected, explaining why aluminum foil has the mirror-like look. The dierence in the
two sides of the aluminum foil has to do with the manufacturing process. Since aluminum foil is so thin, two sheet
of foil are placed on top of each other in the nal rolling. When separated after rolling the inside is duller than the
outside due to dierent surface roughness. The reectance is 88% for the shiny side and 80% for the dull side. There
is a decrease in reectance deviating from the Drude-like behavior around 1.4 eV due to weak interband transitions.
The plasma frequency is
p
= 3.5 s
1
. This corresponds to a density of n = 3.9 10
27
m
3
. So when neglecting the
Fermi-Dirac distribution, we overestimate n by a factor 50. This is roughly the overestimation of the conductivity in
the Drude model, which is given by = ne
2
/m, where is the time between scattering events. The Drude model
assumes that the electrons are free, but does not take into account the Fermi-Dirac statistics. However, even though
the model completely misses the actual value of
p
, it gives a very nice description of the reectivity.
The reectance for frequencies less than the plasma frequency also manifests itself in other quantities. For <
p
,
the extinction coecient is nite, indicating that light does not penetrate the solid. For >
p
we obtain no extinction
and obtain a nite index of refraction. Thus aluminum will become transparent in the ultraviolet. We also observe a
71
k
E
E
F
Drude like excitations
Lorentz like excitations
"localized" bands
"free-electron" bands
FIG. 56: Schematic gure of the possible transitions. The Drude model describes intraband transition. For free electron like
systems and noble metals that would be in the free electron like bands that cross the Fermi level. The minimum energy needed
for these transitions is zero. However, for noble metals, one also has to consider interband transitions from the more localized
3d bands. These occur at a nite energy.
nite refractive index n close to = 0. This is due to absorption related to the fact that
0
= 0 for the Drude model.
Lorentz oscillators and interband transitions.
However, the Drude model appears to fail for noble metals, such as copper (reddish) and gold (yellow), although
it works reasonably well for silver. The electron density of copper is n = 4 1/(3.61 10
10
)
3
= 8.5 10
28
m
3
.
This about a factor two smaller than that of aluminum (although the absolute densities fail to predict the plasma
frequency, your rst order guess for the ratio of the plasma frequencies would be the ratio of the electron densities).
Naively one can expect the plasma frequency to be about a factor
1
= 1 +
2
p
2
0
2
= 0 =
_
p
+
2
0
. (298)
This gives or our example
p
_
1 +
_
1
2
_
2
= 1.12
p
. Above this energy, the solid is transparent. Below this energy the
solid is reecting down to the energy
0
. Below that energy, the system becomes transparent again. This situation is
very comparable to what is found for semiconductors or insulators. For a wide-band insulator (for example, diamond,
see Fig. 59) an optical transition occurs at a nite energy, i.e. the energy needed to excite an electron from the valence
band into the conduction band. Below that energy, the system becomes transparent. For diamond, the minimum
excitation energy is greater than 5 eV, larger than the energy of visible light. This makes diamond transparent for
visible light giving it a glass light appearance. However, the reectance is not entirely zero and some of the light will
therefore be reected. Diamonds can also have a color. This is due to impurities. These impurities create energy
states inside the insulating gap. In Fig. 58, the optical properties are divided into four regions. These regions roughly
indicate the dominant aspect of the material. For less than roughly 0.4
0
, the material is mainly transparent with
72
-10
-5
0
5
10
15
20
0 0.5 1 1.5 2
1
,
2
/
p
1
FIG. 57: The coecients
1
and
2
as a function of /
p
with
0
=
p
/2 and = 0.1
p
.
0
0.5
1
1.5
2
2.5
3
3.5
4
0 0.5 1 1.5 2
R
,
n
,
/
p
R
n
Reflecting Transparent
Absorbing
Transparent
R for 0
FIG. 58: The refractive index n, extinction coecient , and the reection coecient R for a Lorentz oscillator as a function
of /
p
with
0
=
p
/2 and = 0.1
p
.
a nite refractive index n and a small extinction coecient . Around
0
, the material is predominantly absorbing,
where the incoming light causes transitions of energy . Between 0.6
0
and
p
, the system is reecting with a
large R. For energies greater than
p
, we again observe transparent behavior.
When considering the reectance of noble metal such as copper and silver, see Fig. 60, one has to consider a
combination of both Drude and Lorentz excitations. The Drude excitations related to excitations in the 4sp bands
crossing the Fermi level dominate the reectance for low energies. However at approximately 2 and 4 eV for Cu and
Ag, respectively, we observe Lorentz-like excitations related to transitions from the occupied 3d band into the empty
4sp states, see also Fig. 50. To describe the reectance we cannot consider a single Lorentz oscillator, but we have to
take into account a large number of Lorentz oscillators of dierent energy. This causes a sharp drop in reectance.
For Ag, this hardly aects the reectance in the visible light, since 4 eV is in the ultraviolet. However, for copper, the
decrease in reectance occur in the visible region, cutting of the blue part of the spectrum, giving copper its reddish
73
FIG. 59: The band structure of diamond after Saslow, Bergstresser, and Cohen, Phys. Rev. Lett. 16, 354 (1966). The inset
shows the values of several optical transitions.
color. Note that another drop of occurs at higher energies, clearly visible in Ag around 7-8 eV. This drop-o is related
to the plasma frequency at an energy about half that in Al, in agreement with our simple estimate of the relative
plasma frequencies of Al and Ag/Cu.
XXII. QUANTUM-MECHANICAL TREATMENT OF OPTICAL SPECTROSCOPY
We now want to interpret optical spectroscopy in term of quantum mechanics and relate the Lorentz oscillators to
interband and intraband transitions. We can write our eigenstates in the absence of light as
H
0
= [kn) = E
kn
[kn), (299)
where H
0
is the Hamiltonian describing the solid and E
kn
are the eigenenergies that describe the electronic structure.
We want to study how the solid reacts to light and we therefore add a time-dependent perturbation to the Hamiltonian,
H = H
0
+H
(t). (300)
The wave function as a function of time can now be found from the time-dependent Schrodinger equation
i
t
[(t)) = [H
0
+H
(t)][(t)). (301)
The question is now, what states are involved in the wavefunction [(t))? We have to make some approximations
here. Obviously, we want to include the ground state, which is the Fermi sea, i.e. all the states lled up to the Fermi
level following Paulis principle (we assume that T = 0 K). For the Fermi sea, which we denote by [0), we have
H
0
[0) = E
0
[0) with E
0
=
E
kn
E
F
E
kn
. (302)
Let us now consider the excited states. To lowest order, the light will create electron-hole pair excitations with respect
to the Fermi sea. Let us denote these states by
[m) = [k
; kn) (303)
74
FIG. 60: The reectance of copper and silver, see Ehrenreich and Philipp, Phys. Rev. 128, 1622 (1962).
where the state [k
; kn) indicates a Fermi sea, but with an electron with wave vector k
above the
Fermi level and a hole (or missing electron) with wavevector k and band index n below the Fermi level. The energy
of this state is given by
H
0
[m) = E
m
[m) = (
m
+E
0
)[m) with
m
= E
k
n
E
kn
. (304)
We can now write the time-dependent wavefunction as
[(t)) =
m
a
m
(t)e
iE
m
t/
[m). (305)
Inserting this into the Schrodinger equation gives
i
m
_
da
m
(t)
dt
i
E
m
a(t)
_
e
iE
m
t/
[m) =
m
E
m
+H
(t) a
m
(t)e
iE
m
t/
[m). (306)
We see that the terms with E
m
cancel, giving
i
m
da
m
(t)
dt
e
iE
m
t/
[m) =
m
H
(t)a
m
(t)e
iE
m
t/
[m). (307)
We can multiply this on the left by m
[e
iE
m
t/
, giving
i
da
m
(t)
dt
=
m
m
[H
(t)[m)a
m
(t)e
i(E
m
E
m
)t/
. (308)
75
If we take the system to be initially in the ground state,
a
m
=
m,0
, t = 0, (309)
we obtain E
m
E
m
= E
m
E
0
=
m
. Using
H
m
= m
[H
(t)[0) =
_
dr
n
H
kn
, (310)
we can write, in rst order approximation theory,
i
da
m
(t)
dt
= H
m
e
i
m
t
. (311)
Note that the summation has disappeared since we only consider one transition, from the ground state [0) to and
excited state [m
).
So far we have not really specied the pertubation H
m
=
1
2
eE r)
m
(e
it
+e
it
), (314)
where
r)
m
= m[r[0) =
_
dr
k
n
r
kn
(315)
is the dipole matrix element, which follows the usual dipole selection rules Let us drop the prime in Eqn. (311) and
integrate the time from t = 0 to a particular time t
a
m
(t) =
1
i
1
2
eE r)
m
_
t
0
dt
_
e
i(
m
+)t
+e
i(
m
)t
_
(316)
=
eE r)
m
2
_
1 e
i(
m
+)t
m
+
+
1 e
i(
m
)t
_
. (317)
We are interested in comparing the classical with the quantum-mechanical dielectric function. We obtained the
classical dielectric function by studying the induced dipole moment due to the electric eld. Let us consider calculate
the induced dipole moment which can be directly related to the expectation value of r,
p = e(t)[r[(t)). (318)
However, we just obtained the expression for [(t)) within rst-order perturbation theory. Using this, we have
p = e
m,m
m
(t)a
m
(t)m
[r[m)e
i(E
m
E
m
)t/
. (319)
However, we expect that the coecients a
m
with m ,= 0 are relatively small. Therefore, the major contributions will
come from the terms where either m = 0 or m
m
_
a
m
(t)m[r[0)e
i
m
t
+a
m
(t)0[r[m)e
i
m
t
= e
m
_
a
m
(t)r)
m
e
i
m
t
+a
m
(t)r)
m
e
i
m
t
(320)
76
Let us now also assume that the excited states have a nite lifetime. This means that they obtain a complex frequency
m
z
m
=
m
+
i
2
m
, (321)
giving a time dependence e
it
1
2
m
t
, showing a decay of the wavefunction. In the remainder, we neglect the m
dependende of . We are now dealing with terms
a
m
(t)e
iz
m
t
=
eE r)
m
2
_
e
iz
m
t
e
it
z
m
+
+
e
iz
m
t
e
it
z
m
eE r)
m
2
_
e
it
z
m
+
+
e
it
z
m
_
, (322)
where in the last step we have used the fact that only the terms with the same frequency as the driving force remain
whn taking a time average. Let us for simplicity also take the electric eld along the x direction, with E = E x. This
gives also a dipole moment p
x
along the x direction. Using x
m
= m[x[0), we nd for the dipole moment
p =
m
e
2
E[x
m
[
2
2
(z
m
+z
m
)
_
e
it
(z
m
+)(z
m
)
+
e
it
(z
m
)(z
m
+)
_
, (323)
where z
m
+z
m
= 2
m
and
(z
m
)(z
m
) = (
m
+
i
2
)(
m
i
2
) (324)
= (
m
)(
m
) +
i
2
[
m
(
m
)]
1
4
2
(325)
=
2
m
1
4
2
i
2
m
2
i, (326)
where in the last step we assumed that the broadening is small relatively to the excitation energy
m
. This gives for
the dipole moment
p =
m
e
2
E[x
m
[
2
_
m
e
it
2
m
2
i
+
m
e
it
2
m
2
+i
_
. (327)
The polarization is then given by
P =
N
V
p =
m
2e
2
n[x
m
[
2
_
m
(
2
m
2
)
(
2
m
2
)
2
+
2
2
cos t +
m
(
2
m
2
)
2
+
2
2
sin t
_
E. (328)
In order to relate this to the dielectric function, we need to look at the response to a cosine wave. We know that
for an electric eld E e
it
, the polarization is given by P = ()
0
E. To construct a cosine, we also need ().
From Eqn. (264), we can see that replacing by gives () =
0
E(e
it
+
e
it
) =
1
2
0
E
_
(
1
1)(e
it
+e
it
) +i
2
(e
it
e
+it
)
(329)
=
0
(
1
1)E cos t +
0
2
E sin t. (330)
This means that the cosine term is related to
1
,
1
= 1 +
m
2e
2
n[x
m
[
2
m
(
2
m
2
)
(
2
m
2
)
2
+
2
2
(331)
and the sine term to
2
,
2
=
m
2e
2
n[x
m
[
2
(
2
m
2
)
2
+
2
2
. (332)
We can also write this in its complex form
r
= 1 +
ne
2
m
0
m
f
m
2
m
2
i
, (333)
with the oscillator strength given by
f
m
=
m
2m
2
[x
m
[
2
. (334)
77
XXIII. RELATION TO ABSORPTION
Earlier it was mentioned that
2
was related to absoprtion. This might not be directly obvious. We are used to
righting absorption and emission in terms of Fermis Golden Rule
I()
m
[m[H
[0)[
2
(
m
), (335)
where I() is the absorption intensity. However, we want rewrite the -function. Using
i
_
0
e
i(
m
)tt
dt =
1
m
+i
=
m
(
m
)
2
+
2
i
(
m
)
2
+
2
(336)
Note that the imaginary part works as a -function in the limit 0, when it is zero for ,=
m
and 1/ for
=
m
. The integral over this function, known as a Lorentzian, is equal to . Normalizing the -function to unity,
we have
(
m
) =
1
(
m
)
2
+
2
=
1
Im
1
m
+i
. (337)
The function (
m
+i)
1
is also known as a Greens function. Therefore we can also write Fermis golden fule as
I()
1
m
[x
m
[
2
Im
1
m
+i
=
1
m
[x
m
[
2
Im
1
z
m
, (338)
where we have used x
m
= m[H
m
[0[H
[m)[
2
(
m
+). (339)
This term requires some closer consideration. The conservation of energy says that we start we an initial state and
end up with an excitation with energy
m
and a photon with energy . Since energy is conserved this implies that
=
m
. Since the photon energy is always positive, this implies that
m
< 0. However,we saw before that the
m
corresponded to an electron-hole pair excitation, which is always positive. However, we can have this term if
there are excitations in the initial state, which can be present at nite temperature or due to laser excitations. Then
the energy
m
corresponds to the deexcitation of an electron-hole pair. We can also rewrite this in an integral form
i
_
0
e
i(
m
+)t+t
dt =
1
m
+ +i
=
1
+z
m
(340)
The gives for the intensity
I()
1
m
[x
m
[
2
Im
1
+z
m
, (341)
Adding the two terms together gives
1
z
1
z
m
+
=
z
m
+z
m
( z
m
)( +z
m
)
, (342)
which reproduces the e
it
term in Eqn. (326).
XXIV. THOMAS-FERMI SCREENING
As we have seen in the previous sections, the electrons respond to an externally applied electric eld. In the same
way, one can envision the electron cloud corresponding to the electric eld produced by an electron. This is called
screening. We can understand screening physically by noting that the electrons cloud reacts to the electron and will
move as a result of the potential created by the electron. However, in a system in equilibrium we assume that the
78
chemical potential = E
F
(r) U(r) remains constant. Thus the change in potential has to be compensated by a
change in number of electrons
dE
F
dn
n U = 0. (343)
In a free electron model, the Fermi energy is given by
N = 2
V
(2)
3
4
3
k
3
F
k
F
= (3
2
n)
1/3
E
F
=
2
k
2
F
2m
=
2
2m
(3n)
2/3
, (344)
giving dE
F
/dn =
2
3
E
F
/n. The induced charge is therefore
ind
= en =
3
2
enU
E
F
=
3
2
e
3
n
E
F
0
q
2
(345)
where use has been made of Laplaces equation to calculate the potential from the point charge e:
2
V = e/
0
(which also follows from Gausss equation E = e/
0
and E = V . or in momentum space q
2
V = e/
0
or
V = e/
0
q
2
. The potential energy is then U = eV = e
2
/
0
q
2
. An alternative way of obtaining the same result is
to make a Fourier transform of the Coulomb potential
U(r) =
1
4
0
e
2
r
. (346)
The use of this alernative way becomes clear in a little while. The Fourier transform gives
U
q
=
e
2
4
0
_
2
0
d
_
0
dr
_
0
dr
2
sin e
iqr cos
1
r
(347)
=
e
2
2
0
_
0
dr
_
1
1
d cos re
iqr cos
=
e
2
2iq
0
_
0
dr(e
iqr
e
iqr
) (348)
Here we have a small problem with the convergence of the integration. This can be solved by making the oscillation
damp out by e
r
, and in the end take = 0
U
q
=
e
2
2iq
0
_
0
dr(e
iqr
e
iqr
)e
r
=
e
2
2iq
0
_
0
dr(e
i(q+i)r
e
i(qi)r
) =
e
2
2q
0
_
1
q +i
+
1
q i
_
=
e
2
0
1
q
2
+
2
(349)
where a has been introduced to let the oscillations damp out slowly. In the limit 0, we obtain U
q
= e
2
/
0
q
2
.
For the dielectric function we then nd
r
= 1 + = 1
ind
free
= 1
ind
e
= 1 +
3e
2
n
2E
F
0
q
2
= 1 +
q
2
TF
q
2
(350)
with q
TF
=
_
3ne
2
/2
0
E
F
. We therefore nd an eective Coulomb potential
U
q
=
1
e
2
q
2
=
1
+ 0
e
2
r
q
2
=
1
4
0
e
2
q
2
+q
2
TF
. (351)
One might wonder to what potential this corresponds in real space. However, we see that the form is exactly the same
as the form obtained in deriving the Fourier transform of the potential. We can therefore write the eective screened
potential as
U(r) =
1
4
0
e
2
r
e
q
F
r
, (352)
which look like a Coulomb potential but with a reduced range.
79
XXV. MANY-PARTICLE WAVEFUNCTIONS
In quantum mechanics, particles are indistinguishable. This means that we should be able to interchange two
particles without seeing any eect on the probability function. We can distinguish two fundamentally dierent types
of particles, namely those with a symmetric and antisymmetric wavefunction. Under permutation between particle i
and j for a N particle wavefunction, one has the following possibilities
P
ij
n
1
n
N
(r
1
, , r
i
, , r
j
, , r
N
) =
n
1
n
N
(r
1
, , r
i
, , r
j
, , r
N
) (353)
where k
i
means that a particle is in a certain quantum state n
i
(for example a particle in a box where the quantum
numbers are n
i
= n
x
, n
y
, n
z
, but it could also refer to the quantum numbers of an atom n
i
= nlm). At rst sight
one might think it does not make any dierence since the real properties are related to the square of the wavefunction
and a plus or minus sign should not make too much of a dierence. However, the sign has drastic consequences for
the statistics. Let us look at the wavefunction for two particles:
k
1
,k
2
(r
1
, r
2
) =
1
2
(
k
1
(r
1
)
k
2
(r
2
)
k
2
(r
1
)
k
1
(r
2
)). (354)
where
k
are eigenfunctions of the Hamiltonian. The symmetric and antisymmetric wavefunctions give two fun-
damentally dierent particles. Particles whose wavefunction is symmetric are bosons, particles with antisymmetric
wavefunctions are called fermions. Examples of bosons are photons, and K mesons. These are particles with
integral spin. Also the harmonic oscillator and lattice vibrations (also known as phonons) behave like a boson-like
particle. Particles with half integral spin behave like fermions. Examples are electrons, protons, and neutrons. The
antisymmetric wavefunctions have a very special characteristic. Let us take two particle in the same quantum state
k,k
(r
1
, r
2
) =
1
2
(
k
(r
1
)
k
(r
2
)
k
(r
1
)
k
(r
2
)) = 0. (355)
Therefore, it is impossible to have two fermions with the same quantum numbers. Note that the spin is also considered
a quantum number. Therefore, we can have two electrons in the same wavefunction (say the same plane wave or the
same atomic orbital), but with opposite spins. The occupation of each fermion state is therefore 0 or 1.
Let us now generalize the wavefunctions for an arbitrary number of particles. For bosons the general wavefunction
is
k
1
,...,k
N
=
_
i
n
k
i
!
N!
k
1
(r
1
)
k
N
(r
N
), (356)
where the summations runs over all the possible permutations of dierent indices k
i
. This gives the normalization
constant. n
k
i
indicates how many k-values are k
i
. Note that the
i
n
k
i
= N. For two particles with S = 0, we have
k
1
,k
2
=
1
k
1
(r
1
)
k
2
(r
2
) +
k
2
(r
1
)
k
1
(r
2
) . (357)
For k
1
= k
2
= k, we nd
k
1
,k
2
=
_
2!
2!
k
(r
1
)
k
(r
2
) =
k
(r
1
)
k
(r
2
), (358)
using the fact that n
k
= 2. We see that it is possible to put two bosons in the same quantum state. In fact, we can
put any arbitrary number of bosons in a particle quantum state. From a statistical point of view, we therefore have
for the occupation numbers of a particular quantum state i
n
i
= 0, 1 fermions (359)
n
i
= 0, 1, 2, 3, bosons (360)
This is an important aspect that we have to include in our quantum statistics. Up till now the only aspect that we
have used is the indistinguishability.
The situation for electrons (and other spin-1/2 particles) is fact a bit more complicated since the total wavefunction,
i.e., orbital and spin, has to be asymmetric. This means
S
(r
1
, r
2
)
A
(1, 2) or
A
(r
1
, r
2
)
S
(1, 2), where (1, 2) is the
wavefunction for the spin. There are four ways to combine two spins
(1)
(2),
(1)
(2),
(1)
(2), and
(1)
(2).
80
(1)
(2) and
(1)
(2) are clearly symmetric spin functions, since interchange of the particles does not change the
sign of the wave function. These are the
11
(1, 2) and
1,1
(1, 2) components of the triplet (S = 1). Since S
z
= 1, 0, 1
for S = 1, there should also be a S = 0 component. This can be obtained by using the step operator for the spin
S
11
=
_
(1 + 1)(1 1 + 1)
10
=
2
10
=
(1)
(2) +
(1)
(2) (361)
10
=
1
2
[
(1)
(2) +
(1)
(2)] (362)
where we have used
S
SS
z
=
_
(S S
z
)(S S
z
+ 1)
S,S
z
1
(363)
Note that this is also a symmetric wavefunction. The remaining wave function has to be orthogonal to the previous
one
00
(1, 2) =
1
2
[
(1)
(2)
(1)
(2)] (364)
which is antisymmetric. Since there is only one component (called a singlet), this has to be S = 0. The four total
wavefunctions for two electrons are therefore
1S
z
(r
1
, r
2
) =
A
(r
1
, r
2
)
1S
z
(1, 2) with S
z
= 1, 0, 1 and
00
(r
1
, r
2
) =
S
(r
1
, r
2
)
00
(1, 2). The S = 1 and S
z
= 1 is given by
11
(r
1
, r
2
) =
A
(r
1
, r
2
)
11
(1, 2) =
1
2
[
k
1
(r
1
)
k
2
(r
2
)
k
2
(r
1
)
k
2
(r
1
)]
(1)
(2) (365)
=
1
2
[
k
1
(r
1
)
(1)
k
2
(r
2
)
(2)
k
2
(r
1
)
(1)
k
2
(r
1
)
(2)] (366)
=
1
k
1
(r
1
)
(1)
k
1
(r
2
)
(2)
k
2
(r
1
)
(1)
k
2
(r
2
)
(2)
(367)
The S = 0 and S
z
= 0 wavefunction is given by
00
(r
1
, r
2
) =
S
(r
1
, r
2
)
00
(1, 2) =
1
2
[
k
1
(r
1
)
k
2
(r
2
) +
k
2
(r
1
)
k
2
(r
1
)]
1
2
[
(1)
(2)
(1)
(2)] (368)
=
1
2
[
k
1
(r
1
)
(1)
k
2
(r
2
)
(2)
k
2
(r
1
)
(1)
k
1
(r
2
)
(2) (369)
k
1
(r
1
)
(1)
k
2
(r
2
)
(2) +
k
2
(r
1
)
(1)
k
1
(r
2
)
(2)] (370)
Note that this can also be written as a combination of Slater determinants
00
(r
1
, r
2
) =
1
2
_
1
k
1
(r
1
)
(1)
k
1
(r
2
)
(2)
k
2
(r
1
)
(1)
k
2
(r
2
)
(2)
k
1
(r
1
)
(1)
k
1
(r
2
)
(2)
k
2
(r
1
)
(1)
k
2
(r
2
)
(2)
_
(371)
=
1
2
[
k
1
,k
2
(r
1
, r
2
)
k
1
,k
2
(r
1
, r
2
)] (372)
Note that in general we can built up antisymmetric states from Slater determinants
k
1
1
,...,k
N
N
(r
1
r
N
) =
1
N!
k
1
(r
1
)
1
(1)
k
1
(r
N
)
1
(N)
. .
. .
k
N
(r
1
)
N
(1)
k
N
(r
N
)
N
(N)
(373)
where
i
=, . As a result of the properties of the determinant, having two columns with the same k and implies
that the determinant is zero, just as for the exclusion principle. Interchange of two columns also leads to a change in
sign of the determinant, giving us directly the antisymmetry. This is obvious for the S
z
= 1, 1 states of the triplet.
For the S
z
= 0 states, one needs a combination of Slater determinants
k
1
k
2
,00
=
1
2
(
k
1
,k
2
k
1
,k
2
) and
k
1
k
2
,10
=
1
2
(
k
1
,k
2
+
k
1
,k
2
) . (374)
You may have noticed that dealing with the statistic of many particles using wavefunction is quite cumbersome.
This is due to the fact that we have to permute all the dierent particles (or the dierent coordinates r
i
over all the
81
dierent wavefunctions. The statistics for fermions and bosons can greatly simplify if we start out with something
indistinguishable. This can be done by using operators. You probably remember that the analytical solution of the
harmonic oscillator involved the step up/down operators a
[n) =
with x and p, since this result is far more general. Note that a
eectively
put adds one to the quantum state, whereas a removes one from the quantum state. Let us now identify this with
bosons. a
creates a boson in the quantum state, whereas a removes a boson from the quantum state. Now the
harmonic oscillator is only one quantum state. We can in principle have many dierent quantum states k in which
we can put bosons. So we can generalize it to
a
k
[n
1
, n
2
, , n
k
, ) =
n
k
+ 1[n
1
, n
2
, , n
k
+ 1, ) and (376)
a
k
[n
1
, n
2
, , n
k
, ) =
n
k
[n
1
, n
2
, , n
k
1, ). (377)
These operators have some nice characteristics: they are indistinguishable. In addition, they are symmetric. This
follows from the generalized commutation relations
[a
k
, a
] =
k,k
and [a
k
, a
] = 0 and [a
k
, a
k
] = 0 (378)
Thus for k ,= k
, we can write
a
k
a
= a
k
. (379)
Therefore, interchange of two particles does not change the sign of the wavefunction.
The wavefunction for a single harmonic oscillator could be written as
[n) =
1
N!
(a
)
N
[0). (380)
For more than one k value, we can generalize the wavefunction as
[n
k
i
) =
1
_
i
n
k
i
i
(a
k
i
)
n
k
i
[0). (381)
Let us look at the two-particle wavefunctions. If the two particles are in the same state (i.e. have the same quantum
number) then the wavefunction should be similar to that of the harmonic oscillator:
[kk) =
1
2
a
k
a
k
[0). (382)
This wavefunction is now properly normalized since
kk[kk) =
1
2
0[a
k
a
k
a
k
a
k
[0) =
1
2
0[a
k
(1 +a
k
a
k
)a
k
[0) (383)
=
1
2
0[a
k
a
k
[0) +
1
2
0[a
k
a
k
a
k
a
k
[0) (384)
=
1
2
0[(1 +a
k
a
k
)[0) +
1
2
0[(1 +a
k
a
k
)(1 +a
k
a
k
)[0) = 1 (385)
(386)
since a
k
[0) = 0. For two dierent wavevectors, we have
[k
1
k
2
) = a
k
1
a
k
2
[0). (387)
This gives the right normalization since
k
1
k
2
[k
1
k
2
) = 0[a
k
1
a
k
2
a
k
2
a
k
1
[0) = 0[a
k
1
(1 +a
k
2
a
k
2
)a
k
1
[0) = 0[a
k
1
a
k
1
[0) = 0[(1 +a
k
1
a
k
1
)[0) = 1 (388)
Working with Slater determinants becomes quite elaborate. We can however do the same trick with operators to
account for the statistics as with bosons. The operator c
k
creates a fermion in a certain quantum state k, whereas c
k
removes it from this quantum state, i.e.
c
k
[0
k
) = [1
k
) and c
k
[1
k
) = [0
k
), (389)
82
where [0
k
) and [1
k
) denote the states with 0 and 1 electron in quantum state k, respectively. However, unlike bosons
that satisfy commutation relations, fermions satisfy anticommutation relations
c
k
, c
=
k,k
, c
k
, c
= 0, c
k
, c
k
= 0, (390)
where the term in braces is dened as
A, B = AB +BA. (391)
However, we cannot put two particles in the same state, since
c
k
c
k
[0) =
1
2
c
k
, c
k
[0) = 0. (392)
We can put them in dierent quantum states, thus for k ,= k
k
c
[0) = c
k
[0). (393)
Note that the operators directly take care of the antisymmetry of the wavefunctions. Including spin, we can write the
wavefunctions [k
1
k
2
, SS
z
) for two particles in states labeled with quantum numbers k
1
and k
2
with dierent spin S
and S
z
as
[k
1
k
2
, 11) = c
k
2
k
1
[0) , [k
1
k
2
, 10) =
1
2
_
c
k
2
k
1
+c
k
2
k
1
_
[0) , [k
1
k
2
, 1, 1) = c
k
2
k
1
[0) (394)
[k
1
k
2
, 00) =
1
2
_
c
k
2
k
1
k
2
k
1
_
[0). (395)
In general, we can write the many-body Fermion wavefunction as
[n
k
i
) =
i
(c
k
i
)[0). (396)
Note, that the normalization is unity since we can never have more than one fermion in the same quantum state.
Summarizing, we have seen that the a symmetric and antisymmetric wavefunction corresponds to particles with
completely dierent statistics. Bosons have a symmetric wavefunction that allows any number of bosons in a particular
quantum state. Fermions have antisymmetric wavefunctions and the occupation numbers are restricted to 0 and 1.
We also saw that these statistic can be closely related to operators and their corresponding commutation relationships.
Bosons commute, whereas fermions anticommute.
XXVI. MAGNETISM
From quantum mechanics, we learned that the electrons in an atom can have an orbital moment L and a spin
S. This is not only true for a single electron but also for many electrons on an atom. In addition, we saw that the
orbital and spin moment can be coupled through the spin-orbit coupling forming a total momentum J. This creates
an eective magnetic moment on a site. Magnetism is the phenomenon that the moments are aligned in the solid.
This can happen in several ways. First, we can apply a magnetic eld which, through its interaction with the orbital
and spin moment wants to align the magnetic moments. This is called diamagnetism and paramagnetism. Even more
interesting, is the spontaneous alignment of the magnetic moments below a particular temperature. There are several
types: ferromagnetism, antiferromagnetism, and ferrimagnetism.
A. Diamagnetism
Many atoms in a have magnetic moments. However, it is not essential to obtain a magnetization of a solid in a
magnetic eld. The magnetic eld will make the electrons precess in addition to their regular motion. This gives rise
to an orbital moment, which is aligned with the magnetic eld.
Classical derivation. Classically, the frequency of this procession is given by
=
eB
2m
, (397)
83
where B is the magnetic eld and m the mass of the electron. (note that this is comparable to the frequency of a
sphere around its axis and not that of an electron turning around in circles with a frequency eB/m, also known as
the cyclotron frequency). This frequency should be much lower than the frequency of the original motion in the eld
of the atom. This is generally satised, since atomic potential are a lot stronger than the change in frequency due to
the magnetic eld.
The rotation causes a current. For an atom with Z electrons, this gives
I =
Ze
T
= Ze
2
= Ze
1
2
eB
2m
. (398)
This current in itself creates a magnetic eld opposing the externally applied eld as in the case of the Biot-Savart
law. The magnetic eld is proportional to the magnetic moment given by the current times the area of the eective
loop
= I
2
) = Ze
1
2
eB
2m
2
) =
Ze
2
B
4m
2
). (399)
Note that
2
) is the area perpendicular to the magnetic eld. Taking the magnetic eld along the z-axis, one has
2
) = x
2
)+y
2
). For a uniform charge distribution, this can be related to the radius r
2
) = x
2
)+y
2
)+z
2
) =
2
3
2
)
if x
2
) = y
2
) = z
2
). This gives for the magnetic moment
=
Ze
2
B
6m
r
2
). (400)
For a solid with N equivalent atoms, the total magnetization is given by M = N.
We see that the induced magnetic moment is directly proportional to the magnetic eld. It is therefore useful to
dene a susceptibility
=
0
M
B
, (401)
where
0
= 4 10
7
N/A
2
is the magnetic constant or the permeability of vacuum. For diamagnetism, one has
=
N
B
=
NZ
0
e
2
6m
r
2
). (402)
This is the classical Langevin result.
In general, the magnetization does not have to be linearly proportional to the magnetic eld and we need to dene
the susceptibility as
=
0
M
B
. (403)
The susceptibility is a typical response function. It demonstrates how much the magnetization changes due to a
change in the magnetic eld.
Quantum-mechanical derivation. We can include the eect of the magnetic eld by replacing the momentum by
p p +eA. (404)
A is the vector potential and the magnetic eld is given by B = A. The kinetic part of the Hamiltonian then
becomes
H
(p +eA)
2
2m
=
p
2
2m
+
e
2m
(p A+A p) +
e
2
A
2
2m
. (405)
The rst term is the kinetic energy in the absence of a vector potential. The second term describes the interaction
between the electron and the vector potential. In general, we cannot take these two terms together since p is acting
on both the wavefunction and the vector potential A. We can take them together if we use the Lorentz gauge for
which A = 0. The last term describes the scattering of the vector potential. Let us consider the case of a uniform
magnetic eld along the z direction. The vector eld is then
A
x
=
1
2
yB
z
and A
y
=
1
2
xB
z
and A
z
= 0. (406)
84
This vector potential satises the Lorent gauge. The only nonzero component is then
B
z
= (A)
z
=
A
y
x
A
x
y
= B
z
, (407)
as it should be. We can then write
e
m
A p +
e
2
A
2
2m
=
ieB
z
2m
_
x
y
y
x
_
+
e
2
B
2
z
8m
(x
2
+y
2
). (408)
The second term gives a change in energy
E
=
e
2
B
2
z
8m
(x
2
) +y
2
)) =
e
2
B
2
z
12m
r
2
), (409)
using the same relation as used earlier for a homogenous charge distribution. The magnetic moment is given by
=
E
B
=
e
2
r
2
)
6m
B
z
, (410)
which reproduces the classical Langevin result.
B. Paramagnetism
The rst term can be rewritten as
ieB
z
2m
_
x
y
y
x
_
=
e
2m
L
z
B
z
=
B
(L B)
z
, (411)
where
B
= e/2m is the Bohr magneton. The angular momentum is dened as dimensionless (as opposed to the
often-used denition of units of ).
Let us consider the simple example of two electrons in a p shell. The dierent electron congurations are shown
in Table ??. We use the notation for a conguaration (m
k
_
c
k
(lm, lm)c
k
(l
, l
)F
k
(c
k
(lm, l
))
2
G
k
, (413)
where for p
2
, we have l = l
= 1. For
1
D, we nd
E(
1
D) = E(1
+
1
) =
k
(c
k
(11, 11))
2
F
k
= F
0
+
1
25
F
2
, (414)
where the values of c
k
are given in Table ??. In the case of
3
P, there is also an exchange contribution
E(
3
P) = E(1
+
0
+
) =
k
_
c
k
(11, 11)c
k
(10, 10)F
k
(c
k
(11, 10))
2
G
k
(415)
= F
0
2
25
F
2
(0G
0
+
3
25
G
2
) = F
0
5
25
F
2
,
85
where use has been made of the fact that within one shell G
k
= F
k
. The energy of the
1
S term cannot be calculated
this way. However, here we can make use of the fact that the sum of the diagonal energies is unchanged when
transforming from our basis set into the LS states,
E(1
+
1
) +E(1
1
+
) +E(0
+
0
) = 2
k
c
k
(11, 11)c
k
(1 1, 1 1)F
k
+
k
c
k
(10, 10)F
k
(416)
= 2(F
0
+
1
25
F
2
) +F
0
+
4
25
F
2
(417)
= E(
1
S) +E(
3
P) +E(
1
D) = E(
1
S) +
2
25
F
0
4
25
F
2
,
which gives E(
1
S) = F
0
+
10
25
F
2
.
Generally, one nds that for electrons in one shell the lowest term is the one with rst, the highest spin multiplicity,
and, secondly, the highest orbital multiplicity. In the case of p
2
, the
3
P term. This can be understood by noting that
for the highest spin electrons have to occupy dierent orbitals, therefore reducing the Coulomb interaction. The same
applies, in a more complicated manner for the angular momentum.
Thus we nd in general that that
2
P
2
P =
1,3
D +
1,3
P +
1,3
S (418)
giving a total degeneracy of 1 5 + 3 5 + 1 3 + 3 3 + 1 1 + 3 1 = 36, which is equal to 6 6. This would
be the case for two electrons in dierent p shells, i.e. pp. However, for p
2
, the Pauli exclusion principle states that
electrons cannot be in the same orbital. This makes matters a bit more complicated. We can easily see that
3
D is
forbidden, since this would require a conguration (1
+
1
+
). We know that the total number of congurations is 15.
So we still have 10 left. The other terms are 3, 9, 3, and 1 fold degenerate. Apparently the only way to make 10 is
to take the
3
P and the
1
S congurations, being 9 and 1-fold degenerate.
XXVII. FERROMAGNETISM
???????????????????????????????????????????
XXVIII. ANTIFERROMAGNETISM
?????????????????????????????????????????????????????????????
XXIX. PHONONS
Although this is the most convenient way to obtain the dispersion, it is often useful to derive this in a dierent
fashion. Let us consider again the Hamiltonian.
H =
n
p
2
R
2M
+V (r
R
1
, , r
R
N
), (419)
TABLE II: Dierent congurations for p
2
.
p
2
M
S
= 1 0 -1
M
L
= 2 (1
+
1
)
1 (1
+
0
+
) (1
+
0
)(1
0
+
) (1
)
0 (1
+
1
+
) (1
+
1
)(1
1
+
)(0
+
0
) (1
)
-1 (0
+
1
+
) (0
+
1
)(0
1
+
) (0
)
-2 (1
+
1
)
86
where r
R
n
and p
R
n
are the position and momentum, respectively, of the nth atom. We can expand the potential in
a Taylor series around the equilibrium positions r
R
n
= 0,
V (r
R
1
, , r
R
N
) = V (0, 0, ) +
R
r
R
R
V [
{r
n
=0}
+
1
2
R,R
(r
R
R
)(r
R
R
)V + (420)
The rst term is a simple shift of the energy, which we can ignore. The second term contains
R
V =
V
x
R
x +
V
y
R
y +
V
z
R
z (421)
is zero, since we have taken r
R
= 0 as the equilibrium position, i.e. V is in a minimum. The third term is a sum over
second partial derivatives
R,R
(r
R
R
)(r
R
R
)V =
R,R
=x,y,z
2
V
R
R
R
. (422)
When we denote V
R
=
2
V/
R
R
, we can also write this in matrix form
V =
1
2
R
r
T
R
V
R
R
r
R
=
1
2
R
(x
R
, y
R
, z
R
)
_
_
V
xx
R
R
V
xy
R
R
V
xz
R
R
V
yx
R
R
V
yy
R
R
V
yz
R
R
V
zx
R
R
V
zy
R
R
V
zz
R
R
_
_
_
_
x
R
y
R
z
R
_
_
. (423)
Since we are dealing with a crystal, we obviously do not want to stay in real space, but make a Fourier transform
over the positions of the nuclei in the solid to momentum space
r
R
=
1
q
e
iqR
r
q
and p
R
=
1
q
e
iqR
p
q
. (424)
The Hamiltonian then becomes
H =
q
p
q
p
q
2M
+
1
2
q
r
q
V
q
r
q
, (425)
where r
q
is a complex number so we need to take r
q
as opposed to simply transposing it. The interaction matrix is
now given by
V
q
=
_
_
V
xx
q
V
xy
q
V
xz
q
V
yx
q
V
yy
q
V
yz
q
V
zx
q
V
zy
q
V
zz
q
_
_
. (426)
Note that whereas there are cross terms V
R
R in real space (giving a double sum of R and R
), the interaction is
diagonal in q, since it only depends on the RR
,
V
q
=
e
iq
V
R+,R
, (427)
where runs over the lattice sites around the site R. Note that this is equivalent for each site R. Let us go back to
one dimension, to see what our next step is in solving the problem. In one dimension, the Fourier transform of the
Hamiltonian
H =
R
p
2
R
2M
+
RR
V
R
R
x
R
x
R
, (428)
with V
R
R
=
2
V
x
R
x
R
, becomes
H =
q
_
p
q
p
q
2M
+
1
2
m
2
q
x
q
x
q
_
, (429)
87
with
q
=
_
V
q
M
with V
q
=
e
iq
V
R+,R
, (430)
and the Fourier transforms
x
R
=
1
q
e
iqR
x
q
and p
R
=
1
q
e
iqR
p
q
or, inversely x
q
=
1
R
e
iqR
x
R
and p
q
=
1
R
e
iqR
p
R
However, apart from the summation over q, we readily recognize Eqn. (429) as the Hamiltonian for a harmonic
oscillator. We can diagonalize this by introducing the operators
a
q
=
1
_
2M
q
(M
q
x
R
+ip
q
) and a
q
=
1
_
2M
q
(M
q
x
R
ip
q
), (431)
leading to
H =
q
(a
q
a
q
+
1
2
). (432)
We would now like to determine the value of
q
. Let us consider only nearest-neighbor interactions. The interaction
is then
V =
R
1
2
K(x
R
x
R+a
)
2
=
R
K(x
2
R
x
R
x
R+a
) = K
R
x
2
R
K
2
=a
x
R+
x
R
(433)
Since this should be equivalent to V =
1
2
RR
V
RR
x
R
x
R
, we have V
RR
= 2K, V
Ra,R
= K, and V
R
R
= 0 for
[R R
e
iq
V
R+,R
= 2K K(e
iqa
+e
iqa
) = 2K(1 cos qa) = 4K sin
2
qa
2
, (434)
and
q
=
_
V
q
M
= 2
_
K
M
sin
qa
2
=
m
sin
qa
2
, (435)
with
m
=
_
K
M
. reproducing the result at the beginning of the chapter. We can also express the position and
momentum in terms of the step operators
x
q
=
2M
q
(a
q
+a
q
) and p
q
= i
_
2M
q
(a
q
a
q
), (436)
making use of the relations x
q
= x
q
and p
q
= p
q
and
q
=
q
.
Let us now return to three dimensions, where life is complicated by the fact that the interaction between the
nuclei is now a matrix, see Eqn. (426). However, we can turn this eectively into three independent one-dimensional
problems by diagonalizing V
q
. This gives three eigenvalues V
s
q
. There are three mutually perpendicular eigenvectors.
We can write those as unit vectors e
1
, e
2
, and e
3
in real space. These vectors are known as polarization vectors. The
direction of these vectors depend on the wavevectors which we omit here. If one of the polarization vectors is parallel
to q then there are longitudinally polarized phonons. Since the other two vectors are orthogonal, we also have two
transversely polarized phonons. However, this usually only occurs when q is along certain symmetry direction in the
crystal. Equation (425) then becomes
H =
qs
_
p
qs
p
qs
2M
+
1
2
V
s
q
r
qs
r
qs
_
, (437)
88
where the subscript s indicated that we are dealing with the component along a particular polarization vectors, i.e.
r
qs
= r
qs
e
s
and p
qs
= p
qs
e
s
. The frequencies are given by
qs
=
_
V
s
q
M
. (438)
The step operators are given by
a
qs
=
1
_
2M
qs
(M
qs
r
q
+ip
q
) e
s
(439)
a
qs
=
1
_
2M
qs
(M
qs
r
q
ip
q
) e
s
(440)
leading to the Hamiltonian
H =
qs
qs
(a
qs
a
qs
+
1
2
). (441)
XXX. ELECTRON-PHONON INTERACTION
In this section, we discuss the electron phonon interaction. The electrons are treated as a simple electron gas feeling
the potential of the nuclei. The unperturbed Hamiltonian is given by
H =
k
c
k
c
k
+
qs
qs
a
qs
a
qs
. (442)
We then add the potential of the atoms. Unlike what we considered before (as in the free-electron model), the nuclei.
We assume that the potential of the ion depends only on the distance from the equilibrium position of the ion. The
periodic potential is then described in second quantization as
H
U
=
1
N
kk
R
k
[U(r Rr
R
)[k)c
c
k
(443)
=
1
N
kqR
e
iq(R+r
R
)
U
q
c
k+q
c
k
(444)
with q = k
kk
R
e
iqR
U
q
c
k+q
c
k
(446)
U =
kK
U
K
c
k+K
c
k
(447)
(448)
The second term gives the electron-phonon interaction
H
ep
= i
1
N
kqR
e
iqR
q r
R
U
q
c
k+q
c
k
= i
1
N
kqR
e
iqR
1
e
iq
R
q r
q
U
q
c
k+q
c
k
(449)
= i
1
N
1
N
kqq
R
e
i(qq
)R
q r
q
U
q
c
k+q
c
k
(450)
= i
1
kq
q r
q
U
q
c
k+q
c
k
. (451)
89
Expressing the displacement in creation and annihilation operators using
r
q
=
2M
q
(a
qs
+a
qs
) e
s
(452)
gives
H
ep
= i
kqs
2NM
qs
q e
s
U
q
(a
qs
+a
qs
)c
k+q
c
k
. (453)
For simplicity, we shall assume that the phonon spectrum is isotropic. This implies that phonons are either lingitu-
dinally or transversely polarized. In that case, only the longitudinal phonon with the polarization vector parallel to
the wavevector contribute. In the following, we will also neglect the eects of the periodic potential U. We are then
left with the Fr ohlich Hamiltonian
H =
k
c
k
c
k
+
q
c
q
c
q
+
kq
M
q
(a
q
+a
q
)c
k+q
c
k
, (454)
where the matrix element is given by
M
q
= i
kqs
2NM
q
qU
q
. (455)
The Hamiltonian consists of two components. In one case, a
q
c
k+q
c
k
, the electron is scattered by the absorption of
a phonon with momentum q, in the other case, a
q
c
k+q
c
k
the electron is scattered by emission of a phonon with
momentum q. Note that the total momentum of the creation operators equals that of the annihilation operators (in
principle, they could dier by a reciprocal lattice vector K but we will neglect this and reduce everything to the rst
Brillouin zone). We now want to consider the change of the ground state energy due to the presence of electron-phonon
interactions. This will modify the excitation spectra and even result in an eective attractive interaction between the
electrons mediated by the phonons.
Let us study the change in total energy up to second-order perturbation theory, this gives
E = E
0
+[H
ep
[) +[H
ep
1
E
0
H
0
H
ep
[), (456)
where [) is the unperturbed state characterized by the occupation numbers n
q
for phonons and n
k
for electrons.
Since H
ep
involves the creation or annihilation of a phonon, the rst term is zero. The second term gives
E
2
= [
kq
M
q
(a
q
+a
q
)c
k+q
c
k
1
E
0
H
0
M
q
(a
q
+a
q
)c
+q
c
k
[). (457)
Since we start with the state [) and end with the same state, the H
ep
s have to cancel each others excitations. This
is achieved for q
kq
[M
q
[
2
_
[a
q
c
k
c
kq
1
E
0
H
0
a
q
c
kq
c
k
[) +[a
q
c
k
c
kq
1
E
0
H
0
a
q
c
kq
c
k
[)
_
. (458)
We see that the combinations of creation and annihilation operators form number operators n
k
= c
k
c
k
. Note that
c
k
c
k
= 1 n
k
. It is also straightforward to obtain the dierence in energy between the excited state and E
0
. This
leads to the expression
E
2
=
kq
[M
q
[
2
n
k
(1 n
kq
))
_
n
q
)
kq
+
q
+
n
q
+ 1)
kq
q
_
, (459)
where ) gives the expectation value. Since we considered an isotropic system, we can assume that
q
=
q
and
n
q
) = n
q
). The second term in the above equation can be split into two terms. We then obtain
E
2
=
kq
[M
q
[
2
n
k
(1 n
kq
))
_
2(
k
kq
)n
q
)
(
k
kq
)
2
(
q
)
2
+
1
kq
q
_
, (460)
90
FIG. 61: Schematic plot of the Kohn anomaly. The energy is given by E = q + ln
_
(1 2q/k
F
)
2
+
2
with = 0.1 and
= 0.03. In the absence of electron-phonon interactions, the phonon energy would be a straight line E = q. The presence of
these interactions gives rise to an anomaly around q/k
F
= 2.
The total energy is given by E
0
+E
2
. Note that the phonon part of the ground state energy can be given
q
q
n
q
).
We can nd the change in the phonon energy due to the electron-phonon interaction by determing the change in energy
with respect to a change in the phonon occupation number n
q
, giving
q
=
n
q
)
(E
0
+E
2
) =
q
+
k
[M
q
[
2
2n
k
(1 n
kq
))(
k
kq
)
(
k
kq
)
2
(
q
)
2
(461)
Since phonon energies are generally small, let us have a look at this result when we neglect
q
in the denominator:
q
=
q
+
k
[M
q
[
2
2n
k
(1 n
kq
))
kq
. (462)
We can interpret this expression as follows. A phonon will be absorbed under the creation of an electron-hole pair with
energy
k
kq
. The electron-hole pair is then annihilated under the emission of a phonon. We expect the largest
change in the phonon energy to occur when the energy of the electron-hole pair approaches zero. Let us suppose that
q is in the x direction. An electron-hole pair of zero energy can be created by scattering across the Fermi surface
with q = 2k
F
. Integrating over k leads to a logarithmic divergence. This leads to an innite group velocity of the
phonons, known as the Kohn anomaly.
Let us try to make a very simple of the Kohn anomaly and assume a one-dimensional system with a band with
a linear dispersion. Obviously, this is not very realistic, but the underlying idea is that the presence of an anomaly
should not depend too much on the details of the electronic structure. The dispersion is then given by
k
= [k[. We
are interested in what happens if the energy of the electron-hole pair is close to zero. This occurs for scattering from
k
F
to k
F
, i.e. for q = 2k
F
. Let us consider scattering from negative k-values to positive k values (the other way will
be equivalent). An electron at k (with k > 0) scatters to q k. We have to be careful with the integration. When
k
F
< q < 2k
F
, not all q-values give allowed transitions due to the Fermi-Dirac distribution. At T = 0, the maximum
allowed value is q k
F
. The integration over momentum then becomes
_
qk
F
0
dk
1
(q k) k
=
1
_
qk
F
0
dk
1
q 2k
= ln(q 2(q k
F
)) ln q = ln(
2k
F
q
1) (463)
For q > 2k
F
, we can always scatter from the negative k values to positive k values. The maximum k value is then
k
F
, since we cannot remove electrons from above the Fermi level. For q > 2k
F
, the momentum is always suciently
large to scatter the electrons and the upper bound is given by k
F
1
_
k
F
0
dk
1
q 2k
= ln(q 2k
F
) ln q = ln(
1 2k
F
q
). (464)
We can combine the two result, giving
q
=
q
+ const ln [1
2k
F
q
[ (465)
where the constant denes the strength of the interaction. We clearly see the Kohn anomaly at q = 2k
F
. Figure 61
shows a schematic picture of the Kohn anomaly. A broadening of the divergence has been included by using
q
=
q
+ const ln
(1
2k
F
q
)
2
+
2
. (466)
We can also do it the other way around and look at the change in electron energies due to the phonons. This can
be obtained by calculating the change in energy due to a change in electron occupation of the electron state with
momentum k in Eqn. (459). For simplicity, we assume that we are at low temperatures where the number of phonons
91
is small, i.e. n
q
). This gives
k
=
n
k
)
(E
0
+E
2
) (467)
=
n
k
)
_
k
n
k
) +
kk
[M
k
k
[
2
n
k
(1 n
k
))
1
k
q
_
(468)
=
k
+
[M
k
k
[
2
_
1 n
k
)
k
q
n
k
)
k
k
q
_
, (469)
where q = k
k
=
k
+
q
[M
q
[
2
_
1
k+q
2
q
n
k+q
)
(
k+q
k
)
2
(
q
)
2
_
. (470)
The rst term in independent of n
k+q
). We can see it as a single electron moving around in a lattice. It will create
phonons while scattering temporarily to a dierent state k + q. It will then reabsorb the phonon. Eectively, the
electron is moving around with a phonon cloud around it. This cloud will increase the eective mass of the particle.
This interaction can be very strong. We only treat it up to second-order perturbation. One can imagine that the
electron can emit more than one phonon at the time. The electron plus its accompanying phonon cloud is known
as a polaron. The name arises from the contraction of the lattice around the negatively-charge electron leading to a
polarization of the lattice.
The second term gives the change in the energy of the electron due to the presence of the other electrons. This
term gives a change in the group velocity of the electron. Note that for a free electron, the group velocity is given by
v
0
k
=
k
k
=
k
2
k
2
2m
=
k
m
=
p
m
. (471)
Taking the rst and third term in
k
and dierentiating with respect k gives
v
k
=
k
k
=
k
k
(472)
=
k
k
_
1
q
[M
q
[
2
2
q
n
k+q
)
(
k+q
k
)
2
(
q
)
2
_
. (473)
We now argue that the major contribution in the summation comes from the derivative of fast variation of n
k+q
)
around the Fermi energy E
F
. We replace the summation by an integration and approximate M
q
and
q
by their
averages M and . For the density of states, we take the value at E
F
. This gives
v
k
= v
0
k
_
1 2(E
F
)[M[
2
d
d
k
_
n(E))
(
k
E)
2
()
2
dE
_
. (474)
At low temperatures the occupation number is a step function and its derivative is therefore a -function at the Fermi
energy
dn(E))
d
k
= (E E
F
), (475)
where the minus arises from the fact that n(E) goes from 1 to 0.
v
k
= v
0
k
_
1 +
2(E
F
)[M[
2
(
k
E
F
)
2
()
2
_
. (476)
The innities around
k
= E
F
are a spurious result of our averaging procedure. We are often more interested
in how the group velocity around the Fermi level is aected by the interaction of the electron with the phonons. For
k
= E
F
, we have
v
k
= v
0
k
(1 ), (477)
92
with
=
2(E
F
)[M[
2
. (478)
An alternative way to look at it is by trying to reexpress
k
as an eective free-electron dispersion
v
k
=
k
m
= v
0
k
(1 ) =
k
m
(1 ), (479)
giving an eective mass of
m
=
m
1
. (480)
The electron-phonon interaction therefore gives a mass enhancement.
XXXI. ATTRACTIVE POTENTIAL
We now want to see if there is an eective attractive potential between electrons. We therefore look for a term
proportional to n
k
n
kq
) in Eqn. (459). Originally, we had ascribed this term to the scattering of an electron under
the emission of a phonon, followed by the recapturing of the phonon by the same electron. We want to reinterpret this
term as the interaction between two electrons through the mediation of a phonon. We can write the term proportional
to n
k
n
kq
) in the limit n
q
) 0 as
E
c
=
kq
[M
q
[
2
n
k
n
kq
)
kq
q
(481)
We can rewrite this by splitting up the summation and relabeling the second summation
E
c
=
1
2
kq
[M
q
[
2
n
k
n
kq
)
kq
1
2
kq
[M
q
[
2
n
kq
n
k
)
kq
q
(482)
=
1
2
kq
[M
q
[
2
2
q
(
kq
k
)
2
q
n
k
n
kq
), (483)
where we have used
q
=
q
. If we dene
V
q
= [M
q
[
2
2
q
(
kq
k
)
2
q
, (484)
we obtain an eective interaction
V =
1
2
kq
V
q
n
k
n
kq
). (485)
In general, we can write interaction between two particles as
V =
kk
q
V
q
c
k+q
c
q
c
k
c
k
. (486)
We can distinguish two contributions that are nonzero working on a Fermi sphere. We can take q = 0.
V =
1
2
kk
V
q
n
k
n
k
). (487)
This is called the Hartree or direct terms. If V
q
was a Coulomb interaction this would correspond to the interaction
of the electrons with each other. Another possibility is that k = k
q
V =
1
2
q
V
q
c
q
c
k
c
k
q
) =
1
2
kq
V
q
c
k
c
k
c
kq
c
kq
) =
1
2
kq
V
q
n
k
n
kq
), (488)
93
where the prime has been dropped on the right-hand side. Although usually the electron energies are signicantly
larger than the phonon energies, it is always possible to nd values for which [
kq
k
[ <
q
. This means that
the matrix element is negative. Although a negative matrix element in momentum space does not always guarantee
a negative interaction in real space, it does oer the possibility of an attractive interaction.
We can also obtain the attractive interaction by performing a canonical transformation on the Fr ohlich Hamiltonian.
Intermezzo:canonical transformation. We want to derive the eective interaction with a canonical transformation.
Let us rst review what a canonical transformation was. Let us consider a displaced harmonic oscillator
H = H
0
+H
ep
= a
a +M(a
+a), (489)
where we call the second term H
ep
to draw the parallel with the electron-phonon interaction. Note that this
Hamiltonian has similarities with the one we are considering. We can diagonalize this by taking displaced operators
A = a + and A
= a
+. (490)
We can then write
A
A = a
+ (a
+a) +
2
. (491)
This means we can write our Hamiltonian as
H = A
A, (492)
using M = or = M/ this gives =
2
= M
2
/ for the shift in energy. In order to treat a more
complicated Hamiltonian, we want to take a more systematic approach using a canonical transformation. We are
looking for a transformation of the form
H
= e
S
He
S
, (493)
that will eliminate H
ep
to lowest order. Expanding the exponentials gives
H
= (1 S +
1
2
S
2
)H(1 +S +
1
2
S
2
+ ) (494)
= H
0
+H
ep
+ [H
0
, S] + [H
ep
, S] +
1
2
[[H
0
, S], S] + . (495)
We neglect the higher-order terms and choose S such that the commutator with H
0
cancels H
ep
. If we achieve that,
we end up with
H
= H
0
+ [H
ep
, S]. . (496)
Let us try S or the form using
S = (a a
). (497)
We can now make a unitary transformation of a
a,
[a
a, S] = [a
a, a] [a
a, a] (498)
= (a
aa aa
a) (a
aa
a) (499)
= [a
, a]a a
[a, a
] (500)
= (a +a
) (501)
The idea for solving the electron-phonon interaction is very similar, except now we have k dependence and M is no
longer a simple constant, but contains c
c operators. However, in the end we want to look at the nal change in electron
energy, which is proportional to M
2
which would then become a c
k
c
k
c
k
+
q
a
q
a
q
(503)
94
and
H
kq
M
q
(a
q
+a
q
)c
k+q
c
k
(504)
we try
S =
kq
M
q
(Aa
q
+Ba
q
)c
k+q
c
k
. (505)
In order to obtain
H
ep
+ [H
0
, S] = 0, (506)
we need
A =
1
kq
k
+
q
and B =
1
kq
q
. (507)
This can be seen by working the relation on the ground state [E
0
). Let us consider the term with A, which gives
(H
0
S SH
0
)[E
0
) =
kq
M
q
A(H
0
a
q
c
k+q
c
k
a
q
c
k+q
c
k
H
0
)[E
0
) (508)
=
kq
M
q
A(E
0
+
kq
k
+
q
E
0
)a
q
c
k+q
c
k
[E
0
) (509)
=
kq
M
q
a
q
c
k+q
c
k
[E
0
), (510)
where in the last step we have used the value of A. Combined with the term with B, this will cancel H
ep
.
The transformed Hamiltonian then becomes
H
= H
0
1
2
_
_
kq
M
q
(a
q
+a
q
)c
k+q
c
k
,
kq
M
q
_
a
kq
k
+
q
+
a
q
kq
q
_
c
k+q
c
k
_
_
. (511)
Of the many terms in the Hamiltonian we pay special attention to those that arise from commuting the phonon
operators. This gives the eective Hamiltonian
H
= H
0
+
kq
[M
q
[
2
q
(
k
kq
) (
q
)
2
c
kq
c
k
c
kq
c
k
. (512)
XXXII. SUPERCONDUCTIVITY AND THE BCS HAMILTONIAN
Although the possibility of an attractive interaction between electrons is exciting, that it leads to such a surprising
phenomena as superconductivity is really startling. At most temperatures, conventional superconductors (i.e., not
high-temperature superconductors such as the cuprates) behave pretty much like normal metals. At the transition
temperature T
c
, the materials undergoes a phase transition which the spectacular eect that the resistivity goes to zero.
In addition, there is a discontinuity in the specic heat. Below T
c
, it has a dependence e
/k
B
T
, thereby providing
an indication of an energy gap in the excitation spectra. One also observes that magnetic elds are expelled from
the superconductor. This is known as the Meissner eect and can be used to levitate the superconductor.
Since the discovery of superconductivity in 1911 by Heike Kamerlingh Onnes, physicists have been looking for an
origin of the eect. Convicing evidence that the electron-phonon interaction is involved came from the discovery of
the isotope eect, where it was found that the critical temperature depended on the mass of the nuclei. Our starting
point is a Hamiltonian that includes the attractive interaction described in the preceding section
H = H
0
+
kk
W
kq
c
kq,
c
c
kq,
c
k
, (513)
95
where H
0
describes the Hamiltonian in the absence of electron-phonon interactions, ,
=
1
2
=, is the spin. The
matrix element is given by
W
kq
= [M
q
[
2
q
(
k
kq
) (
q
)
2
. (514)
We will not solve this problem according to the methods of Bardeen, Cooper, and Schrieer who used a variational
technique. We follow the method of Bogoliubov. Bogoliubovs assumption was that the condenstate should have zero
momentum. This implies that the electrons that couple with each other have opposite momentum, directly giving
q = 2k. We shall also assume that
= . It can be shown (which we will not do) that the ground state for opposite
spins (singlet pairing) is in fact lower in energy than the ground state for parallel spins (triplet pairing). This leads
to the eective Hamiltonian
H
BCS
=
k
c
k
c
k
kk
V
kk
c
c
k
c
k
, (515)
Where we have taken V
kk
= 2W
k,k
k
, i.e. the interaction is attractive when V
k
is positive. The factor 2 comes
from the summation over .
We will now look for eective operators that help diagonalize the Hamiltonian. The operators suggested by Bogoli-
ubov are
k
= u
k
c
k
v
k
c
k
and
k
= u
k
c
k
+v
k
c
k
, (516)
with conjugates
k
= u
k
c
k
v
k
c
k
and
k
= u
k
c
k
+v
k
c
k
. (517)
The constants u
k
and v
k
are real and positive and satisfy
u
2
k
+v
2
k
= 1. (518)
With this condition, the operators satisfy the commutation relations
k
,
k
=
k
,
k
=
k
,
= 0 (519)
k
,
k
=
k
,
=
k,k
. (520)
Using the Bogoliubov transformation does not allow us to diagonalize the Hamiltonian entirely. It will contain two-
particle term (including four operators). However, we hope that an appropiate choice of u
k
and v
k
will remove the
dominant terms.
We can rewrite H
BCS
, using the inverse operators
c
k
= u
k
k
+v
k
k
and c
k
= u
k
k
v
k
k
(521)
c
k
= u
k
k
+v
k
k
and c
k
= u
k
k
v
k
k
(522)
Let is rst rewrite the kinetic part of the Hamiltonian. Substituting the expressions for c and c
gives
H
0
=
k
c
k
c
k
=
k
[c
k
c
k
+c
k
c
k
] (523)
=
k
[u
2
k
k
+v
2
k
k
+u
k
v
k
(
k
+
k
k
) (524)
+v
2
k
k
+u
2
k
k
u
k
v
k
(
k
+
k
k
)]. (525)
H
0
can be simplied by introducing te number operators
m
k
=
k
; m
k
=
k
, (526)
and making use of the commutation relationships of the s. We then obtain
H
0
=
k
[2v
2
k
+ (u
2
k
v
2
k
)(m
k
+m
k
) + 2u
k
v
k
(
k
+
k
k
). (527)
96
There are three dierent types of terms here: a constant, a term containing the number operators m, and a term
containing creation and annihilation of pairs of quasi particles.
The interaction part of the Hamiltonian can now be written as
H
V
=
k
V
kk
(u
k
+v
k
k
)(u
k
v
k
k
)(u
k
k
v
k
k
)(u
k
k
+v
k
k
) (528)
=
k
V
k
[u
k
v
k
u
k
v
k
(1 m
k
m
k
)(1 m
k
m
k
) (529)
+u
k
v
k
(u
2
k
v
2
k
)(1 m
k
m
k
)(
k
+
k
k
)
_
+ higher order terms. (530)
We now want to eliminate all the o-diagonal terms to obtain a Hamiltonian of independent fermions . The rst
assumption is that there are the occupations m
k
and m
k
are zero in the ground state. We can later verify this
assumption. The o-diagonal terms that remain are
k
2
k
u
k
v
k
(
k
+
k
k
) +
kk
V
kk
u
k
v
k
(u
2
k
v
2
k
)(
k
+
k
k
) + higher order terms. (531)
If we neglect the higher-order contributions, we are left with
2
k
u
k
v
k
+ (u
2
k
v
2
k
)
V
kk
u
k
v
k
= 0. (532)
However, u
k
and v
k
are not independent due to the condition u
2
k
+ v
2
k
= 1, and we can express them in a single
variable x
k
,
u
k
=
_
1
2
x
k
and v
k
=
_
1
2
+x
k
. (533)
This transforms the equation into
2
k
_
1
4
x
2
k
+ 2x
k
V
kk
_
1
4
x
2
k
= 0. (534)
If we dene a new quantity
k
=
V
kk
_
1
4
x
2
k
, (535)
then we obtain
2
k
_
1
4
x
2
k
+ 2x
k
k
= 0
2
k
(
1
4
x
2
k
) = x
2
k
2
k
(536)
x
2
k
(
2
k
+
2
k
) =
1
4
2
k
(537)
or
x
k
=
k
2
_
2
k
+
2
k
(538)
Substituting x
k
in the expression for
k
, leads to the following integral equation
k
=
V
kk
k
_
2
k
+
k
, (539)
if V
kk
is known, we can use this to solve for
k
. Let us make a simple assumption for V
kk
. We know that the matrix
element is given by
V
kk
= 2[M
k
k
[
2
k
k
(
k
k
) (
k
k
)
2
. (540)
97
We want V
kk
to be positive for an attractive interaction. This is the region that [
k
k
[ <
k
k
. In general,
phonon frequency have an energy signicantly smaller the Fermi energy. E
F
is generally of the order of a few electron
volts, whereas phonon frequencies are in the region of several tenths of meV. Let us denote the maximum phonon
frequency
D
, known as the Debye frequency. Let us for mathematical simplicity (and also for lack of information of
a better k dependence) assume
V
kk
=
_
V if [
k
[ <
D
0 otherwise
(541)
Replacing the sum over momentum by an integral over energy, we obtain
=
1
2
_
(E)
E
2
+
2
V (E) =
1
2
V (E
F
)
_
D
E
2
+
2
dE, (542)
where we have assumed that the density of states does not vary too much in a region
D
around the Fermi surface.
This equation has the solution
=
D
sinh[1/V (E
F
)]
. (543)
An estimate of the magnitude of V (E
F
) can be obtained by noting that
V
=
[M
q
[
2
D
. (544)
Earlier we had seen that
[M
q
[
2
=
k
F
MN
D
[U
k
[
2
=
m
M
E
F
N
D
[U
k
[
2
. (545)
For a constant density of states, we can roughly estimate (E
F
) = N/E
F
. This leaves us with
[M
q
[
2
=
m
M
1
(E
F
)
D
[U
k
[
2
, (546)
and hence
V (E
F
)
=
m
M
_
U
k
D
_
2
. (547)
The value of
D
is of the order of 10
2
eV, whereas the screened Coulomb potential can be several eV. The mass
ratio m/M is about 10
5
. This makes V (E
F
) roughly equal to 0.1. We can therefore write
= 2
D
e
1/V (E
F
)
. (548)
This rough estimate shows that the is a small quantity, about 1% of the Debye energy and hence corresponding to
thermal energies of about 1 K.
Note that the total number of electrons is given by
N =
k
c
k
c
k
(549)
=
k
[2v
2
k
+ (u
2
k
v
2
k
)(m
k
+m
k
) + 2u
k
v
k
(
k
+
k
k
)] (550)
which reduces to
N) =
k
2v
2
k
. (551)
In the absence of interactions this would be
N) =
k
2, (552)
where the factor 2 comes from the spin. The presence of will cause a disturbance around the Fermi level given by
v
2
k
=
1
2
k
2
_
2
k
+
2
k
, (553)
which basically causes a smoothing of the electron density around E
F
, of the order of .
98
XXXIII. BCS GROUND STATE WAVEFUNCTION AND ENERGY GAP
First, we would like to see how the ground state energy is changed by the presence of an attractive electron-phonon
coupling. By performing a Bogliubov transformation, we have removed the o-diagonal terms. In addition, we have
assumed that m
k
= m
k
= 0. We can then write the ground state energies as
E
S
=
k
2
k
v
2
k
kk
V
kk
u
k
v
k
u
k
v
k
(554)
=
k
(1 + 2x
k
)
kk
V
kk
_
(
1
4
x
2
k
)(
1
4
x
2
k
). (555)
In the normal state, we have x
2
k
=
1
4
. We want to nd the dierence between the energy in the superconducting state
and the energy in the normal state, known as the condensation energy
E
c
= E
S
E
N
=
k<k
f
k
(2x
k
1) +
k>k
f
k
(2x
k
+ 1)
k
_
1
4
x
2
k
(556)
= 2(E
F
)
_
D
0
_
E
2E
2
+
2
2
E
2
+
2
_
dE (557)
= D(E
F
)
_
(
D
)
2
D
_
(
D
)
2
+
2
_
(558)
= (
D
)
2
(E
F
)
_
1 coth
_
1
V (E
F
)
__
. (559)
In the weak coupling regime, this becomes
E
c
= 2(
D
)
2
(E
F
)e
2/V (E
F
)
=
1
2
(E
F
)
2
. (560)
This condensation energy is very small, only of the order of 10
7
eV per electron, equivalent to about a millidegree
Kelvin. Obviously, these are all the electrons and only the electrons in the region E
F
to E
F
+ are acutally
aected by the attractive interaction, i.e. only a fraction /E
F
of the whole number. Note that we cannot expand
in terms of the interaction V , since it appears as 1/V and has a singularity as V 0. Therefore, superconductivity
cannot be understood in terms of perturbation theory.
In our derivation of the ground state, we had assumed that m
k
and m
k
are zero for all k. This means that there
are no quasiparticles in the ground state and
k
[0) =
k
[0) = 0 (561)
Since the s are fermions, we have
k
k
= 0 and
k
k
= 0, we can satisfy the above criterium by creating a
ground state by working with the s on the vacuum
(
k
)[0) = [
k
(u
k
c
k
v
k
c
k
)(u
k
c
k
+v
k
c
k
)][0) (562)
= [
k
(u
k
v
k
+v
2
k
c
k
c
k
)][0). (563)
We see that this is not normalized. We can do this by dividing by v
k
,
[
0
) = [
k
(u
k
+v
k
c
k
c
k
)][0). (564)
We see that this ground state does not conserve the total number of particles. However, we need to be sure that the
average number of particles is still equal to N.
The quasi-particle excitation of the system are created by the operators
k
and
k
. We can look again at the
Hamiltonian and look at the terms with m
k
and m
k
. This gives
H
BCS
= E
S
+
k
(m
k
+m
k
)[(u
2
k
v
2
k
)
k
+ 2u
k
v
k
V
kk
u
k
v
k
] + higher order terms, (565)
99
which becomes
H
BCS
= E
S
+
k
_
2
k
+
2
(m
k
+m
k
) + (566)
We see that the energy to make an elementary excitation is
E
k
=
_
2
k
+
2
. (567)
Note that this energy is always larger than . This is the opening of a superconducting gap.
Let us try to make an excitation with respect to the superconducting ground state. We now that
k
[
0
) =
k
[
0
) = 0, (568)
which therefore do not produce excitations. However, operating with their conjugates does
k
[
0
) = (u
k
c
k
v
k
c
k
)
k
(u
k
+v
k
c
k
c
k
)[0) (569)
= (u
2
k
c
k
+u
k
v
k
c
k
c
k
c
k
u
k
v
k
c
k
v
2
k
c
k
c
k
c
k
)
=k
(u
k
+v
k
c
)[0). (570)
The second term and third term are zero because of the c
k
c
k
and c
k
working on the vacuum, respectively. The
fourth term can be rewritten as v
2
k
c
k
c
k
c
k
= v
2
k
c
k
c
k
c
k
= v
2
k
c
k
, since c
k
c
k
working on the vacuum
gives unity. Combining the two remaining terms and making use of the fact that u
2
k
+v
2
k
= 1, gives
k
[
0
) = c
=k
(u
k
+v
k
c
)[0), (571)
which therefore creates an electron in a superconducting back ground. Similarly we can nd
k
[
0
) = c
=k
(u
k
+v
k
c
)[0). (572)
Apparently, the onset of superconductivity has changed the density of states. However, we see that there is a one-to-
one correspondence between the quasiparticle excitations and the electrons. We should therefore also have a unique
correspondence between the superconducting density of states
s
(E) and
n
() (note the energies are related to E
k
and
k
, respectively)
s
(E)dE =
n
()d. (573)
Since we are only interested in the changes in density of states very close to the Fermi level, we can assume that ()
is approximately constant. We then nd
s
(E)
n
()
=
d
dE
=
_
E
E
2
2
E >
0 E <
, (574)
where we have used E =
2
+
2
and the fact that no density of states is associated with energies less than the gap
energy .
XXXIV. TRANSITION TEMPERATURE
We managed to diagonalize the BCS Hamiltonian by choosing u
k
and v
k
such that the o-diagonal matrix elements
were zero. The resulting equation was of the form
k
[2
k
u
k
v
k
V
kk
u
k
v
k
(1 m
k
m
k
)(u
2
k
v
2
k
)](
k
+
k
k
) = 0, (575)
which we solved by putting m
k
= m
k
k
2
k
v
2
k
+
k
(u
2
k
v
2
k
)
k
(m
k
+m
k
)
kk
V
kk
u
k
v
k
u
k
v
k
(1 m
k
m
k
)(1 m
k
m
k
).
The energy to create a quasi-particle excitation is then given by
E
k
=
H
BCS
)
m
k
)
= (u
2
k
v
2
k
)
k
+ 2u
k
v
k
V
k
u
k
v
k
(1 m
k
) m
k
)). (576)
Since the quasiparticles are eective fermions, they should obey Fermi-Dirac statistics
m
k
) = m
k
). =
1
exp(E
k
/k
B
T) + 1
. (577)
There is no chemical potential since the quasiparticles are excitations with respect to the ground state. We can obtain
an approximate solution by replacing the number operators by the Fermi function
k
[2
k
u
k
v
k
(u
2
k
v
2
k
)
V
kk
u
k
v
k
[1 2f(E
k
)] = 0. (578)
The only dierence with the original condition of Eqn. (532) is the presence of the Fermi functions in the factor
(1 2f(E
k
). We again make the substitution for the gap energy but now including the Fermi function
k
(T) =
V
kk
_
1
4
x
2
k
[1 2f(E
k
)]. (579)
and obtain again
x
k
=
k
2
_
2
k
(T) +
2
k
. (580)
The temperature-dependent gap equation is then
k
(T) =
V
kk
k
(T)
_
2
k
+
k
(T)
[1 2f(E
k
)] (581)
and by taking the thermal average of the energy The energy to create a quasi-particle excitation is then given by
E
k
= (u
2
k
v
2
k
)
k
+ 2u
k
v
k
V
k
u
k
v
k
[1 2f(E
k
)] =
_
2
k
+
2
k
(T), (582)
which is equivalent to the quasiparticle excitation energy except that the gap is now temperature dependent.
Now since
1 2f(E
k
) = 1 2
1
e
E
k
k
B
T
+ 1
=
e
E
k
k
B
T
+ 1 2
e
E
k
k
B
T
+ 1
=
e
E
k
2k
B
T
e
E
k
2k
B
T
e
E
k
2k
B
T
+e
E
k
2k
B
T
= tanh
E
k
2k
B
T
, (583)
we obtain for the gap equation
k
(T) =
V
kk
k
(T)
_
2
k
+
k
(T)
tanh
_
2
k
+
2
k
(T)
2k
B
T
. (584)
Using the approximations of a constant density of states and an constant interaction limited to a region of width
2
D
around the Fermi level, we obtain the equation
V (E
F
)
_
0
dE
tanh
E+(T)
2k
B
T
_
E
2
+ (T)
= 1. (585)
101
At T = 0, the argument of the tanh, which contains 1/T goes to innity, and hence the tanh approaches 1 and
we recover the original gap equation. For nite temperatures, the numerator decreases, which implies that the
denominator should decrease as well. This means that the value of decreases. At rst, this decrease is very slow,
until k
B
T becomes comparable to the gap energy . We obtain a rapid increase in quasiparticle excitations. We then
obtain a more rapid drop in until it vanishes at the transition temperature. For (T
c
) = 0, we have
V (E
F
)
_
0
dE
tanh
E
k
B
T
c
E
= 1 (586)
or
V (E
F
)
_
0
dx
tanh x
x
= 1. (587)
The integral is somewhat annoying but partial integration gives
[ln xtanh x]
D
0
_
D
0
1
cosh
2
x
ln xdx =
1
V (E
F
)
. (588)
In the weak coupling limit, k
B
T
c
is expected to be small compared to
D
. This means that the tanh will be close to
unity and that we can take the integral to innity:
ln
D
2k
B
T
c
_
0
1
cosh
2
x
ln xdx =
1
V (E
F
)
. (589)
The integral turn out to be 0.44 (????0.56?), which gives
D
2k
B
T
c
= e
0.44+
1
V (E
F
)
k
B
T
c
= 2
D
e
0.44
e
1
V (E
F
)
= 1.14
D
e
1
V (E
F
)
. (590)
Comparing this with the equation for the gap in the weak-coupling limit gives
2(0)
k
B
T
c
= 3.50. (591)
This is reasonable agreement with most experimentally observed values for BCS-type superconductors.
XXXV. GINZBURG-LANDAU THEORY
Ginzburg-Landau theory is a phenomenological theory which was developed before BCS theory. Ginzburg and
Landau suggested a pseudowavefunction (r) and suggested a free energy expanded in terms of [[
2
and [[
2
.
Although relatively successful in explaining a number of phenomena, the lack of microscopic derivation led to an
initial neglect of the work in the Western literature. However, in 1959, Gorkov demonstrated that the Ginzburg-
Landau theory can be directly related to BCS theory in the limit that the spatial variations of (r) are not too rapid
and that the temperature are close to the transition temperature. The pseudowavefunction turns out to be directly
related to the gap parameter (r). The derivation is however rather lenghty and will be omitted.
The basic postulate of Ginzburg and Landau is that the free energy in the superconducting state F
s
can be expanded
in a series of the form
F
s
= F
n
+[[
2
+
2
[[
4
+
1
2m
[(ieA)[
2
, (592)
where F
n
is the free energy in the normal state; m
= 2m corresponding to the Cooper pairs in the supercurrent. However, the mass might
be dierent from 2m due to mass renormalization related to the electron-phonon interactions. When the spatial
variations are small and in the absence of external elds, the dierence is given by
F
s
F
n
= [[
2
+
2
[[
4
. (593)
102
As is obvious, we have to dierent cases. Either [[
2
= 0, in which case we have the normal state, or
[[
2
= [
[
2
=
, (594)
where by
, we imply the pseudowavefunction deep inside the superconductor or for an innite system where the
eects of external elds are negligible. Since [[
2
is always positive,
2
2
. (595)
This energy can be related to the Meissner eect found in 1933. Meissner and Ochsenfeld discovered the the magnetic
eld is excluded from a superconductor. In addition, not only is the magnetic eld excluded from a superconductor,
which might be explained by a perfect conductor, the magnetic eld is also expelled from the superconductor when
going through the transition temperature. This cannot be explained by perfect conductivity which would trap the
ux lines inside the superconductor. When can dene the critical eld as the work done by the magnetic eld to
destroy the superconductivity, i.e.
1
2
0
H
2
= F
n
F
s
=
2
2
. (596)
Apparently should change sign at the transition temperature. If we only take the leading term, we have
=
0
(
T
T
c
1). (597)
This directly says that
[[
2
(1
T
T
c
)
_
1
T
T
c
, (598)
where we have made use of the Gorkovs result that and are related. Using variational methods, we can also
write the problem in the celebrated Ginzburg-Landau dierential equation
+[[
2
+
1
2m
[(ieA)[
2
= 0. (599)
To gain some insight into this equation let us consider it in one dimension and in the absence of external elds and
introduce a normalized wavefunction f = /
with
2
= /, giving
2
2m
d
2
dx
2
+ +
3
= 0
2
2m
d
2
dx
2
+
= 0, (600)
which gives
2
2m
[[
d
2
f
dx
2
+f f
3
= 0. (601)
This makes it natural to introduce a characteristic length (T) given by
(T)
2
=
2
2m
[[
(T)
1
_
1
T
T
c
. (602)
(T) is known as the coherence length. Note that it diverges as the temperature approaches T
c
. This means that
the superconducting state becomes very sensitive to minor disturbances. Compare this to ferromagnetism, where the
103
susceptibility diverges close to the critical temperature. Although the magnetization is small, the magnetism is very
sensitive to external eects such as an applied magnetic eld. We can then rewrite the dierential equation as
(T)
2
d
2
f
dx
2
+f f
3
= 0. (603)
Let us consider the situation where the f is close to unity. We can then write f(x) = 1 + g(x), where g(x) 1. To
lowest order we can write
(T)
2
d
2
g
dx
2
+ (1 +g + ) (1 + 3g + ) = 0 (604)
or
d
2
g
dx
2
=
2
(T)
2
g, (605)
which has a solution
g(x) exp(
2x
(T)
). (606)
This means that a small disturbance of from
A)
2
2m
, (607)
where m
and e
are the eective mass and charge of the Cooper pairs, respectively. However, if the vector potential
is of the form
A = (A, 0, 0), (608)
where A is a constant, then A would vanish and there would be no magnetic eld. The eigenstates would be
(r) = exp[i(k
x
x +k
y
y +k
z
z)] (609)
with energies
E =
2
2m
_
(k
x
)
2
+k
2
y
+k
2
z
_
. (610)
Let us now consider a superconducting ring. The energy in the x direction is then given by
E =
2
2m
(k
x
2eA
)
2
, (611)
where we have introduced the eective charge 2e for superconducting pairs. We have dened an eective mass m
(which could in principle also include some bandstructure eects). The wavevector in the x direction is quantized
giving
E =
2
2m
(
2n
L
2eA
)
2
, (612)
Although the Meissner eect will remove the magnetic eld inside the superconductor, there can still be ux inside
the ring. This magnetic ux is given by
=
_
B da =
_
A da =
_
A dl = AL, (613)
104
where L is the circumference of the ring. We can then write the energy as
E =
2
2
m
L
2
(n
eAL
)
2
=
2
2
m
L
2
(n
e
)
2
. (614)
The total energy of the system as a function of the ux then becomes a periodic function of . The energy is given
by parabola centered around n
0
. If n = 0, there will be a minimum in the energy around zero. If the ux increases
the energy increases. However, if the ux exceeds /e, then if becomes energetically favorable for the electron to
make a transition to n = 1. The new energy minimum then lies around = /e, and so on. This leads us to the
conclusion that the ux prefers to be quantized in ux quanta
0
, i.e.
= n
0
= n
e
. (615)
We can also look at the current as a function of the ux. The current is related to the ux as follows
I m
dr
dt
= k 2eA =
2n
L
2e
L
E
. (616)
The current is therefore directly proportional to . It crosses zero at n
0
and makes discrete jumps around (n+
1
2
)
0
.
The concepts above can be applied to an interesting device known as a Josephson junction. Let us consider a ring
with a narrow insulating layer somewhere in the ring. This layer should be thin enough that it allows tunneling of
superconducting pairs. Let us rst envision an experiment where the ux is held constant. Since E =
A
t
=
1
L
t
,
the electric eld is zero and hence V = EL = 0. However, we saw that for ,= n
0
, there is a nite current.
Therefore, there must be a current a zero voltage. This is known as the DC (direct current) Josephson eect.
The other experiment that we can envision is keeping the voltage constant. This implies that the ux is = V t.
Therefore the current will be a sawtooth as a function of time. The current will be alternating with a frequency
0
= V T =
2
T
=
2V
0
=
2eV
, (617)
which is of the order of
1
2
GHz per micronvolt. This is known as the AC Josephson eect.
APPENDIX A: 666
The number 666 has a pretty bad reputation, related to its mentioning in the Book of Revelation 13:17-18:
And that no man might buy or sell, save he that had the mark, or the name of the beast, or the number of his
name. Here is wisdom. Let him that hath understanding count the number of the beast: for it is the number of a
man; and his number is six hundred threescore and six.
Many scholars believe that the number 666 stands for the emperor Nero. In Hebrew gematria, every letter corre-
sponds to a number. The Hebrew transliteration of the Greek Neron Kaisar adds up to 666.
It has some other nice properties as well:
66
i=1
i = 666, (A1)
so the number on the roulette table add up to 666. In Roman numerals, we have
666 = DCLXV I, (A2)
so 666 uses every Roman numeral symbol (except M). Also, 666 is the sum of the rst seven prime numbers
2
2
+ 3
2
+ 5
2
+ 7
2
+ 11
2
+ 13
2
+ 17
2
= 666. (A3)
If you are afraid of the symbolic meanings behind the number 666, you might be suering from hexakosioihexekonta-
hexaphobia. (If the number 13 worries you, it is called triskaidekaphobia.)
And poor 667? 23 29?
105
Homework 9/24/09
We want to consider the formation of a sp hybrid between carbon and hydrogen atoms. We take the z-axis along the
carbon-hydrogen bond.
1a. Let us consider the bonding orbitals. Denote the carbon 2s, carbon 2p
z
, and hydrogen 1s as [C
2s
), [C
2p
z
), and
[H
1s
), respectively. (the carbon 1s orbitals are strongly bound and can be neglected). Write the Hamiltonian as a
matrix, using
T
ss
= C
2s
[H[H
1s
), T
ps
= C
2p
z
[H[H
1s
),
C
= C
2s
[H[C
2s
) = C
2p
z
[H[C
2p
z
),
H
= H
1s
[H[H
1s
)
1b. We see that the carbon 2s and 2p
z
both couple to the hydrogen. Combine these two orbitals into sp hybdrids
and calculate the coupling between the hydbrids and the hydrogen.
1c. Write down the 33 matrix for the carbon sp hybrids and hydrogen and determine the eigenenergies for T
ss
= T
sp
.
Let us consider a innite one-dimensional chain of hydrogen atoms with interatomic distance a. Apply periodic
boundary conditions. We only consider the 1s orbitals.
2a. Derive the eigenenergies for a nearest-neighbor matrix element t
1
(distance a) and a next-nearest-neighbor matrix
elements t
2
(distance 2a).
2b. Plot the eigenvalues as a function of momentum k with
a
k
a
. Take t
2
=
1
2
t
1
< 0.
2c. For what k-values are the eigenvalues maximum.
106
Answers Homework 9/24/09
1a.
H =
_
_
O
0 T
ss
0
O
T
sp
T
ss
T
sp
H
_
_
(A4)
1b. The sp hybrids are given by
[O
sp
) =
1
2
([O
2s
) [O
2p
z
)). (A5)
The coupling to the Hydrogen is now
H
1s
[H[O
sp
) = H
1s
[H
1
2
([O
2s
) [O
2p
z
)) =
1
2
(T
ss
T
sp
) (A6)
1c.
H =
_
_
_
O
0
1
2
(T
ss
T
sp
)
0
O
1
2
(T
ss
+T
sp
)
1
2
(T
ss
T
sp
)
1
2
(T
ss
+T
sp
)
H
_
_
_ (A7)
when T = T
ss
= T
sp
, the eigenvalue problem becomes
H =
_
_
O
E 0 0
0
O
E
2T
0
2T
H
E
_
_
(A8)
This has one eigenvalue E =
O
, the other two eigenvalues are given by the determinant of the remaining 22 matrix:
(
O
E)(
H
E) 2T
2
= E
2
(
O
+
H
)E +
O
H
2T
2
(A9)
giving
E
1,2
=
1
2
(
O
+
H
)
1
2
_
(
O
+
H
)
2
4(
O
H
2T
2
) =
1
2
(
O
+
H
)
1
2
_
(
O
H
)
2
+ 8T
2
(A10)
2a.
k
= 2t
1
cos ka + 2t
2
cos 2ka. (A11)
2c.
d
k
dk
= 2t
1
sin ka 4t
2
sin 2ka = 2t
1
sin ka 8t
2
sin ka cos ka (A12)
= 2 sin ka(t
1
+ 4t
2
cos ka). (A13)
This gives
cos ka =
t
1
4t
2
=
1
2
k =
2
3a
(A14)
107
FIG. 62: The NaCl lattice.
Homework 10/15/09
1a. Figure 62 shows the lattice for sodium chloride (NaCl). Express the lattice in lattice vectors plus a basis.
1b. Calculate the structure factor S
K
.
1c. Show for what wave vectors Bragg diraction occurs. What type of lattice do these (reciprocal) wavevectors
form?
1d. Calculate the intensities of the Bragg peaks.
2a. An atom is surrounded by four other atoms in planar symmetry, see Fig. 63. We are considering the 2p orbitals
(p
with = x, y, z) on the central atom. On the surrounding atom we consider only the 1s orbitals (denoted as s
i
,
where i = 1, 2, 3, 4 numbers the surrounding atoms). The 2p orbitals hybridize with the neighboring 1s orbitals . In
other words, there are nite matrix elements p
[H[s
i
). However, due to symmetry not all matrix are nite. What
combinations of and i give a nonzero matrix element?
2b. Calculate the eigenenergies (denote the value of the nonzero matrix elements as t).
2c. The hybridization has split the 2p level. Is the splitting what you would expect from symmetry considerations?
(Explain).
FIG. 63: A central atom is surrounded by four atoms in a planar symmetry.