Advanced Computational Chemistry: Pekka Manninen 2009
Advanced Computational Chemistry: Pekka Manninen 2009
Pekka Manninen
2009
2
Contents
3
3.1.3 Gaussian-type orbitals . . . . . . . . . . . . . . . . . . . . . . . . . 34
3.2 Gaussian basis sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.3 Integrals over Gaussian basis sets . . . . . . . . . . . . . . . . . . . . . . . 37
3.3.1 Gaussian overlap distributions . . . . . . . . . . . . . . . . . . . . . 38
3.3.2 Simple one-electron integrals . . . . . . . . . . . . . . . . . . . . . . 38
3.3.3 The Boys function . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
3.3.4 Obara–Saika scheme for one-electron Coulomb integrals . . . . . . . 40
3.3.5 Obara–Saika scheme for two-electron Coulomb integrals . . . . . . . 41
3.4 Further reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3.5 Exercises for Chapter 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4 Second quantization 45
4.1 Fock space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
4.1.1 Creation and annihilation operators . . . . . . . . . . . . . . . . . . 46
4.1.2 Number-conserving operators . . . . . . . . . . . . . . . . . . . . . 48
4.2 The representation of one- and two-electron operators . . . . . . . . . . . . 49
4.3 Commutators and anticommutators . . . . . . . . . . . . . . . . . . . . . . 51
4.3.1 Evaluation of commutators and anti-commutators . . . . . . . . . . 52
4.4 Orbital rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
4.5 Spin in second quantization . . . . . . . . . . . . . . . . . . . . . . . . . . 53
4.5.1 Operators in the orbital basis . . . . . . . . . . . . . . . . . . . . . 54
4.5.2 Spin properties of determinants . . . . . . . . . . . . . . . . . . . . 57
4.6 Further reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
4.7 Exercises for Chapter 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
5 Hartree–Fock theory 61
5.1 The Hartree–Fock approximation . . . . . . . . . . . . . . . . . . . . . . . 61
5.2 Restricted and unrestricted Hartree–Fock theory . . . . . . . . . . . . . . . 63
5.2.1 Hartree–Fock treatment of H2 . . . . . . . . . . . . . . . . . . . . . 63
5.3 Roothaan–Hall equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
5.3.1 On the convergence of the SCF method . . . . . . . . . . . . . . . . 66
5.3.2 Integral-direct SCF . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
5.4 Further reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
5.5 Exercises for Chapter 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
6 Configuration interaction 73
6.1 Configuration-interaction wave function . . . . . . . . . . . . . . . . . . . . 73
6.1.1 Single-reference CI wave functions . . . . . . . . . . . . . . . . . . . 74
6.1.2 Multi-reference CI wave functions . . . . . . . . . . . . . . . . . . . 74
6.1.3 Optimization of the CI wave function . . . . . . . . . . . . . . . . . 75
6.1.4 On the disadvantages of the CI approach . . . . . . . . . . . . . . . 75
6.2 Multi-configurational SCF theory . . . . . . . . . . . . . . . . . . . . . . . 76
6.2.1 MCSCF wave function of H2 . . . . . . . . . . . . . . . . . . . . . . 76
4
6.2.2 Selection of the active space . . . . . . . . . . . . . . . . . . . . . . 77
6.3 Further reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
6.4 Exercises for Chapter 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
5
6
Chapter 1
where Ĥ is the molecular Hamiltonian, which is for a molecule that contains N electrons
and K nuclei (assumed to be fixed in space)
N
X N X
X N
Ĥ = ĥi + ĝij , (1.2)
i=1 i=1 j=i+1
The eigenfunctions or wave functions Ψ(x1 , . . . , xN ) describe the electronic structure and
properties of the molecule. {x} are used to collectively symbolize all variables needed to
refer to particles; here the position and the internal degree of freedom, spin. It is necessary
that the wave function is square-integrable,
Z
Ψ⋆ (x1 , x2 , . . . , xN )Ψ(x1 , x2 , . . . , xN )dx1 dx2 · · · dxN = K (K finite). (1.4)
7
energy values Em , En possess the orthogonality property
Z
Ψ⋆m Ψn dx1 dx2 · · · dxN = 0. (1.5)
The wave function is correspondingly dependent on both the electron and nuclear variables.
Usually, because of the large ratio mn /me , it is a good approximation to separate these,
and we may turn our attention to the electronic problem. This is referred to as the Born–
Oppenheimer approximation. The nuclear-motion Schrödinger equation plays, however, a
central role in molecular spectroscopy. The motion of nuclei can sometimes affect even
those properties that are considered electronic. This is the case e.g. in the Jahn–Teller
effect and the Renner effect. For fixed nuclei, we can write the total energy of the system
simply as
1 X ′ ZI ZJ
Etot = Eelectronic + . (1.7)
2 IJ |RI − RJ |
Eq. (1.1) provides a basis for molecular quantum mechanics and all the static electronic
properties of the molecule may be obtained by expectation values of the appropriate Her-
mitian operators or using perturbation theory. However, we should note the limitations of
this model:
• Eq. (1.1) is soluble exactly only for a few trivial one-electron systems, such as
hydrogen-like atom or the molecule H+2 (see Exercise 1.1)
• Real experiments are not carried out for an isolated molecule, and observation always
involves interaction with the system, and thus a time-independent description is not
sufficient; but the time-independent form of (1.1), ĤΨ = i∂Ψ/∂t, should be applied.
8
1.1.1 Helium-like atom
For a helium-like atom with a point-like nucleus of charge Z the electronic Hamiltonian,
Eq. (1.2), is
1 Z 1 Z 1
Ĥ = ĥ1 + ĥ2 + ĝ12 = − ∇21 − − ∇22 − + .
2 r1 2 r2 r12
Due to ĝ12 , this is a three-body problem, and thereby no closed form solution exists for
the eigenvalue equation (1.1). We start our consideration by an approximation, where ĝ12
is set to zero, i.e. Ĥ0 = ĥ1 + ĥ2 . We can separate the variables in the eigenvalue equation
H0 Ψ0 = EΨ0 by substituting2
meaning that χ1 and χ2 are solutions of the one-electron eigenvalue problem ĥχ = ǫχ. Φ
is a simultaneous eigenstate of the operators ĥ1 and ĥ2 ,
The 1-electron 1-center eigenfunctions that are solutions to the one-electron eigenvalue
problem (this holds only in the absence of ĝ) are the same with the solutions of the
Schrödinger equation for a hydrogen-like atom (note the factor Z),
9
In the above, L denotes the associated Laguerre polynomials
1 dn
Lαn (x) = exp(x)x−α n exp(−x)xn+α (1.11)
n! dx
and P the Legendre polynomials
(−1)m 2 m/2 d
l+m
(l − m)! m
Plm (x) = (1 − x ) (x2 − 1)l ; Pl−m = (−1)m P (x). (1.12)
2l l! dxl+m (l + m)! l
These shall be referred to as atomic orbitals (AO). However, this is used also for other
atomic-centered functions with different radial parts, such as sc. Slater- or Gaussian-type
functions.
φ(r) ∼ cA χA
100 (r) (densities close to nucleus A)
φ(r) ∼ cB χB
100 (r) (densities close to nucleus B)
where cA and cB are some numerical factors. An appropriate trial MO would then be
φ(r) = cA χA B
100 (r) + cB χ100 (r).
Due to the symmetry of the molecule, it is reasonably to assume that there is no higher
density in the other end than in the other; therefore
10
n=1 n=2 n=3
l=0 l=0 l=1 l=0 l=1 l=2
m=0
11
m=1
m=2
Table 1.1: Some lowest-energy solutions χnlm (Z = 1) (blue for positive, orange for negative phase)
Figure 1.1: The two lowest MOs of H+
2.
implies that cB = ±cA . Thus, it appears that we can construct two rudimentary MOs from
the 1s AOs on the two centers, denoted by
where the remaining numerical factors c′A and c′B are to be chosen so that the wave functions
will be normalized. These are sketched in Figure 1.1.
MOs build up this way, in linear combination of atomic orbitals (LCAO), are of great
importance throughout molecular quantum mechanics.
σ describes the spin state and is a solution of an eigenvalue equation ŝz σ = λσ.
If the orbital state is of energy ǫ, then
ĥψ = ǫψ (1.14)
ŝz ψ = λψ, (1.15)
and ψ is a state in which an electron simultaneously has definite energy and z-component
of spin.
12
According to observations, the eigenvalue equation (1.15) has only two solutions for
electrons:
1 1
ŝz α = α; ŝz β = − β
2 2
often referred to as “spin-up” and “spin-down”. Therefore an orbital φ yields two possible
spin-orbitals, ψ = φα and ψ ′ = φβ with the normalization and orthogonality properties
Z Z Z Z
⋆ ⋆
α αdms = β βdms = 1, α βdms = β ⋆ αdms = 0.
⋆
(1.16)
with eigenvalues of M and S(S + 1), respectively, are needed in the discussion of many-
electron system. They commute with a spin-free electronic Hamiltonian; and any exact
stationary state Ψ is an eigenfunction of them.
where the operator P̂ permutes the arguments and εP is +1 for an even and −1 for an odd
number of permutations.
The choice for an system consisting of N electrons is once and for all determined by the
antisymmetry principle (Pauli exclusion principle): The wave function Ψ(x1 , x2 , . . . , xN )
that describes any state of an N-electron system is antisymmetric under any permutation
of the electrons.
Let us now work out the N-electron wave function. We start by introducing a product
function of one-electron wave functions (spin-orbitals):
13
expressed as a Slater determinant
ψ1 (x1 ) ψ2 (x1 ) · · · ψN (x1 )
ψ1 (x2 ) ψ2 (x2 ) · · · ψN (x2 )
Φ(x1 , x2 , . . . , xN ) = M .. .. .. .. , (1.20)
. . . .
ψ (x ) ψ (x ) · · · ψ (x )
1 N 2 N N N
by which all symmetric permutations are automatically rejected, and thus the Pauli prin-
ciple is fulfilled. Expanded in Slater determinants the exact wave function is written as
X
Ψ(x1 , x2 , . . . , xN ) = ck Φk (x1 , x2 , . . . , xN ). (1.21)
k
The normalization factor M is reasoned in the following way: when expanded, Φ consists
of N! products, and thus an integrand Φ⋆ Φ would consist of (N!)2 products, each of a form
±[ψ1 (xi )ψ2 (xj ) · · · ψN (xk )][ψ1 (xi′ )ψ2 (xj ′ ) · · · ψN (xk′ )]. It is evident that unless i = i′ , j =
j ′ etc. in a particular product, the product will give no contribution to the result, due to the
orthogonality of spin-orbitals. Therefore, there are N! non-vanishing contributions, R ⋆ and as
every contribution is equal to 1 with normalized spin-orbitals, we have Φ Φdx1 · · · dxN =
M 2 N!, hence the normalization factor has the value M = (N!)−1/2 .
1
Φ2 = √ |1sα 2sα|
2
1
Φ3 = (|1sα 2sβ| + |1sβ 2sα|)
2
1
Ψ4 = √ |1sβ 2sβ|
2
1
Ψ5 = (|1sα 2sβ| − |1sβ 2sα|) .
2
14
1.4.2 Remarks on “interpretation”
What is the physical meaning of the wave functions introduced earlier? First of all, the
general formulation of quantum mechanics is concerned only with symbolic statements in-
volving relationships between operators and operands, and is independent of the employed
“representation” or “interpretation”.
• We may also think an electron of being delocalized, i.e. not having a more precise
position than that indicated by its spatial part of the wave function φ(r). Then
|φ(r)|2dr would be the amount of charge in an element dr. In other words, in this
point of view a MO can be thought to approximately describe an electron spread
around the nuclear framework. However, as we will see later, MOs can be unitarily
“rotated”, i.e. their form shaped, and they still correspond to the same definite
energy. Which MOs are then closest to the physical ones?
S is referred to as an overlap matrix. The equation (1.22) is the essence of the Slater
method. Observe the introduced Dirac notation for integrals. The coefficients c are then
15
determined by a condition called the secular determinant
H11 − E H12 ... H1n
H21 H22 − E ... H2n
.. .. .. .. = 0. (1.25)
. . . .
H Hn2 . . . Hnn − E
n1
but since the Hamiltonian does not operate on spins, we may perform the spin-integration
immediately [c.f. Eq. (1.16)],
Z
[α(ms1 )β(ms2 ) − α(ms2 )β(ms1 )]⋆ [α(ms1 )β(ms2 ) − α(ms2 )β(ms1 )] dms1 dms2 = 2.
Thus we have
Z
H11 = χ⋆100 (r1 )χ⋆100 (r2 )Ĥχ100 (r1 )χ100 (r2 )dr1 dr2
Z Z
= ⋆
χ100 (r1 )ĥ1 χ100 (r1 )dr1 χ⋆100 (r2 )χ100 (r2 )dr2
Z Z
+ χ100 (r2 )ĥ2 χ100 (r2 )dr2 χ⋆100 (r1 )χ100 (r1 )dr1
⋆
Z
+ χ⋆100 (r1 )χ⋆100 (r2 )ĝχ100 (r1 )χ100 (r2 )dr1 dr2
= 2ǫ1s + J
This result is still a crude approximation, the only difference to the independent-particle
model being the contribution J from the two-electron integral. The next step in the
determination of He energy levels would be to include more and more Slater determinants
16
corresponding to excited state configurations and evaluate the respective matrix elements.
As anticipated earlier, although the exact solution would require an infinite number of
included determinants, the energy spectrum would in practise converge towards the correct
one rather rapidly. However, the analysis of the many-electron problem using this kind of
direct expansion would obviously be extremely tedious, and a more general approach is
needed. Also, the evaluation of integrals gets far more complicated in the case of many
atomic cores.
The expectation value of the two-electron part is a bit more complicated, as contribu-
tions may arise from terms that differ by an interchange of two electrons; but essentially a
similar argumentation leads to
* +
X X
′ ′
Φ ĝij Φ = (hψr ψs |ĝ| ψr ψs i − hψr ψs |ĝ| ψs ψr i) , (1.28)
ij rs
where the former part is referred to as the Coulomb integral and the latter as the exchange
integral. the expectation value of the energy being simply the sum of (1.27) and (1.28).
We also need the off-diagonal elements. Fortunately enough, there are non-vanishing
elements only in two cases:
These results are called Slater’s rules. In Slater’s rules, the spin-orbitals are required to be
orthonormal, but the rules may be generalized for matrix elements even in a non-orthogonal
spin-orbital basis.
17
1.5 Further reading
• R. McWeeny, Methods of Molecular Quantum Mechanics (Academic Press, 1992)
where |φ1 φ2 | is the two-electron Slater determinant. Show that we may separate the
singlet function into a symmetric spatial part and an antisymmetric spin part:
1
Ψ(x1 , x2 ) = Ψ(r1 , r2 ) √ [α(1)β(2) − β(1)β(2)]
2
2. ⋆ Show, that the overlap integral between two normalized s orbitals centred on
nuclei A and B is
Z
1
S = 1sA (r)1sB (r)dr = (1 + R + R2 ) exp(−R)
3
where R is the internuclear separation. Hint: use the confocal elliptic coordinate
system (µ, ν, φ) with
rA + rB
µ = 1≤µ≤∞
R
rA − rB
ν = −1≤µ≤1
R
and 0 ≤ φ ≤ π. In it, the volume element is given by dr = 18 R3 (µ2 − ν 2 )dµdνdφ.
3. ⋆ Consider the solution of the Schrödinger equation for the hydrogen molecule using
a minimal basis, that is, 1s function for both of the atoms. Sketch the obtained 1σg
and 1σu orbitals, using the expression for the overlap integral given above.
18
Chapter 2
3. is variational in the sense that for all possible variations δΨ, which are orthogonal to
the wave function, the energy remains unchanged:
19
0.8
0.6
0.4
0.2
-0.2
Ψ1σ
-0.4 Ψ1σ*
-0.6
-3 -2 -1 0 1 2 3
Figure 2.1: Nuclear cusps. Wave functions corresponding to the two lowest eigenvalues of
H2 molecule, along the H–H bond (obtained from a FCI calculation in cc-pVDZ basis).
5. Within the non-relativistic theory, the exact stationary states are eigenfunctions of
the total and projected spin operators,
6. The molecular electronic Hamiltonian (1.2) is singular for ri = rj and thus the exact
wave function must possess a characteristic non-differentiable behavior for spatially
coinciding electrons, known as the electronic Coulomb cusp condition,
∂Ψ 1
lim = Ψ(rij = 0), (2.6)
rij →0 ∂rij ave 2
the description of which is a major obstacle in the accurate practical modelling of the
electronic wave function. There exists a condition similar to (2.6) also for electrons
coinciding point-like nuclei, known as the nuclear cusp condition.
8. The exact wave function transforms in a characteristic manner under gauge trans-
formations of the potentials associated with electromagnetic fields, ensuring that all
20
molecular properties described by the wave function are unaffected by the transfor-
mations.
As we saw in the introductionary survey, some of these – square integrability and Pauli
principle – are included straightforwardly, whereas size-extensivity and the cusp condition
are more difficult to impose but still desirable. Others are of interest only in special
situations.
21
Eq. (2.10) describes the situation when the electrons coincide in space and is referred to as
the Coulomb cusp condition; whereas Eq. (2.11) establishes the behavior of the ground-state
wave function in the vicinity of the nucleus and is known as the nuclear cusp condition.
Expanding the ground-state helium wave function around r2 = r1 and r12 = 0 we obtain
∂Ψ ∂Ψ
Ψ(r1 , r2 , 0) = Ψ(r1 , r1 , 0) + (r2 − r1 ) + r12 + ...,
∂r2 r2 =r1 ∂r12 r2 =r1
Therefore, the cusp condition leads to a wave function that is continuous but not smooth
(discontinuous first derivative) at r12 .
The nuclear cusp condition for the “first” electron when the wave function does not
vanish at r1 = 0 (such as the helium ground state) is satisfied if the wave function exhibits
an exponential dependence on r1 close to the nucleus:
Molecular electronic wave functions are usually expanded in simple analytical functions
centered on the atomic nuclei (AOs), and close to a given nucleus, the behavior of the
wave function is dominated by the analytical form of the AOs. In particular, the Slater-
type orbitals (introduced later), are compatible with the nuclear cusp condition, while the
Gaussian-type orbitals are not.
It should be noted that the cusp conditions in Eqs. (2.10) and (2.11) are written for the
totally symmetric singlet ground state of the helium atom. The cusp conditions in a more
general situation (but still for a wave function that does not vanish at the singularities)
should be written as
∂Ψ 1
lim = Ψ(rij = 0) (2.12)
rij →0 ∂rij ave 2
∂Ψ
lim = −ZΨ(ri = 0), (2.13)
ri →0 ∂ri ave
where the averaging over all directions is implied.
22
where ESD is a best single-determinant total energy and Eexact the “exact” energy in
the same basis. This definition is most usable when speaking of molecular ground
states and equilibrium geometries, while outside them it is untenable.
• Fermi correlation arises from the Pauli antisymmetry of the wave function and is
taken into account already at the single-determinant level.
• Static correlation, also known as near-degeneracy or nondynamical correlation, arises
from the near-degeneracy of electronic configurations.
• Dynamical correlation is associated with the instantaneous correlation among the
electrons arising from their mutual Coulombic repulsion. It is useful to distinguish
between
– long-range dynamical correlation and
– short-range dynamical correlation, which is related to the singularities in the
Hamiltonian and giving rise to the Coulomb cusp in the wave function.
where |Ψ̃i is some approximation to the eigenstate |Ψi. It provides a simple and powerful
procedure for generating approximate wave functions: for some proposed model for the
wave function, we express the electronic state |Ci in terms of a finite set of numerical
parameters; the stationary points of energy function
D E
C Ĥ C
E(C) = (2.15)
hC |Ci
are the approximate electronic states |Ci and the values E(C) at the stationary points
the approximate energies. Due to the variation principle, the expectation value of the
Hamiltonian is correct to second order in the error.
23
i.e. the approximate state is expanded in m-dimensional set of Slater determinants. We
further assume here that the wave function is real. The energy function of this state is
given by Eq. (2.15). In order to locate and to characterize the stationary points, we shall
employ the first and second derivatives with respect to the variational parameters:
D E
∂E(C) i Ĥ C − E(C) hi |Ci
(1)
Ei (C) = =2
∂Ci hC |Ci
D E
∂ 2 E(C) i Ĥ j − E(C) hi |ji hj |Ci hi |Ci
(2) (1) (1)
Eij (C) = =2 − 2Ei (C) − 2Ej (C) ,
∂Ci ∂Cj hC |Ci hC |Ci hC |Ci
K (2)
EM M is thus singular and K th state has exactly K − 1 negative eigenvalues. Therefore,
in the space orthogonal to CK , the first solution is a minimum, the second a first-order
saddle point, the third a second-order saddle point, and so forth.
24
In fact, for the Hellmann–Feynman theorem to hold, we need to demand from the
approximate wave functions that they are optimized with respect to the changes induced by
the perturbation: |Ψi → |Ψi + α |∂Ψ/∂αi. Usually in molecular calculations we construct
our wave function from a finite set of analytical functions attached to the atomic nuclei.
For example, when we distort the molecular geometry, we change the basis set in terms
of which our electronic wave function is expanded, and a wave function optimized at one
particular geometry is not accurately represented in terms of a basis set associated with
another geometry. As a result, the unperturbed electronic is not variational with respect
to these changes and the conditions of the Hellmann–Feynman theorem is not satisfied.
However, this is an artificial difficulty arising from the coordinate representation of
quantum mechanics we presently operate in. The picture presented by the second quan-
tization, the necessary conditions are fulfilled and we are able to employ the theorem for
approximate wave functions in finite bases in exactly the same manner as for exact wave
functions.
and applying this decomposition we may find the stationary points of the energy function
are found at6
∂ D E ∂ h 2 D E D Ei
0 = Ψα Ĥ(R) Ψα = α Ψ T̂ Ψ + α Ψ V̂ (αR) Ψ
∂α α=1 ∂α α=1
D E D E
dV (αR) H−F dE(αR)
⇒ 2 Ψ T̂ Ψ + Ψ V̂ (R) Ψ = − Ψ Ψ = − (2.20)
.
dα α=1 dα α=1
This result (2.20) is the quantum-mechanical virial theorem for a field-free non-relativistic
molecular Hamiltonian. The most important observation is that in the molecular equilib-
rium geometries (R = Re ), and in the case of atoms
D E 1 D E
Ψ T̂ Ψ = − Ψ V̂ (Re ) Ψ . (2.21)
2
6
Note that the unperturbed wave function corresponds here to α = 1 instead of α = 0.
25
Furthermore,
D E D E
Ψ T̂ Ψ |R=Re = −E(Re ), Ψ V̂ (Re ) Ψ = 2E(Re ). (2.22)
Also these expressions hold for any approximate wave function that is variational with
respect to a uniform scaling of the nuclear as well as electronic coordinates. Note that
according to the last result, no stationary points of positive energy may exist.
The scaling force may be easily related to classical Cartesian forces on the nuclei,
FI (R) = −dE(R)/dRI , by invoking the chain rule
dE(αR) X d(αRI ) dE(αR I ) X
= = − RI · FI (R)
dα α=1 I
dα
α=1 d(αR I
α=1 I
and combining this with virial theorem, we may extract the Cartesian forces experienced
by the nuclei, D E D E
X
RI · FI (R) = 2 Ψ T̂ Ψ + Ψ V̂ (R) Ψ . (2.23)
I
As we will see later, atomic orbitals are further approximated with a set of simpler
functions that are also functions of the coordinates of a single electron. This three-step
procedure is often referred to as the “standard model of quantum chemistry”. To arrive
at the exact solution, i) orbitals in terms of which the determinants are constructed must
form a complete set in one-electron space, and ii) in the expansion of N-electron wave
function, we must include all determinants that can be generated from these orbitals.
All approximations in the solution of the Schrödinger equation should be unambigious
and precisely defined. Ideally, they should be improvable in a systematic fashion; and to
yield more and more elaborate solutions that approach the exact solution. Thus we speak
of different models, levels of theory and hierarchies of approximations.
Computational electronic-structure theory has a few standard models for the construc-
tion of approximate electronic wave functions. At the simplest level, the wave function is
represented by a single Slater determinant (the Hartree–Fock approximation); and at the
most complex level the wave function is represented as a variationally determined superpo-
sition of all determinants in the N-electron Fock space (full configuration-interaction, FCI).
26
Figure 2.2: The two-way hierarchy of approximate molecular wave functions
Between these extremes, there are a vast number of intermediate models with variable cost
and accuracy.
It should be noted that none of the models is applicable to all systems. However, each
model is applicable in a broad range of molecular systems, providing solutions of known
quality and flexibility at a given computational cost.
27
2.5 Further reading
• D. P. Tew, W. Klopper, and T. Helgaker, Electron correlation: the many-body problem
at the heart of chemistry, J. Comput. Chem. 28, 1307 (2007)
(a) Show that the electronic energy, gradient and Hessian can be expanded around
a stationary point as
1
E(λ) = E + λT E(2) λ + O(kλk3 )
2
E(1) (λ) = E(2) λ + O(kλk2 )
E(2) (λ) = E(2) + O(kλk)
Here α is a real scaling factor and N the number of electrons. We assume that Ψ(ri )
is normalized.
hΨα |Ψα i = 1.
(b) Show that the expectation values of the kinetic and potential energies transform
as
D E D E
Ψα T̂ Ψα = α2 Ψ T̂ Ψ
28
3. ⋆ Here, we will consider the use of correlating functions to describe the Coulomb
cusp of many-electron systems. For a system with two electrons and K nuclei, with
the Hamiltonian
Ĥ = T̂ + V̂ + ĝ
1 1
T̂ = T̂1 + T̂2 = − ∇21 − ∇22
2 2
X ZI X ZI
V̂ = V̂1 + V̂2 = − −
I
|r1 − RI | I
|r2 − RI |
1
ĝ = ,
r12
write the wave function in the form Ψ = γΦ, where γ is a continuous correlating
function that depends only on r12 , and Φ is a smooth function of r1 and r2 . The
functions γ and Φ are, of course, finite.
(a) Show that the Schrödinger equation can be written in the form
(b) Consider the former result as r12 → 0 and ĝ becomes singular. Assuming that
the electrons and nuclei do not coincide, show that the singularity in ĝ must be
compensated for by a singularity in [T̂ , γ] such that ĝγΦ + [T̂ , γ] remains finite.
(c) Show that
2 r12
[T̂ , r12 ] = − − · (∇1 − ∇2 ),
r12 r12
n −1
and that for n > 1, [T̂ , r12 ] contains no terms proportional to r12 .
2
(d) Consider the expansion of γ = 1 + βr12 + O(r12 ) around r12 = 0; determine the
parameter β such that ĝγ + [T̂ , γ] becomes nonsingular.
(e) For more than two electrons, we may again model the wave function by γΦ,
where γ depends on the interelectronic coordinates rij and where Φ is a smooth
function of the electronic coordinates ri . Show that the form
X
γ =1+β rij
i>j
29
Y
γ = (1 + βrij )
i>j
!
X
γ = exp β rij
i>j
!
X X
γ = 1+ αp exp βp rij2
p i>j
30
Chapter 3
1. allow orderly and systematic extension towards completeness with respect to one-
electron square-integrable functions
2. allow rapid convergence to any atomic or molecular electronic state, requiring only a
few terms for a reasonably accurate description of molecular electronic distributions
3. have an analytical form for easy and accurate manipulation especially for molecular
integrals
1 Z
H = − ∇2 − .
2 r
31
and are written as
3/2 s l
2Z (n − l − 1)! 2Zr 2l+1 2Zr Zr
χnlm (r, θ, ϕ) = Ln−l−1 exp −
n 2n(n + 1)! n n n
| {z }
Rnl (r)
s
2l + 1 (l − m)! m
× P (cos θ) exp(imϕ).
4π (l + m)! l
| {z }
Ylm (θ,ϕ)
χLF LF
nlm (r, θ, ϕ) = Rnl (r)Ylm (θ, ϕ) (3.3)
s
LF (n − l − 1)!
Rnl = (2ζ)3/2 (2ζr)l L2l+2
n−l−1 (2ζr) exp(−ζr), (3.4)
(n + l + 1)!
referred to as the Laguerre functions. For fixed l and ζ the radial functions (3.4) with n > l
constitute a complete orthonormal set of functions, for which
LF 2n + 1
χnlm |r̂| χLF
nlm = , (3.5)
2ζ
32
1s 2s 2p 3s
0 5 10 0 5 10 0 5 10 0 5 10
being thus considerably more compact than the hydrogenic functions for large n and inde-
pendent of l. The Laguerre functions exhibit the same nodal structure as the hydrogenic
ones but their radial distributions R2 r 2 are quite different (ζ = 1/n, Z = 1).
One should note that the orthogonality of the Laguerre functions is valid only with
fixed exponents. It turns out, however, that the expansion of the orbital of a given l-value
of an atomic system requires a large number of fixed-exponent Laguerre functions with
different n. For non-orthogonal, variable-exponent functions Laguerre functions are not
an optimal choice, but we need to introduce functions specifically tailored to reproduce as
closely as possible the different orbitals of each atom.
χSTO STO
nlm (r, θ, ϕ) = Rnl (r)Y (θ, ϕ) (3.6)
(2ζ)3/2
RnSTO (r) = p (2ζr)n−1 exp(−ζr) (3.7)
Γ(2n + 1)
For the STOs we use the same 1s, 2p,... notation as for the hydrogenic functions, keeping
in mind that STOs are nodeless and now n only refers to the monomial factor r n−1 . The
expectation value of r̂ and the maximum in the radial distribution curve are given by
STO 2n + 1
χnlm |r̂| χSTO
nlm = (3.8)
2ζ
STO n
rmax = (3.9)
ζ
Also STOs would constitute a complete set of functions for a fixed exponent ζ, and we
could choose to work with a single exponent and include in our basis orbitals (1s, 2s, . . .),
(2p, 3p, . . .), (3d, 4d, . . .), all with the same exponent. Alternatively, we may describe the
radial space by functions with variable exponents. For each l, we employ only the functions
33
of the lowest n, yielding a basis of the type ((1s(ζ1s ), 1s(ζ2s ), . . .), (2p(ζ1p , 2p(ζ2p ), . . .), . . .)
These variable-exponent STOs are given by
In practice, a combined approach is employed. For example, for the carbon atom, we would
introduce sets of 1s, 2s and 2p functions, all with variable exponents that are chosen to
ensure an accurate representation of the wave function.
χGTO GTO
nlm (r, θ, ϕ) = Rnl (r)Ylm (θ, ϕ) (3.12)
s
GTO 2(2α) 3/4
22n−l−2 √
Rnl (r) = 1/4
( 2αr)2n−l−2 exp(−αr 2 ). (3.13)
π (4n − 2l − 3)!!
They form a complete set of nonorthogonal basis functions. In above, the double factorial
function is
1 n=0
n(n − 2)(n − 4) · · · 2 even n > 0
n!! = (3.14)
n(n − 2)(n − 4) · · · 1 odd n > 0
1
odd n < 0
(n+2)(n+4)···1
34
2.5 5.0 5.0 7.5
1s STO (ζ=1) 1s GTO (α=1) 1s 1s
2s STO (ζ=1) 2s GTO (α=1) STO (ζ=1) GTO (α=3.42)
2p GTO (α=1) GTO (α=1) GTO (α=0.62)
GTO (α=0.16)
5.0
2.5 2.5
2.5
In practice, the use of GTOs with variable exponents differs from the procedure for
STOs. For GTOs, we describe the radial space exclusively by means of variable exponents,
using for this purpose only spherical-harmonic GTOs with n = l + 1, only powers of r
introduced are thus associated with l of the spherical harmonics. The following set of
spherical-harmonic GTOs is used:
χGTO GTO
αnl lm (r, θ, ϕ) = Rαnl l (r)Ylm (θ, ϕ) (3.17)
s
2(2αnl ) 3/4
22n−l−2 √
RαGTO
nl l
(r) = ( 2αr)2n−l−2 exp(−αr 2 ). (3.18)
π 1/4 (4n − 2l − 3)!!
Let us finally consider the Gaussian product rule, that is the one of the main reasons
2
why GTOs are so widely used. The product of two s GTOs χA = exp(−αrA ) and χB =
2
exp(−βrB ) centered on rA and rB is a third Gaussian
αβ 2
χA χB = χC = exp − rAB exp −(α + β)rC2 (3.19)
α+β
centered on rC given by
αrA + βrB
rC = .
α+β
35
can be obtained for many problems already at the uncorrelated level of theory and using
the non-relativistic Hamiltonian.
The majority of quantum chemical applications employ contracted GTO (CGTO) sets,
i.e. linear combinations of GTO functions with coefficients optimized regarding some
criterion (usually SCF energy) that significantly increase efficiency. There are two widely
used contraction schemes: segmented contraction, where each GTO contributes only to a
single CGTO, and general contraction, where such a restriction is not applied. Examples of
the segmented scheme include the Pople-style basis set families (3-21G, 6-31G,...) and the
Karlsruhe basis sets. The correlation-consistent (cc) basis set families by Dunning and co-
workers are the most widely used generally contracted basis sets. The basic idea behind the
cc sets is that functions that contribute approximately the same amount to the correlation
energy are added to the basis in groups. The cc basis sets provide smooth, monotonic
convergence for the electronic energy as well as for many molecular properties, especially
those originating in the valence region. The polarized valence (cc-pVXZ, X = D, T, Q, 5,
6 corresponding to the number of CGTOs used to represent an occupied atomic orbital)
and core-valence (cc-pCVXZ) sets can be augmented with diffuse functions (aug-cc-pVXZ
and aug-cc-pCVXZ; d-aug-cc-pVXZ and t-aug-cc-pVXZ for doubly and triply augmented
sets, respectively). While these features are favorable and the sets are widely used, the cc-
basis sets become very large at large X, and it is not straightforward to extend the family
beyond the published X values or to new elements. The atomic natural orbital (ANO) basis
sets provide another general contraction approach. The contraction coefficients therein are
atomic ANO coefficients that are obtained by optimizing atomic energies.
The use of increasingly large basis sets that produce results converging to some partic-
ular value is usually regarded as a solution to the problem of basis set incompleteness. In
calculations of molecular properties that originate, e.g., in the region close to the nuclei
(examples include indirect spin-spin couplings and hyperfine couplings), approaching the
basis-set limit using the cc or comparable energy-optimized paradigms may lead to exces-
sively large basis sets prohibiting calculations of large molecules. An alternative and often
used approach is to uncontract the basis set and to supplement it by additional steep basis
functions in the l-shells relevant for the property under examination.
The performance of the different basis sets for a certain molecular property can be
qualitatively understood within a concept, which can be measured by the completeness
profiles
X
Y(α) = hg(α)|χmi2 . (3.20)
m
Here {χ} denotes a set of orthonormalized, contracted or primitive, basis functions, and
g(α) is a primitive “test” GTO with the exponent α, used to probe the completeness of
{χ}. The completeness profile becomes unity for all α, for all l-values in a CBS. Y(α)
is typically plotted in the logarithmic scale, against x = log α. In a certain exponent
interval [αmin , αmax ] this quantity is intuitively connected to the possibility – from the
point of view of one-particle space – of describing all details of the wave function in the
corresponding distance range from the atomic nuclei. Put simply, atomic properties that
36
1.0
s s
0.75 (a) p (b) p
Y(α)
0.5 d d
f
0.25
1.0
s s
0.75 (c) p (d) p
d
Y(α)
0.5 d f
f g
0.25 g h
1.0
s s
0.75 (e) p (f) p
d d
Y(α)
0.5 f f
g
0.25 h
i
1.0
s s
0.75 (g) p (h) p
d d
Y(α)
0.5 f f
0.25
4 3 2 1 0 -1 -2 -3 -4 4 3 2 1 0 -1 -2 -3 -4
Log (α) Log (α)
Figure 3.3: Completeness profiles of (a) cc-pVDZ (b) cc-pVTZ (c) cc-pVQZ (d) cc-pV5Z
(e) cc-pV6Z (f) cc-pCVTZ (g) aug-cc-pVTZ and (h) aug-cc-pCVTZ basis sets of fluorine.
√ √
obtain relevant contributions roughly within [1/ αmax , 1/ αmin ] from the nucleus can be
reproduced by a basis that has Y(α) = 1 in this interval. This way of thinking can be
generalized to molecular properties that may be dominated by phenomena occurring close
to the expansion centers of the basis functions, i.e., atomic nuclei (region described by
high-exponent basis functions) and/or in the valence region, further away from the nuclei.
37
E
Oµνλσ = hχµ (1)χν (1) Ô χλ (2)χσ (2)
Z
= dr1 dr2 χµ (r1 )χν (r1 )Ô(r)χλ (r2 )χσ (r2 ) (3.22)
such that the AOs (χ) are taken as fixed linear combinations of real-valued primitive
Cartesian GTOs
Gijk (r, a, R) = xiR yRj zRk exp(−arR
2
), (3.23)
where the ”Cartesian quantum numbers” i, j, k are greater than zero and l = i + j + k for
a given total angular momentum quantum number.
The integrals over primitive Cartesian GTOs may subsequently be transformed to in-
tegrals over contracted and/or spherical-harmonic GTOs by linear combinations.
where for example Gi (x, a, Rx ) = xiR exp(−ax2R ). We will need also a concept of Gaussian
overlap distribution
Ωab (r) = Gijk (r, a, RA )Glmn (r, b, RB ), (3.25)
that may also be factorized in the same way
Ωab (r) = Ωxij (x, a, b, RA,x , RB,x )Ωykl (y, a, b, RA,y , RB,y )Ωzmn (z, a, b, RA,z , RB,z ), (3.26)
Ωxij (x, a, b, RA,x , RB,x ) = Gi (x, a, RA,x )Gj (x, b, RB,x ). (3.27)
According to the Gaussian product rule (3.19), any Cartesian component of the overlap
distribution can be written as a single Gaussian at the centre of charge Px , and we may
thus evaluate the integral
Z ∞ Z ∞ r
x 2 2 π 2
Ω00 dx = exp(−µXAB ) exp(−pxp )dx = exp(−µXAB ), (3.29)
−∞ −∞ p
38
where µ denotes the reduced exponent ab/(a + b), p = a + b, and
aRA,x + bRB,x
Px = (3.30)
p
XAB = RA,x − RB,x . (3.31)
This result provides a basis for a set of recurrence relations by which we may evaluate
simple integrals – overlap or more complicated ones – over GTOs of arbitrary quantum
numbers. This procedure is known as Obara–Saika scheme. There exist other methods
for molecular integral evaluation but we will not discuss them here. The relations are
obtained by considering the behaviour of the integral under coordinate transformation and
their detailed derivation is omitted here but recommended further reading.
The simplest case is the overlap integral
39
1
0.8
0.6
0.4
0.2
2 4 6 8 10
Figure 3.4: The three lowest-order Boys functions. The zeroth-order function in cyan,
first-order in blue and second-order in dark blue.
The Boys functions of orders 0, 1 and 2 are illustrated in Figure 3.4. It is a strictly positive
and decreasing function. Its values at x = 0 have a closed-form expression
Z 1
1
Fn (0) = t2n dt = . (3.39)
0 2n + 1
Different methods for evaluating the Boys function has been introduced. One way is
through recursion; by integrating the function by parts, we see that the different orders
are related by
(2n + 1)Fn (x) − exp(−x)
Fn+1 (x) = (3.40)
2x
or
2xFn+1 (x) − exp(−x)
Fn (x) = . (3.41)
2n + 1
We would therefore need to calculate the Boys function for the highest or the lowest order
needed, and the others are obtained through the downward or upward recursion, of which
the former is numerically more robust.
40
We again factorize the distribution according to (3.26), and are about to obtain the
integrals through the recursion relations (c.f., below) for auxillary integrals ΘN
ijklmn :
1
ΘN N
i+1 jklmn = XP A Θijklmn + (iΘN N
i−1 jklmn + jΘi j−1 klmn )
2p
1
−XP C ΘN +1
ijklmn − (iΘN +1 N +1
i−1 jklmn + jΘi j−1 klmn ) (3.43)
2p
1
ΘN
i j+1 klmn = XP B Θ N
ijklmn + (iΘN N
i−1 jklmn + jΘi j−1 klmn )
2p
1
−XP C ΘN +1
ijklmn − (iΘN +1 N +1
i−1 jklmn + jΘi j−1 klmn ) (3.44)
2p
with the special cases (that also serve the starting points of the recursion)
41
Then, the second electron is treated using the integrals generated in (3.51):
bXAB + dXCD N i k N p
Θi0 k+1 0 = − Θi0k0 − ΘNi−1 0k0 + Θi0 k−1 0 − ΘN . (3.52)
q 2q 2q q i+1 0k0
In the final step, the Cartesian powers are ”transferred” between the orbitals of the same
electron:
In this way, we may construct the full set of Cartesian two-electron Coulomb integrals.
42
3.4 Further reading
• T. Helgaker and P. Taylor, Gaussian basis sets and molecular integrals, in Modern
Electronic Structure Theory, Part II, D. R. Yarkony (ed.), (World Scientific 1995),
pp. 725–856
• M. C. Strain, G. E. Scuseria and M. J. Frisch, Achieving linear scaling for the elec-
tronic quantum Coulomb problem, Science 271, 51 (1996).
of (a) the hydrogenic wave functions, (b) the Laguerre functions, (c) the radial STOs.
2. ⋆ In this exercise, we study the radial form of the GTOs, the most frequently used
basis functions in molecular electronic-structure theory:
GTO GTO
√
Rαl (r) = Nαl ( 2αr)l exp(−αr 2 ).
GTO
The related functions Rnl are obtained by replacing l by 2n − l − 2.
43
(c) By setting α2 = βα1 verify that the overlap Sα1 βα1 l is independent of α1 .
(d) Using the integral
Z ∞
n!
x2n+1 exp(−αx2 )dx = n ≥ 0, α > 0
0 2αn+1
show that the expectation value of r is
Z ∞
r
GTO GTO 2 2l (l + 1)!
hri = Rαl (r)rRαl (r)r 2dr = .
0 πα (2l + 1)!!
√
(e) Use the Stirling approximation for factorials k! ∼ k k+1/2 2πe−k (k ≫ 1) to
show that hri is for large l r
l
hri ∼ .
2α
(f) Verify that the radial distribution function 4πr 2 [Rαl
GTO
(r)]2 has a maximum at
r
GTO l+1
rmax = .
2α
(g) Consider two GTOs of different l and exponents. Determine the relation between
α1 and α2 so that the radial distribution of these GTOs have the same maximum.
3. Using the Obara–Saika scheme, evaluate the self-overlap integrals hpx |px i and hdx2 |dx2 i.
5. ⋆ Derive the Obara–Saika relations for overlap and differential operator integrals,
Eqs. (3.33-3.34) and (3.36-3.37).
44
Chapter 4
Second quantization
For an orthonormal set of spin-orbitals, we define the inner product between two ON vectors
as
YM
hm |ki = δmP kP . (4.2)
P =1
This definition is consistent with the overlap between two Slater determinants, but has a
well-defined but zero overlap between states with different electron numbers is a special
feature of the Fock-space formulation. It allows for a unified description of systems with
variable numbers of electrons.
• In a given spin-orbital basis, there is a one-to-one mapping between the Slater de-
terminants with spin-orbitals in a canonical order and the ON vectors in the Fock
space.
• However, ON vectors are not Slater determinants: ON vectors have no spatial struc-
ture but are just basis vectors in an abstract vector space.
45
where F (M, N) contains all ON vectors obtained by distributing N electrons among the
M spin-orbitals, in other words, all ON vectors for which the sum of occupation number
is N. The subspace F (M, 0) is the vacuum state,
where
P
Y −1
ΓkP = (−1)kQ . (4.7)
Q=1
therefore
a†P a†P = 0. (4.9)
For Q > P ,
46
Substitution of dummy indices shows that this holds for Q < P as well, and is true also
for Q = P . Since |ki is an arbitrary ON vector, we conclude the anticommutation relation
a†P a†Q + a†Q a†P = [a†P , a†Q ]+ = 0. (4.10)
The properties of the adjoint or conjugate operators aP can be reasoned from those of
the creation operators. Thus, the adjoint operators satisfy the anticommutation relation
[aP , aQ ]+ = 0. (4.11)
Let us invoke the resolution of the identity:
X
aP |ki = |mi hm |aP | ki ,
m
47
4.1.2 Number-conserving operators
The occupation-number operator that counts the number of electrons in spin-orbital P is
introduced as
N̂P = a†P aP , N̂P |ki = kP |ki (4.18)
The occupation-number operators are Hermitian, N̂ † = N̂ , as well as idempotent, N̂P2 =
N̂P . Using the basic anticommutation relations, we obtain
[N̂P , a†Q ] = δP Q a†Q (4.19)
h i
N̂P , aQ = −δP Q aQ (4.20)
which returns the number of electrons in an ON vector, N̂ |ki = N |ki, and is therefore
known as the particle-number operator. We find that the number operator commutes with
any string T̂ that contains an equal number of creation and annihilation operators. Such
strings are called number-conserving. In general, the application of the string X̂ to a
Fock-space vector increases the number of electrons by N X .
The simplest number-conserving operators are the elementary excitation operators
ÊQP = a†P aQ , (4.23)
which give as applied to an ON vector
a†P aQ |ki = a†P δkQ 1 ΓkQ |k1 , . . . , 0Q , . . . , kP , . . . , kM i
= δkP 0 δkQ 1 ΓkQ ΓkP εP Q |k1 , . . . , 0Q , . . . , 1P , . . . , kM i
k k
kP → 1
= εP Q ΓQ ΓP (1 − kP )kQ (4.24)
kQ → 0
where
1 P ≤Q
εP Q = .
−1 P > Q
The case P < Q differs from this result only in the interpretation of εP Q , whereas the case
P = Q is covered by Eq. (4.18). Therefore we may write
† k k
kP → 1
aP aQ |ki = εP QΓP ΓQ (1 − kP + δP Q )kQ
. (4.25)
kQ → δP Q
48
First quantization Second quantization
• One-electron operator: • One-electron operator:
X
N
X fˆ = fP Q a†P aQ
fˆc = f (xi ) PQ
i=1
• Operators are independent of the spin- • Operators depend on the spin-orbital ba-
orbital basis sis
• Operators depend on the number of elec- • Operators are independent of electrons
trons
• Operators are exact • Projected operators
The application of a single such operator gives a single excitation, two a double excitation
and so forth.
49
In the case of real spin-orbitals, the integrals exhibit the following permutational symme-
tries:
fP Q = fQP (4.29)
gP QRS = gQP RS = gP QSR = gQP SR, (4.30)
of which the latter may be further combined with Eq. (4.28) to yield an eight-fold symmetry.
Using the introduced second-quantization representation, we may now construct the
(non-relativistic, spin-free) molecular electronic Hamiltonian in the Born–Oppenheimer
approximation,
X 1
Ĥ = hP Q a†P aQ + gP QRS a†P a†R aS aQ + hnuc , (4.31)
PQ
2
where
Z !
1 X ZI
hP Q = ψP∗ (x) − ∇2 − ψQ (x)dx (4.32)
2 I
riI
Z
ψP∗ (x)ψR∗ (x)ψQ (x)ψS (x)
gP QRS = dx1 dx2 (4.33)
r12
1 X ZI ZJ
hnuc = . (4.34)
2 I6=J RIJ
The form of the second-quantization Hamiltonian can be interpreted in the following way:
Applied to an electronic state, the Hamiltonian produces a linear combination of the orig-
inal state with states generated by single and double electron excitations from this state.
With each such excitation, there is an associated amplitude hP Q or gP QRS , which represents
the probability of this event happening.
Let Âc , B̂ c and Ĉ c be one-electron operators in first quantization and Â, B̂ and Ĉ be
the corresponding second-quantization representations.
• The first-quantization operator aÂc + bB̂ c , where a and b are numbers, is represented
by a + bB̂.
• The standard relations
Â(B̂ Ĉ) = (ÂB̂)Ĉ (4.35)
(ÂB̂)† = B̂ † † (4.36)
are valid.
• For a complete one-electron basis
Ĉ c = Âc B̂ c ⇒ Ĉ = ÂB̂
but for finite basis sets this expression does not hold. The second-quantization op-
erators are projections of the exact operators onto a basis of spin-orbitals. For an
incomplete basis, the second-quantization representation of an operator product de-
pends on when the projection is made.
50
Second quantization treats operators and wave functions in a unified way – they are all
expressed in terms of the elementary creation and annihilation operators: any ON vector
may be written as "M #
Y †
|ki = X̂k |vaci = (aP )kP |vaci (4.37)
P =1
and therefore the matrix elements can be viewed as the vacuum expectation value of an
operator D E D E
k Ô m = vac X̂k† ÔX̂m vac ,
(4.38)
• The (particle) rank of a string of creation and annihilation operators is simply the
number of elementary operators divided by 2.
A simple rule
h i
Â, B̂1 B̂2 = [Â, B̂1 ]B̂2 + B̂1 [Â, B̂2 ] (4.39)
h i n
X
Â, B̂1 · · · B̂n = B̂1 · · · B̂k−1 [Â, B̂k ]B̂k+1 · · · B̂n (4.40)
k=1
h i
Â, B̂1 B̂2 = [Â, B̂1 ]+ B̂2 − B̂1 [Â, B̂2 ]+ (4.41)
51
h i n
X
Â, B̂1 · · · B̂n = (−1)k−1 B̂1 · · · [Â, B̂k ]+ · · · B̂n (n even) (4.42)
k=1
h i
Â, B̂1 B̂2 = [Â, B̂1 ]B̂2 + B̂1 [Â, B̂2 ]+ = [Â, B̂1 ]+ B̂2 − B̂1 [Â, B̂2 ] (4.43)
+
h i n
X
Â, B̂1 · · · B̂n = (−1)k−1 B̂1 · · · [Â, B̂k ]+ · · · B̂n (n odd). (4.44)
+
k=1
Similarly,
[aP , a†Q aR ] = δP Q aR . (4.47)
Then, let us evaluate the commutator between two excitation operators, using the
results from the previous example,
[a†P aQ , a†R aS ] = [a†P , a†R a†S ]aR + a†P [aQ , a†R aS ] = δQR a†P aS − δP S a†R aQ . (4.48)
A slightly more complicated commutator is evaluated by applying Eq. (4.39) and the
previous result:
The following double commutator can be evaluated by invoking the result (4.48) twice
[a†P aQ , [a†R aS , a†M aN ]] = δSM [a†P aQ , a†R a†N ] − δRN [a†P aQ , a†M aS ]
= δSM δQR a†P aN − δSM δP M a†R aQ − δRN δQM a†P aS
+δRN δP S a†M aQ . (4.50)
52
4.4 Orbital rotations
In many situations, e.g., during the optimization of an approximate electronic state or in
the calculation of the response of an electronic state to an external perturbation, it becomes
necessary to carry out transformations between different sets of orthonormal spin-orbitals.
We shall encounter especially cases where the occupation numbers refer to a set of
orthonormal spin-orbitals ψ̃P that is obtained from another set ψP by a unitary transfor-
mation X
ψ̃P = ψQ UQP . (4.51)
Q
U† U = UU† = 1. (4.52)
The spectral theorem states that any unitary matrix can be diagonalized as
U = VǫV† ,
where V is unitary and ǫ a complex diagonal matrix, ǫkk = exp(iδk ). Therefore any unitary
matrix can be written as the matrix exponential1 of an anti-Hermitian matrix:
U = V exp(iδ)V† = exp(iVδV†)
≡ exp(X), X = −X† . (4.53)
53
into account. Therefore, we shall separate the orbital space and the spin space,2 i.e. a spin
orbital will be denoted by ψP (x) = φp (r)σ(ms ) = φpσ . The theory of second quantization
holds unchanged; for example the basic anticommutator now becomes
where, for example, a†pσ is the creation operator associated with the spin-orbital φpσ .
The integrals vanish for opposite spins in the case of spin-free (or spinless) operators:
Z
fpσ,qτ = φ⋆p (r)σ ∗ (ms )f c (r)φq (r)τ (ms )drdms
Z X
= δστ φ⋆p (r)f c (r)φq (r)dr = fpq δστ = fpq Êpq ,
pq
1 X
ĝ = gpσ,qτ,rµ,sν a†pσ a†rµ asν aqτ .
2 pσqτ rµsν
where gpqrs are the two-electron integrals in ordinary space, that are always symmetric
with respect to the particle-interchange, gpqrs = grspq , and have the Hermitian symmetry
⋆
gpqrs = gqpsr in the case of complex orbitals; and in the case of real orbitals, we have a
permutational symmetry
gpqrs = gqprs = gpqsr = gqpsr .
2
Hereafter, the spin functions α and β will be denoted generically by σ, τ , µ and ν.
54
The second-quantization representation of a spin-free two-electron operator is then
1X X 1X 1X
ĝ = gpqrs a†pσ a†rτ asτ aqσ = gpqrs (Êpq Êrs − δqr Eps ) = gpqrsêpqrs . (4.61)
2 pqrs στ
2 pqrs
2 pqrs
We have introduced the two-electron singlet excitation operator in the last equality. We can
now express the spin-free, non-relativistic molecular electronic Hamiltonian in the orbital
basis:
X 1X
Ĥ = hpq Êpq + gpqrs êpqrs . (4.62)
pq
2 pqrs
The one- and two-electron integrals are the same as in (4.31), except that the integrations
are over the spatial coordinates only.
The one-electron second-quantization operators associated to operators that work in
spin space only may be written in the general form
XZ
ˆ
f = φ⋆p (r)σ ⋆ (ms )f c (ms )φq (r)τ (ms )drdms a†pσ aqτ
pσqτ
XZ X
= σ ⋆ (ms )f c (ms )τ (ms )dms a†pσ apτ . (4.63)
στ p
Three important examples of pure spin operators are the raising and lowering operators
Ŝ+c and Ŝ−c , and Ŝzc . The effect of these operators on the (one-particle state) spin functions
is
S+c β = α Ŝ+c α = 0
S−c β = 0 Ŝ−c α = β
1 1
Szc β = − β Ŝzc α = + α.
2 2
From these we arrive at the following expressions for the basic spin operators:
X
Ŝ+ = a†pα apβ (4.64)
p
X
Ŝ− = a†pβ apα (4.65)
p
1 X † †
Ŝz = apα apα − apβ apβ . (4.66)
2 p
The lowering operator is seen to be the Hermitian adjoint of the raising operator:
X X †
Ŝ+† = (a†pα apβ )† = apβ apα = Ŝ− .
p p
55
The operators for the x and y components of the spin angular momentum can be written
in terms of the raising and lowering operators as
1 c
Sxc = S+ + S−c
2
1
Syc = S+c − S−c ,
2i
from which we obtain the second-quantization counterparts,
1 X †
Ŝx = apα apβ + a†pβ apα (4.67)
2 p
1 X † †
Ŝy = apα apβ − apβ apα , (4.68)
2i p
which may be combined with (4.66) to give the operator for the total spin, Ŝ 2 = Ŝx2 +Ŝy2 +Ŝz2 .
As it contains products of one-electron operators, it is a two-electron operator and therefore
somewhat tedious to manipulate. However, the explicit form is seldom needed, because we
are able to employ the standard operator identities: In contrast to the orbital basis, the
spin basis is complete, hence the usual first-quantization commutation relations hold also
for the second-quantization spin operators. One such example is
" #
X X † X †
† †
[Ŝ+ , Ŝ− ] = apα apβ , aqβ , aqα = apα apα − apβ apβ = 2Ŝz . (4.69)
p q p
where Z
Vpqµ = φ⋆p (r)ξ(r)ℓcµφq (r)dr
56
and where the Cartesian components of the triplet excitation operators are given by
1 †
x
T̂pq = apα aqβ + a†pβ aqα (4.71)
2
1 †
y
T̂pq = apα aqβ − a†pβ aqα (4.72)
2i
1 †
z
T̂pq = apα aqα − a†pβ aqβ . (4.73)
2
The second-quantization representation of
• spin-free operators depend on the orbitals but have the same amplitudes (=integrals)
for alpha and beta spins
• pure spin operators are independent of the functional form of the orbitals
• mixed operators depend on both the spin of the electron and the functional form of
the orbitals.
By contrast, the Slater determinants are not eigenfunctions of the total spin. However, it
is possible to determine spin eigenfunctions as simple linear combinations of determinants.
A hint for the procedure is given by an observation that both the total and projected-spin
operators commute with the sum of the ON operators for alpha and beta spins:
h i
Ŝz , N̂pσ + N̂pβ = 0
h i
Ŝ 2 , N̂pσ + N̂pβ = 0,
because N̂pσ + N̂pβ are singlet operators. Such spin-adapted functions are known as con-
figuration state functions.
57
4.6 Further reading
• T. Helgaker, P. Jørgensen, J. Olsen, Molecular Electronic-Structure Theory (Wiley
2002), pp. 1–106.
2. Let κ̂ and fˆ be one-electron operators. Show that the commutator [κ̂, fˆ] can be
written as a one-electron operator
X
ˆ =
[κ̂, f] fPκ Q a†P aQ
PQ
3. One of the operators that describe the interaction between nuclear magnetic dipoles
and electrons is the Fermi contact operator
XX
c
HFC = γK δ(ri − RK )sci · IK
K i
where sci is the spin of electron i, IK and γK the nuclear spin and the magnetogyric
ratio of nucleus K, respectively, and δ the Dirac delta function. Determine the
second-quantization representation of the Fermi contact operator.
58
(a) Show that the second-quantization representation of the two-electron Darwin
operator becomes X
ĤDar(2) = Spqrs a†pα a†rβ asβ aqα ,
pqrs
where Z
π
Spqrs =− 2 φ⋆p (r)φq (r)φ⋆r (r)φs (r)dr.
c
(b) Show that in second quantization these operators are related as
ĤSSC = −2ĤDar(2) .
U = exp(−κ), κ† = −κ,
show, that the elementary operators ã†P , ãP as well as state 0̃ generated by the
unitary transformation may be expressed in terms of the untransformed operators
and states as
59
60
Chapter 5
Hartree–Fock theory
where |0i is some reference configuration and exp(−κ̂) an operator that carries out unitary
transformations among the spin orbitals; these orbital-rotation parameters for the Hartree–
Fock ground state are obtained by minimizing the energy:
D E
EHF = min κ Ĥ κ . (5.2)
κ
The optimization could be carried out using standard numerical analyisis; however, for the
most of purposes more “spesific” methods are required in order to cope with the compu-
tational cost. We recall that the Slater determinant represents a situation where electrons
behave as independent particles, but are subject to the Fermi correlation. Therefore, the
optimal determinant can be found by solving a set of effective one-electron Schrödinger
equations – the Hartree–Fock equations – for the spin-orbitals, with the associated Hamil-
tonian being the Fock operator X
fˆ = fP Q a†P aQ , (5.3)
PQ
where the elements fP Q are called the Fock matrix. A very essential feature of the Fock
operator is that the one-electron part of the “true” molecular Hamiltonian is retained,
but the two-electron part is replaced with the effective Fock operator, so that fˆ = ĥ + V̂ .
The Fock operator incorporates in an average sense the Coulomb interaction among the
electrons, corrected for Fermi correlation:
X
V̂ = VP Q a†P aQ (5.4)
PQ
61
X
VP Q = (gP QII − gP IIQ ) , (5.5)
I
where the integrals g are given in Eq. (4.33), with I running over the occupied and P and
Q all spin-orbitals. The first term in VP Q describes the classical Coulomb interaction of the
electron with the charge distribution caused by the others, and the second one, exchange
term, is a correction that arises from the antisymmetry of the wave function.
• The Hartree–Fock equations are solved by diagonalizing the Fock matrix. The re-
sulting eigenvectors are called the canonical spin-orbitals and the the eigenvalues of
the Fock matrix are the orbital energies, fP Q = δP Q ǫP .
• Since the Fock matrix is defined in terms of its own eigenvectors, the canonical spin-
orbitals and the orbital energies can only be obtained using an iterative procedure;
where the Fock matrix is constructed and diagonalized in every iteration until the
spin-orbitals obtained in a particular iteration become identical to those from which
the Fock matrix was constructed. This procedure is referred to as the self-consistent
field (SCF) method and the resulting wave function the SCF wave function.
• In the canonical representation, i.e., in the frame where the Fock matrix is diagonal,
the electrons occupy the spin-orbitals in accordance with the Pauli principle and
move independently one another in the electrostatic field caused by the stationary
nuclei and by the charce distribution of the N − 1 electrons. The orbital energies
are the one-electron energies associated with the independent electrons and may thus
be interpreted as the energies required to remove a single electron, that is, to ionize
the system. The identification of the orbital energies with the negative ionization
potentials is known as the Koopman’s theorem.
Finally, let us make a remark that the Hartree–Fock state is an eigenfunction of the Fock
operator with an eigenvalue equal to the sum of the orbital energies of the canonicial
spin-orbitals X
fˆ |HFi = 2 ǫi |HFi . (5.7)
i
62
Figure 5.1: The self-consistent field procedure.
63
Figure 5.2: Perfomance of RHF and UHF models in H2 .
Let us plot these wave functions as well as the FCI wave function for comparison using the
LCAO approximation for the orbital basis (cc-pVQZ basis set) in Figure 5.2.
At short internuclear distances the RHF and UHF wave functions are identical, pro-
viding a crude but reasonable representation of the potential energy surface. At larger
separations, the RHF wave function behaves poorly for energy, whereas the UHF wave
function falls towards the correct FCI curve. On the other hand, while the RHF wave
function remains a true singlet at all distances, the UHF wave function behaves incorrectly
with respect to the spin, yielding a spin intermediate of a singlet and a triplet.
where
D E
Dpq = HF Êpq HF (5.9)
dpqrs = hHF |êpqrs | HFi . (5.10)
For a closed-shell state, the only nonzero elements of (5.9) and (5.10) are
Dij = 2δij
1
dijkl = Dij Dkl − Dil Dkj = 4δij δkl − 2δil δkj ,
2
64
and thus the total Hartree–Fock energy is
X X
E (0) = 2 hii + (2giijj − gijji). (5.11)
i ij
The energy must be optimized subject to the orthonormality of the MOs. We introduce
the Hartree–Fock Lagrangian
X
L(C) = E(C) − 2 λij (hφi |φj i − δij )
ij
Since the multiplier matrix λ is symmetric (for real orbitals), it can be diagonalized by
an orthogonal transformation among the occupied orbitals, λ = UǫUT , the Hartree–Fock
energy being invariant to such transformations. The off-diagonal Lagrange multipliers may
thus be eliminated by an orthogonal transformation to a set of occupied MOs that satisfy
the canonical conditions. In the canonical basis, the variational conditions become
X X
AO
fµν Cνk = ǫk Sµν Cνk , (5.13)
ν ν
Since f AO is symmetric, we can extend the variational conditions to yield a set of orthonor-
mal virtual MOs that satisfy the same canonical conditions as the occupied MOs, i.e.,
fab = δab ǫa . Now, we are able to write the Hartree–Fock variational conditions as
f AO C = SCǫ, (5.15)
where ǫ is a diagonal matrix that contains the orbital energies. This equation is called the
Roothaan-Hall equations.
The AO Fock matrix can be evaluated entirely in the AO basis,
AO
X
AO 1
fµν = hµν + Dρσ gµνρσ − gµσρν , (5.16)
ρσ
2
65
Figure 5.3: Roothaan–Hall procedure.
Pn
subject to the constraint that i=1 wi = 1. For this purpose, we construct the Lagrangian
n n
!
X X
L= wi Bij wj − 2λ wi − 1 ,
i,j=1 i=1
66
100
SCF
10 DIIS
1
SCF
0.1 DIIS
∆ Eh 0.01
0.001
1e-04
1e-05
10 20 10 20
Iteration Iteration
Figure 5.4: On the left: H2 O in its equilibrium geometry, on the right: H2 O with its
bonds stretched to double (angle remaining constant). Calculations are carried out in the
cc-pVQZ basis.
• These are based on either the diagonalization of the Fock matrix or on a direct
optimization of the AO density matrix. Nevertheless, in some cases, even the DIIS
method (and alike) is difficult to converge, when it might be better to use the second-
order Newton method to the optimization of the Hartree–Fock energy. This has been
applied in the literature to the optimization of the AO density matrix (the basis for
67
the trust-region or restricted-step method) as well as in a method that carries out
rotations among the MOs.
68
5.4 Further reading
• T. Helgaker, P. Jørgensen, J. Olsen, Molecular Electronic-Structure Theory (Wiley
2002), pp. 433–522.
where
1
0,0
Ŝpq = √ (a†pα aqα + a†pβ aqβ )
2
1
1,0
T̂pq = √ (a†pα aqα − a†pβ aqβ ).
2
(b) Show, that the singlet and triplet energies are related as
D E
3 β
Eai =1 Eai − 2 HF Eia α
ĤEai HF ,
β
α
where Eai = a†aα aiα and Eai = a†aβ aiβ .
2. In this exercise, we shall determine the gradient for the closed-shell Hartree–Fock
energy D E
∂ cs exp(κ̂)Ĥ exp(−κ̂) cs
(1)
EP Q (κ) =
∂κP Q
69
for a non-vanishing κ̂ that contains only real parameters κP Q . According to exercise
4.3⋆ , the derivative of an exponential operator is
∂ exp(−κ̂) X exp [i(λR − λS )] − 1
=− UP⋆ R UQS − UQR
⋆
UP S ā†R āS exp(−κ̂),
∂κP Q RS
i(λR − λS )
where UP Q are the elements of the unitary matrix U that diagonalizes κ; and λP are
the real eigenvalues of iκ:
iκ = UλU† .
The ā†P are the creation operators in the eigenvector-basis of κ,
X †
ā†P = aR URP .
R
(1)
X exp [i(λR − λS )] − 1
EP Q (κ) = UP⋆ R UQS − UQR
⋆
UP S
RS
i(λR − λS )
(b) Using this result, write the first-order variation of exp(−κ̂) with respect to δκ̂
as ∞
X (−1)n+1
δ exp(−κ̂) = [κ̂, [κ̂, . . . [κ̂, δκ̂] . . .]] exp(−κ̂).
n=0
(n + 1)! | {z }
n commutators
70
(d) The eigenvectors U define a basis
X
ā†P = a†R URP .
R
Show that, in this eigenvector basis, the orbital-rotation operator and its first-
order variation may be expressed as
X
κ̂ = −i λP ā†P āP
P
X
δκ̂ = U† δκU P Q ā†P āQ .
PQ
(e) Using the previous expansions, rewrite the expression for δ exp(−κ̂) as
X exp[i(λP − λQ )] − 1
δ exp(−κ̂) = − [U† δκU]P Q ā†P āQ exp(−κ̂).
PQ
i(λ P − λ Q )
(f) Show that the first derivatives of the orbital-rotation operator are given by
∂ exp(−κ̂) X exp [i(λR − λS )] − 1
=− UP⋆ R UQS − UQR
⋆
UP S ā†R āS exp(−κ̂),
∂κP Q RS
i(λR − λS )
⋆
4. In the optimization of a closed-shell electronic system, the nonredundant orbital-
rotation parameters
0 κTAI
κ=
−κAI 0
determine the transformation
P from the original MOs φp (0) to a set of improved MOs
φp (κ) as φp (κ) = φ
q p (0) exp(−κ) qp . A further rotation of φp (κ) involving the
redundant parameters
red κIJ 0
ρ=κ =
0 κAB
P
gives a new MO basis ψp (ρ) = q φq (κ) exp(−ρ)qp which is equivalent to φp (κ) in the
sense that it gives the same energy; in other words, the total energy is invariant to this
transformation. In this exercise, we show that the choice of redundant parameters
ρ = 0 minimizes the function
X
D(ρ) = kφp (0) − ψp (ρ)k2 ,
p
which represents the difference between the original and optimized MOs.
(a) Show that the difference measure D(ρ) can be expressed in the matrix form
71
(b) Show that the first-order variation of D(ρ) at ρ = 0 becomes
δD(ρ) = 2TrWδρ,
where
WIJ 0 exp(−κ)IJ 0
W= = .
0 WAB 0 exp(−κ)AB
(c) Show that the exponential of κ can be written as
cos A −κTAI B−1 sin B
exp(−κ) = ,
κAI A−1 sin A cos B
δ 2 D = −TrWδρδρ.
(f) Show that W is positive definite for small κ and use this to show that the second-
order variation is positive, implying that the stationary point is a minimum.
Thus, for a closed-shell wave functions, ρ = 0 gives the maximum overlap
between the rotated and original MOs.
72
Chapter 6
Configuration interaction
73
R = Rref R = 2Rref
E − EFCI Weight E − EFCI Weight
RHF 0.2178 0.9410 0.3639 0.5896
CISD 0.0120 0.9980 0.0720 0.9487
CISDT 0.0090 0.9985 0.0560 0.9590
CISDTQ 0.0003 1.0000 0.0058 0.9988
CISDTQ5 0.0001 1.0000 0.0022 0.9995
Table 6.1: The electronic energies of truncated CI wave functions for the water molecule
(cc-pVDZ basis) relative to the FCI energy.
Thus, we may characterize the determinants in the FCI expansion as single (S), double (D),
triple (T), quadruple (Q), quintuple (5), and so forth excitations relative to the Hartree–
Fock state.
The number of determinants in an FCI wave function is
2 2
n n!
Ndet = = , (6.4)
k k!(n − k)!
for a system with n orbitals containing k alpha and k beta electrons. For example, when
n = 2k, Ndet ≈ 16k /kπ – for large k, the number of determinants increases by a factor of
16 for each new pair of electrons and orbitals!
74
reference determinant such as the Hartree–Fock determinant does not provide the best
possible description of the dissociation process. To overcome this, we may introduce CI
wave functions based on the idea of a reference space comprising more than a single deter-
minant: The multi-reference CI (MRCI) wave functions is generated by including in the
wave function all configurations belonging to this reference space as well as all excitations
up to a given level from each reference configuration, yielding, e.g., the multi-reference
singles-and-doubles CI (MRSDCI) wave function.
The construction of a MRCI wave function begins with the generation of a set of orbitals
and the reference space of configurations. For example, in the water molecule, we would
include in our reference space for instance the configurations obtained by distributing the
eight valence electrons among the six valence MOs in all possible ways (consistent with the
spin- and space-symmetry restrictions).
• MRSDCI model is very accurate, if all the important configurations are included in
the reference space.
• The number of important configurations becomes, however, very large, making thus
the model applicable to quite small systems.
• The choice of the active space requires a lot of chemical intuition and patience.
Another way of truncating the FCI expansion is to construct the wave function from
individual configurations, the selection of which is based on physical intuition or perturba-
tional estimates.
• For example, if two fragments are described at the CISD level, then a size-extensive
treatment of the compound system requires that the wave function is augmented
with certain triples and quadruples; more precisely those that represent products of
single and double excitations in the two subsystems.
75
• An approach called quadratic CI (QCI) is a size-extensive revision of the CI model.
Other widely used approach to overcome the lack of size-extensivity is the Davidson
correction applied on top of the truncated CI energy.
The CI description of the electronic system is not at all compact. Even though higher
excitations are less important than those of lower orders, their number is enormous. There-
fore, the CI wave function converges slowly with the number of variational parameters.
Both of these problems are due to the linear parameterization in the CI model (6.1).
The ground-state MCSCF wave function is obtained by minimizing the energy with respect
to the variational parameters,
D E
κ, C Ĥ κ, C
EMCSCF = min . (6.6)
κ,C hκ, C |κ, Ci
where the variation principle is invoked to optimize the configuration coefficients as well
as the orbitals. Its behaviour in H2 dissociation is sketched in Figure 6.2.1. The solid line
is the MCSCF potential energy curve in the cc-pVQZ basis set. The thick and the dashed
lines are for FCI and RHF, respectively.
For any larger system than the H2 molecule the selection of configuration space is
the greatest difficulty in the MCSCF calculation. It is very often impossible to generate
a sufficient MCSCF configuration, which is computationally tractable. The selection is
usually carried out by dividing the MO space in inactive, active, and secondary orbitals.
76
Figure 6.1: MCSCF potential energy curve of H2 .
77
Natural orbital occupation numbers, symmetry 1
• Smallest possible active space seems to be (2,0,2,0), i.e. four electrons in four MOs
3a1 , 4a1 , 1b2 , and 2b2 . This corresponds to a combination of 12 determinants.
• For more accurate description, eight electrons distributed on (3,1,2,0) active space,
corresponding to 37 configurations, gives almost similar potential curve.
78
6.3 Further reading
• T. Helgaker, P. Jørgensen, J. Olsen, Molecular Electronic-Structure Theory (Wiley
2002), pp. 523–647.
• I. Shavitt, The Method of Configuration Interaction, in H. F. Schaefer III (ed), Meth-
ods of Electronic Structure Theory (Plenum Press 1977).
• H.-J. Werner, Matrix-Formulated Direct Multiconfiguration Self-Consistent Field and
Multiconfiguration Reference Configuration-Interaction Methods, in K. P. Lawley
(ed), Ab Initio Methods in Quantum Chemistry (Wiley 1987).
where the operator D̂I = a†uI α a†uI β agI β agI α creates a double excitation from the
Hartree–Fock state for molecule I.
79
(b) Show that the FCI eigenvalue problem becomes
0 K K 0 c0 c0
K 2∆ 0 K c1 = Ecorr c1
K 0 2∆ K c2 c2
0 K K 4∆ c12 c12
K = ggugu
2∆ = 2huu − 2hgg + guuuu − ggggg .
3. ⋆ Assume that we are carrying out an MCSCF calculation of the electronic state
X (0)
|0i = Ci |ii
i
D E X I D E 1X
gvwxy hi |êvwxy | 0i + Ci0 I E
i Ĥ 0 = Fvw i Êvw 0 +
vw
2 vwxy
and I E the inactive energy. Obtain an expression for the MCSCF electronic gradient.
4. ⋆ Show that the FCI wave function is size-extensive but the truncated CI-doubles
(CID) wave function is not.
80
Chapter 7
where we have introduced a generic notation τ̂µ for an excitation operator of unspecified
excitation level (D, T, Q,...) and the associated cluster amplitude tµ . This (full) coupled-
cluster (CC) wave function differs from the FCI wave function by the presence of terms
that are nonlinear in the excitation operators. Of course, the CC and FCI wave functions
are completely equivalent provided that all the excitations are included.
Since the τ̂ s always excite from the set of occupied Hartree–Fock spin-orbitals to the
virtual ones, [τ̂µ , τ̂ν ] = 0. Due to the presence of the product excitations in the coupled-
cluster state (7.1), each determinant can thus be reached in several different ways. For
example, the determinant |µνi = τ̂µ τ̂ν |HFi = τ̂ν τ̂µ |HFi may be reached with an overall
amplitude equal to the sum of the individual amplitude. With respect to this determinant,
the double-excitation amplitude tµν is referred to as a connected cluster amplitude and
subsequent singles excitation amplitudes tµ tν as a disconnected cluster amplitude.
81
Instead, by multiplying the eigenvalue equation Ĥ |CCi = E |CCi by hµ| = hHF| τ̂µ† we
obtain the projected coupled-cluster equations
D E
µ Ĥ CC = E hµ |CCi , (7.2)
where the CC energy is obtained by projection against the Hartree–Fock state (note that
hHF |CCi = 1) D E
E = HF Ĥ CC . (7.3)
These equations are nonlinear in amplitudes as well, but unlike the variational conditions,
this expansion will, due to the Slater’s rules, terminate after few terms, as we shall see
later.
Each excitation operator in T̂ excites at least one electron from an occupied Hartree–Fock
spin-orbital to a virtual one. It should be noted that for a system with N electrons,
the expansion terminates after N terms. We further note that [T̂i , T̂j ] = 0 as a trivial
consequence of the individual excitation operator commutation relations. Moreover,
hHF| τ̂µ = 0
τ̂µ† |HFi = 0,
82
To compare the CC model with the CI model, we expand the exponential operator and
collect terms to the same order in excitation level:
N
X
exp(T̂ ) |HFi = Ĉi |HFi .
i=0
Ĉ0 = 1
Ĉ1 = T̂1
1 2
Ĉ2 = T̂2 + T̂
2! 1
1 3
Ĉ3 = T̂3 + T̂1 T̂2 + T̂
3! 1
1 1
Ĉ4 = T̂4 + T̂3 T̂1 + T̂12 T̂2 + T̂14 ,
2! 4!
demonstrating the composition of the excitation processes. The advantages of the cluster
parameterization arise in truncated wave functions and are related to the fact that even at
the truncated level, the coupled-cluster state contains contributions from all determinants
in the FCI wave function, with weights obtained from the different excitation processes.
Similarly with the CI theory, a hierarchy of CC models is introduced by truncating the
cluster operator at different excitation levels. The simplest successful CC wave function
is the coupled-cluster singles-and-doubles (CCSD) model. In this model, the T̂2 operator
describes the electron-pair interactions and T̂1 carries out the corresponding orbital re-
laxations. Its computational cost scales as O(N 6 ). For higher accuracy, the connected
triple excitations have to be taken into account, which leads to the coupled-cluster singles-
doubles-and-triples (CCSDT) model. This model describes already the dynamical correla-
tion almost perfectly, but is computationally very demanding, O(N 8 ) and is thus applicable
to smallest molecules only. The triples excitations can be estimated in a perturbational
way, giving rise to the CCSD(T) method, which is a good compromise between the com-
putational cost (exhibiting an O(N 7 ) scaling) and accuracy. It is often considered to be
the best single-reference treatment of molecular Schrödinger equation that is applicable in
practise.
83
Figure 7.1: On the left: The CC dissociation curves of the C2v water molecule in the
cc-pVDZ basis. On the right: corresponding differences between the CC and FCI energies.
RHF reference state is plotted the full line and UHF with the dashed.
84
Figure 7.2: The error with respect to FCI of CC and CI wave functions for the water
molecule at the equilibrium geometry in the cc-pVDZ basis.
According to the previous discussion, the projected coupled-cluster equations are writ-
ten as
D E
HF Ĥ exp(T̂ ) HF = E (7.9)
D E D E
µ Ĥ exp(T̂ ) HF = E µ exp(T̂ ) HF . (7.10)
• The excited projection manifold hµ| comprises the full set of all determinants up to
the chosen truncation level.
• For the full CC wave function, the number of equations is equal to the number
of determinants and the solution of the projected equations recovers the FCI wave
function.
• The nonlinear equations (7.9) must be solved iteratively, substituting in every itera-
tion the CC energy calculated from (7.10).
The equations (7.10) and (7.12) are completely equivalent (yet the equivalence is nontrivial
for all the truncated CC models!), meaning that they yield same amplitudes and energy
upon solution. They are referred to as the unlinked and linked coupled-cluster equations,
respectively. Their computational cost is practically the same, but the linked equations
85
have some advantages: Although both yield the same, size-extensive wave function, the
linked equations are size-extensive term by term, that simplifies some perturbational treat-
ments; in addition, the CC approach excited states in mind is more fruitful to start from
the linked equations. The main reason is however the following:
• When the Baker–Campbell–Hausdorff (BCH) expansion
1 1
exp(A)B exp(−A) = B + [A, B] + [A, [A, B]] + [A, [A, [A, B]]] + · · · (7.13)
2! 3!
is applied to the similarity-transformed Hamiltonian, one obtains
1 1 1
exp(−T̂ )Ĥ exp(T̂ ) = Ĥ+[Ĥ, T̂ ]+ [[Ĥ, T̂ ], T̂ ]+ [[[Ĥ, T̂ ], T̂ ], T̂ ]+ [[[[Ĥ, T̂ ], T̂ ], T̂ ], T̂ ],
2 6 24
(7.14)
the coupled-cluster equations are therefore no higher than quartic in the cluster am-
plitudes – for any truncation level or even for the full CC expansion.
• Due to the Brillouin theorem, the one-particle operators contribute only to second
order. Furthermore, cluster operators higher than the doubles do not contribute to
the CC energy since Ĥ is a two-particle operator. As a result, only singles and doubles
amplitudes contribute directly to the CC energy – irrespective of the truncation level
of the cluster operator. Of course, the higher-order excitations contribute indirectly,
since all amplitudes are coupled by the projected equations.
where Ω(0) is the vector function calculated from the amplitudes t(n) and Ω(1) the CC
Jacobian D E
Ω(1) (n) (n)
µν = µ exp(−T̂ )[Ĥ, τ̂ν ] exp( T̂ ) HF . (7.15)
Then an iterative scheme is established:
• Solve the set of linear equations
for ∆t(n)
• Form the improved estimate
86
• Iterate Equations (7.16) and (7.17) until convergence in the amplitudes.
In more sophisticated implementations, this Newton-method-like approach is replaced by
a scheme, where the solution of the linear equations as well as the construction of the
Jacobian are avoided by the approximation
∆t(n) −1 (0) (n)
µ = −εµ Ωµ (t ), (7.18)
where, e.g. εab
ij = (ǫa + ǫb − ǫi − ǫj ). Usually, the convergence is accelerated by the DIIS
method.
87
where we have denoted Lpqrs = 2gpqrs − gpsrq .
where the summation is carried out over all the excitation operators present in the cluster
operator. This expansion is referred to as the equation-of-motion coupled-cluster (EOM-
CC) model. The expansion parameters are optimized by the minimization of the energy
D E
c̄ exp(−T̂ )Ĥ exp(T̂ ) c
E(c, c̄) = . (7.26)
hc̄ |ci
where T̂O = T̂2 + T̂3 + · · · + T̂N . The OCC energy is obtained in the usual manner by
projecting the OCC Schrödinger equation against the reference state
D E
EOCC = 0 exp(−T̂O ) exp(κ̂)Ĥ exp(−κ̂) exp(T̂O ) 0 , (7.28)
88
whereas the cluster amplitudes are determined by projection against the manifold |µO i
spanned by T̂O D E
µO exp(−T̂O )Ĥ exp(−κ̂) exp(T̂O ) 0 = 0. (7.29)
The Brueckner coupled-cluster theory (BCC) is closely related to OCC and differs in some
details of the solution of orbital rotation parameters. Unlike in standard CC theory, the
OCC and BCC orbitals are optimized simultaneously with the optimization of the cluster
amplitudes – compare with the MCSCF theory. Surprisingly, the differences between stan-
dard and OCC/BCC wave functions are small. Only in the rare cases that are characterized
by the sc. Hartree–Fock singlet instability the OCC/BCC are clearly more successful than
the standard formulation.
Thus, the Hartree–Fock energy is equal to the sum of the zero- and first-order contribu-
tions; and by adding the second-order correction, we obtain the second-order Møller–Plesset
energy (MP2)
X |gAIBJ − gAJBI |2
EMP2 = EHF − , (7.34)
A>B,I>J
ǫA + ǫB − ǫI − ǫJ
which is very popular and successful approach to introducing electron correlation on top of
the Hartree–Fock wave function; providing surprisingly accurate and size-extensive correc-
tion at low computational cost. Higher-order corrections are derived in a similar manner,
89
Figure 7.3: The MPn dissociation curves of the C2v water molecule in the cc-pVDZ basis.
Restricted MP2 with full and unrestricted with dashed line.
yielding MP3 and MP4 models, but these become rather expensive and are not as appeal-
ing compromise between cost and accuracy as MP2 is. Furthermore, the MPPT series have
been shown to be inherently divergent.
In the coupled-cluster perturbation theory (CCPT) the Hamiltonian in the CC equa-
tions (7.11) and (7.12) is partitioned as in Eq. (7.30). The Fock and the cluster operator
obey the following results:
h i X
fˆ, T̂ = εµ tµ τ̂µ
µ
hh i i
f , T̂ , T̂ = [[[fˆ, T̂ ], T̂ ], T̂ ] = · · · = 0.
ˆ
90
and we arrive at the working equations of the CCPT:
D E
ˆ
E = E0 + HF Φ̄ HF (7.36)
D E
ˆ
εµ tµ = − µ Φ̄ HF . (7.37)
By expanding the cluster operator in the orders of the fluctuation potential we arrive at
the amplitude equations D E
ˆ (n)
εµ tµ(n) = − µ [Φ̄] HF (7.38)
ˆ ](n) contains the nth order part of the similarity-transformed fluctuation potential.
where [Φ̄
Then the amplitudes would be determined self-consistently from the projected equations
after truncating the cluster operator at some excitation level. The CCPT is most often
used to establish approximations to contributions from some excitations – the most famous
is the non-iterative triples, or (T) correction to the CCSD wave function. Also very suc-
cessful iterative CCPT schemes have been set up, e.g. the CC2 model approximates the
CCSD model with the computational cost O(N 5 ) and CC3 the CCSDT model with the
computational cost O(N 7 ). The CCPT series provides more systematic improvement to
the wave function than the MPPT.
Both MPPT and CCPT are limited to the ground states of systems dominated by a
single configuration. To overcome this, multiconfigurational perturbation theory has been
developed. Most important such theory is CAS perturbation theory (CASPT), where the
zero-order state is taken to be a CASSCF wave function. At the moment, it is the only
generally applicable method for the ab initio calculation of dynamical correlation effects of
open- and closed-shell multiconfigurational electronic system.
91
of the standard MO CC doubles (CCD) equations into the AO basis. Scuseria and
Ayala argued that the solution of the CC equations in the AO basis scales in principle
linearly with respect to N, but their approach seems so far not to have received any
widespread use.
• A fully operational local CCSD implementation have been presented by Werner et al.
This implementation, referred to as the local CCSD (LCCSD) method, is based on
localizing the occupied space as suggested by Pulay and coworkers, using traditional
MO localization schemes, and using for the virtual space a projected atomic orbital
(PAO) basis. LCCSD further relies on a spatial division of the system into domains.
Restrictions on the extent to which the interactions between different domains are
accounted for is important for achieving the low-power scaling of LCCSD making the
results deviate from the standard CCSD results.
• The equations of correlated wave function theories may also be derived in a biorthog-
onal projected atomic orbital basis, aiming at a black-box formulation that could be
applied to large systems. This formulation also constitutes a good platform to intro-
duce efficient approximation schemes to reduce the number of parameters and thus
the computational scaling. The locality of the PAO basis is reflected in the fact
that the cluster amplitudes, the integrals as well as the correlation energy contri-
butions decay exponentially with respect to the distance between the centers of the
participating PAOs.
92
7.4 Further reading
• T. Helgaker, P. Jørgensen, J. Olsen, Molecular Electronic-Structure Theory (Wiley
2002), pp. 648–816.
⋆
3. Verify the following commutation relations:
hh i i X
Ĥ, Êai , Êbj = −Pijab hib Êaj − Pijab (gpbiq êpjaq + gpqjaêpqbi )
pq
X
+ (gpbqa êpjqi + gjpiq êbpaq )
pq
hhh i i i X
abc
Ĥ, Êai , Êbj , Êck = Pijk (gkbip êpbic + gpbic êpjak ) ,
p
and the expressions for these commutators applied to the Hartree–Fock state:
hh i i
Ĥ, Êai , Êbj |HFi = 2Liajb |HFi
!
X X
−Pijab I Fib Êaj + Likjb Êak − Lcajb Êci |HFi
k c
!
X X
+ gbjcaÊbj Êci − gbjik Êbj Êak |HFi
bjc bjk
93
hhh i i i
abc
Ĥ, Êai , Êbj , Êck |HFi = −Pijk Ljbic Êak |HFi
!
X X
abc
+Pijk giljcÊal Êbk − gidbc Êaj Êdk |HFi .
l d
In the above, we have introduced the notation Lpqrs = 2gpqrs − gpsrq as well as used
the three-index permutation operator,
abc abc
Pijk Aijk = Aabc acb bac bca cba
ijk + Aikj + Ajik + Ajki + Akji .
⋆
4. Consider, once again, the H2 molecule in a minimal basis.
1
T̂ = T̂2 = tuu Êug Êug
2 gg
and that the biorthogonal basis can be expressed as
uu
= hvac| auβ auα .
gg
(d) Show, that the linked and unlinked coupled-cluster equations are equivalent.
94
6. ⋆ The basic idea behind the quantum-mechanical perturbation theory (known as the
Rayleigh–Schrödinger perturbation theory, RSPT) is to partition the Hamiltonian
into two parts, Ĥ = Ĥ0 + Û , where Ĥ0 is some zero-order Hamiltonian and Û the
perturbation
Ĥ − Ĥ0 . The orthonormal eigensolutions to the zero-order Hamiltonian,
Ψ(0) with the corresponding energy E (0) are then used to expand the eigenfunction
|Ψi of the exact Hamiltonian.
(a) Show, that
n
X
(0)
(n) (n−1)
(Ĥ0 − E ) Ψ
= −Û Ψ
++ E (k) Ψ(n−k)
k=1
(b) Write out explicitly three lowest-order corrections to the energy and to the wave
function.
7. ⋆ Consider the wave function correct to nth order in RSPT:
Xn
(
|Ψn i = Ψ i) .
i=0
in a nonorthogonal basis, whose overlap elements differ from the Kronecker delta:
hi |ji =
6 δij .
(a) Show that the zero-order equations and the condition for the intermediate nor-
malization become
H0 C(0) = E (0) SC(0)
(C(0) )T SC(0) = 0.
95
96
Chapter 8
Performance of the
electronic-structure models
Having now considered the most important approximations to the molecular electronic
wave function we will now address their performance in terms of a few numerical examples.
It should be remembered that none of these models (besides FCI, which is inapplicable for
most cases) is an all-around-good one: good performance in some problem and/or in some
system does not guarantee good performance in another situation. In addition, rehearse
the graph concerning the approximations to the one- and N-electron spaces.
97
Figure 8.1: Performance of various N-electron models in conjunction with one-electron
bases in 29 molecular bond lengths.
98
The calculations were carried out for sets of small molecules containing main-group ele-
ments. 29 bond distances and 10 bond angles were calculated, and the results are presented
in Figures 8.1 and 8.2.
Observations:
• The Hartree–Fock model is suitable for only preliminary calculations.
• The CCSD(T) model is capable of providing structures – both bond distances and
angles – of high quality.
• The MP2 model is less accurate but more generally applicable as it is considerably less
expensive. Higher-order MPPT is significantly less accurate but hideously expensive,
i.e., useless.
• The performance of the CCSD model is disappointing. It however performs more
robust than MP2 in some selected cases with complicated electronic structure.
• The CISD model is also rather useless, showing no improvement when the basis set
is increased.
• The basis-set requirements are different for different models. Whereas there is no
much use to use QZ-level basis sets with the Hartree–Fock model (as the results
are not improved), the correlated models require at least a TZ-level basis set, but
preferably the cc-pVQZ basis.
The results suggest the following recipe for the determination of molecular structures:
obtain an initial structure with HF using a DZ-level basis and improve it by using MP2
and a TZ basis. If ever possible, calculate the final geometry in the cc-pVQZ basis using
the CCSD(T) model.
99
Figure 8.2: Performance of various N-electron models in conjunction with one-electron
bases in 10 molecular bond angles.
100
¯ of the cc-pVQZ basis.
Figure 8.3: Changes in bond lengths calculated from ∆
101
Figure 8.4: Reaction enthalpies.
102
Total energy [Eh ] Energy corrections [mEh ]
CCSDT HF HF→SD SD→(T) (T)→T SO MV Dar
DZ -76.24121 -76.02680 -211.21 -3.04 -0.16
TZ -76.33228 -76.05716 -267.40 -7.65 -0.08
QZ -76.35981 -76.06482 -285.98 -8.99 -0.01 0.00 -255.87 203.69
5Z -76.36899 -76.06708 -292.44 -9.52 0.04
• It is remarkable how a tiny, less than one per cent fraction of the total energy, plays
such an important role in the determination of molecular properties.
• The leading-order relativistic correction, -52.18 mEh , is larger than the triples con-
tribution already in water, and gets far more pronounced in systems with heavier
elements present!
103