We have already seen in Chapter 10 that basic properties of electron
states in materials are determined by quantum effects. This impacts
all properties of materials, including their mechanical properties,
electrical and thermal conductivities, and optical properties. An
example of the inherently quantum mechanical nature of electrical
properties is provided by the role of virtual intermediate states
in the polarizability tensor in Section 15.3
We will now continue to illustrate quantum
effects in materials with a focus on effects that require the use
of second quantization or Lagrangian field theory, or at least the
knowledge of exchange interactions for a proper treatment. We will
start at the molecular level and then discuss the second
quantization of basic excitations in condensed materials.
The inception of the Schrödinger equation was
accompanied by a large number of immediate successes, including
atomic theory, the quantum theory of photon-atom interactions, and
quantum tunneling. Another of these important successes was the
development of the theory of covalent chemical bonding, which was
initiated by Burrau1,
Heitler and London2,
and others. This is an extremely important and well studied subject
in chemistry and molecular physics, and yet it never seemed to
reach the level of popularity and recognition that other areas of
applied quantum mechanics enjoy. One reason for this lack of
popularity might be the lack of simple, beautiful model systems
which can be solved analytically. Solvable model systems are of
great instructive and illustrative value, and often provide a level
of insight that is very hard to attain with systems which can only
be analyzed by approximation methods. However, the existence and
stability of covalent bonds is clearly an important property of
molecules and of materials in general, and a basic quantitative
understanding of the covalent bond should be part of the toolbox of
every chemist, physicist and materials scientist. Indeed, there is
a model system which can be analyzed to some extent by analytic
methods. If only basic qualitative features are required, the
analytic formulation can then be used for numerical evaluations
which do not require a huge amount of effort. This model system is
the hydrogen molecule ion H2 +, which is also
known as the dihydrogen cation. The analysis of electron states for
fixed locations of the two protons in this simplest molecular
system have been investigated already in the early years of quantum
mechanics3, and have
been a subject of research ever since, both in terms of the
semi-analytic analysis in prolate spheroidal coordinates4 used in
Section 19.2, and in terms of high precision variational
calculations5. Before
specializing to H2 + we will discuss the
interplay of nuclear and electronic coordinates and the role of the
Born-Oppenheimer approximation in molecular physics.
19.1 The Born-Oppenheimer approximation
Molecules can be described by first quantized
Hamiltonians of the form
if we use properly anti-symmetrized wave functions for the
electrons and symmetrized or anti-symmetrized wave functions for
bosonic or fermionic nuclei of the same kind. Here lower case
indices enumerate electrons while upper case indices refer to
nuclei.
(19.1)
Otherwise, we might just as well use the second
quantized Schrödinger picture Hamiltonian
where the labels A, B enumerate different kinds of nuclei.
We assume that there are N
e electrons and
nuclei in our molecule.
Realistically, we would restrict attention to valence electrons
(rather than all electrons), and the numbers A would enumerate different kinds of
ion cores. However, in the example of the hydrogen molecule ion
below this distinction is void. The choice of kinetic terms also
assumes that all the particles are non-relativistic. Indeed, this
also informs the choice of interaction terms in the
Born-Oppenheimer Hamiltonian. Electromagnetic interactions between
non-relativistic charged particles are dominated by the Coulomb
interaction, but if there are relativistic charged particles in the
system, photon exchange between charged particles through their
couplings to the vector potential becomes important.
Domination of the Coulomb interaction in the case of
non-relativistic electron-nucleus and electron-electron scattering
is demonstrated in Sections 22.2 and 22.4, respectively.
Equation (22.29) provides an estimate of the
relative importance of photon exchange versus Coulomb interactions
for non-relativistic electrons and nuclei.
(19.2)
Spin labels are suppressed in (19.2) and also in the
corresponding states below, because they enter trivially in the
equations of motion6.
Note that even in the valence electrons plus ion
cores approximation, the Hamiltonians (19.1, 19.2) describe an
incredibly complicated quantum mechanical system, even in the case
of a “simple” diatomic molecule. This is because the complete
spectrum of energy levels and eigenstates of (19.1) does not only
include bound molecular states (which is complicated enough), but
also scattering states of electrons and of molecular fragments. The
Hamiltonian for the hydrogen molecule H2 describes not
only bound states of two protons and two electrons, but also
electron scattering off an H2 + ion, atomic
hydrogen-hydrogen scattering, proton scattering off an
H− ion, and a plasma of free protons and electrons.
However, our primary interest concerns an understanding of the
nature of covalent bonds and of ground state properties of
molecules. In this case, we don’t have to include the scattering
states, and we can even neglect the motion of ion cores.
Born and Oppenheimer have pointed out that it
makes intuitive sense to separate nuclear and electronic motion by
first solving the electronic problem for fixed nuclear
coordinates, and then substituting the electronic solution into a
remnant nuclear Schrödinger equation7. In the framework of quantized
Schrödinger theory this amounts to an electronic Hamiltonian
with corresponding parameter dependent electronic states
Here is an electronic
creation operator and is a creation
operator for a nucleus of species A at the location . The set of quantum numbers
specifies the state (including the
energy level), and the notation
indicates that the electronic state also depends on the location of
the nuclei.
(19.3)
(19.4)
The equation of motion for the electronic
states (19.4) with the Hamiltonian (19.3) then follows as in
Section 17.6, except that here we use a
time-independent Schrödinger equation. The equation
yields with the short hand notation
the equation
The N e -electron wave functions
are complete in the 3N
e -dimensional
configuration space of the electrons, and therefore the wave
functions of the full -particle problem can be expanded in
the form
The sum over the quantum numbers also involves at least one
integration over a continuous quantum number for the scattering
states.
(19.5)
(19.6)
On the level of the second quantized theory, the
amplitude (19.6) corresponds to the -particle state
where the parameter-dependent electronic state
is given in (19.4).
Substituting (19.6) into the full
-particle Schrödinger equation
yields the equation
This can be resolved into a set of coupled equations for the
nuclear factors through
orthogonality of the electron factors .
If this is done, no approximation has been made so far to the
problem to solve the molecular Hamiltonian (19.2). However, if we are
in the center of mass frame of the nuclei, and if both rotational
and vibrational excitations are small, we can neglect the nuclear
kinetic terms, and we find for these nuclear configurations
that their energy levels can
be approximated by
The corresponding full molecular eigenstate in this approximation
has a wave function
and a corresponding second quantized state
(19.7)
(19.8)
(19.9)
It might be tempting to conclude
from (19.8) that the solution of the electronic
equation (19.5) eventually allows us to calculate the
nuclear equilibrium configuration in the aftermath from a
requirement .
However, this is not true:
The energy level
for a general nuclear configuration
represents only the electronic energy plus the
electrostatic nuclear potential energy for that
configuration. Equation (19.8) only states that
within the Born-Oppenheimer approximation, the energy and the full
molecular energy coincide in an equilibrium configuration, but that
does not imply that the two
energies coincide in a neighborhood of an equilibrium
configuration. As a consequence the energy and the full
molecular energy can (and generically will) have different gradients with respect to the
nuclear configuration, even in a molecular equilibrium
configuration. The function may have
non-vanishing gradient in the molecular equilibrium configuration
because it neglects the contributions from nuclear kinetic
terms.
Therefore we have to use a priori knowledge of the equilibrium
configuration , e.g. from scattering
experiments, to calculate the molecular energy in the
Born-Oppenheimer approximation. We
cannot calculate both the energy and the equilibrium configuration
from (19.5).
19.2 Covalent bonding: The dihydrogen cation
The stability of molecules is an issue in
classical physics in the same sense as the stability of atoms is an
issue. It is not surprising that sharing of electrons yields a net
attractive force between positively charged nuclei or atomic cores.
Consider e.g. two protons at separation b with an electron right in the middle
between the protons. The net classical electrostatic energy of the
system is attractive, but the problem is
again to prevent collapse of the system. The corresponding quantum
mechanical system is again stabilized by wave particle duality.
Squeezing the particles very tight together implies strongly peaked
wave functions, hence too much curvature in the wave functions, and
the ensuing increase in kinetic energy eventually cannot be
compensated any more by gains in potential energy terms for
normalizable wave functions.
We apply the basic tenet of the Born-Oppenheimer
approximation to the hydrogen molecule ion H2
+ and determine approximate molecular orbitals under the
assumption that the two protons are fixed at their equilibrium
separation b. The distances
of the electron from the two protons are given by
if we assume that the two protons are located on the z axis at z = ±b∕2. A suitable set of coordinates for
the 2-center Coulomb problem are given by
and the azimuthal angle around the z axis. These coordinates are known as
prolate spheroidal coordinates. They seem to have been used for the
analysis of classical 2-center gravitational or electrostatic
problems and for acoustic and electromagnetic radiation problems
since the 19th century.
(19.10)
The surfaces are ellipsoids with the protons
in the focal points, while the surfaces are the corresponding
hyperboloids. The
coordinate lines take us from one hyperboloid to another hyperboloid
for constant
and .
For given value of , going from to takes us from the south pole of the
ellipsoid to its north pole, i.e.
is similar to the
coordinate on a sphere, except that we move from negative
z to positive z for increasing .
The advantage of this is that z > 0 corresponds to , but the right handed prolate
spheroidal coordinate system is then .
The coordinate lines are hyperbolas
, with the protons in the focal
points. corresponds to the line − b∕2 ≤ z ≤ b∕2 on the z axis and takes us to infinite
distance from the protons, i.e. plays a role similar to the radius
r in spherical
coordinates.
We apply the methods of Section 5.4 to determine tangent vectors to
the coordinate lines and the relevant differential operators. We
have
and
and this implies also
The dual basis vectors (5.21) are in the present case
and
This yields a diagonal inverse metric with
components
and the volume measure (5.27) for follows as
The Laplace operator (5.26) in spheroidal coordinates is
therefore
On the other hand, the coordinate dependence of the electrostatic
potential of the electron is
and therefore the Hamiltonian in the representation
satisfies
(19.11)
(19.12)
(19.13)
The Hamiltonian H commutes with the azimuthal angular
momentum operator L
z , and
therefore we can discuss the spectrum and eigenfunctions of
H within the subspaces of
L z eigenvalues ,
Within these subspaces, the normalization condition on the bound
electron states becomes with (19.11),
and the Hamiltonian H
m acting within
these subspaces satisfies
Here the energy E differs
from the energy E
e
(19.8)
of the molecule in the Born-Oppenheimer approximation by the
electrostatic energy of the nuclei,
(19.14)
(19.15)
Since H
m is hermitian
with respect to the scalar product appearing in (19.14), the differential
operators D
+, m and
D −, m must be hermitian with respect
to the scalar products
and
respectively. The corresponding Sturm-Liouville type boundary
conditions can be read off from the differential operators. We must
certainly have
For azimuthal quantum numbers m ≠ 0 we must also require
Note that corresponds to the interval −
b∕2 ≤ z ≤ b∕2 on the z axis, while and correspond to the half-lines
z ≤ −b∕2 and z ≥ b∕2 on the z axis, respectively. The boundary
conditions (19.17) therefore imply that the wave functions
must vanish on the z axis
if m ≠ 0, which apparently
makes sense.
(19.16)
(19.17)
We certainly should not expect that the molecular
orbitals with m = 0 vanish
on the z axis, and the
differential operators D
±, 0 are actually hermitian on their respective domains
without extra boundary conditions at or except that the wave functions should
remain finite in those points.
The point of this discourse about hermiticity of
the operators D
±, m is that as
a consequence, separation of the electronic Schrödinger equation
for the hydrogen molecule ion H2 + in terms
of prolate spheroidal coordinates will not only give us solutions,
but a complete set of
solutions in the form
Energy is a third quantum number which is treated as implicit in
the notation for the states.
(19.18)
(19.19)
(19.20)
The equation (19.19) and the
equation (19.20) for e 2 = 0 are relevant for
radiation problems and have been studied extensively, see
[1] and references there. The
solutions are known as angular spheroidal functions and radial
spheroidal functions because of the angular and radial
interpretation of the coordinates and , respectively.
The limit of
equation (19.20) immediately tells us that we can satisfy
the boundary condition (19.16) only for negative energy,
and the asymptotic form of the solution should be
with .
(19.21)
We wish to analyze in particular the sector
m = 0, which should contain
the ground state of the H2 + ion.
Equation (19.20) with m = 0 has the form
where we substituted , ψ + → ψ because in the following it will be
clear from presence or absence of the Coulomb term whether we are considering the
radial or the angular spheroidal coordinates and wave
functions.
(19.22)
The length parameter
is closely related to the Bohr radius (7.62) of the hydrogen atom.
Since our solution should remain finite at
, we
make an ansatz
Substitution into (19.22) yields a two-step recursion relation
(19.23)
(19.24)
On the other hand, must satisfy the
differential equation (19.22) without electrostatic term: ,
and on the interval . This equation allows for even
and odd solutions under , and we expect the
ground state solution to be even. Therefore we try an ansatz
where we can set e.g.
because the product form
of the ground state implies a degeneracy between d 0 and the coefficient
c 0 in the
radial factor (19.23). The constant c 0 is then determined by
the normalization condition (19.14).
(19.25)
(19.26)
(19.27)
The expansions (19.23)
and (19.26) are not the standard expansions. For the
angular function (19.26) one rather uses an expansion in terms of
Legendre polynomials (or associated Legendre polynomials
for m ≠ 0), which are orthogonal
polynomials in and satisfy (19.25)
or (19.19) for κ = 0 and . For the polynomial factors in
the radial function (19.23) one rather uses Laguerre polynomials
or , because
are complete orthogonal functions in . The corresponding two-step
recursion relations for the coefficients in these expansions then
follow from the differential equations and recursion relations of
the orthogonal polynomials. However, for our purposes the simpler
expansions (19.23) and (19.26) are sufficient for
the illustration of basic solution techniques for the dihydrogen
cation.
We cannot go ahead and simply solve the recursion
relations (19.24) and (19.28) to some finite
order to get approximate wave functions for the electron, because
for generic values of and the resulting wave functions will
not be regular and square integrable in the domains and . Therefore, one first
has to determine which pairs of parameters and
allow for regular and square
integrable solutions.
A classical method for the approximate
calculation of the allowed parameter pairs and
in a two-step recursion relation
like (19.28) uses the ratios with the initial condition
from (19.28), . The recursion
relation (19.28) can then be written as an upwards
recursion ,
or as a downwards recursion ,
The requirement of finite limits of the angular wave
function implies that the solution of (19.29, 19.30) should satisfy
One way to derive the resulting condition on and
in approximate form is to use both
relations (19.30) and (19.29) for f n with the approximation
f N = 0 for some N ≫ n. Iteration of
equation (19.30) in N − n − 1 steps yields a relation of the
form ,
while on the other hand f
n is also
determined in n steps from
equation (19.29) and to yield functions . The
condition
then implicitly determines the relation between and
.
(19.29)
(19.30)
Another way to derive the relation between
and
writes the recursion
relation (19.28) as a matrix relation
with matrix elements
The condition
is then cut off for an (N +
1) × (N + 1) submatrix
F 0 ≤ n ≤ N, 0 ≤ n′ ≤ N to yield a relation between
and .
(19.31)
Once the relation between
and is established, application of the
same techniques to (19.24) implies a relation between the remaining
parameter and the parameter b∕a e . Since , this relation
determines the quantized energies of the even states (due to the
even ansatz (19.26)), with
m = 0.
Application of the same techniques with an odd
ansatz for or to the equations
with general m yields the
approximate energy levels and wave functions of the electron in the
dihydrogen cation with fixed centers. The matrix and determinant
condition for equation (19.24) are
(19.32)
Using only 3 × 3 matrices F and C in the
conditions (19.31) and (19.32) yields a ground
state energy
with eigenvalues and κ b = 1. 42 for a bond length
b = 105 pm. Using the
equivalent of a 4 × 4 matrix F and a 6 × 6 matrix
C in the
expansions with Legendre and Laguerre polynomials, Aubert
et al. 8 found E e = −16. 4 eV with κ b = 1. 485 for b = 2a. Either way, we find that the ground
state energy E
e is smaller
than the energy E
1 = −13. 6 eV of a hydrogen atom and a proton at large
distance, i.e. sharing the electron stabilizes the dihydrogen
cation in spite of the electrostatic repulsion of the protons. The
actual dissociation energy for the dihydrogen cation is
about 2.6 eV, i.e. the value of Aubert et al. from higher order
approximation of the recursion relations is much better, as
expected.
The coefficients which follow from the
relations (19.24), (19.28), (19.27)
and (19.14) for and κ b = 1. 42 are
(19.33)
The resulting function
along the symmetry axis of the cation is displayed in
Figure 19.1. The abscissa u is related to the z coordinate from
equation (19.10) through u = 2z∕b.
Fig. 19.1
The function
for the approximate ground state (19.33) is displayed along
the symmetry axis of the dihydrogen cation. The protons are located
at u = ±1. The abscissa
u = 2z∕b is in the range − 1 < u < 1, where . Outside of this range we have
for u < −1 () and for u > 1 ()
This low order approximation has already all the
characteristic features of the real ground state as confirmed by
higher order approximations. The electronic wave functions fall off
with a linear exponential for large values of the radial coordinate
,
and a double peak appears at the locations of the two protons.
However, higher order approximations yield lower energies with a
corresponding stronger exponential drop , κ b > 1. 42. This implies that the
values of along
the symmetry axis are actually underestimated in the approximation
in Figure 19.1, and the cusps become more pronounced in
higher order approximations.
Cusps are inevitable in many-particle wave
functions for charged particles. Kato had demonstrated that these
wave functions have cusps for coalescence of any two charged
particles9.
Specifically, if r
12 is the separation between two particles with charges
Z 1 e and Z 2 e, and if the wave function does not
vanish for r
12 → 0, the directional average of ∂ ψ∕∂
r 12 in the limit r 12 → 0 satisfies
The constant γ
12 is
where is the
reduced mass of the charged particles. In particular, coalescence
of two electrons or of electrons and protons corresponds to
19.3 Bloch and Wannier operators
The use of second quantized
Hamiltonians is ubiquitous in condensed matter physics, and in the
following sections we will introduce very common and useful
examples for this, viz. the
Hubbard Hamiltonian for electron-electron interactions, phonons,
and a basic Hamiltonian for electron-phonon coupling. We will
motivate the model Hamiltonians from basic Schrödinger field theory
or the classical Hamiltonian for lattice vibrations, respectively,
and refer the reader to more specialized monographs for alternative
derivations of these Hamiltonians.
However, before we embark on this journey, we
should generalize the results from Sections 10.1, 10.2 and 10.3 to
three dimensions and combine them with what we had learned in
Chapter 17 about quantization and Schrödinger
field operators.
The basic Schrödinger picture Hamiltonian for an
electron gas has the form
(19.34)
Suppose that this electron gas exists in a
lattice with basis vectors and dual basis vectors
(4.18). The lattice points are
with a
triplet of integers n
i . However, we
can also use the basis as a basis in ,
Note that the coordinates x
i and the
lattice basis vectors have the dimensions of length,
while the dual basis vectors have dimension length−1.
The coordinates ν
i are
dimensionless.
A Brillouin zone is a unit cell in the dual lattice
stretched by a factor 2π
and then shifted such that the center of the Brillouin zone is a
dual lattice point,
see also (10.10), where this notion was
introduced for one-dimensional lattices.
(19.35)
The vectors in a Brillouin zone have the
following useful properties, which are easily derived from Fourier
transformation on a one-dimensional lattice10 ,
Recall that the volume of a unit cell in the dual lattice is related to the
volume of a unit cell in the direct lattice through , (4.19).
(19.36)
(19.37)
If a unit cell in the lattice contains
N ions, electrons in the
lattice will also experience a lattice potential
where
enumerates the locations of the ions in the unit cell , and
n A e is the effective charge of the
A-th ion. On the level of
the quantized Schrödinger field theory, the
potential (19.38) adds the operator
to the Hamiltonian (19.34). We will focus on this potential term in
the remainder of this section and neglect the electron-electron
interaction term in (19.34). The corresponding first quantized
Hamiltonian
is invariant under lattice translations,
and therefore admits a complete set of Bloch type eigenstates,
see (10.14) for the one-dimensional case.
We can decompose the Schrödinger picture field operators
in terms of a
complete set of Bloch type eigenstates
with periodic Bloch factors
We denote integration over the unit cell of the lattice with
. Normalization of
the Bloch energy eigenfunctions then yields
and with (19.37) we find
Equation (19.42) also implies with the canonical
anticommutation relations for the Schrödinger field operators
and that the the
operators satisfy the relations
The second quantized state
is therefore a state with an electron in the first quantized
orbital Bloch state
and spin projection . Equation (19.41) and the conjugate
equation for are a special
case of our general observations (17.59) and (17.58) how annihilation and creation
operators for particles in specific states relate to the generic
operators and .
(19.38)
(19.39)
(19.40)
(19.41)
(19.42)
(19.43)
Since the operators are restricted to the
Brillouin zone, or equivalently are periodic in the rescaled dual
lattice with the Brillouin zone as unit cell,
we can expand them using equations (19.36, 19.37),
The operators in the direct
lattice satisfy
Substitution of (19.41) into (19.45) yields
with the Wannier states
These states satisfy the usual completeness relations as a
consequence of the completeness relations of the Bloch states
,
The operator therefore
generates an electron with spin projection in
the Wannier state .
(19.44)
(19.45)
(19.46)
We denote the operators and as Bloch
operators, and the operators and
as Wannier
operators.
19.4 The Hubbard model
The Hubbard model treats electron-electron
interactions in a tight binding approximation. Therefore we wish to
use the creation operators for
electrons in Wannier states.
The kinetic electron operator transforms into
Wannier type operators according to
(19.47)
This has the form of a hopping Hamiltonian for
jumps ,
with a hopping parameter
(19.48)
On the other hand, the electron-electron
interaction Hamiltonian becomes
with the Coulomb matrix element
H
e e would
certainly be dominated by terms on the same lattice site, and if we
restrict the discussion to a single band index, the
electron-electron interaction Hamiltonian assumes the simple form
with the spin polarized occupation number operators for lattice
site ,
The Hamiltonian (19.49) is known as the Hubbard Hamiltonian 11. This Hamiltonian was invented for
the analysis of ferromagnetic behavior in transition metals, and
soon became a very widely used model Hamiltonian in condensed
matter theory not only for magnetic ordering, but also for the
general investigation of electron correlations, conductivity
properties and disorder effects in many different classes of
materials12.
However, the Hubbard model also provides basic insight into the
relevance of delocalized Bloch states versus localized Wannier
states, as we will now discuss.
(19.49)
We assume that the hopping term is invariant
under translation and symmetric between sites, i.e.
If hopping is suppressed,
the Hamiltonian involves only the number operators ,
and the eigenstates and energy levels are given by particle states
with energy
Here N 1 and
N 2 are the
numbers of single and double occupied lattice sites, respectively.
This is also denoted as the atomic limit, since the electrons are
fixed at the atoms and the total energy is a sum of atomic
terms.
(19.50)
On the other hand, if we can neglect the
electron-electron interaction term, U = 0, we end up with a quadratic
Hamiltonian
(19.51)
We can map the electron operators on lattice
sites to electron operators (19.44) in the Brillouin
zone,
This diagonalizes the Hamiltonian (19.51),
(19.52)
(19.53)
(19.54)
The single particle eigenstate of the
Hamiltonian (19.53) with energy ,
is a Bloch state, while the single particle eigenstate
of the Hamiltonian (19.50) is a Wannier state. The magnitude of the
hopping terms
relative to U will
therefore determine the importance of itinerant
(or delocalized) Bloch electron states versus localized
Wannier electron states in the lattice.
19.5 Vibrations in molecules and lattices
Another basic excitation of lattices concerns
oscillations of lattice ions or atoms around their equilibrium
configurations. This kind of excitation is particularly amenable to
description in classical mechanical terms, but at the quantum level
lattice vibrations are very similar to quantum excitations of the
vacuum like electrons or photons. In particular, elementary lattice
vibrations can be spontaneously created and absorbed like photons,
and therefore require a quantum field theory which is similar to
the field theory for photons.
We will discuss the classical theory of small
oscillations of N-particle
systems in the present section as a preparation for the discussion
of quantized lattice vibrations in Section 19.6. We suspend summation
convention in this section, because we often encounter expressions
with three identical indices in a multiplicative term, and also
terms like without summation over the
repeated index.
Normal coordinates and normal
oscillations
We consider an N particle system with potential
.
The equilibrium condition
(19.55)
implies for the second order expansion around an
equilibrium configuration ,
where
parametrize the deviations from equilibrium and the coefficients
V i k, j
l are
The second order Lagrange function for small oscillations of the
system,
yields 3N coupled equations
of motion
Fourier transformation
yields the conditions
Writing this in the form
tells us that the 3N-dimensional vector
must have the form
where
is an eigenvector of the symmetric 3N × 3N matrix
with eigenvalue ω
I 2.
We assume that
is a stable equilibrium configuration such that all eigenvalues of
satisfy ω I 2 ≥ 0, and we define
as the
positive semi-definite roots.
(19.56)
(19.57)
(19.58)
(19.59)
(19.60)
(19.61)
(19.62)
Since is a symmetric real
3N × 3N matrix, we can find 3N orthogonal normalized real vectors
which solve the eigenvalue problem
The general solution (19.60) of the eigenvalue
problem with eigenvalue ω
I 2
will then have the form
with arbitrary complex factors .
The mode expansion (19.58) will therefore take the form
(19.63)
(19.64)
Equation (19.63) and imply the
orthogonality relations
This yields
where we assume that eigenvectors within degeneracy
subspaces have been orthonormalized.
(19.65)
Note that the normalization changes the
dimensions and the physical meaning of the coefficients. The
amplitudes a
I, i k in equation (19.64) have the
dimensions of a length, and the related eigenvectors and factors q I have the dimension of
mass1∕2 ×length. The normalized eigenvectors
are dimensionless, and
therefore the related coefficients have dimension
mass−1∕2. We will denote the related 3N dimensional vector
as an amplitude
vector.
The small oscillations of the system are then
determined by the eigenmodes (or equivalently
), and how strongly these
eigenmodes of oscillation are excited,
(19.66)
(19.67)
(19.68)
The 3N
complex amplitudes q
I are denoted as
normal coordinates of the
oscillating N particle
system, and the related eigenmodes of oscillation are also denoted
as normal modes. Note from
equations (19.66) or (19.68) that we can think
of the coefficients also as the components of a
3N × 3N transformation matrix between the
3N Cartesian coordinates
x i k (t) and the 3N normal coordinates q I of the oscillating system.
These 3N × 3N matrices satisfy the mass weighted
orthogonality properties (19.65) and
which follows from re-substitution of q I (19.68) into x i k (t) (19.66).
(19.69)
Appearance of the particular eigenvalue
ω I 2 = 0 implies that
the system is symmetric under rotations or translations. The
corresponding amplitude vectors
denote the tangential directions to rotations or translations of
the system.
We have learned that small oscillations of a
system are always superpositions of the normal oscillation modes or
eigenoscillations of the system. A
priori this does not seem to be particularly helpful to
determine the actual small oscillations of a system, because
finding the eigenmodes is equivalent to the diagonalization of the
3N × 3N matrix , which is anyhow the main task
in the solution of the equations of motion (19.57) using the Fourier
ansatz (19.64).
However, if the equilibrium configuration of the
system has symmetries, then we can often guess the form of some of
the eigenmodes which leaves us with a smaller diagonalization
problem for the determination of the remaining eigenmodes.
Eigenmodes of three masses
A simple example for the identification of normal
modes of a coupled particle system is given by three identical
masses in a regular triangle, see Figure 19.2.
Fig. 19.2
Three elastically bound masses with
equilibrium distance d
We will determine the eigenmodes in the plane of
the triangle. The potential of the coupled system in the harmonic
approximation is
The matrix V
i k, j l is
and we must have
Absence of external forces on the coupled system implies that there
must be two translational and one rotational eigenmode, see
Figures 19.2 and 19.3,
Fig. 19.3
The rotation mode
The equations
for I = 1, 2, 3 are readily
verified.
The symmetry reveals that another eigenmode can
be read off from Figure 19.4.
Fig. 19.4
The eigenmode
This yields the corresponding normalized
eigenvector
and application of
yields for the corresponding frequency
So far we have found four eigenmodes of the
planar system, and there must still be two remaining eigenmodes,
which must be orthogonal on the eigenmodes .
This yields for
the conditions
with general solutions
Application of reveals that these are
degenerate eigenvectors with eigenvalue
and an orthonormal basis in the degeneracy subspace is provided by
The corresponding eigenmodes are shown in
Figure 19.5.
Fig. 19.5
The eigenmodes and
The general small oscillation with ω > 0 is then given by
with
The diatomic linear chain
Lines of harmonically bound atoms provide
important model systems for oscillations in solid state physics. We
consider in particular a diatomic chain of 2N atoms with masses m and M, respectively. This model is shown in
Figure 19.6. The force constant between the atoms is
K and their equilibrium
distance is a∕2. The number
N of atom pairs is assumed
to be even for simplicity.
Fig. 19.6
A diatomic linear chain with masses
m and M and lattice constant a
We label the pairs of atoms with an index
n, 1 − (N∕2) ≤ n ≤ N∕2, and we use periodic boundary
conditions for the displadements x n and X n ,
The Lagrange function
yields equations of motion
which can be solved using Fourier decomposition on a finite
periodic chain,
with
(19.70)
(19.71)
The geometric series
implies that the inversion of (19.71) is
Since the resulting system of ordinary differential equations for
and is linear with constant
coefficients, we also use Fourier transformation to the frequency
domain,
and the coupled set of equations (19.70) separate into
coupled pairs of equations for different wave numbers k,
(19.72)
(19.73)
(19.74)
This implies that there is a unique set of
frequencies ω = ω
k for each wave
number k which has to
satisfy
This condition has two solutions (up to irrelevant overall signs of
ω k±),
and we have
(19.75)
Equation (19.75) reads in terms of the reduced mass
μ = m M∕(m + M) of the atom pair in the unit cell
An example of these dispersion relations with M = 1. 5m is displayed in
Figure 19.7.
(19.76)
Fig. 19.7
The frequencies ω k± from the dispersion
relation (19.76) for M = 1. 5m and 0 ≤ k a ≤ π. The frequencies ω k± are displayed in units of
, where μ is the reduced mass of the atom pair
in a unit cell
Note that the Lagrange function for a single atom
pair in the unit cell is
and therefore the oscillation frequency of the single pair is
.
The frequencies at k = 0 are ω 0− = 0 and .
The solution of (19.73, 19.74) for ω 0− = 0: , is a uniform
translation of the whole chain,
The solution for ω
0+: , is an
oscillation
The acoustic solution for k a = π is
i.e. only the heavy atoms oscillate,
On the other hand, the optical eigenmode with k a = π,
corresponds to an oscillation of the light atoms,
The general longitudinal oscillation will be a
superposition of all longitudinal eigenvibrations.
Quantization of N-particle oscillations
The Lagrange function (19.56) implies canonical
commutation relations
This yields commutation relations for the normal coordinates
Therefore we find canonical annihilation and creation operators for
the eigenvibrations in the form
The discussion of the diatomic chain taught us that for lattice
oscillations the eigenmodes also depend on wave vectors in a
Brillouin zone, and the following section will show that there can
be up to 3N branches if we
have N atoms per unit cell.
Therefore we will have annihilation and creation operators for
lattice vibrations which are related to the corresponding normal
modes through
The elementary excitations
of the lattice vibrations are denoted as phonons.
19.6 Quantized lattice vibrations: Phonons
We will first generalize the
previous discussion of vibrations in N-particle systems to the case of
three-dimensional lattices, and then quantize the lattice
vibrations
We denote the three basis vectors of a
three-dimensional lattice with , 1 ≤ i ≤ 3. Each location in the
lattice denotes a particular location of a corresponding unit cell,
and we can use or equivalently the three
integers n
i also to
address the particular unit cell to which the point belongs. Suppose we have
N atoms (or ions) per unit
cell in the lattice. We denote the displacement of the A-th atom from its equilibrium value in
cell by , and in the
harmonic approximation the displacements satisfy equations of
motion
corresponding to a Lagrange function
(19.77)
(19.78)
Substitution of Fourier transforms
into the equations of motion (19.77) yields the
eigenvalue conditions
with the symmetric matrices
Translation invariance in the lattice implies that
cannot depend on . Therefore we
can write
with inversion
Symmetry of the real matrix
under
implies
i.e.
(19.79)
(19.80)
(19.81)
(19.82)
Substitution of (19.81) and
in (19.79) yields,
For fixed value of , this is a hermitian eigenvalue
problem for the 3N-dimensional complex vector
Reality of the displacement vectors implies
and
For each point in the Brillouin zone, there will be
3N solutions and
of (19.83) which satisfy the orthogonality property
(19.83)
(19.84)
The hermiticity and transposition properties
imply that we have as a consequence of (19.83) for the normalized
solutions,
also the equations
and
(19.85)
(19.86)
(19.87)
Up to linear combinations within degeneracy
subspaces, the general set of solutions of the
conditions (19.83) will then have the form
with complex factors . This yields the general
lattice vibration in terms of the orthonormalized solutions
of (19.83),
The dual orthogonality relation to (19.84) follows from
re-substitution of into (19.88),
This is actually fulfilled due to two more fundamental completeness
relations. The first relation is completeness of 3N orthonormal unit vectors
in a 3N-dimensional vector
space,
where 1 is the
3N × 3N unit matrix, or if the atomic indices
are spelled out,
where now 1 is the 3 × 3
unit matrix referring to the spatial indices. The second relation
is the completeness relation (19.36).
(19.88)
(19.89)
(19.90)
(19.91)
(19.92)
The canonical quantization relations
imply
i.e. the phonon annihilation operator for the Ith mode with wave vector in the lattice is
and the displacement operators in terms of the phonon operators are
given by
with
(19.93)
(19.94)
(19.95)
The Lagrange function (19.78) implies a
Hamiltonian for the lattice vibrations,
This yields after substitution of equations (19.94, 19.95) and use of the
eigenvalue, hermiticity and orthogonality conditions for the
eigenvalue problem (19.85–19.87) the result13
(19.96)
It is uncommon but helpful for a better
understanding of Bloch and Wannier states of electrons to point out
an analogy with lattice vibrations at this point.
We have seen in Sections 10.1, 10.2 and 10.3 that
electrons in lattices can be described in terms of delocalized
Bloch states
or corresponding Wannier states w n, ν (x), w n, ν (x, t). Here ν labelled the different cells in the
lattice and n labelled the
different electron energy bands in the periodic potential of the
crystal. We have encountered the corresponding states in
three-dimensional lattices in equations (19.43, 19.46). To make the
connection to lattice vibrations, we re-express the
result (19.88) for the particular phonon energy band
I in the form
Instead of the continuous dependence of the Bloch or Wannier type
wave functions
and on
location , we have displacement variables at
the discrete locations in the lattice. However
with the correspondence of band indices , the Brillouin zone
representation of
the displacements corresponds to the Bloch waves (19.43) for electron
states, while the set of displacements
in the unit cell at corresponds to the Wannier
states (19.46).
19.7 Electron-phonon interactions
Phonons in the lattice of a solid material
naturally couple to electrons through the electrostatic interaction
between the electrons and the ion cores. If we neglect
electron-electron interactions, the basic Schrödinger picture
Hamiltonian for quantized electrons in a lattice of ion cores with
N atoms in the unit cell
has the form
We assume that the A-th atom or ion in the unit cell
couples to the electron with an effective charge n A e, and we treat the atoms or ions as
classical sources of electrostatic fields. However, we treat the
lattice vibrations on the quantum level, which according to
Sections 19.5 and 19.6 amounts to canonical quantization of the
lattice displacements
The leading order expansion of the Coulomb term
corresponds to a dipole approximation in the language of
Chapter 15, except that here the dipole
operator
is quantized according to (19.94, 19.95). This yields an electron-phonon
interaction Hamiltonian of the form
where we substituted the time-independent phonon operators
for the
Hamiltonian in the Schrödinger picture. For the electron operators,
we could substitute Bloch or Wannier type operators. However, Bloch
operators make much more sense, because the dipole
approximation (19.97) is a small oscillation approximation in
the sense ,
or otherwise we should include quadrupole and higher order terms.
This implies that matrix elements of electron states with the
lattice electric fields
must not be dominated by large terms from the ion cores. The linear
phonon coupling Hamiltonian H e−q should therefore not be a good
approximation for the localized electrons in Wannier states.
Evaluation of the substitution of the free electron operators
through Bloch operators (19.41) in H e−q uses the fact that integration
over can be split into summation over the
lattice and integration over the unit
lattice cell V,
and that the lattice electric fields satisfy
We denote the Bloch operators for the electrons by to avoid confusion
with the phonon operators. This yields the following form for the
electron-phonon interaction operator,
(19.97)
(19.98)
We can also write this as
with coupling matrices between the phonons and the Bloch electrons,
The products in (19.99) contain summations over the electron
energy band indices n,
n′.
(19.99)
(19.100)
Below we will need the following property of the
electron-phonon coupling functions,
(19.101)
The full Hamiltonian also contains the free
Hamiltonian for the phonons and the Bloch electrons
with
(19.102)
The two interaction terms in (19.99) describe
absorption and emission of a phonon of wave number by a Bloch electron. The resulting
exchange of virtual phonons between electron pairs will generate an
effective interaction between the electrons. If interband couplings
can be neglected,
and , a
simple method to estimate this phonon mediated electron-electron
interaction eliminates the first order phonon coupling through the
Lemma 1 (6.22) for exponentials of operators.
A unitary transformation
with
eliminates the leading order electron-phonon coupling term due to
and generates a direct electron-electron coupling term
In the next step we substitute
in the second term in H
e−e (q) and use the
properties (19.101) and .
This yields
(19.103)
Phonons with frequencies which are large compared
to the electron energy difference,
lower the energy of a two-electron state, thus implying an
energetically favorable correlation between electrons. Effectively,
a negative coefficient of
also amounts to an electron-electron attraction.
Compare (19.103) with the simplified expression for
free fermion operators,
In space this becomes
which is an attractive interaction for and repulsive otherwise.
The possible instability of Fermi surfaces
against phonon-induced energetically favored correlations between
electrons, and the ensuing suppression of electron scattering, had
been identified in the 1950s as the mechanism for low temperature
superconductivity14. Please consult [5, 17,
22, 25] for textbook discussions of low temperature
superconductivity.
19.8 Problems
19.1. Suppose we are using the
Born-Oppenheimer approximation for the hydrogen atom, i.e. we treat
the proton as fixed at location . This would
yield the same energy levels and energy eigenfunctions that we had
found in the exact solution in Chapter 7, except that the reduced mass
would be replaced
by the electron mass m
e in the result
for the Bohr radius a, and
therefore also in the energy eigenvalues and the wave
functions.
Show that the corresponding change in the mass
value δ μ = m e −μ satisfies . Show also that in
the center of mass frame, the neglected kinetic energy of the
proton is related to the kinetic energies of the electron and of
the relative motion according to
Expand the ground state wave function in the Born-Oppenheimer
approximation in first order in in terms of the exact energy
eigenstates from Chapter 7.
19.2. Show that the hermitian symmetric
matrix (19.80)
with eigenvalues and
corresponding normalized eigenvectors has
square roots ,
Hint: The column vectors can be
used to form a unitary matrix . The matrix
transforms
into diagonal form, or in turn can be used to generate
from its diagonal form .
Use this observation to construct all the possible square roots
in
terms of and .
19.3. Suppose the three particles with
masses m and M in Figure 19.8 can only move in one
dimension.
Fig. 19.8
Three particles with masses m and M. It is supposed that the particles
can only move along the line connecting them
The potential energy of the system is
Calculate the eigenvibrations and the eigenfrequencies of the
system.
Solution.
The potential in matrix notation is
and we have to find the eigenvectors of the corresponding matrix
cf. 19.62.
(19.104)
Rather than trying to solve
we can infer two eigenmodes from the translation and reflection
symmetry of the system.
Invariance of the potential under translations
implies that one eigenvector
of has the form
Reflection symmetry also suggests an eigenmode , x 2 = 0,
and application of yields the corresponding
eigenvalue
The remaining eigenvector follows from orthogonality on
and
,
and application of confirms that this is an
eigenmode with frequency
For the actual eigenvibration we have to go back
to the amplitude vector (19.60), because different
masses participate in the oscillation. The normalized amplitude
vector (19.65) is
The eigenvibrations and are shown in
Figure 19.9.
Fig. 19.9
The eigenvibrations and
19.4. Calculate the positive
semi-definite square root of the matrix in
equation (19.104). Use the hint from
Problem 2.
Answer.
19.5. The electron-phonon interaction
Hamiltonian (19.99) is very similar to the electron-photon
interaction Hamiltonian in the representation (18.92),
(19.105)
Which effective electron-electron interaction
Hamiltonian H
e−e (γ) would you get if you eliminate
the photon operators through a unitary transformation
similar to the transformation that we performed to transform
H e−q into H e−e (q) (19.103)?
Bibliography
1.
M. Abramowiz, I.A. Stegun
(eds.), Handbook of Mathematical
Functions, 10th printing (Wiley, New York, 1972)
5.
J. Callaway, Quantum Theory of the Solid State
(Academic press, Boston, 1991)
11.
P. Fulde, Electron Correlations in Molecules and
Solids, 2nd edn. (Springer, Berlin, 1993)
17.
H. Ibach, H. Lüth,
Solid State Physics – An
Introduction to Principles of Materials Science, 3rd edn.
(Springer, Berlin, 2003)
22.
C. Kittel, Quantum Theory of Solids, 2nd edn.
(Wiley, New York, 1987)
25.
O. Madelung, Introduction to Solid-State Theory
(Springer, Berlin, 1978)
Footnotes
1
Ø. Burrau, Naturwissenschaften 15, 16 (1927); K.
Danske Vidensk. Selsk., Mat.-fys. Medd. 7(14) (1927).
3
A.H. Wilson, Proc. Roy. Soc. London A 118, 617,
635 (1928); E. Teller, Z. Phys. 61, 458 (1930); E.A. Hylleraas, Z.
Phys. 71, 739 (1931); G. Jaffé, Z. Phys. 87, 535 (1934).
4
See e.g. G. Hunter, H.O. Pritchard, J. Chem.
Phys. 46, 2146 (1967); M. Aubert, N. Bessis, G. Bessis, Phys. Rev.
A 10, 51 (1974); T.C. Scott, M. Aubert-Frécon, J. Grotendorf, Chem.
Phys. 324, 323 (2006).
5
B. Grémaud, D. Delande, N. Billy, J. Phys. B 31,
383 (1998); M.M. Cassar, G.W.F. Drake, J. Phys. B 37, 2485 (2004);
H. Li, J. Wu, B.-L. Zhou, J.-M. Zhu, Z.-C. Yan, Phys. Rev. A 75,
012504 (2007).
6
We would have to be more careful if we would
discuss expectation values, because exchange integrals appear in
the expectation values of potential terms, see
Section 17.7
9
T. Kato, Commun. Pure Appl. Math. 10, 151 (1957).
See also R.T. Pack, W.B. Brown, J. Chem. Phys. 45, 556 (1966) and
Á. Nagy, C. Amovilli, Phys. Rev. A 82, 042510 (2010).
10
We have seen the corresponding one-dimensional
equations in (10.1–10.4). However, when comparing
equations (19.36) and (19.37)
with (10.1–10.4) please keep in mind that the continuous
variables κ
i play the role
of x there, while the
discrete lattice sites compare
to the discrete momenta 2π
n∕a in
equations (10.1–10.4), see also (10.12).
11
J. Hubbard, Proc. Roy. Soc. London A 276, 238
(1963), see also M.C. Gutzwiller, Phys. Rev. Lett. 10, 159
(1963).
13
You also have to use that the matrix has
a positive semi-definite square root , see
Problem 19.2. Therefore we also have e.g.
14
J. Bardeen, L.N. Cooper, J.R. Schrieffer, Phys.
Rev. 108, 1175 (1957); see also H. Fröhlich, Phys. Rev. 79, 845
(1950) and J. Bardeen, D. Pines, Phys. Rev. 99, 1140 (1955).