Prev Next

Rainer DickAdvanced Quantum MechanicsGraduate Texts in Physics10.1007/978-3-319-25675-7_19

19. Quantum Aspects of Materials II

Rainer Dick¹

(1)

Department of Physics and Engineering Physics, University of Saskatchewan, Saskatoon, Saskatchewan, Canada

We have already seen in Chapter 10 that basic properties of electron states in materials are determined by quantum effects. This impacts all properties of materials, including their mechanical properties, electrical and thermal conductivities, and optical properties. An example of the inherently quantum mechanical nature of electrical properties is provided by the role of virtual intermediate states in the polarizability tensor in Section 15.3

We will now continue to illustrate quantum effects in materials with a focus on effects that require the use of second quantization or Lagrangian field theory, or at least the knowledge of exchange interactions for a proper treatment. We will start at the molecular level and then discuss the second quantization of basic excitations in condensed materials.

The inception of the Schrödinger equation was accompanied by a large number of immediate successes, including atomic theory, the quantum theory of photon-atom interactions, and quantum tunneling. Another of these important successes was the development of the theory of covalent chemical bonding, which was initiated by Burrau¹, Heitler and London², and others. This is an extremely important and well studied subject in chemistry and molecular physics, and yet it never seemed to reach the level of popularity and recognition that other areas of applied quantum mechanics enjoy. One reason for this lack of popularity might be the lack of simple, beautiful model systems which can be solved analytically. Solvable model systems are of great instructive and illustrative value, and often provide a level of insight that is very hard to attain with systems which can only be analyzed by approximation methods. However, the existence and stability of covalent bonds is clearly an important property of molecules and of materials in general, and a basic quantitative understanding of the covalent bond should be part of the toolbox of every chemist, physicist and materials scientist. Indeed, there is a model system which can be analyzed to some extent by analytic methods. If only basic qualitative features are required, the analytic formulation can then be used for numerical evaluations which do not require a huge amount of effort. This model system is the hydrogen molecule ion H₂ ⁺, which is also known as the dihydrogen cation. The analysis of electron states for fixed locations of the two protons in this simplest molecular system have been investigated already in the early years of quantum mechanics³, and have been a subject of research ever since, both in terms of the semi-analytic analysis in prolate spheroidal coordinates⁴ used in Section 19.2, and in terms of high precision variational calculations⁵. Before specializing to H₂ ⁺ we will discuss the interplay of nuclear and electronic coordinates and the role of the Born-Oppenheimer approximation in molecular physics.

19.1 The Born-Oppenheimer approximation

Molecules can be described by first quantized Hamiltonians of the form

$\displaystyle\begin{array}{rcl} H& =& \sum _{i} \frac{\boldsymbol{p}_{i}^{2}} {2m_{e}} +\sum _{I} \frac{\boldsymbol{P}_{I}^{2}} {2M_{I}} +\sum _{I<J} \frac{Z_{I}Z_{J}e^{2}} {4\pi \epsilon _{0}\vert \boldsymbol{R}_{I} -\boldsymbol{ R}_{J}\vert } +\sum _{i<j} \frac{e^{2}} {4\pi \epsilon _{0}\vert \boldsymbol{r}_{i} -\boldsymbol{ r}_{j}\vert } \\ & &-\sum _{i,J} \frac{Z_{J}e^{2}} {4\pi \epsilon _{0}\vert \boldsymbol{r}_{i} -\boldsymbol{ R}_{J}\vert } {}\end{array}$

(19.1)

if we use properly anti-symmetrized wave functions for the electrons and symmetrized or anti-symmetrized wave functions for bosonic or fermionic nuclei of the same kind. Here lower case indices enumerate electrons while upper case indices refer to nuclei.

Otherwise, we might just as well use the second quantized Schrödinger picture Hamiltonian

$\displaystyle\begin{array}{rcl} H& =& \int \!d^{3}\boldsymbol{x}\left ( \frac{\hbar ^{2}} {2m_{e}}\boldsymbol{\nabla }\psi _{e}^{+}(\boldsymbol{x}) \cdot \boldsymbol{\nabla }\psi _{ e}(\boldsymbol{x})+\sum _{A} \frac{\hbar ^{2}} {2M_{A}}\boldsymbol{\nabla }\psi _{A}^{+}(\boldsymbol{x}) \cdot \boldsymbol{\nabla }\psi _{ A}(\boldsymbol{x})\right ) \\ & & +\int \!d^{3}\boldsymbol{x}\int \!d^{3}\boldsymbol{x}' \frac{e^{2}} {4\pi \epsilon _{0}\vert \boldsymbol{x}-\boldsymbol{x}'\vert }\left (\sum _{A<B}Z_{A}Z_{B}\psi _{A}^{+}(\boldsymbol{x})\psi _{ B}^{+}(\boldsymbol{x}')\psi _{ B}(\boldsymbol{x}')\psi _{A}(\boldsymbol{x})\right. \\ & & +\sum _{A}\frac{Z_{A}} {2} \psi _{A}^{+}(\boldsymbol{x})\psi _{ A}^{+}(\boldsymbol{x}')\psi _{ A}(\boldsymbol{x}')\psi _{A}(\boldsymbol{x}) + \frac{1} {2}\psi _{e}^{+}(\boldsymbol{x})\psi _{ e}^{+}(\boldsymbol{x}')\psi _{ e}(\boldsymbol{x}')\psi _{e}(\boldsymbol{x}) \\ & & \left.-\sum _{A}Z_{A}\psi _{e}^{+}(\boldsymbol{x})\psi _{ A}^{+}(\boldsymbol{x}')\psi _{ A}(\boldsymbol{x}')\psi _{e}(\boldsymbol{x})\right ), {}\end{array}$

(19.2)

where the labels A, B enumerate different kinds of nuclei. We assume that there are N _e electrons and $N_{n} =\sum _{A}N_{A}$ nuclei in our molecule. Realistically, we would restrict attention to valence electrons (rather than all electrons), and the numbers A would enumerate different kinds of ion cores. However, in the example of the hydrogen molecule ion below this distinction is void. The choice of kinetic terms also assumes that all the particles are non-relativistic. Indeed, this also informs the choice of interaction terms in the Born-Oppenheimer Hamiltonian. Electromagnetic interactions between non-relativistic charged particles are dominated by the Coulomb interaction, but if there are relativistic charged particles in the system, photon exchange between charged particles through their couplings to the vector potential $\boldsymbol{A}(\boldsymbol{x})$ becomes important. Domination of the Coulomb interaction in the case of non-relativistic electron-nucleus and electron-electron scattering is demonstrated in Sections 22.2 and 22.4, respectively. Equation (22.29) provides an estimate of the relative importance of photon exchange versus Coulomb interactions for non-relativistic electrons and nuclei.

Spin labels are suppressed in (19.2) and also in the corresponding states below, because they enter trivially in the equations of motion⁶.

Note that even in the valence electrons plus ion cores approximation, the Hamiltonians (19.1, 19.2) describe an incredibly complicated quantum mechanical system, even in the case of a “simple” diatomic molecule. This is because the complete spectrum of energy levels and eigenstates of (19.1) does not only include bound molecular states (which is complicated enough), but also scattering states of electrons and of molecular fragments. The Hamiltonian for the hydrogen molecule H₂ describes not only bound states of two protons and two electrons, but also electron scattering off an H₂ ⁺ ion, atomic hydrogen-hydrogen scattering, proton scattering off an H⁻ ion, and a plasma of free protons and electrons. However, our primary interest concerns an understanding of the nature of covalent bonds and of ground state properties of molecules. In this case, we don’t have to include the scattering states, and we can even neglect the motion of ion cores.

Born and Oppenheimer have pointed out that it makes intuitive sense to separate nuclear and electronic motion by first solving the electronic problem for fixed nuclear coordinates, and then substituting the electronic solution into a remnant nuclear Schrödinger equation⁷. In the framework of quantized Schrödinger theory this amounts to an electronic Hamiltonian

$\displaystyle{ H_{e} = H -\int \! d^{3}\boldsymbol{x}\,\sum _{ A} \frac{\hbar ^{2}} {2M_{A}}\boldsymbol{\nabla }\psi _{A}^{+}(\boldsymbol{x}) \cdot \boldsymbol{\nabla }\psi _{ A}(\boldsymbol{x}) }$

(19.3)

with corresponding parameter dependent electronic states

$\displaystyle\begin{array}{rcl} \boldsymbol{\vert }\boldsymbol{n};\boldsymbol{X}_{1},\ldots \boldsymbol{X}_{N_{n}}\boldsymbol{\rangle }& =& \prod _{i=1}^{N_{e} }\int \!d^{3}\boldsymbol{x}_{ i}\,\psi _{e}^{+}(\boldsymbol{x}_{ i})\prod _{I=1}^{N_{n} }\psi _{A(I)}^{+}(\boldsymbol{X}_{ I})\boldsymbol{\vert }0\boldsymbol{\rangle } \\ & & \times \langle \boldsymbol{x}_{1},\ldots \boldsymbol{x}_{N_{e}}\vert \boldsymbol{n};\boldsymbol{X}_{1},\ldots \boldsymbol{X}_{N_{n}}\rangle. {}\end{array}$

(19.4)

Here $\psi _{e}^{+}(\boldsymbol{x}_{i})$ is an electronic creation operator and $\psi _{A(I)}^{+}(\boldsymbol{X}_{I})$ is a creation operator for a nucleus of species A at the location $\boldsymbol{X}_{I}$ . The set of quantum numbers $\boldsymbol{n}$ specifies the state (including the energy level), and the notation $\vert \boldsymbol{n};\boldsymbol{X}_{1},\ldots \boldsymbol{X}_{N_{n}}\rangle$ indicates that the electronic state also depends on the location of the nuclei.

The equation of motion for the electronic states (19.4) with the Hamiltonian (19.3) then follows as in Section 17.6, except that here we use a time-independent Schrödinger equation. The equation

$\displaystyle{E_{e,\boldsymbol{n}}(\boldsymbol{X}_{1},\ldots \boldsymbol{X}_{N_{n}})\boldsymbol{\vert }\boldsymbol{n};\boldsymbol{X}_{1},\ldots \boldsymbol{X}_{N_{n}}\boldsymbol{\rangle } = H_{e}\boldsymbol{\vert }\boldsymbol{n};\boldsymbol{X}_{1},\ldots \boldsymbol{X}_{N_{n}}\boldsymbol{\rangle }}$

yields with the short hand notation $\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle \equiv \langle \boldsymbol{ x}_{1},\ldots \boldsymbol{x}_{N_{e}}\vert \boldsymbol{n};\boldsymbol{X}_{1},\ldots \boldsymbol{X}_{N_{n}}\rangle$ the equation

$\displaystyle\begin{array}{rcl} & & \!\!\!\!E_{e,\boldsymbol{n}}(\boldsymbol{X})\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle = -\, \frac{\hbar ^{2}} {2m_{e}}\sum _{i} \frac{\partial ^{2}} {\partial \boldsymbol{x}_{i}^{2}}\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle + \frac{e^{2}} {4\pi \epsilon _{0}}\! \\ & & \times \left (\sum _{i<j} \frac{1} {\vert \boldsymbol{x}_{i} -\boldsymbol{ x}_{j}\vert }-\sum _{i,I} \frac{Z_{A(I)}} {\vert \boldsymbol{x}_{i} -\boldsymbol{ X}_{I}\vert } +\sum _{I<J}\frac{Z_{A(I)}Z_{A(J)}} {\vert \boldsymbol{X}_{I} -\boldsymbol{ X}_{J}\vert }\right )\!\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle.{}\end{array}$

(19.5)

The N _e -electron wave functions $\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle$ are complete in the 3N _e -dimensional configuration space of the electrons, and therefore the wave functions of the full $(N_{e} + N_{n})$ -particle problem can be expanded in the form

$\displaystyle{ \langle \boldsymbol{x},\boldsymbol{X}\vert E\rangle =\sum _{\boldsymbol{n}}c(\boldsymbol{n};\boldsymbol{X})\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle. }$

(19.6)

The sum over the quantum numbers $\boldsymbol{n}$ also involves at least one integration over a continuous quantum number for the scattering states.

On the level of the second quantized theory, the amplitude (19.6) corresponds to the $(N_{e} + N_{n})$ -particle state

$\displaystyle\begin{array}{rcl} \boldsymbol{\vert }E\boldsymbol{\rangle }& =& \prod _{i=1}^{N_{e} }\int \!d^{3}\boldsymbol{x}_{ i}\,\psi _{e}^{+}(\boldsymbol{x}_{ i})\prod _{I=1}^{N_{n} }\int \!d^{3}\boldsymbol{X}_{ I}\,\psi _{A(I)}^{+}(\boldsymbol{X}_{ I})\boldsymbol{\vert }0\boldsymbol{\rangle }\langle \boldsymbol{x},\boldsymbol{X}\vert E\rangle {}\\ & =& \prod _{I=1}^{N_{n} }\int \!d^{3}\boldsymbol{X}_{ I}\,\sum _{\boldsymbol{n}}c(\boldsymbol{n};\boldsymbol{X})\boldsymbol{\vert }\boldsymbol{n};\boldsymbol{X}\boldsymbol{\rangle }, {}\\ \end{array}$

where the parameter-dependent electronic state $\boldsymbol{\vert }\boldsymbol{n};\boldsymbol{X}\boldsymbol{\rangle }$ is given in (19.4).

Substituting (19.6) into the full $(N_{e} + N_{n})$ -particle Schrödinger equation

$\displaystyle{H\boldsymbol{\vert }E\boldsymbol{\rangle } = E\boldsymbol{\vert }E\boldsymbol{\rangle }}$

yields the equation

$\displaystyle{ \sum _{\boldsymbol{n}}\left (\sum _{I=1}^{N_{n} } \frac{\hbar ^{2}} {2M_{A(I)}} \frac{\partial ^{2}} {\partial \boldsymbol{X}_{I}^{2}} - E_{e,\boldsymbol{n}}(\boldsymbol{X}) + E\right )c(\boldsymbol{n};\boldsymbol{X})\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle = 0. }$

(19.7)

This can be resolved into a set of coupled equations for the nuclear factors $c(\boldsymbol{n};\boldsymbol{X})$ through orthogonality of the electron factors $\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}\rangle$ . If this is done, no approximation has been made so far to the problem to solve the molecular Hamiltonian (19.2). However, if we are in the center of mass frame of the nuclei, and if both rotational and vibrational excitations are small, we can neglect the nuclear kinetic terms, and we find for these nuclear configurations $\boldsymbol{X}^{(0)}$ that their energy levels can be approximated by

$\displaystyle{ E = E_{e,\boldsymbol{n}}(\boldsymbol{X}^{(0)}). }$

(19.8)

The corresponding full molecular eigenstate in this approximation has a wave function

$\displaystyle{ \langle \boldsymbol{x},\boldsymbol{X}\vert E_{e,\boldsymbol{n}}(\boldsymbol{X}^{(0)})\rangle =\delta (\boldsymbol{X} -\boldsymbol{ X}^{(0)})\langle \boldsymbol{x}\vert \boldsymbol{n};\boldsymbol{X}^{(0)}\rangle, }$

(19.9)

and a corresponding second quantized state

$\displaystyle\begin{array}{rcl} \boldsymbol{\vert }E_{e,\boldsymbol{n}}(\boldsymbol{X}^{(0)})\boldsymbol{\rangle }& =& \prod _{ i=1}^{N_{e} }\int \!d^{3}\boldsymbol{x}_{ i}\,\psi _{e}^{+}(\boldsymbol{x}_{ i})\prod _{I=1}^{N_{n} }\int \!d^{3}\boldsymbol{X}_{ I}\,\psi _{A(I)}^{+}(\boldsymbol{X}_{ I})\boldsymbol{\vert }0\boldsymbol{\rangle } {}\\ & & \times \langle \boldsymbol{x},\boldsymbol{X}\vert E_{e,\boldsymbol{n}}(\boldsymbol{X}^{(0)})\rangle =\boldsymbol{ \vert }\boldsymbol{n};\boldsymbol{X}^{(0)}\boldsymbol{\rangle }. {}\\ \end{array}$

It might be tempting to conclude from (19.8) that the solution of the electronic equation (19.5) eventually allows us to calculate the nuclear equilibrium configuration $\boldsymbol{X}^{(0)}$ in the aftermath from a requirement $[\partial E_{e,\boldsymbol{n}}(\boldsymbol{X})/\partial \boldsymbol{X}]_{\boldsymbol{X}=\boldsymbol{X}^{(0)}} = 0$ . However, this is not true: The energy level $E_{e,\boldsymbol{n}}(\boldsymbol{X})$ for a general nuclear configuration $\boldsymbol{X}$ represents only the electronic energy plus the electrostatic nuclear potential energy for that configuration. Equation (19.8) only states that within the Born-Oppenheimer approximation, the energy $E_{e,\boldsymbol{n}}(\boldsymbol{X})$ and the full molecular energy coincide in an equilibrium configuration, but that does not imply that the two energies coincide in a neighborhood of an equilibrium configuration. As a consequence the energy $E_{e,\boldsymbol{n}}(\boldsymbol{X})$ and the full molecular energy can (and generically will) have different gradients with respect to the nuclear configuration, even in a molecular equilibrium configuration. The function $E_{e,\boldsymbol{n}}(\boldsymbol{X})$ may have non-vanishing gradient in the molecular equilibrium configuration because it neglects the contributions from nuclear kinetic terms.

Therefore we have to use a priori knowledge of the equilibrium configuration $\boldsymbol{X}^{(0)}$ , e.g. from scattering experiments, to calculate the molecular energy in the Born-Oppenheimer approximation. We cannot calculate both the energy and the equilibrium configuration from (19.5).

19.2 Covalent bonding: The dihydrogen cation

The stability of molecules is an issue in classical physics in the same sense as the stability of atoms is an issue. It is not surprising that sharing of electrons yields a net attractive force between positively charged nuclei or atomic cores. Consider e.g. two protons at separation b with an electron right in the middle between the protons. The net classical electrostatic energy of the system $\propto -3e^{2}/b$ is attractive, but the problem is again to prevent collapse of the system. The corresponding quantum mechanical system is again stabilized by wave particle duality. Squeezing the particles very tight together implies strongly peaked wave functions, hence too much curvature in the wave functions, and the ensuing increase in kinetic energy eventually cannot be compensated any more by gains in potential energy terms for normalizable wave functions.

We apply the basic tenet of the Born-Oppenheimer approximation to the hydrogen molecule ion H₂ ⁺ and determine approximate molecular orbitals under the assumption that the two protons are fixed at their equilibrium separation b. The distances of the electron from the two protons are given by

$\displaystyle{ r_{\pm }^{2} = x^{2} + y^{2} + (z \pm (b/2))^{2} }$

(19.10)

if we assume that the two protons are located on the z axis at z = ±b∕2. A suitable set of coordinates for the 2-center Coulomb problem are given by

$\displaystyle\begin{array}{rcl} \xi ^{+} = r_{ +} + r_{-},\quad b \leq \xi ^{+},& & {}\\ \xi ^{-} = r_{ +} - r_{-},\quad -\, b \leq \xi ^{-}\leq b& & {}\\ \end{array}$

and the azimuthal angle $\varphi$ around the z axis. These coordinates are known as prolate spheroidal coordinates. They seem to have been used for the analysis of classical 2-center gravitational or electrostatic problems and for acoustic and electromagnetic radiation problems since the 19th century.

The surfaces $\xi ^{+} = const.$ are ellipsoids with the protons in the focal points, while the surfaces $\xi ^{-} = const.$ are the corresponding hyperboloids. The $\xi ^{-}$ coordinate lines take us from one hyperboloid $\xi ^{-} = const.$ to another hyperboloid $\xi ^{-} = const.$ for constant $\xi ^{+}$ and $\varphi$ . For given value of $\xi ^{+}$ , going from $\xi ^{-} = -b$ to $\xi ^{-} = b$ takes us from the south pole of the ellipsoid $\xi ^{+} = const.$ to its north pole, i.e. $\xi ^{-}/b$ is similar to the $\vartheta$ coordinate on a sphere, except that we move from negative z to positive z for increasing $\xi ^{-}$ . The advantage of this is that z > 0 corresponds to $\xi ^{-}> 0$ , but the right handed prolate spheroidal coordinate system is then $\{\xi ^{-},\xi ^{+},\varphi \}$ .

The $\xi ^{+}$ coordinate lines are hyperbolas $\xi ^{-} = const.$ , $\varphi = const.$ with the protons in the focal points. $\xi ^{+} = b$ corresponds to the line − b∕2 ≤ z ≤ b∕2 on the z axis and $\xi ^{+} \rightarrow \infty$ takes us to infinite distance from the protons, i.e. $\xi ^{+}$ plays a role similar to the radius r in spherical coordinates.

We apply the methods of Section 5.4 to determine tangent vectors to the coordinate lines and the relevant differential operators. We have

$\displaystyle{2r^{2} + \frac{b^{2}} {2} = r_{+}^{2} + r_{ -}^{2} = \frac{1} {2}(\xi ^{+})^{2} + \frac{1} {2}(\xi ^{-})^{2}}$

and

$\displaystyle{z = \frac{\xi ^{+}\xi ^{-}} {2b},}$

and this implies also

$\displaystyle\begin{array}{rcl} & & x^{2} + y^{2} = \frac{b^{2}(\xi ^{+})^{2} + b^{2}(\xi ^{-})^{2} - (\xi ^{+}\xi ^{-})^{2} - b^{4}} {4b^{2}} = \frac{[(\xi ^{+})^{2} - b^{2}][b^{2} - (\xi ^{-})^{2}]} {4b^{2}}, {}\\ & & x = \frac{1} {2b}\sqrt{[(\xi ^{+ } )^{2 } - b^{2 } ][b^{2 } - (\xi ^{- } )^{2 } ]}\cos \varphi, {}\\ & & y = \frac{1} {2b}\sqrt{[(\xi ^{+ } )^{2 } - b^{2 } ][b^{2 } - (\xi ^{- } )^{2 } ]}\sin \varphi. {}\\ \end{array}$

The dual basis vectors (5.21) are in the present case

$\displaystyle{\boldsymbol{\nabla }\xi ^{+} = \frac{1} {2r_{+}r_{-}}\left (2\xi ^{+}\boldsymbol{r} - b\xi ^{-}\boldsymbol{e}_{ z}\right ),\quad \boldsymbol{\nabla }\xi ^{-} = -\, \frac{1} {2r_{+}r_{-}}\left (2\xi ^{-}\boldsymbol{r} - b\xi ^{+}\boldsymbol{e}_{ z}\right ),}$

and

$\displaystyle{\boldsymbol{\nabla }\varphi = \frac{x\boldsymbol{e}_{y} - y\boldsymbol{e}_{x}} {x^{2} + y^{2}}.}$

This yields a diagonal inverse metric with components

$\displaystyle\begin{array}{rcl} & & g^{++} = 4 \frac{(\xi ^{+})^{2} - b^{2}} {(\xi ^{+})^{2} - (\xi ^{-})^{2}},\quad g^{--} = 4 \frac{b^{2} - (\xi ^{-})^{2}} {(\xi ^{+})^{2} - (\xi ^{-})^{2}}, {}\\ & & g^{\varphi \varphi } = \frac{4b^{2}} {[(\xi ^{+})^{2} - b^{2}][b^{2} - (\xi ^{-})^{2}]}, {}\\ \end{array}$

and the volume measure (5.27) for $d\xi ^{-}d\xi ^{+}d\varphi$ follows as

$\displaystyle{ \sqrt{g} = \left (g^{++}g^{--}g^{\varphi \varphi }\right )^{-1/2} = \frac{1} {8b}[(\xi ^{+})^{2} - (\xi ^{-})^{2}]. }$

(19.11)

The Laplace operator (5.26) in spheroidal coordinates is therefore

$\displaystyle\begin{array}{rcl} \Delta & =& \frac{4} {(\xi ^{+})^{2} - (\xi ^{-})^{2}}\left [\partial _{+}\left ((\xi ^{+})^{2} - b^{2}\right )\partial _{ +} + \partial _{-}\left (b^{2} - (\xi ^{-})^{2}\right )\partial _{ -}\right ] \\ & & +\, \frac{4b^{2}} {[(\xi ^{+})^{2} - b^{2}][b^{2} - (\xi ^{-})^{2}]}\partial _{\varphi }^{2}. {}\end{array}$

(19.12)

On the other hand, the coordinate dependence of the electrostatic potential of the electron is

$\displaystyle{ \frac{1} {r_{+}} + \frac{1} {r_{-}} = \frac{4\xi ^{+}} {(\xi ^{+})^{2} - (\xi ^{-})^{2}},}$

and therefore the Hamiltonian in the $\{\xi ^{+},\xi ^{-},\varphi \}$ representation satisfies

$\displaystyle\begin{array}{rcl} & & \!\!\!\! \frac{m_{e}} {2\hbar ^{2}}[(\xi ^{-})^{2} - (\xi ^{+})^{2}]H = \partial _{ +}\left ((\xi ^{+})^{2} - b^{2}\right )\partial _{ +} + \partial _{-}\left (b^{2} - (\xi ^{-})^{2}\right )\partial _{ -} \\ & & +\left ( \frac{b^{2}} {(\xi ^{+})^{2} - b^{2}} + \frac{b^{2}} {b^{2} - (\xi ^{-})^{2}}\right )\partial _{\varphi }^{2} + \frac{m_{e}e^{2}} {2\pi \epsilon _{0}\hbar ^{2}} \xi ^{+}. {}\end{array}$

(19.13)

The Hamiltonian H commutes with the azimuthal angular momentum operator L _z , and therefore we can discuss the spectrum and eigenfunctions of H within the subspaces of L _z eigenvalues $m\hbar$ ,

$\displaystyle{\psi _{m}(\xi ^{+},\xi ^{-},\varphi ) = \frac{1} {\sqrt{2\pi }}\psi (\xi ^{+},\xi ^{-})\exp (\mathrm{i}m\varphi ).}$

Within these subspaces, the normalization condition on the bound electron states becomes with (19.11),

$\displaystyle{ \int _{b}^{\infty }\!d\xi ^{+}\int _{ -b}^{b}\!d\xi ^{-}\,[(\xi ^{+})^{2} - (\xi ^{-})^{2}]\left \vert \psi (\xi ^{+},\xi ^{-})\right \vert ^{2} = 8b, }$

(19.14)

and the Hamiltonian H _m acting within these subspaces satisfies

$\displaystyle\begin{array}{rcl} & & \frac{m_{e}} {2\hbar ^{2}}[(\xi ^{-})^{2} - (\xi ^{+})^{2}](H_{ m} - E) = D_{+,m}(\xi ^{+}) - D_{ -,m}(\xi ^{-}), {}\\ & & D_{+,m}(\xi ^{+}) = \partial _{ +}\left ((\xi ^{+})^{2} - b^{2}\right )\partial _{ +} - \frac{m^{2}b^{2}} {(\xi ^{+})^{2} - b^{2}} + \frac{m_{e}} {2\hbar ^{2}}E(\xi ^{+})^{2} + \frac{m_{e}e^{2}} {2\pi \epsilon _{0}\hbar ^{2}} \xi ^{+}, {}\\ & & D_{-,m}(\xi ^{-}) = \partial _{ -}\left ((\xi ^{-})^{2} - b^{2}\right )\partial _{ -}- \frac{m^{2}b^{2}} {(\xi ^{-})^{2} - b^{2}} + \frac{m_{e}} {2\hbar ^{2}}E(\xi ^{-})^{2}. {}\\ \end{array}$

Here the energy E differs from the energy E _e (19.8) of the molecule in the Born-Oppenheimer approximation by the electrostatic energy of the nuclei,

$\displaystyle{ E_{e} = E + \frac{e^{2}} {4\pi \epsilon _{0}b}. }$

(19.15)

Since H _m is hermitian with respect to the scalar product appearing in (19.14), the differential operators D _+, m and D _−, m must be hermitian with respect to the scalar products

$\displaystyle{\langle \psi _{+}\vert \phi _{+}\rangle _{+} =\int _{ b}^{\infty }\!d\xi ^{+}\,\psi _{ +}^{+}(\xi ^{+})\phi _{ +}(\xi ^{+})}$

and

$\displaystyle{\langle \psi _{-}\vert \phi _{-}\rangle _{-} =\int _{ -b}^{b}\!d\xi ^{+}\,\psi _{ -}^{+}(\xi ^{-})\phi _{ -}(\xi ^{-}),}$

respectively. The corresponding Sturm-Liouville type boundary conditions can be read off from the differential operators. We must certainly have

$\displaystyle{ \lim _{\xi ^{+}\rightarrow \infty }\psi _{+}(\xi ^{+}) = 0. }$

(19.16)

For azimuthal quantum numbers m ≠ 0 we must also require

$\displaystyle{ \lim _{\xi ^{+}\rightarrow b}\psi _{+}(\xi ^{+}) = 0,\quad \lim _{\xi ^{ -}\rightarrow \pm b}\psi _{-}(\xi ^{-}) = 0. }$

(19.17)

Note that $\xi ^{+} = b$ corresponds to the interval − b∕2 ≤ z ≤ b∕2 on the z axis, while $\xi ^{-} = -b$ and $\xi ^{-} = b$ correspond to the half-lines z ≤ −b∕2 and z ≥ b∕2 on the z axis, respectively. The boundary conditions (19.17) therefore imply that the wave functions

$\displaystyle{\psi _{m}(\xi ^{+},\xi ^{-},\varphi ) = \frac{1} {\sqrt{2\pi }}\psi _{+}(\xi ^{+})\psi _{ -}(\xi ^{-})\exp (\mathrm{i}m\varphi )}$

must vanish on the z axis if m ≠ 0, which apparently makes sense.

We certainly should not expect that the molecular orbitals with m = 0 vanish on the z axis, and the differential operators D _±, 0 are actually hermitian on their respective domains without extra boundary conditions at $\xi ^{-} = \pm b$ or $\xi ^{+} = b$ except that the wave functions should remain finite in those points.

The point of this discourse about hermiticity of the operators D _±, m is that as a consequence, separation of the electronic Schrödinger equation for the hydrogen molecule ion H₂ ⁺ in terms of prolate spheroidal coordinates will not only give us solutions, but a complete set of solutions in the form

$\displaystyle{ \psi _{m,\lambda }(\xi ^{+},\xi ^{-},\varphi ) = \frac{1} {\sqrt{2\pi }}\psi _{+,\lambda }(\xi ^{+})\psi _{ -,\lambda }(\xi ^{-})\exp (\mathrm{i}m\varphi ), }$

(19.18)

$\displaystyle{ D_{-,m}(\xi ^{-})\psi _{ -,\lambda }(\xi ^{-}) =\lambda \psi _{ -,\lambda }(\xi ^{-}),\quad -\, b \leq \xi ^{-}\leq b, }$

(19.19)

$\displaystyle{ D_{+,m}(\xi ^{+})\psi _{ +,\lambda }(\xi ^{+}) =\lambda \psi _{ +,\lambda }(\xi ^{+}).\quad \xi ^{+} \geq b. }$

(19.20)

Energy is a third quantum number which is treated as implicit in the notation for the states.

The equation (19.19) and the equation (19.20) for e ² = 0 are relevant for radiation problems and have been studied extensively, see [1] and references there. The solutions are known as angular spheroidal functions and radial spheroidal functions because of the angular and radial interpretation of the coordinates $\xi ^{-}$ and $\xi ^{+}$ , respectively.

The $\xi ^{+} \rightarrow \infty$ limit of equation (19.20) immediately tells us that we can satisfy the boundary condition (19.16) only for negative energy,

$\displaystyle{ \frac{m_{e}} {2\hbar ^{2}}E = -\,\kappa ^{2},}$

and the asymptotic form of the solution should be

$\displaystyle{ \psi _{+,\lambda }(\xi ^{+}) = f_{ +,\lambda }(\xi ^{+})\exp (-\kappa \xi ^{+})\quad \kappa> 0. }$

(19.21)

with $\lim _{\xi \rightarrow \infty }f_{+,\lambda }(\xi )\exp (-\kappa \xi ) = 0$ .

We wish to analyze in particular the sector m = 0, which should contain the ground state of the H₂ ⁺ ion. Equation (19.20) with m = 0 has the form

$\displaystyle{ \partial _{\xi }\left (\xi ^{2} - b^{2}\right )\partial _{\xi }\psi _{\lambda }(\xi ) -\kappa ^{2}\xi ^{2}\psi _{ \lambda }(\xi ) + \frac{2} {a_{e}}\xi \psi _{\lambda }(\xi ) =\lambda \psi _{\lambda }(\xi ), }$

(19.22)

where we substituted $\xi ^{+} \rightarrow \xi$ , ψ ₊ → ψ because in the following it will be clear from presence or absence of the Coulomb term $\propto 1/a_{e}$ whether we are considering the radial or the angular spheroidal coordinates and wave functions.

The length parameter

$\displaystyle{a_{e} = \frac{4\pi \epsilon _{0}\hbar ^{2}} {m_{e}e^{2}} = \frac{\mu } {m_{e}}a}$

is closely related to the Bohr radius (7.62) of the hydrogen atom.

Since our solution should remain finite at $\xi = b$ , we make an ansatz

$\displaystyle{ \psi _{+,\lambda }(\xi ) =\sum _{n\geq 0}c_{n}\left (\frac{\xi -b} {b} \right )^{n}\exp [\kappa (b-\xi )]. }$

(19.23)

Substitution into (19.22) yields a two-step recursion relation

$\displaystyle\begin{array}{rcl} 2(n + 1)^{2}c_{ n+1}& =& \left (\lambda +\kappa ^{2}b^{2} + 2\kappa b -\frac{2b} {a_{e}} + 4\kappa bn - n(n + 1)\right )c_{n} \\ & & +\,2b\left (\kappa n - \frac{1} {a_{e}}\right )c_{n-1}. {}\end{array}$

(19.24)

On the other hand, $\psi _{-,\lambda }(\xi ^{-})$ must satisfy the differential equation (19.22) without electrostatic term: $a_{e} \rightarrow \infty$ ,

$\displaystyle{ \partial _{\xi }\left (\xi ^{2} - b^{2}\right )\partial _{\xi }\psi _{\lambda }(\xi ) -\kappa ^{2}\xi ^{2}\psi _{ \lambda }(\xi ) =\lambda \psi _{\lambda }(\xi ), }$

(19.25)

and on the interval $-b \leq \xi \leq b$ . This equation allows for even and odd solutions under $\xi ^{-} \rightarrow -\xi ^{-}$ , and we expect the ground state solution to be even. Therefore we try an ansatz

$\displaystyle{ \psi _{-,\lambda }(\xi ) =\sum _{n\geq 0}d_{n}\left ( \frac{\xi } {b}\right )^{2n}, }$

(19.26)

where we can set e.g.

$\displaystyle{ d_{0} = 1 }$

(19.27)

because the product form $\psi _{+,\lambda }(\xi ^{+})\psi _{-,\lambda }(\xi ^{-})/\sqrt{2\pi }$ of the ground state implies a degeneracy between d ₀ and the coefficient c ₀ in the radial factor (19.23). The constant c ₀ is then determined by the normalization condition (19.14).

Substitution of (19.26) into (19.25) yields the recursion relation

$\displaystyle{ 2(n + 1)(2n + 1)d_{n+1} = \left (4n^{2} + 2n-\lambda \right )d_{ n} -\kappa ^{2}b^{2}d_{ n-1}. }$

(19.28)

The expansions (19.23) and (19.26) are not the standard expansions. For the angular function (19.26) one rather uses an expansion in terms of Legendre polynomials $P_{n}(\xi /b)$ (or associated Legendre polynomials $P_{n}^{m}(\xi /b)$ for m ≠ 0), which are orthogonal polynomials in $-b \leq \xi \leq b$ and satisfy (19.25) or (19.19) for κ = 0 and $\lambda = n(n + 1)$ . For the polynomial factors in the radial function (19.23) one rather uses Laguerre polynomials $L_{n}(2\kappa (\xi -b))$ or $L_{n}^{m}(2\kappa (\xi -b))$ , because $L_{n}^{m}(2\kappa (\xi -b))\exp [-\kappa (\xi -b)]$ are complete orthogonal functions in $b \leq \xi \leq \infty$ . The corresponding two-step recursion relations for the coefficients in these expansions then follow from the differential equations and recursion relations of the orthogonal polynomials. However, for our purposes the simpler expansions (19.23) and (19.26) are sufficient for the illustration of basic solution techniques for the dihydrogen cation.

We cannot go ahead and simply solve the recursion relations (19.24) and (19.28) to some finite order to get approximate wave functions for the electron, because for generic values of $\lambda$ and $\kappa ^{2}b^{2}$ the resulting wave functions will not be regular and square integrable in the domains $-b \leq \xi ^{-}\leq b$ and $1 \leq \xi ^{+} \leq \infty$ . Therefore, one first has to determine which pairs of parameters $\lambda$ and $\kappa ^{2}b^{2}$ allow for regular and square integrable solutions.

A classical method for the approximate calculation of the allowed parameter pairs $\lambda$ and $\kappa ^{2}b^{2}$ in a two-step recursion relation like (19.28) uses the ratios $f_{n} = d_{n+1}/d_{n}$ with the initial condition from (19.28), $f_{0} = -\lambda /2$ . The recursion relation (19.28) can then be written as an upwards recursion $f_{n-1} \rightarrow f_{n}$ ,

$\displaystyle{ f_{n} = \frac{n} {n + 1} - \frac{\lambda } {2(n + 1)(2n + 1)} - \frac{\kappa ^{2}b^{2}} {2(n + 1)(2n + 1)f_{n-1}}, }$

(19.29)

or as a downwards recursion $f_{n+1} \rightarrow f_{n}$ ,

$\displaystyle{ f_{n} = \frac{\kappa ^{2}b^{2}} {2(n + 1)(2n + 3) -\lambda -2(n + 2)(2n + 3)f_{n+1}}. }$

(19.30)

The requirement of finite limits $\psi _{-,\lambda }(\pm b)$ of the angular wave function implies that the solution of (19.29, 19.30) should satisfy

$\displaystyle{\lim _{n\rightarrow \infty }f_{n} = 0.}$

One way to derive the resulting condition on $\lambda$ and $\kappa ^{2}b^{2}$ in approximate form is to use both relations (19.30) and (19.29) for f _n with the approximation f _N = 0 for some N ≫ n. Iteration of equation (19.30) in N − n − 1 steps yields a relation of the form $f_{n} = f_{n}^{(-)}(\lambda,\kappa ^{2}b^{2},f_{N}) \simeq f_{n}^{(-)}(\lambda,\kappa ^{2}b^{2},0)$ , while on the other hand f _n is also determined in n steps from equation (19.29) and $f_{0} = -\lambda /2$ to yield functions $f_{n} = f_{n}^{(+)}(\lambda,\kappa ^{2}b^{2})$ . The condition

$\displaystyle{f_{n}^{(-)}(\lambda,\kappa ^{2}b^{2},0) = f_{ n}^{(+)}(\lambda,\kappa ^{2}b^{2})}$

then implicitly determines the relation between $\lambda$ and $\kappa ^{2}b^{2}$ .

Another way to derive the relation between $\lambda$ and $\kappa ^{2}b^{2}$ writes the recursion relation (19.28) as a matrix relation

$\displaystyle{\underline{F} \cdot \boldsymbol{ d} =\lambda \boldsymbol{ d}}$

with matrix elements

$\displaystyle{F_{n\geq 0,n'\geq 0} = (4n^{2} + 2n)\delta _{ n,n'} -\kappa ^{2}b^{2}\delta _{ n,n'+1} - 2(n + 1)(2n + 1)\delta _{n,n'-1}.}$

The condition

$\displaystyle{ \mathrm{det}(\underline{F} -\lambda \underline{ 1}) = 0 }$

(19.31)

is then cut off for an (N + 1) × (N + 1) submatrix F _{0 ≤ n ≤ N, 0 ≤ n′ ≤ N} to yield a relation between $\lambda$ and $\kappa ^{2}b^{2}$ .

Once the relation between $\lambda$ and $\kappa ^{2}b^{2}$ is established, application of the same techniques to (19.24) implies a relation between the remaining parameter $\kappa ^{2}b^{2}$ and the parameter b∕a _e . Since $\kappa ^{2}b^{2} \propto -E$ , this relation determines the quantized energies of the even states (due to the even ansatz (19.26)), with m = 0.

Application of the same techniques with an odd ansatz for $\psi _{-,\lambda }(\xi ^{-})$ or to the equations with general m yields the approximate energy levels and wave functions of the electron in the dihydrogen cation with fixed centers. The matrix and determinant condition for equation (19.24) are

$\displaystyle\begin{array}{rcl} & & C_{n\geq 0,n'\geq 0} = \left (n(n + 1) + 2 \frac{b} {a_{e}} - 4n\kappa b - 2\kappa b -\kappa ^{2}b^{2}\right )\delta _{ n,n'} \\ & & \qquad \qquad \quad +\, 2b\left ( \frac{1} {a_{e}} - n\kappa \right )\delta _{n,n'+1} + 2(n + 1)^{2}\delta _{ n,n'-1}, \\ & & \mathrm{det}(\underline{C} -\lambda \underline{ 1}) = 0 {}\end{array}$

(19.32)

Using only 3 × 3 matrices F and C in the conditions (19.31) and (19.32) yields a ground state energy

$\displaystyle{E_{e} = \frac{e^{2}} {4\pi \epsilon _{0}b} -\frac{2\hbar ^{2}} {m_{e}} \kappa ^{2} = -14.2\,\mathrm{eV}}$

with eigenvalues $\lambda = -0.490$ and κ b = 1. 42 for a bond length b = 105 pm. Using the equivalent of a 4 × 4 matrix F and a 6 × 6 matrix C in the expansions with Legendre and Laguerre polynomials, Aubert et al. ⁸ found E _e = −16. 4 eV with κ b = 1. 485 for b = 2a. Either way, we find that the ground state energy E _e is smaller than the energy E ₁ = −13. 6 eV of a hydrogen atom and a proton at large distance, i.e. sharing the electron stabilizes the dihydrogen cation in spite of the electrostatic repulsion of the protons. The actual dissociation energy $D = E_{1} - E_{e}$ for the dihydrogen cation is about 2.6 eV, i.e. the value of Aubert et al. from higher order approximation of the recursion relations is much better, as expected.

The coefficients which follow from the relations (19.24), (19.28), (19.27) and (19.14) for $\lambda = -0.490$ and κ b = 1. 42 are

$\displaystyle\begin{array}{rcl} d_{0}& =& 1,\quad d_{1} = 0.2451,\quad d_{2} = -0.0357, \\ c_{0}& =& 1.869,\quad c_{1} = 0.3760,\quad c_{2} = -0.0712.{}\end{array}$

(19.33)

The resulting function $b^{3}\vert \psi (\xi ^{+},\xi ^{-})\vert ^{2}/2\pi$ along the symmetry axis of the cation is displayed in Figure 19.1. The abscissa u is related to the z coordinate from equation (19.10) through u = 2z∕b.

Fig. 19.1

The function $b^{3}\vert \psi (\xi ^{+},\xi ^{-})\vert ^{2}/2\pi$ for the approximate ground state (19.33) is displayed along the symmetry axis of the dihydrogen cation. The protons are located at u = ±1. The abscissa u = 2z∕b is $u =\xi ^{-}/b$ in the range − 1 < u < 1, where $\xi ^{+} = b$ . Outside of this range we have $u = -\xi ^{+}/b$ for u < −1 ( $\xi ^{-} = -b$ ) and $u =\xi ^{+}/b$ for u > 1 ( $\xi ^{-} = b$ )

This low order approximation has already all the characteristic features of the real ground state as confirmed by higher order approximations. The electronic wave functions fall off with a linear exponential for large values of the radial coordinate $\xi ^{+}$ , and a double peak appears at the locations of the two protons. However, higher order approximations yield lower energies with a corresponding stronger exponential drop $\exp (-\kappa \xi ^{+})$ , κ b > 1. 42. This implies that the values of $b^{3}\vert \psi (\xi ^{+},\xi ^{-})\vert ^{2}$ along the symmetry axis are actually underestimated in the approximation in Figure 19.1, and the cusps become more pronounced in higher order approximations.

Cusps are inevitable in many-particle wave functions for charged particles. Kato had demonstrated that these wave functions have cusps for coalescence of any two charged particles⁹. Specifically, if r ₁₂ is the separation between two particles with charges Z ₁ e and Z ₂ e, and if the wave function does not vanish for r ₁₂ → 0, the directional average of ∂ ψ∕∂ r ₁₂ in the limit r ₁₂ → 0 satisfies

$\displaystyle{\lim _{r_{12}\rightarrow 0}\frac{1} {4\pi }\int _{0}^{\pi }\!d\vartheta \int _{ 0}^{2\pi }\!d\varphi \,\sin \vartheta \frac{\partial \psi } {\partial r_{12}} =\gamma _{12}\psi \Big\vert _{r_{12}=0}.}$

The constant γ ₁₂ is

$\displaystyle{\gamma _{12} = Z_{1}Z_{2}\alpha \frac{\mu _{12}c} {\hbar } = Z_{1}Z_{2} \frac{\mu _{12}} {m_{e}} \frac{1} {a_{e}},}$

where $\mu _{12} = m_{1}m_{2}/(m_{1} + m_{2})$ is the reduced mass of the charged particles. In particular, coalescence of two electrons or of electrons and protons corresponds to

$\displaystyle{\gamma _{e^{-}e^{-}} = \frac{1} {2a_{e}}\quad \mathrm{and}\quad \gamma _{e^{-}p^{+}} = -\frac{1} {a}.}$

19.3 Bloch and Wannier operators

The use of second quantized Hamiltonians is ubiquitous in condensed matter physics, and in the following sections we will introduce very common and useful examples for this, viz. the Hubbard Hamiltonian for electron-electron interactions, phonons, and a basic Hamiltonian for electron-phonon coupling. We will motivate the model Hamiltonians from basic Schrödinger field theory or the classical Hamiltonian for lattice vibrations, respectively, and refer the reader to more specialized monographs for alternative derivations of these Hamiltonians.

However, before we embark on this journey, we should generalize the results from Sections 10.1, 10.2 and 10.3 to three dimensions and combine them with what we had learned in Chapter 17 about quantization and Schrödinger field operators.

The basic Schrödinger picture Hamiltonian for an electron gas has the form

$\displaystyle\begin{array}{rcl} H& =& \int \!d^{3}\boldsymbol{x}\int \!d^{3}\boldsymbol{x}'\sum _{\sigma,\sigma '}\psi _{\sigma }^{+}(\boldsymbol{x})\psi _{\sigma '}^{+}(\boldsymbol{x}') \frac{e^{2}} {8\pi \epsilon _{0}\left \vert \boldsymbol{x} -\boldsymbol{ x}'\right \vert }\psi _{\sigma '}(\boldsymbol{x}')\psi _{\sigma }(\boldsymbol{x}) \\ & & +\int \!d^{3}\boldsymbol{x}\sum _{\sigma } \frac{\hbar ^{2}} {2m}\boldsymbol{\nabla }\psi _{\sigma }^{+}(\boldsymbol{x}) \cdot \boldsymbol{\nabla }\psi _{\sigma }(\boldsymbol{x}) \\ & =& \int \!d^{3}\boldsymbol{k}\int \!d^{3}\boldsymbol{k}'\int \!d^{3}\boldsymbol{q}\sum _{\sigma,\sigma '}a_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})a_{\sigma '}^{+}(\boldsymbol{k}' -\boldsymbol{ q}) \frac{e^{2}} {16\pi ^{3}\epsilon _{0}\boldsymbol{q}^{2}}a_{\sigma '}(\boldsymbol{k}')a_{\sigma }(\boldsymbol{k}) \\ & & +\int \!d^{3}\boldsymbol{k}\sum _{\sigma }\frac{\hbar ^{2}\boldsymbol{k}^{2}} {2m} a_{\sigma }^{+}(\boldsymbol{k})a_{\sigma }(\boldsymbol{k}). {}\end{array}$

(19.34)

Suppose that this electron gas exists in a lattice with basis vectors $\boldsymbol{a}_{i}$ and dual basis vectors $\boldsymbol{a}^{i}$ (4.18). The lattice points are $\boldsymbol{\ell}= n^{i}\boldsymbol{a}_{i}$ with a triplet of integers n ⁱ . However, we can also use the basis $\boldsymbol{a}_{i}$ as a basis in $\mathbb{R}^{3}$ ,

$\displaystyle{\boldsymbol{x} = x^{i}\boldsymbol{e}_{ i} =\nu ^{i}\boldsymbol{a}_{ i},\quad \boldsymbol{\nabla } =\boldsymbol{ e}^{i} \frac{\partial } {\partial x^{i}} =\boldsymbol{ a}^{i} \frac{\partial } {\partial \nu ^{i}}.}$

Note that the coordinates x ⁱ and the lattice basis vectors $\boldsymbol{a}_{i}$ have the dimensions of length, while the dual basis vectors have dimension length⁻¹. The coordinates ν ⁱ are dimensionless.

A Brillouin zone $\mathcal{B}$ is a unit cell in the dual lattice stretched by a factor 2π and then shifted such that the center of the Brillouin zone is a dual lattice point,

$\displaystyle{ \boldsymbol{k} =\kappa _{i}\boldsymbol{a}^{i},\quad -\pi <\kappa _{ i} \leq \pi, }$

(19.35)

see also (10.10), where this notion was introduced for one-dimensional lattices.

The $\boldsymbol{k}$ vectors in a Brillouin zone have the following useful properties, which are easily derived from Fourier transformation on a one-dimensional lattice¹⁰ $\kappa _{i} \equiv \kappa _{i} + 2\pi$ ,

$\displaystyle{ \int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\,\exp [\mathrm{i}\boldsymbol{k} \cdot (\boldsymbol{\ell}-\boldsymbol{\ell}')] = (2\pi )^{3}\tilde{V }\delta _{\boldsymbol{\ell},\boldsymbol{\ell}'},\quad d^{3}\boldsymbol{k} =\tilde{ V }d\kappa _{ 1}d\kappa _{2}d\kappa _{3}, }$

(19.36)

$\displaystyle{ \sum _{\boldsymbol{\ell}}\exp [\mathrm{i}(\boldsymbol{k} -\boldsymbol{ k}')\cdot \boldsymbol{\ell}] = (2\pi )^{3}\tilde{V }\delta (\boldsymbol{k} -\boldsymbol{ k}'). }$

(19.37)

Recall that the volume of a unit cell $\tilde{V }$ in the dual lattice is related to the volume of a unit cell in the direct lattice through $\tilde{V } = 1/V$ , (4.19).

If a unit cell in the lattice contains N ions, electrons in the lattice will also experience a lattice potential

$\displaystyle{ H_{V }(\boldsymbol{x}) = -\sum _{\boldsymbol{\ell},A} \frac{n_{A}e^{2}} {4\pi \epsilon _{0}\left \vert \boldsymbol{x} -\boldsymbol{ r}_{\boldsymbol{\ell},A}\right \vert }, }$

(19.38)

where

$\displaystyle{\boldsymbol{r}_{\boldsymbol{\ell},A} =\boldsymbol{\ell} +\boldsymbol{r}_{A}\quad 1 \leq A \leq N,}$

enumerates the locations of the ions in the unit cell $\boldsymbol{\ell}= n^{i}\boldsymbol{a}_{i}$ , and n _A e is the effective charge of the A-th ion. On the level of the quantized Schrödinger field theory, the potential (19.38) adds the operator

$\displaystyle{ H_{V } = -\sum _{\boldsymbol{\ell},A}\int \!d^{3}\boldsymbol{x}\sum _{\sigma }\psi _{\sigma }^{+}(\boldsymbol{x}) \frac{n_{A}e^{2}} {4\pi \epsilon _{0}\left \vert \boldsymbol{x} -\boldsymbol{ r}_{\boldsymbol{\ell},A}\right \vert }\psi _{\sigma }(\boldsymbol{x}), }$

(19.39)

to the Hamiltonian (19.34). We will focus on this potential term in the remainder of this section and neglect the electron-electron interaction term in (19.34). The corresponding first quantized Hamiltonian

$\displaystyle{H = \frac{\mathbf{p}^{2}} {2m} + H_{V }(\mathbf{x}),}$

is invariant under lattice translations,

$\displaystyle{\exp \!\left ( \frac{\mathrm{i}} {\hbar }\boldsymbol{\ell} \cdot \mathbf{p}\right )H\exp \!\left (-\frac{\mathrm{i}} {\hbar }\boldsymbol{\ell} \cdot \mathbf{p}\right ) = \frac{\mathbf{p}^{2}} {2m} + H_{V }(\mathbf{x}+\boldsymbol{\ell}) = H,}$

and therefore admits a complete set of Bloch type eigenstates, see (10.14) for the one-dimensional case. We can decompose the Schrödinger picture field operators $\psi _{\sigma }(\boldsymbol{x})$ in terms of a complete set of Bloch type eigenstates

$\displaystyle{ \psi _{\sigma }(\boldsymbol{x}) =\sum _{n}\sqrt{ \frac{V } {(2\pi )^{3}}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\,a_{ n,\sigma }(\boldsymbol{k})\exp (\mathrm{i}\boldsymbol{k} \cdot \boldsymbol{ x})u_{n}(\boldsymbol{k},\boldsymbol{x}), }$

(19.40)

$\displaystyle{ a_{n,\sigma }(\boldsymbol{k}) = \sqrt{ \frac{V } {(2\pi )^{3}}}\int \!d^{3}\boldsymbol{x}\,\exp (-\mathrm{i}\boldsymbol{k} \cdot \boldsymbol{ x})u_{ n}^{+}(\boldsymbol{k},\boldsymbol{x})\psi _{\sigma }(\boldsymbol{x}), }$

(19.41)

with periodic Bloch factors

$\displaystyle{u_{n}(\boldsymbol{k},\boldsymbol{x}+\boldsymbol{\ell}) = u_{n}(\boldsymbol{k},\boldsymbol{x}).}$

We denote integration over the unit cell of the lattice with $\int _{V }\!d^{3}\boldsymbol{x}$ . Normalization of the Bloch energy eigenfunctions then yields

$\displaystyle\begin{array}{rcl} \delta _{mn}\delta (\boldsymbol{k} -\boldsymbol{ k}')& =& \frac{V } {(2\pi )^{3}}\int \!d^{3}\boldsymbol{x}\,\exp [\mathrm{i}(\boldsymbol{k} -\boldsymbol{ k}') \cdot \boldsymbol{ x}]u_{ m}^{+}(\boldsymbol{k}',\boldsymbol{x})u_{ n}(\boldsymbol{k},\boldsymbol{x}) \\ & =& \frac{V } {(2\pi )^{3}}\sum _{\boldsymbol{\ell}}\exp [\mathrm{i}(\boldsymbol{k} -\boldsymbol{ k}')\cdot \boldsymbol{\ell}] \\ & & \times \int _{V }\!d^{3}\boldsymbol{x}\,\exp [\mathrm{i}(\boldsymbol{k} -\boldsymbol{ k}') \cdot \boldsymbol{ x}]u_{ m}^{+}(\boldsymbol{k}',\boldsymbol{x})u_{ n}(\boldsymbol{k},\boldsymbol{x}),{}\end{array}$

(19.42)

and with (19.37) we find

$\displaystyle{\int _{V }\!d^{3}\boldsymbol{x}\,u_{ m}^{+}(\boldsymbol{k},\boldsymbol{x})u_{ n}(\boldsymbol{k},\boldsymbol{x}) =\delta _{mn}.}$

Equation (19.42) also implies with the canonical anticommutation relations for the Schrödinger field operators $\psi _{\sigma }(\boldsymbol{x})$ and $\psi _{\sigma }^{+}(\boldsymbol{x})$ that the the operators $a_{n,\sigma }(\boldsymbol{k})$ satisfy the relations

$\displaystyle{\{a_{n,\sigma }(\boldsymbol{k}),a_{n',\sigma '}(\boldsymbol{k}')\} = 0\quad \{a_{n,\sigma }(\boldsymbol{k}),a_{n',\sigma '}^{+}(\boldsymbol{k}')\} =\delta _{ n,n'}\delta _{\sigma \sigma '}\delta (\boldsymbol{k} -\boldsymbol{ k}').}$

The second quantized state

$\displaystyle{\boldsymbol{\vert }n,\sigma,\boldsymbol{k}\boldsymbol{\rangle } = a_{n,\sigma }^{+}(\boldsymbol{k})\boldsymbol{\vert }0\boldsymbol{\rangle }}$

is therefore a state with an electron in the first quantized orbital Bloch state

$\displaystyle{ \phi _{n}(\boldsymbol{k},\boldsymbol{x}) = \sqrt{ \frac{V } {(2\pi )^{3}}}\exp (\mathrm{i}\boldsymbol{k} \cdot \boldsymbol{ x})u_{n}(\boldsymbol{k},\boldsymbol{x}) }$

(19.43)

and spin projection $\sigma$ . Equation (19.41) and the conjugate equation for $a_{n,\sigma }^{+}(\boldsymbol{k})$ are a special case of our general observations (17.59) and (17.58) how annihilation and creation operators for particles in specific states relate to the generic operators $\psi _{\sigma }(\boldsymbol{x})$ and $\psi _{\sigma }^{+}(\boldsymbol{x})$ .

Since the operators $a_{n,\sigma }(\boldsymbol{k})$ are restricted to the Brillouin zone, or equivalently are periodic in the rescaled dual lattice with the Brillouin zone as unit cell,

$\displaystyle{a_{n,\sigma }(\boldsymbol{k}) = a_{n,\sigma }(\boldsymbol{k} + 2\pi \tilde{\boldsymbol{\ell}}),\quad \tilde{\boldsymbol{\ell}} = n_{i}\boldsymbol{a}^{i},}$

we can expand them using equations (19.36, 19.37),

$\displaystyle{ a_{n,\sigma }^{+}(\boldsymbol{k}) = \sqrt{ \frac{V } {(2\pi )^{3}}}\sum _{\boldsymbol{\ell}}\psi _{n,\sigma }^{+}(\boldsymbol{\ell})\exp (\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}), }$

(19.44)

$\displaystyle{ \psi _{n,\sigma }^{+}(\boldsymbol{\ell}) = \sqrt{ \frac{V } {(2\pi )^{3}}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\,a_{ n,\sigma }^{+}(\boldsymbol{k})\exp (-\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}). }$

(19.45)

The operators $\psi _{n,\sigma }(\boldsymbol{\ell})$ in the direct lattice satisfy

$\displaystyle{\{\psi _{n,\sigma }(\boldsymbol{\ell}),\psi _{n',\sigma '}(\boldsymbol{\ell}')\} = 0\quad \{\psi _{n,\sigma }(\boldsymbol{\ell}),\psi _{n',\sigma '}^{+}(\boldsymbol{\ell}')\} =\delta _{ n,n'}\delta _{\sigma \sigma '}\delta (\boldsymbol{\ell}-\boldsymbol{\ell}').}$

Substitution of (19.41) into (19.45) yields

$\displaystyle{\psi _{n,\sigma }^{+}(\boldsymbol{\ell}) =\int \! d^{3}\boldsymbol{x}\,w_{ n}(\boldsymbol{\ell},\boldsymbol{x})\psi _{\sigma }^{+}(\boldsymbol{x})}$

with the Wannier states

$\displaystyle{ w_{n}(\boldsymbol{\ell},\boldsymbol{x}) = \frac{V } {(2\pi )^{3}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\,u_{ n}(\boldsymbol{k},\boldsymbol{x})\exp [\mathrm{i}\boldsymbol{k} \cdot (\boldsymbol{x}-\boldsymbol{\ell})] = w_{n}(\boldsymbol{x}-\boldsymbol{\ell}). }$

(19.46)

These states satisfy the usual completeness relations as a consequence of the completeness relations of the Bloch states $\phi _{n}(\boldsymbol{k},\boldsymbol{x})$ ,

$\displaystyle\begin{array}{rcl} \int \!d^{3}\boldsymbol{x}\,w_{ n}^{+}(\boldsymbol{\ell},\boldsymbol{x})w_{ n'}(\boldsymbol{\ell}',\boldsymbol{x}) =\delta _{n,n'}\delta _{\boldsymbol{\ell},\boldsymbol{\ell}'},& & {}\\ \sum _{n,\boldsymbol{\ell}}w_{n}(\boldsymbol{\ell},\boldsymbol{x})w_{n}^{+}(\boldsymbol{\ell},\boldsymbol{x}') =\delta (\boldsymbol{x} -\boldsymbol{ x}').& & {}\\ \end{array}$

The operator $\psi _{n,\sigma }^{+}(\boldsymbol{\ell})$ therefore generates an electron with spin projection $\sigma$ in the Wannier state $w_{n}(\boldsymbol{\ell},\boldsymbol{x})$ .

We denote the operators $a_{n,\sigma }(\boldsymbol{k})$ and $a_{n,\sigma }^{+}(\boldsymbol{k})$ as Bloch operators, and the operators $\psi _{n,\sigma }(\boldsymbol{\ell})$ and $\psi _{n,\sigma }^{+}(\boldsymbol{\ell})$ as Wannier operators.

19.4 The Hubbard model

The Hubbard model treats electron-electron interactions in a tight binding approximation. Therefore we wish to use the creation operators $\psi _{n,\sigma }^{+}(\boldsymbol{\ell})$ for electrons in Wannier states.

The kinetic electron operator transforms into Wannier type operators according to

$\displaystyle\begin{array}{rcl} H_{0}& =& \int \!d^{3}\boldsymbol{x}\,\sum _{\sigma } \frac{\hbar ^{2}} {2m}\boldsymbol{\nabla }\psi _{\sigma }^{+}(\boldsymbol{x}) \cdot \boldsymbol{\nabla }\psi _{\sigma }(\boldsymbol{x}) \\ & =& \int \!d^{3}\boldsymbol{x}\sum _{\sigma,n,\boldsymbol{\ell},n',\boldsymbol{\ell}'}\psi _{n,\sigma }^{+}(\boldsymbol{\ell}) \frac{\hbar ^{2}} {2m}\boldsymbol{\nabla }w_{n}^{+}(\boldsymbol{\ell},\boldsymbol{x}) \cdot \boldsymbol{\nabla }w_{ n'}(\boldsymbol{\ell}',\boldsymbol{x})\psi _{n',\sigma }(\boldsymbol{\ell}').{}\end{array}$

(19.47)

This has the form of a hopping Hamiltonian for jumps $n',\boldsymbol{\ell}' \rightarrow n,\boldsymbol{\ell}$ ,

$\displaystyle{ H_{0} =\sum _{\sigma,n,\boldsymbol{\ell},n',\boldsymbol{\ell}'}t_{n,\boldsymbol{\ell},n',\boldsymbol{\ell}'}\psi _{n,\sigma }^{+}(\boldsymbol{\ell})\psi _{ n',\sigma }(\boldsymbol{\ell}') }$

(19.48)

with a hopping parameter

$\displaystyle{t_{n,\boldsymbol{\ell},n',\boldsymbol{\ell}'} =\int \! d^{3}\boldsymbol{x}\, \frac{\hbar ^{2}} {2m}\boldsymbol{\nabla }w_{n}^{+}(\boldsymbol{\ell},\boldsymbol{x}) \cdot \boldsymbol{\nabla }w_{ n'}(\boldsymbol{\ell}',\boldsymbol{x}).}$

On the other hand, the electron-electron interaction Hamiltonian becomes

$\displaystyle{H_{ee} = \frac{1} {2}\sum _{\sigma,\sigma ',m,\boldsymbol{l},m',\boldsymbol{l}',n,\boldsymbol{\ell},n',\boldsymbol{\ell}'}U_{m,\boldsymbol{l},m',\boldsymbol{l}',n',\boldsymbol{\ell}',n,\boldsymbol{\ell}}\psi _{m,\sigma }^{+}(\boldsymbol{l})\psi _{ m',\sigma '}^{+}(\boldsymbol{l}')\psi _{ n',\sigma '}(\boldsymbol{\ell}')\psi _{n,\sigma }(\boldsymbol{\ell})}$

with the Coulomb matrix element

$\displaystyle\begin{array}{rcl} U_{m,\boldsymbol{l},m',\boldsymbol{l}',n',\boldsymbol{\ell}',n,\boldsymbol{\ell}}& =& \int \!d^{3}\boldsymbol{x}\int \!d^{3}\boldsymbol{x}'\,w_{ m}^{+}(\boldsymbol{l},\boldsymbol{x})w_{ m'}^{+}(\boldsymbol{l}',\boldsymbol{x}') {}\\ & & \times \frac{e^{2}} {4\pi \epsilon _{0}\left \vert \boldsymbol{x} -\boldsymbol{ x}'\right \vert }w_{n'}(\boldsymbol{\ell}',\boldsymbol{x}')w_{n}(\boldsymbol{\ell},\boldsymbol{x}). {}\\ \end{array}$

H _{e e} would certainly be dominated by terms on the same lattice site, and if we restrict the discussion to a single band index, the electron-electron interaction Hamiltonian $H = H_{0} + H_{ee}$ assumes the simple form

$\displaystyle{ H =\sum _{\boldsymbol{\ell},\boldsymbol{\ell}',\sigma }t_{\boldsymbol{\ell},\boldsymbol{\ell}'}\psi _{\sigma }^{+}(\boldsymbol{\ell})\psi _{\sigma }(\boldsymbol{\ell}') + U\sum _{\boldsymbol{\ell}}n_{ \uparrow,\boldsymbol{\ell}}n_{\downarrow,\boldsymbol{\ell}}, }$

(19.49)

with the spin polarized occupation number operators for lattice site $\boldsymbol{\ell}$ ,

$\displaystyle{n_{\sigma,\boldsymbol{\ell}} =\psi _{ \sigma }^{+}(\boldsymbol{\ell})\psi _{\sigma }(\boldsymbol{\ell}).}$

The Hamiltonian (19.49) is known as the Hubbard Hamiltonian ¹¹. This Hamiltonian was invented for the analysis of ferromagnetic behavior in transition metals, and soon became a very widely used model Hamiltonian in condensed matter theory not only for magnetic ordering, but also for the general investigation of electron correlations, conductivity properties and disorder effects in many different classes of materials¹². However, the Hubbard model also provides basic insight into the relevance of delocalized Bloch states versus localized Wannier states, as we will now discuss.

We assume that the hopping term is invariant under translation and symmetric between sites, i.e.

$\displaystyle{t_{\boldsymbol{\ell},\boldsymbol{\ell}'} = t_{\boldsymbol{\ell}-\boldsymbol{\ell}'} = t_{\Delta \boldsymbol{\ell}} = t_{-\Delta \boldsymbol{\ell}}.}$

If hopping is suppressed,

$\displaystyle{t_{\boldsymbol{\ell},\boldsymbol{\ell}'} = t\delta _{\boldsymbol{\ell},\boldsymbol{\ell}'},}$

the Hamiltonian involves only the number operators $n_{\sigma,\boldsymbol{\ell}}$ ,

$\displaystyle{ H = t\sum _{\sigma,\boldsymbol{\ell}}n_{\sigma,\boldsymbol{\ell}} + U\sum _{\boldsymbol{\ell}}n_{\uparrow,\boldsymbol{\ell}}n_{\downarrow,\boldsymbol{\ell}}, }$

(19.50)

and the eigenstates and energy levels are given by $N = N_{1} + 2N_{2}$ particle states

$\displaystyle{\boldsymbol{\vert }\sigma _{1},\boldsymbol{\ell}_{1};\ldots \sigma _{N},\boldsymbol{\ell}_{N}\boldsymbol{\rangle } =\psi _{ \sigma _{1}}^{+}(\boldsymbol{\ell}_{ 1})\ldots \psi _{\sigma _{N}}^{+}(\boldsymbol{\ell}_{ N})\boldsymbol{\vert }0\boldsymbol{\rangle }}$

with energy

$\displaystyle{E(N_{1},N_{2}) = t(N_{1} + 2N_{2}) + UN_{2}.}$

Here N ₁ and N ₂ are the numbers of single and double occupied lattice sites, respectively. This is also denoted as the atomic limit, since the electrons are fixed at the atoms and the total energy is a sum of atomic terms.

On the other hand, if we can neglect the electron-electron interaction term, U = 0, we end up with a quadratic Hamiltonian

$\displaystyle{ H =\sum _{\boldsymbol{\ell},\Delta \boldsymbol{\ell},\sigma }t_{\Delta \boldsymbol{\ell}}\psi _{\sigma }^{+}(\boldsymbol{\ell}+\Delta \boldsymbol{\ell})\psi _{\sigma }(\boldsymbol{\ell}). }$

(19.51)

We can map the electron operators on lattice sites to electron operators (19.44) in the Brillouin zone,

$\displaystyle{ a_{\sigma }(\boldsymbol{k}) = \frac{\sqrt{V}} {\sqrt{2\pi }^{3}}\sum _{\boldsymbol{\ell}}\psi _{\sigma }(\boldsymbol{\ell})\exp (-\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}), }$

(19.52)

This diagonalizes the Hamiltonian (19.51),

$\displaystyle{ H =\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\,E(\boldsymbol{k})\sum _{\sigma }a_{\sigma }^{+}(\boldsymbol{k})a_{\sigma }(\boldsymbol{k}), }$

(19.53)

$\displaystyle{ E(\boldsymbol{k}) =\sum _{\Delta \boldsymbol{\ell}}t_{\Delta \boldsymbol{\ell}}\exp (-\mathrm{i}\boldsymbol{k} \cdot \Delta \boldsymbol{\ell}) =\sum _{\Delta \boldsymbol{\ell}}t_{\Delta \boldsymbol{\ell}}\cos (\boldsymbol{k} \cdot \Delta \boldsymbol{\ell}). }$

(19.54)

The single particle eigenstate of the Hamiltonian (19.53) with energy $E(\boldsymbol{k})$ ,

$\displaystyle{a_{\sigma }^{+}(\boldsymbol{k})\boldsymbol{\vert }0\boldsymbol{\rangle } = \frac{\sqrt{V}} {\sqrt{2\pi }^{3}}\sum _{\boldsymbol{\ell}}\psi _{\sigma }^{+}(\boldsymbol{\ell})\exp (\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell})\boldsymbol{\vert }0\boldsymbol{\rangle },}$

is a Bloch state, while the single particle eigenstate $\psi _{\sigma }^{+}(\boldsymbol{\ell})\boldsymbol{\vert }0\boldsymbol{\rangle }$ of the Hamiltonian (19.50) is a Wannier state. The magnitude of the hopping terms $t_{\Delta \boldsymbol{\ell}\neq \boldsymbol{0}}$ relative to U will therefore determine the importance of itinerant (or delocalized) Bloch electron states versus localized Wannier electron states in the lattice.

19.5 Vibrations in molecules and lattices

Another basic excitation of lattices concerns oscillations of lattice ions or atoms around their equilibrium configurations. This kind of excitation is particularly amenable to description in classical mechanical terms, but at the quantum level lattice vibrations are very similar to quantum excitations of the vacuum like electrons or photons. In particular, elementary lattice vibrations can be spontaneously created and absorbed like photons, and therefore require a quantum field theory which is similar to the field theory for photons.

We will discuss the classical theory of small oscillations of N-particle systems in the present section as a preparation for the discussion of quantized lattice vibrations in Section 19.6. We suspend summation convention in this section, because we often encounter expressions with three identical indices in a multiplicative term, and also terms like $m_{i}\ddot{x}_{i}^{j}$ without summation over the repeated index.

Normal coordinates and normal oscillations

We consider an N particle system with potential $V (\boldsymbol{r}_{1},\ldots \boldsymbol{r}_{N})$ . The equilibrium condition

$\displaystyle{ \nabla _{i}V (\boldsymbol{r}_{1},\ldots \boldsymbol{r}_{N})\Big\vert _{\boldsymbol{r}_{j}=\boldsymbol{r}_{j}^{(0)}} =\boldsymbol{ 0} }$

(19.55)

implies for the second order expansion around an equilibrium configuration $\boldsymbol{r}_{1}^{(0)},\ldots \boldsymbol{r}_{N}^{(0)}$ ,

$\displaystyle{V (\boldsymbol{r}_{1},\ldots \boldsymbol{r}_{N}) = V (\boldsymbol{r}_{1}^{(0)},\ldots \boldsymbol{r}_{ N}^{(0)}) + \frac{1} {2}\sum _{ijkl}V _{ik,jl}x_{i}^{k}x_{ j}^{l},}$

where $\boldsymbol{x}_{i} =\boldsymbol{ r}_{i} -\boldsymbol{ r}_{i}^{(0)}$ parametrize the deviations from equilibrium and the coefficients V _{i k, j
l} are

$\displaystyle{V _{ik,jl} = \frac{\partial ^{2}} {\partial y_{i}^{k}\partial y_{j}^{l}}V (\boldsymbol{r}_{1}^{(0)} +\boldsymbol{ y}_{ 1},\ldots \boldsymbol{r}_{N}^{(0)} +\boldsymbol{ y}_{ N})\Big\vert _{\boldsymbol{y}_{m}=\boldsymbol{0}}.}$

The second order Lagrange function for small oscillations of the system,

$\displaystyle{ L = \frac{1} {2}\sum _{ik}m_{i}\dot{x}_{i}^{k}\dot{x}_{ i}^{k} -\frac{1} {2}\sum _{ijkl}V _{ik,jl}x_{i}^{k}x_{ j}^{l}, }$

(19.56)

yields 3N coupled equations of motion

$\displaystyle{ m_{i}\ddot{x}_{i}^{k} = -\sum _{ jl}V _{ik,jl}x_{j}^{l}. }$

(19.57)

Fourier transformation

$\displaystyle{ x_{i}^{k}(t) =\int \! d\omega \,a_{ i}^{k}(\omega )\exp (-\mathrm{i}\omega t),\quad [a_{ i}^{k}(\omega )]^{+} = a_{ i}^{k}(-\omega ), }$

(19.58)

yields the conditions

$\displaystyle{ \sum _{jl}(V _{ik,jl} - m_{i}\omega ^{2}\delta _{ ij}\delta _{kl})a_{j}^{l}(\omega ) = 0. }$

(19.59)

Writing this in the form

$\displaystyle{\sum _{jl}\bigg( \frac{V _{ik,jl}} {\sqrt{m_{i } m_{j}}} -\omega ^{2}\delta _{ ij}\delta _{kl}\bigg)\sqrt{m_{j}}a_{j}^{l}(\omega ) = 0}$

tells us that the 3N-dimensional vector

$\displaystyle{ \boldsymbol{Q}(\omega ) =\{ \sqrt{m_{1}}a_{1}^{1}(\omega ),\ldots \sqrt{m_{ N}}a_{N}^{3}(\omega )\} =\boldsymbol{ Q}^{+}(-\omega ) }$

(19.60)

must have the form

$\displaystyle{ \boldsymbol{Q}(\omega ) =\sum _{ I=1}^{3N}\left [\boldsymbol{Q}_{ I}\delta (\omega -\omega _{I}) +\boldsymbol{ Q}_{-I}\delta (\omega +\omega _{I})\right ], }$

(19.61)

where $\boldsymbol{Q}_{I} =\{ \sqrt{m_{1}}a_{I,1}^{1},\ldots \sqrt{m_{N}}a_{I,N}^{3}\} =\boldsymbol{ Q}_{-I}^{+}$ is an eigenvector of the symmetric 3N × 3N matrix

$\displaystyle{ \Omega _{ik,jl}^{2} = \frac{V _{ik,jl}} {\sqrt{m_{i } m_{j}}} }$

(19.62)

with eigenvalue ω _I ². We assume that $\boldsymbol{r}_{1}^{(0)},\ldots \boldsymbol{r}_{N}^{(0)}$ is a stable equilibrium configuration such that all eigenvalues of $\Omega _{ik,jl}^{2}$ satisfy ω _I ² ≥ 0, and we define $\omega _{I} = \sqrt{\omega _{I }^{2}} \geq 0$ as the positive semi-definite roots.

Since $\underline{\Omega ^{2}}$ is a symmetric real 3N × 3N matrix, we can find 3N orthogonal normalized real vectors

$\displaystyle{\hat{\boldsymbol{Q}} _{I} =\{ \sqrt{m_{1}}\hat{a}_{I,1}^{1},\ldots \sqrt{m_{ N}}\hat{a}_{I,N}^{3}\}}$

which solve the eigenvalue problem

$\displaystyle{ \underline{\Omega ^{2}} \cdot \hat{\boldsymbol{Q}} _{ I} =\omega _{ I}^{2}\hat{\boldsymbol{Q}} _{ I}. }$

(19.63)

The general solution $\boldsymbol{Q}_{I}$ (19.60) of the eigenvalue problem with eigenvalue ω _I ² will then have the form

$\displaystyle{\boldsymbol{Q}_{I} = q_{I}\hat{\boldsymbol{Q}} _{I}}$

with arbitrary complex factors $q_{I} = \left \vert q_{I}\right \vert \exp (\mathrm{i}\varphi _{I})$ . The mode expansion (19.58) will therefore take the form

$\displaystyle\begin{array}{rcl} x_{i}^{k}(t)& =& \sum _{ I=1}^{3N}\,\hat{a}_{ I,i}^{k}\left [q_{ I}\exp (-\mathrm{i}\omega _{I}t) + q_{I}^{+}\exp (\mathrm{i}\omega _{ I}t)\right ] \\ & =& 2\sum _{I=1}^{3N}\,\hat{a}_{ I,i}^{k}\left \vert q_{ I}\right \vert \left [\cos (\varphi _{I})\cos (\omega _{I}t) +\sin (\varphi _{I})\sin (\omega _{I}t)\right ].{}\end{array}$

(19.64)

Equation (19.63) and $V _{ik,jl} = V _{jl,ik} = V _{ik,jl}^{+}$ imply the orthogonality relations

$\displaystyle{0 =\sum _{ijkl}(\hat{a}_{I,i}^{k}V _{ ik,jl}\hat{a}_{J,j}^{l} -\hat{ a}_{ I,i}^{k}V _{ ik,jl}\hat{a}_{J,j}^{l}) =\sum _{ ik}m_{i}\hat{a}_{\pm I,1}^{k}\hat{a}_{ J,j}^{k}(\omega _{ I}^{2} -\omega _{ J}^{2}).}$

This yields

$\displaystyle{ \sum _{ik}m_{i}\hat{a}_{I,i}^{k}\hat{a}_{ J,i}^{k} =\delta _{ IJ}, }$

(19.65)

where we assume that eigenvectors $\hat{\boldsymbol{Q}} _{I}$ within degeneracy subspaces have been orthonormalized.

Note that the normalization changes the dimensions and the physical meaning of the coefficients. The amplitudes a _I, i ^k in equation (19.64) have the dimensions of a length, and the related eigenvectors $\boldsymbol{Q}_{I}$ and factors q _I have the dimension of mass^1∕2 ×length. The normalized eigenvectors $\hat{\boldsymbol{Q}} _{I}$ are dimensionless, and therefore the related coefficients $\hat{a}_{I,i}^{k}$ have dimension mass^−1∕2. We will denote the related 3N dimensional vector $\hat{\boldsymbol{a}}_{I} =\{\hat{ a}_{I,1}^{1},\ldots,\hat{a}_{I,N}^{3}\}$ as an amplitude vector.

The small oscillations of the system are then determined by the eigenmodes $\hat{\boldsymbol{a}}_{I}$ (or equivalently $\hat{\boldsymbol{Q}} _{I}$ ), and how strongly these eigenmodes of oscillation are excited,

$\displaystyle{ x_{i}^{k}(t) =\sum _{ I=1}^{3N}\hat{a}_{ I,i}^{k}\left [q_{ I}\exp (-\mathrm{i}\omega _{I}t) + q_{I}^{+}\exp (\mathrm{i}\omega _{ I}t)\right ], }$

(19.66)

$\displaystyle{ \dot{x}_{i}^{k}(t) = -\,\mathrm{i}\sum _{ I=1}^{3N}\,\omega _{ I}\hat{a}_{I,i}^{k}\left [q_{ I}\exp (-\mathrm{i}\omega _{I}t) - q_{I}^{+}\exp (\mathrm{i}\omega _{ I}t)\right ], }$

(19.67)

$\displaystyle{ q_{I} = \frac{1} {2}\exp (\mathrm{i}\omega _{I}t)\sum _{ik}m_{i}\hat{a}_{I,i}^{k}\left (x_{ i}^{k}(t) + \frac{\mathrm{i}} {\omega _{I}}\dot{x}_{i}^{k}(t)\right ). }$

(19.68)

The 3N complex amplitudes q _I are denoted as normal coordinates of the oscillating N particle system, and the related eigenmodes of oscillation are also denoted as normal modes. Note from equations (19.66) or (19.68) that we can think of the coefficients $\hat{a}_{I,i}^{k}$ also as the components of a 3N × 3N transformation matrix between the 3N Cartesian coordinates x _i ^k (t) and the 3N normal coordinates q _I of the oscillating system. These 3N × 3N matrices satisfy the mass weighted orthogonality properties (19.65) and

$\displaystyle{ \sum _{I}\hat{a}_{I,i}^{k}\hat{a}_{ I,j}^{l} = \frac{1} {m_{i}}\delta _{ij}\delta ^{kl}, }$

(19.69)

which follows from re-substitution of q _I (19.68) into x _i ^k (t) (19.66).

Appearance of the particular eigenvalue ω _I ² = 0 implies that the system is symmetric under rotations or translations. The corresponding amplitude vectors $\hat{\boldsymbol{a}}_{I} =\{\hat{ a}_{I,i}^{k}\}$ denote the tangential directions to rotations or translations of the system.

We have learned that small oscillations of a system are always superpositions of the normal oscillation modes or eigenoscillations of the system. A priori this does not seem to be particularly helpful to determine the actual small oscillations of a system, because finding the eigenmodes is equivalent to the diagonalization of the 3N × 3N matrix $\Omega _{ik,jl}^{2}$ , which is anyhow the main task in the solution of the equations of motion (19.57) using the Fourier ansatz (19.64).

However, if the equilibrium configuration of the system has symmetries, then we can often guess the form of some of the eigenmodes which leaves us with a smaller diagonalization problem for the determination of the remaining eigenmodes.

Eigenmodes of three masses

A simple example for the identification of normal modes of a coupled particle system is given by three identical masses in a regular triangle, see Figure 19.2.

Fig. 19.2

Three elastically bound masses with equilibrium distance d

We will determine the eigenmodes in the plane of the triangle. The potential of the coupled system in the harmonic approximation is

$\displaystyle\begin{array}{rcl} V & =& \frac{K} {2} \Big((\vert \boldsymbol{r}_{1} -\boldsymbol{ r}_{2}\vert - d)^{2} + (\vert \boldsymbol{r}_{ 1} -\boldsymbol{ r}_{3}\vert - d)^{2} + (\vert \boldsymbol{r}_{ 2} -\boldsymbol{ r}_{3}\vert - d)^{2}\Big) {}\\ & \simeq & \frac{K} {2} \Big((x_{1}^{1} - x_{ 2}^{1})^{2} + \frac{1} {4}(x_{1}^{1} - x_{ 3}^{1})^{2} + \frac{1} {4}(x_{2}^{1} - x_{ 3}^{1})^{2} + \frac{3} {4}(x_{1}^{2} - x_{ 3}^{2})^{2} {}\\ & & +\,\frac{3} {4}(x_{2}^{2} - x_{ 3}^{2})^{2} + \frac{\sqrt{3}} {2} (x_{1}^{1}x_{ 1}^{2} - x_{ 1}^{1}x_{ 3}^{2} - x_{ 2}^{1}x_{ 2}^{2} + x_{ 2}^{1}x_{ 3}^{2} {}\\ & & -\,x_{3}^{1}x_{ 1}^{2} + x_{ 3}^{1}x_{ 2}^{2})\Big). {}\\ \end{array}$

The matrix V _{i k, j l} is

$\displaystyle{\underline{V } = m\underline{\Omega ^{2}} = K\left (\begin{array}{cccccc} \frac{5} {4} & \frac{\sqrt{3}} {4} & - 1 & 0 & -\frac{1} {4} & -\frac{\sqrt{3}} {4} \\ \frac{\sqrt{3}} {4} & \frac{3} {4} & 0 & 0 & -\frac{\sqrt{3}} {4} & -\frac{3} {4} \\ - 1 & 0 & \frac{5} {4} & -\frac{\sqrt{3}} {4} & -\frac{1} {4} & \frac{\sqrt{3}} {4} \\ 0 & 0 & -\frac{\sqrt{3}} {4} & \frac{3} {4} & \frac{\sqrt{3}} {4} & -\frac{3} {4} \\ -\frac{1} {4} & -\frac{\sqrt{3}} {4} & -\frac{1} {4} & \frac{\sqrt{3}} {4} & \frac{1} {2} & 0 \\ -\frac{\sqrt{3}} {4} & -\frac{3} {4} & \frac{\sqrt{3}} {4} & -\frac{3} {4} & 0 & \frac{3} {2}\\ \end{array} \right ),}$

and we must have

$\displaystyle{\mbox{ Det}(\underline{V } - m\omega ^{2}\underline{\boldsymbol{1}}) = 0.}$

Absence of external forces on the coupled system implies that there must be two translational and one rotational eigenmode, see Figures 19.2 and 19.3,

$\displaystyle{\hat{\boldsymbol{Q}}_{1} = \frac{1} {\sqrt{3}}\left (\begin{array}{c} 1\\ 0 \\ 1\\ 0 \\ 1\\ 0\\ \end{array} \right ),\quad \hat{\boldsymbol{Q}} _{2} = \frac{1} {\sqrt{3}}\left (\begin{array}{c} 0\\ 1 \\ 0\\ 1 \\ 0\\ 1\\ \end{array} \right ),\quad \hat{\boldsymbol{Q}} _{3} = \frac{1} {2\sqrt{3}}\left (\begin{array}{c} 1 \\ -\sqrt{3} \\ 1 \\ \sqrt{3} \\ - 2\\ 0\\ \end{array} \right ).}$

Fig. 19.3

The rotation mode $\hat{\boldsymbol{Q}} _{3}$

The equations $\underline{V } \cdot \hat{\boldsymbol{Q}} _{I} =\boldsymbol{ 0}$ for I = 1, 2, 3 are readily verified.

The symmetry reveals that another eigenmode can be read off from Figure 19.4.

Fig. 19.4

The eigenmode $\hat{\boldsymbol{Q}}_{4} = \sqrt{m}\hat{\boldsymbol{a}}_{4}$

This yields the corresponding normalized eigenvector

$\displaystyle{\hat{\boldsymbol{Q}}_{4} = \sqrt{m}\left (\begin{array}{c} a_{4,1}^{1} \\ a_{4,1}^{2} \\ a_{4,2}^{1} \\ a_{4,2}^{2} \\ a_{4,3}^{1} \\ a_{4,3}^{2}\\ \end{array} \right ) = \frac{1} {2\sqrt{3}}\left (\begin{array}{c} \sqrt{3} \\ 1 \\ -\sqrt{3} \\ 1\\ 0 \\ - 2\\ \end{array} \right ),}$

and application of

$\displaystyle{\omega _{4}^{2}\hat{\boldsymbol{Q}} _{ 4} = \frac{1} {m}\underline{V } \cdot \hat{\boldsymbol{Q}} _{4},}$

yields for the corresponding frequency

$\displaystyle{\omega _{4}^{2} = \frac{3K} {m}.}$

So far we have found four eigenmodes of the planar system, and there must still be two remaining eigenmodes, which must be orthogonal on the eigenmodes $\hat{\boldsymbol{Q}}_{1},\ldots \hat{\boldsymbol{Q}} _{4}$ . This yields for

$\displaystyle{\hat{\boldsymbol{Q}} _{I} = \sqrt{m}\left (\begin{array}{c} a_{I,1}^{1} \\ a_{I,1}^{2} \\ a_{I,2}^{1} \\ a_{I,2}^{2} \\ a_{I,3}^{1} \\ a_{I,3}^{2}\\ \end{array} \right ),\quad I = 5,6}$

the conditions

$\displaystyle\begin{array}{rcl} & & \sqrt{3}(a_{I,1}^{1} - a_{ I,2}^{1}) + a_{ I,1}^{2} + a_{ I,2}^{2} - 2a_{ I,3}^{2} = 0, {}\\ & & a_{I,1}^{1} + a_{ I,2}^{1} + a_{ I,3}^{1} = 0, {}\\ & & a_{I,1}^{2} + a_{ I,2}^{2} + a_{ I,3}^{2} = 0, {}\\ & & a_{I,1}^{1} + a_{ I,2}^{1} - 2a_{ I,3}^{1} + \sqrt{3}(a_{ I,2}^{2} - a_{ I,1}^{2}) = 0, {}\\ \end{array}$

with general solutions

$\displaystyle{\hat{\boldsymbol{Q}} _{I=5,6} \sim \frac{A} {2\sqrt{3}}\left (\begin{array}{c} \sqrt{3} \\ - 1 \\ -\sqrt{3} \\ - 1\\ 0 \\ 2\\ \end{array} \right )+ \frac{B} {2\sqrt{3}}\left (\begin{array}{c} 1 \\ \sqrt{3} \\ 1 \\ -\sqrt{3} \\ - 2\\ 0\\ \end{array} \right ).}$

Application of $\underline{\Omega ^{2}}$ reveals that these are degenerate eigenvectors with eigenvalue

$\displaystyle{\omega _{5}^{2} =\omega _{ 6}^{2} = \frac{3K} {2m},}$

and an orthonormal basis in the degeneracy subspace is provided by

$\displaystyle{\hat{\boldsymbol{Q}}_{5} = \frac{1} {2\sqrt{3}}\left (\begin{array}{c} \sqrt{3} \\ - 1 \\ -\sqrt{3} \\ - 1\\ 0 \\ 2\\ \end{array} \right ),\quad \hat{\boldsymbol{Q}} _{6} = \frac{1} {2\sqrt{3}}\left (\begin{array}{c} 1 \\ \sqrt{3} \\ 1 \\ -\sqrt{3} \\ - 2\\ 0\\ \end{array} \right ).}$

The corresponding eigenmodes are shown in Figure 19.5.

Fig. 19.5

The eigenmodes $\hat{\boldsymbol{Q}}_{5}$ and $\hat{\boldsymbol{Q}}_{6}$

The general small oscillation with ω > 0 is then given by

$\displaystyle{\left (\begin{array}{c} x_{1}^{1}(t) \\ x_{1}^{2}(t) \\ x_{2}^{1}(t) \\ x_{2}^{2}(t) \\ x_{3}^{1}(t) \\ x_{3}^{2}(t)\\ \end{array} \right ) =\sum _{ I=4}^{6}\hat{\boldsymbol{Q}} _{ I}x_{I}(t)}$

with

$\displaystyle{x_{I}(t) = x_{I}(0)\cos (\omega _{I}t) + \frac{\dot{x}_{I}(0)} {\omega _{I}} \sin (\omega _{I}t).}$

The diatomic linear chain

Lines of harmonically bound atoms provide important model systems for oscillations in solid state physics. We consider in particular a diatomic chain of 2N atoms with masses m and M, respectively. This model is shown in Figure 19.6. The force constant between the atoms is K and their equilibrium distance is a∕2. The number N of atom pairs is assumed to be even for simplicity.

Fig. 19.6

A diatomic linear chain with masses m and M and lattice constant a

We label the pairs of atoms with an index n, 1 − (N∕2) ≤ n ≤ N∕2, and we use periodic boundary conditions for the displadements x _n and X _n ,

$\displaystyle{x_{n+N} = x_{n},\quad X_{n+N} = X_{n}.}$

The Lagrange function

$\displaystyle{L =\sum _{ n=1-(N/2)}^{N/2}\left (\frac{m} {2} \dot{x}_{n}^{2} + \frac{M} {2} \dot{X}_{n}^{2} -\frac{K} {2} (X_{n} - x_{n})^{2} -\frac{K} {2} (x_{n} - X_{n-1})^{2}\right )}$

yields equations of motion

$\displaystyle{ m\ddot{x}_{n} = -\,K(2x_{n} - X_{n} - X_{n-1}),\quad M\ddot{X}_{n} = -\,K(2X_{n} - x_{n} - x_{n+1}), }$

(19.70)

which can be solved using Fourier decomposition on a finite periodic chain,

$\displaystyle{ x_{n}(t) = \frac{1} {\sqrt{N}}\sum _{k}\tilde{q}_{k}(t)\exp (\mathrm{i}nka), }$

(19.71)

$\displaystyle{X_{n}(t) = \frac{1} {\sqrt{N}}\sum _{k}\tilde{Q}_{k}(t)\exp (\mathrm{i}nka),}$

with

$\displaystyle{k = \frac{2\pi \tilde{n}} {Na},\quad 1 -\frac{N} {2} \leq \tilde{ n} \leq \frac{N} {2}.}$

The geometric series

$\displaystyle\begin{array}{rcl} & & \!\!\!\!\sum _{n=1-(N/2)}^{N/2}\exp \left (2\pi \mathrm{i}n\frac{\tilde{n} -\tilde{ m}} {N} \right ) =\exp \left [\mathrm{i}\pi \left ( \frac{2} {N} - 1\right )(\tilde{n} -\tilde{ m})\right ] \\ & & \times \sum _{n=0}^{N-1}\exp \left (2\pi \mathrm{i}n\frac{\tilde{n} -\tilde{ m}} {N} \right ) =\exp \left [\mathrm{i}\pi \left ( \frac{2} {N} - 1\right )(\tilde{n} -\tilde{ m})\right ] \\ & & \times \frac{1 -\exp [2\pi \mathrm{i}(\tilde{n} -\tilde{ m})]} {1 -\exp \left [\frac{2\pi \mathrm{i}} {N}(\tilde{n} -\tilde{ m})\right ]} = N\delta _{\tilde{n},\tilde{m}} {}\end{array}$

(19.72)

implies that the inversion of (19.71) is

$\displaystyle{\tilde{q}_{k}(t) = \frac{1} {\sqrt{N}}\sum _{n=1-(N/2)}^{N/2}x_{ n}(t)\exp (-\mathrm{i}nka) =\tilde{ q}_{-k}^{+}(t).}$

Since the resulting system of ordinary differential equations for $\tilde{q}_{k}(t)$ and $\tilde{Q}_{k}(t)$ is linear with constant coefficients, we also use Fourier transformation to the frequency domain,

$\displaystyle{\tilde{q}_{k}(t) =\int \! d\omega \,\tilde{q}_{k}(\omega )\exp (-\mathrm{i}\omega t),}$

and the coupled set of equations (19.70) separate into coupled pairs of equations for different wave numbers k,

$\displaystyle{ (m\omega ^{2} - 2K)\tilde{q}_{ k}(\omega ) + K(1 +\exp (-\mathrm{i}ka))\tilde{Q}_{k}(\omega ) = 0, }$

(19.73)

$\displaystyle{ (M\omega ^{2} - 2K)\tilde{Q}_{ k}(\omega ) + K(1 +\exp (\mathrm{i}ka))\tilde{q}_{k}(\omega ) = 0. }$

(19.74)

This implies that there is a unique set of frequencies ω = ω _k for each wave number k which has to satisfy

$\displaystyle{mM\omega _{k}^{4} - 2K(m + M)\omega _{ k}^{2} + 2K^{2}(1 -\cos (ka)) = 0.}$

This condition has two solutions (up to irrelevant overall signs of ω _k±),

$\displaystyle\begin{array}{rcl} \omega _{k\pm }^{2}& =& K\left ( \frac{1} {M} + \frac{1} {m}\right ) \pm K\sqrt{ \frac{1} {M^{2}} + \frac{1} {m^{2}} + \frac{2} {mM}\cos (ka)} \\ & =& K\left ( \frac{1} {M} + \frac{1} {m}\right ) \pm K\sqrt{\left ( \frac{1} {M} + \frac{1} {m}\right )^{2} - \frac{4} {mM}\sin ^{2}\left (\frac{ka} {2} \right )},{}\end{array}$

(19.75)

and we have

$\displaystyle{\tilde{q}_{k}(\omega ) =\tilde{ q}_{k+}\delta (\omega -\omega _{k+}) +\tilde{ q}_{k-}\delta (\omega -\omega _{k-}).}$

Equation (19.75) reads in terms of the reduced mass μ = m M∕(m + M) of the atom pair in the unit cell

$\displaystyle{ \omega _{k\pm }^{2} = \frac{K} {\mu } \left (1 \pm \sqrt{1 - \frac{4\mu } {m + M}\sin ^{2}\left (\frac{ka} {2} \right )}\right ). }$

(19.76)

An example of these dispersion relations with M = 1. 5m is displayed in Figure 19.7.

Fig. 19.7

The frequencies ω _k± from the dispersion relation (19.76) for M = 1. 5m and 0 ≤ k a ≤ π. The frequencies ω _k± are displayed in units of $\sqrt{K/\mu }$ , where μ is the reduced mass of the atom pair in a unit cell

Note that the Lagrange function for a single atom pair in the unit cell is

$\displaystyle{L = \frac{1} {2}(m + M)\dot{R}^{2} + \frac{\mu } {2}\dot{r}^{2} -\frac{K} {2} r^{2},\quad r = x - X,\quad R = \frac{mx + MX} {m + M},}$

and therefore the oscillation frequency of the single pair is $\sqrt{ K/\mu }$ .

The frequencies at k = 0 are ω ₀₋ = 0 and $\omega _{0+} = \sqrt{2K/\mu }$ .

The solution of (19.73, 19.74) for ω ₀₋ = 0: $\tilde{q}_{0-} =\tilde{ Q}_{0-}$ , is a uniform translation of the whole chain,

$\displaystyle{x_{n}(t) = X_{n}(t) =\tilde{ q}_{0-}/\sqrt{N}.}$

The solution for ω ₀₊: $m\tilde{q}_{0+} = -M\tilde{Q}_{0+}$ , is an oscillation

$\displaystyle{\left (\begin{array}{c} x_{n}(t) \\ X_{n}(t)\\ \end{array} \right ) = A\left (\begin{array}{c} M\\ - m\\ \end{array} \right )\cos \left (\sqrt{\frac{2K} {\mu }} t+\varphi \right ).}$

The acoustic solution for k a = π is

$\displaystyle{\omega _{(\pi /a)-} = \sqrt{\frac{2K} {M}},\quad \tilde{q}_{(\pi /a)-} = 0,}$

i.e. only the heavy atoms oscillate,

$\displaystyle{\left (\begin{array}{c} x_{n}(t) \\ X_{n}(t)\\ \end{array} \right ) = (-)^{n}A\left (\begin{array}{c} 0 \\ 1\\ \end{array} \right )\cos \left (\sqrt{\frac{2K} {M}} t+\varphi \right ).}$

On the other hand, the optical eigenmode with k a = π,

$\displaystyle{\omega _{(\pi /a)+} = \sqrt{\frac{2K} {m}},\quad \tilde{Q}_{(\pi /a)+} = 0,}$

corresponds to an oscillation of the light atoms,

$\displaystyle{\left (\begin{array}{c} x_{n}(t) \\ X_{n}(t)\\ \end{array} \right ) = (-)^{n}A\left (\begin{array}{c} 1 \\ 0\\ \end{array} \right )\cos \left (\sqrt{\frac{2K} {m}} t+\varphi \right ).}$

The general longitudinal oscillation will be a superposition of all longitudinal eigenvibrations.

Quantization of N-particle oscillations

The Lagrange function (19.56) implies canonical commutation relations

$\displaystyle{[x_{i}^{k}(t),\dot{x}_{ j}^{l}(t)] = \frac{\mathrm{i}\hbar } {m_{i}}\delta _{ij}\delta ^{kl},\quad [x_{ i}^{k}(t),x_{ j}^{l}(t)] = 0,\quad [\dot{x}_{ i}^{k}(t),\dot{x}_{ j}^{l}(t)] = 0.}$

This yields commutation relations for the normal coordinates

$\displaystyle{[q_{I},q_{J}] = 0,\quad [q_{I},q_{J}^{+}] = \frac{\hbar } {2\omega _{I}}\delta _{IJ}.}$

Therefore we find canonical annihilation and creation operators for the eigenvibrations in the form

$\displaystyle{a_{I} = \sqrt{\frac{2\omega _{I } } {\hbar }} q_{I},\quad a_{I}^{+} = \sqrt{\frac{2\omega _{I } } {\hbar }} q_{I}^{+}.}$

The discussion of the diatomic chain taught us that for lattice oscillations the eigenmodes also depend on wave vectors in a Brillouin zone, and the following section will show that there can be up to 3N branches if we have N atoms per unit cell. Therefore we will have annihilation and creation operators for lattice vibrations which are related to the corresponding normal modes through

$\displaystyle{a_{I}(\boldsymbol{k}) = \sqrt{\frac{2\omega _{I,\boldsymbol{k} } } {\hbar }} q_{I}(\boldsymbol{k}),\quad a_{I}^{+}(\boldsymbol{k}) = \sqrt{\frac{2\omega _{I,\boldsymbol{k} } } {\hbar }} q_{I}^{+}(\boldsymbol{k}).}$

The elementary excitations $a_{I}^{+}(\boldsymbol{k})\boldsymbol{\vert }0\boldsymbol{\rangle }$ of the lattice vibrations are denoted as phonons.

19.6 Quantized lattice vibrations: Phonons

We will first generalize the previous discussion of vibrations in N-particle systems to the case of three-dimensional lattices, and then quantize the lattice vibrations

We denote the three basis vectors of a three-dimensional lattice with $\boldsymbol{a}_{i}$ , 1 ≤ i ≤ 3. Each location $\boldsymbol{\ell}= n^{i}\boldsymbol{a}_{i}$ in the lattice denotes a particular location of a corresponding unit cell, and we can use $\boldsymbol{\ell}$ or equivalently the three integers n ⁱ also to address the particular unit cell to which the point $\boldsymbol{\ell}$ belongs. Suppose we have N atoms (or ions) per unit cell in the lattice. We denote the displacement of the A-th atom from its equilibrium value in cell $\boldsymbol{\ell}$ by $\boldsymbol{x}_{\boldsymbol{\ell},A}(t)$ , and in the harmonic approximation the displacements satisfy equations of motion

$\displaystyle{ m_{A}\ddot{\boldsymbol{x}}_{\boldsymbol{\ell},A} +\sum _{\boldsymbol{\ell}',A'}\underline{V }_{\boldsymbol{\ell},A;\boldsymbol{\ell}',A'} \cdot \boldsymbol{ x}_{\boldsymbol{\ell}',A'} =\boldsymbol{ 0}, }$

(19.77)

corresponding to a Lagrange function

$\displaystyle{ L = \frac{1} {2}\sum _{\boldsymbol{\ell},A}m_{A}\dot{\boldsymbol{x}}_{\boldsymbol{\ell},A}^{2} -\frac{1} {2}\sum _{\boldsymbol{\ell},A;\boldsymbol{\ell}',A'}\boldsymbol{x}_{\boldsymbol{\ell},A} \cdot \underline{ V }_{\boldsymbol{\ell},A;\boldsymbol{\ell}',A'} \cdot \boldsymbol{ x}_{\boldsymbol{\ell}',A'}. }$

(19.78)

Substitution of Fourier transforms

$\displaystyle{\boldsymbol{x}_{\boldsymbol{\ell},A}(t) = \frac{1} {\sqrt{m_{A}}}\int \!d\omega \,\boldsymbol{Q}_{\boldsymbol{\ell},A}(\omega )\exp (-\mathrm{i}\omega t)}$

into the equations of motion (19.77) yields the eigenvalue conditions

$\displaystyle{ \sum _{\boldsymbol{\ell}',A'}\underline{\Omega ^{2}}_{ \boldsymbol{\ell},A;\boldsymbol{\ell}',A'} \cdot \boldsymbol{ Q}_{\boldsymbol{\ell}',A'}(\omega ) =\omega ^{2}\boldsymbol{Q}_{\boldsymbol{\ell},A}(\omega ) }$

(19.79)

with the symmetric matrices

$\displaystyle{ \underline{\Omega ^{2}}_{ \boldsymbol{\ell},A;\boldsymbol{\ell}',B} = \frac{1} {\sqrt{m_{A } m_{B}}}\underline{V }_{\boldsymbol{\ell},A;\boldsymbol{\ell}',B} =\underline{ \Omega ^{2}}_{ \boldsymbol{\ell}',B;\boldsymbol{\ell},A}^{T}. }$

(19.80)

Translation invariance in the lattice implies that $\underline{\Omega ^{2}}_{\boldsymbol{\ell},A;\boldsymbol{\ell}',B}$ cannot depend on $\boldsymbol{\ell}+\boldsymbol{\ell}'$ . Therefore we can write

$\displaystyle{ \underline{\Omega ^{2}}_{ \boldsymbol{\ell},A;\boldsymbol{\ell}',B} =\underline{ \Omega ^{2}}_{ A,B}(\boldsymbol{\ell}-\boldsymbol{\ell}') = \frac{V } {(2\pi )^{3}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\,\underline{\tilde{\Omega }^{2}}_{ A,B}(\boldsymbol{k})\exp [\mathrm{i}\boldsymbol{k} \cdot (\boldsymbol{\ell}-\boldsymbol{\ell}')], }$

(19.81)

with inversion

$\displaystyle{\underline{\tilde{\Omega }^{2}}_{ A,B}(\boldsymbol{k}) =\sum _{\boldsymbol{\ell}}\underline{\Omega ^{2}}_{ A,B}(\boldsymbol{\ell})\exp (-\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}).}$

Symmetry of the real matrix $\underline{\Omega ^{2}}_{\boldsymbol{\ell},A;\boldsymbol{\ell}',B}$ under $i,\boldsymbol{\ell},A \leftrightarrow j,\boldsymbol{\ell}',B$ implies

$\displaystyle\begin{array}{rcl} \tilde{\Omega }_{iA,jB}^{2}(\boldsymbol{k})& =& \sum _{\boldsymbol{\ell}}\Omega _{ iA,jB}^{2}(\boldsymbol{\ell})\exp (-\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}) =\sum _{\boldsymbol{\ell}}\Omega _{ jB,iA}^{2}(-\boldsymbol{\ell})\exp (-\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}) {}\\ & =& \sum _{\boldsymbol{\ell}}\Omega _{jB,iA}^{2}(\boldsymbol{\ell})\exp (\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}) =\tilde{ \Omega }_{ jB,iA}^{2,{\ast}}(\boldsymbol{k}) =\tilde{ \Omega }_{ iA,jB}^{2,+}(\boldsymbol{k}) {}\\ & =& \tilde{\Omega }_{jB,iA}^{2}(-\boldsymbol{k}), {}\\ \end{array}$

i.e.

$\displaystyle{ \underline{\tilde{\Omega }^{2}}(\boldsymbol{k}) =\underline{\tilde{ \Omega }^{2}}^{+}(\boldsymbol{k}) =\underline{\tilde{ \Omega }^{2}}^{T}(-\boldsymbol{k}). }$

(19.82)

Substitution of (19.81) and

$\displaystyle{\boldsymbol{Q}_{\boldsymbol{\ell},A}(\omega ) = \frac{V } {(2\pi )^{3}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\,\tilde{\boldsymbol{Q}}_{\boldsymbol{ k},A}(\omega )\exp (\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}).}$

in (19.79) yields,

$\displaystyle{ \sum _{B}\underline{\tilde{\Omega }^{2}}_{ A,B}(\boldsymbol{k}) \cdot \tilde{\boldsymbol{ Q}}_{\boldsymbol{k},B}(\omega ) =\omega ^{2}\tilde{\boldsymbol{Q}}_{\boldsymbol{ k},A}(\omega ). }$

(19.83)

For fixed value of $\boldsymbol{k}$ , this is a hermitian eigenvalue problem for the 3N-dimensional complex vector

$\displaystyle{\tilde{\boldsymbol{Q}}_{\boldsymbol{k}}(\omega ) =\{\tilde{ Q}_{\boldsymbol{k},A}^{i}(\omega )\},\quad 1 \leq i \leq 3,\quad 1 \leq A \leq N.}$

Reality of the displacement vectors $\boldsymbol{x}_{\boldsymbol{\ell},A}(t)$ implies $\boldsymbol{Q}_{\boldsymbol{\ell},A}(\omega ) =\boldsymbol{ Q}_{\boldsymbol{\ell},A}^{+}(-\omega )$ and

$\displaystyle{\tilde{\boldsymbol{Q}}_{\boldsymbol{k}}^{+}(\omega ) =\tilde{\boldsymbol{ Q}}_{ -\boldsymbol{k}}(-\omega ).}$

For each point $\boldsymbol{k}$ in the Brillouin zone, there will be 3N solutions $\omega _{I}^{2}(\boldsymbol{k})$ and $\hat{\boldsymbol{Q}} _{I}(\boldsymbol{k})$ of (19.83) which satisfy the orthogonality property

$\displaystyle\begin{array}{rcl} \hat{\boldsymbol{Q}}_{I}^{+}(\boldsymbol{k}) \cdot \hat{\boldsymbol{Q}} _{ J}(\boldsymbol{k})& \equiv & \sum _{A}\hat{\boldsymbol{Q}} _{I,A}^{+}(\boldsymbol{k}) \cdot \hat{\boldsymbol{Q}} _{ J,A}(\boldsymbol{k}) \equiv \sum _{i,A}\hat{Q} _{I,A}^{i+}(\boldsymbol{k})\hat{Q} _{ J,A}^{i}(\boldsymbol{k}) \\ & =& \delta _{IJ}. {}\end{array}$

(19.84)

The hermiticity and transposition properties imply that we have as a consequence of (19.83) for the normalized solutions,

$\displaystyle{ \sum _{B}\underline{\tilde{\Omega }^{2}}_{ A,B}(\boldsymbol{k}) \cdot \hat{\boldsymbol{Q}} _{I,B}(\boldsymbol{k}) =\omega _{ I}^{2}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{ I,A}(\boldsymbol{k}), }$

(19.85)

also the equations

$\displaystyle{ \sum _{B}\underline{\tilde{\Omega }^{2}}_{ A,B}(-\boldsymbol{k}) \cdot \hat{\boldsymbol{Q}} _{I,B}^{+}(\boldsymbol{k}) =\omega _{ I}^{2}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{ I,A}^{+}(\boldsymbol{k}) }$

(19.86)

and

$\displaystyle{ \sum _{A}\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k}) \cdot \underline{\tilde{ \Omega }^{2}}_{ A,B}(-\boldsymbol{k}) =\omega _{ I}^{2}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{ I,B}(\boldsymbol{k}). }$

(19.87)

Up to linear combinations within degeneracy subspaces, the general set of solutions of the conditions (19.83) will then have the form

$\displaystyle{\tilde{\boldsymbol{Q}}_{\boldsymbol{k}}(\omega )=\sum _{I}\!\left (q_{I}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{I}(\boldsymbol{k})\delta (\omega -\omega _{I}(\boldsymbol{k}))+q_{I}^{+}(-\boldsymbol{k})\hat{\boldsymbol{Q}} _{ I}^{+}(-\boldsymbol{k})\delta (\omega +\omega _{ I}(-\boldsymbol{k}))\right )}$

with complex factors $q_{I}(\boldsymbol{k})$ . This yields the general lattice vibration in terms of the orthonormalized solutions of (19.83),

$\displaystyle\begin{array}{rcl} \boldsymbol{x}_{\boldsymbol{\ell},A}(t)& =& \frac{V } {(2\pi )^{3}\sqrt{m_{A}}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\sum _{ I}\left [q_{I}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k})\exp \!\Big(\mathrm{i}[\boldsymbol{k} \cdot \boldsymbol{\ell}-\omega _{I}(\boldsymbol{k})t]\Big)\right. \\ & & +\left.q_{I}^{+}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{ I,A}^{+}(\boldsymbol{k})\exp \!\Big(-\mathrm{i}[\boldsymbol{k} \cdot \boldsymbol{\ell}-\omega _{ I}(\boldsymbol{k})t]\Big)\right ], {}\end{array}$

(19.88)

$\displaystyle\begin{array}{rcl} \dot{\boldsymbol{x}}_{\boldsymbol{\ell},A}(t)& =& \int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\sum _{ I} \frac{-\mathrm{i}\omega _{I}(\boldsymbol{k})V } {(2\pi )^{3}\sqrt{m_{A}}}\left [q_{I}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k})\exp \!\Big(\mathrm{i}[\boldsymbol{k} \cdot \boldsymbol{\ell}-\omega _{I}(\boldsymbol{k})t]\Big)\right. \\ & & -\left.q_{I}^{+}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{ I,A}^{+}(\boldsymbol{k})\exp \!\Big(-\mathrm{i}[\boldsymbol{k} \cdot \boldsymbol{\ell}-\omega _{ I}(\boldsymbol{k})t]\Big)\right ], {}\end{array}$

(19.89)

$\displaystyle\begin{array}{rcl} q_{I}(\boldsymbol{k})& =& \frac{1} {2}\sum _{\boldsymbol{\ell},A}\exp \!\Big(-\mathrm{i}[\boldsymbol{k} \cdot \boldsymbol{\ell}-\omega _{I}(\boldsymbol{k})t]\Big)\sqrt{m_{A}} \\ & & \times \hat{\boldsymbol{Q}} _{I,A}^{+}(\boldsymbol{k}) \cdot \!\left (\boldsymbol{x}_{\boldsymbol{\ell},A}(t) + \frac{\mathrm{i}} {\omega _{I}(\boldsymbol{k})}\dot{\boldsymbol{x}}_{\boldsymbol{\ell},A}(t)\right ).{}\end{array}$

(19.90)

The dual orthogonality relation to (19.84) follows from re-substitution of $q_{I}(\boldsymbol{k})$ into (19.88),

$\displaystyle{ \frac{V } {(2\pi )^{3}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\sum _{ I}\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k}) \otimes \hat{\boldsymbol{Q}} _{I,B}^{+}(\boldsymbol{k})\exp [\mathrm{i}\boldsymbol{k} \cdot (\boldsymbol{\ell}-\boldsymbol{\ell}')] =\delta _{ AB}\delta _{\boldsymbol{\ell},\boldsymbol{\ell}'}\underline{1}. }$

(19.91)

This is actually fulfilled due to two more fundamental completeness relations. The first relation is completeness of 3N orthonormal unit vectors $\hat{\boldsymbol{Q}} _{I}(\boldsymbol{k}) \equiv \{ \hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k})\}_{1\leq A\leq N}$ in a 3N-dimensional vector space,

$\displaystyle{\sum _{I}\hat{\boldsymbol{Q}} _{I}(\boldsymbol{k}) \otimes \hat{\boldsymbol{Q}} _{I}^{+}(\boldsymbol{k}) =\underline{ 1},}$

where 1 is the 3N × 3N unit matrix, or if the atomic indices are spelled out,

$\displaystyle{ \sum _{I}\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k}) \otimes \hat{\boldsymbol{Q}} _{I,B}^{+}(\boldsymbol{k}) =\delta _{ AB}\underline{1}, }$

(19.92)

where now 1 is the 3 × 3 unit matrix referring to the spatial indices. The second relation is the completeness relation (19.36).

The canonical quantization relations

$\displaystyle\begin{array}{rcl} & & [x_{\boldsymbol{\ell},A}^{i}(t),\dot{x}_{\boldsymbol{\ell}',B}^{j}(t)] = \frac{\mathrm{i}\hbar } {m_{A}}\delta _{AB}\delta _{\boldsymbol{\ell},\boldsymbol{\ell}'}\delta ^{ij}, {}\\ & & [x_{\boldsymbol{\ell},A}^{i}(t),x_{\boldsymbol{\ell}',B}^{j}(t)] = 0,\quad [\dot{x}_{\boldsymbol{\ell},A}^{i}(t),\dot{x}_{\boldsymbol{\ell}',B}^{j}(t)] = 0, {}\\ \end{array}$

imply

$\displaystyle{[q_{I}(\boldsymbol{k}),q_{J}(\boldsymbol{k}')] = 0,\quad [q_{I}(\boldsymbol{k}),q_{J}^{+}(\boldsymbol{k}')] = \frac{\hbar } {2\omega _{I}(\boldsymbol{k})} \frac{(2\pi )^{3}} {V } \delta _{IJ}\delta (\boldsymbol{k} -\boldsymbol{ k}'),}$

i.e. the phonon annihilation operator for the Ith mode with wave vector $\boldsymbol{k}$ in the lattice is

$\displaystyle{ a_{I}(\boldsymbol{k}) = \frac{1} {2}\sqrt{\frac{\omega _{I } (\boldsymbol{k})V } {\pi ^{3}\hbar }} q_{I}(\boldsymbol{k}), }$

(19.93)

and the displacement operators in terms of the phonon operators are given by

$\displaystyle{ \boldsymbol{x}_{\boldsymbol{\ell},A}(t) =\sum _{I}\boldsymbol{x}_{I,\boldsymbol{\ell},A}(t), }$

(19.94)

with

$\displaystyle\begin{array}{rcl} \boldsymbol{x}_{I,\boldsymbol{\ell},A}(t)& =& \sqrt{ \frac{\hbar V } {(2\pi )^{3}m_{A}}}\int _{\mathcal{B}}\! \frac{d^{3}\boldsymbol{k}} {\sqrt{2\omega _{I } (\boldsymbol{k})}}\left [a_{I}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k})\exp \!\Big(\mathrm{i}[\boldsymbol{k} \cdot \boldsymbol{\ell}-\omega _{I}(\boldsymbol{k})t]\Big)\right. \\ & & +\left.a_{I}^{+}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{ I,A}^{+}(\boldsymbol{k})\exp \!\Big(-\mathrm{i}[\boldsymbol{k} \cdot \boldsymbol{\ell}-\omega _{ I}(\boldsymbol{k})t]\Big)\right ]. {}\end{array}$

(19.95)

The Lagrange function (19.78) implies a Hamiltonian for the lattice vibrations,

$\displaystyle{H = \frac{1} {2}\sum _{\boldsymbol{\ell},A}m_{A}\dot{\boldsymbol{x}}_{\boldsymbol{\ell},A}^{2} + \frac{1} {2}\sum _{\boldsymbol{\ell},A;\boldsymbol{\ell}',A'}\sqrt{m_{A } m_{A'}}\boldsymbol{x}_{\boldsymbol{\ell},A} \cdot \underline{ \Omega ^{2}}_{ \boldsymbol{\ell},A;\boldsymbol{\ell}',A'} \cdot \boldsymbol{ x}_{\boldsymbol{\ell}',A'}.}$

This yields after substitution of equations (19.94, 19.95) and use of the eigenvalue, hermiticity and orthogonality conditions for the eigenvalue problem (19.85–19.87) the result¹³

$\displaystyle{ H =\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\sum _{ I}\hbar \omega _{I}(\boldsymbol{k})a_{I}^{+}(\boldsymbol{k})a_{ I}(\boldsymbol{k}). }$

(19.96)

It is uncommon but helpful for a better understanding of Bloch and Wannier states of electrons to point out an analogy with lattice vibrations at this point.

We have seen in Sections 10.1, 10.2 and 10.3 that electrons in lattices can be described in terms of delocalized Bloch states $\psi _{n}(k,x,t) =\psi _{n}(k,x)\exp [-\mathrm{i}\omega _{n}(k)t]$ or corresponding Wannier states w _n, ν (x), w _n, ν (x, t). Here ν labelled the different cells in the lattice and n labelled the different electron energy bands in the periodic potential of the crystal. We have encountered the corresponding states in three-dimensional lattices in equations (19.43, 19.46). To make the connection to lattice vibrations, we re-express the result (19.88) for the particular phonon energy band I in the form

$\displaystyle{\boldsymbol{x}_{I,\boldsymbol{\ell},A}(t) = \frac{V } {(2\pi )^{3}}\int \!d^{3}\boldsymbol{k}\,\tilde{\boldsymbol{x}}_{ I,\boldsymbol{k},A}(t)\exp (\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}),}$

$\displaystyle\begin{array}{rcl} \tilde{\boldsymbol{x}}_{I,\boldsymbol{k},A}(t)& =& \sum _{\boldsymbol{\ell}}\boldsymbol{x}_{I,\boldsymbol{\ell},A}(t)\exp (-\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{\ell}) =\tilde{\boldsymbol{ x}}_{I,-\boldsymbol{k},A}^{+}(t) {}\\ & =& \frac{q_{I}(\boldsymbol{k})} {\sqrt{m_{ A}}} \hat{\boldsymbol{Q}}_{I,A}(\boldsymbol{k})\exp [-\mathrm{i}\omega _{I}(\boldsymbol{k})t] {}\\ & & +\,\frac{q_{I}^{+}(-\boldsymbol{k})} {\sqrt{m_{A}}} \hat{\boldsymbol{Q}} _{I,A}^{+}(-\boldsymbol{k})\exp [\mathrm{i}\omega _{ I}(-\boldsymbol{k})t]. {}\\ \end{array}$

Instead of the continuous dependence of the Bloch or Wannier type wave functions $\phi _{n}(\boldsymbol{k},\boldsymbol{x},t) \sim \exp (\mathrm{i}\boldsymbol{k} \cdot \boldsymbol{ x})u_{n}(\boldsymbol{k},\boldsymbol{x},t)$ and $w_{n}(\boldsymbol{\ell},\boldsymbol{x},t)$ on location $\boldsymbol{x}$ , we have displacement variables at the discrete locations $\{\boldsymbol{\ell},A\}$ in the lattice. However with the correspondence of band indices $n \leftrightarrow I$ , the Brillouin zone representation $\tilde{\boldsymbol{x}}_{I,\boldsymbol{k},A}(t)$ of the displacements corresponds to the Bloch waves (19.43) for electron states, while the set of displacements $\{\boldsymbol{x}_{I,\boldsymbol{\ell},A}(t)\}_{1\leq A\leq N}$ in the unit cell at $\boldsymbol{\ell}$ corresponds to the Wannier states (19.46).

19.7 Electron-phonon interactions

Phonons in the lattice of a solid material naturally couple to electrons through the electrostatic interaction between the electrons and the ion cores. If we neglect electron-electron interactions, the basic Schrödinger picture Hamiltonian for quantized electrons in a lattice of ion cores with N atoms in the unit cell has the form

$\displaystyle{H = -\int \!d^{3}\boldsymbol{x}\sum _{\sigma }\psi _{\sigma }^{+}(\boldsymbol{x})\left ( \frac{\hbar ^{2}} {2m}\Delta +\sum _{\boldsymbol{\ell},A} \frac{n_{A}e^{2}} {4\pi \epsilon _{0}\left \vert \boldsymbol{x} -\boldsymbol{ r}_{\boldsymbol{\ell},A}\right \vert }\right )\psi _{\sigma }(\boldsymbol{x}).}$

We assume that the A-th atom or ion in the unit cell couples to the electron with an effective charge n _A e, and we treat the atoms or ions as classical sources of electrostatic fields. However, we treat the lattice vibrations on the quantum level, which according to Sections 19.5 and 19.6 amounts to canonical quantization of the lattice displacements

$\displaystyle{\boldsymbol{x}_{\boldsymbol{\ell},A} =\boldsymbol{ r}_{\boldsymbol{\ell},A} -\boldsymbol{ x}_{\boldsymbol{\ell},A}^{(0)}.}$

The leading order expansion of the Coulomb term

$\displaystyle{ \frac{n_{A}e} {\vert \boldsymbol{x} -\boldsymbol{ r}_{\boldsymbol{\ell},A}\vert } \simeq \frac{n_{A}e} {\vert \boldsymbol{x} -\boldsymbol{ x}_{\boldsymbol{\ell},A}^{(0)}\vert } + n_{A}e\frac{(\boldsymbol{x} -\boldsymbol{ x}_{\boldsymbol{\ell},A}^{(0)}) \cdot \boldsymbol{ x}_{\boldsymbol{\ell},A}} {\vert \boldsymbol{x} -\boldsymbol{ x}_{\boldsymbol{\ell},A}^{(0)}\vert ^{3}} }$

(19.97)

corresponds to a dipole approximation in the language of Chapter 15, except that here the dipole operator $\boldsymbol{d}_{\boldsymbol{\ell},A} = n_{A}e\boldsymbol{x}_{\boldsymbol{\ell},A}$ is quantized according to (19.94, 19.95). This yields an electron-phonon interaction Hamiltonian of the form

$\displaystyle\begin{array}{rcl} H_{e-q}& =& -\,\sqrt{ \frac{\hbar V } {(2\pi )^{3}}}\int \!d^{3}\boldsymbol{x}\sum _{\sigma,I,\boldsymbol{\ell},A}\psi _{\sigma }^{+}(\boldsymbol{x})\psi _{\sigma }(\boldsymbol{x})\int _{ \mathcal{B}}\! \frac{d^{3}\boldsymbol{q}} {\sqrt{2\omega _{I } (\boldsymbol{q})}} \frac{e} {\sqrt{m_{A}}} {}\\ & & \times \,\boldsymbol{E}_{\boldsymbol{\ell},A}(\boldsymbol{x}) \cdot \!\left [a_{I}(\boldsymbol{q})\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{q})\exp (\mathrm{i}\boldsymbol{q}\cdot \boldsymbol{\ell})+a_{I}^{+}(\boldsymbol{q})\hat{\boldsymbol{Q}} _{ I,A}^{+}(\boldsymbol{q})\exp (-\mathrm{i}\boldsymbol{q}\cdot \boldsymbol{\ell})\right ], {}\\ \end{array}$

where we substituted the time-independent phonon operators $\boldsymbol{x}_{\boldsymbol{\ell},A}(0)$ for the Hamiltonian in the Schrödinger picture. For the electron operators, we could substitute Bloch or Wannier type operators. However, Bloch operators make much more sense, because the dipole approximation (19.97) is a small oscillation approximation in the sense $\vert \boldsymbol{x}_{\boldsymbol{\ell},A}\vert \ll \vert \boldsymbol{x} -\boldsymbol{ x}_{\boldsymbol{\ell},A}^{(0)}\vert$ , or otherwise we should include quadrupole and higher order terms. This implies that matrix elements of electron states with the lattice electric fields $\boldsymbol{E}_{\boldsymbol{\ell},A}(\boldsymbol{x})$ must not be dominated by large terms from the ion cores. The linear phonon coupling Hamiltonian H _e−q should therefore not be a good approximation for the localized electrons in Wannier states. Evaluation of the substitution of the free electron operators through Bloch operators (19.41) in H _e−q uses the fact that integration over $\boldsymbol{x}$ can be split into summation over the lattice $\boldsymbol{l}$ and integration over the unit lattice cell V,

$\displaystyle{\int \!d^{3}\boldsymbol{x}\,f(\boldsymbol{x}) =\sum _{\boldsymbol{ l}}\int _{V }\!d^{3}\boldsymbol{x}\,f(\boldsymbol{l} +\boldsymbol{ x}),}$

and that the lattice electric fields satisfy

$\displaystyle{\boldsymbol{E}_{\boldsymbol{\ell},A}(\boldsymbol{x}) =\boldsymbol{ E}_{\boldsymbol{0},A}(\boldsymbol{x}-\boldsymbol{\ell}).}$

We denote the Bloch operators for the electrons by $c_{n,\sigma }(\boldsymbol{k})$ to avoid confusion with the phonon operators. This yields the following form for the electron-phonon interaction operator,

$\displaystyle\begin{array}{rcl} H_{e-q}& =& -\,\sqrt{ \frac{\hbar V } {(2\pi )^{3}}}\sum _{\sigma,I,\boldsymbol{\ell},A,n,n'}\int _{\mathcal{B}}\! \frac{d^{3}\boldsymbol{q}} {\sqrt{2\omega _{I } (\boldsymbol{q})}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\int _{ V }\!d^{3}\boldsymbol{x} \frac{e} {\sqrt{m_{A}}} \\ & & \boldsymbol{E}_{\boldsymbol{\ell},A}(\boldsymbol{x}) \cdot \!\left [u_{n}^{+}(\boldsymbol{k} +\boldsymbol{ q},\boldsymbol{x})c_{ n,\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})a_{ I}(\boldsymbol{q})\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{q})\exp [\mathrm{i}\boldsymbol{q} \cdot (\boldsymbol{\ell}-\boldsymbol{x})]\right. \\ & & +\left.u_{n}^{+}(\boldsymbol{k} -\boldsymbol{ q},\boldsymbol{x})c_{ n,\sigma }^{+}(\boldsymbol{k} -\boldsymbol{ q})a_{ I}^{+}(\boldsymbol{q})\hat{\boldsymbol{Q}} _{ I,A}^{+}(\boldsymbol{q})\exp [\mathrm{i}\boldsymbol{q} \cdot (\boldsymbol{x}-\boldsymbol{\ell})]\right ] \\ & & \times c_{n',\sigma }(\boldsymbol{k})u_{n'}(\boldsymbol{k},\boldsymbol{x}). {}\end{array}$

(19.98)

We can also write this as

$\displaystyle\begin{array}{rcl} H_{e-q}& =& \sum _{\sigma,I}\int _{\mathcal{B}}\! \frac{d^{3}\boldsymbol{q}} {\sqrt{2\omega _{I } (\boldsymbol{q})}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\left [c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q}) \cdot \underline{ U}_{ I}(\boldsymbol{k},\boldsymbol{q}) \cdot c_{\sigma }(\boldsymbol{k})a_{I}(\boldsymbol{q})\right. \\ & & +\left.a_{I}^{+}(\boldsymbol{q})c_{\sigma }^{+}(\boldsymbol{k}) \cdot \underline{ U}_{ I}^{+}(\boldsymbol{k},\boldsymbol{q}) \cdot c_{\sigma }(\boldsymbol{k} +\boldsymbol{ q})\right ], {}\end{array}$

(19.99)

with coupling matrices between the phonons and the Bloch electrons,

$\displaystyle\begin{array}{rcl} U_{I,n,n'}(\boldsymbol{k},\boldsymbol{q})& =& -\,\sqrt{ \frac{\hbar V } {(2\pi )^{3}}}\int _{V }\!d^{3}\boldsymbol{x}\sum _{\boldsymbol{\ell},A} \frac{e} {\sqrt{m_{A}}}\exp [\mathrm{i}\boldsymbol{q} \cdot (\boldsymbol{\ell}-\boldsymbol{x})] \\ & & \times u_{n}^{+}(\boldsymbol{k} +\boldsymbol{ q},\boldsymbol{x})\boldsymbol{E}_{\boldsymbol{\ell},A}(\boldsymbol{x}) \cdot \hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{q})u_{n'}(\boldsymbol{k},\boldsymbol{x}).{}\end{array}$

(19.100)

The products in (19.99) contain summations over the electron energy band indices n, n′.

Below we will need the following property of the electron-phonon coupling functions,

$\displaystyle{ U_{I,n,n'}(\boldsymbol{k} +\boldsymbol{ q},-\boldsymbol{q}) = U_{I,n',n}^{+}(\boldsymbol{k},\boldsymbol{q}). }$

(19.101)

The full Hamiltonian also contains the free Hamiltonian for the phonons and the Bloch electrons

$\displaystyle{H_{0} =\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\left (\sum _{ I}\hbar \omega _{I}(\boldsymbol{k})a_{I}^{+}(\boldsymbol{k})a_{ I}(\boldsymbol{k}) +\sum _{\sigma }c_{\sigma }^{+}(\boldsymbol{k}) \cdot \underline{ E}(\boldsymbol{k}) \cdot c_{\sigma }(\boldsymbol{k})\right )}$

with

$\displaystyle\begin{array}{rcl} E_{n,n'}(\boldsymbol{k})& =& \frac{\hbar ^{2}} {2m}\left (\int _{V }\!d^{3}\boldsymbol{x}\,\boldsymbol{\nabla }u_{ n}^{+}(\boldsymbol{k},\boldsymbol{x}) \cdot \boldsymbol{\nabla }u_{ n'}(\boldsymbol{k},\boldsymbol{x})\right. \\ & & -\left.\mathrm{i}\boldsymbol{k} \cdot \!\int _{V }\!d^{3}\boldsymbol{x}\,u_{ n}^{+}(\boldsymbol{k},\boldsymbol{x})\stackrel{\leftrightarrow }{\boldsymbol{\nabla }}u_{ n'}(\boldsymbol{k},\boldsymbol{x}) +\boldsymbol{ k}^{2}\delta _{ n,n'}\right ).{}\end{array}$

(19.102)

The two interaction terms in (19.99) describe absorption and emission of a phonon of wave number $\boldsymbol{q}$ by a Bloch electron. The resulting exchange of virtual phonons between electron pairs will generate an effective interaction between the electrons. If interband couplings can be neglected, $U_{I,n,n'}(\boldsymbol{k},\boldsymbol{q}) \propto \delta _{n,n'}$ and $E_{n,n'}(\boldsymbol{k}) \propto \delta _{n,n'}$ , a simple method to estimate this phonon mediated electron-electron interaction eliminates the first order phonon coupling through the Lemma 1 (6.22) for exponentials of operators. A unitary transformation $\boldsymbol{\vert }\Phi \boldsymbol{\rangle } \rightarrow \boldsymbol{\vert }\Phi '\boldsymbol{\rangle } =\exp (A)\boldsymbol{\vert }\Phi \boldsymbol{\rangle }$ with

$\displaystyle\begin{array}{rcl} A& =& \sum _{\sigma,I}\int _{\mathcal{B}} \frac{d^{3}\boldsymbol{q}} {\sqrt{2\omega _{I } (\boldsymbol{q})}}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\, \frac{1} {E(\boldsymbol{k} +\boldsymbol{ q}) - E(\boldsymbol{k}) - \hbar \omega _{I}(\boldsymbol{q})} {}\\ & & \times \left [a_{I}^{+}(\boldsymbol{q})c_{\sigma }^{+}(\boldsymbol{k})U_{ I}^{+}(\boldsymbol{k},\boldsymbol{q})c_{\sigma }(\boldsymbol{k} +\boldsymbol{ q}) - c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})U_{ I}(\boldsymbol{k},\boldsymbol{q})c_{\sigma }(\boldsymbol{k})a_{I}(\boldsymbol{q})\right ] {}\\ \end{array}$

eliminates the leading order electron-phonon coupling term due to

$\displaystyle{[A,H_{0}] + H_{e-q} = 0,}$

and generates a direct electron-electron coupling term

$\displaystyle\begin{array}{rcl} H_{e-e}^{(q)}& =& \left [\frac{1} {2}\stackrel{2}{[}\!\!A,H_{0}] + [A,H_{e-q}]\right ]_{c^{+}c^{+}cc} = \frac{1} {2}[A,H_{e-q}]\Big\vert _{c^{+}c^{+}cc} {}\\ & =& \sum _{\sigma,\sigma ',I}\int _{\mathcal{B}} \frac{d^{3}\boldsymbol{q}} {4\omega _{I}(\boldsymbol{q})}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\int _{ \mathcal{B}}\!d^{3}\boldsymbol{k}'\, \frac{1} {E(\boldsymbol{k} +\boldsymbol{ q}) - E(\boldsymbol{k}) - \hbar \omega _{I}(\boldsymbol{q})} {}\\ & & \times \left [c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})c_{\sigma '}^{+}(\boldsymbol{k}')U_{ I}^{+}(\boldsymbol{k}',\boldsymbol{q})U_{ I}(\boldsymbol{k},\boldsymbol{q})c_{\sigma '}(\boldsymbol{k}' +\boldsymbol{ q})c_{\sigma }(\boldsymbol{k})\right. {}\\ & & +\left.c_{\sigma }^{+}(\boldsymbol{k})c_{\sigma '}^{+}(\boldsymbol{k}' +\boldsymbol{ q})U_{ I}^{+}(\boldsymbol{k},\boldsymbol{q})U_{ I}(\boldsymbol{k}',\boldsymbol{q})c_{\sigma '}(\boldsymbol{k}')c_{\sigma }(\boldsymbol{k} +\boldsymbol{ q})\right ]. {}\\ \end{array}$

In the next step we substitute

$\displaystyle{\boldsymbol{k} \rightarrow \boldsymbol{ k} +\boldsymbol{ q},\quad \boldsymbol{k}' \rightarrow \boldsymbol{ k}' +\boldsymbol{ q},\quad \boldsymbol{q} \rightarrow -\boldsymbol{q},}$

in the second term in H _e−e ^(q) and use the properties (19.101) and $\omega _{I}(\boldsymbol{q}) =\omega _{I}(-\boldsymbol{q})$ . This yields

$\displaystyle\begin{array}{rcl} H_{e-e}^{(q)}& =& \sum _{\sigma,\sigma ',I}\int _{\mathcal{B}} \frac{d^{3}\boldsymbol{q}} {4\omega _{I}(\boldsymbol{q})}\int _{\mathcal{B}}\!d^{3}\boldsymbol{k}\int _{ \mathcal{B}}\!d^{3}\boldsymbol{k}'\,c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})c_{\sigma '}^{+}(\boldsymbol{k}')U_{ I}^{+}(\boldsymbol{k}',\boldsymbol{q}) \\ & & \times U_{I}(\boldsymbol{k},\boldsymbol{q})c_{\sigma '}(\boldsymbol{k}' +\boldsymbol{ q})c_{\sigma }(\boldsymbol{k}) \\ & & \times \left [ \frac{1} {E(\boldsymbol{k} +\boldsymbol{ q}) - E(\boldsymbol{k}) - \hbar \omega _{I}(\boldsymbol{q})} - \frac{1} {E(\boldsymbol{k} +\boldsymbol{ q}) - E(\boldsymbol{k}) + \hbar \omega _{I}(\boldsymbol{q})}\right ] \\ & =& \frac{\hbar } {2}\sum _{\sigma,\sigma ',I}\int _{\mathcal{B}}\!d^{3}\boldsymbol{q}\int _{ \mathcal{B}}\!d^{3}\boldsymbol{k}\int _{ \mathcal{B}}\!d^{3}\boldsymbol{k}'\, \frac{1} {[E(\boldsymbol{k} +\boldsymbol{ q}) - E(\boldsymbol{k})]^{2} - \hbar ^{2}\omega _{I}^{2}(\boldsymbol{q})}. \\ & & \times c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})c_{\sigma '}^{+}(\boldsymbol{k}')U_{ I}^{+}(\boldsymbol{k}',\boldsymbol{q})U_{ I}(\boldsymbol{k},\boldsymbol{q})c_{\sigma '}(\boldsymbol{k}' +\boldsymbol{ q})c_{\sigma }(\boldsymbol{k}). {}\end{array}$

(19.103)

Phonons with frequencies which are large compared to the electron energy difference,

$\displaystyle{\hbar \omega _{I}(\boldsymbol{q})> \left \vert E(\boldsymbol{k} +\boldsymbol{ q}) - E(\boldsymbol{k})\right \vert,}$

lower the energy of a two-electron state, thus implying an energetically favorable correlation between electrons. Effectively, a negative coefficient of $c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})c_{\sigma '}^{+}(\boldsymbol{k}')c_{\sigma '}(\boldsymbol{k}' +\boldsymbol{ q})c_{\sigma }(\boldsymbol{k})$ also amounts to an electron-electron attraction. Compare (19.103) with the simplified expression for free fermion operators,

$\displaystyle{H' = \Lambda \sum _{\sigma,\sigma '}\int \!d^{3}\boldsymbol{q}\int \!d^{3}\boldsymbol{k}\int \!d^{3}\boldsymbol{k}'\,c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})c_{\sigma '}^{+}(\boldsymbol{k}')c_{\sigma '}(\boldsymbol{k}' +\boldsymbol{ q})c_{\sigma }(\boldsymbol{k}).}$

In $\boldsymbol{x}$ space this becomes

$\displaystyle{H' = (2\pi )^{3}\Lambda \sum _{\sigma,\sigma '}\int \!d^{3}\boldsymbol{x}\,\psi _{\sigma }^{+}(\boldsymbol{x})\psi _{\sigma '}^{+}(\boldsymbol{x})\psi _{\sigma '}(\boldsymbol{x})\psi _{\sigma }(\boldsymbol{x}),}$

which is an attractive interaction for $\Lambda <0$ and repulsive otherwise.

The possible instability of Fermi surfaces against phonon-induced energetically favored correlations between electrons, and the ensuing suppression of electron scattering, had been identified in the 1950s as the mechanism for low temperature superconductivity¹⁴. Please consult [5, 17, 22, 25] for textbook discussions of low temperature superconductivity.

19.8 Problems

19.1. Suppose we are using the Born-Oppenheimer approximation for the hydrogen atom, i.e. we treat the proton as fixed at location $\boldsymbol{X}_{p} =\boldsymbol{ 0}$ . This would yield the same energy levels and energy eigenfunctions that we had found in the exact solution in Chapter 7, except that the reduced mass $\mu = m_{e}m_{p}/(m_{e} + m_{p})$ would be replaced by the electron mass m _e in the result for the Bohr radius a, and therefore also in the energy eigenvalues and the wave functions.

Show that the corresponding change in the mass value δ μ = m _e −μ satisfies $\delta \mu /\mu = m_{e}/m_{p}$ . Show also that in the center of mass frame, the neglected kinetic energy of the proton is related to the kinetic energies of the electron and of the relative motion according to

$\displaystyle{K_{p} = \frac{m_{e}} {m_{p}}K_{e} = \frac{m_{e}} {m_{e} + m_{p}}K_{r}.}$

Expand the ground state wave function in the Born-Oppenheimer approximation in first order in $m_{e}/m_{p}$ in terms of the exact energy eigenstates from Chapter 7.

19.2. Show that the hermitian symmetric matrix $\underline{\tilde{\Omega }^{2}}(\boldsymbol{k})$ (19.80) with eigenvalues $\omega _{I}^{2}(\boldsymbol{k}) \geq 0$ and corresponding normalized eigenvectors $\hat{\boldsymbol{Q}} _{I}(\boldsymbol{k})$ has square roots $\underline{\tilde{\Omega }}(\boldsymbol{k})$ ,

$\displaystyle{\underline{\tilde{\Omega }^{2}}(\boldsymbol{k}) =\underline{\tilde{ \Omega }}^{2}(\boldsymbol{k}).}$

Hint: The column vectors $\hat{\boldsymbol{Q}} _{I}(\boldsymbol{k})$ can be used to form a unitary matrix $\underline{Q}(\boldsymbol{k})$ . The matrix $\underline{Q}(\boldsymbol{k})$ transforms $\underline{\tilde{\Omega ^{2}}}(\boldsymbol{k})$ into diagonal form, or in turn can be used to generate $\underline{\tilde{\Omega }^{2}}(\boldsymbol{k})$ from its diagonal form $\mathrm{diag}(\omega _{1}^{2}(\boldsymbol{k}),\ldots,\omega _{3N}^{2}(\boldsymbol{k}))$ . Use this observation to construct all the possible square roots $\underline{\tilde{\Omega }}(\boldsymbol{k})$ in terms of $\underline{Q}(\boldsymbol{k})$ and $\mathrm{diag}(\pm \omega _{1}(\boldsymbol{k}),\ldots,\pm \omega _{3N}(\boldsymbol{k}))$ .

19.3. Suppose the three particles with masses m and M in Figure 19.8 can only move in one dimension.

Fig. 19.8

Three particles with masses m and M. It is supposed that the particles can only move along the line connecting them

The potential energy of the system is

$\displaystyle{V = \frac{K} {2} (x_{1} - x_{2})^{2} + \frac{K} {2} (x_{2} - x_{3})^{2}.}$

Calculate the eigenvibrations and the eigenfrequencies of the system.

Solution.

The potential in matrix notation is

$\displaystyle{V = \frac{K} {2} (x_{1},x_{2},x_{3})\left (\begin{array}{ccc} \,\,\,\,1 & - 1& \,\,\,\,0\\ - 1 & \,\,\,\,2 & -1 \\ \,\,\,\,0 & - 1& \,\,\,\,1\\ \end{array} \right )\left (\begin{array}{c} x_{1} \\ x_{2} \\ x_{3}\\ \end{array} \right ),}$

and we have to find the eigenvectors of the corresponding matrix

$\displaystyle{ \underline{\Omega ^{2}} = K\left (\begin{array}{ccc} \,\,\,\, \frac{1} {m} & - \frac{1} {\sqrt{mM}} & \,\,\,\,0 \\ - \frac{1} {\sqrt{mM}} & \,\,\,\, \frac{2} {M} & - \frac{1} {\sqrt{mM}} \\ \,\,\,\,0 & - \frac{1} {\sqrt{mM}} & \,\,\,\, \frac{1} {m}\\ \end{array} \right ), }$

(19.104)

cf. 19.62.

Rather than trying to solve

$\displaystyle{\mbox{ det}(\underline{\Omega ^{2}} -\omega ^{2}\underline{\boldsymbol{1}}) = 0,}$

we can infer two eigenmodes from the translation and reflection symmetry of the system.

Invariance of the potential under translations $x_{1} = x_{2} = x_{3}$ implies that one eigenvector of $\underline{\Omega ^{2}}$ has the form

$\displaystyle{\hat{\boldsymbol{Q}} _{\omega _{1}=0} = \frac{1} {\sqrt{2m + M}}\left (\begin{array}{c} \sqrt{m} \\ \sqrt{M} \\ \sqrt{m}\\ \end{array} \right ).}$

Reflection symmetry also suggests an eigenmode $x_{1} = -x_{3}$ , x ₂ = 0,

$\displaystyle{\hat{\boldsymbol{Q}}_{\omega _{2}} = \frac{1} {\sqrt{2}}\left (\begin{array}{c} \,\,\,\,1\\ \,\,\,\,0 \\ - 1\\ \end{array} \right ),}$

and application of $\underline{\Omega ^{2}}$ yields the corresponding eigenvalue

$\displaystyle{\omega _{2}^{2} = \frac{K} {m}.}$

The remaining eigenvector follows from orthogonality on $\hat{\boldsymbol{Q}}_{\omega _{ 1}}$ and $\hat{\boldsymbol{Q}}_{\omega _{ 2}}$ ,

$\displaystyle{\hat{\boldsymbol{Q}}_{\omega _{3}} = \frac{1} {\sqrt{2(2m + M)}}\left (\begin{array}{c} \,\,\,\,\sqrt{M} \\ - 2\sqrt{m} \\ \,\,\,\,\sqrt{M}\\ \end{array} \right ),}$

and application of $\underline{\Omega ^{2}}$ confirms that this is an eigenmode with frequency

$\displaystyle{\omega _{3}^{2} = \frac{K} {m} + \frac{2K} {M}.}$

For the actual eigenvibration we have to go back to the amplitude vector $\boldsymbol{a}_{\omega _{2}}$ (19.60), because different masses participate in the oscillation. The normalized amplitude vector (19.65) is

$\displaystyle{\hat{\boldsymbol{a}}_{\omega _{3}} = \frac{1} {\sqrt{2mM(2m + M)}}\left (\begin{array}{c} \,\,\,\,M\\ - 2m \\ \,\,\,\,M\\ \end{array} \right ).}$

The eigenvibrations $\boldsymbol{a}_{\omega _{2}}$ and $\boldsymbol{a}_{\omega _{3}}$ are shown in Figure 19.9.

Fig. 19.9

The eigenvibrations $\boldsymbol{a}_{\omega _{2}}$ and $\boldsymbol{a}_{\omega _{3}}$

19.4. Calculate the positive semi-definite square root of the matrix $\underline{\Omega ^{2}}$ in equation (19.104). Use the hint from Problem 2.

Answer.

$\displaystyle\begin{array}{rcl} \underline{\Omega }& =& \frac{\sqrt{K}} {2\sqrt{mM(2m + M)}} {}\\ & & \times \left (\begin{array}{ccc} M + \sqrt{M(2m + M)}&\,\,\, -\, 2\sqrt{mM}&\,\,\,M -\sqrt{M(2m + M)} \\ -\, 2\sqrt{mM} & \,\,\,4m & \,\,\, -\, 2\sqrt{mM} \\ M -\sqrt{M(2m + M)}&\,\,\, -\, 2\sqrt{mM}&\,\,\,M + \sqrt{M(2m + M)}\\ \end{array} \right ).{}\\ \end{array}$

19.5. The electron-phonon interaction Hamiltonian (19.99) is very similar to the electron-photon interaction Hamiltonian in the representation (18.92),

$\displaystyle\begin{array}{rcl} H_{e-\gamma }& =& \frac{e\hbar c} {m_{e}} \sqrt{ \frac{\hbar \mu _{0 } } {(2\pi )^{3}}}\sum _{\sigma,\alpha }\int \! \frac{d^{3}\boldsymbol{q}} {\sqrt{2\omega (\boldsymbol{q})}}\int \!d^{3}\boldsymbol{k}\,\boldsymbol{k} \cdot \boldsymbol{\epsilon }_{\alpha }(\boldsymbol{q}) \\ & & \times \left [c_{\sigma }^{+}(\boldsymbol{k} +\boldsymbol{ q})a_{\alpha }(\boldsymbol{q})c_{\sigma }(\boldsymbol{k}) + c_{\sigma }^{+}(\boldsymbol{k})a_{\alpha }^{+}(\boldsymbol{q})c_{\sigma }(\boldsymbol{k} +\boldsymbol{ q})\right ].{}\end{array}$

(19.105)

Which effective electron-electron interaction Hamiltonian H _e−e ^(γ) would you get if you eliminate the photon operators through a unitary transformation $\boldsymbol{\vert }\Phi \boldsymbol{\rangle } \rightarrow \boldsymbol{\vert }\Phi '\boldsymbol{\rangle } =\exp (A)\boldsymbol{\vert }\Phi \boldsymbol{\rangle }$ similar to the transformation that we performed to transform H _e−q into H _e−e ^(q) (19.103)?

Bibliography

M. Abramowiz, I.A. Stegun (eds.), Handbook of Mathematical Functions, 10th printing (Wiley, New York, 1972)

J. Callaway, Quantum Theory of the Solid State (Academic press, Boston, 1991)

11.

P. Fulde, Electron Correlations in Molecules and Solids, 2nd edn. (Springer, Berlin, 1993)

17.

H. Ibach, H. Lüth, Solid State Physics – An Introduction to Principles of Materials Science, 3rd edn. (Springer, Berlin, 2003)

22.

C. Kittel, Quantum Theory of Solids, 2nd edn. (Wiley, New York, 1987)

25.

O. Madelung, Introduction to Solid-State Theory (Springer, Berlin, 1978)

Footnotes

Ø. Burrau, Naturwissenschaften 15, 16 (1927); K. Danske Vidensk. Selsk., Mat.-fys. Medd. 7(14) (1927).

W. Heitler, F. London, Z. Phys. 44, 455 (1927).

A.H. Wilson, Proc. Roy. Soc. London A 118, 617, 635 (1928); E. Teller, Z. Phys. 61, 458 (1930); E.A. Hylleraas, Z. Phys. 71, 739 (1931); G. Jaffé, Z. Phys. 87, 535 (1934).

See e.g. G. Hunter, H.O. Pritchard, J. Chem. Phys. 46, 2146 (1967); M. Aubert, N. Bessis, G. Bessis, Phys. Rev. A 10, 51 (1974); T.C. Scott, M. Aubert-Frécon, J. Grotendorf, Chem. Phys. 324, 323 (2006).

B. Grémaud, D. Delande, N. Billy, J. Phys. B 31, 383 (1998); M.M. Cassar, G.W.F. Drake, J. Phys. B 37, 2485 (2004); H. Li, J. Wu, B.-L. Zhou, J.-M. Zhu, Z.-C. Yan, Phys. Rev. A 75, 012504 (2007).

We would have to be more careful if we would discuss expectation values, because exchange integrals appear in the expectation values of potential terms, see Section 17.7

M. Born, J.R. Oppenheimer, Annalen Phys. 84, 457 (1927).

M. Aubert, N. Bessis, G. Bessis, Phys. Rev. A 10, 51 (1974).

T. Kato, Commun. Pure Appl. Math. 10, 151 (1957). See also R.T. Pack, W.B. Brown, J. Chem. Phys. 45, 556 (1966) and Á. Nagy, C. Amovilli, Phys. Rev. A 82, 042510 (2010).

We have seen the corresponding one-dimensional equations in (10.1–10.4). However, when comparing equations (19.36) and (19.37) with (10.1–10.4) please keep in mind that the continuous variables κ _i play the role of x there, while the discrete lattice sites $\boldsymbol{\ell}= n^{i}\boldsymbol{a}_{i}$ compare to the discrete momenta 2π n∕a in equations (10.1–10.4), see also (10.12).

J. Hubbard, Proc. Roy. Soc. London A 276, 238 (1963), see also M.C. Gutzwiller, Phys. Rev. Lett. 10, 159 (1963).

See e.g. J.E. Hirsch, Phys. Rev. B 31, 4403 (1985); I. Affleck, J.B. Marston, Phys. Rev. B 37, 3774 (1988); Y.M. Vilk, A.-M.S. Tremblay, J. Physique I 7, 1309 (1997). More comprehensive textbook discussions can be found in references [5, 11].

You also have to use that the matrix $\underline{\tilde{\Omega }^{2}}(\boldsymbol{k})$ has a positive semi-definite square root $\underline{\tilde{\Omega }}(\boldsymbol{k})$ , see Problem 19.2. Therefore we also have e.g.

$\displaystyle{\sum _{A,B}\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k}) \cdot \underline{\tilde{ \Omega }^{2}}_{ A,B}(-\boldsymbol{k}) \cdot \hat{\boldsymbol{Q}} _{J,B}(-\boldsymbol{k}) =\omega _{I}(\boldsymbol{k})\omega _{J}(-\boldsymbol{k})\sum _{A}\hat{\boldsymbol{Q}} _{I,A}(\boldsymbol{k})\hat{\boldsymbol{Q}} _{J,A}(-\boldsymbol{k}).}$

J. Bardeen, L.N. Cooper, J.R. Schrieffer, Phys. Rev. 108, 1175 (1957); see also H. Fröhlich, Phys. Rev. 79, 845 (1950) and J. Bardeen, D. Pines, Phys. Rev. 99, 1140 (1955).

Prev Next