The relevance of waves in quantum mechanics
naturally implies that the decomposition of arbitrary wave packets
in terms of monochromatic waves, commonly known as Fourier
decomposition after Jean-Baptiste Fourier’s Théorie analytique de la Chaleur
(1822), plays an important role in applications of the theory.
Dirac’s δ function, on the
other hand, gained prominence primarily through its use in quantum
mechanics, although today it is also commonly used in mechanics and
electrodynamics to describe sudden impulses, mass points, or point
charges. Both concepts are intimately connected to the completeness
of eigenfunctions of self-adjoint operators. From the quantum
mechanics perspective, the problem of completeness of sets of
functions concerns the problem of enumeration of all possible
states of a quantum system.
2.1 The δ function and Fourier transforms
Let f(x) be a smooth function in the interval
[a, b]. Dirichlet’s equation
[7]
motivates the formal definition
(2.1)
(2.2)
such that equation (2.1) can (in)formally be
written as
A justification for Dirichlet’s equation is given below in the
derivation of equation (2.8).
The generalization to three dimensions follows
immediately from Dirichlet’s formula in a three-dimensional cube,
and exhaustion of an arbitrary three-dimensional volume
V by increasingly finer
cubes. This yields
The case ( on the boundary of V ) must be analyzed on a case-by-case
basis.
(2.3)
Equation (2.3) implies
This can be used to introduce Fourier transforms by splitting the
previous equation into two equations,
with
Use of corresponds to the
-representation of quantum mechanics.
Use of corresponds to the
-representation or momentum-representation of quantum
mechanics.
(2.4)
(2.5)
The notation above for Fourier transforms is a
little sloppy, but convenient and common in quantum mechanics. From
a mathematical perspective, the Fourier transformed function
should actually be denoted
by to make it clear
that it is not the same
function as with different symbols for
the first three variables. The physics notation is motivated by the
observation that and are just different
representations of the same quantum mechanical state ψ.
Another often used convention for Fourier
transforms is to split the factor (2π)−3 asymmetrically, or
equivalently replace it with a factor 2π in the exponents,
or equivalently
with the vector of wave numbers
The conventions (2.4, 2.5) are used throughout this book.
The following is an argument for
equation (2.1) and its generalizations to other
representations of the δ
function. The idea is to first construct a limit for the Heaviside
step function or
function
and go from there. The value of is often chosen to suite the needs of
the problem at hands. The choice seems intuitive and is also
mathematically natural in the sense that any decomposition of a
discontinuous functions in a complete set of functions (e.g.
Fourier decomposition) will approximate the mean value between the
left and right limit for a finite discontinuity, but in many
applications other values of are preferred.
The function helps us to explain Dirichlet’s
equation (2.1) through the following construction. Suppose
d(x) is a normalized function,
The integral
(2.6)
satisfies
where we apparently defined as ,
but this plays no role for the following reasoning.
(2.7)
Equation (2.7) yields for f(x) differentiable in [a, b]
where we simply split
to arrive at the final result. Equation (2.8) confirms
or after shifting the argument,
From a mathematical perspective, equations like (2.9) mean that the action
of the δ distribution on a
smooth function corresponds to integration with a kernel
κ d(κ
x) and then taking the limit κ → ∞.
(2.8)
(2.9)
Equation (2.2) is an important particular realization of
equation (2.9) with the normalized sinc function
.
Another important realization uses the function ,
(2.10)
Note that we did not require d(x) to have a maximum at x = 0 to derive (2.9), and indeed we do
not need this requirement. Consider the following example,
This function has two maxima if α ⋅ β ≠ 0 and if a and b are sufficiently far apart, and it
even has a minimum at x = 0
if α = β and . Yet we still have
because the scaling with κ
scales the initial maxima near a and b to a∕κ → 0 and b∕κ → 0.
Sokhotsky-Plemelj relations
Indeed, for the practical evaluation of integrals
involving singular denominators, we virtually never use these
relations but evaluate the integrals with the left hand sides
directly using the Cauchy and residue theorems. The primary use of
the Sokhotsky-Plemelj relations in physics and technology is to
establish relations between different physical quantities. The
relation between retarded Green’s functions and local densities of
states is an example for this and will be derived in
Section 20.1
I will give a brief justification for the
Sokhotsky-Plemelj relations. The relations
imply
On the other hand, the real part is
This implies for integration with a bounded function f(x) in [a, b]
However, the weight factor
essentially cuts the region − 3ε < x < 3ε symmetrically from the integral
∫ a b dx f(x)∕x (the value 3ε is chosen because xK ε (x) = 0. 9 for x = ±3ε), see Figure 2.1. Therefore we can use
this factor as one possible definition of a principal value
integral,
Fig. 2.1
Comparison of 1∕x with the weight factor K ε (x)
2.2 Self-adjoint operators and completeness of eigenstates
The statistical interpretation of the wave
function implies that the wave
functions of single stable particles should be normalized,
Time-dependence plays no role and will be suppressed in the
following investigations.
(2.12)
Indeed, we have to require a little more than
just normalizability of the wave function itself, because the functions
,
, and for
admissible potentials should also be square
integrable. We will therefore also encounter functions which may not be normalized,
although they are square integrable,
Let and be two square integrable
functions. The identity
yields with the choice
the Schwarz inequality
The differential operators and
, which we associated with
momentum and kinetic energy, and the potential energy all have the following
properties,
and
(2.13)
(2.14)
(2.15)
Equation (2.15) is a consequence of the fact that
is a real function.
Equations (2.13, 2.14) are a direct consequence of partial
integrations and the fact that boundary terms at vanish
under the assumptions that we had imposed on the wave
functions.
If two operators and have the property
for all wave functions of
interest, then is denoted as adjoint to the operator . The mathematical notation for
the adjoint operator to is ,
Complex conjugation of (2.16) then immediately tells us .
(2.16)
An operator with the property is
denoted as a self-adjoint
or hermitian
operator2.
Self-adjoint operators are important in quantum mechanics because
they yield real expectation values,
Observable quantities like energy or momentum or location of a
particle are therefore implemented through self-adjoint operators,
e.g. momentum is implemented through the
self-adjoint differential operator . We have
seen one method to figure this out in equation (1.21). We will see another method in
equations (4.26, 4.27).
Self-adjoint operators have the further important
property that their eigenfunctions yield complete sets of functions.
Schematically this means the following: Suppose we can enumerate
all constants and functions which
satisfy the equation
with the set of discrete indices . The constants are eigenvalues and the functions
are
eigenfunctions of the
operator . Hermiticity of the operator
implies orthogonality of
eigenfunctions for different eigenvalues,
and therefore
However, even if for
different indices (i.e. if the
eigenvalue is degenerate because there exist at least
two eigenfunctions with the same eigenvalue), one can always chose
orthonormal sets of eigenfunctions for a degenerate eigenvalue. We
therefore require
(2.17)
(2.18)
Completeness of the set of functions means that
an “arbitrary” function can be expanded in terms of the
eigenfunctions of the self-adjoint operator in the form
with expansion coefficients
(2.19)
(2.20)
If we substitute equation (2.20)
into (2.19) and (in)formally exchange integration and
summation, we can express the completeness property of the set of
functions in the
completeness relation
(2.21)
Both the existence and the meaning of the series
expansions (2.19, 2.20) depends on what large a class of
“arbitrary” functions one considers. Minimal
constraints require boundedness of , and continuity if the
series (2.19) is supposed to converge pointwise. The
default constraints in non-relativistic quantum mechanics are
continuity of wave functions to ensure validity of the
Schrödinger equation with at most finite discontinuities in
potentials , and normalizability. Under
these circumstances the expansion (2.19, 2.20) for a wave function
will
converge pointwise to . However, it is convenient
for many applications of quantum mechanics to use limiting forms of
wave functions which are not normalizable in the sense of
equation (2.12) any more, e.g. plane wave states
,
and we will frequently also have to expand non-continuous
functions, e.g. functions of the form
with a discontinuous potential . However, finally we only have
to use expansions of the form (2.19, 2.20) in the evaluation
of integrals of the form ,
and here the concept of convergence in the mean comes to our
rescue in the sense that substitution of the series
expansion (2.19, 2.20) in the integral will converge to the same
value of the integral, even if the expansion (2.19, 2.20) does not converge
pointwise to the function .
A more thorough discussion of completeness of
sets of eigenfunctions of self-adjoint operators in the relatively
simple setting of wave functions confined to a finite
one-dimensional interval is presented in Appendix C. However,
for a first reading I would recommend to accept the series
expansions (2.19, 2.20) with the assurance that substitutions of
these series expansions is permissible in the calculation of
observables in quantum mechanics.
2.3 Problems
2.1. Suppose the function f(x) has only first order zeros, i.e. we
have non-vanishing slope at all nodes x i of the function,
Prove the following property of the δ function:
2.2. Calculate the Fourier transforms of
the following functions, where in all cases −∞ < x < ∞. Do not use any electronic
integration program.
2.2a. , ,
2.2b. , ,
2.2c. ,
, where n is a natural number.
2.3. The functions and are normalizable to
functions d(x) in the sense of
equation (2.6). Use this to find other derivations of the
Fourier representation of the δ function similar to
equation (2.10).
2.4. We consider a finite interval
[a, b] together with the set C (1, α)[a, b] of complex valued functions which
are continuous in [a, b] and differentiable in (a, b), and satisfy the pseudo-periodicity
condition
Show that the differential operator is self-adjoint on C (1, α)[a, b]. Give a complete set of eigenstates
of in C (1, α)[a, b].
2.5. We consider the finite interval
[a, b] together with the set C (2), 0[a, b] of complex valued functions which
are continuous in [a, b] and second order differentiable in
(a, b), and satisfy the boundary conditions
Show that that the differential operator d 2∕dx 2 is self-adjoint on
C
(2), 0[a, b]. Give a complete set of eigenstates
of d
2∕dx
2 in C
(2), 0[a, b].
2.6. We consider the finite interval
[a, b] together with the set C (1), 0[a, b] of complex valued functions which
are continuous in [a, b] and differentiable in (a, b), and satisfy the boundary conditions
Show that the symmetric differential operator with domain C (1), 0[a, b] is not self-adjoint in the sense
that h 1
+ can be defined on the larger set L 2[a, b] of square integrable functions over
[a, b].
Show that h 1 has no eigenstates,
while h 1
+ has complex eigenvalues and an overcomplete set of
eigenstates.
Bibliography
7.
R. Courant, D. Hilbert,
Methods of Mathematical
Physics, vols. 1 & 2 (Interscience Publ., New York,
1953, 1962)
21.
T. Kato, Perturbation Theory for Linear
Operators (Springer, Berlin, 1966)
Footnotes
1
Yu.V. Sokhotsky, Ph.D. thesis, University of St.
Petersburg, 1873; J. Plemelj, Monatshefte Math. Phys. 19, 205
(1908). The “physics” version (2.11) of the
Sokhotsky-Plemelj relations is of course more recent than the
original references because the δ distribution was only introduced much
later.
2
We are not addressing matters of definition of
domains of operators in function spaces, see e.g. [21] or Problem 2.6. If the operators
and can be defined on different
classes of functions, and holds
on the intersections of their domains, then is usually denoted as a
symmetric operator. The
notion of self-adjoint operator requires identical domains for both
and such that the domain of
neither operator can be extended. If the conditions on the domains
are violated, we can e.g. have a situation where has no eigenfunctions at all, or
where the eigenvalues of are complex and the set of
eigenfunctions is overcomplete. Hermiticity is sometimes defined as
equivalent to symmetry or as equivalent to the more restrictive
notion of self-adjointness of operators. We define Hermiticity as
self-adjointness.