/
Author: Rose M.E.
Tags: physics theoretical physics theory of relativity relativistic physics
Year: 1961
Text
RELATIVISTIC
ELECTRON
THEORY
M. E. ROSE
Chief Physicist
Oak Ridge National Laboratory
NEW YORK" LONDON, JOlIN WILEY & SONS, INC.
Copyright @ 1961 by John Wiley & Sons, Inc.
A II rights reserved. This book or any part
, thereof must not be reproduced in any fornl
without the \vrilten permission of the publisher.
Library of Congress Catalog Card Number: 61-5667
Printed in the U ni1.ed States of America
PREFACE
The preface of. a book is traditionally a device enabling the autho
to divulge his intentions and hopes as weU as his motivations. At the
same time it provides the reader of the book with a preview of things
to come. In that sense this Preface is in accord with tradition.
It-is not the purpose of these prefatory remarks to describe the con-
tents of this book in any detailed manner. A study of the table of con-
tents should provide an adequate guide to the material covered here,
as to the scope of the discussion as well as, possibly, to the level of
sophistication which has been. assumed on the pa.rt of the reader. Lest
there be any ambiguity with reference to the latter, it is assumed that
the.. reader has become acquainted with the general principles and
methods of quantum mecanics. In view of recent trends in the gradu-
ate curriculum, most first-year and virtually all second-year graduate
students should find themselves adequately prepared, and presumably
equipped to undertake the'study of relativistic electron theory. In this
connecti.on it is of interest to not,e that this book has been designed
for Use asa reference as well as a text.
It is important to recognize the place in the scheme of things which
this part of physics occupies. To begin with, we are here concerned
\vith the theory of all spin one-half particles (fermions) which are
lighter than a nucleon. Therefore the word "electron" in the title
stands for mu meson, neutrino, and their antiparticles as well. In mak-
ing this remark we recognize that, in light of recent developments, the
neutrino, in particular, may well require special discussion but this,
properly speaking, is an off-shoot of the more general electron theory
and is taken up in the last chapter of the book.
The second point of importance is to tecognize that we deal here
with what is sometimes called the "c-number ,eory." The fields with
which we are concerned are not quantized. 'This means that certain
vii
viii
PREFACE
effects, radiative corrections to electromagnetic processes for example,
are, stricdy speaking, beyond the scope of the present treatment. This
does not, of course, preclude a discussion of radiative processes: brems-
strahlung, Compton scattering, and the like. The contents of this book
ITlay. properly ,be referred to as the -single particle J?irac theory. This
theory is the extension of quantum mechanics to include the ,effects of
special relativity. As such, it may be thought of as forming a link
between the simpler form of the quantum theory and": the more ad-
vanced version wherein all the fields are treated as quantized entities
It will be quite evident, however, that the Dirac theory occupies a more
important position in the development of modern physics than... this
ancillary role would imply. As will be seen, it has a wde range of
applicability. To the extent that it does not give con1plete answers.. to aU
conceivable problems, in the realm of electron theory, it partakes of the
nature of all other physical theories which are useful, powerful, even
\
elegant, but not final.
This book is intended, then, as a comprehensive treatment of the
single particle description of relaivistic quantum mechanics. Explicitly
we are concerned wjth spin one-half' particles which are not subject to
strong couplings (for instance, the pi meson field); it, is well known t
however, that in. at least a-, formal way parallel considerations maybe
nlade for spin zero particles with mass. This would then exhaust the
non-field theoretic description of kno"wn particles in the quantum
theory.
1.n the pa.st it has been customary, in books expounding the principles
of quantum tnechanics) to conclude with an. all too brief chapter .00 the
relativistic single particle theory. That this kind of discussion, abbrevi-
ated as it D1USt usually be.. inevitably leaves the student with an inade..
quate understanding of this irnportant extension of qua11tum mechanics
is presumably very well appreciated by all the authors who have been
forced into this position for obvious practical reasons. Several yearS'
ago, when an appreciable fraction of graduate students .Nere not ex-
pected to acquire'a knowledge of quantum mechanics beyond the treat-
ment of the standard textbooks, this curtailed treatment of the relativ-
istic theory or, more pertinently the absence of a book dealing with
the subject in a detailed and extensive manner was not so serious a
drawback. It is clear, however, that this is no longer the case. At
present) courses dealing with the present subject have become rather
common in most graduate curricula.
()bviously, the degree of emphasis 'V\'hich each topic has received is a
rnatter of personal taste and judgment. 'The ITiotivation in n1aking such
PREFACE
ix
aecisions has been to give as much prominence as possible to the'
conceptual basis of the theory. Secondly, particular atentionhas been
given to the presentation of techniques which would enable the user
of the book fiot only to "follow the literature" but also to use. the
... theory on his own. Applications. of the theory appear throughout but
are most frequently found in later chapters.. /\ nUITlber of ingenious
solutions of he Dirac equations have not been included because it ap-
pears that their main interest is mathematical rather than physical.
About a hundred problems appear in the book. "These are to be
found at the end of each chapter. They present a wide range of con-
tent and a broad spectrum so far as degree of difficulty is concerned.
Withtbe exception of a few general references listed at the end of 'the
book, the literature cited in the text is referenced at the end of the
appropriate chapter. I offer my apologies to the ITlany contributors
wh()seexcellent papers have not been cited. No attempt has been made
to provide a complete bibliography. Instead, the references cited con-
stitute' recognition of the important early papers and the most recent
devloPD1ents in the case' of each topic discussed. In any event, these
references should furnish an adequate starting point for the reader in-
terested in pursuing any particular topic in. greater detail.
It is a pleasure to record my thanks to Dr. Roland H. Good, Jr., pf
Io""aState University forhis kindness in reading the manuscript.. Need-
less to say, the responsibility for all tbat follows rests entirely with me.
This applies especially to whatever errors of omission and/or com-
mission may exist herein.
M. E. ROSE
Oak Ridge, Tennessee
November, 1960
CONTENTS
I. NON.RELATIVISTIC SPIN rrHEORY
1
1. Introduction
2. Empirical Basis of the Spirt Theory
3. Formal Theory of Angular ?\1omentuln
Definition of Angular lvlomenturn. Eigenvalues ana
Eigenfunctions of the Angular MOlnentum Operators
4. Application to Spin One-Half
5. Spatial Rotations
6. Spin Projection Operators and Polarization
7.. Electron in a Central Field
1
2
3
8
14
17
19
Spin-Orbit Coupling. Pauli Spinors in a Celltral Field.
Anomalous Zeeman Effect
8. CoupHng of Angular Momenta 25
The Vector Addition Coefficients. Properties of the
Spin-Angular Functions
II. RELATIVISTIC QUANTUM MECHANICS
OF FREE PARTICLES 32
9. Postulates of the Theory 32
10. The Wave Equation 37
The Second-Order Equation. 1'he Dirac Wave Equation.
The Covariant Forln of the Wave Equation
11. The Dirac Matrices 44
12. Spin and Constants of the Motion 49
13. The Fundamental Theorem of Pauli 52
14. Lorentz Transformation" and Relativistic Covariance 5 5
Covariance of the Equations of Motion. The Transforma-
tion Matrix. Bilinear- Covarianl' .
xi
xii CONTENTS
In. DIRAC PLANE WAVES 68
15. The Four Plane Wave States 68
The Wave Functions. The Spin Operator
16. Negative Energy Solutions. The Positron 74
17. The. Properties of Free Positrons 78
18. The Diagonal Representation 8
Plane Waves. The Foldy-Wouthuysen Transformation
19. Projection Operators 92
General Properties. Energy Projection Operators. The
New Representation. The Spin Projection Operators-
20. Covariant Description of Spin 102
21. Application to Nuclear Beta DC{cay 105
IV. PARTICLE IN ELECTROMAGNETIC FIELDS 116
. 22. 1'he Wave Equation 116
Classical Electromagnetic Fields. The Equations- of Motion.
Magnetic Moment of the Electron. Foldy-Wouthuysen
Tran$/ormation with External. Fields
23. Spin Effects in Electric and Magnetic Fields 130
Polarization Effects and Covariant Spin Operator.
Virial Theorem
24. Charge Conjugation 134
25. Space . and Time Reflection 139
Space Reflection. Time Reflection. Transformation of the
Adjoint Function. Transformation o/the Bilinear Covariants
under Time Reflection. Unitary Transformations
v. DIRAC PARTICLE IN A CENTRAL FIELD 157
26. Wave Equation in Polar Coordinates 157
27. Free Particle Solutions 161
28. General Properties of the Radial Functions 163
Normalization of Bound State Wave Functions. Nodes of
the Radial Functions
29. Coulomb Field. Bound States
30. Anomalous Zeeman Effect
169
181
VI.
35.
36.
37.
38.
39.
40.
VII.
41.
.42.
CONTENTS
31. '
32.
33.
Hyperfine Structure
Coulomb ..Field. Continuum States
Scattering Theory
The Density Matrix. Formal Theory of Scattering 01 PoltJr-
ized Electrons. The Scattering Amplitudes
Time-Dependent Perturbations
34.
APPROXIMATION METHODS
'"
The Classical Limit
The Born Approximation
Retarded Interaction between Charged Particles. The Breit
Interaction. Scattering of Fa.ft Electrons by Nuclei
Compton Scattering of Circularly Polarized Radiation
Sommerfeld-Maue Appr.oximation
Finite Nuclear Size Effects
Wave Functions inside the Nucleus. Scattering Phase Shifts
The Dirac Equation at Hig Energies
NEUTRINO THEORY
Four-Component Formulation
Masso! the Neutrino. Neutrino Helicity. C:'harge Conjugate
States
The Two-Component Theory
The Weyl Equation. Relation to the Majorana Theory. Co-
variance of the Theory. Two-Component Neutrino in Beta
Decay. Angular Momentunt Representation
xiii
188
191
196
211
219
219
223
232
237
240
246
253
253
258
,
Appendix A. Notation 273
Appendix. B. Lorentz Ttansformations 276
Appndix C. Time-Dependent Operators 279
Appendix D. An Alternative Approach to the
Dirac Matrices 281
Appendix E. Retarded Electromagnetic Interaction 286
General References 293
Author Index 295
Subject Index 298
I.
NONuRELATIVISTIC SPIN TH}-::ORY
1. L'fTRODUCTION
The relativistic theory of the electron, as distinct from relativistic particle
theories in general, is a theory of a particle \vith spin In. By spin vIe shall
mean the intrinsic angular. momentum associated with the particle. In
contrast, the t.otal angular momentum is the resultant of the spin and the
orbital angular momentum which the particle possesses by virtue of.its
'motion. Thus the spin is the total angular momentum in the rest systeln.
This property of intrinsic angular momentum is, of course, a quantum
effect since/ it cannot appear in a classical theory, that is, the limiting fOl1n
of the theory as Ii -- O. By way of contrast, the orbital angular m(;mentum
Iii does have a classical1imit since hequantum number I is not bounded
in this limit.
Needless to say, the relativistic theory which forms the subject of our
discussion is, in a sense, more tha1l a description of the spin. I t is
immediately obvious that SUCl1 a theory must be consistent with the
in variance requirements of the special theory of relativity. Indeed, \vhen
this requirement is iroposed, a number of theories appear as possible
candidates. Moreover, each of these theories contains the result that the
particle under discussion does, in fact, exhibit a spin sn, \vhere 2s is any
non-negative integer. The particular form of the theory, unique for a
particle with non-vanishing mass, which corresponds to s = t is the
well-known Dirac theory with which we shall be almost exclusively
cccupied in this exposition. ConsequentJy, it is proper to say that, in
detail, the spin properties of an electron are a natural consequence of the
requirements of relativistic invariance-. The validity of this statement is
explicitly demonstrated in the seque1.
Historically, the concept of electron spin arose in a phenon1cnological
1
2
RELATIVISTIC ELECTRON THEORY
.
way.It The first formal theory! of spin was not a relativistic theory, a.nd
in view of the basic principle that simpler things come first, this historical
order was a most natural one. This theory, the Pauli theory, is the limiting
form of the rigorously correct theory.in the limit in which the velocity of
light (c) tends ,toward infinity. Consequently, all the quantitative results
of the non-relativistic theory can be obtained as limiting values of corre-
sponding results as given by the Dirac theory. In fact, it is a curious
circumstance that in some cases the exact theory yields these results in a
simplr and more straightforward manner. Accordingly, it might appear
logical to dispense with a discussion of the approximate'theory and obtain
all the results of the Pauli theory.'as limiting forms of the more rigorous
treatment. Nevertheless, it is advantageous to approach the study of the
relativistic theory from the standpoint of the Pauli theory, since the latter
provides a unique insight into the structure of the former.
2. EMPIRICAL BASIS OF THE SPIN THEORY
The concept of a spinning electron was. first suggested byCQII1ptpn 3 jn
1921 in connection, appropriately enough, with the origin of the natural
unit of magnetism. The idea became firmly established in physics when
in 1925 Uhlenbeck and Goudsmit 1 proped the electron asa point magnet
with intrinsic spin in order to clarify,. the anomalous Zeeman effect. The
main results of the argument were:
(a) The electron must have an intrinsic spin !11. Hence single electrorr
atomic levels must be characterized by half-integer angular momentum.
(b) The electron rnagJ?-etic moment arising from the spin must have a
rnagnitde equal to the Bohr magnelon:
Iftl = en/2mc5 #0 (1.1)
where m is the rest mass of the electron and -e < 0 is its charge.
Moreover, in termS ofa vector model, fL and s(the angular momentumifi
units of Ii) must be oppositely directed. Therefore a vector equation
f£ = -(e/mc)Jis (1.2)
can be written. It will appear that this equation is valid as an operator
equation. The magnetic moment (f.-l) as a measured quantity is then the
average (expectation) value of P'z for the state in which Sz has the constant
t Re(erences are found at the end of the chapter.
t More precisely, the non-relativistic spin theory can be correct only to order vIe,
where v is the velocity of the electron. In the case, of bound states, vIe,......, rJ.Z, where
<X = e 2 /hc 1/137 is the fine structure constant and Z is the atomic number.
NON-RELATIVISTIC SPIN THEORY
3
value!. The connection between the spiand the associated magnetism
implies that the spin must be a relativistic phenomenon.
. '
The gyro,nagnetic ratio is
gs = /-l/Pos = Z- (1.3)
in contrast to the result for the orbital angular momentum, where the
corresponding ratio is
gz = /lz/flo l = 1
(1.4)
and fit is the Inagnetic moment due to the orbital motion. It is possible to
make a classical argument4 based on relativistic invariance, whic leads to
the value gs = 2.. However, it is much simpler to obtain this result from
the complete quantum mechanical treatment of spin given in later chapters._
For the moment it is of interest to mention that, if gs = 1 is assumed, the
Zeeman effect with spin leads to the "normal" Zeeman triplet, contrary
to experimental evidence. Obviously, the conclusion that gs = 2 precisely
is obtained from neither the empirical evidence nor as a consequence of
the approximate spin theory. In this theory it must be taken as a postulate.
.. The postulated magnetic dipole to be associatd with the electron
immediately leads to a spin-orbit c0upling since, in the frame of reference
of he electron, the rest of the at0I!l provides a magnetic field which is
coupled with the electr<;>n magnetic dipole moment. In these terms the
doublet structure of the spectra of the alkali atoms' and other multiplet
. structure observed itl optical spectra could be understood. However, a
quantitative accounting for the measured doublet separations depends on
a. more detailed analysis of relativisticeeffects (section 7) than this simple
discussion would seem to entail.
3. FORMAL THEORY OF ;\NGULAR IVIOMENTUMt
Definition .ofAngular Momentum
....
Since we are concerned with a particle with angular momentum,
intrinsic and possibly orbital angular momentum as well, it is very useful
to establish in a formal way just what is meant by these terms. If we are
given a wave function "p which represents the state of a particle, there is a
procedure, as indicated below, by means of which we can determine what
t Here and henceforth the unit of angular momentum is h. Hence the term "angular
momentum" will refer to a dimensionless quantity which, as a matter of fact, is integer
or half-integer. The contents of this section appear in several other places;. for example,
see reference A.in the General References at the end of the book. In the text, references
in this general list will be denoted by an upper-case Roman letter.
4
RELATIVISTIC ELECTRON Tl-IEfJRY
angular momentum, if any, characterizes this. state. Since we do not in
general start with a given wave function, it i& more to the point to establish
SOlne properties which the requisite function must exhibit in order that it
propeJ;'ly describe the given angular m01nenturn associated with the state.
In the last analysis a definition of angular nlomentum must be based on
a measurement or set of measurements. However, the logical chain may
be reversed: angu]ar momentunl may be defined in terms of a formal
operation, and from this definition a connection wiJl eventually be
established between the angular momentum thus defined and a measured
quantity-a cross section, shape of an angular distribution, 01 the number
-of lines in a spectrum of some type of radiation emitted by an atom or by
a corresponding physical systern.
The concept of angular momentum is intimately connected with three-
dimensional rotations. 'This is clear in classical as wen as in non-relativistic
quantum mechanics where, as is to be expected., only orbitaJ anguJar
momentunl is involved. Nevertheless;; the connection \\lith rotations is a
general one. For instance, the statement that a physical system is rotation-
any invariant ir1ipHes in both classical and quantlun theory, that the
Hamiltonian describing it COlnmutes with the operator representing the
rotation. It will become evidenf that in the general cas4'e this leads to the
result that the total angular momentum is a constant of the motion.
Starting with a wave function VJ which depends on spatial coordinats
and possibly other coordinates as well, we consider a rotation R described
by three parameters. These can be taken to be the three Euler angJesor
the two angles specifying the orientation of a unit vector ii, the rotation
axis, and an angle .0, t.e rotation angle around n. Under the rotation 'I'
is . transformed to 1p' and
1p' = R(D, O)1p
(1.5)
Since R must be unitary, it can be written
R(ft, 0) = exp [ - is(n, 0)]
(1.6)
where S is hermitian and S(o.. 0) = o. Considering infinitesinlal rotations
around the X-, y-, and z-axes respectively, we \vrite, in each case
bVJ c= Rtp - 1p = -ie ( ) tp
of)ls=o
(1.7)
where (oS/oO)o=o depends on the axis of rotation. Since a rotation is a
continuous transformation, the function S TIlust have corresponding
properties and, for instance, the derivatives of S \vith respect to f) must
exist at any value of e.
NON..REL.TIVIS'TIC SPIN THEORY
5
The angular momentum operator, actually three operators, J, 1'V, Jz,
are defined by choosing fi along the X-, y-, and z-axes. Since infinitesimal
rotations commute, we can define the J i operators in terlllS of
( as ) .D.. J .... J "" J A T
.- =11. =n x +nll y+nJz
,08 8 = 0
Clearly Ja:, J y , and Jz are all hermitian. In (1.8) a con velltion has been
made 'with respect to a choice of sign, and, in detail, this choice is fixed in
terms of the manner in which a psitive rotation, for example, is specified.
From (1.7), .
(1.8)
R(ft, 6) = exp ( - i(}ft-J)
(1.9)
which has the property that t.o rotations around the same axis corpmute:
R(ii, (1) R(n, (2) = R(ft, 1)1 + ( 2 )
as is necessary.
Since,finite rotations do not commute, it is clear that the components of
J win not commute.. In fact, if a rotation around the y-axis through an
angle 01/ is followed by one around the x-axis through Ox, the result is not
the. same if the rotations are carried out in reverse ordere For simplicity
asum.e that both rotations are infinitesimal and consider terms of second
order in Og:, (}11. The the difference between the first pair of rotations and
the second pair (first pair in reverse order) produces the same displacenient
as an infinitesimal rotation around the zaxis through an angle 8 x 0'll.
Hence, \vith an obvious notation,
. R(x, Oa:) R(y, f)1I) - R(y, Oy) R(i, Ox) = -f);efJy(J:t;' J lI )
where we have introduced the commutator; that is,
(1.10)
(A, B) = AB - BA
From tl1e statement made above, the quantity on either side of (1.10) is
-i(Jx()z Hence
(J;e, J 1/) - iJ;;
It follows that two similar equations obtained by cyclic pernlutation of the
indices x, '!/, z are also valid. These three equations are sUITlmarized by
J X J = iJ
(1.11)
These are the commutation rules of the angular momentum operators.
It is evident that, if J and J' form two sets of operators conforn1ing with
(1.11) and if each component of J commutes with each component of J',
then the sum J + J' = J" also satisfies (1.11). Each component of J" is
an angular momentum operator, while J" itself is referred to as a vector
6
RELATIVISTIC ELECTRON THEORY
angular moentum operator. The measured quantity generally refe!r<i
to ,as the angular momentum of a physical system cannot be a vector
because this would imply that each component of that vector is a constant
of the motion, and that, in view of (1.11), is impossible. Qearly the
angular momentum must be' the eigenvalue of a rotationally invariant
operator and hence must be related to
J2 == J: + J: + J:
..
f 4 rom (1.11) it follows that J2 commutes with each of J re , J u , and J. anq
hence with the rotation operator R. Consequently we can make J2a
constant of the motion, and the eigenvalues of this operator will not
depend on the orientation of the coordinate axes.
Eigenvalues and Eigenfunctions
of the Angular Momentum...Operators
CQnsider a physical-system described by a Hamiltonian.H which is
rotationally invariant. This means that H commutes with each compoI1ent
of J and
Of course, it follows that
(H, R) = 0
(H 1 J2) = 0
so that J2 is a constant of the motion. In addition, one component QfJ,
say Jt;, can be made a con.stant of the motion. The angular momentum
representation in which J2 and J. are simultaneously..diagonaI with H is
given in terms of a set of eigenfunctions 'fJJf for which
J21p'1 = 'YJ i "Pi
( 1.12)
Jz1p7 = m1pi
In the first of (i.12) the notation implies tE.at the eigenvalues'?]; of J2
depend on a number j to be determined. Because 12 and Iz are hermitia!1,
rJ; is real and non-negative, m is real, and the eigenvalue of J2 - 1; = J; + J;
is'
Introducing the operators
'YJi - m 2 >= 0
(1.13)
and the function
J:t = J :I:: iJ '¥
(1.14) ;
we see that
c/>z == J z 'Pi
( 1.15)
J2cp-j; = 'YJic/>:t
NON-RELATIVISTIC SPIN THEORY
because (J2,J:t) = 0 and
Jz":t = [J :l:JI: + (J/lf J :I:)]1J17 == J j;(J 3 ::I: l)-tpi
= (m ::I: l)cp:t
'"Therefore 1>:1: is an eigenfunction of J2 with the same eigenvalue as 1pj
and is also an eigenfunction of Jz with eigenvalue m :I: 1. Thus
7
..J.. - r ,IJm:l:l
'f' oj: - :.t: T;
where r :i: is a constant the value of which is determined below.
Since application of J+ to "Pj raises the value of m, for givenj, it follows
from (1.13) that for some m, say m 2 , the resulting function cP+ must vanish;
that is
J + 1fJ't s = 0 .
(1.16a)
Then m <: mg <; 'YJj. Ina similar way we deduce that there exists a value
of m (say m 1 ) for which
J _1pjl = 0
(1.16b)
andm :> m 1 >: -1'];. Operating on (1.16a) with J_ and on (1.16b) with
J+ gives
J:r- J -J: 1pji = [J 2 - J(J:l:l)]V'ji
= [1}1 - mi(m i :I: 1)]"P7 i = 0
( 1.17)
and i = 1, 2 for ]ower and upper signs respectively. Since 1pj,1t i are bona
fide memb,ers .of the set, it follows that the square bracket in (1.17) must
yanish. Eliminating '7; from the two equations obtained in this way, we
find the result '
(m 2 + m 1 )(m2 - m 1 + 1) = 9
Since m2 > m 1 it follows that m 2 - m 1 + 1 cannot vanish and so m 1 = - m2-
Also, consecutive m-values ,differ by unity. Hence 11'12 - m 1 is a non-
negative integer which we denote by 2j; that is,
j = 0,1, 1,1,2,...
It follows then that
m 2 = j,
m 1 = -j
(1.18)
(1.19)
and, from (1.15),
"YJ; = j(j + 1)
Classically, j - 00 and the eigenvalue of J2 -:;. j2. Therefore it is to be
expected that the number j is the angular momentum (in units of Ii).
The linear term in j is a result of the uncertainty principle as expressed by
8
RELATIVISTIC ELECfRON TIIEORY
the commutation rules (1.11). This is apparent in (1.17). From (1.16}the
projection quantum number m is restricted by
-j <; m -< j (1.18')
so that there are 2j + 1 eigenfunctions for given j.
The next problem is that of determining the matrix elements of the
angular mornenum operators in the angular nlomentum representation.
These wiH be denoted by
(jmlJ klj'm')
for each ,of the operators J k . In writing these matrices in explicit form, the
first row refers to m = j and the first column to m' = j. The nth row and
column refer to m, m' = j --- n + 1. Clearly, -
(jm!Jlelj'm') = mc};j,d mm , (1.20)
A1so
(jmlJ2lj'm') = j(j + l)d j j't5 mm ,' (1.20/)
corresponding to the diagonalization of these operators. For the other
components \,\'e observe that
IF :t 1 2 = (J :t yj, J =t "1'1)
. (The detailed prescription for forming the scalar product wll be discussed
below.) The ''Pj are taken to form an.orthonormaL seLThus, using
..1I = J +, where * means hermitian conjugate,
Ir :f: 1 2 = (1Jlj, J :r-J:t 1J1i a ) = (1pi, [J 1 - J z(J z :i: l)]"PT)
= j(j + 1) - m(m :I: 1) = (j =F m)(j:l: m + 1)
The phase is chosen so that r :i: > 0:
r:f: = [(j T m)(j :l: m + 1)]!-i (1.20")
From (1.15) it follows that
(jmlJ ]:Ij'm') = r ]:<5 H 'c)m,m':i:l (1.20"')
These matrices therefore have non-vanishing elements only in the diagonals
adjacent to the principal diagonaL
4. APPLICATION TO SPIN ONE-HAI..F
Each of the matrices derived in the preceding section has 2j + 1 rows
and columns. For j = ! we obtain the angular momentum matrices for
the intrinsic spin of an electron. Using s for J in this case we write
s=!o
(1.21)
NON-RELATIVISTIC SPIN THEORY
9
and from. (1.20"1) it foHows that
10 1 )
a = ,
'" \1 0
(0
a,s: = ,
\ i
-i\
01'
/1
(T. = (
, ,0
0 )
-1
( 1.22)
in the representation v/herc. Sz or C1 z is diagonal. These a-nlatrices are the
v/ell-knovvn Pauli mtrices. Together 'Nith the 2 by 2 unit n1atrix /2 they
form a conlplete set in the serfse that any 2 by 2 matrix can be written in
terms of them. To see this we observe, first, that 1 2 and the three Pauli
matrices are evidentlv linear]v inde p endent. Thus
'" .I
Qol2 + a-a' :-= 0
if and only if 00 = 0 and a = O. Second, the trace of each a-matrix is
zero, whereas Tr 1 2 = 2. Hence, for any 2 by 2 rnatrix M,
Ai = }[Tr 1\1 + efr 1\10')-«]
(1.23)
In view of (1.11) \\'e can write ill1mediately
s X s = is
(1.24 )
or
a X u = 2ia
(1.24')
J n addition, the Pauli matrices ha.ve the foHoVv'ing properti.es:
axG y = - a l /:J:r, = fa z
fYy(J ::::: - a zU II == 1 a;;c
( 1.24")
(J zC;';,e = -- (J xU z = I cr!J
? 2 1
(f = (J = (1" =
trY Y Z
(1.24"')
In addition to being hermitian, each of the Pauli matrices is unitary:
a: == a-]. The existence of the inverse matrices foHows, since det (jk -=j:. 0.
For integer spins, for instance, the nlatrices are singular, as is clear since
one value ofm which ah\'ays occurs is In = O. The anticommuting
property of the a-matrices is peculiar to spin i. i\ corresponding property
does not appear for j * i. The last equality in (1 e24) also applies in the
case j == i- only. This equaJity \viH be written more succinctly by using
,Latin indices = 1, 2, 3 in place of the cartesian indices. Then
0' ,a k = iE " k lV ' .+ () ' k
1 ,1 v,, J
(1.25)
where €jkl is the antisymmetric third-rank tensor equal to + 1 if j, k, I is an
even perm utation of 1, 2, 3 and eq ual to - 1 if j k, ! is an odd permutation
of 1, 2, 1; otherwise € jkl = O.
10
RELATIVISTIC ELECTRON THEORY
Another. property which is extremely useful follows from the com..
mutation rules (1.24!1)o If A and B are two vectors which commute with
(1k but not necessarily with each other, then
a.A a.B = O"kAk(JzBz = A.D + (1 - OkZ)(]kGlAkBZ
Using (1.24"), this becomes
a.A aoB = AoB + io.(A)( ;8) (1.26)
This is an exan1ple of the decomposition of the type (1.23). It is clear that
no higher power of the PauJi spin matrices than the first need ever occur
in the formalism. By repeated application of the rule (1.26) it is easy to
construct the corresponding decomposition for the product of any number
of factors a.A'n. In view of what has already been said it is trivial to see
that this will always appear in the forn1 a + b.o.
As was mentione above, the form (1.22) of the Pauli matrices refers to
a particular representation: (] z diagonal. By a linear transformation with
a non-singular matrix S it is possible to \vrite the amatrices in other
representations. For example, in
O' = S(]kS-1 (1.27)
.S can be chosen so that any linear combination 0'.0, where n is an arbitrary
vector, can be made diagonaL When n is a real (unit) vector the unitary
transformation is a rotation in three-space. We shall return to this
problem in the next section. l\t this juncture it is important to remember
that all matrix equations are unchanged by the transformatio (1.27).
In particular, the commutation rules, (1.24') and (1.24"), are unchanged in
the sense that, if aU O'k in these equations are primed, the resulting equalities
are valid. A few simple cases can be discussed in1mediate1y. For exampI,
for S = (]x = S-l we find a = (J, a; == -G 1I , 0'; = -G z , which corre-
sponds to a rotation through 'TT around the :t;-axis. On the other hand, a
reflection (change of sign of an odd number of a's) is not a unitary trans..
formation because -0" does not fuUiIl the same comn1utation rules as
does (J.
The invariance of the cQlnmutation rules under the transformation (1.27)
does not actually require S to be unitary. It is sufficient that S be non..
singular so that Sl exists. However, in the present instance, where (/ and
(lk are both hermitian, S can always be chosen to be unitary.
As another example consider a representation in \\,hich (]; is diagonal.
Then, from the preceding it must have eigenvalues :l: 1, and we write it in
the form -
G= ( _)
.
NON-RELATIVISTIC SPIN THEORY
11
The linear transfornlation from the representation . (1.22) to the a'
representation corresponds to a rotation which carries the z-axis into the
x-axis. Since the positjons of the z' - and y'-axes are not specified, there
Intlst be some arbitrariness in 0'; and a;. Setting
a = {: :),
( at
0" =
z . t
C
b l )
d'
the requiremet that O';O' = iO': implies that
..
. ,
a = IG ,
. t
C = -IC ,
F I , , I fi d
. rom O'yO'x = - O'xCf y we n
b = ib '
.....
d = - id'
a=d=O
and
a = e :),. a = (: -;b)
From (]2 = or 0';2 = 1 we find
be = 1
and, with this, results (1(]; = ia = - 0'; a; follow automatically_ Also
()"<T; =__ iQ'= - (.j;(j is fulfilled. lIenee
at = (b1 :),
( 0
ai = ib- 1
-:b)
The lS'-matrix effecting the a-a' transformation is written
s = ( oc fJ )
y ,0
where oed - f3y -=P O. From
aS = Saf£
it follows that
rx = p,
= -y
From O";S = Say or O"S = SO"z we find .
(1..i = by
and thus
( -ib
S=y
1
- i b ) ,
-1
( O b X
x l
S* = Y
ib x
--:)
12
RELATIVISTIC ELECTRON THEORY
and
( lb l2 0 )
SS* = 21rl 2 0 t
Thus we can n1ake S* = S-1 by setting lyl2 = t, Ibl 2 = 1. For this choice
of Ibl 2 it is seen that SS* = S*S = 1. For b = 1, C!; = O'x') a; = u Y ' so
that a cyclic interchange of indices has taken place.
The converse theorem that, if S* = S-l and any matrix a is hermitian,
then .
a' = SaS- 1
is also hermitian is readily verified:
a'* = (SaS*)* = SaS* = SaS- 1 = a'
as required. Notice that, if a sequence of unitary transformations IS
carried out, the resulting overall transformation is also unitary.
The converse statement regarding (1.27) is also true: if a f and a are
two sets of three anticol11muting matrices ,vith a; = 0';2 = 1, then an S
exists for which (1.27) is valid. The proof is identical with that given in
section 13 for the Dirac matrices and will not be duplicated here. Finally,
we note that the trace and determinant of a matrix is unchanged by a
J . '
transformatIon of the type (1.27).
'Throughout this book the notation 0' or 0'.", where 2 by 2, matrices are
in1pJied, wiH refer to the representation (1.22).
The eigenvalue equations (1.12) for spin i will be written in the form
s2Xm = s(s - l)X m =.JtXrn
szX m = mx m , 111 := ::l:! (1.28)
There are t")vo eigenfunctions X:J:. From the matrix representation of Sz
and S2 it 1"oHo\"5 that the Im must b tnto-COfflponent functions. In fact,
with a simp1e choice of phases,
x H = () ;
_ 1.. ( 0 )
X =
1/
(1.29)
These may be regarded as single cohftnn matrices. 'Jv'e verify that these
form an orthonormal set.
(x m , Xtfl.') :.= omm'
»,here the scalar product 111eanS that Xm* is multiplied into X'm'. That is,
X m $ js the transpose, complex conjugate of X m :
x'. = (1 0);
- . (0
X -- =
1)
NON-RELATIVISTIC SPIN THEORY
13
so tnat Xm* are single row nlatrices. Ingenera1, a scalar product will
imply, unless explicitly stated to the contrary, integration over configura-
tion space and summation over the CO]UlTItl (or row) index labeling the
components of the spin function X'ru. Of course, in the present case we
are dealing with the eigenfunctions of the intrinsic spin, and they do not
depend on the space coordinates X k . Hence the first operation is here
unnecessary.
The" set of spin functions is obviously complete. Thus any two-
component function can be written as a linear superposition of them:
( a ) 1 1/
b = ax"" + bx->2
and hence the only two-component function \vhich is orthogonal to both
X!-1 and X- is the trivial one which is identically zero.
The appearance of a multicomponent wave function s characteristic of
the existence of a non-vanishing spin. Where the Vv'ave function 1p has a
single component depending only on the space coordinates the spin is
zero. In fact, the considerations of section 3 show that in this case
J = -irX V =L
(1.30) .
where L is the orbital angular momen.tum operator. Of course, in the
general case a particle with spin s (s > 0) rnay be characterized by a wave
functioll.1p \vhich has the form
'P1(X t )
"p2(x k )
'I) :::-..::
In this case the prescription of section 3 sho\vs that
J=L+s
( 1.30')
where s is a vector-matrix with ,,2s + 1 rows and columns and L is the
direct product of - ir X V and a unit matrix of the same rank. This
follows fron1 the fact that under rotations each component of .) must
transform into a linear combination of components. If this were not so,
the situation would arise in which rotations comn1 ute, contrary to fact.
For s = l
J == Ll 2 + !o
which is usually written
J=L+tO'
(1.31 )
14
RELATIVISTIC ELECTRON THEORY
In the foregoing considerations \vehave "t'..",(x k ) - 1 or 0, and the resulting
'lp = X m is a pure spin function. In section 7 we shall consider the problem
of introducing orbital motion.
Fron1 this discussion it follows that the wave function "p of a particle
with spin i is a function of the three X k , which form a continuum, and in
addition tp depends on a fourth variable which is dichotomic. That is to
say, the fourth variable has only two possible values and refers to the
"direction of the spin" or, more exactly, to the eigenvalue of SZ. Tp.us the
general form of 1p would bet
I -!-i ( "Pi( Xk) )
1p(x k , sz) = 'Vll(Xk)X + "P2(X k )X = )
"P2( X k
The notation indicates that this 1JJ is a superposition of the two states
m = :i:i. The interpretation of each terra is: l"Pl,2i 2 is the probabiliy per
unit volume that the particle is at the point X k with m = l, -i respectively.
Note that there is no interference between these two states:
(1p, "P) = ("PI' 1JJl) + (1J.'2' 1JJ2) = t Y'] 1 2 + 11f21 2
where the scalar product implies only summation over the column (row)
index of the spin functions. This in turn implies that (?p, 1p) is the
probability density when no observation of the spin (polarization measure-
ment) is made. A more detailed discussion of pol3:rization is.. given in
section 6.
5. SPATIAL ROTATIONS
It has already been emphasized that we cannot ascribe any meaning to
the statement that the spin vector is in a given direction. This would
imply the three equations 01p = ft"P where it is the spin direction. That
this equality is impossible follows from the fact that «(Jk, Ul) -=I=- O. However,
we can speak of the . average spin direction. This is given by (1p, aw).
If we introduce unit vectors e k a10ng the coordinate axes,
( ;to
e 3
a=
e 1 -t- ie 2
@1-A ie2 ) '
--e 3
For the p1!re spin functions,
(Xm,O'X m ) = ::tea,
for m = :1:1-
t s; is a number distinguishing tp fron1 the orthogonal wave function:
tp(x Je , --4;) = tpXYi - tpX-
NON-RELATIVISTIC SPIN THEORY
15
This result'is obviously directly connected to the choice of representation
in which (1 = O"a is diagonal.
It is useful to investigate other representations in which the con1ponent
ofa in any direction ii, that is a.6, is diagonal. Thus we write
to
x.= .! amX m
m
(1.32a)
and
a.ft X = AX
It is evident that the average spin ist
(1.32b)
(x :i:' aX:i:) = :I: ft
(1.32c)
From the fact that (0.6)2 = 1 (see Eq. 1.26), it foilows that A = ::I:: 1-
,Substituting (1.32a) into (1.32b), \ve find
( 3
n+
n_ ) fL
- !rz '" -- 1.-2
.. (ax + a_x \ ) = A(ax ,+ ax ' )
-1l
where fi-j; = fi! :i: in 2 . From (1..29} we obtain
n3a-i' -+ n_Q - = Ja!-i
12 + a} - naa -!4 -:- Aa - t
or
I A
n 3 - A
n-
=0
n+ -11 3 - A
giving ),,2 = j1;+ n+n_ = ft2 = 1 or A. = :l: 1 as n1entioned. Also
na - A
a-y% = - A aZi
n_
Writing n3 = cos .f}, n::i: = sin it eitp., so tllat {), f{J are the polar and
azimuth angles. of ii, and using the normalization condition
,
I 2 I 1 2 1
la1 + a-!-i =
we find, for A = 1,
i a , /1 2 = co s 2 Q. /2 .
i /"2 i . cU' ,
a -4/a!i = ei(jJ tan -D/2
and!; for A = -1,
la12 = sin 2 {}j2; a _ \A!a1A, = _eif'P cot fJ/2
t We anticipate that there will be two eigenfunctio!1s X = X::i:; :tee (1.33a) and (1.33b)
below. .
16
RELA TIV1sTIC ELECTRON l'HEOR Y
We choose the phases as follows:
A = 1: a! = e -'iqJ/2 cos {)/2,
A = -1:
a = _e- ifP / 2 sin {}12,
a _ /2= e iP / 2 sifl {}j2
a_ = e i q;/2 cps fJI2
(1.32d)
.t.2e)
Therefore the spin functions which diagonalize 0'.1\ with eigenvalue :f: 1 are
( 'e -irp/2 cos {}12 )
A = 1: X - (1.33a)
+ - e irpl2 sin r{}J2
( -e -if/J/2 sin Dj2\
). = -1: x- == ei'P/2 cos {}/2 ) (Ub)
These are, of course, a complete orthonOfInal set of spin functions. For iJ,
rp 0 the functions X:i: reduce to X:t!.
The tran.sformation just carried out can be written in another form.
We consider the matrix elements of R (see Eq. 1.9), in the angular
momentum representation and use the notationt
D:nm,(f3y) = (jmIRI.i 1n ') (1.34)
Here ct, 13, and yare the Euler angles of the rotation: (I) rotation through
tI.. around z, (2) rotation through f3 around resulting y-axis, (3) rotati0n
through y around final z-axis. Then under this rotation angular momentum
eigenfunction 'tJlj is transformed to
R"IJ"!" = D 3 'W'
T, m'1n,?
m-'
(r. 35)
It is important to notice that, if aU rotations are expressed in the original
coordinate system,A
R = e-i«Jze-ip.,TlIe--iyJ.,
( 1.35a)
In the present instance thefotation is. one \vhich carries the. z-axis, into
the direction ft. Hence the third Euler angle y is irrelevant (it introduces a
phase e- imy in D). The preceding choice of phase is equivalent to setting
y = O. It is clear then that
1 ( e-iCP/2coS19/2 eif/J/2SinfJ/2 ) _
D1A.(tp, 0,0) = . I. . , (1.35b)
_e-1.Q;/2 sin {)/2 eq;/2 cos lJ./2 '.;,:
A two-component function which transforms under rotations by the DYi.
matrix is called a spinor. Thus the pure spin functions X"n and X:t are
spinors. They will be referred to as Pauli spinors. The index which labels
the components will be referred to as the spinor index.. It is seen that
D( rp + 21Tn, f}, 0) = D( cp, {} + 21Tn, 0) = (_)11 D( cp, '0, 0)
t See Chapter I V of reference A.
NON-RELATIVISTIC SPIN THEORY
17
\Vher 1'1 is an inger.. For odd.n the complete rotationcarries1p to --V'
This two-to-one correspondence of the unitary transformation D and
three-space rotations is. characteristic of spinors.
6 SPIN PROJECTION OPERATORS AND POIARIZA1'ION
It was mentioned above that any set of two spinors like X 1: in (1.33)
forms a complete set in the two-dimensional spin space. As we have seen,
this means that any two-componnt function can be expanded as a
linear combination of these two. Alternatively, there is no non-vanishing
two-component function orthogonal to both x+ and X_a
If we write Xa. for X:i:' so that <X has two values, the tatements above
imply..that fot. any spinor -q; we can write
'Y = I c,/xf¥. = I (XIX, 'F}j(Z
« <X
In terIns of the spinor components,
. 'Yp = I X'YyX: = I bpy'Y y
<xv v
Th.erefore we obtain the completeness relation
I x;xx= lJ pv
CIC "
( 1.36)
Of!
I'x tlx xfl.* = 1
oX
(1.36 / )
'where X indicates a direct product of the t\VO spinors.
Consideriqg one term in (1.36), we define a pair of matrices pry. by
prz = Xf!. X x«*
(1.31)
or
pa = X IX X aX
p p ",v
( " 1'"' 7 ')
1.... /
Dropping the superscript (l. for the moment, we investigate the propertics
of P. First ,v observe that P is idempotent: that is, p2 = P and therefore
pn = P (n > 0); thus
(P2)UA = I PtlpP p ). = I XqX:XpX
p p
= ltTX = P tll
In order to understand this result we evaluate P for the spin function
(1.33a, b). Clearly,
p = I[Tr P + (Tr Pa).a]
18
REl.-ATIVIS11C ELECTRON THEORY
But
Tr P = I xpx = 1
p
and
Tr Pa = I Pp;.a).p = 2: xa).pXp
PA PA
= (x, ax)
Here, as elsewhere, the subscripts on spinors are spinor indices and on
tJ1atrices are corresponding ro\v-COlUn111 indices. "lith the results we find
directly from (1.32c)
p+ = x+ X X = !(l 'I- 0-6)
p- := x- X X = !(l - a-6)
(1.37a)
(1.37b)
As expected,
1)+ +P- = 1;'
(p:l:)2 = pi:.
(1.38)
It is evident that
]:J+P- = p-p+ = 0
(1.39)
and that
P+x- = 0, P-x+ = 0
The interpretation of these results is quite simple. If
'Y = 2 Ca.xa;
is an arbitrary superposition of the t\\lO spin states, tllen
P+'Y =c+x+, P-'Y= c-x-
and the operators p.:l:: project from 0/ the parts corresponding to + and -
spin along ft. Since p-J:X:f: = X:1' the idempotent property is obvious.
rrhe IDutuaUyexclusive character of P+ and P-- (viz., 1.39), is an expression
of the fact that there is no overlap in the portions of spin space proje(te(l
by these two operators. The exhaustive property, P++. p- = 1, is an
evidence of the fact that the two projected' subspaces together constitute
the whole spin space. In other words, fro.ffi a conglomerate of spin states
p+ projects or selects one state (A =- 1.), P- projects the other (l == --1),
and together these constitute the complete set of spin states. .
In general terms, if a projection opertor P exists, tbat is, p2 = P, then
P' = 1 - P forms with P a con1plete set (}f projection operators. 1"'hus
(1.40)
p + p" = 1,
pp' := }")']' :.:::: 0,
p,n. = P' (n:> 1)
"rhe projection operators given in (1.37a) (;nd (1..37b) are Pauli spin
. .
projection operators. As is to be expected, they ,"vill be very closely related
to the spin projection operators for a relativisti parti:.]e in the franle of
reference in which the particle is at rest. In connection \Jvith this discussion
NON...RELATIVISTIC SPIN THEORY
19
it should be recognized that projection operators for other dynamical
variables (for example, the energy) can be defined; see section 19. FinaJIy,
it is to be noted that for any matrix P vlhich fulfills p2 = P and P =1= 1 2 )
tbe determinant of P (det P) = 0 as will be readily verified by the reader.
rhus 1''1 is singular and p-l does not exist. In fact, the assumption that p-l
does exist leads to P = 1 2 in1n1ediately, but this does not yield a sensible
set of projection operators (that is, P' = 0 would follow in. this case).
It should now be fairly clear how the poJarization of a particlt Vtrith
spin -! is to be defined. If we again consider a state like (1.40) the polariza-
tion f!lJ will be defined bV 5 )6
.J
f!IJ = ('Y, a!> = Tr aP'Y (1.41)
('Y, 'Y) Tr P'Y
where the projection operator Pq: is
( P'I!-) p;' :== 'P- p \f"
If a beam of polarized particles is detected by a device which is sensitive
only to spin projection along :i:fi, the response of this device is proportional
to
where
:t = ("0/, P+o/) = !('Y,o/)(1 :i: EP.ft)
J+ -1- 5._ = (\f', 'F)
(1.42)
,
In general, the con1ponent of polarization in the direction fi is
&,.ft. = (0/, a'il 0/) = 1<:+ 1__Jc.:: r (1.43)
('Y, '0/) Ie + 1 2 + Ie _12
so that &'.0 = :l: 1 for states with (O'.ft)AV = :i: 1. It is to be emphasized
that this definition of polarization does not carry over without modification
to the relativistic case; cf. section 20.
7. ELECTRON IN A CENTRAL FIELD
Spin...Orbit Coupling
The main pl1ysical assumption o the Pauli theory is that the Hamiltonian
describing a system of particles is just the usual Schrodinger Hamiltonian
plus an additional term representing an interaction energy with the spin.
For a single p3:rticle this term is
Hsp = -fJ.-JIt' (1.44)
where fL is the magnetic moment operator (1.2) and JIt' is the magnetic
field at the position of the particle" Where there is only an external
20
RELATIVISTIC ELEC1'RON THEORY
magnetic field :!/e, Eq'.{1..44) gives the.cntirespinenergy. However,vvl1.i.t
there is also an electric field an additional interaction term of relativistic
origin arises. For anelectron moving.in an.. electrost.atic.field8,'ass
in the laboratory reference system or the reference 'system in which th
. aton1.as a whole is at rest, there is a contribution 'to the field d'l'givenpY'
l .= ! X 8
c
This corresponds to a.precessionofthe spin axis. around. the. fieldl
with Larmor precession frequen(;y
WI =el/mc (1.44')
.and a contribution
.
Hp = Iiw1-s
(1.44 / ')
to th.e couplin.g energy. However, this is not thetotalspinenergy.t<As
the electron Inoves in the field 8, it.undergoes an acceleration a -e8/m
and in time dt the velocity changes. from v to v +dvwith dV .=aat.
This change of the electron reference frame with respect" to the atom
reference .frame will introduce an additional precession of the spin 4 axis.:-,
It issl1Pwnil'l Appendix B thatintimedtthe reference frame at,achedto
the electron rotates through the angle
1 <:"
dO= -- (v X dv) (v )( dv)
v 2 2c 2
where /= (1 - v-2Jc 2 )-1A.. Hence the additional precession frequencyis
. 1 e .
(1)2 = - vX a = - (v X 8)
2c 2 2mc 2 .
Thus the total precession frequency is
f.A) = w 1 + (»2 = - (v X 8) = !wI
2mc 2
The total pin interaction energy is then
Ii
Hsp = nrots = - v X \IV
2mc 2
where we use eO = - V V. If V is a potential energy arising from a
central field,'
. 1 dV
Vv = -- r
r dr
t The follo".,ving discussion leding to Eq. (1.45) is based directly on the work of
L. H. Thomas, :reference 1..
NON...RELATIVISTIC SPIN THEORY
Then, using v = plm where
21
Ii
is =-\7
i
is the linear momentum operator, we find
Ii 1 df,-r _
Hsp = -- s.(r X p)
2n1 2 c 2 r dr ' -
We use small Jetters for .angular mOlnentum operators of a single particle,
and this becomes -
fz2 1 dV
Hsp = ? - --- s"l
2m 2 c'w r dr
(1.45)
where IJi is the orbital angular momentum operator.
The total Hamiltonian is nowt
H = jJ2 + V + Hsp
2m
and it is required to find the eigenfunctions of H. 1hi& win be done below
in an exact manner for the spin and angle dependence or the vvave function.
(1.46)
Pauli Spinors in a Central Field
In the absence of spin coupling the Hamiltopjan
. ft2
1-10 = - + VCr)
2m
commutes with Sz and lz. Therefore the \vave functions for v;dllch Ho, 1 2 ,
lz,and Sz are simultaneously bdiagonal are of the forrn
1fo= R(r) y;n(r) X m '
(1.47)
t By considering the limiting case of the Dirac equation, it will be shewn in section 22
that there are two additional term of the same order of magnitude a!: Hsp which shQuld
be added to (1.46). ,These ar'b
dE-I = [.-ieli 8.p + V p 2]/4m 2 c 2
The first term has no classical analogue. The second is a correction due to the variation
of mass with velocity, that is, a mechanical effect of relativity. l<;Teither of these terms is
spin-dependent, and for a central field they give merely a displacement but not a splitting
of the unperturbed magnetic sublevels. For a Coulomb field the first term gives a level
shift in first-order perturbation theory only for s-states and can be replaced by
Ji 2 e 2 Z c5(r)
2 m 2 c 2
22
RELATIVISTIC ELECfRON THEORY
. where X rn ' are the Pauli spin functions defined in (1.29). In (1.47), R(r) i
a radial function, Y(r) is the spherial harmonic which is the eigenfunction
of 1 2 and lz with eigenvalues l(i + 1) and m respectively_ The. phase
convention adopted is given by the explicit definition
m [ 21 + 1 (1 - nl)! J (_ei'P sin f})m ( 1 d ) l+m 2 £
Yz = --- ,--- (cos {) - 1)
41r (I + m)! 2 l l! d cos f)
(1.48)
and consequently
y;nX = (- )mYl--n
These functions are orthonormal:
(1.48')
j ymlxyrn2 sin {} dO dm = .
ll!1 'r ills mlm2
With Hsp present, neither l nor S commutes with H. Writing
J.s = Izsz + !(l+s_ + l_s+)
and using the algorithm
(1.49)
(A, BC) = (A, B)C + B(A C)
(1.50)
\\:" find
(Sz, 8-1) = i(llI - If/sa;) = -((" s-I)
Therefore Sz + Iz = Jz does com.mute with H. In addition, j2 = (I + S)2
commutes with H, and this is redily seen from the fact that this operator
commutes with Ifo and with any function of r while
2s.1 = fa - )2 - 8 2 (1.49')
which comn1utes with j2 since (1 2 ,lk) = 0 and (S2, SkY = O. Consequently
the required eigenfunctions simultaneously diagonalize .H, j2, jz as well as
1 2 and S2. Since the functions (1.47) form a cotnplete set we write
''Pf = R?(r) I cm(j) yr- m x m ;
m
m = ::J:l
(1.51)
where ft is the eigenvalue of jz. Thus
J % 1p == p,1p
automatically. Applyin; j2 to (1.51) and using (1.49), (1.49'), and (1.20'''),
we obtain the result
I [1(1 + 1) - j(j + 1) + t + 2m(p - m)]c m rr-n1 X m
M
+ [(I + !)2 - ft2] I C- m yr- m x m = 0
m
NON..RELl\ TIVIS'T(C SPIN THEORY
23
Since yr-mx m are linearly indepeildent, we obtain two linear 110rno-
geneous equations in c, c_}-2. Setting the determinant equal to zero
gives the resuJt
1(1 + 1) -- j(j + 1) + i = :!::(l + })
The two solutions are
j=l:t:!>!
which is the 'usual result of vector addition of angular, momenta / and !
Also
C1A(j) [(1 + -1)2 - ,u2J!ri
-........ = ------------
c-IA(j) 1(1 + 1) - j(j + 1) + f." + !
Normalizing the radial and spin-angular functions separately, that is,
L IC m (j)1 2 = 1
m
yields, \vit.h the conventional choice of phases,B
( I + IJ + 1 ) !1:
c(l + l) = .. = c_IA(l - !)
, 21 + 1
( 1. - It + 1 ) Y2 ,
c_(l + !) = = -c(l - tJ
\ 21 + 1
Thus the required eigenfunctions are
. ( ' C [ + jt + 1 )-i Y: Jl- )
Jl _ Rl+ !,;,,. l
'f'1+!ti - (21 + l)14\(l - {t + !)'" y/,+!ti
R ( - ( 1 - 11. + .l)}i' Y;U-1 )
p, l--!4 r- 2 L
'f'1- = (21 + 1)!ti (l + {t + !)!ti yt+!ti
(1.51a)
(I.51b)
These are the Pauli central field spinors. In later discussions we shall also
refer to the spin-angular part (tp exclusive of the radial functions R) as
central field spinors.
The spin-orbit energy is reedily obtained by using first-order perturba..
tion theory for the radial part of the problem. For a Coulolnb field
V(r) = -Ze 2 /r
and the additional energy due to Hap is
Ze'Ji" < 1 ) \
H sp ::'. - (8-1)
2m 2 c 2 ,.a.
r
24
REIJATIVISTIC ELECTRON THEORY
where the angular brackets are diagonal matrix elements.. Also m,.' is the
reduced mass of electron and nucleus.. For hydrogen-like orbits of
principal quantum number none finds c
< 1 ) (me2Z/Ji2)3
r = n 3 1(l 1)(1 + !) "
so that
"
Hsp = a(s.l)
= !aU(j + 1) - 1(1 + 1) - iJ
where
1 (a.Z)4 m c 2
a=- 7'
2 n 3 1(1 + 1)(1 + !)
and oc = e 2 /Jic is ,the fine structure constant. Each unperturbed level with
quantum numbers n, 1 splits into a doublet with the lower level having
j = I - 1-; that is, the doublet is normal for a single electron. The
splitting is
AE =.(H8P)1=Z+" - (H sP );=Z-!1 = a(l + t)
and the center of gravity of the doublet is unshifted since
'1 (2j + l)(H sp ); = 0
i
Anomalous Zeeman Effect
In the presence of an extern.al magnetic field K = curl A, the total_
Hamiltonian is
H = ( p + A ) 2 + V - JL' ( K + i. v X 8 ) (1.52)
2m c. 2c
In the Pauli approximation it is consistent to neglect the A2 term. Then,
for a homogeneous field, A = i(K X r) and div A = o. Thus
H = Ho + H'
-+2
Ho = L .+ V
2m
H' = .!!- A.p - p. . ( :K + i. v X 8 )
me 2c
(1.53)
The last term in H' gives the spin-orbit coupling. This can be written
as-} as before. The remaining terms due to the external field can be
written in terms of I and s so that
H' -= ,uo.(l + 2s) + al.s
(1.54)
NON-RELATIVISTIC SPIN THEORY
2S
ltis seen that with :Ye along the z-axis j commutes with H' and H.
The secular determinant, using the representation (1.51), is
E + 21 + 2 £ _ E
+ f' 21 + 1
€ [(1 + 1)2 - ,u2]
21 + 1,.
E_ + 2€1 '- E
'" 2l + 1
=0
£ [(I + }-)2 - ,u2]
21 + 1 .
where € = ilo:YE and E:f: are the (zero field) energies of the states with
j = I :I:!. Also p, is the eigenvalue of jz. The energy values are then
E = !(E+ + E_) + €/A:l: [( j + 21: 1 E + ( ;r T' (1.55)
where the ::I:: sign is associated with the level for which E = E:J: in the
limit :YE --+ 0: E+ = Eo + tal and E_ = Eo - ia(l + 1) and Eo is the
eigenvalue of Ho.
The result (1.55) shows that the member of the doublet with angular
momentum.i (in :YE = 0 limit) splits into 2j + 1 (non-degenerate) sub-
levels. Levels with the same ft do not cross and'j in general, E is an
increasing function of fl. For an s-level (I = 0) Eq. (1.55) does not apply.
Instead, from (1.54),
E.= Eo + 2£1', }t = :l:l
and Eo is the energy in the absence of the field. Thus for a 2PY2 ls
transition the Zeeman pattern will consist of four separate lines instead of
the Zeeman triplet expected without, spin. c
8. COUPLING OF ANGULAR MOMENTA t
.
The Vector Addition Coetlicients
The discussion of the preceding section shows how the eigenfunctions
of orbital and spin-angular momentum can be coupled to form eigen-
functions of j2 and jz \vhere j = I + s. This procedure can be generalized,
and it will be useful to do so for subsequent considerations. Consider
two vector angular momntum operators jl and j2 operating in different
spaces. The operators j; and jiz are diagonal with eigenvalues ji(ji + 1)
and n1 i respectively in the decoupled representation
1nf!11 'JJ m 2
T .11 T 12
Obviously JIZ + j2z is also diagonal with eigenvaluem 1 + m 2 ';'
t See Chapter III of reference A.
26
RELATIVISTIC ELECTRON THEORY
1"'.he required representation. must diagonalize j2 as well as j, j, j where
j = jl + j2
is alsu a vector angular momentum <)perator G rhis coupled representation
is obtained from the decoupled one by a unitary transformation where the
elements of the unitary matrix depend on jl' /2' j, m 1 , m 2 , and m where the
eigenvalue of j2 is j(j + 1) and m is the eigenvalue of jz. They are denoted
by C(jl}2j; m 1 m 2 m), the Clebsch-Gordan or vector addition coefficients.
For brevity they are sometinles referred to as Ci-coefficients. Thus
'PI} t. = ' r ( ; J . J '. li1 m m ) 111m} 'I 1j m?,
T j k "-".; 1 2 , 1 1 2 T jl T;2 ...
111 ffl?,
(1.56)
By appl)'ingjz, = jlz + j2z to (1.52) we find that
I (m - ml - In 2 )CUJJ; mlmm) 1fJf,.t 1p'J;.2 = 0
ml ffl 2
Since each term in this equation is linearly independent, it follows that
C(jlj2j; m1mgm) = 0 unless
ml + m2 = m
Hence one of tIle indices is redundant.. For instance, the last projection
number can be omitted with the understanding that it is the sum of the
other two. Then I
1pj = I C(jJJ; ml,m - ml) 1pii 1 1p-ml
ml
and only a single sum is involved.
From the unitary character of the transformation it follows that
I C(iJ; 1n 1 ,m - m 1 ) C(jlj2.i'; m1,m - ml) = 6;JI
11&1
(1.57)
and
I C(jLili; m1,m - m 1 ) C(jlj2j; m,'n' - m) = mlmi<5mm
;
(1.58)
Thus the nlatrix of C-coefficients withj labeling the rows and m 2 = m - m 1
labeling the columns is its own inverse.
The results of section 7 give the C-coefficients for jl = I, j2 = ,.
Arranged in conventional form these are
! -I
l+!
[ 1 + m + t ] !,.,
21 + 1
- [ I - m + I ]
21 + 1
[ I - m + i ] '
21 + 1
[ ' + m + I ]
21 + 1
(1.59)
C(llj; m - m2,m2):
l-i
NON-RELATIVISTIC SPIN THEORY
27
By enumerating the possible m values of aU states whiell can be formed
from the two states jl( -jt < m 1 < jl) and j2( -j2 < m 2 < }2) it is seen
that quite generally
Ijl - j21 <j <jl + j2
.
and that all possible j values, differing by an integer, which occur between
these limits are possible. This relation between jl' i2' and j is called a
triangular relation. That is, the three numbers jlj2j form the sides of a
triangle and either all three are integers or one is an integer, the other two
half-integers. The triangular relation is often abbreviated by the symbol
Jl(jlj2j).'
Properties of tbe Spin-Angular Functions
The wav function (1$51) is now written
1/f"! = R . ( r' "" C ( fl J .. JL - m1n) y#-m X na.
T1 ,.Ik I 'r > t. Z
m
(1.60)
Of course, "p) also depends on I, which is a good quantum number giving
the parity as well as the orbital angular momentunl" It is obvious from
section 7 that 'lJlf is an eigenfunction of Gel + 1 with eigenvalue
j(j + 1) - 1(1 + 1) + t = (j + !)2 - 1(1 + 1)
This number will be denoted by the symbol -1(. Thus
I for j = I - t
1(=
-1 - 1 for j = I + t
Therefore I( takes on all integer values excepi zero. We observe that
II<I == k gives the value of j according to
(1.61)
j=k-i
(1.62a)
In addition, specification of I( gives 1 or the parity of the wave function.
The latter is
'Tr z = (_)1 = (_ );+SIC
(1.62b)
where
SIC = K/k
(1.62c)
is the sign of K. It is now evident that the use of K introduces an economy
in the notation since its value gives both j and I:
K for I( > 0
1=
-K - 1 for K. < 0
(1.63)
28
RELATIVISTIC EL.ECTRON THEORY
Thus I is a function of K. Where 1 appears in the sequel its value is defined
by (1.63). In terms of spectroscopic notation 1<: = -1. 1, -2, 2, . . .
corresponds to s, PIA' PYi.' d4.') " · .. states.
We also introduce i = I_Ie; that is,
K - 1 for Ie > 0
1=
- K for K < 0
(1.64)
For a given j the two possible ,( values are x.(j + i). It is also seen that
1-1=81(
j == 1 - IS 1C
'The spin-angular function in (1.60) is now written as x::
x = I C(l!j; fJ - m,m) yr- m x m
1n
(1.60')
From the above,
(0-1 + l)X = -KX
Another useful property of these spinors is
rt' X JL = - X #
V r IC. - K.
(1.65)
(1.65')
where
rf1,. = I XiO'i
i
The proof of (1..65') follows: a,. is a scalar operator so that (1rX must
belong to the same j and it as X!:. That is, j2 clearly commutes with (J,..
For jz we have
r(jz., a r ) = (/z + Sz, xaa; + YO'y) = 0
since (lz, x) = iy, (lz, y) = --ix, (sz, (1a:) = ;(111'" and (sz, (]11)'= -i(f. Since
O',.,has odd parity it follows that
arX == ax': I(
where a 2 = 1 since (1; = 1 by (1.26). To evaluate a we can take r along
$e z-axis" Then setting {} = 0 in (1.48) we find
( ) J
y;n(es) = 21 1 ' b mo
and we obtain
( 21 + 1 ) }i
x= = C(llj; O#)X#
\ 411"
Thus
a(21 + l)H C(ltj; 0#) 2#(21 + 1) C{l!j; O,u)
NON-RELATIVISTIC SPIN THEORY
29
For all four possible cases j = I :J: I, /1, = :l::t we find a = -1, thus
establishing (1.65').
As an application we consider the expansion of a plane wave x m exp (ik-r)
into spherical \vaves. Such an expansion is useful in problems of scattering
and angular correlation. 8 For a free particle tIle radial function, llitherto
denoted by Rj(r), is a spherical Bessel function:
R;(r) =j!(kr) = CJ J!+H(kr) (1.66)
and J Z + 1A is the standard Bessel function. Thus Vie write
X m exp (ik-r) = ! aKJLjl(kr)x
KJ.I.
We use the Rayleigh expansion
(1.67)
exp (ik-r) = ! i L (2L + l)jL(kr) PL(cos 0)
L=O
where e is the angle between j{ and r; with the addition theorem of the
spherical harmonics this becomes
exp (ik-r) = 41T! i I 1L(kr) Jr!lX(k) Yf(r) (1.68)
LM
From the orthonormality of the X we obtain
jZQKJL = (X, X m exp (ik-r))
and \vith (1.68) and (1.60') this gives
jtQKJL =47T ! C(l!j, fl - T, T)Tm:2 iLjLYfX(k)Ll(;I'Jl-T
T LM
or
80 that
X m exp (ik..r) = 417! ilC(ltj;ft - m,m) Yi-mX(k)jz(kr) X
KJ.t
a KIl = 417i l C(llj; p, - yn,m) yr- 1nX (k)
(1.69)
(1.70)
For Ii along the z-axis this specializes to
Xraeikz = (41r) ! ;Z(21 + 1)1A C(ltj; O,m)jz(kr) Xr:
K.
= (27T) ! iZS:+(2j + 1)1A jz(kr) Xr;:
K
(1.70')
To obtain plane waves with the average spin direction along ii, that is o.ii
diagonal, the transformation with the D matrix can be carried out just as
in section 5. Then the X m is replaced by m,Dtn(ft)Xm' so that-in (1.70'),
for example, S;:+ X": is replaced by DmS:?-' + i X:' and the result summed
over m' = :l::l. .
30
RELA TIVISfIC ELECl"RON rrHEOR Y
PROBLEMS
1. Show that it is impossible to construct a non-vanishing 2 by 2 matrix
which anticommutes with each of the three Pauli matrices.
2. (i) Evaluate
a.AI a-A 2 a.As
in the form a + b.a.
(ii) Find the trace of
a.l a-A 2 a...4. 3 a-A..
(iii) Show that
C'! a-A - .A. = iA X 0' = A - a..A a
3. Show that if a nlatrix is idempotent, i.e.. p2 = P and P -:/= 1 2 , then tbe
determinant of P is zero. Thus a projection matrix is singular..
4. Solve the problem of the anomalous Zeeman effect using the ftecoupled
wave functions (1.47) as zero-order solutions.
5. From Eq. (1.35b) it is seen that
D!4(OD 0) D( 0 0) = DJA.(fPD 0)
but
D!4( 0 0) D(O{j 0) =F D2(q;{} 0)
Explain why both of these results should be expected.
6. Show that it is impossible to find a representation of the Pauli a-matrices
in which (a) all three are real, (b) two are pure imaginary (i.e., O' = -(]k) and
one is real.
7. If the numbers aik are the elements of a 3 by 3 orthogonal matrix, so that
:}:aik'1ik = ii
k
and
I,aiiQilC = d ik
i
then prove that
O' = ItLtiO'j
i
satisfies the same cOlnmutation rules as "i:
's 1 ' I ' I I .. I ( - -I- k)
u j =, O'SO'k = -O''kuS = 1€;kZO' J -r
8. Show that there is no 2 by 2 matrix which commutes with a.A other than
the trivial cases of the unit matrix and a multiple of a-A itself.
9. ....An electron in an atom interacts with the magnetic field produced by the
nuclear magnetic moment ILN- The vector potential of this magnetic field is
A = IJ.N x r = fJoN x r
:r 3 r 2
Show that the interction energy is
z:r eli {4 "' ( ) 3fLN.r s-r - P- N" S } """
nsp = - 7TIJoN- S () r + I 3
me r
NON-RELATIVISTIC SPIN THEORY
31
10. p"'rom the fact that an arbitrary two-component spin or is an eigenfunction
of Sl show that S2 must be diagonal.
11. Evaluate a"x!:x where xf: is the spin-angular function for central fields.
In particular, sho\\t that (J1/X!:X is, within a phase, x; /.J 0
REFERENCES
1. G. E. Uhlenbeck and S. A. Goudsmit, Naturwiss. 13. 953 (j 925); lValUie. 117, 264
(1926).
2. W. Pauli, Z. Phvsik 43, 601 (1927).
"
3. A. H. Compton, J. Franklin Ins!. Aug. 1921, p. 145.
4. 1-1. A. Kramers, Quantunl Mechanics, North Holland Publishing Co., Amsterdam)'
1957. .
S. U. Fano, Revs. Mod. Phys. 29, 74 (1957).
6. H. A. Tolhoek, Revs. Mod. Phys. 28, 277 (1956).
7. L. H. Thomas, Nature 117, 514 (1926); Phil. Mag. (VII) 3, 1 (1927).
8. L. C. Biedenhan1 and M. E. Rose, Revs. Mod. Phys. 25, 729 (1953).
II.
RELATIVISTIC QUANTUM MECHANICS
OF FREE PARTICLES
9. POSTULATES OF THE THEORY
The postulational basis of the relativistic electron tory has been
discussed by many authors. For example, for this as well as other questions.
reference may be made to the famous article of Pauli. D Although not'
generally stated explicitly, there are certain postulates which are comlnon
to quantum mechanical theories in general, and since they are of decisive
inlportance in guiding us to a relativistic theory they are dicussed below.
F or a more complete discussion the reader is referred to th work of Dirac. t
The postulates which follow apply quite generally to particles interacting
with fields as well as to free particles. However, it is once nlore to be
emphasized that th.ese fields are taken as given quantities and are not
quantized. The result is a single particle theory.t We list the postulates
below, deferring the discussion of them to the end of this section.
1. The theory shall be forn1ulated in terms' of a field, quantitatively
represented. by an amplitude function 1p, in such a way that the customary
statistical interpretation of quanturn phenomena will be valid.
2. The description of physical phenomena in the theory will be based on
an equation of motion describing the development in time of the system,
or of the field amplitude "p..
3. The superposition principle shall hold, and therefore the equation of
rr}otion must be linear in 1p.
4. The equation (or equations) of lnotion must be consistent with the
t Reference E, especially Chapter V.
t Often referred to as a c-nulnber theory in contrast to the q-number theory with
non..commuting fields in which creation and annihilation of particles is explicitly
provided for.
32
FREE PARTICl,E QUANTUM MECHANICS
33
principle of special relativity. t This, it\vill be seen in section 14, requires
that they may be written in covariant form as, for exan1ple, the Maxwell
equations of classieal electrodynamics.
5. In view of postulate 1 it n1ust be possible to define a probability
density p such that p is positive definite:
p>O
and the space integral of p has the properties
f p d 3 x = relativistic invariant
.!!.. f p d3X = 0
dt
These requirements permit a Lorentz-invariant meaning to a nOftnalization
condition such as
(2.1)
(2. 2a)
( 1 ....'11 )
......kD
f P d 3 x = 1
6. The theory should be consisfent with the cOf!espondence principle
and in the non-relativistic limit should reduce to the standard forIn of
quantun1 n1echanics already found applicable at low velocities. Further-
more, in the non-quantum limit the thory should yield the mechanics of
special relativity.
Postulates 1 and 3 appear to be necessary in view of such experimental
facts as scattering and the attendant difIraction effects observed in SUCll
phenomenao The tp-function referred to will be again caned a \vave
function. It will, in general, depend on the four space-time coordinates .'t p
and may be a multicomponent Vv'ave function. The latter shou1d be
expected jf the theory js to account for spin properties of the electron
(cf. Cha pter I).
Postulate 2 implies, as Dirac E has shown, that there exists an 0Ilerator
equation of the form
01.jJ
H1p = ili-
at
This gives for the time development of the systeln
. 00 1 I _ i H t ) n
1p(t) = e- tHt / 1 1p(O) == 2 - ( -- 1jJ(O)
n=O n! h
where 1p(O) refers to the function 1p at time t = o. Thus H/ili is the time
displacement operator; H itself has the dimensions of energy From the
general relation between time and energy in classical mechanics, including
(2.3 )
(2.3')
t General relativity, so far as is known) plays an extremely negligible role in typical
quantum mechanical processes. An outline of the necessary constructs of special
relativity is given in Appendix B.
34
RELATIVISTIC ELECTRON THEORY
specia1 relativity, we must expect H to be the energy operator. In (2.3')
we have assumed II to be explicitly independent of time; this assumption
is necessary for a system in which energy is conserved.. In vie",! of postulate
1 the scale of 1p, as. n1easured by its norm (1p, 1p) for example, should not
change as long as the system is left undisturbed. This iInplics that
exp (- iHtjJl) is a unitary operator and therefore that H is hermitian.
This is at least consistent with the energy identification since it is then
assured that eigenvalues of H will be real.'
In connection \vith postulate 4 it should be remarked that the occurrence
of the first thne derivative in the equation of motion will impJy that the
space derivatives must also occur to first order. The more or less obvious
requirement of sytnmetry in all four space-time variables is clearly not
.fulfilled by the non-relativistic form of quantum mechanics. Although
this symmetrical appearance of the four x /-l in the equations of motion vv'ill
actually be realized in the form of the theory to which one arrives, it must
be understood that it is not a sufficient condition for relativistic covariance
and that this covariance must actually be demonstrated, as it will be.
Postulate 5 needs t\\'o comn1ents. First, the positive definite character
of p implies that we speak of a particle and not of a charge den5,ity. It is
not clear a priori, in a given theory, whether the goal stipulated in (2.1) is .
attainable. For instance.. for charged spin zero particles only a charge
density can be defined.! The second remark is to the effect that (2.2b) is
assured if a continuity equation exists and if?jJ vanishes sufficiently strong1y
at the boundaries of the system. That is, a particle current density j must
exist such that .
div j + op = 0
at
(2.4)
Then, by Gauss' theorem,
:t f p tFx = - I div j t:f3x = - Ij" dS
where dS is an element of the bounding surface and jn is the .component of
j along the outward normaL The requirement that this vanish can be
stated explicitly only after j has been defined in terms of the wave functions.
I-Iowever, in a general way, the time independence of the volume integral
of p is assured if in vanishes sufficiently strongly on the infinite surface
bounding the physical region. '
If, after j and p are defined, it can be shown that j = sand icp = S4
form a four-vector, sl-l' the continuity equation can be written in the
Lorentz invariant fornl,
asp' -
-=0
oXp.
(2.4')
FREE PAR'TICl.E QUANTUM IvlECHANICS
35
so that the relevant staternents of postulate 5 will not depend on a particular
reference frame. This requirement should therefore be added as an
additional condition on j, p. (>f course, (2.4) has the usual interpretation
that a particle cannot disappear from a volume of space unless it crosses
the surface bounding that volume. As we shall Sl10W later, it will be
recognized that electrons can actually do this by means of pair annihilation
x
4
X.
I
,
x 4
X.
I
x.
J
"
\, d 3 x
',
Figure 2.1 Schematic representation of four..dimensional volume of integration.
The prirned coordinate system is obtained from the unprimerl system by a Lorentz
transformation.
and subsequent materialization of the quanta. Thus destruction or
creation of particles and antiparticles con tradict the conservation of
partictes but not the conservation <;>f charge. This apparent difficulty
disappears in a quantized field theory. In the questions discussed in this
book it raises no real probiem.
The invariance of the volurne integral of p can be demonstrated once
the continuity equation (2.4') is established. Assunle that s'-l vanishes on
the spatial boundaries of a closed four-dimensional space-time continuum.
That is, for large X k , S'-l -+ 0 for any x 4 . Consider a closed four-dimensional
volUllle in the fOfIn of a cylinder whose bases are X 4 = constant and
x = constant where x refers to a second Lorentz frame (Fig. 2.1). The
36
RELATIVISTIC ELECTROl THEOR):'
remaining surfaces correspond to x k = constant, which \ve take to be
large without limit. Then, by an application of Gauss' theorem, (2.4') is
transformed to a surface integral of the outward normal component of
the four-vector S Ii" On the surfces X k = constant the vector vanishes and,
it is assumed, sufficiently strongly that there is no contribution to the flux
integral. On the surface x 4 = constant the normal component is -Sa! and
on x = constant it is s. This is true if dX4/dx > o. Then the flux integral is
- f S 4 d 3 x + fS4 Jdx' = 0
or
f p cf3x = J p' d 3 x'
implying invariance under the Lorentz transformation.
Finally, with regard to postulate 6 it should be stated that no deliberate
effort is made to formulate the theory so that it is a priori evident that all
the requirements listed wiH be fulfilled. Nevertheless, these requirements
of the theory will' be seen to be satisfied without additional assumptions.
Particular attention will be given here to the last requirement mentioned:
that the relativistic wave properties reduce to relativistic but classical
orbits.t For free particles this means that the energy Wand m.omentun1 p
must be related according to
W 2 = C 2 (p2 + m 2 c 2 ) (2.5)
Here W includes the rest energy mc 2 as well as the kinetic energy W -:- mc 2 .
In quantum mechanics the wave aspcts of the field, as specified by tIle
frequency and wave vector k, are related to Wand p by the Bohr-deBrogIie
relations.
W = /iw
p = lik
(2.6 )
Since iW/c and p form a four-vector as d9 iw/c and k, these relations are
in covariant form and will. be valid in all Lorentz frames if they are vaUd
in anyone. The energy-rrlomentuln relatiol\ (2.5) then implies the foHowing
dispersion law for the relativistic deBroglie waves:
w 2 = c 2 (k 2 + k) (2.7)
where
ko = me/Ii
is the reciprocal Compton wavelength.
It will be seen that there are two branches
consideration: those corresponding to positive
(2.7')
for the waves under
and also to negative
t A detailed discussion of the non-quantunllimit is deferred until section 35.
FREE PARTICLE QUANTUM MECHANICS
37
frequencies. This existence of positive and negative frequencies is pecttll.ar
to every'relativistictheory, and the consequences of this fact win be seen-
tobe.profoupd. in the quanttun thepry.
In the limit k ko the dispersion law is characteristic of non-relativistic
quantun1 mechanics of free particles, as one should expectG I'herefore, if
the dispersion law is made the basis on which the theory is constructed,
it may be expected that the appropriate limithtg cases will automatically
emerge incorrect form.
10. THE WAVE EQUATION
The. Second-Order Equation
The dispersion law (2.7) contains, as stated above, the frequency-wave
number relationship applicabJe to non-relativistic deBroglie waves:
0) -- cko = w' Iik 2 f2m
(2.8)
corresponding to
-VV - 1?lC 2 = E r:::::!. p2J2m
(2.8')
If one introduces a wave packet,
1p = fUCk) exp [i(kr - roft)] d 3 k
it follows from (2.8) that
\
(2.9 )
01jJ !i 2
in . - = If . 1/) = - - . y2.",
t . nr j ,., T
U kIn
tIle usual non-relativistic wave equation for free particles. .
Applying the sanle technique t9 (2.7) vl,ithout the approxirnation of
smallk,weobtainwith.. (2.9) the. secondorder wave equation
( V 2 _.!.. (3.. _ k) tp = 0
c 2 at... /
(2.10)
or
( - k)1p = 0
aXJl a JL
This is known as the Klein-Gordon equation, and it has been proposed on
a number of occasions. l The form (2vlO') clearly sho\vs that the wave
equation is relativistically covariant. Even though this equation does not
(2.10')
38
RELA.. TI\tISTIC EloJEC1'R()N THEORY
have tpe fOfl'n of (2u3), it 'NiH be instructive to exaltlinc it furthtr. \\le
write the equation for 1P*:
( '\:12 1 a 2 r 2 ' )
\ v - 2 i)( - K fJ ; 1.p* = 0
By Inultiplying (2.10) with 'IP* on the le;ft and (2910 1 ') by 1p on the right and
subtracting, we obtain 8, continuity equation
(2. 10 ft )
as,.,. := 0
ax p
(2.11a)
where
( ' a?p 01p* )
S JL = const. "p* .-- - -:--- 1p
arp axp /
Because the II f.J are evidently the components of a four-vector, (2.11a) is
Lorentz invariant.. However, ,ve observe that
(2.11b)
a 1jJ '(}(p*
p ,-.".' "p* -- -- --- 1p
ot at
will not be positive definite if 1P obt;Ys onl}' the second-order equation (2.10).
'This is so because in' that eventuality lp and il1p/ot are independent. ]f at
tinJe t = 0 they are chosen so that p is posjtj,,] there is no guarantee that
p will rernain. positive. f-or this reason the second-order equation is usual1y
rejected a a description of the electron.. It is correct to reject the Klein..
Gordon dtscription as far as electrons are concerned. iro\vever, the
resol1ing is son1eV\/hat misleadlng. Fir$t, one recognizes that, if ?p is a
scalar (slngJc""eornponent) field, the second..order theory can appJ)' to spin
zero particles only. 'That it indeed does apply to such particles is ,ven
.kIloln.. However, tIle difficulty concerning the non-positive definite
character of p would appear to relYlain 1'he proper interpretation in th
case of charged spin zero particles has already been Inentioned:1. p can be
expressed as a f.1UTercnct? of two posltive de6.nite biHnear quantities of the
form CP*qJ .-. X*X, where:
, 1 d1.p
1(' ...'., -.-
t ! k
"0 UX4
I
In this fOfIH ex!)]icit fecognition is rnade ,)f the indeptndence of VJ and
Otp/ot, v/hich obey coupJed first-order equations, The interpretation of p
is then made correctly in tern1S of a charge density and the two-component
character of 'tp is connected with the existence of positive and negative
charges..
( ' (j?) ""' ,
xl
1
1J1 ---
ku
-!£
;i ,...
V"""4
(2.12)
FREf Pl\1irr(':LE QUAN1'U!v1 MECHANICS
39
The ass!gnment of the eq uation of motion (2.10) to spin zero partIcles
does not entirely c]ear up th( question at issue. I'he fact relTIa1nS that
according to postulate 6 the wave equation (2.10) must also apply for spin
i, or for any spin. "rhus, from (2.3) it would foUow that
! 2V! = __ _ H2'1f1 = (V"2 -- kh"
c 2 8t 2 Ji 2 c 2
or
11 2 = C 2 (p -J'2 + fn 2 c'2 ) {") 1 -=r )
_. \.i.... _.-
for free partIcles. If 1.p is not a s}ngle-eornpop.ent function, (2.10) appHes to
each cornponenL lfov\/ever, it i no longer true that t:ach C01Tlponentof 1P
,is deterrrdned. on(v by (2.10). In particular, it is not true that 1p and 01p/Ot
are independent. 'I'hey are, in fact related by (2.3). 'The existence of the
con.inuity equation (2.11) is not to be interpre.led in term.s of a conservation
theorem but is the direct result of the energy-nl0rnen1 uro rclation (2.5).,
Thus, for any 1jJ of the form
"P = I u+(k) exp [i(k.r - wt)J ,PI\- + f u._(k) exp [i(k.r -+ wt)] d 3 k
the continuity equation (2.11a) with sit given by (2.11 b) is automatically
fulfilJed.
The Dirac Wave Equation 3
The assumption is made that an equation of Inotion of the type (2,,3)
exists. Then the postulated property of p, that is) (2.2b), is autornaticaUy
valid with H herJuitian if
:« .'" 1 ,
P = 1fJ' tp (.::.: l'.J
Obviously, this p is pos.hive definite. f;or, jf (2 14) IS assurned 5 \\iC see that
f . op d 3 x = -- J f[ 'q}*H1/ J --.- (H'W):t1D J d3tI' ;::= (I
jot Ii.' "'!.' F. 1.
by virtue of H* = H.
Since the equation is hnear in a/ox 4 , the relativistic covan.ance inipHes
that.R is lineai in th<; spaCt: derivatives O/O.1: k or tl) rnornentunl operator.Pk
If we adrnit the possibihty that i is a 1TIulticcrnp0netrt fun.ction ,vith
components 1p).) 1-1 nrust bav( the forrn of a square m.atdx and the nlost
gf::neral forn1 of one of the clerflnts of thL" matrix in the absence of
interactions, is
H,'{a == c(Cl.r);'crj j " + fJ).(jrnc 2 (2.15)
the constants have been chosen so that the nurr1b(rs. {Xk.)..o aDd t3;. are
din1ensioniess. I'or ti partich; at rest \ve 111uSt expect the iirst .operator in
40
RELATIVISTIC ELECfRON THEORY
(2.15) to give zero when applie4 to 1p,andhence the second term. would be
associated -with what remains: the rest energy. The first ternl should
therefore be the kinetic energy operator.
'The postulated wave equation is, in matrix form,
H "" = [c«op + pmc 2 ]1p = ili d1p
at
(2.16)
,vIDell is a set of n equations where 1'1 is the nun1ber of components of 'f/J.
rfhe a..p term is an abbreviation for the sum of three terms,
.
3
-io "" ...
a.p = k (XkPk
]c=l
where j as mentioned, each of the three OC k is a square matrix.. The same is
true of fJ, and thf hermiticity of II, each Pk being hermitian, requires that
each of the four matrices ex and {J are hermitian:
cx. k = oc:
( k = 1 2 3 )
, ,
f1 = fJ* (2.17)
As stated above, this is sufficient for a continuity eq-uatlIL:. To see this in.,
detail and to identify the current density we write the hetHHtian. conjugate
of (2.16):
...,
(Htp)* = 1p*H = 1p*[ -c«op + PW C2 ] = -ili .!f
at
Multiplication of (2c16) by 1p* on the left and of (2.18) by'i.Jl art the"right
and subtraction give
(2.18)
iJi p = c( 1p*«opV' + 1p*a. o p1p)
at
= Cp.(IP*«1J')
because p is proportional to the gradient operator. Using p = -ilf\l we
obtain
d " op 0
IV J -!- - =
at
where p is given by (2..14) and
j = c1p*Cl.1p
The four-vector current density \voutd be expected to be
(j, icp) = S Jl = c1p*a p.V,1
(2.19)
(220)
where the four matrices a f.l are a (It = 1, 2, 3) and a. i == i multiplied IJY the
n by n unit I!1atrix (Jf course. it is yet to be pro'ved that s u. is a four-vector
FREE PARTICLE QUANTUM IECHANICS
41
since the properties of "p and 1p* under Lorentz transformations have not
yet been discussed. This question will be taken up in section 14.
We turn now to the connection with the second-order equation. From
(2013) it is required that
H 2 = C 2 [(OC i !X k + fXkfXi)PiPk + mC((J.,ifJ + fJtXi)Pi + ,82m 2 (;2]
== C2[jJii 1r YJn 2 C 2 ]
iden.tically in the components of p. Consequently,
OCifXk + fXJcfY.i = 2t5 ik
rxfJ + pcx i = 0
{32 = 1
(2.21a)
(2.21b)
(2.21c)
That is each of the four matrices Xi and rJ are their o\vn inverse, in addition
to being hermitian. They are therefore aU unitary.. Ivforeover the set of (Xi'
{J constit ll tes jour anticommuting Inatrices. t
The fact that the cti and fJ cannot be taken to be unit matrices (or
multiples thereof) means that tp must be a multicomponent function. The
process of linearization of the Klein-Gordon equation, which was Dirac's
innovation,3 has therefore led to the requirement of a multicomponent
wave function) wpich, as may correctJy be anticipated.. is c.onnected with
the existence of spin. However, the first striking fact is that "p cannot be a
two-component function like the Pauli spinors, because then each OCi and (J
would be a 2 by 2 matrix and we have seen that there can be only three
anticommutig 2 by 2 matrices: the Pauli a;..matrices and their transforms.
The dimensionality of the four Dirac matrices wiH be discussed in the
following section.. For the present we may observe that the number of
components n must be even. To see this we obs.erve that for each of the
fOUf Dirac matrices there is another matrix which anticommutes lith it.
Therefore, if b J.L is anyone of the four matrices and hv is a matrix which
anticommutes with hp., we have
Tr bJi. = Tr bJlb = Tr bvhJlhv = - Tr bpb = 0 (2.21')
since each b; = 1 and Tr AD = Tr BA. Thus each n1atrix has zero trace.
There exists a representation in which any b tl can be brought to diagonal
form, and, since the results b = 1 and Tr b p are independent of the
representation, we conclude that the eigenvalues of hI-'- in diagonal form
t Equations (2.21) can be written in more compact form by writing for example,
fJ = 4. But this carries the unfortunate connotation that fJ is connected in some way
with the fourth component of a four..vector whose space component is connected in the
same way with ex. The rult (2.20) indicates that "this is an incorrect interpretation,
as it is indeed.
42
RELATIVISTIC ELECTRON THEORY
ar :i: 1 and tbat there are as many + 1 as -1 eigenvalues. Thus the
number of rows and columns must be even. The minimum possible
number for n is 4, and it is easy to see that a 4 by 4 representation does, in
fact, exist. For example,
« = e :).
( 12
p=
o
I)
. (2.22)
where each entry is a 2 by 2 matrix. In detail, for the representation of
the Pauli a's given in (1.22),
0 0 0 1
0 0 1 0
(Xl = etc.
0 1 0 0
1 0 0 0
The matrices (2.22) will be referred to as the standard representation.
That the standard representation fulfills aU the rules (2.21) is readily
verified. Of course, any transform
a.' = 8«8- 1 ,
/3' = Sf3S-1
(2.22')
also fulfills (2.21) and is equivalent. This type of transfOfrnation arises
whn a non-singular linear transformation from one 'if' to another 1jJ' is
made.. Then
1p' = S1p,
1p = S-l1p/
and
H 1p = i.1i (a1jJrat)
becomes, with as/ot = 0,
H"lJl' = SHS-1tp' = ili (01p' tat)
In the present instance
H' = c(ct'.p -t {l'me)
where a.' and {3' are related to ex and fJ by (2.22'). \Ve con1pare the (spinor)
expectation value (not necessarily integrated over coordinates) in two
representations of a matrix Q, where 0 is a SUtTI of terrns formed by
const'ructing arbitrary products of fY..k and {3:
,
y;'*!!' 'fJ/ = 1jJ* 3* Sfl.S -1 S 1jJ = 1jJ* S* SQ1p
If S*S = 1, that is, if S is unitary,
1.p'*Q'V;' = 1p*Q1jJ
FREE PARTICLE QUANTUM MECHANICS
43
Another generalization which gives the same commutation rules is
ex = ( CX(4) 0 ) , {3 = ( f3(4) 0 ) (2.22")
o «(4) 0 P(4)
or
« = C4) 4} P = (p4) P4) (2.22")
where 0:(4) and f3(4) are the matrices given by (2.22). Thus these alternatives
are ..8 by 8 matrices. But they correspond to writing the wave equation
twice and yield nothing of additional significance.
The Covariant Form of the Wa1'e Equation
It is clear that, although (2.16) and (2.18) are in Hamiltonian form, the
time and space coordinates do not enter in a symmetric nlanner. Of
course, (2.16) could be written in the form
( -ilk (( + fJmc2 ) 1p == 0
Jl ox
J.l
where 0.:4 = i. But, in order to construct a covariant operator, each term
should be covariant and the rest energy term, in particular, should be
simply mc 2 . Therefore we multiply the preceding equation on the left by fJ
and define
'Yk = - ifJrt. k
(k = 1., f, 3)
(2.23)
Y4 = fJ
( 0 \
y,.-+ko)tp=O
oXp. ,
This will be referred to as the covariant form of the \\'ave equation.
(2.23) and (2.21) we obtain directly
to obtain
(2.24)
From
Y fl Y V + y v Y p = 2b Jl " ; ft, 'J} = 1: 2) 3, 4 (2.25 )
for the commutation rule of the four Dirc y-matrices. Of course, the
right side of (2.25) impJicitIy contains a unit n1atrix. In ,,'\ppendix D an
alternative approach culminating in the same wave equation and
commutation rules is presented.
It is clear by a preceding arguInent that the trace of anticommuting
matrices vanishes. Therefore, for alII).,
Tr Y Jl = 0 (2.26)
Also, by direct verification it i,s n'n1ediately seen that all y 1-' are bermitian.:
*
Y1-& = Y ll
(2.27)
44
RELATIVISTiC ELECTRON 1HBORY
For 1p* the covariant equation of motion is
011'* 01jJ*
- Yk - - Y4 + ko1jJ* = 0 (2.24')
oX k aX 4
This extremely unsymnletrical form indicates that for Lorentz trans-
formation properties it is not "1'* which should be considered. Instead if,
the adjoint to 1p, is introduced (following Pauli D ):,
'ip = 1p*y 4' 1p* = ipY 4 (2.28)
Then, inserting (2.28) in the equation for 1p* and multiplying by 1'4 on the
right yields
oip k - 0
- Y - O'{jJ =
ox Jl
Jl
In section 14 it will be shown that both equations of motion (2.24) and
(2.29) are covariant under Lorentz transformations. That is, in two
reference systems (2.24) and (2.29) are both valid if it is understood that
the wave functions in the two systems are related to each other by a
specified transformation.
(2.29)
11. THE DIRAC MATRICES'
In order to fix the dimensionality (rank) of the Dirac matrices we
consider the complete set of matrices which can be constructed Iforn them
by multiplication. As win be seen presently, there are 16 different matrices
Y.A (A = I, . . . 'I 16) which can be formed in this way, and we shall choose
a phase factor for each so that in all cases y = 1. Then also Y.A = y
f<?r all A.. We divide the matrices Y A into five groups as indicated below.
The letters used to label these groups refer to Lorentz transform.ation
properties discussed in section 14.
Group;S. This consists of a single nlatrix, the identity or unit matrix.
Obviously it can be formed from the Y Ii- in at least four ways: y = 1 for
eachJ.t.
Group V. These are just the four n1atrices Y Il.
Group T. These are the matrices formed by taking products in pairs.
Since y; = I and 'Y lAY'" = -y"y p when p, -=1= 11, there are just six linearly
independent matrices in this group:
if' IlY v
( =;1= 'v)
or) in detai1
'YIY2, iY2Ya, iYSY1' iYIY4' iY2Y4' iYaY4
F'REE PARTICLE QUANTUM MECHANICS
45
The first group of three can also be written, using (2.23), i(t.l 2' ioc 2 rJ.s, ;rJ.SOCl.
The sond group is -OCt, -0: 2 , and -rJ.s.
Group A. These are the four possible products formed by products of
three II/-l. Choosing the phase again as indicated above, these are: iY2YaY 4'
iYSYIY4' iYIY2Y4' and iYIY2Y3. These can be written in a more lucid fashion
by using the Ys matrix, defined below,. in the fornl iy pYs.
Group P. rrhis is the single matrix formed by multiplying all four Y fA :
/'5 = YIY2Y3Y
No other matrices can be formed from the y's in view of (2.25).
The designation "group" used above does not mean that these 16
matrices fornl a group in the technical sense (for exampJe, iYIY2 cannot be
\vrittcn as a product of two members of the set of 16). Mor€over, if the
factor i is olnitted in the T group, then there are no inverses of these
elements in the set. Nevertheless, the set of 16 matrices doe,s form a
n1athematical entity: a Clifford algebra. 5 This is synonomous ¥/ith
statements 1 through 5 below.
We now prove a nun1ber of statements concerning the 16 Dirac
matrices :4,6
Statement 1. For every Y A =f=. 1 there is at least one other matrix YB
(B =1= A) such that Y AYB= -YBf'.ll. This is obvious on inspection, and
in fact, for every Y A =1= 1, there are exactly 8 other YB \Vhich anticommute
with Y A and 8, counting Y A itself, which commute with it. Obviously the
unit matrix 1 is a member of the latter set.
Statement 2. For every Y A #:- 1 we deduce that
Tr YA = 0 (2.30)
This follows exactly as in (2.21').
Statement 3.. The Y A are linearly independent. This means that if
16
! C AY.4 = 0
. A=l
(2.31)
. then all CA = o. To prove this we multiply (2.31) by anyone of the 16 y's,
ay YD- Then
CD + ! C A YBY..1 = 0
A*B
But I'BYA =1= 1 or a multiple thereof when A =F B: Therefore on taking
the trace of this equation we find
f
Cn = 0
Since this applies for B = 1, . . . ,16 the statement IS demonstrated.
Thus the 16 r A are distinct.
46
RELATIVISTIC ELECfRON THEORY
Statement 4. An arbitrary n by n matrix M can be written as a linear
combination of the 16 Y A. 'fhe truth of this statelnent is demonstrated by
performing the decomposition. Thus
M = n1.A.Y,4
.....4.
,
and by multiplying by 'YB nd taking the trace we find
1
mA = -Try AM
..;... n
(2.32)
Statement 5. If a matrix M commutes with all 16 Y A it must be a
multiple of the unit matrix. We first write M in the form
M = mBYB + mAY A (2.33)
A=f;:B
where YB =I=- 1 but is otherwise arbitrary. Since a Yo can always be chosen
such that YBYa = -YOYB' where Yo is one of the 16 'Y A matrices, \ve can
write '
M = YaMYa = mBYer'BYa + InAYaY AYC
AB
= -mBYB + EAmA.YA; A. = ::t:l
A#B
since, for each A, YaY AllO = :i:y A. Multiplying this equation and (2..33)
by 'YB and taking the trace, we obtain
Tr yBM = nmn = -nmB
where the first equality follows from (2.33). It is then evident that
mB=O
Since 'YB was any member of the set excluding-I, it follows that M contains
no y-matrices other than 1. This demonstrates the validity of statement 5.t
Under the circumstance which has been established-that only one
element of the Dirac algebra commutes with all the rest-it i5 a property
of the algebra that it can be represented by n by n matrices v/here n 2 is the
number of elements of the set. Thus since n 2 = 16 we conclude that, aside
from trivial generalizations, the Dirac matrices have four rows and
columns. The wave function is a four-component function.
The matrices (2.22) do indeed constitute a 4 by 4 representation for the
(Xi and p. In this representation, the standard representation.
y = (: -a),
Y4 = ( _)
(1.34)
t This result is a special case of Schur's lemma. 7
FREE PARTICLE QUAN'TUM MECHANICS
47
and all equivalent representations are obtained from (section 13)
y = SyIlS- 1
For the T group described above we find
(2.35)
. ( ia jGk 0 ) ( C1l
zy iYk = . = -
o iajG k 0
:)
(2.36)
\vhere j, k, I are a cyclic permutation of 1, 2, 3. The matrix in (2.36) will
be designated by C1 io rrhat is, in general,
a = (: :)
or
0 1 0 0 0 -i 0 0
1 0 0 0 i 0 0 0
<11 = 0'2 = ,
0 0 0 1 0 0 0 -i
0 0 1 0, 0 0 i 0
/1 0 0
0 -1 0 0 (2. 7)
t1 3 =
0 0 1 0
0 0 0 -1
Each one of these three 4 by 4 matrices is a direct product of a unit Inatrix
and a Pauli spin n1atrix. The fact that the same notation is used for both
four- and two-ditl1ensional matrices should cause no confusion since the
context wiU distinguish between them.
It ",'ill be noted that all the [)irac matrices can be written as direct
products of two 2 by 2 matrics: one of these operating in the "Dirac
space" refers to the four areas of the 4 by 4: matrices delineated by dotted
Hnes belov/:
x x x x
x x x x
(*)
x x x oX
x x x x
48
RELATIVISTIC ELECTRON THEORY
The other which operates in the "Pauli space" refers to the four elements
within each of these four areas. Thus
('J.i =. Pl ai',
fJ = P3 1 2
(2.38a)
where
( 0 1\
PI = 1 0)'
P3 = ( _ )
(2.38b)
operate in Dirac space. It is to be understood that the direct product is
always implied for matrices, operating in. different spaces. A third matrix
operating in Dirac space is evidently
.'1 ( 0
P2 = i
i)
(2.38c)
so that 1, Pi' P2, Pa forIn a cOlnplete set like 1, aI' (12' era-
The Dirac matrices which have zero elements in the upper right and
lower left quadrants in the 4 by 4 array (*) are caned even in the Dirac
sense; those with zeroes in the upper 1eft and lower right quadrants are
called odd. Evidently matrices which are formed with PI and P2 are odd
and those formed with Pa are even. In a corresponding way the wave
function is written in the form
VJl
'P2 ( VJ u \
1p= = VJI)
'IP3
1f'.
where u and I refer to "upper" and "lower":
VJU = (:j, I = (:j
are each two-component spinors. Then, for example,
( _ i'fjJl )
P21Jl =
i'fjJu
while
( OVi )
U1p = P 1 G1p =
a Vi'"
From these examples it is clear that the matrices operating in the Dirac
space act 011 1jJu and 1pl while the matrices operating in Pauli space act on
FREE PARTICLE QUANTUM MECHANICS
49
the. two components in 1pU( 'If)1' VJ2) and in 1pZ( 'fJJ3, "P4). Odd Dirac matrices
couple 1ptl. and 1jJl while even ones couple 1pu with 1pu and VJZ \vith 1fJL. The
four-component 'lp, sometim,es referred to as a bispinor, will here be called
a spinor (or four-spinor). This nomenclature is justified since the four-
component functions transform under rotations in exactly the same way
as the Pauli spinors; see section 19.
.' 12. SPIN AND CONSTANTS OF THE MOTION
From the form (2.3) of the equation of motion it follows for any operator
a thatt
dO = (HO .. 00) = ! (H, 0)
dt Ii Ii
(2.39)
From the interpretation of 1J' provided by the density p as given in (2.14)
one U1USt calculate expectation values according to
(0) = J cJ3x tp*Otp
(2.40)
,
Hence it follows that the time derivative of the expectation value is the
expectation value of the time derivative.
(/ > = !!.. (fA)
\ dt dt
(2.41 )
This fors the basis of the connection with the realm of classical physics
via the correspondence principle.
For our present purpose it is more pertinent to recognize that by (2.39)
constants of the motion exist for a set of commuting operators if and only
if they commute With H. In connection with the angular momentum of
the electron we first calculate the commutator of /1 = - i(r )( \1)1 with H.
Only the kinetic energy term cu., is relevant since P commutes with Ii.
We.:tind
(1 1 , ex-p) = (J.,(11t P2) + ('/..3(1 1 , P3) = i( OC 2P3 - tJ. a P2)
or
(I, ct-p) = i(ct. X ji) =j:. 0 (2.42)
For the commutator \vith the square of the orbital angular momentum 1 2
we use (1.50) and obtain
(1 8 , (lep) = -i{(J.1[(12'PS)+ - (1 3 ,P2)+J + C(a[(la'Pl)+ - (1 1 ,13)+]
+ (1.3[(/ 1 , P2)+ - (1 2 , Pl)+]} (2.43)
t Time-dependent operators are discussed in Appendix C.
so
RELATIVISTIC ELECfRON THEORY
Here we use a subscript + to designate the antjcommutator:
(A, B)+ = AB -f- BA
It will be seen that none of the quantities in square brackets in (2.43)
vanishes. For example,
(1 3 ,Pl)+ - (1 1 , P3)+ = -iff! + 21 3 F1 - (ips + 21 1 '3)
= -2iP2 + 2(1 X P)2
= -2iPa + ! (r.pjJa - TaP 2 ) *' 0
Ii
The condition that (2.43) vanishes is that the coefficient of each (li vanishes.
It follows that the orbital angular momentum is not a constant of the
motion. On the other hand, we expect the total angular momentum to be
a constant of the motion since 110 direction in space is preferred. We
identify the vector operator for the total angular momentum as.t
j=l+!a
(2.44)
where a is the 4 by 4 lllatrix vector. This will certainly. give the Pauli
result when the non-relativisti( limit is taken. The commutator of jl with
H is C(j1' ex-,) since aJl components of 0 commute with f3 in the standard
representation and therefore in all representations. Thus
(jl «op) =-= ([1' cxp) + !( aI' a.p)
Using
c; = -YsCX = -«Yo
(2.45a)
(2.45b)
or
we find
ex = -Yrl 1 = -erys
t( 0"1' ex.p) = i(P2Ct..a - P a rJ..2)
and two sin1ilar equations obtained by cyclic permutation of the indices.
From (2.42) it follows that
(j, ex-p) = 0
(2.46)
Therefore j2 and any component of j, say js, may be made diagonal
simultaneously with the energy H. '
As a result of the foregoing consideration we must identify to == s as
the spin operator in the relativistic the<?ry.. Obviously S2 is diagonal with
eigenvalue s(s + 1) =! or s =!. This interpretation of the spin is
therefore in agreement ,vith the empirical results.
t The context should clearly distinguish between the symbol j used as the total
angular momentum operator and as the Dirac current density.
FREE PARTICLE QTJANTUM MECHANICS 51
Since j2 and 8 2 commute with 1! while 1 2 does not, it follows that the
spin-orbit coupling operator s..{ :=; iO'.1 also does not commute with H.
However, there is an operator related to a..} which is precisely the relativistic
analogue of (1.65), which does commute with .R and with each component
of j. This is the operator
K = p(a e ) + 1) , (2.47)
Obviously K COlnmutes witb {J. Therefore, for the commutator with H,
consider
({J a-I, a-p) + (13, at.p) (p (JI, a-p) + 213 a.p
(fJ O'.!, .p) == fJ( a. a.p) +
'fo evaluate this W note that. the extension of (1.26) to the 4 by 4 a...matrices
is imrnediate. T'hat is,
a.A aB =.:: A-B -1-- ja..A. X B (2.48)
This relation will be used very frequently in tIle sequel. l\fultiplying by
-Ys we get
But
a-A (l.D = a.A a.B = -ysA-B + i!l-(A X B)
For complet.eness we record the important result
(t.A .x-D = a..A O"B
(2.48')
(2.48")
"fherefore, from (2,,48'),
Then.we get
and
(u-R, (I,,,p)+ = -ys[ll)p -{- p1 + i a.(1 )( p .t Ii X I)]
But I-p -= p-I = 0 and
Ii I X Ii = -rft2 -1- (u-.p)p
Ii is X I = rp2 - (p.r)p __M i/p
I X P -1- P X I = 2ip
(K', ex.p) = 2p (l.p + 213"15 es.p = 0
1he con1mutator of K and j is obtained in a similar ¥lay
commutes with j and Te need only consider
(2.49)
Again {J
vvhile
({J a-I, I + lo) = p(a-I, I) -t- !p(a-I, a)
From (2.48) we obtain
(a e ), a) = 2ia )( I
From I )( I = il the first tern1 is
Hence
p(a-J, I) == - ifl(a X I)
(K, j) = 0
(2..50)
52
RELATIVISTIC ELECfRON THEORY.
The connection between K2 and j2 is revealed by
K 2 == (ael + 1)2 = }2 + io-l X I + 2a.} + 1
= 1 2 + a-I + 1 = j2 + !
Therefore the ,eigenvalue 1(2 of K2 is (j + !)2 or
I( = :l:(j + t)
just as in (1.62). More will be said about K in sections 26 and 42.
The Hamiltonian H = c( a-p + pm c) does not commute with the
operation of space inversion r -)0- - r. TIle term a-, is odd under inversion,
while {J is even. Hence the operator {3 times space inversion does commute
with H. This will be called the parity operator and will be denoted by tJI.ie
Since j and K contain axial vector operators which conlmute with (J, it
follows that the parity operator commutes with H, j, K. It is seen that K
gives the eigenvalue ofj2, and in section 26 it will be seen that it also gives
the eigenvalue of the parity operator in the angular momentum representa-
tion. This situation is therefore reminiscent of what was seen to apply in
the Pauli theory. The problem of space inversion will be discussed at
greater length in section 25.
There is another very obvious and very important representation. This
is the plane wave representation in which the set of commuting operators
is Hand p = -iliV. Because the eigenvalues of p constitute the vector
mo:r:nentum, the relativistic plane wave of fixed energy W is
'1p(r, t) = u(p) exp [ (p.r - Wt)]
(2.51)
In Chapter III, where the plane wave solutions are studied in greater
detail, it will be seen that there is another operator (there called ) con-
nected with the spin (analogous to cr) the component of which in any
direction can be made simultaneously diagonal with Hand p.
Of course, p does not commute with K or I.fJ. Therefore te two
representations described are alternative ones and they are connected by a
linear transformation; see section 27.
13. THE FUNDAMENTAL TI-IEOREM OF PAlJLI4,6
,
In what follows, extensive use is made of the fUt;ldamental theoreln of
Pauli. ,The content of the theorem is: If two sets of matrices I' p. and
y (p = 1, 2, 3, 4) obey the commutation rules
y 1I/'11 + Yvl'lt :::: 2pv;
, , + I' 2 .R
'Y IJ." v Y vI' Jl = U /lV
FREE PARTICLE QUANTUM MECHANICS
53
then there must exist a non-singular D1atrix S \vhich connects the two sets
accord in g to
rS = SY Il
From each set r and Yl we can build a set of 16 matrices in a parallel way.
Typical members of sets are caHed Y.A. and 1'4. respectively.t rhe theorem
also implies that
for each A..
It is first shown that (2.52) is valid if one makes the choice
yAS = SYA
(2.52)
16
S :.-= 2: y'nFYB
B=l
(2.52')
where F is arbitrary. Then
16
yASy A = 2: YYBFYBr A-
B=l
(2.52")
Each product Y AYB is) within a factor :i: 1 or :!:i, equal to SOll1e other
Inember of the set say Ye. Thus
rBf A = Aore (2.53a)
where Ao = :I: 1 or ::l:i. For each A as B ranges from 1 to 16, the 'Yo which
result constitute the cOlnplete set of 16 four by four Dirac matrices. To
see this, we assume the contrary. "fhat is, let
r.BY A = 1or c
and
rnYA = PD'YC, B #D
so that one particular matrix Yo occurs at least twice. Then
rB = AeYCYA = (Aa/PD)YD
This is contrary to the proven linear independence of the 16 t'-matrices
and so is itnpossible.
From (2.53a) it follows that
I' ..,
rBY A = ).'oYa
(2.53b)
since the rules for forming all the y' -matrices frorn y are exactly the same
as those for forming the y-matrices from y p' and the commutation rules
are the same for both sets. From (2.53b) we find
YYB = AolYa
t The previous statement in1plies that a one-to-one correspondence between mernbers
of the two sets exists such that for YAY B = Yo there corresponds the relation YY = Yo-
S4
RELATIVISTIC ELECfRON THEORY
by taking the inverse. Introducing (2,,53a) and (2.53b) into (2.52'''), we find
16
, S "'\:' ']-1 , F "
YAY A = k /\'0 Y c AcY 0
[':::::1
16
= I yoFYa= S
0=1
J By multiplying on the right by Y A we obtain (2.52).
It now remains to be shown that F can be chosen so that 8-- 1 exists 1>
'fo do this it is necessary to show first that }? can be chosen so that S -:J:. o.
If S were exactly a null matrix for all}', we could choose F to be a matrix
with only one element diffe.rent from zero:
F fJV == (j plIo flvvo
where Ilo, 1-'0 are an arbitrary index pair. Then IS = 0 inlplies that
18
I (YB)lIl0(YB)vo(1 = 0
B=l
Since this would follow for all '''0 and (1, each eletnent of the Inatrix
16
\ I (1 i b);" po l'.B
B::.::.l
would vanish. }'his again contradicts tbe linear independence of the YB- ,
lienee there must be SOll1e F for ,vhich 5 f =f=. O
1"0 denlonstrate the existence of S-l it i$ sl1o\vn that a non-zero matri S
exists sucb. tllat Ss = k, where k is a l11l.lltiple of the unit Inatrix. (oDsider
16
ti , "',.'
.:) -'= Y BG'J B
B :;;. 1
(2.54)
where G is; for the moment, arbitrary. This is constructed in a way similar
to (2..52') except for the interchange of f'B and lIB" From precisely the
same argument as led to (2.52) from (2..52') we deduce that
... l'f'
I.A = 0YA
(2.55)
This equation with (2.52) gives
'Y ASS = SyAS = JSSy A
Since ss commutes with all Y A. it is a multiple of a unit matrix by statement
5 of section 11. Thus. '
Ss= k
(2.56)
FREE PARTteLE QUANTUM MECHANICS
55
:Now G can be chosen sq that S =F 0 just as was done for Sf. Also F can
be chosen so that k =I- 0 since the assumption that k = 0 for all F leads to
16
2: SyBFYB = 0
B=1
by (2.56) and (2..52'). Again, if the ch.oice FJ'. = 0 PP.o <5""0 is made, one
obtains
16
2: (SYB)AfJO'YB = 0
B=l
B,ut this is irnpossible in view of the linear independence of the YB and the
fact that at least one coefficient in the sum above does not vanish: S =1= 0
and I'h includes the unit matrix. 1.hus k -1= 0 is possible for SOIne F.
Also, from (2.56),
det S det S#-O
and S-.1 therefore exists. From (2.52) it folJows that
Y.A = SY.A. S - 1
(2.57)
as was stated originally.
It can be further demonstrated that S is deterplined to within a numerical
factor by the two sets l'f-l and y. If there were two matrices Sl and S2
for which
y = SlY Il S 1 1
/ S S '-1
Y Jt = 2Y v. 2
then
SlY p.Sl1 = S2Y p.S;l
or
S2 1S lYP. = Yp.S;lS 1
Thus 82,- 1 8 1 corl1n1utes with the entire set of 16 matrices and is a multiple
of a unit matrix: S2 = kS1.follows at once. It is customary to choose the
arbitrary Dlultiplicative factor so that det S = 1. Then S is unique except
for a factor of modulus unity: explicitly, :!: 1 or :l:i"
14. LORENTZ TRANSFORMATIONS AND RELATIVISTIC
CO""ARlANCE
In this section we &halJ consider. e Lorentz transformations as th.ey
affect the equations of motion. O\ir explicit considerations are for the
moment restricted to the proper continuous group of transformations,
since the improper transformatio;Is are best studied in conjunction with
other considerations which. are taken up in section 25. In this connection
S6
RELATIVISTIC ELECTR.ON THEORY
it should be realized that covariance under the discontinuous transforma-
tion does not have quite so strong a basis of experimental justification as
do the continuous transformations. It will also appear subsequently that
the discussion of the present section is fully appJicable to an electron in an
electromagnetic field.
As outlined in Appendix B, the Lorentz transformation is defined by a
matrix a which connects the space-time variables in two reference systems
(primed and unprimed):
, -
x p, - a p..vxv
(2.58)
and invariance of x#x# implies that a is an orthogonal matrix
a ,.,.."a Il). = <5 vl ,
a p.vapv = 0/lP
(2.59)
or
a = a-I and det a = ::1::1
For transformations 'Continuous with the identity det a = 1. Since the X k
are real and X 4 is pure iInaginary and the same is true in the primed system,
it follows that Qik, ia4;' ia;4' and 044 are rea.l. For continuous transforma-
tions as well as for space reflection aM:> O. In the first case, since
L",(0p4YJ. = 1, it is seen that 0 44 :> 1 for space-time rotations.
Covariance of the Equations of Motion
We now consider equations (2.24) and (2.29) and investigate the
conditions under whtch relativistic covariance under proper Lorentz
transformations obtains. That is, if (2.24) and (2.29) apply in the unprimed
system, then it is necessary that 1p'(x') and ip'(x') exist such that
J'
and
(r It a: + ko) tp(x') , 0
(7.24")
oijl k -' ( ' ) . 0
;--; Y fl -'o1p X =
(,I x p.
(2.29')
We do not alter the y p, because these matrices are sinlply a devie for
writing four equations for the components of"p as a single matrix equation
(2.24). In the same way they permit the four complex conjugate equations
to be written in the compact form .29). If these four equations in the
components of vJ are written in detailed or expanded form in the unprimed
system, the corresponding .equations in tbe primed system, if l..orcl1tZ
covariance is to obtain, are realized by priming each component of 1p and.
by rel,lacing xp. by x in both o/OXp. and in the argument of the wave
I
FREE PARTICLE QIJANTUM MECHANICS
57
function. Then (2.4") and (2.29') are the conlpact forin of such equations
with same Y ft as in the system of unprimed equations.
For Q44 > 0 we write
"P' (x') = t\.1p( x)
(2.60)
where, it is assumed, A does not contain the coordihates. It is also assumed,
subject to verification, that the inverse A-I exists. Starting with (2.24")
we obtain
a
'Yp.a/l V - A1p + koA1p = 0
ox"
(2.24''')
where we have used
a '__ ox" () _ -1 0 _ a
-----a --a --
, , VII. a p"
uX Il uXp. (Jx" Xv uxv
That is, the four-gradient is a polar four-vector.
If we multiply (2.24"') by A-Ion the left it becomes the same as (2.24)
nrovided that
.I.
A -II' pa Jt"A = Yv
Using (2.59), we may write this alternatively in the form
(2.60a)
A-IYJLA = apvyv
(2. 6Ob)
The existence of a A which satisfies this condition is apparent from the
following. Let
r = a pvy,,,
Then
.'Yyl + yly = aJtvapY"Yp + a;'pap/lpYv
= 2a llv a;.p o vp = 2a iJp a).p
= 2 1
itA
Therefore. by Pauli's fundamental theorem there exists an S sh that
SYp,I.';-l = }I. By comparison with (2.60b) it is clear that A = S-lwithin
a multiplicative factor. Since the ap'v are not all real, the ')' defined above
need not be hermitian. Hence A (also 8- 1 and therefore S in this case)
will not be unitary in general.
Turning to the transformation of the adjoint equation, we write
ip' (x') _ 1p'*( x')y 4
= 1p*(x)A*Y4
= 1jj(x}Y4 A *Y4
(2.61 )
58
REl..ATIVISTIC ELECTRON 'THEORY
The operator Y4A *Y4 can be related to A-I as follows. Writing (2.60b)
for the two cases f-l = j = 1, 2, 3 and /.1. = 4, we have
ajkf'rc + a i 4Y4 = A---IJlji\
a 4k Yk + a 44 Y4 :=ow; A. -lY4 A
Taking the hermitian conjugate of these equations results in
ailt'Yk - t1 j4 'Y4 = A *Y1 A -1*
-a 4kf 'k + a44Y4 = A*r'4 A -1*
These can be combined into
III
a pp "F4Y p 'Y4 = A*Y4YIl('4 A - 1 * .
as is verified by setting !-t =.i and 1,£ ::: 4. Then we multiply (2.60b) on
the left and right by Y1 and substitute in the above to obtain
Y A -l y I.\, y - A * y Y Y A--l *
4 p..J. 4: - 4 p. 4
By multiplying by AY4 on the left and by i\. *Y4 on the right, the result is
Y p AY4. A *Y4 = AY.i A *Y4YP.
Therefore Ay 4 A *1'4 commutes with all,' J.l and is a multiple of 1.
..l\Y41\*Y4 = k
By taking the hermitian <?onjugate of this equation we find
Y4..l\'Y4 A *= k X
and multiplying by Y4 on right and left results in
AY4A Jir'o/4 = k X
Therefore k = k X and k is real. Since (2.60) does not fix a multiplicative
factor in A, k ca.n be chosen to have modulus unity. Later we show that
k has the same sign as a 44 so that in the present case k = 1. This result
can be very easily established by noting that A is a function of a set of
paralneters defining the rotation in the space--time continuum. One of
these parameters is () the angle of rotation, real for space rotations and
pure imaginary for space-time rotations. Examples are given in Eqs. (2.69)
and (2.71) below. As f) varies in a continuous \vay, k cannot change
discontinuously, and for () = 0, A = 1 and from (2.62) we deduce that
k = 1. Then, from (2.61),
(2.62)
ip(x') = 'tp(x) -,1
From (2.62) with k = ::I: 1 it follows that
fdet j\.12 = 1
(2.63)
FREE PARTICLE QUANTTJM 1\1ECHANICS
59
The transformation of (2.29') into (2.29) is now achieved by the condition
a pyA -11' 1l.J.\ = Y)I
which. is identical with (2.60a).
The Transformation l\tIatrix
To determine A. for particular continuous transformations vie consider
first an infinitesimal transfornlation
x = X fl +. IE J.w x ",
or
aJl:V = oJlV + Ew'l
The orthogonaJ cl1aracter of the transformation Ineans that to fir&, order
ail = 1 + (;" + E == 1
so that € is an antisymmetric Dlatrlx: E!-tv = -- Ei',u. The li matrix which
is deternuned by the 0j.{tI or € lAP is now of the form
A = 1 +. t€p.yTJlV (2.64)
where TIAJ1 constitutes a set of matrices one for each pair of indices p, 'V and
TPP c::: _TJJf.l. The inverse of .l\. is
_1\..->1 = 1 - !E!JV TIl V
Inserting (2.64) and (2.65) in (2.60b) gives
,,'
(1 - t E p " TP")y p(l + t€,"'r7y).r) = Y p + €p.vYw
Neglecting, as usual, the term of second order in IE, ,¥e find
€;"y(Yp.T AV - TAVyp) == €;..v(,'v'-' #1)" - rA.av)
(2.65)
or
(y, 'TA-V) = yy6 #1). - 'Y ).vp.
A so1ution sufficiently general for our PU11)ose is
T).v = trAY,! (A =f:. 11) (2.67)
F.or A. = 'V E<J. (266) is trivially valid.. For;::f- fl. =/--; P =f- A botb sides of
the equation vanish with the solution (2.67) since r,.,. (OmIn.utes with r AV
in that ca&, Ftnally, it is easily verified that, if p, = 'y 7'-= A or f.l = A :f.= "',
an identity is obtained.
We nO\1{ write
(2.66)
1pi( x') -. pt;) = tJtp = (1\ - 1 }tr'
= t€J.:",T P v1p
60
RELATIVISTIC ELECTRON THEORY
For a finite transformation, tp' - Atp results with
A = exp (!£JlVY JlYv)
Example 1. Consider a rotation around the xs-axis through an angle O.
Then
cos (} sin () 0 0
- sin 8 cos (} 0 0
a= (2.68)
0 0 1 0
..
0 0 0'1
and, for an infinitesimal transformation () 1,
€12 = -E21
and all other €o/),v = O. Since T IZ = - T 21 = 11'11'2' we obtain
A = e717s' , cos () + 1'11'2 sin ()
2 2
(2.69a)
A -1 = e-717S' = cos - 1'11'2 sin ()
2 2
The -expanded form of the exponential operator foUows because
(Yl"2)2n = (_)n
and, of course, (Yly2n+:i :::II (- )ny1I'z. We check these results by inserting
(2.69) into (2.60b) and obtain /'., = a"pY p for" = 3, 4 and
"11 cos () + 1'2 sin 8 = QIPY p = a 11 )'l + a l2 Y2
-Yl sin (J + 1'2 COS (J = a"p'Y p = a 2l Yl + a 2 2Y2
from which (2.68) is recovered. It will be noted that
A = exp G alJ) (2. 69c)
(2.69b)
In general, for a rotation through an angle () around the direction ii,
A = exp G a.M) (2.69d)
The A-matrix is then seen to be identical with the n1atrix of the Cayley..
Klein parameters. 8 On comparison wit.h DIA.(cp, fJ, 0) in Eq.. (1.35b), it is
seen from (2.69d) that A = D(O, 8, 0) if and only if ft is a unit vector
along the y...axis, as expected from the definition of the Euler rotation.
FREE PARTICLE QUANTUM MECHANICS
61
Example 2. Considera. Lorentz transformation corresponding to a
uniform motion with velocity v along the xs-axis. Since this is a rotation
in the X 3 -X 4 plane, the results are in complete parallel with the first exampJe
except that the angle () is pure imaginary. In fact,
· () .VI::
SIn = l -
c
cos (J = ';
= (1 - V 2 /C 2 )-tA
(2.70)
and with 0 = iw
A = eY3Y49 = cos (j + i sin (j
2 2
h w · h ())
=' cos 2: - d sIn
(2.71a)
= ( t-l f-( l f
A-I = ( I f + ( I f
The coordinate transformation obtained is just the familiar one corre-
sponding to uniform nl0tion along the x 3 -axis:
,
xl:::::: Xl'
,
X 2 = X 2
x = (X3 - t,t)
f 1: ( vxa )
t = t -- c 2
(2.72)
It is of interest to observe that for V 2 JC 2 1, A 1 so that tpf(X/) = 1p(x)
and (2..72) reduces to the Galilean transformation: x; = Xv (v = 1, 2, 4)
and x = Xa - vt.
Bilinear Covariants
Under the Lorentz transformation,j(x) f'ex'). T'hen the follo\ving
covariant quantities are of interest and occur in the Dirac theory.
(1) Scalar:
(2) Vector (polar):
(3) Tensor:
(4) Axial or pseudovector:
.
(5) Pseudoscalar:
fl(X ' ) = f(x)
f;(x') = ajlvfv(x)
fv(x') = a p ).a va f).(1(x)
f(x') = (det a)allvfv(x)
f'(x') = (det a)f(x)
\
62
RE.LATIVIS11C ELECTRON THEORY
Under proper transformations (det a = I) there is no distinction between
(1) and (5) or between (2) and (4).
That only Ithese tensors and pseudotensdrs of rank 0, 1" and 2 occur
is a consequence of the existence of the five groups into which the 16 Dirac
n1atrices were ciassified in the discussion of section 11. We proceed tQ
the construction of these five cuvariant quantities in terms of bilinear
combinations of the Dirac wave functions.
I Scalar s I"rom (2.60) and (2.3) it follows that Sex) = ip(x)1p(x) is a
scalar.
S!'(x') = ijj'(x') 1p'(x') = ip(x)A-' l A 'tp(x) = Sex)
,rector V.. We defint four ...Huantities V(x) by
(2.73)
V}t(X) = ijJ(x}y Ii tp( x)
(2.74)
Then
V(Xf) == ip'(x')y Il Vl(:c') ::=: (x)A -lYIlA tp(x)
= a p-v ip( x)l'v 1p(x) = a pw V v ( x)"
(2.74')
'rhrefore v- JI are the components of a four-vector which transforms
exactly like xp It win be verified thatthe vector ij/c and p (see Eq. 220),
is just Jt....u . This justifies the reference to s p. as a four-vector and demon-
strates th( invariance of the continuity equation.
'Tensor r'. Again we define a second-rank anti symmetric array by
1v( :1:) = i ip( x)y f.ly \' V'( x)
(It #: v)
(2.75)
The transformation of Tp.v is
T,,(x') = iip'(x')y,/yv 1P'(X') = i1fi(x)A -ly p.A.A -1/VJ\. 1p{x)
= iallfJaVtf ip(x}Y P Yl1 1p(x)
= a Jlp Q Wl Tpq( x)
(2.7 S')
"Note that in the secbnd last line the terms p = (/ do not contribute
because p. =I=-- 'I'. Hence Tp.'JI is a four-tensor antisyol111etric in the tensor
indices
Axial vector A This set of four quantities is defined by
A4 1l = i(x)y ItYs 1p(x)
The transformation law is most easlly studied by first evaluating
(2.76)
A -. 1 A A --1 A
Ys.l:l =:.: .ll. JJ 1 Y2l'aY4
¥kEE PARTICl.E QUANTUM MECHANICS
63
By introducing €rxpa't'JI an antisymmetrical fourth-rank tensor which vanishes
unless aU indices are different and is + 1 (-1) for (x, fJ, oc', p', an even (odd)
permutation of 1, 2, 3, 4 we can write
1
1'5 = - €a:/J«'{J'Yr/lpY«'Y{J'
4!
since each of the 4! terms in the sum is )15 by virtue of the fact that with
, f3, fX/, P' all different -YaY (3Y fI,'Y p' = ::t "5 according to whether €a.{1a.'fJ' = ::i: 1.
Hence
A -75A = ! e"p,,'p;(A -ly"A)(A-1YpA)(/\.-ly".A)(A-1Yp.A)
1
= 4! Eapa.' {J,a«p.a pvaa.' 1(,a (J'v'Y 1l'Y v Y .IJ'Y v'
1
= - (det a)EjJVJt'v'Yj./YvY/-t"'/v'
4!
, = (det a)ys
Returning to A Jl' we now see that
(2.77)
A(x') = i1fj(X}l\....lY:JA1\. -lyS'/\ 'IfJ(x) = (det a) apyAv(x)
so that AI' is a pseudo or axial vector.
Finally, the fact that
P(x) = 1p(X)Ys 'tp(x)
is a pseudoscalar is already evident from (2..77). That is,
P'(x') = (det a) P(x)
(2.78)
(2.78')
It is clear that ,ve can generaHze the covariants discussed above by
replacing 1p(X) but nt)t ,;(,'J:), by the \:v'ave function of another particle
which, ho"vever) transforrns just as 1P does. If the two particles are referred
to by labels a and b then, as an exalnplc J
iji''(x) 1pb(X)
is a scalar.
These covariant quantities p1ay an important role in the problem of
fonnulating the weak interaction of four fermions9-11 When this inter-
action energy is to be a l.Jorentz invariant it can evidenfly be constructed
by contractii1g the tensors of the five groups. rrhu if, for example,
T/'t = pa(x)YJ& V}J(x)
64
RELATIVISTIC ELECTRON THE()RY
where a and b are labels of different spin 1 partIcles, the contraction of
two vectors V;b, Vd is evidently a scalar. Similar scalars are constructed
from sab Sed TabT cd A ab Ar-d and pal.> [}cd
, J..'V !-tV' !J. p.' . ..
To facilitate comparison with the form in which these often appear in
the literature of nuclear beta decay the covariants are listed below in
terms of 1p* instead of ip and in terms of the (! {3 nlatrices.
s = 'P*f31jJ
V 4 = 1.p*1p,
T/ . *
k = -11p (J..k 1 P
Tjk = - tp*pal"P,
Ak = - VJ*ak, 1p,
P = 1p*Y4Y51/l
T 4k = 1p* tct.k 'ljJ
A4 = i'l.p*Y5'P
(2.79)
Here j, k, and I are a cyclic perm"Jtation of x, y., z or 1, 2, 3. Note that in
this forIn the tensor components appear as 1p*DAP, where Q need not be
hermitian. Those Q which are not hermitian are, however, antihernlitian
(i times a hermitian matrix) and on contraction of the co variants a factor i
appears twice. It should also be remarked that in many references a
representation in which a, fJ are replaced by -u) -fJ appears in the
literature. Again, on contraction this sign difference ",ould not appear.
The notation .S, V, T, A, P used above is based on the terminology of
the theory of beta decay; see section 21. In this theory it is necessary to
work with quantized field operators because particles are created and/or
destroyed in the decay processo However, after the formalism of the
perturbation theory is carried through, it is possible to evaluate the
observable results predicted by the theory in terms of wave functions of
the type discussed here.
PROBLEMS
1.. Find the transformation 111atrix S for which
a,. , = Sa..S- 1 = -<X
R' - S 'R('f- 1 - - 8
,., - p.J .- .
Can S be chosen to be unitary'! If 1p is a solution of the wave equation in the
<I, (J representation and is wri tten in the form
/tJ) \
( ,1
'ijt
1p = : j
\1J4j
FREE PARTICLE QUANTUM MECHANICS
65
express 1p (A. = 1, 2, 3, 4) in terms of "PAo Conlpare the four-density p and j
calculated by explicit matrix multiplication in the two represenations.
2.. Show that if
YpY + YvY p = y;y + yy; = 2d pv
and, with all y p, hermitian,
J1 = S'Y p,S-l
S*S commutes with a]) Yp, and S...S'Y* commutes with all j/. Consequently S*S
and SS* must both be multiples of a unit matrix, and in particular it is possible
to choose SS. = 1.
3. Consider two Lorentz transforn1ations defined by
I
XI' = a,..t"'x v ,
11 b I
X t-t = f.J.va.v
,"lith corresponding matrices Aa and j\b transforming the wave functions. If
AaY4i\Y4 = a 44 /la 44 I
t\bY4A:jl4 = b 44 /lb 44 1
show that
AbAaY4(Ab.L\a)f'4 = (b 44 /lb 44 D (a44!1a 44 !)
4. From the conditions of problem 3 show that
Idet Aal 2 = Idet Abl 2 = 1
Give an argument to show that for transformations continuous with the identity
the only possible value is
det A = 1
5. Show that the tensor covariants discussed in section 14 have the stipulated
transformation properties even when they are defined in terms of two types of
Dirac particles (for example, electron and mu meson); that is,
V,,(x) = 1jje(x)y" tpP,(x)
transforms like a four-vector.
6.. Show that the complete contraction of two covariant tensors of the same
rank is a l.orentz invariant.
7. Consider a Lorentz transformation for which a 4p = ap4 = p4' as in a space
rotation. Show that A commutes with Y <I and that in the representation (2.22)
it must have the form .
( AI 0 )
A=.o A 2
where Ab A 2 , and 0 are here 2 by 2 matrices. A matrix of this type is called
even (Dirac). Write the inverse matrix -1 in terms of A1-l and A2"l.
8. Referring to problem 7, carry out a similar investigation of the Lorentz
transforrnation in which a 3p = <5 3P . With what matrix does A commute in this
case? In the representation (2.22) can A be an even matrix if
9. A matrix of the form
( )
66
RELA-fI"IS'fIC ELECTRON TI-IEORY
where 4, B, and 0 are 2 by 2 matrices is cal1ed an odd Dirac matrix. R_eferring
to problcln 7, show that the product of two odd or t\VO even matrices j even
ihHe the product of an odd and an even matrix taken in either order s odd.
10. Consider the nl<itrices
P -J: :::::: t( 1 :t.: (J)
\Vrite these matrices in the, representation (2.22). Show that in any representation
det 1::t: ::= 0
Show also that in any representation
1'-'2 = P
:t: :i-: 5
p+p_ = p_p+ = 0,
P+"+"P_=l
so that P -+ and P._" form a complete set of projection operators. Can you suggest
an interpretation of these projection operators--:
11. Is it possible to construct a representation of the four y p in \vhich they
are an real? Show that it is hnpossibie for aU l' p. to be even in the Dirac sense.
12. Demonstrate that in every representation det? A =- 1 for al t 16 Dirac
matrices.
13. Prove t.he relations
Y",Yfj + YSY!l = 0,
p = t, 2" 3, 4
y: = Y,
2 1
t's = ?
J': = yS=-l
Are there any other matrices \Alhich anticomn1ute with all four of the }' f1 ?
14. Show that the four components A" defined by
A f.l( x) = i'tjj( x);, p.1' 5 1p(X)
transforlu like a third-rank tensor antisymmetric in all index pairs.
15" Show that Y;4 = Y f1 can al\vays be obtained from y p, by
Y = SY!6 S - 1
Can S be unitary in this case? If 5' is unitary explain why S must also be hermi-
tian.
16. A and B are two 4 by 4 matrices which can be written in the form
A = ( I all a 12 )
a 21 a 22
and similarly for B" where a ik and b ik are all 2 by 2 matrices. Sho\"v' that C = AB
can be written in the form
_ ( ell C12\
C - e 21 22)
where the 2 by 2 n1atrlx C ik is given by
elk = L a ij b ik
j
17. An invariant quadrilinear combination of the wave functions of four
different Dirac particles (0, b, c, d) is constructed by contracting the covariant
FREE PARTICLE QU-ANTIJM MECHANICS
67
forms as discussed in section 14. Write I n , n = 1, 2, 3, 4, 5 for Sflb..')Cd, V ab . J/cd,
Ta'J: TI';il, Aat.A('.l, pubprd,where the dots indicate the number of indices contracteq
Evidently, interchange of particles a and c would also give an invariant quadri-
linear form. ('all these five invariants Ln. Then show that
Ln = .4 nm J m
.wher the:; by 5 matrix A (Plerz matrix)12 is
1 1 I 1 -1
J
4 -2 0 ,., -4
J..,
A ==! 6 0 -2 0 6
4 2 0 -.2 -4
1 -1 1 -1 1
Verify that A2 = 1, as it should. Find the eigenvalues and corresponding
eigenvectors of 4. The Jatter are linear combi!1ations of the five J which are
equal, to within a factor, to the same linear combinations of L'ne
18. Cons1der a representation which differs fro!l"'t the standard one by inter-
change of Y, and Y5. Shovi that in this representation the Dirac equations can
be written as two coup!ed equations involving two-component spinors and that
the coupling is broken for zero rest fnass.
19. \Vrite each of the 16 Dirac matrices as the direct pr oduct of 2 by 2 matrices
in Dirac space and Pauli space.
RI£FEREN(ES
1. E. Schrodinger, Ann. PhJ'sik 79, 489 (1926); (). K!ein, Z. Ph/sik 37., g95 (1926):
VI. Gordon, Z, Physik 40, li7 (1926); V. Fock, Z. Physik 38,242 09:'6); 9J 226
(1926); .1. Kudar, Ann. Physik 81, 632 (1926); Th. deDonder and II. 'vB'.n J)ingen,
(""'ompt. rend... July J 926.
2. H. Feshbach and F. Villar, Revs. A-fod Phys. 30, 24 (1958).
3. P. A.. M. Dirac, Proc. Roy. Soc. (London) A117 1 610 (1928): .118 351 (1928).
4. '1\'. Pauh, Ann. lust. Henri Poincare 6, 109 (1936).
5. \V. K. Clifford, Afn. J. lv/alii. 1, :150 (1878).
'6. R H. Good, Jr., Revs. 1Y1od. Phjs. 27, 187 (1955).
7. J. Schur, Berliner Silzber. 406 (905).
8. II. Goldstein, Classical Mechanics, .:\.ddison..\Vesley, Cambridge" f\.fass.., 1950; p.116.
9. E. J. Konopinski, Ret's. Mod..Phys. 15, 209 (1943).
10. L. Michel and A. Wightfnan, Phys. Rev. 93, 354 (1954).
1 L R. W. King and D, C. PeasJee, Phys. Rev. 94, 1284 (1954).
1.2. 1. Fjerz Z, Physik 104, 553 (1937).
III.
DIRAC PLANE WAVES
IS. THE FOUR PLANE WAVE STATES
The Wave Functions
The eigenfunctions of definite energy have a time dependence
ili i) 'IjJ = w 'IjJ
at
(3.1)
and are therefore eigenfunctions of If with eigenvalue W. This is the
energy including the rest energy. We shall use rational relativistic units
wherein Ii = m = c = I. The rest energy, for example'! has the value 1.
The time-independent wave equation is then
H1p = (a.-p + fJ)1p = W1p
(3.2)
This is valid for the time-dependent or time-independent wave function.
The plane wave states are eigenfunctions of the mornentum operator
the eigenvaJues PI' P2' Pa constituting the components of the vector
momentum p. Note that without the arrow p is a set of three numbers.
We have
-
P1Jl = pip
(3.3)
Hence we write
1jJ = U(p) exp [i(p.r - JVt)]
'1 '1' )
t_o..,
where U(p) is a four.component spin or which satisfies the equation
hlJ = (a-p + (J)[) == WU
68
(3.4)
DIRAC PI.A.NE WAVES
69
This is the abridged notation for four linear algebraic equations in the
components of (}(p).
I'he upper and lower t\.vo-conlponent spinvrs in [7 are inn=oduced by
( 'u\
u= .
". I
, t'l
(3.4')
-rhis corresponds to the dcCOn1p0'3ition of the wave function In Dirac
3pace. Then (3.4) in the representation (2.22) becomes
apu = (fV + l)v
(3.5a)
O'''pt' 7.:':: (Vi - l)u
(3.5b)
Eliminating v, we find
(a.,p)2u = p 2 u == (H/ 2 - l)u
where (1.26) has bet;;fl uscd In the sarne \vay, elin1.inating u would give
(aop)2v = p 2 v = (H/ 2 - l)v
'"fherefore the four roots of the secular determinant of the eigenvalue
probleln under consideration are
T:fl === n. "-: (p .2 + 1 , ) 1,A).
n Po -,," , .; 'j
ILP ,.? 1 ) /'
rv ==. -- P.} :::::= -(P'" + ... ... '.
occurring t\vice
(3.6)
occurring twice
Consequently, there are four eigenstates of the energy operator Hand
these are degenerate in pairs: two with positive energy Po and two with
negative energy equal to -PO' The significance of this strange result-that
eigenstates \vith negative energies occur--wil1 be discussed in the next
section.
Considering first the positive energy solutions, there are, in general, two
linearly independent solutions.. This. fact is not altered by the existence of
the t\vo-fold degeneracy. The degeneracy simply l11eans that in the 4 by 4
deterrrtinant obtained by ""Tiring out the four linear equations corre-
sponding to (3.4) each minor vanishes ,vhen the determinant vanishes;
that is, when (3.6) is fuifilIed. 'rherefore the general solution is given in
terms of two constants G..l-.:
( a+\ /
1';<; __ LI)
tl= aJ=a+x-+a_ x ,-
70
RELATIVISTIC ELECTRON THEORY
which is the most general form of a two-component spinor. Alternatively,
the poitive energy wave functions define a two-dimensional space with
the basis
( Y2 )
U + r-.J a.; x
Po + 1
( - ) \
U - "" a.; x- I
Po -+ 1 !
These are unnorrnalized. The normalization to one partic]e per unit
volulne, that is,
tJ'*tp = 1
gives the normalized amplitudes
" ' l) i ( X:!: U \
u -. Po-r
:t -- (--;:;-- 0'.1' :!: J
\, .L P (1 --- X
, Po + t I
This normalization also corresponds to a current density equal to
j = p/Po = v, the average velocity. rhe proof is easHy obtained by direct
calculation:
j = 1p*ct.1J' = U*a. U = ') 1 _ [(X m , a O'.p X m ) + (O'.p X m , O'x m )]
... Po
(37)
= -1... [(X m , (a, O'.p)+ X m )] = 2!!. (X m , i m ) = 1-
2po' 2po Po
In (3.8) we have used the hermitian property of a-pc Obviously, other
normalizations are possible. For normalization to unit current the wave
functions are obtained from (3.7) by multiplication by v-. Normalization
per unit energy range requires multiplication of (3.7) by the square root
of PE' the density of state"s per unit range of E:
(3.8)
PE = %EPo :: PPo
(217)3 21T 2
The Spin Operator
In order to understand the physical significance of the spin in the
relativistic theory we first consider the non-relativistic limiL 'Then p <t 1
and Po is replaced by 1. Consequently, from (3.5) we see that in this limit
DIRAC PLL\NE WAVES
71
u v. Therefore for positive energy states U IS the so-cHed large
component, 1) the sinal! component. 'T'hen
U :!: --+ U :1:(0) = (>:!-i) (3.9)
so that we recover the PauJi spin functions.. For the non-relativistic wave
function "Pnr = ll:t:(O) exp [i(p.r -- Pot)] it is seen that
p .t',' -- 1 l '
jJ -J nr -, 1'21'
'I'herefore in this Jirnit 13 can be rep]aced by 1, the unit rnatrix. I'll the
general case the non-relativistic amplitude function, except for a nornJaliza...
tion factor, can be obtained from U tJY application of the projeetion
operator !(l + fJ):
I . 1-2
1 ( J + Q'\ l r .:.:= ( I p 'J -t 1 ' ) " U
z, - p) ,., '") nr
\ k Po !
It is also seen that odd IHlai; rnatrtCe5 (:ouple large \J/ith sHal COtDponents
while even ones coupJe large with large and small "v1:h cH'nalf. (cnscquci1'fly
in the non",relativistic limit the large contribution to quantIdes Eke 1;i*fhp
come frotn even 12 operators.
From the result (J.9) it would appear that the t\VO solutions U and 'fIJ-z.
correspond to rro spin orientations Hovlevef, lHlliJ.<e the PJ.uli spin ,,;ase
where a z is diagonal, we have
l ;--1/< \
: (J Y-" \
/ . -I · \ L') i ZA"
P . \ / .. i ·
T f'-1'
(Jz (J:1: == \ J .-, \ a.. a.p .J.. lA)" }
\ 2po I .3.___ X.....-'-
'Po + 1 /
and, while O'2'X:t.:-f = :f:X:i:, in the s:11all (or lower) component (f does
not commute with a.p unless p is in the z-direction. ]n that case it is true
that G z is diagonal: (JzU: t = ::J: [l+ for p = pze z . But in general neither "P
nor U is an eigenfunction of (j.: Sin<.;e.. in the case that p = pz it is true
that (jz == O'..p, it appears that O'p is diagonal in this special ea5e. In fact
a.p does comrrlute \vith the HamHtonian. lIowever (3.7) js not an
eigenstate of a.p in genera]:
( PO + 1 ) ( I o.p l:= )
a.lJ U:1: = 2p-- _ X x Yi
Po -1- 1 t
In fact we observe that for any unit vector ft
(a.o, .II) = (cr. n, a.p) :=.:: 2ia. ii X P
and the cotnmutator is zero only if fi X P = o. This implies that there i
a linear combination of lJ+ and U_ which is an eigenfunction of a..p, and
72
RELATIVISTIC EIJECTRON THEORY
\ve shaH subsequently discuss this in detaiL However, we are interested
in the interpretation of U+ or U.__ alone. For this purpose \ve note that
({3a.fi,11) = (fJon, tt-p) = fJ(afi, «.p)+
= - 2fJ)'5 n -P
and this vanishes if fiop = o. l'h.erefore if we introduce two unit vectors
e 1 and c 2 which together with .p form a right-handed coordinate system,
we can construct an operat(Jrt
(!) = O'-p P . pa.e 1 e 1 + fJa.e 2 c 2
and the comn1utator
(), H) = 0
(3JO)
(J.l1)
Of course, the three terms in (JJ do not COffilTIute \vith each other, so only
one of them can be made diagonal. 1'his is the relativistic generalization
of the spin operator which was a in the Pauli theory. In the non-relativistic
limit
(f) --)- a = a..p p + Q've 1 c 1 + Ge2 (\
It \vill now be sho't'vn that the representation (3.7) diagon,.dizes (9. 1"'he
reason for the preference for the >axis is the choice r1 z diagonal on which
(37) is based In the foUo\ving we note that
p X e 1 = e 2 ,
" x .... "
C 2 P = el,
e 1 Xe 2 =p
l"'hen.
( + " \2 ( / aofJe Xi fA ) \
(foel U :!: -. £(\,,- 1. ) ipa"e:t
.:I., Po - .-- X
Po + 1
and
a.e 2 U 1: =
( o.e 'V:i: \
I 1 2h
( P 0 2 +- ) ipa'el .:t: )
Po Po + 1 X · J
'"[hen
I \.' '
OzU:t:= ( po+l ) (a )
\ 2 Po ! \ b
where
a = [a"p pz + a.c 1 e 1 z .+- 0'4e 2 e 2z Jx i: },j = a zX:t !,}:' = ::t X:t
( 1 )l [ .J4 .".. " .A '1 + "f
Po + ) = p pz + la.e el _. io"e 1 e2.t% - .
.
arc eaCH two-component Spl110rs.
"1 See also section 20. We refer to the \;Olnponents uf {!J as spin operators in the sense
that they correspond to the Pauli operators a and not (f. An alternative non1encltu.re
for (f) is the '''polarization operator" sjnce a will be clear from the sequeL the po1ariza"-
tion of an electron beam is the ensen1b1e average of ((I.
DIRAC PLANE WAVES
73
For X+ ,\\1e use
( Ti' \
1 ' f' ...
ooY X = )
V+ 1
for any vector V and V-I- = Va; .+ iVIJ' 'Then
( ... \
(Po + 1) b = P . _ . p z _ _ )
t(e]Ze 2 + - e 2z e 1 +)1
But i(el z e2+ -, e Zz el+) = i(e l X e 2 )y + (e l X e 2 }r = P+q 'r'hus
(Po + 1) b = O'''p X:1:
and we obtain
tD z rJ + = [J +
(3.12)
Sjrr!ilarly, for X--i we use
lL ( V - )
a-V X-/ =
\- V z
where V_ = Va: - iVy, and so in this case
, . {i( e 1z e 2 - - e 2z e 1 -- Y } \
(Po .+ 1) b == p \
pz I
= pC(e 1 X c 2 )Y - (e l X C 2 },,)
" pz I
( ' --- p ) \
'- l'
= pz = -a.p X - ,10
Therefore
OzU _ = - [J _ (3.13)
The interpretation of (3. l2) and (3..13) is that the z-component of the
relativistic spin operator (!) is a constant of the motion and for the states
U:i the eigenvalues of (!) z are :f: 1.
Since a-p, a.c 1 , and a.e 2 all anticommute, we note that (!)2 = 3 and
«(!)_n)2 = 1 for any unit vector n. Hence the eigenvalues of (9.0 are ::t] in
general.
We can also interpret the spin properties of the state (3.7) in terms of
the average spin; tha tis,
( tp *, (!) 1p) = (U , (9 U :t:) = «(J) 1.
The expectation' value «(f):t is readily calculated:
«(f)-I = [it (a-p) r+ e 1 (fJa.e}) + e 2 <pa-cz)]:t
I-I ere we use
(ap):t: = :i:pz,
({3a.e i ) i: = :f: e iZ '
.. 1 ,..,
l == ., k
74
REL,A.TIVIS'fIC ELE(TRON TlIE,l)RY
and agait 1 I'S-'pj fj.e 1 , and a..e 2 have tll s(nne CO!nU1utatlclp rules as 0Z' U;r,
and Cl Y . 'rh us
/'1 ' \ -.
\Ci I::f: -
r ( A A + .... J\ + '" "' ) I "
::t: '- p zp t?J zC] e 2z e 2 , = ::t: e z
That is) the, tlveragt; value of the pl:1 operator has v.njt length and is in
1:h z-djrection for iJ_ L and in the -,;;direction for [t__ This is 'vvhat is
rn';ant by the uSljal tcrrninology: "spin up" (U_,,-), (;sph) do\vn>' (O'd). In
ection J 9 the Eenerahzation to an arbitr;, ry reD '.\:ntation t\/here <» is
(_J .. i '
an arbitl ary urHt ve::'(ot vnH be COIlStdered
Since lJ+ and tJ__ are cig}1functions of the hrmiti(lJ; operator t 7 -:; hav:ng
dlf1rent eigen'lalueso; it foUo\'\-"s that (hey rnust be 4)rthogonaL This is
verified by d1rect c<)lculation froro (3.7):
( TT { J ) ...-- (,T
l./ +;:, * -.J ........ \ v -.,
[J +) ,,..:::; 0
'rhe sarnc" of (,ourse aprlie to (?P'1 0 ' 'lp_) =::: (i__" 1p.+.). 'rhe t\VO spinor
\ .1; , ." t . 1 f " . 1
arn r -lntUG,e u o cnos1J,tute a com p ete set 0 POSItIVC encre,V P ...ane \.iaves.
w..t.- }. 4.---'.. .
l"he fact that (he {jirae v/ave functi()n haY'; four and n.ot (\/o cOlnponent£
\vIB he nter)fetcd. in the IH:xt section.
J.
1). .NI:(; A'r'\'I: E.}\U;i«;Y S()lfUI'ICH..L 'IRE P()SfIltON
fhe t)ccurrence of ntgative energy s()tDtkH1' i-; CLaraClcri::;tic 01' a
relativis.tic thear y bcause of the two possible solutions cf (2.5); rran1ely
If-> ""-1.' l' '4' 1 .
'1";.' :"'7.: =l:pfj' C;rOV10usay, a negative energy tatc 1)./0 U h.., rDrilY propertle.s
J l'k "*-'/\'fJo.' {' ..... 1 1 .;- d · <I.,' ,1 ¥. - ..... ,. '\,',, ., · ... ] .<:(,' .1 th " ('
,drul:....e L......,J," .)1 ad) 0 )c;:,.rve pardCn.... Ja. O} 1hS,..f.dA..''' .dJ a, (,t.',"I1Cal e..Jry,
and a ( 1 1(1nt"iltO theof' ) / lS \lileB) 1he aVr.-;rae ve1.ocjtrt is
,:.; -' I
orV
v, =-- ==
fl: "'"
°Pk
P ,..
1'-
Po
t The velocit.y operator, in ordinary :ulits, is (see .Appendix C)
. i
x == - (II X ) ;.. co:
It. ,
and, since the e:jgtn"alne8 of each component ot (1. have mod.alus unity, it might appear
that the vel()city ha& tht. value ::f:c. 'This is indeed the insta'1tan<;X\11S veleclty but Dot the
rneasured veloG;ty. In a representation in whicl1 the (>.nrg,) is (:Ol,tant H\ litne none of
the !.X., can he ill.ude diagonal. In frict,
, " ( '" "' ( T --\>. . . 2 ":)
tt :::':': I H a.j = -"....1 (1.1/ --- '9) = -. 2a X l' -t- .!pa
with n ::;: C := i. In physicaJ teffi1S this is a reflection of the fact that a precise vf;!Qcity
measurement rquire8 precise tirne and pos1tion rnea3urelnnts. so that the eneigy aikd
ulonlenturn, in that C9se, could not be t..-;onstants of th motion. The average veiocity
divided by c is ({fl.; -== p!Po for a plal1e wave of positivs energy.
11JRl\C: P[ut\tE ""1\ \rES
75
vv.here the last (qua1ity apphes for the negative energy 5tates Thus the
.:nomenturn and ve.]ocity .would he in opposite directions. In a classical
theory the negative energy states cause no trouble beca ue no tranr...t.tions
between positive and negative e.nergy states occur. 1'herefore if a particle
occupies a positive energy state at any time] it will never appear in a
negative energy state. The anomalous negative energy states are then
eliminated as a result of initial conditions vihich stipulate that no such
state occurred in the past. In a quantunl theory this device is no longer
admissible. Although the problem of coup1ing of })irac electrons with an
electromagnetic field has 110t yet been discussed, it is fairly obvious that
spontaneous emission of radiation can occur as long as a state of lower
energy is unoccupied and as long as conservation of angular mOInentum
and line,lf norncntum can b fuHdt:d. rfbese conservation princjples can
always be fulfilled under appropriate conditions (for exaJnple the presence
of another particle to take up rec:oB ITl0tl1entUIT! is necessary in brem-
sstrahlung). rrhus there is nothing to prevent an electron from. radihting
energy in making a transition from a positive energy state to a negative
energy state. What is more, therl is nothing to prevent it trorn c.ontinuing
to radiate, making transitions to lO'ler and loYer negative energy states.
This behavior is evidently to be rejected in a reasonable description of
nature, and the negative energy states are unphysicaJ so far 3.S observed
states are concerncd.t
The solution of tl1e difficulty of the negative energy st.ates is due to
Dirac. 1 One defines the vacuurn to (:onsist of no occupied positive energy
states and aU negativt:' energy states completely filled. This nleans that
,each negative energy state contains two electrons ..n electron therefore
is a particle in a positive energy st.ate vlith an negative energy states
occupicd No transitions to these states can occur because of the Pauli
principle. The interpretation of a single unoccupied negative energy state
is then a particle with positive energy, Po if the observed n10mentunl is p,
and \vith (average) velocity
"'i.! _ n I ' ')
'f - t J i 0
parallel to the mOI1H:ntum. This follo'\vS because to rrodiJce the vacuum
where all observables bave zero expectation value requires the addition of
an electron with energy Po and (average) velocity --Plpt)- It \vill be
apparent that a hoi( in the. negative energy states is equivalent to a particle
with the sarne nlas as the electron, and this rnass, In, which n1ay be
defined by
p = mv(l - V2/C2)-.1
t They may and do occur a& intermediate states in the usu(\l desc, iption of uch
processes as COinpton scatt€riIlg; see section 37.
76
RELArrIVISTIC ELE(TRON THEORY
is positive. When one examines the behavior of the particle (unoccupied
negative energy state) in an electromagnetic field it is seen that its charge
is e \vhere the electron char..ge (that of the particle in the occupied pus.jtive
energy state) is --e; set; section 21. 'The thtory therefore predicts the'
existence of a particle, the positron" 'vvith the same mass and opposite
charge as compared to an electroJl. It is well known that this particle \vas
discovered in 1933 by i\nderson. 2
Although the prediction of the positron is certainly a brilliant success
of the Dirac theory, some rather fOfl11idable questions still arise. With a
cOlnpleteiy fined "negative energy sea" the C0111 plete theory (hoole theory)
can no longer be a single-particle theory. The treatment of the problems
of electrodynamics is seriously complicated by the requisite elaborate
structure of the vacuun1 The fined negative energy states need produc no
observable electric field. However, if an external field is present the shift
in the negative energy states produces a polarizatjon of the vacuum and,
according to the theory) this pplarization is innoite. In a similar way it
can be shown that an electron acquires infinite inertia (self-energy) by the
coupling \vith the electromagnetic field which permits ernission and
absorption of virtual quanta. More recent developments show that these
infinities, while undesirable, are removable in the sense that they do not
contribute to observed resu]ts. 3 ,4, For exampJe, it can be shown that
starting with the parameters e and m for a bare Dirac particle the effect of
the "crowded" vacuum is to change these to ne\\ onstants e' and m ' ,
which must be identified with the observed charge and mass. The difference
between e and e' as well as bet\veen m and tn' arises principally from
coupling between the electron and the electromagnetic field \vherein
transitions to states of very high momentum p' can occur and the diver-
gences mentioned arise from the contribution from states with p' -)- co.
If these contributions were cut off in any reasonable Inanner, m' - nl and
e' - e would be of order rx =: e 2 /lic 1/137. No rigorous justification
for such a cut-off has yet been proposed.
An this Ineans tbat the present theory of electrons and fields is not
cOJnplete. 'This is, of course., a characteristic of n1any theories in physics.
The particles---the electron and j tS antiparticle the positron, or posjtiVt
and negative fiU rnesons.-are treated as "bare" particles. For problerrls
involving electrornagnetic field coupling this approxirnation ,",iH result in
an error of order ('/.. As an example, in section 22 it wiU be shovvn that the
Dirac theory predjcts a magnetic monlent fl :=' #0 for the electron'! whereas
a more c0I11plete treatment 5 of radiative effects gives ,u = .uo(I + CI..!27r)
which agrees very well with the very accurate measured valuet) of
ft/ /l-o = J.oo 1146 4 0.000012. l'his example is typica] in the sense that it
teJls us that the Dirac theory can be useful in a certain domain, a very
DIRAC PLAl'lE \;VAVES
77
broad domain, of physical problems. In other words, \ve can prescribe a
method for obtaining resuJts vvhich are consistent with experiment. It is
only fair to add that this is not an ad hoc procedure and that a reasonable
physical picture emerges from the the;:Jry. So far as the treatment of .
dynamic processes is concerned, it should be stressed that even when a
quantized field is necessary, as in decay processes) the present theory is not
only useful but also essential in obtaining results with \vhich experiment
can be confronted.
I
o---- ---
-mc 2
Figure 3.1 Energy spectrun1 showing positive and negative energy continua. A transi-
tion, indicated by the vertical arrow, of a negative energy electron from the initially
completely filled negative energy states to a positive energy state rcpreents the '-7eation
of a positron..electron pair.
In radiative P roblefi1s such as brerosstrahlung" ohotoelectric effect,
v .....
internal conversion, pair annihilation, and pair formation the theory is
used to obtain results in the lowest non-vanishing order of the perturbation
theory The process of pair production, for example, is then regarded as
the absorption of a photon \vith a transition of an e1ectron from a negative
to a positive energy state. AnnihiJati(:it is the reverse process except that
in the absence of a (nuclear) field only t\VO quantun1 annihilation is
pern1itted by the requiremer;.ts of energy and mon1entum conserv3tion.
The simple diagram of Fig" 3.1 sho\vs the envisaged transition. The shaded
78
RELATIVISTIC ELEC1'RON 'THEORY
region gives the possible continuum energy states in the absence of external
fieJds. \
Another brilliant success of the Dirac hypothesis is the prediction of
antinucleons, negati ve proton or antiproton and the antineutron. tfhese
particles, \ihich were discovered in recen t years in high energy nuclear
reactions,7 have (within the exprinuntal error) the predicted properties
that: (i) the charge of particle and that of antiparticle are OpposIte;
(ii) their mass is the same; and (iii) they are produced and annihilate
in pairs. The spins should be the same and their magnetic moments
(sectlon 22) should have opposite signs, but reliable data on these quantities
have not been obtained as yet. ()f course, for nucleons the prediction of
t}:e Inagnitnde of the magnetic lT10ment requires detailed consideration
of the interaction with pi mesons.
1'1: production processes for the antjrroton p appear to be
p M1- P - '.. J p t P
7f - -t- p --." p -t- n + p
Here 1T- is a negatively charged pi TI1eSOn4 For the antineutron n at least
one production mechanism is the so-called exchange scattering of p on p:
p + P --?o- n - ii
The antiparticles annihilate with partjcles giving predominantly 17":1:
meBons. -rbe strong interaction bctween nucleons and 7T' nlesons precludes
the use of the Dirac theory of are partie-Ies for any but essentially
qualitative applications so far as nucleons are concerned.
In contrast to tllis situation mu fl1esons of both charge signs, /AI., appear
to be "nor-trial" Dirac l)articJes. For exatnple, the accurate magnetic
:rneasurenlent 8 for the su+ gives a value (1.00122 :i: 0.0008) #0 \vhich agrees
\vith the electron value within the experimental error.t There seerns to be
Jittle doubt that the physical properties of the ,n meson can be explained
by the Dirac theory as well as in tl)(. (:ase of electrons. 9 The distinction
bet'-V\reen the particles eems to be entirely hl the much larger mass of the
# Incsons (207rn) whicb permits the decay process of I-t -»- e:l: -f.. neutrinos
whereas, of course e* are stable except for annihilation with each other.
17. THE PROPERTIES OF FRE:E POSITRONS
FrOIn. the results of section 15 it is seen that the normalized wave
fun(tions of a negative energy eictron with the eigenvalue of Ii = pare
1J':t( -- Po) = J7j:( -p) exp (i(p.r + Pot)]
tHere fto is ,the m.agneton unit dehned "\J,'hh the meson mass.
DIRAC PLp..NE W,A. YES
79
and
O'-p :t: i
Po + 1 X
:t
X
Here H1f':t( -Po) = -Po1p-J:.( -Po). "'fhe argument (-Po) is to indicate a
negative energy state. The large component is the lower spinor.. Thus in
this case Po 1 gives
V:t(-p) = ( PO + 1 ) !"
2po
(3.7')
V:!:(O) = C:H) = V llt
and fJ can be replaced by - 1. Also
i(l - f3)/:t( -p) = ( po + 1 ) "" Vnt
2po
The projection operators l( 1 ::t: 13) constitute a complete seL
The results given here and above for the positive energy states clarify
the problem of interpretation of the four..component wave function. The
occurrence of an "upper" and "lower" spinor is evidently connected with
the appearance of positive and negative energy states or, in more physical
terms, with the existence of positive and negative charge. This corresponds
to the decomposition of the wave function in Dirac space. The decom-
position in Pauli space is clearly associated with double-valued spin
orientation,. This interpretation is brought out more explicitly in the
diagonal representation discussed in section 19. '
The physical particle is not the negative energy state electron but the
positron.. The corresponding positron has energy Po and momentum -p.
We change the notation so that for the positron p has the meanjng of the
physical momentum of the particle. Hence the positron wave function is
1J.':t = V:t(p)exp [-i(p-r - Pot)]
(3.14)
where
cr-p ::f: !
V:t:(p) = ( po + l ) H Po + 1 X
2po 1L
X:f: 72
(3.14a)
The large component is again the lower two-component spinor, while the
slnaU component is the upper two..spinor. Even Dirac matrices have the
property of (:oupling large vdth large and small \vith small components,
just a for the electron. Sin1ilarly, odd matrices couple small and I.arge
components as before.
80
RELATIVISTIC ELECTRON THEORY
The V::t: are normalized and orthogonal
(V+, V_) = (V_, V+) = 0
for both V:t:(p) and V:t:( -p). Between the U and V amplitudes the following
relations hold:
(U m , Vm,(-p» = 0, '(U m , V m ,) = .!.(x m , (J.PX m ')
Po
where unless otherwise indicated the argument of U and V is p; for
brevity m and m' are used as indices in place of ::1:. The first of these
results shows that the four amplitudes U::t:(P), V::t:( -p) are linearly independ-
ent and constitute an orthonormal set. This is obvious from the following
facts: (1) they are eigenfunctions of Hwith different eigenvalues; (2) U:t:(p)
. are eigenfunctions of the hermitian (!)z with different eigenvalues; and
(3) V:t:( -p) are eigenfunctions of the hermitian operator:
(!) ' (!) ' ,.. A A + fJ '" A + R A ,.
Z = .e z = -a.l'pz 0'.e 1 el z pa.e 2 e2z
with eigenvalues + 1. On comparing with (!) Z' the change of sign in the
first term should be noted. This result is obtained in the following way:
We have
V:t(-p) = -iP2 U :t(P)
and P2 commutes with (J and anticommutes with fJ. Therefore
(!)V:t( -=p) = ip 2 m z U :t:(p)
= ::I: iP2 U :t:(p) = =F V:t:( -p)
Between. V:t:(P) and V:t:( - p) we obtain the scalar products
1
[Vm(p), V m ,( -p)] = - c5 mm '
Po
since, as is seen in the next paragraph, V:t:(P) are also eigenfunctions of m;
with the same eigenvalue as V::t:( - p). This result is, in fact, evident since
() and (!)' do not change under the transformation p -p. It follows
from the foregoing that U:t:(P), V:t:(P) , are four linearly independent
. I
amplitudes.
It is of considerable importance that there exists an operation which
converts an electron wave function into a positron wave function and
vice versa. We distinguish between these by writing them, for the moment,
as 1p(-e) and 1p(e) respectively. Then, with the standard representation
used here,
1p(e) = fJifJr:J. 2 1pX( - e)
(3.15)
DIRAC PLANE WAVES
8l
where 'YJ is a phase factor: 1171 = 1. To prove this, it is only necessary to
show that
V(p) = 'Y}ifJrx2UX{P)
,vhere, as indicated, a real phase factor 1] = :l: 1 may occur. Since
; {3a.2 = ( j(1 0 2 )
-10'2
we find
. x
lO' 2 G .p x:t
ifJa. 2 U = ( po + 1 ) J.i Po + 1
2po
- i a2x :i:
since X::!: is real. Now i<1 2 o X .p = - a.pi0'2 and
ia2x:t = TX=F
Then, combining these results,
V:t(p) = :l:ipU{p)
(3.16)
The operation of complex conjugation and multiplication by ifJoc 2 which
occurs in (3.15) and (3.16) is called charge conjugation. It appears here in
the standard representation, but it wjl1 be discussed in a general representa-
tion in section 24. It is seen to interchange a positron and electron and at
the same time' to reverse the spin state. This fact will be investigated
further below. Clearly, the charge conjugation operation works in both
directions. That is,
implies
1p :t ( e) = :J:. i fJ OC2, tp ( - e)
1J':t( -e) = :l:ifJrx21p(e)
(3.17a)
(3.17b)
The spin reversal is another way of saying that (!) and (!)', the spin operators
for electron and positron, have opposite eigenvalues; see also below.
The notation will be simplified by introducing the charge conjugation
matrix c:
C -1 = :l: ifJ.(t.2 = C
(3.18)
and writing
1p = C"P
(3.19)
We,refer to 1pc as the charge conjugate wave function. Then any operator
,
equation of the form
(1 tp:t: = lJ.) 11' :t:
82
RELATIVISTIC ELECT'RON THEORY
where Q is hermitian, co real, becomes
C-lrlxCV} = w1p
In the present case
C -'1 = C
(3.20)
and this, it will be seen, is a consequence of choosing the phase so as to
make C real. The operator
(;-lQXC ::::"".: QC
(3.21)
is the charge conjugate operator. Similarly, for a herlll1tian operator
\vhich is not diagonal we consider the (necessarily real) expectation value
W = (V'-:J:;, Q1P:t)
Then the charge conjugate equation is
(V c = (¥', Q c1f)) = (1p =+-., (-; .-lC X - 1 Q1p=F)X = (J)x = 0)
Mnce C ::= (: = eX = C-I.
Applying these results we find a C = (1" {Jc - f3, (lC = - a and
(a) Energy operator: HC = --Ii = _({tap + fJ)
(b) Momentum operator: pC = --p == iV
(c) Spin operator: (!Jc == - a.p p .t {30'ae1el + {Ja.e 2 e 2 = (!J'
(d) Angu1aI momentum> JC == IC -r- ta c = -J = -(I + to)
(e) Current four-vector: .i c = j, Fl = P
We proceed to a discussion of these results. In connection with the
energy operator we recognize that
i 0v:::. = i aC-1'IfJx = -C-1Jlx'lfJx = -C-1HxC'IfJ" = -H"'lfJc
at at
or
. a1Jl c
l - = -.... Po'tfJ
at
so Po is the eigenvalue of HC. Threfore vv'e are justified in calling HC the
energy operatofo Sin1ilarly,
pC1f'c = i\71jJc = p1pc
and so pt: is the ITtomentum operator. For the spin operator we note that
the component of spin along the mOl11entum has reversed in sign. This is
related to ti1e he/icily of the particle; that is, the expectation value (o-p).
r-rhe result that electrons and positrons have opposite helicity is well kno'W'n .
as an experinlental result in beta decay. '"fhe angular momentum operators
DIRAC PLANE WAVES
83
of the positron are the negative operators of the electr'on. For the positron,
then, the commutation rules are
JC )( JC = -iJc
The operator KC = {3C(oc.lc + 1) is equal to -K. The connection between
this result and the relative parity of positron and electron states win be
clarified in section 26. FinaHy, the result for the current four-vector is
fairly obvious. 1he space part of particle current density is in the direction
of the average velocity or momentum, and the particle denity is positive
definite. These results are precisely what one obtains from the hole theory.
18. THE DIAGONAL REPRESENTATION
Plane Waves
A special representation to which considerable interest is attached is
that in which the energy matrix h for plane waves is transformed to
diagonal form. Since the eigenvalues of hare :1::po the diagonal form of h
must be
h' = ShS- 1 = PofJ
(3.22)
This corresponds to the transformation fron1 U to V' = U o , ¥'.here
u = S-lU O
(3.23)
and
h'U o = ShS- 1 U O = Po{3U o = PoU o
or
{JU o = U o
(3.24 )
Here we confine our attention to positive energy states. The extension to
negative energy states \viIl be obvious. The resuJt (3e24) corresponds to an
electron in the rest frarne, and therefore the results of this section should
correspond to the non-relativistic limit. We return to a consideration of
this question below.
It is evident that if 8 1 and S2 are solutions of (3.22), that is
Si h = POPSi,
i = 1, 2
(3.22')
then any linear combination of 8 1 and "';2 is also a solution. All the
solutions can be generated in the following way. First a particular
solution is constructed. 1'his is
fJ + kat-p
84
RELATIVISTIC ELECTRON 'fHEOR Y
"vhere, k is a constant. J nsertion in (3.22') shows that
1
k=
Po + 1
and p = p2 -t- 1. We now consider a solution of (3.22') in the form
SA =: ')/..4((3 -f- ka-p)
where k may now be different from the value found above. Here YA is
one of the 16 matrices previously discussed, and it either con1mutes or
anticommutes \vith fl. lnsertion of SA in (3.22') shows that SA satisfies
this equation provided that
1
k = .----
1 + EPO
where € = + 1 if (y A' ;3) = 0 and € = -1 if (jl A' (J)-!- = O. It follows that
the Inost general solution of (J.22') is
s = ! c--t[YA(l + EPO){3 + YAr;tp]
A
where CA are 16 arbitrary constants not all zero. If vve write
M = !CAYA
A
then
S =: Mh + Pof3M
(3.25)
independently of E.. Direct substitution in (3.22') verifies that tIns S does
indeed satisfy this equation since h 2 = P5 times the unit matrix.
If the normalization of U o to
(rIo, U o ) = 1
is to be retained, it is necessary that S*S = SS* = 1 But
SS* = pMM* + Po[(MhM* (J)+ -r Po{31vIM*{3]
It is sufficiently general to consider the two cases: (i)}V! and .f3 commute
and (ii) lvE and fJ anticommute. In the first case \ve set f = J.10 and in the
second M' = -Y5MO' where (Mo, (J) == 0 but Mo is otherwise arbitrary.
When M is a linear cornbination of Afo and -YsMo, the result for Sand
therefore for U o is a corresponding combination of the results for cases
(i) and (ii).
It is now seen that
S S * 11 " M * ( " t) 2 '
I. = lVL koPO :f:: ...Po)
DIRAC PLANE WAVES
8S
where the upper sign corresponds to case (i) and the lower to case (ii).
The general form of Mo is
Mo = ( )
where, if we take a*a = b*b = 1, it follows that Mt Mo = 1. Therefore
in case (i) we choose S to be
S = [2po{Po -1- 1)]-!-2[Moh + PoPM o ]
(3.26a)
and Mri Mo = 1. In case (ii)
S = -[2po(Po -- 1)]-/Y5[kloh - Pof/Atl o ]
Substitution of S given by (3.26a) into U o = SU gives
(3.26b)
1 _ ( a xm )
V o -
o
(3.27)
where, as usual, m = :i:!. The general form of a is 0'-° 1 , where 8 1 is a
unit vector. The result (3.27) is a unitary transformation (with a which is
unitary) on the wave function
U(O) = (lorn)
representing the electron in the rest frame..
For case (ii) we find similarly
_ ( bO. P xm )
U o -
, 0
(3..28)
which is again a unitary transformation on U(O). )
For negative energy or positron states it is readily seen that the diagonal
representation results in
v = Cm)
where cc* = c*c = 1. The demonstration will be left as an exercise for
the reader.
It j natura] to think, in connection with these results, of the Lorentz
transforrnation which brings the electron to rest. To see the relation of
86
RELATIVISTIC ELECTRON TIIEOR {
this ,\\lith the unitary transformation used in the foregoing., we genca!ize
the transformation of Eq. (2.71a). This gives
( ' + 1 \! ( - 1 ) Y2 r ( + l ) !' J
A = ) - ex-p -= exp I -a.p cosh -1 -
2 , 2 L. L, /
and
-1 ( + 1 ) ,. ( - 1 ) ' /2
A = + a.p
\ 2 2
(3.29)
Here p is a unit vector in the direction of the velocity vector v of the
transformation. The correctness of (3.29) is verified by using (2.60b),
which can be written in the form
4a.uv = l'r (Yv A -lYpA)
or
8a}'v = Tr Yv[{ + l)YIt - (- l)a.p y a.p + (2 - l)i(cx.p, YJl)J
When this is evaluated for the various cases, I =I; 4, v =1= 4; fh = 4, v -:F. 4:
p = l' = 4, it is seen that the Lorentz transformation generated
,
X II = a ll"x"
agrees exactly ,;ith Eqs. (B.6) and (B.7) of Appendix B.
Now ..1\ given in (3.29) may be applied to VJ = U exp i(p.r - Pot). The
exponential factor is the scalar product of two fourvectors and is therefore
a Lorentz invariant. Then using == Po we obtain
_ ( l P + 1) X 11t ) _;'2 ( O'ap asp 'Vm )
A Po \ 0, P Po 1 /'"
U = - - ---- Po +
2 a-p X m 2
a.p Xrt1
and, since a-p a-t) = p, the result is
AU = po(xm )
,0
(3.3,0),
It is obvious that l\.U is not normalized in x-space. But, if Vie recall that
the Lorentz transformation does not change the two coordinates perpen-
dicular to Ii anp stretches the coordinate along p by a factor ::.: Po' it
follows that I
d 3 x' = Po d 3 x
Hence, if
r 'IjJ.'IjJ d 3 x = 1
.,
DIRAC PLANE WAVES
87
it follows that
f 1p'*V/ lfx' = 1
as should be expected from the invariance of f p d 3 x. In this sense it may
be said that the Lorentz transfOfn1ation and the S..transformation used at
the beginning of this section are equivalent. They differ by an arbitrary
unitary transforn1ation.
The Foldy-Wouthuysen Transformatin
The diagonal representation discussed above is closely related to an
extremely elegant and povverful transformation scheme due to FoJdy and
\Vuuthuysen. 10 This will be referred to briefly as the FW transforn1ation,
and "re shall here discllss on 1y its application to free particles. J n section 22
t.he rnethod will be applied to particles in an electromagnetic fieJd. In
general, the lnethod is equivalent to a non-relativistic expansion in which
lhe ratio of th momentum to me and the ratio of the kinetic and inter-
act.ion energies to the rest energy rnc is a paratneter of expansion. For
free particles the rn.ethod gi ves exact results in closed formw
The purpose of the FW transformation is to find a representation in
\vhich the srnall and large components are dcoupled. l'hus the electron
\,vaVt; function \vill be a [our-component spinor with the lower t\¥o-spinor
jdnicany zero, vvhile the positron. \vill be represented by a fouf<wspinor
\vjth the upper t\vo.spinor identically zero. The spinors
.(1 :i: fJ)VJ
JUive the property of vanishing small cOlnponents, but these do not
represent states of definite energy in the standard representation.
Instead of working in the plane wave representation, the V\'ave functi.on
will be left general and wiH depend on the coordinates in an unspecified
way.. The transf..)rmation
'Jf" __ <.:'11)" __ e iU Ui'
I "--;.J 1 -- T
introduees a wave function vhich fulfills
'V'
} J ''-Tit! . U -
T :=: l-
at
where
H · in H - iU at!
=-=e e ----
at
Vie shaH be interested In the case iJll/or == 0 so
III ::..:: eu He - iH
( " ..,.. \
-,.j'; )
88
RELATIVISTIC ELECfRON THEORY
Other operators transform in exactly the same way. For the hermitian
operator U the choicet
U = - 2- (Ja..p lp ( p- )
2m m
is made. Here we are using units with Ii = c = 1, but it is preferable to
keep the mass m in evidence so that
(3.32)
H = m.p + 13m
In (3.32) q; is a function of p/m defined in terms of the (Taylor) series
expansion of this function. It is to be recognized that fJa..p commutes
with q;. Since U anticommutes with H we can write
H' = e 2iU H = [cos (p'Pfm) + {3a..pp-l sin (plp/m)]H
= {3(m cos (plp/m) + p sin (pq;/m)]
+ a..pp-l[p cos (prp/m) - m sin (p!pjm)] (3.33)
which follows from the fact that ({3cx e p)2 = _p2. It is seen that H' is free
of odd operators if
q;(p/rtt) = (m/p) tan -J (p/,n)
00 (_ )n(p/m)2n
=I -
n=O 2n + 1
(3.34 )
and with this choice
H' - RW
- P p
(3.35)
where W p is an operator:
W p = (m 2 + p2)Y2 (3.35')
which can be expressed by the binomial expansibn in ascending powers of
(p/m)2 = -m- 2 V 2 .
It follows from (3.35) that 'Y' can be written as a sum of positive and
negative energy solutions:
'¥' = 'Y + 'P'
(3.36)
t Here and below p is an operator. The arrow is omitted for simplicity. In qJ only
pi = _ V2 will occur.
: For plane waves W p has the eigenvalue po, and the entire discussion reduces to
the diagonal representation. The unitary transformaion matrix is
S = e iU = [1(1 + l/po)]IA + P (I.p[-6-(l - l/Po)]}1
and this is obtained from (3.25) with either
M-= [2po(po - 1)]- a.p
or
M = [2po(po + l)]-! p
DIRAC PLAN'E WAVES
89
where
't:t'1't 1(1 ::t f3Y"Y.
and for these two solutions the wave equation reads
W \lj'" . \TJ'f'
. p r + = 1 u '.c -t. i at
_ W 'tf,'" =' a \TJ' r:.\
p I _ Ir .- Jot
To justify these statements the ,-,rave function 'F'(x)
Fourier integral:t
(3.37)
IS written as a
'f'(x) = f a(p') exp (ip',x) tfJp'
and
'I' :t(x) = f [1 ? Ja(p') exp (ip'.x) d 3 p' (338)
\"here H(p') = (Xpf + fJnl. Since i[l 1: 1l(Pf) JI}/;;l] is a positive (upper
sign)-or negative (lower sIgn) energy projection operator 0/...;_ and'¥ _ are
positive and negative energy solutions. Since S an be "vvritten in the form
S __ _ 11'1 + 'IV" *t- pa..p
--- [2Vp(Ylp + In)])1i
it is readily verifIed that
' I I'I = oiU'!f}
:i: \[". ...t
j ,.. r ., TJT ] 1 1 [ H ( r \
= 1(1 fJ) _'::!!.JL_' 1 :I:: ..:- P J I alp') exp (ip'<;:) J3 p '
!- J1 Vi .+. In 1. W;Jf .J
\vhlch, because of the (1 :l f3) explicitiy sho\vs that (he \lf, have only
large cOl11ponents. .
An extrcrnely interestIng result of the FW transforr:nation s that it Jeads
to a representation in which the operators have a non-local character.
To understand this we construct an opel'ator kernel K(x, x') which: acting
on 7(x'), gives 'P'(x). Thus, with
a(p') =, (217')'-3 f 1J'(x') exp (- ip'.x') (PX'
('j 3 1\ )
\_;:._7
we obtain
J il
, . '/ ( I I " 3 I
'¥ (x) == 1(\,x, X ) 'tp(x ) d x
(3.40)
,,,,,here
I t "- ? v 1 r PH" 1
K(x, x') = (217)-3 , - p' --! ; L 1 + - r :f; exp [ip'''(x - x')] d 3 p'
. - Vp' + nl..J.... J1;p1 ..
(3.41)
t Of course, where p and W 2J operate on plane "'"aves they may be replaced by
numbers: their plane wav eigenvalues.
90
RELATIVISTIC ELECTRON THEORY
It is se(fn that K(x, x') is not a Dirac delta function. In fact, it can be
shown that 0/' (x) is determined from 'Y(x') over a finite range of x' centered
around x and that this range is of the order of the Compton wavelength
Ii/mc. To see this w note that K is' a displacement kernel depending only
on R = x - x'. Then
f cFRK(R) = f L ( 2Wp' ) ! ( 1 + PH p , )] = 1
W 1J , + m 2 W p ' p'=o
The 1nean square R is
(R 2 ) = r d 3 RR 2 K(R)
eI
= - [ v. ( p' -r !' ( l + fJP: ) \ ] = 3
W p < 1- m _ 2 JV p ' p'=o 4
The unit is (n/mc)2, and this result substantiates the statement made above.
lhe interpretation of this curious result lies in the fact that x is no
longer the position operator. Instead, in the FW representation this is x'
\vhere
..,... . u iBct i .Ra. p P - ( 0 X P ) p
x' :.= eh-t xe - = x __ -!..- + fJ
2»p 2Hi(H;; + m)p
If X is the operator in the old representation, which in the F\V representa-
tion becomes just tbJ' old x, it is clear tlatll
X -iU iU + ifJa. i{Ja.ap p + (a X p)p
= e xe = x - -
2W p 2Wp(Jt + In)p
and its time derivative is
= i(H, X) = 1-!!..
dt W p W p
Thus for positive energy states this is just p/W 1J and for negative energy
states -p/Wl" so that dX/dt is the conventional velocity operator.
In this connection the results of Appendix C should be consulted. It is
shown there that the electron executes a cOITlplicated motion which is a
superposition of an average motion with the expected velocity plus an
oscillating motion with a frequency 2mc 2 /n. "[he latter type of motion is
called Zitterbewegung or trembling motion. 12 The origin of this trernbling
motion is seen when one considers a superposition of positive 'and negative
energy states. For instance, a general wave function is
\f'(x) = f a+(p) exp [i(p.x - Pot}] tfJp + J a -(p) exp [i(p.x + Pot)] tfJp
DIRAC PLANE WAVES
91
where o:f: are the positive and negative energy state anlplitudes.t rhe
average value of X k with this wave packet is
!tI
(x k > == J 'F*xk'F d 3 x
= f tJ3:J.: d 3 p cfp'[a:(p) exp [-i(P.x - Pot)]
+ a(p) exp [-i(p"x + Pot)]]
( . ) [ ( 1 ) - i1J t ( d ( . , ))
X -l a + P e 0 --- exp lp.X
ap !
+ a _ (p/)ei:..,ot ( exp (ip/.X) ) ' "1
a PI:: ...J
= ( 27T)3 J cFp{ateiPot + ae -iPot) { i ;0 _ ) (a+e-i:pot + a _eiPot)
\ °Pk'
where tl1e integration over x has been perforr11ed to give (27T)3('(P - p'),
and after the p' integration is made aU arguments of ax are equal to p.
If only Q+ or Q_ is different from zero, this result for (x k > is linear in t, as
is seen wh.en the differentiation '\vith respect to PTc is performed; but with
both a+ and G___ present there are cross terl11S of the type a.a_e2iPut and the
conjugate terrn which give rise to an osciHatory tinle dependence of (Xtt)
with the above-mentioned frequency.. .l\n understanding of these effects
can be obtained from the requirement of the theory that a probability
density exist, implying the possibility of a precise position measurement,13
and the requirement of energy and momentunl conservation, which implies
an uncertainty Llx:> lie/po t'. Ii/me. The trembling motion is a con-
sequence of the reconciliation of these two requireml)ts. "fo avoid this
we must redefine the position operator to be X in the old representation,
and not x. This operator X is ;),ppropriately referred to as the rnean po,rition
operator. lts transform in the FW scheme, as indicated, is just x
The transfornlatioll of otber operators in the FW scl1eme is readily
carried cut. We mention oniy that the momentlun operator, commuting
with S, is unchanged. For other cases the original literature Illay be
consulted. 10
A number of investigations extending the scope of the F"W tra.nsforma-
tion have been published. P.or exan1ple the extension to two Dirac
particles has been studied by Chrapl yvy 14, and by Barker and Gfover,15
the extension to an arbitrary number of particles by Pursey 16 who also
discusses the essentially unique character of the transforn1ation.. An
t These amplitudes are the unnormalized wave functions in momentum space.
92
RELATIVIS11C ELECfRON THORY
alternadve representation in which H' contains only an odd operator
(a.p UT 1J !p) can be constructed 17 (see Eq. 3.33)'1 and this is of interest for
high energy particles or, alternatively, for nlassless particles; see also
section 40 and Chapter VII. Other methods for decoupling the four Dirac
equations into two independent sets of two equations have been described
in the literature. 18 ,19
19 PROJECTION OPERATORS
(;eneral Properties
We ha.ve seen that the four-component structure of the wave functions,
as well as the existence of four linearly independent plane waves for a given
momentum p, is a direct result of the t\vo-valued nature of the spirt and of
tb.e sign of the energy. This doub]e division of the four states forms a
natural basis for the construction of projection operators which play an
extremely ilnportant role in the theory and :its applicatioJJs.
F 1 0r convenience the entire set of quantum l1Utnbers describing a plane
wave state win be designated by 'fj and the corresponding Wave functions
by 1.p{'l)(X). Only the space part of the wave functions will be needed
beca.use the time parts can(el out in the foIJo,ving procedure. In detail,
for the representation used abov, 'YJ win then sta.nd for the three nurnbers p,
the sign of the energy Sw = »r/ po , and the eigenvalue, A( = ::i: 1), of {!}z.
1'hen summation over 'fJ has the n1eaning
1 = (27rr- s r d 3 p 2:
r: ., sw).
since the number of states in the volume element of phase space d 3 p d 3 x
is (217)-3 d3p d 3 x and we consider unit volume in configuration space
Since the 11'(YJ) form a eornptete orthonorn1al set, we can expand a
four-cornponent function in terrns of thern:
F(x) = t c<) 'Ip<>(x) = [f cFx' 'f'<'1>*(x') F(X')J 'Ip<'O(x)
Here x' is used to distinguish the integration variable from x. In sI>inor
index notation the above reads
. F p(x) = [f d 3 ;c' "I.'1)X{x') F".( x') ] ''P>(x)
= <rp f d 3 x' F ,lx')I5(x -. x')
DIRAC PLANE WAVES 93
so that ,¥e obtain the completeness relation
! 1p,,)X(x') 1p")(x) = d ap <5(x - x') (3.42)
"
This relation would be valid for any complete orthonormal set. For the
plane wave case we use
.,,(")(x) = a (")(p ) exp (ip.x)
and (3.42) becomes
(21T)-a j ! tfJp a")X(p) a'1)(p) exp [ip.(x - x')]
Sw l
= "'Px - x') = (2'IT)-3tlP f tFp exp (i,.(x - x')]
by the Fourier expansion of the delta function. Hence
! a")(p) a")X(p) = pa (3.43)
8w).
In previous sections the notation was a(p) = U(P) for W = Po and
a = V( -p) for W = -Po. In (3.43) the label '1 should be interpreted
only as an abbreviation for the set of two nuulbers Sw and A. For
definiteness we label the four by 1j = 1, 2, 3, 4 according to the following
scheme:
1 234
Sw 1 1 -1 -1
A 1 -1 1-1
Thus 7J = 1 and 2 are positive energy states, 'YJ = 3 and 4 are negative
energy states. For 'fJ = 1 and 3 the spin component «(),)) along z is
positive; for 'YJ = 2 and 4 it is negative.. The notation
p",] = a")(p) a")x(p)
then refers to the elements of a matrix..P<TJ). Since
(a ('I) a ('I'» ) - -
, - u",,' - US w 8w,{I A.A'
the following properties are seen to hold:
( p<,,) p(If#» - p<,,) p(tt') - a(If) a(tOX a(I:') aCIJ')X
pet - pr 1(1 - P 1 r (1
- d a(") a<"')X = {) p(rr)
- II'" P tI II'" ptJ'
(3.44)
Hence
P<If) pC,,') = 0
if '1J =F rj'
(3..45a)
andt
[p<'1)]2 = pc,,) (3.45b)
t Since p("I) is not a unit matrix, it follows that det p(f1) - O. as it must for all pro..
jection operators.
94
RELATIVISTIC EI,ECfRON TJ.IEOR Y
Finally,
4:
p<q) = 1
11=1
These results. show that each p(TJ) is a projection operator: Eq.. (3.45a) is
the I1lutually exclusive property, (3.45b) the idempot.ent propel1y, and
(3.46) the exhaustive property.
If '1) --/= r;' the matrix p(r;) + p(r() is also a projection operator 'Thus, for
(3.46)
P(l2) = p(l) + p(2)
we have, fronl (3.45),
1\1S0
[P(12)]2 := [p(1)]2 + [p(2)]2 p(l) .+. pU) = P(12)
P(34) P(12) =: P(12) P(34) :' 0
by (3.45a) and
P(12) .t. .P(34) =:: 1
by (3.46).
l'he projection of greatest physical interest are tht four per;) and those
for positive and negative energy as \¥(n as positive and D.egative spin
projection 'The ener"Y projection operators are
P(Po) =:: p(l) '1'- pen
1.\ - Po) == p(3j -f.. p(4)
(3.48)
and fhe spir projection operators ar
pel) = 1.)(1) t- p{3}
P( -1) = p(2) -f- p(4)
(3.48 )
Energy Projection Operators
We first consider the positive energy projection operator. \Ve "vrite
16
P(Po) = 2: C .Lir'A.
A:::::l
and it follows that
4CA = 4c(J'A) = Tr YA.P(PO)
::; (''It')'' a(I1)a{1i)X
\ I /..../ AI) k p J\.
II =: 1,2
= 1/t;)X ( "v ). a('O
;- "..1. .i A, A/J P
11 1;.2
= I (a(l1) Y Aa(1 0 )
,,:;: 1,2
DIllAG PLANE WAVES
95
V{hereas the trace is indepen.dent of the representation, the explicit 'Y .A are
not. We use the standard representation. "lith a(1) = U+ and a(2) = U_
we obtain
4c(1) : I (a(Jf), all;)) = 2
'I
4c(fJ) = 2/ Po
4c(rX i ) = 2Pi/ Po
and all other C(,'A) == O. These resu1ts are obtained easily by recognizing
that Y A has one of t!O fOfJns: ,ither Y A is even:
'v = fa 0 )
i A \0 b
V\ l ith a = :l:b, or 'Y A is odd:
( ' 0 a )
YA = \b 0,
'\¥ith a* ='b and, sinc a (and therefore b) is either hermitian or anti-
hermitian, a = :l:b. 'fhen, for even Y A'
! (a("), Y Aa C ,,») = Po + 1 ! ( xm, ( a :t Po - 1 a.p aa.p) xm )
" 2 Po n Po -t- 1 I
for a = :!::b, and, for Y A odd,
I (a(l1), Y.d a ('1») = ? p 2 (x m , fa, a-p):t: x m )
" -Po m
where the anticonlmutator is used for a = b and the commutator,
(a, o..p)__, for a = -b. The fesult is then
P(Po) = P+ = ! ( l + (;t.p + P )
2 Po I
=: l ( l + ! )
2 Po
Fronl (3e49) the idelnpotcnt property is obvious
(3.49)
just as ',vas to be expected.
since h 2 /P5 = 1.
For the negative energy states, a(S) :.:: V+( -p), a(4) = V_( -p), and
P( - Po) = P - = .!:.! (a C "), Y Aa(")h' A
4 A '1 ::: 3 ,4
Interchanging small and large components and changing the sign of p in
the positive energy case, we have .
! (a("), Y Aa('1») = P!L,+ ! I r x"", ( Po - 1. a.pa a.p :!: a ) \ Xm 1
11 2po Tn L... \po + 1 ..J
96
RELATIVISTIC ELECTRON THEORY
for even r.A. and a = :i:b, and also
(a(I1), Y Aa('O) = - 2 p I (xn, (cs.j), a):f: x m )
" Po m
for Y...4. odd and a = 3:b. Hence C 4 changes sign for 'YA odd and for YA
even with a = -b, while CA remains the same for 'Y A even and a = b.
Thus we find
p _ = ! ( '1 - ex-, + P ) = ! ( '1 _ .! ) '
2 Po 2 Po
Again p = P _ is obvious and
(3.50)
p+p_ = p_p+ = 0,
P++P_=l
is readily checked.
In general, if "p is a linear combination of positive and negative energy
states
"P = 1JJ+ + 1p--
in an obvious notation,
P + "P = 1fl+
P-1Jl=1J'-
since P+"P- = P-"P+ = O. The result P"+1JJ = "p+ and VJ = 1J'- for any
integer n is then also obvious.
For the positron states the momentum is the negative of that for negative
energy states. Retaining the symbol p for the observed momentum gives
. the operator
1 ( cx.p - fJ ) X -1
PPOS = - 1 + = p = CP +C
2 . Po
This is a projection operator in the sense that P;os = Ppo' but clearly the
complete set of projection operators of which Ppos is a member contains
P+(-p) = tPi)-l(po - «.p + (J), as the other member.
(3.51)
Tbe New Representation
Instead of using P(::t: 1) or p(1'J) defined above, it is more useful to define
the spin projection operators in terms of eigenfunctions of -ft and (!}C-ii
where, as before, ..
(!) = a.p p + I fJcr.e i e i
i
(f)C = - a-, j) +. 2: fJa.e i e i
i
DIRAC PLANE WAVES
97
Since
...." " A " ( A )
k a-e i e i = 0 - a-p II = P X ,a X p = 0'.1
i
these can also be written in the fornl
;} = :i:a.p p + (3aJ.
CJearly a.l and O'c>p it anticommute since they are obtain.ed from (Jre, (111'
and G z by a rotation of the z-axis to the direction ft.. We note that
(!) = per 4- (1 - P) O"p P
(!Je =--= t 1a - (1 + fi) a-p p
so that in the nOll-relativistic limit f3 ._ 1, (!) (J and fJ --+ -1, (!}e -:,.. -0
as expected.
"The eigenvalue problem
(9-0 'f = ,,o/
must give eigenvalu(s A = :J: 1 since
( (f)..ft)2 = 1
The same is true for (9c"n. We write, for the electroD,
'Y = b+ "p+ + b_ "P- = A exp (ip.x)
where
1p = U :t:(p) exp (ip-x)
since '¥ is also an eigenfunction of Ii with eigenvalue p. Then the
exponential factors cancel. 'The eigenvalue problem is then
But
(9oD,,4 = (f)-f1(b+U + + b_U _.) = }(b+U + + b_U.__)
( ! (a-p p + 0';) X. m )
(!Mit! = ( Po + _! ) ..n
:t \ 2 ( " ) a-p m
. Po O'-p P - a.l X
Po + 1
(3.52)
We observe that a .l-n and a.1\ anticornmute, a11d (3.52) becornes
I bnlo.p p-n + O'l- n ) x rtt = A I h m x?n
t rn
frDnl the upper components and
I b 7r lp.ti + (fep a .Len) x 1n := ).. 2: b.mo"p X"
m. ?I'"
98
REI",TjVISTIC El"E(TRON THEORY
from the lower corflponents By operating on the left \vith a-p in the
second 'equation, this equation becomes identical with the first and the
probJem is reduced to the transformation of the upper components only,
that is, of the Pauli spin functions. 1\1 oreover, a.p p.ft + a J. -0 = a.o and
the transformation in question is one that diagonalizes a-f1 in the X m
representation. This transformation has already been carried out in
section 5, and those results can be taken over at once. Thus the coefficients
b-:t:. are identical with the am used there. 'The result is then
A A -"'lJ' 2 1} U ';mlO . r{} (T
= + = e <"t"j cos - + + e"""/" SIn - _
2 2
(3.53a)
for it = 1 and
f).
., A -i9Y/2,' t 1;
../:1. = _. = - e Sin ..- Co., +
2
.+ e i <P/2 COSo {} U _
...,
-"'"
(3.53b)
for )" = -- i. Here f} and rp are the pOlar and aZlrnuth angles of n as before.
For '{J == 0, {} = 0 these reduee to ,A:_t =.7.;: [J: t a they should. Jote that,
for f} = 0, the factor exp (:f: icpJ2) enters flS a crivial phase corresponding to
the fact that, if Ii is along the z-axis, the positjons of the x- and y-axes are
not specifiedw
Just as in section ] 5 \ve can show that
('lit:t, (9'¥ :1:) = (A:t, (9 A 1;) = :!: 0 (3.54)
This wouid be expected fron1 the general principle of covariance. However,
as a check the result (3.54) win be worked out for 'Y +. From (3.53),
('¥ +, (9'£"+) = cas:! (U +1l!JiU +) + sin 2 % (U _ 19IU_)
+ !e"P sin {l(U + !IU _) + 'e-iq; sin {feU -lmlU +)
Since (!J is hermitian the last tern1 is the conpix conjugate of the third.
I t is also seen that
/ 1 T I A '" I T 7 ) A Po J- 1 r 1 + p2 J l ( 'i11. .A m'\
\ v m ap Pi lJ 'tn' ::-:-.: P -- 2 -'-1 1 i -:-:'--;- ) 2 X , a-p X' )
Po.' (Pt 1 -r ...
"= p(X m , a-p X W ')
and
(U"m!fJa4lei etllJ m ,) = elxm, cr-e i X m );
where again U m is written for U:t' 1""hen since
(X m ., a;oV X m ) = V z for In = m' = !
= -v for In = m' = _1 2
h
i = 1, 2
=: Jt' - iVy
=: Jlx -i- i V:;
for m == --nt' = i
for n1 == -m' = -i
(3.55)
DIRAC PLANE W A YES
99
we obtain
('1',+, (9'1"+) = COs-o(pPz + ei;z)
+ 1. . {} f itp [ '" ( .... . A ) + ( A .... )]
2 Sin \e p p - ZPy f e j ,ej - le jll
+ complex conjugate}
Since n z -== cos'{} and fix :I:: {fly = sin {}e:l: i9i , this beconles
('Y +, (90/ +) = nf:e Z + !(n;l: + iiiy)(c x - iy) +. lena; - intl)(e + ie ll )
=0
( 3.56)
In a similar way we find
('F' _, (9'Y _) = - ft
(3.56')
since the vector n occurs only in the coefficients b m and, under the trans-
formation {} -+ 7T - {}, ffJ 'IT + cp, the coefficients bm, for A = 1 go over
into i times the coefficients b m for Ii. = - 1. The factor i, of course, does
not enter into the expectation value.
The two states 'Y :f: completely span the two-dimensional spin space for
positive energy states, just as was the case for 1pj:. For the positron states
the results are obtained most simply by charge conjugation. Thus
'Y = C'J1
so
A C = _e itp / 2 sin {} U C + e -ifJ'J/2 cos {} uc:..
+ 2 + 2
= e -if/J/2 cos {} V - e itp / 2 sin {} V_
2 + 2
and
A:' = e i q;/2 cos {} U + e- i q;/2 sin {} U':..
2 2
= e itp / 2 cos f!. V _ + e - iCP/2 sin {} V
22+
which should be compared with (3.53a, b). The eigenvalue equations for
the spin now read :.'
(9c.ra 'Y = +'Y
and, in addition,
(\f", e c'Y) = (C'Y, ((!)X (.' -lC'f.)
= ('Y =1=, .(!}':P =f)X = + it
100
RELATIVISTIC ELECTRON THEORY
The interpretation of 'Y, for a given vector 0, is then in one-to-one
correspondence with the interpretation of'F =F. Alternatively, the vector it
is replaced by -0 upon charge conjugation. This is in agreement with the
result that the V:1: are eigenfunctions of with eigenvalues =F 1.
The Spin Projection Operators
With the wave functions 'Y:i: and 'Y we construct the spin projection
operators, that is, the matrices with elements
Ppt1(:l:ft) = (\}J' :i:)p(7 :i:) = (A:i:)p(A:i:) (3.57)
Only P(ft) need be calculated since P( -0) is obtained by changing the
sign of ft. As before,
Pen) = ! .l (A+, Y4A+)YA
A.
Separating the various possible Y A into even and odd
facilitates the calculation of the expectation values.
matrices we find
(3.57')
Dirac matrices
For even Dirac
YA
1
{3
(A+, Y AA+)
1
fJo
-1
Po
p;l[V + (po - 1)p.V p]
V - POl(pO - l)p.V it
a
where
V = .l bbm'(Xm, ax m ')
mm'
For the odd Y A the results are
YA (A+, Y.AA+)
-1. V
Ys -Po p-
« p!Po
i{3« - POl(p X V)
i{3ys 0
The vector V is readily obtained from (3.53a) and the result is
V=ft
Consequently,
P(ft) = !{1 + Pol({3 + «-p) + pola-[ft + (Po - l)o.p it]
+ {3a.[ 0- POl(pO - 1)0-' p]
- ipOlpa..(p X ii) - PolY5ii.P} (3.58
DIRAC PLANE WAVES
101
We observe that
P(n) + P( -0) = l(l + h/po) = P +
That is, the positive energy projection operator is obtained by summing
over the two spin states in either basis, as expected.
For the positron the projection operator is readily obtained by charge
conjugation. This gives
PC(n) = l{l + pole -fJ + cx-p) - pola-[ft + (Po - 1)0-; it]
+ (la-[ft - POI(pO - 1 )ii.p p]
- ipO"lpa..(p X 0) + PolYsft-p}
In the rest system, p 0, Po 1, and
(3.59)
pen) - l(l + a-fi)l(l + fJ) = Po(ii)
PC(n) t(l - a.ft)!(l - p) = Pg(ii)
(3.60a)
(3.60b)
These are just the products of the non-relativistic spin projection operator
and the positive and negative energy projection operators in the rest
system. The projection operators P(fi) and PC(n) can be written more
compactly in terms of these rest system operators. F'oT this purpose we
define
V(O) = bm(')
which is the limit as p 0 of the actual wave functions. Then, for positive
energy,
P + V(O) = [1 + atop p fJ ] ! b m ( xorn)
( 'Vm )
Ai
_ b Po + 1 _ Po + 1 A
- m a.p - +
m 2po --- X m (2Po)
Po + 1
Then
P"p(ft) = V,,'F: = 2po (P +)"/l 'FiO) \F(O)(P +)rp
Po + 1
2po (
= P+PoP+)(J'P
Po + 1
Therefore
P(ii) = 2po P + (1 + a.n)(l + (3) P +
Po + 1 4
(3.61)
102
RELATIVISTIC ELECTRON THEORY
Similarly,
PC ( ft ) = _2po _ pc pc pc
+ 1 + 0 +
Po '
_ 2po P (1 - ait)(l - (3) P
- ---- 'os pos
Po -t- 1 4
The projection operators in this form will play an important role in the
theory of scattering as given in section 33 and the discussion of Compton
scattering, section 37. '
(3.62)
20. CO,,r ARIANT DESCRIPTION OF SPIN20,21
The spin projection operator obtained in the preceding section is readily
understood in terms of a covariant description of the spin.
We consider two reference systems: the rest system and the laboratory
system in which the electron or posit.ron has momentulTI p.. We use bars
to refer to the rest system and write the four-vector momentum:
PIL = (O i)
(3.63)
\here the first entry in the parentheses gives the space part of the vector,
l"'he spin vector win be
il,u = (no, 0)
vvhere ) is:the unit vector previously written 1\. Obviously
(3.64)
fz IlP Jl = 0
and therefore, in all reference systems,
nJlPIl = 0
(3.65)
where np and PP, are the components of the four-vector into \vhich flp and
Pll transform under the Lorentz transforlnation. Sinilarly,
where nil = (n, nJ and
n III Ji = 1. 0 2 + ni
(3.66)
PIlP IL = -1 = p2 - p
since P,t = (p, P4 = ipo).
Under a IJorentz transformation of the rest system with velocity
---v = --'pJ the particle acquires a velocity v. Then, since iip transforms
like a polar four-vector under the continuous Lorentz transformation, we
DIRAC PLANE WAVES
103
can use (B.6) and (B.7) of Appendix B with v replaced by -p/Po, = pO
and obtain
n'= 110 + (Po - l)fio.p P
(3.67)
.A
n 4 = 'I1o-P
From (3.65) we obtain
n 4 = in-pi Po
(3.67a)
From (3.66) we obtain
n 2 = 1 .t (00-p)2 > 1 (3.68)
Of course, (3.67) substituted into (3.67a) gives an identity.
For fto-p = 0, the vector n is a unit vector = 110. For Do X p = 0, so
that (flo_p)2 = p2, n has the magnitude Po and is again in the direction of6o.
For other cases these two vectors are not parallel.
The spin is described in a covariant manner by introducing the operator
Q(n) = iysY p'n Jl (3.69)
Clearly ipQ(n)"P transforms like a pseudoscalar under Lorentz trans-
formation just s afto transforms under the extended group of three-
dimensional space rotations and reflections. The operators
![l :f: Q(n)]
are, moreover, projection operators since
Q2(n) = -Y5Y p'n IlYSYVnv
= nJlnv!(Y P:Yv + YvY .)
=nn =1
Jl 1J
For the rest system
![1 :I: Q(n)] !(1 :J:: fJa.n) = .(1 :i:: pa-Do)
and for both electrons ({J - 1) and positrons ({J -+ -lY this gives the
expected results. We may also observe that the charge conjugated
I operator is
QC(l1) = Q(n)
(3.69')
We now show that Q(n) is completely equivalent to (9-0 0 for the electron
and to (Qc. Oo for the positron. It is first observed that
Q = i'sf3a.-n + iYSP n 4
= f1a-n + fJyso.p/ Po
With
(9-° 0 = ap paDo + fJo-f1o - paep p-ft o
104
RELATIVISTIC ELECTRON" 1-HE()RY
we obtain from (3.67) and (3.69)
Q(n) - (!)-Oo = ftop[(po - f)lla-p .+ pfJY5/ Po -to {3Y5(PO -. l)pj Po
-- a-p + pes.;]
= Oo-p[popa.p + p,'uP .-- a..p]
= flo-}} a.p[pop + a.pfJysp - 1]
= Do-P oep[Pnf f .-- f/a.p -- 1]
= Do-P (JpfJ(Po -. tt-p -- fJ)
= 2PoDo"P (f.p/3P - (3.70)
Consequently, for a positive energy state 1p for "\¥hich "¥.r ::.::: P.}-"P it follows
that
[Q(n') -- (v.] o :-:. 0 (3.71)
In other words, the operator Q(n) -" tt)fio is a nun operator for positive
energy states only. We observe that, in contrast £0 (V..rlo, Q(n) does not
commute with the free particle I-iatniJtonian. In fact:
[Q(n), h] = 2Y5Y4[n p p p + i(h - po)n/] =1= 0
However, since n,JJp. = 0 we see that
[Q(n), h] = -4in 4 J)sY4PoP-
and, again, this gives zero \vhen applied to a pos.itive energy statc. We
see, then, that every positive energy eigenstate of (0.00 is an eigenstate of
Q(n) with the same eigenvalue (:J: 1) and vice versa. This clarifies the
observation that ![1 :I:: Q(n)) are projection operators. 1-he same, of
course, is true of t(l :J: (9.-00), and these projection operators select the
spin eigenstates in the sense that these have been defined a.bove.
For the spin operator (!)C only the sign of the term without f3 must be
changed. The steps given in (3.70) yield the result
Q(n) - (!)c. no = 2PoDo"P a.pfJ(l --. Ppos)
where Pp08 is the positron energy projection operator introduced in. (3.51)
Therefore, for positrons as \\'ell, the operator Q(n) is fully equivalent to
(!)c.fto-.a result to be anticipated in vie,,, of (3.69').
The spin projection Qperator given in (3.58) or (3.61) is related to the
spin operator (!) by
P(no) = I' + !(1 + ..fio)
(3.72)
and the corresponding charge conjugate equation C),lso holds, of course.
This is the relation analogous to (3.60). Tra.(lSforrnations to other
coo.rdinate vsfeIns are faci1itated b ) ' using th, covariant form of the
DIRAC PLANE W A YES
105
projection operator. However, P(iio), as (3.72) shows, is not in such a form
since P + is not. In fact, the covariant energy projection operators are
{ P +fJ }
1(1 =F iY/lPll) = Po
- p fJ
where P = Ppos. Since the spin-independent terms in P(fto) constitute
the operator !P + the covariant form must be PoP(fi.o)f3. Direct calculation
yields the result
4PoP(fto)fJ = 1 - iY/1PJL + iYsYJLnJL + 'YIlYvTpy (3.74)
Here np' is given by (3.67) and TJJ" is an antisymmetric four-tensor whose
space-space components are
(3.73)
T;k = - ; EjkZ[PO(no)z - (Po - l)fio.p pz]
and space-time components are
7J4 = tEiklPk(n O ), (3.74")
The stated transformation properties of TfJv are readily verified by the
methods of Appendix B. Each term in (3.74) is evidently covariant; thus
PoipP(iio)f3'tp is an invariant. The appearance of Q(n) in (3.74) is to be noted.
The last term in (3.74) indicates that an alternative and equivalent covariant
description of spin is possible in terms of an antisymmetric four-tensor.
In the rest system, for example, Tik reduces to a multiple of floo
(3.74')
21. APPLICATION TO NUCLEAR BETA DECAy22.
The extensive literature of nuclear beta decay and weak interactions in
general bears testimony to the numerous phenomena involved. A com-
prehensive discussion of these phenomena is not our purpose, and our
attention is restricted to a brief outline of the foundations of the theory
and some applications.
A convenient starting point is that of the Lagrangian density p of the
Dirac field. The Dirac equations themselves can be derived from a
variation principle 23
b f ..<t'(x) d 4 x = 0
where, for free particles,
2'(x) = 1/'* (i - «op1/' - ,81/')
(3.75)
106
RELATIVISTIC ELECfRON THEORY
is a function of the four components of 1p* , those of V' and the derivatives
thereof. Variation with respect to 1pp gives the p-component of the wave
equation for 1p* and similarly for 1p/
For the interaction of four fermions whiell are taken to be Dirac
particles the total Lagrangian density must then be a sum of four terms
like (3.75), one for each particle, and an interaction term. The interaction
term must be Lorentz invariant at least for the continuous transformations,
and it will be assumed, in agreement with observations, that this interaction
contains each of the four particles linearly with no derivatives of the fields
occurring. Thus in the process
n -+ p + e- + ji
where, by definition the light neutral particle is an antineutrino, the
interaction density in the Lagrangian is
int(X) = - g J d 3 y( ij?'(x) r Jl.. V'''(x)X tp"(y) r Jl.. V'V(y))(x - y)
+ hermitian conjugate (3.76)
The -function implies a local interaction as in electromagnetic theory. It
, does not seem possible, within the present framework, to construct a
consistent relativistic theory with any other kernel corresponding to a
non-local theory. In (3.76) the r It.. may be one of the five groups of i'.A
matrices discussed in section 14. More generally, it is a linear combination
of them. Thereby the relativistic invariance is assured. The constant g is
determined empirically by the observed beta half-lives. The structure of
the first term of (3.76) corresponds to creation of a proton and a positive
energy electron and annihilation of a neutron and a negative energy
neutrino. The hermitian conjugate term corresponds to the process
p n + e+ + ')1
with a positive energy neutrino emitted with the positron The 11 and ji
are taken to be Dirac particles with zero rest mass. t
The Hamiltonian density is obtained from if in the usual way:
= I a 1ftr -
a 01jJ(J'
where (j runs over all fields and their conjugates. Since 2 int contains no
derivatives, the corresponding term in , that is, int will be just -int.
t Experimentally, the neutrino mass is known to be less than 10- 3 times the electron
mass; see section 41. No experiment yet devised is sufficiently accurate to distinguish
between zero mass and a mass of, say, 10- 4 m. 22
DIRAC PLANE W A YES
107
The beta interaction obtained by the foregoing prescription would have
the form
£int = g C {£ + h.c.
(3.77a)
II:
where x = S, V, T, A, and P and, from (2.79),
£' s = (1Jl'P*fJ1pnx tp6*fJV/')
JIe v = (1p'P*'lpn)( V/ 'I.jJ") - (1p'P*aVJn).( 1f,6*a.1Jl)
1? T = (1p'P*{3cs1pn}{ tpe*fJcs1pv) + (1p'P*fJa.1pn)e( 1p6*{3a.1pV)
£ A = ('tfJ'P*aVJn}( 'ljJe*a1pV) - ('Ip'P*Ys1jJn)( 1p6*Y61pV)
Jt' p = ('f/J'JJ*fJYs1pn)( 1pe*{Jys1pv)
Therefore there would be ten. coupling. constants since the C:e are, in
general, complex. There are, however, two important results which hear
on the interaction Yt'into The first is the well-known fact that the inter-
action is not parity conserving as the scalar character of (3.77a) would
indicate. 24 ,25 Actually, the assumption that ..Pint and therefore Yt'int must
be a scalar is not based on experimental fact but as initially made as a
natural assumption which is not only the simplest but is in complete
parallel with other interactions, notably those of electromagnetic type.
The observation, for example, that beta particles which are emitted from
nuclei are polarized 26 is a sufficient datum to cause the coventional theory
to be scrapped.
With this in nlind, Yt'int must be a combination of scalar and pseudo-
scalar terms since either one alone would give parity conservation.t
Therefore, one writes
(3.77b)
Yt'int = g (C gJrYf' x + C Yt') + h.c.
(3.78)
a;
where C are ten new constants and Yt'; differs from Yt'x in that each
lepton covaria.nt is replaced by its pseudo..form SP, V A, T-(-4 T.
This means that each 1p" in (3.77b) i replaced hy Ys1p" in Yt';.
'Thus far it would appear that ten constants have been replaced by
twenty. The additional complication in the theory is more than com-
pensated by the added variety of experiments which can be performed. 22
As a result of these the following values of the constants can be given. with
reasonably good accuracy:
C s = C s = C T = C = 0
C v r>J C v , C A ::-:: C A = -AC v
(3.79)
t We recall that transition probabilities depend on absolute squares of matrix
elements of .1t'int.
108
RELATIVISTIC ELECTRON THEORY
,vhere A 1.2; also g'2 = 2g2(C + C1) 21 X 10- 23 in rational
relativistic units.t An overal1 phase is irrelevant.t Also, as will be evident,
the equality of the so-called even and odd coupling constants (C re and C;')
means that parity breakdown effects are as large as possible. Although
this equality of Ca: and C is fairly well established, it is useful to write
C = EC:r, in order to study the effects of deviations from the condition of
equality. The significance of the equality will become much more apparent
in light of the discussion of the two-component neutrino theory; see
Chapter VII.
The choice of coupling constants given in (3.79) leads to the so-called
V - AA theory which, at present, seems to give good agreement with all
observations. With this choice we can write the part of lnt leading to
e- emission in the form
g-l1nt = (1jj"YJl(l + AYs) 1pft)(1jje yJl (l + 1'5) 1p")
= (1p1'*Y4YJl(1 + Ayo) 1pn)C'lf,e. y4y ,ll + YS) 1p")
for Ca; = C. For C[C = eC; the factor 1 + Ys in the A-independent term.
is replaced by 1 + €Ys and in the A terms by £ + 1'5 = Yo(l + EYo). To
introduce the E we designate this matrix by a + bY5 so that a and b
interchange their roles in going from V to A interactions.
When the interaction is used in a perturbation calculation of the
transition probability the following result is obtained. For the number of
transitions in which the electron has momentum between p and p + dp',
the antineutrino momentum is between q and q + dq, and the electron
spin state is specified, say by the unit vector fto in the rest system, the
result is
w dp dq = (27T)-5 g 2 C} dp dqlfffil 2 (3.80)
A sum over neutrino spin states is implied since it will be assumed that
this observation is not made. The transition probability w is in units
me 2 /1i. In (3.80) the matrix element is between final (I) and initial- (i)
nuclear states ('Yf and 'Fa
'. ',.
Jt> it = f d 'YjY4y,.(1 + AY5) 'Y.[ tp'Y4y,.(a + bY5) tp"]
The factor in square brackets is evaluated at the position of nucleon
number k, and we have suppressd the explicit appearance in front of'Y i
t The dimensions ofg are energy times volume, so thatg' in ordinary units is obtained
by multiplying by li31 m 2 c. Thus g' 2.2 x 10- 49 erg cm s .
:I: This means that for all purposes the constants may be assumed real and also that
£lnt is time reversal invariant; see section 25.
For E = 1 the interaction is equivalent to that of the two-component neutrino theory
of Chapter VII..
DIR.t\.C PLANE WAVES
109
of an operator which changes the kth nucleon from. a neutron to a proton
if it is a neutron and gives zero other\vise.. l"'his detail and, all effects
arising from the fact that nucleons are not actually bare Dirac particles.,
will affect only the nuclear ll1atrix elen1.cnts whih enter as described in the
next paragraph.. A SUln over all nucleons is also implied in r:-Yt/ fi .
For ?po and 1pfl plane waves are assurned.. Corrections due to Coulomb
fields will alter only the total intensity of beta particles, This assumption
of plane 'Naves gives a factor exp [..-i(p + q})xv], where XLV is the position
of a nucleo!L Since XiV ;;;;,; 1<'1 the nuclear ractius, and R in our units is
O.4cxA l (A is the mass number), it follows that for typical beta spectra,
in which p and q are of order 1, that (p + q).xN <{ I.. The exponential
will therefore be replaced by unity for the transitions of greatest probability,
and then o?pe in the above is given by (353a), For these ("allowed")
transitions only the even parts of the nuclear Dirac Jtlatrices should be
retained" Therefore v'fl becomes
;fffi = «(fe, (1 +. €)ls)[j\l)A1(1) .- A(U B , 0(1 + E)/s)Uv)-M(a) (3.81)
wher kf(J) and 1\1(<7) are nuclear matrix elcn1ents:
M(l) = f dXN '}";':I r i
''\. fd \:,'1'* n"'
J\1{a J == xJ.v l,a-l i
It is evident that aJl(')v(d transitions should then be characterized by no
nuclear parity change. 'f.his is indeed what is observed.
It is custolnary to observe only th( electron, and the usual rcsu]t of
interest is obtained by ,vriting an expression for the energy distribution.
"'fherefore we write, for the transition probability for electrons with energy
in the interval Po to Po + dpo'J direction p in tIle solid angle range d(}, and
neutrinos with fil0n1entumt q In the soHd angle range dQq" and "spin
direction" Do for the electrons,
9-2
W dpo d1p dg == $L,T dpo dO;. dllqS(Po)lJ1Oh,12
(217"X)
(3.82)
where, using the conservation of energy,
S(Po) = dp p2 J r dq t5(q - W o + PO)q2 = pPo(W o - Po)2
dpo
t q = W o po, \\fhere Vo is the total energy rele,se: maximum kinetic energy of the
beta particle plus its rest' energy.
110
RELATIVISTIC ELECTRON THEORY
S(po) is a statistical factor arising from the volume in momentum space
available to the two light particles. The nucleus, which is very accurately
treated as infinitely heavy, will take up recoil momentum but negligible
recoil energy.
For simplicity we shan consider separately those nuclear transitions for
which M(O") = 0 and those for which M(J) = o. The former case arises
when 22
Jf=Ji=O
and these are called pure Fermi transitions.. The latter arises when
J f - J i = :i::l
and these are called pure Gamow-Teller transitions. For J i = J f -=I=- 0
both matrix elements would contribute in generaL
For pure Fermi transitions we need to calc.ulate
, I f.il 2 = I M(1)12 2.: I U*\ (1 + E))5)U V l 2
8"
and the sum is over v spin states. Then
I t Te , (1 + €l's)u v I 2 = U;X(l + eys)ptY U; U;,(l + C:f'5)a' u;::<
= Pp'p(rto)(l + €Y5)P;Y P;(/,(l + l£?,S):' P '
where P(fto) is the spin projection operator of section 19 and pv is a similar
operator defined for the neutrino. When the sum over neutrino spins is
made, P" becomes the energy projection operator for the neutrino. For
· zero rest mass and physical momentum q this is
2.: pv = PC,,) = tel + «-4)
8"
(3.83)
since I'll = q. Then we obtain
lfii2 = IM(1)f2 Tr P(fto)(l + eys) P(vXl + eY5)
= IM(l)( 2 Tr P()[l + e 2 + 2eys] P( v)
(3.84)
since P(v) commutes with 1 +, €Ys. A parity conserving theory in which
£ = 0 gives 1 for the square bracket. Similarly, if only the €r5 term were
present instead of 1 + eys, the square bracket would be €2. Therefore the
parity non-conserving effects must arise from the 2€Jls cross term, and the
relative order of parity non-conserving terms to parity conserving terms is
j" = 2e
1. + €2
(3.85)
DIRAC PI.1ANE W A YES
111
For € = 1, f has its maximum value, namely 1, and for € = 1 + { with
o < 1 this factor f is 1 - b 2 /2..
The trace in (3.84) is ea.sily evaluated, and for the transition probability
one obtains 27
2
W = (f5 S(Po)IM(1)1 2 I F
(3.86)
where
g;r = !g2C(1 + e 2 )
and
IF = (1 + p-4/ Po)(l + &'-0 0 )
(3.86')
The beta particle polarization is
f/J = - f q + p + (Po - 1 )p-q P
Po + ,-ii
(3.87)
for electrons. For positrons the same result applies with the exception
that the sign of fIJ is cllanged.
If the electron polarization is not measured, IF is replaced by
\
! IF = 2(1 + pelt/po)
Be
and this gives the well-known electron-neutrino correlation, 1 + (vIe) cos{},
where {J is the angle between p and q. If the neutrino direction is not
observed, P(v) will be replaced by t and a. factor 41T from integrating over
dO, is introduced in }t'. Then
fJ' = -p/Po
Thus he polarization is longitudinal and the helicity is negative for
electrons, positie for positrons. In the general case r!J> can have any
direction in the p-q plane, even tansverse to p.28 It is seen that v/hen
j- = 1 the magnitude of f!P is unity; this implies that for Do = -f!lJ no
beta particles are emitted. lIenee in pure Fermi transitions only one of
two spin states is formed, provided the spin basis, or selection of spin
states, is made in terms of the vector fJJ defined by (3.87). lhen f < 1
the polarization is not conlplete and .both spin states are formed, though
not equally. It will be seen in Chapter VII that the explanation lies in the
fact that for f = 1, E = 1 only one neutrino spin state is possible. Hence
the averaging over neutrino spin states is superfluous in that case. F'or
f < 1 there are two neutrino spin states and the averaging process reduces
to a value less than unity.
112
RELATIVIS'fIC EI..:ECTRON T'HEORY
For the !>ure Ganlow- Tener case we calculate
'C:fit2 ' )2 2: U:X(ooM(a)(l + €YS»)PIL U U;,(a-M(a)(l -t- €Y5)Jt' U;
8,..
:::= A 2 Tr P(iIo) 0 0 1\1«(1)(1 {- €Ys) P(lI) aMx(a)(l .+ €,'5)
-= A,2 Tr 'P(fto) ooM(a)(l + £2 + 2€ys) P(v) a.MX(a)
The transition probability is no""' given 27 in terms of
2
g S ' ( .
' = - .Po)lGT
(21T}}
(3.88)
where
and
g 2 == ......2C 2 (1 + €2 )
A .../:>.A,
IGT = (Bl - P.A 1 )(1 + &"'iio)
Po
Here the poJarization 28 is
&' == :f: Ao - Bop. + (Po - l)p.A-o P
POBI - peAl
(3.89)
(3.90 )
and
An = :i:if ,n M)( M X + fl-'rn[I\i>l\I X q - (10M M X - q$lVI X M]
Bm = f1-mM.M X ::i: ifmq.M X ft'lX > 0
The upper sign refers to electrons, the lower to positrons. No\v the matrix
elements M and M X depend on the nuclear orientationc If aU substates
in the initial and final Ilucleus are uniforroJ.y popu.lated and the n.UCJCl are
not oriented, v.'e find
(M.M X ) = IM q l 2 = M(a}MX(a); Re (4-M M X ) :-':: liijM q f2
i(M X M X > = 0
where the angular brackets now indicate an average over nuclear substates.
Then, for f = 1,
fJJ = :I:: it} - p + i(Po - 1)p-q
Po - tp"q
Now 1.91 < 1 even thoughf"= 1. r"fhis result filight have been expected
in vievv of the averaging over nuclear states. For no observation of the
electron polarization the usual electron-neutrino correlation is observed.
f"or un oriented nuclei this is
(3.91)
I GT =. 21M o 1 2 (1 - tp 0 4/ Po)
DIRAC PLANE WAVES
113
If the direction of the neutrino is not observed and no polarization
nleasurement of the elctrons is made there is, from the p.At term in the
first factor of I GT' a correlation 25 between the direction of the electron
nlomentum and the nuclear polarization i(M X M X ) which is now not
zero. This vector is
i(M X M X > = N< : >MaI2
where (Ji/J i ) is the polarization of the initial nuclear state and 28
N = 1 for J i = J f + 1
= (J i + 1)"-1 for J i = J f
= -Ji/(J i + 1) for J i = J f - 1
PROBLEMS
1. Explain, from a consideration of the nlomentum spectrum resulting from
a precise position measurement, the fact that the instantaneous velocity of a
relativistic electron must have the value :l:c.
2. Show that each column of the 4 by 4 matrix h + Po where
h ::;: ex.p + tJ
is a solution of the amplitude equation (3.4). Are these four solutions linearly ,
independent? Answer the same questions for the columns of the matri (h + Po) A
where A is an arbitrary 4 by 4 matrix.
3. Obtain the electron and positron \vave functions in the representation
ex = Pa O , f3 = PI
where Pa and PI are given in (2.38b). Find S where
SploS-t = Pa o , SP:i S - 1 = Pt
Compare the non-relativistic limit of the wave functions in this representation
with those obtained in the standard representation. /
4. Using anticommutator relations, show that the expectation value of (3 for
positive energy states is
<fJ) = l/po
independent of the representation. In a similar way show that
(a) = p/Po
in all representations.
5. Let A be an arbitrary four-component spinor. Show that P +A = tp+,
where tp+ is an eignfunction of P + = 1(1 + h/po). Thus a positive energy
amplitude can be generated by P + operating on any four-component spinor.
6. Show that the spin operator (9 can be obtained from ° by a unitary trans...
formation; that is
SOS-l = (9
Find S subject to S* = S-l. Obtain the corresponding result for the positron.
114
RELATIVIs'rIC ELECTRON THEORY
7. For two vectors A and B, whose components commute with the components
of @, show that
l'J.A .B = A.B + i(!}.A x B
Write the corresponding result for thf; positron.
8. Show that the product of two projection operators ..4 and B is again a
projection operator if A and B commute. Is the converse theorem true?
9. The pair of operators A and B fulfill two of the following:
(a) A2 = 64, B2 = B
(b) AB = BA = 0
M A+B=1
Show that if (c) and (a) or (b) are true then (b) or (a) must be valid, but if (a)
and (b) hold then (c) is not necessarily vaJid. What is the relation between these
results and the existence of a complete set of eigenfunctions of a set of com-
muting operators?
10. Verify Eqs. (3.61) and (3.62).
11. \Vhat interpretation should be given to the projectio operator P(13) =
p{l) + p(3)? Compare the operator P(fi) with P +P(13). Under what circurn-
stances are they equal?
12. Show that (3C( QC(n) - 19 c .fi o ) is a nun operator for positron energy states.
13.. Considering plane wave states of given momentun1, show that in any
representation the Dirac current for positive and negative energy states with the
same physical momentum must always have the same magnitude and sign.
14. Show that for Do parallel or antiparallel to the lTIOrnentum p the spin
operator projected on no, that is; (9-° 0 , is equal to the helicity operator a.p while
for no perpendicular to p it is fJa.iio.
15. VerIfy that Tik and T j4 defined by Eqs. (3.74') and (3.74") do in fact
transform like a four-tensor.
16. Evaluate the position operator x" in the Foldy\Vouthuysen scheme to
obt3in the result given in the text. Evaluate the FW transform of the spin
operator a and of the orbital angular momentum I = r x p. Should}' -1- -ja'
conunute \vith H', the (new) FW Jlamiltonian?
17.. F<or a general wave packet consisting of a superposition of positive and
negative energy states show that the, current density has osciHatory terms cor-
responding 10 the Zitterbewegung. Is this also true of the average mOlnentum?
18. If in the beta decay formulas (3.86') and (3.89) the vector Do is replaced
by the Pauli spin matrix u, the polarization is
rr a I
flJ =-
TrI
How should this fact be interpreted '?
19.. From the result
@j(!)k = Ojk + i€jkrn(!)m
evaluate the anticommutator of (9.0 and (!}j, and ShOi from this that the expecta-
tion value of (!) is :i:il. Note that this proof does not require the use of a specific
representation.
DIRAC PLANE WAVES
115
REFERENCES
1. P. A. M. Dirac, Proc. Roy. Soc. (London) A 126, 360 (1930). See also P. A. M.
Dirac, Proc. Cambridge Phil. Soc. 30, 150 (1934); W. Heisenberg, Z. Physik 90,
209 (1934); V. Weisskopf, Proc. Dallish A cad, Sci. 24, No.6 (1936).
2. C. D. Anderson, Phys. Rev. 43, 491 (1933).
3. J. Schwinger, Phys. Rev. 74, 1439 (1948); 75, 651 (1949).
4. S. Tomonaga, Prog. Theoret. Phys. (Kyoto) 1, 27 (1946).
5. See, for example, J. M. Jquch llnd F. Rohrlich, TIle Theory 0.( Photons and Electrons,
Addison-Wesley Publishing Co., Cambridge, Mass., 1955, p. 342"
6. S. Koenig, A. G', Pradell and P. Kusch, Phys'. Rev. 88, 191 (1952).
7. An excellent sutnmary of the data as of April 1958 appears in the article by E. Segre,
Ann. Rev. Nuclear Sci: 8, 127 (1958).
8. R. L. Garwin, D. P. Hutchison, S. Penman, and G. Shapiro, Nevis Rept, 79 (1959).
9. J. Rainwater, Ann. Rev. Nuclear Sci., 7, 1 (1957).
10. L. Foidy and S. A. Wouthuysen. Phys. Rev. 78, 29 (1950). See also S. Tani} Progra
Theoret. Phys. 6, 267 (1957).
11. M. H. L. Pryce, Proc. Roy. Soc. (London) A 150, 166 (1935); A 195, 62 (1948).
12. E. Schrodinger, Berlin Ber. 419 (1930); 63 (1931).
13. T. D." Newton and E. P. Wigner, Revs. A/od. Phys. 21, 400 (1949).
14. Z. V. Chraplyvy, Phys. Rev. 91, 388 (1953); 92, 1310 (1953).
15. W. A... Barker and F. N. Glover, Phys. Rev. 99, 317 (1955).
16. D. L. Pursey, Nuclear Phys. 8, 595 (1958).
17. M. Cini and B. Touschek, Nuovo cimento 7, 422 (1958). See also S.. K. Bose,
A. Gamba, and E. C. G. Sudarshan, Phys. Rev. 113, 1661 (1959); P. Y. Pac. Progr.
Theoret. Phys. 21, 640 (1959); 22, 857 (1959).
18. R. A. Ferrell, Thesis, Princeton University, Princeton, New Jersey, 1951 (un-
published).
19. B. Kursunoglu, Phys. Rev. 101, 1419 (1956).
20. H. A. Tolhoek, Revs. Mod. Phys. 28, 277 (1956).
21. F. W. Lipps and H. A. Tolhoek, Physica 20, 85, 395 (1954).
22. For a general survey see, for example, M. Deutsch and O. Kofoed-Hansen, in
E. Segre (ed.), Experimental Nuclear Physics, John WHey and Sons, New York,
1959, Vol. III, Part XI, especially sections 3ff. Also M. E. Rose, Handbook of
Physics, McGraw-Hill Book Co. New York, 1958, Part 9, Chapter 5.
23. G. Wentzel, Q.lantum Theory of Fields, lnterscience Publishers, NevI York, 1949.
24. This hypothesis was originally suggested by T. D. Lee and (. N, Yang, Phys. Rev.
104, 254 (1956).
25. The first experiment which established parity non-conservation in beta decay was
carried out by C. S. Wu, E. Ambler, R. W. Hayward, D. D. Hoppes, and R. P.
Hudson, Phys. Rev. lOS, 1413 (1957). This experim.ent demonstrated an anisotropic
angular distribution of e- emitted by polarized C 0 60 nuclei.
26. A summary of the data as of 1957 is found in Proceedings of the Rehovoth Conference
on Nuclear Structure, H. J. Lipkin (ed.) North Holland Publishing Co., A.msterdam,
1958. See pp. 376-403. )
27. J. D. Jackson, S. B. Treiman, and H. W. Wyld, Jr., Phys. Rev. J06, 517 (1957).
28. R. H. Good, Jr., and !vI. E. Rose, Nuovo cimento 14, 812 (1959).
IV.
PARTICLE IN ELECTROIUAGNETIC FIELDS
22" THE WAVE EQUATION
Classical Electromagnetic Fields
In this discussIon we shall be concerned with the interaction of electrons
or positrons with external electromagnetic fields. While ,it is possible to
construct a Hamiltonian equation and a covariant wave equation for more
general cases, these seem to have mainly academic interest. There is one
exceptional case and that is the problem of the beta interaction vlhich was
discussed in section 21 and win again be considered in sections 25 and 42.
The electromagnetic fields are taken to be real classical Maxwell fields,
and in the present theory it is assuIned that they are given independently of
the dynamics of the Dirac field. These fields are then described in terms of
,two vectors 8 and K, the electric and Inagnetic fields for which the uual
MaxweH equations apply. In non..rational Gaussian units these are
47T . 1 08
cud K = - Je + - -
c c at
(4.1a)
curl 8 = _ ! a:¥e
· c at
(4.1 b)
div 8 = 4rrpc
div:¥e = 0
(4. Ie)
( 4.1d)
where Pc and jc are the electric charge density and current density respec-
tively. Then from (4.1a) and (4.!c) the continuity equation follows:
B',;;
div J . + aPe = 0
" ot
116
(4.1e)
PARrI(LE IN E.l..ECTROMAGNETIC FIELDS 117
Since the charge is
e =.r P. d 3 x
and this is taken to be a scalar invariant, Pc znust have the Lorentz trans-
formation property of the time part of a four-ve<tor whose space part is l..
More exactly, the four-vector is SIt = (jc, icpc) so that (4.1e) reads as
follows:
which is in covariant forrn.
The field equations (4.1a) and (4.tb) can be replaced by equations in the
ve(;tor potential A and sca.lar potential <I> by the definitions
8 = _ 1 8A _ 7f1>
c at
.05#
-=0
aX tt
( 4.2a)
e'Yl' == curl A
( 4.2b)
so that the 110mogeneous equations (4Jb) and (4.1d) are satisfied auto-
matically. Furthermore, if the Lorentz condition
div A + ! o = 0
c at
(4.2c)
is assumed, the A and <P satisfy a simple second-order wave equation. We
introduce a four-vector potential Ap = (A, i<P) and then
0 2 A", 411
. = - - S (4.)
dx v OX" c p,
For the vacuum, sp. = 0 and this is the zero m.ass Klein-Gordon equation.
Of course, the Ap are still not uniquely determined because, if they are
replaced by
a;'
A' = A + - (4.4)
f.l JI. a ;J:: 1J
where G is a scalar function satisfying the zero rnass Klein-Gordon
equation, the fieJd strengths
. 0 vf = - ! - V<!>'
C ot
;Yea' = curl AI'
are the same as 8 and ft 1ne transforlnation (4.4) is called a gauge
transforn1ation of the first kind..
118
RELATIVISTIC ELECTR..ON THEORY
To justify the description of Ap. as a four-vector it is necessary only to
observe that a 2 jox v OX,t in (4.3) is an invariant operator. The Lorentz
condition is
(J.,4 J.t
.--=0
ax)!
and is satisfied in an inertial frames if it is assumed true in anyone. The
Maxwell equations are then written in covariant form by introducing the
antisymmetric fieJd tensor
( 4.2c')
aA oAJl
F = -. - --=-F
IIV YJl
u X Il UX v
In detailed f orIn this is
0 1'3
._ 2
-3 0 ;Y{)
, 1
F p ,,: 2 -.Yt'l 0
it9\ ii)2 i8 3
Then the Maxwell equations become
of Py 41T
-=-$
ax" c JJ
(4.5)
--i8 1
-i8 2
- iB3
o
( 4. 6a)
for the inhomogeneous equations, and for the homogeneous equations the
result is
oF vp 0
E pvp;' - =
ax;.
which are in manifestly covariant form. Here, again, €f.jvp'). is the completely
antisymmetric unit tensor of rank four (section 14)a
(4.6b)
The Equations of l\1otion
In classical mechanics the equations of motion for a charged particle
(charge -e) in a field are obtained fron1 the free particle equations by
replacing the energy Po by Po + e<I> and the momentum by p + (ejc)A.
The correct Lorentz force -e(8 + v X jc) is then obtained. It will be
recalled that - e is the electron charge. The same prescription is valid in
non-relativistic quantum mechanics because, when the replacements
. " h C . 0 + ..Jh
l .- - 2,1 .- Cq..l
ot at
-iIiV--inv.+:A
c
PARTICLE IN ELECTROMAGNETIC FIELDS 119
are n1ade, the resulting wave equ(j.tion is gauge invariant. This is what is
meant: l.,ct H(A,J be the Hamiltonian in one gauge so that
H(A ,.)1jJ = ili 01jJ
at
Then it will be true that for another gauge (cf. 4.4),
H(A ' ) ' . f; d1p'
Il 'If' = lrt--
ot
where
tp' = exp (- ie G/lie) 11'
(4.7)
Equation (4..7) is a unitary transformation. In the present connection it is
called a gauge transf.ormation of the second kind, and it is evident that the
gauge transformation of the first kind is equivalent to (4.7) and conse-
quently no physical results are altered.
Exactly the same replacements are now made in the covariant free
particle Dirac equations of motion. The justification for this follows.
1. The equations are still consistent with relativity requirements.
2. They are gauge invariant exactly as described ab9ve.
3. The classical equations of motion for particles in electromagnetic
fields are obtained in the appropriate limit. Also the non-relativistic
quantum limit is obtained, as one should expect.
We shall defer discussion of point 3 until later.
The new form of the equation of motion is no\v
[yDp(--e) + ko]1p = 0
(4.8)
where
a ie
Dp.( -e) = - + - AJl
ox". lie
For the hern1itian conjugate 1p* we have
D( - e }'I'*y Jt + ko 1p* = 0
(4.9)
and
X ( ) 0 ie ( )
Dk - e = - - - Ak = Dk e
oX k lie '
X ( ) () ie ( )
D4 -e = - - + - A4 = -D 4 e
ox.) lie
Therefore, for xactly the same reason that nlotivated us in discussing free
particle, the adjoint function
ip == 1p*')14
120
RELATIVISTIC ELECTRON THEORY
is introduced. Then for fJ the wave equation is
D p(e)'ipy II - ko1P = 0 (4.10)
The equations for the positron will be discussed-in the next section.
Since Dp. transforms under a Lorentz transformation exactly as iJ/ox p ,
that is, like a four-vector, the argument concerning the covariance of (4.8)
and (4.10) is precisely the same as for free particles. Therefore nothing
further need be said about point 1. '
For the gauge invariance we observe that, replacing Ap by A; and 1f by
. ei'Ltp, we have
[ 0 ie ( oG )] . i
YP - + - Ap. + - e l 1.1p + koe l1p
ax p ne ax p
. ( 0 ie ) i ie oG. . ox
= r e tX - + - A "P + koe x1p + - r - extp + iy e'x - 1p
p ax lie P lie Jl ox p ax
p P Il
The sum of the first two terms vanishes by virtue of (4.8). The sum of
the second two terms will vanish if
e
X = --G
lie .
(4.11 )
as in (4.7). This justifies the statement made in point 2. It will be
recognized that. in any bilinear or quadrilinear combination of wave
functions such as generally occurs in matrix elements the transformation
(4.7) multiplies the wave function combination by
exp (-ie G//tc)
where e is the sum of the charges in the initial states minus the sum of
the charges in the flnal state. Hence, since charge is conserved, this sum is
zero and the factor given above is unity.
In terms of (X and p the wave equation is
Htp = ili otp
at
where
H = ccx-ii + pmc' - e<1>
(4.12)
and
n = p + A
c
(4.13)
is the standard kinetic momentum operator.
Since A is real, tbe continuity equation holds with the same four-current
sp as for free particles. This is in contrast to the non-relativistic case
PARTICLE IN ELECTRO¥AGNETIC IELDS 121
where A occurs explicitly in j. Of course, the 1p is different so that 1p* fl.1p'
for example, has a different value now and will certainly depend on the
fields present.
It is of interest, however, to note that the fields appear explicitly when
the current is decomposed into constituent parts which can be interpreted
in a sinlple way.l In
$Il = (j c) Jl = iecipy 1l"P = liec( 1py 1l1p + .;pI' 1l1p)
we replace ip by kOl D,,(e)ipyv in one term and tp by _ki)l D,,( -e)'Y,,"P in the
other. Then (jc)", can be writte as a sum of two parts, one arising from
the a/ax" term in D,,(:l::e) and one from the field terms. Alternatively, we
separate the terms with f' = 11 from those with Il =1= 1'. Then
(j c)1l = j<:> + j1)
where
.(0) ieh oM JlV
) =-
Il 2m ax v
'(1) ien { ( oip ie A - ) - ( a + ie A ) }
1 11 =- --- 1l1Jl1p-1p - - JJ tp
2m aXil nc / aX/J lic
Here the ft =1= 11 terms give jO) and
M /JV = - M v 11 = ipy" y 1l'fP
The tensor .J.\1 IJ.V has space-space parts given by
M jk = - i€ ikl¥'(1Z"P
(4.14)
(4.15)
(4.16)
(416a)
and time-space parts given by
M;4 = - i ip(1../t/J (4. 16b)
so that jO) can be interpreted as the current density moment associated
with a magnetization (density of magnetic dipoles) and an electric polariza--
tion (density of electric dipole moment). The space ..part of the second
term has just the form of the non-relativistic Schrodinger current:
'(1 ) ien ( oip - otp ) + e2 A -
lIe = - - 1p - 1p - - k1Jl1J'
2m aX k aX k me
However, note that ip and not 1p* occurs here. For the non-relativistic
limit where {3 can be replaced by 1 the distinction is irrelevant. Of course,
in the frame of reference in which the electron is moving there is also a
. ,
tIme part
.(1) en ( Oip _ otp ) + ie 2 ,,,-_
14 = - - 1J' - tp - - '-V1jJtp
2me at ot me
( 4. 14a)
(4. 14b)
122
RELATIVISTIC ELECfRQN THEORY
For both jO) and jl) the continuity equation holds.
The question of constants of the motion of (4.12) will be deferred until
the study of specific fields is taken up.
Magnetic Moment of the E1ectron
It has already been stated that the Uhlenbeck-Goudsmit hypothesis'
involves the existence of a magnetic filoment of the electron given in terms
of an operator
en
fL=--S
me
Exactly this magnetic Inoment, it \vill now be shown, emerges from the
Dirac theory. 1he magnitude of the measured moment is then the
maximUITl expectation value of fL, which is predicted to be
en
Po = --
2n1c
To see this we construct the second-order wave equation by operating on
(4.8) with rp.DI(-e) - ko. Then we obtain
( Y Jl Y v D Il D v -- k)1p = 0 ( 4.17)
where j) = D('-e) 1'1he terms in 'Yp,YvDpD" are evaluated as follows:
y py"D Il Dv = 'y;D + iY Ilrv(D pDv -- D"D Il)
_ D2 ie r;
- k Ji. + 2 9:..-4- y IJYv JL ,tLV
" flC
1'hen (4.17) becomes
( D I'D p - + 'Y p'YlI F P\. ) ' "I' = 0
2/ic
(4.18)
The'first two terms give the Klein-Gordon equation with the replacement
of a/oxj.t by D;.t. The space part of DfJDp. is
1 1 ( e \2
D]cDk = - - 2i 2 = - -- if + - A )
/i2 ;"2 C
so that the farniliar non-relativistic kinetic energy operator results after
multiplication by -Ji2J2fn. The last term in (4.18) is the spin-dependent
part:
;e Y Y F . == _ _f!.. ( O-eYe .- ia...8 )
2 f.;. P" In r
flC fiC _
PARTICI,E IN ELECTROMAGNETIC FIELDS 123
1"0 interpret these results in terms of a coupling energy with the field,
the equation (4.18) is multiplied by -1i2/2m, as indicated above, so that
the spin-dependent interaction energy is
Hsp = ,.eh (4J - ia..4)
2mc
(4.19)
By de.linition the magnetic moment operator couples to the magnetic
field to give a contribution
- iL.
to Hsp. Therefore
en eii
iL= --0= ---s
2mc me
( 4.20)
as predicted.
The occurrence of tbe antihermitian electric field interaction in (4.19) is
puzzling until it is realized that (4.18), after being multiplied by -1i2/2m,
does not have the Hamiltonian form
H1p = ili 1p
at
and the operator on 'tp in (4.18) need nc,t be hermitian. If we replace
iJ2/ax by WA/1i 2 c 2 and W = E + mc 2 , then for
E me 2 ,
e 2 <1>2 <{ m 2 c 4 ,
e 2 A 2 m2c'4
as is appropriate in this limit, and using (4.2c'), we obtain a time-
independent Hamiltonian equation valid in the non..relativistic limit:
[ ;'2 e l
- 2m V2 - eel> + c A-p + Hsp JVJ nr = E1pnr (4.21)
A.gain, the non-hermitian term in Hap does not present a real difficulty
because of the approximate nature of this equationt The correct
H"amiltonian (4.12) is herlnitian. The non...relativistic limit "vil] be studied
further immediately belovv and the defect in the form (4.21), it will be
seen, can be remedied when the limiting process is performed more
systematically.
Foldy-Wouthuysen Transformation with External Fields 2
The limiting process considered in the preceding discussion is equivalent
to writing the Dirac equatiO!l as a pair of coupled equations in the large
and slnall components and then elirninating the small component to obtain
t This does not imply that an approximate Ha.miltonian cannot be hermitian. The
n1anner in which the approximation is made is the decisive pointo
124
RELATIVISTIC ELECTRON THEORY
a second-order equation for the large component. As was evident, this
procedure suffers from the defect of giving non-hermitian operators. It is
also inconvenient in that, when expectation values are to be calculated to
order V 2 /C 2 , the small components cannot be ignored. The FoIdy-
Wouthuysen transformation considered in section 18 remedies both these
defects and at the same time provides more physical insight into the
mechanism whereby the relativistic description of the electron operates.
The appearance of hermitian operators only is assured since we start with
a hermitian Hamiltonian and perform only unitary transformations.
In contrast to the free particle studied before, it will be seen that it is
impossible to eliminate all odd operators from the Dirac Hamiltonian in a
finite sequence of transformations. This is connected with the observation
that whereas for free particles a clean-cut separation of positive and
negative energy states is achieved, this is no longer the case when external
fields are present. If these fields are weakt compared to mc 2 , the FW
transformation should converge rapidly and' something of the nature of
an approximate separation should be achieved. Fortunately for electro-
magnetic fields this usually occurs. .
The ambiguities which arise when fields are present can be illustrated by
the following example. Consider a particle subject to an external static
potential <I> and write V = -e(I) (for electrons). Then the wave. equation
for a stationary state with energy W is
(W - «., - (3)tp = V1p
( 4.22)
or
where "p is time independent. Operate on (4.22) from the left with
W + a-, + p to obtain
(W 2 - p2 - l)tp = (W + (X-p + (J) Vv.'
= (t-(jiV) 1p + V(W + (X-' + fJ) "p
= «-(pV) 1p + V(2W - V) 1p
[V 2 + (W - V)2 - 1]1p = a:.(pV) 1p
( 4.23)
Consider the case of a square central well:
V=-J/O
V=O
r < ro
r> ro
Then the. right side of (4.23) gives a Dirac delta function at r == roe
However, if we consider r ::/=- r 0' the right side of (4.23) can be set equal to
t More precisely, the relative change of the interaction teJms in a Compton wave-
ltmgth and in a time interval of II/me" must be small compared to unity.
PARTICLE IN ELECTROMAGNETIC FIELDS 125
zero, this equation is readily solved in both regions, and 1p is made
continuoust at '0. Therefore we consider the equations
.
[V2 + (W - V)2 - 1]1p = 0..
r < ro
( 4.24a)
and
,
,
(VI + W 2 - 1)1p = 0 r > ro (4. 24b)
Although these equations are proper ones to use, it must be remembered
that it would be incorrect to calculate all four components of 1p indepen-
dently from (4.24). Instead (4.24a) and (4.24b) could be used to obtain 1p",
the large component say, and then 1pl obtained from .
1pl = (W - V + 1)-1 a.p1pu (4.25a)
or, alternatively, from 1pl we could obtain 1pu by
1pU = (W - V - 1)-1 a.pv i
(4.25b)
We see that for, > To we have free particle solutions, but it is not
assumed that these are necessarily momentum eigenfunctions. It is some-
what more appropriate to consider that they are angular momentum
eigenfunctions. These are studied in dtail in Chapter V, but the particular
form which they assume is not essential for the present discussion. In the
inside region (, < To) we may select, for any W, a solution regular at , = o.
This means that 1p*1p is integrable over any domain, including the origin.
For W2 < 1 the solutions of (4.24b) are clearly of exponential type and a
square integrable solution is obtained only if-the decreasing exponential
solutions (/"'OooJ exp - [1 - W2]!-t r ) are chosen. There will consequenty
exist a set of discrete states in the interval -1 < W <: 1, if it is assumed
that [(W V)2 - I]r is sufficiently large to permit at least one level.
However, when we consider W2 > I, in particular W < -1, we obtain
results which are in complete variance with expectations based on the
behavior of a non-relativistic particle. In the region, > '0 we now obtain
oscillatory solutions. At, = 00 these are not square integrable in the
sense of a bounded value of J tJ8X1p*1p, but they are acceptable solutions in
the sense that continuum soltions generaJly are. In general, linear
combinations of the oscillatory solutions regular and irregular at , = 0
will be used in the outside region, and at r = 00 these are standing waves.
With these linear combinations a perfectly valid solution of (4.24) is
obtained since the inside solution furnishes values of 1pu and 1p1 at , = '0;
and (4.25a) with (4.25b) provides values of 1pu and 1p& at all points, > '0
once the starting values are specified.. Of course, here we set V = 0 in
both equations (4.25). As a consequence we find that a particle can have
deep lying negative energy tates which permit a "tunneling through" to
t A3 required by the postulate of a probability dnsity.
126
RELATIVIsrrIC ELECTRON THEORY
infinity in a region of classically non-allowed motion. This is, in fact, an
understatelnent since in the region r > To there is no exponential damping,
as "tunneling" usually implies.
The situation described here is an example of the so-called Klein paradox
which is a paradox only if we insist on an interpretation in which the wave
functions are supposed to describe particles of definite sign of the mass.
Instead, it is necessary to reject the customary intuitive notions connected
with a non-relativistic description. In the presence of very strong fields
the usefulness of a description in terms of positive and negative energy
states is seriously impaired.
Returning to the problem of the FW transformation, the Hamiltonian
is written in the form
H = fJm + fie + 0 0
(4.26)
where Qe is an even operator and 00 is odd. These shall be assumed time
independent. The rest mass term fJm is considered dominant and it is
desired to transform H to a new Hamiltonian in which the odd tenns are
of a given order in 11m. We shall successively transform H so that the
resulting Hamiltonian contains odd operators of order 11m, then lim 2 ,
and finally 11m 3 . The general prescription is to choose U in
H' = eiuHe- iU
(4.27)
to be
..
i
U = - - pOo
2m
When this is done, H' contains odd terms with a factor 11m or higher
powers of 11m. If these are substituted for 0 0 in (4.28) and a second unitary
transformation is carried out, the resulting Hamiltonian H" contains
odd-order terms with a factor 11m 2 or higher order in 11m. At each stage,
if the odd terms which are of order 11m" or higher are dropped, the
resulting Hamiltonian is correct to order 1 1m".
From (4.27) we can write
(4.28)
«J 1
H' = I - Tn
n=O n!
(4.29a)
where To = Hand
Tn = (iU, T n - l ) (4.29b)
defines all other Tn' n > 1. For the leading term in Tl we have
(iU pm) = !«(:Jo.o, {J)
--
Since fJ anticommutes with all odd operators, this is
(iU, pm) = ....:.no
PARTICLE IN ELECTROMAGNETIC FIELDS 127
which will cancel the 00 in To = H. The remaining terms are: first,
(iU.O.) = (po.o. 0.) = 1- (0 0 , 0..)
2m 2m
\\.hich is odd, and we have used the fact that f1 commutes with Qe; a
second term is
(iU. 0. 0 ) = ..!.. (pD,o. 0. 0 ) = 1. po.
2m m
and is even. There is one additional term arising from T which contributes
to order 11m. This is the term of T 2 arising fro111 the commutator of iU
and the dominant term of TI. With the numerical factor i going with T 2
the relavant contribution is
I(iU. -0. 0 ) = - PQ
Hence, to order 11m the Hamiltonian is
H' = pm + Q. + J.... pO: + J!- (0 0 , 0.)
2m 2m
(4.30)
If it is desired to obta;n the HamiJtonian correct to order l/ln, then we
carry out the same transformation but with U replaced by U', where
u' = - {3 L ( 0 Q )
2 ') fJ' 6
m .....m
i
= - -i (00' 11)
4m
(4.31)
Then
H'" = eiU'H'e- iU '
is written. in the form (4.29a)
"all == 1.- T '
.n k n
n==O n!
where T = H' and
T f ( ' . U ' 1 ' )
n = l , ." -1 ,
n > 1
The mass ternl gives
(iU', pm) = _.!... «0. 0 , 0.). fJ) = -- - p(no. Q.)
41n 2m
whih cancels the last term qf (430). l'teinaining tern1S are of order t 1m?.
Therefore the first three terms of (4.30) give the correct result to the
desired order.
128
RELATIVISTIC ELECfRON THEORY
If it is desired to. obtain the Hamiltonian in which odd-order terms are
of order 11m 3 , the preceding transformation which led to (4.30) must be
carried further to give terms of order 11m 2 . Then we must add the following
terms from i T 2 :
! (iU, (iU, OJ) = <Pa., fJ(D.o, D.J)
2 8m
1
= - -. (0 0 , (no, fie))
8m
and
! (iU, (iU, D.o» = (fJD.(), PD.:) = - D.:
2 4m 2m
and a term from Ts/6:
! (iU, (iU, -D.()) = - 1 2 (pD.(), PD.:) = 0.:
6 12m 6m
which involves the commutator of iU and the dominant term of T 2 . Then,
to order 11m 2 , we obtain
H' = pm + Oe + ..!.- pO: + L (0 0 , 0e)
2m 2m
- (0 0 , (0 0 , Oe» - 0:
8m 3m
. (4.32)
Repeating the same process gives
H " iU' H ' - iU'
= e e
with U' now given by
U' = - fJ[ L ( Q Q ) - Q3 J
2m 2m 0' e 3m 2 0
(4.33)
The commutator (iU', pm) gives a contribution from the first term of (4.33)
which cancels the fourth term of (4.32), and a contribution from the
second term of (4.33) which cancels the last of (4.32). Then, in addition
we obtain the ,following m- 2 contribution to H": ,f
(iU', 0e) = «0 0 , OJ, 0e)
4m
which is odd. All other terms are of order m- 3 . Each succeeding term in
the expansion of H" now gives a factor m- 2 , since this is the nt-dependence
of the dominant term of (4.33). Since tJ1e term (4.33/) can be removed by
(4.33')
PARTICLE IN ELECTROMAGNETIC FIELDS 129
another unitary transformation withoJt changing the m- 2 terlns, it follows
that the Hamiltonian to second order is
H n.. = fJm + 0. + J.- fJD, - -; (no. (no. D,.»
2m 811';""
(4.34)
This result is now applied to the electron in an electromagnetic field.
Then
0 0 = a.(p + eA)
Qe = -e<I>
A straightforward calculation gives
Hn.. = fJm - e<l> + L ( + eA)2 + ..!!- fJa.;Ye
2m 2m
+ a.8 X (p + eA) + divS
4m 8m
It can be checked that all terms in H nf are hermitian. For positive energies
fJ should be set equal to 1. Then, in ordinary units, and with {3 = 1
H nf" = mc 2 - eel> + l-. ( ii + .: A ) 2 + a.;Ye
2m c. 2mc
(4.35)
+ e a.S X ( Ii + A ) + en 2 _ div B
4,n u c 2 c 8m 2 c 2
(4.35')
The first three terms have an obvious interpretation. Then the magnetic
interaction of the field ;Ye with magnetic moment fL, given by (4.20), can
be recognized' in the fourth term. The fifth term gives the spin-orbit
coupling interaction. Finally, the last terln, the so-called Darwin term,3
gives a relativistic shift to s-levels for a Coulomb field. This follows since
div 8 = 41TPc = -41Tec5(r), and in the present approximation it is proper
to use non-relativistic wave functions for which only s-states have 1p(O) ==F O.
A simple way of interpreting this term is to recall that the electron motion
is characterized by an oscillatory component which was referred to as the
Zitterbe\\'-egung. If its coordinate is written r + Llr, where r is the
oscillatory part, the potential <I> at the position of the electron is
4t(r + r) = [1 + Llr.V + l(lir.V)2 + · · .]<I>(r)
The relevant -quantity is a time average of this. Thus, for the interaction
energy, we obtain
-e<D(r) - «(8r.V)2) Av <l> = -e$(r) - e (Ilr)i v V2<1>
2 6
130
RELATIVISTIC ELEC1RON l'HEOR'Y
Hence the additional energy is
e ( A ) 2 d ' n..
'6 Ur A v 1 V G
In this interpretation we would set (cf. 4.35')
3 ( Ii ) 2
(llr)lv = - -
4 nlC
which is exactly the result obtained in section 18..
2.1. SPIN EFFECTS IN ELECTRIC AND t[AGNETIC FIELDS
Polarization Effects and Covariant Spin Operator
As an application of the results of the preceding section we first consider
the behavior of a spinning electron in electric and magnetic fields. 4 ,5 FrQln
dQ = !. ( 1-1" Q )
dt Ii ' -
where oil/ot = 0 and HO is given by (4.12) we observe that with Q =: 0-1t,
d ie ( Ii..... ie -. m
dt a-"(C = ..- -h 'V, (J.1t) :;:::: Ii O"{P, 'V)
= -. eo-I:
(4.36)
where 8 = - V<I> is the static electric field. Therefore in a pure magnetic
field a.7t is a constant of the motion. From tIlls result it may be concluded
that a longitudulally polarized electron will remain longitudinally polarized
after passing through a static magnetic field. A second conclusion is that
a polarized beam of electrons will not be depolarized on passing through
a magnetic field if no electric field is present.
For a pure static electric field we consider n = 1t. Then
d1t
- = -elf
dt
Combining this result with (4.36) leads to the conclusion
da
rc. - = 0
dt
Here 1t = p. Therefore a beam of electrons (or p, D1esons) which is
originally polarized along the direction of the momentum-that is,
PARTICIE IN ELE(;TROMAGNETIC FJEI..DS 131
longitudinally--will rernain longitudinally polarized in passing through an
electric field whj<.h docs not deflect them.
1""'he justification for the int.erpretation of the results given above is
ultimately to be based on an appropriate definition of the spin operator in
the presence of fields This question was discussed for free particles in
section 20. It was shown there that a description of the spin st.ates could
be based on the operator
(9-60 -: 80-[0 + (fJ - l)p X (0 )( p)]
and despite the non-covariant appearance of this operator it is equivalent
to the manifestly covariant
Q(n) = iY6Y I4 n Jl
\vhere nJA is the four-vector into which ii", = Il Oj 0 transforms. This
description is based on the single-vector parameter 6 0 which gives the spin
direction in the rest system. J-lo'Never, when fields are present the spin
direction is no longer a constant of the motion. Instead, from classical
considerations, one expects a precession in a pure nlagnetic field, for
example. Therefore lV-Oo or Q(n) no longer provides a suitable description.
Another ,vay to say this is that lVftu does not lend itself to the gauge
invariant generalization jJ p. -+ 17 p = PI' + eA i which must be made when
electrolnagnetic fields are present.
TIle required operator is obtained by noting that for free particles
Q(n) = Tpn1J
vheret
TJl =: YG( iy jl - P Il)
since n/-lp/-l = o. For free particles it follows that
(TIl II) = 0
where we use iy!.lpp. = -1 as an operator relation for the relevant states.
Also,
1p /.L = P p.'T Il = 0
while the commutation rules of the T are
(Tp., Tv) = 2[/IlY -- t5 pv + i(?JIlPv - P,uYlI)]
(Tp., T v )+ = 2(b 1lv + PIlPv)
t These operators were fi.rst jntrcduced by 'V. Bargmann and £. P. Wigner, Proc.
Natl. A cad. Set. U.S. 34, 211 (1948). 'Their w!.l is IT;l> The 1,.l are generators of a
subgroup of the Lorentz transfoI1nations. i'hey were called to my attention by D. M.
Fradkin of Iowa State University.
In the rest frame T p- = T, T 4. 0', 0, where odd operators are repla.ced by zero.
132
RELATIVISTIC ELECTRON THEORY
In the ,presence of an electromagnetic field with four-potential A# the
Tp is defined by _ C . _ -+ )
& - 1'5 ty Jl. 1T Jl
The commutation rules with the Hamiltonian are now, for time-
independent fields,
(T, H) = - ie(a )( tYe - Y( 8 )
(T4' H) = ea-8
and, expJicitly, T4 = ia.n. Consequently, when\<- 8 = 0, r4 is a constant
of the motion as is the component of T along:¥e. When:¥e = 0 the
components of T perpendicular to {I are constants of the motion.
In general, <t!'
dTp. . F
- = te/'5/'4 p.v/'v
d'T
where Fp." is the electromagnetic field tensor introduced in section 22 and
dT = dt/E is the proper time interval. For slowly varying fields in which
the relative change of the fields pver the dimensions of a wave packet 1Jl
are negligible,
!{. (T p ) = -eF ,.. f 1p*i/'4'Y6/,y1J1 d3X
dT
In the rest system of the particle (1'4 1 in even operators) this is
d(T) _ F (T T )
e p.v Y
dT '
where r refer& to rest system and the gyromagnetic ratio eJmc here appears
as e. Here we use (TT) = i/,sY, (T) O. Then since Fjk = EijkJl'i we
see that
d(T") -e(T') )( :Ye
d'T
which is the classical equation of motion for the spin vector. Under a
Lorentz transformation to an arbitrary coordinate system., (T;) = a#,,(T:>
and
d(T) = -eF ( T' )
dT /.IV v
Virial Tneorem 6
As a second example we consider an entire'!y different question. The
virial theorem in physics has a very general significance. What form does
it take in the Dirac theory? In classical physics the form of this theorem is
- (r.F)Av = (T + LO)Av
(4.37)
PARTICLE IN EIJECfROMAGNETIC FIELDS 133
where (. · .)Av indicates a time average; F is the force, Tthe kinetic energy,
and Lo the Lagrangian for a free particle. Thus, in classical relativistic
mechanics, 1
T" = m 2 c 4 + c 2 n 2 ,
= (1 - v 2 fc")-1A
The corresponding quantum form is obtained by first observing that
,
d < d Ii )
- (p.r - r.p) = - 3 - = 0
dt dt i
Lo = -mc 2 /E
(4.38)
Angular brackets mean expectation values. Also,
d i
dt por = n [po(H, r) - (p, H)-r] (4.38')
Since (H, r) = CCI and (p, H) = ea:.(p, A) - e@, 4» = e@CI-A - pfb), it
follows that (H, r) commutes with is and @, H) commutes with r. Therefore
:t rop = [ro(H, p) - (r, H)' J = - :t por
Thus it follows from (4.38) and (4.38') that
<:/ op) = 0
(4.39)
and
(p.(H, r» = «ji, H)-r)
Applying this to (4.12) results In
C(CI.p) = -e(r.Vc1> - r.V (I-A) (4.40)
Since r and A commute, we can replace p by ft in (4.38). Hence
_ ! / d roA \ = (a.oA) + (ro(a.oV)A) = 0 (4.41)
c '\dt /
The quantity V CI-A in (4.40) is readily evaluated since the components of
(I are constants in the differentiation and
VCI.A = (X X curl A + (CI.V)A
Thus, for example, ..
(J ( A A A ) ( OAlI GAg: ) ( OA GAs )
;- «'z + ex. 11 + oc. s == <Xv -;- - _ a - tX s -;- ,- -;-
u X (J X Y I u1, uX
+ or. cA", + or. cA.. + or. cA..
ax 11 ay Z oz
134
RELATIVISTIC LECTRON THEORY
Substituting in (4.40) and using (4..41) results in.
c(<<-7t) = -- (r-F)
(4.42)
where
F = e[?f.J.) -- ot X curl A J (4.42')
This is just the Lorentz force when we replace the velocity by the operator
Cot
This result can be used to calculate 111atrix elements in a very simple
way. Since (4.42) can be written in the form
--(r.F) = (W - pmc 2 + e<I»
we find, for no ma.gnetic field,
c(a.-p) = (W + e .J.... (3rnc 2 )
For 4 = O this becolnes the free particle case and
c(ot.p) = W -- mc 2 (f3)
where "V = Po = (p2 -f- 1), and since <(J) = lnc 2 /W we obtain
.cp2
( a.;t > = -.:.....
\ l' J"V
which checks \vith the result < C() = cpJ lift.
r\n alternative method of rapid calculatio:n of matrix elements is now
illustrated. Consider t.hat the magnetic field is absent T'hen
fJl-I + HfJ = 2(mc 2 - pert.)
T11erefore
(fJ(Pa + e<I») = me 2
(4.43)
From this it can be seen that, at least in. the non..relativistic limit, (J _. 1,
the eigenvalues Ware less tl1an l'f!c2. for an attractive potential, eq> > o.
rhis, as should be expected, 1Adll be true in general. It would foI1o"w then,
that the upper con1ponents, for which fJ = 1, contribute a greater amount
to the average potential than do tIle lower cornponents for which f3 == -1.
24* CHARGE CONJIJGATION8,9
In section 17 it was seen that a plane,. wave for an electron 1p ,\\ras trans....
formed into a positron plane wave "Pc by the charge conjugation operation
'Ill' = C- 1 1px
(4$44)
PARTIC'LE IN ELECfROMAGNE'TIC FIELDS 135
We "\fish to show that there is a charge conjugation transformation in the
general representation and to determine the properties which the matrix C
will exhibit in this general case" For'tp we write
( a ;e ' )
Y P - -1- - A.u 1J' + kofJJ = 0
aXil ne I
(4.45)
and substitute (4.44) after taking the complex conjugate of this equation..
Then 'we obtain
X ( (7 ieAk )c , C X ( G ieA 4 )c C + 1 ..., C 0
Yk --- - -- tp + 1'4 - -- + -:- 1p .oC".p:=
eX k lie , GX 4 he
Multiplying by C.-l from the left, we see that if
C-1y:C = fk
C I - 1 ' y Xf1 = ._-v
4- ." I 4
( 4.46)
then we obtain for 1pc
( a ie A ) ere 0
y p. - - - II 1p + "o1p' =
oXp. he /
"
(4.47)
Since ts is just (4.45) with the sign of e reversed, we may interpret 1jJc as
the positron wave function.
The fact that charge conjugation involves complex. conjugation in any
representation can be seen to be a consequence of the requirements of
gauge invariance. The charge conjugation is therefore a non-linear
operation: .
[ a i tp i ]" * t aitp
for all constants ai. Instead
[ai'PiJ = aftp:
and the charge conjugation 0Eerator is antilinear.
Since 'Y and -Y obey the commutation rules of the YP-' it is established
by the fundamental theorem of section 13 that C exists. We introduce a
matrix B such that
C = BY4J'5
(4.48)
and B has the property that
It,j = "a/ X = .B' y B-1
rp III t').
(4 49)
136
RELArIVISTIC ELECTRON THEOR Y
Then both equations (4.46) are fulfilled. The matrix B is now shown to be
antisymmetric. The transpose of (4.49) is .
I
- »-1:: B - n-1B B-1B
i'" - 'YIA - 'Yp.
= (B- 1 B)-11'IA B - 1B .
..
Therefore B-1! commutes with all 'YP ana must be a multiple of a unit
matrix.
B-1n = k
ot
B= kB
Transposing gives
B = kB = k 2 B
so that k = :i: 1. To show that k = - J, consider the transpose of iBy I' Y v
where Jlt =1= ". This is .
(iBy,,/,v)- = ikjiyY IJB = - iky Ilr y B = - ikjl pBy y = - k(iBy Pyy)
Also, since
v - v X - Y x,,\/xvxv x - B v B -1
r5 - r6 - 1 r2 r3 r4 - r5
we obtain
(iBYpY5)- = ikY6Yl'B = ikYsByp. = ikBysYp = -k(iByp.Ys)
.
The choice k = 1 implies the existence of ten linearly independent anti-
symmetric matrices. Since there can only be six antisymmetric 4 by 4
matrices we conclude that k = -1. Thus
B= -B
( 4.50)
With this result the remaining five matrices By P and BY5 are anti-
symmetric and with (4.50) we find BrA constitutes six antisymmetric and
ten symmetric matrices.
It can be further shown that B may be chosen to be unitary: Thus the
hermitian conjugate of (4.49) is
y = (B-1)*YIlB*
so that, with (4.49), this yields
B*Byp = 'YB*B
Consequently B* B, commuting with all l' Il' is a multiple (k) of. a unit
matrix. Since the diagonal elements of B* B are necessarily positive
definite, k > 0 and k may be set equal to 1 since (4.49) does not define k.
PARTICLE IN ELECTROMAGNETIC FIELDS 137
These properties of B may now be used to establish some properties of C.
From (4.48) we see that
C = Y6Y4B = -Y5Y4 B = -BYSY4 = BY4YS = C
Thus C is symmetric. It is also easy to show that C is unitary if B is. Then
C.C = (BY4YS)*(BY4YS) = YSY4 B *BY4Y5 = 1
It is now seen that charge conjugation does not change the norm of 1p:
.
C'f/l', 1pC) = (C- l 1px, C- l 1pX) = (1px, CC- 1 1pX) = (1p, tp)x
and (1p,1p) is real. The same is therefore tru for any linear combination
2,i a i1pi with complex coefficients ai. This result is to be expected in view
of the interpretation of 1pc as the positron state. .
Finally, the charge conjugation property is reciprocal. This means that
the charge conjugate of tpc is 'P:
<''PC)C = C-1(V'C)X = C-1C-1Xtp
But C-l is C* and (C-l)X is (7*x = t; = C. Thus C-IC-IX = C*C = 1.
This proves the theorem. .f'J
As before, any operator equation
01p = w1p
where ()) is a number, on charge conjugation becomes
O/tpc = ())xtpc
where
QC = C-1QXC
(4.51)
and, for hermitian 11, ()) = ())X.
The correspondence between positive energy positron" states and
negative energy electron states should not be a property of a particular
Lorentz frame but should be independent of which inertial system is used.
This means that charge conjugation should be covariant. With 1px = C1pc
under a Lorentz transformation, 1p(x) -+ 1p'(x'); then if .
1p'(x') = A1p(x) .
. we should also have
tpC'(x') = A1pC(x)
This implies that
1p'C(z') = C- l 1p'X(x.') = C-1Ax 1pX(x)
"should be equal to
1pc'(x') = A1pC(x) = AC- l 1pX(x)
138
RELATIVISTIC ELECTROi 11Hf20RY
Therefore we require that
["f-1 A X ._ 1\ ,"""-1
v -- .t 1\....
Of, equivalently,
A Xc = CA
This constitutes an additional condition on A. For instance, for
( 4.52)
A _ J'j)'k8
1l. -- e
(j =1= k)
and
c = Y2
in the standard representation, the condition (4.52) Ieduces,to
Yr;;Y2 = Y2Y:irk
Since in this representation 1'2 is rea] while Yl and 1'3 are pure imaginary, it
is seen that the condition is indeed satisfied. For a Lorentz transformation
with uniform velocity the A used in section 14 will satisfy (4.52) if
<X:Y2 = Y2k
Again «2 is pure imaginary and anticolnlTlutes with -1'2 while (Xl and 3 are
real and commute with 12* Hence the condition is again satisfied.
It will be realized that the relations
c
G k = -Cl k ,
c_
a'k ,-- fX k
and the like are unchanged under a change of representation. Also
{ca.(p + ;A) + pmc 2 - eCI>}" = -{ca.(p - A) + pmc 2 + eCI>}
The minus sign in front of the curly bracket on the right is cancelled in
the equation of motion by another minus sign arising fronl charge
conjugation of the operator iJi %t.
It is important to recognize that any particular Lorentz transformation
can al\vays be replaced by two (or any nurnber of) other l.;orentz trans-
formations and conversely. Therefore if (4.52) is 'valid for two trans-
formations Ll and L 2 with corresponding Al and A 2 it rnust also be true
for the Lorentz transfoflnation obtained by applying them in succession.
That this is so is seen at once. If
J\i<c = CA 1
A:C = Cj\..!
then
(i\2Al)XC = AAC = l\:CA r
= Cl\.2 A l
PARTICLE IN ELECfROMAGNETIC FIELDS 139
It follows that the property (4.52) is preserved through any number of
Lorentz transformations. Improper transformations are discussed in the
next section, and it will be seen that (4.52) still applies.
25. SPACE AND TIME REFLECTION
In this section it is our purpose to investigate the transformation
properties of the Dirac equations of rnotion under the improper Lorentz
transfgrmations
,
x k = -X k ,
, -
X 4 - X 4
(4.53)
which is the space inversion of coordinates, and
x = x k ,
,
X 4 = - x 4
( 4.54)
or time reversal. In each case det a = -- 1. The case of reflection in a
plane, say x = -xl' x = xp. for !1- ::i= 1, is included in (4.53) since the
complete space reflection followed or preceded by a rotation around the
X 1 - or x-axis through an angle 17 reproduces the reflection in the X 2 -X a
plane. Clearly, Lorentz transformations of the type (4.53) and (4.54)
commute with all the three-space rotations although not with the general
continuous Lorentz transformation.
In the following discussion we shall trace the arguments concerning th"e
space and time inversion in classical and in non-relativistic quantum
mechanics, and this will shed considerable light on tlle discussion of the
corresponding problem with the Dirac equation.
Space Reflection
If we consider a charged particle in an electromagnetic field, then the
equations of motion
rn!!:. v · 1 = F = .-e(8 .+ v X Jfejc)
dt (1 - v 2 fc 2 )IA
with v. = dxfdt are unchanged under the space reflection (4.54) provided
that
v' = --v
'(X')= (x)
$/(3'/) = -4(x)
where the charge -e is an invariant. Here x'( = -x) on the left refers to
.the san1e point in space as x does on the right.. Therefore at a given point,
( 4.55)
140
RELATIVISTIC ELECTRON THEORY
described by different coordinates in the two reference frames, does
not change (axial vector) and " does (polar vector).
The transformations (4.55) are in accord with the deductions from the
form invariance of the Maxwell equations. 1"hus
«:)
-e = J J J Pc(x) cJ3x
-00
is transformed to
«:)
-e" = J J J Pc(x) cJ3x'
-ex;)
since the three sign changes in going from d 3 z. to d 3 x' compensate the
three sign changes required to interchange the limits in the integrals. Since
00
-e' = -e = J J J p(x') cJ3x'
-00
we conclude that Po is a scalar. From (4.1c) 8 is a polar vector and from
(4.1b) is an axial vector. Then from (4.1a) we conclude
j'(z') = -J(x)
so that, as expected, e continuity equation is
oS' (x')
II- = 0
ox'
p.
unchanged in form. From (4.2b) we conclude A(x') = -Arc(x), and from
(4.2a) <J>'(x') = (x). Hence it is still true that
A (x') == a ItVAy{ x)
In a quantum theory the space reflection requires
p' = -p, l' ::::: I
and since reflections commute with rotations it follows that
(4.56)
J' == J
for all angular momentum operators. Then the commutation rules
( :c l' %k) == (jJ;, Pk) == 0
(jJ i'l X k ) = - ind it
JxJ==iJ
PARTICLE IN ELECfROMAGNETIC FIELDS 141
are all unchanged by the space reflection. It follo'Ys then, from the known
properties of transformation theory, that there exists a unitary trans-
formation A such that
1p'(x') = A1p(x)
The non-relativistic equation
{ J... ( p + A ) I - eC1> } tp(X) = ili otp(x)
2m e at
(4.57)
goes over into the corresponding primed equations with A equal to any
operator which commutes with the Hamiltonian in curly brackets. ince a
second application of A gives the same coordinate system with which one
.
started, A2 = 1 and the eigenvalues are :f:: I. These are the well-known
even, odd parity states. Thus
1p'(X') = :f:1p(x)
implies that for every 1p(x) there is a functiqn 1p( -x) which is also an
eigenfunction of the energy operator with the same eigenvalue.
We may now consider the space reflection in the Dirac theory. Writing
(4..8) in the primed coordinate system, for example,
YJj ( -.!. + ie A(X') ) 1p'(x') + k o 1p'(x') = 0 (4.8')
ax lie
becomes
A -1, iJ E Il ( .3.... + ie A,,(x) ) Atp(x) + k o 1p(x) = 0
oXJj lie
.
where E"k = -1, £4 = -t-1. Consequently,
A -lYJlA = a pv ""
as before and, in detailed forln, this is
(4.58)
A -ly = -Yk
(4.58' )
A- 1 1'4.L\ = Y4
From the second of these equations A nlust be a linear combination of
Y4' 1'1Y2' Y2r3, Y3Y1, 'Y11'5' )'21'5, and 1'31'0" However, of these only 1'4 has
the property that it anticommutes ,vith aJl Yk. Hence
A = iY4 (4.59)
The choice of phase in (4.59) is arbitrary so far as (4.58) is concerned,
but the factor j is inserted so that the relation
A Xc = CA
142
RELATIVISTIC ELECTRON THEORY
is fulfiJled. In this way tb.e space-reversed positron is the charge conjugate
of the space-reversed eJectron. The fact that A "'" Y 4 is hardly surprising.
in view of the remark made above that y, times space inversion commutes
with the Dirac Hamiltonian and the additional circumstance that
'Y ( -!. + A,, ) + ko = Y4 ( H - ili 2- )
oX p lie Ii at
with H given by (4.12).
It should be emphasized that the covariance of the Dirac equation under
space inversion is in no way at variance with the breakdown of parity
conservation in beta decay. This phenOITlenOn is a Inanifestation of the
properties of the beta interaction. Whether or not parity conservation is
required for an electron or l')ositron in an electromagnetic field is a matter
for experiment alone to decide) and present data are entirely in agreement
\vith the position that the electromagnetic interaction is parity-conserving.
When one deals \/Vith a neutral particle like the neutrino (see Chapter VII)
there will be no a priori reason for insisting on a parity-conserving
Hamiltonian.
Time Reflection
Considering first the classical probJem of the motion of a charged
particl 1"1 an electromagnetic fietd we find from the IVlaxwell equations
that under time reflection
A(x') = '-'avAv(x) (4.60)
This is seen by noting that, with e a sca1ar,
;
Pc = Pc
but
j = -jc
as is required to preserve the forIn of the continuity equation Then from
(4. Ie)
8' =8
and from (4.tb)
- (
:Ye ' = -:Ye
Thus Eq. (4.1a) is fulfilled in the prirned fields and current density, with
primed coordinates (x). Consequently, froIn (4.2b), A' = -A and from
(4.Za), (1)' = $. This gives (4,,60); the sign change as compared to (4.56)
will be seen to have profound consequences.
For the classical orbits we see t.hat, v1'ith
v' = -v
PAR'fICLE IN ELECTROMAGNE11C FIELDS 143
the Lorentz force and. therefore the equations of motion do not change
under time inversion. This means that, if an orbit exists in which a particle
goes from A to B, . with momentum PA at t = 0 and PB at time t, then
another orbit exists in which the particle retraces its path and goes fconl
B to A with momenta -PlJ at Band -P-A at A. This is, of course, the
original orbit run backwards in thne as in a reversed motion picture.
Turning to the spin-independent non-relativistic theory, we see that if
1p/ (x', t/) satisfies (4.57) with primed variables, then
{ 1 ( it e A) 2 Jh ) ' ( ' ' ) . OJ:. 01p'(x/, t')
- p - - - tN:' 1p x, t = -In
2m c at
with t' = - t. Taking the cotnp]ex conjugate of this equation converts it
to (4.57). Therefore
tp'(x', t f ) = 1pX(x, t)
(4.61)
;expresses the time reversal properties in this case. 1ne occurrence of the
antilinear cornplex corugation operation could have been foreseen by
noting that none of the commutation rules is changed by time inversion
but the operator equation
d11 i
.- = ..-(H,Q)
dt Ii
is,-changed to the extent of replacing i ",ith - i, which is just the effect of
complex conjugation. This, of course, does not occur with space reflection.
"fhe n.ext step l[) the consideration of the Pauli equation. This is written
in the forIn .
[ -j a r(
H nr + .a()(J. J '!pet) = in )
where H'nr is the Hamiltonian in (4e57).. Writing this in the primed system
gIves
, I ." i!' ' ( " at; 81//( - t)
H",.1p (-t) - PiP"""" If -./} = --.In 3 -
Gt
In H". the vctor potential A' = -A occurs. We again take the complex
conjugate and set
1p/X( -t) = A x''P(t)
where i\. is a linear operator.. 'We remeniber that H = Hut" and this gives
A -lxfl nt.Ax 'p(t) - p.oA -1 X aX..?f(' AX'fJ(t) = in .?)
144
RELATIVISTIC ELECTRON 1'HP,ORY
Therefore it is required that
A -lxaxAx = -0
( 4.62a)
and
A-1XHn,AX = Hnr (4.62b)
It is clear from (4.62a) that j\ must be a 2 by 2 nlatrix in spin space. This
would commute with Hnr, which is a unit matrix in this space and so
(4.62b) is satisfied. Writing (4.62a) in the fonn
aA = -Ao x
we see that, in the standard representation of the Pauli matrices where <1 2
is pure imaginary and at, as are real, A is proportional to (]2- We choose a
phase consistent with A Xc = C/\.. and write
A = i<1 2
(4.63)
which is real. Of course, (4.63) will hold under a unitary transformation
also. The final result for a Pauli electron is then
1p'(X, t') = ia 2 1pX(x, t)
(4.64)
'The persistent appearance of the antilinear complex conjugation is to be
expecte:;p and will, of course, appear in the Dirac formalism as well.
Proceeding as before, we write (4.8') for the Dirac particle and insert
A& = -apvA"
to obtain
{ y € ( - ie All ) + ko } 1J"(x, t') = 0 t
II Pox lie
Il
where Efe = 1, E4 = -1. If there were no field present it would be possible
to write a linear relation
1p'(x,t') = A 1p(x, t)
( 4.65)
with
IJ,/l = AYk
Y4 A = -AY4
( 4.66)
with the solution
A = Yl"2Ya = 1'51'4
( 4.67)
However, this choice would not restore (4.8) when Ap "* 0 since the sign
of the charge would be reversed. The time-reversed solution (4.65) would
then correspond to opposite charge! To remedy this situation we need
the complex conjugation operation. Therefore we write
tp'X(x, t ' ) = CAtp(x, t)
( 4.68)
PARTICLE IN ELECTROMAGNETIC FIELDS 145
,.
The charge conjugation matrix C is inserted for convenience. Noting that
A == EIlAp (no sum on p,) and that (a/axx == E",(a/aX) (again no sum
on p,), we obtain (4.8) provided that
(CA)-lyCA == I'll
or
A-IC-lyCA == YP
From (4.46) it is seen that this reduces to
A -ly == i'k
A -11'4A == -I', (4.66')
so that A is the same as in (4.65) and the special solution (4.67) applies.
It is therefore still true that
A-II' pA == apv'Yv
and (4.53) gives apt' = Ep. <5",v (no sum 011 p,). Of course, A now plays an
entirely different role in the Lorentz transformation, as compared to
transformations with Q44 1.
In order that time reversal and charge conjugation commute we require
that
(,,c)' = (1p')C
The left-hand side is
CXAxtpcx.= CXAX(C-1VJX)X .=..CXAXC-1Xtp
The right-hand side is
C- 1 (C X Ax1J'X)X = C-1CAtp == A1p
Equating the operators and taking the complex conjugate gives
CAe -1== A x
or
CA = A xC
which is just (4.52). Thus, with (4.52) ,fulfilled, charge conjugation is
covariant under all Lorentz transform_ations. The solution A = 1'51'4 =
"1"21'3 does indeed fulfill this relation with C:: )'S" In the standard
representation A = ;'1)'2"8 is real, and this is just the condition for (4.52)
to be correct. From the definition of B given in (4.48) it is seen that (4.68)
is alternatively written
1p'X(x, t') = B1p(x, t)
With C == 1'2 the matrix B is
B := Cr'''4 == 7'."51', == -Yl1'3 = ieT"
(4.68')
(4.68 / ')
146
RELATIVISTIC ELECTRON THEORY
so that (4.68') reduces to (4.64), the t.ransformation equation of the. Pauli
electron. In the present context, however, (12 is a 4 by 4 nlatrix. Thus
both the large and the small components transform Jike the Pauli functions.
It is a consequence of (4.52) that if a seq uence of Lorentz transformations
is carried out the transformation matrix 1\. is the product of the A-matrices
for the individual transformations. This is valid for all types of trans..
formations. For example, consider a time reflectipn carried out first. Then
x = b JJvxv
1p'X(x,) = CA b 1p(x)
bpvYv == Ably p.Ab
Then, if this is followed by a Lorentz transformation which does not
involve time reflection, we have
" ,
X A = a;.pxJ1.
'fJ'"( x") = Aa 1p'( x')
a API' J.l = Al Y A,Aa
The net transformation gives
and
" b
x A . = a).J.I. fJ.yXy = C A.yX V
where
c;'vYv = a;'pb pv ". = aAIlA;lYpAb
= A;lA;ly;.Aa 1 \b
= (AaAb)-lYkAaAb
'li == A;lY;.Ac
Ac = AaAb
Also
tp"(x") = AaCxAtpX(x)
or
1p"X(X") = A:CA b 1p(x)
= CA c 1p(x)
Any matrix compounded from continuous space and tin1e reflection
matrices will also satisfY (4.52).10 If
A:C = CA a
and
A_C = CAb
PARTICLE IN ELECTROMAGNE'rIC FIELDS 147
then for Ac = Aa.i\iJ Ne see that
j\.C = A;AC = J\CAb
=:: CAaAo = CA o
as stat(d abvve. 'fhllS (4.52) characterizes all Lorentz transforrnations.
TransformatiuD of the Adjoint Function
In order to study the time reflection properties of the covariants ?py A.1p
(see section J4) it is necessary to determine the connection between 'V"(x')
and ip(x). F'or a transformation without time reflection it was seen in
Chapter II that
'II/(X') = ip(X)Y4 A *)/4
On the ottler hand, for a time reflection
{4.69)
(ij/(x'))X = (1p'*(X!)Y4)X
(1p,X)*y: = (Cl\ 'IfJ)*y;
= 1p:J\ *(:*Y.
- 1! 'Y A * C -LX
-- T il J 14
- ,:.Ie C '- l
= -- 'If'Y 41.\. Y 4
(4.70)
since C* = C-l and cr-ly, == -')"4.C-1u 'This result certainly d]ffeIS from
(4.69). Jlowever, this is expected since 1p itself transforrns in a different
way for time reversal than in other cases. In fact, in secjcn 14 it was
shown that
Y4 A *Y4 = A-I
'Nllen Q44 > I. Now when Q44 < -1 it will be shown that
Y4-I\.*Y4 = __A-1
In section 14 it vIas shown that
./\ Yi}.A. *'/4 = k (4 71)
where k 2 = 1 and therefore k == :f:: 1.
To see that.l, is identical witll the sign of a 44 , \vhich is wha.t is needed to
complete the proof, ,ve DIUlt.ipJy
'-l4 p Y p = ...1\ -lJI4A
from the left and then second time frOT.a the right by Y4. and add the
resulting equations to get
a 4P (:P4J'P -f- Y P Y4) = ?41\.--1.y_1\. + A- 1 Y4 A ,J 4
I'.) 7 " )
\.'"""
148
RELATIVISTIC ELECTRON THEORY
We use i'j'Y + 'Ypi'« == 264p and
I',A-l = kA*'Y4 (4.71')
which is obtained from (4.71) by taking the inverse of that equation. This
is used in the first term on the right of (4.72). In the second term use
A'Y4 = ky,A*-l
which is the inverse of (4.71'). Then (4.72) becomes
2a44 ::.k[A*A + A-IA*-l]
The matrix in square brackets is now seen to be diagonal, and from its
structure its elements must be positive definite. Therefore the sign of k is
the sign of a44 and the proof is thereby completed. Substituting the result
in (4.70) gives
'.
(ip')X = ipA -lC- 1
( 4.73)
Transformation of the Bilinear Covariants under Time Reftection
..
In section 14 the transformation properties of the bilinear co variants
1p'YBc/> were investigated for continuous Lorentz transformations. The
results obtained there also apply to space reflections which involve only
"linear" transformation. However, for time reflections, with the antilinear
transformation apparing, a separate investigation is necessary and a
different result Inay be expected.
The transformation rule for 1p and q, is given by (4.68) and for ip, f> by
(4.73). Then
ip'YBP' = (ijJ'X,,rp'X)X
== (ipA -1C-l,,CAcP)X
=: (1p*Y4A-IC-1yCA4»X
= ("P*Y4 A -lC-lyCA)*
since the covariant is a 1 by 1 and complex conjugation is equivalent to
hermitian conjugation. We use l' ,A -1 = ........l\ *"4 and obtain
ip'''B' = -(1p*A*Y4C-li'CAtp)"
== (1p* A ..,-l'Y:'YCA4».
since 1' 4 C- 1 = -C-l,,.
The matrix C-li':YC is related in a simple way to C"4'YBC-l. Moreover,
C"4'YBC- 1 == -'YB;;" (4.74)
where
'=1
t= -1
for "B == 1, iyo" Il' 'Yo
for I'B == "#£' iy pI'"
(P =1= ,,)
PARTICLE IN ELECTROMAGNETIC FIELDS 149
Taking the complex conjugate of (4.74) gives
CXy:yC-IX = -'(Y4"B)*
or
C-li':YC = -'(Y'YB)*
by the properties of C: C-l = C*, C = C. Then
ip'YBc!>' = -,( tp* i\ * yilY4 A 4»*
= -'(tp*A*Y4'YBA)
= -''Y4A *Y4y B A1p
= 'A -lyBA'tp
(4.75)
where the last stp follows from Y4AY41\* = -lor y,A*/'4 = _A-I;
cf. equations immediately preceding (4.71).
We see from (4.75) that if 1p and cp are the same the only change in the
transformation law is the factor ,. Thus, for the V and T covariants, a
minus sign is introduced s compared to (2.74') and (275/, In general,
then, the transformation laws could be written as in section 14 but with a
factor 5'(a44) = a 44 /la 44 1 inserted for the Vand T covariants. It is important
"to realize, however, that when 1p::j::. cp that time reflection reverses the
roles of these two. This is intuitively obvious when 1p( <p) represent, as in
beta interactions, a particle which appears in the final (initial) state.
The result (4.75) will now be applied to the study of time reversal
properties of interaction constructed frOITI contractions of covariants.
We consider, as a special case, a term of the type
..
H B = CBKB(ab) KB(cd) + CK;(ab) K;(cd)
where
K B( a b) == ipayBtptJ
and similarly for KB(cd). The C B are ordinary numbers playing the role
of coupling constants. We wish to investigate the consequences of the
assuDlption: H B is invariant under time reversal. It will be evident that
the conclusion will also apply to an interaction of the form LBH R .
We observe that
K;(ab) = (ipa j 'B1pb)*
= ( fp a*Y4YB1Jl b )*
= 1pb*Y;Y41pa
-1) * a
,= tp Y4'YB'Y41J'
ISO
RELATIVISTIC ELECTRON THEORY
We shall use the hermitian I'll so that we can writet
K;(C!b) = €BKB(ba)
where €B = 1(-1) if 1'4 and YB comrnute (anticommute). Moreover,
Kh(ab) = {B1pb 1 \ -'ly B A1pa
where we have recognized that' depends on YB by writing B" With
.A = Y5Y4 \ve have
A -l')J = Y4'YSYBY5?'4 = 1'}B?'/At'BY4
where t]B = 1 ( -1) according to whether '"B and Y5 commute or anti..
commute. Hence
Similarly,
KB(ab) = 'B'i'IBK;(ab)
'Kf(ab) = 'B'fJsKB(ab)
The same resu]ts apply for K(ed) arid KJ'(cd) witll the same phase factors.
I-Ience, since , = 1] = 1,
'liB = CBKj;(ab) K{cd) + CKB(ab) J(B(cd)
'Vhen this is compared with HE it is seen that the consequence of the
assumption of time reversal invarianee is
., e x
('B = 11
( 4 76)
or the coupling coefficients must be reaL 11
Unit8:ry Transfor'mations
In the discussion of charge conjugation, given in section 24, it ,vas stated
that the relation 1;Jetween a given matrix (1 and the corresponding charge
conjugate matrix Qc was independent of the representation. However, it
does not foHow that, if the charge ,conjugation matrix has a particular
realization, say 1'2 as in the standard representation, in another representa-
ton it V\,ill be the transform of Y2' that is, SY2S-1. OUf purpose here is
three-fold. First we determine the relation between C in different
represntations. Second, we show that when 1fJ undergoes a unitary
transformation.
'¥(x) = S1p(x) .
(4.77)
t Although this result is not actually needed in the present connection it is cited to
show the connection of the hermitian conjugate .covariants to the reverse decay processes.
If Y B is antihermitian there is an additional minus sign in the connection between
Kl(ah) and KB(ba) which disappears in the product entering HlJo The same remark
app lies in the time-reversed H 80
PARTICl"E IN El,ECfROMAGNETIC FIELDS 151
then the charge conjugate function undergoes the same transforrna.tion:
'Y C ( x) :-.= S 1pC( x)
Finally, it ';viII be demonstrated that the unitary transformation is covariant
under allJ-,orentz transformations; that is, j f (4.77) applies i one reference
frame it a]so applies in any other with the same S.
To avoid confusion \ve use capital letters 7) r '" rather than prhnes to
designate the wave function and Dirac matrices obtained after the S
transforn1ation. A_s usua1, prhnes are reserved to designate the '\-'lave
functions obtained after a I"orentz transformatio11. 'Then if
(YpD'J + k O )1p(X) = 0
the tral1sfornlation (4.77) gives
(r pDJL + ko)'Y(x) = 0
where
I = S y S-l
JL J4
Of course the cornmutation relations
(4.78)
f',ur v + I\.r p = 2b J.tV
are valid and r /-4 can be chosen hern1itian. Therefore, by a prevIous
argument, 5" can be chosen unitary and \ve shall so choose it. Thus
S* == S-1
(4.79)
For definiteness, we refer to the f'/-l representation as the "new" repre-
sentation in contrast to the "old" representation \vhere the Dirac matrices
are 'Tritten )' p." In the old representation we use Co for charge conjugation
and th.e associated Bo as defined by (4.48). In the new representation these
are replaced by C n and Bn. Thus .
EJc = 1, E4 = --1, and
C -1 f 'X rf D
n. po t", on = € Ii 1. J.(
( 4.80 )
B r B-' l = I'x
n /l n JI.
(4.81)
Substituting (4.78) into (4.81) yields
\
B S S-lB- 1 = SX J<S-lX
n Y P n y
= S--lBoYp,BowlS
by (4.49) and (479). Multiplying on the left by y,.I)-lB;;l and on the
right by S-lBoY p.' we obtain .
S-lB- 1 S- 1 B = S-lB-1S-1B
n oY p Y It -n 0
152
RELATIVISTIC ELECfRON THEORY
This result states that S-lB:;lS-1Bo commutes with aU "II and must
therefore be a multiple of a unit matrix:
Bo = kSBnS . (4.82)
By taking the hermitian conjugate of (4.49) the result
'y: = B;l*y JlB:
is obtained. Then with (4.49) we deduce that
B: BoY Ii = Y 1J!3 Bo
\
and hence B3 Bo is a multiple of a unit matrix which is, moreover, positive.
Since a scale factor is left open in the definition of Bo it can be chosen so
that Bo is. unitary. Precisely the same argument with r JI and Bft shows
that Bra can be chosen unitary. Consequently (4.82) becomes
"'"
1 = B:Bo = Ikl'S*B:S*SBnS
=.,lkI 2 S*B:B n S = IkI 2 .S*S = (k1 2
Therefore it is permissible to choose k = 1, and when this is done
Bo = SBnS
or
Bn = SXBOS-l
For the charge conjugation 111Rtrix
Cft = Bn r 4 r 5 = SXBoS-1SY4YsS-1
= SXCos- 1
(4.83)
'I
This is the desired cqnnection between C n and Co, the charge conjugation
matrices in the two representations.
TIle properties Co = Co and CS = Co! are preserved under the unitary
transformation:
[;n = S....lCOSX = SXCoS* = C n
and
c.c - S-l*C*SX*SxC 8- 1
n n- 0 0
= SC;lSX-1SXCOS-1
= SC;lCoS-1 = 1
From (4.83) the validity of the initial statement of this paragraph can
be checked. For example, if (jJ and Q represent an operator in the old and
new representations and if
Ct.)C = 'fJO)
PARTICLE IN ELECTROMAGNETIC FIELDS 153
where 'YJ =:: :!: 1, then with a = SroS- 1 we find
QC = C-1QXC
n. n
= SC;lS-lXSXwxS- 1 XSXCoS-l
= "SC;lWXCoS- 1 = S(OcS-l
= 1']8ooS- 1 ::= 1JO
as required. . .._'
The second problem is the determination of.ttie charge conjugate of tp
in the new representation: 'Fe" This is now obtained immediltely.
'F c = C'Yx
== SC:S-1XSX'lpx
= SC:tpx
== S "Pc
Thus "Pc and tp transform in exactly the same way.
With regard to Lorentz transformations we consider first those which
do not involve time reflection. We wish to show that if tp'(x') = Ao1p(x)
then, under (4.77), 'Y'(x') = S1p'(x') and 'Y'(x') = An'¥(x). Here Ao and
An are the A transformation matrices in the old and new representations
respectively. We find with (4.78) that
a ,uyr y = SA,;lS-1S y pS-lSAoS- 1
= sAQ1s-lr /JS1\.oS-1
== .i\;lr pAn
1
SO that
An == S1\P-l
as could be expected. It is also true that
..."-Cn = CnAn
which should be compared with (4.52), and
A n r 4 A:I'4 = S(a 44 )
as was the case in the old representation.
For the transformation of'Y'(x') we see that
'¥'(x') = Af£'Y(X) = SAoS-1S1p(x)
= SAo1p(x)
= 8,,'(x')
(4.84)
154
PEL.LL\ TIVISTI( ELECTRON TI-IEOI.tV
We turn now to the Lorentz transfofJr\ation vvith thne reflection, and
we shall 8ho\1J that th same result holds., 'Ne vlrite
,¥r(x') = (CnAn:'¥(x))x := (C ytl\.nStjJ(x))X
= (AnSX't)((x)
But 'tJ"(x') = Ct-AX(x) and so
'f'(x') = CA;SXj\:-lC:--l/(jJ'(x'')
We use (43) and (4.84) to obtain
'¥'(x') = (SC;S'-lX)(SXS-lX)(SXA;-lC:l)V"(XI)
= SC:AA:'-lCI:-11p'(X')
= SC:Ct-' 11 p'(x')
== Stp'(x) .
. as ,vas stated.
PROBLEMS
. 1" ShOVi tha.t the space pari of the current density j;?) defined in (4.14) is
. ?h
)(0) == 2"1 curl 'fP('';'F
2. Sho'W' that the expectation value of fJ for any state HUlst satisfy the inequality
--1 < <f3 > :< .1
3. Find a Inatrix B satisfying (4.49) when the standard representation is used.
Choose the arbitrary factor in B so that B is unitary. What a,:!)itrariness rema.ins?
With this B fine the charge conjugation" n1atri x C.
4. Pauli has used a charge conjugatioI1 rrlatr:: C.1J for whih
C Y C--l = _. y X
p p.. :fJ f.t
Sho\v that the charge conjugate wave functiqn is
/1/)t = C-1l'T,
. p r
5.. Find a represenation in. vvhich C is the unit matrix so that charge con-
jugation is identica1 \\,ith cornplex conjugation. Find. a representation in whic.h
C? =- "-}'5 \vhere 1/5 is in the standard repiesentation. 12 Vlrite the wave equation
for zero rest Inass.
6. Give an argument, based on the gauge transformation, which ilVould show
that the charge conjugation operation rnust involve cO.,.Ttplex conjugation.
7. ShOVi that the tin1e...reVer&ecr wave funct]on tp'(x, t') is equal to AtpC(x t)
where A is defined in (4.66) and (4.52) is assumed to be fulfilled.
PARTIC[.E IN ELECTROMAGNETIC FIELDS
,
155
8. Consider an interaction of the f OrIn
H B = C n( -rpa y B1pb) (ipc}, B1pd) + ('B{ ijja y B Y 5 pb) (ipc Y B1pd) -t- hermitian conju ga te
Show that the c.onseguence of invariance of if B undr space reflection is CB = O.
Alternatively, if Ell = _u HB, then e'E = o.
9.. Sho\v that covariants forrned wjth . :::::: S'fp are exactly the saIne as those
formed \vith 1/', 'hcre the IS transformation is unitary.
10.. The Majorana represents tion of the J)irac nlatrices is one in which the
three Dirac (t.k are real hi1e f.l is pure irnagioary. More specifically, it 1S stipulated
that the transformation to the Majorana representation leaves Ct.l and (;(3 un-
changed and replaces \X2 and (3 ,jth -.f] and Ct 2 respectively. Find. the S ma.trix
connecting the Majorana and standard representations, Take the fortner to be
the H.nw" representation. Find the charge conjugation matrix in the Majorana
representation. In the Majorana rt:presentation find the A rnatrix vv'hich ,effects
the Lorentz transforrnations for (a) a space rotation around thc'z..axis: (b) a
uniform translation along th x-axis; (c) a space reflection; x k :';.::: -;c, x =:: xli;
and (d) a tin1e rtftection: x 'c = Xk, x 4 :=:;; -'t:i.
11. Show that the adjoint tp" in the net(\! representation is connectd to the
adjoint 1jJ in the old representation by
\f(x) = ip(x)S
12. If BB* = 1 sho\v that C and C n are both unitary.
13. Feynman and Gell-l\fann 13 have pointed out that t instead of using four-
con1ponent wave functions satisfying linear equations, onecou!d use two
coupled second-order equations with t"No-co.mponent functions. Find a pair
of equations of this type equivalent to the standard Dirac equations.
14u In a nuclear beta transition the final and initial states, 'tp I and 'fIJi., are
stationary states of a Hamiltonian \vhich is assumed to have the form HN ==
a.-p + PNf + V, where V is an even operator. Using the FW transformation
to first order in "1.--1, evaluate the parts of the beta interaction (see. Chapter III,
Eq. 3.77) which contain odd operators in toe nuclear space.
15,. For a Lorentz tral1sformation in \vhich VJ(x')" = Atp(x) sho\\:' that
ip'T1p' = S(a44) (det a)ap.vii'T,,1p
and for 1p'(x') ACx1J'X(;) show that the above is changed '£0 the extent ofa
rninus sign..
REFEltENCES
1. W. Gordon, Z. Physik SO, 630 (1927).
2. L. Foldy and S. A. Wouthuysen, Pllys. Rev. 78, 29 (1950).
3. C. G. Darwin, Proc. Roy. Soc. (u 1 ndun) A 118, 654 (1928).
4. H. A. ToIhoek and S:R. de Groot, Physica 1" 17 (1951).
5. K. M. Case, Phys'. .Rev. 106 173 (1957).
6. M. E. Rose and T. A. Welton,. Phys. Rev. 86, 432 (1952); R.. M. Schectman and
R. H. GOOd;f Jr., ./1m. J. Phys. 25, 219 (1956).
156
RELATIVISTIC ELECTRON THEORY
7. H. Goldstein, Classical Mechanics, Addison-Wesley Publishing Co.. Cambridget
Mass. t 1953, Chapter 6.
8. W. Pauli, Ann. illst. Henri Poincare 6, 109 (1936).
9. R. H. Good t Jr., Revs. Mod. Phys. 27, 187 (1955).
lO. G. Racah, Nuovo cimento 14, 322 (1937).
ll. L. C. Biedenharn and M. E. Rose, Phys. Rev. 83, 459 (1951).
12. Cf. W. L. Bade and H. Jehle, Revs. Mod. Phys. :!S, 714 (1953).
13. R. P. Feynman and M. Gen..Mann, Phys. Rev. 109, 193 (19S8).
v.
DIRAC PARTICLE IN A "CENTRAL FIELD
This chapter, is devoted to some of the ,most important applications of
the theory which arise in connection with central field problems. The wave
functions obtained for hydrogen-like atoms in the Kepler problem will
also' be applied to perturbation calculations wherein the perturbed
Hamiltonian does not possess spherical symmetry.
26. WAVE EQUATION IN POLAR COORDINATES
1:
We recognize at the outset t.hat the central field problems which arise in
actual applications are not strictly one-body problems but present for
consideration at least a two-body problem in which the second "particle"
is the atomic nucleus.t Since the electron in an atom perturbs the nuclear
structure ,in an entirely negligible way and whatever perturbation exists
reacts back on the electron to a very mal1 extent, the nucleus can be
treated classically as a source of the static central field. The motion of the
center of mass of the system can be eliminated in a trivial way by taking
the mass of the nucleus to pe infinite. Alternatively, one can replace the
electron mass m which appears in the following equations, when ordinary
units are introduced, by the reduced mass. The latter does not have a
uD:iue definition in relativistic problems,1 but this ambiguity is mitigated
by the circumstance that the description of the nuclear motion can be
taken to be non-relativistic with a high degree of accuracy. Whatever
course is followed, the error introduced is less than one part in 10 3 , and
this is of order or less than the radiative cotrections. c In the following
treatment r is the vector defining the position of the Dirac particle relative
to the source of the field.
.
t Many electron atoms are briefly discussed in section 29.
157
158
RELATIVISTIC ELECTRON THEORY
To obtain the polar form of the wave equation we consider a stationary
. state of energy JV in a field with a central potential energy VCr) and
transfornl the kinetic energy term cx-,. fo do this use is made of the identity
v = f(i-V) -- r X (r)( V)
,
A a . r I
=r--l-X
or r
.
(5.1)
where, as usual, I = -ir X V is the orbital angular nlomentUITJ. in the
rational relativistic units used here. From (5.1) the kinetic energy operator
becornes
-+ . a 1 I
a-p == -1(;(" -- - - a-X' X
ar r
If in
ex-...t\. (I-B = A.B + ia-A X B
we set A = r, B = I Wf.. A find
GrO'-J = fa-,. X I
Hence
P · 21 + · CY..,.. I
ex- = -1<X - t - 0'.
. r dr r
This rf.sult rnay be.6ubstituted in the wave equation and, using the K
operator defined in section 12, we obtain
W1p'= Htp = [ iY50'T ( + ! - P K ) + V + P J Vl (5.2)
or r ,.
This is the wave equation in polar fotm.
As is evident from section 12 and from the fact thatj2,jz, and K commute
\vith r'{r), these three operators commute with H. We shall be interested
in a represen.tation which diagonaIizes these three operators in addition
to H. The eigenvalues OfJ2,jz' and K arej(j + 1), l,l, and -K respectively.
As has already been mentioned, the operator of space inversion times fJ is
also diagonalized in this representation. Writing
11/1 U )
'If} = \'11'
we have
(a-I + l)V l ' := _K1p'U
? I
(0-1 + l}lfJ = K1p
j2"pn = j(j + l)1pn
jzlpn = #'1)11.
DIRAc PARTICLE IN A. CENTRAL, FIEID 159
where, in the last two equations, n = u or I.. Since 1pu and VJZ are two-
component spinors, it follows that they are proportional to X and X/'-I<
respectively; cf. Eq. (1.60'). Therefore we may write. for 1p
( g(r)x )
1p - 1pll -
- K - \if(r)XK
(5.3)
wh.ere g(r) andf(r) are radial functions which will, in general, depend 011 K.
The phase i is introduced to make the radial equations for.r and g explicitly
real. For bound states and continuum standing waves fIg win be real.
Inserting (5.3) in (5.2) we obtain the two relations resulting from
equating upper and lower components on each side of th.e wave equation :
(W' - V - l),gX: == [ - ( a df + ) . !3f. J t-:;
\ r r r.
l - dg rr Kg l
(W - V .+ l)fxK = -- + Q. + - J IX-K
dr r r
Here (1rX = .- X'=-t< has been used; cf. (1.65'). F'rom these we arrive
finally at the radial equations
df K - 1 T
- = f - (W - 1 - Jt)g
dr r
.
d g . f( -t 1
- = (W - V + 1)/ - - - g
dr r
(5.4)
It is often convenient to use
Ul = rg
U2 = rf
for whic11. th.e alternative radial equations
(5.4')
d ( Ul ) ( -K/r W + J. - V' )( Ul ) \ (5.5)
a; U2 = -( - 1 - V) Kfr U2
apply.
To obtain the corresponding results for the positron we recall that the
charge conjugate solution is
1J l = v 1 / 'X
, 2 Y
in th.is representation. Howe\(r, in applying the charge conjugate operation
it D1USt be remerrlbered that it applies to the iinle..(hpendent functions and
therefore if i 01p/ot = W1p then i a1fl/ot = - V1f'c. .Hence the radial
160 -
RELATIVISTIC ELECfRON THEORY
functions must be altere.d to the extent of changing the sign of W.
call these altered radial functions IC and gC we obtain
( - iCa 'V IlX )
(1Jl:)C = 2-1(
ig C a 2 x: x
If we
From (1.60'),
C12X:X = IC(llj;/-l- m,m)<12x m (-)p-myrn-#
'\ ,n
But
C1 2 X tn = i( - )m-X-m
So
(]2X: X = i(-)P- I C(l!j;# - m,ln)x- my r- 1l
m
The summation letter In can be replaced by -m and the relation
.
C(l!j; -p, - m,m) = (- )l+-JC(I!j; f.l + m,-m)
it used. The validity of the latter may be verified from (1.59). Then
a2XX == i( - )Z-;+ Il X ; P
The reversal of the sign of p, is just what is expected from j: = - j..
Using this last result, we obtain
( ifo -1l )
("P:y = (_)'-1+ 1'+1- 1 . X-Ie
gCX; Il
Comparing this result with (5.3), we see that (apart from a phase) (5.-6) is
obtained from the former by making the replacements: -I( for K, _igc
for f, -ifc for g, /l. for - f.l. If these replacements are made in (5.4) and
the sign of W is changed, the result is
(5.6)
'!l.:.. = K - 1 IC _ (W _ 1 + V)gC
dr r
(5.7)
dg" = (W + V + 1)f" _ K +1 g"
dr r
In other words, since IC and gC are regular solutions as are j" and g, the
charge conjugate radial functions are obtained from the f and g of (5.4)
by changing the sign of V. For a positron in an electrostatic field (5_6)
. applies withfand g obtained from (5.4) but with the sign of Z reversed.
Therefore a positron wave function is
( - if( - Z)X= : )
("P:)P08 == g( _ Z)X; I'
(5.6')
DIRAC PARTICLE IN A CENTRAL FIELD J61
and the eigenvalues of (j2)C and j: are j(j + 1) and p, respectively. If we
apply the space inversion operator PIa to "Ppos, the eigenvalue is (_ )11t+ 1
where for the electron the same operator gives (_)'1(. But {Jcl, applied to
"P P 08 again gives (- )ll(tppos. '
27. FREE PARTICLE SOLUTIONS
The angular momentum repre$entation for free particles is obtained
through use of solutions of (5.4) or (5.5) with V = O. In general, a second-
order equation for u 1 (or U2) can be obtained by elimination of one of these
radial functions. For U 1 this second-order equation is, for any cenral V,
d2Ul + dVjdr dUl
dr 2 W - V + 1 dr
+ [ (W - V? - 1 - K(K + 1) + !5 dV/dr J U l = 0 (S.8)
r 2 rW-V+l
For V = 0 this becomes
d 2 U 1 +[ .2 K(K + l) J ..... -- 0
- p - - u 1 -- .
d r 2 r 2
(5.9)
where p2 = W 2 - 1. The solution regular at r = 0 is
U 1 = ArjzCpr) (5.10)
Here A is a normalization constant. For U 2 the first ot'Eqs. (5.5) is used
to give
1 ( d \
u 2 =- -+ ) Ul
W + 1 dr r
For the spherical Bessel functions \ve use the relations
<I ( ) I .. I + 1 . + .
JL X = - Jz - It+l = - Jz Jl-l
X X
where prime means differentiation with respect to the argument x = pr.
With these relations and ,the relation 1 - 1 = .K = K/I1<I, we find
AS pr . ( '\
U 2 = K --1 , pr)
W+ 1
(5.11)
162
RELATIVISTIC ELECTRON THEORY
which applies for both signs of K. For a constant potential, W is SiUlply
replaced by »'y' - V throughout.
If the free particle wave function
( jl.X )
'1"U - . S
TIC - zp K . It
W + 1 JiX-1<
(5.12)
is compared witll the plane wave solutions (W = Po) it is clear that the
spirt orientation quantum nUInber, i.e., the eigenvalue of (9z, h,as been
replaced by f..t, the eigenvalue of Jz. This does not mean that. a direct
rep]acement is made but rather that the operators play similar roles in a
given physical situation. For example, a polarized particle would be
represented by an ensemble in v/hich states of different p, would have
unequal weights. rrhe precise relation between the two representations is
obtained by an expansion of the plane wave into angular momentum waves
similar to that carried out in section 8. \¥ e \vrite
Tl e ip.r = L "" . 'wJ.L
t... :t ,t;.", ,s,. K P. T I(
KfJ.
(5.13)
where C!:t i given in (3.7) and 1f' in (5.12). Then we require that the
e(l nation.s
I )
Po + 1 m' ·
( .._" - X = '5' a ) (p r )yfJ
, 2 """ KJl l - I\IK
Po . KJl
(5.13a)
( +J.y-i .
Po . , a.pXm = 1]J ! SKaK/lh(pr)X':K
,2po I Po + 1 KJl
be fulfilJed. Here m = :f:!. The first equation is satisfied for
a",. = 47TiZ( po2: I f C(IV;p - m,m) Kr-mX(ft)
(5.13b)
(5.14)
as comparison with. (1.69) shows. That this value of the a KJl also satisfies
the second equation (S.13b) \vill be immediately apparent. In fact, we
kno,v that tJle small component of the plane wave is obtained from the
large cornponent by applying the operator a.p/(W + 1) to the latter. Since
the large COITlpOnent of the plane wave is equal to the large component in
the expansion (5.13) with alC!-t given by (5.14), it fo11o\\'s that the proof of
the statenlcnt consists in showing that a.p/(W + 1) applied to VJU = jzX:
gives the small conlponent of the spherical waves. But this is true, since
it is just the condition used to find the small component in vt:c. The
DIRAC PARTICLE IN A CENTR.AL FIELD 163
expansion of the plane wave is, therefore, obtained from that of the large
component, 'Arhich s
, \ 1 '
W = 4--r{ E o +_ .-!. ) ".! . i L C r I 1 J '. J1. -. m m) j '.{ p r ) y,.,.-mX (p A ) x ,.,.
T large · \ 2 ) k- \2"' , " i\ l K
\ Po .00Jl.
(5.15)
An expansion of this type is useful in scattering theory; see section 33.
A corresponding expansion for particles in a central field \vill be discussed
later (section 34).
It is useful to observe that, for p along the z-axis, f.l = m = ::I:!.
28. GENERAL PROPERTIES OF 'fHE RADIAI.J FUNCTIONS
Normalization of Bound State Wave Functions
For many problems, and for the Coulomb field in particular, the bound
state solutions are rather complicated functions and the problem of
normalizing them by direct ITlethods of calculation is rather formidable.
Fortunately, there exists a comparatively simple method for carrying out
the norn1alization. 2
First it is desirable to introduce the concept of "left" and "right"
solutions. Since the normalization requires that
f VJ*VJ d 3 x = f" r 2 (f2 + g2) dr = 1
it is clearly necessary that 1p*1f' be integrable over any domain in configura-
tion space. It is possible to find solutions which are integrable at the left
end of the interval 0 < r < OCJ; that is, they are regular at r = O. These
will, in general, not be integrable as r -+ 00 unless the energy parameter W
is given one of the values corresponding to the appropriate discrete
spectrum Such solutions, which depend on W, will be called left or L
solutions. Similarly, it is always possible and usually easy to construct
solutions which vanish at infinity in such a ""vay that I)*'lp i5 integrable
there. Such solutions will not be integrable at r = 0 unless W has one of
the appropriate values (eigenvalues). Such solutions we shaH call "right"
or R solutions. If an L solution is made to coincide with an R solution at
any point '1 say, that is,
;fL(r 1 ) ==.frt(r 1 )
gL(r 1 ) = gR(r t )
th.en they will coincide at aU J' and the solution obtained will be an eigen 4
solution. The solution is then hoth an L solution and an R solution. Th.e
164
RELATIVISTIC ELECfRON THEORY
continuity conditions at any point r 1 constitute a condition 011 W yielding
the correct spectrum of energy values. Actually, since an overall scale
factor is not fixed until the normalization is applied, it is only necessary that
PL(r 1 ) == fL(r 1 ) = PR(r 1 ) = fn(rl)
gL(rl) gR(rl)
In fact, p(r) is uniquely determined by the radial equations (5.4) or (5.5)
and the stipulation of a regularity condition either at r = 0 or at r = 00.
Again, PR and PL are functions of Was well as of r.
We now consider two timewodependent solutions corresponding to
different energies Wand W'. They are
'Y = 1pe- iWt
':1" = 1p' e -iW't
where the prime refers to the energy Wi. The fourcurrent formed from
'Y and 'Y' fulfills a continuity equation. Thus
div'Y'.ci'Y + o'¥'*'¥ = 0
at
or
div'f'.'*cx'Y = i(W - W')'Y'*'¥
Integrating over a closed volume we obtain
f 'Y'*ex..'Y dS = i(W - W') f 'Y'*'Y tf>x
where <X. n is the component of (I along the outward normal on the surface S
bounding the volume of integration. Now we let W' = W + dW and
obtain
f exntp dS = -if tp*tp tFx
since there is no outward current for a stationary state; that is,
f tp*ex.. tp dS = 0
We now specialize the volume of integration to be a spherical shell with
radii'1 and '2. Then, introducing Ul = rg and U 2 = f, we find
[ J l'i J 1"t
Utll UU 2 2 2
u 2 - - u 1 - = (U I + u 2 )dr
aw oJV r1 1'1
(5.16)
DIRAC PARTICLE IN A CENTRAL FIELD 165
Taking r 1 = 0 and r2 = r, the contribution to the left side of (5.16) from
r = Tl = 0 vanishes if "t and U 2 are the radial functions of an L solution.
Hence 0 i r
u a = - 0 (uf + u) dr (5.17)
where
PL = ( u 2 )
U 1 L
For r 1 = rand T 2 = 00 the contribution fron1 '2 on the left side of (5.16)
will vanish if Ul and U 2 are radial functions of the R type. Hence
a f oo
u 2 PR = ( u 2 + u 2 ) dr
1 aw r 1 2
(5.18)
and
PR = ( u 2 )
\U J R
We combine (5.17) and (5.18) and the nornlalized solution at any point r
is given by [ a a J -1
u = PR _ PL (5.19)
oW aw W n
where W"t is one of the eigenvalues of Wand n represents the set of
quantum numbers required to specify these eigenvalues.
The normalization procedure is then as follows. From (5.4) or (5.5) one
constructs solutions for any W which are regular at r = o. One also
constructs solutions regular at r = 00. From these PR and PL are obtained
as functions of W. The Land R solutions will each contain a normalization
constant. From either the L or R solutions the correct W n are obtained.
The ratios PR and PIJ do not depend on the normalization constants.
Hence, by differentiation with respect to Wand substitution of Wn., the
right side of (5.19) is calculated. Equating this to ur obtained from R or L
solutionst with W = W n gives the value of the normalization constant to
within the usual phase :!:l. This procedure will be carried out in detail
for the Coulomb field in the next section.
In connection with these questions it is useful to examine the asymptotic
form of (5.5) in the case of practical interest: VCr) -)0- 0 as r 00. Then,
for large r,
dU t K.
- = - - U 1 + (W + 1)u 2
dr r
dU2 ( ) K
- = - W - 1 u 1 + - U 2
dr r
t After setting W = W" these solutions are identical to within the unfixed normaliza-
tion constant.
166
REL.. 1'rVISTIC ELECTRON TlfEOR Y
The asymptotic solutions are
u 1 == A(r, W)e--;'r + B(r, fV)e).r
(W + 1)u 2 = -AA'(r, W)e-).7' + AB'(r, fV)eA.r
where
A = (1 - W2),
A' = A _ 1. dA
A dr '
B' = B + .! dB
A dr
The terms in ,,/r in the differential equations and also the contribution
from the potential energy J" serve to determine the functions A(r, »/) and
B(r, "T). These will generally have the form of finite powers of r. F1rom
the result just given it is seen that a bound state requires
-1 < W < 1
(5.20)
This demonstrates a general result that all bound states must be in the
interval from -1 to + 1 Of, in ordinary units, from -mc 2 to mc 2 . In
particular cases, of course, it is possible that the spectrum of discrete
eigenvalues is restricted to an even srnaller range. The Coulomb field is a
case in point. In the aSytnptotic form of Ul the term in eAr n1ust vanish.
Thus the eigenvalues W n may be obtained as roots of the equation
Jim B(r, W n ) = 0
(5.21)
r ...Ii> 00
We must at() h3ve lim B'(r, W n ) = 0 and this will be the case where
B(r, Jfl) has a factor which depends on U7 alone and which vanishes for
W = u-/1...--see (5.42) below--or where dBfdr < .ll for large r.
The term ..4(r, W) exp (-'Ar) is, of course, the asyrnptotic form of the
R solution.
Nodes of the R.sdial Functions 3
In the non-relativistic central field problem vi/e know that for given
orbital angular rnomentuln fhe solution for the bound state with lowest
energy is nodeless if we exclude the possible zero at the origin a.nd the
point at infinity. In this open interval frofTI 0 to (fj the number of nodes
increases by one in going from one state to the next of higher energy.
There is a corresponding result in the relativistic central field prob1em, but
the situation is n10re complicated because there are now two radial
functionsw t
t Of course, we can also write the non..relativistic radial equations as t\tVO coupled
first-order <;litferential equations, This can be done in nlany ways. For example, if
r!1t, where !:il is the radial wave function, is set equal to U I and dr/Rldr is set equa.l to Uz,
we obtain a pair of equations of the general character of (5.5). However, the connection
between Ul and Ut is somewhat more involved in the relativistic case. This is reflected
in the radically different secondorder equation (5.8).
DIRAC PARTICIJE IN A CENTRAL FIELD
167
We first recognize that the quantity W - I - v in (5.5) is similar to
E - V, the kinetic energy in the non-relativisti case. In the non-relativistic
case nodes of the radial \vave function can occur only jn the region of
classically allow'ed motion, that is, where E -- V > O. Of course, \ve
consider only proper wave functions with W or E equal to one of the
eigenvalues. It is now easy to see that exactly the same result applies in
the relativistic case: nodes can occur onJv where V,? - 1 - r' > o. We
'"
shaH prove this in the practical case that v" is everywhere negati'v'e and a
monotonic increasing function of f. Thus U/ - 1 - V = 0 at only one
point:t In (5.5) we set
G K
= r U l'
F = r-- K u 2
Then in the open interval 0 to 00 in which the end points are excluded.,
nodes off and g coincide "vith nodes of }? and G. We see that
; = -r- 2 ,.,(W - 1 - V)G;
dG == r 2K (W + 1 -- V)F'
dr
No\v \ve consider a node of [/ at r = t 1 and !irb 1 .trafily assume that F < 0
for r <:: '1 and F > 0 for r > 1"1. If W" .--- 1 -.. V < 0, and s'nce }fl' + 1 -- V
is every\vhere :>0'1 jt foHows that, at '1' G is poitive and goes through a
mininlum; that is, the curvature of G i poitive. This behavior is
imoossible because F and G must both vanish at c(). "rhus, for some
l
f ::=;: r ;:> Y 1 ' the function F must reach a rnaximurn bev011d "which F
:
decreases. 1 q he point '2 i deHned so that between r a}d fr;: there are no
roots or extrem.a of F. At r 2 then, G 1Dust vanish. 'But, since;; at 1'1 ,ve saw
that G > 0 and d 2 Gjdr 2 :;> 0, it follo\vs that between '1 and r 2 the function
G must reach a Inaxin1um. 'This is not possjbJe because at such a point F
would vanish, contrary to assumption.
On. the other hand, jf J1/ ....' 1 --- J.Y :;;. 0 then at r 1 ,vhere F = 0 and
dFfdr > 0 the function (i <:: 0 and dGfdr == o. Moreovcr d 2 Gld.r 2 ;> 0 so
that G reaches a tninitnUlTI, at rt. This is a valid type of solution. For
r > rr, F and G may have otl,er zeros or G may approach zero without
crossing the axis. in that case F'reaches a ITlaximum for some r :> '1 and
then approaches zero \vithout crossing the axis for any Jarger value of r.
'I'hi& discussioD also jHutfates a p\)lnt \vhich is faidy obviou:3. It is
i!1'lpossible for.;f and g1 or I' and G.. to vanish sin1 u]ta!"ieous,ty at any point
where J/" is fJnte. If both j" and g or P' anli G vI/ere to vanish at the same
point, the equations, (5.4) for example, sho\v that j' and g would vanish
evervw]lerc. }\ second relnark ,\t'hich j at the base of our discussiol1
..
t 'fhe proof can be generalized to Sh0W that H0ies ('f;\::;ur only \vhere (j,i/ - V)1 -1 > 0,
When 1/ < 0 this is identical with the condition H7.. 1/' _a ] :'>' O.
168
RELATIVISTIC ELECTRON THEOR Y
concerns the fact that where V is continuous f and g must be not only
continuous (which is always necessary) but they must also have con-
tinuous first derivatives.
To discuss the nodes of.! and g it is useful to introduce p = fig once
more. For p we have the Ricatti equation
dp 21<p ( r ( ) 2
-=-- V-l-v)- W+l-Vp
dr r
If we also introduce cp according to
f = p sin r.p
g = p cos cp
( 5.22)
it is seen that
op = (1 + p2) Of/!
ar or
Therefore, where g = 0 and hence p = 00, aplar and oq;lor are both
negative (V < 0). Where f = 0, p = 0, both aplar and oq;jor have the
opposite sign to M/' - 1 - V. But, since this must be positive where
rt'odes occur, aplar and orp/or are negative again. Hence) in thef-g plane,
the vector representing f and g rotates clockwise whenever it crosses the
axes f = 0 or g = 0 with r increasing. Frorn the discussion of th.e functions
of F and G it js seen that the nodes off and g alternate; that is, between
every pair of adjacent nodes of j' (or g) there is one node pf g (or f).
For an eigensolution .
( ) t /
A 1 - W (',
p(oo) = - = - < 0
W+l l+W
(5.23)
Hence at 00 the functions f and g have opposite signs. Thus, to determine
the relative number of nodes off and g, we must examine the behavior at
the origin. Two cases suffice for the discussion: V(O) = constant an1
V(r) = - 'I r for small r with , > 0. 2 - 4 In the first case the behavior at
the origin is the saIne as in the free particle case; see Eq. (5.12). Thus, for
small r,
II g > 0 for K:> 0
II g < 0 for I< <' 0
For a Coulomb-like behavior of V near r = 0 we find from (5.5) that
U 1 = Ar}',
U2 = Br}'
for small rand
A(K + y) = {B
B(K - y) = 'A
DIRAC PARTICLE IN A CENTRAL FIELD 169
so that
,..2 = «2 _ ,2
The regular solutions (for all K) must correspond to {2 < 1 and y > o.
Hence
lim [ = B = K + Y
r....O g A
Thus, for K > 0, fig> 0 and, for K < 0, ,fig <.:: 0 just as in the first case
discussed. For K > 0, the angle cp at r = 0 is in the first or third quadrant
and, for K < 0, g>(O) is in the second or fourth quadrant. At r = ':I:) we
have cp"'in the second or fourth quadrant.
It follows that for 1<: > 0 the number off' nodes exceeds the number of
g nodes by 1, while for K < 0 the numbers of nodes off and g are equal.
It is seen that the number of nodes of the large component g in every'case
follows the same rule that applies to the non-relativistic radial function.
The bound state eigenfunctions for a Coulomb field are studied below,
and the results are shown in F'igs. 5.2 through 5.5. These may be compared
with the statelTIents made .here.
(5.24)
29. COUI..IO:rvm FIELD. BOUND STATES
\\Te shall consider hydrogen-like atoms for which
V = --Ze 2 /r
although for many electron atoms screening corrections due to the presence
of other electrons constitute an appreciable rnodification of the energy.
The radial equations (5.5) are now
dU l = _ KUl + ( w + 1 + r ) u 2
dr r r,
dU2 ( { ) KU?
-- == - J-V - 1 + - U 1 + --=
dr , r r
(5.25)
where { = e 2 Z = Cl2: < 1. In these equations the substitutions
U 1 = (1 -1- W)}-e-;'r( 971 + tP2)
U 2 = (1 - W)e-;"r(q;l - f{J2)
are made. Here A = (1 - W2)1/2 as before. If we also use
x = 2).r
(5.26)
170
RELATIVISTIC ELECTRON THEC)RY
the resulting equations are
dCPl _
-.. ......1
dx
( W ) ( K> )
1 - - <Pi - - + - l:f2
AX x AX
d f{J2 ( K ) , fV
- = - - + -=-- 'PI + - CfJ2
dx x AX }x
These equations are solved by substituting the power series:
( 5.27)
ao
fn = x'Y (X xfn
:rl k m
m=O
ao
m - x Y R x m
'1'2 - k Pm
m=O
which, after like po,vers of x are equated, gives the recurrence relations
W' I ' )
(J.. ( m + y) = rJ.. 1 - --- CI.. - I K. ,I- R
2n 1Yt. - ') m \ . '" P m
1\ \ /:,
( ,\ ult
,Bm(m + y) = - K -t- -: ) rJ..m. + ---:-=- Pm
\ .It A
For m > 0 this determines all 1Xm and 13m in terms of c(o and {Jo. For
m = 0 we find a pair of homogeneous linear equations in (1.(\ and Po which
are consistent jf and on]y if the determnant of the (oeftleients vanishes:
(5.28)
or
I y + 1;'>/A I( + /A
I K - /A y _ W/;i. = 0
,2
y2 __ r 2 W 2 /A 2 = K2 _ _
A 2
Using the value of )\. given above, we obtain
y2 = K2 ._ 2
as above. The regular solutions are obtained by taking the positive square
root:t y = (K 2 _ '2) (5.29)
Using this value of y we have
fJ,n 'I). - K
-=---
1Xm {V/A - :v - rn
k -, 1 / ' A
- ";,
--- ----
n' .-. nl
t For the negative root, r 2 (l2 + g2) r- 2i ' near r = 0, and this gives a divergent
result for y > t. The lTtinimum y occurs for K 2 = 1 so that in this case the negative
root would require' > iv'3 or Z > 109. For K 2 > I there is no value of Z \\'hich
permits a regular solution to be constructed.
DIRA( PARTICLE IN A CEN1RAL FIELD 171
where
n 1 = W/A - y (5.30)
Inserting tlus into the first of Eqs. (5.28) yields the result
n' - In m ( n I -- 1) · · . (n' _. m)
CXm = - CX'rn-l = (-) CXo
m(2y + m) m !(2y + 1) . · . (21' + m)
= g_ n')(2 - !l') · . · (m - n' ! (5.31)
In !(2y + 1) . · · (2y + in)
and
(3 = < - r n' · · · en' - Tn + 1) f3
m 111 !(2y + 1.) . · · (21' + nz) 0
From. the second of (5.28),
!Xo _ Y - W'/A _ n'
- --- - - - - -..,--
Po K -¥ {I A Ie - ,/ A
Tht8e results may nO'N be used in the power series for CPt and f{J2. \Vhen.
this is done we recognize that the series cpJ and. f{Jz are confluent hyper-
geometric functions. "f.hese functions can be defined by
(5.32)
(5,32')
a a(a + 1) x 2
F(a, c, x) = 1 + - x + ---- -- + · .. ·
c c(c + 1) 2!
(5.33)
wl1ere
00 m
= ! 1n _
m::;O c tn n1!
(a -{.- 111 -- I)! r(a + m)
a = --.--------- =
Tn (a . 1)! rOta)
The series (533) converges uniforrnly over the entire complex plane In
terms of the confluent hypergeon1ctrie function
(fJ1 = CI,,)x Y .P'(l - n', 2y -1-- 1. .:r.)
(5 _ 34)
fP2 = {iox} FC -- fl', 2y + 1, x)
K - 'I A
= - (1...0:(;"1 F( - n I, 2 y + 1, x)
n'
(5.35)
The asymptotic behavior of (5.33) j.s given by5
F(a, c. x) = r(c) <-X)-n [ l + o() J - + r(c) exxa-c l r-l + oU ) "l
r(c - a) \.t;; rea) .x -'
As a consequence, as r -+ oo,f'and g bthave hke eAr and are not regular at
infinity. Therefore the series (5,34) and (5.35) must be so tern1inated that
172
RELATIVISTIC ELECTRON THEORY
both of the confluent hypergeolnetric functions are simply poJynomials.
This means that. n' is a non-negative integer: n' = 0" 1, 2, . . .. The case
n' = 0 gives an acceptable solution since then rt o = 0 (see below). The
integer n' gives the number of nodes of f!'2, and it will be seen that this is
the same as the number of nodes of g. In addition to n', it is useful to
introduce the principal quantum number n where
n = n' + k = n' + IKr
Then Eq. (5.30) gives the eigenvalues
_ [ ( ' ) 2 J -- _ [ ( , ) 2 J -!i
W nk - 1 + - 1 +
n' + y n - k + Y
The eigenvalues are seen to lie in the interval
?'l < W n < 1
(5.36)
(5.37)
where the lower limit corresponds to the ls state: K = -1, n' = 0,
n = 1. In (5.37) we have attached a subscript to y which gives the value
of k: 7'1 = (1 - '2)Y2 is the ls energy.
From the result (5.36) it is seen that W depends only on 11 and k; the
non-relativistic degeneracy for the Coulomb field is partially lifted.
Whereas for principal quantum number n the 2n 2 states described by
o < I < n - 1, -I <: m l < I, and ms =:i:-! had the non...relativistic
binding 1 -- W = l'2fn 2 , now the levels with the same n and I but with
j = I :f: t are split. These levels correspond to K = k = jl + t and
K = -k - 1 = -jl - 3/2 = -j2 - 1. Here jl = I - i and)2 = I -I- I.
The level with the higher j lies higher as the x-ray data require. 6 This
splitting represents a spin-orbit energy, but only in the non..relativistic
limit will it be the same as the values given in section 7.
In Fig. 5.1 the predicted position of the levels for n = 1 and 2 is shown
for Z == 82 and for both the relativistic (r) and non-relativistic (nr) cases.
Note that the scales for n = 1 and 2 are not the same. The nUInbers on
the ordinate scale are J;J 7 nk - 1. We see that the relativistic binding is
greater, and this is generaIly the case.
Since W nk for given n depends on k = II<I but not on the sign of K, it
follows that the levels of the sarne j and n are predicted to be degenerate.
This degeneracy which, of course, also exists in the non-relativistic energy,
is an accidental degeneracy peculiar to the Coulomb field. In many...
electron atoms, where V deviates from a Coulomb field due to screening,
the level with lower llies below that with higher /. However, in hydrogen
where no screening is involved there is nevertheless a very small 2s-2p
splitting which is the well-kno\\'n Lamb shift. 6 In frequency units this
DIRAC PARTICLE IN A CENTRAL FIELD 173
splitting is 1057 megacycles per second, and so tlE/E 28 = 1.4 X 10- 3 .
This is of the same order as other radiative corrections (for ins! (laGe: the
correction to the magnetic moment).
Expanding (5.36) in powers of = rxZ) we see that
1 ,2 4 ( 3 1 )
W - 1 = E = - - - + r - - - + 0 ( Y6 )
n n 2 n 2 «;;, 8n 4 2kn 3
(5.38)
-0.040
n := 2 -0.045
2P3
:.'2
2 S1 t 2 p.
1'2 /2
-0.050 -
-0, 60
n = /j -0.180
- 0.200
is
/.
2
nr
ir
I
Figure 5.1. Energy level diagram for n = 1 and 2. The numerical values refer to
W nk - 1 and are calculated for the non-relativistic (nr) and relativistic (r) cases
for Z :r=: 82.
The first term is the non-relativistic energy, exclusive of the rest energy.
The terms in '4 are exactly what is obtained in the Pauli theory if a first-
order perturbation calculation is used to evaluate the contribution of the
sum of the following three terms:
(i) the additional energy due to variation of mass with velocityt
-teE - V)2;
(ii) the spin-orbit coupling as given in Chapter I;
(iii) the Darwin "fluctuation" term given in Chapter IVe
t The kinetic energy is (1 + p2)Yi, where p is the local momentum. Expanding in
powers of p2, the p4. term gives -l(E - )/)2, where p2 = 2(£ - V) to this order.
174
F..ELATIVIS'fIC ELECl'RON THEORY
The first terrn (1) con tributes to ijf -,. J the amount
1 1'"1"2 + 2E I -1) -+ '2,. ._,., 1 f l 4 4 t! ]
-- :: LC \"r · r" \r ,'.I = .... :2 4 4 - n 4 + n 3 (l +- 1)
,4 ( I! 3 )
"': ._... -..----- -- -
2n 4 l + i 4,
1he second term contributes (section 7)
4. 1
-- .--.........--------..-.
2n 3 (2.1 + 1)(1 + 1)
for j = 1 + t
and
<.'4 1
---- .......--- ----------
2n 3 f(21 + 1')
for j = 1 -- 1
FinaHy, the Dar"win term (4.35) gives
y4
....?- ,/.
am
" n 3
.t...
Adding tiu;se (aud noting that k == I i- J for j == i -1- t and k = i for
j = I - -i)7 confirms the validity of the stt1ernent made above.
The relativj::)tic corrections to the Coulornb energy levels are seen to be
of relative order 2 ":'.= (<XZ)i and are n10st in'portant for heavy elements
for \vhich ,xZ is no much less than unit Yo "{'his is expected because for
large Z the approximatio:1 ! Jf! mc 2 is 110t justified. Lt\S the fornl (5.38)
sho,vs" tIle corrections are )not irnportant for small principal cluantum
number. I-lo\vevcr, comparison of :\bsolute value of the calculated and
measured energies' shows that the effect of screening by the other atomic
electrons is quite in1portant. The influence of St:reening can be included
by using an average central potential so that
r
I,' 5 11 )
::= - \ r.
r
where the scr(ening factor S( depends on the choice of rnodeL Numerical
integration of the radia1 quat.i()n:s \ith :.-C; given by the Thomas-Fermi-
Dirac mocel \vith exchange, effects included yie1ds energy values in
rea,;onably good agreernent. 'ith observations.j.j l"he influence of screening
is essential for the splitting of levels \.vJth t.he same j It is also not negligibJe
for the fine-strUt:tut'e'spHtting, and calcu1ated vatues are in good agreement
with the measured ones.
DIRAC PARTICIJE IN A (':ENTRAI.l }7:IELD 175
Returning to the wave functions, we discuss first the case n' -.::;:: 0 which
requires separat comrnent. Frotn (5.32 1 ') it i eel1 that if K > 0 it is
necessary that
K = k = /A
This gives
i
W - ( 1 r2I k 2 )
- ,".1. --- ':, I
which agrees with (5.36). For K. < 0 we would obtain <Xr, ::.; 0 Vv'hen n' :::: O.
That is, r:J.o/n' is finite. In this case ({J1 0 and f{J2 takes the simple form
, ( 1' 1 " ' 1 1
CfJ2 = 'lO K -- A):t.
where tX = rxo/n' is a norn1aJization eonstant.
To determine (xo in the general ease v.; proceed as ou.tlined in the
preceding section. Starting wIth (5.5), we introduce
U 1 = (1 + W)(<I>l + <1>2)
. 1 ,.,
U 2 = (1 --. W)/2(<P 1 - <fJ 2 )
and the variable x = 2lr = 2(1 - vJl2)ir. From the resulting differential
equations for <PI and $2'
d<l>.. ( 1 W ) ( K' )
dX'- = 2 - X; <[>1'- -; + AX <[>2
= ( _ K + 1. ) <1>1 _ ( ! _ 'W ) ' <1>2
dx X AX 2 Ax
we eliminate <1>1 to obtain a second-order equation for <1>2:
(5.39)
d 2 <1>2 + ! d<I>2 + [ _! + ( 'W + ! ) ! _ j/2 - ] q>2 = 0
d x 2 X d x 4 A 2 X ;l2_
This equation can be put in nOfrnal forln by using
9Jl = x}itt> 2
as dependent variable. l"hen
d 2 9Jl + [ - ! -- ( f V + ! ) - _ y:l - ..f J Wt = 0
dx 2 . 4 . A 2 X x2
The solution of (5.40) regular at the origin is 10
en> ( ,..," ) - x y + /'e - Yi.:rr F ( ! -I .. .....J --. k ' ".,. .J... 1 A )
»"k' ,'V "',/ - 2" { , .w" r ..Ii, "'"
(5.40)
(5.41)
with
k' =: (WIA) .-,-. !
(5.41')
176
RELATIVISTIC ELECTRON THEOR)'
This agrees with the results already obtained. A solution regular at x = 00
is the Whittaker function,lO Wk',y(x), which, for our purposes, can be
defined byt
m, (x) = r( -2y) . 9JL (x ) r(2y) 9Jl, (x) (5.42 )
k.1 rei - y - k') k,r + r(i- + y _ k') k,-1
The asymptotic expansion of m3,y(x) for large x is lO
ID k , y( x) t-.....J e - z x k '
{ <X) [y2 _ (k' _ !)2][y2 _ (k' _ j)2] · . · [y2 - (k' - 'V + l)2] )
x 1+2:
v=l v!X V
(5.43)
Equating Wkl y and mk,y so that they are identical functions to within a
constant factor yields the result (5.36) for the energy. This is most readily
seen from (5.42), ,,<'hich requires that r(i + y - k') = 00 or
t + Y - k' = -n' (5.43')
where n' is a non-negative integer. For <1>2 we write
2 = z- }-iWk"y(x) (5.44a)
For <PI we use the second equation in (5.39) and the relation 5
d' a
- F(a, c, x) = - F(a + 1, c + 1, x)
dx c
a-c
= F(a, c + 1, x) + F(a, c, x)
c
(5.43")
to obtain
<PI = X-!-i(K + '/A)Wk'-l,y(X)
(5.44b)
Consequently,
( 1 - W ) (K + '/A)W k '-l1-' - W k ,
P - " "
R - 1 + W (I< + '/A)'lB k '-l;t + Wk',y
(5.45)
This result is to be differentiated with respect to W, and after differentiation
(5.43') is used. If we evaluate (5.19) at r = 0, in the sense of a limit for
small r, it is unnecessary to consider OPL/iJW as (5.17) and (5.18) show that
lim ( oprJOW ) = -Urn r[u + u] = 0
1"-+0 OPR/OW 1'-+0
t The notation used here corresponds to Whittaker and Watson's notation in the
following way: y = m, k' = k, 9Rk':Y = Mkm" Wk',y = Wkm. Equation (5.42) appears
on p. 346 of reference 10.
DIRAC PARTICLE IN A CENI'RAL FIELD 177
In calculating OPR/aW it is unnecessary to differentiate 9J(k,_y(x) because
it is multiplied by [r(t + y - k')]-l which eventually is set equal to zero.
No indeterminate forms arise thereby. Finally, we notice that only the
leading terms in IDl,t',:l:Y need be considered since we are to take the limit
of oPRloW at r = O. In evaluating (opu/oW)w n we need to use
[ 0 1 ] (_ )n'+l (Ok' )
-oW r(l + r - k') w.,= - n'! , aw w.,
(_)n'+ln'! [ ( n'+y2 )J
::..-: -- - 1 +
7T ,
in which elementary properties of the gamlna function are used.t The
remaining details of the rather lengthy m.anipulations may be left to the
reader. The result is expressible in terms of a value of oc. Taking the
negative square root as a matter of cOl1vention we get
An' [ 1'(2y _J r n' + 1) J
0C0 = - I'(2y + 1) 2'( -I< + 'IA)n'!
(5.46)
in which the value of A. must be inserted according to
A = [1 + ( n' t r j]-
= '[n 2 - 2n'(k - y)]-
(5.46')
The final results for the bound state wave functions are obtained from
(5.26), (5.34), and (5.46). They are
f == - 2A% [ r(2 Y + n' + 1)(1 - W) J (2Ar)Y-le-'\T
r(2y + 1) n'! ,(, - A;c)
x [n'F( -n' + 1, 2y + 1, 2).r) - (K - /A)F( -n', 2y + 1,2A.r)]
(5.47)
= 2?-i';\,% [ r(2 Y + n + 1)(1 + W) J (2Ar)Y-le-'\T
g r(2y + 1) n'! ,(, - AI() .
x [-n'F(-n' + 1,21' + 1, 2Ar) - (I( - '/A)F(-n', 2y + 1, 2A.r)]
(5.48)
In the non-relativistic limit is neglected compared to unity, so that
" = k, W is set equal to 1 and A = 'It1. Then/vanishes and g becomes
t [d log I'(z)/dz]z== --n' +€ = --l/E for € 1. Also,
r( -n' + e) = 1T/{n')! sin (n' + 1 - E}rr
Table 5.1. Parameters Defining the K and L SheD Radial Wave Functions for the Coulomb Field
I j
I
Subshell 'Y W A Qo a1 , c1) I C 1 N
.L
I - --------j------!----
(2')Y+
K (ls1i) (1 - '2J , 1 I I I
I' o 1 I 0 I
K = -1 I [2f(2y + 1 )]
I I
!
, i I I
'" i ' '" ..,. I y+! -, 1,-1
, . I
L 1 K( 2.s!1 1 ! (1 - e ;,)
I
L 11 (2p) (1 :Y2'\lL ( 1 + Y ) H I
.... - ':, ]=,zl --- 2
Ie = 1 I I
L 1 : 1 (2 P % ; I (4 - ;.) t?J
,
2W 2(W + 1)
,
2JV
tT1
2" + 1 T!' 2 U -t 1 (2') r 21' -r 1 I ' t"'"
- W 21' + 11 2JJ : - W 2" + 1 ! 2(2J1I)1'+1 L P(2y + 1)(2 W + IJ g
, 2W - 11 I, 2W - 11 (20r+ [ 2i' + 1 J H
- W 21' + 1 i 2(W - 1) 1 , - HI 2y + 1 12(2 w) 1'+! r(2y + lX2W -1) 'Z
I I
I , y+
o 1: 0 _ tr1
[ ' I [2r(2y + 1)];.t
I
2W
t,
1
...-
-.l
00
tT.t
t""4
>
<:
en
('")
DIRAC PARTICIJE IN A CENTRAL FIELD 179
the non-relativistic radial function, as can be seen by use of the contiguous
relation
xP"(a + 1, c + 1, x) = c.F'(a + 1, c, x) - cF(a, c, x)
All the states with k = 1 exhibit a \veak (but square integrable) divergence
of f and g at r = o. This is typical of the relativistic wave functions" It
0.6
0.2
ITlIiTll-j-r-r--n-- j
J I , I J I I I I ' I I
I Ig' ----rgnrT ---t : -----r--r-- t -- i Tll-- i l
i- I - --- I ---T- -,- --t--t-Ti--i-- I
I ! I
o I - .
! I I I I ! I I 1 ,
---t i rf, -+-t-t-ti-r--+-t-+--- L -._-
-L_ _l I -1-LLLl-1-1____L__ -J
4 6 8 10 12 14 16
I
0.4
-0.2
-0.4
o
2
r
Figure 5.2 Normalized radial wave functions rnultiplied by r for the ls state and
Z = 82. The abscissa gives r in units of "Ime. The subscript nr refers to the non..
relativistic radial function.
does not appear in the non-relativistic radial function because it is there
assumed that V <{ me 2 , which is clearly invalid near r = O. Then only the
centrifugal term 1(1 + 1)/r 2 determines the small r behavior. In the
relativistic case the second-order equation contains V2:::.. '2Jr 2 , which
counteracts the centrifugal repulsion to some extent and reduces the
exponent in the indicial behavior of the wave functions from k _u. ] to Y - 1.
For the K and L shells the wave functions j" and g may be \vritten as
follows:
f = -N(l - W)'-2r)'-le-).r(ao + aIr)
g = N(J .t W)r,-1e- ).r{C O -t- c1r)
(5.49a)
(5.49b)
'J'able 5.1 give8 the values of N, y, and ai' c. i ' For the K shell we have
n = 1, n' = 0, K = -1. For L 1: n = 2, n' = 1, K .= -1; I.'II: n = 2,
n' = 1, K = 1; L11111: n = 2, n' = 0, K = -2. In Figs. 5.2 to 5.5 these
radial functions multiplied by r are given in graphical forin for Z = 82
together with the non-relativistic rg. It is apparent that for ls and 2pi
(K and Ln [ respectively), the ratio fig is constant and, in fact, equal to
p(oo) = -(1 - W);(l 4- W)!r1 as would be required. From (5.22) it is
seen that this occurs only for the Coulolnb field and then only for n' = 0,
K < O. In these cases W = 'yJk in generaL
180
RELATIVISTIC ELECTRON THEORY
0.2-
0.1
o
-0.1
-0.2
'--0.3
-0.4
o
5
10
15
20
25
r
Figure 5.3 Same as Fig. 5.2 but for the 2s state.
0.1
I
o
-0.1
-0.2
-0.3
-0.4
-0.5
o
10
15
20
25
5
r
t *
I
I
30
35
.
I
30
35
Figure 5.4 Same as Fig. 5.2 but for th 2p state. The infinite slope of rg at r = 0
is not discernible on this scale.
0.4
0.3
0.2
0.1
o
-0.1
o
5
10
15
25
r
Figure 5.5 Same as Fig. 5.2 but for the 2p state..
."'"
30
35
DIRAC PARTICLE IN A CENTRAL FIELD 181
The ratio of the magnitudes of small to large component is of the order
p( 00) ,-...; , = ('J.,Z. However, large departures from this ratio occur,
especially near nodes of f or g or both. The order of magnitude of
l(g - gnr)lgnt is (ocZ)2 except, of course, where one of these radial functions
has a node.
30. ANOMALOUS ZEEMAN EFFECT
In a homogeneous magnetic field :Ye in addition to the Coulomb field
the Hamiltonian for an electron is
H = Ho + H'
where Ho is the Hamiltonian with the Coulomb field and
H' = ea.A = - e a-r X :!/e
2
(5.50)
In this relativistic formulation of the anomalous Zeeman effect it is
apparent that,there are no explicit A2 terms. These appear only in a
second-order equation, and therefore the present treatment includes,
among other effects, the influence of the A2 term previously neglected.
If the z-axis is chosen as the direction of the magnetic field, the
perturbation (5.50) becomes
,
H' = -1 (oczY - oc"x) (5.51)
The total Hamiltonian still commutes with jz since
(lG z , Cl..a:Y - Cl..yx) = i(a.-r - Cl..zz) = -(lz, Cl..zY - IX1JX)
Therefore matrix elements of H' exist only between states of the same fl.
Ho\vever,j2 does not commute with H', as may be verified directly.t The
matrix elements of H' with the wave functions in the angular momentum
representation are
( I H ' I Jl ) _ _ ie 1UJ [( 'L I( . ) I I' JI. )
1p1(1 "p", - 2 dl- - gKX". 0' X r z J K'X -I\.'
- (fKXKI(CI X r)Z'gK'X,)J (5.52)
The econd term in (5.52) can be put into a form similar to the first by
noting that O'rX = - X':,c and
O'r 0' X r (J". = -a X r
t A simple way to see this is to note that If' transforms under rotations like a first-
rank tensor (or a first-order spherical harmonic). Therefore it can connect states with
angular moentum j' and j if 8.(j j' 1) is fulfilled: j' = j, j ::i: 1.
182
RELArrIVISTIC ELECTRON THEORY
Hence ll
( ,w 1 " fH ' I .w Jl ) - -- ! R A
TK TI\'. - ') KK.' KK'
k
wh.ere
KKK' = LX) r(g , ,JK + g,J",) dr
is a radial matrix element sYInmetric in K, 1<:' and
r-
A KK , = J dn(X:I(o X r).IXK')
is an angular integral. Using
(5.53)
(] X ,n = 'y' - 1n
00 1',;
m . ( ) -m -m
ayX = I --. X
and the definition (1.60') of the spin-angular functions we obtail1 the result
A"K' = i(fr{C(lj;ft - t,nCo'!j';p, + !.-nJdQyt'-XY1-lyr
+ C(lV;p, + l,--t) ql'j';p,- t,t) f d.QYl'+xyyr}
'Chi'" . ){'f l "¥ a ', .... l 'r',h:.'l> g r aIs a .' et
.J .j..,,\e.,,1 (.";".,."t:,.,,, .. .L f..-\.,J,.."-J>... ...
J.. . ......
". r 3 2 1" L 1"\!
I dfl y.u i: xl" i: 1 Y . ' / T 1 :::: l - _..:.:.....2._, C e l'll- 00) C ( i'11" =F 1- :i: 1 )
. ' 1., 41T 21 + 1 J '. , ft .,.'
(5.53')
and
A . f 2(21' + l ) J ["- } ' 11 ' 0 0) '"
. A K p\ ILL 2/ + 1 \... {... ,...
x.I C(l!j;p, - T'i) C(l'!j';p. +. r,-T) C(1'11;# + 7',-27) (5.54)
l'
Fron1 this result it is apparent that
if + 1 + 1 = even integer
j' -- J " = 0 ..1- 1
J ,
as is evident fjorn the odd parity property of (a X r)z/r and its rotational
properties.. In addition, I - l' = :t: 1 from the triangular condition !i(ll'l).
Since if = I' -- K' it follows that If + I Inust be even and I - l' = 0, ::1:2"
The diagonal matrix eletnents of h't' exist for al1 states: K = K' = k,
j = j' == k - i 1 === l' = k -. 1, }, = k - 2. Tb.e non-diagonal matrix
t Reference A, D 62.
j.
DJRAC PARTICLE IN A CENTRAL F'IELD 183
elements which do not vanish occur between the following pairs of states:
(a): K = -k, 1(' = k + 1, j = k - i = j' - 1, I = k - 1 = /' - 2.
I' = k; (b): K = k, K' = -k -- 1, j = k - ! = j' - 1, 1 = l' = k,
-,
I = k + 1 Son1e examples of (a) are si - d'2' p; - J% and of (b) are
Pt. - P3A.' d4 - d s ,,;.. Obviously, the value of p for the two coupled states
D1USt be tIle same.
In constructing the secular determinant for the operator H' we shall
restrict our consideration to states within a given shell, neglecting matrix
elements between states in different shells since their energy separation is
large. In particular we consider the K and L shells. The onJy non.-diagonal
matrix elements are between 2PIA. and 2p4. for !.t = :f:!- In order to
calculate A KIC , in general \ve need vector addition coefficients of the type
C(jl 1 j; m - m 2 , m). These are listed in Table 5.2. B Using these results
and the C(llj; m - m 2 , mJ given in (1.59), it is a straightforward procedure
to calculate AICI(I for all relevant cases. For the diagonal elements we find
A _ 4il#
kk - 41 2 _ 1 '
k=1
A = _ i(l + 1),u
-k, -k; (21 + 1)(2l + 3) ,
or, in general,
k=l+l
A = 4iKp,
KK 4k 2 - 1
(5.54')
For the non-diagonal elements the results are
_ . [(1 + !)2 -- ,u2]
Ak -k-l - 1 ,
· 21 + 1
. [(I + J)2 - ,u2]
A =-}
-kk+l 21 + 3 '
1 = k
l=k-l
For s states in the K and L shells the total energy is simply
W - W _ 2elK R (n)
- nl 3 -1,-1#
(5.55)
where we have elnphasized that the radial matrix element depends on the
principal quantun1 number n. For the 1\1 shell and higher shells this result
will not apply because of non-diagonal elements between ltS and nd 3A .
For the p-states in the L shell Vle obtain a simple result for 2P%, ,u = :!:l:
W = W 22 =F ielKR_ 2 ,-2
(S.SSa)
00
,.J:::..
Table 5.2. Vector Addition Coefficients C(j1lj; m -- m 2 , mJ
j m 2 = 1 m" =0 m 2 = -1 tr1
... >
------_. ------- -l
joo-oJ
-<
[(.it + m)(h + m + I)J [(h - m + l)(jl + m + l)J [(h - m)(h - m + l)J
jl + 1 t.n
(2jl + 1)(2j1 + 2) (2jl + 1)(j1 + 1) (2jl + 1)(2jl + 2) ...,
(j
tn
t'11
_ f<jl + 111)(j1 - m + 1ll m [(h - m)(jl + m + I)J q
}l L 2jl(jl + 1) J [jl(jl + 1)] 2jlVl + 1)
C
Z
eh - m)(jl - m + l)J _ r(jl - nl)(h + !n)l [(h + m + l)(h + m)]!1i trJ
jl - 1 0
2jl(2jl + 1) L h(2jl + 1) J 2jl(2jl + 1)
DIRAC PARTICLE IN A CENTRAL FIELD 185
For It = ::I:! we use the secular determinant
W 21 + (2pIH'12p) - W
(2pIH'12p2)
We find from the above that
(2pIH'12P%)
W 22 + (2p%IH'12P%) - W
= 0 (5.56)
(2piH'12p) = i eYCR l1f t
(2pIH't2p) = -16eyeR-at-2fl
(2pIH'12p) = (2pzIH'12p)
eye ( 9 ) i
= 6 4 _,.,,2 R 1 ,-2
(5.56a)
(5.56b)
(5.56c)
The roots of (5.56) are
w = !{ W 21 + W 22 + H + ll%
::l: [(L\ W + H2% - !1!4J1.)2 + 4(H.-21)2]} (5.57)
where L\ W = W 22 - W 21 is the 2p% - 2P! splitting without magnetic
field. In (5.57) we have used an obvious abbreviation for the matrix
elements of H'.
The radial integrals are obtained from the results of the preceding
section. A straightforward calcula{ion gives
Rt-l = -t(2Yl + 1)
R,-l = - W(2 + 1) (1 _ W2)
Rll = W(2 - 1) (1 _ W2)
(S.58a)
(5.58b)
(5.S8c)
where ,. = W 21 .
R_ 2 ,-2 = -!(2Y2 + 1)
R = _ 2Yl+Ys+lWis+1(2Wl + l)J.-2r(Yl + Y2 + 2)
1,-2 '(1 + W 1 Y'l +Ya+ 2 [r(2Yl + 1)r(2Y2 + 1)]
X { [(l - W)(1 + )] [ w _ )11 + )12 + 2 l
] 2 1 (2W 1 + 1)(W 1 + l)J
+ [( '1 - W )( l + W )] [ w - 1 - Yl + Y2 + 2 J}
2 . 1 1 (2W 1 + 1)(W 1 + 1)
(5.S8e)
(5.58d)
186
RELATIVISTIC ELECfRON THEORY
1.6 I
l
o
l i I
i s ' J.L = '2 --....
W-W nt X 103
W l14
1.2 -+
0.8 -------
0.4
I
2 s 1 t fL =: - 2
V z
-0.4
is. tP.=-2
'2
-0.8
-1.2
-1.6
o
0.4
0.8
e "}.I /2 W n 1
1.2
(XiO- 3 )
Figure 5.6 Magnetic energy for s-levels, (K and L I shells) Z = 82. The ordinate gives
the additional energy due to the homogeneous magnetic field in units of th (oulomb
energy. The abscissa is ""o/'nk' where /1-0 = el2 is the Bohr magneton. Both ordinate
and abscissa scales are different for ls}-2 and 2s!.-}, but the slope of the Jines is independent
of the scale factor W n1 . The Coulomb values are 1 - W n = 0.1989 and 1 - »"21 =
0.0510.
DIRAC PARTICLE IN A CENTRAL FIELD
187
4'°1--1--
W-Wn.t 3
- W..d: - X 10 +
30 - --- -
'r- -t 2P3/. . ,u.=3t 2 -
I 2
I I
2.0 ---+---
-3.0 t-------i.
I
I
,
-4.0 L
() 0.4
\
I
t
2P3 t fL= 2
:.'2
2 Pi' fL :::: /2
/2
o
2 p IJ_=-';-
1/ 2 I ,..... 2
I -
_____L...--.--- 1 .
I 2 P 3 '2 · I'- = - 2
I
--2.0
2 p u= - 3f?-
3/ 2 ' r ... i
---+-----
0.8
e1.J/2 W nk
1.2
I
I
J
XtO-3)
F'igure 5.7 The rnagnetic energy for the 2p levels and for Z = 82. The coordinates are
the same as in Fig. 5.6 and the unit of energy is slightly different for 2PIA and 2pi.
The Coulomb value of 1 - }V 22 is 0.0458..
188
RELATIVISTIC ELECTRON THEORY
For greater clarity we have written Yk = (k 2 - 2), and in (5.58e) the
subscript on Win (5.58e) gives the value of k:
WI = e YI f
W 2 = !Y2
We can readily verify that in the non-relativistic limit these results indeed
go over into the results given in section 7. The radial matrix elements are,
in fact, independent of Z in the limit Z - 0 because, while f r-; ClwZg, the
matrix elenlent of r always involves a factor 1/ A "'-' l/<xZ. Indted, the
diagonal radial matrix elements can be written in a form which displays
this in an explicit \-vay. From (5.4) we obtain, by multiplying the first
equation by f and the second by g and adding,
f df + dg 2 .r + K - 1 f 2 I< + 1 2
- g- = :/g - g
dr dr' r r
Therefore
l C() 1 00 r ( df d ) l
2 0 rIgdr = 0 Lr 3 I dr + g d - (K - 1)r2 + (K + 1)r 2 g 2 J dr
= -f + (K + 1) roo r 2 g 2 dr - (K - 1) [ 1 _ (00 r 2 g 2 drl
J o J o
re oo
= -(K + t) + 2KL r 2 g 2 dr
= K - t - 2Kf: rap dr (5.59)
In the non-relativistic limit the last term may be neglcted. As a final
check \\'e observe that R lf -2 is equal to unity in the non-relativistic limit.
Figure 5.6 shows the magnetic energy W/W n1 -- 1 versus e:Ye/2W nl for
the s-states (n = 1 and 2) and for Z = 82. In ordinary units the abscissa
is the ratio of flo:Ye to W n1 - In Fig. 5.7 the corresponding results are
given for the 2p states. For a more extended discussion of the anomalous
Zeeman effect reference C should be consulted.
31. HYPERFINE STRUCTURE12.13
TIle relativistic corrections to the hyperfine structure can be determined
in a manner very similar to that used in the treatment of the anomalous
Zeeman effect. We must now consider the entire system of nucleus plus
DIRAC PARTICLE IN A CENTRAL FIELD 189
electron. If the nuclear spin is I, the angular ITlom.entum of the nuclear
plus electron system is F where F(F + 1) is the eigenvalue of F2 = (I + j)2.
The total system of nucleus plus electron is described by a wave function
o/; = 2 C(ljF; m - j1,p.y1>?- Jl 1p j
J{
Here <l> is the nuclear wave function and VJY is the same as lpl introduced
in (5.3).
The perturbation is
1-1' = ea.-A = ea.- m X r = em- r X ( 5.60 ) '
3
r' r
In (5.60) tn is the nuclear magnetic momnt operator, \vhich is
111 = gs!l..v I,
eli
fl4V = ") 1\ If .
,,-,1{1 C
\\lith M the proton mass and g;.v the nuclear gyroTIlagnetic ratio. The
t1rst-order perturbation energy is then
J1f' == egNf-lJ.V 2 C(Ij.f; n1 - p!);) C(IjF m - I U ' ,p')
/l p: X (I m _ ,u I 111 m - ,u /). (j,u r 3 (1 I j,u' )
(5.61)
The matrix elenlent (5.61) can be worked out by Inethods described in
reference At with the result
W' _ e _ F(F. + 1) - J(I + . 1) - j(L + 0 ( 111:'1 ) ( .1 !.. X . ) \
- Kv!-lN 2[1(1 + l)j(j + 1)r..-,; \, Ii ) I 1'2 . )
(5.62)
,vhere the double-barred quantities are "reduced n1atrix elements" which
can be defined by
and
(Imllzllm) = C(llI; mO)(IIiIIi I )
(j,u (r a (1) j,u) = C(j1j; ,uO)(; I r 1\ j)
It followst that
and
(1111111) = [1(/ + l)J
(J ' r X (1 .' ) [j(j + l)]i ( . (r X a.)z J 'n ) '
. ) = Jll r
r 3 . fl r 3
t Chapter VI, particularly Eq. (6.21).
Reference A, pp. 85-88 and Eq. (5.13).
190
RELATIVISTIC ELECTRON THEORY
Hence
W' = eg.v,us[F(F + 1) - 1(1 + 1) - j(j + l)](j;.t (r 3 a) jjp),u-l
(5.63)
The matrix element in (5.63) is exactly like the one worked out in the
preceding section except for a change in the radial integrals. Takin g' over
those results, we have
( . (r X a)z . ) "A flJl
J# ---a- JP- = -1 KK K
r
(5.64)
\vhere AKl< is defined in (5.53) and
.9t", = 2 [OOgJ" dr
--0
From (5.54') we obtain the result 14 - 16
W' = 4k; 1 egN,uzv[F(F + 1) - 1(1 + 1) - j(j + l)]K (5.65)
(5.64')
The radial integrals for the K and L shell states are
/J/j ' 1 ) 23
9t- l ( sH = -
YI(2!'l - 1)
_ 2 (1 - tV2)
3i_ 1 (2s!i) = - 2W 2 (2W - 1)(2W 2 - 1)(4W II - 3)
, 2 (1 - WJ);
&f (2plL) = -
2 7" 2w 2 (2JV + 1)(2W 2 - 1)(4W 2 - 3)
(;3
9l ( 2 3 ) = -
- 2 P J-2 4 (2 1 )
Y2 Y2-
In the non-relativistic Jhnit all these radial integrals are proportional to
(cx.Z)3, as they should be. 17 To obtain an idea of the magnitude of the
corrections to the non-relativistic limit we give the expansion of 9£1( to
two terms:
(5.66a)
(566b)
(5.66c)
(5.66d)
-l(lsi) -23(1.+ !2)
3 ( ' t 7 )
!!II_l(2sj,!) -"4 1 + 8 2
/JlJ I" ,3 ( 47 Y2 )
a I 2 p ,,(. ) - 1 + -
l' 2 - 12 24 ..
-l2p%) c:::-L - 3 ( 1 + 2. 2 )
- 24 24
DIRAC Pl\RTICLE IN A CEN'"fRAL }.:'IEIJD 191
In all cases the hyperfine rnultiplet splitting has bee.n increased. For
Z == 82, , 0.60 and the relativistic correction 1S quite appreciable.
32. CC)ULOMB FIELD CONTINUlJ1\f STA l'ES
When 'V > 1 the energy spectrum for the (oulomb field is continuous
and the "vave functions which are regular at r = 0 become standing waves.
To obtain these \vave functions 'e consider the counterpart of (5.39)"
Tha t is, we set
U 1 == (V + 1// 2 ($1 + <P 2 )
U 2 -= i{ Hl - 1)!:-2(<D 1 - <1>2)
(5.67)
and use
p = (H/ 2 - 1 )'
In place of A. Hence p is the rnagnitude of the local mon1e.nturn at r = roo
If we set
x = 2i pr
we obtain
d<l>l = ( '.! + Pt' )<1>l _ ( _ i )<1>2 (5.68a)
dx 2 px I \x PXI
d<D 2 = _ ( !:. -+ -! ' )11>1 - ( ! + ! JW )11>2 (5.68b)
cl x x px, 2 px I
If we take the cOll1plex conjugate of these equations" remembering that x
is pure imaginary, we find
d<D = _ ( ' + iW ) <I) _ ( K + 1- ) <p
dx 2 px .X px
d<1?: = __ ( ._ f )<J> + {+ W ) (p:
d x ,x px, \.c.. px
1-'hese equations are identical with (5.68a) and (5.68b) if we set
<I>f = $2 (5.69)
Hence u 1 and U 2 can be chosen real, as is obvious from the original radial
equations. These real functions will therefore gi'"vre standing waves.1"
Eliminating <P 2 , we find for $1 the second-order equation
d 2 cI>t + q\ _ f! + (! + iW ) ! + Y J <I>l = 0
dx 2 X dx L4 \2 p:1: x 2
t The extension to outgoing or ingoing waves will be obvious from the sequel.
192
REI.,ATIVISTIC ELECTRON rrHEORY
where )l has the sa1T!e meaning as before. T' o put this in normal form we
again write
1.<:
9)1 =.= X,r 2 q, 1
and find
d 2 m
--
dx 2
[1 ! 1 °1 U r ) 1 2 1 J
,';" rt ' Y - 4:
- + ( - + - - + ---,-:;-- ffi1 = 0
4 2 p J X x'"
(5.70)
This should be con1pared with the corresponding equation (5,40) in the
bound state problelTl. The regular (at r = 0) olution of (5.70) is
9Jl(x) = x y !- /2e - );/2F(y ..J r - 1 + iy 2y + 1 x)
where we have introduced
y = V)p
"r e set
<1>1 = N(y + iy)eil1(2pr}Ve-'iPTF'(y + ,+ iy, 2y + 1, 2ipr)
= lV(y + iy)e ill (2p)Y(P(r)
where N is a real norlnaiization factor "/hich) for the 1110111ent, is irrelevant.
The phase Yj Inust now be dcterll1ined so that (J>2 evaluated fforn (5.68a) is
indeed <I>. This requires that
- 2i Y + i Y r r 1 d(J) . ( y ) (j) 1
e ,'I = - Y _ iy _ iy/w L q; x dr --tp 1 + p;/ $X _ J
'The evaluation of exp ( - 2ft,) is facilitated by th use of K. urnrrler's
formula. 10
e- x / 2 F(j' + 1 + iy, 2y + 1, x) = e X / 2 },\y - iy, 2y + 1, -x)
With this and the additional help of the contiguous relation
x P'( a -f.. 1, c -t. 1, x) = c [F (a .+ 1, c, x) - F ( a, C, x)]
of the hypergeometric function, \ve find
2in. K - iy/W
e =- - -----
i' .-t 4 i JI
(5.71')
For the radial functions \ve can no\v write
rf = i( W - 1)1i N(2prY' {(/) + iy)e- i1n '+ il1
X F(? + 1 -t iy 2y + 1 2ipr) - c.c.}
(5.71)
rg = (W + 1)!,iN(2pr)/'{(}' + iy)e-ivr+it,
X }'(y + 1 + iy 2y + 1, 2ipr) + c.c}
D1RAC PARTICLE IN A CENTRAL FIELD 193
where N is again the normalization factor and 'YJ is the phase determined
by (5.71') to within an additive multiple of?T. In (5.71) c.c. means complex
conjugate.
The solutions are now norlTlahzed in the energy scale. This means that,
if 1pw and 1f'JV' are solutions corresponding to energies Wand W', ·
f d 3 x 'I/ltv''Pw = d(W - W')
An alternative normalization is to one particle in a sphere of very large
radius R. If at r = 00
(5.72)
rf = -A(W - l);i sin (pr + b)
rg = A(J-V -1- 1) cos (pr + J)
then the normalization in the sphre requires that
f d 3 x tp*V! = i R r 2 (j2 + g2) dr = A 2 WR = 1
(5.73)
Then
A = (WR)-
For normalization according to (5.72)t
A = (1Tp)-
(5.74)
We use the asymptotic behavior of the confluent hypergeonletric
functions. s l'he relevant part of this in our case is
F ( a c x) _)-- r'(c) xa-ce x + . · .
, , / r( a)
so that at r -+ co
[ ( +') ipr+i'1 J
rf -+ i(W - 1)!/Nr(2y + L)(2pr)Y lye. (2iprYll-Y - c.c.
I (y + 1 + lY)
[ ( +') i1J'1' + iq J
rg (W + 1)1.Ll\lr(2y + 1)(2pr)Y ly\e . (2iprYv-y + C.c.
I (f" + 1 + lY)
We write
y + iy = exp [ - i arg r(y + iy)]
r(y + 1 + iy) Ir(y + iy)1
(2iprY Y = e-n-1I/2 e itdog2 p r
i-Y = e-n-i Y /2
t cr., for example, reference C, p. 23.
194
RELATIVISI'll-: ELEC"fRON THEORY
Then rfand rg hae the asyn1ptotic behavIor given by (5.73), \vhere
21Ve .- 1TV /21'( 2 y + J)
A :.:: "--- (5.74')
!r(y.+ iy)1
and
tJ == b K =:. y log 2pr -- rg r(y + iy) + rJ - !'iry (5.75)
The occurrence of the r-dependent logarithm tern... is characteristic of the
COUIOlTLb field and arises from the 810\\/ decrease of V"(r). F'or lim r J/ ---+ 0
as r 00 it \vould not appear. '"fhis r-dependent phase \vill not affect any
physical results of the Coulon1b field alone. For example, it will not
appear in interference terms in scattering amplitudes since the log terrrr is
independent of K.
'rhe energy scale norn1alization fixes N when (5.74) and (5.74') are
compared. The final results are then
.' W l) ( " ) }' 11' f/i2 (J -' ( + . )1
r 1 = 1\ - "'P; e - 1/ lY" (e-- tpr+iJ,{ , + i )
2(7Tp)!'r(2y + 1) \ ) Y
x };(y + 1 +, iy, 2y + 1, 2ipr) -. c.c.} (5.76)
, (W + 1)(2prYe"1I1211\y + iy)1 r -ipr+il l ( +')
rg = 2(7Tp)J..2r(2y + 1) .-- le y. lY
X F(y + 1 + iy,2y + 1, 2ipr) + c.c.} (5.77)
Since rJ is defined only to within an additive 1nultiple of 1T, there is the
usual sign ambiguity in,f and g, but fig is unambiguous. The onJy factor
in /1( and g K which depends on the sign of Ie is e:t irj
For rnany purposes, in particular the calculation of radial Inatrix
elernents involving! or g, the integral representation. of these functions is
useful. With the same nOfrnalizatjon as in (5.76) and (5.77) \ve have 5 ,]&
{ f \ { it w - 1)2 } . e1rVI2(pr)Y
r g J = (W + 1)! 2(7Tp)!W ( Y +- iy)1
x [ e iq f+lfiT""X(l - xy-l-ill(1 + x)1+ill dx =t= c.c. ] (5.77')
.... -1
lxpansion of exp (iprx) and integratiol1 term by tern1 give the series
so] utions again.
The asymptotic behavior of the solution (5.76') and (5.77) is
rf = - ( W -=- .! tsin (pr + b)
71]) /
( Jf V + 1 \l
rg = -;;p-) cos (pr + <5)
where the phase {; is given in (5.75).
(5.78a)
(5.78b)
DIRAC PJ-\-RTICLE IN A CENTRl\L FIELD 195
Clearly) wherever a physical problen1 involves emission of electrons into
the continuum these wave functions win be important. Exan1ples of their
application occur in electron scattering, internal conversion, photoelectric
effect, nuclear beta decay, and e1ectron..positron pair formation. They
would also be relevant for Inany other problems which have hitherto been
solved only with approxirnate wave funetions. Among these we may
mention bremsstrahlung and Auger emission. Approximate "'ave functions
are therefore of son1e utility and win be discussed in the next chapter. A
formal application to scattering wiH be made later on in this chapter.
It will be recognized that the same w(:.ak singularity as appeared in the
bound state wave functions occurs in the continuum solutions at r = 0
for j = i. This behavior in both continuum and bound solutions implies
a marked modification of the description of processes in which the small r
region is important. Internal conversion is a case in point. However, in
all such cases it 1Tlay be necessary to remember that at very small distances
the potential energy' function is again modif1ed by the effect of the finite
size of the nucleus. This problem will also he discllssed in the next chapter.
Finally, it is of interest to note that screening effects on the continUUi11
solutions are usually less important than for the bound state functions.
As a simple application of the continuum wave functions we consider
the density of electrons near the nucleus. A quantity of this sort appears
in the beta-decay transition probability. Then, since we are interested in
small r, the confluent hypergeometric functions in (5.76) can be set equal to
unity. The factor of interest inf2 or g2 or both is
== e" Y II'(y + iY)12
and we consider this factor for small momentum. Then, since y is large,
we use Stirling 7 s approximtion5 and
21re 7T1I (y2 + y2Y'-!e-21'(y + iy)i1J(y _ iy)-ill
The product of the last two factors is
( + . ¥Y
I y Y , = e- 2Y luetan I'll
\1' - lY,
= e-2'V{"-1Iv+ ...)
Hence
= 21T(y2 + y2YI-}i
This result applies for electrons. For positrons j I'(y + iy)1 is unchanged,
but the factoi" exp (7T1I) becomes exp (-:7Y) and we find
POfJ = e -21111 el
196
RELATIVISTIC ELEC'TRON THEORY
Consequently, as p - 0, the number of positrons near the nucleus is very
strongly suppressed in cOlnparison with the number of slo\\' electrons.
This is evidently an influence of the Coulomb repulsion acting on the
forlrler.
If non-relativistic wave functions are used, the value of ¥,2(O), when
tp2( 00)1 = 1, is known to be
2TrY
----
I - e - 2rr1/
for electrons and
2TTY
e 21lY - 1
for' positrons (y = rxZ/p). Again, for y -+ CX), the ratio of 11JI(O)Ios/11JI(O)I1
is exp (-27TY), as would be expected.
33. SCATTFRING THEORY
The relativistic treatment of the scattering problem was first given by
Mott 19 in a famous paper in which he- also showed that the electrons are
polarized in the process of scattering. The physical origin of this polariza-
tion is connected with the spin dependence of the interaction (as evidenced
by spin-orbit coupling) which is built into the theory. The analysis of the
polarization may then be made by a second scattering, whereupon an
azimuthal asymmetry in the scattered intensity appears. Since any asym-
metry in a scattering process ""'herein the wave vector is scattered from p
to p' must be a scalar of the form &>.p X p', where f!I> is the polarization
vector, it is clear that the polarization must be at least partially transverse.
As will be seen, the direction of the polarization is along the normal to the
scattering plane, as could be expected on elen1entary principles of symmetry.
In this section we shall first develop the scattering theory fo! polarized
electrons in a purely formal way. It will be shown then that, in contrast
to the spin-independent description, there will be (»70 scattering arnplitudes
corresponding to the two possible orie1.1tations of the electron spin The
problem of obtaining an explicit form of these scattering amplitudes will
be then taken up for th case of a central field. For the first part of the
discussion '\Ie shaH follow the treatment of Miihlschlegel and Koppe. 20
As a preparation for the treatment of the scattering problem we first
introduce the concept of the density matrix. The present discussion will
be only a very brief one; for a more comprehensive treatment the literature
may be consulted. 21
DIRAC PARTICI4E IN A CENTRAl.. FIELD
197
The Density Matrix
When we use a single vvave fUliction or, generally, a state vector to
describe the electron, there is a tacit assuJnption that there exists an
experiment, designed for exan1ple to InegUre the spin con1ponent in some
direction, which will give the result + -k wjth certainty R For such a pure
state the electron polarization is complete: of unit Inagnitude and of
definite direction. Thus any Jinear cOITlbination of plane waves U_+: exp (ip.r)
not only specifies the luomentum and energy uniquely but also diagonalizes
a.a for sotne unit vector n. The precise specification of ft depends only on
the coefficients in the pure state envisaged. However, we must recognize
the existence of situations in v/hich this characteristic of maxima] inforrrla-
tion does not apply: Suppose that an electron is emitted from a nucleus in
beta decay. J n general, one does pot perform an xperilnent .in which all
observables are measured. For instance, The neutrino (or antjneutrino)
rnay not be observed in coincidence; the nuclear magnetic substate are
averaged over because the tnlitter is not prepared in a definite one of the
substates nor is the recoi1 nucleus ob.serv,.:.d in a. definite substate¥ l\S a
consequence, the electron polarization is less than unity and "\-vhat is
measured is an average value. The Inathel11atical device for perforrning
the average'1 \vhi<.;h js carried out incoherentl}', is the density matrix. 'The
probability for electron emission in SOlTle spin state is calculated and the
average of this quantity. quadratic in the electron amplitude, is taken. Thus
the electron can be thought of as being in an inlpure state. Alternatively,
'We deal with an ensefnble of pure states, each member of the ensen1ble
corresponding, in the example above, to emission with all other physical
parameters being simultaneously Ineasured.
The forolalism of the density matrix technique is based on the following
definition. Consider a pure state 'If which is expanded into a set of basic
states VJn:
'Y = C 11)
£., n'17i
n
(5.79)
Then any observable represented by an operator !2 has the average value
f\tf"j'-)Illf') _ "'\,""" () x
'- - :l,1 T } -- "'n'r.CnfC n
nn'
where Qn'n are the matrix elemtnts of !2 in the 1p basis. Now we consider
an impure state. The enseJnble average of Q is
(1l) === I ql't J6 (i) I Q)'l}'(i»)
i
where qi is the probability, or statistical weight, that corresponds to any
one of the pure states '¥(i). The latter are different states of the form (5.79).
198
R.ELATIVJSTIC ELECfRON 11IEORY
It is evident that.
fA' } = "'" , q . C (i)X C (i)
\ "- l.n n k 1 n' n
nn' i
wllere c} are the expansion coefficients of'l"(i) in the 1Pn basis. If the
matrix p is defmed by
P = q . C (i)X C (i)
nn' .k' n' n
i
we may write
(Q) == IQn'nPnn' = "rr(Qp)
nn'
(5.80)
The density matrix p is defined by Eq. (5.80). Obviously, it depends in a
quadratic way on the amplitudes c) and linearly on the probability
parameters qi-
Some relevant properties of pare:
(i) In order that <0) be real when Q is Jlermitian, it is necessary that
p be hermitian:
" X
Pnn' = Pn''YI.
(5.80a)
(H) When (2 = 1 we must require th.at (0) = 1. Hence
Tr p = 1 (5.80b)
(iii) If Q is diagonal with Onn :> 0, we Inust require that (0) > O.
Hence all diagonal elements of p are non-negative:
Pnn > 0
(5.80c)
Suppose that
c< i) = .
11 1La
corresponding to 'Y i = 1Jli. Then
Pnn' = t5. n n.,qn
(5.8Od)
which says that p is diagonal and its elements are the probabilities for
finding the system in one of the base states. Consequently rr p = 1 is
the usual normalization for the probability parameters.
(iv) If p is brought to diagonal form by a unitary transformation which
does' not change Tr p, we see that (Pn == Pn,,):
Tr p2 = p < ( pn ) 2 = (Tr p)2 = 1
n n
(5.80e)
Thus Tr p2. < 1 and each e]ement Pnn' has a square modu]us equal to or
less than unity.
DIRAC PARTIGLE IN A CENTRAL FIELD 199
There is a relation between p and the projection operators discussed in
section 19. Suppose that (12) is equal to the expectation value ofO. Then
( r\ ) _ \Tl'*Q'l''' _ 'Yxr\ \TJ'
" - I - k O'tGaA. T ;.
6).
where the spinor index summation is now explicitly indicated. It follows
then that
(0) = QalP AO' = Tr !1P
0').
where P is the projection operator: P),(j = q;',t'Y as before. Therefore in
this case the density matrix and projection operator are the same. This
case corresponds to complete polarization, f!lJ =.1. If the average polariza-
tion vector is fYJ, where {!) < 1, the density matrix can be obtained from
the projection operator by replacing the unit vector {!fJ by the average
polarization vector. Thus for the non-relativistic case the density matrix is
Po( OJ) = }{ 1 + .9-0), #2 < 1
This satisfies all four conditions (5.80a, b, c, e). Thus
p = p * ; T r p = 1; p Y2 = t( 1 + .07J z), P - /f -. :/f = l( 1 - {ljJ,)
(5.81)
The latter two elements of p are both less than unity and positive; Tr pi =
Tr 1(1 + f!lJ2 + 2[1JJ.a) = !(l + &,2) < 1.
For the relativistic electron the density matrix corresponding to
momentum, or wave vector, equal to p and polarization fIJ will be obtained
in a similar way from (3.61).
p(p,9') = 2po P + (p)t(l + P)Po(9')!(l + fJ)P +(p)
Po + 1
(5.82)
where
P +(p) = ( 1 + ,,-t.l )
.t.. 1"0
is the positive energy projection operator. In the same v.lay as before, when
we considered Po(f!IJ), it may be veril1ed that the four fundamental
properties are indeed satisfied. t
Formal Theory of Scattering of Polarized Electrons
We consider an elastic scattering process in v/hich p and f!lJ describe the
initial state and p' a.nd fJJ' describe the final state. The cross section
per unit solid angle for this process win be denoted by a(p', p, 9).
t For instance, to verify that p is hermitian it is sufficient to observe that p,,(fIJ) and
!(l + fJ) commute.
200
RELATIVISTIC ELEC1RON THEORY
We define a transition amplitude A(p', p) by
a(p', p, flJ) Po(&J') = A(p', p) po(&» A*(p', p) (5.83)
From this definition it appears that A is a 2 by 2 nlatrix. In part, the
succeeding development will explain \vhy this is the relevant transition
amplitude even though the scattering of a Dirac electron would seem to
involve four by four matrices. OUf eventual purpose is to define A
explicitly (for example, in terms of phase shifts) and to relate (] and .9' to A.
The solution of the scattering probJem leads to a wave function which
has the asymptotic formt
"p = a(p) exp (ip.r) + b(p') exp iJ.?!.
r
Here a and b are four-component spinors. We define T(p', p) by
b(p') = T(p', p) a(p)
( ; 0 8 )
( 5.85)
so that T transforn1s the incident. amplitude to the amplitude of the
outgoing wave. The cross section in (5.83) is
(J = b*b (5.86)
The density matrix for the incident beam is pep, f/J) as given in (5.82).
It is constructed from the incident wave amplitudes a(p) as shown in
section 19. Thus
PCT,t(P, f!lJ) = a(1a
(5.87a)
Although the notation does not explicitly indicate it, the fact is that a must
also depend on the direction of &J. For the final state density Inatrix we
Inust use p(p', [!P'), and this is constructed in a similar \vay fronl the b
amplitudes. We write
Pu,t(p', &J') = .h'bab (5.87b)
where % is a normalization factor chosen to make Tr p = 1. From (5.85),
%-1 Pa;'(P', &J') = [T(p', p) a(p)]u[T(p', p) a(p)]
= [Tp(p, &J)T*]a,t (5.87c)
Hence
(] = b*b = b,tb = %-1 Tr p
Therefore
%-1 = a(p', p, &J)
and (5.87b) reads
a(p', p, &J) p(p', flJ') = b X b* = T(p', p) p(p, &» T*(pf, p)
t There is no loss of generality in omitting the explicit appearance of a possible
logarithmic term in the phase of incident or scattered wave.
DIRAC PARTICLE IN A CENTRAL FIELD 201
Making use of the identity
p +(p') pep', flJ') P +(p') = p(p', .9')
we find that
G(p' p, flJ) p(p', f!I>') = P.+(p') T(p', p) pep, fJ» J'*(p', p) P +(p') (5.88)
Substituting (5.82) jnto (5.88) and writing pep', fYJ') in the form (5.82)
with primed variables (p = Po), we obtain
P +(p') T(p', p) P +(p)(1 + (J) Po(.9)(l + f3) P +(p) T*(p', p) P +(p')
= a(p', p, &J)P+(p')(l + fJ) Po(.9')(l + P)l)+(p') (5.89)
We now multiply (5.83) on the left by P +(p')(l + fJ) and on the right by
(1 + fJ)P +(p') to obtain
a(p', p, &» P +(p')(1 + (» Po(flJ')(l + (J) P +(p')
= P + (p')(l + P)A Po(&J)A*(l + (J) P +(p') (5.90)
The right side of (5.89) and the left side of (5.90) are identical. 1_herefore
we equate the remaining members to get a relation between A and T.
This relation has the form
A = !11](Po)12(1 + fJ) P +(p') T(p') p) P .,.(p)(l + (3) (5.91)
'tV here the constant I1JP is to be fixed. Then we obtain a result
1" + (p') T(p', p) p + (p )(1 -)- (3) Po( &')(1 + fJ) P -+ (p) 7*(p : p) p + (p')
= !11l1 4p +(p')(l + (3)P +(p') 11(p', p)P -t.(l»)(l + {J)Po(f!IJ)(l + fJ)
X P +(p) 1"*(p' p)P +(p')(1 + fJ)P +(p') , (5.92)
Tlus is simplified by use of the identity
! P +(p'Xl + (J) P +(p') = Po + 1 P +(p')
2 2po
Then the two sides of (5.92) are equal if
(5.92')
")
1171 2 = kPO
Po + 1
Substituting this in (5.91) fixes A in terms of J'. To obtain the inter-
pretation of A we see that
b = Ta = P +(p')b = P +(p')Ta
202
RELATIVISTIC ELECTRON THEORY
since b is a positive energy amplitude for which P+(p') has the eigenvaJue 1.
From this it follows that
t(l + (J)b =--= t(l + 13) P +(p')Ta
L
a = P + (p)a = Po P + (p)[l(l + fJ)]2a
Po + 1
\vhere the last equality follows by use of the identity (5.92') wherein p is
substituted for p'. As a consequence of this last result we may write
But
r 2 1 1 1
hl + f3)b = I- - (1 + (3) P +(p') T(p', p) P +(p) - (1 + (3) J t(l + p)a
"'Po + 1 2 2
The quantity in the square brackets is A. Hence
i(l + {J)b = A!(l + fJ)a
( 5.93)
This means that A transforms the large components of the incident wave
anlplitude into the large components of the outgoing wave ampJitude. It
fOUO'NS that the scattering is cOlnpletely described by the manner in which
the large con1ponents are influenced by the scattering field. When this part
of the incident and outgoing waves is specified, the small components are
automatically corret1y adjusted. This is a consequence of the fact that in
the as)'111ptotjc ",vave function the amplitudes are those corresponding to
essentiaIJy plane waves for which the small components are determined in
a specified and simpJe \vay from the large ones.
From (5.83) it follows that
a(p', p, flJ) = Tr .A. Pot eP)A *
( 5.94)
and
f1J' = Tr aA Po( &')£4 *
Tr A Po()A*
(5.95)
Both (J and f!lJl are therefore fixed from a knowledge of A which is forth-
coming from a detailed analysis of the scattering process. Ho\\'ever, it is
clear that A? a 2 by 2 matrix, must have the form F + Go-a, where n is
conveniently taken to be a unit vector. From a syrnrnetry consideration
ft must lie jn the direction of the normal to the scattering plane because, as
will be evident, for an initially un polarized beam :YJ' is parallel to ft and no
other direction is uniquely defined. We take
A P X p'
n=
Ip X p'l
J)1].AC PARTICLE IN A CENTRAL FIELD 203
In Fig. 5.8, ft points into the plane of the paper for the first scattering and
out of this plane for the second scattering if the scattered particle proceeds
along the vector there labeled p.
From (5.94) we find for the cross section
a(p', p, 9J) == IFI2 -t. tGj2 + (FxG + G X F}!7'.ft
(5.96)
OUTGOiNG
/
FIRST
,SCATTERING
,
OUTGOING
INCOMING
Figure 5.8 Schematic diagram illust.rating double scattering. -The outgoing momentum
p is parallel to the incoming n10mentur{t p. Th' outgoing morncntum p" makes the
same angle \vith p' ?s does the outgoing tT10mentum p_
The scattered intensity is therefore dependent on the initial polarization if
this does not lie in the cattedng plane. F0f the polarization after scattering
a some\vhat lengthier hue sirnpl calcuL3.tion gives
a(pf, I), fP)fl P = "(FG)' + (;Fx + 2.niGI?)
+ .9(;£1 2 - tG1 2 ) + i&> X fiCFG x - GF X ) (5.97)
]f the initial bean1 is unpoJarized, fj.iJ = 0 and then after the scattering the
polarization is
;tn' JA FG X '1- (; r X
fr = .r M = 11<"1 2 = -:--Gr I
This is the Mott polarization. 22 It is along n as stated above.
(5.97a)
204
REloJATIVISTIC ELECTRON THEOR)"
There are some convenient relations which can be derived from (5.79a).
For instance,
1 _ g/2 = (1 - glJw)(l :- 91 2 )
(1 + fIJ M. fIJ)2
may be verified by direct substitution. Also
fi- flJ' = 0-( f/J + f!IJ AI)
1 + fP lYE- fJJ
From these equations we can deduce some interesting consequences. If
r!P = - OJ {, then f!IJ' = 0, or the scattered beam is unpolarized. Figure
5.8 shows how this situation could be realized The beanl before the first
scattering is unpoJarized, and so after the first scattering it is polarized
with fIJ = f/J M. --fhis [IJJ M points into the plane of the paper and depends
on the scattering angle {}l. For the second scattering wherein the outgoing
partiele is parallel to the original direction of n10tion, the original polariza-
tion is - OJ lu(2), where fP M(2) is the Mott polarization that would ensue
if an initially unpolarized electron were scattered from pi to p for which ft
has the opposite direction to the ft of the first scattering. Hence, after the
two scatterings, both the wave vector and the (zero) polarization are
unchanged.
On the other hand, in Fig. 5.8, the scattering intensity along p and p"
after the econd scattering will be different, although the scattering angle
is the same for these two directions. For this it is not necessary that
'{}1 = {}2. We denote the amplitudes F and G at {} = 1}i by F i and G i - Also
11 1 and 1\2 are the unit normals for the first and second scattering. For
instance, if Po is the initial wave vector,
(5.97b)
(S.97c)
while
I
ftl = Po X p
Ipo X p' I
,.. p' X P
"2 =
Ip' X pi
for scattering into the direction p and
,.. p' X pI!
"2 =
Ip ' X p"l
for scattering into p". Thus ft 1 -ft 2 is -1 in the first case and + 1 in the
second. Then after two scatterings we find, from (5.96) and (5.97a),
a = I F 1 2 + I G -2 ::f: (F 2 G: + F:G 2 )(F 1 Gf + FfG t ) ( 5.98 )
2 21 I F ll2 + IG 1 !2
DIRAC PARTICLE IN A CENTRAL FIELD 205
where :i:: is the value of 81- 8 2 . This is the well-known analysis of polariza-
tion by double scattering. 22 Of course, it js now known that it is unnecessary
to scatter electrons in order to polarize them. Electrons emitted in beta
decay are polarized, and it is important to measure this polarization. If an
analysis of the polarization of beta particles is to be made, it can be done
in single scattering by comparing the intensity along two directions with
the same scattering angle. Thus, in the notation of Fig. 5.8, the relative
difference of intensity along p and p" is
a(p") - a(p) }"G x + FXG
= &'-8 (5.99)
a(p") + O'(p) If"1 2 + IGI2
where fJJ is the polarization in the incident beam and
X "
ft = pine p
Ipine X p";
This measures only the polarization component along 0, but this direction
may obviously be varied at will.
From (5.97b) we see that a completely polarized beam remains completely
polarized after scattering although the direction of the polarization may
change. If conditions are chosen so that
J.11-3J :> - [1 - (1 - .9f)]
f9'.ft ;;. _ 1 - (1 - f9')
fP:&f
then the polarization after scattering is at least as great as the incident
polarization but, of course, f!JJ' < 1 always. Finally, we notice that the
incident and final polarization are parallel (or antiparallel) only if the
initial polarization is completely transverse, that is, f/J X ft = o.
or
The Scattering Amplitudes
The formal solution of the scattering problem is complete when the
scattering amplitudes F and G are expressed in calculable form. We do
this for an arbitrary central field. The starting point is the expansion of
the plane wave into spherical waves carried out in section 27, Eqs. (5.13)
and (5.14). In this expansion we shall make the following changes: First,
since there should be no confusion between positive and negative energy
states, we shaH write Po = W throughout. Second, we shall consider a
superposition of the two basic plane wave states. This means that we
must replace X m in the plane wave by
! cmX m
m
206
R ELA TIVISTIC ELECrRON THEORY
where c and c_ are arbitrary constants. Third, for convenience we
change the normalization of 1p as defined by (5.12) so that the amplitudes
at r -+ 00 are the same as those given for the COUIOITlb field in (5.78).
That is, we normalize the free particle spherical wave solutions in the
energy scale. These renormalized solutions are denoted by 1p(O), to
emphasize that they are free particle solutions, and
[ peW + 1) J !i ( . jlX; )
1Jl:(O) =
17' p S.. II-
W + 1 KJlX-K
The asymptotic behavior may be checked by noting that
xU x ) -+ COS (x _ 1 1 7T)
xjj(x)-+ -SK sin (x _ 1 1_ 7T)
Thus, for Z == 0, the phase = <51(0) is
biO) = - 1 1 7T
and a definite ohoice of phase has already been made in (5.12). Since the
Coulomb phase shift must reduce to t111 for Z = 0 we observe, with
cos 2'1 = _ /(y + y2fW
y2 + y2
. 2 _ Y(K + yiW)
SIn 1] - 2 2
Y + y
that rJ is in the third quadrant for J( > 0 and in the first quadrant for K < o.
-'Nith these changes the expansion of the plane wave is
( , ) !i
1pp = 47T cmiz C(l!j; p, - m,m) Yf-m X(p) (O) (5.101)
2Wp KJ! m
( 5.1(0)
For the Coulomb field we require a solution which has the asymptotic
behaviort
+ b i(pr+1I1og 2pr)
1Jl -+ 1ppl - e
r
so that it is asymptotically a plane wave plus outgoing waves.
t Fer fields falling faster than the Coulomb field the logarithmic term is omitted.
DIRAC PARTICLE IN A CENTRAL FIELD
207
For 1p we writ("
/ )
7T -- 'l 1- . . ""/' - m X IJ.
f/J=47T t - I!sKCm 1 C(12J,fl-- m ,m)lL (f))"PK
\2 JV p K JJ m
(5.102)
where 'tp is defined by (5.3) with f and g given by the (oulomb radial
functions (5.76) and (5.77). '"fhe argument of X/g in 1p is the unit vector r
which is in the direction of observation: that is, r = p', the unit vector
in the direction of scattering. The constants SK are fixed so that the
required asymptotic behavior is obtained. Using (5,,78), we find that
. ;
s = e 1 ,o K.
K
(5.102')
where
b = 1] -- !1Tr' - arg I\y + iy) + !(l + 1)71
(5.103)
That is, o is the difference between the Coulonlb phase shift exclusive of
the logarithmic term and the Z = 0 phase shift K.(O). Jt is therefore the
additional phase due to the Coulomb fIeld. For a chfferent central field
the phase o has a similar definition but a different value, of course. It is
only in these phase shifts that the detailed structure of the central held
enters. \\lith this value of SI( the alnplitude of the outgoiQg wave is obtained
i1nmediate.iy. Writing only the relevant large components, \ve tind
2 . ( *' + 1 )
i(l + fJ)b = - 2 2 Cm(e2ia - 1) C(fj J-t - In,nl)
p 2W / I(fl. 1n
X yt.-?t't X(p) X(p')
2 . (W + J'
1T I I.. \ I')ib' -, '\'
= -- - ) ! 2 C m(e- K - 1) (,(/ 2 J:, p. - yn,n1)
p ,2JtV Kit mT
X C( 1i- i' iJ. - T 'Z- ) y.u - m X (p " ) ' y,.l- T{ )j')yT
. ,.. ,J , I' , l l \ l ,'/"
(5.104)
wherein i1ei"K(O) = - i has been used. \Ve may sin1piify this result by
choosing the z-axis along the direction of the incident beam. Then, since
( ]A
Y JJ - m X 0' ) _4- 2 L + 1 )
l \ P U J.: 'tit
417' .
we can write
},,"1 + K ) ' b = ( W' + I J \ ')' (' Em r
2 V ? W "'- m T Z
\ _ rnr .
(5.105)
where
. 14-
Bn:= - 7T --I(e2iO _ 1)(2/ + l)! r2 C(l-!j; Om) C{lj; fn - r,'"r) Y7 t -- 1 (p')
p K
(5.106)
208
RELATIVISTIC ELECTRON THEORY
The transition amplitude A of the previous section is then obtained from
! cmBXr = A ! cmX m
mr m
Therefore
IB BllA )
}'2 .., .... ,
A = I}.f _ = F + Go-.ft
\B_ !; B-!4
where Ii' is to be determined. We write
G = Go'
and
A = ( F + G z G_ )
G+ F - G z ,
where Gx. == G x :f: iG,r Consequently,
F - .l (B + B -Y2 )
- 2!-t - !
G z = I(B - B=t1)
G B G B -
+ = - 3--2, - = !4
.For p = e z , a unit vector along the z-axis, and
p' = e sin {} cos cp + e y sin {} sin cp + e z cos {}
we find
With
p X p' = e y sin -0 cos f{J - e3: sin 1? sin cp
A X A'
n= P P
I, x p'l
we see that
n% = 0,
fix :;: - sin rp,
ii' ll = cos q;
Therefore
n:!: = :!:ie:t:ifP
We can now determine the direction o[ft' relative to n. First the component
G 3 is evaluated. This gives a sum of two terms:
{[C(iij; O,!)]2 - C(!j; O,_!)]2} Y:(p')
Using the relation
C(llj; m 1 ,m 2 ) = (- )l+-j C(l!j; -nl 1 ,-m 2 )
we see that G z = O. Next we evaluate
'
AI G B !. . . C(ltj; O,!) C'(l!j; 1,-i) Y(p')
n+ + - K
fi = G_ = B!! = I" · C(l!j; 0,-1) C(llj; -1,1) Y,-l(')
I(
DIRAC PARTICLE IN A CENTRAL FIELD 209
Here the dots indicate factors, depending only on /<, which are the same in
numerator and denominator. Using the relation between C...coefficients
just given above, we see that the product of these coefficients is the same in
numerator and denominator. From the definition (1.48) of the spherical
harmonics,
ym = [ 21 + 1 l - m)! J i ( _ ) meimqJ pm ( cosD )
l 41T (l+m)! l
where Pi is the associated Legendre function. From }TtX = (- yn Yl-'n
we deduce that
Y l _ _ 2ifP y -l
l - e t
and hence
AI
n+ Zitp n+
- = -e = -
fz'- n_
This demonstrates that ft and 0' are either parallel (6' = ft) or antiparallel
(0' = -Ii). As a matter of definition we can take ft.' = 1\, since Gft' is
unaffected by the choice we make.
For the scattering amplitudes we obtain
F = -. 1- }: (e2i - 1)(21 + 1) Pt(cos {}) ! [C(lij; Or))2
4p K r
= - 2 (e2iC - 1)(21 + 1) P,CCO:; 1J)
4p K
by (1.57). For G we find, from G = -ie-ifPG+,
G = -ie-itpB!
(5.107)
-i
= - :...- '7T I ( e 2iO :C - 1)(21 + 1)-2 C(ltj; 01) C(l!j; 1,-1) Y(fa')
p K
FroIn (1.59),
C ( r1 .. 01. ) C ( l '. 1 _1 ) = -8. [1(1 + 1)]
7];] , 2 'lJ,,"! K 21 + 1
U sing this and
yl (p "' ) = _ [ 21 + 1 1 J ei'P pl ( COS # )
I 4Tr 1(1 + 1) l
the result for G becomes
G = - J.. 2 S,.{e'UlJ',. - 1) p}{cos I}) (5.108)
2p K
Equations (5.107) and (5.108) [with (5.103) for the Coulomb field]
complete the formal solution of the scattering problem. In order to obtain
210
R.ELATIVISTIC ELECTRON THEC)RY
specific results, nUlnerical procedures are necessary in general. Without
attempting an exhaustive survey of the literature, rnention may be made of
the calculations of Bartlett and \'/atson2 for Hg and of Bartlett and
Weltol1,24 also for fIg" in \vhich screening is taken into account by straight-
forward numerical calculation and by using various approximation
methods. Aore recently Shern1,:tn 25 has giv€"!l numerical results for Hg,
Cd, and Al for the unscreened field. Additional numerical values for
cross sections have been given by Doggett and Spencer 26 for Z = 6, 13,
0.06
o 6 r-- T --r--I----r-- I --y-- ----r-----r-
. I I r---t--- __L-__ ----1---- I -----..:t::.::.
O.4 ._ _ _ __ ____ __.
\---105° , . (--I
t f ..
=- I = :- .-- . ft---f-_l- f +-- j - t i
j____,--L -- --'
0.5 0.6 0.7 0.8 0.9
O.i
0.04
-1
0.02
0.2
03
0.4
vie
Figure 5.9 The asyrnmetry factor SeD) versus vJe for I-Ig (after Sherman 2 &).
29, 50, 82, and 92. These authors a1so give results for positron scattering.
For the purpose of more easHy extending the results to other elements
McKinley and Fcshbach 27 have given analytic expressions obtained by
expanding the Mott scattering in powers of r.x.Z and ocZcjv. As their Figs.
2 and 3 show, the scattering is less than Rutherford scattering for almost
all Z at large scattering angles but exceeds Rutherford scattering for heavy
elements and scattering angles in the intertnediate range. Sherman's
calculation of the am.plitudes F and G has been used 25 to calculate the
scattering asymmetry when the incident beanl is polarized. This asymmetry
factor S(fi) is defined so that the double scattering cross section is (cfs Eq.
(5.98)]
a( {}1' IJ 2 , rp2) a( {'I) a( {1 2 ) [ 1 +. S( 1)1) S({'2) COS 12]
where O-({}l) and a( {}2) are the single scattering cross sections and 4>2 is the
azimuthal angJe of the second scattering about the direction of the beam
after the first scattering. As an illustration of the results Sherman"s values
of .S(O) for Hg have been plotted in Fig. 5.9. It is clear from (5.99) that a
DIRAC PARTICLE IN A CENTRAL FI]-2LD 211
measuremnt of the azimuthal asymmetry in the single scattering of
polarized electrons together with a kno\vledge of S determines the
polarization of the incident beam.
34. rIME-DEPENDENT PERT{JRJjATICN
A development similar to that employed in obtaining the scattered wave
in the preceding section is necessary when one wishes tc answer questions
concerning the angular distribution or polarization, or both, of electrons
emitted in electromagnetic or weak interaction processes Iiere we need to
know the solutions of the Dirac cC1uation in a Coulonlb field which behave
j.
like outoing ,vavs at large distances and, as i-n the scattering problcln,
correspond to a definite direction of nlotion. In developing the necessary
formalism we follow the work of Rosc) Biedenharn, and Aifken. 28
Consider a time-dependent perturbation II' e -iwt + H' *e'iut. In a process
wherein energy is absorbed by the electron in going from an in.itial state
to a final state, only the first terl11 contributes. 'Therefore .e write the
equation of motion as
(H + H'e-irot)'t'''(r, I) = io'Y(r, t)/ot
(5.109)
Here H is the Hamiltonian in the Cou1omb field. If \ve introduce the
Fourier transfornl1p( W, i) according to
'Y(r, t) = r V->(W, r)e- iWI dW (5.110)
'"
which corresponds to writing 'f\r, t) as a superposition of stationary
states, we obtain from (5.109)
(Ii - W)1p(W, r) = -H"tp(W - oJ,r)
(5.111)
"{Jnder the conservation of energy, W -- OJ is the initial energy. -Moreover,
in a perturbation treatrnent in which H'is {;onsidered only to first order,
tp on the right side of (5.1 ] 1) should be replaced by the initial stationary
state wave function ':pi" Therefore we obtain an inhomogeneous equation
to solve:
(H - W)w == --H"'Pi
(5.112)
In order to solve (5.112) it is necessary to obtain the Green's function
of the operator Ii - W. This Ineans th.at we are required to solve
(ex-p + fJ + V ,- Vv') G(r, r{) = d(r - r')l (5.113)
where on the right side we have emphasized that there is a 4 by 4 unit
matrix (1). We recognize that G is actually a 4 by 4 matrix.
212
RELATIVISTIC ELECTRON THEORY
The solution of the Green's function problem is more easily obtained by
first considering the free particle case. Then we have
(a.p + (3 - W) Go(r, r') = J(r - r')l
(5.114)
and we see that
e il1R
Go(r, r/) = (W + ,B + a-p) - I
4'11" R
where R = r - r'. This result follows since
(5.115)
e ipR
(V2 + p2) - = -47Tb(r - r')
]{
We now introduce the well-known expansion
ipR
= ip Z hz(pr»jl(pr<) Y(r) yrn XCr')
41TR lm
( 5.115')
where h z is the spherical Hankel function of the first kind:
( )
hz{x) = H}Y2(x)
2x,
To carry out the evaluation of Go as expressed in (5.115) we observe that
11 A/Jl
l/'",K
[tp:Jout = [ pew 7T + l) Ti ip S/Ch/X':.K
W+ 1
(5.116)
is a solution of the free particle Dirac equation. We consider r > " and
construct the matrix
ap = Z {[ tp(r)Jout}O"{ 1p(r')};
KJI.
where 1p is the standing wave solutions with h z in (5.116) replaced by it.
We write this matrix as
t(1 = ( 1l :12 )
ry 21 '9 22
where each of r:g 11' etc., is a 2 by 2 matrix: Then, for example,
11 = peW + 1) L hl(r)jz(r')x(r)xX(r')
1T KJl
and
W -1
(122 = 11
W+ 1
DIRAC PARTICLE IN A CENTRAL FIELD 213
since replacing K by - K in the summand does not change the value of the
sum over K. But
I x(i) x:X(i') = 2: ! C(llj; It - T,r) C(l!j; It - 'T',T')
jp. TT' ip.
X XTXT'X y- T(f) yr-T'x(f')
where the sum over j is carried out with / fixed. The sum over j of the two
C-coefficients gives ':'t" from Eq. (1.58). Then we observe that
2:X T X TX = ( 1 0 ) = 1 2
T 0 1
In the sum over p, and T which remains, we set It - T = m and sum over
m and T. Then .
2: X:(i) xX(r') = 2: Y;n(t) y;nX(r')I 2
;p m
Hence
11 = peW + 1) 2: hl(pr) jl(pr') Y;"(i') Y;"X(i")I a
7T 1m
Con1paring this with (5.113), we see that
' G ) JV + 1 " G )
l 0 11 = ( 0 22
W -1
= ip(W + 1) 2: hl(pr) jz(pr') X(f) x:X(r')
ICp
and, in a similar way,
(G O )12 = -(G O )21 = - p2 2: SlChl(pr) jlpr') XIC(i') x:X(r')
K/J
Consequently, for, > ,',
Go = i1T 2: [1p(r)]out 1p;X(r')
K1J.
(5.117)
For, < " we need only interchange rand r', since Go is symmetric in its
arguments. This result is strongly reminiscent of the completeness relation,
but it should be stressed that the "P do not form a complete set of states.
To obtain the Green's function for the Coulomb field we need only
replace the radial functions by the Coulomb functions, since the particular
form of the radial functions which appear in the free particle solutions
played no essential role in the development above. The desired wave
function is then
'P(r) = - f d 3 r' G(r, r') R'(r') 'Pir')
(5.118)
214
RELATIVISTIC ELECTRO THEORY
and from (5.113) it is evident that (:)o'f 12) is satisfied. For the asynJp10tic
ehavior of 1p, which is all that is needld to calculate the outgoing currt;nt,
we have
( ) . 1 . "It.""' [ Jl ( ' .... {Jl i H n )
"Pas r = -11'1 lITJ 2..: "P,f r)Jout\'lf'Kf .'lfJi.
r -+ IX> f\ 11.
(5.119)
v/here the quantity in brackets is the 111atrix element of the time-independent
perturbation H'. For ('p)out we lTIUst choose radial functions which have
the asyutptotic behavior
( \ 1 /
W 1'/
r[fKr)Jcut -- i -;,. ) ei(l1TH)
. . /W + 1 ) .
r[gK(r)] nut --+ I et('[Jr+lJ)
\ 1Tp
since the phase must reduce to that of the I-Iankel functions in the Z = 0
limit. Conseq.u cntly ,
( ) l'Pr { (V +. l)XJt(p) )
"pa3 = - i , !!: 1: ei61C(lp;:IJl't"Pi) '- _ K,
P r 1.: Jl \ -- t W' -- 1) X - {l) ),
\vhere p is in the direction of the outgoing electron. r-rhe spinor in (5.120)
is an eig\;nfunction of (I-P + {3 with eigenvalue W, so that 1.p81S is indeed a
plane v\,Tave w-1th momentum p.
l:\S ,applications of (5.] 20) we may !nention two. First, if we consider
internal conversion,29 "Pi is the initial bound state and 'fp are states in the
<:ontinuurrl. The perturbatjon H' is e( a..A . <l') where A and <D are
outgoing \vave solutions of the Ivtaxvvell field.t The SUITt on K and fl is
restricted by selection rules arising from the matdx clerrient in (5..120);
for example) angular mOl11entum conservation imposes the triangular
condition il(j, L, jJ. As a second example the emission of beta particles
rnay be considered. Then H' is the nuclear beta Int{;raction of the forIn
'YjQ'l,'t i .!"2(1 + Ys), ,vhere Q is a Dirac matrix (of V or A type, say), and
'I!f and '¥i are nuclear wave functions. 'rhe dot indicates a contraction
over the tensor indiees of Q. Also 1Pi is no\v a neutrino state of negative
energy, so t.hat negative beta emission in\'olv::s absorption of a negatIve
energy neutrino or the creation of an anti neutrino. To calculate the
intensity of outgoing beta particles one considers '):a'lpa" whereas for the
polarization of the beta particles the quantity involved is (1J}:s(,()'tp,s)/( VJ;slp3S)'
where (f) is the operator discussed in section 15. In all cases the logarithmic
(5.120)
t For a nuclear transition J i -+- J h these Maxwell fields arc svperpositions of eigen-
functions of the electromagnetic angular moment L with II, -- Jfl -« L < J i + J f and
they are moreover eigenfunctions of parity. Hence for each L the potentials describe
a Inultipolc field.
DIRAC I"A'RTICLE IN A CENTRAL. FIELD
215
term in OK factors out of (5 120) as an irrelevant phase factor. It win be
recognized that(5.120) is independent of the sign convention for r; occurring
in K since 1f' also changes sign when e h7 does, but, of course, the sign
ambiguity in 1pas persists in that 1J'i is not fixed as to sign.
If we think of (5.120) as a matrix element of the form
( 1f' f III I i V'i)
where Vlf plays the role of a "final state wave function," it is clear that the
Coulomb phase enters "PI as e-- i6iC . lIenee 1Pt is not the scattered wave
discussed in the previous section but, since changing the sign of the phase
shifts converts outgoing waves into incoming waves, 1pf has the behavior
of a plane T\\rave plus an ingoing spherical wave!. This has been discussed
by Breit and Bethe. 30
PROBJ.JEJ\1S
1" What is the eigenvalue of the operator l s 13 for the wave function (5.3),
Is being the space inversion operator? How are the eigenvalues of I s f3 for
1p and ?pc, given by Eq. (5.6), related? Is this relationship a generally valid one?
2. Calculate the perturbation energy for the L shell states in the case of an
electron in a unifornl electric field.
3. Using first-order perturbation theory, find the shift of the lS!-2 and 2s
energy levels due to screening when the screening function is
5' -:: e -A.y
and .A. is chosen so that (dSjdr)r=o has the same value as for a Thoinas--Fermi
screening Inode 1.
4. Assume that the nucieus is a uniformly charged sphere of radius R = tA !.
Estimate the energy level shift of the 2s and 2p1A. levels by using first-order
perturbation theory. Under \vhat t::ircuITtstanccs, if any, would this perturbation
result be an accurate representation of the energy shift?
5. Using first-order perturbation theory, calculate the shift in the 2p% level
tinder the influence of a perturbation
e 2 Q
H' = -- 2r 3 P2(COS{)
where Q is the "nuclear quadrupole moment." rhis type of perturbation would
arise from the non-spherical shape of a- nucleus. Obtain a nUITlerical estimate
for the energy shift for Q = 10- 24 cm 2 and for Z = 63. Show that the first-order
perturbation of H' vanishes for aU levels with j = i.
6. Apply the operators rlJ (space reversal) and ia 2 times complex conjugation
(tirne reversal) to the scattered wave discussed in section 33 and. compare the
result with 1pas given by (5.1 20). A.
7. Verify the result for e 2 ' i l'J,as given in Eq. (5.71') of the texL
216
RELA TIVISTIC ELECTRON THEORY
8. Show that p(p, &J) as defined in section 33 does have the properties of a
density matrix.
9. How does one obtain the scattering cross section and polarization after
scattering for a positron, given the corresponding results for an electron?
10. In the theory of beta decay: when the Coulomb field is included, the
spectrum depends upon the following bilinear cOfi1binations of radial functions,
evaluated at the nuclear radius :31
fk 2 + g!..,
fk + g
I-kg -k - [kgk
The subscripts (k > 1) give the value of K. How are these quantities related in
positron and electron emission?
11. The influence of the Coulomb field in allowed beta transitions is expressed
in terms of the Fermi function F(Z, W). This is defined by
1 ( 2 2
F(Z, W) = 2p2 g-l + [1)
evaluated at the nuclear radius. Taking only the first term in the series expansion
of the confluent hypergeometric functions, obtain an expression for f'(Z, W).
Verify that, for Z = 0, F = 1 and that, for p -+ 0, pF has a finite limit when
Z > 0 and F vanishes for p --.. 0 when Z < O. Note: For the p = 0 limit,
Stirling's approximation for the gamma function is useful.
12. Find the bound state solutions for j = ! in a square well,
V = - V o < 0 for r < ro
V = 0 for r > Yo
What is the minimum depth V o with given "0 for a bound state? What happens
to the energy levels and wave functions as V o increases indefinite1y?
13. Consider the emission of elect.ric dipole radiation for which the selection
rules are
l = :f: 1
t:,. j = 0, :f: 1
Discuss the spectrum to be expected in the transition between states \vith
principal quantum numbers 2 and 3, and cODlpare the number of lines predicted
with the result that would be expected in the Schrodinger theory (no spin).
What differences, if any, are to be expected in the Pauli spin theory and the Dirac
theory?
14. Evaluate the scattering amplitudes F and G and the cross section a(O)
in the limit of small rxZ. What is the Mott polarization to the same order?
15. Define the irregular solutions in the continuum as those obtained by
replacing y in the regular solutions by -yo What linear combination of regular
and irregular standing waves has the asymptotic behavior of the outgoing waves
designated by [ff,l]out in section 34?
16. Find the solutjons of the radial wave equations in a Coulomb field for zero
kinetic energy at infinity (W = 1).
DIRAC PARTICLE IN A CENTRAL FIELD
217
17. Show that there exist radial functions in the Coulomb field for which the
asymptotic behavior is .
( W-l ) .
rfo --> i 1TP e,(pr +)
( w + 1 ) Y2 .
rgo ->- 1TP e,(pr + 6)
Hint: Ifjandg are the real irregular (at r = 0) solutions, consider the r depend-
ence of r 2 (fg - Ii).
18. In the scattering wave (5.102) take the terms I( = :!: 1 on]y and evaluate
"p in the litnit of small r. Note that "p n1ay be written as a spinor which closely
resembles a plane wave. With this 1jJ, for sman r construct a projection operator 32
with elements Pa.j3 = ¥ta.1p.
19. For free electrons of definite n10J11entU111 p and energy Po, show from the
definition of the density Inatrix p that
Tr po. = p!Po, Tr pfJ = Ilpo, Tr pfJys = 0
and that the traces of PYs, pfla, and pfla are linearly related to the Tr pO. Hence,
show that
4p = 1 + (a.p + fJ)/po + (Tr pa).a - (Tr pa.p)p.fJo/po
+ po(Tr pa)-(Ja + ip X (Tr pa).{3a - (Tr po.p) "Is/Po
REFERENCES
1. M. H. L. Pryce, Proc. Roy. Soc. (London) At95, 62 (1948).
2. M. E. Rose, Phys. Rev. 82, 389 (1951).
3. M. E. Rose and R. R. Newton J Phys. Rev. 82, 470 (1951).
4. K. M. Case, Phys. Rev. 80, 797 (1950).
5. Ifigher Transcendental Functions, Bateman Manuscript Project, McGraw-Hill
Book Co., New York, 1953, Vol. I, Chapter VI.
6. W. E. Lamb and R. C. Retherford, Phys. Rev. 72, 241 (1947). Also see Phys. Rev.
75, 1325 (1949); 79, 549 (1950); 81, 222 (1951); 85, 259 (1952); and 86, 1014
(1952) by W. E. Lamb and co-workers.
7. R. D. Hill, £,. L. Church, and J W. Mihelich, Rev. Sci. Ins/r. 23, 523 (1952).
8. J. R. Reitz, Phys. Rev. 77, 10 (1950).
9. R. Christy and J. KeHer, Phys. Rev. 61, 147 (1942).
10. E. T. Whittaker and G. N. Watson J Modern Analysis, Cambridge University Press,
An1erican Edition (1943), Chapter XVI.
11. H. Margenau, Phys. Rev. 57, 383 (1940).
12. G. Breit, Phys. Rev. 35, 1447 (1930).
13. G. Racah, Z. Physik 71, 431 (1931).
14. E. Fermi, Z. Physik 60, 320 (1930).
15. G. Breit, Phys. Rev. 38 463 (1931).
16. G. E. Browo J Proc. Natl. A cad. Sci. U.S. 36, 15 (1950).
17. See. for example, H. A. Bethe and R. F. Bacher, Revs. Mod. Phys. 8, 82 (1936).
18. M. E. Rose, Phys. Re:J. 51, 484 (1937).
218
RELATIVISTIC El,ECTRON TI-IEORy"
19. N. F. Mott', Proc. Roy. Soc. (London) AI24, 438 (1929).
20. H. I\flihlschlgel and loot Koppe, Z. Physik 150, 474 (1958).
21. For example, U. Fano, Revs. Mod. Phys. 29, 74 (1957). AJso R. C. Tolman, The
Principles 0..( Statistical "fechanics, Oxford University Press, Oxford, 1938.
22. N. F. l\1ott, Proc. Roy. Soc. (London) A135, 438 (1932).
23. J. ff. Bartlett, Jr., and R. E. \\latson, Phys. Rev. 56, 612 (1939).
24. J. J-l. Bartlett, Jr., and T. A. Welton, Pilys. Rev. 59, 281 (1941).
25. N. Shennan, Phys. Rev. 103, 160J (1956).
26. J. A. Doggett and V L. Spencer, Phys. Rev. 103, 1597 (1956).
27. W. lL lcKnley, Jr., and H. Feshbach) Phys. Rev. 74, 1759 (1948).
28. 1\1. E. }"{ose, L. C. Biedenham, and G. B. l-\rfken, Phys. Rev. 85, 5 (1952).
29. M. E. F,ose, Nlultipole Fields, John \Vile)' and Sons New 'York, 1955.
30. G. Breit and .H. A. Bethe, Phys. Rev. 93, 888 (J 954).
31. For exarnple, ?VI. Deutsch and O. Kofoed-Hanen1 in E. Segre (Ed.), Experbnental
l\luclear Phyj'ics, Vol. III, John WHey and Son;;, New Yark, 1959, p. 523.
32. J. D. Jackson, S. B. Treiman, aDd }-1. H. \Vyld, Jr., Z. Plzysik J50, 640 (1958).
VI.
ApPROXIMATION METHODS
For many problems, exact solutions of the Dirac equations are not
available and it is highly important to develop methods of approximation.
In this chapter we discuss some of the more important methods ,vhich
have been developed. Some of these are applicable to stationary state
problems, others to dynamical processes, and some to both types of
problems.
35. THE CLASSICAL LIMIT
The classical limit of the Dirac equations is obtained by considering
that Jj is small compared to such quantities as (xft). In this limit one
shou1d recover the relativistic Hamilton-Jacobi equations. Pauli 1 has
considered tlns limiting case in some detail and, except for some rl1odifica...
tlon of notation, we shall follow his presentation
Restoring the constants Ii, m, and c, the wave equation for a particle in
a field with vector potential A and scalar potential <D = Ao is
[ ( Ii e ) Ii a e ]
a;. -: v + - A + -: - - - Ao + fJmc 1p = 0
lei (} Xo c ,
where Xo = ct. The field cOl11ponents A and Ao are taken to be rea1. We
introduce the usual representation of 'lp in terms of the action function S:
(6.1)
"p = a exp (iSII1)
(6.2)
Then, with
e
1t = VS + - A
c
( 6.3a)
as e
71"0 = - - + - Ao
ax c
o
219
(6.3b)
220
RELATIVISTIC ELECTRON THEORY
we find
Cle ( V + n ) a + ( - 7TO ) a + pmca = 0 (6.4)
I I (} Xo
The expansion in powers of Ii is made in the amplitude a:
Ii
a = a o + -: at + · · ·
J
Then the coefficient of lio is
(<<en + fJmc - '"o)a o = 0 (6.5)
and the coefficient of Ii gives
(men + flmc - ?'To)a 1 = - ( a.eV + 1- ) a o (6.6)
oX o
The homogeneous equations (6.5) are consistent with a o =/:- 0 if
n 2 + m 2 c l = 11: (6.7)
as can be seen, for instance, by operating on the left of (6.5) with
(l e 7C + fJmc + '"0. Equation (6.7) is the relativistic Hamilton-Jacobi
equation.
By taking the hermitian conjugate of (6.5) we obtain
a: (<<en + fJmc - ?To) == 0 (6.8)
We have used the fact that real fields imply a real S and hence realn and 170.
From (6.8) it follows that
a: (necx + pmc - '"o)a n = 0
for any n = 0, 1, . . .. Hence, from (6.6) with n = 1, we see. that
a: ( CltoVao + oa o ) = 0
oXo
The hermitian conjugate of this is
. oari 0
vao.«ao + - a o =
oX o
and, adding (6.9a) and (6.9b), we obtain the continuity equation
(6.9a).
(6.9b)
div j + = 0
ot
with
. *
J = ta o Ciao
p = aria o
APPROXIMATION METHODS
221
TIle relation of this current density to the velocity of the particle is seen
from the following; E(. (6.5) is multiplied on the Jeft by aa and (6.8) is
multiplied on the right by «a o . Adding these, we find
ari(<< cx..1t + a.'Jt a)a o = 21Toaaao
or since
a .7t + «.'Jt (I = 27t
we see that
.,., oj = C p'lt
(6.10)
The velocity is deduced from the canonical equations:
. oR
X k = -
OPk
with
H = - as = _ c oS = C ( 170 _ e Ao )
at ax o c
Hence, from (6.7),
. C11'k C11'k
X k = 1-'" = --
(71: 2 + m 2 c 2 )' 2 7To
(6.11 )
Consequently,
jk = pik
(6.12)
Here the positive root in (6.7) was chosen. For the negative root,
. CTT' k
X k = - -
(71: 2 .t- 1112c2)
C1rk
=-
11'0
so that (6.12) applies in any case. It follows that the orbit of the particle
is along the direction of j: the current is convective, as would be expected
classically.
The preceding discussion is a formal one designed to exhibit the classical
]imit. However, as is well known, the limiting form of the theory for
Ii 0 can sometimes be used as an approximation for getting wave
functions and eigenvalues in the limit of large quantum numbers. This
involves, of course, the application of the Wentzel-Kramers-Brillouin
(WKB) method to the Dirac equation. 2 ClearlY:J the application of the
method is facilitated very greatly if the wave equation can be reduced to
one degree of freedom although a more general treatment is obviously
possible, in principle.
For the radial part of the central field problem we consider the second-
order equation (5.8). The application of the WKB method to this equation
222
RELATIVISTIC ELEC'"fRON THEORY
has been discussed at some length by Bessey3 and by Good$4 If the units
are restored, this equation now reads
u; + - V' 2 u + (Qo + Ql)U 1 = 0
W - V + mc
(6.13)
where the prime. means differentiation with respect to rand
(W - V)2 - m 2 c 4 1(2
Qo = ft2 2 - 2
c r
K I<. V'
Ql = - - + -
r 2 r W - V + mc 2
The terms have been grouped as shown because it is consistent to treat 1<:
as large, and Kn/r is then of order mc. Consequently Ql must be treated
as sn1aUer than Qo by one order of magnitude. Thus the two terms in Ql
are of order
1/ K ,"'-I (h/mcr) and C1..Z/ K
tin1es the term K2Jr 2 in Qo. 'The large value of r compared to Ii/mc can be
understood in terms of a pair of turning points which delineate a classically
allowed region of Illotion which encompasses the point at infinity as Ii -+- O.
The lerln in u{ in (6.13) can be eIiminated by the substitution
U 1 = VI(W - J/ + mc2)-'2
and this gives
v + (Qo + Ql + Q2)V 1 = 0
where
3 ( J.l' ) 2 1 VI!
Q2 = - 4 w - V + mc 2 - "2 w - V + mc 2
Both terms in Q2 are negligible in the classicallirnit. They are, in fact, of
second order relative to Qo, and they will be dropped in the following
treatment. Hence the WKB solutions are of the form (Qo + Ql > 0)
( W - V + mc2 ) r f r 1 / ( Q ) ]
U ex p :f:i Q 2 1 + --L dr
1 -- Q Y2 0 2Q
OJ..... rl 0
(6.14)
The small component u 2 is obtained from (5.5). These solutions are for
the region where Qo -t- Ql > O. It is obvious that for Qo + Ql < 0 the
solutions have the real exponential form and i(Qo + Ql) is replaced by
IQo + Ql[. In (6.14) an expansion has been made for Ql <{ Qo and r 1 is
a root of the integrand.
APPROXIMATION METHODS
223
For a bound state, neglecting Qt, it is seen that Qo has two real positive
roots and Qo > 0 between these roots. If these roots are denoted by r 1
and '2' with '2 :> tv the energy quantization condition is
J 1'2 Q;? dr = (n r + !)7T
Tl
( 6.15)
where n r is the number of radial nodes. 'The evaluation of the integral for
a C01Jlornb field is eleInentary, and it. gives the resu]t (5.36) provided that
we identify 11", .+ t with n'. This identification is not exact, but it is
pern1issible in the lirnit of Jarge n'. If the Ql correction is addd to the
integrand, the eigenvalues depend on the sign of 1<, contrary to the behavior
of the exact rcsulL
A.n inlproved foral of the WKB solutions has been given by GOOd,4
who has applied the method to the continuum states in the calc'ulation of
the F'erl11i function v/hich describes the influence of the screened Coulomb
field on the energy spectrum of allowed beta transitions. For bound
states this modified method gives the san-Ie energy eigenvalues as the
standard WKB procedure.
36. THE BORN APPROXIl\IATION
Retarded Interaction between Chal 4 ged Particles
One of the ITIOst iIl1portant i11ethods \vhich has been used very extensively,
especially in dynamical problems, is the Born approximation. The term
Born approxinlation is used in two different contexts. In one it involves
neglect of all interactions in zerC order so that the zero-order vv'ave
functions are plane \laVeS or free particle spherical waves. The broader
meaning of the Born approxitnation Inay bf.: illustrated by a specific
exarrlple.. In the treatrne-nt of the probieru of beta decay the electrons are
in a Coulomb field and the de(ay process takes place by virtue of the beta
coupling. It is customary and justified to treat the latter as small and to
calculate transition probabilities to lo\;vest order in the beta coupling
constant. This is a Born approximation in which the expansion parameter
is expressed in terms of the coupling constant. If the Coulomb field is
neglected and plane waves are used for the beta particle, this constitutes
an additional approximation, a Born approximation in the sense first
described above. -rhen the expansion parameter depends on the Coulomb
field and is essentially cxZ. In this case, then, a double expansion in powers
of two paranleters is being ll1ade. In the present application we shall be
concerned \vi th situations in \vhich the electronlagnetic coupling is the only
224
RELATIVISTIC ELECTRON THEORY
one present. In this kind of problem the term Born approximation has the
same two-fold meaning. For instance in the electromagnetic interaction
between t\VO charged particles, electrons say, there may also be an external
field which, for example is due to the presence of the nucleus. We can
take this field as fixed: essentially this means that the nucleus is treated as
a classical system of charges (and currents). l'hen the Born approxin1ation
consists of a perturbation treatment of the tvvo..electron interaction \vhich
is equivalent to an expansion in powers of e ::-:..: (Y.., In addjtion, we may
use the Born approximatIon in the sense that tb:e external field is neglected
"so that the electrons are represented by plane 'Naves for exarnple. The
sense in which the approximationg are made 8hould be clear from the
context. ..
The problem of interaction b(tweer! charged particles \\'ilj be formulated
by assuming that two Dirac particles ate conpJed to the electromagnetic
field. Then for each particle the equation of n1Gtion
( ' a if:: ) \ , '"
r /l , ax /l + he A It '!p + ko 1f = \J
applies. Here AJ.t' the four-potentia] of the eiectrornagnetic field, is
evaluated at Xl in the equation for particle 1 and slrni]arly at Xz in the
equation for particle 2. We shall make no distinction between 1 1 and t 2 ,
using a common time t for both particles. The field A p, at x] , say, is
generated by particle 2 according to
o2Av 4.11'
-. Sy (6.1.7)
axp oX)L - - -;;
where the current four-vector due to 2 is
/ . 1 .....,
, 6 c l Q 1
Sv = - iecip(2)Y2V 1f'{?'}
( 6. i8)
If we consider a dynarriical problem where particle 2 maks a transition
from state 1p2) to 1JlY1.), \ve iTIUst replace s". by
S = -- ; ec 'f7; 2} 111 ,w2)
V f.,r T f 't'2v'f z
(6.18')
" ;
Then
1 t- . / if )
A / ) J S ,.6\ r:h" d 3
I r 1 t = - ---- rf)
V\' C R -"
( ;' "'0/' )
C. Lt"
where R = r 1 - r 2 and t ' == t -- Rlc. Under the influence of A"., particle
1 is considered to make a transition fforo state .)l) to 1f.P. l"'hen the rnatrix
eleljlent for this is written, in correspondence princIple fashion, as
If . f -(H A ( ) '1' A
fi = Ie )i Ylp I f 1 1.fJi' tl r 1
r.J
(6.19)
APPROXIMATION METHODS
225
where we recognize that only the space part of the potential A{4 enters in
the results of the perturbation theory. In fact, sp(r 2 , t) has a time
dependence given by exp UCW f - Wi)t/h] = exp (-hot), and replacing t by
t - R/c gives a factor
e -iwteikR
where k = ({)je = (Wi - Wf)flic. The All in (6.19) includes everything
but the time factor, exp ( - iwt).
From (6.18') and (6.18") we find
r eikll
H Ii = e 2 ifJ2) ifJ}l'Yl ll f'21l R - 'VI)2) 'VI?' d 3 r t d 3 r 2
(6.20)
This may be written in the alternative forIn
R
H Ii = e 2 J 'VI2)*'VIt)*(1 - a. t °Ot 2 ) e R 'VI2) '1jJ1) d3rt d 3 r 2
( 6.20')
rhis is the wen-known retarded interaction between two electrons. It
was first obtained by M0l1erS and has been discussed by many other
authors. 6 The retardation is expressed through the scalar Green's function
(exp ikR)jR. The first term is recognized as the Coulomb repulsion, the
second is the relativistic current-current interaction. While the operator
appearing in (6.20) is a contraction of two four-vectors and is relativistically
invariant, it should be emphasized that the matrix element as obtained here
is correct only to first order in e 2 or <X. This is brought out explicitly in a
more.detailed derivation given in Appendix E. There it will be seen more
clearly that the Coulomb repulsion term arises from the virtual emission
and absorption of longitudinally polarized quanta, while both transverse
and longitudinal polarizations contribute to the (11-(12 term. The wave
functions in (6.20) or (6.20') need not be free particle wave functions.
For instance, in the Auger effect "1'(1) and 'lp(2) are wave functions in the field
of the nucleus. For two electrons antisytnmetrization in initial, final states
is necessary. Tn internal conversion this is not needed, of course, because
then one of the particles is a nucleon, the other a Dirac electron. For this
problem a in the nuclear space should be thought of as a current operator
whose precise specification depends on questions of nuclear dynamics.
The matrix element (6.20') would also include a sum over all nucleons.
'The Breit Interaction
A different approach to the problem of two electrons has been developed
by Breit 7 In this treatment of the problem the instantaneous Coulomb
repulsion is included as a zero-order term in the total Hamiltonian, and an
226
RELATIVISTIC ELECTRON THEORY
additional approximately relativistic current-current interaction B is
deduced from the mutual emission and absorption of transverse quanta
between the two electrons. Thus we would write the equation of motion
for a stationary state in the form
( W - Ro(1) - He(2) - ) 'f = Bo/
\ 2
(6.2J)
Here W is the total energy of the system, Ho(n) = cx n ,,1i n + t3n + Vext(r n )
where nn = Pn + eAxt(rn) in general. To determine B the zero-order
solutions of (6.21) with B = 0 are used in a second-order perturbation
treatment. This gives the interaction energy
AW = _ e 2 ! J d3k' .L { (011Xv. exp (ik'or 1 )ln)(nI 1X 2l exp (-ik o r 2 )IO)
47T 2 ).. k' n t k' + W n - W o
+ (OIIX2). exp (i'or2)ln)<n!lXu exp (- ikor1)IO) }
k + W n - JV o
This result corresponds to emission of a quantum with wave number k',
polarization A by particle 2 and its absorption by particle 1 (first term)
and (second), the same process in which particles 1 and 2 are interchanged.
The numerical factor comes about from the (27T)-3 coming from the
density of states of wave number k' and a factor 211/k' from the normaliza-
tion of the radiation field.t The index n designates an intermediate state
in which the energy is k' + J¥n e
The Breit interaction results from the neglect of the energy differences
W n - "V o compared to k'. This implies a neglect of retardation. The
justification for this step may be made by noting that in an atom of atomic
number Z the important values of k' are of order < 1/ r) ,-....; ocZ while
W n - W o r-...; (ocZ)2. Thus We must assume ocZ <{ 1. When this is done
the sum over n is carried out by the completeness relation, giving
W = (OIBIO)
where
e 2 f d3k'
B = - 27T 2 .t k'2 exp (ik' oR) IXUIX2l
( 6.22)
The SUITl over polarization states A is carried out for transverse degrees of
polarization only. Then
.! <X 1 ,l(X2J. = (Xl - k' (Xl. k ')-( «2 - k' a 2 .k')
A
A, r_I
= «1-«2 - Cll.k CX2.K
t See Appendix E.
APPROXIMATION lVIETI-IODS
227
and
2 J -- d 3 k '
e '. , I ",
B == - - - exp (lk .R) (CX 1 .CX 2 - cx 1 -k tX 2 -k)
21T 2 k,2
(6.22')
The integrals are evaluated by elementary means:
f d3k' 2n 2
- exp (ik'-R) = -
k,2 R
j '" d3 k' . J
_ 2 exp (ik'-R) (cx]-k')(O:2- k ') = !. «1-\7 R d 3 k' exp (ik'.R) CX 2 -V k , 1-
k' '2 " k,2
2 A
= 7T (CXl-V It) C'l. 2 '"R
2 A A
= -- (ct 1 e a 2 - af,R cx 2 .R)
R
The integrals over k' are carried out by inserting a Hconvergence factor"
e- rxk ' and then taking the limit C1., -> 0" Hence the Breit operator becomes
e 2 A A
B = - 2R (CX 1 'CX 2 + cx1.R cx 2 .R)
(6.23)
The cOD1plete two-electron interaction is then
2
V;2 = + B
R
( 6.24)
in the Breit theory. Tn contrast to this) fron1 (6.20') we would take this
interaction to be
ikR
V2 = e 2 ( 1 - (11-(12)
R
( 6.25)
in a process involving energy transfer k. The retardation does not appear
in (6.24). Indeed, the two interactions should not be expected to give
identical results since the Breit interaction invo]ves the additional assump-
tion that V 2 JC 2 1 and is therefore not an invariant quantity under
Lorentz transformations.
To compare these two results quantitatively \\'e cousider the rnatrix
element for electron-electron scattering, using }/]2 given by (6.25) ZInc! then
by (6.24). \Ve consider a coHision process in which there are no external
fields and plane ",raves are used in zero order. The ini tial state j s described
by electron 1 (Pt, WI)' electron 2 (P2' W 2 ) where the syrn bots in parentheses
are the momenta and energies. After the scattering 'e have, for (;1ectron 1,
228
IELATIVISTIC ELECTRON THE,QRY
(p, W{) and, for electron 2 (p., W). We use conservation of energy and
momentum so that
q = PI - P = - (P2 - p)
and
k = WI - W = -(JV 2 - w;)
The cross section per unit solid angle in units of (tzjmc)2 is
27T6111 fil 2 n( H;, rV)
(J=
.hnc
( 6.26)
",here 6 is a sum over final spin orientations and an average over initial
orientations, n( fV{, V) is the density of states, and Jinc is the incident
current density. The n1atrix element for the !Y1011er interaction (6.25) is
thent
f ikR
Hfi = e 2 JaR e R exp (iq.R)[U*(pD U*(p)(l - a 1 ,a 2 ) U(Pl) U(P2)]
(6.27)
The integral in (6.27) is
I '3 R exp (ikR + iq-R) 27T J oo dR ikR ( iqIl -iqR )
cl =- e e --e
R iq 0
This is eva]uated, as usual, by inserting a convergence factor e-I.R and
taking the limit r:I. ._- 0 after integration. Then the integral is
27T ( 1 1 ) 47T
q k -t- q - k - q = q2 - k 2
and
HI; = "241Te2 k 2 [lJ*(p{) U*(p)(l - a 1 .( 2 ) [{(PI) U(P2)]
q -
(6.28)
We shall not evaluate this further, although a subsequent exanlple will
illustrate the nlanner in which this would be done. Instead, the Breit
interaction will be used to obtain a result to compare with (6.28). For
this case,
H 2 f d3R ( . R " U ' ( ' ' ) T -.' ,0 )
h = e Ii- exp HI" ){ 'I" .'l LI '\"'tP2
A A-
X [1 - {al-a2 + ((l-R (X2- R )] U(Pl) U(P2)}
t Nonnalizing in a box of unit volume. For our present purpose it is not necessary
explicitly to consider antisyn1metrization since we shall eventually compare two different
operafors which are treated in the same way.
Ll\.PPROXIM/\'TI()N METIIODS
229
The second part, involving B, ls sbnp1y done by ttjng (6.22') for B rather
than the fi)rnl displayed here. T'hen.8 contributes
2 r f d3 t f
H J ,(B) == - 3- d 3 R ex p (i q .,R) exr (.!l(eR' )
2 2 " k "f 2 ,
*"1T ...
x [U*(p) {]*(p)«(.'(laa2 - tX1-:k f tt 2 e :t{') U(P1) (I(P2)]
T'h;;; integration over R gives (27T)3 o(q + k') and
4. f) ( I )
l/ (B) ' 7 re 4J T 1'::.0;/ ' ) L ' ( ' ) ('tlt1q tt2'"Q Iv ) U( )
t Ii j = -. --:;-- u' PI I' P2 a 1 ll a: 2 -- -'--;'--- (PI P2
q q
TJ, I '" r ' f
.uut q ';= PI -. It l' l1ere ore
lJ*(p) alq U(Pl) = U*(p) aI-PI [l(Pl) - [al.p U(p;)]* lJ(PI)
=.: l]*(p){ Vl - PI) U(Pl) - l]*(p)(J¥; - PI) U(Pl)
.=' kU*(p) U(Pl)
Similarly, q = -(P2 - p) and
£J*(p;) a.q U(P2) = kU-*(P2) [J(P2)
u
Li..-,..."o3r" ( k2\ l
;-l"jJL) =--= - :::..! t: 'l'(pj) [I*(p) C( l tOt 2 - -2/ fJ(pJ) ll(P2)
4 '". \ q , ..J
1 "-. . . , id . P J 4- . p ."')., -:"I;' t., ,..\. f 2 /R W ." f ·
J u. 1 c.; t1{. i\..(. to n, ,Xl.;.1 t. 1 )" LiCm'l.lTl (0 e . c use
2 f dR ( ' ) 47Te2
e 1 -Ii: exp Iq"R = -qi-
and the total n1fu.rlx ele1nent, from (624)5 becomes
Hfi = ;:2 {U*(P) U*(p;) ( 1 + ;: - (1(1"(12) U(Pl) U(P2)} (6.29)
for the Breit interaction. ({)mparing with the T\1[0tler result (6.28)" we see
that the t\VO are identical on!y if the retardation as expressed by k is
neglected. 'This assurnption of k = 0 will be exact if p] = P. and hence
p ::::: p. For exan1plc, if r n 2 :-:.::; 0 then k = 0 only for forward scattering.
For sn1all p and p', k S jq2 <f, 1 and the t\\-'o results again become identical.
r[o obtain the cross section for electron-electron scattering we must
antisyrnmetrize the wave functions. Reults have' been given by M011er. 5
These sho\v that for st11all relative velocities the cross section given by the
two interactions agree up to order v 2 jc 2 only.
230
RELATIVISTIC EI..ECTROt 1'HEORY
Scattering of Fast Electrons by Nuclei
As an application of the Born approximation vve consider the effect of
the finite extent of the nucleus in the scattering of electrons. 8 When the
deBroglie wavelength of the electrons is of the order of nuclear di;:nensionst
we may expect destructive interference from \.vaves elnanating from
different parts of the nucleus. Essentially exact ca lculations 9 show that the
}3orn approximation gives approximately correct results except over certain
angular ranges where strong destructive interference occurs; the scattering
in Born approximation is then predicted to be smaller than the true value.
Our purpose here, however, is illustrative rather than one of obtaining
precise nun1erical cross sections.
The nucleus \vill be considered a static charge distribution with density
Zep. The cross section (6.26) for unit solid angle is
w 2 I f (r ) 1 2
a = 817 2 ZV6 d 3 r tPr N p R N 1Jl;(r) 1Jlb) I
(6.30)
where we disregard retardation in (6.20') since only elastic scattering is
considered. "Ne neglect the negligible current term in view of the non-
relativistic treatment of the nucleus. Here p(r).,,) = 1fJ:r"PJ.v replaces 1.p}2)*"P2).
The matrix element in (6.30) is
U*(p') V(p) f tPr d 3 rN p(rN) exp iq.r2
where q = p - p' is the mon1entUJTI transfer and R == r - r N' Tbe
integral is
f 3 . f eXD ( i q .R ) 3
d rN p(rN) exp (/q.rN) A R d R
= 17 f d 3 rNP(rN) exp (iq.rN) l ro (eiqR - e- iqR ) dR
zq 0
= f d 3 rNP(rN) exp (iq.rN)
where use of the convergence factor is again made. Since we assume a
spherically symmetric charge distribution p(rN), the integration over the
directions of r. N may be made to give for the integral
(47T)2 j OO sin qr
- rJ.vp(rJ-wT) _-11 dr.'V.
q2 0 qrN
t More exactly, the nuclear radius times the mornentum transfer is of order unity.
APPR()XIJ\l{ {TJOI rAEl'fIODS
'j" 1
..".)
The SUln over spiri. orientations is done by standard llleans:
6IU*(p') U(p)1 2 = 6U(p') Ug(p') Ur(p) (}f(p)
- Tr P ( n' ) ' :'J> ( )
- -\- t' i + 1".1
1 1.. (1 r
= .+ --- -, P . p )
'2 '
w 2 -- 1 -t- p2 cas l} 2 I fl ') {) )
= ----- '" = - + pu cos2-.
W W 2 \ 2
where{} is the scattering angle and we have usedp = pro Collecting results,
",re obtain for the cross section
2
(J = (JoFo
(6.31)
where
Z 2 e 4 [1 + p2 cos 2 ({j/2) ]
(J - .
o - 4 p 4 sin 4 ({)j2)
is the scattering per unit solid angle from a point nucleus and
(6.31a)
l CX) .
F 4 ' ) S I n q r lV 2
'"10 = 17 ptrN r 1y (11 IV
o qr;.V
is the forn1 factor for scat1ering fron1 the distribution p. If the nucleus has
a sharp radius '0 and a constant den5ity p = 3f4'n"rg, the forrJ factor is
(ci.31b)
3 {sin qro \
Fo == ---;;\_._-- .-- cosqroJ
(or ( ,)'"" \ ( 1 1" 0
'L j,
For qrJ.v < 1 for all rN for \\lhich p(r. y ) is appreciabe the form factor can
be expanded to give
F = 1 - - q 2 < r2T'> + q -!lr4 ) ; -. .
o 3 ! , , .Av 5! \ N .A '\
In fIrst approximation the correction for finite size of the scatterer depends
. 011 (r})AV' In aU cass Fe) .« 1, since p is normalized to unity. Noting
, that q = 2p sin !19, it is evident that Fo = 1 in the fOf"vvard direction. l'his
is expected since all waves arising frorn different parts of the s<..attering
volun1e are then in phase.
The validity of the Born apprOXinJd.tion in scattering has been invcsti-
. gated by Parzen. 10 For a central field \vith potential energy /fr) \vhich is
not singular at r = 0 (finite nuciear size) 2.nd 'lhich falls at infit1ity faster
than a Coulomb field (screening) the requiren1ent is found to he
I ' ('''' V( 1') dr l ' <{ 1
J o
232
REI.,r\TIVISTIC ELECTRON TfIE()R"-[
In contrast to the expectat10n foHowing from tbe non-relativistic treatment
it does not foHow that at sufficiently high energy the iorn approxilnation.
is valid. If V(r) violates the condition above, as jlvvnuJd for high Z., 1he
Born approximation cannot be justified. The distHlctiun in the f\VO cases
lies in the interpretation 0; the Born pan:uTIeter e 2 Z)hv "'v1'dch pproaches
zero if we superficially take v -- (X') as in tht; non-relativistic description
but approaches rxZ in actuality. f\S a rough estimate of 1 he a bove integral
we can cut off the Coulornb field at r == ltXA1 at the lo\v:r Jirnit (\vhich is
of order of the nuclear radius) and at an upper hrnitequal to (Cj{.L':-)--;"'3
which is of the order of the atornic radjus. T'hen
J I I) ( 1 ", ""y.
V(r) dr ! 2.0: rxZ In :-A;: -
which is roughly of order y.Z.
37. COMPTON S(ATT:ERING OF CII({JLARl.lY 'P\OLj\Ft!Z}D
RADIATION
Strictly spfaking, the discussion of processes invo1-'ving f.;USton and
absorption of quanta, electrrnn3gnctic or nther\vise in\i()]ve ttH: formula
tion and a p plcatjon of a the:or y .. in vvhich these fic1d: arc {Tuantizt.\.L
. ;,;
HO\''le'' e r a we L 1 a .. / f' rl lI ',..;\(i.' f'f""l p '. J 't 1 <,;';7f{ ,)....$", F:"') l '!i;-Ilt,p r'f fr'..:lpI\: 1 " t 1p n
'- " Y u .1 ,,-<.. V L... \ II.- tv.. t A .t. ......J 1, i '-l ,.A W\ L ':&',..f., ."., ,,' (..... i........" t. -\ ..1 .I '... .. 4"" _ .t_ LJ.&.
probabilj ties and cross sections 'is to a large extent 2ri :{pphc:atlcn of tht,
singJe particle theory developed in this book A an ; nu:;;rafJon f)f tbrs
fact \VC considel the (()1TIpton 'catteri1lg,} st(1rt1ng vnth the genraJ fOflnuias
of the perturbation theory in vvhich the h)\vest-order ficn..vanishing
contributions in a power series in Ct := e 2 j Ilc a re retained,> 1'his lS often an
cxceHent approximation') and this rt:m.a.rk applies to th' (o!npton scattering
in the range of energies (nuc]ear garnnla-ray encrgie5) where IT'iany pr(1ct,.cal
applications are n1ade
'The effect we shall discuss is the a nalysis of circutady poladz,ed radjation
where the application in mrnd is to nuclear gatnrna rays foUo\ving beta
decay. As is well known, the effect of non,.conser\'atiOI1 or parity jn bet1
decay results in a residual nuclear state whi<.:h is pOJarizedt /ven in a!l<y.;;ed
transitions. "Then a gamma ray ernitted fforn such a nucl:?ar tate, observed
in coincidence with the beta particle, is circularly polarized. T'o anatyz
the circular polarization COlnpton scallt:ring by polarized electrons
constitutes the 1nost practical procedure. H ,
l' In a representation in which the density matrix is diagonal th:: nuclear sur.'states
with magnetic quantum nurnber A1 can be described by a population disH ibntion
(following allowed transitions) of the forn1 of a linear function of M'.
...
APPROXIMATION METHODS
233
r-rhe influence of the Coulomb field in the Compton scattering is
negligible, and the electron can be represented by plane waves. Hence the
present consideration represents an example of the Born approximation
in both senses in which this term is ernployed. 1'he present section
constitutes a.n application of not only the Born approximation but also of
several other considerations which have been discussed in previous
chapters.
The electron is initially at rest but in a specified spin state. The photon
has initial momentum to, and after scattering its momentum is k. The
electron therefore acquires momentum p given byt
p=ko-k
( 6.32)
1'hen the final state is reached from the initial state via either of two
intermediate states:
(i) The initial photon is absorbed and the electron acquires m()mentum
p' = ko
(6.33a)
and energy
vV' ;:..-= :f: (p2 + 1) ,'i
(6.33b)
rhe electron of this energy will exjst in either of two possible spin states)
and so in summing over intermediate states ::dl four states of the given
momentum must be counted
(ii) The final photon is eJnitted first, so that 1n this intermediate state
there are two photons, vvith momentum ko and k, and an electron with
momentum
p" = .-k
(6.33c)
and energy
WI! _ :i:(p"2 + 1)Y2
( 6.33d)
Again both spIn states and hence four intermediate states are to be
included.
The final state is reached from (i) by emission of k and from (ii) by
absorption of ko In all cases it will be assuIned that no observation of the
final polarization of electron or photon is made. This, again, corresponds
to the situation most often realized in practice.
The transition probability in units with 11 = m = c = 1. is
21TP,:! :! H(k, -r') Hio(kr" -r) + I H,lke., r) (k, :J 2 (6.34)
r' i Eo - E i ii Eo - E ii
t Linear momentum conservation is a consequence of the non-vanishing of the matrix
elements; see below.
234
REI-iATIVIS'rIC ELECTRON THEORY
Tn (6.34) Pi is the density of final states per unit range of E, = W + k,
where 2 1/6
tV = (p .+ 1)" 2 > 1
j.s the final electron energy. AJso in (6.34) the matrix eIenJents Hflk, T')
aild 30 on are the plane \vave rnatrix elements for emission (with the
COITlplex conjugation) and absorption (without complex conjugation). The
arguments indicate the InOlnentum of the photon eillitted or absorbed and
the polarization of that photon. 'lye normalize the radiation fieJd so that
the energy in unit volume is k or ko- "fhen the coupling energy with the
field is ( ) 1 .'
2 /2
e k: (X.a, exp (iko'r) = e(X.A (6.35)
for the photon with energy ko, n10mentum ko. Here 3.. is a unit (complex)
polarization vector: ") - !--/'" + . .... ) (6. 1 5 ' )
a r = k; el ITe 2
"
where e 1 , e 2 , and e3 = ko form a right-handed system and T = :l: 1. For
T = -(-1 the radiation is right circularly polarized (by definition), and
T ;;:.-;; - I corresponds to left circular polarization. Also, aT' is expressed in
a sirnilar v,'ay in terms of two directions vvhich vvith .k form a right-handed
oordin(: te system. The superscript (X) on the matrix elements n:eans that
only the vector potential A in (6.35) is conjugated. Since the space."
dependent exponential factors cancel out by linear n10nlentum conserva-
tiCIL this means that only aT' viH be; conjugated. The sum over if jnlplies
that even this operation is actu.any unessential (1h == 2_ -;,,). }\. surn over
fir'ai spin states is in1plied in (0,34). Final!y the energy denorrdnators in
(6.3:4) are
En = J -t k {}
1:.7. = Vf
t
(6.36a)
( t:.. ;>$ 6 )
\.J.':; 0
E'ii = k + ko + JV"
The densIty of final states is
( 6.36(; )
dk k 2 dO
Pr =:-: dE f (2,")3
where the volume 'Of the box, in \;hich the entire system is enclosed, has
been set equal to unity. In (6.37), d!.! is the elernent of solid angle f0r the
outgoing phot or. l'rorn
( 6.37)
V11 := (k. t- k 2 - 2ko.j{ -1- l)
we nnd
Ji/k k 2 dl
Pi = - k _ ( '1... ) 3
o ..... IT
(6.37')
APPROXIMATI()N MErIIODS
235
The cross section per unit so1id angle is obtained from (6.34) by dividing
by c = 1 and hence, in units of (hfmc)2..
(J = e 4 W k 2 I Z pIIX'alp') (p'IIX',a,IO) + Z (pla:.arlp") (P"I(X'IO) \6.38)
k r' i 1 .+ ko - W ii 1 - k - W I
"fhis refers to one electron. In an aton1 of atomic number Z the electrons
scatter incoherently, and the cress section would then be multiplied by Z.
In (6.38) the plane wave states have been labeled by their momentum.
It is important to remember that the initial state labeled 0 is an igenstate
corresponding to a definite spin direction s: that is, it is an eigenstate of
(!J.s, \vhere s may be taken to be unit vector for the present. 'rhese states
were discussed in section 19. For the remaining states ..he spin rpresenta-
tion need not be specified explicitly since a sum over spin states is to be
carried out.
The sums over four intermediate states in (6.38) are carried out as
follows. In the first sum we multiply numerator and denominator by
1 + ko + W' so that the denOl11inator then becomes (1 + ko)2 - W'2 = 2ko
and is independent of the specification of the intermediate state spin and
energy sign. The factor 1 +. ko + Jill" in the numerator can be taken into
either matrix elen1ent where, multiplying the state with momentum p', it
is replaced by 1 + ko + h(k o ), where
h(k o ) = a..ko + f3
Then the first sum has the form
with
2O (piQl!P')(P'!Q210)
Ql = a..a[l +. ko + h(ko)]
Q2. = a..a r
Introducing spinor indices, this sum is
[p J;( Q 1) P;.[p'] [P'];(Q2)P' ;.,[OJ,t'
(6.39)
wherein the n10menta in square brackets are used as symbols for the
amplitudes. The only quantity involved in the sum over intermediate
states is [P']A[P'] and
[P'JA[P/];' = t5 ;,p'
i
since the states labeled by p' form a complete set of given momentum.
Hence (6.39) becomes simply
tpjQIQ210)
236
RELAT'lVISTIC ELECTRON THEORY
,
In a similar "vay, the second sum in (6.38) is evaluated by multiplying
nUJllerator and denominator bv 1 -.. k + W". The result is
.I
(j = e4 2 _I ' plf!.110) _ (PIf!210) 2
4 k r' ku k
( 6.40)
"here
£1 1 = a.a[l ko + h(ko)Ja..a r
U? = a-a r [ 1 - k - h( - k)]a..a
( 6.40a)
(6.4Gb)
Carrying out the square operation and introducing the projection operators
previously studied in section 19, vie find
e 4 W k 2
a = -- I Tr (.11 -- B - C + D)
4kg r'
( 6.41)
where
kgA = P +ltP(S)o.;
k 2.J) = P -r-112P(S)Q:
- *
kkoB = P +11P(S)!.}2
kkoC = P +f1 2 P(s)O:
Here we have already carried out the spin sum for the final electron states
by introducing the (positive) energy projection operator
p+ = (1 + ct.+ E )
(6.41a)
Compare Eq. (3.49). Vle have also introduced the projection operator
P(s) for the initial electron. This is,
P( s) == i (1. -to 0' -s )( 1 + {3)
(6.41b)
Compare Eq. (360a).
The evaluati.on of the traces in (6.41) is quite lengthy but straight-
forward. [t is facilitated by noting relations such as
a a x -- '".I a x -. 1
f" r .-- ".'- T' --
. x I" ' k '"
la r X ar' = T e a = 'T
x
;I a.a ' a. p a.a ' = -2a.K P .k
...., r r
r'
The final result is, for a single electron,12
1 2 k 2 [ ko k . 2 -Q ( 1 Q. ) (k {} k )]
(] = - r - - t - -- S1n 'U .-- 7' --- cas v s. cos + 0
2 0 k k ko .
( 6.42)
APPROXI1\1A TIO'N METH()DS
237
where cos {} = .k-k o is the scattering angle of the photon and \ve have
introduced the classical \::lectron radius ro = e 2 ,/lnc 2 = e 2 in our units. The
first three ternlS 111 (6.42) give the \'Vell-known K]ein-Nishina formula for
COlnpton scattering The last term, <;lependent on T, gives the anisotropy
due to the circ:ular polarization of the photen and the magnetization of the
electron. F'or scattering in magnetized iron, s would be replaced by its
average value so thar the anisotropic term would be of order %6 0.08.
38. SOfvll\1ERFI:J.:D...j\1AlfE APPRO'XJMATION
Plane wa"lcs for the Dirac particles have been used very frequently for
the caicuJation of a number of other etrccts. l\mong these mention rnay be
ITlade of hrernsstrah1ung,!3 external pair forn1ation,13 photoelectric effect,14
and inL;rn.al pair forn:ulticn.1 5 In most ca&e,: appreciable deviations from
thr:se Born appf(fX1IP8tiont re:sults are se;n to occur '\vhcn essentially exact
caicuiitiqr'I(;; >1",,;, r.!rr'-d oc 1 tb i\':jnc tht' exac+ calcnlt;o:<]S ar,n. vfrv
..J....... - '-,..:.,., ,,'., ,"". 1 """ L.':>.oJ I..... "" i. "'10 ....,;1 ,"w< .- w' .,. - - .... L ' J" f........ ..I. 1. .,; .J
laborious, etIl irDprOVenJcnt on the Born a pproxin/a6'1Jn is desirabJe. l"his
1S afforded bv us\ of the Sornmerfeld..Maue Vlave funct 1 ons. 1 '7 These 'wave
functio-ns \vttich h3 \/l) heen \vorked out for the (oulomb field" correspond
very closely to the \veU.»kno"vn solutions of the !lon-relativistic probIenl in
parabolic coordinates. As such, they give an approA:imate solution of the
l)irac equatton in closed form which can be used when the direction of
motion of the particle at infinity is specified. The solutions are approxi-
mate, as couhi be expected, since the Dirac equation for central fields,
unlike its Don-relativistic counterpart, is not separable in parabolic
coordinates. The solutions exhibited beloVv T have been used to obtaitl
improved 'values for bremsstrahlung and external pair production c,ross
sections 18 and pbotoelectric cross sectionsJ9
1'he starting point is the exact second-order \vave equation (4.23) which
is \vrittt:n in the forol
('q"rJ , ' Jl r '" ". v y .". j .. --; . ' J .... ..."'t
: I i " J,j .., , * . ), .-- I Ira ' ) , ' · / '"' l' jJ
\, -. - ", -- "-' jI _u j r ,..;; .r."'\!t" fir -..., 'f'
i 6.43)
where Ii ()perates orJy on V;i the r.:entral flthi potentiaL I'he terrns on the
right side of (6.43) are to be treatcd as snlaH correction terlns, and the
second terrn with V2 is regarded as a second-order terrn while the first
gives a tlrst-order correc1 ion. lIenee \V; \A/rite
'if == 'lj'o + ')l 1 'IP? -1- · · .
Then \\'ith p2 = W"z. 1 and
D = y'2 -t- p2 --- 2 VV
-r In :he narro\v sensc.
238
RELA TJVISTJC ELEC"fR01'! Tl-IfOR Y
\ve obtain the series of equations
D'ljJl = -ia'{VV)o
D1p2 = - iet-(V V)"Pl - J;"2O
etc.
( 6.44a)
(6.44b)
(6.44c)
D'lf'o = 0
OUf considera.tion is restricted to 1pO and the firstnorder correctJon (j'l' so
that only (6.44a) and (6.44b) will be con&idcred.
We recognize that the operator D is diagonaL Since for V ---)- 0 the
solution '«Po must become the free particle solutions, we write
1jJo = U(p) fer)
( 6.45)
where U(p) is the usual fourcornponent fJ>jra.c spinor \vhich gives the plane
¥laVes .in IllomentulJl space. Then fo! V -)0- 0 \Ve require ,/'0_> exp (ip"l')
To simplify (6.44b) vve set
"Pl = - [a.-eft
(6.45')
and (6.44b) becomes
Dc.p = (V' 1/) 'lpo
( 6.45")
The solution of (6.44a) for the (oulomb field is exactly the saIne as the
non-relativistic equation, except that the rest mass of the latter equation is
replaced by the moving mass. This is seen by introducing pr as a variable
in (6.44a); then the equation becomes identical with the non-relativistic
one if the coefficient cxZ f1/jp of 1/ r in the Coulolnb term is \vritten rxZ/v.
rhe solution 20 of the non-relativistic equation in parabolic coordinates
can be taken over so that
fer) = exp (ip<r) F( -11,1, u)
( 6.46)
where
u = i(pr .- p..r)
(6.46a)
and tbe cont1uent hypergeOlTIetric function can be represented by the
convergent series
n(n - 1)u 2
F ( -n 1 U ) = 1 - nu + + . . . ( 6.46b )
, , (2 !)2
with
. zw
n = -l --
P
lJ./e veri(y first thatfreduces to a plane wave for Z = O. That (6.46) is a
solution is checked by using
(6.46c)
v2j= exp(ip..r){-p 2 F -t 2ipo\7F + y 2 F}
,h}.PPRO){!lv1A TION MEI'IJOl)S
239
and
VF = i(pr - p)F"
\vhere prime means the derivau ve \\-ith respect to u. Also
y2F = div V'F = i[(1"r p).\"'j--" + pF' di v rJ
?in
== - 2 p( p - p. f')f'lI -t- -_.-- F'
r
I-lence
Df:.-;: 2p ..:xp (ip-r)[uf" + (J - Il)F' + nFJ = 0
,...
;
since the square bracket, set equal to zero, js the dilferential equation for
(he hypergeometric function,31
1he follo,ving procedure gives, in 'Jutline forr.o, th:: method of obtaining
qJ'. l.let WO and VJ be the 30Jutions of (6.44a) fot encrgies fj/ and Hi"
rcspctively. I"fhen, by n1uttiplicatior of (6.4421.) by 11' * and its counterpaft
for 'I'P!* by 'lpo and subtraction, the orthogonality relation
!It
j 1f"(W + W' - 2V)'Po d 3 r = 0
(6.47)
is obtaind. Here Gauss" theorem is used to reITIOVe an integral of
div [1p6*Vtpo - (v1p)*¥'o]. In a similar lUanneI' we demonstrate that
f \O*(W + W' - 2V)( c:p - 2 . _;'" V'iPIJ) d 3 r = 0
e; \ ... rV I
Hence c.p -- \-; 11'0/2 7 n1ust be a solution of (6.44a) since the solutions form
a complete set. 'fherefore
1
<p = - "VV;o .+ X,
2J11
and X is deterrnined so that, in the limit Z - 0, the ratio cp/lIYo -). O. The
quantity v'lpo is obta;ned fronj the above:
DX=Q
( 6.48)
Y'V;o = U(I))vj'
= i[J(p) exp (ip.r)[pF + (pr -- p)f']
(6.49)
}--;"rom (6.46b) it is seen that F' is proportional t.o n or rxZ. On the other
hand, the first term in (6.49) is of the same order asV)o' }Ienee, since this
terrn is jllst ip1.PU' we can choose X so that th.is tern) is cancelled. Hence
th.e Sornn1erfehl. Malle wav(:: function is
[ " 1
. ". 1 " . '"'If I '.,.. ,
'Jf = exp (lp. r) F + ---- a..(pr -" p)F I t) (p)
2W _J
(6.50)
240
RELATIVISTIC ELECTRON 'THEORY
,
where F is given in (6.46b). Of course, for Z = 0 they reduce to the
famiIjr plane waves. Detailed analysis of the Somlnerfeld-Maue approxi-
mation for the photoelectric etfect 19 and for bremsstrahJung 18 shows that
the expansion parameter is of order Cl.Z/ W, and hence these functions are
very well suited for high energy processes. Sommerfeld and Maue I7 have
used the wave functions to calculate the Coulomb scattering. We note
that the wave function as given in (6.50) does indeed have the asymptotic
form 21 of a scattering function,;: plane '.\lave plus outgoing spherical wave.
As an application of the result (6.50), we consider the modification of
the positive energy projection operator due to the Coulomb field. We
evaluate
[P + (Z)]rrp = ("Pq1J':)r=o
since this is the pertinent quantity for applications to beta decay where,
following standard practice, the wave functions are evaluated on the
nucleus. Then with F replaced by 1 and F' by -Jl and recalling that
n X = -n, we find
p +(Z) = [ 1 - ..!!..- a.-(pr - p)lp + [ 1 + a..(pr - P) J
2W J 2W
Keeping only first-order corrections in 11, we obtain
p +(Z) = P + - iCl.Z [P(a:-.. - (X-p) + 2ia.p X rJ
2W
(6.51)
To first order in n this is a unitary transformation on the Z = 0 projection
operator P +" For negative energy states we find
P _(Z) = p _ + ;rJ.Z [p(a-r - a-i» + 2ia.p X r]
2W
and P+(Z) + P_(Z) = 1; also P+(Z)P_(Z) = P_(Z)P+(Z) = 0, and
Tr P+(Z) = Tr P_(Z) = 2just as for Z = O. The results (6.51) and (6.51')
apply for waves which are plane waves plus outgoing waves at infinity.
For the case in which the Coulomb field produces an incoming wave at
infinity the sign of the Z-dependent terms is changed. 22
(6.51')
39. FINITE NUCLEAR SIZE EFFECTS
In this section we take up the question of corrections due to the finite
size of the nucleus. These arise in beta decay, internal conversion, electron
scattering, isotope shift, hyperfine structure, and in many other situations.
APPROXIMATION METHODS
241
"'ave Functions inside the Nucleus 23
We develop a Inethod by which the solution of the Dirac equations can
be expressed as an infinite serie of quadratures for any central field. The
method is essentially exact since it can be made to yield results of any
desired degree of accuracy. The only approximation enters in terminating
the series with a finite number of terms.
We write (5.5) in the form
dU I K
- = - - u] -t. €]2U2
d r i,e
(6.52a)
dU2 K
- = E 2J U 1 + -1l2
dr r
(6.52b)
with
£12 = W + 1 - V,
£21 = -(W - 1 - V)
'-fhen these equations can be put into the form of integral equations:
II I = r-{ C I + i r r'K E12 (r') u 2 (r') dr'J
U 2 = r" r C z + (r r' -. ""2L(r') lII(r') dr' J "'"
L .! 0
(6.53a)
(6.53b)
These equations can now be solved by iteration. We consider the case of
K <:: 0 and K ::-'> 0 separately.
For K = -k < 0 the condition of integrability requires that C 2 ;.--= o.
We write (6.53) in the form
ill == C1r k + f 1 u 2
U 2 = .f 2 u 1
where J 1 and Y2 are 1inear integral operators:
Yly = r k i Tf ,-Ic E12 (r') y(,.') dr'
f 2 y = r- k f r l""'E 21 (r') y(r') dr'
.,' 0
(6.54a)
(6.54b)
Eliminating U z ,
III = C1r ' .: + .f r f 2 u 1
or, by iteration,
or.
1I 1 = C1(1 - .f"r_yf)-lrk = C 1 ! (u1.f'2)nrk
o
(6.55)
Uo = --yf "U 1
'" ...
242
RELATIVISTIC ELECTRON l"'HEOR'"y
r;or K == k ::-> 0 the equations (6.53) beconle, with C 1 = 0,
4"
U - 't) Z '
1 - eJ" 1 't
( k rJ:
lt 2 = -'2 r + c/' 2 U l
and JfJ and Jf2 are linear int.egral operators \vhich differ fronl 51 and J 2
only in that the sign of k is changed. For this case
Xl
i ....,. C "" ( 0; ;; '1; ) n ."k
."'I ....._, 2 } [F t) .. t
.... ,""" \.(. " ....
o
U 1 = f1 U 2
(6.56)
1n each case the solutiol15 contain one arbitrary (normalization) constant,
C 1 or C 2 ) but U 1 /U 2 is uniq ueJy determined. These solutions ,\'ill be of
practical use for a1irnited range of r although, as will be Geen, they converge
for all r with potentials of interest. Fer the finite nuclear size problem the
soh;tions (6.55) and (6,56) are joined to linear combinations of regu1ar and
irre g ular Couhvrnh soJu1ions gnd the }t.1inin b o condition together with the
J
norrnaEzatioD fixes a a the censtar:ts.
\iVe ITlay first r,:cogHizc that if /'(r) is a polynon1iai with positive PO\I'lcrs
of r each tern1 in (6.55) or (6.56) is a polynonlial in r and the degree of
. r- L. . '1 ,--..P :. l' " (, , . '.," < .. . t 1 r> ;>. .' 0. . ... ,.,:>. r (:< b ' .., ]' -. V . 4- r t
f.:h.,p Su,..,...",.-LGiHt;, P0.tyHolU.i.at 1., u)C ,')(:,t h",S Hll V...aSeL ,y ... d lS a cons Lan..
over a range r <::: '0' the tern1S n the Fol: y not'1ials c:t.n be reordered to give
the series expansion. of th(: pheric(-l Bssel functions. It is hardly necessary
o show this in detail :in'2c the -Taylor expansions of U 1 and U 2 are unique.
If Jl'is bounded, as it is fOf a 'lucleus of fAnitc SIze; a:t upper !i:nit for each
tern'l and for the series is obtained by nrlacing / \"ith its 111aximum
pGsti've or ngative ",alue. The resulting series 1S, of course, again the
series of the soherical Bessel functions 'hich CO!lVergc ever y \vhere. Hence,
j
for bounded i/ r , the solutions (6.55) and (t.56) converge. For a potential
which js negative defini'ie and monotonic, such ps the Coulomb field \vith
tlnite size modification there is in the discrete spcctrurn (ij/ <: 1) one
turning point r 1 " /(rl) == w r - J, such thHt for r .< '1" E 21 < 0 "nd, for
.. ".- " 0 F p.. - ' 0 H {' b t1 r't.:\ r.( itf:t.r .. (i'
1 -> fJ' ."21''-> . . or a.1 t, t12 >. cD.....e O.J1 oJL.l.l...,S a.. e ad...' n....unt) In
the regioll r <' = 1"1 and an upper liu]ir 01" the error is obtained fro(n the
first -ern1. negJt;(;ted. !n genraj, for 1,)oundcd f,' the expansion parttrncter
is of order r{yV -- V)2 - IJr 2 , as can 1e seen by I:r']a(jn.t; C1S and €l in.
(6.54) by <;'average" values.. It will b seen that the integra.is \viU eX)Sl and
the series will converge for aU cases \Vhere1o ihl1 rv T ::-:: 0 for r -;'t. O. Hence
the Coulofnb field, as is usual_ is a speej"l case and the method does not
'work fOt fhis '''singuiar'' field.
APPROXIMATION METHODS
243
Considering tl).e leading terms, we have for K = -k,
u1(r) C1r k
C r k + 1 i 1
u 2 (r) 1 w(x)<:12(rx) dx
2k + 1 0
where
w(x) = (2k + 1)x 2k
is a normalized weight function:
[ 1 w(x) dx = 1
o
In genera], then u 1 is determined by the centrifugal term but U 2 is strongly
dependent on the potential at points between 0 and ,. For very large k the
weight" function becomes a de]ta function at x = 1, and the value of u 2 (r)
no longer depends on the details of the potential at points closer to the
origin than r. This is readily understood in terms of the repulsion of the
centrifugal terms. For K = k the functions U 1 and U 2 interchange their
roles:
U 2 ""': C 2 r k
rk+l i 1
U 1 ro..J C 2 w(x)E'21(rx) dx
2k + 1 0
For the constant density nucleus of radius '0'
}.7 = _ ::Z ( 3 _ r: )
2ro ro
The radial functions for K = k are
_ ( / r ) ' 1c+l ( r ) 2n
Ul -'- 2 an -
'0 0 r 0/
(6.57a)
with
( 'r ) kOO ( 'r ) 2n
U 2 = p- I b n -
\r" 0 '0
(6.57b)
b .- (2k + l)ao
0-
ro(W + 1) + 3Z/2
and the recurrence relations
[ rxZ ] rxZ
(2k + 2n + l)on = ro(W + 1) + 3 2" b n - '2 b n - 1
[ Z J Z
2(n + 1)b n + 1 = -- ro(W - 1) + 3""2 0'j + '2 °n-l
(6.58a)
(6.58b)
244
RELATIVISTIC ELECTRON THEORY
determine the remaining coefficients in terms of Go, which is a normalization
constant. For K = -k we interchange U 1 and U2 and change the sign of
Wand Z; see section 26. 'These wave functions have been used ,in a
number of problems. 24 For beta-particle energies three terms of the series
usually give the wave functions for all r < '0 to better than 1 percent.
Scattering Phase Shifts
Phase shifts for scattering Inay be obtained by joining the solutions
inside the nucleus (see above) to the Coulomb solutions outside the nucleus.
This procedure is sometimes laborious, and a more direct method for
obtaining the phase shifts win be of interest. Generally) the method to be
described is approximate. However, when the potential energy V deviates
from the Coulomb value over a finite distance and the solutions inside th.e
nucleus are known (see above) exactly, the method becomes exact.
The difference between the actual potential V and the Coulomb value is
denoted by i (r ) :
r(r) = V + a.Z/r
Then the radial equations (5.5) become
(6.59)
d ( Ul ) ( Ul ) ( 0 -r )( u 1 )
dr U 2 - M(r) U 2 =.y 0 U 2
where the matrix M is given by
M _ _ ( -1<./r W + 1 + ocz/r )
( 6.60')
-w + 1 - ocZjr 1<:/r
The real Coulomb (1/ = 0) radial functions multiplied by r are denoted
by Vl and V 2 for the solution regular at the origin and by VI and V2 for the
solution irregular at the origin. Then the Wronskian
(6.60)
V 1 V2 - V 2 V 1
has a constant value, as may be verified by differentiation and the use of
(5.5). We normalize the Coulomb solutions so that
V 2 V 1 - V 1 V 2 = 1
Then an integral equation equivalent to (6.60) is
tlj(r) = vlr)[ 1 - LX) v;(r') ulT') 1/"(r') dr']
- vlr) J: vir') uj(r') j'(r') dr' (6.62)
(6.61 )
APPROXIMATION METHODS
245
Repeated indices. are to be sUDlmed over the two values 1, 2. This
expression for U i can be written more compactly in terms of the Green
function matrix of (6.60):
U i = Vi - LX> Gij(r, r') uj(r') (r') dr' (6.63)
where
Gij(r, r') = vlr) v;(r'),
= vir) vier'),
r' > r
r > r'
(6.63')
The asymptotic behavior of Vi and Vi with the normalization (6.61) is
(cf. sections 32 and 34)
Vi --+ -[p/(W - l)],cos (pr + 0)
V 2 [(W - l)/p]!4 sin (pr + )
Z)l-+ [peW - 1)]!4 sin (pr + 0)
6 2 [(Tt'" - l)/pJ cos (pr + )
where is defined in (5.75). The subscripts K are omitted throughout.
Thus the solutions of section 32 have been multiplied by -1T. The
irregular solutions are obtained from these by the change of sign of y.
The asymptotic behavior of U 1 and U2 win be
U 1 -[p/(W - l)][ct?s (pr + 0) - tan sin (pr + 0)]
U 2 -+ [(W - l)/p]![sin (pr + ) + tan cos (pr + 0)]
so that is the additional phase shift produced by the deviation from the
Coulomb field. Comparing with (6.63), we find
tan A. = - LX> vj(r') ulr') (r') dr' (6.64)
When j/" -=1= 0 for, < '0 only, the integral is taken over the finite region
r < roe However, this result has the disadvantage that the solutions u;
must be normalized, and the solutions for r > '0 must be continued to
infinity in order to do this. An alternative expresion for the phase can be
obtained from (6.63) and (6.64).23,25 This is
-cot Ll =
LX> dr(r) u;(r) ulr) + LX> dr LX> dr' (r) u;(r) Gii(r, r') uir') (r')
LC)dr(r) uir) vj(r)T
(6.65)
246
RELATIVISTIC ELECtRON THE()RY
with sums over repeated indices implied. The normalization constant in
u j now cancels out. The expression (6.65) is actually stationary2.!} with
respect to first-order variations in uj(r). This is not true of (6.64). When
"f/(r) = 0 for r > ro, the solution U j for r < '0 obtained as described
above can be used to evaluate cot directly.
40. THE DIRAC: EQUA'flON AT HIGH ENERGIES
A high energy electron behaves very much like a particle with zero rest
mass. In this respect at high energies the properties of electrons are
related to those of the neutrinos to be studied in Chapter VII. Of course,
one n1ajor distinction, compared to the neutrino case, is the fact that for
an electron interactions with electromagnetic fields are possible. A second,
as will be seen, is the fundamentally different polarization possible.
We consider an electron in a central field V. Instead of the ot and {3
standard representation \\'e use 9
,
«' = Pa fl = (: J, P' = Pl = e) (6.66)
which can be obtained from the standard representation by
pJG = SP10S-1
where the unitary matrix S is
s = J2 G
1 \ 1
) = --;= (Pa + PI) X 1 2
-1 2
(6.67)
and each element appearing in the first form of S is a 2 by 2 matrix.
The wave equation written in terms of upper and lower components is
now
(o,p + V - W}'P tu = 0
(-G-p + V - W) 1p ' l = 0
( 6.68)
where the transformed wave function is
, = ( 1p IU )
1p fZ
1jJ
and where the rest mass term is neglected, It is seen that as a conseq"uence
of the representation (6.66) the upper and lower components are now
decoupled. HO\\lever, we still deal with a four-component wave function
APPROXIMATION METHODS
247
For a free particle,
G.' 1p'U = W1f"U
O'-P 1p't = - W1f'"
The plane wave solutions 1J/ = A' exp (ip-r) or
(6.69)
"P"''' = A"u exp (ip.r)
'V,tl = .A. il exp (ip'r)
are obtained with the amplitudes given by the helicity eigenvalue equations
(I I)pA ftt = A 'U
a-pAil = _A'l
These eigenvalue problclTlS were solved in Chapter I? where u.p was
diagonalized in th.e Pauli theory. From (1.33a) we deduce that
(6.70)
( e - i'P/2 cos {f l2 )
C 1 I
. f
eZ'P1 SkO v!2 t
A.' =
( ' -- e -irpl2 sin fJ/2' )
(0 ' / 2 I I
'"' elp,
\ ' cos l}j2 /
(6.71)
\vhere 0 and rp are th.e po}ar and zjrrtuth a'ragles of p. 'To under5tEud !he
significance of the constants C 1 and C 2 \ve transforITt the stallda.rd
representation wave function 1Pt.
, V'st = U trlp) exp (ipr)
\vbere, in the high energy lin1it,
1 ( ' X'm )
U 1n (p) = /,-
.y 2 .PXm
Then from (6.67) the transfor-rned wave function is
1p I = S1jJ3t, = [T tXp (ip.r)
U' "-! (0 + a"Ph m )
". 2\(1 -- a.p)xm
::::: ( p+ (p) X\
P-l p "' ) ./"J
, \ fool I
(6.72)
where p:i:(p) are the Pauli spin projection operators
248
RELATIVISTIC ELECTRON THEORY
Writing these in detail,
1 + cos {J cos t, {) /2
U' 1 sin () e itp sin {} /2 cos {} /2 eif'
1-' = - -
2 1 - cos {} sin 2 {}f2
..
- sin () e itp -sin {}J2 cos #/2 e ilfl
sin {} e- illJ sin {}12 cos f}/2 e- icp
U = ! 1 - cos {} sin 2 {}J2
-
-
2 -sin {} e- irp -sin {}f2 cos {}f2 e- illJ
1 +. cos f} cos 2 {}12
(6.73a)
(6.73b)
These amplitude spinors diagonalize {J = S@zS-l, where (f) is the spin
operator discussed in section 15. To diagonalize (f)J .ft) where ft is a unit
vector with poJar and azimuth angles {} n' qJ"", the usual procedure is
followed. The amplitudes are transformed to
A = cos n e-i"'''/2U{,;, + sin n ei'P"/2U!--2
A = -sin IJ n e -i'P,,/2 U + cos 1} n e i 'P,,/2 U:.
2 2
We now take ft = P so that A':.t.: will be amplitudes for states of positive
and negative helicity respectively. Then {} n = fJ, qJn = q;, and
e - illJ/2 cos {}/2
e illJ / 2 sin {}/2
A =
(6.74a)
o
o
o
o
A:' =
_e-,illJ/2 sin {}j2
e illJ / 2 cos 1Jj2
We see that the solutions for mean spin along xii are effectively t"'O-
component spinors with Pauli spin functions. From (6.71) these two
solutions are obtained by setting C 1 = 1, C 2 = 0 and C 1 = 0, C 2 = 1. Of
course. this result would have been predicted, since the upper and lower
(6.74b)
APPROXIMATION METHODS
249
components in (6.70) l1ave opposite signs for the eigenvalues of o-p, so that
if this operator is diagonal one of the components must be identically zero.
It is cleat that throughout this development the nODlenclature "large
components" and "small components" is no longer significant. The upper
and lower cOlnponents have the same order of fi1agnitude.
It is useful to observe that for free particles it is possible to remove the
rest mass term from the Hamiltonian in a rigorous manner.t This is done
by the (unitary) Foldy-Wouthuysen transformation discussed in section 18.
Front
fl1p = }i/1p
we transform again to 1p' by
1p' = e iU 1jJ
with the hermitian U written exactly as in (3.32). Then
with
H'1p I = W1J"
H' = eiuHe- iU
given by Eq.. (3.33). J-Iowever, this time \ve elirrlinate fJ by choosing
tan p cpjm = --mlp
(6.75)
]n this case
} "J' ex-p ( 2 + 2 ) .
=--p m
p
flere p is everywhere an operator. If we take a plane wave solution, then
p is replaced by the rrtomentum eigenvalue Consequently, with
'ljJ = A exp (ip..r)
VJ' = A' exp (ip..r)
A' =: eiHA
we have
aopA' = A'
The amplitude A' is to be distinguished from (6.71). Ifnow the representa-
tion (6.66) is used, the equations (6.70) apply rigorously to the amplitude
A" = ( 1
2 1
I ) "
A'
--1
The transformation from A to A' is made with
.. r 1 ( p ) ] r 1 ( p ) 1 Y2
e!I = I - 1 + - - pa>p' -. 1 - - J
L2 W L2 "fJ'
(6.76)
t See reference 17 of Chapter III.
250
RELATIVIS1'IC ELECTRON THEORY
The connection with the transfonnation discussed at the beginning of this
section is obvious. If W 00, the second term in (6.76) vanishes and
exp (iU) = 1 so that A' = A and tp' = 1jJ. Then only the transformation
(6.67) is needed to go from A to A tl .
It is also of interest to discuss the central field solutions. in the angular
momentum representation. These are obtained from (5.3) and (6.67).
1 \ 1 gX + if Xl!:.. Ie )
1"" = -
i K ,--
2 gX - ifX':K
( 6.77)
The total angular lTIOmentun1 is diagonal} as is its z-component:
j21p = j(j + 1)1p
J . II'JJIJ. = 11. 'l1JJl..
zr!( rTN.
and PIs has tbe eigenvalue (- y-+ 1, where fJ is given hy (6.66). Similarly
K = fJ(a-) + 1) has the eigenvalue -I<. with the same fJ.
The radial functi<:)ns for the Coulomb field are obtained from (5.76)
and (5.77) ',vith
?i" K
e - :::::; -- ----
;J ,L ictZ
y is everywhere replaced by CJ.Z and tIle factors (Jv:i: l)!- are replaced. by
W. From the radial equations in the high energy limit,
dg I( _ K + 1 + / w T/). {
- d --. gK -"Jjl(
r r
d" = -(W _ V)g" + K - 1 fIt
ar r
we see that changing the sign of K restores the equation if the replacements
f - g and g -)- f are made. Hence
I-I( = -gK'
l? - K = ffr.
(6.78)
From the asymptotic behavior given in (5.78a) and (5* 78b) it is then
evident that
(j-K = b K + 7T/2
(6.79)
a relation which is useful in the analysis of high energy scattering. 26 From
(5.75), this relation between the phases is equivalent to
'Y1 - /1I"j / - /')
.'-_1( III( - Jijk
}\PPROXIrvL TION METI-IODS
251
and the definition of e 2iYJ given above is seen to give a verification of this
result. In fact, the correction term is seen froln
exp [2i(1JK - 'YJ- K)] - ( 1 _ 2iO: )
pKI
to be con1pJetely negligible at energies of order 50 Mev. The relative signs
of''1 _I( and 1]1( were fixed by the choice ,of phase made in (6.78). ()bVlously,
f0r given j and gJ( it would be equally valid to reverse the sign of both
f __I( and g -K. The application of thse central field solutions to the high
energy scattering problem has been rnade by Yennie et a1. 9
For the extrelne relativistic Ihnit a rough approximation using the
asyrnptotic fonn of the radial funct.ions.f and g \viU often yield useful
results. This is the so...caUed C:asimir limit.
PR()BI.EMS
1. Find the cross section for lectron-electron scattering v\/'ith the ivI011er
interaction in the limit of srnaiI scattedng angles. Take one electron to be initially
at rest.
2. Show that in the Born approxu11?-tion the scatter; ng arnplitudes F and G
as defined in Stction 33 are out of phase by 7T/2 and hence that "there it) no effect
on the scattering of the initial electron polarization.
3.. Derive the expression (6.65) for the phase shifts.
4. ObtaiQ the high energy solutions for a positron. Write phase in the forrD.
of wave functions for positive and negativ helicity
5. Find the transfonTled spin operator () in the representation (666).
6. Show that for the state described by (6.7]) \vith c 1 = C 2 = } the jnean spin
along the direction of n10tion vanishes. ·
7.. In beta decay Ihe electL)Cl i emitted V\lith polarization -vie along the
direction of the momentulTI. \Vha t does this mean vvith regard to the r}ative
amplitude of positive and negative helicity states. In what way, if a!1Y \vould
the answer change if one considers the limit v c.
8. EstiITlate the order of mgnitude of the phase shifts in scattering due to the
finite size of the nucleus (1) by assunling a constant proton charge density and
(2) by using only the first terms of th. f:xpansion of the wave functions ccurring
in (6.65). COffil11ent on the validity of this approximation at high scatterIng
energies.
9. Ine!astic scattering of electrons by atolTIS is obtained H the Born approxi.,
mati on by using the matrix elenlent (6.20'), replacing Cf. for the atomic electrons
by the operator \vhich gives the nonrelativjstic current density, and sumnling
over aU bound electrons. 27 Shnw that if polarized electrons are inclasticaHy
scattered there is no scattering Hyn1.metry in the Born approximation.
252
RELATIVISTIC ELECfRON THEORY
10. Discuss the high energy approximation in which one uses
S' = ( t 1 \
'/2 \ - 1 1 )
in place of(6.67). Start with the standard representation and find the transformed
wave functions which diagonalize S'lVapS"-l.
REFERENCES
1. W. Pauli, Helv. Phys. Acta 5, 179 (1932).
2. See, for example, L. 1. Schiff, Quantuln Nfechanics, M.cGraw-Hill Book Co., New
York, 2nd ed., 1955, secti.on 8.
3. R. J. Bessey, Thesis, University of Michigan (1942)7 unpublished.
4. R. H. Good, Jr., Phys. Rev. 90, ]31 (1953); 94, 931 (1954).
5. C.. M011er, Z. Physik 70, 786 (1931); An!l. Physik 14, 531 (1932).
6. L. Rosenfeld, Z. Physik 73, 253 (1931); H'. A. Bethe and E. Fermi, Z'. Physik 77,
296 (1932); W. Heitler and L. Nordheim j J. phys. S!t 449 (1934).
7. G. Breit, Phys. RelJ. 34, 553 (1929); 36, 383 (1930); 39, 616 (1932). See also J. R.
Oppenheimer, Pllys. Rev. 35, 461 (1930).
8. See, for example, M. E. Rose, Phys. Rev 13'1 279 (1948).
9. D. R. Yennie, J). G. Ravenhall, and R N. "Vilson, Phys. Ret'. 95, 500 (1954).
10. (J. Parzen, Phys. Rev. 80, 261 (1950),
1 L H. Schopper, Nuclear Instr. 3, 158 (1958); L. Grodzins, Prog. in lVuclear Phys.
7, 163 (1959).
12. W. Franz, Ann. Physik 33, 689 (1938); F. V.I. Lipps aDd H. A. Tolhoek, .Physica 20,
85, 395 (1954).
13. \V. I-Ieitler, Quantum Theol Y o.f lfadiation, Oxford Press, 3rd ed... 1954.
14. F. Sauter) Ann. Physik 9, 217 (1931); 11.. 454 (1931).
15, I'lt E. Rose, Phys. Rev, 76, 678 (1949); 78'1 184 (1950).
16. Pbooelectric effect: tI. R. Hulme, J. McDougal, R. Buckingham, and R. Fowler,
Proc. Roy. Soc. (London) A149, 131 (1935). Internal pairs: J. C. Jager and fI. R.
Hulme, Proc. Roy. Soc. (London) A148, 708 (1935).
17. A. Sommerfeld and A. W. Maue) Ann. Physik 22s 629 (1935): see also \"1. Furry,
Phys. Rev. 46 391 (1934).
18. H. A. Bethe and L. Maximol1, Pllys. ReI). 93, 768 (1954). See also H. Davies,
H. A. Bethe, and L. Maximon, Phys. Ret'. 93, 788 (1954); H. QJsen, Phy.,:. Reo. 99,
1335 (1955).
19. H. Banerjee, Nuovo cimento 10,863 (t958). Also T. Erber, Ann. Phys. 8, 435 (1959).
20. G. Temple, Proc. ROJ. Soc. (London) i\J2t, 673 (1928); A. Sommcrfeld, Ann.
Physik 11, 257 (1931).
21. Higl(er Transcendental Functions, Bateman Manuscript Project, McGrawHill
Book Co., New York 1953 Vol. I, Chapter VI.
22. J. D. Jackson, S. B. Treiman, and H. W. "Wyld: Jr., Z. Physik 150, 640 (1958).
23. M. E. R.ose, Phys. Rev. 82, 389 (1951).
24. L. K. Acheson, Jr., Phys. Rev. 82,488 (1951); 1. A. Green and M. E, Rose.. Phys.
Rev. 110, 105 (1958); M. E. Rose and D. !(. Holmes, Phys. Rev.. 83, 190 (1951).
25. J. M. Blatt and J. D. Jackson, Phys. _Rei'. 76, 18 (1949).
26. H. J;eshbach, Phys. Rev. 84, 1206 (1951).
27. H. A:Bethe, Handbuch der Physik, XXIV/I, Julius Springer, Berlin, p. 495.
Vll.
NEUTRINO THEORY
41. FOUR-COMPONENf FORMULATION
Mass of the Neutrino
The unique position of the neutrino arises from the null character of
many of its physical attributes. That it has no charge is obvious. The
neutrino magnetic moment is either zero or so extremely small as to
preclude any likelihood of its observation. In fact, since the only inter-
action known to exist for this particle is the very small coupling leading to
decay processes, there seems to be no mechanism for providing the neutrino
with an appreciable magnetic moment. Finally, the neutrino mass is
extremely small (in units of m) and the experiments are consistent with
this mass mv being zero.
The most reliable value for the mass of the neutrino comes from the
observation of the shape of the nuclear beta spectrum near the maximum
electron energy where, if there were a non-zero neutrino rest mass, its
effect would be noticeable when the corresponding rest energy is of the
order of the neutrino kinetic energy. In order that this portion of the
spectrum constitute a non-negligible portion of the total spectrum an
emitter with a small energy release is studied. The best source for this
purpose is H3.
The effect of the possible non-zero neutrino rest mass is observed in the
alteration of the statistical factot
p"J. dp f q2 dq .5(W o - W - W y )
= dW pllV(W o - W)[(W o - W)2 - In;] (7.1)
Here q is the magnitude of the neutrino momentum and W)' is 'its total
253
254
FtEI_ATIVISTIC ELECTRON TI-IEORY
energy.. Besides this statistical fa.ctor, the spec.truln for emISSIon of
electrons is proportional to
T (1 (top + P ) (\/ 1 . { l (teq:r: Pm,, ) it. (1 )
r +. ---- ,,\.\ ,i> ';i;- \ c + l. -+ Y
\ W ,".) J \ Jf / ' 5
where
(7.2)
Q = jW(1) -,- ,1o$.M(()")
contains the nuclear ITlatrix elements. rrhe upper sign above corresponds
to e-- clnissjon accornpanied .by the enlission of an antineutrino (charge
conjugate of a positive el1ergy state) and the lO'Ner sign implies elni$ion
of a positive energy neutrino. lt this stage the distinction is purely a
formal one. The only contribution to (7.2) arising frorll nI, is
rn" 'Ir R1t(1 -J- ) 81l*ll + ,' )
WW. t', Ys I , h
tlo\vever, {l as well as 5.1* commutes 'Nlth 1 + Y5 while
fJ(l + Y5) ::: (1 yJP
He-nee we obt:lin the trace of a Inalrix containing (1 + r::)( 1 -- ?5) as a
fae-toe and this product is identically zero. If the (1 -1- Ys) factors i.n (7.2)
are repl.ced by ]. + €?/f» the resuH of (2) gives a ternl in 1Jl p proportional
!) 1 -. t. 2 and experilTlcntaHy 1 -- {. -< 1. Consequently, the altered form
of the statistic[tl \veight is he on1y effect \vhih needs to be considered.
1'he experirnental re5uI t 1 is rnJ < 10 -3.
Since this n1ass is so sOla1l and rnlY \vel] be /:ero:, Vle shaH take rnv = 0
in 'rVhat follo,vs. In this connectlon it hould be realized that ail Inasurable
quantities are continuous in the 1irnlt In y = 0, and so far as experIments
are conce'rned there (,eerns to be no prospect of distinguishing between
theories \vith zero and non-zero but very Sl113.11 rest rnass of the neutrino.
Neutrino Helicity
'Nith l1'lp = 0 the theory deveJoped in section 40 for high energy electrons
holds rigorously for the neutrino. t]sing the fh1tation q for nl0mcntun1
and q > 0 for the energy, we "tNritc
rJ. · \/ 1,p ==
(} '<p
--- - -_....
at
(7.J)
and with the p]ane v\'ave soJu1tons
1jJ =-= A exp [i( q.r -- qt)]
(7.4)
we find
"''' A
a.q /1. = ","1
(7.5)
NEUTRINO TI-IEORY
255
where the unit vector q is
q = q/q
In the representation (6.66) ",'herein
tt= ( a 0 )
o ---0;
we obtain, as before, the general solution for positive energies,
A = ( cIA + )
c,>A _/
...
(7.6)
where
( e - i1fJ/2 cos {}/2\
A-f'- = )
e'ltp/2 sin 1..9>/2
= ( ' --- e -- i<r/? sin {j /2 )
A__
iq;/2 cos 012
and fJ, cp are the polar and azimuth angles of q. In (7.6), Cl and C2 are
constants which may be subject to the normaJizing condit.ion
(7.7a)
(7. 7b )
Ic l l 2 + jC212 = 1
and A+, A_ are positive and negative helicity solutions:
a.q A:t = ::I:: A :1: (7.8)
We now introduce two projection operators which have the property
of selecting one or the other of the two eigenstates of the so-called chiralityt
operator YS' These projection operators are
Wi: = i(l :i: Ys)
(7.9 )
where, clearly,
(1)+ + OJ_ = 1
2
(J).:i:: = ({):i:
w+w_ = W__({)+ = 0
The states !1) i: 1/1 will be denoted by 4>-=F where the reason for the inversion
of indices (::I::) will be clear presently; th us
c/; t = (J):t 'lp
= B:;:. exp [i(qr -- qt)J
B:r: = (0::t A
t The chirality transforrn. of tp is j I 5V1. 2
(7.10)
256
RELATIVISTIC ELECTRON THEORY
Then, from (7.5),
0'-4 B 1= = !(G'.4 =F (t.q)A
= l(T 1 - Ys)«.4 A
= T 1(1 :i: Y5)A
= =FB+
(7.11 )
Hence B+, B_ are the amplitudes of the positive, negative helicity states
respectively and t(l ::I:: Ys) tnay be regarded as helicity projection operators:
positive helicity (w_) and negative helicity «(0+).
In detail, we see that in the representation (6.66) used here
( 0 -1 )
Ys = S -1 ° S-1
where S == S-l is given by (6.67). Thus
( -1 0 )
Y5 = 0 1
and
w+ = ( ),
w_ = ( )
(7.12)
lienee, choosing C 1 and C 2 in (7.6) to obtain normalized functions,
B+ = (A O +)' B_ = COJ
(7.12')
as was expected.
Charge Conjugate States
The charge conjugate state to 1Jl is
1pc = Cx1px
where the charge conjugation matrix C is most easily obtained from (4.83)
with Co = ifJrJ..2 in the standard 'representation. Then
( 0 -0"2 )
C = SXCOS- 1 = i = -Co
0"2 0
The charge conjugate solution is then
(7.13)
1pC == A exp [ -i(q-r - qt)]
l\1EUTRINO THEORY
257
wher
A C = eX AX
Since CI is real, we find from (7.5) that
Cetx.q C- 1 A c = A C
or since C-l = C and
0'2 a X a 2 = -(1
the eigenvalue equation for AC beCOTI1eS
a.q A C = A C
just as in (7.5). In fact, direct application of (7.13) to (7.6) gives
A C == _._ ( C:A+ )
c 1 A_,
which differs from A given by (7.6) in only a trivial way. For 1pc it is
apparent tht
(7.14)
(7.15)
01pC
a:.Y1pc = _ _
dt
which should be compared with (7.3). It should be emphasized that the
charge conjugate solutions are not connected to a reversal of sign of
electromagnetic coupHngs, which, of course, are absent, but rather to the
existence of negative energy state solutions. As before, the antiparticle
represented by 1pc is to be interpreted, according to the hole theory, as a
vacancy in the other\vise filled negative energy states.
From the fact that (7.15) and (7.S) have the same form, it appears that
. (J) :l:Ac are eigenfunctions of a-q with eigenvalue + 1 (for a)_AC) and -1
(for w+AC). However, the charge conjugate of the positive and negative
heIicity states have amplitudes B and B respectively. Thus
B = !(1 - y)AC
= !(1 -b ys)A C = (u+A c = -cB_
which is a negative helicity state. Sirnilarly,
.8:' = }-(1 + Y5)4C
= !(l - ys)A C = cu_A c = -c:B+
which is a positive heJicity state In fact, we verify from (7.12) that
B c - B
+ - - -
B == - B +
(7.16)
258
REI-JATIVISTIC ELECTRON l'HEOR Y
This illustrates a general rule which has aJready been seen in operation in
the discussion of the spin operator j n section 17: the particle and anti..
particle states which are charge conjugate to each other have the opposite
sign of the helicity. For example, if by some mechanistn a particle is
emitted into a mixture of states such that the average longitudinal polariza-
tion (or heJicity) is positive, then in the process \vhere the charge conjugate
particles are emitted the corresponding antiparticle is JongitudinaHy
polarized with the opposite sign relative fo its direction of motion. A case
ill point is e* emission in beta decay where e-- is accompanied by emission
into one type of neutrino state, e t by neutrinos in the charge conjugate
states. By dfinition, the latter are caUed neutrinos (v) and the former are
the chargeconjugate antineutrinos (v).
In the description of processes; S\Hh as beta decay, in terLf1S .of a four..
eomRonent theory of the neutrino it h; to be expected a priori that the
neutrino polarization will not be compJete in general. The four states have
the interpretation given in preceding dis<Hssion. Even jf e- emission is
accompanied by v only, both spin (helicity) states are available for v and
the polarization of ji depends on the relative number of decays into the
tNO helicity eigenstates. It is primarily in just this respect that the two-
component theory of the neutrino, to be described ifl the next se,ction,
introduces sOInething new.
42.. THE TWO...COMPONEN'f TI-IEORY
The Weyl Equation
Shortly after the introduction by .Dirac of the relativistic theory of the
electron, a two-con1ponent theory Vias proposed by Wey13 for the massless
parti(;leM The c,ase of zero mass and no couplings. with external fields is
described in the Dirac theory by the wave equation (7.3). Since here there
are only three anticommuting matrices, there is the possibility of identifying
the u in (7.3) vlith the three Pauli spin n1atrices. This identification leaves
a sign an1biguous: ct ---)-- ::to'. The wave equation would therefore be
equivalent to two equations, and the wave function, which is designated
by cp, is a two-component spinor. Thus we write
ia-V' CiS = i orp
at
(7.17)
Originally, this Weyl forrn of the theory \vas discarded on the groJnds that
it did not give a space-reflection invariant equation. Under the in1petus
of the discovery of parity nOfi-(,OnServatioll in weak intera(;tions the
NEUTRINO THEORY
259
two-component theory was revived,4-6 because it ,vas then evident that the
former objection was not actually cogent. At this juncture, it should be
emphasized that the properties of weak interactions and the decay processes
induced by them are not consequences of the properties of the free neutrino
but are instead consequences of the nature of the we"ak interaction itself.
This becomes evident when it is recalled that a four-component conven..
tional Dirac neutrino does not preclude asymmetries of the type attributed
to violation of parity conservation. The validity of a two-component
neutrino theory must be decided on grounds independent of the existence
of non-conservation of parity in beta decay. Of course, as already
emphasized, the predicted magnitude of the asymmetries depends on
whether or not the two-component theory is valid.
It is clear that the two equations (7.16) are not enough because from
four cOlnponents we can obtain two components by a projection, for
instance with tel - Y5). The complementary projection (1 + Y5) yields
another two-component spinor.t I'hus, if we take; the complex conjugate
of (7.16) and introduce
1>C = cXrj>x
we see that
a .J.. c
CXC;XC-lX.V c = J..-
at
If the two-component matrix fulfills eX = :f:c, as we shall assume, and if
caXc- 1 = -a
(7.18)
as will be required, then we obtain
· T7.J..C . ocP c
-la. v 'jJ = -
ot
(7.19)
The two functions cp and <pc, each of which is a two-component spinor,
replace the four-componpnt solutions previously studied.
In order to interpret this result we introduce plane waves
= a exp (i(q.r - qt)]
where I ql = q and then (7.16) gives
"-
a.q a = -a
(7.20)
and, from (7.18),
a.q aO = aD (7.21)
The implication of these results follows. The plane wave state 4> has only
one possible spin state, and this corresponds to negative helicity, that is,
t Of course, these are four-component functions with either the upper or lower
components zero. However, these are equivalent to a two-component wave function.
260
RELATIVISTIC ELECTRON THEORY
momentum and spin antiparallel. For cpo there is similarly only one spin
state, and this corresponds to parallel spin and momentum or positive
helicity. Thus one of 4> and tpc arises from the negative energy states and
the other from the positive energy states. According to our identification,
which is a matter of convention only, cp corresponds to the neutrino in a
positive energy state; 4>0 to the antineutrino. . This convention is in accord
with the practice of calling the light neutral particle in e- emission the
antineutrino, because according to this rule 'V is left-handed (G-4 has
eigenvalue -1) and;; is right-handed (0'-4 has eigenvalue +1). Thus the
reduction of the number of independent states of given momentum from
four to two is a reflection of the fact that for each type of state only one
and not two spin states arise. The neutrino polarization on this theory is
t,hen always complete, either :I: 1 along (}. Note that the relation between
spin and momentum (and therefore the helicity) is the same for a negative
energy state particle as fot the "hole" in these states.
Relation to the Majorana Tbeory7
We now show that the two-component theory is a projection of the
Dirac theory with the additional condition thatt
1p = tpc
(7.22)
This condition that 'fJJ be self charge conjugate leads to a theory originally
proposed by) Majorana. What is discussed here is the unquantized or
c-number form of the Majorana theory. The relation between the
Majorana and two-component theories has been discussed by Serpe,s
McLennan,9 and Case. 10
In the representation (6.66) the charge conjugation matrix is given in
(7.13). Hence with
"PI
= :: == (:')
"P4
we see that (7.22) requires
i ( -(]2CPX ) = ( q/ )
(J2'P'X cp
t It is important to distinguish between the condition of self charge conjugate solu-
tions as applied to VJ where it is pertinent and as applied to a projected wave function
of the fonn (:) or (:) where it is not pertinent.
NEUTRINO THEORY
261
or
'I/J -- 'I"x
rl - - r4 ,
VJ2 = tp:
Consequently,
q:/ = ( "PI ) = ( -: ) = -iG 2 f{Jx
'lfJ2 'fJJa
This, it will be seen, has the form
q/ = cXq;X
where c = - i0'2 has the propertyt
caXc- 1 = -a
x - -1 *
C = C = -c = -c =-c
.
(7.18')
Hence the self charge conjugate 1p is
= (-i:2X) = (;C)
(7.23 )
and
The wave equation (7.3) now becomes
a.V = o
at,
OfPc
o..Vcpc = .- -
at
(7.17 ')
(7.19')
in agreement with (7.17) and (7.19) when we identify q; '\lith rp. Since
(7 .19') fallows from (7.17') by taking the complex conjugate, it is clear that
the four Dirac equations are equivalent to two equations.
If the projections with the heHcity operators are now constructed, we
find that
_ = W + Y5) = (0 )
Jp
(7.24)
and
+ = l(t _ Y5) = (:C)
(7.24')
are two-component neutrino functions for which the following auxiliary
conditions obviously hold:
Y 111 = " 'P
5,- -
(7.25)
Y 5 1 jJ + = - 'ljJ +
(7.26 )
t The replacement of c by c- 1 in (7.18) is irrelevant since by any choice of phase
c = :1:.:- 1 .
262
RELATIVISTIC ELECTRON THEORY
Here y = 1 is used. It is of interest to note that the conditions (7.25)
and (7.26) are consistent only with zero neutrino rest mass. Thus, from
"If a1p + k '10 = 0
/lJ a VT
xp.
where kv is the reciprocal Compton wavelength of the neutrino, we have,
by multiplication on the left by }Is,
01p
--Y}LY5 + kvys'tfJ = 0
dX Jt
since Y/l and Y5 anticommute. From (7.25) or (7.26) this equatio..t can be
consistent on]y if kv = 0 or the neutrino mass lTIUst vanish. lI The converse,
that mv = 0 implies (7.25) or (7.26), is not a va1id conclusion.
It will be recognized that the conditions (7.25) and (7.26) applied to a
four-component wave function as described by (7.3) lead to the two..
component description as given, for example, by (7.17) or (7.19). This is
another way of saying that the two-component theory is a projection of
the usual four-component fornlalism.
For th p]ane wave solutions the explicit form of the amplitudes has
already been given. For the positive helicity solution "P+ this amplitude is
B+ defined by (7.12') and (7.7a). For 1fJ- we use B__ defined in (7.12')
and (7 .7b).
The complete set of projection operators for the two-component
neutrino is two in number and not four, since there are only two states of
given n10n1entum. The new projection, operators are then
p=F = U):iP(V
'Nhere P, on the right side, refers to the four-component solutions with
zero rest mass. We see that
p:t = !( 1 ::I:: a-q)({):+
as was to be expected.
Covariance of the Theory
We now turn to the question of the Lorentz covariance of the theory.
For the most part it is sufficient to consider only the negative helicity
solution, since the corresponding results for the positive helicity state is
obtained by conjugation. 'Then we can discuss the properties of either "P_
ar (p. Tn the forn1cr case a would be the set of 4 by 4 matrices which are
the direct products of unity in Dirac space and the Pauli G. In the latter
NEUTRINO THEORY
263
case a is the set of 2 by 2 matrices. We discuss the equations first. in
two-component forrrt. Then (7.17) is written
a . orp ::.= 0
jJ ox
J1.
(7.27)
where
(]4 = - i
The hermitian conjugate of (7.27) reads
am*
....:!.- (j = 0
ax p. p.
since X 4 = it is pure imaginary and a4 = -(14' Then, in the usual manner,
we multiply (7.27) by cp* on the left and (7.27') by f/J on the right, and,
after adding,
(7.27')
a
- cp*a p.C(J = 0
oXJl
Thus we obtain a continuity equation. That
(7.28)
. *
J il = -qJ apcp
(7.29)
is a four-vector will be justified be]ow. The space and time parts are
j = - rp*ocp
(7 .29a)
(7 .29b)
. . *
1)4 ::-:::: p = q; f{J
In ternlS of 1Jl- the current density would be -1p:' GVl- and, recalling (7.25),
this is the san1e as 'lfJ CX'lp_. The positive definite p defines a normalization
according to
f p tfx = 1
and, if j/t is a four-vector, this integral is a relativistic invariant according
to arguments previously adduced.
For the Lorentz transforn1ations we first consider the proper rotations
,
xp. = a JIVX"
Then
a m' ( x' ) = 0
Jt , T
vXp
becomes
a Jl.vCf,. q/(x') = 0
ax y
264
RELA'flVIS'fIC ELECTRON THEOR'Y
Setting
cp'(x') = Ar{x)
we obtain a covariant result if
a jlvO'", == .l\ * (T j\
(7.30)
(7.30')
This differs from the defining equation (2..60b). Nevertheless we can see
that the A obtained from (7 30') is the same as the 111atrix for the continuous
transformations discussed in Chapte II, section 14. rro see this, consider
a rotation around a direction fi through an angle 8. Let the vector Oft be
denoted by D. Then ta ke 12
- A = exp (in-aI2) (7.31)
as in section 14. To verify the correctness of this.. we calculate the right
side of (7.30') first for a space rotation: ail! = a 41 = 0, a 4t = 1.
A *aiA ---: (cos !O - ;fi.a sin to) Gi(cos 10 -t- lOna sin iO)
= a i cos 2 iO + n.e ai'iu sin 2 i8 + i cos !8 sin to (Gin-a -- Deer O'i)
The last factor in parentheses is
<1 i i\.o - D$(T Ui = 2ia$(ei X 1\) = 2i€kim Olflm(/k
=..": 2iEkimrl mak
."
where e i is a unit vector in the direction of the ith coordinate axis and €klm
is the antisymmetric unit tensor of third rank. Also
Roa fJifi.a = [Di -t- ia o { 11 X e i ) ]f1ot G
= nin.." - a-(n X e t ) X fi
= 2n.ii-o - cr
l
Hence (7.30') becomes
aijC1 j = C1 i + nift.O'(l - cas 0) + €ikmnm.(1k sin ()
If we multiply this by (jz and evaluate the trace of the resulting equation,
we obtain
ail = oa cos (J -- ninll 1- cos ()) -t €amiim sin (j
(7.32 )
That this gives just the expected result can be verified easily. For example,
if Ii is along the z-axis we find
cos () sin fJ 0 :\
- sin () {;OS e 0
a=
0 ('J 1 !
\,1
0 0 0
NEUTRINO THEORY
265
Since there is nothing special about the z-axis, this constitutes a sufficient
proof that (7.32) decribes a three-dimensional rotation.
For a translation in the direction v with uniform velocity v the vector n
becomes
n = it arctanh vJe
and that this leads to the appropriate Lorentz matrix a may be left as an
exercise for the reader.
It is of interest to examine the properties of n under three-space rotations
If the rotation is described by the 140rentz matrix a whose elements are Oil'
then
a.. (j. = A *a. A
'l.1 J
where, under the rotation, "p(x) -+ tjj(x) = A 1p(x). Then
tp'(x') = exp (in-oJ2)VJ(x) = A1p(x)
and
ip'( x') = . f\. 1p'( x')
ip'(x') = A A1p(x)
-= A AA-lip(x)
== exp (;0-0/2) ip(x)
In the case of the rotation considered here A -I = A *; that is, A is unitary.
Hence
Consequently,
exp (iii-a/2) = A exp {in-a/2) A *
( , - - )
= exp 2 Ao-aA *
Since
we find
AO'k A * = (jiaik
fi i = a ik 1'lk
Thus n behaves like any three-space vector under rotations. The trans-
formation law (7.30') is supplemented by
p*'(x') = <p*(x)A*
Then we readily see that
j(x') = -<p*(x)A*opA <p(x)
= Qllvjv(x)
justifying the statement that the jfC are the components of a four-vector.
266
RELATIVISTIC ELECTRON THEORY
Turning to the improper Lorentz tratlsformations, we see that under
either
1
Xi = -Xi'
,
X 4 = X 4
or
,
Xi = Xi'
,
X 4 = - x 4
that is, space or time reflections, the wave equation (7.27) would acquire a
relative sign change between the space and ti111e derivative terms. Starting
with
i
a '
(J (x') = 0
Jl ax'
Jl
a linear transformation of the type
tp'(x') = Aq:>(x)
would require that
A -laA = -a
which is impossible, since there is no 2 by 2 matrix which anticommutes
with all three a i . Therefore we consider the antilinear transformation and
set
<p'(x') = [A<p(x)]X
(7.33)
Then we find
A -la:A = -a k
From this we can conclude that A is equal to a 2 , within a phase. We write
q/(x') = E(ia2<P(x))X
where ;(12 is real but E may not be; however, lei = 1.
(7.34)
Two-Component Neutriao in Beta Decay
We can discuss the problem of covariants by considering the important
question of beta decay. The interaction density may be tentatively written
Hp = '1 gl1Jl:, !}i1Jln)('P:' 11 i 1JJ-)
i
(7.35)
for negative electron emission. The positron emission would arise from
H/J*. Here the annihilated neutrino is represented by 1p_. In (7.35) the gi
are the coupling constants a\nd i runs over the five possible interaction
types; see section 14. The ,Qi are the corresponding operators formed
from products of the Dirac y's, and a contraction between nucleon and
lepton bilinear covariant quantities is tacitly assumed in (7.35) and in the
following variants of the interaction. All this follows standard procedure.
NEUTRINO THEORY
267
We shall consider the behavior of Hp under space reflection. Then, for
the 1pP" 1p II' and "Pe we can write (see section 25)
Vl(x') = 17).fJ1Jl).(x)
where.lt stands for p, 11, or e and r;;. is a phase: 1r;;.1 = 1. For "P- we must use
1Jl'-(x') = ( 0 )
<p' (x')
From (7.24) and (7.34), the transform of "P- under space reflection is
1Jl'-(x') = ( 0 ) = €1Jlv(x)
Eia 2 qJX( x).
Consequently, under space reflection Hp transforms into the right side of
(7.36) :
H'p L g{rJ'Y} n'YJ: E ( 1Jl:, f3[J. i f31p n)( 1Jl:, fJOi''Pv)
i
= L gir;'YJ nr;€( Vlp, 12 i 1Jl n)( 1Jl:, o'i(J1p..,)
i
(7.36)
since Qi and {3 either commute or anticommute: {JOif3 = :i:Qi and this
operation has been carried out twice in (7.36). It is clear that there is no
choice of phases which will make the right side of (7.36) the same as Hp.
Alternatively, we can consider the interaction
H " - X X ( * {\. '\ ( * {". Ii ....
{J - k gi'Y}p'J'}n'YJe€ "P'P' l.irpnJ 1.pe' l.JifJ"P"')
i
(7.37)
so that under space reflection IIp H;. To complete our consideration
of transformation we examine the form of Hi under the space reflection.
Then
( 0 ) ( 0 )
Vl'(x') = = E'x = - eXw_(x)
v, i0'2CP'X(X') (i0'2)2qJ(x) .
and
H'P - L gi( r;:TJ n'Y})2EXE( 1p:, Qi1Jl n)( VJ:, o {if -)
i
This is the same as H; if we choose -e X e('YJ:'YJn17;)2 = 1, which is al'.vays
possible. Consequently,
H fJ = Hft + Hp
is invariant under the space reflection transformation. Either H; or Hi
alone is not. Experiment requires that the coefficients of IIp and Hi in H
cannot be equat In fact, if the present experimental data are interpreted
as a demonstration of maximum parity breakdown, thn HfJ is identical
268
RELATIVISTIC ELECTRON THEORY
wih Hfi or with Hi. The choice of HfJ = Hp depends on the measurement
of the neutrino helicity.13 Thus we write
H fJ = H = ! ! gi( 1p:, ililp n)( 11':, !li( 1 + )'5)1J')
i
where tp is the four-component neutrino state. Since
fJ1pv = "p+
where p is given by (6.66), it follows that under space reflection HfJ changes
into a form corresponding to the opposite helicity assignment for the
neutrino.
In a linear combination of H/J and H; we would find that the neutrino
",'ave function enters as
![gi(l + 1'5) + g;(l - Ys)]1p = W'f/J
where gi and g are the coupling coefficients in H; and H; respectively.
'Comparing with the notation of section 21,
gi -t. g; = 2gC i
gi - g; = 2gC;
It is evident that we are now describing the neutrino as a superposition of
negative helicity and positive helicity states. If we take wtp as above, for
example, the am.plitude of the negative helicity state is gi and for the
positive helicity state it is g:. Hence the average neut.rino polarization for
a particular interaction of type i would be
f?lJ = ICi - en! - ICi + C;I
v ICi - C:1 2 + ICi + C i l 2
_ 2ReC i C;X
- --
Ic i t 2 + IC12
(7.38)
For C i = C; this would give complete poJarization anti parallel to the
momentum, and the beta interaction would contain the factor 1 + 1'5
. operating on the four-component neutrino, as compared, to the parity-
conserving interaction where the corresponding factor is 1. i'his, as we
have already mentioned, is the formulation of the interaction according to
the t\\'o-component neutrino theory and, as is evident, this fornl of the
the<?ry of the weak interactions implies maximum polarizations of the
emittd particles. As can be seen without great difficulty, it also implies
maXImum anisotropy in angular distributions where such anisotropies
arise from parity violation of HfJ; see section 21.
NEUTRINO THEORY
269
We note that the condition C"i = C; appropriate to the two-component
neutrino theory is equivalent to f = 1 in the discussion of section 21. This
was seen to correspond to maximum breakdown of parity conservation in
the beta interaction, as the foregoing remarks would imply.
Angular Momentum Representation
l"he wave equations (7 17') and (7.19') are readily solved in the angular
momentum representation. Considering the latter equation, we write for a
stationary state
0'- v"1 cpt := i q gl
(7.39)
Since j2 and j still commute \vith a.V while 1 2 does not, the solutions will
still contain x/ KG However, a,,1 does not commute with (J-V, as the results
::t:
of section 12 demonstrate. Nor does the .operation of space inversion
multiplied by any 2 by 2 tnatrix conlmute with a-\7q That is,
71 a-v .= -Ta-'V 1
fJ
and there is no 2 by 2 matrix 7' which anticcHnmutes with fJ. lienee the
olutions will not have a definite padt)' and '#iH be linear combinations of
.x' '(. This means that K no longer represents a constant of the motion, as
is e\ ident ffonl the fact that (K, tJ.. \7) * O. 1'his again shows the intimate
:;onnection be!!een the .K operator and parity.
Set
( mC" ) JI == gX ' l + I . (,y Ii-
...,. K > K ;),,,--K
(7.40)
Compare .E.q. (0.77). Then, since
I a 1 ,.
Go V = (Jr \ - - - G-I )
fJr r
and
a-I X = -(K ..... l)X
rl 'V /J == _ 1l Jl
utA.1( lv-!\.
we find the follo\,ving differential equations for the radial functions:
d g I< -1- 1
-5... = qf -- ---- g
dr r-
(7.41)
dJ K -. t
--- = ---- f - qg
dr r
These are exactly the sanle as (5.4) with V = 0 and the rest mass term
(there represented by ::i: 1 added to W - J/) set equal to zero. Of course,
270
RELATIVISTIC ELECTRON THEOR"Y
W = q in our present notation. l'he solution, regular at r = 0, as
comparison with (5.12) shows, is
g = jz(qr)
f = S/Cjl(qr)
(7 42)
The results for <p are obtained by charge conjugation in exactly the same
way as in section 26.
We observe that the wave functions for .1( = l and K = -k are not
linearly independent. Instead,
(cpC)k = i( q/))
Hence, as compared to a four-component description, there are half as
many states in the present theory. This corresponds to the reduction of
the number of plane wave states by one-half which, as \ve have seen, is a
consequence of the unique helicity of the two-component neutrino.
It is seen that the solution (7.40) is obtained froln the four-conlponent
solution 1p, as given for instance by (5.3), by application of ,,/2('o_S,
where S in turn is given by (6.67) The ren1aining projection is
rp = '2w+S1p = gX - ifxK
( '7 , .. ..., ' )
" " . l.t -'
and the linearly independent solutions (7.40), (7.43) may be compared
with (6.77). Two-cornponent plane waves expanded as a superposition of
these spherical waves can be \vritten down at once by applying the operators
(JJ:f::.S to each member of the four-component expansion given in section 27.
PROBLEMS
1. Assume that for the massless neutrino there n1ay be a magnetic moment
different from zero. If the interaction of this monlent with an electromagnetic
fiel is written as Pauli has suggested, the wave equation is
(y p iJ: t , + ;"Y p Y v Fpv)'P = 0
Nhere A is proportional to the n1agnetic moment and Ff.H' is the electromagnetic
field tensor as defined in Eq. (4.5). Show that in the t\vo-component theory
A. = 0 is a necessary condition, so that the neutrino cannot have a magnetic
moment if this theory is accepted.
2. Investigate the effect of replacing each wave function, in the beta inter...
action terms HjJ and H'/J as defined in (7.35) and (7.37), by the charge conjugate
functions. Is either Hp or Hp invariant under such an operation? What linear
combinations of HjJ and Hi are invariant?
NEUTRINO THEORY
271
3.. In \-vhat sense does the \Veyl equation (7.17) con1e in conflict with covariance
under improper l,orentz transformations'? .&
4. What is the effect of Inaking the space retlcction transformation a.nd re-
placing cp vvith q;C in the Weyl equation?
5. What meaning, if any, can be given to the follovving statement? A particle
observed in a Inirror is equivalent to the corresponding antiparticle.
6. Co 60 decays with negative eJectrons which are observed to be preferentially
en1itted antiparal1el to an externally applied magnetic fieJd. If the same type
of observation is inade with CO fi 8 \vhich errlits positrons, what should be the
observed directi0n of preferential Clllission?
7. If the electrons en1itted in bet:l dcay are described \N'Eth a V - 11.A inter-
action, implying that C.'A = -).Cv, v"hat .sign should the polarization of these
en1itted electrons have? .A.nsv/er the same question for positron enlission.
8. In the decay of a !h n1cson an electron and t\VO neutrinos are emitted One
may atternpt to describe this process by the tensor coupling
lfdeeay ,........ (pyt.i)J (;Y f3V',)(lp;i' :.',Y f;()' 13 11 ' p,)
\vhere ¥1'V = ::l:;J 5 ?p,.. is the t\,vo-e-olnponent neutrino wave function. Show that
this Hrlecay vanishes identically. If "/lJ.;J{J is replaced by Yc>: and then, in an alter..
native form for [{decay, by Y(y'?'5' ho\l\ that the latter t\VO aHernative forn1s of
Hdecay are identical and do not necessarily vanish,
9. If the fl., meson decay rcfern.d to ill problem 8 is described by
H deca'/ ,-..! (1P;JI t1f.}p 1")( VJy 4 0 'f/J,,)
what are the properties of H iiC[i,Y for fl =: y (f.}' {J? Comp_i.re the results for n = Yex
and 12 =- ')'C(Y p . In all cases assurne a tv/o COlnpO!:ent neutrino theory.
10. 'The proce.ss of double beta (fJ--) decay is the transformation of a nucleus
with Z protons and IV neutrons to one \vith Z + 2 protons and lV - 2 neutrons
with the emission of two electrons. Alternative descriptions of this process
are (1) no neutrinos:
III -t- n2 -+ Pi -f- P2 + e 1 + e;
(2) (anti) neutrinos emitted:
n 1 +. n2 -+ PI .t- P2 +. e 1 + e 2 + 'i\ +. V2
Sho\v that in a two-COITlpOnent description of the neutrinos process (1) is
irrlpossible.
11. In the classical beta interaction one can replace each field 'l.J' by the pro-
jection w+1f'. The beta interaction for fermions 0, b, c, d is then
H{3 :== Cx [(w+yP)*y 4f2.<yw+"Pb] [(co+1pC) *y 4 Q X w +1pd] .,- h.c.
...1:"
where Ox = 1, Y It' i 1 pY4J (p, ¥- v), ?J /t/'5' )J 5 for "'¥ = S, V, T, A, P respectively.
Show tllat only the Vand A interactions actually QCCur in H fJ and that the others
"" ill vanish identically.
"Ii 2. Show by direct application of the charge conjugation operation that the
two-component neutrino wave functions given in (7.40) and (7.43) are charge
conjugates of each other.
272
RELATIVISTIC ELECTR()N THEORY
REFERENC:S
1. Dft R. Hamilton, W. P. Alford, and L. Gross) Phys. Rev. 92, 1521 (1953).
2. See S. Watanabe, Phys. Rev. 106, 1306 (1957); E. C. G. Sudarshan and R. E..
Marshak, Phy.. Rev. 109, 1860 (1958)
3. Ii. Weyl, Z. Physik 56, 330 (1929).
4. L. Landau Nuclear Phys. 3, 127 (1957).
5. A. Salam, Nuovo cinlento 5, 299 (1957).
6. T. D, Lee and C. N. Yang, Phys. Rev. lOS, 1671 (1957).
7. E. Majorana) Nuovo cimeno 14, 171 (1937)
8. J. Serpe, Physica 18, 295 (1952).
9. J. A. McLennan, Jr., Phys. Rev. 106, 821 (1957).
10. K. M. Case, Phys. Rev. 107, 307 (1957).
11. Cf. W. Pauli, Nuovo clmenlo 6, 204 (1957).
12. C. L. Hammer and R. H. Good, Jr., Phys. Rev. 108, 882 (1957).
13. M. Goldhaber, L. GI'odzins and A. W. Sunyar, Phys. Rev. 109, 1015 (1958).
APPENDIX A.
NOTATION
In the chapters of this book certain notations have been introduced.
Although these are generally explained at the point of introduction, they
are summarized here for convenience. Much of the notation used here is
fairly standard and, as a rule, wherever a commonty accepted set of symbols
occurs in the literature it has been adopted here.. However, it is certainly
fair to say that in many cases there does not exist a notation which is
comtnon to more than two or three authors. 'While the following remarks
are not intended as a glossary of symbols used in the text, they are intended
as an explanation of some genera] characteristics of the notation used.
In the first chapter a notation for a unit vector is introduced. and this is
followed throughout the book. This consists of a (circumflex accent) n1ark
placed above the vector syrnbo1. Boldface symbols as used here refer to
three-component vectors, either operators or vectors with con1ponents
which are numbers. Generally, a boldface synlbol appearing as a square
is a scalar: for instance, f = j; + j; + j; is an operator which is to be
distinguished fromj2) which is a number related to the eigenvalue [that is,
j(j + 1)] of jt. Where it is convenient to use the same symbol for an
operator as for its eigenvalue, the two are usually distinguished by the use
of an arrow written above the operator. In gen.eral, the direction of the
arrow indicates the direction in which the operator acts. The only exception
to this rule occurs where no confusion is occasioned by the omission of
the arrow.
In connection with matrices and wave functions (spinors) \vith more
than one component the following notations are used:
(*) hermitian conjugate: transpose and complex conjugate
(X) complex conjugate
(f'../) transpose
( -1) inverse
273
274
RELATIVISTIC ELECTRON TI-IEORY
In the case "of conjugation and inverse operations the syn1bol, here placed
in parentheses, appears as usual as a superscript to the right. Of course,
complex conjugation and the inverse occur in connection with ordinary
numbers (1 by 1 quantities) and operators \vhich are 1 by 1 in their matrix
structure. An additional syrnboi appearing as a superscript, is c, which
connotes charge conjugation.
Spino!" indices generally appear as (Greek) subscripts. W"hen an
additional subscript is needed to distinguish between two or more matrices
(for example) parentheses are used for greater clarity. Thus a /.tv is an
element of a 1natrix a and it is associated with the p,tll row and vth column;
similarly, the corresponding elernent of the matrix Cl. k is (<X':).u v . Indices such
as k in th.e last exan1ple often refer to the three cartesian directions.
Synl bols x, y, z and indices 1, 2, 3 occur frequently and are interchangeable.
That is, (Xl' , tXa and (I.;)], Cl.. 1I 'J OC z are notations for the same set of three
quantities (matrices). Si111ilarly, Xl' x 2 , x 3 is used interchangeably with
x, y, z.
The sum of the diagonal elements of a matrix A is denoted by Tr A,
rneaning Htrace of A." For a matrix element of an operator Q between
states 'ljJn and "Pm anyone of the following equivalent notations is used:
('lpn,01p111) = (1fJ:D.1Jlm)
= (VJnl o lVJm)
= (nIQI,n)
Of course, the parentheses 'in the second form of the matrix element are
not really necessary 'Nhen only a spinor index summation is involved.
Diagonal matrix elements, or expectation values, are often designated with
angular brackets:
('lpn' D."Pn) = (Q)
Whether a matrix element involves only spinor index summation or
integration over space coordinates as weJl should be clear from the context.
\Vhere a labeling of a wave function (or spinol" amplitude) with eigen-
values of diagonal operators occurs, these eigenvalues usually appear as
subscripts or superscripts or both, bu sometimes as arguments wherever a
simplification of notation is thereby elfected. When the angular momentum
representation occurs, the general practice is to write the total angular
mon1entum as a subscript and the eigenvalue of the z-projection of angular
momentum as a superscript. The only excptions are: the pure spin Pauli
eigenfunctions always refer to angular lTIOmentum i and therefore this
subscript is omitted; the spin-angular functions in a central field carry a
subscript K which gives the total angular momentum j and the parity as well.
In many parts of the book it is cumbersome to carry along constants
APPENDIX A
275
m, c, 11 as well as e. It is customary in the description of the relativistic
electron to use units such that m = c = Ii = 1, and for definiteness m is
the electron mass. Then e 2 = <X 1/137. When two particles of different
n1asses are involved in the sanle discussion, as in the decay of the meson,
one can either set 11 = 1 and then the electron mass is m/p, 1}207 or
one can take m = 1 and the mu meson mass is 111m 207. If m is the
unit mass, the units of some other quantities of interest are
energy W: me 2
momentum p: me
length r: liln'lc
time t: 11/ mc 2
wave number k: mc/Ii
After each quantity there appears a symbol which is frequently used for it.
To conver(any result expressed in these rational reativistic units to ordinary
units the energy symbol W should be replaced by Jt"/mc 2 , p by p/mc, and
so foth. Then the syrnbols JV, p, . . are the energy, momentum, ., . . in
ordinary units. It is obvious that this process automatically introduces an
appropriate combination of m, c, and Ii so that correct dimensions appear
in each equation. For instance, a cross section will be replaced byaj(Jijmc)2
and a transition probability w = l/,T will be replaced by wiijn1c 2 . To
introduce ordinary units in a wave function it is only necessary to specify
the normalization. For example, if 1p is a bound state so that
r-
J d 3 x'IjJ* 'IjJ = 1
then 1J' in rational relativistic units is replaced by (mcjli)-%1p in ordinary
units. Conversion to atomic units is easiI)' carried out by noting that in
the latter system m = Ii = e = 1, so that c = l/ex. For example) the unit
of energy in this systen1 is rz 2 mc 2 , which is devoid of c as it should be.
In the discussion of Lorentz transformations the four space-time
coordinates are generally denoted by x p with Greek indices always having
the range 1 to 4. Here X 4 = fet. Latin letters range from 1 to 3 and refer
to the space part of four-vectors, for example. The dumrny index ruJe
involving summation over repeated indices is used rather frequently,
although where greater clarity is achieved by explicit use of the summation
sign this additional symbol appears.
The finite number of characters available in the alphabets which are
more or less common knowledget has unavoidably led to duplication
wherein a single letter is used in more than one context. An attempt has
been made to void such duplication in a single chapter, and there seems
to be no reason why confusion should result.
t And readily available from the printer. i
API)ENDIX B.
LORENTZ TR..\NSFORMATIONS
For convenience the relevant facts concerning Lorentz transformations
are summarized in this appendix. As indicated in Appendix A,
;r 1 , x 2 , x 3 ; x 4 = X, 1/, z; ict = it. Then one considers all the transforn1ations
which leave dxfl. dx,., invariant. These can be written in the form
x = a 1t VXV -I- b
(B.l)
where b is a constant four-vector corresponding to a space-time displace-
ment. The transformations indicated in (B.l) constitute the inhomogeneous
Lorentz group. "Ve shall be primarily interested in the homogeneous
subgroup, b = O. Then
I
X J.l = a pv x "
(BJ ')
The requiren1cnt that
dx dx = dXjl dx p
implies that the 4 by 4 matrix a is orthogonal:
a P\l a p.p = 6 vp
a fJVa..tv = JlA
and henee a = a- 1 " Therefore
det a = :I:: 1
(B.2)
(B.3)
The transfortnations with det a = + 1 constitute the subgroup of the
proper l...orentz transformations. Those with det a = -1 are the improper
transformations. The latter include
(a) space retlection a ik = -ik' a44 = 1, a j4 = a 4i = 0
(b) time reflection a ik = b ik , a 44 = -1, a j4 = a 4j = 0
and any product of a proper transformation \vith space or time reflection.
Combined space-tirne reflection gives a = - 1 and is a proper l,orentz
TI6
APPENDIX B
277
transforrnation. But, since it is not obtained in a continuous way from the
identity a = 1, it is properly conidered in a separate category. The proper
Lorentz transforrrlations with tl 4 4 ::.> 0 constitute a subgroup of trans-
formations continuous with the identity. They are four-space rotations.
This subgroup is composed of three-space rotations in \vhich a 44 := ],
Q4i = Qi4 = 0 and of translations with uniform velocity along some
direction as wen as products of these two.
For a uniform translation with velocity v along the xl-axis,
0 0 iv/c
0 1 0 0
a(e1v) = 0 0 1 0 (B.4)
- ivlc 0 0
= (1 - v 2 jc 2 )-IA = (1 - v2)-A
so that the components of th four-vector X/i perpendicular to v do not
change In (B.4) e 1 is a unit vector along the xl-axis. This is easily
generalized to a transfornlation with arbitrary but uniform velocity v..
We have
r = rev V + V X (r )( v)
(B.5)
where the first ternl is parallel to v and the second is perpendicular to v.
Similarly,
r' = r'et v + v X (r X v)
= (r .+ iC- 1 vx 4 ).y v +, v X (r X it)
where the first tern1 is transformed according to (B.4). Using (B.S) again,
this becoJtles
r' = r + ( - l)r-v v + iC-l'Tx4
and, by an obvious generalization of. (B.4),
x = (- ic-1v.r -1- x 4 )
1"herefore the elements of a(v) are
(B.6)
(B.7)
aik = ik + ( - l)v i v k
a k-A = --- Q41c = i i;v k / C
(B.7').
a 4 4 =
In the discussion of spin-orbit coupling (section 7) it was necessary to
consider two Lorentz transformations: a( -v) followed by a(v + u),
278
RELATIVISTIC ELEC1"RON THEORY
where u = a dt was an infinitesimal velocity. From (B.7') it follows thAt,
to first order in u,
a(v + u) = a(v) + b(u)
where
- l r 1 3 2C 2 ) V.U ]
b ik = -"2 "2 ViV k + UiV k + ViU 'c
v 2 L - 1 t' C
b k4 = -b 4k = (;2 : v k + Uk)
b 44 = 3 u.v
c 2
(B.8)
Thus
a(v + u) a( -v) = 1 + b(u) a( --v) = 1 .+- a'
and
, - 1. ( "
a 'l :=: ... U . v. ._ U. V 1,. )
. 2 fC 1, I> ,
v'"
, ,. -1 [ +-( 1: 1) "''' ]
a k4 = -a 4k = lC !;- Uk - VkV.U
(B.9)
a = 0
Thus ] + 0' is the Lorentz matrix for two infinitesimal transforlnations,
which obviously ommute, and these are an infinitesimal translation with
a velocity given by
[u + ( - l)v-uv]
(B.1 0)
and an infinit.esimal rotation with the rotation axis and angle given by
- 1 . ,
- (v X OJ
v 2
(B.l1)
The angular velocity of the precession arising from (J 11), with u = a dt, is
-. 1 . 1
w = . tv X a ) - \ (v X a\
2 \ --- 2 2 "
V C
where the last expression applies as an approximation for V 2 /C 2 1..
APPENDIX C.
TIME-DEPENDENT OPERATORS
For any operator Q(O) in the time-independent (Schrodinger) .representa-
tion used throughout most of this book, a titDe-dependent operator Q(t)
(Heisenberg representation) can be defined by
J 1j!"'(t)O(O) 1j!'(t) d 3 x = J 1j! * (0) O(t) '1"(0) d 3 x (C.1)
where the prinle is' used to distinguih two \vave functions, belonging to
different energies for example.
Since
1p(t) = e- iHt / 1t 1jJ(O)
'P' (t) = e - ill t 1 ft1p' (0)
it is seen that
Q(t) = eiHt/1tQ(O)e-iHtlh
(C.2)
From (C.2) it follows that (00-/01 = 0):
dfl = i. (HQ - flH)
dt Ii
(C.3)
where, on the right, Q(t) is meant. Hence, for any operator D. commuting
with H, df2/dt = O. This is always applicable ,,'hen Q = H. For Dirac
plane waves fl = Ii is time-independent.
As an example we consider the operator for the velocity. T'his is
x =:: ! (fIx - xli) = CeL
Ii
(C.4)
for the Dirac particle. Equation (C.4) is also appl.lable in the presence o
279
280
RELATIVISTIC ELECTRON THEORY
interactions which are not momentum-dependent. For the acceleration,
assuming free particles,
1 .. . i (H H) 2 i ( -+ H)
-x=a=- a,-(I =- Cp-(I
c n n
2i
= Ii (Ha. - cp)
The last equation can be integrated since Hand p are constants in time.
Thus
«(t) = tIoH-le2iHt/A + cH-lji (C.5)
where CXo is a time-independent operator. Hence the velocity operator is
composed of two parts. One, the first erm in (C.5), is an oscillatory term
with frequency 2mc 2 /1i, and the second is the usual constant term with
eigenvalue c 2 pJpo. The physical interpretation of the oscillatory part is
given in section 18. FrOIn (C.5) another integration gives x(t), which will
clearly have three terms: an oscillatory term with frequency 2mc 2 //1, a
term increasing linearly with t, viz., c 2 pt/po, and a constant term.
APPENDIX D.
AN ALTERNATIVE APPROACH
TO THE DIRAC MATRICESt
In the discussion of the Lorentz covariance of the equation
( Yfl 1- + ko ) "P(X) = 0
oXp.
given in section 14 the 'Ytl. are recognized to be a given set of matrices
introduced as a device for writing four equations in one. Therefore they
do not change under the Lorentz transformation;
(0.1)
,
X 11. = a IJVXV
(D.2)
Nevertheless, with each Lorentz transformation there is associated a linear
transformation of the ')lIt which was written in Eq. (2..60b) in the form
, A -I A
Yll = 'Yp. = a/JvYv
(D.3)
This can be regarded as a linear (vector) transformation law. With it is
associated a transformation law of the other Dirac matrices. For instance,
sincea 3 = - iYl"2,
G = -iyy; = -iA- 1 y 1 AA-I y2 A
= - ia 1 /Ja 2v Y I&Y"
For a space rotation in which Qi4 = 04i = 0 it follows that
I
(1'3 = a 1k a 21 €klra(]m
(D.4)
where - iYkYz = €kl;fl,(f m - i lk has been used. Here €klm = Elm is the
antisymn1etric third-rank unit tensor introduced in section 4.
t The material in Appendix D follows the development given by H. Feshbach and
F. Villars, Revs. Mod. Phys. 30, 24 (1958).
281
282
RELATIVISTIC ELECTRON THEORY
The transformation. (D.3) is engendered by the coordinate four-rotation
(D.2). It should not be confused \vith a function space transformation in
which
'fJJ'(x) = S1p( x)
The transfornlation (D.S) changes (D. I) into
( SYvS-1-J- + ko ) 1p'(x) = 0
ax y
while (D.2) and (D.3) with 1p'(x') := A1p(x) changes (D. I) into
( aV/lY + ko ) 'lp'(x') = 0
-iJx y
Returning to (D.4), it is clear that if we introduce an antisymmetric
tensor f/ tl" whose space part is
(D.5)
:7'ik = Eikl(]Z
(D.6a)
then
[/k = ailakm//zm
(D.6b)
Equation (D.6a) introduces a three-vector whose components are
('p'
r:.7 23 = aI'
[/31 = (12'
9'12 :.= 0"3
and (D.6b) is the generalization of (D4) to which it reduces \vhen i = 1,
k = 2.
The approach of Feshbach and Villars is to start with the particle in the
rest frame where the Pauli theory applies. Here the spin matrices are just
the lY i and the commutation rules are
(Jja k = i€ikl(]Z
(j :# k)
(D.7a)
or
a j = iE jkZ(]l(Jk
(D. 7b )
In addition,
cr; = 1 for all' i (D.7c)
The relativistic description is no,\' obtained by Inaking a Lorentz transfor-
mation to the system in which the velocity is given by vie = plmc = cpJpo
where in ordinary units the total energy Po = mc2 = lnc 2 (1 + p2Jm 2 «2)'A.
To do this we recognize that 9ik is the space-space part of an anti-
symmetric four-tensor Y 1J. The space-tiIne part will be denoted by a.
That is,
Y' 4 = 0("
Y4 = Y ii = 0
Although we show that these 'Xi have all the properties of the Dirac OC i ,
the similarity of notation should not be taken as an implication that this
APPENDIX D
283
has been proved. We must regard the (1.,i as unknown for the moment.
Under the Lorentz transformation f/!-t v g''V, where
9'v = a fJAayp!/' lp
For instance, a Lorentz transformation in the Xl-X, plane, see Eq. (B.4)
of Appendix B, gives
I
0"1 = <1 1
O' = ( 0'2 + i!? (la )
\ c
a = (O'a - i (l2 )
(D.8a)
and
,
(Xl = <Xl
(l = ( (l2 + i aa)
= ((la- i 0'2)
Similar relations follow for transformations in the X 2 -X 4 and x a -x 4 plane.
The relations (D.7) must be valid in the primed system. Hence it is
concluded that
(D.8b)
O"l'i = <XjO":;
(D.9)
and
0" i = i€ jkl(J.ZCt.k
(D.lO)
To see this evaluate (1(1 from CD.8a). 1"hs is
r. 2 ]
,,? IV V
0"20"3 = " l <12(1s + - (o;S0'3 - (]'2 OC 2) + '2 (1.,3(1.,2
C C
Equating -2a(/ to i(1 - V2/(2)(j = i-2(Jl and comparing coefficients of
vIe gives an identity for the coefficient of (vlc)O,
<XaO"a = <122
for the coefficient of vie, and
(J I = i CXa(Xz
for the coefficient of V 2 /C 2 . Then (0.9) and (0.10) are the self...evident
generalizations of these results. In a similar fashion 0'2 = 1 gives
V v 2 v 2
1 + i - (a 2' (XS) + - -- (X = 1 - -
C c 2 c 2
284
RELATIVISTIC ELECTRON THEORY
or
C1 2 Cl 3 + CY..3C12 = 0
(X = 1
These immediately generalize to
O'i(Xk + (Xl/ Y i = 0
ex; = 1
(i =1= k)
(0.11)
(D.12)
It is now possible to deduce the commutation rules of the (Xi which fixes
them within a linear (unitary) transformation. Thus, from (0.10) and
(D.7b), it follows that
(Ji(fk = € ikZElnmanrf.. m
for j =1= k. Since €tnm is antisymmetric in nand m and (fj(Jk =1= 0, it follows
that (Xn(Xm must b antisymmetric in nand m also. Therefore
(X nrL m + CY..mrL n = 0
(m -::/:; n)
and from (D.12) we conclude that
(J..n fX 1n + r:t.mfX n = 20 nm
(0.13).
in general. These are, of course, the commutation rules of the Dirac
«-matrices.
The discussion is not yet complete because there are four fundamental
Dirac matrices and we have obtained only three of them.. To remedy this
omission we define four matrices Y It by
(I'll' :/ aP) = Y p.!/ ap - !7 rxp'Y p. = 2i( b ItPY a - 0 #laY p)
(D.14)
In the rest frame this becomes a well-known relation where, with indices
running from 1 to 3, Yk are the components of a vector in three-space.
To obtain the properties of y p, we first evaluate
(/, 11' sP;v) = !/' Ilv(r /1' Y II v) + (y It' [/ /1v)/7 p.v (D. IS)
where now no sum over f.l is implied. But !7v = 1 for all p, =1= 11 and O.for
fL = v. Therefore the commutator in (0.15) is zero. From (D,J4) we find
(I' p' 5P ,..,,) = - 2iy_ for !l =F 'V and (0.15) becomes
- 2i(/7 /1 vI'" + Yv Y Jlv) = 0
(Y!l' g' Wt') + = 0
(0.16)
where again no sum on It is implied.
For the commutator of y and Y)rx the identity
(t'; YaP) = YDt(Ya, Y/ ap )+ - (rex, Y(JfJ)+Ya.
APPENDIX D
285
holds. Again there is no sum on repeated indices. Using (D.16), it follows
that
(y, !/ a.p) = 0
(no sum on ex)
(D .17)
This shows that 1'; must be a scalar So since it commutes with all rotations.
Therefore, for all p"
y = So (0.18)
From (I' J.4 f/ p,,) = - 2iy" we obtain by multiplication with Yv on the left
- 2iy Jl'Y" = 1';9' Jl" - Y JlY Jl"Y Jl
and by multiplication with I' p on the right
-2iy"y JL = Y p,!7 /lvY Il - 51' /lvY;
Therefore, by (D.17),
(no sum)
(no sum)
(Yll'YV)+=O
p,=/=v
so that
YjJY" + YVYJL = 28 0 Jl" (all fl, v)
The final step is to show that So commutes with all Y p. This is trivial
since So = Y; (any ft). Then
(So, Y Jl) == (1';, I' Jl) = 0
Consequently by Schur's lemma we can set So equal to the unit matrix and
there exist four matrices I' p for which
Y Ill'" + Y,,'Y Jl = 2d JlV
This completes the chain of reasoning which leads, via the Lorentz
transformation, from the non-relativistic Pauli representation of the spin
to the relativistic Dirac representation. A wave equation can then be
obtained by contracting I' p ith the four-vector a/axp, and this operator
acting on 1p must be a scalar times 1p. This leads to Eq. (2.24), where ko is
identified (up to a sign) by the requirement that one obtain the Klein-
Gordon equation as a second-order equation.
The agreement with the transformation properties already deduced in
Chapter II is to be noted. Thus, ipy p1p is a four-vector, ipr:J.l'P = ip[/i41JJ is
the space-time component of an antisymmetric four-tensor whose space-
space components are ipY jk 1p. Here a minus sign is not uniquely defined
because the con1mutation rules are unchanged if all (Xi are replaced by - r:J.ie
APPENDIX E.
RETARDED EIJECTROMAGNETIC
INT}RA errI ON
en section 36, Eq. (6.20'), \ve obtained a result according t.o which tile
interaction between two electrons in a transition with energy transfer k is
e ikR
e 2 (1 - (Xl-a 2 ) ...:-. (E.t)
R
where (Xl and a2 are the Dirac rnatrices for electrons 1 and 2 and R i5 the
distance between then1. 'The present purpose is to clarify this result by
sho"ving in greater detail the assumptions which are at the basis of (E. I)..
The problern is described in terms of t\VO electrons which, in zero order
in a perturbation theoretic sense, are completely decoupled. The overall
process is one in ,'\, hich electron 1 is initially in the ground state of zero '
energy (since an additive constaOnt to the energy is of no re]evance) and
electron 2 is in a state of energy W. -In the final tate electron 2 has zero
energy and electron 1 is in a state of energy E. 'The question at issue is 10
determine the transition probability per unit tirne It \vill not be necessary
to complicate matters by taking the exclusion principle into account in an
explicit way until the end of the calculation. Each electron is coupled to
the electromagnetic field with a coupling energy He- iwt + H"*e- iwt , where
H = e( a-A - <1». The transition is to be picturd as a two-step process
in which a quantum is first emitted and then absorbed so that there 'are
two intermediate states: TIle first, for which the probability anlplitude is
a, is reached by the ernission of a quantun1 of frequency w, or 'Nave vector
Ii. (k = co), polarization specified by an index A, and electron 2 is in the
ground state. From this intermediate state the final state (an1plitude at)
is reached by absorption of this quantum by electron 1 which goes to
energy state E. The alternative path consists of an en1ission of a quantum
specified by k, A with electron 1 in state E forming an intermediate state
with amplitude a! and then the absorption of this quantum by electron 2
286
APPENDIX E
287
'lhich thereby proceeds to the ground state. FIgure £.1 illustrates these
transitions in a sche1natic way. The symbols HVWl(W) which appear in
this figure have the ITleaning of absorption n1atrix elerrlents of H for
tl ---
w
-- -.q ... ...
2
w --.
/
a
/
./
,
I
I
I
"'VV\AIa- W E _...... _--
--:-°f 1 r
(2) X
A }1 0W (w) i (1 L
I H EO \w)
o I 2 I) _..1-_-.,
2
2
w ·
INITIAL
STAT£:
o.
!
\
\ E _n"'--
a' t
\0 _ J W)
- (J f --......
'11---,------
I
I
, "'
E-r-
,
I J {Z) (
I I /O f V.L:)
,
t 2
o ---...----..-
,
0- ..
iNTE RM EDtATE
S;,ATES
F!NAL
S T f.J. -; E
Fljg. F.l. Diagracfl sho\viog transitions ioyoJvt::d in the eJe.!.:.(.ronag.n.elic .\nteraction
between t'NO chrged particles. The relevant transittons. are labeh:d \vJth thf; 8ppropriatc ,,.
matrix eJements \vhi.ch art defined in the text. Tile an1pHtude of the vDricus states
a .. I <;.1 d
ppear as ai, a" a , ,..n 21.
frequency (J) by electron. (n) going frorn energy I/r/ to H7" (if cot1.rse,
}{V1 (co) is the elniion rl1atrlx elemen.L Clearly,
'f l (-YI)X - '\ H ( n) ( ' )
1: r, v '; "';u I -- (V, = W TA: v co
. 2 f.'y l' /' ' 2 y 1
Wifh tl i the initial state proba.bility anp1itude and 'with the expansion
\J}I (t) (t) 1..I'" , ( \l"fJ' I 1 1 [Of'" t ' + 'Hf.' /1\
I: tot =,;: a i " r I (). J} -:t. ._- a x " ) D,' a f ( ,tV')
the equation& of notiol1 are
. .. Dr. Lj't2) + ' [I ll) ,
la i = ,'., a i . /.:..i. C10Jpa oW FlG a
(E.2a)
lid = wa + HWa; + f dE aiE) lI<Jx
(E .2b)
288
RELATIVISTIC ELECTRON THEORY
iii' = (co + E + W)a' + HIj;Xai + f dE al(E) L HX (E.2e)
iO t = Eat + aH + a'H (E.2d)
Throughout, the k sign is meant to include a sum over all frequencies,
directions of propagation, and polarization. The latter inc1udes the
longitudinal components of the electromagnetic field; see below. The
argument of all matrix elements is w. To write these explicitly we first
remember that in zero order the wave functions in initial and final states
are simply products of wave functions of each particle We write the
stationary state wave functions as "PI and 1J1i for electron 1 and 4>1 and i
for electron 2. For the space part of the vector potential we '\Trite
A = (27T/W)IA. a ). exp (it-r)
(E .3)
where the constant factor (27T/ro)!4 is chosen so that the total energy in the
radiation field is (0; that is,t
f d 3 r( 8.8 x + .,*,X) = co
The con1plete set of poJarization vectors is aI' a 2 , and as, where these are
unit vectors ¥.:ith aa along k and a 1 and a 2 are perpendicular to k.. These
vectors satisfy;
a..t.a = GAl'
(E.4)
For the longitudinal field A = 3 and we note that, from the Lorentz
condition <I> = 0 for A = 1, 2 and for A = 3,
<I> = as exp (ik-r) = exp (ik-r)
since the time dependence is always exp ( - imt). Then
HIj; = f:t e f tPrl 'P:(a..a;. - J.3) exp (ik.r 1 ) 'Pi
1I = e:f ef tPr 2 CP:(a..a;. - ;'3)exp(ik.r2) cP;
(E.5a)
(E.5b)
t HereC = -oAlat -- V«>, = curl A, where A and (f> areconplexfields. The fields
are normalized in a box of volume V, and then V is set equal to 1 since it cancels from
Dnal results; see M. E. Rose, Multipole Fields, John Wiley and Sons, New Y{)I'k, 1955,
p. 42.
; For example, with k along tbe z-axis vie can choose 83 = e, 81 = 2-(e + tell)'
a z = 2-(e - ie tl ).
APPENDIX E
289
The solution of the equations of motion which gives at to second order
in the matrix elements is
a. = e- ffVt
(E.6a)
H (2) X
OJV ( -iwt -iWt )
a= e-e
(0- W
(E. 6b)
. H(l)X
a' = EO e-iWt[e-i(w+E)t - 1]
w+E
(E. 6c)
H (1) H (2)X [e i (E- U'')t 1 e i(E-co)t 1J
'" EO Of-V - -
at = £", -
W-w E-W E-w
+ H1tHriX [ ei(E-fV)t - 1 _ e-i(JV+w)t - 1 J
,k (E.6d)
1 (J) + E E - W W + OJ
Only the first term in each square bracket in (E.6d) gives energy conserva-
tio\lt and a result for the total number of transitions increasing linearly
with time. Retaining only these terms, the transition probability per unit
ti me is
!!.. J dE 1 a J I2 = 21TIH fi'= TV
dt
where
H ' = I [ H!JM w )H'H - w) + Hh1H w )H( - W) ] (E.7)
I W - OJ + h} v w + W - i'YJ
We have here replaced w by (JJ - i'YJ since, as will be evident very soon,
this prescription for perforn1ing the sunl on ro gives outgoing waves.
The sum is now replaced in the usual way by a sum on A and by an
integration in k-space. Thus
2 = ( 27T}-a J d 3 k I
).
= (271')-3 J J d1lw 2 dw t
The,n in .the integration on w the integrand of the second tern1 in (E.7)
becomes identical with the integrand of the first term if we replace 0) by
-(J). The limits of the second term are thereby - 00 to O. Hence the two
t See, for example, L. 1. Schiff, Quantu1n Mechanics, McGraw...Hill Book Co.,
New York, 1955, pp. 201-202#
290
RELATIVISTIC ELECTRON THEORY
terms combine to the integral of the first over (J) from - 00 to 00.' The
nlatrix element Hli, is then
e 2 J (JJ dw f 3 * ( ) ( .
Hfi = 2 . .k d rl'lJlf al-a,,\ - U)'3 exp ,k.r l ) V'i
4-n- W - OJ + l'YJ l
X f d 3 r z <p:( (%2oa - 15;'3) exp ( - ik o r 2 ) <Pi
For A = 3 the cross-terms in the vector and scalar potential are simply
evaluated by using
A = VX,
<P = iroX
where
x = - .!.. exp (ik-r)
w
and noting that
a-A = avx = -i(X, Ho)
where Ho is the zero-order Hamiltonian: a-p + f3 + V ext , "where V ext
may include a nuclear Coulomb field for exan1ple. Then using
I-Il)1jJi = H2).p! = 0
H6 1 )1pj = W'lJlI'
J!2}i = W 4>i
we find
H . = f co dw dO. ff d 3 r d3,.. * A.. *
11, 4 2 W + . A- 1 2"P f 'fJ f
7T - W l1'] X
X [(%loa.1. (%2oa + (1 - 2: ) 15.t3 ] exp (ik o R)"I'i<P.
where R = f 1 - f 2 .
The sum over A is evaluated with
! (ll-a l a2-a = (l1- I - cx 2 = Ct 1 -cx 2
).
where
I = ! a;.a
).
is the unit dyadic. Therefore
H . = e 2 f w dw d!}
11. 471"2 W - w + ir;
X f f d 3 r1 d 3 r 2 "1':<p:[ (%1 0 (%2 + 1 - 2: J exp (ik.R)"I'i<Pi
APPENDJX E
291
Then the integration over the directions of k is done by (OJ = k)
J ds:! exp (ikoR) = .2TT (e ik1i __ e- ikR )
I OJ R
so that
2 d
H - 3-'- J (I)
Ii - . .
211"1 W - 0) + rr;
X f J d 3 r 1 d 3 r 2 ¥';.p; [(Xl 0 01:2. + 1
- 2 t V ] (eikR _ e - ikl)R - 1. W ...J).
. . 'f'?
OJ
Finally, integrating over (V, the path is closed in the upper half-plane for
the ternl e ikR so the pole at (lJ = W + iYj is encircled. For the term-e-ikR
the path must be closed in the lower haJf-plane and no pole is enclosed.
Hence onJy the outgoing \\'ave part contributes and, evaluating the residue
and then taking the Iiolit 1J 0, we obtain
, 11> I ' iTVIl
q 3 3 * *. e .
1-/ Ii = e-- j d rId r 21pf 1>, (1 - CX 1 -«2) "Picfi
· R
which is the desired result. To include the efiect of the exclusion principle,
the final and initial state ,vave functions are antisymrnetrized. -rhus 'Pfr$/
is replaced by
(l. 8)
2 - }'2 [ tW(l) ,/...(2) _ 1,,(2) ,-h(1)l
Tf f TI f
and similarly for the initial state.
The derivation given above is based on the plane /ave representation
of the electrornagnetic field. One may use any complete set of states and
obtain the sanle result. For an cxarnple using angular mon1entum waves
the reference given on p. 287 n1ay be consuIted.t
t See also N. TraIl: and G. Goertzel Phys. Rf!o. 83, 399 (1951); lI. R. Hulme,
Proc, Roy. Soc. (London), A154'1 487 (1936).
(;ENERAL REFERENCES
A. M. E. Rose, Elementary Theory of Angular Momentum, John Wiley and Sons,
New York, 1957.
B. E. U. Condon and G. H. Shortley, The Theory of Atomic Spectra, Cambridge
University Press, Cambridge, England, 1935.
C. H. A. Bethe and E. E. Salpeter, "Quantum Mechanics of One.. and Two.. Electron
Systems," Encyclopedia of Physics, Julius Springer, Berlin, 1957, Vol. XXXV.
D. W. Pauli,"The General Principles of Quantum Mechanics," Encyclopedia ofPhysics
Julius Springer, Berlin, 1957, Vol. vII.
E. P. A. M. Dirac, Quantum Mechanics, Oxford. University Press, Oxford, England,
fourth edition, 1958.
293
AUTHOR INDEX
Acheson, L. K., Jr., 244, 252
Alford, W. P.'} 254, 272
Ambler, E., 107, 113, 115
Anderson, ( D., 76. lIS
A r,lke n, G . B., 211, 2 1 7
Bacher, R. F., 190, 217
Bade,. W. L, 154, 156
Banerjee, H., 237, 240. 252
Bargmann, V., 131
Barker, W. A., 91, 115
Bartlett, J. H., Jr., 210,218
Bessey, R. J., 222, 252
Bethe, H. A., 24, 25, 157, 188, 190,
192, 215, 217, 218, 225, 237, 240,
251, 252 29
Biedel1harn-, L. C., 29, 31, 150, 156,
211,217
Blatt, J. M., 245, 246, 252
Bohr, N., 36
Bose, S. K., 92, 115
Breit, G., 188, 190,215,217,218,225,
252
Brown, G. E., 190, 217
Buckingham, R., 237 1 252
Case, K. 1. 130, 155, 168, 217, 260,
272
ChrapJyvy, Z. Vo, 9 I, 115
Christy, R., 174, 217 .
Church, E. 1.10' 174, 211
Cini, Mo, 92, 115
Clifford, W. K.. 45, 67
Con1pton, PL fJ. 2, 31
Condon, Eo V., 23, 183, 217, 293
Darwin, (;. G., 129, 155
Davies, H., 237, 240, 252
deBroglie, Lo, 36
de Dander, Th., 37; 67
de Groot, So R., 130, 155
Deutsch, M., 105, 115, 216, 218
Dirac, Po Ao M., 32, 33, 39, 41, 67, 75,
115, 293
Doggett, J. Ao, 210, 218
Erber, To) 237, 240, 252
Fano, V., 19, 31, 196, 218
FOermi, E., 190,217,225,252
Ferrell, R. A., 92, 115
Feshbach, H., 34, 67, 210, 218, 250,
252, 281
Feynman, R Po, 155, 156
Fierz, M., 67
Fock, Vo, 37, 67
Foldy, L., 87, 115, 123, 155
Fowler, R., 237, 252
Fradkin, D. M., 131
Franz, W., 236, 252
Furry, W., 231, 252
Gamba, Ao, 92, 115
Garwin, R. L, 78, 115
Gell-Mann, Mo, 155, ] 56
Glover, F. N., 91, 115
Goertzel, G., 291
Goldhaber, M., 268, 272
Goldstein, I-I., 60, 67, 133, 156
Good, R . Ii., Jr., 52, 6"7, J 11, 112, i 1 3,
] IS, 132, 134, 155, 156, 222, 223
252, 264, 272
295
296
AUTHOR INDEX
Gordon, W., 37, 67, 121, 155
Goudsmit, S. A., 2, 31, 122
Green, T. A., 244, 252
Grodzins, L., 232, 252, 268, 272
Gross, L.., 254, 272
Hamilton, D. R., 254, 272
Hammer, C. L., 264 272
Hayward, R. W., 107, 113, 115
Heisenberg, W., 75, 115
Heitler, W., 225, 237, 252
Hill, R. D., 174, 217
Hoppes, D. D., 107, 113, 115
Hudson, R. P., 107, 113, } 15
Hulme, H. R., 237, 252, 291
Hutchison, D. P., 78, 115
Jackson, J. D., 111, 112, 115, 211, 218,
240, 245, 246, 252
Jaeger, J. C., 237, 252
Jauch, J. M., 76, 115
Jehle, H, 154, 156
Keller, J., 174, 217
King, R. W.,63, 67
Klein, 0., 37, 67
Koenig, S., 76, 115
Kofoed-Hansen, 0., 105, 115, 216, 218
Konopinski, E. J., 63, 67
Koppe, H., 196, 218
Kramers, H. A., 3, 31
Kudar, J., 37, 67
Kursunoglu, B., 92, 115
Kusch, P., 76, 115
Lamb, W. E., 172. 217
Landau, L., 259, 272
Lee, T. D., 107, 115, 259, 272
Lipkin, H. J., 107, 115
Lipps, F. W., 102, 115, 236, 252
McDougal, J., 237, 252
McKinley, W. A., Jr., 210,218
McLennan, J. A., 260, 272
Majorana, E., 260, 272
Margenau, H., 182, 217
Marshak, R. E., 255, 272
Maue, l\. W., 237, 240, 252
Maximon, L., 237, 240, 252
...
Michel, L., 63, 67
Mihelich, J. W., 174, 217
Mller, C., 225, 229, 252
Mott, N. F., 196, 203, 205, 218
Miihlschlegel, H., 196, 218
Newton, R. R., 166, 217
Newton, T. D., 91, 115
Nordheim, L., 22, 252
Olsen, H., 237, 240, 252
Oppenheimer, J. R., 225, 252
Pac, P. Y., 92, 115
Parzen, G., 231, 252
Pauli, W., 2, 31, 32, 44, 52, 67, 134,
156, 219, 252, 262, 270, 272, 293
Peaslee, D. C., 63, 67
Penman, S., 78, 115
Praden, A. G., 76, 115
Pryce, M. H. L., 90, 11.5 , 157, 21 7
Pursey, D. L., 91, 115
Racah, G., 146, 156, 188,217
Rainwater, J., 78, 115
RavenhalI, D. G.. 230, 246, 251, 252
Reitz, J. R., 174, 217
Retherford R. C., 172, 217
Rohrlich, F., 76, 115
Rose, f. E., 3, 16, 25, 29,31, 105, lIlt
112, 113, 1]5, 132, 150, 155, 156,
166, 168, 182, 189, 194, 211, 214,
217, 230, 237, 241, 244, 245, 252,
287, 293
Rosenfeld, L., 225, 252
Salam, A., 259, 272
Salpeter, E., 24, 25, 157, 188, 192, 293
Sauter, F., 237, 252
Schectn1an, R. M., 132, 155
Schiff, L. I., 221, 252, 289
Schopper, H., 232, 252
Schrodinger, E., 37,67,90,115
Schur, I., 67
Schwinger, J., 76, 115
Segre, E., 78, 115
Serpe, J'J 260, 272
Shapiro, G., 78, 115
Sherman, N., 210, 218
AUTHOYt INDEX
ShortJey, G., 23, 183 217, 293
Sommerfeld, A., 237, 240, 252
Spence,r, V. L., 210, 217
Sudarshan, E. C. G., 92, 115, 255, 272
Sunvar A. W., 268, 272
.. ,
Tani, S., 87, 115
Temple, G., 238, 252
Thomas, L. H., 20, 31
ToJhoek, H. A., 19,31, 102,115, 130,
155, 236, 252
Tolman, R. C., 196, 218
TomoJ1aga, S., 76, 115
Touschek, B., 92, 115
Tralli, N., 291
Treiman, S. B., 111, 112, 115, 217,
'''. 218, 240, 252
Uhlenbeck, G. E., 2, 31, 122
Van Dingen, H., 37, 67
297
Villar, F., 34, 67, 281
Watanabe, S., 255, 272
Watson, G. N., 175, 176, 192, 217
Watson, R. E., 210, 218
Weisskopf, V., 75, 115
Welton, T. A., 132, 155, 210, 2,18
\\'entzel, G., 105, 115
WeyJ, H., 258, 272
Whittaker, E. T., 175, 176,192,217
Wightman, A., 63, 67
Wigner, E. P., 91, 115, 131
Wilson, R. N., 230, 246, 25<1, 252
Wouthuysen, S. A., 87, 115, 123, 155
Wu, C. S., 107, 113, 115
Wyld, H. W., Jr., 111, 112, 115,217,
2] 8, 240, 252
Yang, C. N., 107, 115, 259, 272
Yennie, p. R., 230, 246, 251, 252
SUBJECT INDEX
Pdjoint function, 44, 119, 147, 155
Amplitudes!)- Dirac plane waves, 68,
247
for scattering, 200, 205, 216
Angular displacements, cornmutation
of, 5
Angular momentum, conservation, 77
coupling, 25
eigenfunctions, 6
eigenvalues, 6
intrinsic, 2
operators, 4-8, 82, 83, 158
orbital 13, 114
spin matrices, 9
Angular momentum representation, 21-
23, 158, 161, 181, 250, 269
Anisotropy, in Compton scattering, 237
in nuclear beta decay, 111, 113
Antilinear transformation, 135, 143,
144, 266 .
Antineutrino, see Neutrino
Antineutron, 78
Antiproton, 78
Asymptotic wave functions, 125, 165}
168, 171, 176, 193, 194, 200, 214.
216, 217, 240, 245, 250, 251
Auger effect, 195, 225
Axial vector covariant, 62, 66
Axial vector interaction, 107
Beta decay, 64, 82, 1 05 ff. 114, 116,
142, 155, 195, 205, 214, 216 223,
240, 251, 253, 258, 266, 268, 271
Born approximation, 223 if., 230, 233,
237, 251
Bound state wave functions, 163, 169,
223
energy values, 125, 172, 183, 190,
223
Breit interaction.. 225, 228
Breit operator t 22.7
Brelnsstrahlung 77 t 195, 237, 240
C:asimir limit, 251
Center of mass, 157
Central fields 21, 157 if., 196
Charge conjugation, and Lorentz trans-
formations, 137
and space reflection, 142
and time reflection, 145
for zero mass 256
in Dirac equation, 81, 134, 159-160
in s.tandard representation, 81, 138,
154
in t\vo-component theory, 259, 260
of spin operator, - 82, 99, 103
reciprocal character of, 81, 137
under unitary transformation, 150-
152
unitary property, 137
Charge density, 38, 116
Charge conservation, 35, 120
Chirality, 255
Classical limit r 7, 33, 219 if.
Clebsch-Gordan coefficients, see Vec.
tor addition coefficients
CJjffortl lgebra, 45
Commutation rules, angular momen-
tum, 5,. 83
central field operators, 22, 49-51
298
SUBJEC1"' INDEX
Commutation rules, of Dirac matrices,
41, 43, 52, 65, 284
of even and odd matrices, 126
of spin one-half, 9
relativistic spin operator, 72
under space reflection, 140
under time reversal, 143
Completeness, 13,74,93,213; 226,
235
Compton scattering, 75, 102, 232
Constants of motion, for free particls,
52
in central field, 22, 27, 49
-in---static fields, 130, ] 32
in hvo-component theory, 269
in Zeeman effect, 181
Continuity equation, 34, 38, 62, 116,
122, 140, 164, 220, 263
Continuum states, 191 ff.
Coulomb field, 23, 163, 166, 169 fi.,
206, 216, 238, 240, 242, 250
Covariance, of charge conjugation, 137 9
145
of description of spin, 102, 114, 130
of Dirac equations, 33, 55-61, 116
of MaxweU equations, 33, 117-118
of unitary transformation, 153
of Weyl theory, 262
Covariants, bilinear, 61, 65, 66, 147 t
148, 155, 266
vector, 62, 149
Current decomposition, 121
Current density, 34, 40, 82, 83, 114,
116, 121, 154, 221) 263
Darwin fluctuation energy, 129, 173
Decomposition of wave function, 48,
69, 79
Density matrix, 197-198, 217, 232
DiagonaJization, of energy, 25
of Pauli spin component, 15
of spin operator, 97
Diagonal representation, 79, 83 fi., 87,
88
Dirac equation. quadrature solution,
103
Dirac matrices, alternative approach to,
281
299
Dirac matrices) commutation. rules, 41,
43, 52
complete set) 45
properties of, 66
rank" 41, 46
trace ot 41, 45
transfonnation of, 42: 52
Dirac space, 47, 67, 69, 79, 262
IJirect product, 48
Double scattering. 196, 204, 210
f)oublet structure, 3, 24
Eigenvalues, of angular momentum, 6
of Coulomb energy, 172
of free particle energy 1 69
of spin operator, 9, 96
Electromagnetic interaction, 224, 286
Energy degeneracy, 69, 172
Energy levels, in anom.atous Zeeman ef-
fect, 24, 186, 187
in Coulomb field 172-174
Energy operator, 2 L 24, 34, 39, 123
Even matrices, 48, 65, 71. 79, 95
Expectation values, 49, 82, 113, 133,
154
Fermi function, 216, 223
Fermi transition, 110
Fierz matrix, 67
Fine structure, 24, 174
Foldy-Wouthuysen (FW) transforma-
tion, 87 fI., 114. 123 ff., 155, 249
Four-component representation, 46, 79,
92, 258
Free particles, 87, 104, 124, 131, 161,
223
Gamow. Teller transition, 110
Gauge invariance, 117, 118, 120, 154
Gauge transformation, 117, 119
Green's function, 211, 225, 245
Gyromagnetic ratio. 3, 132, 189
Hamilton-J acobi equation, 220
Heisenberg representation, 279
Helicity, 82, 111, 247, 248, 255 fT.
Jlermitian density matrix, 198
Hermitian matrices. 12 34, 40, 82, 88
300
SUBJECT INDEX
Hermitian operators, 123, 124, 150
-' .{igb energy representation, 92, 246,
252, 254
Hole theory, 75--76, 83, 257
Hyperfine coupling t 30, 188 if., 240
Hypergeometric function, confluent,
. 171, 239
contiguous relations, 176, 179, 192
Idempotent operators, 17, 30, 94
Inelastic scattering, 251
Internal conversion, 77, 195, 214, 225,
240
lnvariance,d)f beta interaction, 106
of norn1alization, 33, 263
Isotope shift, 240
Klein paradox, 126
Klein-Gordon equation, 37, 117, 122,
285
Klein-Nishina formula, 237
K shell, energy levels, 178
hypemne structure, 190
wave functions, 178, 179
Zeeman effect in, 183
Lamb shift, 172
Large component. 71, 79, 89, 123, 146,
169, 181, 202, 249
Larmor precession, 20
Linear independence, of Dirac mat-
rices, 45
of four-component wave functions,
69, 92, 113
of spin functions, 13
Lorentz transformations, see also Space
reflection; Time reflection
continuous, 56, 58, 85, 102, 106, 276
discontinuous, 103, 139-147, 266,
276, 282
general, 65, 120, 155, 276
infinitesimal,. 59
in two...component theory, 262 fi.
sochronous, 153
uniform translations, 61, 86, 87, 138,
265, 277
L shell, energy levels, 178
hyperfine structure, 190
wave functions, 178, 179
Zeeman effect in, 183
Magnetic moment, 2, 76, 78, 122, 129,
189, 270
Majorana neutrino theory, 260
Majorana representation, 155
Mass variation, 21 t 173
Matrix elernents, 8, 25
in hyperfine coupling, 189 ff.
in Zeeman effect, 181 ff.
Maxwell equations, 33, 116
lHer interaction, 225, 228, 251
Mott polarization, 203, 216
Multipole field, 214
Mu meson, 76, 130, 271
Negative energy states, 69, 74 ff., 85,
88, 90, 95, 114, 126, 240, 257, 260
Negative frequencies, 36
Neutrino, 78, 106, 142, 214, 253 fI.
Nodal properties, 166 ff.
Non-quantum limit, see Classical limit
Non-relativistic limit, 2, 33, 70, 79, 83,
87, 97, 101, 113, 119, 121, 122,
134, 171, 177, 188, 190, 199
Normalization, of bound state eigen-
functions, 163, 175
of continuum wave functions, 193
of radiation field, 226, 234, 287
Notation, 273
Nuclear form factor, 231
Nuclear size effects, 195, 230, 240 fI.
Odd matrices, 48, 66, 71, 79, 88, 95,
124
Orbital angular momentum, see Angu-
lar momentum
Pair annihilation, 77
Pair formation, 77, 195, 237
Parabolic coordinate so1utions, 237
Parity, in two-component theory, 269
of Dirac wave functions, 52 83, 161
Parity non-conservation, 107, 115, 232,
258 fI., 267
Pauli space, 48, 67, 79
Pauli spin matrices, see Spin matrices
Pauli theorem, 52, 135
Pauli theory, 2, 173, 216, 247
Perturbation theory, 77, 173,215,224,
226, 232, 286
SUBJECT INDEX
Phase shifts, 194,215,244,250
Photoelectric effect, 77 t 195, 237 9 240
Pi fiteSOns, 78
Plane waves, 52, 68 tr., 88, 89, 92, 109,
114, 223, 247. 259
expansion into spherical waves, 29,
162, 206. 270
Polarization, electric, 121
electron, 19,72,111-113,114,130,
162, 196, 203-205, 2 J 4, 251 f 271
longitudinal, 130
magnetic, 121
neutrino, 258. 260, 268
nuclear, 112, 1 J 3
photon, 225, 232
Position operator, 90, 91
Positron, 76, 79, 85, 96, 99. 103, l11,
135, 137. 159, 195, 216, 27J
Precession of reference franlf':, 20
Projection operators, energy, 66, 79
89,
94, 101, 110, 113, 236, 240
genera] properties, 114
helicity, 256, 259, 26l
spin, 17" 92 ff., 96, 100, 103, 104.
110, 199,217,236,247
Projection quantum number, 6, 8
Pseudoscalar covariant, 63
Pseudo')calar spin operator, 103
Quadrature solution of Dirac equation,
241
Quadrupole interaction. 215
Radial wave equation, 159. 160, 269
Radial wave functions, 159: 161: 163 ff..
175-181, 270
indicia] behavior, 170
regularity at infinity, 163
reg;]larlty 1U ()rigin
163, 169, 170
square integrability, 163
Relativistic invariance, 32, 35
Representation invarianc:e of rnHtrix
equations
10, t 14
Retardation, 225, 226, 227
Rotation matrices, 16, 30
Rotation operator, 4«- 5
Rotations in thrc
-space) 4
301
Scalar covariant, 62
Scattering, 194, 195, 196 fr., 205 fr.,
227, 240
phases in, 200, 245, 250
Schur's lenlma, 46, 285
Screening effects, 174, 195,215,223
Second-order equation, approximate
form, 123, 124, 129, 237
for free particles, 37
in electromagnetic fields, 122
155
Selection rules. J10, 214,2]6
Singularity. of Dirdc wave functions,
1 79, 19.5
of projection operators, 19,
3
SOlan cornponent, 71t i9, J23, 146,
181, 249
SomrnerfeJd-Maue rnethod, 237
Space inversion. 52
158, 215. ::69
Space reflection, J 39 ff., 155, 266 ff.
'lnd charge conjugation, 142
Spherical harmonics. 22
Spherical \vavcs, 206, 214
· . " I '
li o
Ingo1ng, .. _, £. ,
Spin, average? 15
componen.t of, 14
covar.iant description) 102, 130
of Dirac partic]
, 1--3
Pauli, 2. 9, 30, 114, 248
Spin-angular funcdons; 27, 3
160
Spin matrices, 9. 114, 25P.
Spin operator, 8--12, 162
in Dirac theory, 72
pseudo8calar, 103
relat{visti,
, 72, 113! 1 i 4, 214, 235,
248. 251
Spin-orbit coupling, 19\ ] 73. 196, 2i7
Spin-orbit spHtting, j 72
Spinors. four-COmpi)nent, 49
Pauli, 12
t,vo-compoD{:nt. 16. 3]; 7.5R
St
tic f1elds, ! 10
'"rensor cou.pHng} 27 t
Tensor covari(l1)t, 62, 69, 149
Tensor spin operat.or, 105
Thomas
Fen1\i."I)irac Model, 174
Titne-dependi'nt operators
49
rinle reflectlo!1, and eharg
conjuga-
tion, 145
302
SUBJECT INDEX
Time reflection, in beta interact jon,
108, 149
in classical theory, 142
in Dirac theory, 144, 154
in non
relativistic quantum theory,
143
in Pauli theory, 143
in scattering, 2! 5
in two-component theory 1 266
of the adjoint, 147
Trembling motion, see Zitterbe'lrvegang
Two,component spinors, 248
"rwo-componcnt theory, 108, 258 if.
Unitary transforn18tions, 10, 34, 64, 66,
83, 86, 87, 88, 113, 124, 150--154
lJnit tensors, fourth..rafik, 63, 118
third-rank, 9, 281
Units, 275
Variationai rnethod, 244
Vector addition coefficients, 26, 1 H4
'lector covariant. 62, 149
VectoriPteraction, 107
Vector operator, 5, 40
Velocity operator. 1 t 3, 279
Virial theorem, 132
Virtual quanta, 76. 226
Wave equation, 37, 39, 219, 258
polar coordinate form, 158
Wave functions, see also Asymptotic
wave functions; Bound state wave
functions
angu Iar mOInentum, 6, 12, 22, 26
ct.ntra] field, 27
Coulonl b field
177, 194
decomposition, 48, 69. 79
free particle, 29
integral representation, 194
left-handed, 255, 260
muJticomponent, 13
right-handed, 255, 260
Whittaker function, 176
WKB Inethod, 221
Zeeman effect, anomalous, 2, 24, 30,
181
Zero energy solutions, 216
Zero mass particles, 67, 106, 154, 253,
262
Zero spin particles, 216
Zitterbewegung, 90, 114, 129, 280