Author: Schwinger J.  

Tags: physics   quantum mechanics  

ISBN: 3-540-41408-8

Year: 2001

Text
                    Quantum Mechanics
Symbolism
of Atomic
Measurements
Edited by
Berthold-Georg Englert
Springer


JULIAN SCHWINGER Quantum Mechanics Symbolism of Atomic Measurements Edited by Berthold-Georg Englert « Springer
Julian Schwinger (1918-1994) Dr. Berthold-Georg Englert Clarice Schwinger Gleissenweg 23 10727 Stradella Court 85737 Ismaning, Germany Los Angeles, CA 90077, USA With 78 Drawings and Figures, and 351 Problems Library of Congress Cataloging-in-Publication Data. Die Deutsche Bibliothek - CIP-Einheitsaufnahme Schwinger, Julian Seymour: Quantum mechanics : symbolism of atomic measurements / Julian Schwinger. Ed. by Bertold-Georg Englert. - Berlin; Heidelberg; New York; Barcelona; Hong Kong; London; Milan; Paris; Singapore; Tokyo: Springer, 2001 (Physics and astronomy online library) ISBN 3-540-41408-8 ISBN 3-540-41408-8 Springer-Verlag Berlin Heidelberg New York This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilm or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution under the German Copyright Law. Springer-Verlag Berlin Heidelberg New York a member of BertelsmannSpringer Science+Business Media GmbH © Springer-Verlag Berlin Heidelberg 2001 Printed in Germany The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. Typesetting: Camera ready copy by the editor using a Springer TgK macropackage Cover design: Erich Kirchner, Heidelberg Printed on acid-free paper SPIN 10761836 55/3141/tr 543210
To Clarice and Ola
I I m% '* p* Julian Schwinger (1918-1994)
Preface Julian Schwinger had plans to write a textbook on quantum mechanics since the 1950s when he was teaching the subject at Harvard University regularly.* Roger Newton remembers:"!' [A] group of us (Stanley Deser, Dick Arnowitt, Chuck Zemach, Paul Martin and I forgot who else) wrote up lecture notes on his Quantum Mechanics course but he never wanted them published because he "had not yet found the perfect way to do quantum mechanics." The only text of those days that got published eventually - following a suggestion by, and with the help of, Robert Kohler* - were the notes to the lectures that, Schwinger presented at Les Houches in 1955. The book was reissued in 1991, with this Special Preface by Schwinger [3]: The first two chapters of this book are devoted to Quantum Kinematics. In 1985 I had the opportunity to review that development in connection with the celebration of the 100th anniversary of Hermann Weyl's birthday. [... ] In presenting my lecture [4] I felt the need to alter only one thing: the notation. Lest one think this rather trivial, recall that the ultimate abandonment, early in the 19th century, of Newton's method of fluxions in favor of the Leibnizian calculus, stemmed from the greater flexibility of the latter's notation. Instead of the symbol of measurement: M(a',b'), I now write: \a'b'\, combining reference to what is selected and what is produced, with an indication that the act of measurement has a beginning and an end. Then, with the conceptual analysis of \a'b'\ into two stages, one of annihilation and one of creation, as symbolized by |o'6'| = |o')(6'|, the fictitious null state, and the symbols *F and ¢, can be discarded. As for Quantum Dynamics, I have long regretted that these chapters did not contain numerous examples of the practical use of the Quantum Action Principle in solving physical problems. Perhaps that can be remedied in another book, on Quantum Mechanics. [ ... ] *See chapter 10 in the recent biography by Mehra and Milton [1]. ^As quoted by Schweber in section 7.11 of [2]. *See the preface to [3].
VIII Preface The change in notation mentioned here was systematically incorporated in the set of lecture notes that Schwinger wrote up for the students of the three- quarter course on quantum mechanics that he taught twice in the mid-1980s at the University of California, Los Angeles (UCLA). I had the great luck to still be at UCLA during much of the first round, taking care, in fact, of office hours and problem sessions, and I continued to receive Schwinger's handwritten lecture notes after I had left. Indeed, these notes were meant to be the basis of the intended book for which Schwinger had selected the natural title Quantum Mechanics and the less obvious - to others, that is, not to him - subtitle Symbolism of Atomic Measurements. This choice is the succinct pronouncement of his philosophy, which is spelt out in the Prologue. The quote from page 10: [PJhysics is an experimental science; it is concerned only with those statements which in some sense can be verified by an experiment. The purpose of the theory is to provide a unification, a codification, or however you want to say it, of those results which can be tested by means of some experiment. Therefore, what is fundamental to any theory of a specific department of nature is the theory of measurement within that domain. is to the point. Schwinger's continuing interest in frontier physics was a permanent and, in hindsight, unfortunate distraction from the book-writing enterprise. Eventually, his untimely death put an end to all plans, and so his quantum mechanics book is not to be. Yet there are those UCLA lecture notes. Although they are certainly not identical to the book Schwinger would have written, they do represent a first draft and are the closest thing to the unwritten book that is available. Prom many conversations I had with him, I know that Schwinger was quite happy with the way he induces the general structure of quantum kinematics and establishes the dynamical principle, his quantum action principle. I think that he had finally "found the perfect way to do quantum mechanics." I always thought that the notes should be put into a form that makes them accessible to the broad public, but it needed the encouragement of a few friends to actually go about it. Particularly decisive was the gentle push by Robert Finkelstein who, in response to my remark - during a lunch session at the UCLA faculty club (the Chatham had disappeared years earlier) - that somebody should put the notes into print, just said: " You should do it." Thank you, Bob. And then, of course, there was the consistent support by Clarice Schwinger who gave me the feeling - very calmly and, I'm sure, very consciously - that she couldn't think of anyone else to do it. Thank you, Clarice. My dear wife Ola had to be content with a much too small share of my attention while I was working on this project. Knowing well how much the book means to me, and why it does, she never complained. Dziekuje, Ci, Olu.
Preface IX The lecture notes of Schwinger's UCLA course consist of three parts corresponding to the three quarters of teaching. Here is a brief summary of the contents. Part A, the material of the fall quarter, begins with an analysis of experiments of the Stern-Gerlach type that accomplishes "a self-contained physical and mathematical development of the general structure of quantum kinematics" [4]. Much technical material is delivered in passing. In particular, unitary transformations are studied from various angles, and the algebra of angular momentum is treated in depth. Then, an analysis of Galilean invariance yields the non-relativistic Hamilton operator. The winter quarter, Part B, proceeds from there. The response to infinitesimal time displacements establishes the equations of motion. Then the Quantum Action Principle is derived, and accepted as a fundamental principle. In a sense, the rest of Part B and all of Part C consist of instructive applications of the action principle - the "numerous examples" referred to above. Part B contains treatments of, among others, the (driven) harmonic oscillator, bound-state properties of hydrogenic atoms, and Rutherford scattering. Part C (spring quarter) begins with the two-particle Coulomb problem, including the modifications for two identical particles. The treatment of systems with many identical particles follows, where the notion of second quantization eventually leads to the concept of the quantized field. As a first application, the Hartree-Fock and Thomas-Fermi approaches to many-electron atoms are presented, the latter in considerable detail.§ The second and final application is the quantum theory of electromagnetic radiation, which is developed to the extent necessary for an understanding of (the non-relativistic aspects of) the Lamb shift. During his oral lectures, and in the handed-out notes, Schwinger never took any credit for his own very substantial and highly original contributions. But, of course, he mentioned the names of others whenever appropriate. I decided to stick to this practice when preparing the notes for print. Distributing notes to the students that attend your lectures is one thing, writing a book for the anonymous reader is quite another. So, some editing was unavoidable in the course of turning Schwinger's lecture notes into book form, but I tried to change as little as possible. In addition to the UCLA notes of the mid-1980s, Chapter 12 contains some material from lectures that Schwinger gave at the University of New Mexico, Albuquerque in 1987. The Prologue is based on the transcript of the audio record of a public lecture that he delivered in the early 1960s.^ Most of the problems are as formulated by Schwinger; in addition to the ones that came with the lecture notes, I discovered many good problems in the Schwinger Papers [5] that are archived at the UCLA Research Library. In fact, all the raw material that I used can ^Tliis is an example that teaching and research were closely related activities for Schwinger. At the time of these lectures, he was working on refinements of the Thomas- Fermi method. 'Section 7.10 in [2] comments on this lecture.
X Preface be found there. Charlotte Brown, Curator of the UCLA Special Collections, has been very helpful in my search of the Schwinger Papers. I thank her sincerely. I wish to thank Herbert Walther for the splendid hospitality extended to me over the years at the Max-Planck-Institut fur Quantenoptik in Garching; the institute's infrastructure was of great help while I was working on this book. During the past year, the crucial stage of this undertaking, I was supported by the Universitat Ulm; I thank Wolfgang Schleich for the generous invitation to join his Abteilung Quantenphysik temporarily. I acknowledge with gratitude the support by the editorial staff of Springer- Verlag; in particular, the help of Wolf Beiglbock, Christian Caron, and Brigitte Reichel-Mayer was invaluable. And I thank Jens Schneider, who turned the handwritten notes into electronic files that I could then work on. Ismaning, September 2000 BG Englert References 1. Jagdish Mehra and Kimball A. Milton: Climbing the Mountain. The Scientific- Biography of Julian Schwinger (Oxford University Press, Oxford and New York 2000) 2. Silvan S. Schweber: QED and the Men Who Made It: Dyson, Feynman, Schwinger, and Tomonaga (Princeton University Press, Princeton 1994) 3. Julian Schwinger: Quantum Kinematics and Dynamics (W.A. Benjamin, New York 1970; reprinted by Addison-Wesley, Redwood City 1991) 4. Julian Schwinger: 'Hermann Weyl and Quantum Kinematics'. In: Exact Sciences and their Philosophical Foundations, Proceedings of the International Hermann Weyl Congress, Kiel, Germany, 1985, edited by Wolfgang Deppert et al. (Verlag Peter Lang, Frankfurt/Main and New York 1988) pp. 107-129 5. Julian Schwinger Papers (Collection 371), Department of Special Collections, University Research Library, University of California, Los Angeles
Contents Prologue 1 Part A. Fall Quarter: Quantum Kinematics 1. Measurement Algebra 29 1.1 Stern-Gerlach experiment 29 1.2 Measurement symbols 31 1.3 State vectors 36 1.4 Successive measurements. Probabilities 38 1.5 Probability amplitudes. Interference 41 1.6 "Measurement disturbs the system" 46 1.7 Observables 48 1.8 Algebra of Pauli's operators 50 1.9 Adjoint symbols, Hermitian symbols 53 1.10 Matrix representations 56 1.11 Traces 57 1.12 Unitary geometry 59 1.12.1 Column and row vectors, wave functions 59 1.12.2 Two arbitrary components of Pauli's vector operator . 63 1.13 Unitary operators 67 1.14 Unitary operator bases. Complementarity 69 1.15 Quantum degrees of freedom 76 1.16 The continuum limit 82 1.16.1 Heisenberg's commutation relation 82 1.16.2 Schrodinger's differential-operator representation .... 86 Problems 88 2. Continuous q,p Degree of Freedom 101 2.1 Wave functions 101 2.2 Expectation values and their spreads 109 2.3 States of minimal uncertainty Ill 2.4 States of stationary uncertainty 114 2.5 Hermite polynomials 118
XII Contents 2.6 Completeness of stationary-uncertainty states 123 2.7 Eigenvectors of non-Hermitian operators 125 2.8 Classical limit 132 2.9 More about stationary-uncertainty states 135 Problems 136 3. Angular Momentum 149 3.1 Infinitesimal unitary transformations 149 3.2 Infinitesimal rotations 150 3.3 Common eigenvectors of J2 and Jz 152 3.4 Decomposition into spins 155 3.5 Angular momentum of a composite system 158 3.6 Finite rotations. Eulerian angles 162 3.7 Rotated angular-momentum eigenvectors 168 Problems 177 4. Galilean Invariance 183 4.1 Generators of infinitesimal transformations 183 4.2 Hamilton operator for a system of elementary particles 190 Problems 191 Part B. Winter Quarter: Quantum Dynamics 5. Quantum Action Principle 195 5.1 Equations of motion 195 5.2 Conservation laws 197 5.3 Sets of q,p pairs of variables 199 5.4 Wave functions for force-free motion 202 5.5 Quantum action principle 207 5.6 Principle of stationary action 210 5.7 Change of description 213 5.8 Permissible variations 214 Problems 216 6. Elementary Applications 223 6.1 Time transformation functions 223 6.1.1 Free particle 223 6.1.2 Constant force 224 6.1.3 Linear restoring force: Harmonic oscillator 226 6.2 Short times 227 6.3 Harmonic oscillator: Energy eigenvalues 229 6.4 Free particle and constant force: State density 231 6.5 Harmonic oscillator: Energy eigenstates 234 6.6 Free particle and constant force: Energy eigenstates 237
Contents XIII 6.7 Constant force: Asymptotic wave functions 239 6.8 WKB approximation 243 6.9 Zeros and extrema of the Airy function 248 6.10 Constant restoring force 252 6.11 Rayleigh-Ritz variational method 255 Problems 257 7. Harmonic Oscillators 269 7.1 Non-Hermitian operators 269 7.2 Driven oscillator 272 7.2.1 Time-independent drive 274 7.2.2 Slowly varying drive 276 7.2.3 Temporary drive 278 7.3 Remarks on Laguerre polynomials 286 7.4 Two-dimensional oscillator 288 7.5 Three-dimensional oscillator 295 Problems 298 8. Hydrogenic Atoms 303 8.1 Bound states 303 8.2 Parameter dependence of energy eigenvalues 307 8.3 Virial theorem 309 8.4 Parabolic coordinates 313 8.5 Weak external electric field 316 8.6 Weak external magnetic field 319 8.7 Insertion: Charge in a homogeneous magnetic field 324 8.8 Scattering states 328 Problems 333 Part C. Spring Quarter: Interacting Particles 9. Two-Particle Coulomb Problem 343 9.1 Internal and external motion 343 9.2 Rutherford scattering revisited 346 9.3 Additional short-range forces 352 9.4 Scattering of identical particles 355 9.5 Conserved axial vector 358 9.6 Weak external fields 365 Problems 368 10. Identical Particles 375 10.1 Modes. Creation and annihilation operators 375 10.2 One-particle and two-particle operators 381 10.3 Multi-particle states 385
XIV Contents 10.4 Dynamical basics 386 10.5 Example: General spin dynamics 387 10.6 General dynamics 392 10.7 Operator fields 395 10.8 Non-interacting particles 397 Problems 403 11. Many-Electron Atoms 405 11.1 Hartree-Fock method 405 11.2 Semiclassical treatment: Thomas-Fermi model 410 11.3 Correction for strongly bound electrons 420 11.4 Quantum corrections and exchange energy 425 11.5 Energy oscillations 428 Problems 430 12. Electromagnetic Radiation 437 12.1 Lagrangian, modes, equations of motion 437 12.2 Effective action 441 12.3 Consistency check 444 12.4 Free-space photon mode functions 447 12.5 Physical mass 449 12.6 Infrared photons 452 12.7 Effective Hamiltonian 455 12.8 Energy shift 459 12.9 Transition rates 461 12.10 Thomson scattering 465 Problems 467 Index 473
Prologue It seems to me that there are deep philosophical lessons to be learned in the way in which the practicing theoretical physicist thinks about the foundations of the subject, i.e., the manner in which he approaches the problems, the general criteria he brings to bear on what is a reasonable solution. So, the important thing then is to display the general world view, the world picture that the theoretical physicists has. This is particularly significant in connection with the philosophical implications of quantum physics, because quantum physics or quantum mechanics - by which I think we mean finally the rational mode of understanding of microscopic or atomic phenomena - has perhaps had the greatest impact of any of the developments of physics upon the mode of thinking or the world picture of the physicist and thereby, indirectly, of the general citizen. Now, if we want to understand specifically the origins of quantum physics, we must go back to see how the stage has been set through the developments of what is called classical physics, and then compare with quantum physics. By classical physics, we mean the precise formulation of all of the properties of matter as they were finally expressed in their essentially perfect form at the beginning of the 20th century. Classical physics is characterized by the fact that, whereas the underlying conceptions are idealizations - it was no easy job to be a Galileo or to be a Newton - nevertheless, these conceptions still strike very close to common, ordinary everyday affairs. To understand the principles of physics, as they are expressed within these great generalizations of classical physics, is not very difficult. Our school children manage it all the time. But quantum physics is something different. In quantum physics, you go far beyond the ordinary situations of everyday life. We strike at a level of idealization that is hard to appreciate until you have seen how this historical line of development has come about. The first grand physical theory, of course, was that of Newtonian physics. This is a theory of massive point particles which interact by means of actions at a distance. The traditional theory of gravitation is the classic example of this. And to characterize this theory in a general way in terms of its philosophical foundations, let me say that Newtonian physics, or Newtonian mechanics is a causal, deterministic theory.
2 Prologue By causal, one means essentially that when the state of the system is given at a particular time - and we must return to precisely what we mean by "state" - then the state is completely determined at any other time; this is what we mean by causality. Causality is inference in time: given the state of affairs at one time, the state of affairs is uniquely determined at another time. What makes it deterministic is that the knowledge of the state also determines all possible physical phenomena precisely. This distinction between causal and deterministic may not seem very important until we come up to the rather different situation that appears in quantum mechanics. I have spoken of the state within the framework of Newtonian physics. It is familiar that when we specify the state of the system of particles which interact with each other by means of instantaneous forces as the gravitational force was conceived at that time, that the full specification of state is the indicating of precisely where the particles are at a given instant of time and how they are moving. In more technical language, it is the specification of the positions and the momenta of the particles. If these are known at a given instant of time, and it is known precisely under what law of force the particles are moving, such as the grand statement of the inverse square law of gravitation, then the physical situation is specified completely. The state at any other time can be predicted and also, since this is a deterministic theory, the knowledge of the state is the origin of the full knowledge of the answer to all possible physical questions that can be asked. Now, to indicate that Newtonian physics is not something that lies completely behind us in the history of physics, let me perhaps remind you of the fact that the triumph of Newtonian physics is indicated every time we have an announcement of a successful space mission. That is, Newtonian physics, as any general theory of physics must, remains perfectly valid in its own domain. Here, of course, the domain is the motion of material bodies under the action of known laws of force which are instantaneous interactions and cover, therefore, fully the motion of artificial satellites in the perfectly well-known field of force that is provided by the gravitational attraction of the Earth, or the Moon, the planets, and so on. Physics introduces new theories not because the theories in a particular domain are found to be unsatisfactory, although they may be so also, if the technique of experiment becomes finer and finer and new phenomena are found which pass beyond the level of accuracy in the earlier theories, but primarily because the domains of physics which come into question are ever extended. To follow the line of historical development: the introduction in the 17th century of the Newtonian concepts led to a steady development of these ideas, their applications being primarily to astronomical phenomena, which lasted for essentially a full 200 years, while the technical means for the inference, in a precise sense, of the implications of these laws were developed until one could fully carry out the calculations necessary to follow the paths of the planets in full detail and so on. It was, however, during the 19th
Prologue 3 century that new areas of physical experience began to be met, in particular in the domain of electromagnetism. And then we came finally, towards the second half of the 19th century, to essentially a new physical theory going beyond anything that had been, so to speak, conceived within the framework of Newtonian physics. This is Maxwell's field theory of electromagnetism. As far as the broad categorization of these theories is concerned, this was also a causal, deterministic theory. But what made it so very different was what was involved in the specification of state. Recall that the Newtonian theories were concerned with point particles and the specification of state was the indication of where these particles were and how they were moving at any particular time. It is a discrete description: a finite number of particles, and a finite number of quantities is needed to characterize everything about this physical system. By contrast, in a field theory - let's continue to have in mind the very specific example of the electromagnetic field - the specification of state requires not a finite number of things (where the particles are and how they are moving) but an infinite number. We must, in principle at least, specify what the electromagnetic field is doing; how the electric field is pointed; and how the magnetic field is pointed at every point of space - and this at a given time. And what makes this a causal theory is that if we know the state, if we know the distribution throughout space of the electric and magnetic field at a given time, then we can predict at any later time what the distribution of the electric and magnetic field will be. Given the state at one time, the state is uniquely implied at another time. That makes it a causal theory. Again, what makes it deterministic is that the knowledge of the state, the knowledge of the electric and magnetic fields, completely determines the answers to all questions that can be asked about the electromagnetic field. And, we should note again that - while we are talking about a domain of classical physics, with the inference that this is not the final word - nevertheless, the quantitative success of Maxwell's theory is demonstrated every day. We just have to look at the ever-expanding development of radio communication systems, and microwaves of radar or of television, not to mention the ubiquitous cellular phones. This is not a past history in the development of physics, it is something whose validity is confirmed all the time. The point, however, is that it refers to a limited domain of experience. It is not all of the physical world. So, here then we have two very different kinds of physical theories, both causal and both deterministic, but widely different in the nature of what characterizes the specification of a state. One lies, so to speak, at one limiting domain of the spectrum, it is a discrete description; a finite number of quantities is specified. The other, at the other end, is a continuous description; fields are involved, distributed throughout all space. These then are the models of the two limits of classical behavior: the discrete, the continuous. And it is particularly interesting to see how these two entirely different clas-
4 Prologue sical modes of description have in a sense become unified or, perhaps better, transcended in the further development of quantum physics. Still continuing with our historical development, I recall what is all familiar to you, that at the beginning of the 20th century there were further very important developments, associated with Einstein's name - the developments of the special and general theory of relativity. But yet, these were not radically new developments in the sense that I mean quantum mechanics to be; they were rounding out the framework of classical physics. They were the recognition that - once we had placed the field phenomena, the electromagnetic field specifically, on the same foundation as the theories of particles - there was a modification in the strict Newtonian point of view. While we are still dealing, within this framework, with point particles, they no longer interact via instantaneous forces. We now recognize, as is particularly emphasized by the relativity theory, that the interactions between particles are propagated through space by means of the intermediary of the field. Incidentally, I should also emphasize this difference between the two classical modes: the strict Newtonian viewpoint is one of the instantaneous interaction at a distance; the field point of view is one of local interaction propagated from one point of space to the contiguous points. Within the field concept, there is no longer any idea of instantaneous propagation. It is propagation through space and time by means of a mechanism which is, in fact, intrinsically limited in speed. This is, of course, the famous constancy of the speed of light, the starting point for all the investigations on the special theory of relativity. And so, what finally emerges from all of this is a theory with a dualistic point of view, in which there are particles and fields, standing side by side, neither explained in terms of the other - a fundamental duality that is the culmination of classical physics: the strict Newtonian, discrete, point of view of the particle has been modified because we now recognize that the interactions between particles are not instantaneous but are propagated through the mechanism of the field with its continuous point of view. The field is there to supply the dynamical agency by which particles interact. It was to be the purpose of further developments of quantum mechanics that these two distinct classical concepts are merged and become transcended in something that has no classical counterpart - the quantized field that is a new conception on its own, a unity that replaces the classical duality. We must try to trace the development of quantum mechanics, starting from this classical background, up to this much deeper quantum mechanical foundation and explanation. So much then for a summary which can hardly do justice to several hundred years of hard work by many physicists in attempting to lay the foundations of these laws of what I will now call macroscopic phenomena because it was, of course, in the investigation of the microcosmos - of atomic phenomena - that an entirely new world and a new system of order was opened up. It was here that it was found that the laws, which served so very well „. BJrt „ w™s8&>ii^&l
Prologue 5 to range from ordinary experience on the Earth to extraterrestrial experience in terms of the motions of the planets, are not in accordance with the observed phenomena. When we turn not outward, but inward, we find new laws of motions, new laws of physics, new ways of thinking, new philosophical conceptions. Now, how did this come about? Of course, in just a few sentences I cannot give a fair account of the tremendous development of physics which occurred during the last years of the 19th century and the early years of the 20th, but let me remind you that these developments began in what may seem to be a paradox. As pointed out above, we had two distinct laws of behavior in classical physics, and one never trampled on the other. We either had particles, and they were discrete objects, or we had fields, and they were continuous objects distributed throughout space. The fields could be attenuated as much as you want - a radio wave, as we travel out through space, becomes weaker and weaker and weaker in a perfectly continuous way. A particle, on the other hand, has discrete properties which it carries with it. And so the remarkable thing was the discovery, in investigations of various atomic phenomena, of an apparent paradox. Light, for example, was known, from various interference phenomena, to possess properties typical of waves, which are field phenomena; and light is spread out through space in a characteristic field way. When performing suitable experiments, one now found that these light waves appeared to acquire particle-like properties under certain experimental conditions. Unlike the classical notion of a light wave, which suggests that its energy is simply distributed continuously throughout the area that it occupies, light exhibited the ability to transfer definite and discrete amounts of energy, acting then as if it were a particle - first simply called light quantum, but now more commonly photon - whose characteristic, of course, is to have associated with it, in a certain state of motion, a definite energy, a definite momentum. This was the paradox found in the early days of these developments: that light waves exhibit, under certain circumstances, definite particle-like behavior. The two quite distinct classical notions, mutually exclusive as they are, are nevertheless in some sense realized jointly within the microscopic domain. An example is the classic experiment of the photoelectric effect in which light waves, falling upon metals, would transfer energy to electrons and liberate them, whereby a definite amount of energy was absorbed every time, despite the picture of a classical field distributed throughout space in which you might absorb more or less energy, depending upon the accidental circumstances of that particular electron. Here then was light, a characteristic example of a wave or a field phenomenon acting in a particle-like manner. The converse was also true, although the experimental proof of this would have to wait for some 27 years. But at this distance in time, I think we can lump all these things together and say that experimentally, as an important aspect of this same development, the converse was true. Electrons were the characteristic example of
6 Prologue microscopic particles. Electron beams could be produced in evacuated tubes and they would move in straight lines. And when exposed to electric and magnetic fields, they would change their direction just as material bodies were supposed to do. But, nevertheless, under appropriate circumstances, namely when electron beams are scattered from crystalline bodies, one found interference rings which would be typical for the type of wave phenomena that is characteristically associated with a distributed field. In other words, instead of being scattered as material objects would be, electrons moving through a crystal would be scattered in the way that would produce a characteristic interference pattern much as light of a certain wavelength would do. Here then were objects originally thought to be essentially classical particles which, under new experimental conditions, would exhibit continuum or wave-like phenomena. We had then a remarkable duality in which apparently the same objects could, under some circumstances, act as classical particles; under some other circumstances, they could act as classical waves. And, of course, this was something for which there was no preparation in any other phenomena of physics as they had been known. Meanwhile, the detailed investigation of the properties of atoms, as they were revealed by spectroscopic experiments, had proceeded. The possibility of producing atomic spectra in the suitable circumstances of very thin gases had made it possible to study the behavior of individual atoms. There were attempts to understand the observed spectra in terms of the motion of electrons within the atoms, which ended with a complete failure of classical physics to account for these phenomena. Indeed, the mere existence of atoms and their ability to radiate precise spectral lines is a conflict with classical physics. If we'd accept any picture of an atom as electrons moving around some central nucleus, as Rutherford discovered to be the situation in 1911, then, according to the classical laws of electromagnetism, the accelerated charges in their motions around the nucleus would always continue to radiate until finally they had exhausted all possible energy and would fall into the nucleus. First of all, this meant that atoms were not stable, a quite flagrant violation of simple experience; and more than that, in the course of this process the electrons would radiate spectral lines whose frequencies would change as they got closer and closer to the nucleus, and you would have nothing analogous to the empirical situation of sharp spectral lines, of definite frequencies, characteristic of individual atoms. Clearly, the laws of macroscopic physics failed completely within the microscopic world. In the detailed analysis as it was carried out, primarily in the hands of Niels Bohr, and others of this important Copenhagen School of Physics, it was found that the only successful attempt to understand the properties of atomic spectra consisted of introducing a bold hypothesis: that physical quantities - such as energy and angular momentum, the two most important examples - can only have certain definite values. In classical mechanics, they would
Prologue 7 assume any possible values; classically they are continuous objects. A particle in ordinary life can be given any energy one wishes by simply providing the appropriate amount of energy; and if you set a body into rotation, the angular momentum that this will have can be given any value. There is no particular selected set of values that are natural. But, nevertheless, the analysis of the facts of atomic spectroscopy indicated that the energy values that atoms, or electrons in atoms, could have were not continuous but assumed definite values. This was the only explanation that could be given of the discreteness of spectral lines. And all of the certain definite values of angular momenta are simple multiples of a new natural constant, known as Planck's constant of action, which was first discovered in connection with other attempts to understand particularly significant characteristics of atomic phenomena. So here, then, we had first of all the major break with the phenomena of classical physics: quantities which classically would be given continuous values, now had discrete values. This, in other words, is the general observation of the microscopic world: that the phenomenon of atomicity is all-prevailing. We must account for the very existence of atoms, which after all is not a classical conception. Classically, there should be no limit to the extent to which you could subdivide matter. The fact that this subdivision cannot be carried out indefinitely, but ceases when we reach the atomic scale is, of course, the most fundamental statement that something new is involved. Here is the phenomenon of atomicity, not only in the mere existence of atoms but also in the laws of mechanical motion: an atomicity of angular momentum, an atomicity of action, to put it in the most general way, was a basic phenomenon of microscopic physics. And we simply had to accept - I say "we" but, of course, I was not involved at the time; there is, nevertheless, the feeling of kinship here - that the physical properties of atoms must be understood in terms of new laws which transcended anything that was familiar before. This was a new world. Beyond this phenomenon of atomicity, which I mark as the one basic fact, the new fact of microscopic physics, there is another one which appears at the same time. This is the essential statistical nature of microscopic phenomena - another fundamental feature which must be accepted as the way that the microscopic world operates. That the microscopic world is necessarily statistical can be understood in the example mentioned above of the diffraction phenomenon that an electron beam, interacting with a crystal, exhibits. Now, let's think about how this diffraction phenomenon would come about. A crystal is a regular arrangement of atoms with a characteristic distance separating them and, by virtue of this characteristic dimension, for any wave phenomenon that falls upon the crystal, there will be certain preferred definite directions of scattering, selected by the relation of the wavelength to the interatomic spacing of the crystal. (This phenomenon is, for example, well known in the case of X-rays - in fact, its demonstration represented one of the experimental proofs that X-rays are wave phenomena.) So. when we
8 Prologue carry out an experiment in which a beam of electrons falls upon this crystal, and then moves in various directions toward a screen, it will produce a characteristic interference pattern there; instead of the electrons arriving more or less at random, with a uniform intensity all over this screen, you will find preferred places. Now, if I make the beam so weak that one electron within a perfectly definite time interval moves through this crystal in some way, it will finally be detected by landing in a perfectly definite place on this screen. This may be a scintillation screen, for example, and if you have a weak beam of electrons and you look at the screen, you'll suddenly see a flash of light, not all over the screen but in one place. The electron exhibits its particle-like characteristics when it is finally detected, when it finally exhibits its position by producing an appropriate chemical process, the result of which is a flash of light. And we observe that the electron is here. For one electron, there is certainly no interference pattern. The interference pattern does not appear all at once. You have simply an individual electron. Now, a second electron arrives in the course of this very weak beam. What happens to it? Does it land at the same spot? No, it lands at random, at some other point that has no relation to the first. But as we continue, and more and more electrons land upon this scintillation screen - all coming under the same experimental conditions, but each arriving independently of the others - eventually the pattern of this interference behavior emerges. Many electrons land here, none land there, and so on, until the final picture of intensity is one which gives the overall pattern. But nevertheless, this has come about, as the result of the random landing of the electrons at various points on the screen. To emphasize another aspect, now suppose we had carried through such an experiment and ten billion electrons had finally produced a certain interference pattern, a picture of intensity of relative number. Then I repeat the experiment. I prepare exactly the same circumstances, and I turn on the electron gun and see what happens. Again, the first electron moves through the crystal. Will it land at the same place as the first electron of the previous run? Certainly not. We begin all over, but the pattern as a whole repeats itself in a random way. The first electron of the second experiment will land somewhere; the second electron of the repeat experiment will land somewhere else - with no relation between them or the electrons of the first experiment. In other words, the individual particles arrive in a perfectly random statistical way. There is no possibility of controlling this. This is, of course, a generalization from half a century of attempts to do so: the picture which we must accept is that, within the domain of microscopic phenomena, we are unable to control how the individual particles will behave. But what is perfectly determined, and will be reproduced every time you repeat the experiment, is the interference pattern; the characteristic features of the pattern are predictable and reproducible. Once we have this apparent duality of entities - electrons that behave under some circumstances like discrete entities, particles, landing at
Prologue 9 definite places on that screen, but in other respects act as waves, producing in their overall intensity characteristics the wave phenomenon of interference - we must accept that the interference pattern is not going to be repeated in miniature every time an electron lands; and that, therefore, there must be an aspect of randomness about where the electrons do land. The interference pattern is merely finally the statement of relative probabilities: with endless repetitions, you will find many more particles there than here - and this in a perfectly regular way. It is then, in a sense, almost an automatic inference of everything that we have said, but I would prefer to take it as really the more fundamental thing: it is a basic characteristic of the laws of microscopic phenomena that they are statistical. It is not possible to predict, in general, the outcome of a specific event. But what one can predict is the average result, the statistical result, the net situation for the repetition, a sufficiently large number of times, of the same experiment - and it is the purpose of microscopic physics or quantum mechanics to make such predictions. So here, then, we have the great challenge of microscopic physics - that there are these two new basic aspects which we must incorporate into a world picture: the fact that phenomena are atomistic and that they are statistical. But that the new physics is statistical and therefore fundamentally different from the fully deterministic classical physics does not mean that we have failed. We simply recognize what the nature of this new microscopic physics must be. It is not to predict the outcome of each individual event. It is to predict rather what the outcome must be on average; what the probable outcome must be. And as the electron-scattering experiment - or rather this simple-minded description of the experiment - indicates, this is something that we necessarily must put up with. In fact, as we should perhaps mention here, if we attempt in any way to control precisely where the first electron shall land, we can indeed do that; we can produce a new experimental situation in which, with essential certainty, the electrons will land at the pre-assigned spot - all of them. But then we shall have no interference pattern. In other words, there are now two situations that we're talking about. The first, in which an interference pattern does appear, is one definite experimental situation, and in that it is not possible to predict or control in any way where the individual electrons will appear. Then there is the second experimental situation, in which we can control and predict where the electrons will appear in the course of moving through some apparatus. It will be a different apparatus and that apparatus could never produce an interference pattern. We are dealing, so to speak, with two distinct aspects of the microscopic world and it requires a different experimental situation to display one or the other. Let's think now of how - once having recognized that we have two basic features of the microscopic world, the aspects of atomicity and of the statis-
10 Prologue tical nature of microscopic events - we could proceed to construct a theory that would incorporate this very bizarre situation. We must have a mathematical theory which, in some way, will represent a suitable mathematical model or idealization and enable us to predict in a coherent way - in much the same manner as physics has always done - what the outcome of experiments will be if we are given correctly all the conditions that fully characterize the nature of the experiment. To see what we have to do, I think we must go back and think a little more consciously of some of the fundamental principles (call them philosophical, if you like) which underlie classical physics - or shall I say macroscopic physics, because that's now the distinction. I think specifically of the theory of measurement. And here, of course, we have to recognize the fundamental philosophical conception that physics is an experimental science; it is concerned only with those statements which in some sense can be verified by an experiment. The purpose of the theory is to provide a unification, a codification, or however you want to say it, of those results which can be tested by means of some experiment. Therefore, what is fundamental to any theory of a specific department of nature is the theory of measurement within that domain. Now, what was characteristic of the theory of measurement in the macroscopic classical physics? Well, the essential thing that was basic to it was the conception of a non-disturbing measurement. It is, of course, perfectly obvious to anyone who has ever come near a physics laboratory that, in the process of a measurement, aimed at gaining information about a particular object, we must interact with it physically in some way. But nevertheless we would like to be able to idealize that interaction in such a way that we could meaningfully state what that property would be as though the interaction did not occur. (You may take as the simplest model the insertion of a thermometer into a body of water with the objective of determining the temperature of the water - ideally as it would be without any disturbance by means of the thermometer. Without the presence of the thermometer, however, there is no means to determine what that temperature is.) As you know, whenever the interaction occurs there must be some disturbance as a net effect of that interaction with the object in question. (The insertion of a thermometer into a pail of water changes the mass of the water. It will change the temperature that is to be measured in some way.) But what is characteristic of classical physics is that we can state, and correctly so, that it is meaningful to talk of an idealization in which that interaction can be made as small as we please without, however, it becoming zero; because if it is zero, we have no means of gaining information. That the interaction is so small that it does not disturb the object of interest, is in fact not always necessary, nor is it always possible. For example, some measurements might represent chemical changes, which are large alterations in the nature of the substance, and these arc certainly not negligible
Prologue 11 disturbances. It is here that a second aspect of macroscopic physics comes into play. Since classical physics is causal and deterministic, we can calculate as accurately as we please and correct for the effect of these unavoidable disturbances. It is indeed familiar that in classical physics any measurement has a theoretical description associated with it, which represents the recognition of what the disturbance has been and the calculation of how to correct for it in order, therefore, to come back to what an idealized non-disturbing measurement would be. So, for the record: the two basic features of measurements in classical physics are that we can either make the interaction so small that there is a negligible disturbance; or, in a particular experimental circumstance, by the nature of the experiment we wish to perform, if we cannot make the disturbance arbitrarily small, we can calculate the effect of that disturbance and compensate for it with arbitrary precision. This is, simply stated, the theory of classical measurement; and associated with it is the idea that there is then no limit to the accuracy with which we could make measurements simultaneously of any number of physical properties, as in the very statement of the concept of state, for example, in Newtonian physics. When I assert that the state is the specification of the positions and the momenta of all the particles in question, then implicit in that is the assumption, consistent with the whole scheme, that in fact I can carry out the measurements necessary to give the numerical values that those quantities have at every time. And similar remarks apply to measuring the distribution of electromagnetic fields throughout all of space. In summary then, the classical theory of measurement says that there is no limit to the accuracy with which we can assign numerical values to all the quantities that are needed to specify the state, and since all of these are deterministic theories, that means to all physical properties at once. Since physical properties can be assigned numerical values consistently, one has never in classical physics drawn any distinction between the physical properties and the numerical values which they have at any particular time. In classical physics, we are always able to assign to the physical properties, considered as an abstract thing, a very concrete representation by means of numerical values which a non-disturbing measurement would find for them at a particular time. This restates the foundations of classical physics: the idealization of non- disturbing measurements and the corresponding foundations of the mathematical representation, the consequent identification of physical properties with numbers because nothing stands in the way of the continual assignment of numerical values to these physical properties. Now, by contrast, what is the situation in microscopic physics? Drawing upon the vast body of experimental data, accumulated over the course of several decades, I have summarized above the properties of microscopic physics - or, if you like, of microscopic measurements - under these two basic headings: atomicity and the statistical nature of the phenomena.
12 Prologue What does this mean? First of all, atomicity: atomicity means that the microscopic entities have many of their properties carried in certain basic units. There is no half of an electron. The electron has a definite mass; it has a definite charge. If the interactions that I am concerned with are electrostatic in nature, I cannot reduce them arbitrarily in strength because there is no half of a unit of charge. This indicates to you immediately, I think, the basic difference between the laws of microscopic measurement and macroscopic measurement. I must take into account the fact that the strength of the interaction - which must be present if I am to talk of measurement at all and, therefore, talk meaningfully of physical phenomena - cannot in general be made arbitrarily small because the physical objects that interact (the atoms, the electrons) in general have relevant physical properties which come in certain units - quanta, the origin of the name of the subject that we are discussing: quantum mechanics. Now, this might seem as though it were not an unsurmountable difficulty. We recognized, even in classical physics, that there might be circumstances in which the act of measurement produced definite disturbances that we could not minimize because of the particular kind of measurement we carried out. In classical physics, we said the situation may be such that the measurement interaction is very strong and cannot be made arbitrarily weak, but this does not upset the underlying philosophy of measurement because I can calculate with arbitrary precision what the effect of that interaction was and compensate for it, correct for it. Can I still do that now, in the realm of atomic measurements? The answer is no, because this is where the second fundamental aspect of microscopic measurement comes into play; namely, the phenomenon of statistics, the fact that we cannot predict in detail what each individual event will do but only make predictions on an average or statistical scale. The measurement act involves a strong interaction - I repeat: on the microscopic scale it is necessarily strong because we cannot cut the strengths of the charges in half; we cannot change the properties of these fundamental particles; we must accept them as they are - and so the measurement unavoidably produces a large disturbance, which we cannot correct for in each individual instance, for we cannot control what happens in each individual event in any detail. We can only predict or control what happens on the average, never in any individual instance. Therefore, the program of computing what the effect of the disturbance was and correcting for it is, in general, impossible. Accordingly, the two basic tenets of the theory of macroscopic measurement are both violated. Either the interactions cannot be made arbitrarily weak because of the phenomenon of atomicity, or if we wish to accept this and correct for it, we cannot do so because we do not have a detailed, deterministic theory of each individual event; we have only the ability to anticipate or control the statistical average.
Prologue 13 So, here then is the general implication from the mass of experimental data that for microscopic physics, if we are to construct a theory, we need a whole new theory of microscopic measurement, and to go with this we need a whole new scheme of mathematics, which is to say that we can no longer speak meaningfully of the numerical values that physical properties have at a given time. Put differently, I wish to point out that the failure of these fundamental assumptions means equally well a failure of the ability to represent physical phenomena in the microscopic realm by numbers which change in time as we do in the macroscopic or classical domain. Something of an entirely different mathematical nature is needed, such that it represents, or mimics, the basic facts of microscopic measurement. To emphasize the relevant point, I may say this. Macroscopically, we can measure one physical property; we assign a number to it. We measure a second physical property; we assign a number to it. We can then speak of this pair of numbers as the values of this pair of physical properties at a given time. There is no contradiction here. We can perfectly well go back and check that the first property has still the same value it had before if we could, in an idealized way, carry out these measurements rapidly enough (or regenerate the physical circumstances in such a way that we could repeat the first measurement). By contrast, suppose we have indeed succeeded in measuring in some way one physical property of an atomic system. Now we go on to make a measurement of the second physical property. That measurement necessarily will involve an interaction, the strength of which is not arbitrarily weak and the effect of which is not controllable, in such a way that it will, in general, produce changes in the physical circumstances that specify the conditions of the first measurement. In other words, the system that is being measured is disturbed in an uncontrollable way in such a manner that if we now went back and asked for the value of the first physical property, checking to see that it still had the same value as before, we would now find not at all the same value but a random assortment of all the possible values that it could assume, with various probabilities that depend in detail upon precisely what we have done. This is so because the second measurement has introduced a new physical situation; the interaction of the second measurement has disturbed the physical system of interest so violently that we have no way of knowing, except under very special circumstances, whether the system has been left in precisely the same physical situation that would enable us to say that the first physical property still has the same value it had before the second measurement. In other words, once we recognize that the act of measurement introduces in the object of measurement changes which are not arbitrarily small, and which cannot be precisely controlled, then we must acknowledge that every time we make a measurement we introduce a new physical situation that is essentially different from the situation before the measurement.
14 Prologue So, if you measure two physical properties in one order, and then the other - which classically, of course, would make absolutely no difference - these are simply two different experiments in the microscopic realm. You have two different physical situations which come about depending upon whether you first measure property A and then property B, two successions of disturbances which have this microscopic character, or do it in the reverse order, first B and then A, which involves entirely different disturbances. And since the final physical situation depends crucially upon the order in which the microscopic measurements are performed, it is, in general, no longer possible to say that property A and property B have these values. That would only have meaning if you could get the same numerical values no matter in which order the measurement was carried out. Exceptional situations aside, in the "first A then B" order of measurements only B emerges with a known value, and only A in the "first B then A" order. Therefore, the mathematical scheme for microscopic measurements can certainly not be the representation of physical properties by numbers, because numbers do not have this property of depending upon the order in which the measurements are carried out. The assignment of a pair of numbers to two physical properties introduces no sense of order, no sense of sequence. We must instead look for a new mathematical scheme in which the order of performing physical operations is represented by an order of performance of mathematical operations. The mathematical scheme that was finally found to be necessary and successful is the representation, in a very abstract way, of physical properties not by numbers but by elements of an algebra for which the sense of multiplication matters. In other words, the multiplication of these algebraic symbols was found to be the proper counterpart of the successive performance of measurements: that the order of measurements is significant, as a consequence of the unavoidable disturbances, is reflected in that, correspondingly, the sense of multiplication of these symbols must be significant. And so we are led to a much more sophisticated and deep mathematical scheme in which physical properties are set into correspondence with the non- commutative elements of an algebra or, as they often are referred to, non- commutative operators as compared to the very elementary representation of physical properties by numbers. And with every physical state - the idea of state reoccurs - we associate a vector in a suitable abstract space on which these operators act. As a result of all this, a very beautiful mathematical scheme has emerged which gives a wonderful account of all of these seemingly bizarre and incomprehensible facts of microscopic physics. This symbolization of atomic measurements is quantum mechanics, developed by Heisenberg, Born, Schrodinger and others, essentially in the years 1925 to 1927, still very distant from our present point of view.
Prologue 15 Let me now describe, within the same general framework, what the nature of quantum mechanics is. It is still a causal theory. Given the state at one time, the state at any other time is uniquely determined, but what makes it different is that it is not a deterministic theory. It is a causal, statistically deterministic theory. The knowledge of the state at one time fixes the state at another time, but what information is obtainable from this knowledge of the state? Recall that classically, if you knew the state, you knew everything; if you knew where the particles were and how they were moving, you could predict any other physical property you happened to be interested in with arbitrary precision. But, as we have just said, arbitrary precision of individual predictions cannot exist in the microscopic world. Nevertheless, in a science of observation we must be able to make precise predictions - and we are, only that in quantum mechanics these precise predictions are of a statistical nature. The knowledge of the state enables you to predict the statistical, the average outcome of the measurement of any physical property, but never the result of any specific event. In other words, if you know the state, you can then predict what the result of repeated trials of measurement of a particular physical property will be. You will have perfectly determinate, statistical predictions but no longer individual predictions. I repeat: the causal connections between states at different times is still present. (This seems to be fundamental in any branch of physics as we know it.) But what has changed drastically is that the knowledge of the state does not imply a detailed knowledge of every physical property but merely, in general, of what the average or statistical behavior of physical properties may be. This, in a sense, is the final understanding of these remarkable apparent paradoxes in the earlier developments of the theory. They are now resolved in terms of this statistical determinate rather than individually determinate theory. I have spoken of states but have in no way indicated how a state is to be defined. The answer to this can be given if we think of a model of a physical system which still comes very close to classical models. For example, we began by thinking that atoms were to be understood simply by electrons (small material bodies, each carrying a unit of negative electric charge) moving in a certain definite field of force, the Coulomb force field of attraction of the positively charged nucleus. Here is a situation which seems to fit the Newtonian mold; a definite law of force, a finite definite number of material bodies. What failed was not that the dynamical picture was not correct but that the laws of microscopic physics were different. They were not such as to permit a detailed, deterministic prediction but they have this character of the statistically deterministic theory. We have a physical model which is classical in picture. When we describe an atom, we say how many electrons there are and what the nuclear charge is and the picture is still classical, at least as far as our minds are concerned. What is very different is how we go about calculating. For simplicity, think of
16 Prologue a hydrogen atom where there is only one electron. So, here is one electron and the electron has associated with it physical properties of position and physical properties of momenta, and classically we would say that there is no limit to the accuracy with which we could measure those positions and momenta simultaneously. But the distillation of what physicists have learned throughout this line of development that culminated eventually in the mathematical scheme of quantum mechanics, is that this is not the proper definition of a state of an electron in an atom. The best we can do to specify a state is not to assign numerical values simultaneously to all of these classical properties of position and momenta but to only half of them. We can, in fact, produce experimental situations in which we know precisely where an electron is. It lands on a scintillation screen, and the flash of light reveals essentially the position of the electron, or we can produce experimental situations in which we know precisely what the momentum of the particle is. That, in fact, is what I had in mind when I described this beam experiment. Here is the particle, moving in a definite direction with a definite speed. And, having a definite mass, that means I know the momentum; but when I know the momentum, in a sense I cannot know where the position is, and the appearance of the interference pattern formed by the random falling of the particles on the screen is the sign of that fact. On the other hand, I could produce a very different experimental situation in which I arrange matters so that the electrons always land at a pre-chosen site. Then I'd have a position measurement, and I can predict precisely what the result of such a measurement would be. We will then never have the interference pattern that is characteristic of the very different physical situation in which the momenta are perfectly definite. What has changed basically is the nature of a state. If we have a certain number of particles - electrons, for example - the specification of the quantum state amounts to telling where the particles are at a given time or, alternatively, how they are moving at a given time, but never both together. So, indeed, by comparison with what would be a full specification of state in classical physics - where the electrons are and how they are moving - in quantum mechanics the state is specified by telling, with arbitrary precision, what the result of the measurement would be of half of those properties. But then we are completely incapable of predicting the individual values of the other half. They will then simply have random probability distributions. If you make measurements again and again on this state, about which you know precisely the positions, you will always find that position, of course, comes out precisely as it should. But if, on that state, you make momentum measurements, you will never find a definite value, you will find a random statistical distribution. And the more precisely you specify the position, the wider will be this momentum distribution. This simple situation is essentially a statement of what is perhaps the widest philosophical principle that has emerged from these studies of micro-
Prologue 17 scopic physics. This is what is known as Bohr's principle of complementarity. We have used, and shall continue to use, the example of the so-called wave- particle duality, tied to the names of Einstein and de Broglie, for the illustration of complementarity. The general development, however, establishes that wave-particle duality is just one consequence of the fundamental complement arity. So, by the idea of complementarity we mean the final unification, within these general principles, of what began by seeming to be a paradox. Electrons, under certain perfectly definite experimental situations, act as particles would, and under other experimental situations, they act as waves - this is, so to speak, what we have expressed now in a more precise way using the insight that the definition of state never refers to all of these physical properties but to only half of them. You have the privilege of designing experiments in which different choices are made as to which will be the physical properties whose values are precisely known. Waves, for instance, represent the option, the choice on the part of the experimenter, to produce an experimental situation in which the momenta are selected to have definite values, the positions can then not be controlled. Bohr's principle of complementarity is the statement that we have in microscopic physics first of all a new world (that's the important thing to recognize), in which classical analogies fail. But, nevertheless, there are certain situations in which analogies of a classical nature do hold — situations, for example, in which it is possible to speak meaningfully of particles with regard to certain circumstances and certain measurements; and other situations where electrons, or what have you, can be spoken of as waves. Two distinct classical pictures can hold under different physical situations, never simultaneously, and the applicability of one picture prevents the applicability of the other - the two classical analogies are mutually exclusive. But both pictures are on the same footing. We can produce experimental situations in which either classical picture can be applied, and the other is then inapplicable. The idea of complementarity is that a full understanding of this microscopic world comes only from the possibility of applying both pictures; neither in itself is complete. Both must be present, but when one is applied, the other is excluded. This is, in essence, the entirely new situation which has no counterpart in any of the classical philosophical modes of thought. It is something that simply must be accepted. At least all physicists have accepted it - it is essentially the way in which the laws of microscopic physics have been understood, and as the result of which the enormously successful development of quantum mechanics has arisen. Within the space of a few years, the application of these understandings in microscopic phenomena has completely swept away what was traditionally regarded as the great classical problems of physics. At least in principle, if not in practice, a reduction of chemistry to physics has been brought about. The understanding of all of the various diverse properties of
18 Prologue matter in all its forms under all ordinary circumstances is reduced to a few simple facts. The laws of quantum mechanics and the specification of which particular configuration you happen to be talking about is all you need - in principle at least, and with great success in practice - to understand the properties of material bodies, as indicated by the enormous developments of the underlying theory of the solid state and many other applications. All of these are in large measure the expression of the understanding of the laws of atomic physics that have been codified - and unified - in quantum mechanics. In a fundamental sense, this was completed in 1927. It is by no means the end, however, of physicists' investigation of the physical world. The development has gone on and gone on in the direction of looking for entirely new realms of physical experience within the domain of higher energies, smaller distances. We have spoken again and again of the atom, but in the center of the atom is the nucleus, and within the nucleus are the nucleons, and as we now understand, the nucleons are made up of other still more fundamental primordial entities. The search goes on. To understand a little bit in what language this development is continued, I must come back to an idea that I mentioned earlier: the notion of the quantized field, because here we have perhaps the deepest expression of what has been learned within the framework of these microscopic phenomena. Let me introduce this in terms of another basic philosophical idea which is given an entirely new turn within the phenomena of microscopic physics. This is the concept of identity or indistinguishability. It is, of course, perfectly clear to you that when we speak even in classical physics of electrons that mere terminology indicates that we understand that one electron is just like another. If we measure any of the fundamental non-accidental properties of this electron, or that electron - its mass, its charge, and whatever more sophisticated properties you may be concerned with - these are invariably the same. This is, if you like, the fundamental conception of the uniformity of nature without which physics could never begin to operate. We must assume that one sample of a particular substance is like any other sample if no relevant circumstances are involved. So, if we take any two electrons, describing them classically as we might have if we had two beams of electrons moving in some evacuated chamber, then we understand that one electron is just like another, and in classical physics there are no particular implications in this, because despite the identity, the indiscernability in principle of the two electrons, in classical physics we are never in any difficulty of being able to specifically distinguish them. We can say this electron originated from this region of space; the second electron from that region of space. Classically, I am able to follow in detail the trajectories of these particles, and I can at every instance say precisely where this particular electron came from and trace its path continuously to the point of origin. No matter how these electrons may interact in some com-
Prologue 19 plicated way within a vacuum chamber or a radio tube, for example, I could always identify - in principle at least, thereby using nothing but the classical laws of physics - at any stage precisely which each electron is which. Now, I think you may see immediately that the situation must be very different when we recognize the reality of the microscopic world which is not governed by those classical laws. Consider, for example, a collision experiment. Suppose I have two beams of electrons. (You could make them protons just as well. It is, of course, important to realize that the laws of nature which I am speaking about govern all of the various manifestations of matter. If I speak of electrons, that is historical convention. Those could be protons, neutrons, hyperons, 71 mesons, the laws of physics are the same.) If the electrons are to collide, they must come into intimate interaction. Of course, what they do not do now is carry out, as they would do classically, a detailed trajectory because for that to be meaningful, I must be able to always, without disturbing the nature of this experiment, check precisely what a particular electron is doing at every instant of time. That calls for a degree of control, and thus of determinism, that is impossible in the microscopic world. If I have produced an experimental situation in which the particles head toward each other, then they have rather definite momenta. Then, as I have tried to suggest, the complementary physical measurements no longer can be specified in detail. I have no way of knowing precisely where these electrons are, and can give no meaning to such statements about their positions. And when finally these beams separate, so I no longer have any doubt about which is which, I have no right or ability to tell whether this electron is that one - or that one - because I have not been able to follow in detail precisely what has happened. In other words, these basic physical phenomena, the atomicity, the statistical nature of things, the inability to control in detail individual events, imply correspondingly the absence of an ability in the fundamental experimental sense to tell which particle is which at every stage of the interaction. This requires, therefore, that my description must take into account this fundamental failure of being able to place a tag on every particle. I repeat: there is no experiment I can perform that gives reality to that label because to do so would represent an intervention into this experiment, the performance of a detailed microscopic localization experiment, which would change completely the nature of the collision experiment. It would no longer be a simple collision, there would be something else there, equally important, interacting with these particles. It is a different experiment. And this is the whole point, of course: the recognition that I must indicate in detail precisely every experiment I have in mind because every measurement I wish to perform changes the conditions of the experiment, produces a strong, non-controllable interference in the other things that are going on. The net result of all of this is the recognition that the description of the states of several particles necessarily can only be done in a way which incorporates from the very beginning the fact that they are indistinguishable,
20 Prologue that particular labels have no significance. This simply means that when we have several indistinguishable particles, the states can only be described in a way that is perfectly symmetrical among all the particles that contribute to it. Now the word "symmetrical" is to be understood in a more general sense. The traditional manner in which states are specified is in terms of numerical quantities that are known as wave functions. I spoke of vectors, state vectors, which are abstract entities, and wave functions are particular numerical representations of these abstract vectors. I have several identical particles and the wave function is a function of their positions, for example. The only description that is tenable is one in which this wave function is completely symmetrical among all these positions, so that all the particles are on exactly the same footing. The arbitrary labels - which could say this particle is the first, that particle is the second, and so on - must be deprived of any distinguishing significance by insisting that, no matter how the labels are assigned, the wave function of the state is the same. The states themselves must remain unaltered if I decide to distribute the arbitrary labels differently among the identical particles. But this can be ensured either by making the states completely symmetrical or completely antisymmetrical - the more general sense of "symmetrical" referred to above. In this way we recognize the existence of two very different types of systems of identical particles. In fact, of the two basic examples we have used, the photon and the electron, the detailed properties of physical phenomena show that the photon belongs to this class of symmetrical states, and the electron falls into the framework of antisymmetrical states. Let's take a closer look at those wave functions, beginning with just a single particle. Its wave function is simply a function of its position - or, if you like, of the three coordinates that specify the position - and of time. Here you might say that I have, in a sense, a field; a field is after all physically a distributed object in space and time. This is very much analogous, as far as this space-time structure is concerned, to the electromagnetic field of Maxwell's theory; and indeed it might seem that a wave function is just a field of some other kind. But, as soon as we deal with several particles, the wave function cannot be a field in the conventional sense, because it is now the function of two, three, or more positions (and time). In other words, here I have rather a much more abstract, multidimensional configuration space, and the multiparticle wave function is a way of indicating what several particles are doing at once in a highly hypothetical mathematical space, which is very remote from ordinary three-dimensional space because there are many three- dimensional spaces now being considered at the same time. So, we have states of single particles, two particles, three particles, and so on, and clearly the picture is getting ever more complicated. An enormously fruitful change of perspective begins by recognizing that there is also a no- particle state - the vacuum. States with just one particle can then be thought
Prologue 21 of as brought about by creating a particle. First you have the vacuum, then you create a particle and you have a one-particle state. You create another particle and have a two-particle state - then a third, a fourth particle, and so on. At the moment this is nothing but imagination, idealized, abstract creation processes that, in a sense, just make book-keeping easier. What makes it useful and important is that I can now deal with experimental situations with varying numbers of particles - situations in which particles are physically created and annihilated. In the mathematical scheme, then, we describe the creation by the application of a creation operator - apply it, for instance, to a state with five particles and the result is a state of six particles. It is an operator, rather than a numerical quantity, because it symbolizes a physical property that is something beyond what we are accustomed to thinking of, and also because it acts on a state to produce another one with one more particle. So, I will imagine, for example, that I apply the creation operator to the vacuum state and create particle 1 at the time t; I will then create particle 2 at the time t (or any number of additional particles by the repeated action of this creation operator, but let's have just two right now). In this manner, I produce something which has, essentially, the nature of a two-particle wave function which involves both positions and the time t. And the important thing is that these requirements of symmetry (or antisymmetry) on this wave function can now be converted into algebraic statements on these operators. If the situation is one of symmetry, as in the case of photons, then I assert that whether the creation operators are multiplied in one order or the other, the result is the same; this independence of the multiplication order produces symmetry. If it is antisymmetry, as in the case of electrons, I assert that if they are multiplied in one order, the result is the negative of what occurs when they are multiplied in the reverse order. In other words, the properties of these two classes of identical particles - or statistics as it usually is referred to - becomes replaced by an algebraic property of these operators. In short, we are led, instead of talking about a system of a definite number of particles, to think of physical systems with an indefinite number of particles because we can produce whatever number we are interested in by the application of this creation operator. We now then, so to speak, transfer our attention to this operator as the basic physical object. And this is what I mean by the quantized field: it is on the one hand a field, a mathematical quantity which varies continuously in time and space; on the other hand, it is certainly not a classical field because these operators are not physical quantities which can be measured simultaneously. In the operator character, in the fact that the sense of multiplication is significant (in a much deeper way than I can describe it here) we have the elements of discontinuity which is the essence of the particle concept. The two entirely unrelated classical conceptions of discreteness of particles, of continuity of the field, are now unified in this entirely new conception
22 Prologue of the quantized field - more than unified: transcended because the new conception is beyond either. The two are, after all, incompatible in the classical sense because there is nothing that is both discrete and continuous. We have arrived at something which has neither of those properties but which, in limited domains, can be characterized in terms of either of these conventional concepts. So, here is the fundamental unification, brought about by this idea of the quantized field. I have emphasized how the quantized field enables us to speak meaningfully of creation acts, but it should be obvious that, correspondingly, the inverse processes of annihilation, in which particles get destroyed at various points in space and time, are equally accounted for by the quantized field. As it arose historically, the quantized field was merely a convenient way of summarizing the mathematical properties of indistinguishable particles, but soon, through the ever-broadening developments of experimental science, what was conceived of simply as a mathematical idealization became reality. I am referring to the ability, as enough energy was available, to supply the rest energy of particles according to the famous Einstein relation E = mc2: given the energy E that corresponds to its mass m, a physical particle can be produced. (It may be necessary for other reasons, such as the conservation of electric charge, to produce them in pairs, as is in fact the case for the negatively charged electron and its positively charged counterpart, the positron.) By the early 1930s, these experiments had been performed. In later years, pairs of protons and antiprotons, of neutrons and antineutrons were produced for which vastly much greater amounts of energy are required, and by now experiments have created (pairs of) most of the other particles known to be the building blocks of nature. These experiments alone then give physical reality to the quantized field, in its interpretation of symbolizing acts of creation and destruction. But now, in the course of the development, it soon became realized that the moment you have introduced the quantized field in direct association with particles as we know them, inevitably this situation could not persist. A new level of abstraction had to be reached and was reached. It occurred essentially during the late 1940s and the 1950s while attempting, as demanded by the refinement of experimental data, to understand better the properties of atomic phenomena that were successfully accounted for in the first flush of the development of quantum mechanics. The experiments went on; more and more refined properties became known; further and sharper applications of the theory were required. For example, in the case of electrons in atoms, from the field point of view, we are really concerned with the dynamics of two fields. There is the quantized field which is associated with the electrons and also their counterpart, the positrons; there is the quantized field which is associated with the electromagnetic field, to use its classical name, the field
Prologue 23 of photons. The photon field and the electron field are in interaction. And as a consequence, the identification of each field by these physical names is only an approximate one. Only if the interaction between the two fields is weak, as to a large extent it is in this example, can we use physical names in relation to these mathematical objects. But in a more refined theory in which the interaction between them must now be taken into account, we have to recognize that what we call physically an electron is only partially to be associated with that electron field alone. It is also partially to be associated with the photon field, because the two are in interaction. Physically, an electron can sometimes radiate a photon; it can then also reabsorb it. In other words, what we physically call an electron would, at a deeper level, be described as sometimes the action of the electron field only, but another fraction of the total history also involves the action of the photon field. And conversely, what we call a photon, propagating through empty space, is not merely the result of the creation act of an analogous photon field, because . the photon can occasionally materialize itself in space and become replaced by an electron and a positron which then, in the course of time, recombine to reform the photon. In other words, the physical object we call the photon is not what is created all the time by applying the mathematical photon-creation operator to the vacuum state. The other operators, the quantities that represent the creation of electrons and positrons, also come into play. Once you recognize this, we draw a distinction between two levels of physical description. There is the phenomenological level at which we recognize the properties of electrons and photons as we see them, out of, of course, the enormously detailed analysis of microscopic experiments. Then there is the attempt to deepen the understanding in terms of more primitive objects which are these quantized fields, which are no longer placed in immediate correspondence with the phenomenological particles but through a chain of dynamical development. And, in fact, this program of rcnormalization, to mention the technical term, as it was applied specifically to the case of electrons and photons, through the development of what is called quantum electrodynamics, led to a description on a more fundamental level of some of the finer features of electron and photon behavior. What were once considered to be anomalies, things that were unexpected, became the predicted outcome of this deepened understanding. The lesson was learned that our most profound understanding is not to be found in terms of what we actually see but at a more fundamental level. So it has always gone throughout the history of physics. We begin with atoms as fundamental objects and then we attempt to understand the properties of atoms in terms of electrons and a nucleus, which is taken as unan- alyzable. Then we move down to the level of the nucleus, and analyze it in terms of the properties of nucleons and so on and on. In very simple terms, this is the conception of how we go about it in terms of smaller and smaller particles, smaller and smaller regions of space.
24 Prologue The analysis of particles as we know them and as we associate them with fields is an attempt at understanding, at a deeper level, that strives for a simplification in terms of yet more fundamental quantized fields which have fewer properties, that strives for deeper, more symbolic laws with fewer arbitrary constants. For example, unlike the experimental situation in which the charge of the electron, the mass of the electron, the magnetic moment of the electron, are all unrelated constants, the deeper understanding attempts to explain one or more of these in terms of a fewer number of fundamental things. So it has gone in the case of quantum electrodynamics. This has been a very successful application of this idea that it is the quantized field conception which is the statement of our deepest level of understanding of microscopic phenomena. But as I mentioned, and it must be familiar to you to some extent, in the course of the development of higher and higher energy machines, more and more particles have become known and these have appeared in a bewildering array of properties. Some of them are stable; some of them are unstable. They decay into each other in all possible conceivable ways. As sufficient energy is available, they are produced very copiously as the result of obviously strong interactions. They then proceed very slowly to die successively, moving down to the final stable particles that we know, which are still the electrons, protons, plus a few others. Now, the interactions which are involved here in these basic studies of nuclear phenomena are of an entirely different kind than the electromagnetic ones. The electromagnetic forces are essentially rather weak. And on the basis of this, one has been able to develop technical methods of handling these interactions. But when one cornea to the very much stronger bonds that not only hold the nucleus together, but hold together the particles that compose the nucleons, we are at a much more difficult level in the sense that not only are the phenomena bewilderingly complicated, but we also lack the mathematical means to draw the implications of any particular hypothesis about what is going on. And, as a result of this, there is doubt about what should be the fundamental nature of an explanation at this level. Should it be the continuation of this point of view of the searching for deeper understanding in terms of ideally a very small number of fundamental" fields, which in their dynamic interplay, and as a result of the complexity of that dynamics, finally bring about the manifold nature of the world as we see it? Or must we really abandon this attempt completely, replacing the difficulties by the anticipation of a fundamental impossibility, and simply describe nature in terms of what happens when we take various microscopic particles and perform experiments on them? We send electrons, protons into the various kinds of nucleons, where we perform experiments in which these particles enter a certain very small region. We make no attempt to describe what goes on there and simply try to finally characterize what emerges when the particles are separated again. Is the purpose of theoretical physics to be
Prologue 25 no more than a cataloging of all the things that can happen when particles interact with each other and separate? Or is it to be an understanding at a deeper level in which there are things that are not directly observable (as the underlying quantized fields are) but in terms of which we shall have a more fundamental understanding? Well, this question - idealized, frankly, beyond all recognition - is in a sense the deep philosophical problem that confronts theoretical physics at the frontier of high-energy physics, where we attempt to understand the structure of matter as it is revealed to us, in all of its complexity, using the ever-rising level of energy that has become available to study the basic building blocks of matter and, in the course of this, to create new kinds of matter.
Part A Fall Quarter: Quantum Kinematics
1. Measurement Algebra 1.1 Stern-Gerlach experiment I presume that all of you have already been exposed to some undergraduate course in Quantum Mechanics, one that leans heavily on de Broglie* waves and the Schrodingert equation. I have never thought that this simple wave approach was acceptable as a general basis for the whole subject, and I intend to move immediately to replace it in your minds by a foundation that is perfectly general. In checking out my impression of undergraduate courses, I happened to glance through a particular elementary textbook and found this statement: The laws of quantum mechanics cannot be derived, any more than can Newton's* laws or Maxwell's^ equations. Ideally, however, one might hope that these laws could be deduced, more or less directly, as the simplest logical consequence of some well-selected set of experiments. Unfortunately, the quantum mechanical description of nature is too abstract to make this possible. Despite that last pessimistic assertion, I propose to present just such an ideal induction (the more accurate term) of the general laws of quantum mechanics from a well-selected set of experiments - indeed, from a single type of experiment. The experiments are atomic beam measurements of magnetic properties, developments of the original experiment carried out by Stern^ and GerlachU in 1922. With the exception of ferromagnets, matter in bulk is unmagnetized in the absence of a magnetic field. A paramagnetic substance acquires a magnetization proportional to an applied magnetic field, which is understood as the field lining up the individual atomic magnetic moments against the disorganizing effect of thermal agitation. This, then, is an indirect measurement of an atomic magnetic moment. The Stern-Gerlach experiment attempts a direct measurement of the atomic moment through the mechanical effect of the field. *Prince Louis-Victor de Broglie (1892-4987) fErwin Schrodinger (1889-1961) *Sir Isaac Newton (1643-1727) §James Clerk Maxwell (1831-1879) 'Otto Stern (1888-1969) !iWalther Gerlach (1889-1979)
30 1. Measurement Algebra A magnetic dipole moment /z in the magnetic field B has the energy —/i ■ B, so that the force on it is F = -V(-n-B{r))=Vn-B{r) . (1.1.1) In the Stern-Gerlach experiment we meet a situation where, at a particular point, B has a single component, Bz, which varies strongly in the z direction. So, the z component of force is Fz = nz (j^Bz\ . (1.1.2) Here is the experimental setup: Ae1 ' • "g 1 1 1 ^ 1 C| —1 1 1 1 M jH Oven Slits Magnet Screen > 2000°C Frontal view of magnet Silver atoms evaporate from the oven and pass through the slits, traversing the magnet at the center point where there is a strong variation of Bz in the z direction. Depending on the value of /j,z for a particular atom, the force on it will deflect the atom up or down and correspondingly deposit it on the screen. Thanks to the high temperature, the Ag atoms certainly have their magnetic moments (m.m.) distributed uniformly in all directions, which is to say that the distribution of \xz should be uniform between the limits /j, and —(j,. So, after the beam has run for a while, the distribution of atoms on the screen should be What did Stern and Gerlach find? This: The atoms were deflected up or down; nothing in between! It is as though the atoms emerging from the oven have already sensed the direction of the field in the magnet and have lined up accordingly. Of course, if you believe that, there's nothing I can do for you. No, we must accept this outcome as an irreducible fact of life and learn to live with it!
1.2 Measurement symbols 31 As a first step, we refine the Stern-Gerlach experiment by dealing with single beams. (In the following we speak of a +m.m. or a —m.m. according as the atom is deflected up or down.) Suppose we stop the —m.m. beam: 1 or the +m.m. beam: +m.m. beam -m.m. beam (1.1.3) (1.1.4) How can we be sure that we have produced a pure +m.m. beam or a pure —m.m. beam? By repeating these selective measurements: 1) J I 1 ■' I 1 " + selector + selector + beam Yes, a + beam. 2) I I I I + selector - selector Nothing Not a -- beam. Pure + beam. (1.1.5) In sequence 1) we look for a +m.m. beam and find it; in sequence 2) we look for a —m.m. beam and do not find it. Together that establishes that the initial +m.m. selection indeed selects +m.m. atoms. Similar pictures apply to an initial —m.m. selection. 1.2 Measurement symbols The Stern-Gerlach experiment using silver atoms is the measurement of a physical quantity, fj,z, that happens to have only two possible values, say +fj, and — n- We now generalize by regarding \xz as just an example of a physical quantity A that has the possible values 01,02,. • .on; a typical value will be designated as a! or a". The specific physical apparatus that measures \xz and selects a particular outcome (say +m.m.), as suggested by then becomes an unspecific apparatus, suggested by
32 1. Measurement Algebra Measure A Select a' all of which carries the implication that a measurement is a physical act occupying a finite region of space (and time). The above box is awkward as a symbol. We simplify it as follows Measure A Select a' i ' ' a a (1.2.1) which retains the implication of a finite region associated with the measurement act. Physical property A is implicit, adequately implied by a'. But why the repetition of a'? First, it prepares the way for a generalization; second, it is a reminder that a selective measurement involves an initial act followed by its verification, as in the sequence 1) of (1.1.5). We also introduce symbols for two particularly simple measurement acts: the unit symbol for the one that does nothing at all - selects everythi without bias, Accept Everything 1 (1.2.2) and the null symbol for the one that rejects everything, accepts nothing: (1.2.3) Accept Nothing 0. A first step toward constructing an algebra for these symbols is made on representing successive acts of measurement - displaced in time - by sequential multiplication of the respective symbols. Thus the generalization of 1) in (1.1.5), which says that the repetition of a selective measurement confirms the measurement is symbolized by I a'a'I \a'a'\ = \a'a'\ (1.2.4) The generalization of 2), that two distinct selection acts end up by selecting nothing, is a'^a": \a'a'\\a"a"\ = 0. (1.2.5) Then, as reasonably obvious statements about multiplication of the measurement symbols 1 and 0, we have |a'a'|l = l|a'a'| = |a'a'| 11 = 1 , 10 = 01 = 0, (1.2.6)
1.2 Measurement symbols 33 and |o'o'|0 = 0|o'a'| =0, 00 = 0. We regress temporarily to note the equivalence Measure A Select a' Measure A — a' Select 0 What then do we mean by (a' / a") Measure {A - a!) {A - a") Select 0 (1.2.7) (1.2.6 (1.2.9) This is an A measurement in which either the outcome a', or a", is accepted without distinction. It is a less selective measurement, which I propose to represent by the addition of the respective symbols: a' £ a" : Measure {A - a') {A - a") Select 0 = \a'a'\ + \a"a"\ (1.2.10) which incorporates the complete symmetry between a' and a". Continuing in this way, a' / a" / a'" / a' : Measure (A - a') (A - a") (A - a"') Select 0 \a'a'\ + \a"a"\ + \a'"a'"\ we end with Measure {A- a\) ■ ■■{A — an) Select 0 (1.2.11) = |oiOi| + ■ ■ ■ + |anan| = 2_Ja'a'l ' a' (1.2.12) Just as there are 2 = 2! equivalent sums in (1.2.10), permutations give 3! = 6 equivalent forms in (1.2.11) and n! ones in (1.2.12). The measurement that accepts all possible outcomes without distinction is symbolized by 1. So the sum in (1.2.12) must be equal to the unit symbol, J2\a'a'\ = 1 . (1.2.13) a' which states the completeness of the measurement symbols a'o'l.
34 1. Measurement Algebra Before continuing, notice the simple properties of 0 with respect to addition. Given the option of accepting either something or nothing one ends up with something: \a'a'\+0 = 0+\a'a'\ = \a'a'\ , 1+0=0+1=1, 0+0=0. (1.2.14) Is it consistent with the known properties of the |a'a'| that /J|a'o'| acts like 1, the unit symbol? If so, we must have o' (\ ai i it \ i a m 1////1 y Ja a I I \a a | = \a a | . J (1.2.15) This is indeed true if a product with a sum is the sum of the products (distributive law of multiplication): (Ela'a'l) |a"a"| = £|a'a'||a"a"| \ a' / a' = \a"a"\\a"a"\+ V \a'a'\\a"a"\ i ii i z_/ i ii i = \a"a"\ +0 + --- + 0= \a"a"\. (1.2.16) We therefore accept the distributive law of multiplication. The notation |a'a'| is an invitation to generalization: |o'o"|, a' ^ a". What can this mean? Return to the m.m. example and consider arrangement 2) of (1.1.5): I 1 ' + selector I selector As it stands it stops everything. But suppose in the region between the + selector and the — selector we reversed the direction of the m.m.? A homogeneous magnetic field can do this; a dipole precesses around the direction of the field: B It is only necessary to control the time in the field. The outcome is a selective measurement in which only +m.m.'s enter, and only —m.m.'s leave: 1-)— .
1.2 Measurement symbols 35 For successive measurements of this general type we have (1.2.17) and \a'a"\\a'"a'v\ = 0 if a" / a'" . (1.2.18) Again, we have the proper behavior for the unit symbol 1, (\ A i / /1 1 i a in i \ A i i 111 a n y\aa I I |o a | = > |o o ||o a a' / a' = |a"a"||a"a'"|+0 + --- = |a"a'"| . (1.2.19) Now notice this: \n'n"\\n"n'\ ~ \n'n'\ (1.2.20) The products on the left side differ only in the order of multiplication; the right sides are different if a' / a". The order of multiplication can be significant! This evolving algebra is non-commutative for multiplication. And, as we could have noticed before, it is not a division algebra, a'^a": \a'a'\\a"a"\ = 0 (1.2.21) does not imply that either |o'o'| or |o"o"| is 0; similarly a'^a": \a'a"\\a'a"\ = 0 (1.2.22) does not imply |o'o"| = 0. The commutator [X,Y]=XY-YX (1.2.23) of two measurement symbols X and Y vanishes if the order of their multiplication does not matter; if it does, one has [X, Y] / 0. A related quantity is the anticommutator {X, Y}, defined by {X,Y}=XY + YX. (1.2.24) It equals 2XY or 2YX if [X, Y] = 0; otherwise one can regard \{X, Y} as a symmetrized product of X and Y. Note the identity XY =\{X,Y}+\[X,Y], (1.2.25) an immediate consequence of these definitions.
36 1. Measurement Algebra The outcome of a measurement is a number. We must have numbers as well as abstract symbols of measurement in our algebra. The obvious definitions of the basic numbers one and zero are 1 la'a" = la'a" 0 \a'a"\ = 0. (1.2.26) number number symbol They are convenient in synthesizing the products (\a'a'v\ = l\a'a'v\ if a" = a'" 0 = 0\a'a'v\ if a" /a'" !/ // \\ / , = o(a ,a )\a a where S(a",a'") = 1 if a" = a'" , 0 if a" / a'" , (1.2.27) (1.2.28) is Kronecker's* delta symbol. Notice something else. What meaning shall we give to \a'a"\ + \a'a"\ ? Well, if we accept the distributive law generally, this is l\a'a"\ + l\a'a"\ = (l + l)|o'o'' And what is one plus one? Two, naturally! l\a'a"\ + l\a'a"\ = 2\a'a"\ . (1.2.29) (1.2.30) 1.3 State vectors Let's think a little more about the meaning of |o'o"|. Only an atom having the value a' of property A, an a' atom for short, can enter (left to right reading, indicated by L -> R where necessary) and what leaves is an a" atom. It is as though the entering a' atom is destroyed and in its place an a" atom is created. This is a mental two-step process that is indistinguishable from the actual one. We symbolize the composite viewpoint by introducing little brackets 'Leopold Kronecker (1823-1891)
1.3 State vectors 37 L -> R \a'a"\ = \a')(a"\ . a" created a in a" out a' destroyed (1.3.1) So far this is innocuous. But we take a giant step forward by viewing this as the product of two symbols of a new type! But is it compatible with the known algebraic properties of the symbol \a'a"\ ? We must have I l\ I //H ///\ I lv\ :( II lll\\ l\l lv\ \a ) {a \\a ) {a | = o(a ,a )\a ){a | % ^ ' = (a la ) (1.3.2) where we simplify the notation by writing (a"\a'") rather than (a"\ \a'") here and in all subsequent products of this kind. Observe that (1.3.2) is satisfied if (a"\a'"} = S(a",a'") . Note the consistent physical meaning: (1.3.3) [l -> r] (a" I a'") a" = a'" : Yes, represented by number 1, a" / a"' : No, represented by number 0, destroy a'" atom create a" atom (1.3.4) if this creation-annihilation act is considered in isolation, so that the destruction of an a'" atom, where only an a" atom, a" / a'", is available, is not possible. So now we have n symbols (a'| and n symbols \a') such that the product ( | ) of those with the same label equals one; the product of those with different labels equals zero. This has the ring of familiarity, most obviously in the three unit vectors i, j, k of a spatial coordinate system, or more systematically, e^, k = 1,2,3. Indeed, ek -ei = f 1 if k = l\ {OiikjLlj 6{k,l) =5ki (1.3.5) characterizes unit orthogonal vectors. This statement about orthonormality is supplemented by the completeness relation
38 1. Measurement Algebra Y^ekek (1.3.6) a sum of dyadic products. Accordingly, we shall speak of these symbols (a'| and |a') as vectors, n component vectors. But notice that the numerically-valued product (a'\a")=S{a',a") (1.3.7) involves two distinct types of vectors; certainly (a'|, symbolizing a creation act, cannot be equated to |a'), representing an act of destruction (reading L -> R). The kind of geometrical space in which these vectors lie must be somewhat more general than a Euclidean* space. We shall speak of (a'| as a left-vector: and la') as a right-vector:, from the respective positions in the numerical product. [Dime* calls (a'|a") a bracket, and the vectors: bra and ket, respectively.] 1.4 Successive measurements. Probabilities The measurement of the magnetic moment in the z direction, or of property A, is only one of a myriad of possible measurements, of properties B,C, This is most obvious in the m.m. example through the possibility of measuring the m.m. in any other direction: ,ti // In the rotated apparatus atoms are also deflected up or down, in the direction set by the rotated magnetic field. Considered in itself, this experiment is indistinguishable from the original one. But what happens when a beam from one apparatus is sent through the other one, as suggest by V There are two situations where we already know the answer: When the angle 6 is 0° or 180°. *Euclid of Alexandria (fl. B.C. 300) fPaul Adrien Maurice Dirac (1902 4984)
1.4 Successive measurements. Probabilities 39 0° 9 = 180° The atoms in the approaching beam are all +m.m. The transmitted beam is entirely bent up. The atoms in the approaching beam are --m.m. The transmitted beam is entirely bent down. Then, what about 9 = 90° It helps to think of the first apparatus as being gradually rotated from 9 = 0° to 9 = 180°. As one does so, the initial situation of the transmitted beam, entirely up, no atoms in the down beam, must change, with fewer up atoms and more down atoms until one comes to 180°, where there are only down atoms. It is then clear that, with 9 = 90°, half way between the limits of 0° and 180°, half of the atoms will be in the up beam (+m.m.'s) and half will be in the down beam (—m.m.'s). But what does an individual atom do? It doesn't split in half! This atom is deflected up and that atom is deflected down. We have no way of predicting or controlling what an individual atom will do; we can only be sure of what will happen, on the average, to very many atoms. Speaking of averages let's list the average m.m. in fi units, for the transmitted beams of the three arrangements: = 0° all +m.m. Average +1 9 = 90° 50% +m.m., 50% -m.m. 9 = 180° all - m.m. Can we come up with a reasonable result for any value of 91 The initially measured m.m. along the 9 direction can be decomposed into the component along the z direction, and the perpendicular component:
40 1. Measurement Algebra /F + cos8 ¥■ It is natural to assume that the average m.m. measured in the z direction is just the z projection of its known value in the 9 direction; that is, cos 9 which correctly reproduces the values for 9 = 0°,90°, 180°. The average m.m. is the weighted average of the two possible outcomes, +1,-1, the weights being the fraction of a large number of atoms in the up, or down, beam. When used in this sense (sufficiently large number of atoms), we speak of the fraction as the probabilities for the respective outcomes. Thus, for the initial selection of a +m.m. in the 9 direction, [L -+ R] and SO R] cos0 = (+l)p(+,+) + (-l)p(+,-) l=P( + ,+)+P(+,-) : P( + ,+): l + cos e 2 1 — cos 9 : COS2 (10) . :sin2(i0) . The last versions use the trigonometric identities cos# = cos (\0 + \6) = cos2(i#) - sin2(i( :2cos2(i#)- §0) =cos2(i< 1 l"2sin2(i6>) . Note v.2 -, that the two probabilities in (1.4.3) are really one: (1.4.1) (1.4.2) (1.4.3) (1.4.4) the probabilities of a — outcome is the same as the probability of a + outcome for the angle n — 9,
the figure 1.5 Probability amplitudes. Interference 41 cos (i(,r-0))=sin2(i0) ,2/1 (1.4.5) «*(§(*-0)) sin(|#) 1 reminds us of its geometrical significance. With this in mind, we can immediately write down the probabilities for an initial choice of — m.m.: [l -> R] p(~,+) = cos2(i(7r-^))=sin2(^); p(-,-) = sin2(i(,r-0)) =cos2(^) (1.4.6) The table P(,) + + cos2(^) sin2(|6>) sin2(i6>) cos2(i6>) displays the four probabilities. 1.5 Probability amplitudes. Interference Now, more generally, we first measure some property A and select the particular outcome a'; (in L -> R reading) we symbolize that by the creation of an a' atom: (a'|. Then we make an as yet unspecified type of B measurement and symbolize it by M(B), so that we now have (a'\M(B). The final step is the annihilation (detection) of the a' atom, which produces the number [L p(a',M(B)) = (o'|M(B)|o'>. (1.5.1)
42 1. Measurement Algebra We consider three types of M{B): 1. The B measurement that selects only &': M(B) = \b')(b'\ , [p(a',\b'b'\) =] p{a\b') = (a'\b')(b'\a') . (1.5.2) 2. The B measurement that selects either b' or b", b" ^ b': M(B) = \b')(b'\ + \b"}(b"\, p(a',b' or b") = (a'\b')(b'\a'} + (a'\b")(b"\a'} = p(a',b')+p(a',b"). (1.5.3) 3. The B measurement that selects all b' without bias: M(B) = 5>'><6'|=1, b' p(oM) = J2(a'\b')(b'\a') = (a'\l\a') b' = ^p(o',6') = l- (1-5-4) b' Here are the properties that qualify p(a',b') to be the probability that a B measurement performed on an a' atom will have the specific outcome b'■ For, it should be true that the less specific measurement that selects either b' or b", that has both b' and b" atoms in the transmitted beam, has an outcome with the greater probability: p{a!, b') +p{a', &"); and that the outcome of the least specific measurement has the greatest probability: >Jp(a',&') = 1. b' We can also verify the probability formula in one simple situation. That is when B is just A, as in the two m.m.'s with 9 — 0; or is directly related to A, as in the two m.m.'s with 9 = 180°. So suppose that b' = a". Then, surely, p(a',a") = {j jf£ J°« }=*(«>") (1-5-5) and indeed (a'\a")(a"\a') = {S(a',a")}2 = 5{a',a") . (1.5.6) As we see in the results, and in cos2(|#), a probability is a number in the range between zero and one. What kind of numbers are the (a'\b') so that (a'\b')(b'\a') does lie in this range and is not a negative or a complex number, for example?
1.5 Probability amplitudes. Interference 43 There are two possibilities: 1. (a'\b') is a real number and (b'\a') = (o'|b'). Then p{a',b')=[(a'\b')]2 >0 (1.5.7) and automatically p(a',b') < 1, since the sum of all the non-negative probabilities is equal to one. The objection comes from (f/lc/HW) 1 ' (1.5.8) which seems to say that left and right vectors are the same, which they actually are not, physically. Indeed we shall see explicitly that a scheme with only real numbers, in which there is no number with a square equal to —1, does not work. 2. (a'|6') is a complex number and (b'\a') its conjugate, (b'\a') = (a'|&')*. Then p(a',&')= |(a'|&')|2>0. (1.5.9) Now left and right vectors are not the same, but are interconverted by a process involving complex conjugation (which we shall expand upon later). We accept complex numbers, because the doubling produced by the existence of two kinds of vectors fits perfectly with the doubling provided by the complex numbers. The construction of probabilities as absolute squares supplies a name for the complex numbers (a'\b'): probability amplitude. This is a purely functional description, that real, positive probabilities are derived as absolute squares of complex valued probability amplitudes. However, this relationship is also evocative of analogies, which we proceed to develop. Consider a sequence of measurements on three (in general different) physical properties : A, B, C. First, perform an A measurement and select the outcome a'; that is, we create an a' atom: (a'\ (l -> R reading). Then perform an as yet unspecified B measurement, symbolized by M(B). Finally we select the outcome c' of the C measurement, symbolized by the annihilation of a c' atom. The number produced by this sequence is (a'lM(-B)lc'), which is a probability amplitude from which we derive the probability for the success of this compound measurement: [l-»R] p(o',M(B)1c') = \(a'\M{B)\c'}\2 . (1.5.10) We consider three examples.
44 1. Measurement Algebra 1. M(B) = |&')(&'|, a selective measurement: [p(o', |6'6'|,c') =] p{a',b',c')= \(a'\b')(b'\c')\' = p(a',b')p(b',c'). (1.5.11) This is as it should be. The beam of a' atoms is subjected to a selective measurement that leaves the fraction of atoms p(a', &'). Then these atoms are subjected to another selective measurement, of probability p(b',c'), that gives the net fraction, or probability, as the product: p(a', b')p{b', c'). 2. M(B) = Y^bi \b')(b'\ = 1, the measurement that accepts everything without discrimination; no measurement. For (o'|l|c') we have, equivalently, <a'|c'> = E<a'l&'><&'lc'>> (1.5.12) b' and p(a, 1,c') = p(a,c) , (1.5.13) as it should. Incidentally, (1.5.12) is the counterpart of a familiar three- dimensional vector relation: A-C = YJAkCk = Y.A-ekek-C ^ (L5-14) k k the evaluation of a scalar product in terms of components relative to some coordinate frame. Note the appearance of the completeness relation (1.3.6). 3. The non-selective B measurement; the B measuring apparatus functions but no selection of b' atoms is performed. In the m.m. example, this means that the up and down beams are physically separated, but then the two beams are run along together to the next stage. The result, probability p(a', b, c'), is the sum for each independent choice of &', which have been physically distinguished by the experiment: p(a', b, c') = J2P(o\ b')p(b',c') . (1.5.15) Neither the B measurement that accepts everything, nor the non-selective B measurement, involves a rejection of atoms at the intermediate stage. So the total fraction, the total probability, for any outcome equals one, for both:
1.5 Probability amplitudes. Interference 45 5>(a',l,c') = 5>(a')C') = l, c' c' 5>(a\6,c') = 5>(a',&') £>(&', c') = ! ■ (L5-16) = 1 = 1 Nevertheless, in general p(a', 1, c') ^ p(a', b, c'). As an example consider A,B,C, the m.m. in three directions: A zf -»B C All probabilities for the angle 90° equal |. So (l -> R reading) a 6 6 c , a b be p{+,b, +) = P{+, +)p{+, +) + p(+,-)p(-, +) = \\ + \\ = | , p(+,b,-) =p(+,+)p(+,-) +P(+,-)p(-,-) = H + H = I ■ (!-5-17) However p(+,l,+)=p(+,+) = 0 (1.5.18) since a m.m. that is + in the z direction is certainly not + in the —z direction. And, p(+,l,-)=p(+,-) = l (1.5.19) since — in the — z direction is the same as + along +z. These probabilities also add up to one. To see how this difference must come about, let's look in detail at p(+,l,±): [l -> r] p(+,l,+) = a c p(+, 1,-) = a , b . b . c . a . b . . b , c . (+1+)(+1+) + (+|-)(-|+) a . b . . b , c . . a . b . . b . c . (+1+)(+1-) + <+-><--> (1.5.20) Note that the absolute square of each individual term is \\ = |, and keeping only the absolute squares gives the non-selective measurement. Accordingly, each product of probability amplitudes is | in magnitude. So it must be (we show this explicity later) that in p(+, 1, —) the two terms add up:
46 1. Measurement Algebra p(+,l,-)=|l + I|2 = l, (1.5.21) whereas in p(+, 1, +) they subtract: P(+,1,+)=|!-||2 = 0. (1.5.22) In short, we have situations of constructive or destructive interference, typical of wave phenomena. On the basis of this analogy, one speaks alternatively of a probability amplitude as a wave function. 1.6 "Measurement disturbs the system" Language aside, there is a very important lesson in the fact that p(a!, l,c'), in which there is no actual B measurement: p(a',l,c') = J2(a'\b')(b'\c') (1.6.1) is generally different from p(a', b, c'), in which the B measurement takes place, but all atoms are retained: p{a',b,c') = Y.\W)(h'V')\2 ■ (L6-2) That fact is usually expressed as "measurement disturbs the system". This is very different from the situation in classical physics where it is assumed that in principle, as an idealized limit, measurement does not disturb the system. Consider the familiar problem of measuring the electric field at a point, to which one responds by placing a test charge at that point (itself an idealization) and measuring the force on it. To the objection that the presence of the test charge changes the field being measured, one usually says: "yes, but I can make the magnitude of the test charge arbitrarily small." All very well, until one reaches the atom and the electron within it and discovers that (current speculations aside) there are no smaller charges. In short the atomicity of matter - and of physical properties associated with matter - sets a fundamental limit to the basic idealization that is implicit in classical physics. And that's what quantum mechanics is all about. One gets a quantitative version of "measurement disturbs the system" by asking for a symbol, M&, of the non-selective B measurement, so that J2 |(«>')(&'lc')|2 = \(a'\Mb\c'}\2 (1-6.3) in some sense. It helps to go back to p(a', 1, c') and write it out as
1.6 "Measurement disturbs the system" 47 p(a',l,c') = YtWW EW'X&'V) V b" 12 = X)Ko'l6'X&'lc'>l + E («'l&')*(&'lc')*(«>")(&"lc') ■ ^^ =p(a',6,c') (1.6.4) As we have already seen in an example, p(a', b, c') is derived from p(a', 1, c') by keeping only the absolute squares, by omitting the cross products, b' ^ b"■ We can convey this by choosing Mb = Y}b')Mh'\b'\, (1.6.5) b' where the f(b') are real numbers, phase angles, about which more in a moment. The effect of this in the above is given by the substitutions: (a'\b")(b"\c') —► (a'\b")e[^b"\b"\c') , (a'|&')*(&'|c')* —► (a'\b'}*e-{rtb'\b'\c')* . (1.6.6) So there is no change in the terms with b" = &', e-iv(b')eMb') = \eMb')\2 = iy (1.6.7) whereas the terms with b" ^ b' are multiplied by p-i<p(b') MV) (1.6.5 We succeed in removing these cross products if each <p(b') is not a definite angle, but rather is a randomly distributed quantity. Then a particular value p, and p + 7T, are just as likely to occur and the average is zero (e17r = -1). It is this randomness of the phases that conveys the uncontrollable nature of the disturbance produced by a measurement. That is important because at a higher level of classical measurement, disturbances produced by a measurement can be admitted if they are known and therefore could be compensated for. The combination J2\l/')Mb"\b"\ (1.6.9) b" has a wider range of applicability if one allows some of the <p(b") to assume imaginary values, specifically positive infinite imaginary values, ioo, where ei(ioo) = e-oo = 0 _ (1 g 1Q)
48 1. Measurement Algebra If one does this for all b" ^ b', then all those terms disappear and we are left with [ip(b') is real] M = Mh">\b'b'\ (1.6.11) where the remaining phase factor can be ignored in the context of the probability |(o'|M|c')| . We have returned to the measurement that selects only b' atoms, all others being discarded, the stopping of those beams now being represented as a kind of complete absorption (e—°°). The extension of this viewpoint is immediate. If infinite imaginary phases apply for all but b' and b" 7^ &', and these two real phases are locked together: <p(b') = <p(b"), we effectively produced the measurement Ib'b' I + |b"b" |. And finally, if all <p(b') are real, and locked together as a common phase, we arrive at the measurement that accepts everything, M(B) = 1. 1.7 Observables The symbolic representation of measurements and the use of probabilities is a far cry from classical physics where theory is formulated in terms of the values of physical properties. But the latter must be latent in the quantum description. Can we find a symbol that is associated with a physical property as a whole? A natural place to start is the average value, or expectation value, of a physical property B, produced by a measurement on a' atoms. This is [L H- R] <B> -' = £K££) b' = <°'l (Z»'<*>1 ) 1«') ' (1-7.1) V ^{a'\b'){b'\a'yb' 7 which exhibits a clear separation between what is measured, B = J2\b')b'(b'l (1.7.2) b' and information about the atoms (a' 1,1 a'), namely {B)a, = (a'\B\a'). (1.7.3) We test the suitability of B as the symbol of property B. First, consider (b'\B = Y, (b'\b") b"(b"\ = b'(b'\ (1.7.4)
1.7 Observables 49 inasmuch as all terms of the b" summation vanish, except b" = b'. The result can be read (l -> R) as: create a b' atom and measure property B; the outcome is the number b'. An alternative version is B\b") - 5»' (b'\b") = \b")b" . = S(b',b") (1.7.5) We might also present these results in another way. Consider B — b'l where any particular value b' multiplies the unit symbol: Now, we get B - b'l = 5»"(&"| - b' $>"}<&" | b" b" = $>">(&" -b')(b"\. b" (b'\{B-b')=0, {B-b')\b'}=0 (1.7.6) (1.7.7) since b" — b' = 0 for b" = b'. We have also suppressed the unit symbol, its implicit presence being clear in this context. There is something rather basic we expect of symbol B. If the possible outcomes of measuring B are &i, &2, ■ ■ ■ ,bn, then the outcomes of measuring B2 must be (&i)2, (fe)2, ■ ■ ■ , (&n)2> and similarly for higher powers. Well, B2 = B Y, \b')b'(b' | = 5>')(&')2(&'| b' b' and so on, just as it should. In a related way, consider B^b1=J2\b')(b'^b1)(b'\ and then (b - &,)(£- fc) = EIW - W - &2)(&'l b' terminating with n (B-fci)---(B-6n)=n(B-6*) (1.7.S (1.7.9) (1.7.10) = 5» n(&'-&*) fe=i (&'|=0 (1.7.11) since any b' is one of the bk's. So B obeys an algebraic equation of degree n. Accordingly, any function of B expressed by a power series is no more than a linear combination, with numerical multiples, of 1, B, B2,... Bn_1. As such
50 1. Measurement Algebra f(B) =. J2 f(b')\b'b'\ = $»/W| ■ (1-7.12) b' b' This can be taken as the general definition of the function f(B) of the measurement symbol B, or of the observable B; any measurement of the physical property represented by B, measures all functions of B. All that is required of the numerical function /(&') is that it has a well defined value for each potential measurement result b' ■ As an illustration of a function of B expressed by a power series of degree n — 1, consider w)= n (f^) (!-7-13) b"&b') y ' a product of n — 1 linear factors. If B happens to have the value &', this function equals one; if B has a value different from &', the function equals zero, whence the delta symbol designation. And what is this function in terms of measurement symbols? It is 6(B,b') = Y^\b"W>b')(b"\ = \b'b'\ > (L7-14) b" the symbol of the b' selective measurement. To go along with this, notice that multiplying 5(B,b') by the factor B — b' produces the polynomial of degree n that is zero. So 0 = (B-b')S(B,b') = (B-b')\b')(b'\ =0, (1.7.15) as it should. Incidentally, the unit symbol 1 is also a function of B, one that has the value 1 no matter what the value of B: (&'|1 = (&'| = l(6'| . (1.7.16) We can exhibit the unit symbol as a power series in B by using the power series construction of the \b'b'\: ' = £Il(f5£)- <"■"> b' b"&b') y J Indeed for any value of B, the right side equals 1+0 + 1-0 = 1. 1.8 Algebra of Pauli's operators In seeking to illustrate this symbolic representation of physical quantities we naturally turn to Stern-Gerlach experiments and \xzj'\x which has the possible
1.8 Algebra of Pauli's operators 51 values +1, —1. Following Pauli* (1927) we designate this physical quantity as az, where a'z = +1,-1. Then (<7z-1)(<7*+1)=<7*-1=0 (1.8.1) and (a'z and 1 are omitted, the sign suffices) T"~r — ct,-1 l-o-2 1 ' -1-1 2 Indeed, (1.8.2) |++| + |—1 = 1, |++|-| \=az. (1.8.3) We also note that, from of = 1, we get J7* ) =-(l + l+2o2) = —-^- = |++|, i^y=i(l + l-2o2) = ^ = | —|, (1.8.4) and Ml-I = ^^ = ^ = 0. d«) The totality of measurement symbols \a'za"\,a'z = ±1, o" = ±1, is 4 = 22. We have displayed I++I and I 1, or, equivalently, 1 and az. Now, in three- dimensional space there must also be a ax and a ay. And since all directions in space are equivalent, the outcome of measuring ax and ay must also be just ±1, so that we must have 02 = 1, 02 = 1 . (1.8.6) Can we exhibit such physical properties in terms of | + —| and |—|-|? Let's write out their multiplication properties: |+-||+-|=0, |-+||-+|=0, |+~||-+| = |++| , 1- + 11+-1 = 1 — I- (I-8-7) Since the squares of \-\—| and |—h| are zero, we must have both present if we are to form an object of unit square. Try 'Wolfgang Pauli (1900-1958)
52 1. Measurement Algebra -,2 -+I + I+-IJ = —H —h + H— —H + —^ H— M" H— H— = ' (1-8-8) = o =1++1 =1 I =° so we define ** = |-+| + |+-| • (1-8-9) The only combination left is the difference: -+1-1+-11 =0-1++1-1 —1+0=-1 ! (1.8.10) If space where two-dimensional, we could settle for ax and az. But in three- dimensional space there is no way to get that last square equal to +1, except by introducing i2 = — 1. Real numbers alone won't work. So az and ax are joined by ay =i|- + | ~i|+-| ■ (1-8.11) Note the converse constructions, \+-\ = \(?x+iay), \- + \ = \(ox-iay) (1.8.12) for which we get 0=[|+-|] = \(ax+iayf = \[l-l+i{axay+ayax)] , (1.8.13) or axay +ayax = 0; ayax = -axay . (1.8.14) Then we look at \-\—11—h| = |++|, or \ (ax +iay)(ax -iffy) = \{l + az) , (1.8.15) > v ' = 1 + 1 - 2\OX0y which gives axay = \az , (1.8.16) and, in conjunction with (1.8.14), we get crxcrz + azax = 0, ayaz + azay = 0 . (1.8.17) Also (by moving the left-hand factor twice):
1.9 Adjoint symbols, Hermitian symbols 53 oxayaz = i = ayazax = azaxay (1.8.18) which shows that the indices can be cyclically permuted. So axay = \az is joined by ayaz = iax, azax = iay . (1.8.19) This direct conversion of the algebra of the four |±±| into that of the four l,ax,ay,az is a bit clumsy, but the algebraic properties of Pauli's vector operator cr are not. One way to express them compactly considers two numerical vectors, say a and b. Then, a ■ a a ■ b = (axax + ayay + crzaz)(axbx + ayby + azbz) (1.8./U) = axbx + ayby + azbz + iax(aybz - azby) H , or cracrb = ab + icraxb. (1.8.21) This unifies all the multiplication properties. In particular, let a = b = n, an arbitrarily directed unit vector. Then (cr-n)2 = l, (1.8.22) which shows that a measurement of the component of a in any direction will have the outcomes +1 and —1. The apparent contradiction between discrete results of measurements and the possibility of continuous change in the direction of measurement is resolved by the non-commutativity of the symbols of different components. 1.9 Adjoint symbols, Hermitian symbols Now that we have a few more objects to work with, let's develop some more machinery. First, notice that the symbolization of successive acts of measurement by multiplication of corresponding symbols has an element of arbitrariness: we placed them in sequence from left to right (l -> R). But that is a cultural bias - we can equally well order them from right to left. As with the freedom of choice of coordinate systems, the validity of a physical statement cannot depend on which conventions we adopt. So a correct relation expressed in one convention must also be a correct relation when we systematically switch to the other convention. We raise this to the status of a mathematical operation, called the adjoint, or Hermitian* conjugation, and symbolize it by the dagger: f. Incidentally, here is another reason that (a'\b') cannot be identified with a probability, by itself. On reversing the convention, the meaning of (a'\b') "Charles Hermite (1822-1901)
54 1. Measurement Algebra is taken over by (&'|a'). The physical probability must be unchanged, which (a'\b') is not, but, (a'\b')(b'\a') is. The measurement symbol \a'a"\ has the interpretation of selecting a' atoms and producing a" atoms, when read from left to right - dextrally. If we read it from right to left - sinistrally - it means selecting a" atoms and producing a' atoms. We express this by \a'a"\* = \a"a'\ , (1.9.1) which gives the new interpretation in the original convention. Read this: the adjoint (or Hermitian conjugate) of \a'a"\ is |o"o'|. [Compare with (PAN)* = NAP, for instance.] Note the particular example ja'a'jt = \a'a'\ ; (1.9.2) any such object is said to be self-adjoint or Hermitian. [Compare with palindromes such as (MOM)t = MOM.] The left and right vectors (o'|, |o') are read dextrally as the creation of an a' atom, the destruction of an a' atom, respectively. In the sinistral convention, (o'l symbolizes the destruction, and la') the creation of the a' atom. So (a'\* = \a'), \a'y = (a'\. (1.9.3) Here, and in general, the repetition of f recovers the original object, as in (X*)*=X. (1.9.4) We illustrate another general rule by combining what we have learned: (\a')(a"\)* = \a")(a'\ = (<o" |)' (|o'>)' (1-9.5) or (XY)f = FfXt ; (1.9.6) the adjoint of a product is the product of the adjoints, in the opposite order. Here's a more elaborate example: (ja'&'Hc'd'iy = \d'c'\\b'a'\ = |c/d'|t|a'6'|t . (1.9.7) We learn something new by rewriting the ingredients of the first equality: \a'b'\\c'd'\ = \a'}(b'\c')(d'\ = (b'\c'}\a'd'\ , \d'c'\\b'a'\ = \d'}(c'\b')(a'\ = (b'\c'y\d'a'\ , (1.9.8) namely,
1.9 Adjoint symbols, Hermitian symbols 55 (<&'|c'>|a'rf'|)t = <fe'|c'>*|a'rf'|t ; (1.9.9) in taking adjoints, numbers are replaced by their complex conjugates, (AX)f = A*Xf , (1.9.10) where A is any arbitrary complex number. We must also remark that f reverses sequence, as expressed by multiplication; it has no effect upon addition: (X + Y)f = X*+Y* . (1.9.11) In consequence the unit symbol is Hermitian, if = (j2\a'a'\) =Ela'a'lt = Ela'a'l= 1- (L9-12) V a' / a' a' Indeed, the symbol of any physical property, B = J2\b')b'(b'\ with b" = bf, (1.9.13) b' is Hermitian: fit = £<&'|t&'*|&')t = 5»'(&'| = B . (1.9.14) b' b' Let's try this out on 1 = |++| + | 1 , ®z = I i r| — | | j Ox = |—H + JH 1 , ay = i|-+| -i|+-| ■ (1.9.15) Now, 1 and az are examples of what we have just shown; they are Hermitian. ax and rjy, however, are not presented in the same form. But surely the adjoint operation is not going to distinguish one direction in space from the other? Fear not: 4 = |-+|f + l+-|f = |+-| + |-+l = °* > at - _j|—(_|t +i|_|—|t _ _i|_)—| +j|—h| _ Gy _ (1.9.16) Let's also notice how the algebra of the ct's behaves under Hermitian conjugation. The adjoint of (a, b are real) cracrb = ab + icraxb (1.9.17) is cr-bcr-a = a-b — icr-axb = ba + icrbxa: (1.9.18) it is the same statement.
56 1. Measurement Algebra 1.10 Matrix representations The Pauli operators ax and ay are examples of writing one physical quantity, B, in terms of the measurement symbols of another property, A. We do this generally by exploiting the structure of the unit symbol 1: B = IB1 = X>'><0'IBZ>"X°"I a' a" = Y,{a'\B\a")\a'a"\. (1.10.1) a',a" This exhibits B as a linear combination of the n2 measurement symbols \a'a"\ as specified by the values of the n x n = n2 numbers (o'|jB|o"). These numbers can be displayed in an n x n square array, or matrix: a" -> column ai /(oi|jB|oi) ••• (oi|jB|o„)\ row V<o„|B|oi>"- (an\B\an)J (1.10.2) For example +- ax: + (jj).^=(0i~o).^:(j_l)' 1:(oi) ' (L1°-3) which are known as the Pauli matrices. The physical property B is determined by its matrix, the a matrix of B. So all algebraic operations involving B, and other physical properties, can be expressed in terms of their matrices. Here are the basic algebraic operations: addition of B and C: (a'\ (B + C) \a") = (a'\B\a") + (a'\C\a") ; (1.10.4) multiplication of B and C: (a'\BC\a'") = (a'\BlC\a'") = Y,{a'\B\a"){a"\C\a'") ; (1.10.5) a" multiplication of B by a number: (a'\XB\a") = X(a'\B\a") . (1.10.6)
1.11 Traces 57 Notice that the notion of adjoint does not enter here, and B,C can be replaced by not necessarily Hermitian X, Y, where, say, (the change of labeling is for convenience) X = ^2(a"\X\a')\a"a'\ ■ (1.10.7) a',a" Then X* = J2 (a"\X\a')*\a'a"\ , (1.10.8) a',a" which is to say (a'\X*\a") = (a"\X\a'y ; (1.10.9) the matrix of an adjoint quantity is the complex conjugate, transposed (row <-> column) matrix of the quantity. If X is the Hermitian B, its matrix is restricted by (a'\B\a") = {a"\B\a')* ; (1.10.10) one can check this for the Pauli matrices. Notice that complex conjugation is the adjoint for numbers; that is, one evaluates it by taking the adjoint of its multiplicative components, as in {a'\b'Y = \by{a\^ = {b'\a') (1.10.11) and (a"\X\a'Y = \a'yx*(a"\t = (a'\X*\a") . (1.10.12) 1.11 Traces Here's another useful concept in which one reverses the order of things. The measurement symbol \a'a"\ is a product of vectors |o')(o"|. If one reverses the multiplication order of the vectors, one gets anumber: (o"|o') = 6(a',a"). This operations is called the trace [German: Spur]. So tr{\a'a"\} =S(a',a") . (1.11.1) Equally well one should have tr { \b'b"\} =5{b',b"). (1.11.2)
58 1. Measurement Algebra But is this consistent with the linear relation between the two types of symbols, namely \b'b"\= ^|a')(a'|&')(&"|a")(a"| a' ,a" = 5](«>')(&"K>K«"|? (!-U-3) a' ,a" Mindful that the trace has no effect upon addition or on numerical multipliers, we answer this by a symmetric treatment of the first line: tr {\b'b"\} = ^(&"|a")(a"|a')(a'|&') a',a" = (b"\b')=S(b',b") . (1.11.4) The trace has a simple meaning in terms of matrix elements: X= J2(a'\X\a")\a'a"\ (1.11.5) a',a" gives tr{X} = J2 (a'\X\a")S{a',a") = ^(a'|X|a') , (1.11.6) a',a" a' the so-called diagonal sum of the matrix. A glance at the Pauli matrices (1.10.3) shows a vanishing trace for each component: tr{cr} = 0. (1.11.7) The Pauli matrix for the unit symbol 1 illustrates the general form (a'\l\a") = 6{a',a") (1.11.8) from which we get tr{l} = n, (1,11.9) the number of different values assumed by property A, or B, or ... . One can reverse the trace procedure so that any number involving vectors can be presented as a trace. Thus (a'\X\a")=tr{X\a"a'\} =tr{\a"a'\X} . (1.11.10) If one multiplies these equivalent forms by (a"|F|a') and sums over a', a" it becomes tr {XY} = tr {YX} ; (1.11.11)
1.12 Unitary geometry 59 the trace of a product of such symbols is independent of the multiplication order. As an example of that symmetry take the trace of a ■ a cr ■ b = a ■ b + \cr ■ a x b, tr{cr-acr-b} = 2a-b, (1.11.12) which is indeed symmetrical in a and b. The trace version of a matrix element can be applied to (B)a, =(a'\B\a') = ti{\a'a'\B} (1.11.13) in which B can be replaced by any f(B). Consider in particular the function S(B,b') = \b'b'\- As a quantity that is one, if B is b', and zero otherwise, (5(B,b'))a, =p(a',b') = tr{\a!a'\\b'b'\} . (1.11.14) Of course, we check directly that this is (o'|&')(6'|o'). To see the advantages of the trace formula recall that ^ = +l:|++|=—^— ; ^=-l:| —1 = _— (1.11.15) or \a'za'z\ = \(l+a'zaz). (1.11.16) There is nothing special about the z direction. We apply the same formula to properties A and B, which are cr ■ n\ and cr ■ n2, respectively, where n\ ,2 are unit vectors in two different directions. So ,, f l+g^cr-ni l + a!2cr-n2\ p(a1,a2)=trt[ j = |tr {1 + a[a ■ n\ + a'2a ■ n2 + o'xa'2a ■ n\a ■ n2\ = 1(1 + a[a!2m ■ n2) = | (1+(7^ cos6>) , (1.11.17) where 8 is the angle between the two directions. Hence, 1 + cos 9 .->,, - = CO" 'L' 1 — cos ( p(+,+) =P(- -) = -^- = cos2(i0) , p(+, -) = P(-, +) = J = Sin (^) ' (1.11.18) as we had already surmised. 1.12 Unitary geometry 1.12.1 Column and row vectors, wave functions It is instructive to look at the az matrix of l ,/i l+<r'„cr-n (1.12.1)
60 1. Measurement Algebra where we use spherical coordinates to parameterize unit vector n: z a ■ n = ax sin 1? cos ip +ay sin 1? sin </3 +az cost? . Then 0^0(J has the matrix (1.12.2) <\< (a'z\cr • n\a") = cos 1? sin 1? e lLP sin 1? e1^ — cos 1? (1.12.3) So (a>\a>n = +1>« = +1|^'> = (^|i±^|ff») COS2(|'l?) cos(|i?)sin(ii?) e -1(,9 cos(±tf)sin(|tf) elV ndicat af a co cos(itf) (1.12.4) which must factor, as indicated by (0^0^ = +l)(a'n = +l|a"). Indeed, this matrix is the product of a column and a row: sin(|tf) e1^ (cos(itf) , sin(§#) e_1V) (1.12.5) or, alternatively, multiplying the column by e 2^ and the row by e2^, we get « = +%") = (cos(i^) e^ , sin(^) e"^) • (1-12-6) As required by (0^0^ = +1)* = (a'n = +l|o^), the elements of the row are the complex conjugates of the column elements. The analogous results for a'n = — 1 are \Oz\On = -lA°n = -1^/ = (°*| 0 r*/ sin2(Itf) 2„y -sin(|t?)cos(it?)e_iV -sin(ii?)cos(ii?) e^ cos2(|tf) -sin(itf) e"2^\ / ; ; s V2 ; ■ ' (-sin(|i?)e2^ ,cos(|i?)e"2^J cos(|tf) e& (1.12.7)
1.12 Unitary geometry 61 The significance of this is expressed generally by writing some vector, say |&'), as a linear combination of the a vectors: |6') = l|6'> = ^|o'><o'|6'> (1.12.8) a' which exhibits the (a'\b') as components of the vector |&') relative to the orthonormal system of a vectors - the a coordinate system. An alternative notation that accompanies the notion of 'wave function' is (a'\b')=^(a') (1.12.9) so that \b') = J2\a')^b-(a') ■ (1.12.10) a' The adjoint of this is <&'|=X>.(a'r<a'|. (1.12.11) a' To express the product of the two vectors in terms of their components we can multiply directly: (b'\b") = J2^(a'r (a'\a")^(a") a'a" V " ', „ = 8{a',a") = X>'(°')V*»(a')> (1.12.12) or use 1 = >J|a')(a'|: a' (b'\b") =X>'l°'><a'|&"> = 5>fc< (a')*<Ma') • (1.12.13) a' a' Notice that the squared length of the b' vector is (b'\b') = J2 Hv (a') |2 >0. (1.12.14) a' Here, for complex-valued components of vectors, is what replaces the sum of the squares of Euclidean geometry: This kind of geometry is called unitary. The orthonormality of the b' vectors is conveyed by a' b' £ b" : J2 ^' («')>&" («') = ° • (1.12.15)
62 1. Measurement Algebra Here, then, are two wave functions: " ^ sin(^) e^ ) , .,. /-sin(itf)e-^ \ cos (|tf) e^ (1.12.16) Since cos (^) + sin (~d) = 1 it is clear that both of them describe unit vectors of the unitary geometry. And the two vectors are orthogonal: (a'n = +l\a'n = -1> = (cos(itf) e^) (-sin(§tf) e"^) + (sin(|#) e-^) (cos(itf) e^) = 0 . (1.12.17) The two wave functions are really the same wave function for, as before, a'n = —1 is a'n = +1 in the — to direction. According to the side and top views ■K-d —n TC + <£, —n we get —n by the substitution 1?->7T — $, ip -+ tt + <p, and </V' =+i - /cos(f(7r-^e"^^) = . /-sin^e"""^ 1 V cos(|t?) e^ = i^V'=-i • (1.12.18) Notice the factor of i. Is that significant? No. The original identification of V>_i from the product tp-i (a'z)tp^i (ff")* is arbitrary to the extent of a phase factor ela. Any value of a will do. It has no effect upon orthonormality, the statement of the unit symbol, or the physical interpretation in terms of probabilities (see Problem 1-8). The probability amplitude interpretation of a wave function is expressed by P{a'nia'z) = k<K)| (1.12.19) where the results are, of course (■& here is the angle between the z direction and to):
1.12 Unitary geometry 63 p(+,+)=pK~) = cos2(itf); P(+,-)=pK+) = sin2(itf). (1.12.20) 1.12.2 Two arbitrary components of Pauli's vector operator It is interesting to use these wave functions to rederive the probabilities associated with two arbitrary directions n\ and n2 (specified by $i,<£>i and $2,^2) respectively), according to (^KHE^kx^khe^ fcr^M)- (1-12-21) It will suffice to consider (+, 1|+, 2) = cos {\dx) cos (|tf2) ei^1 - ^ + sin(|#i) sin(|tf2) e^t^1 _ ^ . (1.12.22) First note two special situations in which n\ and n2 lie in plane. 1. If both are in the x,y plane, we have fl\ = 1?2 = tt/2 and get <+,l|+,2> = ie2t' + ie = cos(i0) , i „-£6 (1.12.23) which is a real number and gives the right probability. 2. If both are in the x,z plane, we have ipi = ip2 = 0 and get (+,l|+,2) = cos(|i?i)cos(|i?2) + sin(|i?i)sin(|i?2) = cos(i#) , which is a real number and gives the right probability. = 01-02 (1.12.24)
64 1. Measurement Algebra We see again that we might get by with real numbers in two dimensions. But, in verifying that we have the right answer in general, we see that complex numbers are essential. Now 3s(£tfi) cos(itf2) e^1 ~ ^ + sin(itfi) sin(itf2) e"^1 ~ ^ = cos2(ii?i)cos2(|i?2)+sin2(|i?i)sin2(ii?2) + sin(^i) cos(itfi) sin(|i?2) cos(±tf2) (V^1 ~ ^ + e-1^1 _ V2A v v ' = 2cos(i/3] -1/32) = 1(1 + COS1?i)|(l + costf2) + |(1 - COS1?i)|(l - costf2) + |sini?! sini?2cos((/3! - tp-i) = Hl + cosi?i cos$2 +sim?i sini?2cos((/3i — <p-2)] = cos2(i#) . (1.12.25) = COS# There's another way of looking at these matters that is suggested by writing out a ■ n as a ■ n = (ax cos ip + ay sin ip) sin 1? + az cos d . (1.12.26) We begin with ax cos ip + ay sin <p = ax (cos ip + \az sin <p) = (cos </3 - \az sin y>) ct^ . (1.12.27) That directs our attention to cos ip ± \az sin <p = e±i(T^ , (1.12.28) where the use of Euler's* identity involves only the fact that (iaz) = —1, just like i2 = — 1. Another approach starts with the exponentiated function of crz, and its two possible values: p±io-z<fi - 1 + az i<p , l^az -i<p = cos ip ± \oz sin ip . (1.12.29) So now we have ax cosy? + ay simp = ax elip<Tz = e"lip<Tzax , (1.12.30) "Leonhard Euler (1707-1783)
1.12 Unitary geometry 65 where the two forms on the right are connected by the anticommutativity of ax and az. We again apply this property in getting a more symmetrical form: e-~{Va>ax = e"^^ e'^f^ax = e-&a*ax e^ff* , (1.12.31) so that axcostp + aysiinp = e-%V<T*axe2{Pa* (1.12.32) and an= e~~^az (az costf + ax sintf) e^^ (1.12.33) since e-iV'Oi e&a* = az e"^"' e&a* = az . (1.12.34) Notice that we have made free use of exponential properties of the form eiA, A ei\2A = ei(Ai + \2)A ^ (1.12.35) which we justify by eiA, A ei\2A = J2 eiAi«' |o'o'| £ eiA2fl" \a"a"\ a1 a" = Y, ei(Al + A2>a' \a'a'\ = e1^1 + X^A . (1.12.36) a' In a analogous way: <p -+ $, ax -+ az, ay -+ ax, az -+ ay, we have ff2 costf+ 0-^01^= e^^e^^ (1.12.37) and an= e-^^e-^^^e^^e^'^r1^ (1.12.38) where U = e^a» e^a' , U^1 = e"^"" e"^^ (1.12.39) are indeed inverse since UU'1 = e2§av ekv°* e-fra' e-^ = 1 , £/-1 £/ = e~^a" e-^vv e^av e&a* = 1 , (1.12.40) which illustrates the general statement
so that 66 1. Measurement Algebra (XY)"1 =1^1^1 . (1.12.41) Before commenting on this relationship between different physical quantities, let's supplement it by a connection between vectors. Write az = Y^H = o'y{< = o'\ (1.12.42) and get a n = J2 tf_1K = a')a'(az = a'\U a' = £|< = a>'«=a'|, (1.12.43) a' (v'n = v'\ = (v'z = v'\U , \a'n = a') = U^1 \a'z = a') . (1.12.44) The necessary adjoint relation between these left and right vectors tells us that {ia'z\U)X =V^\a'z) , (1.12.45) where the left-hand side can equivalently be presented as [/t(a;|t=[/t|a;>, (1.12.46) so that f/t = £/-i , \j\\j = J7f/t = l . (1.12.47) This one checks directly, first by noting that (>*')'= (coef+i^sinf)' f ■ ■ f = cos \az sin — 2 2 = e~2faz (1.12.48) which, once worked through, is seen to be just i -> —i. Accordingly, f/t = (e^av e&a*y = <r^a* e-^°v = J/"1 , (1.12.49) as required. The same property of U guarantees that, like az,cr-n is Hermitian (f/tt = U):
1.13 Unitary operators 67 (U*azU)* =U*azU . (1.12.50) And notice how the requirement (<r • n)2 = 1 is satisfied (<r ■ nf = U-latUU-lazU = U~1IU = 1. (1.12.51) 1.13 Unitary operators We express what we have recognized here more generally. Property A has the possible values 01,02,--- ,on. Property B also has the same possible values 61 = Oi,... , bn = an. Now consider n Uab = ^2\ak)(bk\ , Uba = ^2\bk)(ak\ , k=\ k Ul = Uba , Ul = Uab , (1.13.1) for which UabUba = ^2\cik) (bk\bi)(ai\ = ^2\ak)(ak\ = 1 , k,i *—w—' k = S(k,l) UbaUab = Y,\bk)(ak\ai)(bi\ = 1 , (1.13.2) k,l so that Also, U]ab = Uba = U^, ul = Uab = U£. (1-13-3) (ak\Uab = (bk\ , Uab\bk) = \ak) , (bk\Uba = (ak\ , Uba\ak) = \bk) ■ (1.13.4) The U symbols act, or operate, on one orthonormal set of vectors to produce the other orthonormal set. They maintain the metrical relations of the unitary geometry: (bk\bi) = (ak\UabUba\ai) = (ak\ai) = 5ki (1.13.5) and so are called unitary operators. Indeed the term operator is applied, mathematically, to every element X of the measurement algebra: X = J2 (a'\X\a")\a'a"\ , (1.13.6)
68 1. Measurement Algebra which can be thought of as operating on any vector to produce another vector (of the same kind): (a'\X = J2(a'\X\a")(a"\, a" X\a") = J2\a')(a'\X\a"} ■ (L13-7) a' The word 'operator' is frequently used as synonym for 'measurement symbol' - we have done so above when speaking of Pauli operators, for instance. In particular the Hermitian operator A = Y}a')a'{a'\ (1.13.8) operates on an a vector to produce a multiple of the same vector (as we know): (a'\A = a'(a'\ , A\a') = \a')a'; (1.13.9) such vectors are termed eigenvectors of the operator. The numbers a' are correspondingly called eigenvalues. The Hermitian operator B, the symbol of physical property B, is B = ^2\bk)bk(bk\ k = 2_^ Uf,a\ak)ak{ak\Uab k = UbaAUab (1.13.10) indicating the manner in which Uab operates on A to produce B. The totality of b vectors is expressed in terms of the a vectors by, for example, h) = Zh)H&') (1.13.11) k where the set of n2 numbers, (aJ&/), which are generalizations of direction cosines, is called the ab transformation function. That is a mathematical term. Physically these are probability amplitudes or wave functions. They are also the matrix elements (^16,) = (¾^¾). (1.13.12) We illustrate this for the example of a Stern-Gerlach measurement, where A = ax, B = cr-n, Uba = U^1 = e^*"7* e~^av , (1.13.13)
1.14 Unitary operator bases. Complementarity 69 so that ^e-^-^a^cos^ia.sin^K) az\an -az = /cosfe"^-sinfe"^\ 3 V sinf e^ cosf eh<P ) The first column of this matrix is the wave function ipa-n=+i (<r'z), the second column is tparl=-i(az), consistent with (1.12.6). 1.14 Unitary operator bases. Complementarity Thus far we have had only one physical example before us - the two-valued magnetic moment of the Stern-Gerlach experiment. The time has come to break out of that beachhead. And we shall do it by examining some apparently innocuous questions. A unitary operator converts one orthonormal set of vectors into another such set, or, possibly the same set. Certainly unity is a unitary operator: ltl = H = l (1.14.1) and indeed (a'\l = (a'\, l\a') = \a'). (1.14.2) Now suppose we consider a unitary operator that produces the same set of a vectors but in a different order, as produced by numbering them: 01,02,... ,an, and then, for example, cyclically permuting them: U\ak) = \ak+1) U\a„) = |oi) . n-1 (1.14.3) We have the simplest example of this in ax: °*h) = l+)> ^1+) = 1-). (1.14.4) In this example, repetition gives unity, a\ = 1. Now
70 1. Measurement Algebra U'2\ak) = U\ak+1) = \ak+2) (1.14.5) and Un\ak) = \ak+n) = \ak) , (1.14.6) where the index k is understood modulo n, n + 1 = n + 1 (mod n) = 1, n + 2 = n + 2 (mod n) = 2 , et cetera. (1.14.7) We have Un = 1 ; (1.14.8) n is called the period of U. Let's think about the numbers u' that satisfy (1.14.8): («')"=!, u'= e2lr[n ; k =1,2,...,n (1.14.9) so that the various m' are the n different nth roots of unity (for n = 2, e.g., the two roots are u' = e 2 = —1, e 2 = 1). What we are saying is that U obeys the algebraic equation n Un-l=J[(U-uk) = 0 with uk = e27Tin . (1.14.10) The factorization of Un — 1 into a particular m — uk and all the rest is done this way: n-l £/" - 1 = (f//Mfc)n - 1 = (U/uk ~l)J2 ^1^)1 (1.14.11) /=o which makes use of the familiar identity (X - 1) (1 + X + X2 + ... I""1) = Xn - 1 . (1.14.12) This result (I = n is as good as I = 0) ([/ - ufc) J] (£//«*)' = J2 (U/u*)1 (U -uk)=0 (1.14.13) /=1 /=1 suggests the symbol for the measurement of U that yields the value uk: -. n h«fc| = -^(t/M)' ; (1.14.14) n , /=1 indeed, 1½¾ -> 1 for f/ -> u^., as it should.
1.14 Unitary operator bases. Complementarity 71 It is important to note that this operator is Hermitian: K«*r = IE (ut/Ol = IE vi"*vl n n-\ /=0 / = 1 = \ukuk\ . (1.14.15) It is clear that the algebra here is quite the same as with a Hermitian operator and indeed a unitary operator can be considered to be a complex valued function of a Hermitian operator, as in e^^az. Now consider 1 n \uk){uk |o„> = - Y, uk' Ul |o„> (1.14.16) = 1¾) with the consequence 1 1 {an\uk){uk\an) = \{uk\an)\ =-u^n = - . (1.14.17) We choose (uk\an) = \j\fn and get, consistently, \uk, /n /=i and for (uk\=-Lj2e^kl(ai\, (1.14.19) n , /=1 1 MW 1 (wJa,)= ^=e^Tfc'; = -= for I = n . (1.14.20) 1 ' \/n Vn Now that we have another set of n orthonormal vectors, let's define a second unitary operator, V, by cyclically permuting them: (uk\V = (uk+1\ , (uk\V2 = (uk+2\ , (uk\Vm = (uk+m\ , (uk\Vn=(uk\. (1.14.21)
72 1. Measurement Algebra Again Vn = l and vi=e2lT'ln, l=l,...n (1.14.22) so that \VlVl\ n , 111' = \ E (W* , (1-14-23) leading to <u„h>H = -Ee^H(«*l . (1.14.24) and then |(un|uj)| = 1/n. With (un\vi) = 1/\A* we get |^) = ^V|Mfc)e^W. (1.14.25) However, and therefore \vi ^e^W = (Mfc|a,) (1.14.26) > = 5Z|«*><«*|ai> = |«i> I (1-14.27) We are led back to the initial set of vectors. What we have, then, is the reciprocal definition of two unitary operators, ^h> = |»J+i>, (uk\V = (uk+1\ . (1.14.28) These unitary operators obey Un = l, Vn = l, (1.14.29) and one more relation derived, for example, by considering (uk\VU = (uk+1\U = e^(fc + 1)(Mfc+1| , (uk\UV= e^{uk\V = e^*(ufc+i| , (1.14.30)
1.14 Unitary operator bases. Complementarity 73 namely We also have VU= e^UV . (1.14.31) (^1^,) = 4=6^, (1.14.32) and the same exponential occurs in V'Uk = e^fklUkVl , (1.14.33) which we get, either by repetition, or from the comparison: (um\VlUk = (um+l\Uk = e™klm + l)(um+l\ , (um\UkVl = e^km(um\Vl = e^km(um+l\ . (1.14.34) For n = 2, we have [/2 = 1, V2 = 1 , UV = -VU (1.14.35) which we recognize as the properties of ax and ay (or any other pair); these Hermitian operators are also unitary: oxax = (crx)'2 = 1. In the progression from n = 2 to n = oo, we go from anticommuting U and V to commuting U and V. Commutativity, characteristic of ordinary numbers, is the hall mark of classical physics. Indeed, from our viewpoint, in which atomic mechanics, quantum mechanics, is fundamental, we shall derive the characteristics of the physical properties that are recognized in the classical limit. The four basic operators of n = 2 are constructed from the fundamental U,V pairast/fcV', k,l = 1,2: UlVl = axay = iaz, U1V2=ax, U2Vl = ay, U2V2 = 1 . (1.14.36) For general n, UkVl with k,l = l...n, gives n2 operators, again just the number of measurement symbols. Can we exhibit each of the measurement symbols, say \u'u"\ in terms of the UkVll We do it by beginning with \uk){uk\ = ¾ (U/uk)1 = - V e-¥klUl . (1.14.37) Then multiply on the right by Vm k and get
74 1. Measurement Algebra \ukum\ = ~Y e~2-^klUlVm~k , (1.14.38) i which also applies for m < k since V"1 = V™"1, for example. And any operator, say F, which is a linear combination of the n2 measurement symbols \u'u"\ is also a linear combination of the n2 unitary operators UkVl: n F=Y,hiUkVl = f{U,V), (1.14.39) it is a function of the fundamental pair U and V. Although we can think of U and V as complex valued functions of two physical properties symbolized by Hermitian operators, it is more direct to work with U and V■ And so we have the probability p(u', v') that the particular outcome v' will come from a V measurement performed on atoms that have been selected to have the value u' of U. It is (u' = u^, v' = Vi) 1 (1.14.40) n independent of the particular choice of u' and v'. This statistical aspect of U and V is emphasized by considering a measurement that includes a nonselective measurement: p(u',v,u") = X)P(«',«')P(«',«") = Y.\\ = ^ > (L1441) v' v' as compared with p{u', 1, u") = p(u', u") = S(u', u") ; (1.14.42) the intervening non-selective v measurement has removed all knowledge of the initial u' value - all outcomes are equally probable. Physical properties whose measurements interfere are called incompatible; U and V display optimal incompatibility. That, and the fact that both U and V are needed in the description of the system is what is meant by calling U and V complementary properties: Both are required, but the measurement of one wipes out any prior knowledge of the other. Yes, the essence of Bohr's* complementarity (1927) is just that: All quantum objects must possess mutually exclusive properties. That's quantum mechanics. Here's a nice way to evaluate the trace of an operator F given as a function of U and V: F = f(U,V) = J2hiUkVl. (1.14.43) k,l p(u',v') = Jn P—kl e n "Niels Henrik David Bohr (1885-1962)
1.14 Unitary operator bases. Complementarity 75 Take the matrix element with u' row and v' column: (u'\F\v') = (u'\J2 fkiUkVl\v') = /(«', »')<«'|»') (1.14.44) k,l and then tr {F} = Y,(U'\F\U') = E (u'\F\v')(v'\u') u' u',v' or tr{/(£/,V)} = i £/(«>')• (1-14.46) n If F is a physical property, or a function of one, we have and, since tr {|/'/'|} = 1> f = EI/')/'(/'l (L14-47) /' tr{F} = £/', (1.14.48) /' that is E/' = ^E/(M>')- (L14-49) /' u'v' Here, then, the sum of all physical values of f(U, V) is given by a summation over all the numbers, f(u',«/), that are produced by independent assignment of values to U and V: an Ergodic Theorem. As a preliminary to a trace evaluation, note that E(M*h)HM™) = <^m = -E e^Me"^lffl / n / = -E(e~) (L14-5°) i or, equivalently, 1E(-r = (i !or rnz°> (1.14.51) n, (0 for 0<r<«, v '
— k 76 1. Measurement Algebra which is just a property of the nth roots of unity, Uk = e^~K. Now consider tr {UkV1} = -^2 («')*(«')' • (1.14.52) n u', V' Clearly, for k = I = n, we get tr{l} = n, (1.14.53) which is (1.11.9), but for any other choice: k = 1,..., n-1 or I = 1,..., n-1 (1.14.54) " *■ ' n2 — 1 in number we have tr{UkVl}=0. (1.14.55) This is the generalization of the n = 2 property (1.11.7). 1.15 Quantum degrees of freedom For each choice of n = 2,3,4,... we get a different pair of complementary quantities, a different physical system. However, for n = 4 = 2x2, orn = 6=2x3, generally n = nin2, it is possible to consider the system to be a composite of two simpler ones characterized by n\ and n2. A step in this direction comes from considering Ui = Um , U2 = Uni , Vi = Vhn2 , V2 = Vhni , (1.15.1) where l\, l2 are integers such that U?1 = Un = 1 , f/2n2 = Un = 1 , ym _ ylin = j ^ y2«2 = y/2n = j ^ (1.15.2) as well as, e.g., Vi£/2 = vhnaUni = e^LhnU2V1 = U2V-i (1.15.3) and Vif/i = y«i"2[/"2 = e^^C/iVi = e^TC/iVi , (1.15.4) provided Iiri2 = 1 (mod rii). There is a unique choice of l\ and ^) if ni and ri2 have no common factors - for m = 2, n2 = 3, for instance, one gets l\ = 1,
1.15 Quantum degrees of freedom 77 h = 2. However, this does not cover 4 = 2x2, say. Instead we proceed this way: replace the linear counting k = 1,... , n by the rectangular counting, h\k-2 1,1 1,2 ... l,n2 2,1 2,2 ... 2,n2 (1.15.5) ni,l 1,2 ... ni,n2 where ri] rows and n2 columns account for riin2 = n pairs fcj,fc2. Then we can introduce operators ¢/1,¼ that permute the rows, and independently, operators £/2,½ that permute the columns. With —fci — k-i uiki = e"i = ulfcl ; u2k2 = e"2 = «;2fc2 (1.15.6) we get the reciprocal definitions: (wifci,W2A:2|Vl = («lfci + l,W2fc2| , (^1^,^2^1½ = (ulkl,U2k2+l\ , (1.15.7) Ui\vikx,V2k2) = \vik! + l,V2k2) > C^2 !«!*!!, «2*!2 ) = |«l*!l,«2*!2 + l) • (1.15.8) and Within each set, Ui,Vi and £/2,14, all is as before, U?1 = V?1 = 1 , f/2n2 = v.p = 1 , VlU1 = e^U1V1, V2U2 = e^iU2V2 . (1.15.9) The operators Ui,Vi act on one index of the vectors; the operators £/2,V2 act on the other index. It does not matter in which order the two different operator sets act, e.g., (uiki, U2k2\UlV2 =^1^(^1^,¾¾]½ = Uik1(uik1,U2k2 + l\ , (Mlfci, 1^1½^ = («1*!, «2*2 + 1 1^1 = U1k1{uik1,U2k2 + l\ ■ (1.15.10) So UiU2 = U2UX , V1V2 = V2V, , UiV2 = V2U! , VlU2 = U2V1 ■ (1.15.11) Two physical properties such that the measurement of one does not alter prior knowledge of the other, in which the sequence of measurement does not matter - as represented by the commutativity of the associated symbols -
78 1. Measurement Algebra are called compatible. The pair of operators U\,V\ is compatible with the operator pair ¢/2,½. It's a useful fiction to think of U\,V\ and ¢/2,½ as describing two independent systems, which can be created individually, in either order, or individually destroyed: /' 'I /M/'l /'l/'l W>M2| = (Ml|\M2| = (M2KMl| ) K,<> = K>K> = K>K>, (1-15-12) and so or explicitly, {u\,u'.2\v'l,v'.;) = (u\\v'l){u'.2\v'l), (1.15.13) 1 1 \ 1 2iiJfci/i 1 — k2l (uikt, u2k2 fiii, v2i2) = -7= e "i —= e "2 2'2 = _— e27ri(fci/1 /m + ^2/2/^2) _ (1.15.14) This also emerges directly by writing the (u\ki, M2fc21 in terms of {viit, «2fc21> which involves (mi^I^i/j); and then writing the (vu1,U2k2\ ^11 terms of the («i«i,«2«2|) which introduces (u2k2\v2i2). These references to compatible and incompatible physical properties direct attention to something that has remained implicit so far: We began with the measurement of a single physical property. In general, however, to such a property A\, we can add another compatible property, A-2. A selective measurement of both is symbolized by |a^Oi II02O21 = |«2a2 ||ai°i I • (1.15.15) We continue in this way until we reach a complete set of compatible physical properties: Ai,A2, ■ ■ ■ ,^4a'(A), such that any additional property will be incompatible with at least one member of the complete set. The symbol of such a complete measurement is K^'i I • • • \a'K(A)a'K(A) I = |«'a'| with «' = {a[,a2,..., a'K(A) j . in any order (1.15.16) Atoms selected to have the particular set of values a' are said to be in the state a'. From this point on, the entire construction of the measurement algebra and the geometry proceeds as before, the only difference being that the outcome of a measurement is not a single number but a collection of numbers. Thus
1.15 Quantum degrees of freedom 79 J2\a'a'\ = l, \a'a'\ = \a')(a'\, (1.15.17) and K(A) (a'\a") = 6{a',a") = ]J Sia'^a'j) ■ (1.15.18) i=i Now we reverse things and construct the symbols of measurement of one quantity from that of complete measurements. Thus Kai|= £ \a'a'\ ' (1.15.19) a2>-" <aK(A) for the measurement that selects atoms with the values a[ of Ai is the less selective complete measurement that selects a[ and any value of a'2, a'3, ..., a'KiAy Of course £|°i°i| = £la'a'l = 1 ' (1.15.20) a'^ a' and, e.g., Ax = XXK«i| = £|a')ai(«'| • (1.15.21) a'j a' Notice that now \a\a\ I is not a simple product of vectors. That is emphasized by contrasting tr{|o'o'|} =tr{|a')(a'|} = (a'|a') =1 (1.15.22) with trflaXl^trJ ^ la')(a'l \ = £ 1 = m(a[) , (1.15.23) \a2>-'- >aK(A) J a2'"' 'aK(A) the multiplicity of a\, the number of a' states that have a'j in common. As a result, we have, equivalently, tr {4i} = 5>im(°i)=£0i- (1.15.24) Of course, we could start, with any other quantity, £?i, in general incompatible with the set Ai,A-2,..., and supplement it by compatible properties to form the complete B set, with its sets of values, &', and vectors (&'|, |&'). What the incompatible A and B sets have in common is the total number
80 1. Measurement Algebra of states, which is the dimensionality of the geometry. That follows from the alternative evaluations: £5>'|&W>=£i=<4)] a' b' a' £5>|a'>W>=£l=»(*) > = n . (1.15.25) b' a' b' We meet these ideas in a more mathematical way in the context of the eigenvalue problem. Consider an n dimensional space, with a coordinate system |a'}. Then given a Hermitian operator Bly find its eigenvalues and eigenvectors, that is, solve Bi|) = |)&i. (1.15.26) Introducing the a coordinate, system, we get the system of equations J2(a'\Bi\a"}iP(a")=iP(a')b\ (1.15.27) a" for the components of the eigenvector, (a'| )='</>(«') • (1.15.28) When written as ^[(o'|Bi|o") -6i<J(o',o")]^(o") =0, (1.15.29) a" it is familiar that the equations have no solution other than tp(a") = 0 unless det[(o'|Bi \a") - b[6(a',a")] = 0 (1.15.30) or, more explicitly, '(oi|jBi|oi) -&i ••• (oi|jBi|o„) det = 0 (o„|jBi|oi) ••• (o„|jBi|o„) -&i_ which is an equation of degree n for b[: {-b[)n + (-b'^-hr {Bx} + ---+detBl=0. For a simple example, take n = 2, Bx = ax, A = az: = (-¾)2 + (-¾) tr {<Jx}+detax (1.15.31) (1.15.32) det -¾ 1 1 -¾ = 0 = -1 = b[2 - 1 = o, (1.15.33) implying b[ = ±1.
1.15 Quantum degrees of freedom 81 Given a particular root b[, one returns to the linear equations for the tp(a"Ys and finds them, to within a constant factor. That is fixed, to within an arbitrary phase, by normalization, £|<Ma')|2 = l- (1-15.34) a' Two such eigenvectors associated with different roots, or eigenvalues, are orthogonal. That follows from the alternative evaluation (b[ \BX \bl) = b[ (b[\b';) = (b[ |&i>'i' (1-15-35) or (&i-&i')(&iK>=° (1.15.36) so that K / &'/ : (b[\b'0 = J2^(«')>*»(«') = ° • (!-15-37) a' Now suppose that not all the n roots are distinct, that some of them are multiple. For example, suppose that n = 4, and B\ obeys B\ = 1, so that there are only two eigenvalues: +1, —1. Then one or both must be multiple, as illustrated by the determinant (&' — 1)2(&' + 1)2. Let the multiplicity of b[ be 771(¾) = 2,3,..., so that there are 771(¾) linearly independent solutions of (1.15.27). These vectors, \b[, l),... , ^1,771(¾)) are not automatically orthogonal, but they can be made so, as illustrated in two Euclidean dimensions by the projection: -w So now we have an orthonormal system: \b\,l), I = 1,... ,771(6^). Find a second Hermitian operator B2 that commutes with Bi, B\B2 = -B2-B1, and BiB2|&i,Z) =B2B1\b'1,l) -B2|&'i,0&i • (1.15.38) This says that B-2\b\,l) is a £?i eigenvector with the eigenvalue b\, and must therefore be an linear combination of the 771(¾) known vectors: ™(6'i) B2\b[,l}= Y,Kk)(b'i>k\B*K1} (1.15.39) fc=i where the appropriateness of the matrix element labeling is checked by multiplication with (b'},k\.
82 1. Measurement Algebra Within this m(&'1)-dimensional space we ask for eigenvectors of B? ^1^1,) = 1^,)½ (1.15.40) or with (&i,l|&i, ) = ipb'.il), 53 <6i,fc|B2|6i,0^i(0 = ^(¾)½ (1.15.41) /=i leading to det [(&i,fc|B2|&l,j) — «J(fc, O62] =°- (1.15.42) If all the roots of this equation differ, it provides us with the orthoiiormal vectors 161,62)- ^ not' we nnd a third commuting operator £?3, and so on. The U, V operators illustrate this discussion. For a system with n states, the two Hermitian operators implicit in U and V, such that Un = Vn = 1 and Uk = Vk = e27nfc'n for fc = 1,... ,n, each have n distinct eigenvalues, so the multiplicity of any eigenvector is one. In this situation the single operator U, or V, is already complete. If n is a prime integer, that's where it rests. But if n = nin-2, we can introduce U\,V\ of period rii, and the commuting [/2,½ of period n-2- The eigenvalues of JJ\ or V\ are only rii < n in number and must be multiple. We can form various complete sets of compatible properties U\,U2\ U\,V2\ V\,V2\ V\,U2 leading to the orthonormal vectors 1^1^,112^2)) |Mifci,^2/2), etc. If fix and/or n2 is still factorable, this process can be continued, ultimately ending with the various irreducible systems in which the number of states is a prime integer: v = 2,3,5, 7,11,13,... . These we call quantum degrees of freedom. 1.16 The continuum limit 1.16.1 Heisenberg's commutation relation With the exception of v = 2, all v's are odd. Then it's convenient to choose the k in uk = vk = e(2m/u)k ag fc = 0,±l,±2,... ,±^—. (1.16.1) Now let's exhibit the Hermitian operators in U and V. For write
1.16 The continuum limit 83 U = eie<? , V = ei€P , q' = qk , p' = pk , qk=pk=ek = 0, ±e, ±2e,...,± (n/e - |c) . (1.16.3) Recall that V'Uk = e^fklUkVl (1.16.4) or y-ljjkyl _ e-^kljjk (1.16.5) and Jjkyljj-k _ e-~klyi ^ (1.16.6) which now appear as e-ikpeikeq eikp _ p-ifce/e eikeq _ eike(q - h) (1.16.7) and eifceg ei/ep e—ikeq _ e-ikelt ^lltp _ eik(p — he) _ (1.16.8) With le = q', ke = p' they become e-iq'p jp'q jq'p = jp'(q - q') (1.16.9) and ev'qeWPe-ip'q = eiq'(p-p') } (1.16.10) respectively. We see here unitary transformations on a function of an operator, a function that is defined by a power series. So, with U referring to either unitary operator and A to either q or p, consider U-1AkU = U~1AAA---U = U'1 AUU-1 AU ■ ■ ■ = (U^AU)k , (1.16.11) that is, the unitary transform of the function is the function of the transformed operator, U~lf(A)U = f{U~lAU) • (1.16.12) This is quite generally true - for any operator function f(A) and any unitary operator U-
84 1. Measurement Algebra In the examples (1.16.9) and (1.16.10), then v'b-iq'p^q'p} = jP'(q-q\ eq I pe J = JQ(P-P) . (1.16.13) It is tempting to identify the respective exponents, but it is not generally correct because of the periodicity of the exponentials. However, there is a limit in which this is right: v -> oo, e -> 0, with qualifications. It helps to first draw the circle of periodicity (¾ = pk = he): -n/e+ |e n/e— \e In this limit, as e -> 0,1/e -> oo, any finite portion including 0 is indistinguishable from a continuous straight line: 0 1 But to avoid the fact that, if the line is continued indefinitely in both directions, the two ends ultimately meet, one must implicitly restrict all applications to physical situations where the values of q and p, although possibly very large, remain finite. Now the periodicity does not come into play, and the values of q' and p' form a continuum, which permits power by power identification. So e-tfP q^P^q-q', piP 9 „ p-iP pe-W<l=p-p'. (1.16.14) Deferring for a moment the interpretation of these equations we again use the continuous nature of q' and p' permitting a power by power comparison. We want just the first power: (1 - iq'p + ■ ■ ■) q (1 + iq'p +•••) = <?-<?' (1.16.15) *- ^, ^ = q + iq'(qp-pq) + --
and 1.16 The continuum limit 85 (1 + ip'q + ■ ■ ■) p (1 - ip'q + • • •) = p - p' (1.16.16) v „ ' = P + ip'(qp-pq) + --- both of which yield the statement of non-commutativity 1 qp — pq = i or T[q,p] = l. (1.16.17) This is the essence of Heisenberg's* discovery of non-commutativity in 1925. To the compact formulation in the form of the commutation relation (1.16.17), Born* contributed substantially. The action of the unitary operators on vectors: (uk\Vl = (uk+i\ , Uk\vt) = |wfc+«> (1.16.18) can be relabeled as (q'\e^P = (q'+q"\ , e^'^p') = \p' + p") , (1.16.19) after we employ (1.16.2) and (1.16.3) and identify ke = q', le = q" in uk - eie(fce) = eiel' , yi = ei(fe)p = eiq"P ? uk+l = ex(ke + k) = ek(g' +g") ^ (1.16.20) and ke = p", le = p' in jjk _ ei(ke)q _ eip"q vi = eie(/e) - eieP , vk+l = eHl*+ke) = eHP'+p") _ (1.16.21) We'll find later, in Section 2.1, that (uk\ -> \/e(</|> |^/) -> \p')\ft, but the additional numerical factors simply cancel here. Here we see, for example, unitary operator e1^ p acting on a left eigenvector of q to produce another such eigenvector with a displaced eigenvalue. The statement that (q'\elcl p is a q eigenvector with the eigenvalue q' + q", or equivalently that it is an eigenvector of q — q" with the eigenvalue q', is (q'\eiq"p(q - q") = </(g'|ei<?"P = W\(l^q"P (1.16.22) *Werner Heisenberg (1901-1976) fMax Born (1882-1970)
86 1. Measurement Algebra or eti'p(q-q") = qetf'P, (1.16.23) which is just the previous operator equation in (1.16.14) (with q" -> q'). As we see, when there is a statement about q, there is an analogous one for p. That originates in the symmetry (see Problem l-43a) U -> V, V -> J/""1, or, with U = eie<7, V = ekP, q-^p, p->-<?. (1.16.24) Indeed i = qp~pq —> p(-q) - (-q)p = qp - pq (1.16.25) and (with q' -> p') e~'1l'PqeiJP = q~ql , I etfqpe-ip'q=p-p> . (1.16.26) If we apply this to (q'\eicl"P = (<?' + q"\, we get (p'|e_iP"« = (p' +p"\, the adjoint of e^'^lp') = \p' +p"). 1.16.2 Schrodinger's differential-operator representation Now let's convert the q vector equation to numerical form by multiplying by a right vector I ): (q'\j*"p\) = (q'+q"\). (1-16.27) We assume I ) to be such that (q' +q"\ ) is a continuous function of q", permitting expansion of both sides in powers of q": (q'\{l+iq"p+---)\ ) = (q'\ )+q"^i(q'\ > + '" (1.16.28) or ^1) = 1^1)- (1-16-29) This differential-operator representation of the effect of p is the essence of Schrodinger's discovery of wave mechanics (1926).
1.16 The continuum limit 87 To show the equivalence of this with non-commutativity, consider 1 d (q'\(qp-pq)\ )=q'(q'\p\)- 7^7(«'M ) ,i_d__ i_d_ , i dq' i dq' (q'\ ) = (q'\i\ ) (1.16.30) as it should. The substitution q -> p, p -> — q gives us with the adjoint MolH^W < M = 1£< l»'> ■ The relation (1.16.29) and (q'\f(q)\ ) = f(q')(q'\) (1.16.31) (1.16.32) (1.16.33) are combined in W\f(qM\) = f{q\\~)w\) (1-16.34) provided f(q,p) can be built up from the basic structures by addition and multiplication. That is, if the differential operator representation is correct for fi(q,p) and f-i{q,p), as it is for f(q) and p, then it is also correct for fi(q,P) + f-2(q,p) and A(<7,p)/2(<7,p). Indeed (q'\ [/i(<Z,p) + f2(q,P)} | ) = (q%(q,p)\ ) + <<z'|/2(<Z,p)| ) and , , , 1 d \ ,/,19 1 d (1.16.35) W\h(q,p)f2(q,p)\ ) - /i (/.7^7) <«'|/2(«.P)| ) The corresponding statements about p eigenvectors, W(«.p)| ) = /(^)(^1). (1/(^)1^) = /(7^)(1^. (1.16.37) are obvious analogs.
88 1. Measurement Algebra Problems 1-1 What is the root mean square speed of silver atoms passing through the slits of the Stern-Gerlach apparatus after evaporating in the oven at 2 227°C? 1-2 The torque /x x B that results from the action of the magnetic field B on the magnetic moment /x changes the atomic angular momentum vector J in accordance with d 7 — J = ijxB. dt ^ Suppose that /x = jj and let the homogeneous field B point in the z direction. How do the components of J change in time? Assuming that Jz = 0 initially, after what lapse of time will J reverse direction? Express this as a distance d traveled in the field at the mean speed v. Take 7 = 1.76 x 107 and \B\ = Bz = 0.01 [cgs units] and find d. 1-3 On page 30, speaking of the classical anticipation of the outcome of the Stern-Gerlach experiment, the text says: "the Ag atoms certainly have their magnetic moments (m.m.) distributed uniformly in all directions, which is to say that the distribution of /j,z should be uniform between the limits n and —/x-" Justify the latter conclusion. 1-4 The Stern-Gerlach experiment shows that the possible values of /j,z are ±/x. What then would be measured for /i2z, /^, ^, and M2 = nl + $ + A ? How does this compare with the classical picture in which /x should have some specific direction in space? 1-5 A certain textbook, seeking to define the addition of measurement symbols, |o'o'| + |o"o"| , a' / a", says: "This is the case of the Stern-Gerlach experiment followed by a screen with two holes." Comment on this assertion. 1-6 Prove that (a' / a") [|a'a'| + |a"a"|]2 = |a'a'|+|a"a"|. 1-7 Evaluate the commutator [|o'o"|,|o'"o"'|] . Check that your answer has zero trace. Why should that be so?
Problems 89 1-8 Show that the vectors (af\ = \{a')(a'\ , p) = \a") [Afc")]-1 also satisfy (a7 p) = <5(a', a") for arbitrary numerical values of the A (a'), and verify that IF><*l = i- a' Similar statements apply to the vector sets (b'\ and \b"), with numbers A(b'). Find what restrictions on A(o') and A(&') are required so that Use this arbitrariness - this freedom of definition - to argue that (a'\b'\ cannot itself be a probability, whereas (a'\b')(b'\a') can be, and indeed is. 1-9 Concerning page 43: Show that the apparently more general possibility for real numbers, <&>'>= 7<a'|6'>, p(a',&') = 7[(a'|&'>]2 with 7 real and positive, can be reduced to what was considered by an acceptable redefinition of the a vectors, or the b vectors. 1-10 An initial Stern-Gerlach measurement selects the + beam. A second + selection is made on this beam, in a direction differing by angle e. A third + selection is made on that beam in a direction again differing by angle e. This change in direction is repeated n times in all to produce the final + selection at an angle 0 = ne relative to the initial direction. In the limit as e -> 0 and n = 9/e -> oo, what fraction of the initially chosen beam emerges from the final selection? 1-11 Write Mb, the symbol of a non-selective B measurement, as a function of operator B. What kind of operator is this Mj? 1-12 Physical property s has the possible values +1,0, —1. What algebraic equation does the symbol s obey? Exhibit the measurement symbols | + + |, |00|, | — | as functions of s. i I s ne = i
90 1. Measurement Algebra 1-13 Given that (ax cos <p + ay sin ip) = 1 for all angles ip, and that az = —\axay , deduce all the algebraic properties of ax, ay, az. 1-14 Evaluate a ■ aa■ba ■ c in two ways indicated by (cr ■ a a ■ b) a ■ c and a ■ a {a ■ ba ■ c). 1-15 Evaluate the commutator of cr ■ a and cr ■ b, and also their anticom- mutator. 1-16 The magnetic moment in the Stern-Gerlach experiment is /x = /j,a. What can you say about the possible outcomes of a measurement of /x2? Compare with Problem 1-4. 1-17 Use the fact that ax = e"1iav azei:iay to find the eigenvectors of ax in terms of the az eigenvectors. What is the probability interpretation of these ax eigenvectors? 1-18 Exhibit the matrices for cr ■ a and cr ■ b. Then use matrix multiplication to find the matrix for cr ■ a cr • b, thereby checking the multiplication law for the ct's. 1-19 The Pauli matrices in (1.10.3) refer to the eigenvectors of az. What are the equivalent matrices referring to eigenvectors of ax, or of av. 1-20 Stern-Gerlach measurements are made on atoms that are symbolized by the unit vector I ^, with a'z components i\)(a'z = ±1)- Write the expectation values of ax, ay, az (comprising the three-dimensional vector (cr)) in terms of the wave function tp(a'z), which is arbitrary apart from the statement of normalization. Prove that (cr) is a unit three-dimensional vector. 1-2la Verify the commutator identity [X,YZ} = Y[X,Z} + [X,Y]Z .
Problems 91 What is the analogous version of [XY, Z] ? What does the phrase "the commutator [X, Y] is linear in X and in Y" imply about [X\ + X-2, Y\ + Y-2\ ? l-21b Check that [X, [Y, Z]} + [Y, [Z, X]] + [Z, [X, y]] = 0 . What statement about three-dimensional vectors emerges from the choice X = a ■ x , Y = cr -y , Z = a ■ z ? Verify it. l-22a State the adjoints of the measurement symbols X*X , X*Y + Y*X, i(X^Y-Y^X), and of the eigenvector equation (a'\A = a'(a'|. l-22b X is a non-Hermitian operator. What kind of operators are X^X and XXt? Verify that A=$(X + X*) , B = %(X-X*) are Hermitian, which gives the construction X = A + \B . Apply your results concerning X^X and XX^ to learn the adjoint nature of (l/i)[A,B}. 1-23 Hermitian operator A has eigenvalues a' and eigenvectors |o'); Hermitian operator B is arbitrary. Show that the matrix elements (a'|i?|a") obey the sum rule 53 (a" - a')\(a'\B\a"}f = |(o'| [B, [A,B]] \a') . a" 1-24 Use the matrix significance of the trace to prove that tr {XY} = tr {YX} . What is tr {[X, Y}\? Evaluate tr {f(A)} in terms of the numbers f(a').
92 1. Measurement Algebra l-25a Use the construction of the |&'&'| by means of the la'a'l to show that E!&,&'hEK«'! h1]- b' a' l-25b By transforming from b vectors to a vectors prove that E<W> = E<a'lA1a'> [=trW]- 1-26 The trace has been defined as a numerical property of an operator, not by any particular matrix realization. As a check of this show that there is a unitary operator U such that the a matrix of U~~1XU is the b matrix of X. Conclude from tr {U~~1XU} = tr {X} (why?) that the trace is independent of the matrix realization used to compute it. 1-27 Show that tr{XfX} = Y, \(a'\X\a")f >° [=0onlyifX = 0] a' ,a" which is the squared length of operator X expressed in terms of components relative to the orthonormal operator set |o'o"|. Verify the inequality tr {X*X} tr {yty} > |tr {X^Y} f . When does the equal sign hold? l-28a The unit symbol and the three Pauli operators are used to define another set of four Hermitian operators, -Boo = |(1 + <?x + <?v + <*z) , Bw = o-xB00ax , Bqi = <jzB00oz , B\\ = OzVxBqqVxVz ■ Verify that they are indeed Hermitian. and find the eigenvalues of Bjt (j,k = 0,1). Show that tr {Bjk} = 1 and tr {BjkBj,k,} = 26jj>6kk> . l-28b The Bjk's are complete, which is to say that any function X of Pauli's vector operator cr can be written in the form
Problems 93 X = YlxikBik ■ j,k Justify this statement, and find what is needed in Xjk = tr {(?)Ar}. Are the coefficients Xjk unique for a given X? Write 1, ax, ay, az as sums of the Bjk's. l-28c Operators X and Y are specified by their respective coefficients Xjk and yjk. Express the traces tr {X}, tr {A^A}, and tr jytA} in terms of these coefficients. 1-29 Work out the value of j 1 + {ax + <Ty)/V2 1 + {-ax + ay)/V2 \ { 2 2 J' What physical meaning can you give to this number? l-30a Expectation value is a linear concept: ((X + Y)) = (X)+(Y) , (XX)^X(X), and the expectation value of the Hermitian operator representing a physical quantity is real: {^>* = {^> , (B)* = (B) , ... , or, more generally, ({A + iB)) * = ((A - iB)) , that is: (A) * = /x*\ . Also, (1) = 1. Define the probability operator P (frequently called density matrix) by \a'a"\) = (a"\P\a') (does ( \b'b" | ) have the same form?) so that the expectation value of X= J2(a'\X\a")\a'a"\ is (X)= ^(a'|A|a")(a"|F|a') = tr{AF} . a' ,a" Prove that P is Hermitian and that tr {P} = 1.
94 1. Measurement Algebra l-30b Each |o'o'| symbolizes a physical quantity with the non-negative values 0 and 1. So (\a'a'\\ = (a'\P\a')>0. If the Hermitian operator P is regarded as symbolizing some physical property, conclude that one can write p=j2p'\p'p'\ with p'>°>xy=i- p' p' Recognize that (X)P = YiP'(X)P' p' is a weighted average of the more familiar (X)p,=tv{X\p'p'\} = (p'\X\p'}. What happens if P2 = PI 1-31 Property B is measured and atoms selected with B = b'. What is the corresponding probability operator PI What is it for a non-selective B measurement? 1-32 Stern-Gerlach measurements are made to determine the expectation values of ax, ay, az for atoms emerging from a certain source. Show that P=I(l + <<r)-<r) is the probability operator characterizing these atoms. Compare with Problem 1-20. l-33a The determinant det {X} of operator X can be defined by the differential statement ,5 log det {X} = tr{X~~lSX} together with det {1} = 1. If this is to produce a unique det {X} the differential expression must be integrable, which means that <M2logdet{X} = <52<5ilogdet{X} , where SiX and S2X are independent infinitesimal variations of X. (You will need an expression for 5X~~l, the variation of the inverse of operator X).
Problems 95 l-33b What is S(XY) in terms of SX and SY? Use this to prove that det {XY} = det {X} det {Y} . What is the relation between det {AX} and det {X}, where A is a number? 1-34 In Section 1.5 we anticipated that the probabilities A 3—>B a b c a b c p(+, 1,-) and p(+, 1,+) C show constructive and destructive interference, respectively. In Section 1.12 we found the required probability amplitudes. Demonstrate that this works out as expected. (For simplicity take the three directions to be in the x, z plane, with azimuthal angles ip = 0.) 1-35 U and V are unitary operators. Show that UV and VU are also unitary. l-36a Any member of the 2x2 measurement algebra - any such operator - is a linear combination of 1 and the three ct's. Consider U = uo + in • a , where the number u$ is real. Under what restrictions on uq and the numerical vector u is U unitary? l-36b Unitary operators U and V are given by the numerical parameters Mo, u and w0, v, respectively. What are the parameters for UV and VU? Verify that they obey the conditions for a unitary operator. l-36c Satisfy the unitarity conditions on uo, u in such a way the U is seen to be an exponential function of i times a Hermitian operator. l-36d By what simple modification of Uq, u does one remove the restriction that Mo is real? 1-37 With what restriction on the number A is 1 + iAcr • n 1 — iAcr ■ n (where n* = n, n ■ n = 1) a unitary operator? Write it in the form of Problem l-36a; in the form of Problem l-36c. 1-38 Evaluate e?0* e?°» e?CT~ .
96 1. Measurement Algebra 1-39 Show that the adjoint relation between eigenvectors of unitary operator U, <«T = K> > is consistent with the eigenvector equation (u'\U = u'(u'\ . 1-40 Operator U permutes vectors 1¾) cyclically [cf. (1.14.3)]. Construct the eigenvectors \ui) = ^2\a,k)(ak\ui) k by solving the eigenvector equation, thereby determining the eigenvalues ui and, with the normalization requirement, the (ofcluj). 1-41 Use Ul\ak) = \ak+,) to arrive at U = /J\am)Sm-n,l(an\ ■ Prove (1.14.14) in this way. 1-42 Non-Hermitian A commutes with its adjoint, AA* = A''A . (Operators with this property are called normal.) Show that A can be understood as a (complex) function of a Hermitian operator B, A = f(B) = MB)+if2(B) , ^ = f*{B) = fx{B)-\f2{B) , with real functions /i^. What is the relevance thereof in the context of (1.14.15)? l-43a Unitary operators U and V reciprocally defined as in Section 1.14: Check that the substitution U -> V, V -> U~~l leaves the algebraic properties of U and V intact. Then find what happens to the reciprocal definitions of U and V by permutation of eigenvectors.
Problems 97 l-43b Unitary operator W has the defining property W\vk) = \uk) for k = 1,2,..., n . Show that W\uk) = \vn~-k) , and then w~1uw = v, w-lvw = u-1. l-43c Proceed from n n -. W = y^\ukVk\ = y^lukUkl -,—j—r \vkVk\ and find the coefficients Wki in n w=J2 wkiUkVl . k,l=\ 1-44 Show the consistency of the uv transformation function (1.14.32) with the algebraic properties of U and V by evaluating and comparing the matrix elements (uk\UV\vi) , (uk\VU\vi) . 1-45 Recall [cf. (1.14.10)] un-l = JJ(u-u') u' with v! = Uk = eS2m'n> for k = 1,... ,n. By taking the logarithm of this equation and expanding in powers of 1/u, prove that Y,(U'V =° for 0< r<n. 1-46 Use the cyclic property of the trace, tr {XYZ} = tr {YZX} = tr {ZXY} , for an alternative demonstration of (1.14.55), that is tr {UkV1} = 0 for 0<k<n or 0<l<n .
98 1. Measurement Algebra 1-47 Evaluate p(u',v') by using the construction I ' 'I \u u I and the similar one for |u'u'|. 1-48 Apply the result of Problem 1-45 to prove the completeness of the \u'u'\ measurement symbols, \u u I = 1 . u' l-49a Consider F = f(U,V) = J2fkiUkV = Y,lkiVlUk . k,l k,l Express the coefficients fkl in terms of the //y's. l-49b Show that an F which commutes with U, or U~1FU = F, can be only a function of U, and similarly that an F which commutes with V can only be a function of V. Conclude that if F commutes with U and V, it must be just a numerical multiple of the unit symbol 1. 1-50 Prove that tr {|a'a"|t|a"Vl'|} = S(a',a'")S(a",a'v) , an orthonormality statement for the n2 measurement symbols, regarded as vectors in operator space. Then show the analogous property for the ri2 powers of U and V, that is ^tr {(UkVl)*Uk'v1'} =6kl,6w . 1-51 Demonstrate that J2\a'a"\*X\a'a"\ = ltr{X} a' ,a" and then prove the analog = tr{|«V||«V|} i
Problems 99 ~ ^2(UkV1)^ X UkVl = 1 tr {X} by, for example, applying Problem l-49b, or by considering X = UrVs■ 1-52 Use both versions of Problem 1-51, for n = 2, to show that i(X + cr. Xa) =ltr{X} . Check this for X = 1,(7¾. 1-53 Construct (q'\p'), apart from the numerical factor, by using 1-54 Convert JP'lpe-'w'l =p-p' into [»• e-ip'q] - l]Le-w'q i dq Check the generalization to ld/(«) [p,f(q)\ i dq by introducing the g Schrodinger description. Write the analog with q and p interchanged. 1-55 Arrange e-i/(9)eVpei/(9) in different ways to show that = ei[/(g + g')-/(g)]eVp where
100 1. Measurement Algebra Apply this to f'{q) = —(p'/q')q to recover familiar statements, and find new ones for f'(q) = nq'2/2. 1-56 Generalize the results of Problem 1-54 to arrive at d d Q-f{q,p)=i[p,f{<i,p)], ■^-f(q,p) = ^[f(q,p),o\ ■ Then show that the order of differentiation doesn't matter, iPlqf{q>P) = lqlpf^P) ■
2. Continuous q, p Degree of Freedom 2.1 Wave functions The completeness of the U or V vectors is expressed, for example, by the summation l = £)|ufc)(ufc|. (2-1-1) But, in introducing % = ke and proceeding to the limit e -> 0, where q'(= qu) becomes a continuous variable, the summation must be replaced by an integration. Accordingly we write (uk\ = ^(q'\ , \uk) = |g')v/e , (2.1.2) and 1 = Zy>cfa'l> with e = Qk+i-qk = Aq' , (2.1.3) and, in the limit, arrive at 1=/ \q')dq'(q'\, (2.1.4) i-oo which states the completeness of the q vectors. This symbolic statement implies the numerical one /oo (l\q'} dq' (q'\2) . (2.1.5) =Mq'Y =M4) In particular, for a single vector, /OO /"OO iPiq'T dq'iP(q') = dq'\iP(q')\2 -oo J — oo = 1 , (2.1.6) if it is a unit vector.
102 2. Continuous q, p Degree of Freedom We get to the same point from the probability interpretation of \(uk\ )\2 = e\(q'\ > |2-+d</!</,(</) I", (2.1.7) being the probability dP that a q measurement carried out on the system represented by the vector I ), or the wave function ij>(q'), shall yield a value in the range dq' about q'. [We owe this probability interpretation of Schrodinger wave functions to Born.] The total probability of any outcome must be unity: //"OO dP = dq'\iP(q')\2 = 1 . (2.1. J — oo The necessary existence of this integral requires that tp(q') -> 0 as \q'\ -> oo. For a physical system, truly infinite values of q' do not occur. The qp transformation function is produced by relabeling the uv transformation function (1.14.32), H««> = ^W)^= ^==^{kt){h) , (2.1.9) or (^) = 4=^, (2.1,10) What about (utlui) = <5;y? Dividing by e = -Je-Je we get {q'\q")=5^f1^6{q'-q"), (2.1.11) which introduces Dirac's delta function, where (e -> 0) ?7«", q'-q" /o: %W) = o, q< = q" ; q'-q" = 0: S(q' - q") = oo . (2.1.12) One more property of the delta function comes from ^2 (uk\ui)^(ui) = ip(uk) , ^eS(q'-q") f 5{q'-q")dq"^{q")=^{q') (2.1.13) for arbitrary ^(ut)- Inasmuch as S(q' — q") = 0 if q" / q', only the value of 4>(q") for q" = q' occurs in the integral, and we learn that /OO /"OO 6(q' - q") dq" = / dq' 8{q' - q") = 1 . (2.1.14) -OO J — OO
2.1 Wave functions 103 Let's see how all this fits together, ^2 {uk\vi)(vi\um} = (Uk\um) = 5km ~> £<5(</ - q") , ^(^W\p')^)(^(p'\q")^) /oo (q'\p')dp'(p'\q")=6(q'-q"), (2.1.15) -oo which is formally correct as a combination of f\p')dp'(p'\ = l and (q'\q") = 6{q' -q"), (2.1.16) which state the completeness of the p vectors and the orthonormality of the q vectors. But how does it work numerically? We have (q>\p>) = --L jM , (p'\q") = -L e-W (2.1.17) so that />oo 6(q' - q") = i" ^- eW " «") . (2.1.18) J-oo 27r But the integral from —oo to oo does not converge! i^oo 27r Jo ^ dp' d sm(p'(q'-q")) I Jo 7T dp' q' — q" 1 sm(P(q'- q")) = lim l , „ '> = ? . (2.1.19) P->oo 7T q' — g" Evidently the difficulty is that we have not made explicit the restriction that P = oo is not allowed. To see that this is only an apparent problem, let's restate what has been done in terms of the individual {q'\p') and (p'\q') transformation functions: (q'\)= [WW(p'\)i (p'\} = J(p'\q')dq'(q'\), (2.1.20) or /•OO J ' ^(q')= J e^'P^ip'), :„'„' ¢¢)=1 e^P-1^(q'). (2.1.21)
104 2. Continuous q, p Degree of Freedom If we eliminate t/>(p')> f°r example, we get f°° ■ i i <\r>' f°° ■ i a ¢{4) = e^P^ e-V « <tf W') (2.1.22) J -OO ^ J-oo which is Fourier's* integral theorem: 'any' function ip(q') can be written as a linear combination of the functions e15 p , —oo < p' < oo, with the indicated coefficients. Should we interchange the integration and integrate first overp', we come to the integral for 6(q' — q"). But what happens if we use it as is? For example, ^') = ^7le"^'2 (2-L23) is a normalized wave function since F d</ e~q'2 = sfi , (2.1.24) the basic Gaussian* - integral. Its Fourier transform ^)=/%^^) J-oo V^Ti" 1 f°° .ii i T1/4 J-oo 4,/ h e-w ^v')\-w- 71" J-oo 7TV4 is obviously also normalized. And then 1 1 /2 e"P (2.1.25) 1 ] r°° ■ i i i/2 7T1/4 sJllX J-oov v ' 1 in'2 = ^Jl^iq > (2-L26) which is back where we started, without the slightest difficulty. [More about such Gaussian Fourier integrals later, in particular in Sections 5.4 and 6.5.] It is clear that, for a physical ip(q'), the actual range of p' is quite finite, so no harm should be done in leaving P large but finite in (2.1.19). Accordingly, let's check that, in this sense, *Jean Baptiste Joseph Fourier (1768-1830) fKarl Friedrich Gauss (1777 1855)
2.1 Wave functions 105 ./-00 re q' ~q" Put q" = q'+X/P, dq" =dX/P: w)=r——^'+^/p) • (2-l28) ./-00 i" X We are content here to argue that, because of destructive interference (sin x) for large x, the relevant x values are limited by |x| ~ 1, and for sufficiently large P the replacement ip (q'+ ~ 1/P) -> ^{q') is permissible, a statement of continuity. Then ./-00 1" X = 1 as it should. More generally, the restriction to possibly large but finite values of p' is expressed by the more precise definition 6«-<r) = £%K&)&«-*') (2.1.30) e-> 0 where K(ep') -> 0 as \p'\ -> 00 for given e (i.e., lep'l -> 00) , (2.1.31) K(ep') -> 1 as e -> 0 for given p' (i. e., ep' -> 0) , and the eventual limit e -> 0 is reserved until all integrations are performed. The example just discussed is (0 for elp'l >l) K(tf) = { , c ,' } with c= 1/P. (2.1.32) II for e\p I < 1 J Another example is ff(cp') = e"elp'l (2.1.33) so that (e > 0, eventual limit e -> 0 understood) W " «") = f ^ e"*' [eW - 5") + e-W - Ol JO 27T L J = Reirdp'e-[-itf-?")b' *" ./0 1 1 = Re 7T e - %' - q") 1 e ,^ -7-; ^ ?• 2.1.34
106 2. Continuous q, p Degree of Freedom Indeed JY - q" / 0: 5{q' - q") -> 0 \ q' - q" = 0: 5{q' - q") -> oo and (substitute q' — q" = ex and x = tan $) /OO />00 -I <#*(<?'-<?") = / <V- — -oo ./-oo ^ (q> "V J, a2 I r^l2 ~ / ' ^ J-jr/2 (9' - <?") + as e -> 0 1 /"c 7T /-< (2.1.35) dx x2 + 1 Yet another, Now, K \ep' \ 1 as ep' -> 0 . (2.1.36) (2.1.37) oo j„/ ■„/1 -„/\ dp'sin(^ep')cb/fq/„q//) 2tt |ep' OO OO A^' nin f ^ <~vJ f°° dp' sin(iep') 1 2c 1 1 for q' -q" > -|el -lforg'-g"< -|ej " 0 for q' - q" > \e , 1 for -\t<q'-q"<\e 1 for q' - q" > \e -1 for q' -q" < \e 2^ ' (2.1.38) 0 for q' - q" < -\e , which is graphically represented by 6(q' - q") ] q' - q"
2.1 Wave functions 107 Evidently all conditions are satisfied as e -> 0. What we have here is the continuum version of the initial introduction of the delta function as the discrete (l/e)S(q',q"), which is 1/e for q' — q" = 0, and zero for \q' — q"\ = e,2e,... . Now notice this: (q'\q")=S(q'-q") q'^q" ■■ (q'\q")=0, states the orthogonality of the q eigenstates, BUT q' = q" ■■ (q'\q') = ™\ (2.1.39) (2.1.40) We have lost the normalization to unity. What we have here is a reminder that an exact value of q' is an overidealization; any measurement of q, with its continuous spectrum, will locate it within a certain range, which can be very small, but never zero. The simplest example of such states is fq' + h^q l',Aq') = dq" Jq'-hAq' 1 tIO or ^q',Aq'{q") 1 fl for \q" -q'\<\Aq' VW I 0 for \q"-q'\ > \Aq' ^here f d«"K',4,'(g")l =l, with a graphical representation of the integrand given by (2.1.41) (2.1.42) (2.1.43) \^AAq") I 1/Aq' q'-\Aq'l q' + \Aq' What is the p description of this state? We evaluate [this is, of course, essentially the inverse of the Fourier transform in (2.1.38)]
108 2. Continuous q, p Degree of Freedom -ipq ^q',Aq'{q") f°° , 1 ll>q',Aq'(p') = / (V'—==■ ./-oo V^T 1 fi'+hAi' ■ , // = -F=F= / dg" e-* « yJlisAq' Jq>^^Aq> • s\n(^p' Aq') 1 y/2nAq' -ip'q'! and get I2 = ^ ' 2tt |VV,4?'(p') which is graphically represented by }^q\Aq'{p')\2 < uniform for |p'| -C 1/Zig' sin{±p'Aq') -1 2 |p'Z\g' (2.1.44) (2.1.45) 2tt 'IbW The total probability is (substitute x = -p'Aq') f dp' |</y,zv(p')| 2 /1°° 1 -/ W<fc/ n Jo 2 sin(|p'Z\g') |p'Z\g' 1 . (2.1.46) = dx [(sinx)/x] The q spectrum - that is: the totality of eigenvalues - is sharply limited, whereas the p spectrum is quite diffuse: for values of \p'\ 3> 1/Aq' the probability density oscillates about (sin2 -> |, effectively) \lpq',Aq'(P')\ ' 2> 1 1 TiAq' p' 1 „/2 ■ (2.1.47) The decrease to zero as \p'\ -> 00 is sufficient that the total probability exists (and equals one), but the average of p' does not,
2.2 Expectation values and their spreads 109 /2 P It /OO dp'p'2|^</',/l<7'(P')|2 = OO (2.1.48) This unphysical behavior reflects the unnatural nature of the sharply limited q spectrum. It invites our attention to physical states for which both the q and p spectra are effectively bounded. 2.2 Expectation values and their spreads For this purpose we examine the expectation or average values of q and p in the physical state | ): <<?> = < M>> <p> = <H> and the mean square deviation from these averages: (Sq)2 = (\(q-(q)f\) = (1\1), (5p)2 = (\(p-(p)f\) = (2\2), where <1| = < \(q-(q)), |2>=(p-(p»|>. Now we consider (<5g)2(,5p)2 = (l|l)(2|2)>|(l|2)|2. (2.2.1) (2.2.2) (2.2.3) (2.2.4) The latter inequality is the unitary counterpart of the 3-dimensional Euclidean statement a • abb > (a • b) , or cos2 8 < 1. To prove it, think of the vector lv> = W-l'>§$ such that <2|1/2> = <2|1>-<2|2>||^=0. (2.2.6)
110 2. Continuous 5, p Degree of Freedom We know that (l/2|l/2) > 0 where the equality holds only if |l/2) = 0, that is, when |2) is a numerical multiple of |l); when they are parallel. Now, as a consequence of (2.2.6), (l/2|l/2) = (l|l/2) = (1|1) - l^L > 0 , (2.2.7) which is the inequality stated in (2.2.4). So (6q)2(6p)2>\(\(q-(q))(p-(p))\}\2 (2.2.8) where the equal sign applies only if (p — (p)) | ) is parallel to (q — (q)) | ), (P-<P»|>=M<?-(<Z»|>- (2.2.9) Now for the product of Hermitian operators A and B, one can use identity (1.2.25), AB = ±{A,B} + ±\[A,B], (2.2.10) which is a non-Hermitian combination of the two Hermitian operators {^4, B} and \[A, B], The diagonal matrix elements of a Hermitian operator - expectation values - are real. So, with A = q — (q) and B — p— (p), the absolute square occurring in (2.2.8) is the sum of the squared real and imaginary parts: (SqSp)2 > [( \±{q-(q),p-(p)}\ )}2 + h( \\{q,p]\ >f (2.2.11) >0 =1 or 5q8p>-~. (2.2.12) This is Heisenberg's Uncertainty Principle of 1927 (in Weyl's* form of 1928); we'll prefer to speak of Heisenberg's uncertainty relation. The equality holds only if (p-<P»|> = Ate-<g»|>, < I (P-<P» = A*< I (<?-<<?» (2.2.13) and 0 = ( \\{q~{q)^-{p)}\) = \(X + \*){5qf , (2.2.14) so that A = 17 with 7 real. Notice that (\(p-{p)f\)^l2(\(q-(q)f\), v v , V v , = (Spf ={6qf *Claus Hugo Hermann Weyl (1885-1955)
2.3 States of minimal uncertainty 111 and therefore T=sH(sr2*>*- (2-215) which anticipates that 7 must be positive. 2.3 States of minimal uncertainty To study the optimal situation, 5q5p = |, let's use (q'\ components: W\(p-(p))\) = W\(i-(i))\ ) (2-3-1) or (~-<P>)W) = i7(3'-te>W)- Therefore (2.3.2) dlogiP=^ = {i(p)-1(q'-(q))]d(q'-(q)) (2.3.3) and tP = C e1 <P> («' " <«» e" H (9' " ^)2 . (2.3.4) After determining the value of the integration constant C from the normalization of tp, 'P-7(g'-(g»2 /OO /"OO dq'\^\2 = \C\2 / dg'e -OO J — OO |2 /^ ,~,2 = |C|2W-= |C|2\/2^<5g, (2.3.5) (Note that 7 > 0 is essential here.) we arrive at ^0 = (2jr)-i-i=ei<P>(«'-<«>)e"'4SF"e^><«>, (2.3.6) Vdq where the phase constant is introduced in anticipation of a symmetry property. The substitution q -> p, p -> — q gives us directly 1 (p'-(p>)2 ^(p') = (2jt)-4 -i= e1^) (P' " <P» e W e"* <P> <«> , (2.3.7) V<5p which should also emerge as
112 2. Continuous q, p Degree of Freedom dq' 1>iP "■/ -ipV -ip </>(</) '(«) /■ip-i(p'-(p))(«'-<9'» / 2tt 1 («'-(9>)2 x (27T)"i-p=e" *(*«>a e2<P)(«) (2.3.8) or, after completing a square, ^) = e-i (?) (p' - (p» e^ (p) (q) 1 / d«' _-17^ b' - («) + 2i (p' - (p» (¾)2]2 ■/■ : e 4(^9)^ VSq J V2tt x (27T)"ie"(59)2(p"^2 = (2tt)" i y^ e"1 <g> (P' - <P» ,e- W <? ~ <P»\ e~ * <P> <?> ■ (2.3.9) Now we can see the precise sense in which wave functions such as (q' \p") — (2tt)" 2 JQ V and (j\p») = 6(p' -p") are idealizations. First, let's write ¢(01) = ^Ae^{p)(q)(rismf Sq 1 e^'P" with p" = (p> , where ^(p') = [(2tt) i 7^ e" 2 <P> ^ e"1 <«> (p' " p")] <J (p' - p", <5g) , (2.3.10) tt"! - (p'r/',>2 <J(p'-p"» 2<5p 4(<Sp>2 (2.3.11) obeys (substitute p' — p" = 2x<5p) /OO 1 /"OO „ dp' <5 (p' - p", 6p)= -= dx e~x = 1 . (2.3.12) -oo V71" ./-00 For Sp -> 0, one has S (p' - p", <5p) -> <5 (p' - p") and e"1 <«) (P - P ) -> 1 follows. Also in that limit, Sq = l/Sp -> oo, and then e~(l' ~ (l)f/(26<l)2 -> 1. This leaves, to within a common (arbitrarily small) factor, just the wave functions (q'\p") and (p'|p"), respectively. More practically, suppose Sp is sufficiently small that (q) Sp <C 1, or [since l/(2«Jp) = Sq]
2.3 States of minimal uncertainty 113 (q) < Sq , and that the excursions of q' about (q) are limited by W ~(q)\<Sq. (2.3.13) (2.3.14) This situation is essentially indiscernible from Sp -> 0, 5q -> oo. Yet the value of S (p' — p", Sp) for p' —p" — 0 is not infinite (~ l/Sp) and the integral of |?/>(p')| is one, in contrast to that of [S (p' — p")] . And the Gaussian function of q' drops below unity for \q' — (q) | ~ Sq, resulting in unit value for the integral of |?/>(</') I' , in contrast to that of |(g'|p")| , which is infinite. Let's return to the characterization of the state of minimal uncertainty, SqSp = |, namely (p-(p))\) = ifq(l-(l))\} or, equivalently, Sq {q)+iP- <P> Sp ) = o Notice that (q-(q)\' (P-(P)\ \V2Sp ) and (q) p-(p) 1 1 2SqSp i i L V2Sq ' V2Sp So there is no loss in generality on redefining q and p q-(q) , P~ (p) y^2Sq V2Sp so that (p) o. o (Sqf (Sp)2 P 1 2 1 2 (2.3.15) (2.3.16) (2.3.17) (2.3.18) (2.3.19) (2.3.20) and also \[q,p] — 1 hold for the redefined variables, and | ) is characterized by V2 (q + ip)\)=0. (2.3.21)
114 2. Continuous q, p Degree of Freedom So | ) appears as an eigenvector of the non-Hermitian operator y, with eigenvalue 0. Henceforth we write |0) for this state. But we can also regard |0) as an eigenvector of the Hermitian operator y^y, with the eigenvalue zero. Certainly multiplication on the left by the operator y] = ^{q-ip) (2.3.22) gives 2/^0) = 0. (2.3.23) And this is equivalent to the original statement, since the consequence 0 = <0|s/ts/|0> = (l/|0»t (s/|0» (2.3.24) says that the vector j/|0) is of zero length: j/|0) = 0. The Hermitian operator introduced here is I yfy = 2(«-ip)(« + ip) = |(<?2+p2-t[<7,p]) |(/ +/-1) . (2.3.25) Notice that yyf = 5(g + ip)(«-ip) = |(/ +p'2 + yta,/ 2 (/+/ + 1) , (2.3.26) so vv*-y*v=[v,v*] = i. (2.3.27) As a reminder, for this state ip0(q') = n-*e-%q' , ip0{p') = n~* e~&'2 , (2.3.28) which are (2.3.6) and (2.3.7) for (q) = (p) = 0 and Sq = Sp= l/%/2. 2.4 States of stationary uncertainty The minimum value of 2SqSp = 1 is attained for the unique state |0). Are there other states for which 25q5p is neither a maximum nor a minimum, but is stationary for small changes of the state?
2.4 States of stationary uncertainty 115 2SqSp If we again use the permissible simplification (q) = (p) = 0, we have (SqSpf = (\q2\)(\p2\); <|) = 1. Now consider a small change, Z\( | and A\ ), leading to (a) Z\(«Mp)2 = (A( I) [g2(«5p)2 + p2(<5g)2] | ) +(|[g2(<5p)2+pW](z\| ))=0 where (b) A{ \ ) = (z\( |) | ) + ( | (z\| )) = (2.4.1) 0. (2.4.2) At this point we adopt (5q)2 = (Sp)2, another permissible simplification. Then (a) is satisfied in consequence of (b) if «L±£|> = A|>, <|^^ = <|A. (2.4.3) We verify later that (q2) = (p2) follows from this. Let TV = y*y so that [cf. (2.3.25)] 2 2 Then, with A = n + |, the eigenvector characterization of | ) reads N\ ) = | )n, ( \N = n( | . (2.4.4) (2.4.5) We know that n = 0 is an eigenvalue. What are the others? First observe that yN-Ny= (yy* - y^y) y = y, (2.4.6)
116 2. Continuous q,p Degree of Freedom an immediate consequence of the commutation relation (2.3.27), or y(N -l)=Ny, yN = (N + l)j/, (2.4.7) of which the adjoints are {N - l)j/t =y*N, Nyt = j/t (N + 1) . (2.4.8) What these equations say is that if n is an eigenvalue, then so are n + 1 and n — 1, in general. Thus Ny\n) = y{N - l)|n) = y\n){n - 1) (2.4.9) and (n\yN = (n\{N + l)y = (n + l)(n\y . (2.4.10) So y\n) = \n - l)Cn , (n\y = Cn+l{n + l| , (2.4.11) where the relationship of the coefficients follows from (n - l\y\n) = Cn , (n\y\n + l) = Cn+1 . (2.4.12) Then we have \Cn\ ={n\y^y\n)—n so that Cn = \/n , (2.4.13) = N if we agree on Cn > 0, and therefore y\n) = \n - l)Vn , (n~l\y = ^fn{n\ (2.4.14) and (n|2/f = ^/n(n - l|, yf \n - l) = |n)v/n • (2.4.15) From the fact that n=(n\y*y\n)= (y|">)t (»|n>) >0 (2.4.16) we conclude there must be a non-negative least value no of n, which must be such that y\rio) = 0. Of course, from the above equations, it follows that no = 0; the eigenvalue spectrum of TV is n = 0,1, 2,... .
2.4 States of stationary uncertainty 117 The observations that (n|j/|n) = \/n + l(n + l|n) = 0 (n\y2\n) = (n\yy\n) = Vn+T{n + l\n - 1)^ = 0, (2.4.17) where 1 lf = ^ + *)a = ^+i^, (2.4.18) tell us that (q)n = 0, (P)n=0, ((qp + pq)) n = 0 , {(q2 - P2)) n = 0 , (2.4.19) which imply (Sqfn = (Sp)l , (2.4.20) as promised at (2.4.3). From the eigenvalue equation W2 +P2)\n) = \n)(n+ |) (2.4.21) we get (|{q2 +P2))n = (q2) n = (Sq)l = (5qSp)n = n + \ , (2.4.22) which gives all the stationary values of the uncertainty product 5q5p. The eigenvectors for eigenvalue n are constructed as (n\ = (n - 1\4= = /„ _ 2|—L=-^ = • • • = (Ol^L (2.4.23) and In) = ^4-10). (2.4.24) Vn! These symbolic constructions are realized numerically by the wave functions 1 / rl \ U (2»n!)-3^__J (,'|0) -1 / d\" i i„'2 (2nn!) 5/g'___j ,r-4e"29 (2.4.25)
118 2. Continuous q,p Degree of Freedom and ^(P') = (p'\n) = (p'\^\0) (_i)»(2»n!)— ^p'-—J <p'|0> 1 / d \™ 1 1 '2 = (-i)n (2n n!)~2 IP' - T7 ) *"" e_fP • (2.4.26) These wave functions should be connected by MP') = J <V —j=r- 1>n{q'). (2.4.27) Note that / <tf e-W (V - A) ... = _i (y _ |.) J dq< e-W • • • ; (2.4.28) the n-fold repetition of the transformation, combined with the truth of this relation for n = 0 produces the general verification of (2.4.27). 2.5 Hermite polynomials Now observe that ^£=^(-^)--^ (2-5I) from which follows and ^)^(-^-^ 7r~"4 l /2 / (i \ n /2 Define the nth Hermite polynomial Hn(g') by d \n /2 /2 -—J e"« = H„(g')e~g , (2.5.4) giving
2.5 Hermite polynomials 119 _i Mq')^-ILl^Rn{q')e-'12l'2 (2.5.5) \/ L nn\ and _i MP') = ("i)n4=Hn(p')e"^'2 • (2.5.6) V2"n! The first few polynomials are Ho(g') = l, E3(q') = (2q')3-Q(2q'), H, (q') = 2q' , H4 (q1) = (2q')4 - 12(2g')2 + 12 , H2(g') = (2g')2 - 2 , H5(g') = (2q'f - 20(2g')3 + 60(2g') , (2.5.7) the highest power being (2q')n, generally, and H„(-g') = (-l)"H„(<z')- (2.5.8) Note the differential recurrence relation ~d7 (Hn(9'}e"9'2) = Hn+1 {q'}e"9'2 (2'5-9) or H„+i (<?') = (--^ + 2q'^j Rn(q') (2.5.10) from which all polynomials can be constructed successively. Another differential recurrence relation is found by differentiation: ^(H„(,0e-'2)=(-ir(A)n(-V)e-'2 -2g'Hn(g')e"9'2 +2nHn_1(g')e"9'2 (2.5.11) or d d-Rn(q') = 2nRn^(q'). (2.5.12) The two relations are combined in d ( d dq> V dq> + 2<1' I Eniq,) = 2{H + mn{q,) (2'5'13) which is the differential equation ^-2g'^7+2n)Hn(g') = 0. (2.5.14)
120 2. Continuous q, p Degree of Freedom Of course, this must be a consequence of the eigenvector equation (2.4.21) as represented by \(v''2-^a)M<l') = (n + \)M<l')- (2.5.15) i„'2 Indeed the replacement of ipn(Q') by e 2 9 Hn(g') gives, according to (2.5.16) just the previous equation (2.5.14). Here's another way of getting at the structure of the polynomials n Hn(g') = 2nn («'-%), (2.5.17) fc=i where the % are the n roots of the polynomial of degree n. Write AH„(g') = 0n(g')H„ (<?'). (2.5.18) so that Then, since M*) = ^8^(^ = £^- (2'5-19) ^Hn(g') = (JjMJ) + lMl')f) Hn(g') (2.5.20) we get the first-order non-linear differential equation ^Ml') + [Mq')f - 2q'Mq') + 2n = 0 . (2.5.21) The insertion of the general form of <j>n gives ~ Y, , , 1 V2 + Y, -7^ rl--2V^-+2n = 0. (2.5.22) The identities y^--n = yg,',(g,'gfe)=y:-^- (2.5.2¾
2.5 Hermite polynomials 121 and fjq'-qkq'-qi ^ {q'-%)2 ^«'-«h'-qi eLj l \ l M x<7'~% q' -qi/ Qk -qi 2Y.-r—Y.-^— (2'5'24) u q ~^i^k)qk-qi then lead to % = Y —z~ • (2'5'25) /(^)qk ~qi Two general results that follow immediately are £<?* = £—T-=° (2.5-26) M< * ~ Ql and Yq^Y.^^1^^"^ =V--l>; (2.5.27) the first is a consequence of the symmetry property (2.5.8): if % is a root, so is -qk. Let n be even, n = 2r, so there are r distinct roots in magnitude, both signs occurring. Then + Y + -- (2.5.28) ,, ,,, ,qk-qi ujrf , % + % 2% l(itk)=\ l(z/tk)=\ or 1 = 2 V -2-^--5+i for ^ = 1,2,..., r (2.5.29) from which follows k=\qk r =-^^- ^5'3°) The first non-trivial examples are: n = 2, r = 1, where qf = | is immediate and gives H2 = 22(q'2 - q'2) = (2q')'2 - 2; n = 4, r = 2, where
122 2. Continuous q, p Degree of Freedom 1 1 ..1 1 -q\-qy 2q( ' q\-4 2q\ so that 1 1 (2.5.31) and 2=2? + 2^ °r ^+^ = 4^¾2 (2.5.32) h-h or (92-922)2 = 8g2g22. (2.5.33) q\ - q\ 2¾2 2qx This gives q\ and q\ as |(1 + x) where (x — 1)2 = 8x, that is ii = 1-)/1- «i = § + >/§• (2'5'34) So = 24 (V4 - 3g'2 + |) = (2</)4 - 12(2g')2 + 12 ■ (2.5.35) If n is odd, n = 2r + 1, there are r pairs of opposite signed roots, and zero. Now qk= JT —L_ + - + JT —L_ + J- (2.5.36) and With n = 3, r = 1 we have g2 = § and H3 = 23</ (</ - |) = (2</)3 - 6(2</). For n = 5, r = 2, 1 = 2^—^ + o^' 1 = -ta + oi. (2-5-38) 1 3 1 _ 1 3 1 -g2^f + 2^' 1~~2qni + 2£ so that here 3 (ft2 + 922) = 4¾2¾2 , 3 (qi - qi)2 = 8¾¾ , (2.5.39) which gives q\ and gf as |(1 + x) where (x — 1)2 = |x, <?? = I - \fl - <?2 = § + \/f > (2.5.40)
2.6 Completeness of stationary-uncertainty states 123 and finally H5 - 2 9 I « -2 + V2J15 ~ 2 _ V 2 = (2</)5 - 20(2</)3 + 60(2</) . (2.5.41) All polynomials in (2.5.7) are thus written in the form (2.5.17). 2.6 Completeness of stationary-uncertainty states How do we know that the vectors In), (nl, are complete, that there are no other eigenvalues of N than n = 0,1, 2,... ? That is asking whether oo J2\nn\ = l. (2.6.1) n=0 To answer, let's begin by exhibiting the \nn\ as algebraic functions of N. The symbol 1001 should have the value 1 if A^ is 0, and zero otherwise. Notice that a function of the number n — 0,1,2,... that has this behavior is ^-^0^:::5,2,... ^ Accordingly, replacing n by N and using the binomial expansion: Similarly, and So 2 ^-^-MJS^w..., ^-4) ^-^-^={J£;=M,3.... <2-6-5> 00 1 /Vl fc=o 2kl(N-2-k)l \nn\=Y(-lf : (2.6.6) 1 ' ^-^v ; nl kUN -n-k)l y ' k=0 v
124 2. Continuous q,p Degree of Freedom Now try 1 /V! ^ (N-k)\ M =l^y-1) fci(JvTIJ!^n!(iV-fc-n)! n=0 fc=0 v ' n=0 v ' = (1 + 1)' k\(N-k)V E ^(-^2^ = (2 - 1)- = 1- = 1 , (2.6.7) fc=0 all right. How is this related to the operator constructions of (2.4.23) and (2.4.24), where (!&o><o|J^=Vv.i mm = ^=-10)(014= = ->(v*)nWvn (2-6- and EH = E ^)100^ s exp (^ l00M? (2-6-9) n=0 n=0 The notation is intended to indicate the insertion of 1001 between the powers of yt, on the left, and y, on the right. It helps to notice that y^y = TV, and (j/t) V = V*Ny = yh(N -1)= N(N - 1) (2.6.10) followed by (j/t) V = y*N(N - 1)2/ = j/tiVj/(iV - 2) = ^(/V - 1)(/V - 2) = iV(iV-l)(iV-2); (2.6.11) in general <tft)V=(tf^bJ!- (2-6-12) Then we can write looi = V(-D* — = Y(-i)k{y ) y 11 ^[ ' k\(N-k)\ ^[ ' k\ = exp (-j/t; J/), (2.6.13) a basic ordered exponential function. And now oo ^|nn| = exp(j/f;exp (-yu,y) ;yj = l (2.6.14) n=0
2.7 Eigenvectors of non-Hermitian operators 125 because the two ordered exponentials are inverses of each other. That would be obvious if we did not have to worry about the non-commutativity of y^ and y, but in fact, since all j/t's stand on the left, and all j/'s on the right, that non-commutativity never enters. This is illustrated by the first terms expMjexp (-y*;y) ;y) = e~~v ;v +y^ e~~v >vy + -y* e~~v >vy2 -\ = (1-2/^+2^^^ + --) + yHl~yfy + ---)y + -3/^(1 + • • • )v2 + • • • = 1 . (2.6.15) 2.7 Eigenvectors of non-Hermitian operators Nevertheless, things would be more immediately apparent if we could replace the operators by numbers, by introducing eigenvectors and eigenvalues of y and j/t: y\y') = \y')y', (/^ = /(/( (2.7.1) for then </|(El""l)b'> = <J/t'|eI'tVe-»tV|j/') n = (/|l|2/'). (2.7.2) We know that there are zero eigenvalues, 2/(0) = 0, (0|j/t = 0. (2.7.3) So we have only to change the eigenvalues: 0 -> y', 0 -> j/t . That sounds familiar [we use the adjoints of the relations stated in (1.16.19)]: \q'+q") = e~-[l"P\q'), (p'+p"\ = (p'\ e^"* , (2.7.4) illustrating operators that change the eigenvalues of q and p. These relations are consequences of [q,p] = i. But [?/,«/*] = i. suggesting an analogy: q -> y, p -> ij/t. Accordingly we expect q' -> 0 , q" -> y', p -> ij/t: |j/') = e»V |o) , ' , ', (2.7.5) p' -> 0 , p" -> iyt , <? -> y: (y* \= (0\eV V . Unlike |g'), (p'l, these vectors are in adjoint relation.
126 2. Continuous q, p Degree of Freedom First, notice that the adjoint of y\y'} = \y')y' (2.7.6) is |2/')V=2/'*|2/')t> (2-7.7) so |2/')t = (/| « / = y'*- (2-7.8) Clearly the eigenvalues of non-Hermitian y = 2~ 2 (q + ip) are complex numbers. Even if we thought they were real, we would see that y = emy, 2/t = e~~lay^, [y,tf] = 1, which are also a perfectly good pair of non- Hermitian operators, have complex eigenvalues: y' — elay'. So it follows that (e»V 10)^ = (01^ ^ / = 2/'*- (2-7.9) Now let's verify what the analogy suggests: y \y<) =yeyfy'\0} = e»V ( e~V^ y e»V ) |0). (2.7.10) Then consider A fe-yfy'yeyfy'^ = e-v^y (tf^v') + U-^v'{-y^))y^v = e-yfy'{y,yt}eyfy' = 1, (2.7.11) = 1 so e-yfy'yeyfy' =y' + y (2.7.12) which incorporates the obvious integration constant. Then, since e»V(y'+y)|0) = e»V |0>j/' = |j/'>J/' , (2.7.13) I 0 we have checked that y\y') = \y')y'■ The eigenvector is 1 ' *-. -Ja\ %/n! ' ' ^.1 ' %/nT n=0 v n=0 v
2.7 Eigenvectors of non-Hermitian operators 127 which employs (2.4.24), and taking the adjoint gives (y'* -> y* ) n=0 v from which we get Apart from the numerical factor, this is the analog of ^^e-W. (2.7.17) Notice the special examples (0\y') = l, </|0> = l. (2.7.18) The construction of the y, j/t eigenvectors in terms of n states is conveyed by the components (n\y') = &¥. , </|n> = i^> . (2.7.19) Evidently we have arrived at y', and j/t , wave functions that represent the abstract n vectors of (2.4.23) and (2.4.24). Indeed, from the latter we get (/|n) = (/|^|0) = ^i(/|0) . (2.7.20) n\ yn\ = 1 And we can understand the statement (j/t |0) = 1 as the numerical realization of a vector property, namely 2/(0)=0. (2.7.21) First, notice that ^7(/1 = ^7(0|e^ = (0|e^S = W\y, (2.7.22) the analog of i±(p>\ = (p>\q. (2.7.23)
128 2. Continuous q, p Degree of Freedom Then A(/|0) = (/|2/|0)=0 (2.7.24) and (j/t'|0) = (0|0) = l. (2.7.25) Having recognized these wave functions, note that n=0 Vn- Vn- n=0 repeats the properties of completeness for the n states. What is the q' description of the \y') vectors, that is: what is {q'\y')^ We construct differential operators according to ^W\ = W\n ^,\y') = y'\y') (2.7.27) the latter being the adjoint of the earlier statement about (j/t I in (2.7.22) or the analog of {d/dq')\q') = —ip\q'). So j%W) = U\W)=U\{^V-«)\lt) = (/2y' -q')(q'\y') (2.7.28) and = (V2q'-y')(q'\y'}, (2.7.29) leading to (q'\y') = Ce~y +^l'y' - \v'2 . (2.7.30) The complex conjugate of this is (/J,/) = C* e"K2 + ^l'yf' ~ Wf . (2.7.31) We can determine the integration constant C in several ways. First, put y' = 0, which must give us the known wave function ^0(g') =7r"ie~29' , so that C = tt"4 . (2.7.32)
2.7 Eigenvectors of non-Hermitian operators 129 Or, we can use the completeness of the q' states: = \c\2 [dq'e-i'2 + ^i'(yf' + y") - ^yf'2 + y"2) = \C\2Jdq>e^' - (y''+v")/V2f eyV ) (2.7.33) _I so that C = 7T 4 again (to within a here undetermined phase). As an application of the transformation function (q'\y') that touches a number of bases, introduce the n states via (q'\y') = J2(q'\n)(n\y') so that n=0 1 1 /2 . /K I I 1/2 7T 4 e 2^ T,Mi') ,Ay'Y n=0 (2.7.34) (2.7.35) This appears as a generating function for the jpn, which are selected by the power of y'. Think of this as a Taylor* series expansion and identify , t >\ 1 -1 ( d ipniq)- -7=t 4 — Vn! \d2/ e 2 d I(y'-V2^2 ^2 es^ »' = 0 1 _I I„'2 V^! Vd(-\/2g'). 7T"4 1/2 / d\" .J! :e2y I --^ ) e %/2nn! d</ n 4 , i/2 (2.7.36) which agrees with the previous result (2.5.5), indeed. To say more about the physical interpretation of the states \y'), (j/t |, we must notice that, with the exception of |0), these vectors are of finite, but not unit length, t' /* yT =y Thus a normalized vector is (yl'\y')=e\y'\2>l (2.7.37) * Brook Taylor (1685-1731)
130 2. Continuous q, p Degree of Freedom — II 'I2 \q',p') = \y')e 2 W I with 1 V = ^=(<z' + y/) (2.7.38) (frequently called a coherent state, a term coined by Glauber*) and an expectation value in this state is computed as ,/12 (X)q,tP, = e-\v'\ </|X|2/>. Since (y — y') \y') = 0, we have (y)q',P'=y' or (q)(l,tpl=q' and (p) q,y = p' , and from <(2/-2/')2)«',P'=0, ((^-^),^=0, ((y*-yt')(y-y'))q>,p>=o, we get (2.7.39) (2.7.40) (Sq)2 = (Sp) 2 _ 1 and SqSp= I (2.7.41) (2.7.42) The y eigenvectors are all minimal-uncertainty states, which we have already discussed in Section 2.3. To see the connection we return to our earlier results, with just the simplifications 5q = 5p (= l/\/2): Sp 7=- = 1: Sq (p-(p))\) = i(q-(q))\) or q + ip (q)+i (p) ) = 0 \/2 %/2 which is (y-y')\ )= 0, with q' = (q), p' = (p). Also [cf. (2.3.6)] as compared with 1 •-""" -g')e"l(9""9')2eigV (2.7.43) (2.7.44) (2.7.45) (q"\q',p') = W'\y')e^y'^ ^-i e_ ig"2 + q"(q'+ip') - I(g' +ip')'2 - \{q'2 +p'2) n-Up'tf'-*)etfp'e-W ""2 )2. (2.7.46) they are the same. *Roy Jay Glauber (b. 1925)
2.7 Eigenvectors of non-Hermitian operators 131 Given vectors like the \q',p') and (q',p'\ we naturally ask about completeness and orthogonality. Let's look first at (1"\1',P')(<1\P'W") = *-* eW -«"'> e~W ~~l'f e-W" "9')2 (2.7.47) where y |„' „'\^<V .r/\n"'\ = 7r~~h(n" - n"M e"^" ~ ?')2 and then / (9"k'^')^(9'.P'k'") = ^^(9"-9'")e"(9 ~~q> (2.7.48) / <^V>^<9Vk'''> = <%" - 9'") = <«"|1|«'"> (2.7.49) which is a completeness statement in appearance: 1 = f\q',p')^(q',p'\. (2.7.50) Indeed, as learned in Problem 2-29, there are many different completeness statements for the \q',p') vectors: they are over-complete. But, now consider (q',p'\q",p") = e~~^y'f (yl'\y") e-Wf ea*r ^')e-|(9W)V^'-p")\ (2.7.51) = exp(yf y") j ii i„n\ 11„i This equals one for q',p' = q",p", but it is not zero for q',p' / q",p"■ It only becomes small for appreciable values of \q' ~ q"\ or \p' —p"\. But that is as it should be; the state (q',p'\ describes a situation in which the average values of q and p measurements are q' and p' , but the individual results have spreads of Sq — Sp = \j\pl. A change of q' or p' that is small on this scale is not a significantly different state. A change that is large, resulting in very small values of one or both Gaussian functions, produces a different state. The completeness expression suggests that one can approximately associate a state with each range Aq', Ap' such that Aq1An' -2-Z- = 1 . (2.7.52) It is not necessary to identify individual states if we ask for probabilities in a range Aq'Ap'/2n 3> 1. It is in this sense that we assert, from [tp(q',p') = (q',p'\ ) for unit vector | )] 1 = /^I^V)|2, (2.7.53)
132 2. Continuous q, p Degree of Freedom that dg'dp', , , |2 -z—\il>{q,p)\ in (2.7.54) is the probability that q and p measurements of minimal uncertainty will yield values q',p' in the interval dq', dp' . 2.8 Classical limit As an example, consider ipn(q',p') = (q',p'\n) = e'2lV I (yt |n) 11 'i2 (2.8.1) tV aATT and \M4M = (»r)7 n! I 2 / "..I nl with 77 /2 . /2 <7 +F (2.8.2) We also have (polar coordinates) q' = sjfr] cos p' = \/2t7 sin dg'dp' _ d0 2tt ~ 2tt dry -> dry , (2.8.3) so one quantum state corresponds approximately to Ar\ = 1 (more about this in a moment). As it should, /^!*•"•"!' = / ^--- /*oo n Also notice that El^>')|2 = E^ =^ = 1, (2.8.4) (2.8.5) n=0 n=0 which could have been anticipated:
2.8 Classical limit 133 oo n=0 n=0 = (^,^1^,^) = 1, (2.8.6) where the completeness of the \nn\ measurement symbols and the normalization of the \q',p') vectors enter. For a given n the maximum value of \tpn\ occurs at r\ such that ^log|^„|2 = -^[nbg„-„] = ^-1 = 0, (2.8.7) or, at r\ = n. The value at the maximum is According to Stirling's* asymptotic expansion for large n, n! ~ V2^inne"n (1 + ^- + • • • J , (2.8.9) [the displayed terms give l! =* 0.9990, 2! ^ 2 x 0.9995, 3! =* 6 x 0.9997] and l^nl2^ J— * (2.8.10) V 27rn 1 + i2^ at the maxiimim. The average values of r) and rj2 are , . f°° A ^ + 1 _„ (n + 1)! fa = / (fa -i-y- e ^ = ^ r^- = n + 1 , ^\ = rdritlle-V = t+31 = (n + 2)(n + l) I J0 n\ n\ = fa)2+fa), (2.8.11) giving the dispersion (Sri)'2 = (fa - fa))2) = (??2) - fa) 2 = fa) (2.8.12) or Sri = Tfa) . (2.8.13) So, for very large n, the relative dispersion of r\ approaches zero, *L = 1 =_1 /s Vv^for n » 1 , M \/fa) v/nTI I -> 0 for n -> oo. "James Stirling (1692-1770)
134 2. Continuous q, p Degree of Freedom To see what the probability density does, we put r\ = n(l + v), and use the leading approximation of (2.8.9): enlog [n(l + v)\ — n— nu l^nl2^ — V2nnnn e n ■*- n log (l+i/) — nv \l2ixn e"l-2 '2-KTl (2.8.15) because log (1 + j/) = j/ — \v'2 for |j/| <C 1 , which is a consistent approximation. Now, jdV\^n\2 <* J°° dvy^e->2 = 1 (2.8.16) and, since n—>oo -iw2 |oofori/ = 0 lim W^e""" ^^r1"-"' (2.8.17) • ■-- »' 27T (0 for j/ / 0 , v ; we see that lim dr]\ipn(q',p')\2 ^dv8{p) . (2.8.18) n->oo The exact asymptotic correspondence of rj = \(q' + p' ) with n identifies this as the classical limit, in which the non-commutativity of q and p, and the equivalent uncertainty 5q5p = |, is unobservable. Now let's return to (2.8.6) and proceed to integrate over r\ from 0 to rj: 00 .fj Y\ / dV\<pn(q',p')\2=li. (2.8.19) n=0J° Let 77 2> 1. For any n -C rj, the integral covers the entire \i/)n\ distribution and produces a unit contribution. For large values n, comparable with rj, we can write, approximately, dn\il;n\2 ^dv8{v) = dr18{r1-n), (2.8.20) so those values of n < rj make a unit contribution, whereas those greater than rj do not contribute. The result is to identify asymptotically the count of the number of states having n <rj with rj. It is in this sense that increasing rj by one, Ar] = 1, adds one additional state.
2.9 More about stationary-uncertainty states 135 2.9 More about stationary-uncertainty states We first found the spectrum of N = y^y, n = 0,1,2 ... and then we constructed the eigenvectors. Here is a method that gives the eigenvalues and eigenvectors (expressed by wave functions) together. Consider (v*'\e~PN\v") with/?>0, (2.9.1) which we proceed to differentiate with respect to /3: -~{y''\e-pN\y") = {y''Wy^N\y")- (2.9.2) Now (see Problem 2-32) yf(N) = f(N + l)y, (2.9.3) so ye-PN=e-PNe-Py {2QA) and </\y*V e^N \y"} = <j/t V e"^ e~Py \y") = yl'e-Py"W\e-PN\y"). (2.9.5) Now the differentiated equation is (d/3 -> d/3) ^¾ = - W e" V = d[/ e-Py><} (2.9.6) or W\e-PN\y")=ye~PS'; (2.9.7) the constant of integration has already been picked to satisfy (v*'\e-pN\y") -+ {V]'\y") = eyf'V" for /3 -+ 0. (2.9.8) Now if we introduce the eigenvalues of TV (temporarily called N') and the eigenvectors, we get 1^7 fri, Vn! Vn! from which we learn, once more, that N' = n = 0,l,... , (2.9.10) and <,'»-&£. WO = fiff- (2.9.11) n! ' ' \/n\
136 2. Continuous q,p Degree of Freedom Problems 2-1 It has been stated, at (2.1.28) and (2.1.29), that lim r^sj^x Check this by explicitly evaluating the integral for the example where k is a constant, say positive. 2-2 Show that, for any e, /oo dq'6(q'-q") = l -OO for 5{q'-q") of the form (2.1.30), provided that K(ep') obeys (2.1.31). Check this explicitly for the example K(x) = 1/(1 + x2). 2-3 Evaluate r Veip'(g'-g")e-|(ep')2 7-oo 2?T and verify that it has the delta function properties in the limit e -> 0. 2-4 Differentiate x6{x) = 0 to establish xJx6(yx) = -*(*) ' Find an analogous expression for x2(d/dx)2S(x). 2-5 In a similar way, proceed from f{x)S{x -a) = f(a)6{x - a) to show •^dx^X ~ ^ = -^cT^ ~ a) ~ d^(a^(x ~ ^ and the corresponding expression for f(x)(d/dx)2S(x — a).
Problems 137 2-6 Begin with (1.14.46), that is tr{/([/,F)} =-£/(«>'), v u' ,v' with the semicolon indicating the UV ordering of (1.14.43), and arrive at 2-7a Consider the normalized matrix elements f( , ,, (q'\f(g,p)\p') f( , ,, {p'\f(q,p)\q') (q'\p') {p'\q') and show that f(Q,p) = h(q;p) = h{p;q) ■ Given f,(q',p'), what is /2(p',g')? 2-7b Find the qp ordered form of f(q,p) = pq2p by using [q,p] = i, and also by evaluating fi(q',p'). Repeat for the pq ordered form. 2-8a Evaluate U(q',p')= e^q e'^e"^' in various ways to arrive at U(q',p') = jtfp-P'9) = etfPe-tfQeffl = e-'^9 ^P e~^p' 2-8b Proceed from (cf. Problem 1-51) hr{(UkV1)* Uk'vl'} = 6kk,6lv with k,I, k', V = 0, ±1,±2,... , ±\{v - 1) and derive tr {U(q',PyU(q",p")} = 2n8{q' - q")S{p'-p") . 2-8c Use Problem 2-6 to prove that tv{U(q',p')} = 2nS(q')S(p'). Can you produce the result of Problem 2-8b from this?
138 2. Continuous q, p Degree of Freedom 2-9a Verify qU(q',p') + U(q',p')q = 2i-^U(q',p') , and find the analogous statement for pU + Up. Use them to show that the reflection operator is such that Rq — —qR , Rp — —pR . 2-9b Show that R is both Hermitian and unitary. What, then, are the eigenvalues of R'! 2-9c Use Problems 2-7 to find the qp ordered form of R. Can you then get the pq ordered version immediately? 2-10a Use the Taylor series expansion of e~"^Me*-" to prove that e-BAeB = A+[A,B]+±[[A,B],B]+'- . Note how this simplifies if [A, £?] commutes with B. 2-10b Recognize in the results of Problem 2-8a a realization of the following statements, which apply when [A, B] commutes with A and B : eA + B = eAeBe-±[A,B) = eBeAe±[A,B] 2-10c Provide a direct proof of this statement by starting with *e\(A + B) = {A + B)e\(A + B) and then writing, for example, e^' +-°) = e*BX(\). 2-11 Extend Problems 2-10b and 2-10c to the situation where [A,B] commutes with B and [[^4,jB],^4] commutes with A. 2-12a As an alternative to the symmetrical v -> oo limit of Sections 1.16 and 2.1, reconsider
Problems 139 ^2(uk\vm){vm\ui) = Ski m and define 27rfc ,, 2ttI 9= , <t> = , v v with k,l = 0,±1,±2,... ,±\{v- 1) . Show that in the limit v -> oo one then gets oo J2 ^eim^-^ =8(<t>-((>') with -n<ct>,ct>'<n m——oo 2-12b Identify thus vectors {<j)\ and \m) such that (<t>\m)= — ^m /2tt with -ir<(j><ir and m = 0,±1, ±2,... , and verify the relations &(j)(m\4>)(<f)\m') = <5TOiTO, , -7T OO m=— oo Explain why these statements express the orthogonality and completeness of the vector sets. 2-12c Define Hermitian operators # and M in accordance with &<j>\4>)4>{<j>\ , M= ]P |m)m(m| . -* m=-oo State the analogs of (1.14.28) and (1.14.33) in terms of the unitary operators eim and e1^ . [You will need to introduce the convention that the vectors (0| and (<j>'\ are identical if <j> — <j)' = 2n. Why?] 2-13 Begin with the inequality |E l^(«')^(a")-^2(a')^i(a")|2>0 a',a"
140 2. Continuous q,p Degree of Freedom and arrive at (l|l)(2|2)> |(l|2)|2 along with the condition for the equal sign to hold. 2-14 Hermitian operators A and B have expectation values (^4), (B) and spreads 5A, SB, respectively. Define X = SB(A- (A)) +iSA(B- (B)) and exploit (cf. Problem 1-27) (x*x\ > 0 , (xxi) > ° to establish 6A6B>^\i[A,B]\ along with the condition for the equal sign to hold. This is Robertson's* general form (1929) of Heisenberg's uncertainty relation. Apply it to the pairs (A,B) = (q,p) and (A,B) = {aX)ay). 2-15 The result ^4|l) of applying a Hermitian operator A to a normalized vector |l) is, quite generally, of the form A\\)= |l)a+|2)& where |2) is a normalized vector orthogonal to |l). Show that a and |6| are the expectation value and the spread of A, respectively, if |l) specifies the state of the quantum system. What changes if A is non-Hermitian? 2-16 By exponentiating a small multiple of the Hermitian operator A, one gets a unitary operator exp(ie^4), |e| -C 1, that differs little from the unit symbol 1. Evaluate the expectation value (exp(ie^4)) to second order in e, and demonstrate that |(eieA)|2 = l-£2(M)2+0(£4). Apply this to the situation of Problem 1-10. 2-17a In Section 2.4, the "permissible simplifications" (q) = (p) = 0 and Sq = dp are made. As a justification, consider the general situation, in which the expectation values (q), (p), as well as 'Howard Percy Robertson (1903-1961)
Problems 141 {q2) = (q)'2 + (Sq)'2 , {p2) = (p)2 + (Sp)\ (h(qp + pq)) = (q)(p)+A are arbitrary except for the restriction (why?) (5qSp)2>\ + A2. Three consecutive linear transformations, (i) -► (i)=u.^ (i) u2=(™* sini) (i), \pj \pj \pj v~sm^cos^/ \pj (1)-09=^ (I) *=(?/) • are then chosen such that <Q>=0, <P>=0, (Q2) = (P2), (|(QF + FQ)) = 0. Verify first that each tranformation preserves the fundamental commutation relation, that is &P] = [U}=[Q,P} = [q,p}=i- Then note that (5) = 0, <p) = 0 by construction, so that (5) = 0, (p) = 0, (Q) = 0, (P)=0. Next, determine the angle <f> such that (liqp + pq)) =0. Finally, find the value of k that ensures (Q2) = (P2)- Can you thus justify the simplifications (q) = 0, (p) = 0, (q2) = (p2), and ((qp + pq)) = 0? 2-17b Express (<5g)2, (<5p)2, and A in terms of 0, k, and e = 2(Q2) = 2 (P2) > 1. What do you get for (SqSp)2 - A2? Conclusion?
142 2. Continuous g, p Degree of Freedom 2-17c Verify that the unitary operators U\, U2, U3 are given by Ui(q,p)= jtoW-vW , U2(q,p) = e-Vtf +^2)/2 , U3@,P) = ei#c(«P + P«)/2. Show that Ui(q,p) = Ux(q,p) , U2(q,p) = U2(5,p) , U3(?,?) =U3(Q,P), both by an explicit check and by a general argument. 2-17d The total transformation CM?)-"©- involves £/ = £M<z,p)£M5,p)tf3(5,P). Without invoking the explicit forms of the C/^'s, demonstrate that [/ = [/3(g,p)t/2(g,p)t/i(g,p). This expresses U in terms of the original variables g and p. Can you write C/ as a function of the final variables Q and P? 2-17e If the second transformation is left out (cj> = 0), which value of k ensures (Q2) = (-P2)? Argue that this ¢ = 0 case is sufficient to justify the simplifications (q) = 0, (p) = 0, <5g = <5p. 2-18 Verify the sum rule of Problem 1-23 for A = y^y and B = ay^ + a*y with arbitrary complex a, 2-19a Use the definition (2.5.4) to derive n=0 a generating function for the Hermite polynomials, and recognize it in (2.7.35). 2-19b Rederive (2.5.8), (2.5.10), and (2.5.12) directly from this generating function.
Problems 143 2-19c Employ the generating function to prove the orthogonality relation r°° 2 / dq e~« Hn (q)Em (q) = 6nm y^2nnl . J — oo 2-20 Rearrange the binomial expansion (2.6.3) to show that JV|00| =0, and, similarly, that (N ~-n)\nn\ = 0 . 2-21 What are the analogs of Problem 1-54 for the operators y and j/t? Combine them with 1001 = exp(—y^;y) to prove that 2/|00|=0, \W\y*=Q. Now check that |oo| |oo| = |oo|. 2-22a Prove that e-V;» = (i_A)JV, N = y*y. For A -> 1 these are expressions for 1001. Put A = 1 - e-P and get use this to identify the eigenvalues and eigenvectors of TV. 2-22b Discuss the interpretation of this relation for /3 <C 1 and for /3 -> oo. 2-23 Here is yet another method for finding the eigenvalues and eigenvectors of N = y^y. First show that, for any vector | ), the wave function ^{y^ ) — (y^ | ) is an entire function of j/t , that is: ip(y^ ) is analytical everywhere in the complex y^ plane. Then use this global analyticity to get
144 2. Continuous q, p Degree of Freedom the eigenvalues and eigenvectors of N. [Dirac calls tp(y^ ) the Fock* representation of the vector I ). In today's terminology, however, the set of probability amplitudes tpn = (n| ) constitutes the Fock representation, and the stationary-uncertainty states |n) are commonly termed Fock states.] ~\yf;y = P-Xyu,y( 2-24 Show that ye-Av,v= e-*v;y(i-A)j,, and state the adjoint. Then verify that r= e-2yu>y = (-i)yfy is the reflection operator of Problems 2-9. 2-25 Exhibit yky^ for k = 1, 2,... as polynomials in TV. Use them to prove that eW = (i _ a)"""1 for |A| < 1 . Replace the A in Problem 2-22a by —A/(l — A) and conclude that 3Ay;yt = 1 rJ^yf;y 1-A Check that both sides give the same result when applied to |0). 2-26 By differentiating with respect to j/t, and y, show that Y, the equivalent ordered operators of Problem 2-25, is such that y 1-A ' y 1-A Accordingly, generalize Problem 2-25 to 2-27a In the continuum limit, the analog of operator W in Problems 1-43 W= f dx\q' = x)(p' *Vladimir Alexandrovich Fock (1898-1974)
Problems 145 Find W|p'), W|g'), and state the analogs of the various forms for W in Problem l-43c. 2-27b Show that Wy = iyW , y^W = W%f . Use them to express W as a function of j/t and y. 2-28a The operator 7=\(Ox + icry)y + yf \(crx - iay) is a Hermitiaii combination (show this) of y,y^ with the non-Hermitian Pauli operators \{ax ± ltJy)- Find the eigenvalues of 7. [Hint: What is 72?] Then express the eigenvectors of 7 in terms of the common eigenvectors In, ±) of yiy and az. 2-28b Repeat for 7 = \{ax + iay)y + y^\{<Jx — wy) + ^z- 2-29 Verify that /^k')0)ei(5'2 + 2W+p'2)(0)P'| = l J 47T ' ' holds for the coherent states of (2.7.38). How does this tell us that the \q',p' = O) vectors are complete by themselves, and also the \q' = 0,p') vectors? Can you find more completeness statements for the coherent states? 2-30 Use (2.9.7) and the completeness statement (2.7.50) to get *{*-0N} = fc^e-^W'le-nv') yJ -y 1 1- e' Is this result expected from the spectrum of TV? Employ the completeness relation of Problem 2-29 for an alternative derivation. 2-31 Use the result of Problem 2-30 to evaluate (N) = tr {NP} with P = tr{e-PN} Does this P obey the general conditions imposed on the probability operator (cf. Problems 1-30)?
146 2. Continuous q,p Degree of Freedom 2-32 Exploit the completeness of the |nn| measurement symbols to prove that yf(N) = f(N + 1)2/ and f(N)yt = V*f(N + 1)- 2-33 Evaluate tr { e~P } by differentiating with respect to /?, then applying the result of Problem 2-32, followed by the use of the cyclic property of the trace. You will need a boundary condition for /? -> oo. Does your final result agree with that of Problem 2-30? 2-34 Use the result of Problem 2-30 to find tr | e-^} with TV = y\yx + y\y.2 , where 1 and 2 refer to different degrees of freedom. From your result derive the eigenvalues of this TV along with their multiplicities. Check the latter by an elementary argument. Can you supply the corresponding answers for ;/ degrees of freedom, V 2-35 In view of the cyclic property of the trace (cf. Problems 1-24 and 1-46), is it true that tr{j/j/t}-tr{j/tj/}=0 ? In particular, give the explicit numerical forms of the two sides. Then, compare with the explicit numerical forms of (A > 0) tr j^e-Ay+jyJ.tr j^e-Ay+yJ Is this zero? What is the moral? 2-36 Why haven't we talked about a vector obeying 2/t| )=0? Answer by multiplying on the left by y, and also by using the q' description to construct the wave function representing the vector. Nevertheless, there is a sense in which |j/t ^ and (j/'j can be defined. Consider \y'\>0: (y'\= / d/' e"^ V(yt"\, Jo with the integration path extended to oo so as to produce convergence. Then (check this)
Problems 147 (y'\(y-y')= / ^"£» Jo owt /o dyi = -<tftW = 0|; t" / // no£ zero, but a constant vector. So 1 <s/'l = <o|-r—. <s/'|s/"> = -rLi7- y y y y As a check of consistency, compare <2/W> = wv 2/' - 2/" with <J/'|j/|j/"> = J/'<J/'|j/">-<0|j/") 1 = y'-y" y'-y" Also, from (prove this) we get ~W = W Vn! -•—v—' vn! = W or Then (2/» (2/0 nn+i ' HO = ^- n=0 n=0 (2/0* 2/' - 2/" for \y'\ > |j/ '|.
3. Angular Momentum 3.1 Infinitesimal unitary transformations Physical properties or combinations of them are symbolized by operators X, Y,... obeying algebraic relations, X+Y — Z, XY = Z; states are symbolized by vectors ( |, | ), with algebraic relations, X|l) = |2), |l) + |2) = |3), all this subject to the adjoint relations, such as A^ = A for a Hennitian operator and (ll-X't = (2|. There are numbers formed by the vectors and operators: (l|2), (l|X|2), or equivalent traces, e.g., tr{X|2)(l|}. Suppose one systematically redefines all vectors and operators: (1 = (1^, 1) = ^1) X = U~1XU (3.1.1) where W = U' l is a unitary operator. Then all algebraic adjoint and numerical relations are maintained: X + Y XY |1) + |2) = X|l) = -z\ -- z |3> |2> A*=A ( X + Y = Z XY = ~Z ]iy+|iy=|3> X\l) = |2) 3f = A (3.1.2) and <lt=T>. (^) = W = 0]x^>. <!|* 12) = (1^12)- (3.1.3) Thus the symbolic and numerical representations of the atomic phenomena permit the freedom of unitary transforms, in much the same way as there is a freedom of coordinate systems. Indeed the two are linked: the change of description produced by introducing a new coordinate system is represented by a unitary transformation. U = 1 is the identity transformation. We are interested in transformations that differ infinitesimally from the identity: U = 1 + iG , (3.1.4)
150 3. Angular Momentum where G is an infinitesimal operator of order e; in f/t = 1 - iGt , U*U = (1 - iGf)(l + \G) = 1 + i(G - Gt) + G*G (3.1.5) the product G^G is of second order, oc e2, and is neglected. Therefore Gt = G ; (3.1.6) the generator G is a Hermitian operator. In what follows we use a notation based on analogy with this familiar change in coordinate system: x 0 0 5x x = x — Sx The displacement of the origin by 8x changes coordinate x into x — 5x. So, we write X-l(XG-GX) , (3.1.7) \{X,G\. (3.1.8) 3.2 Infinitesimal rotations Consider an infinitesimal rotation of the coordinate system, about the z axis through the angle dtp, \)-S\) = \)=U-l\) = (l-iG)\), (\-H\=V\=(\U = (\(l + iG), X-6X = X = U~lXU = (1 - iG)X(l + iG) and get <5|)=iG|), <5(| = -(|iG, SX
3.2 Infinitesimal rotations 151 The components of any vector V are changed to VX=VX+ SifVy , Vy = Vy- 6ipVx , VZ=VZ, (3.2.1) together: V = V - Su x V , (3.2.2) where <5o> here is the product of dtp with the unit z vector. Here then SV = Su x V . (3.2.3) The infinitesimal nature of G is clearly set by 5<p; we write G = 6cpJz =6u-J (3.2.4) for this generator of infinitesimal rotations. What name do we give to the vector J? How about angular momentum? The appropriateness for this classical mechanics terminology (which is all that we borrow) will become clear later. We have learned that Su xV = l[V,Su;-J} (3.2.5) which supplies the commutators relating the components of any vector, with the components of J. As one way of expressing this, we multiply by numerical vector a and get (<5a> -> b) l[aV,bJ]=axbV. (3.2.6) Thus, parallel components of V and J commute, [Vk,Jk]=0, (3.2.7) perpendicular components have commutators proportional to the component of V in the third direction: 1 rT/ T , _ J +Vi if jkl is a cyclic permutation of xyz, . . jl j> kl~~y__yl if it's an anticyclic permutation, K ■ ■ ) as illustrated by \[VX,JV] = Vz , j[Vv,Jx] = -Vz . (3.2.9)
152 3. Angular Momentum These relations must also apply to vector J: l[a J,bJ} = axbJ (3.2.10) thus •Jx'Jy ~ JyJx = ^'>z , (o.Z.LL) for example, or J x J = iJ . (3.2.12) The answer to Problem l-36a, [a-cr,b-cr] = 2ia x b a , (3.2.13) gives us an example of an angular momentum vector J: J = \o. (3.2.14) We began with vectors, but even simpler is a scalar: S. It is unchanged by the rotation, SS = 0, so 0 = 1{S, 5u ■ J) or [5, J) = 0 . (3.2.15) The scalar product of two vectors is a scalar. Are these characterizations consistent? We are asking whether it is true that ' 5(VX ■ V2) = SVt -Vi + Vf 6V2 ? (3.2.16) which amounts to 6u x Vi ■ V2 + V1 ■ 6u x Vg = 0 . (3.2.17) = Vi x 5u> ■ V-2. Put more generally: Does [XY, Z) = X[Y, Z) + [X, Z)Y (3.2.18) hold? Yes, see Problem l-21a. 3.3 Common eigenvectors of J2 and Jz The square of the angular momentum vector, J2, is a scalar, so [J2,J] = 0. (3.3.1)
3.3 Common eigenvectors of J2 and Jz 153 In particular J2 and Jz commute; they are compatible properties. But we cannot add Jx, or Jy to this list; they commute with J2, but not Jz. We now ask for the eigenvectors of J2 and Jz: J2\J2',J'Z) = \J2',J'Z)J2' , JZ\J2', J'z) = |J2', J'Z)J'Z - (3.3.2) It's convenient, to introduce the non-Hermitian operators (frequently called ladder operators) J-\- zr Jx f Wy , J— ™ Jx l^y ™~ J-\- [o.O.O) such that [Jz, J+] = lJy + Jx = J+ i {Jz,J^=iJy-Jx = -J~- (3.3.4) We also have J+J- = (Jx + iJy){Jx — iJy) = Jx + Jy — i[Jx, Jy] = J J z + Jz 1 J^J+ = (Jx - Uy)(Jx + Uy) = fx+Jy+ l[JX, Jy] = J2-J2~JZ, (3.3.5) so that [J+,J_] = 2JZ. (3.3.6) For simplicity, write J'z — m, and, for reasons we see in a moment we shall write J2 = j(j + 1) with j > 0 by convention. Then from JzJ-y — JjrJz ™ J-y Or JzJ-y ™ J-\~yJz + 1) (o.O.Y) we get JzJ+\j,m) = J+(JZ + 1)|.7,m) = J+\j,m)(m + 1) . (3.3.8) Now J2, commuting with J, commutes with J+, so that J2J+\j,m) = J+J2\j,m) = J+|i,m)j(j + 1) , (3.3.9) which says that J+\j,m) = \j,m + l)Ajm (3.3.10) with a numerical factor Ajm to be found; if j, m are eigenvalues, so, in general, are j,m + 1. But this increase in rn cannot go on forever. Note that
154 3. Angular Momentum /• It t I- \ J (J+b''m)) (J+\j'm)) >°. ,,,m (j,m|J_J+|j.m) = < V ' / \ ' ) (3.3.11) [j(j + l)-m(m + l) , so to (to + 1) cannot exceed j(j + 1). In fact since the largest value m of to is such that J+1j, m) = 0, we have just m = j (3.3.12) (—j — 1 is no£ m). Incidentally, the above relation also tells us that j(j + 1)- m(m + 1) = (j - m){j + to + 1) = \Ajm\2 (3.3.13) where -4jm = {j,m + l\J+\j,m) = (j,m|j_|j,m + l)* . (3.3.14) Now, starting with JZJ_ = J_(JZ~1) (3.3.15) have and implying where JzJ_|j,m) = J-\j,m)(m - 1) (3.3.16) J2J„|j,to) = J_|j,m>i(j + 1) (3.3.17) J-\j,m) = |j',to - l)Bjm (3.3.18) Sim = (j,m- l\J-\j,m) = A*jm^ . (3.3.19) This tells us that \Bjm\2 = {j+m)(j + l-m) . (3.3.20) So, in general, if j, to are eigenvalues, so are j, to — 1. But the decrease of to cannot go on forever, or positive \Bjm\ would turn negative. Indeed, the lowest value to of to is such that J_ | j, to) = 0, or to = -j . (3.3.21) The descent from m = j to to = —j occurs in n = 0,1,2,... steps. Therefore 2j — n,
3.4 Decomposition into spins 155 j = \n =0,|,1, |,2,... . (3.3.22) For each n, or j, the possible values of m are m = j,j- 1,..., -{j-l),-j, (3.3.23) which are n + 1 = 2j + 1 in number, j 0 £ 1 | 2 ... 2 2 (3.3.24) 2j+ 1 12 3 4 5 In particular j = i 2j + l = 2, m = ±|, is clearly the Stern-Gerlach system: ' =1 = 1(1 + 1)- arize: •J2'=i(i + l) , J = \n, J'z=m=j,...,-j, To summarize: J+ |j, m) = | j, m + l>\/(j — m)(j + m + 1) , J'-li.m) = \j,m-l)y/(j +m)(j -m + 1) , (3.3.25) where the usual phase convention is adopted, namely that Ajm > 0 and Bjm > 0- 3.4 Decomposition into spins The combinations 3 - m = n_ = 0,1,... , j+m =n+ =0,1,... , (3.4.1) direct our attention to another formulation. First note that j = |(n++n_) = n , m = |(n+— n__) . (3.4.2) Then, relabeling the states, we get J+|n+, n_) = |n+ + l,n_ — l) v n_(n+ + 1) , X-|7i+,n„) = |n+ - l,n+ +l)i/n+(n_ + 1) . (3.4.3) Doesn't this set the bells ringing? Recall (2.4.14) and (2.4.15): j/|n) = |n - l)v/n , y^n) = \n + l)\/n + l \ (3.4.4) It is as though there are two independent systems, y+,y+, and y_,y_, with composite states |n+,n_) = |n+)|n_), and
156 3. Angular Momentum J+ = j/+j/_ , J- = yly+ . (3.4.5) In addition, Jz = \{y+V+ ~ V-V-) , J'z=m= \{n+ - n_) , (3.4.6) and [j = \/j(j +1) +f-i] \/-/2 + I " | = £(j/+J/+ +J/lj/_) • (3.4.7) Notice also that Jx = \{J+ + J-) = | (2/+2/- +2/12/+), Jy = ±(J+-J^) = i(ij/lj/+ -ij/+J/_) . (3.4.8) Then, if we introduce the 2-component objects y=(y+\ «t = r«t „t we find that y_ , , 2/ = (2/+,2/1), (3.4.9) J = y^<ry, y/J2 + \-^ = ^y, (3-4.10) where the cartesian* components of vector cr are the Pauli matrices of (1.10.3). The explicit construction of the eigenvectors that follows from 171) = -^^10) with 2/10) = 0 (3.4.11) VTl! is (j ±m = n±) \j,m) = -^±L= ^-/ |0) with J|0)=0, (3.4.12) V(j+m)!(j-m)! where |0) = \j = 0, m = 0) = |n+ = 0, n_ = 0). This state |0), with angular momentum equal to zero (notice there is no conflict with the commutation relations if all components of J have the value 0) is unchanged by rotations: <5|0) =i<5o>-J|0) = 0; (3.4.13) it is rotationally invariant. For j = |, to = ±|, we get |U>=2/+|0), ||,-|) =2/l|0). (3.4.14) *Rene Descartes (1596-1650)
3.4 Decomposition into spins 157 We can say that, y\,,o = ±1, acts on the |0) state to create a j = | state of angular momentum, the one with m = |ct. That tells us why the a matrices appear in the construction of J. For {\MJ\hm')\m=\a,m-=\a- = <°k£Ws/M°> • (3-4-15) <t" ,<t"' Now the commutator properties of y and j/t are summarized by [y*, vU = &** ; [2/., vA = 0 ; [yl yU = 0 . (3.4.16) So ya„,yl\0) = (<W< +^,^,,,)10) = |0>«W , (3.4.17) the adjoint of which is (a' -> a, a'" -> a") {Q\yayl,=5aa„(Q\. (3.4.18) Therefore (i,ia|j|!,ia'> = <a|i<T|a'> (3.4.19) which of course says that, for j = |, J = \a. The basic angular momentum, j = |, is called a spin. The construction for |j, m) in (3.4.12) can now be described by saying that angular momentum j, rn is produced by creating n — (j + m) + (j — rn) = 2j spins, where the combination of j + m +|-spins and j — m —\-spins gives for J'z: +|(i+m)-|(i-m)=m. (3.4.20) For thoroughness we ought to verify directly that J = E^<ffIHff'K' (3-4-21) <7,<7' obeys the J x J = iJ commutation relations. Consider then [vlva-iyl'Va"] = viivl'Va' + ^a'a")ya„, - ^,,(<5<x"'<x + yiya.,,)ya, = SsvvUa- - Sv'rvl.Vr, , (3-4.22) where the cancellation occurs because the order of multiplication of y+ and 2/1, or 2/+ and 2/-, does not matter. Notice that this commutation relation has the same structure as that of the spin measurement symbols (Stern-Gerlach)
158 3. Angular Momentum [I f\ I I' ///ll C I Hl\ C I II l\ /O A O0\ |ctct|,|(TCT J = Oa>a»\crcr \ — <V"<r' \<r o \ ■ (3.4.23) Of course, the latter also have the multiplication property I /11 II ///1 r I ///1 /-q a o/l\ \aa \\(J (J | = Oo-v" |<7<7 | (3.4.24) whereas [see (3.4.16) above] yiva-vl'Va- = s*'*"yUai» + yiyl»y„>y„>» ■ (3.4.25) However, as applied to one spin, ya,y<T,„\n = 1) = 0 and (n = l|j/tj/t„ = 0 , (3.4.26) and they are the same. So (cr'\, which, acting to the right, annihilates the spin of type a' is replaced by ya> which, acting to the right, annihilates a spin of type a'. Similarly, for |ct) and yl. This kind of extension from one system to any number of similar systems is usually (somewhat erroneously) called second quantization. Now, we know that the product of two operators (here, for one spin) is represented by the product of their matrices, the commutator of the operators by the commutator of the matrices. So, since the algebra of the commutators is the same, J commutators are given by J x J = E V*M 1° x \°\°')V* = iJ . (3-4-27) = i|<r indeed. 3.5 Angular momentum of a composite system In recognizing the significance of J as a collection of n spins we took it as intuitively obvious that J'z is the sum of the individual ±|'s. Now let's look more generally at the angular momentum of a composite system. Consider two parts, with angular momenta J% and J2, which of course commute. An infinitesimal reference frame rotation acts similarly and independently on the two parts U = UiU2 = U2U1 = (1 + W« • Jx)(l + W« • Ja) = 1 + i6u> ■ (Ji + J2) . (3.5.1) Thus, the total angular momentum is J = J1+J2. (3.5.2) We verify that
3.5 Angular momentum of a composite system 159 J X J = (Ji + J2) X (Jx + J2) = J\ X J i + Jt X J2 + J2 X Jt + J2 X J2 = iJi =0 =iJ2 = iJ . (3.5.3) How are the possible values of j [in J2 = j(j + 1)] determined by j\ and j2? Here's a simple counting argument. The largest value of m = m\ + m2 is to = ji +j2, which is unique. That is, only (7711,7712) = (./1,./2) will produce it. Hence the largest value of j is imax = Ji + j-2', it has to = j\ + j2 and h +h ~ 1. • • •, -(./1 + J2 -1), -(ji + ./2)- Now consider to = ji + j'2 - 1; it can be produced in two ways: (mi, m2) = (ji - 1,./2), and (toi, to2) = (ji, j'2 - 1)- One such state is already contained in j = j\ + j'2, m = j\ + ji ~ 1- Hence there must be another state for which to = j\ + j'2 — 1 is the maximum value, thus it has j = j\ + J2 — 1. For to = j\ + j2 — 2 there are (in general) three ways: (mi,m2) = (ji -2,j2),(ji -1,J2 — 1), (j 1, J2 -2). Two of these accompany j = j\ + j% and j = j\ + j'2 — 1, so there must be a third state for which m = j\ + j'2 — 2 is maximum; it is j — j\ + j'2 — 2. There are only a finite number of states; where does this stop? Consider to = j\ + j2 — Ay. (7711,7712) = (ji - Ay j-2), • • • , (./1,./2 - 4/); 4/ + 1 in number. Suppose that j2 < j\. Then if 4/ = 2j2, the values of 77½ range over the full gamut, m-2 = J2, • • • , ~J2- And this means that if we now increase Aj by unity, to 2j*2 + 1, we will not get an additional state; the lowest value of j is jm[n = j\ +J2 — 2j2 = I Ji — j21, the latter holding generally. We can verify this result, h +32 >j> |j'i — J2 i , 3>m>-3 , (3.5.4) by counting the total number of states. It is (j\ > j2) 31+32 J2 (2j + 1) = (2ji + 1) x (2j2 + 1) , (3.5.5) 7= ji — J2 aver, value no. of terms as it should. To illustrate this consider two spins, j\ = j2 = |. Then we start with m =,/ = 1: |l, 1) = |++) , i.e., mi = 7712 =+| (3.5.6) which, one should notice, is perfectly symmetrical between the two spins: 1 <r+ 2 leaves this vector intact. Now we use J-\j,m) = \j,m)^(j +m)(j-m + l) [cf. (3.3.25)], here for j = 1, J_|l,l)=|l,0)>/2; J_|l,0)=|l,-l)>/2 (3.5.7)
160 3. Angular Momentum to construct the other states of j = 1. For this we use the j = | properties J- \h I) = 12' 2) ' -^~ 12'— 2) = ° - (3-5.8) So, from |l, l) = 1++)) we get 11,0) = -^(^.+72-)1++) = ^(1-+) + 1+-)) - (3-5-9) which is a unit vector, and is symmetrical in 1 and 2. Then = 1(1-) + 1-)) = 1-), (3.5.10) which we could have written directly (there is only one way to get m = —1), although we did have to see how the phase came out. Here are the 2j + 1 = 3 states of j = I +1 = 1. The only other possibility among the 2x2 = 4 states is the 2j + 1 = 1 state of j = | — | = 0. This m = 0 state is the combination orthogonal to |l,0), which is (to within a phase factor) 10,0) = -^ (1-+)-1+-)) - (3.5.11) Under the interchange 1 ■H- 2, this vector turns into its negative - it is anti- symmetrical. The symmetry properties of the 3 + 1 states can be expressed by an operator. First we note that ^=^)=(^:5: (3-5-i2> or so the permutation operator P12, which tests symmetry or antisymmetry under the exchange 1 ■<-» 2 is -P12 = (!<T! + |cr2)2 - 1 = § + 2<T! - cr2 + I - 1 = £ (1 + <ri • <r2) - (3.5.14)
3.5 Angular momentum of a composite system 161 This operator is such that p12K,<4') = K>2)- (3-5-15) For n spins, the state with m = j = |n is !+ + ■•■+) (3.5.16) n which is totally symmetrical in all n spins. All the other m's for j = in, produced by successive application of J„, which is symmetrical in all the spins, are also symmetrical. It is that total symmetry that picks out the maximum value of j. And that is what is displayed by the construction [cf. (3.4.12)] \jt m) = -j | y\yl ... ylyl ... |o) , (3.5.17) V(j +m)l(j -to)! v - v ' j + m j — m for all these operators are commutative, and have no association with any individual spin. The repetition of an infinitesimal rotation, say about the z axis, produces a finite rotation: U(<p) = (1 + i6ipjz)(l + i6<pJz) ---(1+ iSipJz) v ' <fi/5ip S> 1 factors = litn (l+iS(pJz)'p/Sv = elLpJ* ■ (3.5.18) Stp->0 The first use we put this to is a more elegant derivation of the addition of two angular momenta. Begin with the trace r - t -, _i_ • ei(j + l)¥>_ P-w sin ((i + |V) (3.5.19) sin(j¥>) and then consider J = Ji + J%, so eiipJz = eivjlz eivj2z (3.5.20) This operator has the matrix (diagonal elements) (ji, mi; j2, to2 | ellfiJz \ji, mi; j2, m2) =(ji, toi | elipJlz \ji, mi) x(J2,m2|ei(^|i2,m2), (3.5.21)
162 3. Angular Momentum leading to the trace for given j\,J2, the sum over m,\ and m-i'- trtAJ^e^'} =tr<j0 {e^-Jtru,) { e1^ } = J2 e[(mi+m*te. (3.5.22) Now *<*) {^j-} *<*> {^-}=sin((V^} sin((V iv) u; I J u ; I J sin(^) sin(£y>) _ - COS ((/l + J2 + l)f) + COS ( | J] - j2 | ¥>) 2sin2(|<^) J1+J2 53 [cos((i + 1)(/5) - cos(j»] J=Ul-j2| or [i + l = (i + |) + i,i = (i + |)-|] ji+h ^(n){^J^}tHh){e^}= Y, 2 sin2 (iy>) in((j + |V) (3.5.23) sin sin(iy>) J1+J2 i J2 J2 eimv (3-5-24) i=|ji-J2l m=-] showing that the totality of m,\, m-2 values can be classified by total angular momentum eigenvalues: j>m> -j, ji + j2 >j> \ji — is i ; (3.5.25) see (3.5.4). Putting ip = 0 repeats the alternative, equivalent, count of states J1+J2 (2j1 + i)(2j2 + i)= J2 (2i + 1); (3-5-26) j=\h-h\ see (3.5.5). 3.6 Finite rotations. Eulerian angles By changing the coordinate system, the rotation around the z axis becomes the rotation about the direction given by unit vector n. Then if we call
3.6 Finite rotations. Eulerian angles 163 the rotation angle 7 we get the unitary operator representing an arbitrary rotation U = el^n • J . (3.6.1) Notice that three angles are required to specify this rotation, two to define n, and 7. Other ways of choosing the three angles can be useful. The one introduced by Euler uses three successive rotations of the coordinate system. 1. Rotate through angle (f> about original z axis: z, z Ux = e^J* , J = UilJUy 2. Rotate through angle 0 about new y axis: (3.6.2)
164 3. Angular Momentum U-2 = eWlv = U^1 e[ejv Ui , = U2 7 = U.2'llU.2 = U-2~lU{lJUiU2 = U^U^JUM 3. Rotate through angle ip about new z axis: z,z (3.6.3) y,y U3 = e1^ =^-1^1 e|^£/2£/, = (U^)'1 (U3U2U{) , = u3 1 = VZ'11V3 = [(UsU^y1 (u2Ui)] [(£/2^1)-^(^2^1)] x [(tWr'O/s^t/i)] (3.6.4) or J = U'lJU with U = U3U2U1 . (3.6.5) We have applied a sequence of transformations, each defined by its action on the previous coordinate system: U = UXU2U3 (3.6.6) and ended with the product of the three transformations defined on the same system, the original system, applied in the reverse order U = U3U2Ui . (3.6.7) Another way of getting at this begins with a (two-dimensional) Euclidean analogy:
3.6 Finite rotations. Eulerian angles 165 The final relation between vector and coordinate system is the same whether the vector is fixed and the coordinate system is rotated (counterclockwise) or the coordinate system is fixed and the vector is rotated (clockwise). Now consider, say, (a'| ) where (a'| is a coordinate system for vectors such as | ), and change the coordinate system (as will be induced by a change of reference frame) (a'\->(a'\ = (a'\Ui . (3.6.8) Then (a'| )-> (a'\Ui\ ) (3.6.9) and we have in effect retained the coordinate system but changed the vector: \)->Ui\). (3.6.10) Now make a second change of coordinate system, (a| -> (a'[£/2, so that (a'\ ) —► (a'\Ui\ ) —► (a'|£/2tfi| ) (3-6.11) which shows quite clearly that for any number of successive changes, the net operator is U = ---U3U2U, So, for Eulerian angles, W,M) eii>jz ei9Jv ei<f>jz (3.6.12) (3.6.13) See Problem 2-17d for another example illustrating these matters. The dependence on the three angles lets us produce differential-operator realizations for the three components of J. Consider (w| = ( |f/(w). Then, since 1 d U(u) = U{u)Jz , (3.6.14)
166 3. Angular Momentum we get 1 f\ t — (uj\ = (uj\Jz. (3.6.15) 1 d(j) ' ' Next ~U(u) = UWe-W'JyfM3* (3.6.16) 1 Ou which introduces one of the two statements about a z rotation: Jx (¢) = e-W* Jx e^J* = Jx cos <f> + Jvsm<f>, Jy (¢) = e-^J* Jy e^Jz = Jy cos (f)-Jxsm(t). (3.6.17) Geometry aside, one can check this by producing differential equations: -Ja(0) = e~^Jn[Jx,Jz] J*J>=Jytt) , Jy(cf>) = e-W'iiJy, Jz] J*J' = -Jx{cj>) . (3.6.18) Define J±((j>) = Jx{<t>) ± iJy{<j)), which obey ^J+(0) = -iJ+(0), ^J_(0)=iJ_(0), (3.6.19) so J+(0) = e_i^J+ , J_(0) = e^J_ , (3.6.20) and Ja(0) = I[J+(0) + J_(0)], J„(0) = ^[J+(0) - J_(0)] give (3.6.17). So we have learned that 1 d - —(w| = (oj\ (-^ sin¢ + Jy cos0) . (3.6.21) Finally, I Amw) = e^'J* el6Jy j*j' = U(u) e'W' e~idJvJz el6Jy e1^ i dtp (3.6.22) or with x —} z, z —} y, y —} x, <j) -> 8 in (3.6.17), t-^jU(uj) = £/(w) e~^J* (Jz cos6» + Ja sin6») e^J* = C/(w) (^ sin^cos^ + Jy sin 0 sin (^) + J2 cos#) , (3.6.23)
3.6 Finite rotations. Eulerian angles 167 leading to ld_ i dip and then to 1 fl 9 sin 9 \ i dip (u I = (u) | (Jx sin 0 cos <p + Jy sin 0 sin <p + Jz cos 9) cos 9 1 9 (w | = (uj I (Jx cos <p + Jy sin 0) (3.6.24) (3.6.25) Equation (3.6.24) could have been anticipated. Angle ip refers to rotations around the new z axis, which points in the direction given by spherical coordinates 9, (p. This is just the component of J that appears here. Now multiply the 9 equation (3.6.21) by ±i and add to the above; the outcome is =±i0 9 1/19 ,19 =^ + ~^~e ttt ~ cos^T"^r. d9 sin 9 \ l dip i dq> (w| = {uj\J± We can also construct a differential operator representing J2: J = J_ J+ -¥ Jz + Jz i (3.6.26) (3.6.27) so or (oj\J2 (lo\J2 ^1 96>2 x e^ d9 sin9 \i dip i d(j> d_ _1_ 9$ sin ( 1_9_ i dip cos 9 1 9 + I- — i d<p COt0d9 19,, 1 ( 92 sin26> V^2 2cos0 — — — 9(^ 90 dip'2 (3.6.28) <W|. (3.6.29) We make just one application of this differential equation, for which it is helpful to write 92 9_ d92 +cot0dO ~ ^[0d9 19..9 Sm()d9 (3.6.30) We are going to integrate over all angles, combining the familiar solid angle sm9d9d¢ with dip. What is the range of integration? It is 0 —> 7r for 9, set by the vanishing of sin#. But for </> and ip, appearing in e1^^, e"^, the fact that J'z can be ±|, ±|,..., in addition to integers, leads us to integrate over
168 3. Angular Momentum the range <j) : 0 -> An; jp : 0 -> An in order to attain periodicity. So if we define the normalized "volume" 1 11 f doj = -sin8d0— dd>—dtp , / duj = 1 , 2 An vAn ^ J (3.6.31) the result of integrating the differential equation obeyed by U(ui) is to remove all d/d(p, d/dip terms which have no net change, and, as for the 0 derivative we get r d ( d sin0leu o. In short, we learn that which tells us that / dujU{uj)J2 =0 / dujU{uj) = |00| (3.6.32) (3.6.33) (3.6.34) the symbol of the selection of the j = 0 state, the one with J2 = 0. As for whether the numerical factor is right, note that 10011001 = |00| I dojU{oj) . (3.6.35) But the selection of J2 = 0, for which J = 0, means: U(u) = 1, and f du) = 1, which shows that the right side is just 1001. More simply put: 1001 has the value one for a state with J2 = 0. 3.7 Rotated angular-momentum eigenvectors We are given vectors <J,m|=(0| y+ yi- \/{j+m)\(j-m)\ What are these vectors in the rotated frame, (j,m;u\ =(j,m\U(u) = (0 = (0 —j + m—j — in \/(j+m)!(j-m)! 1 + m j — m y+ yt y/{j + m)\(i-m)\ U(w) (3.7.1) (3.7.2)
3.7 Rotated angular-momentum eigenvectors 169 In the last version we have used (o|t/(w) = (0| and defined y = U~1yU. (3.7.3) [In the notation of (3.6.2)-(3.6.5) we'd write y, rather than y.} To evaluate the latter, let's differentiate the form (3.6.1) for U(u), -&V = * #- e"i7H ' J y ^ -J = U~l[y,n-J]U. (3.7.4) l 07 107 Now so and a' a" ^ = S(a,a') = E(CTln • \a\a")ya" = (n ■ \cry)a , (3.7.5) x^y = n-^y (3-7-6) y-=<hn-2°y=Uy, (3.7.7) where £/ is the 2x2 matrix to the j = \ version of U. When we use Eulerian angles, U is W= e^5ff*eW5ff»e^ff*, (3.7.8) which is the matrix a\a' I I7/I <\ I e^cosfe^ e^sinfe '2 \ \ - e 2 sin I e 2 e 2 cos f e 2 y We now see that the (j, m;uj\ are given as a linear combination of the ( j, m' |, (j,m;w| =V(i,m;wkm')(i,m'| , (3.7.10) , » v ' = (j,m\U(uj)\j,m') by expanding
170 3. Angular Momentum Tl3 +m j -m j + m'j - m! /+ V~ = = Y(j,m\U{u)\j,m') /+ V~ (3.7.11) in which i± I » ,r . </ _, 2/+ = e 2 | cos - e 2 y+ + sin - e '22/, _ _;i ( 9 \± 9 _\$_ \ 2/_ = e '2 I -sin-e'2j/+ + Cos - e Lay„ J . (3.7.12) We should be able to simplify the jp and <j> dependence, because (j,m\U(u})\j,m'} = (j,m\e^Jz eWJv e[^J^\j,m') = eim^(j,m\ eWJy \j, m'} eiro'* . (3.7.13) Indeed, we see on the left the factor (e4)J + m(e--4)J-m=^ (3.7.14) and also just the combinations ^zy+, e l2j/_ occur, as required for e'-'V^-V:-' = (e'f^)^™' (e-Vr*"' - (3.7.15) So it suffices to put ip = 0, cj> = 0, and get / 9 -9 \3 +m / -s 9 \j — m (cos 22/++ sin 22/-) (-sin 2?/++ cos|j/-j \Aj +m)!(j-m)! j + m' j — m! ^> „ y/(j + m')l(j - m')\ = UU) ,(6) As an example consider j = 1. Then 1/0 . 9 \2 m = 1 : — I cos-2/+ + sm-2/_ 1 = cos2 °-^ + V2 cos °- sin | y+J/_ + sin2 ^ (3.7.17) or ^i(,V (0) = cos2 - , E/g (0) = v^ cos - sin | , £/¾ (0) = sin2 | (3.7.18)
3.7 Rotated angular-momentum eigenvectors 171 and w = 0 : ( cos - y+ + sin - 2/- 1 I - sin ^2/++ cos o y~ 1 y2 1 2/2. = —jz sin# —!= + cos #2/+2/- H—■= sin# -j= (3.7.19) v2 v2 v2 v2 or <V (*) = - ^= sin 0 , £#0> (0) = cos fi , Ui% (9) = -^ sin 9 (3.7.20) as well as m = —1 : 1 / . 0 0 — I - sin - 2/+ + cos - 2/- . 2 9 yi r- . 9 9 , = sin - —j= — V 2 sm - cos - 2/+2/- + c°s 2 v 2 2 2 0_2/?l 2V2 (3.7.21) or £/1¾ (0) = sin2 0 , £/!V)O(0) = -v^ sin °- cos | , £/¾^ (5) = cos (3.7.22) What we have here is an example of the probabilities relating measurements of angular momentum, or magnetic moment, in two different directions related by angle 9: P <(m,m';0) = \(j,m;w\j,m')\2 = [u^m,(9) (3.7.23) The table m 7 = 1: TO 1 0 -1 1 cos4f 1 sin2 9 sin4f A 0 2 cos2 ! sin2 ! cos2# 2 sin2 ! cos2 § -1 sin4! | sin2 0 cos4! (3.7.24) summarizes the j = 1 probabilities. Evidently, p(m,m') =p(m',m) , >Jp(m,m') =1 , Y^p(to,to') = 1 . m m' (3.7.25)
172 3. Angular Momentum For another specialization put m = 0, which requires that j be an integer: 1 = 0,1,2,... . Then (m' -> m) c°s ~j\)+ + sin -j/_ I I - sin -y+ + cos -y. V(J+m)!(J-m)! (3.7.26) Notice that the left side is multiplied by (—1) when y+ -> j/_, 2/- -> —y+. So the right side is also equal to ' „1 + m( ., \l - m y/{l+my.(l-m)l E (-^r<im(o) I + m I -m y/(l + m)\{l-mji (3.7.27) m —> -m and AD t0) KLV) = (-l)mU^m{9) (3.7.28) follows, for which Uq / = —Uq ij in (3.7.20) is an example. To get an expression for U^m(6), put 2/+ = sin 0, y^=t-cos0 (3.7.29) so that cos w-y+ + sin "-y_ = (t + 1) sin "- , 0 0 0 sin -2/+ + cos -y_ = (t - 1) cos - , and (-01¾^ = E,*')^^^ V(i + m)!(J-m)! which has the form of a Taylor series expansion. So (sm#) — -i—-—'— V ' \dtj 2lV. (3.7.30) (3.7.31) t(D U^>m(0) (sin 0) t = cos ( l + m {l-rn)\ (3.7.32)
3.7 Rotated angular-momentum eigenvectors 173 or U^(0)-^^^(^0)^(^) ^,, ^ ■ (3.7.33) (!-m)r ; Vdcos67 2^! In particular, Uq 0 is given by Legendre's* polynomial, as illustrated by U$ (9) = cos 9 = Px (cos 9) . (3.7.35) Is it accidental that we run into Legendre's polynomial here? Hardly. To see why this happened, let's restore the angle <j>: y+ H- y+ A , y_^y_ e-if , yl++ myl~ m -+ e1™^ "V-" m . (3.7.36) We now meet the I power of 9 x± . 9 _{± \ f . 0 ii 6» _ii cos - e 2 y+ + sin - e 2 y_ 1 i — sm - e 2 y+ + cos — e 2 y_ 1-1 = - - sin 9 e^y'l + - sin 9 e^yt + cos 9 y+y^ 1 = - [ sin^cos^ (y'i - y\) + sin # sin ^ (-ij/'i - \y\) + cos9 2y+y^ ] — x/r =vx —y/r —vy —z/r =vz where r,6,(j> are the spherical coordinates of point r [cf. the illustration in (1.12.2) for unit vector n = r/r], and the cartesian components of vector v are as indicated, " = (yl -y+,-iyl-iyl,2y+y-) ■ (3.7.38) Notice that v2 = (yl - yl)2 + (iyl + iy2+)2 + (2j/+j/_)2 = 0 , (3.7.39) f is a complex vector of zero length, a null vector. Its components are presented, in terms of y = I + j and its transpose yT = (y+,y_), in matrix form: *Adrien Marie Legendre (1752-1833)
174 3. Angular Momentum vx = yT{-Oz)y , vy = yT{-i)y , vz = yTaxy (3.7.40) or " = yT -Oycry . (3.7.41) l Now we have (r --)1 _ f jy ,. , /IT A.+ mvL-m ,, 7 42, where we have written U(%(0) eim* = JJJL-Ylm{0, ¢). (3.7.43) Consider the Laplacian* of the left side: V2(r • v)1 = V • Z(r • i/)' _ lv = 1(1 - l)(r • i/)' _ 2 ^2 = 0 , (3.7-44) it is a solution of Laplace's equation. Solutions that are homogeneous in r, of degree I are called solid harmonics; the r Yim(9,(j>) are solid harmonics [the term spherical harmonics is applied to Yim(6,<j))]. It is familiar that Legendre's polynomial gives the spherical harmonics that are independent of <j>; to = 0. Incidentally, the factor introduced in defining Yim(0,4>) by the generating function (3.7.42) is such that the orthonormality statement / &QYlm(0,4>)*YVrn.{0,4>) = 8lv8mm. with dtf = sin0d0(ty (3.7.45) has a simple appearance. For a proof based on (3.7.42), see Problems 3-14. The quantity v has been identified as a vector, but is it really? Suppose one replaces y by y = Uy then 1 1 v = {Uy)TTaycr(Uy) = yTUTTaycrUy . (3.7.46) Next, the effect of transposition on the Pauli matrices (1.10.3) is given by °"J = <*x , <?y = -°y , o-J = az (3.7.47) or crT = -<Jvcr<Jv . (3.7.48) *Marquis de Pierre Simon Laplace (1749-1827)
Then, with we have or giving 3.7 Rotated angular-momentum eigenvectors 175 U = e^n ' 2a = cos J + in • <r sin 2 (3.7.49) t 7 7-i ' ~ - - iffyn • crffy sin - — 7/ - £/r = cos — — iayn ■ cray sin — = ayU lay (3.7.50) U1<Ty=<TyU-1 , (3.7.51) V = yT\ayU'l<rUy. (3.7.52) We recognize in U~~1aU the transformation that produces er. Specifically, for infinitesimal transformations, we have W=l+W«-io-, (3.7.53) and yielding _ 1 1 <T = (1 — iduj ■ —cr)cr(l + iScv ■ -or) 1 = a - —[or, <5o; • or] = a - Su x a , (3.7.54) 17 = v - <5a> x v ; (3.7.55) f is a vector. Having seen how things work for m = 0, let's return to the general situation, using the same substitutions. After using (3.7.29) and (3.7.30) in (3.7.16), [(* + 1) sin fH' +"»[(*-!) cos |F ■ m y/{j + m)\(j -m)7 = E(Bi°*y + "»' ^-""^" ^,,, "£»' W ' (3-7"56) ^—' ./ i +m' ' 1-m/ VX/ + m')!C/-m')! we get
176 3. Angular Momentum Uu) , (6) = 77i.m' \ ' (j+m')!(smf) (cos |) (j -m')\ 2J + m'y/(j + my.{j-m)\ d ^i-m' [(cos0 + l)'' + m(cos0-l)''-m] . (3.7.57) ^ d cos 6 As we see by writing -^ (sin|) (cosi) = (tan^j (sine)~m (3-7-58) we recover the known form for m = 0. Note the simple example which can be immediately read off from the original expansion. Given two angular momenta, J\ and J2, what is the state of zero total angular momentum? As we see from the restriction |ji — j'2 | < j, j — 0 is only attainable for j\ = j'2 (we have already met j = 0 for ji = j2 = f). So (omitting jx = j2) (0| = ^2 (0|mi,TO2)(ml,TO2| , (3.7.60) 7711,7712 which must obey (0|(Ji+J2)=0. (3.7.61) The z component of this relation y^ (0|mi,m2)(mi +m2)(m1,m2| = 0 (3.7.62) 7711,7712 tells us that only values obeying m — rti\ + m2 = 0 (3.7.63) occur. Therefore, in a simplified notation, (°l = ^2 c™(mi "m\ ■ (3.7.64) m Now consider 0= (0|(Ji_ + J 2-) = Y^cm{m,-m\(Ji- +J2-) m = ^Cmj^/O'i -m){ji +m + l)(m+ 1,-m| (3.7.65) m + VO2 + m)(j2 - m + l)(m, -m + l| | ,
Problems 177 so that (j2 =Ji) 0 = ^2V(Ji ~m)(ji + m + l)(cro+cro+i)(m + l,-m| , (3.7.66) m implying cm+i = —cm. Therefore cm alternates in sign as m changes by unity, or which already incorporates the normalization factor for a sum of 2jj + 1 orthogonal unit vectors. For the example j\ = |, we get <0| = -i= [<+,-|-<-+|] , (3.7.68) which is just the previous result in (3.5.11), apart from a minus sign. Problems 3-1 Use the y\ , y_ description to find differential operators that represent the components of J. Apply the latter to the appropriate angular momentum wave functions to arrive at the known results for J|j,m). 3-2 There are two angular momenta, j\ = 1, J2 = o- Begin with the total 3 2 angular momentum state m = j = § in terms of the individual momenta, and then construct the other states of j = |. Next find the m = j = | state as the one orthogonal to the m=\, J = § state. Finally, construct the other j = | state and verify that it is orthogonal to the m = —|, j = § state. 3-3 What is the j = m = 2 state for the two angular momenta j\ = j-2 = 1? Find the four other states with j = 2. How would you get the states with j = 1 and j = 0? Construct all of them. 3-4a Iso (topic) spin T: The nucleon is a particle of isospin T = |; the state with Tz = \ is the proton (p), the state with T-j = — | is the neutron (n). Electric charge is given by Q = \ +T3. The n meson, or pion, has isospin T = 1, and electric charge Q = T3, so there are three kinds of pions with different electric charge: T3 = 1 (n+), T3 = 0 (n0), T3 = -1 (nr). Consider the system of a nucleon and a pion. The electric charge of this system is Q = | + T3. Check that a system of charge 2, T3 = |, is p + 7t+, according to the isospin assignments. Now, if the system is in the state T = |, T3 = \, what is the probability of finding a proton? What is the accompanying n-meson? Answer the same questions for T = |, T3 = — |.
178 3. Angular Momentum 3-4b The unstable particle A, with T = §, decays into T = § states of one nucleon and one pion. What fraction of such events for A+ produces a 7t+, a K°, an"? 3-4c Suppose nucleon 1 and nucleon 2 (charge Q = 1 + T% for this system) are in the T = 0 state. What is the probability that nucleon 1 is a proton? 3-5 The states of angular momentum j = 1 can be constructed from two J = 2 spins, in a symmetrical state. Use these two-spin states to evaluate the probabilities p(m,m',0) connecting measurements of components of J in two directions related by angle 8. Do this for m = m' = ±1 and m = —rn' = ±1. Then apply probability normalization to find all other probabilities. Independently evaluate p(Q,0,8). [The question refers specifically to two spins, not an application of a general result.] 3-6a For j = 1 evaluate tr {Jk} for k = x,y,z , tr {JfcJj} for k,l = x,y,z , tr {JlJf} for k,l = x,y,z . 3-6b Use measurement symbols and the trace formula to evaluate the probability for finding (Jz cosO + Jx sin#)' = 0 if J'z = 0 is prepared. 3-7a For j = |, two orthogonal cartesian components of J, such as Jx and Jz, are complementary. Show that they are not for j = 1. 3-7b How about two non-orthogonal components, such as Jz and e-J with some unit vector e? 3-8a Suppose J = |<Ti + \cri-, then what are the associated eigenvalues of (<Ti ■ a-i) ? Use them to show directly that (ci -cr-i)2 = 3-<ri a-2 and that the permutation operator Pr2 of (3.5.14) obeys /¾ = 1. 3-8b Use the algebraic properties of CT\ and cr-2 to verify that C1F12 = -P12C2 or a-2 = Py2 criPu , and CT2P12 = -P12C1 or <n = P^1 a2Pi2 ■ How does (3.5.15) follow?
Problems 179 3-9 For J = \<T\ + \<r-2, show that J2, Jy, J2 commute with each other. Find their common eigenvectors (sometimes call Bell* states), expressed in terms of the common eigenvectors of (J\z and o-iz- How are they related to the common eigenvectors of J2 and Jz ? 3-10a System 1 has arbitrary angular momentum J\, system 2 has J2 = \cr, so that j = j\ ± | for total angular momentum J = 3\ + J2- Show that the eigenvalues of a ■ J\ are given by (<T-J,)' = ji for j = jx + i , ^-O'l + l) for J=ii-|, and verify algebraically that (cr ■ Jxf + cr ■ Jx - J! (ji + 1) = (cr ■ Ji - ii) (cr ■ Ji + ji + 1) = 0 . Then show that cr -Ji+ji+1 ji - cr ■ Ji ancj — — 2ji + 1 2ji + 1 are measurement symbols for j = j\ + \ and j = j\ —\, respectively. 3-10b Employ cr ■ Ji = az Jlz + \{ax + iay)(Jix - U\y) + \{Ox -'U7y)(Jlx +iJlj,) to show that cr ■ Ji + ji + 11 ! u I. .rl v h + m + \ 2jl+1 lTOi = m-*>m2 = s> = Ij=ji + ^TO)y 2il+1 ii - cr ■ Ji , j u , . . ! , /ii - m + I 2ft+1 lTOl = ™ ~ *'m2 = 2) = b = ^ - g,"»)y 2ii + 1 ■ Use them to check the orthogonality of \j = j\ + \,m) and \j = j\ — \,in). 3-10c Evaluate the expectation values of (az) 1 and show that the j~ji±^,m same result is obtained for the z component of the projected operator J2 J 3-11 Express jn of (3.6.1) in terms of the Eulerian angles <j>, 9, and ip. *John Stewart Bell (1928-1990)
180 3. Angular Momentum 3-12 Give alternative evaluations for e^Jy e—inJz e—iirJy emJz by applying the two different unitary transformations describing finite rotations that appear here. Repeat with Conclusion? 3-13 Justify the appearance of [U^m,]2 rather than \U(£m, |2 in (3.7.23) by showing that these matrix elements are real. 3-14a Use arguments that exploit the vector nature of null vector u to argue that i t' t' i i \ where the eventual application to (y'+ ,y_ ... \y'+,y'_) is understood so that v and ft can be regarded as numerical complex zero-length vectors. Then find the value of c/ by a special choice for f, such as vx = 1, vy = i, vz = 0. 3-14b Now show that l i t \l+m i t \l-m [y u) -2(20-jL {l + m)l (,_m)I and compare this with what (3.7.42) gives for the integral in Problem 3-14a. You should arrive at (3.7.45). 3-15 Recognize the orthogonality statement for Legendre polynomials, \f dxPl(x)Pl,(x)= S" 2 y_! ,w ' w 21 + 1 as a special case of (3.7.45). Use Po(x) = l, Pi(x)=x, P2(x) = |x2-i to check this explicitly for 1,1' = 0,1,2. 3-16 Show that f2 i'Krf*'-).
Problems 181 Which choice for v does then imply the integral representation P/(cos#) = / —— (cos# + i sin 0 cos </») Jo 27T of Legendre's polynomial? Use it to derive the familiar generating function ^ oo VI - 2xt + f2 ^ w which can serve as a convenient alternative definition of the Legendre polynomials.
4. Galilean Invariance 4.1 Generators of infinitesimal transformations The freedom of choice for reference frames includes more than rotations: one can displace the origin, translate it by a constant vector; or one can let that translation grow proportionally with time; the two frames are in relative motion at constant velocity. We'll consider only relative speeds that are small on the scale of the speed of light; see Problems 4-3 and 4-4 for other circumstances. Then time has an absolute significance (Galilean*-Newtonian relativity) apart from the freedom of displacing its origin. The infinitesimal transformations of these types are displayed by the space-time changes t = t - St , r = r — Sr , with 6r = Se + Scv x r + 5vt, (4-1-1) where 6t is a constant, as are the vectors Se, 6u), Sv. The accompanying unitary operator is U = \+\G (4.1.2) where, now G = 6e-P + 6u-J + 6v-N-8tH + 6<p, (4.1.3) and we want to recognize that we always have the freedom of a phase transformation. The names for the generators are derived from classical mechanics: P: linear momentum vector, J: (already familiar) angular momentum vector, H: energy; Hamiltonian (or Hamiltont operator), JV: no classical name, perhaps booster? But now we have to notice something. If we write [/ = 1+ iG, it is clear that G is dimensionless - it is given by pure numbers. But Se-P, the product *Galileo Galilei (1564-1642) fSir William Rowan Hamilton (1805-1865)
184 4. Galilean Invariance of length [L] by momentum [ML/T], or [ML2/T] - or equally well -StH: time [T] times energy [ML2/T'2}, not to mention 5ui-J: angle (dimensionless) times angular momentum [ML2/T] - has dimensions, those of action. It is clear that up to now we have been employing natural atomic units, not the arbitrary units of macroscopic physics. So, if we wish to use the latter, we must include a conversion factor: U =l+i^G, G = Se-P + --- + HSip, (4.1.4) where h, the unit of action, is (27T)"1 times Planck's* constant h. Experiment tells us that h= — = 1.05457 x HT27 erg sec = 0.658 212 eV fs (4.1.5) 27T (leV = 1.602177 x HT12 erg, electron-volt; lfs = HT15s, femto-second). It is important to recognize that the order in which these transformations, even infinitesimal ones, are made is important, in general. To use a familiar situation consider rotations. Compare 1,2: r -) r — 5\<jO x r -> r — Silj x r — 5-2^ x (r — 5\W x r) = r - 5\u x r - 6-2w x r + <52o> x (<5iO> x r) (4.1.6) with 2,1: r -> r — <52o; x r — S\U} x r + d\U x (<S2u> x r) . (4-1-7) The result of performing 1,2 and then the inverse of 2,1 is r -> r + 6210 x (61U) x r) —Siio x (<feo> x r) = <Siw x (r x 52w) = r - (6\U) x <S2uj) x r = r - <5[12]o; x r , (4.1.8) i.e., another rotation described by ^[12]^ = Siio x <52w = —8-200 x 8\ui . (4.1.9) From the viewpoint of unitary transformations we are saying that tWi / U1U2 and (UiU^UiUi = Um , U^ = (£/2^1)-^1^2 = £/[21] (4-1-10) which for infinitesimal transformations becomes *Max Karl Ernst Ludwig Planck (1858-1947)
or since 4.1 Generators of infinitesimal transformations 185 U[12] = 1 + iG[12] = 1 + ^(G1G2 - G2GX) (4.1.11) G[i2] = ^[Gi,G2], (4.1.12) ^ = (1 + -^)(1 + 1¾) 1(G2+Gl)-^ = 1 + ^+^)--^¾ (4.1.13) and ^ = (1 + ^)(1 + 1¾) l(Gl+G2)-l = l + ^(Gi+G2)--^GiG2. (4.1.14) And so we have 1 rz[6\U3 ■ J, <52o> ■ J] = (6\U} x <52o>) ■ J + M[12]y ■ (4.1.15) Now the only possibility for the scalar 6[r2}<p is a multiple of 61U) ■ <52o>, which is symmetrical in 1 and 2, not antisymmetrical. Hence S^r2]f = 0. Then, written as ^[J,6u- J] = Su> x J , (4.1.16) we recognize the characterization of a vector under rotations. This immediately tells us that the analogous considerations for the vectors P, JV, and J will yield ?-[P,6u-J] =Su> xP , ?-{N,6u>-J}=Su>xN, (4.1.17) whereas, for the scalar H, Uh,6cvJ}=0. (4.1.18) How about translations? As r -> r -S\e -> r — ^e — <52e (4.1.19)
186 4. Galilean Invariance indicates, we have S[V2]e = 0 (4.1.20) and ^[6ie-P,62e-P] = HS[12]ip, (4.1.21) where the only possibility of S^2jip oc 6\e ■ S2e shows that ^[6ie-P,62e-P] = 0, (4.1.22) or [Pk, Pi] = 0 and P x P = 0 . (4.1.23) Similarly, [Nk,Ni]=0, NxN = 0. (4.1.24) But when we come to ^[Se -P,SvN} = hSip = MSe ■ Sv (4.1.25) (dimension of M: mass) we can no longer conclude that Sip = 0 since two different vectors are involved. So j|[Pfc,JVj] = M<Jw. (4.1.26) With regard to transformations that include time displacement, consider t -> t - Sit -> t - Sit - S2t, r -> r -5xvt-+ r-Sivt-S2v(t-S1t) , (4.1.27) so that (1, 2) x (2,1)_1 leaves us with a net displacement S{i2]e = SivS2t - S2vS!t (4.1.28) which will have no counterpart in displacements or rotations. So ?-[6vN,-6tH\=6v6t-P+h6<p (4.1.29) or ±[N,H] = -P, (4.1.30)
4.1 Generators of infinitesimal transformations 187 whereas ^[6u-J,-6tH]=Q and hse ■ P, -StH] = 0 (4.1.31) imply [J,H] = 0 and [P,H]=0. (4.1.32) The commutators involving J are the response to rotations, distinguishing vectors and scalars. Now let's look at the P commutators, the response to translations. Prom the P equation in (4.1.17) we get hj,8e-P) =5eJ = 5exP, (4.1.33) and since also and of course j[P,5e-P} = 5eP = Q, (4.1.34) I- [N, Se-P]= SeN = -MSe (4.1.35) ^-{H,Se-P} =6eH = 0. (4.1.36) Both J and JV show a response to translation which can be expressed by a vector R such that SeR= ^-{R,5e-P) =6e, ^[Rk,Pl] = 6kl. (4.1.37) So 6e(J-RxP) = 0, Se{N + MR) = 0 (4.1.38) and we write J = RxP + S , (4.1.39) where the components of S commute with those of P, [Sk,Pi]=0. (4.1.40)
188 4. Galilean Invariance Since R is a vector we must have ^z[R,Su- J} = 5u xR = ^r[R,Su} xR-P + Su -S] (4.1.41) which is certainly satisfied if [Rk,Ri]=0 or RxR = 0 and [Rk,St]=Q. (4.1.42) Also, since JV generates a displacement proportional to t it must contain Pt, or N = Pt- MR . (4.1.43) In particular, for t = 0, N = -MR, and RxR = 0 follows from NxN = 0. Inasmuch as ii and P are vectors, so is L = RxP (4.1.44) and, in view of }- [L, Su ■ J] = ^- [L, Su-L} = SuxL (4.1.45) one has which implies that We see that Lx L = ihL (4.1.46) S x S = ihS . (4.1.47) J = L + S (4.1.48) is the decomposition into external or orbital angular momentum L, and internal or spin angular momentum S. We have now recognized that the system as a whole is described by position vector R, momentum vector P, which for each direction in space constitute a q,p set of operators; ^{Rk,Pl} = Skl, [Rk,Rl} = 0, [Pk,Pl}=0. (4.1.49) Accordingly all these operators have continuous spectra and have a classical limit.
4.1 Generators of infinitesimal transformations 189 Notice also that RL = RRxP = RxRP = 0 (4.1.50) which means that a rotation about the direction R has no effect, has zero quantum number, 6{ | =i( |<5o>-£ = 0 if SuxxR. (4.1.51) But zero is an integer and therefore all possible values of Hn L'2 =1(1 + \)K2 are integers, / = 0,1,2,... . (4.1.52) Now look at the information we have about H: [J,H}=0, [P,H}=0, ±[N,H] = -P. (4.1.53) The first says that H is a scalar, the second, according to (R'\P=^VR,(R'\ (4.1.54) (R components are compatible) says [P,H} = jVRH = 0, (4.1.55) H does not depend on R; the third is ~{Pt-MR,H] = -P (4.1.56) or But, according to (P components are compatible, too) (P'\R = ihVP,(P'\ (4.1.58) we have ^{R,H}=VPH = ^P (4.1.59) or p2 H=2M+ H'mt With VpHint = ° ■ (4.1.60)
190 4. Galilean Invariance 4.2 Hamilton operator for a system of elementary particles For us an elementary particle is defined as one without internal energy, or at least with inaccessible internal energy under the given circumstances. For atomic structure discussions the elementary particles are electrons and nuclei. For nuclear physics discussions, they are protons and neutrons, and so on. Let each elementary particle be described by independent variables ra, pa, sa and mass ma. Then we construct P,J,N additively a a N = Yi(Pat-mara)=Pt-MR, (4.2.1) where and indeed M = j>a, « = E^p» (4-2-2) ^'^^[E^'Epk = Ei7-^^ = ^- (4-2-3) = 1 =Ski We write Y,r"x Po = Y, [R+(ra ~ R)]xpa a a = RxP + Y,(ra-R)x(Pa- ~P) , (4.2.4) internal variables since Ytma(ra-R)=0 and £(pa - ^P) = 0 , (4.2.5) a a and get S = £ [(^ - fl) x (Po - ^P) + sa] (4.2.6) for the total internal angular momentum.
Problems 191 If the constituents were isolated from each other we would have a More general we write a with ^^^{V.-^P)\V, (4.2.9) where V, the potential interaction energy, is a scalar function of the internal variables and the sa and possibly others. Problems 4-1 Verify explicitly that L = Rx P obeys the angular momentum commutation relations (4.1.46). Can you think of a reason, based on the vector structure of L, for the fact that any component of L/H has only integer values? 4-2 Show that L-S commutes with J, X2, and S2. Then find the eigenvalues of L ■ S. 4-3 Einsteinian* relativity. Replace the first line in (4.1.1) by t = t <5eo ^Sv ■ r , c cl where c is the speed of light, and the Galilean form is formally recovered in the limit c -> oo if (l/c)<5eo -> St is understood. Show that the commutators are the same, with two exceptions: I- [Se -P,SvN] = (m + \h) Se ■ Sv and 1 1 — \Sxv ■ N,S2v ■ N] = —-S\v x S2v ■ J . in J c2 4-4 In consequence of these modified commutation relations, what needs to be altered in the equations introducing R and S? 'Albert Einstein (1879-1955)
192 4. Galilean Invariance 4-5 Photons have only spin angular momentum +1 or —1 along their direction of motion. (Incidentally, helicity is a more fitting term than spin under these circumstances.) A light beam is deflected through the angle 9. To what extent can you anticipate the dependence of the deflected beam's intensity on angle from the spin properties of a photon? [Hint: Recall Problem 3-5.]
Part B Winter Quarter: Quantum Dynamics
5. Quantum Action Principle 5.1 Equations of motion Consider infinitesimal displacements of the time origin, t = t-6t, (5.1.1) and the implied unitary transformation, U = 1 + ~Gt with Gt = -6tH, (5.1.2) n where H, the Hamiltonian operator, depends upon a set of variables for the system, va(t), and possibly on t itself. When we shift the origin, the variables are redefined, va(t)=va{t) (5.1.3) or where so that va{t) =va(t + St) = va(t) + 6t—va(t) = va{t)-6va(t), (5.1.4) Sva(t) = ^[va(t),Gt], (5.1.5) -Stj-tva(t) = ^[va(t),-6tH]. (5.1.6) This gives us the equations of motion ^va(t) = ±[va(t),H]. (5.1.7) More generally, consider any F(v(t),t), where v(t) stands for the collection of all va(tys. Then
196 5. Quantum Action Principle U~lFU = F-SF = F(Wlv(t)U, t) = F(v(t),t) = F(v{t + St),t) . (5.1.8) We write F(v(t + St),t) = F(v(t),t) +8t(J-t- ^j F(v(t),t) (5.1.9) where d/dt is the total time derivative and d/dt refers to the explicit (or parametric) t dependence, so d/dt — d/dt differentiates the time dependence implicit in the dynamical variables v(t). This gives SF = ^-st^ = -st{^-m)F (5-L10) or This is Heisenberg's equation of motion. It says that there are two contributions to the change in time of the arbitrary operator F: its explicit time dependence - the dF/dt term - and its dynamical time dependence, given by (i/i)"1 times the commutator of F with the Hamilton operator. The v equations of motion (5.1.7) are, of course, included since dv/dt = 0. By their nature, dynamical variables have no explicit time dependence. For vectors (... ,t\, with the ellipsis indicating a time independent set of quantum numbers, we have (■■■,t\ = (...,t\(l + ±Gt) =(...,t\-6(...,t\ (5.1.12) or 6(...,t\ = ^(.-. ,t\Gt = ^(...,t\StH. (5.1.13) To be more precise about (... ,t\, from the v(t) select a complete set of commuting operators, vca{t), so that, at time t, they can all be assigned numerical values, collectively denoted by v'c: (v'c, t\vca(t) = v'ca(v'c, t\ . (5.1.14) The unitary operator (5.1.2) turns these equations into (v'c,t\ vca(t) =v'ca (v'c,t\ , (5.1.15) = vca (t + St) so (v'c,t\ is the left eigenvector of the vca(t + St) with eigenvalues v'ca,
5.2 Conservation laws 197 JrtJ\ = (v'c,t + 6t\=(v'c,t\+6t%-(v'c,t\ . (5.1.16) dt Accordingly, which is S(v'c,t\ -^^(^1 = ^^(^)^)6^ (5-1-17) m^(v'c,t\ = (v'c,t\H, (5.1.18) Schrodinger's differential equation of motion. Its adjoint is -ih^\v'c,t) = H\v'c>t) . (5.1.19) These equations of motion come together in i«,t|F(»(l),t)|<,() = «• 'I {kHF+!;F+k^-lhFH) !»"•'> = <«;,*|—(«(*), *)K',*>. (5.1.20) If F does not depend explicitly on £, ft(v'c,t\F(v(t))\v'l,t) = 0; (5.1.21) the number (^,£|jF|w",£) is unchanged by the unitary transformation, applied to operators and vectors, that represents a change of t. 5.2 Conservation laws Let's see some general consequences of Heisenberg's operator equation of motion (5.1.11). First, let F = H: dH dH 1 rrr rrl dH ,rn_ In particular, if H does not depend explicitly on time, dH/dt = 0, that is, if H maintains its form under a time displacement, we have £ = o, (,2.2) which expresses the conservation of energy: H is a constant of the motion.
198 5. Quantum Action Principle Next, take F = Se ■ P: A 1 —Se-P = --[H,Se-P] = -SeH (5.2.3) where SeH is the change in H produced by a coordinate displacement. If H is unchanged by such a displacement, if it depends only on relative (internal) particle coordinates, the ra — R of Section 4.2, then the linear momentum is conserved, AP = o. (5.2.4) Similarly, F = Su: ■ J gives ^-6u ■ J = ~5„H , (5.2.5) and if H is unchanged by an infinitesimal rotation, if it is a scalar, the angular momentum is conserved, 1/ = 0. (5.2.6) Finally, consider F = Sv ■ N = Sv ■ Pt — Sv ■ MR, for which —Sv -N= ±L(6vPt) + r^lSv ■ N,H] dt dty ' ihl ' J = Sv-P-SvH. (5.2.7) For H = P2/(2M) + H-mt, as in (4.2.8), with an interaction Hamiltonian Hint that depends only on internal variables and is therefore unaffected by the transformation, whereas SVP = It [P, -Sv ■ MR] =SvM , (5.2.8) as one would expect, one gets SVH =^-- MSv =Sv-P , (5.2.9) M so -j-N = 0 . (5.2.10) The interpretation is the obvious one: In conjunction with the momentum conservation it establishes ±N=±(Pt-MR)=P-M±R = 0, (5.2.11) and tells us that the system moves with constant velocity P/M.
5.3 Sets of q,p pairs of variables 199 These isolated statements come together in the recognition that they are all concerned with unitary transformations between equivalent frames of reference. Given some physical state, | ), it is represented or described by the wave function (v'c,t\ ). Another description of the same state, in a different reference frame, is (v'c,t\) = (v'c,t\U(t)\). (5.2.12) But this is also the description, in the original frame, of the different physical state U(t)\ ). Both states must obey the Schrodinger equation of motion ih^(v'c,t\ ) = (v'c,t\H\ ) (5.2.13) and ih^t(v'c,t\U(t)\) = (v'c,t\HU(t)\) (5.2.14) which, in view of ihjt ((«;, t\U{t)) = (v'c, t\HU(t) + ih(v'c, t\ jU , (5.2.15) is only possible if jtU(t) = 0 , (5.2.16) so that for U = 1 + \G we get AG = 0. (5.2.17) Here is the basis for all the conservation laws, of H, P, J, N- y ■"■ i " > 5.3 Sets of q, p pairs of variables In R and P, [i2*,i%] = 0, [P*,Pj] = 0, ^[Rk,Pi] = Ski (kj = 1,2,3) (5.3.1) and, more generally, in the ra and pa, 1 [rka,rib] = 0 , \pka,Pib] = 0 , r^[rka,Pib] = Sab6ki , (5.3.2) we see independent q,p pairs
200 5. Quantum Action Principle 1 [Qa,qp] = 0 , \pa,pp] = 0 , r^[qa,pp] = Sa0 . (5.3.3) More than this, particle spins sa, as with any angular momentum, can be represented as [recall (3.4.10), (3.4.21)], /^j^XXH^kV (5-3-4) where, for each y, y^ pair, y 2 a,a' expresses them in terms of a Hermitian q,p pair; or one can use y, ihy^ as a non-Hermitian q,p pair, i [VoaiVa-b] = ° . [i%L.!%!'(,] = 0 , ^ [yaa,ihylb] = 5ab5aa, . (5.3.6) So, with great generality, we consider sets of q,p variables, and a Hamilto- nian operator that is a function of these sets: H(q,p, t). We recall the lessons of Problems 1-54, 1-55, and 1-56 - which are immediately generalized to sets of q,p pairs: [the basic commutation relations (5.3.3) are special cases] - and use them to get equations of motion for the qa{t), pa(t)- d 1 dH -riQa = 7T[qa,H\ = —- , at m opa 5* = >..*] = -^- d«) And now we can understand the origin of the classical Hamilton's equations of motion; they are already true at the fundamental quantal level! Next, we give the symbolic Schrodinger equation ^t{v'c,t\) = {v'c,t\H\) (5.3.9) a more explicit form. As an example (the most usual one) of a complete set of commuting operators, at time t, we pick the set of qa(t)\ any pp(t) must fail to commute with one of the operators {qp, of course). So we have an equation of motion for the wave function (q',t\ } = ^(q',t). (5.3.10)
5.3 Sets of g, p pairs of variables 201 Again recall that for a single q,p pair (one degree of freedom) (q'\eliPl" = (q' + q"\ (5.3.11) [cf. (1.16.19)] which, for q" -> Sq, and extended to any number of pairs, is d {q'\\Gq = (q'l^P^a = E^a7<«1 (5-3-12) rv rv *Qt or [cf. (1.16.29)] (l'Apa(t) = j^(q',t\, (5.3.13) which makes explicit that all operators and vectors here refer to a common time t. As before, we have the generalization [cf. (1.16.34)] {q',t\F(q(t),p(t)) = F (V, j^) (q',t\ , (5.3.14) if F depends algebraically on the p's. So, applied to the Hamiltonian, we get the numerical differential equation for the wave function: ififv^',0 = H (V, Y^7»*) M9',t) ■ (5.3.15) This is Schrodinger's differential equation of motion for q wave functions (frequently just called the Schrodinger equation). Equally well (q -> p,p -> —q) we have a Schrodinger equation for p wave functions, ih^(p',t) = H Aft A,j/ A ip(p>,t) ; (5.3.16) as a rule this is useful only if H(q,p,t) depends algebraically on the g's. But we can always construct one wave function from the other. Recall, for one degree of freedom, that the qp transformation function is [cf. (2.1.17)] <gy) = -i=eS«'p\ (5.3.17) V27rft where we check the appropriateness of \j\fh by verifying that J(q'\p')dp'(p'\q") = S(q'-q") = [~ &j<p'/*)W-<r)t (5.3.18) which is correct, the previous variable p' being replaced by p'/ft in (2.1.18).
202 5. Quantum Action Principle For n degrees of freedom, <«'|p'> = (<«i|-Rl)(|pi>-K>) = n<«-K> = (2^71 e^E-^-, (5.3.19) where, indeed, for example, JW) (dp^ipY) = Hl(q'a\p'a)dp'a(p'a\q'<:) = S(q' -q") . (5.3.20) = dpi • • • dpn =*(«;-«;') So, or and <«',«| >=y<g>'> (dp') <P'»*|>» (5-3-21) ^'' ^ = / (2^7* e* Eq ^ (dp#) ^ ° ' (5-3-22) ^-') = /(2^7^^^ WW.*) ■ (5.3.23) 5.4 Wave functions for force-free motion As the simplest illustration, consider a single particle of mass M (or system of particles without reference to internal motion), P2 H = —r . (5.4.1) Here p is constant in time, and it is natural to be interested in the states with various p values, |p"). In this situation we easily find the p wave functions: (p',t\p")=iPP„(p',t), (5.4.3) where |p") refers to t = 0. The Schrodinger equation (5.3.16) reads
5.4 Wave functions for force-free motion 203 i^P" (p', 0 = f^ <V' (p\ *) , (5-4.4) = E(p') so <V' (p,«) = e" W)*,J(p' - p") , (5.4.5) which incorporates the initial condition ^P»(P',0) = <P'|P"> = «J(P'-P") (5.4.6) and obviously describes the fact that a p measurement at any time will certainly yield p", and the particle has the definite energy E(p') = E(p"). Now we construct [three degrees of freedom, n = 3 in (5.3.22)] *"(r'',)' piW> I e±"r'"' W)W -p")^iEip']t that's a wave function! Standard notation for plane waves is j(k-r-wt)^ (548) and standard terminology calls k the propagation vector or wave vector and ui the angular frequency, related to the wavelength A and the frequency v by 27T 1 |fe| = — = -, u = 2nv. (5.4.9) A A We also note that A = \/(2ir) is the reduced wavelength; the inverse wavelength A"1 = |fe|/(27r) is called wave number, and \k\ is the reduced wave number. So (omitting primes) we have Planck's energy-frequency relation, E = hv = hu , (5.4.10) and de Broglie's momentum-wavelength relation, p = hk, |p|=£ = /i. (5.4.11) Naturally, the wave function (5.4.7) obeys the Schrodinger equation (5.3.15), fo§iMr,t) = ^(jVj <</>pM), (5.4.12) ~^E ->p2/(2M)
204 5. Quantum Action Principle and could have been found in this way also. While we're at it, let's find yet another kind of wave function for this simple system: (r',t\r"} = (r',t\r",t = 0), (5.4.13) which we get as (r',t\r") = J(r',t\p')(dp')(p'\r") = f ML e\r' p' e-i[p'2/(2M)]<e-ip' r (2-Khf <- -^{W-{r'-r")-i^p'2) = f ML e'i^W \P' - (M/t)(r' - r")]2 e|f (r' - r") J (¾¾3 6 = -—— eh2t(r r ) . (5.4.14) \2mhtJ y ' This time transformation function is conceptually important but, as a wave function, it is too idealized for direct physical interpretation. For the following, notice that in all the above, based on the x, y, and z motions are independent, as in j=l for example. So consider just the X\ = x, p\ = px = p motion in one dimension. Suppose that, at t = 0, we have a minimum-uncertainty state |<5), with (x) = xo, (p) = Po, and, of course, SxSp = \h. According to (2.3.6) and (2.3.7), the initial wave functions are of Gaussian shape, {x,t = Q\8)=^{x)={L?LjLeiP0{x~x0)eH^)\iK XoPo /OO -j --=,eixP&p^s(p) (5.4.17) -oo V2nh and (x -> p, p -> —x)
5.4 Wave functions for force-free motion 205 (p,* = o|«J> = MP) = ^7^- e~rMP~P°) 6-(¾1) 5p e 2ft Poxo = (2/tt)4 ^^(p-^^lKp-^f^lyo _ (5.4.18) The time dependence of the latter wave function is immediate: MP,t) = (p,t\5}= e-*EW%(p) . (5.4.19) In a first step, we then get tps(x,t) = (x,t\6) = (x\p)dp(p,t\6) = /^MZe|p(^-^o)e-[i(p-po)^]2e-i^e^oPo !2nh VSp (5.4.20) The exponent in this integrand is quadratic in p, with the p'2 term given by 2 e(t) V h ' where p5x\ i p2 1 /p5x (5.4.21) 1 , . ht The identity 2M(6x)2 MSx hM (5.4.22) (5.4.23) is useful when we complete a square to bring the whole exponent of (5.4.20) into the form 1 exponent = — p - e(t)p0 _ i_ x - xp e(t) [ h/Sx 2£[) Sx i i p20 fx-xo-pot/MY + ^x0P0 . (5.4.24) The p integration is now immediate and produces
206 5. Quantum Action Principle , 2 = (2/tt) t [e(t)«Jp/^] * e^°(K _ x°"> e-* 5&* (5.4.25) The corresponding probability distribution |^(x,0| =^r-e 2l **<*) j (5.4.26) [make use of Kee(t) = |e(£)l *° Se* it] is a Gaussian at all times and is, of course, properly normalized, &x\%l)S{x,t)\=l. (5.4.27) It exhibits the mean position (x(t)) =xo + j~t, (5.4.28) which grows linearly in time and thus confirms that the particle moves with constant velocity po/M, and the t dependent spread in position, 6x(t) = \e{t)\~~l6x = ^{Sx)2 + (tdp/M)2 , (5.4.29) which shows that the Gaussian distribution (5.4.26) broadens in time. The spread in momentum is time independent, Sp(t) = Sp , (5.4.30) of course, since I^(P,*)|2 = I^(P)|2- (5-4.31) This means that [Sx(t)Sp(t)]2 = Qfi)2 |£p2 = Qfi)2 + (W*)2 (5.4.32) so the uncertainty product increases in time, eventually linearly, 6x(t)6p(t)^Ml if ^»fi. (5.4.33) One should also observe that, at any t > 0, there is a minimum value possible for 5x(t):
5.5 Quantum action principle 207 2 ,, ,2 I ( fit V [**(*)] = (Sx)2 + (fe)2 ymJ = (**-^V + £>£- (5-4-34) V Sx 2MJ M - M y ' The minimum of Sx(t) occurs at t = T if (Sx)2 = \KI'jM or Sx/6p = T/M] in this optimal circumstance [<5x(£)] only doubles in time T. Not surprisingly, this time constant appears in e(t) of (5.4.22), £=(l+«/rrl with T = M~, (5.4.35) op and therefore also in 6x(t), Sx{t)/5x = y/\ + (t/T)2 > vWr , (5.4.36) which tells ns that T sets the time scale for the spreading of the Gaussian probability distribution (5.4.26); the lower bound is the one of (5.4.34), the equal sign holding for t = T. Relations such as (5.4.29), (5.4.30), (5.4.33), or (5.4.36) are more generally true than the particular initial state of minimum uncertainty suggests. This is the subject of Problem 5-8. 5.5 Quantum action principle In classical mechanics, Hamilton's equations of motion are deduced from an action principle. Is there an action principle in quantum mechanics? Yes. Here is the derivation. Consider the transformation function relating q states at infinitesimally different times: (</, t + dt\q",t) = (q',t\ (l - ±dtH(q(t),p(t), t))\q" ,t) (5.5.1) and focus, not on what this equals, but how it changes when everything on which it depends is infinitesimally varied. Recalling £rW,t\ = j:W,t\pa{t), §-t(<l\t\ = ~(q',t\H(q(t),p(t),t) = ~(q',t\H(t) , (5.5.2) we begin with &q',t+dt(q',t + dt\ = ~(q',t + dt\ (J2pa(t + dt)6qa(t + dt) - H(t + dt)S(t + dt)) (5.5.3)
208 5. Quantum Action Principle and Sq»,t\l"> *) = ~ji (£*>«(*)*?<*(*) - H(t)St) \q", t) (5.5.4) a which are combined in S'(q\t + dt\q", t) = i(g', t + dt\ (^ [pa(t + dt)«J<fc(t + dt) - pa{t)Sqa(t)] - [H(t + dt)6(t + dt)-H(t)6t])\q",t) , (5.5.5) where 6' - <V,t+dt +<V,*- Using d 3M pa(t+dt) =pa(t) + dt—pa(t) = pa(t) - dt— , ff (t + dt) = H{i) + d*— = ff (t) + d£— , (5.5.6) and neglecting consistently all second-order changes, we rewrite (•••) and get (■■■) =52pa(t)[6qa(t + dt)-6qa(t)] - di^l^t + dt) — a a v - HSdt - dt—5t dt =6'{j2Pa(t)[qa(t + dt)-qa(t)]-dtH(t)) dpa no- - £ SPa (t) [qa (t + dt) - qa (t)] +dt ^ —- 6pa = dqa=dt§£ =<*'(£>*(*)[<Za(* + dt)-<fc(t)] -dtff(t)) ■ (5.5.7) Here, as in (5.5.5), <5' is the injunction to vary everything except the dynamics, that is: except the form of the Hamilton operator. As for changing the form of H, symbolized by 6", we have S"(q', t + dt\q", t) = 6"(q',t\ (l - ±dt H) \q", t) = -j.(q',t\dt6"H\q",t) = -Uq',t + dt\dtS"H\q",t) (5.5.5 which fits right into the structure of 8' (■ ■ ■). So, with 6 = 8' + 6",
5.5 Quantum action principle 209 6(q',t + dt\q",t) = i(g',t + dt\6Wt+dt,t\q",t) , (5.5.9) where Wt+dt,t = dtL{t) with L = Y,P^-H . (5.5.10) a Note that the order in which pa and dqa/dt are written does not matter after variation, for 5pa and 5qa are numbers (times the unit symbol 1), which commute with all operators. Now we consider two consecutive infinitesimal time intervals: t -> t+dt -> t + 2dt: and compute S(q',t + 2dt\q",t), using the previous result for each time interval: S(q',t + 2dt\q",t) = J \(q',t + 2dt\SWt+Mt<t+dt |q, t + dt)(dq)(q, t + dt\q", t) + [(q',t + 2dt\q,t + dt}(dq)~(q,t + dt\5Wt+dt,t\q",t) so that = ±(q',t + 2dt\(6Wt+2dt,t+dt +SWt+dt,t)\q",t) , (5.5.12) SWt+2dt,t = SWt+2dt,t+dt + SWt+dt,t ■ (5.5.13) We see that the structure is maintained, with the appropriate W produced additively from the constituents. The evident generality of this lets us jump immediately to the statement for any finite time interval, the Quantum Action Principle: Siq'Mq"^) = ^(q',h\SW12\q",t2) with W\2 /"12 = / dtL(t) . (5.5.14) Jti W, with the dimension of H - action - is the action operator; L is the La- grangian, or Lagrange* operator, first met in (5.5.10). 'Joseph Louis de Lagrange (1736-1813)
210 5. Quantum Action Principle 5.6 Principle of stationary action The time transformation function (g', t\ \q", t2) depends on the choice of final and initial state, specified by the vectors (q',t\\ and \q",t2), and upon the form of the Hamiltonian operator that guides the time evolution. For a given Hamiltonian, the only freedom of change is of initial and final states, for which we write (J<g',*i|=^',*i|Gi, 8\q",t2) = ~G2\q",t2), (5.6.1) where the generators G\ and G2 are infinitesimal Hermitian operators constructed form the physical variables at the respective times. This gives 5{q'M\q",t2) = ^',*i|(<?i - G2)\q",t2) (5.6.2) from which we conclude that SW12 = Gi-G2. (5.6.3) This is the Principle of Stationary Action. It asserts that the infinitesimal variation of Wy2 - which, according to (5.5.14) depends upon the variables at all values of t between t\ and t2 - in fact involves only variations at the end points, t\ and t2, and so is stationary with respect to variations at any intermediate time. We now want to recognize that, conversely, the equations of motion and the commutation relations follow from this single, fundamental dynamical principle. In connection with 6t\ and 6t2, it is more convenient to regard t as a function of a variable r, t = t(r), where r is not varied, but the form of the function is, St = St{r) , 6ti = St{n) , St2 = 8t{T2) . (5.6.4) So, with the reference to r as integration variable left implicit, we have Wvi = / (5>«dfc - Hdt) . (5.6.5) Notice that, e. g., Sdqa{t) = S[qa(t + dt) - qa{t)] = 6qa (t + dt) - 6qa (t) = dSqa (t) (5.6.6) so, symbolically, <5d = d<5, (5.6.7)
5.6 Principle of stationary action 211 and SWvi = / (Z) (SP«d(l« + P^Qc) ~ SHdt - HdSt) ''2 a = / (Z (^«d«« ~ dP«%*) ~ SHdt + dHSt) ■'2 a + / d(j2PaSqa - HSt) . (5.6.8) ^2 a The last term refers to the end points only, ,.\ i d{^pa5qa-H8t) = (^pa8qa-H8t) . (5.6.9) Jin, n, 2 The stationary action principle therefore requires that the first integrand on the right-hand side of (5.6.8) vanishes at all intermediate times: The specific nature of the g's and p's is now introduced by asserting that the 8qa and Spa are numerical multiples of the unit operator or, better, that they commute with all operators q and p. Then we can infer that dt and arrive at where dH 0pa ' dpa dH dH dt dqa ' dt 5Wr2 = Gi - G2 G= ^2pa5qa - H5t dH dt (5.6.11) (5.6.12) (5.6.13) at each boundary. Prom the interpretation of Gt = ~HSt (5.6.14) as the generator of infinitesimal time displacements, we infer the Heisenberg operator equation of motion and the Schrodinger vector equation of motion. The interpretation of Gq = ^2paSqa (5.6.15)
212 5. Quantum Action Principle as the generator of infinitesimal q displacements, *<F=hF>G<] or jfc = hF^> (5-6-16) produces the basic commutators k[9a'Pf,] = % = SaP ' k^M = We = ° • (5'6'17) We do not, in this way, get [qa, qp] = 0. But, of course, the latter is implicit in the choice of compatible q states, and the interpretation of the effect of Gq on these states: Sq(q',t\ = (q',t\^Gq (5.6.18) or ~^r(q',t\=(q',t\Pa(t). (5.6.19) On the other hand, we could have initially chosen as the complete set of compatible physical properties - commuting operators - the totality of the p's. Then we would have been led to the action principle <*<p',*i|p",*2> = ^,^(^ d*£P)|p",*2> (5.6.20) ^here L. = - V dt lp = -J2^t^-h (5-6-21) is so labeled to distinguish it from a the time derivative term of Lp is produced from that of Lq by the substitution q -* P) P -* ~q, and conversely. And since the operator equations of motion maintain their form under this substitution, d* " dPa ^ dt - dqa ' {b-b-26} they are also produced by the new action principle. What is different is G now appearing as
5.7 Change of description 213 G = GP + Gt, Gp = -Y, Q°8pa ■ (5-6-24) The operator significance of Gp, 1 riW 1 ^=^,] OT fl^i^' (5-6-25) gives ^'^] = ff=^ md ^[&'^1 = ^=0' (5-6-26) Thus, all the fundamental commutators are produced when both Lagrang- ians, Lq and Lp, are employed. 5.7 Change of description Although we used the q -> p, p -> —q substitution to recognize that Lp produces the same equations of motion as Lq, the more fundamental observation is that Lq-Lp= —^2paqa, (5.7.1) for, in general, if L-L=^-w (5.7.2) the Lagrangian L produces the same equations of motions as L: Wl2 - Wl2 =Wl-W2, (5.7.3) and the stationary action principle applied to W\2 gives the same result as W\2, since the difference refers to the boundary, not the interior of the time interval. The remaining boundary terms are (d - G2) - {Gi - G2) = Swx - 6w2 (5.7.4) or G-G = 5w (5.7.5) at the terminal times t\ and t2. In the example of Lq and Lp we have Gq~Gp = ^2pa6qa + ^2QaSPa = s^2qaPa ' (5.7.6)
214 5. Quantum Action Principle which leads directly to the transformation function connecting the q and p descriptions: W> = ^W\(Gq-Gp)\p'} = l(sJ2^P'a)W\p') (5-7-7) or (q> \p>) = I ei £« <£p« , (5.7.8) where the multiplicative constant emerges from the requirement that l(q'\p,)(dp')(p,\q") = S(q'-q"), (5.7.9) as in (5.3.20). 5.8 Permissible variations We began knowing all about p's and g's. Now let's turn it around and take as our starting point the quantum action principle: <5(1|2} = ^(1|<W12|2) with W12= f dtL (5.8.1) where a with no a priori knowledge of operators p and q. For brevity we introduce the notation A.B = ]-{A,B} = ^(AB + BA) (5.8.3) for the symmetrized product of two operators; the symmetrization ensures that the products pa . dqa are Hermitian. What additional input is required to infer the specific properties of the operators p and q? And how wide is the class of permissible variations? For a given H we have the stationary action principle, 5Wr2 = d - G2 = I (^2 iSP* ■ d<7« ~ dP* • ^«1 ~ SHdt + dHSt) ^2 a + I d{^pa . 8qa - H8t) (5.8.4)
5.8 Permissible variations 215 [(5.6.3) and (5.6.8) with symmetrized products] with the inference that <"'•£(>*■■%-%■«>) + %«■ <«5' a To this point the Sqa and Spa are unspecified operators. We now assume that among the possibilities are operators 6qa, $pa that commute with the g's and p's. Then all is as before: dH _ dqa dH _ _ dpa dH _ dH dpa dt ' dqa dt ' dt dt and well (5.8.6) Gq = 5>«<%> (5-8-7) a GP = -J26paia (5-8-8) if w = E«fc -Pa in (5.8.2). Returning to the general expression for SH, we can now write (5.8.9) The commutation relations of the g's and p's are maintained if these variations appear as an infinitesimal unitary transformation, <%* = rr[qa,G] = j— , ifi Opa $Pa = rz\pa,G] = - — , \h dqa dH 1 5H = ^—5t+r^{H,G] (5.8.10) or *™-?(£•£-£• £)-««>• <-" But we know from Problem 5-10a that this identity of commutator and symmetrized Poisson* bracket holds, for arbitrary H, only if G is less than cubic in the g's and p's. Another point to recognize is that one must have * Simeon Denise Poisson (1781-1840)
216 5. Quantum Action Principle BC G = £>*• 5qa = Y.P«- a". (5-8-12) for instance, which requires that the p dependent part of G - which produces Sqa - must be linear in the p's; likewise, G must be linear in the q's. Here we recall that, in general terms, a function / of a number of variables, say xi,... ,xn, is linear if /(Axi,... ,Ax„) = A/(xi,... ,x2) ; (5.8.13) we put A = 1 + SX, f(xx +6\xx,... ,xn +6\xn) = f(x) + 6\^2xk-— k = f(x)+SXf(x), (5.8.14) and conclude that 5>£- = / (5-8-15) is characteristic of a linear function. Of course, Gq = 2_\Pab(la with numbers Sqa (5.8.16) a is linear in the p's and q's and thus satisfies the linearity requirements on G. But now we see an additional possibility: a G that is linear in the p's and linear in the q's, therefore quadratic in the q's and p's, G = ^Pa ■ 9apqp , Ha = q— = ^Oapqp , %J = -g— = - ^PaOap , (5.8.17) where the gaps are numbers. We shall make use of this possibility later, in Section 8.3. Problems 5-1 Dynamical variables va have a dynamical, but no parametric time dependence. By contrast, probability operators (see Problems 1-30) do not change in time at all, P(v(t\),t\) = P{v{t-2),t-2)- Conclude that
Problems 217 which is frequently called the von Neumann* equation. Show that the dynamical variables can refer to any common time in this equation. 5-2a Descriptions at different times t and t' are related to each other by the unitary evolution operator U, (...,t\ = (...,t'\Ut,t,(v(t)), which has a parametric dependence on both times and a dynamical time dependence through the dynamical variables v(t) of which it is a function. Show that U obeys the equations of motion ih~Ut,P = HU , ih^-Ut,t> = UH , at at where H = H(v{t), t) is the Hamilton operator. Note the fundamental difference between taking the partial and the total time derivative of the evolution operator. 5-2b Show first that the replacement v(t) -> v(t') does not change Ut,p as a whole, that is: Ut,f(v(t)) =Ut,f(v(f)) , and then verify the group property Ut,f(v(t))=Up,f(v(f))Ut,t.(v(t)) = Ut,f(v(t))Ut.,f(v(t)) or tersely: Ut,t» = Ut,t>Ut>,t>>- 5-3 Show that the spread 8A of observable A and the spread 6H of the Hamilton operator H obey the inequality SASH >~ 5-4 A slit is opened for the time interval St and an emerging beam of particles moves along the x axis at the average speed _ dE dp What is the length Sx of the beam? How large is Sp, the unavoidable spread of momentum in the beam? Show that the related spread of energy, SE, is such that SESt>h. IAA _ 8A\ \dt dt J *John (Janos) von Neumann (1903-1957)
218 5. Quantum Action Principle 5-5 Stern-Gerlach experiment, spin |fi: z magnetic moment, ±fi; field in- homogeneity, (d/dz)Bz(z); time in field, t. For the experiment to work, the positive, or negative, z momentum acquired in time t must be large compared to (half) the inherent spread of z momenta in the beam, Spz. Write out this condition. Now, Heisenberg's uncertainty relation requires a minimum spread of z values, 6z. That implies a corresponding spread in ¢, the angle of rotation of the angular momentum s in the xy plane, during time t. Write down <p, according to the torque equation based on /x = 7s [if this doesn't come to mind, derive it quantum mechanically from the Hamilto- nian H = —jszB2(z)]. Evaluate dip. What known non-classical property of Stern-Gerlach measurements follows from the resulting inequality obeyed by 8(j)l 5-6 For the Stern-Gerlach experiment on an atomic doublet, let E be the energy difference between the two states during the time interval St that the atoms are in the magnetic field Bz(z). What is the spread, 6E, produced by the width of the beam, 5z? By using the condition for the experiment to succeed, show that 6E6t>h. 5-7a One degree of freedom: Hamilton operator H = p2/(2M); minimum uncertainty state |<5) at time £ = 0. Solve the equations of motion and evaluate [6x(t)]2 = ([x(t)-(x(t))s]2)g and [6p(t)]2 . 5-7b According to (5.4.34), the inequality [Sx(t)}'2 > Ht/M holds. Give numerical values to this lower limit of 5x(t) for an electron at the times t = 10"16s; Is. For an object of mass lg, how large would t have to be in order that Sx(t) > 1 A ? 5-8a Reconsider the situation of Section 5.4, mass M moving freely along the x axis. But now do not assume that there is a minimum uncertainty state at t = 0; rather think of any arbitrary initial state. Solve the Heisenberg equations of motion to find the expectations values (x(t)), (p(t)), ([x(t)J2), (\p(t)]2}, and 5 ([x(t)p(t) +p(t)x(t)]) in terms of their initial values. Use them to show that 6x(t) = Sx ^l~2tt0/T2 + {t/T)2 with T as in (5.4.35) and *o = ~Zwp [f (W+Px) - (x) (P)] ■ What is the physical significance of £q ?
Problems 219 5-8b How does the restriction in Problem 2-17a follow? [What is 5q dp there is 5x5p/h here, of course.] 5-9a Recall Problem 2-8a and show that U(q',p')U(q",p") = e~l^l'p"~l"p'^U(q' + q",p'+p") , U(q",p")U(q',pr) = eWp"-4'p')u(q' + <?">P' +p") . 5-9b Any F(q,p) can be written as F(q,p) = J^-f(q',p')U(q',p') and similarly G(q,p) = J ^^9(q",p"W(q",p") ■ To justify this assertion express f{q',p') in terms of the normalized matrix elements of Problem 2-7a. 5-9c Prove that e~hDFG= e2DGF where (introducing H) ^ ». ( d d d d D = n dqF dpa dpF dqG J in which d/dqp, for example, means differentiation with respect to q in F only. 5-10a Introduce the notation [cf. (5.8.3)] F.G= 1(FG + GF) = UF>°} for the symmetrized product of F and G, and use it to rewrite the result of Problem 5-9c as where ' ; h dq dp dp dq
220 5. Quantum Action Principle is the symmetrized Poisson bracket operator. Recognize that ±[F,G] = (F,G) if either F or G is less than cubic in q or p, or if both F and G are less than cubic in q, or in p. 5-10b Make a direct evaluation - that is [F1F2,G] = Fx[F2,G] + {Fx, G]F2 et cetera - of (ifi)"1 [g2,p2] and (i/i)"1 [g3,p3]- Compare the results with the predictions of Problem 5-10a. 5-11 In Section 5.5 we found S(q',t1\4',t2) = ^,t1\SW^)\q",t2) with the action operator 4M) = /2(Epa.^-M), a and in Section 5.6 we observed that r-2 wip = / (- ^2qa ■dpa ~Hdt)' is needed in Now consider and w*i^M = ^*i|^i(rV>fe>- 8(4,^,^) = ^,^^,12) Kp'M^" ^) = ^^^^^^)- Show that the appropriate action operators are a
Problems 221 and a = Wi(2pp)-E^(*2)-««(*2)- respectively. 5-12 Show that <5o> ■ J, the generator of infinitesimal rotations, is of the bilinear form in (5.8) if J = r x p + s is constructed from the position vector r, the momentum vector p, and the spin vector s of the quantum object considered.
6. Elementary Applications Let's see the action principle at work solving problems. We'll consider the one-dimensional motion of a particle (position x, momentum p, mass M) without any force acting on it, exposed to a constant force, and under the influence of a linear restoring force. 6.1 Time transformation functions 6.1.1 Free particle The Hamilton operator H=^ (6.1.1) gives 6(x',t\x") = ~(x',t\ [p(t)6x' - pSx" - St^} \x") (6.1.2) where p(t)=p, x(t) = x+^t, [x,x(t)]=~ (6.1.3) are immediate consequences of the Heisenberg equations of motion dx{t) __ p(t) dp{t) dt M ' d* 0 ; (6.1.4) here and below we write \x") = \x",t = 0), p = p(t = 0), and x = x(t = 0) for brevity. So S(x', t\x") = ~(x', t\ [y (x(t) - x) (Sx' - Sx") - 6t~ (x(t) - xf] \x") (6.1.5) with
224 6. Elementary Applications ,2 , ,.,,2 2 (x(t) - x) = (x(i)) + x - x(t)x - xx(t) = x(t)x + iht/M v2 „ ,,, 2 i^ = (x(t)) -2x(t)x + x2- — . (6.1.6) Now that [• • •] of (6.1.5) is in a form where x and x(t) stand next to their respective eigenvectors, we can introduce eigenvalues and get first S(x',t\x") i \M. , „... , . „, StM. , „.2 ..1 St -r r—r- = r —(x x )(Sx' - Sx") - ^--(x - x'r + ift- — (x',t\x") h[t y n ' t2 2 y ' 2 t and then Aflx'-x-r + logi + logC <x',«|*»>=^eiS<«'-*">\ Vt (6.1.7) (6.1.S The value of the multiplicative integration constant C follows from the initial condition: (x',t\x")-t(x'\x") = 6(x'-x") for£->0. (6.1.9) Now d(x'-x")(x',t\x") = C 2-K\h (6.1.10) is actually independent of t so that it must equal unity, and we get C=\/A, <*', Ax") = J^Md* - *")2 ; (6.1.11) V 2-K\h ' \ ' I / V 2-kM k ' we have seen this time transformation function before as the factors in (5.4.16). 6.1.2 Constant force Next consider the Hamilton operator P2 H = —r-r — Fx with constant F. 2M The Heisenberg equations of motion dx{t) _ p(*) dp(t) dt ~ ~W ' ~d~T F, (6.1.12) (6.1.13)
6.1 Time transformation functions 225 are, of course, those of motion under the influence of constant force F. They are solved by p(t)=p+Ft, x(t)^x+^t+^~t2 with the consequence [*> *(*)] = ^ • (6.1.14) (6.1.15) Note that this commutator is F independent. The quantum action principle states here that 6(x',t\x") = -(x',t\ p(t)6x'-p6x"-6t(j£j-Fx c") . (6.1.16) We bring it into a more useful form by expressing [• • • ] in terms of x and x(t), with x(i) to the left of x in products so that they stand next to their respective eigenvectors. First, we use (6.1.14) to establish p{t) = ~ [ x{t) - X + M ( _ Ft2\ (6.1.17) t V~v/ ~ ' 2MJ ' where we note that initial and final quantities are interchanged, x(t) <+x , p{t)^p (6.1.18) on letting t «-» —t. That would be more evident had we continued to label the two times as ti and t%, with t = t\ — t2- Then we are just talking about the interchange of labels 1 and 2. Second, we then use (6.1.15) to get p2 M 2M m 1 FH = W2{W)]2-^)x + x2)-l--~F[xit)^x\+m Therefore, the [• ■ •] of (6.1.16) turns into r i M (6.1.19) ,,Fi2\ M(, „ Ft2\ , X H — OX ii -1 1 or. 2MJ t x — X 2MJ St j M . , 2 ifi 1 , FH2 „\ \W^~X) -2t-~2F{x-x) + jM~Fxj AM< < ^2 -tl c 1^, , „, F2t3\ [2t )^^- + -^^)--^ hS{x',t\x") h , , , „, = --r 1—r- = -ologix ,t\x ) i (x',t\x") i N ' ' (6.1.20)
226 6. Elementary Applications and the time transformation function (x',t\x") M 2n\ht LM.It,' - r"\2 i pfx'+x" i F^f e» 2t vx x I eft 2 e ft 24m (6.1.21) follows. The value of the integration constant C, as required by (6.1.9), is the one given in (6.1.11); it is accounted for in (6.1.21). 6.1.3 Linear restoring force: Harmonic oscillator Now consider the Hamilton operator H = m + \"^ with constant ui. The Heisenberg equations of motion dx(t) p(t) dp(t) dt M ' dt -Moj'2x{t) (6.1.22) (6.1.23) are those of a harmonic oscillator; it's motion under the influence of a linear restoring force: proportional to the distance x(t) and directed toward the equilibrium position x = 0. The familiar solutions of (6.1.23) are 1 x(t) = xcos(iot) + -—-ps'm(ujt) p(t) = pcos(ujt) - Mujxsm{ujt) , with the consequence \x, x(t)] = —-— sin(u)t) . 1 J Muj Here, the quantum action principle says 6(x' ,t\x") = —(x',t\ 1 n ' p(t)Sx' - p5x" - St (t. \1M 2 Muj'2 + ———: (6.1.24) (6.1.25) (6.1.26) and we express the operators in [• • • ] in terms of x and x(t): First we have Muj p ■ \x(t) — xcos{ojt)\ sm(uji) (6.1.27) where we again note that t ■<-» —t interchanges initial and final quantities, and then
6.2 Short times 227 p2 Muj2 /r .^,-,2 .,^ ^ 22/ ^\ ifecos(w^) £- = ^-—- x £) -2x £)xcosw£ + x2cos2 w£) --^—. ; ' , (6.1.28) so that in (6.1.26) [... 1 ^ J^L. \x< cos(^) _ x»l fe' + J^_ rx» cos(^) _ a-'] 6x" l J sm(ut) sin(w£) \2sin2(w0 v n 2 sin(wt)/ = *L w^ J(x2 + x,2)cosM)"2x,x"]-ifilog./^-l \2sin(w0 ; v ; J 6VsinM)J = ~5\og{x',t\x") . (6.1.29) Accordingly, we get (x^^'')=C¾/^^e^^¾^'2 + K''2)C0S^-2KV'] . (6.1.30) For w£ <C 1, this becomes (6.1.8), and therefore the t -> 0 limit of (6.1.9) requires that the integration constant C has the value given in (6.1.11), and so we arrive at the time transformation function (x> t\x") ~ , MU 1 c*™^* + * )cos^) - 2x'x"} (6 1 31) {X,t\x ;-y27TifisinM)e • ^-^) We note that the ambiguity of the square root of isin(w£) is only apparent; see Problem 6-12 for details. 6.2 Short times We have just seen that we regain the free particle as t -> 0; there is no time for the force to act. Suppose now that t is small, wt<l, but not approaching zero. It is helpful to first rewrite the exponent with the aid of the identities (x'2 + x"'2) cos(ujt) - 2x'x" = {x1 - x")2 cos2{\ut) - {x1 + x")2 sin2(Ac^) , sm(iot) = 2sm(^iot)cos(i;U)t) , (6.2.1) so that i i i n\ / Mu} LMnL(r'- r"\2 cotl^tot) (x',dx") = W-——T-7—eft 4 (x x I <™(2wt) 1 Mw-{x' + x"ft:m{\wt) x e~^~T(x +x > ^(2^) . (6.2.2)
228 6. Elementary Applications Now we use, for wtCl, cot(-ojt) = (^ojt)"1 + terms of relative order (ojt)2 tan(~ojt) = \u)t + |(|w£)3 + terms of relative order {ojt)4\ . (6.2.3) We keep an extra term in the expansion for the tangent because it is multiplied by (x1 + x")2, which can be large whereas (x' — x")'2 is necessarily small, of order ht/M = [H/(Muj)]ujt. Indeed, we now shift the origin to a point x, x'->x + x', x"->x + x" with |x'|,|x"| < \x\ ; (6.2.4) then U>t\x") s JL. eif (*' - *"? e-i^C^2 + *(*' + *")](¥ + #) . N ' ' ' V 2-k'M (6.2.5) Recognize that the potential energy and the force at the point x = ~x are V=\Mu2x2 and F = ~Mu2x , (6.2.6) respectively, so that the last exponent in (6.2.5) becomes = 4w +iFt*±£ 4^3 (6.2.7) and the approximation i',ii = \hr~r^Tti- ' e^*-^-e nwl1 e nVt 6.2.8) 1 ' V 27rira obtains. Except for the last factor this is the transformation function (6.1.21) for constant force F. In the latter discussion the potential energy V = —Fx is zero at x = 0. Here, at x = 0, which is the physical point x, the potential energy is V of (6.2.6). Recall for a moment that the unitary operator for an infinitesimal time displacement is 1 + j^(-dtH). If the system is conservative, H independent of t, then the repetition of this a number of times equal to t/dt gives the unitary operator for a finite displacement of t, / • \ t/st lim [1-UtH) = (TTitH . (6.2.9) (5(-)-0 \ h J This means that
6.3 Harmonic oscillator: Energy eigenvalues 229 (x',t\x") = (x'\e-ktH\x") . (6.2.10) So, if we change H by adding a constant V, the transformation function changes by the phase factor exp {~j{Vt^ as observed. Incidentally, although we derived the small-£ approximation (6.2.8) for the oscillator potential, all that enters is the potential and the force at the reference point x, V(x) = \Mj2x2 = \Mu2[x~ {x -x)}2 = V-F(x-x) + ±Moj2{x-x)'2 , (6.2.11) where V = V(x), F=~^{x), Mu2=~(x). (6.2.12) ax ax1 Upon using these equations to define V, F,cv2 for an approximate description of an arbitrary potential energy in the vicinity of position x, we get a "local- oscillator approximation" for any V{x). Note that we have simplified the pre-factor in (6.2.2) by putting sin(w£) -> cot. However, the next term in the expansion sinx = x — |x3 can be included, giving an additional factor of 1 , , t2 d2V 1 + l2^ =1+12Md^- See Problems 6-16 for some more details. 6.3 Harmonic oscillator: Energy eigenvalues Let's return to the oscillator result (6.1.31) and set x' = x", and integrate. This is /OO l-OO . . dx'(x',t\x,} = / dx' (x'\e~nm\x') =tv le-km\ -oo J—oo = r M Li^e-iM^'2tan(!^). (6.3.1) We have done such Gaussian integrals before, but here we have a situation in which the integral does not converge if tan(i<x>£) = 0, that is Lot = 0, ±2n, ±47T,.... Therefore we regard it as a limit in which t -> t — ie, with e -> +0. Indeed -to tan(|w(* - ie)) = tan(|w£) - ie—^—- (6.3.2) gives the convergence factor
230 6. Elementary Applications 1 Mw2e r / 1 -i\1-2 /2 e-I-8™[COS(2^)] * _ (^33) Now we get [the identity 2 sin(ujt) tan(-ujt) = (2sin(^w£)) is used] , ,,, , Muj j nh tr y v ' V v z ' 1 1 (6.3.4) V 27Tiftsin(w^) y iMwtan(iw^) 1 _ 1 2isin(|w£) e|w£_e-|urt At this point we must remember that t is actually t — ie, so | e lw(* le) | = e—ea; < 1, for which reason we write — -lot °° °° tr { e-iHt\ = G 2_[ut = e-2wt J2 e~imJt = E e~ft(n+ ^ ■ e n=0 n=0 (6.3.5) We know that [designating an eigenvalue of H by E, and its multiplicity by m(E)} trle-im\^^m{E)e~iEt (6.3.6) E and conclude E=(n+±)hu, n = 0,1,..., m(£) = 1 . (6.3.7) Of course, we know all about the n + | part. If we take //=4 + ^ (6.3.8) 2M 2 v ; and introduce the dimensionless variables g,p, —-q, px = Vrnftup, rrfoPz] = T[g,p] = l , (6.3.9) into the Hamilton operator (6.1.22) we have H = hu£^-, where (?-±±-\=n+% (6.3.10) are the eigenvalues found in Section 2.4. Indeed, the stationary-uncertainty states In) discussed there, then characterized on purely kinematical grounds, are now recognized to be the energy eigenstates of a harmonic oscillator, thus acquiring an equivalent dynamical characterization. We'll return to this in Section 6.5.
6.4 Free particle and constant force: State density 231 6.4 Free particle and constant force: State density What happens when we carry out the trace calculation for the free particle and the constant force? If we set x' = x" for the free particle we get from (6.1.11) <*'*'> = & <6A1» which may not seem so informative. It helps to recall that -i= e^P' dp' e-sfw* ~= e~iP'x" , (6.4.2) -oo v2nh V2-kH (x',t\x')= I ^e4fi' (6.4.3) so that It1 .t.\x'\ = /_00 2nh which, on integration, gives back the stated result. Now (dropping primes) with E = If ' (6A5> which makes evident that the spectrum of H is 0 < E < oo. We also see a correspondence between quantum states and classical phase space, an integrated range of dx dp equal to 2nh corresponding to one state. [In three dimensions it would be an integrated (dr) (dp) equal to (2nH)3.] A more precise way to put it considers a large finite range of x (J dx -> L). and recognizes that, counting positive and negative p together, dp -> 2dV2ME = dE \j~ , (6.4.6) so *{'-*"%= L iEtnf-f^m-' <6-«> evidently, m^-Lr-f <"-8> is the number of states per unit energy range over distance L, so
232 6. Elementary Applications d .,_,. 1 /2M ,„„^ is the state density: the number of states per unit energy range and unit x range. Similarly we rewrite the constant-force result (6.1.21), / .i \ I M lFxt -i£^ (X, £ a;) = \ 7r-rr-e* e ft 24m N ' ' ' « 2ttiht What do we do with f^e-i(^-4e-ill. (6.4.10) J_00 2-Kh i F2t3 i 3 1 / F2 \ 3 e-ftW=e-^, T=_W_J ? (6.4.11) Write it as a Fourier integral: e-P / —OO where /OO daei(TTAi(a) , (6.4.12) -00 /00 J . ; 3 1 rOO — e~laT e~3T =- dr cos (ar + |r3) (6.4.13) -00 2t tt ./0 is Airy's* function. It obeys a simple differential equation, easily established by partial integration, d2 a -r ; f°° dT -utt 2 -ir3 d^Ai^=-.L2^e r e 3 dr _iCTT. 9 _iTs e ri-^-e sr .^ 2tt dr = crAi(cr), (6.4.14) and is normalized to unit integral, /00 do-Ai(cr) = l, (6.4.15) 'OO as we see by potting r = 0 in (6.4.12). Now we find that tr \e~^Ht} = / dx(x,t\x) J —OO (6.4.16) "Sir George Bidell Airy (1801-1892)
6.4 Free particle and constant force: State density 233 With and varying p. we get pdp E= -£-- Fx- -^r(HMF)i (6.4.17) 2M 2My ' y ' dE , p=±\J2ME + 2MFx + a{hMF)3 . (6.4.18) So tr|e~iH*}= fdEe~iEt f daAi(a) I dx M nh ^2ME + 2MFx + a{hMF)\ (6.4.19) where the integration is over those values of E, a, and x for which the argument of the square root is positive. The integral (6.4.15) ensures that, for F = 0, this is the free-particle trace of (6.4.7), as it must be. Now for F > 0, we take the x integration from where the argument of the square root vanishes to a large positive value L, I L dx M nH '2ME + 2MFx + a(hMF)i = f -^=^~d^2ME + 2MFx + a{HMF)i 1 /2ML L E a ( K2 Y = ^^Y+fl + YlKmf) ■ ^4-20) With L so large that l \E\ ( h2 \ 3 L»y and L»(—J (6.4.21) and noting that only a values of order unity contribute significantly (see Problem 6-29 for a refined treatment), this gives trje-^l =/ dEmL(E)e~JhEt with mL(E)^^2-^ (6.4.22) and the state density
234 6. Elementary Applications lm^^te- (6A23) where the range of E is -oo < E < oo, for unlike F = 0, where 1/y/E appears in (6.4.8) to exclude E < 0, there is no restriction on E here. The nature of the energy spectrum for a constant force is easily understood. Given the Hamiltonian (6.1.12), one gets an equivalent Hamiltonian by the displacement x -> x + x$ H=^-Fx-Fx0. (6.4.24) Clearly the spectrum of p2/(2M) — Fx can have no lower limit, for if it did, by choosing Fxq > 0 one gets an even lower value. Similarly for an alleged upper value. 6.5 Harmonic oscillator: Energy eigenstates The importance of the Hamiltonian in describing time evolution naturally directs attention not only to the eigenvalues of H but also its eigenvectors. All that information is in the time transformation function. Let the, possibly multiple, eigenvectors for energy E be labeled by E, 7, so that e~iHt = Y,\E^)e~^Et(E^\ ■ (6-5-1) Then, for example, (q',t\q")=(q'\e~im\q") = Yj{q'\E,1)e~iEt{E,1\q") (6.5.2) B,^ ' " ' = 4>E,7(q') =4>E,',(q")* identifies their q wave functions. Hence knowledge of a time transformation function gives one all the eigenvalues and wave functions of H. As we have seen, the trace is a special application that focuses entirely on eigenvalues: ti{e~iHt}= Jdq1 (q'\e~iHt\q') = '£e-TiEt f 6q'\iPeM) Y^ e~iEt = Y,m(E) e~^Et ■ (6-5-3) E,j E
6.5 Harmonic oscillator: Energy eigenstates 235 Let's illustrate this first with the discrete spectrum of the oscillator, starting with [dimensionless variable q = ^/Muj/Hx] tf.^ys^.'i*") \/27ri sm(ujt) 3l^][(g'2+g"2)cosM)-2gV'] : ea^ nSe 2- l(q'2 + q"2) sjl - e~2iujt x e-(l - e"2^)-1^'2 + q"2 - 2^6-^) _ (6.5.4) The last factor, a Gaussian in two variables, is usefully presented as a double Fourier integral. We stop for a moment to look at this n-dimensional Euclidean Fourier integral j(du) e~uTAu + 2iwT^ = f(du) e~uTAu + [uT* + [^u (6.5.5) where u and <j> are real n-component columns (uk = u*k, 4>k — ¢1), A is a real, positive, symmetric n x n matrix (Aki = Mh = A*kl), and their products in the exponent stand for the obvious, namely uTAu = ^^UkAkiui , ur<j> ~ ^2ui4>i = <j>Tu . (6.5.6) k i Complete the square in the exponent, then substitute u -> u + iA~l(f> to turn the integral into f(du) e~(w ~ iA-^)TA(u ~ M" V) e-4TA- V = f(du) e-^Au^A-1^ (657) Now change the coordinate system to the eigencolumns of matrix A, f(du) e~uTAu = n (duk e~A^i = r\.rf = _^L. (6.5.8) J Y J Y V Ak VdetA The result is f(du) e-Tiw + 2ittl> = -^L e-<^~ V . (6.5.9)
236 6. Elementary Applications The two-dimensional Gaussian we are presented with in (6.5.4) is <TA~\ 1 - A2 1 - A2 A 1 V~ 1 - A2 1-A2 J j n,^ /1 A\ ( q' W)(xi with A = e" -kot and we note that det | ^ ] = 1 - A2 . (6.5.10) (6.5.11) Therefore J^p-(g'2 + 9"2-2AgV')/(l-A2) yT^A2 = - r dUldu2 e~u* ~ u* ~ 2XuiU'2 + 2i(W]9' + u^") , (6.5.12) 1" J-oo which we rearrange as oo l\\\n ,-00 1 A (|A) dMidw2 (-2iwi)n(-2kt2) n „—«1 — u2 + 2iuiq' + 2iu2q" n=0 00 (|A)n ( d \n ( d \nl n=0 n! \ dq' J \ dq -) - /°°dMldM2 e"u? - ^ + 2iW + W / "" J — oo J2 J I2 = EAniHn(«')Hn(g")e^'2-5"2, n=0 (6.5.13) where we recognize the Hermite polynomials of (2.5.4). In summary, oo <«'.%"> = E^(«')e"^n^»(«")* (6-5-14) n=0 where £n = tiu (n + |) (6.5.15) are the energy eigenvalues of (6.3.7), and Mq,)=J^nn(q,)e-L2l'2 V2nn! are the wave functions of (2.5.5). Mx") = (jy) " MQ') (6-5.16)
6.6 Free particle and constant force: Energy eigenstates 237 All this fuss to get back to a known result! Surely there are simpler ways? Indeed there are, not surprisingly involving non-Hermitian operators. But before going into that, in Section 7.1, let's look at the wave functions for a free particle and a particle under constant force. 6.6 Free particle and constant force: Energy eigenstates For the free particle we know that [this is (6.4,2)] , , ,, r e*3* i £_* e~ipx' (x,t\x') = / -== doe »w' =- (6.6.1) which we now rewrite in terms of E = p2/(2M), dE = (p/M)dp. We have only to recognize that for a given E, there are two values of p, p = ±\^2ME. (Henceforth we use p to mean the positive value.) That gives us the form /•oo (x,t\x')=y] d£fe,7(i)e""l£tfei7(i')* (6.6,2) ^- + Jo 7=± where yj2-Kk V dE = ^m±e±^v2ME (6.6.3) V2ttH represent the two states of the same energy, in which the particle is moving to the right (+) or the left (-). Prom Problems 6-6 we know, for constant force F > 0, that Now so that where (p,t\p') = e n&MF 8(p-p' ~ Ft)eh&MF . (6.6.4) 5{p-p'-Ft) = f" -i^-eiftP-P'-F*) (6.6.5) /00 dE7pE(p)e~^Et^B(p')* (6.6.6) -OO ^E(p) = -7=L=eT?F(Ep-6M). (6.6,7)
238 6. Elementary Applications What happens as F —> 0? Considered as a function of p, iPe(p) oscillates infinitely rapidly in this limit, except if which can only occur for £? > 0. This gives us back the free particle spectrum. We leave to Problem 6-31 the task of verifying that the expected wave functions emerge. Given iPe(p), we get iPe(x) as —4= e*3* dp , e^EP -oo V^fl V2nhF 1 1 ~~ 2-kH^/f 6M ) / dpeWF(E + Fx^c-i^fM . (6.6.9) J — oo The variable change p=(2MhF)*T , (6.6.10) gives V y ' \tfVFj J-oo 2tt / 9/l/f \ 3 = (fiVfj HVMF/h2)*{-z-E/F)). (6.6,11) They are orthonormal and complete, as they should be (see Problem 6-25). Apart from the normalization constant, we could have recognized the appearance of the Airy function from the Schrodinger differential equation for an energy eigenstate, {q'\H{q,p)\En) = h(<J ^){q'\En) = {<j\En)E (6.6.12) (frequently called the time-independent Schrodinger equation) or H 9 il>E,M = 0, (6.6.13) *-"('■?£) which here is E + JMd^ + Fx]lpE{x) = 0- (6'614) Comparison with [this is (6.4.14)]
6,7 Constant force: Asymptotic wave functions 239 (6.6.15) indeed shows that a= {2MF/h2)* (-x-E/F) should be the argument of the Airy function in (6.6.11). (6.6.16) 6.7 Constant force: Asymptotic wave functions It is clear in a general way from the differential equation (6.6.15) that Ai(<r) will be oscillatory for a negative and sufficiently large, but non-oscillatory and of exponential behavior if a is positive and sufficiently large. We want to be a bit more precise about these asymptotic behaviors. Consider first large negative a values, — a 3> 1- In Ai(o) = -Re(f drei(Hr-tr3)N) (6.7.1) there is a stationary value of the phase: "-U^-r1 W\ — T so that r = \a\ = (6.7.2) which in fact is a maximum value 1 d'2 (, , 1 2drn|a|T-3r l=-T" (6.7.3) Expanding about this value up to quadratic terms, that is: t —> V—a + t , \a\T F = 1(-^ -ar (6.7.4) gives Ai(a) ^ - Re 7T ^ -Re 7T dre 3 &-*)'< n &-<?)* :6 3V W—a = 7T 5(-CT) Icosj 3(-^)2 -J (6.7.5) which is the leading term in the asymptotic expansion for — a 3> 1.
240 6. Elementary Applications For a = 0, 1 I /'"" >^3 Ai(0) = -Re( / dTe~sr (6.7.6) or, with the substitution t = 3^ e sj/3, 1 / Z*00 Ai(0) = -Re^-ie"""^ / dyy-i e"""2'J = (-§)! = 2.678 939 EI (-1), = 0,560. Now we turn to large positive a values, a ^> 1. In 1 / r°° -i l 3^ Ai(a) = -Re I At e-1(ffr + 3CT - (6.7.7) (6.7.8) it's well to notice that, for convergence at r = oo, r should have a negative imaginary part, i. e., the integration path should run below the real axis, such as for example. Next, notice that the exponent still has a stationary point: d ( 1 , . ar + ~t3 ) = a + t2 = 0 holds for r = —\\fa . At \ 3 (6.7.9) So let's choose this contour of integration: Imr -Vct ->Rer
6.7 Constant force: Asymptotic wave functions 241 The first part, which we parameterize by r = —iv with v : 0 —> \/a, does not contribute: ^ Re j J ^(-1)(1^ e~^v " 3«3) )=o. (6.7.10) The second part, where r = —\\fa + u with u : 0 —> oo, is, for a 3> 1, Ai(^lRe(£ w Jo ~!ffi P-v^2 dw e 3CT e 3 /»0O dw e — \/<TU = |>A/V^ 1 „1 _1 -Vl (6.7.11) which is the leading term in the asymptotic expansion for a 3> 1. Now if we recall (6.6.16) and introduce the meaning of <r, the asymptotic forms of iPe(x) become ,,. , (2M/H2) tPE(X) ^7T~5 V > 2\4 (E + Fx)* for E + Fx > [ti2F2/(2M)] cos(~(2MlKY~{E + Fx)i-\) (6.7.12) and i/>e{x) =S l__i (2M/?»y c_f frM/fi^lig + fsl* 2 |£ + Fa;|i for - (£ + Fx) > [S2F2/(2M)] * . (6.7.13) A simpler and more general presentation of these results comes from the recognition that E+Fx = E—(—Fx) is the classical kinetic energy p2/(2M), so that p(x) = y/2M(E + Fx) (6.7.14) is the classical momentum at position x for energy E, and that d da; 3F {E + Fx) J = (E + Fx)* = p(x) V/2M (6.7.15) or i o rx 2M—{E + Fx)i= dx'p(x'). (6.7.16) 3-^ J-E/F
242 6. Elementary Applications All this applies for E + Fx > 0, and E + Fx>0: ipE(x)^ J-^—coa(l [ dx'p(x')-^) (6-7-17) y irhp(x) ^^ip=o 4/ repeats (6.7.12). With E + Fx < 0, p(x) is imaginary; this region is classically forbidden. But the wave function does not vanish, although it decreases rapidly as one penetrates into the region. With (6.7.18) we have |p(a:)| = ^/2M{-E - Fx) pX px / dx'\p(x')\ =(2M)i / dx'(-E-Fx)k Jp=0 J-E/F -(2M)5— {-E-Fx)* or E + Fx<0: t/,E{x)*i-. ——e*U 2 l/ nnpyx) hILodx'\p(x')\ (6.7.19) (6.7.20) which repeats (6.7.13). For the physical interpretation of these asymptotic wave functions, we first write 2 \irhpj incident reflected for x > -E/F ^if^yeuio^iP(*')i for X<_E/F. 2 \irnpj (6.7.21) transmitted Matters are as sketched in this figure: ncident V = classically forbidden classically allowed
6.8 WKB approximation 243 The incident wave moves to the left in the classically allowed region; it falls upon the boundary of the region, where E — V(x) = E + Fx = 0. There, a wave of equal amplitude is generated, moving back into the allowed region: the reflected wave. And, an exponentially attenuating wave moves on into the classically forbidden region: the transmitted wave. The transmitted and reflected wave functions, Vtrans. and Vrefl., are both obtained from the incident wave function 1 I 2M \* -ifi/JU^pM-i) *-w = iU*oJ ' K r-° " ("-22) by simple changes of p(x). The replacement p(x) -> eJ \p(x)\ = i\p(x)\ (6.7.23) turns Vine. mtO Vtrans., and the replacement p{x) -> enip{x) = -p{x) (6.7.25) turns it into i/Vefl., ^■^H^oJ eV 4J=^-- (6'7-26) Note that, owing to the phases ±n/4 in the incident and reflected wave functions, no additional phase factors are needed. These rules for turning Vine, into '0trans. and -0refl. are known as connection formulas. 6.8 WKB approximation The remark has been made that these results are more general than the particular potential V(x) = —Fx. To justify this, consider the Schrodinger equation or ^ + |rb(z)]2W*)=0, \p(x)]2=2M(E-V(x)); (6.8.2)
244 6. Elementary Applications ±p{x) being the classical momentum at x. In the light of what we have seen we look for a solution of the form ipE(x) = A(x)e i<j>{x) (6.8.3) with real amplitude A(x) and real phase <j>{x). For convenience, we denote differentiation with respect to x by primes and have da; da;2 so that ^E(x) = ^'E = A'e'f + A^'e'f , ^E(x) = ^ = A" ei(t> + 2A'i# e^ + A\cj>" e^ - Aft2 el A" - Aft2 + ^p2A = 0 , 2A'<f/ + Acj)" = 0 The second equation, multiplied by A, says that da; (A2<j>') = 0 or A C (6.8.4) 1.5) (6.8.6) with integration constant C [clearly, "constant" means "independent of x" here; C may (and will) depend on E]. Differentiation of (6.8.7) A' 1 <(>" A 2 (j)' gives A" (A'\ (A'V 1 (cj>"\ 1 and then we have that 2 1 2 ldf 1 U"^ 9 H2P 2 da; 4>< 4 \<p t This is useful only as an approximation, beginning 0th order : </>' = ±-p n and going on to 1st order : </>' = ±4 ^ ± /(¾ -\as) 1 lKdj/ in n 4 p da; p %p )'■ with 1 + 4 (f) 2 ) ■ (p!>C (?)' 2" 5.8) (6.8.9) (6.8.10) (6.8.11)
6.8 WKB approximation 245 In terms of \{x) = p(x) (6.8.12) [sometimes called the local (reduced) de Broglie wavelength] this is 2 i / i \ 2" 1 order: n 1 /\ d Y ! /\dP (6.8.13) Thus the leading term is a good approximation wherever the momentum changes only by a small fraction in a distance of A, that is: where X(x) dp(x) dx «p(i) is obeyed. That implies da; C IE-VI (6.8.14) (6.8.15) or H2 {dV/dxf 2M IE-FI3 «1. (6. Obviously this fails in the vicinity of any x — Xo for which E — V = the situation where, for small x — xq, 8.16) 0. In dV E-V(x)^F(x-x0) withF=—— (x0) da; the approximation is good for \x - x0\ > ti 2 \ 3 MF (F > 0 is assumed) provided of course that , d2V \x — Xq\ dx2 «ir, which requires d2V dx'2 0-(^0) „fMF\ <<F(^) (6.8.17) (6.8.18) (6.8.19) (6.8.20)
246 6. Elementary Applications Under these circumstances we have ¢^ JLe±iJxdx'P(x') (6821) Vp in the classically allowed region where [p(x)] > 0, and the previous discussion, with the simple connection formulas among tpinc., "0refl.5 ''/'trans., applies and gives back just the results found in the special example V(x) = —Fx, apart from the overall normalization constant. In the quantum literature this approximation method is identified with the initials WKB, thereby referring to work of 1926 by Wentzel,* Kramers,* and Brillouin,* although it had already been given explicitly by Jeffreys5' in 1924, and had a long history stretching back to Green^ and Carliniii (work of 1837 and 1817, respectively). The asymptotic forms work only sufficiently far to the left and right of the point xq where E — V = 0. It would be nice to have a unified approximation that includes the region of transition between classically allowed [x > xq, p(x) positive] and forbidden [x < x0, p(x) imaginary] regions. For that, return to the Schrodinger equation (6.8.2) and define 1 fx , / / /s 2, . 3 ., f a(x) < 0 if x > xq , ,„„„„, - dx'p(x') = -(-a) 3 with \ (6.8.22) n JXo 6 yo{x) > 0 it X < Xq , so -dxp = -v^da . (6.8.23) H Now, introducing a scaled momentum x(x) m accordance with „2 _ „„,2 X = pH-o , p1 = -oXl , (6.8.24) we have -fi;r = xr > (6-8-25) da; da and the Schrodinger equation appears as :^-X~A ax2)ip = 0 or l j a., da da J \\ da da Xt-XT"-^X2V = 0 or (IAXA_CTU = 0. (6.8.26) Write %I> = X~^ (6-8.27) *Gregor Wentzel (1898-1978) fHendrik Anthony Kramers (1894-1952) ^Leon Brillouin (1889-1969) §Sir Harold Jeffreys (1891-1989) 'George Green (1793-1841) l!Prancesco Carlini (1783-1862)
6.8 WKB approximation 247 and use (primes denote differentiation with respect to a) Ti d ±l d lx' X daX da 2 x 1 d d ~1~X^~=X 2 X da da X d \c _I ( d da^2x)\da 2 X .'* 2VX/ 4Vx X2 to get da2 ^ 2 V X 1 /W Ux; 4 U 1/) = 0. (6.8.28) (6.8.29) As before, (x'/x)' and (x'/x)2 wiU be relatively small (compared to ct) in the asymptotic regions. But now, near xq, where p{z)*i ^/2MF(x - x0) if x < xo , { i^2MF(x0 - x) \i x>x0 , a{x) = (2MF/S2)5(i0-i), x(z) = (2ftMFp, (6.8.30) the scaled momentum x is a constant and those extra terms are negligible. Thus, we have an everywhere valid approximation \da2 -ctW> = 0, solved by — C— u tp oc Ai(er) , -0 oc -—r~ Ai(cr) . (6.8.31) (6.8.32) If, for convenience, we set the proportionality constant to 2^/n, and knowing the \a\ 3> 1 forms of the Airy function [cf. (6.7.5) and (6.7.11)], we now find the asymptotic form, for — a > 1, VP \3 47 Vp(x) cos s/>'"M-i) ("■ as expected, and analogously, for a 3> 1, </> = 1 -*-* l_j/;0^'Ip(^)i O ?3 — -IT2 6 3° = y/W)\ 33) (6.8.34)
248 6. Elementary Applications But now we also know the value of ip at x = xq, for example, to which the asymptotic forms incorrectly assign oo owing to the diverging 1/ ^y\p(x)\ amplitude factor. For that we need the limit of (—ct)3/p5 = X~~f as a; —> xo, when a —> 0 and p —> 0. This is available in (6.8.30), and so 1 2585 Mzo) = 2^VxM Ai(0) = -——i- , (6.8.35) (2fiMF)6 if we again take 2-^7? for the proportionality factor in (6.8.32). Incidentally, we shouldn't fail to notice the physical significance of the factor 1/\Jp{x) in the asymptotic form, as given in l -i(i/!dx'pM-|) ''/'inc. oc —7== e vftJ*o v, (6.8.36) with its implication |^nc.|2oc-7-^, (6.8.37) p(x) and the associated (relative) probability of finding the incident particle in the interval da; about x, |2 ^ da: darl^incl oc T7^- oc dt . (6.8.38) dt Here is the very sensible result that the probability of being found in a given stretch is proportional to the time that the particle spends in that stretch. Notice however that we have considered only the incident wave. The same result would emerge if we had only the reflected wave. But in fact both are present and in reality dx\^\2 oc -^-4cos2 Q fdxp-^ . (6.8.39) So, superimposed on the particle factor \p(x)]~~1dx oc dt, is the wave factor describing the interference between incident and reflected wave. Indeed, at certain points, the probability per unit length is zero; at other points it is four times that of the incident wave alone. On the average it is twice that of IV'inc. I , representing the additive contribution of I V-'inc. I and |^refl. | . 6.9 Zeros and extrema of the Airy function The construction of tp from Ai(cr) redirects attention to the latter function in its finer details. Let us specifically ask where Ai(cr) has zeros:
6.9 Zeros and extrema of the Airy function 249 Ai(an)=0, n = 1,2,3,..., (6.9.1) and where it has extrema, maxima or minima: Ai'(ff„)=0, n= 1,2,3,...; (6.9.2) here and below primes denote differentiation with respect to a, the argument of the Airy function. For the purpose of determining the a 's and a 's we return again to the amplitude-phase construction, this time in real form: Ai(o-) = Acos^ , Ai' = A' cos <p - A<j>' sin <j>, Ai" = A" cos 4> - 2A'4>' sin 4> - A(f>'2 cos 4> - A4>" sin <j> . (6.9.3) The differential equation (6.4.14), Ai" = crAi, is satisfied by setting the coefficients of cos <fr, sin(/> equal to zero, A" - A0'2 = a A , 2A!(t>' + A'fi' = 0 . (6.9.4) The integration constant in A(AV)=0, ^V = ~, (6.9.5) follows from the asymptotic form (6.7.5), and approximations for <f>(o) are iteratively found from ^=-a+^L, (6.9.6) The initial approximation is WKB: ignore A"/A, so that 1st approximation: 0'= ->/=?, <^=|(-a)l-J, A=-^r, (6.9.7) o 4 (—0)* where, again, we make use of information available in (6.7.5) about the sign of <f>' and the behavior of <f> in the WKB regime (—a 3> 1). In this approximation the zeros a of Ai(cr) are given by cos (0(a)) = cos (| (-a) f - ^) = 0 , (6.9.8) so that 2 forn = 1,2,3,... , (6.9.9) -On ^ 37T T n —
250 6. Elementary Applications and the comparison with the exact values, -cti =* 2.3203 = 2.3381 x 0.9924 , -a2 = 4.0818 = 4.0879 x 0.9985 , -5¾ = 5.5172 = 5.5205 x 0.9994 , (6.9.10) shows that this simplest approximation is remarkably accurate. In the approximation where A is slowly varying, the extrema of A cos < are given by <j>{o) = 0,n,2n ... , so that 3?r T n"4 for n = 1,2,3,. (6.9.11) Here the comparison with exact values, -cti =* 1.1155 = 1.0188 x 1.095 , -¾ ^ 3.2616 = 3.2482 x 1.0041 -¾ = 4.8263 = 4.8201 x 1.0013 (6.9.12) reveals very good agreement, except for n = 1. We improve the approximation for <f>(o) by using the 1st value for A"/A in (6.9.6), A^_ A 1 16 (-a)2 ' (6.9.13) giving so and —a 2 approximation: q From (j>{a) = 7r/2,37r/2,57r/2,. of (6.9.9), 1 + 1 16 (-ct)3 1 + 1 32 (-a)3 _ 2 3. 7T = 3(-^-4 48 (-")- (6.9.14) (6.9.15) (6.9.16) 3?r ( 1 T n-4 , we then get a corresponding refinement 3.9.17) 3 5 H 48 [37T ( 1\] — n L 2 V 4jj 3 Not surprisingly, the comparison with the exact zeros of Ai(<r),
6.9 Zeros and extrema of the Airy function 251 -cti =* 2.320 251 + 0.019 349 = 2.339 600 = 2.338 107 x 1.000 638 , -5¾ = 4.081810 + 0.006 252 = 4.088 062 = 4.087 949 x 1.000 028 , -5¾ - 5.517164 + 0.003 422 = 5.520 586 = 5.520 560 x 1.000 005 , (6.9.18) shows a substantial improvement. Concerning the zeros of Ai', we note that the extrema of (—0')~2 coscj) are determined by H0! sin 6 + - 2 wy cose or. since is small, 2(</>')2 1 Ai. 1 d 1 = 0 1 1 4 (-a)! sin ' + 1 2 {4>'Y With <f>(o) of (6.9.16), the requirement 1 = 0. > + 2 wy = 0, 7T,27T, . . is met by -on — pTr [2 (a 3\] I" 4J] 3 7 48 "3?r [2 (a I" 3"ll 4jj (6.9.19) (6.9.20) (6.9.21) (6.9.22) (6.9.23) and the comparison with the exact zeros of Ai', -fi =* 1.115460-0.117206 = 0.998255= 1.018 793x0.9980, -f2 - 3.261626 - 0.013 708 = 3.247 917 = 3.248 198 x 0.999 914 , -¾ = 4.826 316 - 0.006 261 = 4.820 055 = 4.820 099 x 0.999 991 , (6.9.24) shows good agreement for n = 1 and very good agreement for n > 1. If we now denote the zeros a\, ef2, .. • of the Airy function by a\, 03, 05, ..., and the extrema cti, a2, ■ ■ • by cto, 02, 04, ... , the second-order approximations (6.9.17) and (6.9.23) are compactly presented as —o-m. = r3-rr — L 4 ( lX m + - v 2; J 3 _ fl 1/ ,m\ 777+0-1 \48 8V ' ) \3tt ( 1\ — m + - L 4 V 27] 3 for m = 0,1,2,... (m odd: zeros; m even: extrema). (6.9.25)
252 6. Elementary Applications The solid curve in 0.8 displays the function Ai(a) that, for negative a, has these zeros, maxima, minima, equals 0.3350 at a = 0, and decreases rapidly for a > 0. The dashed curve shows the leading asymptotic forms (6.7.5) and (6.7.11). 6.10 Constant restoring force We get a direct application of these results if we ask for the energy values in the so-called linear potential: energy V(x) = F\x\ , which corresponds to a constant restoring force, dV „ ,, i-F forz>0, —-r— = — b sgn(£) = < dx B w \+F forz<0. The Schrodinger equation for an energy state is here (6.10.1) (6.10.2) (6.10.3)
6.10 Constant restoring force 253 This differential operator is even in x; it is unchanged by x —> —x: ft'2 d2 £+2Md?-f|11 i>(-x) = o. (6.10.4) If both tp(x) and ip(—x) are solutions, so also are their even and odd sums VWen(z) = -j[iAx) +tp(-x)] , 1p0dd(x) = -j[^{x) -1p(-x)] , (6.10.5) which are characterized by tpeven(-x) = 1peven(x) , "0odd(~^) = ~1podd(x) , (6.10.6) so that, taking the continuity of ip and dip/dx for granted, d da; </Wen(0) = 0 , </>odd(0) = 0. (6.10.7) For each kind of function it is sufficient to consider only the region x > 0, or x < 0. We use the latter: V •f- boundary condition >x which gives us back the situation already studied, V = —Fx, now with a boundary condition at x = 0, appropriate to even or odd functions. The solution, with V = —Fx, is a; s -i [2MFY ( E x < 0 : ip\x) ^ Ai(ct) Wltn — ct = ( I ( a; + — Therefore, in accordance with (6.10.6) we have VW*) <x Ai((2MF/ft2)*(|a:| -F/F)) , </>0dd(*) <xsgn(a;)Ai((2MF/ft2)*(|a:| - F/F)) , and the boundary conditions (6.10.7) require (2M_\ 3 f -^ , -<f2 , Vn2F2; \-Si, -a2, for Veven for Vodd -<70, -CT2, •■• for Veven -^i , ~^3 , • • • for Vodd (6.10.8) (6.10.9) (6.10.10)
254 6. Elementary Applications so that E™ = ^a™)(r^fX for m = 0,1,2,... (6.10.11) are the energy eigenvalues for the linear potential (6.10.1). The second-order approximation for — am in (6.9.25) gives ( 2M V \h2F2) H>m — 3?r ( 1 T [m + 2 'W + J 3?r / 1 Tr+2 (6.10.12) We write fa, fa, fa, ... for the even solutions, and fa, fai, fa, ... for the odd solutions. The (positive) constants cm in x>0: faa{x)=cm(2MF/n2Y ki(am + (2MF/h2Yx^ (6.10.13) are determined by the normalization: /OO />00 dx\lpm(x)\2 = 2 / dz|V>m(z)|2 -oo ./0 /•OO / da[Ai(a)]' = 2ci so C«7. I ^ />oo !/ da[Ai(CT)]; J <Tm (6.10.14) (6.10.15) where, as we recall, Ai(<rm) = 0 for m odd, Ai'(<rm) = 0 for m even. Now, differentiation of the differential equation (6.4.14) obeyed by Ai(<r), da2 Ai(<r) = a Ai(er) , gives Therefore da' Ai'(a) = aAi'{a) + Ai(a) . (6.10.16) (6.10.17) [ Ai(a)]2 = Ai(a)^ Ai'(a) - Ai'(a)^ Ai(a) ^(Ai(a)Ai"(a)-[Ai'(a)]2) ^([Ai'(a)]2-a[Ai(a)]2) da i "da (6.10.18)
6.11 Rayleigh-Ritz variational method 255 and A" = /0° d<T [Ai(^)]2 = ([Ai'(am)]2 -am[Ai(am)]2) (-CTm)[Ai(CTm)l for m even, 2 (6.10.19) [Ai'(aro)]' for m odd. Let's ask how well the leading (WKB) approximations represent the normalization constants cm. Proceeding from Ai(a) S 7r-3(-a)-' cos^-(-a)-3 - -J , Ai'(a)^7r-J(-a)isin(|(-a)-t-J), we first get [Ai»]2-a[Ai(a)]2S V=^ and then with (6.9.25) On* (!) 3tt / 1 — m+- 4 l 2 = 7r3(6m + 3)' (6.10.20) (6.10.21) (6.10.22) The comparison with exact values, co = 1.21954 = 1.30784 x 0.9325 , ci = 1.01549 = 1.00841 x 1.0070 , c2 = 0.93261 = 0.93634 x 0.9960 , c3 = 0.88175 = 0.88046 x 1.0015 , c4 =* 0.84558 = 0.84666 x 0.9987 , c5 = 0.81777 = 0.81727 x 1.0006 , (6.10.23) shows that the error is well below 1% except for m = 0. 6.11 Rayleigh-Ritz variational method The one state that did not fare too well with the WKB approximation is, not surprisingly, the lowest energy state, m = 0, which has a wave function without oscillations: Ai(<r0 + \q\ 'ipo(x) oc Ai(a0 + \q\) with q = {2MF/h2)ix . 0 1 (6.11.1)
256 6. Elementary Applications Here is a method directed specifically at that state. Consider any Hamil- tonian for which the spectrum is bounded below: H' = E > E0 ■ (6.11.2) In the present circumstance, H = p2/(2M) + F\x\, it is clear that H' > 0; there is a lowest energy state. Generally, we have (H - E0)' = E - E0 > 0 , (6.11.3) so that, for any state | ), the expectation value of H — E0 is positive, ((H-~Eo)) = YJ(E-Eo)p(E)>0, (6.11.4) where the equal sign holds only if | ) = \H' = Eq~). Equivalently, (H) > E0 , (6.11.5) so that for any | ), {H) provides an upper limit to Eq. One then tries to minimize (H) to get a good value. In the quantum literature, this is known as the Rayleigh*-Ritzt variational method. It is often convenient to write a normalized (real) wave function as ¢(1)/^// dx'[,«x')]'- (^ + fW> = Writing / Here then 1 dx 2MV-*fj)(*i^+*FM* ! dxip2 converts x - this - ( n2 Y \2MFJ q into /d. and dV> £o=,W ^° >E0. (6.11.6) (6.11.7) dq + \q\f I >£o (6.11.8) dqij}1 where the range of q is, say, 0 —> oo and (dip/dq) (0) = 0. 'John William Strutt, Lord Rayleigh (1842-1919) tWalther Ritz (1878 4909)
Problems 257 Now we must pick a suitable trial wave function ip(q). It should be a maximum at q = 0, and it must decrease rapidly for large q. Suppose we try (having some knowledge of its shape) 4>{q) = e"lAg5 , (6.11.9) where A is an adjustable parameter. Then we get Jo°° d<? (A2<? + g) e" 4 % a. 3~fAg2 f^~'¥ > £0 (6.11.10) or, with (A2 + ¢=(-) **, (6.11.11) (6.11.12) where (|)! = 0.892980, (-|)! = 1.354118, and (f)<(-§)! = §7r/sin(|7r) = 271-/35 illustrates a property of the factorial function. We now pick A to minimize this: ~ (At +\~i) = t\\-1\~\ =0 (6.11.13) qA *> / o o or A2 = J, A3+A^I = 4- (6.11.14) 2 2i Therefore 3§ (I)! r -, ^0^^--7^^7 = 1^288 | = 10188x 1-0098 I . (6.11.15) The approximation is correctly in excess and remarkably close considering the simplicity of the trial wave function. Any more general choice will yield a lower and better answer. Problems 6-la One degree of freedom, Hamilton operator H = p2/(2M) + V{x) with arbitrary V(x). The probability for finding the particle between x' and x'+dx' is dx' p(x',t) with the probability density
258 6. Elementary Applications p{x',t) = (6(x{t)-x')) . Show that the continuity equation is obeyed by this p and the probability current density j(x',t) = ±(p(t).5(x(t)-x')) . Generalize this to motion in three dimensions. 6-lb Now consider the probability density for momentum p, P(p',t) = (5(p(t)-p')) and find the associated probability current density needed in §iPV,t) + ±M,t) = o. What is the three-dimensional analog? 6-2a One degree of freedom, Hamilton operator H — p2/(2M) — Fx with constant force F. State and solve the equations of motion. As in Problem 5-7a, again consider the minimum uncertainty state |<5), at t = 0, and evaluate [Sx(t)}2, [Sp(t)}2. 6-2b Now consider an arbitrary initial state and repeat Problem 5-8a. Why could you have anticipated that T and to do not depend on F ? What changes when the force is time dependent, F(t) 1 6-3a One degree of freedom, Hamilton operator H = p2/(2M) + \Mu2x2 with constant frequency u. Same questions as in Problem 6-2a. In addition, use the solutions to verify that x(t) and p(t) obey the required commutation relations. Also prove that 5x(t)5p(t) > \h. Under what circumstances does the equality sign hold for all tl 6-3b Again lift the restriction of an initial minimum uncertainty state. Follow the strategy of Problem 5-8a and find corresponding expressions for 5x(t) and Sp(t). 6-4 Three degrees of freedom, Hamilton operator H = p2/(2M). For N = p(t)t — Mr(t) and constant v, evaluate (r',t\e*v-N\p') and interpret the result.
Problems 259 6-5a One degree of freedom: Prove that dp [Hint: Recall Problem 1-55.], and illustrate it with - FX = e""»6MF (-Fx) eh&MF . 2M v ' 6-5b Use this to show that ; , 2 . j 3 ; j 3 e~l Um — -^ ^j* = e—ft 6mf eft eft smf = e ft e s2M'e k m i1 b e ft om -i^! i-^lFi2 -^^-t i-Fxt Recognize in these results an example of Problem 2-11. 6-6a Apply a statement of Problem 6-5b to demonstrate that (p',t\p") = e~I6mf <5(p' -p" -Ft) efi&MF ; for i? = p2/(2M) — Fx. What does this become in the limit F —> 0? 6-6b Verify that this {p',t\p"} is the solution of the appropriate p Schro- dinger equation and its initial condition. 6-7 Apply another statement of Problem 6-5b and arrive at 1 i-r'(n1 + Ft~\ —i-2—t -IsLlFt2 — 1 = —■ eft^'e ft2Mle ft m ir b e ft 2,3 (x',t\v') = / enx ^p + ^1) e ft 2M* e ft m 2-^ e » 6« for H = p2/(2M) — Fa;. Then use the action principle to produce another derivation of this result. 6-8 Apply the third result in Problem 6-5b to get {p',t\x'). Check that this is produced from (x',t\p') by complex conjugation combined with the substitution t —> —t. Why should that be so? 6-9 Use the action principle, for H = p2 j(2M) (one degree of freedom), to evaluate SiV.w. £iW.*W). jjsf<*.<i*">-
260 6. Elementary Applications Are these results correct according to the known forms of the respective time transformation functions? 6-10 A question analogous to Problem 6-9, with H = p2/(2M) - Fx, and a;<*',(|j>')=?. ^',(1*")=?. 6-11 One degree of freedom, Hamilton operator H~2M d\(x,t) V dx 2 d\(x,t) dt Does the force M(d/dt)2x depend on the "gauge" \{x,t)l Use the quantum action principle to find the A dependence of the time transformation function {x',h\x",t2). 6-12 Concerning the apparent ambiguity of the square root in (6.1.31): Follow the spirit of the discussion in Section 6.3 and write ' l = e-W (l - e-ee-2i^V 2i sin{ujt) V J t > 0, e ->• 0 Then show that, for e > 0, the right-hand side is - in a natural way - a continuous function of ut. Take the limit e —> 0 and state explicitly what you get for kn < wt < (k + l)n with k integer. 6-13a One degree of freedom, Hamilton operator H = p2/(2M) — F(t)x with time dependent force F(t) acting between t = 0 and t = T. Consider variations SF(t) of the force and use the quantum action principle to find first SF(p',T\x',0) and then (p',T|a;',0). 6-13b Repeat for (x1,T\p',0). 6-13c Finite time transformations are effected by the unitary evolution operator U, (..., T\ = (..., 0\U; see Problems 5-2. Regard U as a function of x = x(t = 0) and p = p(t = 0) and get the pa>ordered form of U from (p',T|a;',0) and its £p-ordered form from (x',T|p',0). Show that they can be written as U = e-ifMTe-bAx^xAp^ijr^dt^dt'm^F^/M = eft \xAv e-±PAx ^i^f^T e4 /0T dt /0T dt' f(t)t>F(t')/M
Problems 261 where << and t> are the earlier and later one of the times t and t', respectively, and Ap= j dtF(t), Ax = -J AtiiF^) are convenient abbreviations. What is the physical significance of Ap and Ax'l 6-13d As a check, use either form for U to verify that x(T) = WxU and p(T) = UipU. 6-13e Write U as a single exponential of a Hermitian operator, rather than a product of exponentials. [Hint: Problem 1-55.] Verify that you get the right answer for time-independent F. 6-14a Consider a Ag atom that passes through a succession of Stern- Gerlach magnets that are intended to first split a beam of atoms in two and then reunite it - a Stern-Gerlach interferometer. For simplicity, treat the longitudinal motion, along the y axis, as classical: y —> vt; ignore the x motion; and assume that only the z component of the magnetic field is relevant and that all z values of interest are sufficiently small to justify the approximation fj, B = fia B =* n<jzBz(y, z) = /uctz Bz(y,0) + z-^(y,0) y~^vt Show that the Hamilton operator H = p2/(2M) — fj, ■ B is then effectively reduced to where the precession frequency Q(t) and the force F(t) vanish before t = 0 and after t = T. Prior to entering the magnets at t = 0 the atom is in a ax = 1 spin state and its spatial properties are specified by a certain probability operator P(z,pz). Use the findings of Problems 6-13 to show that | (<t(T)) |2 = |tr {P(z,pz) e2[(P*Az ~ 2^>*)/fi} |2 where Az and Apz are defined analogously. 6-14b Ideal reunification would be achieved for Az = 0 and Apz = 0. Supposing now that the experimenter is satisfied by | (<r(T)) | > 0.9, say, how large are the tolerable uncontrolled errors in Az and Apz1 [Hint: Problem 2-16.1 Conclusion?
262 6. Elementary Applications 6-15 One degree of freedom, Hamilton operator H = p2/(2M) + Vo — Fx + ^Mu2x2 with Vo, F, u2 constant. Use the known transformation function for Vo = 0, F = 0, u2 / 0 to get x ' ' ' y 2TTihsm{ujt) x e-i(Mu3)-' (F - Muj2xf[tM.{\ujt) - iwt] where 5? = 5(2;' + #"), £ = #' — #". Check the limiting situations of vanishing F or vanishing uj2 . What happens for uj2 < 0? 6-16a One degree of freedom, Hamilton operator H = p2/(2M) + V(x) with (rather) arbitrary potential energy V(x). The particle does not travel far during short time intervals. With this in mind, use the local-oscillator approximation for V(x), that is dV 1 .,d2y V{x) ^ V(x) + (x-x) — (x) + 2^-5)¾-^) with reference point x = \{x' + x") half-way between the initial and final positions x" and x', and the result of Problem 6-15 to obtain a short-time approximation for the time transformation function. You should get (x',t\x")^ J—j-enn h+±u}2t2) e hUMuj « e *Vt e kum s v ' free particle where V = V(x), F = -$£{x), M^S2 = 0(x), and | = x' - x". 6-16b Generalize this to motion in three dimensions with Hamilton operator H = p2/(2M) + V(r). 6-17a Put x' = x" = x in Problem 6-16a and use the resulting approximation to (x\ exp(—j^Ht)\x) to show that tr{/(H)}s /H? JdaM(a)f(^ +7 -^[(H2/M)(F2 -2p2^)}*) is the corresponding approximation for the trace of a function of H. 6-17b Now use properties of the Airy function and partial integrations to exhibit the leading quantum correction to the seiniclassical phase space integral:
Problems 263 tr{/(#)}= /^/(^-+7)- [*£*£#!£/»(*?- +v) 1M " 7 27rft"'V2M / 7 2nh 24 -7 V2M / semiclassical value leading quantum correction where primes denote differentiation with respect to the argument of /( ). 6-18 Reconsider the multiplicity of energy states for constant force. Concerning the phase space integral in (6.4.16): Suppose one integrates first over x, from -oo to oo, and then limits the p integration to |p| < P. Demonstrate the equivalence of the resultant spectral density with that displayed in (6.4.22). 6-19 For function f(H) of Hamilton operator H = p2/(2M) - Fx with constant force F show that Why did we not make use of this when evaluating the trace of (6.4.16)? 6-20 Prove that /oo />oo daaAi(a) = 0, / da a2 Ai(a) = 0 , -OO J —OO whereas />oo da a3 Ai(a) = 2 . 6-21 Work out Ai'(O) analogously to Ai(0), and conclude that -Ai(0)Ai'(0) = ^-. [You will need a fundamental property of the factorial function.] 6-22 We know that /OO da Ai(a) = 1 . -OO What are the individual values of ^00 pO / da Ai(a) and / da Ai(a) ? JO J-oo
264 6. Elementary Applications 6-23 Here is a theorem about Ai(<r): 1 /1°° 1 [Ai(^)]2 = ^/ drAi(r) V2~2/3r _ a 2 In what sense does this make a true statement about [Ai(cr)] for — a ^> 1, if one regards — a as large compared to the significant values of r? 6-24 For a proof of the theorem in Problem 6-23 first show that 0 if a < 0 , >0, r°° ( 0 if a and use this and (6.4.13) to establish j_ r 2tt 722/v dr Ai(r) f°° dx r°° dy_e_j_x3_ixy2_iax 2n i22/v V2-2/3r - a J-oo 2?r J-oo 2?r Now introduce new integration variables in accordance with x = T\ + T2, y = |(n — T2) and head home. 6-25 Show that both the orthonormality and the completeness of the iPe(x) wave functions (6.6.11) imply / da Ai(<r — cti) Ai(<r — ct2) = S(ai — a2) , which is, therefore, a completeness and orthogonality relation for the Airy function. Check it directly. 6-26a Use the defining integral representation of Ai(cr) to demonstrate that /00 da Ai(a)Ai(-a) = 2~5Ai(0) . -OO 6-26b Extend the argument to establish /00 dr Ai(a + r) Ai(a - r) e2ifcr = 2~s Ai(2§(a + k2)) . -00 6-26c As a check of consistency, derive a differential equation for this integral by first showing that d2 d2 da2 dr2 [Ai(er + t) Ai(er — r)] = 4crAi(cr + r) Ai(cr — t)
Problems 265 6-27 According to Problem 6-23, 0-2/3 />oo j Evaluate this integral and arrive at [Ai(Q)]2 = -^^^6 (-!)«• Does this check out numerically? 6-28 Verify that another consequence of Problem 6-23 is -Ai(0)Ai'(0)=--/o -^Ai'(r). Evaluate this integral to recover the result in Problem 6-21. 6-29 Replace the unbounded x integral in (6.4.19) by the bounded one of (6.4.20), then perform the a integration and the x integration (in this order) to arrive at { />00 e-iHt\ = / dEmL(E)e~TiEt ' L J~oo where mi(E)={§ky oai' w]2-°w<>)]2)^ =(2MF/h2) s (-E/F-L) Show that (6.4.22) obtains under the circumstances of (6.4.21). 6-30 One degree of freedom, Hamilton operator H = p2/(2M) + ~Moj2x2 with constant frequency w. Use the action principle to find {p',t\x'). Can you identify the momentum wave functions ipn{p')^ If you find it convenient, introduce dimensionless variables. 6-31 In order to perform, for E > 0, the limit F —> 0 in (6.6.7) first write p = ±\/2ME + p, put aside all phase factors that do not depend on p (why is this allowed?), and get V^i27rSF Then show that * eTiW-E/(2M)p2 ^ (2E/Myh(p) as F -> 0 , yf^i2nHF
266 6. Elementary Applications and note that p3/F ~ y/F in this limit. Verify that your answer agrees with the ipE,±(p) obtained from (6.6.3). 6-32 The n oscillator state (dimensional variables used) has classical turning points at q = ±\/2n + 1 (check this). Use the WKB approximation for the wave function in the classically forbidden region q > \/2n + 1, retaining the two leading terms in an expansion of the exponent for q ^> \/2n + 1. How does your result compare with the asymptotic limit of the known wave function? 6-33 You know the WKB approximation when the classical region is on the right. Find, in any manner, the WKB wave function for a classical region on the left. Consider a potential energy V(x) with two classical turning points: energy E x0 Xi ->x Write the WKB wave function for x > xq, and for x < x\. To within a possible minus sign these must be the same in the common region xq < x < x±. Conclude that \ f ' Ax sj2M{En - V{x)} = n + i for n = 0,1,2,... ™ Jxo determines the WKB approximations for the energy eigenvalues En. 6-34a Consider the oscillator potential V{x) = \MJ1x1 and use the result of Problem 6-33 to work out the possible energy values. Compare with the known result. 6-34b Repeat for the linear potential V{x) = -F|a;|. 6-35a For a very different derivation of the WKB energies of Problem 6-33 return to the approximate trace evaluation in Problem 6-17b. Apply it to fE(H) 1 if H <E , 0 if H > E , so that tr {/#(#)} is the count of energy eigenstates below energy E. Equate the semiclassical value of tr {fE„(H)} with n + | (why is that reasonable?) and show that this reproduces the WKB quantization rule.
Problems 267 6-35b Now use the leading quantum correction of Problem 6-17b to improve upon the WKB rule. Find the implied corrections for the oscillator potential V{x) = \Muj2x? and for the linear potential V(x) = F\x\. Compare the latter with (6.10.12). 6-36 Consider a family ip\(q) = ip{^0) of scaled trial functions in (6.11.8). Which value of A gives the lowest upper bound for £0? Use this to arrive at a scale-invariant version of (6.11.8). Try it for tp(q) = exp(—qa) with a- 2. 1 o 2' 4>
7. Harmonic Oscillators 7.1 Non-Hermitian operators Now we turn to the treatment of the oscillator using non-Hermitian operators, with an eye toward more general dynamical circumstances. For simplicity we use dimensionless variables q,p, and the non-Hermitian variables y, y^ closely related to them, rather than dimensional x,px: h 1 , t . px =VhMujp —> p = --= (yf - y) , ^[x,Px]=\[q,p]=l, [v,V*]=l, (7.1.1) and express the energy in frequency units. ^=^ + ^^ = ^(1^ + ^) = fa(y1y+%)—Kjjy1y, (7.1.2) where, in addition, the irrelevant constant |fiw = |w is subtracted, so that the eigenvalues of H are now u, 2w, 3u, ... . Since [2/,1^] = i analogous to [q,p] = i, it must be possible to use, in addition to Lagrangian L=p^-H(q,p,t) (7.1.3) the Lagrangian and the generators L = iy^-H(y,v\t), (7.1.4) Gy=iyUy, Gy,=-iy5y1 (7.1.5)
270 7. Harmonic Oscillators as the analogs of Gq = p5q and Gp = —qSp. In accordance with Section 5.7, the condition is that L-L=±W, (7.1.6) with, at any t, 6w = (Gq+Gt)-(Gy+Gt) = (p5q-H6t)-{iyl8y-H8t). (7.1.7) The transformation of interest here does not involve t: H = H, so that Gt = -H5t = -H6t = Gt, (7.1.8) and we have p5q - \yHy = 5w(q, y) . (7.1.9) Now so p = i{q-V2y);, y^ = V2q-y, (7.1.10) Sw = i(q — v2y)Sq — \{y2q — y)Sy = ^(V+ ^-1^</y) • (7.1.11) This is used directly in finding the transformation function {q'W), S(q'\y'} = i(q'\(Gq - Gy)\y') = i(q'\Sw\y'} = i(q'\y')s(^q'2+1-y'2-iV2q'y^ , (7.1.12) giving (q'\y') = n~i e-y2 - \y'2 + ^l'y' (7.1.13) which is the long known result (2.7.30) including the constant that normalizes the vector |y' = 0). With the Hamilton operator of (7.1.2), the non-Hermitian equations of motion are ft-w<-^ <»■"> and
7.1 Non-Hermitian operators 271 ^ = -- = -^. (7.1.15) The simplification here is evident, the equations of motion are solved immediately: y[t) = e-^ty , y\t) = eiwV . (7.1.16) Now consider the time transformation function (yt ,t\y"), <5</, t\y") = 1(/, *| [(G„, + Gt) (*) - G„] |y") = i<2/+', *| [-tf/y W - <Wv - itfV] \v") , (7.1.17) where [•••]= -uS/e^y -<$WW e'iuty - iy1{t) e'[ut6y" 4- 4- 4-4- V" / V" / -> <S [-i/e-^V'l , (7.1.18) so which satisfies the initial condition (/, t\y") -> <2/+'|2/"> = e»fV' for t -> 0 . (7.1.20) Now let's use the time transformation function as </,*|y") = </|e-itff|y") (7.1.21) which is immediate from the power series telling us that H'=nu with n = 0,1,2,... (7.1.23) are the energy eigenvalues and (/^)=1^, (n|y'')=i!LL (7.1.24)
272 7. Harmonic Oscillators are the wave functions of the energy eigenvectors. This, of course, we already know. Indeed, we have seen the essential mathematical details in Section 2.9; that earlier treatment is recovered upon replacing ut by —i/?. Further, we know that the transformation function (7.1.13), <«V> = 7T~i e~K2 + V2«V - W2 = £ MD^S- , (7.1.25) n=0 v produces the Hermite polynomial form for (q'\n) = ipn(q')- It's also possible to use this generating function to arrive at asymptotic forms of the r/Vi (<?')• We leave this matter to Problem 7-3, and turn to a study of more general dynamics. 7.2 Driven oscillator Consider the time dependent Hamiltonian H =ujyly + K(t)*y + K(t)yt . (7.2.1) It gives the equations of motion 1¾ = ^ = «.+«(«) <™» and -i^=<V+„(*)*, (7.2.3) which describe the system as driven by external forces. Clearly we can solve these first-order differential equations. But it is important to realize first the boundary conditions that accompany them. The time development of the system is given by (v*',h\y",h)K (7.2.4) where k is written to recall the presence of the external forces, as distinguished from (/c = 0) (vI'M^-y^1-*^. (7.2.5) Accordingly, it is natural to ask how (l|2)K changes as we turn on the forces, as given by 6K(1\2)K = i(l\\- f dt (6it*y + 5Ky1)(t)\\2) (7.2.6)
7.2 Driven oscillator 273 where, of course, (l| = (y* , tA and |2) = \y", £2). Now we see that we should find y(t) in terms of the given y{t2) —> y", and y^ (t) in terms of the given So, begin with (7.2.2) or, equivalently, ±(eiUy(t))=-ieiUK(t) (7.2.7) giving which is elujty(t) = elujhy(t2) -i / d*' elujt «(*') , (7.2.8 y(«) = e-^*-*2)^) -i /'d*' e^'-*)K{t') Jt2 (7.2.9) For the y\t) equation, it suffices to take the adjoint of the above, while replacing t2 by t\: y1(t)=y1{ti)e~iu(-tl ~'} -i f ' M n[l!)* <T™V-*) . It will be helpful to introduce Heaviside's* unit step function 'l iovt>t', V(t-t') = 0 for t < t' , (7.2.10) (7.2.11) so that f At' e-^-Vittf) = t dt'vit-t'je-^-t'Kit') , h2 Jt2 f 1 At' K(t')* e~-'lU){t' ~ *) = J' At1 K{t')*V(t' - t) e-iw(*' ~ *) . (7.2.12) h Jt2 Thus, the integral in (7.2.6) effectively becomes / At (6k*y + 5kiJ) -> Jt2 f1 Atdn(t)* e-iw(< —<2)y« _i J1 dt> v(t-t!')e-iw(* ""*')«(*') Jt2 L -^2 /" + / At n2 f g-iwCij -t)_{ Ttf K(t')*ri{t' -1) e_iw(*' - *) •/*2 <5k(£) (7.2.13) 'Oliver Heaviside (1850-1925)
274 7. Harmonic Oscillators or, on exchanging t f* t' in the last (double) integral, rti / At (5K*y + 5Ky1) -> Jt2 S \f dtK(t)* e~iu}t eiut9-y" + e-^rf' f At At' K{t)*r]{t - t') e~iw^ ~ *')/e(f) d* eia;iK(t) /2 (7.2.14) This gives immediately the time transformation function t' i. L" +„\« -»2/T e {y\h\y"MK = ew f -iw(ti -*2)„» i»t' e-iw*i /^ di eiwiK(t) x e w J2 x e~' J2 i/21dtK(t)*e-ia;ieia;i22/" x e- £ At At' K(tyV(t - f) e~™(t - *')«(f) (7 2 15) We note that the first factor is (yt , £t |j/", £2), the khO time transformation function of the not-driven oscillator, and the last factor is (0,ti\0,t2)K, the y" = 0 —> yt — 0 transition amplitude of the driven oscillator. The equivalent form </, *i \V", hy = </, tl |,», i2) e-i[/ e"-'>7 + 7* e^V] <0,,, |0, t2>- (7.2.16) with 7=/ AteiujtK(t), 1* = I At n{t)* e"'lujt (7.2.17) reflects these observations. 7.2.1 Time-independent drive First, let's run a check on this. We wrote n(t), but that includes n independent oft: H — ujy^y + K*y + Ky^ I |2 ( t K\ ( K\ \k\ = uj(yi + -) ly + -) - L-L- = cj(yl +X*)(y + X) -u\X\2 with A = k/cj . (7.2.18)
7.2 Driven oscillator 275 The operators in the latter version are mutually adjoint and obey the commutation relation [v + \,V* + A*] = 1. (7.2.19) Therefore we immediately see the spectrum: I I2 El =nu- — = (n- \X\2)u , (7.2.20) just lowered by a constant. The eigenvectors are clearly given by / I , \*\n |n,A)= [V ,— ' |0,A) , (y + A)|0,A)=0. (7.2.21) Vn! Now y + A= e-VjeV , (7.2.22) so that yeA»f|0,A)=0 (7.2.23) or -Ay 0) |0,A) = ' '=== , (7.2.24) ^(Ole-A^e-A^lo) where the denominator ensures proper normalization; it's explicit value is . oo ,,,2n <0|e~A ye~AyT|0) = J]^r = e ' (7'2-25) n=0 Accordingly, |0, A> = e™2 lA'2 e~~A2/t|0> (7.2.26) which is conveyed, along with |n,A), by the wave functions |2 (/ln,A)=(l/ +A) e-Ve-||A|-. (7.2.27) Vn! Does the time transformation function (7.2.15) produce these results for K,(t) = k (= wA)? We observe that 7 = / ' At eiu}tK = -iA (eiwil - eiw*2) , 7* = f ' d^*e~ia;i = iA* (Via;il - e^*2) , (7.2.28)
276 7. Harmonic Oscillators and fU dtdt' K*v{t - *') e"^ - *')« = -iu\X\2 p At (l - e^' " *2)) T+ife-^-lV -'no A (7.2.29) where T = t\ — fe- Therefore, xei|A|2[o;T-i(e--T-l)] = eio;|A|2Te-»t'A (»f + A*) e"1^^" + A) -A*y" e-|A| ^(/MK^-'MO (7.2.30) n=0 is the appropriate time transformation function where, indeed, E* is the energy of (7.2.20), (y^ \n, A) are the wave functions of (7.2.27), and (n, \\y") are their adjoints. 7.2.2 Slowly varying drive Next, suppose that n(t) changes slowly from 0 at t2 to wA at t\, u(h - h) » 1 1 d2K(t) w2 dt2 < l«COI d/c dT I >£ = 0 for £ < t2 and t > t± . (7.2.31) What happens? We evaluate the integrals needed in (7.2.15) with the aid of partial integrations. First note that
•K*) = | d_ dt ' eiu}t —«(*) ' eiujt —n{t) - + eiujt dK (iw)2 dt{t} ufi dt^\ 7.2 Driven oscillator 277 eiujt d2 f ^ + (iuy dt*K{th (7.2.32) so that and /' Jwt piwti 7 = / dt e1U}ln(t) 2 «(*!) = -iAeia;tl / dtK(t)*< lUJ —\u>t ~ j \ * e~iujti (7.2.33) (7.2.34) The same approximation is used in / dt dt' n(t)*ri(t - t') e"iuj^ " *')/e(f) ■>2 -\w{t-t'\ -K(t) + ^ J' dt [-MK{t)\2 + K{t)*^K{t)] . e-iw(*-*')dK Wz dt (*) (7.2.35) With i«wi2 = ^[*k*)i2]-*^i««i2 (7-2-36) and ^0 = 551-^ + 5 •w-lo-^-c) (7.2.37) real imaginary we write this as dt dt' n(t)*T){t - £') e"1^* - * )/e(f) S-i^|K(*i)|2+27^|K(*i)|2-i? • j. I \ I2 , 1 I \ |2 -, = —lwi] |A| + -|A| - 10 , (7.2.38) where the real phase 0 is given by
278 7. Harmonic Oscillators r At l ( tAn Ak [ K — ;—K, 2uj I At At t ( .Ak Ak* wl di At (7.2.39) Therefore, the time transformation function (7.2.15) is here approximated by )uh n=0 x e = (»t'|„,A> ,ino;i2 w / Vnl (7.2.40) or = (n|»"> OO (/,^1^2^= e^ £ (/|n, A) e-1^'1 x eiS«i2(n|;(/") . (7.2.41) n=0 = <Sft',*i|«,A> =<n|»",*2> But, in general we can introduce energy states at t2 and different energy states at t\\ 00 WMv"MY= E </.*ihA>(n,A|n'><nV',*2> (7'2'42) from which we learn that or (n,A|n')= e^<J(n,n') (7.2.43) |(n,A|n')| =8(n,ri). (7.2.44) So the system is to be found, with certainty, in the state of the same quantum number n, although the energy changes by — |/c(£i)| /u) = — uj\\\ in going slowly from k = 0 at time t2 to k = wA at time £1. 7.2.3 Temporary drive Now think of a situation in which k is zero before t\ and after t2, being turned on and then off, as illustrated by
7.2 Driven oscillator 279 where k = 0 for t < ti and t > t\ (7.2.45) but otherwise arbitrary. As a first example, put y^ = 0, y" = 0, when (7.2.15) gives (0,h\0,t2y= e- £ dtdt' "mi*-*') e~M* ~'V) . (7.2.46) Then the answer to the question, If the system is in the n = 0 state at £2, what is the probability that it is still in the n = 0 state at t\, despite the intervening k disturbance?, is p(0,0)" = e J2 /21 dt dt' it(tyr)(t - t') e-iw(* ~ *')«(*') _ e-2Re/21dtdt' ••• In taking the complex conjugate, also interchange t and £': (7.2.47) f dtdt'--- = f dtdt1 K(t)*r](t'-^e-'^-^Ktt1) . (7.2.48) Then the exponent in (7.2.47) is 2Re(Y d*d*' ..-)= f dtdt'K(t)*[ri(t-t')+r](t'-t^e-^^-t^Kit') . (7.2.49) But jj(* - *') + jj(f - t) = 1 + 0 for £ > £' 0 + 1 for t < t' : =1. (7.2.50) and we get
280 7. Harmonic Oscillators 2 Re (j dtdt' ■■■ J = f dtdt' n(t)* e~[ujt e[ujt'K,(t') = |7|2 (7.2.51) and p(0,0)re= e^lTl2 < 1, (7.2.52) with 7 as in (7.2.17). Next, only y" = 0; the system starts in n = 0, and we want the probability p(n,0)K = |(n,£i|0, £2)*! of finding any n at t\: n=0 v (7.2.53) Therefore, (n,h\0,t2)K = e-'mujt'^T-(0, *! |0, *2 }" (7.2.54) /n! and (l^l2)" l7|2" I I* p(n, 0)* = -i r^-pfO, 0)" = ii!— e~l7l (7.2.55) nl nl where, indeed, ]Tp(n,0)K= e^l e~lTl =1. (7.2.56) n=0 Also note that |7| is the mean value of n, 00 co 1 |2n ^np(n,0r = (nr = ^-LLe-|Tl = |7|2 , (7.2.57) n=0 n=l *■ '' so that one has a Poisson distribution of probabilities: p(n, 0)K = ^n\ ) e~<n)K . (7.2.58) n! Now we turn to the general situation and recall (7.2.16), 00 {vVMv\t2)«= ]T </|n> x(nM\ri,h)" x (n'\y") = (yfy/Vri. ={y")ni^. = ye-^ ^Ve-^'e-^^e-ire^V'^^ICt,)- . (7.2.59)
7.2 Driven oscillator 281 First we recognize that if we redefine /-> eluJhiu, y" -> e-'lujtHv and at the same time write k _ 0-'muti / n\ni\K An'ut2 (n,h\n',h)K= e~ma;tl(n|n') (7.2.60) (7.2.61) which is just making explicit the time dependences associated with the initial and final energies, we get, more simply, the generating function for the probability amplitudes (n|n')K, e—uv + u*y + v~/*= ST^ u" -n+ri \n\n / v" n,n'=0 \ I / v Then note a symmetry; The replacements 7* 7 u —> —v , i> —> —u 7 7* leave the left side unchanged. Therefore (7.2.62) (7.2.63) f* ""•„+„< ("I"')" ""' = f* [(7V7H"'.»+»-(n'ln)-[(7/7*H" „,tl0^ <0|0>- v^7! „tl0 V^T <0|0>- ^ (7.2.64) or (n'\n)K = (7*/7)n_n'(«!«')" (7.2.65) with the consequence p(n',n)K = |(n'|n)*|2 = p(n,n')K . (7.2.66) Now return to (7.2.62) and pick out the term proportional to vn /\/n7I: „^(7*-ti)n = y, ti".n+n,(n|n')" and, as a Taylor series in u, (0|0)« %/nTn7! \<W or (x = |7| - «7; m = 0 : x = I7I ) pw(4^:=_^=(„ir7n^ei7i2fd\ xn e x x=\j\2 (7.2.67) (7.2.68) (7.2.69)
282 7. Harmonic Oscillators We meet the Laguerre* polynomial of order n (integer), index a (arbitrary) L<£\x) = —aTa ex ( — ) xn+a e~x (7.2.70) so that M^>1 = ./Z(_i7*)»'--»L(n'-n>(|7|2) (0|0)« Vn'P 7 ; " U71 > ^(-^""-"'l^-^ItI2). (7.2.71) A convenient way to write the implied probability is by means of the larger (n>) and smaller (n<) of n and n', P(n,nT = ^(l7|2)n>~n< [L^>-»<)(|7|2)]2 e~M2 (7.2.72) J and, in particular, for n = n1 p(n,n)" =[LW(|7|2)]2e-|Tl2. (7.2.73) As a polynomial, L„ (x) is T (a)(-T\ _ _LT-a / _£ 1 ) Tn+a so A=0 T («) _ i Lj = 1 + a — £ , E,_ .¾ (n+a)! ,i" ,, (-1) („._ fcM ffc + „v IT ' (7.2.74) L2° = ^(1 + a)(2 + a) - (2 + a)z + ^2 , (7.2.75) for example, and also (n + a)l Tf_„w, {-x)n L^=^Kf' ^B)(*) = ^-> (7.2.76) which exhibit the first and last term of the series. The situation is particularly simple for H2 < 1 and Z\n = n-n'/0. (7.2.77) 'Edmond Laguerre (1834-1886)
7.2 Driven oscillator 283 Then the leading factor is (|7| ) and [L(»>-«<>(0)]2(|7|2)|4n| | |2|zln| 1/1 if An ¢0, (7.2.78) p(n,n')"a- n> n> n (|4r»|!)' for which p(n + l,n)*-(n + l)|7|2, p(n-l,n)*^n|7|2, p(n + 2,n)"~ i(n + l)(n + 2)|7|4, p(n-2,n)^-^n(n-l)|7|4 (7.2.79) are simple examples, for An = ±1 and An = ±2. Note that p(n — l,n) = p((n — 1) + 1, n — 1) and p(n — 2, n) = p((n — 2) + 2. n — 2) hold, as required by (7.2.66). For the probability for An = 0, p(n,n)K, we must use the first terms of the power series, l4°)(z) = 1 - nx + ~n(n - l)x2 + ■■■ (7.2.80) To first order in |7| <C 1, we get p(n,n)K^ (l-n|7|2) (l - |7|2) =* 1 - (2n + 1)|7|2 , (7.2.81) and we note that p(n,n)R +p(n — l,n)K + p(n + l,n)K = 1 + terms of order |7|4 , (7.2.82) a basic check of consistency. These results for small |7| hold only if n and n' are not compensatingly large, that is: (7.2.81) assumes that (2n + 1)|7| <S 1, which is not true for n|7| > 1. Let's return to (7.2.67) in the equivalent form e"T (l - £)"' = V ^Mu»-»'i»-»'M^ (7.2.83) V W ^0Vn! <0|0>« and consider the regime [7] <C 1, n' ^> 1, for which * \ n' 1 - — ) = en> log (1 ~ 7*/M) S e-n'Y/u _ (7.2.84)
284 7. Harmonic Oscillators We also concentrate on the situation where we have \An\ <C n'. Then, for example, 1 n! (n' + An)l (n')An and This suggests we put a /n7! V n! variable change «"""' S e. Indeed, 7= Me u f u > with ,ia -iat An 1 v^7 and get /in \ I I (7.2.85) (7.2.86) (7.2.87) (7.2.88) (7.2.89) We recognize here the generating function of Bessel* coefficients, Bessel functions of integer order: oo e5*(*-l) = Y, tmJm(z), (7.2.90) m= — oo so, with account for (0|0)re = 1, (■ \ An -ie1QJ JAn(2y/^\j\) (7.2.91) and p(n' + An,n'Y S [j^v^M)]' • (7.2.92) For the essential probability check, we note that [t —> 1/t in (7.2.90)] OO -j e-5* (*-«)= J2 ^rn(z) (7.2.93) m= — oo and therefore "Priedrich Wilhelm Bessel (1784-1846)
7.2 Driven oscillator 285 oo 1= Y, tm~m'Jm(z)Jm>(z). (7.2.94) m,m' = —oo Picking out the t independent terms establishes the sum rule oo l = y, [J»wi2 > (7-2-95) m=—oo so that, indeed 00 r i2 5>(n,n')K= £ [Jz\„(2^|7l)j =1- (7-2.96) n An= — oo The substitution t -> -tin (7.2.90) tells us that Jm(z) = (-l)mJ_m(*) (7.2.97) or p(n' + An, n'Y = p(n' - An, n')K , (7.2.98) an approximate version for n' 3> 1 of (7.2.66). We shall develop further properties of the Jm in Problems 7-11 and 7-12. For now let's notice that n' 3> 1 is not the classical limit, that requires also that \An\ 3> 1 and \fn1\^\ > 1. Let's put t = e1^ in the generating function (7.2.90): eIz(e*-e-* = eizsM= g eim*Jm(z) (7.2.99) or l which we get [j4n(2V^M)]' f2" A(t> izsin /o 2tt ~ J 2^^TTe <t> e-i"# -\An{4>~ — <J<m\Z) •> <t>') ei2^ |t| (sin 0 - (7.2 - sin^') (7.2 .100) .101) For \An\ > 1 and y/n/lj] 3> 1, the important region is <j>' — ¢. So write <f>' = <f> - 'ip, \i/j\ <^1, and <f> - 0' = -0, sin<j> — sin(j>' = ipcos0 to get ptn' + ^n')^^ / d^e L IM VJ . (7.2. 102)
286 7. Harmonic Oscillators The tp integration produces a delta function, 1 f2n / \ p(n' + An,n')K £ -/ d<j> 6 hVn* \j\ cos 0 - An) , (7.2.103) and the remaining <j> integral is non-zero only if \An\ < 2y/ri/\'y\. Substitute x = cos (fr and arrive at p(n' + An,n')K * - [ , dx S (2vV|t|z - An) n J-\ V1 - x2 ^ ' 0 if \An\ >2vV|t| , -[4n'|7|2-(Z\n)2]~^ if \An\ < 2^71 ■ (7.2.104) Consistent with the actual approximations, the total probability is unity: 5>(n' + An, n'T = E 1^=.* An An n ^4n'\j\2 -(An)2 7Ti-2V^|7l yjAn'^-v2 1 . (7.2.105) A deeper understanding of this classical statement as the limit of a quantum statement is gained by solving Problem 7-10. 7.3 Remarks on Laguerre polynomials A few words about the mathematical properties of L„ '(x) are in order. First, notice that \Jg)(x)=l-x-<*(!L-i\ x<*+" = ±x-«(A-i\ xxa~l+n ™ y n! \dx J n! \dx J = L-M)fA_iyitt-i+n n! \da; / + (^1)1^(^-1) x°+"" (7il) or ^)(3:)=^-1)^) H-L^Or), (7.3.2) an algebraic recurrence relation in n and a. Another version IS
and then 14^) = 1^( in S-' + iK'M 7.3 Remarks on Laguerre polynomials 287 --1 xaXn = ±(—-l + SL) xn (7.3.3) dx / n! V da; x / x / d a n! I dx x n+l I-1!^1 a — 1 n! V dx x n+l -n+l (7.3.4) or giving ~ ~ x + a) LW = (n + 1)1,^ = (n + 1) (l^ - L^) (7.3.5) (7.3.6) + i ' x~ - x + n + 1 + a J LW = (n + 1)L^ a differential recurrence relation in n. The equivalence of the two forms, for integer a, n\ (n + a)'. (~-x)aL^(x) = L^(x) = 1 (n +a)V dx n+a i"e * (7.3.7) n+a n+a tells us that (—l)a a integer : h[a) (x) = -—~ ex , , ™ v ' n! VdaT = (zl>!fA_i n! \dx Prom the latter we learn that A __ A L(a) = _L(a + l) dx v n a differential recurrence relation in a, or with (7.3.2) d xne'x dx T (a) _ T (a) _ T (a + 1) _ _T (Q + 1) un ~ Lln un — un~\ (7.3.8) (7.3.9) (7.3.10) which, although proved here for integer a, is generally true. The combination of the last result with [n —> n — 1 and a —> a + 1 in (7.3.5)] x~-x + a + l) 14,1+1) = nl4tt) is the differential equation d2 . 1 . d x-r-z + (a + 1 - x)- h n ax1 dx Li") = 0 (7.3.11) (7.3.12) the Laguerre equation.
288 7. Harmonic Oscillators 7.4 Two-dimensional oscillator We now turn to the oscillator in two dimensions. The Hamilton operator is H = u (ip? + \q\) -\u,+u, (\pl + \q\) - \u = u{y\y1 + yly2) , (7.4.1) which is specialized in that oji — 0J2 = oj ; (7.4.2) that is: this two-dimensional oscillator is isotropic. Clearly the energy eigenvalues are Eni,n-2 = n\u + n20J = Noj (7.4.3) where N = ni+n2=0,l,... . (7.4.4) With the exception of N = 0, when ni = n2 = 0, these eigenvalues are multiple, or degenerate: N is produced by m = N, n2 = 0;nj = N — l,n-2 = 1; ... ; ni =0,n2 = N; there are AT +■ 1 choices in all, so that m(N) = N+1 (7.4.5) is the multiplicity of energy Nu. Closely associated is the freedom to change the variables by a two- dimensional rotation: I7i = 2/i cos a + 3/2 sin a , 1/1=2/1 cos a + j/] sin a , j/2 = —j/i sin a + y2 cos a , y\ — ~v\ sin a + y\ cos a . (7.4.6) One checks immediately that the commutation relations are preserved, that, is [Vk>Vi] = [vlM] =° and [VkM]=ski for k,l = 1,2, (7.4.7) or, more fundamentally, that the Lagrangian maintains its form: £(2/,2/) = i £2/^24 -"Y,VkVk = L = L(v^) ■ (7-4-8) k k Two-dimensional rotations are conveniently expressed by 2/+ = -^(2/1-12/2), 2/+ = -^(2/1+12/2). 2/- = -/| (2/1 + 12/2) , yl = -^(2/1 - 12/2) , (7-4.9)
7.4 Two-dimensional oscillator 289 and again the form is maintained L =i Yl vt~fiV* - u Yl vtv* (7A1°) cr=± cr=± and, therefore, so are the commutation relations, as demonstrated by [2/+,2/+] = \[vi -12/2,2/1 +12/2] = I[2/1-2/1] + I[2/2-2/2] = l, [2/+,2/1] = |[2/i -12/2,2/1-12/2] = 5 [1/1,1/1] -\[v*,v\] =0, (7.4.11) for example. The effect of a rotation is simply y+ = eiUty+ , y+= e iU-y\. , y_ = e~iay_ , yl = eia2/l , (7.4.12) and the maintainance of the commutation relations is transparent: [1/+,2/+] = [eiay+, e-iayl] = [y+,y\.] = 1 , (7.4.13) for example. Consider an infinitesimal rotation, V±=V±- Sy± = (1 ± i6a y±) (7.4.14) with 1 1 P\r^ 6y± = Ti6ay± = -[y±,G] = - —T (7.4.15) 1 1 9y± so that the generator is G = fo L3 where L3 = j/^j/+ - j/Ly_ . (7.4.16) Alternatively £3 = 2/1-2/+ ~ 2/I2/- = 1(2/12/2 ~ 2/22/1) = ¢1½ ~ ?2Pi (7.4.17) as checked, for example, by Sq1 = T[qi,daL3] = —6aq2 , Sq2 = T[q2,SaLz] = 6aqx . (7.4.18) Evidently, these are infinitesimal two-dimensional rotations indeed, and we recognize in L3 the two-dimensional version (or third component) of the three-dimensional orbital angular momentum vector L — q x p.
290 7. Harmonic Oscillators Using the ya=± variables, the energy is En = En+tU_ = u(n+ +n_), still with multiplicity m{N) = N + 1, of course. But now the different states of common N are labeled by the eigenvalue m of the angular momentum L3, (7.4.19) L3' = that is N = 1 N = 2 N = 3 m — n+ — n~ = N m = 1,-1 m = 2,0,-2 m = 3,1,-1,- - 3 2n_ = N,N- [m(N) [m(N) [m(N) 2,,.. ,-iV, = JV+1 = 2] = JV+1 = 3] = JV+1 = 4] (7.4.20) the count of states is correct. Question: What are the q wave functions for the state of definite energy and angular momentum? Of course, we know for one degree of freedom that (q\n) 1 V2nnl and then for two degrees of freedom: H„(g)e-592 (7.4.21) (qi,q2\ni,n2) = (<7i \ni)(q2\n2) 1 V^nini!2"2n2! Hni(gi)H„2(g2)e-5(9f +«!) , (7.4.22) which are states of definite energy .Ejv = oj(rii +71-^), but not states of definite angular momentum L3. For that we go back to the generating function in one degree of freedom (7.1.25), so for two degrees of freedom (dropping primes) we have (qi, <?2 (2/1,2/2) = it 2 e Wl + 92) + \/%i2/i + 922/2) - f (2/? + 2/22) ; (7.4.23) which we proceed to rewrite in terms of y± (numbers!) and also introduce polar coordinates, <?i q-2 q\ ± i<?2 pcos< psinc; pe ±\<j> (7.4.24) So <&+<& P2 , \ (2/1 + 2/1) = 2/+2/- , V%l2/1 + q2y2) = p(y+ e1^ + y^ e™1^) , (7.4.25)
7.4 Two-dimensional oscillator 291 and <P,^+,y-)=7r-ie"^-+^+ei0 + y-e i0)e-V n+,n_-=0 2/+ 2/- ^+! yn-J (7.4.26) Now we remember (7.2.62) and (7.2.71), which amount to -uv + «7 + «7 _ *= £ n,n'=0 Jf__H_in+n' /!Ll^_j^n-n'T.(n-n') VnlVnJl V n! (-i7)B-BLjTB'(l7r), (7.4.27) a generating function for Laguerre polynomials. So the substitutions 2/+-> w , 2/--> v , 7—>pe n —> n+ = A^ + m N -m n —> n_ = (7.4.28) give us the wave functions 1/^^(/9,0) = -7=(-1) =me1; /f JV-mM I 2 /■ mT(m) . 2-, -i/92 (-JV+myl LI(W-m)(P )e 2^ Z7T (7.4.29) where the familiar azimuthal wave functions (2w) 2 elmcP are multiplied by the radial wave functions PN,m{p) = PN,-m(p) ]V-|m| = (-1) >V 2 /■ |m|T(|m|) >l) ,2^P-^2 (/) e (7.4.30) These wave functions are, of course, complete and orthonormal. Expressed in polar coordinates, the area element is dqidq2=dppd(j) , (7.4.31) and orthonormality takes the form SNN'Srnrn' = / dppd(j)1pNtrn(p, (f>)*1pN>,m> (p, </>) = ^' dPPPN,m(p)PN>Mp) r^e<m'-m)<l> (7.4.32) Jo Jo 2n = S„
292 7. Harmonic Oscillators which reduces to />0O / dppPN,m(p)PN',m(p) = $NN' ■ (7.4.33) JO To present this most simply as a property of Laguerre polynomials, we write N =\m\ + 2np , n„ = 0,1,... (7.4.34) (np is the radial quantum number) and p2=x, \m\ = a = 0,1,... (7.4.35) to get (np —> n) dx xa e~xL^ (x)L^ (x) = 8nn, ^-^ . (7.4.36) F Jo This orthogonality statement about Laguerre polynomials is true for arbitrary values of a. The simplest example is n = n' = 0 : f Jo dxxae"x = al, (7.4.37) Euler's familiar integral representation of the factorial function. What is the form of the Schrodinger equation we have solved? Here, the Hamilton operator is that in (7.4.1), and so a wave function -0((/1 ,¾) for the energy E = w(|m| + 2np) obeys If d' d' \ 1,, 9X + 2n> + l + ~2\M + W~*{qi+® ¢ = 0. (7.4.38) We know, however, that such a wave function appears naturally not in rectangular but polar coordinates. That brings up the question of expressing the Laplacian differential operator in curvilinear coordinates, both in two and three dimensions. Consider a three-dimensional coordinate system, ui,u2,u3 - that is: a parameterization of the position vector, r = r(ui,U2,u3). Infinitesimal changes of the coordinates, , dr , dr , dr dr = d«i -— + du2T, h d«3-— OU\ OU2 ou3 = d«i h\e\ + d«2 h'ie.2 + dw3 /i3e3 , (7.4.39) define the (local) set of unit vectors e*, k = 1,2,3, and the positive metrical functions hk(ui,U2,u3), k = 1,2,3. We take for granted that the coordinate system is orthogonal and right-handed, so that e.j ■ eu = Sjk for j,k = 1,2,3 and e\ x e-i = e3 . (7.4.40)
7.4 Two-dimensional oscillator 293 Then ds2 = dx2 + dy'2 + dz2 = h\du\ + h\du\ + h\Au\ is the (squared) distance of neighboring points, (dr) = da; dy dz = /11¾¾ d«i d«2 d«3 = h is the volume element, and (7.4.41) (7.4.42) Vr/> 15 15 13,, h\ OU\ /12 OM2 AI3 OM3/ (7.4.43) is the gradient of 'tp(r) = tp(ui,U2,u3). To find V2-0, we apply Green's identity I (dr) V ■ Vr/> = / dS ■ Vr/> , (7.4.44) which relates the volume integral (over V) of the divergence of vector Vr/> to the surface integral (over the boundary 5 of volume V) of this vector field, to a small volume bounded by two surfaces each of constant u\, constant «2, and constant «3, and we learn that the Laplacian differential operator is given by VV = t d h d d h d d h d du\ h\ du\ du2 h\ dui du% h\ du% = h2h3/hi ip (7.4.45) with its obvious simplification in two dimensions. For two-dimensional polar coordinates p, <j>, for example, we have and hp = 1 , /i0 = p , h = p d d did 2ji2 dp^dp > pd<f> ds1 = dp1 + p'dcj) d2 Id Id2 dp2 p dp p2 dcp2 (7.4.46) 1 (4)! + ^ \d4>J (7.4.47) Then, with </> ^PN,m{P) , -TT^ = -m2xP (7.4.48)
294 7. Harmonic Oscillators we get (JV = \rn\ + 2np) -,2 J P(p) = 0 (7.4.49) Id m dp2 ' pdp p ^ + ^-^+2^1+^0 + 2^2 in which |m|, freed from the relation to <f>, is arbitrary; np is a non-negative integer. The differential equations easily suggest the general character of the wave function. For large p the dominant terms in (7.4.49) are $-P2)P = ° (7"4-5°) and a suitable (bounded) asymptotic solution is P(p) oc e~~y2 , (7.4.51) for which % = -<*■ 0 = ^-^ = ^- <7452» For small distances the dominant terms are Kdp2 pdp p so that P(p) oc p±m [of which P oc p|ml is bounded] (7.4.54) since Then, on writing P{p)^p\m^-'12p\{p1) (7.4.56) we get a differential equation for L(x = p2) that is the Laguerre equation (7.3.12). Let us also note the form of the differential equation with the first derivative removed. For that, the identity d2 „ ( -a& a\ ( d "X 2 ,2 +--^-^)^ = ^ (7-453) P T^P -\P —,P) = I — + dp2 \ dp J \dp p d2 2a d a2 - a dp2 p dp ,2+^+^2- (7A57)
7.5 Three-dimensional oscillator 295 or + 2a d a — a + dp2 p dp p P =d? is useful. So, with a = ~ and P(p) = —u{p) in (7.4.49) we get, dp2 m2 - i ^ + 2|m| +4np + 2-p2 u(p) = 0 and (7.4.33) turns into the orthonormality statement / &PU\m\,ni,{p)u\mlnli>(p) = £„„„;, </ 0 for the radial wave functions «|m| '"'^ l ' \j (np+\m\)lP n» [P ' (7.4.58) (7.4.59) (7.4.60) (7.4.61) (7.4.62) 7.5 Three-dimensional oscillator How about the isotropic oscillator in three dimensions? Its Hamilton operator is 3 H = X>^* + ql ~ l) = U\{p2 + q2) ~ lU ■ (7-5-1} k=i This time, for a change, we begin with the Schrodinger equation, !+i*-i«"+i)*<«>=o, (7.5.2) and, as a convenient way of exploiting the obvious freedom of three-dimensional rotations, we introduce spherical coordinates: qi — psine/ cos< (h—p sin 9 sin (, fts = p cos 9 , (7.5.3)
296 7. Harmonic Oscillators for which hp = 1 , he = p , hff, = p sin 0 , ft = p2 sin ( so that ds'2 = dp2 + p2d92 + p2 sin2 6&<j>2 , (dg) = dgi dg2 dg3 = dpp2 A9 sin# &<j>, (7.5.4) (7.5.5) and -p2 -> V2 = p2 sin ( 9 2 . 9 d p2sin0 d d p2 sin9 d —p sm#—+ — ^ qq+ T^^r. ff2 2 5 1 dp2 p dp p2 1 d . .6 :Sin6»— + -T-2 cw sin /r sin ( 1 52 sin e 89 (7.5.6) Alternatively, it is clear that we must deal with angular momentum, three- dimensional angular momentum, and its differential-operator' representation, i=qxp->qx-rV = -tV x q (primes omitted), and specifically L2 -> —{q xV)x(qxV) = VxqqxV = V ■ [q x (q x V)] = V • (q q ■ V - q2 V) = (9 • V + 3)qf • V - p2 V2 - 2q • V (7.5.7) (7.5.8) or, with q ■ V = /9-^-, 5 rZ , ( d \2 > d 2^,2 2 ^ , o U 2^,2 ^ "^ (V +^"^V =Pdp-2+Pdp"PV so that 52 2 d /92 0/92 /9 Op (7.5.9) (7.5.10) which identifies the angular derivative term in [— p2x ]V2 of (7.5.6) as the differential operator representing L2. Symbolically, we write (L1) = —7—7;— sin V 'dm 0 1 d2 sin 6» 56» 56» sin2 6» ■ and, in the same vein, (L) =1^ (7.5.11) (7.5.12)
7.5 Three-dimensional oscillator 297 In Section 3.7, we have already talked about solid harmonics, plYim(9, ¢), which satisfy Laplace's equation: V2(p'l/m) = 0. But so (L2)d]ffYim = 1(1 + l)Yim I and, of course, (L3)diff y,m = ml|m I (7.5.14) and the Yim(6,(j)) are the wave functions of orbital angular momentum with (as we already know) integral values of I. Accordingly [recall (7.4.58) with a = 1 here], we write ^(p,M) = -"(p)lWM) (7.5.15) P and get the radial differential equation d2 1(1 + 1) 2E dp2 p2 u + — +3-/9 u(p) = 0 . (7.5.16) It has exactly the same form as the two-dimensional equation (7.4.60), with the correspondence two-dim, oscillator three-dim, oscillator \m\ 1+ \ (7.5.17) 2|m|+4n„ 2E/u + 1 and, therefore, with \m\ —> I + |, we get IE to or + l = 2(Z+|)+4np (7.5.18) E = Ncj with N = l + 2np = 0,l,... . (7.5.19) The energy states are degenerate (except for N = 0, where I = np = 0 is unique). The multiplicity is m(N) = \(N + l)(N + 2) since we have for even N N I N 1=0,2,... m=-l 1=0,2,... N -4-2 1 = -f- x(N + l)=~(N + l)(N + 2), (7.5.20) number average
m(N) = J] (2/ + 1) = ^±i x (N + 2) = ±(JV + 1)(N + 2) . (7.5.21) 1=1,3,... 298 7. Harmonic Oscillators and for odd N N* <9i+v = a 2 v ' 2 number average In Problem 2-34 we got this from the alternative form — = JV = m +n2 +n3 . (7.5.22) In view of the orthonormality (3.7.45) of the spherical harmonics, the orthonormality of the wave functions / (d<Z)VVm(<Z)*-0Vm(<Z) = (7.5.23) where (dg) as in (7.5.5) and ^n„im{q) = -ts,j(p)3MM) , (7.5.24) reduces to />0O / &PUnpl{p)U<l{p) = ^nPn' (7.5.25) •/0 which is automatically satisfied by the two —> three-dimensional construction «n» = (-l)n'M^f^/+1L^\p)e-hP2 . (7.5.26) We easily check the normalization for the simple situation np = 0 : / ^(^ = -^-/ <W(^)('+*>e ?\('+h) P-p2 1 ^0° 77-^/ <W'+*>e-* = l, (7.5.27) indeed. Problems 7-1 Concerning the Lagrange operator (7.1.4): Find the additional term needed in the action operator 12 =y [ly^dy - H(y,y\t)dt] + (?) Wl2 = '2
Problems 299 such that 6W12 = {Gy1+Gt)1-{Gy + Gt)2 as required by the action principle applied to (yt ,t\ \y",t2)- [Hint: Problem 5-11.] 7-2 Dimensionless variables: H = \p2, [q,p] = i. Construct {q',t\y"}, for which it is sufficient to consider time variations, in conjunction with the known initial value (q'\y")- Recall that the normalized minimum uncertainty state is [y" = (q" + ip")/V2] \q"y)=\y")e-W\\ State {q',t\q",p") and work out the probability density |(g',£|g",p")| ■ Check that it is normalized in q', give its physical meaning, and make contact with the ips(x,t) of (5.4.25). 7-3 Consider the generating function (7.1.25) for the oscillator wave functions ij}n(q)- Show first that i/>n(q) = vn\t ~nl^e-^(q\n) :i with t > 0 . y = te1f Then establish the circumstances under which the <j> integral allows for a stationary-phase evaluation. Conclude that ipn(q) is exponentially small for q2 > In and find the corresponding approximation for q2 < 2n. Does the extreme semiclassical approximation 1^,((/)|2 eJ^2"-*2) for?2<2n> { 0 for q2 > 2n , emerge in the limit of very large n? 7-4 The driven oscillator: The driving force n(t) can be turned on, and off, smoothly as in K(t)=e-ime-^2 , or abruptly as in K(t)= e-'li2tri(±T - \t\) . Evaluate 7 of (7.2.17) for both choices. Give a rough graphical comparison of the two 171 , as a function of fl —u, for the situation
300 7. Harmonic Oscillators \tt — L)\ = — , (J ^> — . T T How significant is the change, in these circumstances, if complex e""1 * is replaced by real 2cos(/?t)? 7-5a Dimensionless variables; Hamilton operator H = 0J±(p2+q2)-fq, which describes an oscillator subjected to a constant force. Find the energy eigenvalues and q wave functions in terms of those for the undisturbed oscillator (/ = 0). Then find the analogous p wave function relationship. 7-5b Write the ground state q wave function for the disturbed oscillator (/ 7^ 0). Then apply the (q'\y') generating function of undisturbed oscillator wave functions to answer this question: If the system is in this //0 ground state, what is the probability that a measurement of 0J^(p2 + q2) will have the outcome nw? 7-6 Dimensionless variables; Hamilton operator with real parameter Q. Apply perturbation theory to get the leading change in the energy values for small fi. Can you find the exact eigenvalues? [Hint: Introduce q and p.] 7-7 Write the equations of motion for the Hamilton operator of Problem 7-5a. Evaluate I dt'q(t') Jo in terms of p(t) and p. Then use the action principle to get TfVWY thereby obtaining {p',t\p"Y in terms of {p',t\p"\f~°. Recognize here the same relations found in Problem 7-5a for energies and p wave functions. 7-8a Driven oscillator: The solution (7.2.9) of the equation of motion gives y(h) in terms of 2/(^2) and k. Construct the operator N\ = 2/(^1)^2/(^1)- What is its expectation value for a system initially in the nth state? Interpret this in terms of energy fed into the system by the driving force.
Problems 301 7-8b Evaluate the analogous expression for Nf and interpret the result in terms of the dispersion of the energy transfer. 7-9 Dimensionless variables; Hamilton operator (7.2.1); minimum uncertainty states \q',pf) of (2.7.38). Write out (yf = y'*) (q',p',ti\q",P",t2y = e-H^T (/,^1/,^)^-11^12 and check that I/ / /,1// II , \k|2 -|u'pia;il - j/'p^*2 +i-Y|2 \(q,p,h\q,P ,h)R\ = e \y e V e +17l = e_|(/_(,''e-iw(*i-*2)+i7e-iwti|2 _ Why does this combination of y(t\), 2/(^2) values look familiar? Verify the probability normalization j^~-\(q',p',h\q",P",t2y\2 = 1. 7-10 Driven oscillator: In the classical limit, the probability distribution becomes infinitely sharp. What is the final energy of the oscillator, in terms of the initial energy, I7I, and the phase angle between the initial motion and the external force? If the phase angle is unknown, what probability distribution emerges for the final energy? 7-11 Use the generating function of Bessel coefficients to write out the infinite power series for 3m(z). [Although thus derived for integer m, it holds generally.] Write the analogous generating function and power series for the related function lrn(z) = PmJm(i*) . Apply both generating function and power series to verify that J-mCO = (-l)mJm(^) for integer m. What is the analogous statement for I_m(^)? 7-12 Take the Laguerre polynomial generating function (7.4.27) and, with the substitutions u = VX e^ , v = V\ e-1* , H = v^ , arrive at the alternative generating function
302 7. Harmonic Oscillators (As)*° ^ } (n+a)!L" (*j ' which, although thus proved for integer a, is true generally. 7-13a Concerning Laguerre polynomials: Return to Problem 2-26 with /(2/f) = {v^)a, a arbitrary; f(y) = ya, a = 0,1,2,... . Place / = x, y = d/dx, and let the equivalent forms operate on e~~x. For arbitrary a, arrive thereby at yet another generating function: e~i^ = Y^ A™L(a)M (1-A) n=0 with L,W(x) = ^x-aex (-^-) xn+ae-~x n! \dz, as in (7.2.70), and, for non-negative integral a, at the same expansion with the equivalent form f-na / H \ n+a Recognize that the latter is equivalent to ■*'<*>-^(e-O 7-13b Derive the recurrence relations (7.3.9) and (7.3.10) directly from the generating function in Problem 7-13a. 7-13c Use the power series expansion for L„'(x) to prove that (7.3.10) is true for arbitrary a.
8. Hydrogenic Atoms 8.1 Bound states Now we are going to do a nice little trick: turn one kind of dynamical system into another one. Begin with the differential equation (7.4.60) that determines the energy eigenstates of the two-dimensional isotropic oscillator, and put dp2 + 2|m| +4n„ + 2- p2 u(p) = 0 (8.1.1) 2Ar with A > 0 (8.1.2) Then and therefore d d i—— d i—— d -1 r— r—-h — l—\ (8.1.3) d2 j__d_ dr2 2r dr m2 - ■!■ A __4+^(H+2Bp + l)_A2 i(p) = 0 . (8.1.4) Here we have the a = \ case of (7.4.58), and the function change u(p) = C(\r) *u(r) , (8.1.5) with a (positive) proportionality constant C to be determined later, gives dr2 m 1 A(|m|+2nr + l) 4r2 + -A2 u(r) = 0 , (8.1.6) where the radial quantum number np is renamed nr. Now, for any spherically symmetrical potential V(r), the radial Schrodin- ger equation for a particle of mass M is
304 8. Hydrogenic Atoms K2 E~^2MV2 + V^ u(r)Ylm(e,4)=0 or d2 1(1 + 1) 2Mrr/ , 2ME v L V(r) A w(r) = 0 . (8.1.7) (8.1.8) The evident correspondence between the two equations tells us that we're now dealing with a 1/r potential which we identify with the attractive Coulomb* potential between the electron charge —e and a nuclear charge Ze Ze^_ r V(r) = -— , (8.1.9) the potential energy of a hydrogenic atom. Then we have the correspondence two-dim, oscillator three-dim. Coulomb 1(1 + 1) \{m2 - 1) A(|m| +2nr + 1) A2 2M ~w 2M ~W -Ze2 (-E) (8.1.10) and so first |m|2 -> 41(1 + 1) + 1 = (2/+1)2 , that is |m|-»2J + l and then Ze'2M Z X(2l + 2nr + 2) -> 2—^7— , that is A -> , rr noo where n = nr + I + 1 is the principal quantum number (or energy quantum number), and h2 a0 = Me2 (8.1.11) (8.1.12) (8.1.13) (8.1.14) is known as the (first) Bohr radius. Therefore K2 e2ao n2aQ (8.1.15) or * Charles-Augustin de Coulomb (1736-1806)
8.1 Bound states 305 Z2e2 -En =^2^- with n= 1,2,3,...; (8.1.16) these are the Bohr energies. Except for n = 1, when I = 0 and nr = 0, the energy states of a hydrogenic atom are degenerate. In general their multiplicity is n—1 n—1 m(n) = £(2J + 1) = Y, [(' + I)2 - J2] = «2 = 1,4,9,... . (8.1.17) 1=0 1=0 Here are some numbers pertinent to atomic physics. If M is the electron mass and e the elementary charge, M = 9.10939 x 1(T28 g , e = 4.80321 x 10"~10 esu , (8.1.18) then the Bohr radius is a0=0.5292 A (8.1.19) [1 A = 10~~8 cm, Angstrom* unit]; it sets the atomic length scale, and Me4 e2 ^==-- = 13.606 eV , (8.1.20) Inr 2ao called Rydbergt energy, sets the energy scale: E„ = — R^Z2/n2. Corresponding atomic scales for frequency, wave number, time, and velocity are given by R00/(2ttH) = 3.2898 x 1015 Hz , Roo/(2TThc) = 109 737 cm"1 , h/(2ROQ) = hao/e2 = 0.0242fs , ^/W^JM = e2/h = 2.188 x 108 cm s""1 , (8.1.21) respectively. Now, what about the wave functions of the hydrogenic system? We have the relation (8.1.5) between the oscillator u(p) and the Coulomb u(r), but need to determine the proportionality constant. Since />0O />0O 1=/ dp[u(p)]2=C2 / &y/2Xr{\r)~-*[u(r)f (8.1.22) Jo Jo " v ' we find i i l dr~[U(r)]2] =2* (-) (8.1.23) * Jonas Anders Angstrom (1814-4874) fJanne Rydberg (1854-1919)
306 8. Hydrogenic Atoms which gives C in terms of the average value of 1/r in the particular hydrogenic state, which, at the moment, we don't know. On the other hand, we could have turned it around: 1 = J™ dr [u(r)}2 = ^ jT ^f^Hpf (8.1.24) gives ,.1 . . />oo C = 2^A-i(p2)2 with (p2} = dpp2[u(p)]2 . (8.1.25) Do we know, or can we easily find, (p2)? Sure! First, for one degree of freedom: In Section 6.3 we observed that the oscillator energy states are the stationary-uncertainty states of Section 2.4, and there we had found, in (2.4.22), P2) =(q2) =n+|, (8.1.26) n giving back M^H«^+(«V>) n+|+n+i-l = 2 2 = n . (8.1.27) For the two-dimensional oscillator, we have, then, P2) =(q2i+ql) =ni+Tt2+l=|m|+2np + l (8.1.28) = N = \m\ + 2np so that with (8.1.11) and (8.1.13) //) ->(2/ + l) + (2np +1) = 2(/ +nr+ 1) = 2n. (8.1.29) /n I nr Then, since A = Z/(nao), we have C = 2i(a0/Z)in, (8.1.30) giving I\ =-£-; (8.1.31) r/„ n2a0 more about this in Section 8.2.
8.2 Parameter dependence of energy eigenvalues 307 So now we are told that (8.1.2) and (8.1.5) with (8.1.12) and (8.1.30) turn the oscillator u(p) of (7.4.30) into the hydrogenic wave function /+i /n*-_N Zr (8.1.32) or 97. with x = . (8.1.33) nao The simplest example is, of course, nr = 0, n = I + 1, «—(') = (-)'^(-)"«--. (»■•■") \nao/ V (2n)! V«ao/ in particular, Rio(r) = ( — J 2e «o (8.1.35) for the lowest-energy state, the ground state in which n = 1, I = 0. As a check of consistency, we evaluate the normalization integral and find, indeed, / dr r2 [#„,„_!(r)f= / dxx2-±-x2n~2 e~x = 1 . (8.1.36) 8.2 Parameter dependence of energy eigenvalues The incidental evaluation (1) =4- <<"»> W„ n «0 directs attention to a general question: What information follows by knowing the dependence of energy eigenvalues E on various parameters A? Consider the eigenvector equation [E(A)--ff(A)]|E,7> = 0 (8.2.2) and its adjoint (E,-y\[E(\)-H(\)]=0. (8.2.3)
308 8. Hydrogenic Atoms Differentiate with respect to A: ^m_nm^) + [m.am^)=0, „,4) and multiply with (E,j\ to get ^ = (^)- This is frequently called the Hellmann*-Feynmant theorem. We'll reconsider these matters in somewhat more detail in Section 9.6. As a first example, consider the one-dimensional oscillator for which "=& + ¥* = '+*■ <»*» where T = p2 /(2M) is the kinetic energy and V = \Muj2x2 is the potential energy. We recall that the dependence of the nth energy eigenvalue upon the mass M (no dependence) and the frequency u (linear dependence) is given by E=(T) + {V) = (n + \)hu> . (8.2.7) Differentiation with respect to M gives bh Fit? MdM = "~T + V ' MdM = ° S° that (T) = {V) ' (8-2-8) and differentiation with respect to u gives u = 2V , w- =E so that 2{V)=E. (8.2.9) OUJ OUJ Together they say (T) = (V) = \E , (8.2.10) which gives the correct sum of (T) and (V), as it should. Now try three-dimensional hydrogenic atoms, v2 Ze2 H = T + V withT=-4r, V = , (8.2.11) 2M r and Z2e4M "Hans Hellmann (b. 1903) fRichard Phillips Feynman (1918-1988)
3.3 Virial theorem 309 We differentiate with respect to the mass M, fij-f be MdM=~T> MdM=E S°that W = ~E, and with respect to the nuclear charge Z, rdH ~dZ V , Z— = 2E so that (V) = 2E , which give the correct sum, (T) + (V) = -E + 2E = E The result in (8.2.14), presented as 1\ Z2e2 -Zez n2ao is the known n2ao (8.2.13) (8.2.14) (8.2.15) (8.2.16) (8.2.17) 8.3 Virial theorem There is a related transformation in which we change the scale of the p's. Consider a three-dimensional system H = ^ + V(r)=T + V, L=P.^-H and the infinitesimal transformation 5r = 5X(t)r, 5p = -5X(t)p. Then the induced change of the Lagrangian is 5L=p- r—6X + 26XT-6Xr-W at g's and (8.3.1) (8.3.2) d_ di (5Xp-r)+Sx\i-^-(p-r)+2T-r- w] , (8.3.3) and the stationary action principle, 5W\2 =G\ — G2, applied to 5X gives Gx=5Xpr (8.3.4) and
310 8. Hydrogenic Atoms ^-lp-r) =2T-r- VV , (8.3.5) dt v which is known as the virial theorem. Note that G\ is indeed a generator: Sr=l[r,Gx} = ^=6\r, 1 BC Sp=-\p,Gx} = -^ = -5XP. (8.3.6) The importance of the virial theorem lies in the remark that for a state of definite energy E, a stationary state, the expectation value of the time derivative dF/dt of an operator that has no parametric time dependence - that is F = F(r,p) or dF/dt = 0 - vanishes: '^ ={E\±(FH-HF)\E)=0. (8.3.7) E E When applied to F = p ■ r, this implies 2 (T) = (r ■ W) (8.3.8) for these expectation values in a stationary state (= eigenstate of the Hamilton operator). If V(r) is of degree n, that is V oc r™, then r-VV = r^-V = nV (8.3.9) dr and 2(T) = n(V) . (8.3.10) In conjunction with (T) + (V) = E, then n 2 (T) = —E , (V) = ^-2E . (8.3.11) Thus, for the oscillator, n = 2 and (T) = (V) = \E ; (8.3.12) and for hydrogenic atoms, where the Coulomb potential has n = —1, (T) = -E , (V) = IE , (8.3.13) as seen in Section 8.2. From the viewpoint of the one-dimensional radial motion described by u(r), we have Pr — j~§p and
f^t + 'lLtp* 3.3 Virial theorem 311 (8.3.14) so that effectively H^Hl = £ + <(< + ^ + y(r). 2M 2Mr2 (8.3.15) Then, with A = /in (8.2.5), dl ~ M ^+2J\r2 (8.3.16) For the Coulomb example, E = — |(Z2e2/a0)(nr +1 + 1) 2, we get dE Z2e2 1 H2 „ ,,/1 5/ a0 n3 Af or (8.3.17) (8.3.18) a20 «3 (/ + |)' whereas, for the three-dimensional isotropic oscillator, .E = fkj(l + 2nr) and or dE * ^n i\/1 -=^ = -(/ + 5)(^ Mu 1 ft / + (8.3.19) (8.3.20) A more subtle kind of average occurs when, beginning with the Schrodin- ger equation (8.1.8), d2 1(1 +1) 2M ^dr2 we differentiate, d2 1(1 +1) 2M + ^(E-V) u(r) = 0 , dr2 + -^(E-V) _2^1)+^ (8.3.21) u (8.3.22) (primes denote r derivatives), then cross-multiply both equations to get d r3 + ft2 w2 = dr (uu" - «'2) (8.3.23) leading to
312 8. Hydrogenic Atoms -21(1 + 1)^ + 2^. (V') = [u'(0)f = [R(0)]2 . (8.3.24) Here we made use of uu" —> 0 as r —> 0, which is an immediate consequence of u(r) oc r'+1 for r > 0. For I / 0, (8.3.24) is the three-dimensional isotropic oscillator, V = ^Mu}2r2, thus has and for the Coulomb field we find (I / 0) 1\ - l M7r* /1\ - Z * l (R 1 27) r3 / 1(1 + 1) ft2 \ r2 / a30 n* 1(1 +1)(/ + 1)- ^^ '> Now, for I = 0, we have 2M ft2 For the three-dimensional oscillator thus 2 (V) = [#(0)]2 . (8.3.28) 2(ir) <r> = ^°)r> (8-3-29) and for the Coulomb potential The latter is quite simple; does it indeed follow from [1 = 0 and r = 0 in (8.1.33)] ^0(0) = (-1)-(1)^^(^(0)7 (8.3.31) Yes, since according to (7.2.76) 4^)(0) = (li±^! so that 4^(0)= (-^.!=n, (8.3.32) as required. As for the three-dimensional oscillator, relation (8.3.29) says [dimensionless variables and I = 0 in (7.5.26)]
.4 Parabolic coordinates 313 (P) x(r) (r-+0) 1 = t2- np\ 2 K + l) nj^tH n„! K + l)! ("p + I)-' n„! Hi P- 2- («p + 5)! I (l\ m See Problem 8-16 for an explicit check. (8.3.33) 8.4 Parabolic coordinates The wonders of the Coulomb potential do not cease with the connection to the oscillator. Despite the evident spherical symmetry of the problem, there is another useful coordinate system, one with a preferred direction - parabolic coordinates (£ > 0, rj > 0, 0 < 4> < 2n): x = 2y7;?7 cos < 77 = 4 \/x2 + y2 £ = 4 £ = i 2 3 in terms of which the length r of the position vector r is = \/x2 +y2 + z2 = Ji&j + (£ - 77)2 = £ + 77 and £ = 5(7- + 2), r]=\{r-z) emphasize the privileged role of the z direction. With H It + V (8.4.1) (8.4.2) (8.4.3) ^=m/—> h0 = 2V&i, /i = 2(£ + r7) (8.4.4) £ y v in (7.4.39) and (7.4.42), we have [cf. (7.4.41)] ds2 = =^(d£)2 + =^77)2 + 4tv(dcP)2 (8.4.5) and [cf. (7.4.45)]
314 3. Hydrogenic Atoms _o 1 2(£ + r7) 1 d „d d d d£+n d e + r? d£ ^d£ dr) 'drt dcp 2£v d<p 1 d2 d_ d d_ d_ d£^d£ + d^T1dr] + 4£Vdcj>- 2 " Both this Laplacian differential operator and the Coulomb potential V-- — Ze2 £ + r? (8.4.6) (8.4.7) have £ + rj in the denominator and so, after multiplying with (£ + rj), the Schrodinger equation V + £<S-V) ¢ = 0 reads [(2M/H2)Ze2 = 2Z/a0 is used] d d Id2 2ME d d Id2 2ME 2Z d£^d£ + 4£d4? + W- ^ + dr]ridr] + 4:qd^ + H2 V + ~^~ (8.4.8) (8.4.9) First, it is clear that we can classify states by the eigenvalues L'z = mh of Lz [factor (2tt)-* eim<t> in the wave function]. Then the differential operator is additive in £ and rj, so we write ^ = .1^4^)¼ V2?r V£ v^ (8.4.10) and get the pair < af equations " d2 m2 - 1 2ME 2Zi" [de2 4e ' n2 +a0e. "d2 m2-l 2MB 2Z2 dry2 4rj2 H2 aorj «(£) u{rf) = 0 = 0 (8.4.11) with Z\ + Z2 = Z. We set along side these the differential equation for u(r), that is (8.1.8) with (2M/H2)V = -2Z/(a0r), dr2 r2 where we know that 2ME (Z/a0)2 1(1 +1) 2ME 2Z h2 a0r t(r) = 0 H2 (nr+l + 1)2 ' or, for our immediate purposes nr,l = 0,1,2,... (8.4.12) (8.4.13)
8.4 Parabolic coordinates 315 nr +1 + 1 = -- Z/a° , (8.4.14) We see the correspondence: \m\ ** 2/ + 1. And so, for the £ and rj equations, using k\ = 0,1,... and fc2 = 0,1,... as analogs of nr = 0,1,..., we get 1 , in 1 , i\ ^1/ao , ,1/1 1 , -,\ Z2/a0 fci + M m +1) = —7 , . fc2 + 4( TO + 1) = -7 ,, 2VI ' 7 ^-2ME/h2 2V y ^-2ME/h2 (8.4.15) which, on addition (Zi + Z2 = Z), give h+h + M + l^-^f^, (84.16, so that Zi _ fci+ |(|m|+ 1) Z2 _ fe+ |(|m|+1) (8.4.17) and Z2e2 2n2ao £ =-^12—' «= 1,2,..., (8.4.18) the familiar Bohr energies, of course. What is the multiplicity of states of given n counted this way? Note that fcj, fc2 = 0,1,..., whereas rri = 0, ±1, ±2,.... From determining the multiplicities of oscillator energy states in two and three dimensions, we already know, for non-negative integers that m + n2 = N : m(N) = N + 1 , n1+n2+n3 = N: m(N) = -(N + l)(N + 2) , (8.4.19) from which, if n3 = 0 is excluded (n'3), we get m+n-2+r^ = N : m(N) = -{N + 1)(N + 2) - (N + 1) = -N(N+l). (8.4.20) So, for ki+k2 + \m\=n-l (8.4.21) we have n states for m = 0 and twice \n{n — 1) states for m / 0, giving the multiplicity m(n) = n + 2 x -n(n - 1) = n2 , (8.4.22) which is, of course, the answer we found earlier in (8.1.17).
316 8. Hydrogenic Atoms 8.5 Weak external electric field The utility of the parabolic coordinate system begins to appear when we ask for the effect on the energy levels of an applied homogeneous electric field F, which we take to point along the z axis. Thus the potential energy of an electron (charge —e) is now V Ze2 + eFz (?-'*) (8.5.1) which, in parabolic coordinates reads V + eF(£ - n) 1 electrostatic potential [Ze2-eF(e-r,2)] Thus the factorization of the wave function persists, now with (8.5.2) d2 m 1 2ME 2Zi IF _) 1 £ 4£2 h2 ao£ eao d m drj' 1 2ME 2ZX IF ~T^>— H P5 1 1 V 'irf- nr aorj eao «(0 = o, u{r]) = 0 . (8.5.3) Before continuing let's notice what plays the role of the effective potential yeff for the two motions. Drawn for m = 18, F = \eja\ and Z\ = Z2 = 42 with abscissas linear in £2 and rj2, respectively: (2Ma2/H2)VeS(Q >£/ao {2Ma20/h2)E we see that vea(0 - 2M 4C2 2 _ 2Zi 2F aoC eao (8.5.4) binds the electron to the vicinity of the location of its minimum, whereas
1.5 Weak external electric field 317 Vefffa) 2M m2 - 1 2Zi 2F 47y2 a07? ea0 (8.5.5) (2Ma2/n2)yeff(r?) 4* >v/ao (2Ma20/H2)E becomes arbitrarily large negative for large 77 values. In the latter situation there is a certain probability of finding the electron outside the atom, if ionizing the atom. That, however requires very strong electric fields to be effective, because one must overcome the exponential attenuation in the classically forbidden region (recall the discussion in Section 6.7); we are now interested only in weak fields. Incidentally, we note that the £ equation in (8.5.3) is known as the uphill equation, and the 77 equation as the down-hill equation - names that are obviously suggested by the shapes of the effective potentials. Think of the F in the u(£) equation as a parameter and differentiate, anticipating that E and Z\ will change (it is Z that is fixed) 2M dE ~WdF ea0 (0 2 dZi a0 dF Of course, if we only change Z\, we get 2M dE _ 2/1 ~WdZl ~ ~a^\i so that dE dE dZx dZi dF = e(0 ; (8.5.6) (8.5.7) (8.5.8) we could have written the left side directly - it is the change of E produced by the explicit dependence on F. Similarly dE dF dE dZ2 dzVW -e(ri) (8.5.9)
318 8. Hydrogenic Atoms We are going to apply these results for F = 0, to get the linear dependence of E on F. First, for the £ motion we known that E = - Zl 1 with the consequence and similarly 2«o [fc1 + I(|m|+l) dE Z^8Z[ = -2E> dE |2 ' (8.5.10) (8.5.11) (8.5.12) Then, multiplying (8.5.8) and (8.5.9) by Z\ and Z2, respectively, and adding (F = 0): BF f) (Zx +Z2) — - 2E—(Z1 + Z2) = eZx <£) - eZ2 (r,) (8.5.13) = Z = 0 so dE dF F=0 (8.5.14) F=0 = z^z^^-Z^)k2 According to Problem 8-ld, for the Coulomb field in spherical coordinates, <»•>„» = f[3«2-J(J+ 1)] • (8.5.15) So, with the correspondences 21 + 1 4+ \m\ , Z «-> Zit2 , n «-> fc1>2 + \{\m\ + l) , we get (8.5.16) a0 Z^v)kl,\m\ - 2 Z2(V) 3 fci + m\ + 1\ mz — 1 *2»|m| ~~ 2 3 fc2 + I i 1 \ z 2 1 m + 1 \ mz — 1 and dE OF _ e 3ao F=0~ Z~2~ (fci -fc2)(fci +^2+ |m| + 1) (8.5.17) (8.5.18) follows.
8.6 Weak external magnetic field 319 This exact statement for F = 0 gives an approximate statement for the change of energy produced by a weak electric field; 6EnMM = ~-Fn{h - k2) . (8.5.19) This is an example of first-order perturbation theory, as applied to the linear Stark* effect. The quantum numbers k\ and k2 range individually from 0 to n — 1, so that k\ — k-2 goes from n — 1 to — (n — 1), giving in all 2n — 1 equally spaced values. For n = 2 with its n2 = 4 states we get 6E kx = 1, k2 =0, m = 0 : SE = 3ea0F/Z , 0-- fci =0, k2 = 0, m = ±1 : 6E = 0, kY = 0 , k2 = 1, m = 0 : 6E = -3ea0F/Z . F=0 F>0 So the n'2-fold degeneracy is not removed; there are only 2n — 1 different energy states. What in general is the multiplicity of a given energy, a given k\ — k2l Suppose k\ > k2. Then we get the same ki — k2 by successively increasing k\ and k2 by 1, until we reach k\ = n — 1 (n — k\ values including the original) or by successively decreasing k\ and k2 by 1, until we reach k2 = 0 (k2 additional values), giving in all m(n,ki,k2) = n — \ki — k2\ (8.5.20) for the multiplicity of the Stark-shifted energies. The total number of states is n—1 n—1 ]T (n-|fc|) =n + 2^(n-fc)=n2 , (8.5.21) A=-(n-l) k=l — {n — l)n the multiplicity we know from (8.1.17) and (8.4.22). 8.6 Weak external magnetic field This naturally raises a question: What would a homogeneous magnetic field do to the remaining degeneracy? It is time to appreciate generally how a charged particle is acted on by electric and magnetic fields that are given. 'Johannes Stark (1874-1957)
320 8. Hydrogenic Atoms The Lagrangian of a free particle of mass M is [Here and below we'll always understand products of potentially non-commuting operators as symmetrized, so p ■ dr should be read as \p ■ dr + \dr ■ p.] We have already recognized in the context of Coulomb and constant electric fields that a particle of charge e (now either positive or negative) has the Lagrangian dr 1 L =^-2^-^ (8-6-2) where <t>(r) is the electrostatic or scalar potential representing the electric field, E = -VO. Now we want to turn on a magnetic field. The important thing to appreciate is that a magnetic field interacts with motion, as described by the velocity v. Perhaps you say that you know the velocity: v = pjM'l But no. Just as the presence of an electric field changes the meaning of energy: — >— + e$ (8.6.3) kinetic, potential energy the presence of a magnetic field changes the meaning of momentum, p is no longer Mv. For that reason it behooves to introduce v as an independent variable. Consider this L: L(r,p,v) =p- \-T7-v) + TjMv2 -e<D(r) = p- — -H(r,p,v), (8.6.4) with the Hamiltonian H = p-v-~Mv2 +e<i>. (8.6.5) The Hamilton-fleisenberg equations of motions are dr _dH dt dp _^^ = eV(|)=-c£, (8.6.6) dt or and, since there is no time derivative of v, simply BJ-f 0 = — = p - Mv . (8.6.7) ov
8.6 Weak external magnetic field 321 So we identify v with velocity, dr/dt, learn that momentum p is Mdr/dt and get the force equation If we wish we can simply accept that v = p/M and come back to H=Mt+e®- (8-6'9) But now we want to add an interaction that depends on v; the simplest possibility is v multiplied by a new vector. We shall in fact write 1 e H =pv - -Mv2 + e$(r) - -A(r) ■ v , (8.6.10) where c = 2.99792 x 1010 cm/s is the speed of light. What are the new equations of motion? Rather than (8.6.6) and (8.6.7) we get now dr _dH _ d* " ~dp~~V ' dp dH e ,,, at or c f)J-f p 0=-— = p-Mv--A(r). (8.6.11) ov c In the second equation, V does, of course, not differentiate the independent variable v; nevertheless, since we'll shortly accept the third equation as defining wasa function of p and r, it is expedient to use the vector identity VA(r) ■ v = v x (V x A) + v ■ VA (8.6.12) and write the second equation in the equivalent form ~=eV$--t)x(VxA)--wVi4. (8.6.13) at c v c So the first equation in (8.6.11) tells us that v is still the velocity dr/dt, but the third says that p has changed: dr p p = M~- + -A(r) . (8.6.14) kinetic, potential momentum And the force equation? Differentiate,
322 8. Hydrogenic Atoms and combine it with (8.6.13) to get Q 7* 6 M-— = - eV$ + -v x [V x A(r)] at2 c J + - [w • VA - -rMr)] ■ (8.6.16) To evaluate j^A(r), we need H as a function of p and r, which is H = (p- -A] -v - -Mv2 + e<I> = -Mv2 + e<I> v c / 2 2 2M L(p^£yl)2 + e$. (8.6.17) Now ^71=^^),^1 = (71,^), (8.6.18) the symmetrized Poisson bracket, because A is independent of p and H is less than cubic in p. So d , dA dH A , AtA = l*-lip=V-VA> (8^19) the term in the second line of (8.6.16) vanishes, and we get d2v e M—r = eE+ -v x B at1 c with E = -V<J> and B = V x A (8.6.20) which exhibits the dynamical action of the magnetic field B, as it is constructed from the vector potential A. The appearance of the Lorentz* force in (8.6.20) tells us that we have indeed found a way of incorporating magnetic fields. Thus the Lagrangian (8.6.4) with the Hamilton operator (8.6.10) is appropriate for a charged particle moving under the influence of both an electric and a magnetic field. We are interested now in a homogeneous field B. What is A1 The answer is not unique. If A is a possible potential, so is A + VA, A arbitrary, for V x (A + VA) = V x A = B . (8.6.21) (This is the freedom of gauge transformation.) Suppose for example that B points in the z direction, so "Hendrik Antoon Lorentz (1853-1929)
3.6 Weak external magnetic field 323 — A - — dy dz' X — ~E^_AZ — Ay — (J , One simple solution is Another is By = -7T Ax — — j4z = 0 , oz ox Bz = iLAv ^h,Ax = B- (8-6-22) Ax = Az=0, Ay = Bx. (8.6.23) Ay = Az = Q, Ax= -By . (8.6.24) A third is the average of these two Az = 0 , Ax = -^By , Ay = ^Bx : A = l-B x r ; (8.6.25) it is the most natural form, being three-dimensional and not singling out arbitrary x or y directions. The first two appear as A = ^-Bxr + V (±l-Bxy) . (8.6.26) If we use the natural form in the Hamiltonian (8.6.17), we get =Mp-YcBxr)2+e^ H = 2 =£f+e*-^cB-rxp+^Bxr)2- (8-6-27) Note that the term linear in B is proportional to the orbital angular momentum L = r x p. We now apply this to hydrogenic atoms. The charged particle is the electron, so e —> —e and e > 0 denotes the elementary charge of (8.1.18). Then, in conjunction with the nuclear Coulomb field and a weak electric field along the z axis, consider a weak magnetic field along the same z axis. We omit the quadratic B term and get H=^-^+eFz + W-cBL" (8-6'28) This is easy! For any state specified by the magnetic quantum number m, L'z = mh, we simply get the additional energy (known as the energy shifts of the normal Zeeman* effect) 'Pieter Zeeman (1865-1943)
324 8. Hydrogenic Atoms eh mB = m/j,0B , (8.6.29) 2Mc where ^ = -^- = 5.788 382 x 1(T9 eV G"1 (8.6.30) 2Mc v ; is called Bohr magneton. So the energy values in weak parallel electric and magnetic fields are E = -^Ar + | ^Fn(h - k2) + miioB . (8.6.31) Electrically degenerate energy values, those of constant k\ — k-2, have, according to (8.4.21), here written as |m| = (n-l + fci-fc2)-2fci , (8.6.32) ■v- given different values of \m\, in general, or, for a given value of \m\, opposite values of m. The degeneracy is completely removed. 8.7 Insertion: Charge in a homogeneous magnetic field Now consider just the magnetic field B, so that <t> = 0 in (8.6.17), and the Hamilton operator is 1 1 / P \2 — (p - -A\ with V x A = B = Bez . (8.7.1) First, look at the commutator relations for v = M_1 {p — |A): v>(v = w(p-lA)X(p-ecA) = -Jpl (pxA + Axp) \H e_ . ihe „ ,0 „ _. M2 c M2c v ; That is 1 ell TEJ-Vx'v^ = ~W~cB ' tf\.v*>v*\ = -tf}.vv>v*\ = 0> (8-7-3) which shows a (1 + 2)-dimensional split, H=\Mv2z+\M{vl+i,l) . (8.7.4) = P2Z/(2M)
8.7 Insertion: Charge in a homogeneous magnetic field 325 For the electron, e = — |e|, the variables q,p introduced by give and then h\e\B k\e\B ,<,„-, wp' v« = vikq (8-7-5) }[<Z,P] = 1, (8.7.6) gives us the energy spectrum _ p' lei HB . ,, with - oo <p'z < oo and n = 0,1,2,... . (8.7.8) Indeed u)\ = \e\B/(Mc), called cyclotron frequency, is the rotation frequency of v in the x,y plane: the pair dvx dv. is just dt = -uivy , ^- = ulVx (8.7.9) d? dp -=ulP, - = -Uiq (8.7.10) and, of course, we knew this all along since (8.6.20) says ^--^Bx^-^Bxt,. (8.7.11) dt Mc Mc K ' The energy (omitting the free z motion, or setting p'z = 0) is that of a one- dimensional oscillator. What is the multiplicity of those energy states? We get a clear picture by, in contrast with the above, working with a particular gauge: Ay = Bx, Note that y does not appear, so
326 8. Hydrogenic Atoms a state can be specified by an eigenvalue of py, and H "> m + V (* " il*0 with - °° < 4, < °° (8-7-14) exhibits the one-dimensional oscillator. Clearly each energy state is infinitely degenerate, corresponding to the independence of the energy of p'. How does this work out with A = \B x r? Here t ,. =UJ2 & + Pi) + ^T,B L* + ^7^ ix' + V2) with ^ = 5wi • P'z=0 (8.7.15) We recognize here the two-dimensional oscillator with an extra energy term u)2Lz. It is convenient to use the quantum numbers (y+y+)' = n+ = 0,1, 2,... and (y_y_)' = n_ = 0,1,2,..., for then W'2,UZ = hu>2(n+ — nJ) as we've seen in (7.4.17). So the eigenvalues are E — hx>2{n+ + n_ + 1) + fk02(n+ - n_) = hio2(2n+ + 1) = /iwi (n+ + |) . (8.7.16) This applies for e = —|e| < 0; for e = |e| > 0 we'd have E = fiwi(n_ + |). The degeneracy comes from the dependence of the energy on only one of the integers n+,n^. To describe the degeneracy more physically note that the equation of motion (8.7.11) is At At so {-WcB*r), (8.7.17) Av _ A / e ~ A~t\M e „ e V+M-cBXr^M-cBXr° (8^18) obeys Aro/At = 0. Indeed -^-Bx(r-r0) (8.7.19) describes the rotational motion around the fixed point ro with frequency uj\. What are the properties of ro = (#0,2/0)? First n°te -Bxr0=Mt)+-Bxr=p+-4 (8.7.20) p-(e/c)A= =2A
8.7 Insertion: Charge in a homogeneous magnetic field 327 so e eB e eB —By = vo , p„ H Bx = — 2c y c y ' Fy 2c c Px~^By = ——ye py + — Bx = —x0 , (8.7.21) and conclude eB"2 c that is [x0,vo] = [px - YcBy^v + YcBx] = -[nlB ' (8-7-22) [-o>W,] = -i^ = ^; (8.7.23) e = — e it is not possible to specify precisely the center of the motion. The coordinates xq and j/o are subject to a Heisenberg uncertainty relation, 5x^Aik- (8-7-24) Also observe that 1 / e ,\2 1 feBx 2 2M 5 Mlo\ =r20 1 lei e2J32 2M (p* +P^ ~^2Mcf Lz + W£ = UJ2 or so 2JWW2 (8.7.25) \mu\tI = ^{pI+ P2y) + \Mul {x2 + 2/2) - u2Lz , (8.7.26) -Mw2ro = f^2(n+ + n_ + 1) - fiw2(n+ - n~) = hui (n_ + |) . (8.7.27) Thus the second quantum number, the one that does not appear in the energy (8.7.16), specifies the required distance of the orbit center from the origin rt = 2]A- («_ + |) • (8.7.28)
328 8. Hydrogenic Atoms That, of course, was obvious once we saw (8.7.23); the eigenvalues of x'q + y% are those of the oscillator, suitably scaled. Incidentally, one can present E similarly: H = lMv* = ^[-^-Bx(r- r0)]2 = yw?(r - r0)2 , (8.7.29) and the comparison with (8.7.16) gives (r-r-o)2' = 2^(n+ + !) . (8.7.30) It has now long been clear that the energy degeneracy simply means that the center of motion can be anywhere, although not precisely specifiable. One has the option of specifying r^, but the location on that circle is unknown, or of, for example, specifying Xq (that is the gauge Ay = Bx) but yo is unknown, or, most physically of using the minimum uncertainty states, the eigenvectors of Xq + iyo. The latter gives us an easy way to answer the practical question: If there is only a finite (macroscopic) area A available, how many states are there of a given energy? We recall that / does that counting, where from (8.7.23) we have the correspondence q -> y/MtJi x0 , p -> y/Muii y0 , (8.7.32) so fdg'dp' ^ Mm [,,,, \e\B J^fT-^^Hjdx^= *^;A (8-7-33) is the desired number. Another more special way takes the area A to be a circle. Then the largest radius and the largest n_ are such that l-A = 2-|-n_,max. (8.7.34) Therefore number of states = n_jmax = —-—-— . (8.7.35) 8.8 Scattering states Back to the unfinished Coulomb problem - unfinished because we have only considered the E < 0 states, the bound states. How about the E > 0 states?
8.8 Scattering states 329 They describe an electron, not bound, but coming from far away where it has positive kinetic energy, ^Mv2 = p2 /(2M), and negligible potential energy, — Ze2/r —> 0. Such a particle approaching the nucleus will have its direction changed while receding from the nucleus - it is scattered. Suppose that the particle, far away, is moving along the z axis with velocity v = p/M. Its wave function will be essentially the plane wave en p • r en pz Akz .,, p Mv with k = — — -^- n n — . (8.8.1) The wave approaching the center of force at the origin will produce a new wave representing the scattered particle which will essentially emanate from the neighborhood of the origin, seen far away as a spherical wave e . The two parts of the wave, crudely elkz and e1 , appear in parabolic coordinates as Mi - V) Mt + v) (8.8.2) which suggests that the complete wave function may have the form i/> = eik^G(v) , (8.8.3) certainly independent of <f> because of the axial symmetry of the physical situation. Let's try it in (8.4.9) with 2ME/H2 = k2, dip/d<j> = 0: d_ d_ 2 d_ d_ 2 2Z d£, d£ drj drj ao -» -k2(, + ifc ^G(r,) = 0 (8.8.4) so that d2 Id ,? 2Z/a0+ik h \-k + —-— drj2 j) drj rj G(v) = 0 (8.8.5) or, again, with G = \/l/r]u(r]) drf2 Arf The analogous radial equation, ^2_ _L_ ~dr2+ Ar2 d2 1 t ,2 , 2Z/a0 + ik ~\ ;—;r + K H u{rj) = 0 k2 + 2Z a0r t(r) = 0 (8.8.6) (8.8.7) is (8.1.8) with I = -\, (2M/H2)V(r) = -2Z/(a0r), and 2ME/H2 k = Z/(nao); it has the bound state solution [cf. (8.1.32)] u(r) oc r^V- ' Z/(nao) i{2nr)e~ (8.8.8)
330 8. Hydrogenic Atoms which indicates the correspondence r —> 77 , Z —> Z + \ika0 , k —> ik . So and with /? «(,)«,*L%(4oo)(2ife7)e-i*' Gfa) oc L_i/3 (2i/by) e^7? Z _ Ze2M _ Ze2M _ Ze2 Ze2 ka0 kh2 Hp Mv2 Hv (8.8.9) (8.8.10) (8.8.11) (8.8.12) Now notice that the sign of i in k —> ik is chosen so that G(ri) oc e~ikrl for Z -> 0 [when ^\z/(kao) -> 40) = l] (8.8.13) as it should. All very well but what is the complex extension of the Laguerre polynomial that appears here? Recall that L(°)(x) = lex (JL\ x»e-x (8.8.14) is the definition for integer v. But we can extend it by using a contour integral: L<°>(*) = At tv%-1 2ri~(t-xy+1 >Ret (8.8.15) where a cut must connect t = x and t = 0 for non-integer v values. For v = —i/3 and x = 2ikr], this gives us L%(2ikv) = e 2ikri — e-H^- 1 ^,,,,,-.. r2wr . (t-2ikr,y^- (8.8.16) A convenient way to take the contour is specified by having the cut extend from t — 2ikr] to t = 2ikr]+oo and then from t = 00 to t = 0. Then, integrating along the cut:
8.8 Scattering states 331 >Ret (8.8.17) splits the integral into two parts. For part A we write t = 2ikr] + u with u : oo —> 0 —> oo and evaluate it for krj > 1, giving lL-%<H.-/l^-"<2*>«r,s^ =* p-i/3 log(2i&77). 2: e t fc7?» 1 27Ti f° e~u f°° Ldu^+Jo du 2^,^0 (e2mu) i0[f+log(2kr,)]J_(e-2^_1) [°°duu-i+ye- 2iri V J J0 1 - e-2^ = efe^l0g(2b,)(i/))!; 27T/3 (-l+i/3)! (8.8.18) and for part B we find [L%(2iknj\ z =,2i/c?? i szri , x1-'/3 27ri / 37T1 \ ( e~2kr1\ [/° y oo dt e-^r1'3 + e2^ / d* e-^r"3 / Jo /^^^.-^^108(2^) e2^-l(_i/?). 2fc?7 ■e 2 ^ e' 27T/3 (^-1)(-1/3)1 (8.8.19) So L%(2ifc„) = [L(0J3(2ifc7?)]A + [L(0J3(2ifc7?)]B ■i/Jlog(2*,) +/3^ei/Jlog(2*,,)(zl£) 3-,3 e^-l e 2 27T/3 (i/?)! 2fc?7 (i/?)! (8.8.20) If we omit the rj independent pre-factor, we have, for kr) 3> 1,
332 Hydrogenic Atoms G oc e-ikV e-ifKog(2kr,) +/?^! ei/31og(2^) e-2iarg((i/3)!) 2krj incident scattered "^(^lnc+^scatt.) and, with the scattering angle 9 introduced in accordance with = e (8.8.21) z — r cos ( ??= \(r~z) ,2 1/j 2( = r sin" 2" ' wave vector I 6 y* Of t/)Scatt. >X,y wave vector kez of V'inc. (8.8.22) we write the incident and scattered amplitudes as T/W = jkze-\f] log (2kr sin*(±e)) e'"" e „iAx /? ^scatt. —' ^» . 9/1 ^\ r 2fcsin2(|0) si/31og(2fcrsin2(|e))e-2iarg((i/3)!) _ (8_823) Note that although we anticipated that the incident particle is represented by just e , it actually feels the long-range effect of the slowly decreasing Coulomb potential; that is equally true of the outgoing spherical wave representing the scattered particle. The asymptotic form tp = tp\nc. + ipsc&tt. identifies the respective relative probability densities for the incident and scattered particles: \t» = 1 l<A l scatt. /? 2ksmza0) (8.8.24) These are also the relative fluxes - density times speed - because the asymptotic speed is the same; it is the direction that has changed. One speaks of the differential cross section per unit solid angle da/dO, where da is the ratio of the scattered flux into the solid angle dO to the incident flux da = l^scatt.lVd/2 I^inc. | = d/2 0 2fcsin2(|6») ,8.25) or, with p/k = Ze2/(Mv2),
Problems 333 a^Gfb) ^¾^ (8-8-26) It is independent of H and is identical with the classical Rutherford* cross section. Problems 8-la One-dimensional oscillator, dimensionless variables: Evaluate **>.• (**>.■ <«^2 by noting, for example, that (q4) is the squared length of the vector q2\n). Check these independent calculations by using them to find q +p 8-lb An example of a more general method for such calculations: Check that eiV2A«=eiA»teiAye-lA2. recall (or, better, show) that Vk\n) = \n -fc>^—^; then deduce that ei^Ag\ =L(0)(A2)e-lA2_ I n Use the initial terms of the A expansion to recover (g2) „ and (q4) n- Anybody for(</6>„? 8-lc For the two-dimensional oscillator, using dimensionless variables, evaluate (p4\ n+)n_ and express it in terms of N and m. 8-ld Use the correspondence between the three-dimensional Coulomb u(r) and the two-dimensional oscillator u(p) to show that 'Lord Ernest Rutherford, Baron of Nelson (1871-1937)
334 8. Hydrogenic Atoms na0 \P )n+,n„ (r)n,l = 2Z (p2\ Apply the known oscillator values and the connection between the two systems to arrive at ^,, = ||[3n2-/(/+ 1)] 8-2 Two-dimensional oscillator: Express p2 = q\ + q\ in terms of y+ and y_, introduced in accordance with y± = 2^^(yi Tife), and rederive (8.1.28). 8-3 The two-dimensional Coulomb problem is defined by the Hamilton operator 1 Ze2 2M ^ yy' ^/X2 + yl What are the energy eigenvalues and their multiplicities? 8-4 The non-relativistic Hamiltonian for a spinning electron (charge e = — | e |) in a magnetic field is (very nearly) 2M V c J 2Mc Verify that ■*<,.*. "^[-(H*)]'- Consider a homogeneous field along the z axis. What is the energy of the ground state in the circumstance (pz — §ylz)' = 0? What are, more generally, the energy eigenvalues for (pz — ^Az)' = Kkl [They are called Landau* levels.] 8-5 For a free particle, the relation between momentum p and the relativists energy W = Mc2 + E is W2 = p2c2 - {M(?)2 , or E = W-Mc2 = W^=a-p2MTE^a-p- [Note the connection with Problem 8-4, in the non-relativistic limit, when a magnetic field is introduced: p —> p — ~A.] Introduce the electrostatic 'Lev Davidovich Landau (1908-4968)
Problems 335 energy V (replacement E —> E — V) to arrive at this approximate relativistic correction to the Hamiltonian for a state of energy E: H = (T-p2M+(E-V)/c^-p + V What is the resulting energy shift for the ground state of a hydrogenic atom? 8-6a The three-dimensional relativistic Schrodinger equation for a spinless particle moving in the potential V(r) is (£_V)2_^(|v) ~-{Mc2)2 r/) = 0. Insert V = —Ze2/r and find the bound state energy values. [A comparison method is suggested.] 8-6b An approximate procedure for Problem 8-6a begins \£\ <C Mc2, and arrives at the modified Schrodinger equation '-r-mfr + 2Mc2 r/) = 0. Use first-order perturbation theory to find the approximate energy shift. Check that your result is indeed an approximation of what you got in Problem 8-6a. 8-7a Regard p2/(2M) as the leading term in the expansion of cyV + (Mc)2 - Mr2 in powers of p2. Show that the next term in this expansion gives a correction of AH = —a2R0 4 (JL \(h/a0) to the Hamilton operator, where a = e2/(Hc) = 1/137 is Sommerfeld's* fine structure constant. 8-7b Find the resulting shift of the ground-state energy in first-order perturbation theory. For this purpose evaluate the ground-state expectation value of p4 = (p2)2 by a variety of methods, indicated by * Arnold Sommerfeld (1868-1951)
336 8. Hydrogenic Atoms (p4) = yW)P'4|^(p')|2 = /(dr')^(r')* (W'2)%(r') = f(dr') |n2V'V(r')|2 = ( [2M (Ze2/r - Z2^)] Justify these statements. 8-8 State the ground-state eigenvector equation as an integral equation obeyed by the ip(p') of Problem 8-7b. Then verify that it is obeyed. 8-9a Since r is positive by its nature, one cannot apply the WKB quantization rule of Problem 6-33 to radial Schrodinger equations of the form (8.1.8) immediately. A suitable change to unrestricted variables has to be done first. Show that r = r0ex/ro, u(r) = e~^x/r°^(x) (with arbitrary reference length ?*o > 0) turns the normalization integral for u(r) into the one for tp(x), and that the resulting version of (8.1.8) corresponds to an effective one-dimensional Hamilton operator given by ^ = 7^7 + ^+^ -.^1=- As in Problem 6-35a set the semiclassical value of tr {ri(Enrii — Hi)} equal to nr + | (nr: radial quantum number) and arrive at «r + \ = ^ /dr ^2MEnr,i - h2(l + \fjr2 - 2MV(r) . The transition from 1(1 + 1) in (8.1.8) to (I + |)2 here is known as Langer's* correction. 8-9b Apply this to the three-dimensional oscillator, V(r) — \MiJ1r1. How do the approximate WKB energy eigenvalues compare with the exact ones? 8-9c Repeat for the three-dimensional Coulomb problem, V(r) = —Ze'2/r. 8-9d These applications are easier if you first verify the integrals T2 r 1 / dr === = -7r(n + r2) , Jrx v(r-2-r-)(r-n) ^ T2 dr 1 __ 7r Jri r 7(r2--r)(r-ri) yfnxi ' / — y[r-2 - r)(r - n) = -7r(n + r2) - 7rv/r7r2~, ri r ^ for r2 > ri > 0. *Rudolph Ernest Langer (1894-1968)
Problems 337 8-10 Use arguments analogous to the ones in Problem 8-9a to find the WKB approximation to the energy eigenvalues for two-dimensional motion in a rotationally symmetric potential V(x\, x2) = V(p),p = {x\ + x\\) 2. What do you get for V(p) = \M(J1 p2? 8-11 The triton nucleus of a tritium atom (3H, Z = 1) undergoes a /3 decay and we assume that the created electron (and also the neutrino) escapes very quickly. Before the decay, the atom is in its hydrogenic ground state. What is the probability that, after the decay, the resulting 3He+ ion (Z = 2) is found in its ground state as well? 8-12 Non-degenerate second-order perturbation theory. We know that <*-<Mf-£)i*>-°. from which follows and d\ K ' ax' ; E*E'-> Wikw-Bhtwlw- It is consistent with if one chooses 9 (E\E) = 0 dX Evaluate -ivsE and use it to write out the perturbation expansion for the eigenvalues of H = H0 + Hi = H0 + Atfi A=l as — ai + ld2E A = 0 2 dX A = 0 + Apply this to the example of Problem 7-6.
338 8. Hydrogenic Atoms 8-13 Can you prove that, in general, (^)^(7-)-1 ? When does the equality hold? Is the inequality satisfied for hydrogenic atoms? Compare the two sides of the inequality for the nr = 0 states, at large n. Conclusion? 8-14 The generator G\ of (8.3.4) is not Hermitian as it stands. Why is this irrelevant as long as only infinitesimal scale changes are considered? After symmetrization we have Gx = 8\\{p ■ r + r ■ p) = 8\T which is Hermitian. Find the explicit effect of finite scale transformations r -> eft re ft , i x r —ix r p-> eft pe ftAi . How does a scale change affect the orbital angular momentum L = r x p? 8-15 Show that r = \r\ and F constitute a pair of complementary observ- ables. Which quantum degree of freedom are they associated with? [Hint: Consider the unitary operator (r/ro)m, where ?*o > 0 is an arbitrary refer- j_\r ence length and k is any real number, and compare its product with eft with their product in reverse order.] 8-16 Three-dimensional oscillator: For nr = 0,1 and arbitrary I, calculate the expectation values (r-2), (r^3), (r), by direct integration using the radial wave functions (7.5.26). Compare with what is stated in (8.3.20), (8.3.26), and (8.3.33). 8-17 Hydrogenic atoms: Use the simple form of the radial function w(r), nr = 0, to evaluate (rk)n, nr = 0. Check that (r), (r^1), (r-2), (r^3) are correctly reproduced. For what value of r is [w(r)]2 a maximum? Can you give an approximate form for [w(r)]2 near the maximum, when n 3> 1? 8-18 Parabolic coordinates: eim0 i i ^=c7^7f(€);Ar(??)- With radial normalization: />oo />oo / d£ [U(0]2 = / d^ [uin)}2 = l, ./0 ./0
Problems 339 and three-dimensional normalization: |(dr)H2 = l, what is C'l Use §f = {ez) to rederive (8.5.14). 8-19 In (8.6.10) the scalar potential and the vector potential are assumed to depend only on position r but not on time t. Now lift this simplifying restriction and consider <J>(r,£) and A(r,t), so that the Hamilton operator acquires a parametric time dependence. Show that the Lorentz force (8.6.20) emerges correctly with electric field E = —dA/dt — V$ and, as before, magnetic field B = V x A. 8-20 Three-dimensional Coulomb problem. Evaluate the expectation values (r2)ni. [Hint: Multiply the u differential equation (8.3.21) by 2r3^ - 3r2u and integrate.] Check your value for the ground state, n = 1, by using its wave function directly. What is the value of x, the diamagnetic susceptibility [magnetic energy: — \\B2\ originates in the quadratic B term of (8.6.27)] for n = 1? 8-21 A particle of charge — |e|, mass M, moves in the homogeneous magnetic field B, which is directed along the z axis, and also in the electric field derived from the scalar potential . = _•„(,._ £^), V^0 where Q > 0. What are the energy values of this system? Their multiplicities (excluding special relations)? [Hint: Look at Schrodinger's three-dimensional equation in the natural gauge. Do not take Q to be too large.] 8-22 Concerning the incident wave tpinc. of (8.8.23): The surfaces of constant phase kz — {3 log(fcr — kz) are not planes of constant z, as for /3 = 0. To get an impression of the mild distortion caused by the logarithmic term, consider a distant point on the z axis (r = — z = R, kR 3> 1, Ze2/R <C Mv2) and find the points in the z = 0 plane of the same phase. How large is their common r value?
PartC Spring Quarter: Interacting Particles
9. Two-Particle Coulomb Problem 9.1 Internal and external motion In Chapter 8 we treated hydrogenic atoms as if their nuclei were infinitely massive. Let us now refine the analysis and consider two particles with masses ni\ and m2 and charges Z-^e and Z2e, respectively, r, ^,m2,Z2e mi,Zie. /T ' so that H^f + sL + ^L (,.U) 2l7li 2m2 17"i - V2\ is the Hamilton operator. Introduce the position vector of the center of gravity, or center of mass: R = miri+m2r2 , (9.1.2) mx + m2 the relative position vector: r = n-r2, (9.1.3) the total momentum: P = Pi+P2, (9.1.4) the relative momentum: m2Pl - 77liP2 p = . (9.1.5) mi +m2
344 9. Two-Particle Coulomb Problem Then 7*i = it -I r , r9 = it r , mi + m2 r/it + to2 TOi „ TO9 „ Pi = -*•—P + p, p2 = -2_p_p. 9.1.6 TOi + TO2 TOi + TO2 The reason for the particular factors becomes evident from the Lagrangian if we introduce the new variables: dr-i dr2 / To! \ /dil m2 dr Pi --77- +P2- -rr = P + P) --77- + ; 777 dr at \rrii + to2 J \ at mi+TO2d£ + ( m2 P p] ■ (** - TOl dr \mi + to2 / \ dr rn-t + m2 dt _ dil dr . =P -^+"-dt- (9-L8) This tells us immediately that just as r i, pi and r2, p2 describe 3 + 3 = 6 degrees of freedom, so do the pairs R, P and r,p. This says that all commutators are zero, except i[i**,P*] = l, i[r*,p*]=l. (9.1.9) In the Hamiltonian (9.1.1), we have the kinetic energy where is the total mass and 2toi 2m2 2M 2\i M = mi+m2 (9.1.11) / 1 1 \ TOiTO2 . M= —+ — = 3-^- 9.1.12 \TOi TO2/ TOi + TO2 is the reduced mass; and the electrostatic Coulomb potential energy ZxZ2e2 Z^e2 \r\ -r2\ (9.1.13) depends only the distance r = \r\ between the particles. So, the Lagrangian splits in two,
9.1 Internal and external motion 345 L = Lext + Lint , (9.1.14) where the external part d 7? P2 Lext = P ■ ^ - ±- (9.1.15) describes the whole system, which has mass M and moves as a free particle, and the internal part dr fp2 , Z1Z2e2\ dr Lint = p ■ Tt - (- + -^ j = p • - - Hint (9.1.16) describes the relative motion of the two particles. For hydrogenic atoms, composed of an electron (particle 1) and the nucleus (particle 2), one has mi = mei , m2 = mnuci = M — mei , M, mnuci > mei ; mei(M-mei) A* = ^ = md (l-»SroeI(l-J5sL). (9.1.17) V M / V mnuci / Another example is positronium, electron and positron, which have opposite charge and equal mass, so M = 2mei , n = |mei • (9.1.18) The Bohr hydrogenic energy values, for [mnuci =] M = oo, are [put Z —> ~ZXZ2 and M -> mei in (8.1.16)] ^ = -^- <"-19' We have only to replace the electron mass mei by the reduced mass /j,: where .Boo is the energy in the limit mnuci —> oo. For positronium, this means E = |£co (9.1.21) with Z = 1 in £^. For scattering we found the cross section (8.8.26), for which the obvious extension is [Z -> -Z\Z2, M -> /j,, 0 -> 6]
346 9. Two-Particle Coulomb Problem where v is (still) the relative speed of the two particles when they are far apart. Because this is the description of the relative motion, it is directly applicable only when there is no center-of-gravity motion, that is: when the total momentum vanishes, P = p\ + pi = 0. Under these conditions, p\ = —p2 = p, the momentum relations of the collision are displayed as (all momentum vectors are of equal length) lout 2 out and 0 is the angle through which p, the relative momentum, is turned. 9.2 Rutherford scattering revisited The differential cross section involves just the ratio /3/k [recall (8.8.25)], whereas the asymptotic form of the wave function refers to both [J and k. Notice that /?, initially Ze2/hv, is ,, = -*£- (9.2.) with v the asymptotic relative speed, and now fc = ip = J^v , (9.2.2) the relative momentum and the relative velocity being related by the reduced mass. We recall the asymptotic form [cf. (8.8.23)] of the wave function (6 is still used here) ip (xe ifcz -i/31og(2ATsin2(i0)) Akr + ll I ei/3 log (2fcr sin2(10)) -2iarg((i/3)!) (g 2 3) We used parabolic coordinates to get this; now we reexamine it using spherical coordinates, which are more useful when deviations from the
9.2 Rutherford scattering revisited 347 Coulomb potential come into play as, for example, in proton-proton scattering where, with increasing energy, nuclear forces begin to play a role. Let's start by putting Jkz _ \krcost (9.2.4) into spherical coordinates. The azimuthal angle <f> does not occur here, so the complete set of angle functions are just Legendre's polynomials P/(£ = cos#); see Section 3.7. Accordingly, we write Akr cos 6 = J2(2l + l)iljl(kr)Pl (cos 6), (9.2.5)' 1=0 where the factor i' ensures that ji is real. In fact, if we take the complex conjugate and at the same time let £ —> —¢, which leaves e C intact, i'P(C) changes into (-i)'P;(-C)=i'P/(C) (9.2.6) so ji must be real. Upon recalling the orthogonality relation of Problem 3-15, we note that ji is given explicitly by the integral = \f dCPtQi&rC. (9.2.7) i ji(kr) = For our present purposes, the asymptotic form (i = b » 1) will do: If1 d elx<^ ilh{x) = ~ / dCP;(C): d( ix P;(l)— -P;(-l) — IX IX dC^ K d -i ix d( S _L[eia; - (-1)' e-[x] = il— (rl eix - i' e~[x) 2\x Tax \ ) p«(0 or ji{x) = sin (x — \nl) (9.2.8 (9.2.9) So, AkrcosO ^ g(2/ + jy^LMp^eosj) ;=o kr (9.2.10) is an asymptotic expansion of the plane wave (9.2.4) in terms of Legendre's polynomials.
348 9. Two-Particle Coulomb Problem The wave function for the Coulomb potential in this axially symmetric situation can be expressed as oo </>M) = Yl(21 + l)ilRi(r)Pi(cos9)e[Sl , (9.2.11) where the real radial function Ri obeys the Schrodinger equation d2 2d /(/ + 1) 2/i ( Z1Z2e2^1 dr2 r dr ,2 + ^ Ri(r) = 0 (9.2.12) or, with Ri(r) _ ui(r) E- p2 h2k2 1\i 2\i ZxZ2e2jx H2 = Pk (9.2.13) more compactly dr2 1(1 + 1) + fc2 + 2^ «/(r) = 0 . (9.2.14) The correspondence with our earlier discussion of hydrogenic bound states [(8.1.8) with (2M/h2)V(r) = -2Z/{a0r) and 2ME/H2 = -Z2/(na0)2} is 21,2 k2 PZk 2„2 n^a. (9.2.15) or 0k a0 with clq /iez and Z -zxz2 (9.2.16) so —i/3 and ifc nao (9.2.17) where either sign of i would do. Recalling the bound-state wave functions (8.1.33) we thus note that \na0 J ™ ' \na0 J Zr q nao -> {2kr)lh^\_l{2ikr)e~'lkr (9.2.18) As in Section 8.8 we use a contour integral to generalize V-\-Qt x"e'x (9.2.19)
9.2 Rutherford scattering revisited 349 to non-integer v values, " [ ' [ ' v\ J 2m (t - x)"+«+i with the contour as depicted in (8.8.15), so (9.2.20) Ri oc eikr(2kr)1 I — r'l0-l-l(t - 2ifcr) / 27T1 This looks better with the substitution t-¥ t + ikr , namely 1/3-/-1 e-t Ri oc (2kr)1 <f> ~{t~ ifcr)i/3_'-1(* + ikr) -1/3-/-1 e-t (9.2.21) (9.2.22) (9.2.23) First let's check that we have the physically acceptable solution as r —> 0, where, of r' and r^'^1, the first is chosen. Look at the integral for r = 0 : dt A0-1-1.-10-1-1 -t _ . t, i t — 27T1 -t At e" 2^1^+2 1 (21 + 1)! Vd* 1 2/ + 1 ,-* £ = 0 (2/ + 1)! (9.2.24) So indeed Ri behaves correctly, as r', for r —> 0. Notice the symmetry of the integrand under i —> -i, indicating that it is intrinsically a real function. Now we want the asymptotic form, kr > 1. The singularities are at ±ikr, so, with an eye on e , we choose the closed contour In I ikr ( ikr ( it <- part A -* : <- s -> part B —> 00 >Ret (9.2.25) which is, of course, the contour of (8.8.17) after the shift (9.2.22).
350 9. Two-Particle Coulomb Problem For part A we write t = ikr + u with u : oo —> 0 —> oo and get, for kr 3> 1, ,—ikr , „—lfcr r rO Ri(r)\ =(2Jkr)'(2iJkr)-i/3""'_1 / dww J A 27T1 /^ .,10-1-1 e-u e + / du(e2'riu)^-'-1e-u Jo -*kr. e-i/31og(2fcr) eI^j-« 1 {-2nP _ 1)(i/? _ j _ 1}! = -2e-7r/3sinh(7r/3) = J_ e-i[*r + /31og(2fcr) + Inl] fi-I^ 1 ^^ (i/? _ j _ 1}, _ ikr it (9.2.26) Let's notice here a property of the factorial function: z\(-z)\ = -^-r or (* - l)!(-s)! = —f— . (9.2.27) Then, with z = —i/3 + / + 1, we get (i/3 - Z - 1)! (-i/3 + /)! = . , ,* t^y = (-1)'+1 . I) -, (9.2.28) sin (tt(/ + 1 - i/3)) v y sinh(7r/3) or sinh(7r/3) (i/3 - / - 1)! = (-1)'*1 {^+l)l ■ (9.2.29) This gives R,(r)] = _A_ e-i[*r + /31og(2fcr) - |tt/ + arg((/ - i/3)!)] e"7^ (9.2.30) Similarly, we parameterize part B by t = — ikr + u with u : oo —> 0 —> oo and get aMl = ^Lei[fcr + /31og(2fcr)-i7rf + arg(q-i/3)!)]_e'27r/3 (9.2.31) which, as anticipated, is just the complex conjugate of the part-A contribution, so , -f 7T/3 sinfkr + /3 log(2fcr) - W/ + arg((/ - i/3)!)) *<r>aKf=wr— * -• ^ How do we find the factor of proportionality that is independent of r?
9.2 Rutherford scattering revisited 351 Let's return to the expansion (9.2.5) and consider large kr, where (9.2.9) applies, 1 fCf 1 ft°T* ^ak-^w- (9-2-33) From the point of view of radial motion, e~ describes a spherical wave moving in, and e describes a spherical wave moving out. Were it not for the very long range nature of the Coulomb potential, we could argue that the effect of the interaction is to produce an additional outgoing wave; the incoming wave far away is still unaware of the presence of the interaction potential. While not entirely true, this is still the dominant aspect of the Coulomb interaction (see Problem 8-22). For the incoming part of ijj = 5^(2/ + lJi'e^'i^rJP^cosfl) (9.2.34) i to match that of e , where e lRi —> ji(kr), it is clear that the factor in front of (ifcr)^1 sin(fcr -I ) in (9.2.32) must be unity, so that the looked-for proportionality factor is -e**P\(l - i/?)!| . (9.2.35) Then ilRi(r)inC, = -(-lj'-^e-^^^e-1^^-^)1) . (9.2.36) The factor e"1^ g' fcr' is the long-range effect, it is simply a slowly varying phase that is independent of I. But we must remove e_larg*-' ~~ ^ ''', which is the purpose of e ': 5t = arg((/ - i/3)!) . (9.2.37) Now, in fact, the incoming wave of the known solution does have the factor e-^M2**-). But it also has e_i/31os(sin2(le)). That, however, must be implicit in our now completely determined wave function: oo ,/, = ]T(2Z + l)i' e^'Ri(r)Pi(cose) , 1=0 with Ri = -r- sin(fcr + /3 log(2fcr) - \itl + St) , (9.2.38) being too subtle to show up in these crude asymptotic expansions.
352 9. Two-Particle Coulomb Problem Let's use this scattering wave function to answer the following question. The incident part of the wave is of unit amplitude; or the (relative) probability density of the incident beam is |</>inc.|z = e^e-ipiog... 2 = 1_ (9_2.39) '2 „ifcz -i/3 log... What is the probability density at the origin, where the particles are in contact? We know that only I = 0 survives for r —> 0 where Ri our1. So ^(0) = eiSoRo(0) (9.2.40) where [this is the proportionality factor (9.2.35) with the sign change required by (9.2.24)] i?0(0)= e^KZ-i/?)! | (9.2.41) or |#))|2 = 6^(-1/3)1(1/3)1 = e^^L • (9.2.42) There are two situations here, attractive interaction, ZXZ2 < 0, /3 > 0 : \ip(0)\ = ~—~ > 1 , 1 — e ZnP repulsive interaction, ZXZ2 > 0, /3 < 0 : |?/>(0)|2 = ,1, — < 1 • gZ7T jp j — -J^ (9.2.43) Not surprising. Particularly interesting is the situation -/3= Zx Z2e2/(Hv) > 1 (repulsion), or v <C ZiZ2e2/H, where |^(0)|2S27r|/3|e"27r!^l «1, (9.2.44) which is a semiclassical situation; the probability of penetrating the classically forbidden region is small, but not zero. One might wonder to what extent the semiclassical WKB description can reproduce this result, but we shall not explore this territory. 9.3 Additional short-range forces Now suppose that when the colliding particles come quite close, additional forces come into play; for example, in proton-proton collisions nuclear forces, which are short range, become important at sufficiently high energies. This
9.3 Additional short-range forces 353 effect first begins with I = 0, where contact is possible. An example of such a potential is and, as a preparation, let's begin with a simpler model version that omits the Coulomb potential: V(r) 4\ -Vn ro ->r so that the Schrodinger equation is [R(r) = (kr) 1u(r)} /d\2 _ f [ME + V0)/h2]u = k2u for r<r0 \dr) U~\ (2fj,E/K2)u = k2u for r > r0. So, with attention to w(0) = 0, C sin(fix) for r < ro , u(r) = sin(fcr + A) for r > ro . (9.3.1) (9.3.2) The real amplitude parameter C and the real phase parameter A are determined by the continuity condition at r0 of the wave function: and its derivative: Csin(/cro) = sin (kro + A) kC cos(fiTo) = k cos (kro + A) (9.3.3) (9.3.4)
354 9. Two-Particle Coulomb Problem We get Kcotfixo = kcot(kro + A) (9.3.5) to determine A, A = -kr0 + cot-1 f - cot(fcro)) = -kr0 + tan"1 ( - tan(ra*o) I , (9.3.6) and then C, C = k cos(kro + A) sm(kr0 + A) k cos(kt*o) sin(Kro) A picture may help to see what's going on here: V(r),u(r) >r (9.3.7) where the dashes are the extrapolation of the outside wave inside; the extrapolated wave does not vanish at the origin, which means that the phase of the outside wave is shifted (this is A, the scattering phase shift) relative to what it is for Vo = 0. The same thing happens for the Coulomb potential and short-range force, that is flo(r) oc — sin (kr + /3log(2fcr) +S0 + A), Kr as compared to pure Coulomb, 1 Ro(r) ~ ^- sin (kr + /?log(2fcr) + 60) ■ (9.3.6 (9.3.9) So, to keep the given incoming spherical wave, we must delete the I = 0 contribution eiS°Ro(r) and add e{(S° + A">Ro(r) : <p = Vcoui. + e1^ +^)j?0(r) - eiSoR0(r) and then, recalling that So = arg((—i/?)!) = — arg((i/3)!), (9.3.10)
9.4 Scattering of identical particles 355 i(kr + (1\og(2kr) + 250) Vscatt. = /(0) (9.3.11) with /(*) ^_^ei/31og(sin2(Ie)) leiAs[nA (9 3 12) 2fcsin2(|<?) fc V ; Now the differential cross section in the center-of-mass frame, where © is the scattering angle, is ^=r2|^cat,|2 = 1/(0)12 /?\2 1 sin2Z\ 2k J sin4(i<9) fc2 + ^8]¾)008^^^10^81112^^) • (9-3-13) The first term is the Rutherford cross section (9.1.22); the last, interference, term is purely quantum mechanical. In the classical limit, /3 —> oo, it will oscillate very rapidly. 9.4 Scattering of identical particles It is time to point out that the above discussion really applies only to two distinct particles, the proton and the deuteron, or the deuteron and the tri- ton, for example, but not to two identical particles: proton and proton, for example. What's special about identical particles? Think of the symbolic Schrodinger equation for particles labeled 1 and 2, with those numbers also used to represent analogous choices of physical quantities for each particle: mj^(l,2,*|=(l,2,*|tf(l,2). (9.4.1) The statement that they are identical particles means that the assignment of names is purely arbitrary; it makes no difference to the dynamics: #(1,2) =#(2,1). (9.4.2) Therefore, interchanging them we also have mj^(2,l,*| = (2,l,t|tf(l,2). (9.4.3) The clear inference is that the indistinguishable states (1,2, t\ and (2, \,t\ are really the same states, always remembering the phase freedom:
356 9. Two-Particle Coulomb Problem (2,1,*|= e^(l,2, t\ (9.4.4) with if constant. Then interchanging 1 «->■ 2 twice, <1,2,*| = e^(2,l,t| = (e^)2(l,2,t|, (9.4.5) tells us that e11*9 = ±1 are the actual possibilities. For e11*9 = +1 we have wave functions that are symmetrical in the particle labels, (l, 2| = (2, ll, and one says that such particles obey Bose*-Einstein (BE) statistics; for e11*9 =-1 the situation is that of Fermit-Dirac (FD) statistics with antisymmetrical wave functions, (l,2| = -(2, ll. The importance of this for scattering is suggested in the center-of-mass diagram 1,2 out 2,1 out illustrating that after the collision there is no way of knowing whether the particle moving up is particle 1, deflected through angle 0, or particle 2 deflected through angle w — 0; the wave functions for both contingencies must be used with due attention to the symmetry of the wave function, to the statistics of the particle. How do we find the statistics of a particular kind of particle? Consider the scattering of two 4He nuclei (a particles). The amplitude /((9) describes the scattering process in which particle 1 is detected moving up; f(w — 0) describes the process in which it is particle 2 that is detected moving up. The combination of the two, produced by the interchange of labels 1 and 2 in the final state is f{0) ± f(n - 0), respectively, that is % = |/(0) ±f(n -0)|2 for /BE statistics, dO ' J K n \ FD statistics. v ' In the situation of pure Coulomb scattering the amplitudes /(0) and /(^-0) are available in (9.2.3), where the last factor is a 9 independent, and therefore now irrelevant, phase factor. Accordingly, the modification of the Rutherford cross section is given by "Satyendranath Bose (1894-1974) fEnrico Fermi (1901-1954)
9.4 Scattering of identical particles 357 da _ ( p __ (2e)5 dD \2k 2\m /3 log (sin2 (|6))) ei/31og(cos2(f6>)) 2 sin2(|<9) cos2(i0) 2e^2 fiv2 1 1 2cos(/?log(tan2(|6))) sin4(|6>) cos4(|0) sin2(i6>)cos2(|e>) (9.4.7) In this so-called Mott* cross section, the first two terms would be the classical result for identical particles; the third is purely quantum mechanical. Notice what happens for right-angle scattering, that is: 0 = \k = n — 0, where f(0) = f(n-0): (da/d«)quant. _ I/ ± /I2 _ / 2 for BE statistics, (da/dO) class. 21/r 2 I 0 for FD statistics. (9.4.5 In fact, 2 is observed, at low enough energies that nuclear forces are ineffective: 4He is a BE particle. It's time to mention something we took for granted about 4He; it has no spin. But other particles do have spin. And the requirement of symmetry or antisymmetry refers to all the degrees of freedom of a particle, position and spin. The spin states of the two particles, each of spin s, can be separated into symmetrical states and antisymrnetrical states. We already know that for s = \, the (2s+l) x(2s + l) = 4 states consist of three symmetrical states and one antisymrnetrical one (see Section 3.5). In general, if you have two variables, each taking on n values, the number of antisymrnetrical combinations is |n(n—1), and the number of symmetrical ones is |n(n —l)+n = |n(n + l) correctly adding to n2. Thus the fraction of spin states that are symmetrical or antisymrnetrical is (n = 2s + 1) f s + 1 1 symmetrical fraction 1 _ |ri(n ±l)_n±ll 2s + l>2' antisymrnetrical fraction \ n2 2n | s 1 2s + l< 2 (9.4.9) as a check put s = | and get the respective fractions of | and |. In a collision with all spin states equally probable, the fraction of symmetrical spin states will have the scattering amplitude f(0) ± f(n — 0), for the respective BE/pj) statistics, whereas the spin antisymrnetrical fraction will have the scattering amplitude f(0) T f(n —0). So -.1/(0)12+ !/(»-6)|'±^-rRe(/(e)-/(T-e)) (9.4.10) classical *Sir Nevill Francis Mott (1905-1996)
358 9. Two-Particle Coulomb Problem and, for 0 = ~n. (da/d/2)quant. (da/dO) class. 1± 1 0 = ln 2s+ 1 s + 1 s + Y > 1 for BE statistics, l» + 5 Y < 1 for FD statistics, (9.4.11) which shows how, in principle, the statistics and the spin can be determined. It is an empirical fact, one now understood theoretically, that there is a connection between spin and statistics: BE statistics: s = 0,1,2, FD statistics: s so, in fact the possibilities are 13 5 2' 2' 2' ' (da/dO) quant. (da/d/2)cla88. e ¥ o 4 6 Z' 3' 5 ■ ■ 13 5 2> 4' 6' ' ■ for BE statistics, for FD statistics. Note that these are reciprocal sequences. (9.4.12) (9.4.13) 9.5 Conserved axial vector We can't leave the two-body Coulomb problem without presenting it in the following way, which uses the notation of the hydrogenic atom. We begin with recalling the basic dynamical properties, dt d —r : dt d dV H = 1 = -P xp) = P2 2(j, d dtP 1 = ~p X Ze2 r — P~ -Ze Ze2 r x r = 0 . (9.5.1) We say that L is a constant of the motion. Can we find another vector constant of the motion? In this, we do not explicitly pay attention to the order of multiplication of operators, but verify its correctness later. Consider dt (p x L) = 7-r x (r x p) nZe2 ( dr\ -rxfrx^J \iZe 2 > dr rr ■ ■(, , dr dt dt) - rdr/dt u = nZe1 1 dr r dr\ r dt r2 dtJ = 5 ("z^) (9.5.2)
9.5 Conserved axial vector 359 Therefore, (pxL-nZe2-) =0 (9.5.3) d_ dt identifies a constant vector; understanding that px L needs symmetrization, we write it as A = - - ^^ (pxL-Lxp) . (9.5.4) [Although the same letter is used, there shouldn't be any danger of confusing this A with the vector potential of Section 8.6.] Notice that both H and A, the axial vector, are less than cubic in p. Hence, dt ~ih[A,Hi~{A,H)~ dr dp dp dr ' { ' where the symmetrization of the Poisson bracket is left implicit. One easily checks that the individual terms are only linear in p so that the symmetrization removes any contributions linear in H, and H2 does not occur. Therefore if it is zero classically, it is zero. For the next step we need the commutation relations between p and L, the generator of rotations, for which Tfi\Px,Ly}=pz, — \py,Lx} = -pz (9.5.6) are familiar examples. Now look at the z component of p x L + L x p: (PX L + L XP)Z = PXLy " PyLX + LXPy ~ LyPX = \Px,Ly}-\py,Lx} = 2ihpz (9.5.7) so that px L + Lxp = 2ihp (9.5.8) summarizes the commutation relations (9.5.6) compactly, and r 1 r [iZez --^Hxp + ifip) (9.5.9) r iiZez are alternative ways of writing A.
360 9. Two-Particle Coulomb Problem |2r What is A ? Use both versions of (9.5.9) and write 1 liZe- ■{—L x p + ihp) \iZe- - (p x L - ihp) 1 (t r r r\ =1 =-r I — ■ p x L — ih— p — L x p ■ —h ihp ■ — uZez \r r r r> 1 \2 —rr ) (-(L xp) ■ (p x L) +ihL xpp+ihpp x L +ti2p2) ^fiZe* J \ ■> ^ ' - *, ' / + = 0 = 0 (9.5.10) and note the identities ih 2ft2 ( r r \ .^h— r in I p. . p = j/j- v . _ — \ r r / i r r (£ x p) ■ (p x L) = L ■ [p x (p x £)] = L-(pp-L-p2L) = -pzL = 0 2 r2 (9.5.11) So yl2 = l- 2 !-2 ,^ 1 :(£2 + ^) + /iZe2 r (pZety ;P2(L2 + h2) or #= - nZ2e4 1 - yl2 (9.5.12) (9.5.13) 2 i2 + ft2 ' a relation among the constants of the motion, if, Jt, A, which must exist since there are only six variables (count the components of r and p, for instance). Notice that bound states (H' < 0) have A2 < 1, whereas scattering states (H1 > 0) have A > 1. This is consistent with the geometrical significance of the axial vector A; see Problem 9-14. What are the commutation relations among the six components of L and A? We already know that Tr[Lx,Ly} = Lz ,... and —[Ax,Ly]=Az,. or, compactly, L x L = ihL and AxL + LxA = 2iUA (9.5.14) (9.5.15) What is A x A? It is a vector that is a constant of the motion and therefore it must be a linear combination of L, A, and L x A.
9.5 Conserved axial vector 361 Now consider the transformation r _> „r ; p _> „p ; (9.5.16) which leaves the commutation relations and the Hamiltonian intact. (We have seen the one-dimensional version of this unitary reflection in Problems 2-9 and 2-24). Under this transformation L-> L , A-+-A, AxA->AxA, (9.5.17) therefore only L is possible on the right side: AxA = \hCL , (9.5.18) where C is a scalar constant of motion. To determine its value, look at = AyCLz + CLzAy - AzCLy - CLyAz (9.5.19) or, with (9.5.12), ~ [AX,A2] =(Ax L)XC - C(L x A)x = ^,^-^] inhere tt [Ax, L j — LyAz + AzLy LzAy — AyLz = (Lx A)x ^{Ax L)x . (9.5.21) Therefore, C = -2H/(nZ2e4) in (9.5.18) and we have O TJ AxA = ih-^—-L . (9.5.22) fj,Zze* Let's pretend that we don't know as yet the eigenenergies of H and let's write aZ2e4 1 H'= ^^-- with i/>0 (9.5.23) for the value of H in a subspace of given bound-state energy H' < 0. Then AxA = ir*\L md A2 = x-h ihL2 + x) > (9-5-24)
362 9. Two-Particle Coulomb Problem and we define Jl<2 = \(L±hvA) , (9.5.25) so that L = Ji + J2 , HvA = Ji - J2 ■ (9.5.26) What are the commutation relations for Ji and J2? First, " 7^ [X, ftlMj,] + — [ftlM,,,, Lj,] J = - (Lz - Lz - HvAz + TwAz) = 0 , (9.5.27) so J1 and J2 commute. Then j^[^,^i»] =4(-¾^1¾] + T^[hvAx,hvAy] + -[LxMAy} + -[KvAx,Ly}) = ~(LZ+LZ + %vAz + %vAz) = Jlz , (9.5.28) so J1 x J\ = \KJ\ and similarly J2 x J2 = iriJ2 • (9.5.29) Ji and J2 are two independent angular momenta! There is one constraint, however. Notice that r fj,Ze2 A-L = --L -(pxL-iy-I = 0. (9.5.30) Therefore = n2v2-L2 ~n2 j2 = j2 = ^(z2+ftW) = ^2(^-l). (9.5.31) The eigenvalues of j\ = J\ are j(j + l)H2 where j = 0, |, 1,... or 2j + 1 = n = 1,2,3 So v2 = 4j(j + 1) + 1 = (2j + 1)2 = n2 (9.5.32)
9.5 Conserved axial vector 363 and we find, once again, the Bohr energies It is clear that these energy eigenvalues are degenerate, corresponding to the 2j + 1 = n values for m\ and mi independently: the multiplicity is n2, as we know. Alternatively we can label the states by the eigenvalues of L = JX+J2 : \j\ -j2\<l<ji+ h • (9.5.34) = 0 = 2j = n - 1 n-l The multiplicity computed this way, Yl (21 + 1) = n2, is, of course, the same. Notice that n = 1 is j = ji = j2 = 0 so that the ground-state vector I ) obeys Ji| } = 0 and J2| ) =0 (9.5.35) or L\)=0 and A\)=0. (9.5.36) For the wave function ip(r) representing | ), we have r x ~V4>(r) = 0 and (-+ -¾^ V ) Mr) = 0 (9.5.37) l \r fiZe1 J which say that and then </>(r) = m , ( ^ + |- ) V(r) = 0 , (9.5.38) fc2 </,(r) = ^(0)e~Zr'a° with a0 = — . (9.5.39) The (positive) value of r/>(0) is, of course, determined by the normalization, /r°° irn^ (dr)|^|2 = 4tt[^(0)]2 / drr2e~2Zr/ao = ^[iP(0)}2, (9.5.40) = 2(Ia0/Z)3 and we arrive at the familiar ground-state wave function 3 ^n=l{r)=n~i(J-y e-Zr/a0 (95>41)
364 9. Two-Particle Coulomb Problem What is VVt=i(p)? We can find it by Fourier transformation - indeed, this is part of Problem 8-7b - but why not directly from (A + i^'xi;)l>-(7 + ^')l>-0 (95-42) or (- + i— J | ) = 0 with p0 = HZ/ao = \iZe2jh , (9.5.43) using the p description in which r —> ifi^-? The problem is: how to handle 1/r. But | ) is the n = 1 energy eigenstate: 2 n2 (9.5.44) p2 Ze2 2n r 1 (hZ\\ , p2 > = -£^J !> = -£!> so 'l> = ^2+«l> = ^+«l> which gives [r(p2+p2) + 2inp]| )=0. And now we see that (r —> iS^fe) 5 dp (Pz+Po)+2p 1pn=l (p) = 0 or So, (p2+pg)-1^(p2+Pg)Vn=l(p)=0. (9.5.45) (9.5.46) (9.5.47) (9.5.48) ^n=l (p) OC -rj- ip2 + Po) ,2\2 (9.5.49) and, as expected for I = 0, the wave function depends only on the length p of vector p; there is no directional dependence. Normalization determines the modulus of the proportionality constant, but its phase cannot be chosen freely anymore because this choice was made earlier when we opted for a positive ipn=i(r). Therefore, we find the p = 0 value from ♦"•-^/(sSr*''' (9.5.50)
9.6 Weak external fields 365 here: i>n=i(p = 0) = 47r(27rft)-2 7r~2 ( — j / drr2e a<> "">/ Jo with the consequence ao, = 1 f2oo\§ = I /2/ * ■K\hZ) n \p0/ 2 /77 1 = 2(a0/Z)3 (9.5.51) When verifying, as a check, that the normalization is correct, 4 2pl //>oo ^ (dp)|Vn=j(p)|2 = 47ry dpp2- r2(p2+pg)4 16 fl IT ■ [ d**t(1-*)»= — x^, (9.5.53) P2 + Po = Po/t = 3(1( !|!/3! we meet, in a typical context, Euler's beta function integral [ At tn(l - t)m = /^ 1M , (9.5.54) Jo n!m! (n + m + 1)! here for n = §, m = |; and (—|)! = \/n is worth memorizing. 9.6 Weak external fields Now let's impose a weak electric and a weak magnetic field (homogeneous in space and time). The small change in the internal Hamiltonian is 8H=-e'r-E- ^'^ LB , e = —lei . (9.6.1) 2c This is essentially the leading correction of (8.6.27), for e = |e|, except that effective coupling strengths -- e' for the electric interaction, (e/M)' for the magnetic interaction - appear; see Problem 9-18 for details. First we review, in a little more detail than in Section 8.2, the basics of first-order perturbation theory. We recall that [E(X)-H(X)]\E, 7)=0, (9.6.2)
366 9. Two-Particle Coulomb Problem implies and note that, on multiplication by a small 5\, this becomes (E, j\SH\E,i) = 5E(E,7')%,7') • (9.6.4) As told by the appearance of the Kronecker symbol ($(7,7'), the correct choice of degenerate states is that which diagonalizes the matrix of 5H for these states. In short, they are the eigenvectors of 6H in this space of degenerate states. Finding the eigenvector of L ■ B is easy; what to do about r ■ El Here, we want to remember something, namely that , dF, , 1 (F,7| —|F,7') = (E,j\-(FH - HF)\E,j') = 0 (9.6.5) for any operator F that has no parametric time dependence, dF/dt = 0. Now look at the axial vector: r 1 (p x L — L x p) r 2/iZe2 r d 1 -(rxl-lxr) r dt 2Ze2 = ^ , (9.6.6) where the dotted equal sign states equality up to a total time derivative, and at the unperturbed Hamiltonian: p2 Ze2 H=£ =T + V (9.6.7) 2/j, r so that - = -J^rV • (9.6.8) r Zez If we can effectively replace V by H, we shall exhibit r in terms of A/H and time derivatives that do not contribute. For this we use the virial theorem: —r ■ p = 2T + V = 2H - V (9.6.9) where it should be remembered that (although it does not matter) r ■ p is really \{r -p + p ■ r), and 11 \ 1 f dr dr \ d 1 , ,n „ _. 2(*-P + P-*-)=^(r-dt + dt'rJ=dt2"r ; (9-6-10)
9.6 Weak external fields 367 so d2 1 V=^2H+^~nr2 . (9.6.11) Now, then r 1 , ,rt 2 r d' 1 2 _=_r(_v) = __rH+___^r 2 „ 1 d2r 2 Ze2 2 r or, in conjuction with (9.6.6), = Ap = -Ze2r/r3 = -4^rH-\T- (9.6.12) So f^^-^rH. (9.6.13) 3 Zp2 r = |^A (9.6.14) and SH^.l^A.E.^lMlL.B. (9.6.15) 4 -if 2c In effect, we have replaced the position vector r in (9.6.1) by a suitable multiple of the axial vector A. Since A commutes with the unperturbed Ham- iltonian (9.6.7), this equivalent version of 6H is fit better for a perturbative evaluation. For the degenerate states of principal quantum number n, Ze2/(—H') = 2n2ao/Z, this becomes 5H^^n2e'A.E^^BlL.B Z Zj ZC 3ne' (e/M)' /nfii«x =---—nnA-E — 1-1- L-B, (9.6.16) 2p0 2c where we must also observe that, for such states, nhA=J1-J2, L=J1+J2 (9.6.17) [v -> n in (9.5.26)]. So
368 9. Two-Particle Coulomb Problem SH^-^e.^-jj-MMLb-^+J*) + 2po 3ne' 3ne' "2p7 2c is + ^bV* #■ 2c (e/M)' 2c B ■ J2 (9.6.18) We see that the physically significant directions are of these two combinations of E and B, not of E and B individually. The eigenvalues of Ji and J2 along these directions are m\h and n^/i, where both m\ and rn-2 range independently from j = \{n — 1) to —j = —|(n —1), which are (2j + l)2 = n2 states: <5£L + ™E+WMYB E-^^-B 2c 2po 3ne' 2po fimi ftm2 (9.6.19) In general, the degeneracy is removed, the exception occurs where E and B are perpendicular, when the two combinations have the same length, and only the difference m\ — 77¾ appears. Note that for B = 0 and F = \E\ we get SE = ^^-FUm, - m2) = -^^(^1 ~ ™a) ; (9.6.20) /po 2 Z this agrees with the parabolic coordinate result (8.5.19) - there e was |e| - which was stated in terms of k\p = m-i,2 + |(n — 1) = 0,1,... ,n — 1 . Problems 9-1 Verify directly from the definition that p = fj,v with v =V\ — V2- 9-2 Verify explicitly the commutation relations between all components of R, r, P, p. 9-3a The binding potential of some two-atomic molecules may be approximated by V(r) = V0 \ r I r where Vq and 7*0 are phenomenological constants. Find the energy eigenvalues (relative motion) associated with this potential.
Problems 369 9-3b For many large molecules one has HV0rl > H2 . For the corresponding small parameter, give the three leading terms of the energy eigenvalues. 9-4 Particle 2 is initially at rest. Particle 1, of equal mass, and initial momentum p'/, collides with it. What are the initial values of P and p? The collision turns the relative momentum through the angle 0. What is the angle 6 between the initial momentum p" and the final momentum p[, which is the scattering angle when particle 2 is initially at rest? Supplement your analytical derivation by a diagram that makes the result evident. 9-5a Use the construction (3.7.34) of Legendre's polynomials P/(£) to show that ... xl (. d2 \ sin a; M = ¥T\{1 + w) "V for the radial wave function defined in (9.2.5). State, in particular, the explicit forms of jo (x) and j i (x). 9-5b Note that (why?) (V2+fc2) e^rcosd =Q What differential equation does ]i(kr) obey as a function of r; as a function of x = krl What would you conclude from the differential equation about the behavior of ]i(x) for small x; for large xl 9-5c Through an appropriate transformation of a differential equation and comparison of asymptotic forms, recognize that expresses ]i{x) in terms of a closely related Bessel function. [The j; are known as spherical Bessel functions.] 9-6a One of the definitions of the factorial function is n=l where Euler's constant 7 = 0.5772--- is fixed by 1! = 1. Check that the relation z\ = z (z — 1)! is obeyed.
370 9. Two-Particle Coulomb Problem 9-6b Use this construction of z\ to prove the relation stated in (9.2.27). 9-7 The spherically symmetrical potential V{r) falls off faster than 1/r for large r; the radial wave function for positive energy E, Ri(r), is asymptotically 1 kr Ri(r) = — sin(fcr — ^nl + 5t) . Explain why the correct wave function for a particle (relative motion) incident along the z axis, with wave function e is oo il>(r,0) = £(2i + l)i'e^i?,(r)P;(cos^) . Find the scattering amplitude /(#): </wt. = -^-/(0) ■ What is the differential scattering cross section per unit solid angle? What is the total cross section? 9-8a Wave function ip(r,t) obeys the Schrodinger equation ■t,d i 7?2 - — V2 + V(r) The probability density p and the probability current density j of Problem 6-la are then given by (check this) p = %!)*%!), j = -RfiffTV^i They satisfy the continuity equation which justifies the interpretation of p and j as particle density and particle flux vector, respectively. 9-8b In a state of definite energy the flux vector of Problem 9-8a obeys /dS-j = 0, JS for any closed surface S. Why? Consider the asymptotic scattering wave function (time factor omitted)
Problems 371 ., eifcr \l> oc e1KZ + /(0) with z = r cos # . By integrating over a sphere of large radius (fcr 3> 1), conclude that the total cross section is given by Air a=TIm(/(0 = 7r)) . Verify this so-called optical theorem for the phase construction of /(0) and a you found in Problem 9-7. 9-9a Use the operator equation of motion to show, for any operator F not explicitly dependent on time, that (£',7'|F(*)|£;",7") = e^E'^E"^(E'n'\F\E",7") . Is this consistent with the fact that (-B',7'; t\F{t)\E",~f"\ t) is independent oft? 9-9b Perturbation theory: H = Hq + H\, Hi small. Use the action principle to show that the small change that Hi produces in the time transformation function is <J(l|2) = -i(l|y AtHi{t)\2). 9-9c With Hi not an explicit function of t, and the initial and final states eigenstates of Ho, that is {l\2) = {E',i;ti\E",i';t2) show that the time transformation function in Problem 9-9b produces the factor ^(E1 - E")U _ ^(E1 - E")t2 -\h E' - E" 9-9d Demonstrate that in the limit of large T = ti — t2, i(E' - E")ti _ UE' - E")U E' - E" ->^T5(E' -E"). At this point you should conclude that the probability per unit time for E",i' ->£;',7' is
372 9. Two-Particle Coulomb Problem j-5(E? -E")\(E',7'|^|£;",7")|2. In the quantum literature this is known as Fermi's golden rule. 9-10a Scattering: Let H0 = p2/(2M), Hx = V(r). The wave function describing a momentum state [compare with <j)p of (10.8.12) below] is The average flux of particles in the initial state is F = v\ipp»\2 with v=\p"\/M. Introduce momentum-space spherical coordinates to write (dp') in terms of dE' and the solid angle dO, The differential cross section is defined by , probability per unit time for all deflections into dO da — — . fluxF You should end up with This is known as the (first-order) Born approximation. 9-10b Let V be the screened Coulomb potential V(r) = — e~r/a ; r what is d<r/d/2? What do you get in the limit a —> oo? 9-11 The differential cross section (center-of-mass frame) for the scattering of two particles that are distinguishable in principle, but not in practice, is |/(0)|2 + \f(n -0)|2 , the "classical" part in (9.4.10). Consider the scattering of two spin-| FD particles. Apply the above to the scattering of particles with opposite spins; use anti symmetrical wave functions for parallel spins. Compare your result with that of Section 9.4. 9-12 Carry out the Fourier transformation required to get the hydrogenic ground-state wave function ipn=i(p) from the known ipn=i(r)-
Problems 373 9-13 Verify the statement about linear p dependence at (9.5.5). 9-14 The polar equation of a conic of eccentricity e is a(l-e2) r = —- — . 1 — e cos 6 State the geometrical significance of a and 9. Then use the axial vector A to arrive at this form of the classical Kepler* orbit. Draw a Kepler ellipse and indicate A. So, why is A called axial vector? 9-15 Axial vector: Use the result for the time average of r, along with the structure of A, to rederive the expression for (r)n t given in Problem 8-ld. 9-16 The hydrogenic state mi = rn2 = \(n — 1) obeys (why?) {Jix+UiV)\ ) = 0, (J2x + iJ2y)| ) = 0 , or (Ax+iA„)\)=Q, (Lx+iLy)\) = 0. Check that, to within a numerical factor, the solid harmonic with m = I, a solution of Laplace's equation, is (x + iy)1, so the required wave function is proportional to (x + iy)n~lf(r). Use the axial vector condition to find f(r). Exhibit the normalized wave function. [These states are known as circular Rydberg states, in particular for large principal quantum number n.] 9-17 The n = 2 hydrogenic state, of multiplicity 4, is described by two spin-Jj's. We know that two spins have three symmetrical states (I = 1) and one antisymmetrical state (I = 0). Explain why (to within a constant) Az\n = 2,l = l,m = 0) is \n = 2,1 = 0,m = 0). 9-18 The two-particle system of: (1) charge e, mass m; (2) charge — Ze, mass M — m; is acted on by weak, homogeneous, electric and magnetic fields. Consider the internal motion and arrive at the effective electric charge e' for the electric dipole moment, and the effective ratio (e/M)' for the magnetic dipole moment. Apply your results to the situations: (a) m/M C 1 (to first order in m/M) as in a hydrogenic atom; (b) M = 2m, Z = 1, as in positronium. 9-19a Consider the four degenerate hydrogenic states for n = 2 that are labeled by angular momentum. Construct the 4x4 matrix of —e'Fz. What are its eigenvalues and eigencolumns? Compare with the known result for the linear Stark effect. "Johannes Kepler (1571-1630)
374 9. Two-Particle Coulomb Problem 9-19b Repeat the above for the matrix of —e'Fx. Were your eigenvalue answers to be expected? 9-19c Extend the last calculation to the matrix of —e'Fx — j- (e/M)'LzB. Does it work out correctly?
10. Identical Particles 10.1 Modes. Creation and annihilation operators We now begin the construction of an algebraic theory of identical particles known (inaccurately) as second quantization. A brief reference, in the context of angular momentum, already appears on page 158. Recall that, for a single system, physical property / is displayed relative to the complete set of measurements A, 53|o')(o'| = l, (10.1.1) a' as / = E l«')(«'|/|«")(«"l = E (a'\f\a")\a'a"\ . (10.1.2) a\a" a',a" Now consider a collection of identical systems, henceforth called particles, which are labeled k = 1,2,... , n. Then /*= E(°*l/*K)lo'o"l*' (1(U-3) ^ _ _ ^ a',a" v , , = {a'\f\a") inasmuch as the relation between / and A is the same for each of the identical particles. Linear momentum, angular momentum, kinetic energy, are examples of a one-particle physical property, described by a one-particle operator that is constructed additively: n n ^ = E/*= E<a'l/K>EK«"I* • (101-4) A=l a',a" A=l * v ' = |a'a"| = |a"a'|t This generalization of the measurement symbol \a'a"\ is distinguished from the original by the property
376 10. Identical Particles Y,\a'a'\ = Yl \a a I a' LA—1 = YlY,\a'a'\k = (10.1.5) A—1 a' = 1 We also note the product n n I I II I I III lv\ \ ^ \ ^1 I II I I /// /y/l a a a a = > > \a a U a a b a=i;=i El I II I I /// iv\ . V^ 1///11/// /i/| |aa |A|a a |/ + 2_Jaa 1*1° a |* a^; = <S(a",a'")|a'a"'|* = 5]|a'a"|A|a'"a"/|; + <J(0")0'")|o'a','| (10.1.6) a^/ and, similarly [a' «->■ a'", a" «->■ a'", &«->■£] I /// /y/| I I //1 \ N /// /1/11///1 , XI I lv\\ HI "! |a a ||aa | = >|a a |; |a a |fc+o(a,a )|a a | a^/ (10.1.7) The two k / I terms are the same, because operators associated with different particles, different degrees of freedom, commute. Therefore, on subtracting the two, we get [! I II! ! in iv\\ r/ n ni\\ I iv\ r/ I iv\! /// //! /-in 1 c^ I a a |, I a a | = o(a , a )|a a | — 0(a ,a )|a a | . (10.1.8) Note that on putting a' = a" and summing, we get, consistently; [_ ! /// /i/!l ! in iv\ ! /// iv\ n dr\ 1 m n, |a a | = a a — a a | = 0 . (10.1.9) Also, !//!!// /Ml £/ / //\! / //! £/ / //\! // /! n /in -i m\ a a , a a | =o(a,a ) a a — o(a , a ) a a| = 0. (J 0.1.10) In order to distinguish between the states of the n particles, and those of a single particle, we henceforth call the latter modes. Accordingly, we describe I i n I \ N i n\ \a a I = 2-j\a a r (10.1.11) A=l as symbolizing a measurement in which one of the n particles is removed from the a" mode and put into the a' mode (this is sinistral reading - right to left) or, in factored form: \a'a"\ = ( Create an 1 x ( annihilate ^ - .,,^W„» a! particle J I an a" particle J = </>(a') W) , (10.1.12)
10.1 Modes. Creation and annihilation operators 377 where tp(a')^ and i>{a") are generalizations of |a') and (a"\, respectively, as introduced in Section 1.3 (reading was dextral then). One calls the -0(a')t's creation operators, the tp(a")'s annihilation operators, and either kind ladder operators. Note that the construction (10.1.12) is consistent with the relation ! I II ii I II I I \a a \' = la a \ (10.1.13) The meaning we give to ip(a"), that, applied to a state with n particles, In), it produces a state with n—1 particles, and that ip(a')^, applied to a state with n—1 particles produces a state with n particles, frees us of the restriction to a specific number of particles. From now on, n is any non-negative integer, and 53|o'o'| = n -> 53^(o')V(a') = N (10.1.14) a' a' where N is the total number operator, with eigenvalues N' = n = 0,1,2,... . (10.1.15) It is natural to extend this to ^(o')V(o') = N(a') , (10.1.16) the operator for the number of particles of mode a'. That is consistent since [N(a'),N(a")]=0 (10.1.17) as a consequence of (10.1.10), so that it is possible to specify simultaneously all the eigenvalues N(a')' = n(a') = 0,1,2,... . (10.1.18) Indeed, a state is characterized by that collection of eigenvalues: |{n(a')}) = |ni,n2,...) (10.1.19) where ni, n2,... represent some ordered labeling of the modes and the number of particles occupying them. Such states with a definite number of particles in each mode are frequently called Fock states. The physical meaning of r/>(a') and N(a") is conveyed by ,,,,,,1, u / n(a"Ma')|{n}> for a" / a'1 [[n(a')-l\ip(a)\{n}j for a" = a J = iP(a')[n(a") - 5(a' ,a")}\{n}) = V(a') [N(a") - 6(a',a")]\{n}) (10.1.20)
378 10. Identical Particles which is the operator relation N(a")il>(a!) = il)(a!)[N(a")-6(a',a")] (10.1.21) or ty(a'), W(a")] = 6(a',a")7P(a') . (10.1.22) The analogous treatment of the creation operator %p(a!)^ gives N(a")iP(a'y = V(a')f [W) + S(a',a")} (10.1.23) or [N(a"),iP(a'y] =<J(a',o")^(o')t (10.1.24) which, as it should be, is the adjoint of the ip relation (10.1.22). We can check this against the general commutation relation (10-1.8), now written as [iP(a')^(a"),^(a'")^(a'v)} =S(a" ,a'")iP(a')^(a'v) -6(a',a'v)iP(a'")^(a") . (10.1.25) Putting a'" = a'v specializes it to [il>(a')^(a"),N(a'")} = <5(a",a">(a')V(a") - <S(a',a"Xa')W) (10.1.26) where, indeed, [^(o0V(o'0.^(o''0]=^(o0U^(o''),^(«"')]+Wo')t»^(o''')]^(o") = <5(a",a"')V(a') W) - <J(a',a"Xa') W) • (10.1.27) For the decisive step consider the creation of two additional particles in either order VKaOW^lM); ^(a'OW^IW)- (10.1.28) The states are physically indistinguishable, the vectors can differ only by a phase factor, ^(o')V(o")t =C(o',o")^(o")V(o')t (10.1.29) with |C(a',a")|2 = l, C(a',a")* = -^-. (10.1.30)
10.1 Modes. Creation and annihilation operators 379 Reversing the order twice is no change at all, so that C(a',a")C(a",a') = l. (10.1.31) Then taking the adjoint of (10.1.29) and interchanging a' and a" gives </,(aXa") = C{a",a'Yil>(a")il>{a') . (10.1.32) Now look at [il>(a'),N(a")] = ^(o')^(o")V(o") -^(o")V(o")^(o') = (iP(a')iP(a"y -C(a",a')^(a")V(o'))^(o") = 5(a',a")ip(a") (10.1.33) from which we infer that *l>(a')il>(ay -C(a",a')il>(a''?il>(a')=8(a',a'') . (10.1.34) Next, put this in the general commutation relation (10.1.25), requiring ^(o')V(a")^(o"')V(a"0 =V>(a')f [C(a"V>(a"')ty(a") + 6(a",a'")}^(a'v) = <5(a",a">(a')W) + C(o',a"')C(o'",o")^(o"')V(o')V(o")^(o"0 (10.1.35) and likewise ^(a'")^(a'v)iP(a')^(a") =5(a' ,a'v)^(a'")^(a") + C(a',a'v)C(a'v,a")^(a'")^(a')^(a")iP(a'v) . (10.1.36) So, (10.1.25) is satisfied provided that C(a',a'")C(a'",a") = C(a',a'v)C(a'v,a") . (10.1.37) This relation and the ones in (10.1.30) and (10.1.31) can only hold if C(a', a") is of the form C(a',a") = ±j<P(.«')-Ma") . (10.1.38) Only the sign is relevant, however, because, if y(a') is not zero, it can be made so by a redefinition of the annihilation and creation operators: V(o') = Mar>Nil>(a') , V(a')f = V(oOf e-^^O^ . (10.1.39)
380 10. Identical Particles Note the commutation relation expressing the significance of tp, -0*: e-^Va')' = </<(«')' ^{N + 1} (10.1.40) which generalize (10.1.22) and (10.1.24). So V(o') V(o") = el^a')NxP{a')e^a")NxP{a") = e^(a') + V(a")^e^(a")^(o')^(o") = ±ei^(a') +V(«")^eiy(a')</,(a»)</,(a') = ±V(a")V(o') (10.1.41) and V(a')W)fT W)fW) = ei^(a')iV^(0')^(o")te-i^(a")iV T eM«')^ e-^(a')^(o") V(o') e1^") <rW{*")N (10.1.42) or V(a')>(a")t:F V(a")fV(a') = eW^^a'MQt -C(a",a')V>(a")V(a')j e^"")* = S(a',a") = 6(a',a"). (10.1.43) In summary, there are two types of identical particles: those with BE statistics (r/>'s commute), called bosons - and those with FD statistics (r/>'s anticommute), called fermions. The table BE statistics FD statistics [l>(a')Ma")]=0 {^(o'),^(o")}=0 [</>(a')f, </>(«") + ] =0 {^(a')t,^(«")+} =0 [^(a'),^")*] =<J(a', a") ty(a'), </>(«")+} = <J(a',a") (10.1.44) lists corresponding commutation relations. For later use we introduce the notation { \A, B}= AB- BA for BE symbols, \{A,B}] = { \ \ (10.1.45) Ll JJ \{A,B} =AB + BA for FD symbols, ' then [{^(o')^(«")}]=0, [{^)^(0^=0, [{^(a'),^")*}] =<J(o',o") (10.1.46) replace the three pairs of (10.1.44).
10.2 One-particle and two-particle operators 381 10.2 One-particle and two-particle operators The BE system we recognize as the non-Hermitian operator (y, y^) description of a number of degrees of freedom, and consistently, the eigenvalues of N(a') are indeed N(a')' = (V(a')V(a'))' = "(<*') = 0,1,2,... . (10.2.1) How about the eigenvalues of N(a') for FD statistics? Here N(a') (1 - N(a')) = ^(a')ty(a') (l - ^(a')ty(a')) = V(a')f ^(a')V'(a') ^(a'V = 0 , (10.2.2) = 0 so that N(a')' =n(a')= 0,1. (10.2.3) If two particles occupied the same mode, that situation would be symmetrical; it cannot occur for FD statistics. This is Pauli's Exclusion Principle: one particle in a mode excludes any additional ones. A degree of freedom with just two outcomes of measurement? That's also familiar: spin |. The construction tl> = |(cti - ia2) , Vf = Wi + i*2) (10.2.4) gives O] = tp + ifi , ct2 = ir/> — itp^ , a3 = —<j\Oi = ip^tp — tpip^ , (10.2.5) for which the basic algebraic relations, o-j = 1 , cr2 = 1 , er3 = 1 , {*i,ct2}=0, {a2,a3}=0, {a3,ai}=0, (10.2.6) are verified easily, and N = ^^ = \{^^} + \[^^} =| + |a3 (10.2.7) has eigenvalues N' = | + | = 1 and N' = \ — \ = 0, indeed. But what about the fact that FD operators for different modes, which should be different degrees of freedom, anticommute rather than commute? This is considered in Problem 10-2, and also here, using the commuting ct's associated with different degrees of freedom. Let the modes be numbered, so that a" > a' means that a" is later in the sequence than a'. Consider the constructions
382 10. Identical Particles </>(«) = </>(«')* = II CT3(a"') a'"(>a') II CT3(a"') a'"(>a') o (°) . ^1 + i^2 , ,,. o (°) • (10.2.8) Clearly if one multiplies operators for a common mode - that is ip(a')tp(a') or %j)(a')%l){a'y or ^(a'^^a')* - all is as before, because II ^(O a'"(>a') Now consider a' < a", for instance, and look at = II Ma'")]2 = l- o'"(>o') (10.2.9) </>(«>(«") versus </>(«")</>(«') = II CT3(a"') o'"(>o') II CT3(a'") a'"(>a") 0\ - 1(72 («') CTl — 102 («") II CT3(a"') o'"(>a") II CT3(a"') o'"(>o') <J\ - 102 («") (10.2.10) CTl - 1CT2 («') (10.2.11) where one wants to move the left-hand \{p\ —\oi) over to join its right-hand partner. In doing so, \{p\ — i02)(a') moves through the product ]fj0-3(0'") with a!" > a" > a', so that a' is not in this product and nothing happens. On the other hand \{<J\ — io-2)(a") passes through the product with a'" > a' which, since a" > a', includes <73(a"), and therefore there is a sign reversal: ^(o")^(o') =-^(a')^(a") or {^(a'), 4>{a")} = 0 (10.2.12) Similarly, the other anticommutators in (10.1.44) are verified. All this gives any one-particle operator (10.1.4) the construction f= Y.^a'y(a'WW) (10.2.13) Here is the origin of the term "second quantization". Suppose we consider a single system (particle) and write down the expectation value of / in the state described by the wave function ip(a'). It is (/) = <|/| )= E(l«')(»'l/K)(«"l) (10.2.14) = 4,(a'y = V(a")
10.2 One-particle and two-particle operators 383 F, which refers to any number of particles looks as though it has been produced by elevating the wave function (first quantization) into an operator (second quantization). In Section 3.4, we have already met an example of this construction in another notation (dimensionless variables): J = yt 1 = £ y{a')^a'\\a\a")y{a") (10.2.15) with [y(a),y(a')}=0, foV^V)] = 0 , [y(a),yHa')]=6(a,a'), (10.2.16) which, we now recognize, says that any angular momentum is a BE collection of spin-1's. In this connection we noted then that J obeys the proper commutation relations because \<r does. That is an example of the statement [F,G] = ]T il>(ay(a'\[f,g]\a")il>(a") , (10.2.17) which follows from the commutation properties of the ip(a'Y, ip(a"), a restatement of the commutator Yjfc>Yl 9i = ^2[fk,gk] ■ How about the individual product FG? On the one hand, k I k k^jtl (10.2.18) (10.2.19) a sum of a one-particle operator and a two-particle operator, and, on the other, FG=Y,^y(a'\f\a"Ma") a',a" x J2 VKOV'Ma'XO. (10.2.20) Here we encounter (10.1.35) [with C(a',a'")C(a"',a") = (±1)2 = 1] ^>(o') V(o")^(o'") V(o"0 = S(a",a'")7P(a')^(a'v) + ^(a')V(o"')V(o"0^(o") (10.2.21)
384 10. Identical Particles so FG= $>(a')V|/0|«'Xo") a' ,a" + E ^{a')^{a'y{aW')^"\g\a'v)^{a'v)^'). (10.2.22) a',... ,afv A one-particle operator has a single %[' (and a single ?/>*); only one particle is required for it to contribute. A two-particle operator has two r/>'s (and two r/^'s); at least two particles are required for it to function. More generally, a two-particle operator is F=lY,f"1 with hi = fik- (10.2.23) 2 z k^l Now so where r \ ^ ! I I! ! in ill! £ \ iv iv\ ! // //! /a; = 2_v |aa|*|a a \ifki\a a \i\a a \k a',... ,a" = E fon(0*|/*K)K)J0'01*K'°1' (10.2.24) = (a ,a \f\a ,a ) = (a'",a'\f\a'v,a") F=\ E ("'«'"|/|o"o")Elo'o"l*lo'"o'1' (10-2-25) a',...,a" k^l =^(o')V(o")^(o"')V(o"0 - <J(o",o"')^(o')V(o"0 =V(a')V(a'")tV'(a"/)V'(a") (10.2.26) and the general form of a two-particle operator emerges as F=\ E V'(o')V(a'")t(o'a'"|/|a"a"/)V(a"/)V(a")- (10.2.27) a',...,a" Seeing this, it's easy to appreciate that a fc-particle operator is ' 0(^,...,0(2^) x (o^,...,0^-^1/10^),...,0^)) x^').-^). (10.2.28)
10.3 Multi-particle states 385 10.3 Multi-particle states What are the eigenvectors (10.1.19) of states with specified numbers in the various modes? As an obvious generalization of the BE angular momentum treatment, we have with ^(o')|0) = 0 for all a'; (10.3.2) the vector |o) describes the state with no particles, the vacuum. We see that ^(a')f \{n}) = \{n + l(a')})y/n(a') + 1 (10.3.3) and that ('</> -> d/dtp^) *l>(a')\{n}) = \{n - l(a')})Vn(arj , (10.3.4) in words: {tp(a'y I f increases by one 1 ,, ,. 'o {«}) { , , } r/>(a ) J I decreases by one J the number of particles in mode a . (10.3.5) These statements, of course, combine into tyty)(a')|{"}>=|{n}>n(a'), {^)(a')\{n}} = |{n})(n(a') + l) . (10.3.6) The same construction applies to FD statistics, except that - with the restriction n(a') = 0,1 - the factorial n(a')\ = 1 is always unity. Also the order of multiplication is significant, some standard order must be adopted. In effect, \{n})= Y[ ip(ay\0) with ^(o')|0) = 0 (10.3.7) (n(a') = l) is simply a product, over occupied modes. Now one has <Ka')|{n}> = |{n- l(a')})(-l)n(<a')y^0 _j\{n- l(a')})(-l)n(<a,) for n(a') = 1, ~ 1 0 for n(a') = 0 ; (10.3.8)
386 10. Identical Particles and i/>(ay\{n}) = \{n + l(a')})(-l)"(<a'Vl ~ "(a') f 0 for n(a') = 1, = < , , „ (10.3.9) ( |{n + l(a')})(-l)n(<a > for n(a') = 0 . The sign depends on n(< a'), the number of particles in modes a" < a', that is: V(a")f stand to the left of </>(«')* in (10.3.1), n(< a') = 5Z n(a") • (10.3.10) a"(<a') 10.4 Dynamical basics So far, this has been kinematics, the study of operators and vectors, without reference to time. Now to dynamics. The basic variables are ip(a',t) and ip(a',ty. How do they fit into the action principle? That is quite evident for the BE situation because the ip(a',t) and ip(a',ty are just examples of non-Hermitian variables combining q,p type variables. We know that L = ^2ihi/)(a',t)^il)(a',t)-H (10.4.1) from which we derive the equations of motion and *&«>v = m!h* (10A2) d BH V'«' = -siM' (ia4-3) which are adjoints of each other, and the generators G = Gi,+Gt=- Y^ityi0'',tf5ip(d\t) - HSt. (10.4.4) a' If the Hamiltonian is just a one-particle operator H= Y,^(a',ty(a'\h\a")ip(a",t) (10.4.5) we get the equation of motion
10.5 Example: General spin dynamics 387 J,(n' +\ = ih^-7p(a',t) = ^2{a'\h\a")il)(a",t) , (10.4.6) compactly written as ih^-Tp = hip , (10.4.7) a linear operator equation that is essentially a Schrodinger equation. Let's illustrate this with the example of angular momentum, considered as a BE collection of spin-|'s. 10.5 Example: General spin dynamics As an example we treat the motion of an arbitrary spin S in a time varying magnetic field B(t) to which the magnetic moment jS couples. The Hamilton operator H = -jS-B(t) (10.5.1) has a parametric time dependence through B(t). Recalling that for any angular momentum 5J = (iH)~l[J, 5u> ■ J] = 5u> x J, we find the equation of motion ^ = ±_[S,H]=jSxB; (10.5.2) the right-hand side is, of course, the torque on the magnetic moment jS in the magnetic field B. The conservation of S2 suggests to solve (10.5.2) for the given value of s [S2 = s(s+l)H2; s = 0, \, 1,... ], but it is much simpler, and more systematic, to deal with all s values at once. We regard arbitrary 5asa BE collection of spin-1's, ~S = y^ay, V = (l+_) > V* = (vW-) > (10-5.3) which uses the notation of Section 3.4. The Hamiltonian (10.5.1) then appears as H =-ttyy^a ■ By ; (10.5.4) it is of the one-particle form (10.4.5) with ip(a', t) —> y±. From the BE viewpoint, the equations of motion are d 1 BH '*»=w=^"-B' (ia5-5) which are just two (±) linear operator equations.
388 10. Identical Particles The linearity of the equations of motion says that y(h)=Uy(t2) (10.5.6) where U is a 2 x 2 matrix, and therefore y(*i)f = [Uyfa)]* = y{t^UT* (10.5.7) which involves the complex conjugate (*) of the transposed (T) matrix. Explicitly these are Vafa) = E Ufa'Va'fa) > cr'=± v*fa? = E v*>fa)HuTK>* = E w^w<r'(*2)t ■ (10.5.8) tr'-± a' = ± The commutation relations require that [y<rfa),y<r>(tiV] =6,,,,, = [E^^-^.E^^H^X,^] a" a'" = E u™>>[y*>>fa),yUfa)W*)„,,,„, a",a"1 * *~~ ' = Ocr"cr'" = {UUT*)aa, (10.5.9) which says that U is a unitary matrix, UUT* = 1 . (10.5.10) The dynamical evolution of the system is described by the time transformation function (yt tt\ \y" ,t2)- It suffices to consider the dependence on y^ , for example, &{v*'M = \{v*'MGv< (10.5.11) where Gyt =-ih5yl'y (10.5.12) [see (7.1.5)], so 6{v*'Mv"M = W'MWy{h)\y",t2) = </,*i|«s/Wy(*2) \y",t2) I y" (10.5.13) or <51og(2/t',i1|2/",i2) = (5[2/tW] • (10.5.14)
10.5 Example: General spin dynamics 389 The result, </,ti|y",t2)=e»' *'Uy" (10.5.15) is correctly normalized because (j/* = 0| is the state of zero angular momentum, {y* — 0|S = 0, and nothing happens. All information is contained in the 2x2 matrix U, produced by solving (10.5.5) and writing the solution in the form (10.5.6). As a specific example we choose the rotating field B = Bi (ex cos(ut) + ey sin(wi)) +B0ez so that cr ■ B = £?i (ax cos(wi) + ay sin(wi)) + B0az Accordingly we write = B1e~i(T"faxeia"f+B0az and get or y(t) = e iSra*y(t) i-TT + 2CTz) y= -i2^Biax + B°a^y i-^2/= ~7^[Biax + (Bo +uh)oz]y Now, with B and 9 defined by B = JB* + (B0 + oj/j)2 , Bsin6 = Bi, Bcos0 = Bo+u/j (10.5.16) (10.5.17) (10.5.18) (10.5.19) (10.5.20) (10.5.21) we have
390 10. Identical Particles Biax + (B0 + ujh)az = e~l2avBaz e1^ . (10.5.22) So with we get y= e~{2avy (10.5.23) i±y = -yB±azy (10.5.24) and then %(ti) = e^sM'1 - *2)f(i2) . (10.5.25) To put it together, = Uy(t2) (10.5.26) with W= e"1 2 CT^(ii -^)61 2 ^ , (10.5.27) where Z7(T) = e_i2CT»eiTS2^Tei2CT» = cos(i7jBT) + isin(i7jBT)(CT2 cos 6» + ctx sin 6») . (10.5.28) The explicit matrix elements of (U++ U+^\ "=(»-+»-) (ia5'29) are U++ = e-i^T[cos(|7BT) +icos6» sin(|jBT)] , U— = ei5a;T[cos(|7JBT) - i cos6» sm(^jBT)] = U*++ , W+_ = e_i5w(*» +**)isine sin(|7J3T) , W„+ = 6^^(^+^)1^6) sin(|7J3T) = -U%_ (10.5.30) with T = t\ — <2-
10.5 Example: General spin dynamics 391 Now, applying the lessons of Section 3.4, the expansion </,ti|y",*2> = e^ uv" = J2 ^(V^V'T (10.5.31) ra=0 introduces the wave functions for all the spin states (2/ \s,m) (yl'r + m(yl)s~m with that is \/(s + m)\(s ~ m)! (s,m\y") = ^±L (V-> (10.5.32) X ' ' y/(s + m)l(s-my. s + m = n+ and s ~m = n_ , (10.5.33) n = 2s = 0,1,2,... . (10.5.34) So, for a spin s, we have m which gives all the required propability amplitudes in terms of the four matrix elements of U. For s = |, where </|U> = 2/!'. (Vr\h,~h) =»!',••• (10.5.36) we have immediately (\,\a;h\h\<?';h)B =Uarr, . (10.5.37) In particular, the probability of the transition \ ++ — \ in time T is p(|,-I) = |W+^|2= |W_+|2=sin2esin2(i7BT). (10.5.38) Notice that it oscillates in time, with sin2 (^jBT) ranging between 0 and 1. The maximum value is given by 2 p(I -1) < sin^ = ( ^ I = ,,, , "~iy, , D,2 (10.5.39) 'I -I) < sin'* = ('^V = ^ which reaches unity for
392 10. Identical Particles oj = —jBo (10.5.40) the condition of resonance between the rotating field and the rotation of the angular momentum produced by field Bo. Also, at resonance, the time required to change the probability from 0 to 1 is AT = 1B1 (10.5.41) So, the smaller B\, the sharper the resonance, half-width: 7B1 = Aw, but the longer it takes to build up the transition: AT = n/Au. See Problems 10-4 for s > \. 10.6 General dynamics For BE systems, with commutation relations [</>(a'),</>(a")]=0, [^(oO^a'O^O, (10.6.1) one uses infinitesimal variations 5ip(a'), 5ip(a'y that commute with all operators ip, if)1. This maintains the commutation relations, as [^(00,^(0^=0, (10.6.2) for example. Therefore, analogously, for FD systems, with anii-commutation relations, one uses operator variations that anticommute with all ip, -0*: {<ty>(oO,^(a'0}=0, {67P(a'),7P(ay}=0, ... . (10.6.3) Do such completely anticommuting quantities exist? Recall that n ct3(o"o a'"(>a') n ^(o"o a'"(>a') (cti - i<72)(a0 , (cti +io-2)(aO • (10.6.4) What anticommutes with each and every one of these operators? The product ]>3(o'0 (10.6.5) of all ct3's.
10.6 General dynamics 393 We now want to recognize that dynamics, as described by the Lagrangian L = J2 'Mia', i)f ^-4>{a', t) - H , (10.6.6) a' for BE systems, also includes FD systems. The distinction is implicit in the nature of the 6ip, 5ip^. Begin with Wi2= j dtL= [(y^ih^di/t-Hdi) (10.6.7) and get 6Wi2= f dQri/M/^ - HSt) ^ a' + j (JT iH5ip*dip - Y, Vidip*6ip - 5Hdt + dH6t) •* a' a' =GX - G2 ■ (10.6.8) So G = G^ + Gt with G^=ih^2i>(a',tfSiP(a,t) and Gt = -HSt. (10.6.9) a' The significance of G^ as the generator of variations of the tp is conveyed by 61>(a',t) = ^[ip(y%G^\ = Y,W,t),tl>(a",t)^(a",t)] , a" 0= i[^(y'^,G^] =5][^(o'»*)t»^(o"»*)t^(o"»*)] ■ (10.6.10) a" For BE statistics, the 6ip commute with all ip, ip^, so that [ip,^Sip] = [ip,ip1]5ip , (10.6.11) for instance, and we conclude that ty(a',t), #»",*)*] = S(a',a") , [ip(a',t)l ip(a",tf] = 0 . (10.6.12) With FD statistics the 6ip anticommute with all ip, ip\ as in [ip,ip^Sip] = ipip^Stp — ip^Siptp = tptp^Stp + ■tp^tpStp = {ip,^}Sip, (10.6.13) for example, and
394 10. Identical Particles ty(o\t),^(0",t)t}=<S(0',o"), {V(a',i)t,V(a",i)t} = 0, {ip(a',t),ip(a",t)} = 0 (10.6.14) follow. The other inference from the action principle is a' (10.6.15) which is set against the significance of Gt = —H5t in producing the time derivative of any F(ip,ip\t), dF dF 1 ^ = -3^¾^ <"""•> Again we get from both Now let F=G^: A Fi 1 -Gv, - ^G^ = -[G^, #] = -fytf , (10.6.18) so S+H = -\h ]P ma'; ^ S^a', t)-i^4>(a', t)* (± - |) 6^a',t) . a' a' ^ ' (10.6.19) Thus consistency requires that ---) <ty(0',i) = - [Sip(a',t),H] = 0 , (10.6.20) dt dt) ^v ' ' ihl which is trivial for BE systems, but for FD systems requires that the Hamilton operator H be an even function of ip and ifi. Of course, that's the kind of H we've been talking about, made of one-particle operators [?/>*■•• -0] and two-particle operators [tp^tp^ ■ ■ ■ tptp}. Using an obvious notation for left and right derivatives, the equations of motion for both types are ih-r-ip(a ,t) = drv ' ' a^(o',t)t ' -^^ = 4¾)5 (la6-21) which are mutually adjoint.
10.7 Operator fields 395 10.7 Operator fields Discrete indices are nice, but we want to describe particles that move in three- dimensional space and are specified by position r (as well as, e. g., spin). So, understanding the possible presence of, but not writing until needed, discrete spin indices, we replace ip(a', t) —> ip(r, t). We are now dealing with operator- valued functions of space and time - operator fields. With E -> /(dr) t10-7-1) a1 J the Lagrangian becomes L = y(dr) i^(r, i)f J^(r, t) - H , (10.7.2) leading to the generator Gv,= ndr)ihil>(r,tf8il>(r,t) , (10.7.3) and SiP(r,t) = ity(r,t),G^] = J(dr')[il>(r,t),4(r\t?6il>(r\tj\ , 0= |^(r,i)t,G^ =|(dr') [iP(r,t)\iP(r',t)UiP(r',t)] , (10.7.4) or, using the [{ }] notation introduced in (10.1.45), for the two statistics: H{r, t) = J (dr') [{^(r, t),^(r', i)f}] «ty(r', t) , 0 = J(dr') [{iP(r,t)\ip(r',tf}]5iP(r',t) , (10.7.5) which yields the commutation relations [{^(r,t),ip(r',rf}]=6(r-r'), [{</>M)t, ^(/,0^=0, [{^(r,i),^(r',t)}]=0, (10.7.6) the continuous analogs of (10.1.46). As for the Hamilton operator H, here is a one-particle term: #(1) = [(dr')(dr")il>(r',tf(r'\h(r,p,t)\r")il>(r",t) , (10.7.7) or with
/ s , 396 10. Identical Particles (r'\h(r,p,t)\r") = h(r', ?V,*)<S(r' - r") , (10.7.8) somewhat more simply #(1) = f(dr)ip(r,tfh(r,^V,t}ip(r,t) . (10.7.9) A two-particle term (the model is potential energy pairs) is //(2) = I /"(dp') ■ • • (dr") V(r', i)f </>(*•"', i)f x(r',r'"|t;(ri-r2)|r",r"/) x 4>{r'v,t)4>(r",t) , (10.7.10) or with (r',r'"\v(ri - ^Ir'V} = 5(r' - r")5(r'" - r'v)v(r' - r'") , (10.7.11) more compactly Hl2) = i f(dr)(dr') i/>(r, i)f^(r', i)fv(r - rV(r', t)^(r, t) , (10.7.12) and the symmetry v(r — r') = v(r' - r) replaces /^ = /¾¾ of (10.2.23). The action principle gives (omitting the 5t contribution) 5H = ih (dr) JVCr, i)T —-^ ^- dip(r, t) Define, in 6H(ip ',*)=/ (dr) <5^(r,i)1 <5,i/ + 6rH 8il>(r,t)1 5ip(r,t) -H(r,t) (10.7.13) (10.7.14) the left functional derivative, with respect to ip\ and the right, functional derivative, with respect to ip. That gives the equations of motion in the form .^d t , 8XH in-—%b(r,t) - -r— -r -ift^M)'--^- d_ "at" (10.7.15) H{r,t) ' the continuous analogs of (10.6.21). For H - H^ + H^2\ it is a matter of inspection that + f(dr') 7p(r', tfv(r - r')i/>(r', t) ip(r, t) , (10.7.16)
10.8 Non-interacting particles 397 giving the ip equations of motion h^(r,t) = h(r,jV,t) + f(dr')il>(r',tfv(r-r')il>(r',t) il>(r,t). (10.7.17) 10.8 Non-interacting particles Let's look first at the simplest problem: v = 0, h = p2/(2M), a collection of non-interacting particles: d ., , 1 fh. N 2 mat^r'*) = 2M^TvJ ^(r'*}- (10-8-1} What is the relation between ip(r,ti) and r/>(r',<2)? We know all about that; the fact that tp is now an operator and not a numerical wave function changes nothing. So, ''Kr,h) = f(dr') (r,h |r\h)i/>(r',h) , (10.8.2) where {-Mr'M) = j ^eiP-^-')e-^^-t,) (10.8.3) with „2 EL 2M EP=~ (10.8.4) is the time transformation function of (5.4.14). We want to study the dependence of the time transformation function (%p^ ,ti\'tp",t2) on the quantum numbers ipi . Recall (10.7.3), GV = \h f(dr) ip(r, tfSip(r, t) . (10.8.5) Then G^t is produced as Gv-t = Gv, - <J (ih f(dr) i/>(r, t)ty(r, t)) = -ih f(dr) 6ip(r, tfip{r, t) . (10.8.6)
398 10. Identical Particles So = {^\h\ j{Ar)5^{r)'^{r,h)\^",h) (10.8.7) where, using (10.8.2), ^(r,ti)|^",t2) -> \4>",h) f(dr') (r,ti\r',h)il>(r')" (10.8.8) gives V'log<^'»*ik"»*2)=V f|(dr)(dr')V(r-)t'<r,i1|r',i2)V(r')") (10.8.9) and then <^UK,*2> = e/^X^M'VikW^T . (10.8.10) Again, this is properly normalized because (0, t\ | is the state of no particles, the vacuum, which stays the vacuum. To draw the physical consequences of this expression it helps to exhibit (r, ti \r', £2) in discrete form, as produced by breaking the /(dp) integral into a sum over small (dp) cells (we keep the same notation), so now = 5Z^p(r)e~^p(<1 ~i2VP(r')* (10.8.11) where *<" = V<s$*ei,"r- (I0-812) Then define |(dr)</>(r)tVP(r)=<, /(dr')^p(OW)" = < (10-8.13) and get, with T = ti — £2,
10.8 Non-interacting particles 399 = Y[e1'*e~*E*TM P CO = nE^(<^f' dO.8.14) P np=0 = (^fe4«,VW u\nv Consciously thinking of BE statistics, we get ( t^"p -**-*- ' •* .. /n l .. /n l p np=0 V P V P = 5Z(^'|{n})e"^({n})T<{n}^") , (10.8.15) {«} where £(W) = 53^^ ' nP = 0,1,2,... (10.8.16) p is the energy of the multi-particle state specified by {n}, and n\nv p ■ ,.,. i <^'iw>=n 4^r. <wr>=n %?r (io-^) p /np! " VnP! are its wave functions. These are the evident energy eigenvalues and the familiar oscillator wave functions, now for (infinitely) many degrees of freedom. How does it work out for FD statistics? To this point we have not specified the statistics and taken for granted that one can work with eigenvectors and eigenvalues of the tp and tp^ in either situation. But what does, say, M^^WWr (10.8.18) mean for FD statistics? We have wk'> = </<p|</<>;< = !</<»;< (10.8.19) and therefore the algebraic properties of the tp's must be obeyed by the eigenvalues ip': K,</y}=0. (10.8.20) The ip'p, and the ipP , are a set of totally anticommuting numbers, analogs of the totally commuting numbers of BE statistics. [Totally anticommuting
400 10. Identical Particles entities were foreshadowed by Grassmann* about 1840.] It is this total anti- commutativity, which includes (^)2 = 0 and (</>P')2 = 0 , (10.8.21) that assures the FD property: np = 0,1. Now we return to (10.8.14) and note that, for FD statistics, the expansion of the exponential terminates with the linear term, =n[i+<e_i^T<] p = E(^t'lW>e~«£;({n})T({n}|</>''> , (10.8.22) {«} and we see that <^'|{n})({n}^")= II K<) • (10-8-23) (np = l) As an example take np = 1 for modes 1,3, 7 and np = 0 otherwise, then (np=1) =(-i)2WVlVi'€T4V' = (-i)2+V?VjVlVi'^'^' = (4Vivr)w>3V"). (io-8-24) which involves an even number of sign changes. In general then, adopting a standard multiplication order: J7, and its reverse f] , we have <^'iw>=if OpT ' <{n}i^">=n wt - (io-8-25) p p where one could, for uniformity, include 1/' ^np\, which is one. The consistency of these results with the interpretation of the ip and ip^ as creation and annihilation operators can be checked. Note that for the vacuum state (^t'|0) = i, (0|tf>") = l. (10.8.26) *Hermann Giinther Grassmann (1809-1877)
10.8 Non-interacting particles 401 So, for both statistics, / ,/\nP <^'iw>=nT%W» - <^'inT^=Ho> (io-8-27) 'n0l and therefore i<"»=nT4£io> (10.8.28) as required by the creation operators significance of the tp^. Similarly and p v p (wi=(oin p V P (</>p)"p p V P T ' (10.8.30) which left vector is indeed the adjoint of the preceding right vector, inasmuch as f reverses the multiplication order. The (j>p(r) are the eigenfunctions of h = p2/(2M) 1 fh 2M V cpp(r) = Eptj>p(r) (10.8.31) Equally well, for any single-particle energy h(r,p), say h = p2/(2M) + V(r), one can introduce eigenfunctions (r| E,... \ = </>a(?*) m accordance with h(r, y v)^(r) = EaMr) (10.8.32) such that (P|e-^1e^|r') = (r,ti|r',t2) = Y,Mr) e~TiE«(tl -*2) Mr')* (10.8.33) = (r|a) = (a|r') and all goes as before. In particular, for FD statistics, T (n„=l) (10.8.34)
402 10. Identical Particles with ¢1= J(dr')^(r')Ua(r') (10.8.35) and ipa |0> = y (dr) Mr')* Mr) |0) = 0 , (10.8.36) = 0 for which the 1-particle state |la) = J (dr') ^(r')^a(r') |0> , (10.8.37) and the 2-particle state la, lb) = y(dr") WyMr") y(dr') ^(r')Va(r') |0) (10.8.38) 1„ are examples. What is the effect of ip(r) on these states? First iP(r)\la) = f (dr')iP(r)iP(r')1 Mr')\0) = 6(r - r') - iP(r'^ip(r) -> 6{r - r') = ^o(r)|0) (10.8.39) which is most reasonable: the wave function (j>a(r) represents ip(r) for a 1-particle state. Next </>(r) |1Q, 1») = y (dr") ^(r)^(r")f &(r") ^(r-r")-^")1^) Xy(dr')V(r-')Va(r')|0) > v , = Mr)\la) - J\dr")iP(r")Ub(r")ip(r)\U) = |0>^(r) = ^(r)|l0)-^a(r)|li). (10.8.40) Here are two ways of annihilating one particle, with both sides antisymmetrical in the a, b labels. Clearly this is general as illustrated by tp(r) | 10,1(,, lc) which is antisymmetrical in any pair of indices and unchanged by cyclic (even number) permutations. So
Problems 403 4>{r)\la, U, lc) = Mr)\la, lb) + Mr)\lc, la) + <pa{r)\U, lc) • (10.8.41) Now try two annihilations: 4>{r)^{r')\la,lb) = 4>{r)[<t>b{r')\la) - Mr')\U)} = (Mr)Mr') - Mr)Mr')) |0) , (10.8.42) properly antisymmetrical in a, b and in r,r'. Here we see a 2-particle wave function. Similarly for three particles, using cyclic symmetry: </>(r)</>(r')|la, h, lc) = {Mr)Mr') ~ <t>c(r)cj>b{r')) |la) + (^(r)</>a(r')-^(r)</»e(r'))|U) + {0a(r)Mr') - Mr)0a(r'))\lc} , (10.8.43) and so forth for states with 4, 5, ... particles. Problems 10-la BE statistics: Evaluate W)f\{n}), O/KaOTlW)' and for a' / a" tf(aW')|{"}>> 4>(a'Ma.")i\{n}), </>(a')V(a")|W> . 10-lb FD statistics: Same questions (only a' / a", of course). 10-2 FD statistics: Show that, for a' / a", the operators V(o') = (-!)">"</<(«) , >(a") = (-l)w>-^(a") are commutative. Here, the operator N>ai, for example, counts the number of particles in all modes after a' in some standard ordering. What is the connection with the construction given in lecture? 10-3 Verify (10.3.8) and (10.3.9). 10-4a Spin s in the rotating magnetic field: What is the probability that, in time T, the transition m = —s —> m = s happens? What is it for m = s —> m = — s! 10-4b Same set-up, for integer s. What is the probability that m = —s —> m = 0; that m = s —> m = 0; that m = 0 —> m = ±s?
404 10. Identical Particles 10-4c Again, for s = 1. Use the information available from Problems 10-4a and 10-4b and find the probability that m = 0 —> m = 0 in time T. 10-5 Concerning the spin dynamics of Section 10.5: Consider a magnetic field that changes slowly from B(t < 0) = B0ez to B(t > T) = —B0ez. The initial state of given s has m = s. Find the final state for T — 2t B(t) = B0-^r-ez and for B(t) = B0(ez cos(irt/T) + ex sm(nt/T)) . 10-6 For both statistics, Gj, =ih^2^(a',t)H^(a',t) , a' G^ = ihY.Hia',tfil>(a'\t) . a' Use the known FD commutation relations to check that G¢t does the expected things. 10-7 Operator F is of degree n in tp and ifi. Show that ^F=±[F,G^] for FD systems, implies that neven: ^^,^ = -^^=--^^, 10-8 Given the Hamiltonian H=Y, ip(a',tf(a'\h\a")iP(a",t) a',a" + \ E Ha,t)^(a"',ty(a',a"'\v\a",a'v)^(a'v,t)^(a",t), a',...,a" what are the equations of motion of a BE system, of a FD system? 10-9 Use (10.8.2) and the equal-time commutation relations (10.7.6) to find, for non-interacting particles, the commutation relations at unequal times.
11. Many-Electron Atoms 11.1 Hartree-Fock method We now apply the methods of Chapter 10 to a study of neutral atoms with Z electrons. As basic physical approximations we take into account only the electxostatic nucleus-electron interactions, and we treat the nucleus as an infinitely massive point charge (of strength Ze). In the Hamilton operator H = H™ + H^ + i|(dr)(dr')^(r)V(r')t |^7|^(r')^(*-) (11.1.1) the one-particle term H^ represents the kinetic energy of the electrons and the nucleus-electron Coulomb interaction energy, and the two-particle term H{2) is the energy of the electrostatic electron-electron interaction. They are of the general forms (10.7.9) and (10.7.12) with ft(P,?V)=--^-V2- — and v{r-r') = -^-, (11.1.2) v i ) 2mei r \r — r'\ respectively. All conceivable multi-electron states are described by eigenvectors of the total number operator with eigenvalue Z, f(dr)i/>(r?il>(r)\ ) = | )Z . (11.1.3) We are interested in the ground state, treated approximately, under the assumption that all electrons are occupying different modes - electrons are FD particles, fermions. So the approximate ground state is is)=nT^i°) (n-1-4) a where the a's label Z different modes. In general, the expectation value of H in any state is larger than the true ground-state energy Eg,
406 11. Many-Electron Atoms |2 ( \h\ ) = 53E'\(E'>---\ )1 > E^1(^---1 >l = Ee■ (n-1-5) £V So the best choice of approximate state is the one that minimizes (H). We are, of course, heading toward an application of the Rayleigh-Ritz variational method of Section 6.11. To begin, we note that a ^(0|g> = Els- la>(±)<A(r') (11.1.6) a T with the sign depending on the conventional order implicit in F] . So <g|^(r)ty(r')|g> = E^(r)*^(r') ' (1LL7) a and therefore the expectation value of the one-particle term is <g|ff (1) |g> = 5] J(dr) Mr)* (-^V2 - ^) &,(r) • (H.1.8) Similarly we have ^(r>(r)|g> = 53|g - la - lb)(±)ab [Mr')Mr) - <t>a(r')<t>b(r)] (11.1.9) a<b and then (g|</>(r)W)W)</>(r-)|g) = E^»(r)^(r') ~ ^(*-)^(*-')]* a<b x [<Pa{r)Mr') - <t>b{r)<Pa(r')} (11.1.10) or, with £a<6 -> \ J2a^b, (g|^(r)V(r')V(r')^(r)|g) = 53[^a(r)^(r')]*[^(r)^(r')-^(r)</»a(r')] (11.1.11) and so we get <g|ff(2>|g> = ™E /(dr)(dr')^(r)>6(r'r a^6 2 x |737m [^(*-)^(*-') - ^W^(r')] • (11.1.12)
11.1 Hartree-Fock method 407 Since <j>a{r)(j>b{r')-(j>b{r)(t>a{r') = 0 for a = b, the replacement Yla^b ~> Ea,j is permissible; it does not change the value of (g|ii"(2) |g). Accordingly, the energy estimate is (remember: it's an upper bound on the true ground-state energy) a J + ^/(dr)(dr') [MryMr'yv(r-r')Mr')Mr) -<j>a(ryMr'yv(r-r')cj>a(r')Mr)] (11.1.13) with h and v of (11.1.2). Of course, the single-electron wave functions are orthonormal, /< (dr)MryMr) = 8ab. (11.1.14) Now look for the minimum of E by varying the <j)*a and the <j>a: ^ = ^ /(dr) 6Mr)* (hcj>a(r) + £ |(dr') [&(r')*i>(r - r')^(r')^0(r) -^(r')*v(r-r')^a(p')^(*-)]) + {its complex conjugate} , (11.1.15) subject to the constraint (11.1.14), that is '(dr) [Hlh + KHb] =0, (11.1.16) from which we conclude that /< h + E [(dr')Mr'yv(r ~ r')<t>b{r') &(r) b J - Y, /(dr') &(r')Mr - r')<j>a{r') Mr) = £ £a,bMr) (11.1.17) v „ ' exchange terms with energy parameters £aj, to be determined self-consistently. These are the Hartree*-Fock equations. Things are simpler if, as an approximation, we omit the so-called exchange terms. Note that, because they involve <j>b{r')*<j>a(r') the a and b wave functions must overlap which limits their contribution, particularly for Z > 1. "Douglas Rayner Hartree (1897-1958)
408 11. Many-Electron Atoms Then the <f>a can be chosen as the eigenfunctions of a single effective potential V(r): (~2^V'2 + V) K = £a<f>a (11.1.18) with these are called Hartree equations. Clearly, here we meet the average electron density «W=El^(r)|2' [(dr)n(r)=Y.l = Z'> (11-1-20) b J b the latter statement repeats (11.1.3) in the present context. The total energy is 2 7e2 r h2 ^2 Ze 4 Adr)(dr.)=fe>2£> , (11.L21) or 2n(r)n(r') £ = J] j(^Wa (-^- V2 + y^a -\ |(dr)(dr')e2 r — r' (11.1.22) The first term sums the independent-particle energies £a and counts the interaction energy twice, and the second, negative, term subtracts the doubly counted interaction energy once. Here we have looked for the best choice of the <j>a. Another procedure is to accept that the <j>a are the wave functions of a common effective potential and look for the best choice of V. This can be done including exchange, but we omit it here. So we start with the energy expression (11.1.21) and choose the (j>a$ such that (11.1.18) holds, which gives E = E^« - /(dr) (V + ~~) n(r) + y J(Ar)(dr' n(r)n(r') (11.1.23)
11.1 Hartree-Fock method 409 To vary V we recall that so that 6£a = (SV) = J(dr)6V\cj>a\ (11.1.24) 5Y,£* = J(dr)SV Y, l^l2 = f(dr)6Vn . (11.1.25) Then SE= [(dr)SVn- f (dr) + [(dr)Sne2 f (dr') Ze2 6Vn+ [V+ )6n r n (r') or SE = |(dr) 5n T, Z(? 2 -V +ez r /(dr') »(r') (11.1.26) (11.1.27) where we recognize that E is actually a functional of the electron density n. In view of the constraint f(dr)6n = 0, (11.1.28) 6E = 0 then implies that [• • •] in (11.1.27) is constant. Now, since V(r) -> V(r) + Vq requires £a —> £a + Vq for consistency and does not lead to a change of the energy (11.1.23), we can agree on the natural convention that V(r) —> 0 as r —> oo, and then this constant vanishes. Therefore, the best choice for V is such that V{r) = -— + e2 j(dr')-^- , (11.1.29) r J \r — r'\ which is (11.1.19), indeed. The set of equations (11.1.18), (11.1.20), and (11.1.29) must be solved jointly, and iterative methods suggest themselves for this purpose. Before continuing notice the implication of Poisson's equation: -V2 ( V + ^-) = 47re2n (11.1.30) namely that
410 11. Many-Electron Atoms Ze2 - /"(dr) (.V + ^~\ n + ~ /"(dr) n(r) f (dr'), n(r') \r — r" ^V + Ze2/r Ze2\ 1 , _2. / Ze2 v y + 47re'2 Ze2 (11.1.31) which is just the negative electrostatic energy of the electrons. Indeed *=E&-£?/<*•> V [V + Ze1 (11.1.32) which is a functional of the effective potential V, gives back the condition to determine V: = /"(dr) 5Vn- —^ f (dr) VSV ■ V (V + = J(dr)t Ze^ r A-ne2 in the form of Poisson's equation. n+^V2(V+^ r (11.1.33) 11.2 Semiclassical treatment: Thomas-Fermi model Hartree's program involves making an initial choice ofV(r), solving Schrodin- ger's equation to find the wave functions, then constructing the electron density, to find a new V(r), and so on. Noting that the effective potential is spherically symmetric, V(r) = V(r), since nothing distinguishes one spatial direction from the others, we can give an approximate version of this by using the WKB approximation of Problem 8-9a nh Jri dr\2mel[£~V(r)] V (I + I) nr + 1 2 ' (11.2.1) in which the integration is over the classically allowed region where the square root has a positive argument. Individual energies are labeled £„r+i j+i and we get the Z electrons by filling up the energy states from the bottom, up to the energy —£: z = 5>(-£„-0 (11.2.2)
11.2 Semiclassical treatment: Thomas-Fermi model 411 with oo oo ^- = 2^1(21 + 1)- , (11.2.3) a nr=0 1=0 where 2/+1 is the multiplicity of orbital angular momentum /, and the factor of 2 is the spin multiplicity. Accordingly, it is convenient to write E as r E = Y,(£a + CM-£a-C)-CZ-^J(dr) v(V + „2m2 (11.2.4) where Ei — (Z is the sum term of (11.1.32). The purpose of the step function ri(—£a — 0 is> °f course, to select the occupied states, the ones with £a < — ¢, the actual value of £ being determined by the count of states (11.2.2). Inasmuch as we do not know V(r), yet, we can hardly go at this directly, that is: beginning with V, find the £Ur+i ;+i and work out E. As a step toward an approximate way of finding V, we replace these summations by equivalent integrations. First recall that J_ y eim^-^)=<$(0-^) for -7r<<M'<7r, (11.2.5) m™ —oo which is generalized to (<fi — (p* —» 2ttx, — oo < x < oo) oo oo £ e2*im*= ]T 5(x-p), (11.2.6) both sides being periodic in x with period 1. We use two examples of this so-called Poisson sum formula oo oo £ e2,ri*(A-i)= ^(A-I-J), (11.2.7) &=--00 lz^ — OQ OO oo £ ^-1)= Y, &{y-\-nT) . (11.2.8) j = ~-00 rir^ — OO Then we can write nr,l=0 /.oo °° = / di/dA/(i/,A) £ <J(i/-|-nr)<J(A-|-/) ^° n,.,J=-oo = j" Avd\f(u,\) V e27ri^-2) + fc(A~5)] . (11.2.9) j,k= — oo
412 11. Many-Electron Atoms In particular, for the contribution E\ in (11.2.4), oo = 4 /"di/ dA X(£„,x + Ov(-£v,x - C) E e2?d Ci (" ~ 5) + * (A ~ s)] . Jo ■ j~ (11.2.10) All the terms here are integrals of oscillatory functions, except j = k — 0. We pick this out and for historical reasons call this highly semiclassical approximation the Thomas*-Fermi (TF) approximation, />00 E?F = 4 / di/ AdA (£„,A + CM-^,a - 0 (11.2.11) ./0 where, this is (11.2.1), n Jri "dr^^-V)-^. (11.2.12) In essence, the sum over discrete quantum numbers nr,l in (11.2.4) is approximated by an integral over continuous quantum numbers v, A. As written, (11.2.12) gives an implicit definition of £„a- It is more convenient to read it as function v(£, A), v(f,A) = ijr"dr^(f-V)-^, (11.2.13) and to switch the integration variable from v to £. This is done with the aid of a partial integration. Note that di/ {£ + Qt,(-£ - C) = d[(£ + Qr,(-£ - ¢)1/] - vA[(£ + Qv(-£ - ¢)] (11.2.14) vanishes at both integration limits and therefore, with d[(£ + Qrj(—£ - Q] = d£rj(-£ — Q, -C /•oo [ — (, / dv(£ + CM-£-Q = - A£u{£,\), (11.2.15) ./0 ^-00 which uses that v = 0 obtains for £ —> — 00, so that -C /.00 /.-i, EjF =-4 dAA / d£i/(£,A). (11.2.16) Jo ./-00 'Llewellyn Hilleth Thomas (1903-1992)
11,2 Semiclassical treatment: Thomas-Fermi model 413 In view of \v{£,\)=\M~J dr ~ — ( — [£- V(r)] -- 3 2mei \ ft2 5 5 1 dX d£ 4?r2 / •(dr; 2\ li2 /2mel 47rdrr2 ( -^ ) ^- ( ^ [£ - V(r)] - 15 J 2mei V h2 ^2X2 (11.2.17) we then get Eib = ~ 157T2 ^j^)(2rn4~V(r)-C]f. (11.2.18) Recalling (11.2.2) and the definition of Ex in (11.2.4), we have BE, in general, and in the particular TF context Z=3^/(dr)[2mel(-y"C)]S- Next we look at variations of V as in (11.1.25), here: /"(dr) SVn=SvYJ£a= Sv (E1 - (Z) = SVE1 £ constant since the induced changes of ( do not contribute, 6C(E1-(Z)=0. So, the TF approximation for the electron density is n(r)==3i[2mei(""y(r)""c)]: which is consistent with (11.2.20), (dr)n(r) = Z . /< Now Poisson's equation (11.1.30) says that -V2 V + Ze< 1 = ^e2^2^[2m4~v~0Y (11.2.19) (11.2.20) (11.2.21) (11.2.22) (11.2.23) (11.2.24) (11.2.25) a non-linear differential equation to find V(r), subject to the boundary conditions
414 11. Many-Electron Atoms rV -> —Ze for r -> 0 and V -> 0 for r -> oo . We introduce an auxiliary function / by -v-c -f with / -> 1 as r -> 0 (11.2.26) (11.2.27) and note [recall the Laplacian differential operator in spherical coordinates; cf. (7.5.6)] -V-C- 6e- r W£„-.,) = ££, Then (11.2.25) appears as d2/ 4?r 2me]Ze 2\ 2 /* 3n \ r dr2 Z 3ir2hz [a0 — H2/(meie2) is Bohr's radius once more] and with r = aZ~~zx this reads if d2f(x) _ [f(x)Y dx2 x? with /(0) = 1 1 /"?7T \ 3 a=-(—J a0=0.88534a0 (11.2.28) (11.2.29) (11.2.30) (11.2.31) (11.2.32) The important point of this TF differential equation is that it is universal, independent of Z\ its solution is the TF function fix). The boundary conditions for large x need some discussion (this is for neutral atoms). At the edge of the atom, the electron density must drop to zero, so (11.2.23) requires y + C = o and the total potential must vanish (no net charge): V = 0 and therefore ( = 0 . Also, according to Gauss's theorem, Z = 1 47r2e2 4irr _d_ dr V + Ze2 edge rf_dV_ e2 dr edge (11.2.33) (11.2.34) (11.2.35)
11.2 Semiclassical treatment: Thomas-Fermi model 415 so av dr = 0 edge All this says that we must have, 'at the edge', / = 0 and /'=^=0. da; (11.2.36) (11.2.37) For neutral atoms then, the edge is at x = oo. We begin at x = 0, where /(0) = 1, start with some downward slope, /'(0) < 0 and proceed according to /" = j \fjjx > 0, which says that the function is always curving upward. There are three qualitative possibilities according as —/'(0) >,=,< B where B is the correct value of —/'(0). The plot / 1 • 0.5- 0- V 2 6 ""-.. ,-f'(0)<B -/'(0) = B 10 r/'(0) > B shows that /(0) = 0 occurs at finite x, where —/' / 0, if —/(0) > B; and that /' = 0 occurs at finite x, where / / 0, if —/(0) < B. Of course, the value of B, the initial slope of the TF function, is found by numerical integration. But we want to a get a feeling for it, as one can by seeing its connection with the all important energy, which is ETF =- 1 1 157T2 me\H3 -^/(dr) (dr)[2mei(-V -C)]f- CZ Ze2xl2 VIV+ — (11.2.38) in the TF approximation. With ( = 0 and V(r) = ~(Ze2/r)f(Zir/a) = — (Z3e2/a)f(x)/x this is
416 11. Many-Electron Atoms Zie2 ETF = -: 2/0 dx(Tx) +5/0 d^"2/2 (11.2.39) since 7 47re2 j(dr) v y + Ze2 ./0 f Jo dx dxx 2/d/-1 \dx a; d/V d (/-1)2 da; / da; a; da.fyV-^-1)2 da; s=0 = 0 Clearly, for the correct /, 5 = ^+^ dxf'2 + ^l dxx~ifi=-ETFi 5 in / a must be stationary. Indeed />oo 5S= dx Jo r^f+x^fhf =fsf\7 (11.2.40) (11.2.41) J dXdx:f'6f + J dx-J/ [-/" + a:-i/5J=0, (11.2.42) when /" = j\[JJx and /(0) = 1, so that /(00) = 0, 5f = 0 at x = 0 and a; = 00. We learn more by considering a scale change: f(x) —> /(Ax), 5(A) = A- / dxf'2 + X~^ dxx~ifi 1 Jo 3 Jo (11.2.43) where we must have ^S<A» 1 f°° 1 2 Z*00 = 0 or -/ da;/'2---/ da;a;^/i=0, (11.2.44) A=l ^ i0 -^ 0 ,/0 so />0O rt />0O / da;/'2 = -/ da;a;~2/t for correct / . (11.2.45) /0 5 io
11.2 Semiclassical treatment: Thomas-Fermi model 417 Also />0O />0O J />oo J dxf = J dar—(//')-y ds//" />00 = -/(0) - / dxx~if% for correct / . (11.2.46) For the correct / then, / d^/'2 = ./0 and This means that a 2 Z*00 -£ , / dza;-*/* ' ./0 5=(i + |)B = |B. -ETF=Z$--B a 7 (11.2.47) (11.2.48) (11.2.49) where jB is the stationary value of S. Before using the latter connection to get an approximate value for B let's see another way of understanding this result. We return to (11.2.39) and, inasmuch as we are, in the TF limit, no longer restricted to integer values of Z, consider d/dZ. Because the functional (11.2.39) is stationary, induced changes of V and £ do not contribute and only the explicit dependence on Z counts: _y(dr)v^ + —)vr-c OZ 4?r r Ze*\ 21 or with and C = 0 = i/(drHy + rrJv^"c (1L2-50) V2- = -4n6(r) (11.2.51) ^ = -(^ + ^)(0). (11.2.52) There is a simple way of understanding this: OE/dZ is the change in E produced by placing an additional unit positive charge at the nucleus. That is the negative of the change in E produced by an additional negative unit charge at the nucleus. But the latter is just the interaction energy of an electron at r = 0 with the rest of the electrons, which is (V + Ze2/r) (0).
418 11. Many-Electron Atoms Now (C = 0) V + Ze2 r->o a x = zie-B from which follows dETF „4 e2 dZ = -Z*—B a (11.2.53) (11.2.54) and then (11.2.49). Now let's return to the calculation of B according to (11.2.48) and (11.2.41). In Problem 11-5 we learn that the stationary value of S is a minimum, so that jB < S and the equal sign holds only if f(x) solves the TF equation. All trial functions are, of course, subject to the boundary conditions /(0) = 1 and /(oo) = 0. As a simple example we consider /(*) = 1 (1 + Az) so that, with (11.2.43), (11.2.55) ;J3<ASi + \~*S2 (11.2.56) where * = 2* 7*00 L (T dx 1 a2 + x) 2a+2 2 2a + l (11.2.57) and [substitute x = 1/t — 1 and recognize Euler's beta function integral (9.5.54)] S2 = CO cy ;V2 2/" / dttia- Jo j(i-*r 5V (|a-l)! (11.2.58) Upon differentiating (11.2.56) we find that the optimal choice for A is A (|S2/Si)* which gives the estimate ?*<§(2SiSS)* (11.2.59) or **KF' V2a + 1 (|a-l)! (11.2.60)
11.2 Semiclassical treatment: Thomas-Fermi model 419 For a = 1,|,|,2 this gives B < 1.5960,1.5908,1.5910,1.5940, respectively. Certainly, then B < 1.5908. Numerical integration gives 5 = 1.588071 (11.2.61) for the (negative) initial slope of the TF function. That is: to three significant figures, B = 1.59 is correct, and the variational estimate for a = | is less than 0.2% in excess. Now to the binding energy: -ETF = ~BZi- = %^-Bzl- = 0.7687Zi- , (11.2.62) 7 a 7 a/ao a® ao or measuring the energy as a multiple of the atomic energy unit e2/ao, —ETF 1 = 1.537 Zi (atomic units) . (11.2.63) 2Z The division by \Z2 is convenient because values then range, over the periodic table, from 1 to about 6. The points (or rather little circles) in 'o 25 50 75 100 125 Z
420 11. Many-Electron Atoms report binding energies (in atomic units) as calculated by Hartree-Fock methods, and in agreement with experiment where that is available (up to about Z = 20). This collection of points cries out for a smooth connecting curve, and as a first step toward this goal we have superimposed on the 'experimental' points the smooth curve that corresponds to the TF energy (11.2.63). The general pattern is right, but one may get the impression that the two curves are diverging for large Z; in fact they are not, and more important, the fractional error is decreasing: It begins at about 50% for Z = 1, and has dropped to about 15% for Z = 100; it is about 25% for Z = 27, Zll'A = 3. 11.3 Correction for strongly bound electrons Can we improve ETF without getting involved with the oscillating terms of E\ in (11.2.10) [because there is no sign of oscillations in the data just plotted]? For that it helps to understand where the TF approximation must break down. The clue is in the local nature of nTF(r) = 3i^^2meiy(r)]l (11-3-1} [( = 0 in (11.2.23)]. The wave functions that actually give n(r) certainly involve V(r) within a wavelength or so of the given point. This would be unimportant if the fractional change of V in a wavelength is small: dr C IVI with A = - = -==^- (11.3.2) [the virial theorem (8.3.10) states that kinetic energy j?2/(2mei) and potential energy V are of the same order of magnitude] or « ^ = -/½ • (11.3.3) Observe that at typical TF distances, r and V are of the order Z~3ao and Zse2/ao, respectively, and this criterion becomes — (z%—) 2 « (a0e2y* or Z* » 1 . (11.3.4) Here is the essential parameter of the TF approximation. Now suppose that r <S Z~~5a0; then V = —Ze2/r and for TF validity we must have 11 o-o , ~7=T<~r= or r»^, (11.3.5) that is: r must be much larger than the radius of the first Bohr orbit for the nucleus with charge Ze. So
11.3 Correction for strongly bound electrons 421 — <r < —r Z Zi (11.3.6) which must be considered satisfied for r as small as Z~2/3ao (assuming Z* > 1, of course) or energies of the order Ze2/(Z~2/3a0) = Z$e2/a0 but certainly fails for distances of the order Z~la^ or (binding) energies of the order Z2e2/ao. Strongly bound electrons, those for which £ ^> Zie2/a0 are a source of error for the TF approximation. To correct this we regard Ei as a function of ( and write (( = 0 eventually) £i(0=[£i(O-£i(Cs)]+£i(Cs) (11.3.7) with £s of the order Z^e2 /ao, thereby separating the contribution from the strongly bound electrons, Ei((s), for which the TF treatment is in error, from the rest, -Ej(£) - Ei((s), whose TF value can be trusted. Since r C a0/Z~s for the strongly bound electrons, they move in the potential V = -- Ze2 (1-*;*') Ze2 „e2 * + B — Z3 , r a (11.3.8) which is essentially the Coulomb potential of the nuclear charge, and therefore the calculation of -Ei(Cs) can be done exactly. Of course, for consistency - contributions of oscillatory terms are omitted from Ex (Q - we want only the non-oscillatory part of Ei((s). First, we find the TF value of Ex (Q - Ex (Q as [£i(( = 0)-£i((s)]TF 1 ^/(dr)[2mel(~y)]§ + 157r2 mei 1 =»/*"> 157T2 meih3 The latter integral covers r < rs with 2 17 1 2mei [-(s-B — Zi + , ^e2 4 Ze2 (s+B-Z* = a rs (11.3.9) (11.3.10) and has the value 4?r 1 157T2 me\hz (2melZe2y f Jo dr r2 [ r rs 32 Z2e2 fmele 157T ao K2 -a0 Zrs 2a0 f Jo dtr*(i-ty- = 1 2Z 2^,2 a0 Zrs 2a0 2Z 2JI a0 = 5tt/16 (11.3.11)
422 11. Many-Electron Atoms where we introduce an effective continuous quantum number ns rs = —nt or (S + B — Z> = 5. Z a ZaoUg (11.3.12) Then, summing the Bohr energies with multiplicities 2n2, £q(£s) is 2a0n ^,^ v^„?/ Z2e2 „e2 4 \ /1 1 Z2e2 / 1 , 1 \ "n I 2* ~T ~^ 1 2 2 LnsJ a0 £ -i + (11.3.13) where [«sj is the largest integer that does not exceed ns. So, the correction for the strongly bound electrons adds 2 „2 a0 L«sJ + E -1 (11.3.14) to the energy, but this still contains oscillatory contributions which we must identify and then discard. The challenge here is to separate the summations into smooth and oscillatory parts. Begin with the Poisson sum formula (11.2.6) in the form E 8(x — n) = 1 + 2 2_, cos(2irmx) (11.3.15) and integrate x from e to ns with 0 < e < 1, »ns 00 LwsJ / da; E S(x ~ n) = £ l = LnJ = ns + - ]T sin(27rmns) 1 m sin(27rme) m (11.3.16) The left-hand side does not depend on e, and so the e terms on the right-hand side must be constant. Putting e = 5 gives the value stated. But, wouldn't we get zero for e = +0? No; it is essential to realize that the summation is not
11.3 Correction for strongly bound electrons 423 zero for infinitesimal positive e. Indeed, it has contributions for sufficiently large m that an integration can be used: I f H2-(+°» . I f ^sin(2™(+0)) = 1 f m= 1 dt . 1 7sm*=2- (11.3.17) So |_nsJ = tis — | smooth 1 ^-^ s'm(2irmns) oscillatory (11.3.18) »ns 0 12 3 4 5 identifies the smooth and oscillatory parts of \ns\. More generally, we multiply (11.3.15) by xk, k = 1,2,... and use successive partial integrations to get LnsJ „Ms oo ^2 nk = / dxxk ^2 S(x - n) rk+l n~~ oo k AI'Kmx , '- + 2Re"T(-l)A^f-^ V * ."",, . fc + 1 ^-^v y fc! ^-^ (i27rm) !+*-•» ' " m=lv ' J j=0 With 2E COs(7Tto) E (-i)r ^ (2?rm)2 2?r2 ^ m2 m=l v ' m=l this gives for k = 1 1 "24 LnsJ oo E« = o^2-^+2E sin(27rmns) cos(27rmns) ns—^ i + 2nm (27rm)2 (11.3.19) (11.3.20) (11.3.21) smooth and for k = 2 [ns\ J2 n2 = -ns3 +2 ^ n=l m=l smooth oscillatory , sin(27rmns) cos(27rmns) sin(27rmns) 27rm + 2ns (2?rm)2 (2?rm)3 oscillatory (11.3.22)
424 11. Many-Electron Atoms Accordingly, the smooth part of (11.3.14) is 2 „2 Z2e a0 ;ns + 3"s ^z2 a0 (11.3.23) The net result is independent of ns, as it should be, and tells us that the correction for strongly bound electrons adds |Z2 atomic units to the energy. Therefore (11.2.63) is changed to -E \z2 1.537Zs -1 The second curve in (atomic units) . (11.3.24) -E \2? shows the remarkable improvement that these corrections for the innermost electrons bring about.
11.4 Quantum corrections and exchange energy 425 11.4 Quantum corrections and exchange energy How about the remaining discrepancies? Where to look? Problem 11-3 shows that TF is a classical limit, with ETF given by a classical phase space integral (including the spin multiplicity of 2), E™(O = 2J^0j^(H(rtp) + C)v(-H(r,p)-C) , (11-4.1) the leading semiclassical approximation to E^tiiiH + CH-H-O] (11-4-2) with the single-particle Hamilton operator E(r,p)=4+v(r)' (ii'43) That suggests looking at the first quantum corrections to EjF. All we need is the three-dimensional generalization of the one-dimensional result of Problem 6-17b, which requires that we recognize as the essential ingredient. So, the leading quantum correction to EjF is hn(-h) h = H(r,p) + C E*» --2 f ldr) (dp) -^- V2V —hn(-h) 1 ~ J (2nh)3 24me. dh* V[ ' = 2/MM_^v2yif-^-y(r)-() (11.4.5) ./ (27rft)3 24mei V 2mei w V v ; or, after evaluating the momentum integral, Er = 24^h /(dr) VV \-2m^-V - 0] h ■ (11-4-6) Notice the appearance of V2y, not V2(y + Ze2/r) where V2V{r) = A-KZe25(r) - 47re2n(r) . (11.4.7) But we must not forget the special treatment of strongly bound electrons. What we want is [£i(0-£i(Cs)]qm (11-4.8) where E\{C,S) is computed from the potential Vs — —Ze2/r + (B/a)Zs such that V2T4 = 4:nZe25(r). So, the necessary correction for the strongly bound
426 11. Many-Electron Atoms electrons removes the virtual 6(r) contribution, and we get the quantum correction A«mE = dk /(dr) v' (y + T~) \-2m^-v -0V- (n-4-9) Since this is a small correction to the TF energy, we evaluate it consistently for the TF potential that obeys the differential equation (11.2.25), A^E = - J-^ /(dr) [2mel(-y - ¢)]2 ^pr/(dr-)[-V(r)-(]2. (11.4.10) 2 "9^ With the TF parameterization r — Z ^ax, V + ( = —Zs (e2/a)f(x)/x this becomes ^qm-B — - —2 -Zi— r dx[f(x)]2 . (11.4.11) ^0 «0 Jo MfP 4 For the neutral-atom TF function f(x), the integral is 0.615 435, and .Se2 Z\qmJB = -0.04907^3 — . (11.4.12) a0 Next we look at exchange, for which purpose we return to (11.1.13), pick out the exchange term, and make the spin summation explicit: £ex = ~\ E /(dr)(dr') E Mr, *)*&(»", a')*v(r - r')^(r', a')Ur, a) . a,b <r,<r' (11.4.13) Since, for example, 2_,(t>a{i',a)*(j)b(r\<j) = 0 if modes a,b have different spin (11.4.14) a only the a, b pairs of the same spin (net factor of 2) contribute, so that Eex = -^1^)^)1:.^(^)^(^^(^)^^) (n.4.15) with the Coulomb interaction potential v(r — r') of (11.1.2). In the semiclas- sical TF limit the wave functions are those of (10.8.12),
11.4 Quantum corrections and exchange energy 427 Mr) = J^^eJP-r with \p\ < y/2ma(-V - Q = P (11.4.16) and we have , f , (dp) (dp') e~i(P-P') -(r-r') Eex = -e2 / (dr)(dr')7^¥?^4 = r. . (11.4.17) y (2nn)J (^irh)6 \r — r'\ We first integrate over r', where only the H/P vicinity of r contributes, and meet the Fourier transform of the Coulomb potential, |(dr')- -jr(p-p')-(r-r') (P ~p) J \r — r'\ \ J (p-p1)2 J K ' V / |r-r'| v v ' = 47T(5(r' — r 4irH2 {p-p'f ' Then (11.4.17) becomes (11.4.18) F - 4?re f(dr) f (dp)(dp,) ni4 1Q^ E---(W¥j{dr)J Jp-^W (11A19) where |p|, |p'| < P(r) = i/2mei(—V(r) — Q specifies the range covered by the two momentum integrations, so that [see Problem 11-11] _ 4?r e ±Jp.v — (2.)4^/^)^(-^)-0]2 i^l/(dr)hy(r)_c]2 (1L420) and, recalling (11.4.10), we have Therefore -Eex + AqmE = 9 11 „ „ 11 /3ttV (11.4.21) ~3 Z"00 9 s P2 / ds [/(^)]2 Zf -, Jo ao = 0.2699 (11.4.22)
428 11. Many-Electron Atoms and together with the terms of (11.3.24) we arrive at rp —*mooth = 1.537Zs - 1 + 0.540Z~i (atomic units) . (11.4.23) 2Z Now we have a third curve in the comparison with the 'experimental' data, 0I 1 1 1 u_ J 0 25 50 75 100 125 Z a curve that runs right through the little circles! (Except that it stops 7.7% above Z = 1.) That the third curve be visible at all, two thirds of the circles are removed, only those for Z = 1,2,3,6,9,... ,120 are displayed, with a larger diameter than in the figures on pages 419 and 424. In three steps we've reached the goal of understanding the smooth Z dependence of the Hartree-Fock energies, first seen in the figure on page 419. 11.5 Energy oscillations Do we stop with this success? Not quite. We have got to see the contributions of oscillatory terms. When should they show up? They are there to enforce the
11.5 Energy oscillations 429 integer nature of nr and I, or if you like, the discreteness of the individual electron. In contrast, the TF treatment regards the electrons as infinitely divisible, as if their total number were continuous. The implied error must be roughly the ratio of the actual unit of charge, e, relative to the total charge (for neutral atoms) Ze: 1/Z. Relative to the TF energy (11.2.62), ET* oc Z7/3, this is Zs, one order below where we are. Can we anticipate anything about the structure of these oscillatory terms? Recall (11.2.10), E,=AJdvA\\ {Sv,x + CM-^,a - OE e2?d [j{V~ *} + k(X~ ^ • (11.5.1) j,k We might expect that a dominant contribution comes from regions where A approaches its maximum value (= A), a region of constructive interference. Now, in (11.2.12), * = !/*: it J r dr /2mP h2 r2{£ -V)-\2 , we have v > 0 and £ < -Q , (11.5.2) (11.5.3) so that the largest A occurs for the smallest ^, that is: v ~ 0, and the largest £, that is: £ = 0 for neutral atoms (( = 0). In fact A2 = Max 2mei -w1 '■[-V(r)] ,2a ZV3—Max{xf(x)} a0 z1/a (—) yxma.^f(xma.x) (11.5.4) From the numerically known f(x) it turns out that xf(x) is maximum at a^max = 2.104, and from the value of / there, /(xmax) = 0.2312, one gets A = 0.928Z3 . This suggests looking at — Eos i~E) ~ ("-^smooth) Zf (11.5.5) (11.5.6) as a function of Z s. That is done in this figure (atomic energy units used once more):
430 11. Many-Electron Atoms -En Zi 0.08 0.04 0.00 0.04 - - - " e/ <*b A° ° '* ' / V ° /\ / 1 "J \ / 1*7 1 rA ';^l'S : / / \ / jvi ) p*/^i - •'A A " / - 3 zi 4 where Hartree-Fock data is marked by little circles, and the larger circles display real experimental data (including a smooth relativistic contribution). There are the oscillations! And their period is indeed close to Zs. The result of calculation, with leading terms only, is the continuous curve. It refers only to oscillatory terms - a smooth term of order Zs in the energy is not included [nor is the relativistic contribution whose dominant term is ~Z2(Za)2, a = 1/137 being Sommerfeld's fine structure constant]. With that in mind, the agreement in magnitude and structure is impressive. Problems 11-la For a variational estimate of the ground-state energy of two-electron atoms, Hamiltonian: 2 2 2mei 3=1 i=i ■r2\ Sffkin. : Hn-i iHe. consider states with zero total angular momentum (antisymmetric spin state) and symmetric spatial wave functions of the form ^(ri >ri\ _ q fe-Z(ar[ + (3r2)/a0 + e-Z(f3r[ + ar'2)/a0\ with r[ = \r[\, r'2 — \r'^\ and a,/3 > 0. Determine the (positive) normalization constant C. Then verify that the expectation values of the three pieces of H are given by
Problems 431 (Hi kin. '*- = U2 fflo 2 a2+P2 , 0 „/ 2 (apy + a + /?, {Hn.e)/-=-Z2(a + P) I fflo {He-e) /v0 = lz(^h — —+— 1 + Why could you have anticipated that a«j9 changes nothing? 11-lb Now, for simplicity, put a = /3 and find the a value for which (H) is minimal. Is this estimate good enough to explain the existence of the stable negative hydrogen ion H-? 11-lc To facilitate the optimization of both a and /?, introduce new parameters x, y in accordance with \{a + P) y = Va/3 You should find that (H) is a quadratic function of y, so that - for any given x > 1 (why this restriction?) - the optimal y can be found immediately. Then optimize x. Is the estimate now good enough? 11-ld Compare both estimates with experimental values: -£ = 0.52776, 2.9038, 7.2804, 13.657,22.036 xe2/a0 for Z = 1,...,5. ll-2a Non-interacting electrons filling v closed Bohr shells. Energy: Z2 2 ' 2 ) = ~^V (atomic energy units). E = V In , ^ \ 2n Show that, for N electrons, n=i Verify that, for sufficiently large N,
432 11. Many-Electron Atoms and -£ = Z2[(§A0M + IL(iA^+...] = (§)*S*-^ + &(f)*S*+-- t N = Z ll-2b The asymptotic expansion of v as a function of N obviously holds only for large N. Nevertheless, put N = 2 (!) and compare the numerical value produced by the first three terms with the actual value of v. Similarly for JV = 10. 11-3 The classical limit of tr {f(q,p)}, n degrees of freedom, is / »/fe p) J (27Tft)" J{q,P) Consider / = (H(r,p) + ()r](-H(r,p) - £) so that tr {/} = X>« + Ov(-£a - 0 = ^1 ■ a Then (n — 3 plus spin--) £i,cias, =2/^2^ (H(r,p) + C)v(-H(r,p)-C) ■ Verify, for H = p2/(2mel) + V(r), that E1M = /(dr) (-JL) -L, [W-V - 0] § = E7 - ll-4a Show that a solution of the TF differential equation (it is the asymptotic form for x ^> 1) is (Ax)' Then demonstrate that if f(x) is any solution of the diffei'ential equation, so also is X3f(Xx). [Of course, the boundary condition /(0) = 1 cannot be maintained.!
Problems 433 ll-4b Prove that 5f, a small difference between two solutions, obeys d2 stt ^ 3 ff(xV l Show that this 6f equation is satisfied by 3/ + #gj. How could you have anticipated that? 11-5 With the convention that /2 = 0 for / < 0 show that If* i! = fi + yhh -h)+ f'df (^ - /)^/¾. J h Use this to prove that the stationary value of the / functional S of (11.2.41) is a global minimum. ll-6a The variational estimate (11.2.60) is an upper bound on B, the initial slope of the TF function. To get a lower bound, take as the looked-for function. Then show that S=--y dxx^[g,(x)y-g(0)--j dx[g(x)]2 equals |i? for the correct g(x); that S is stationary at the correct g, if trial functions are restricted by g'(x) > 0 and g(x —> co) = 0; and that the stationary value is a global maximum. ll-6b Now try, for example, g(x) = -Ai e"^^ with A1?2 > 0 , and optimize Ai and A2- Compare the resulting lower bound on B with the actual value (11.2.61). 11-7 Use the asymptotic form given in Problem ll-4a to test for the failure of the TF approximation at large r. ll-8a Positive TF ions: N < Z. Modify the arguments presented for the neutral atom to show that f(x) still obeys the TF differential equation but with the following boundary conditions at 7*0 = aZ^sx0, the edge of the atom: f(xo)=0, ^0^^°) = l ~ ~z ■
434 11. Many-Electron Atoms Of course, /(0) = 1, as for N = Z. What can you say qualitatively about —/'(0) in relation to B = 1.588, the value for neutral atoms? ll-8b Show that N = f(dr) n = Z f ° dx x* [f(xj\ Then use the differential equation and boundary conditions to prove that the last form is indeed equal to N. ll-9a TF atoms: energy and scaling. Consider E = E\ - C,N + E-2 with 1 1 and ^ = -1^^^/^^,(-^-0] and recall the physical boundary condition lim rV(r) = -Ze2 . Use the scaling transformation V{r) -> XV(Xr) , ( -> AC, Z -> Z to show that -i£i-CiV + £2 =0 or £:=1¾. Then use the different transformation V(r) -> A4y(Ar) , £ -> A4( , Z -> A3Z to prove that 7(E1 + E2) - 4(N = 3Z-^E(Z, N) or 3r„ d E=l[ZQZE(Z>N)-CN] ll-9b Prove that d dNE(Z,N) = -C,
Problems 435 and so arrive at (z^ + nm)e(~z>n) = Ie(~z>n)- First consider neutral atoms, N = Z, and use the above to prove again that E(Z,Z) <x Zs. Then demonstrate that, in general, E(Z,N) = zie What can you say about the function e(N/Z) forN = Z and for N <C Z1 11-10 The TF density is proportional to r_5 for very small r. With the method that is used in Section 11.3 derive n(r = 0), the electron density at the site of the nucleus, with corrections for the strongly bound electrons taken into account. You will need the density of a closed Bohr shell at r = 0. How do you get this from (8.3.30)? 11-11 In the following integral, p and p' range over the interior of a sphere with radius P. Prove that /(dp) (dp'), l /|2=(27T)2P4. J \P—P\
12. Electromagnetic Radiation 12.1 Lagrangian, modes, equations of motion We now turn to another application of the methods of Chapter 10: the electromagnetic field interacting with charged particles. Aiming at applications in atomic physics, where particle dynamics is predominantly non-relativistic, we continue to use the non-relativistic description of the particles developed in Chapters 4-11. The Lagrangian consists of a sum over the particles (masses ma, charges ea, positions ra, momenta pa, velocities va) and an integral over the Lagrangian density of the field: * = £ P v I + ~mv2 - e$(r, t) + -v ■ Air, t) at I 2 c + /<*>£ -E- 1-—A + V®) -B-VxA+~(B2-E2) cdt (12.1.1) (symmetrized products understood). For a single particle, we have already seen the particle part in Section 8.6. We check the field part from its consequences: varying the electric field E and the vector potential A, which are paired in the time-derivative term, gives two equations of motion, SE: E = --^-A-V®, c at 8A: V xB-- — E= — > eava5(r-ra) = — j cot c t—' c (12.1.2) where j is the electric current density of the moving charges; whereas varying the magnetic field B and the scalar potential ¢, which have no time- derivative terms in the Lagrangian, gives two constraints, 5B : <5<J>: B = V xA, V ■ E = 4?r Y^ eaS(r - ra) = 4irp , (12.1.3) where p is the electric charge density.
438 12. Electromagnetic Radiation Using these constraints, we can eliminate the fields B, <t> that do not obey equations of motion. For B, this is easy, accept B = V x A as a definition. For ¢, we turn to the 5<£> equation in (12.1.3) and split E into a longitudinal and a transverse part, E = E\\+ E± with V x E\\ = 0 and V ■ E± = 0 where we might as well write E\\ = -V$ , ensuring Vx£|=0, and get Poisson's equation for ¢, ,, P(r',t) -V2<1> Similarly A = A,, +A± 4irp, <f>(r,t) = /(dr') with V x Tin = 0 and V ■ A± = 0 (12.1.4) (12.1.5) (12.1.6) (12.1.7) but here the freedom of gauge transformations, A —> A + VA (see Section 8.6 and Problem 12-1), says that the longitudinal part A^ is arbitrary and can be chosen to vanish: -Am =0 = a± , V -A = 0 . (12.1.8) This is the radiation gauge. It retains only the gauge-invariant transverse part of A. Incidentally, (12.1.5) in the E equation of (12.1.2) requires dA^/dt = 0 for consistency. In the radiation gauge then, note that the time-derivative term involves transverse fields only, / (dr) 1 47T ,-, is; ^E----A c at I = (dr) 1 47T c at (12.1.9) because a partial integration removes the virtual E\\ contribution; that the integral of E2 splits in two, that this longitudinal contribution is the electrostatic interaction energy of the charges, E? /(*)»|f-/W')^- = 5/(*)^ eae& 2 J V A ' r-r' 2^\ra-rb\ a.b (12.1.11)
12.1 Lagrangian, modes, equations of motion 439 where the a = b term will be omitted; and that _eaeb It a, b 5><&(r,*)]a = /(dr)p<I> = £!-r J „t Ir« -n\ (12.1.12) is twice this interaction energy. So the Lagrangian reduces to ^ = E (17* \ 1 P ^E eae& 2 *-? |ra-r6| + /(dr) _J_£ ^ j (V x A)" + jgj 4?rc X dt 8?r In addition to terms that refer solely to the particles, p.{--v +-mv i'part. — 2_j a or solely to the radiation field Afield = / (dr) dr \ . 1 2 -\Y.- eae& 2 "~7l \ra-n\ ^J^E d A (V X A)2 + E2± 4ttc X <% 8?r (12.1.13) (12.1.14) (12.1.15) there is a single interaction term, Lmt. = E -»<•(*) • ^(*•<•(*)» *) = /(dr) -j(r, *) • A(r, t) . (12.1.16) a ° J C The two equivalent ways of writing L-mt. emphasize the particles or the radiation, respectively. Now note that, owing to Stokes's* theorem and the transverse nature of A, 0 < /(dr) (V x A)2 = /(dr) [4-(Vx(Vx A))] = f(dr)A- (-V2)j4, (12.1.17) so that the negative Laplacian —V2 is a positive differential operator. It is expedient to introduce transverse vector eigenfunctions of —V2: r2 c 2Aa(r)=(^-)2Aa(r), V • Aa(r) = 0 (12.1.18) (with ua > 0 by convention) that are complete (as transverse functions, see Problem 12-3 for details), and orthonormal: 'Sir George Gabriel Stokes (1819-1903)
/< 440 12. Electromagnetic Radiation (dr) Aa(r)* ■ Ap(r) = 8afl . (12.1.19) Another important property is (dr) Aa(r) ■ A0(r) = 0 if uia ? uip , (12.1.20) /< which holds since A*a obeys the eigenfunction equation (12.1.18) with the same u value as Aa. To handle the infinitely many degrees of freedom of the radiation field, we express the transverse fields E± and A in terms of these mode functions as Hermitian operators, -A(r, t) = Y, \— (M*)^a(r) + ya(t)lAa(ry) , C *—* w ^j \ / a " ^±(r, *) = J] >/27r«wa (iya(*)Aa(r) - ifcWU^r) j . (12.1.21) Then, to within additional time derivatives, and, to within an additive constant, / (dr) (V x Af + E\ _^ ^ ^^ _ (12_L23) So we arrive at ^i^+EN^-^J ■ (12-L24> 2 ^ |ra -r6 = £fle From the structure of this Lagrangian one reads off the commutation relations and equations of motion. Of course, operators referring to different degrees of freedom commute, and within each degree of freedom, the only non-vanishing ones are jjjhfc.P*] = 1 , h\V,{Hy^ = [y'y*] = l ■ (12.1.25) Varying the y^'s produces the equations of motion of the j/Q's,
12.2 Effective action 441 {di ~ W<* ) Va = V fioT 5Z eaVa ' Aa(ra)* ' (12.1.26) and variation of the j/Q's gives the adjoint equations, ~{dt ~ Wa) V« = ~V fo7 ^ eot,° ' ^°^ " (12-1-27) ' a Upon introducing current components, ja(t) = $>at;a(*) "^a (*•<•(*))* = f(dr)j(r,t).Aa(r)* , ja(ty = Y,e*v*® '^"M*)) = f(dr)j(r,t)-Aa(r) , (12.1.28) a "^ (remember: products are symmetrized if necessary), the interaction Lagrang- ian has the compact appearance, /2irh W = E^fc+4) , (12-1.29) a and the equations of motion read The electromagnetic modes are described by driven oscillators - driven by the electric current of the charges. 12.2 Effective action First consider the situation in which the radiation field is not driven because no charges are present. Then we just have a collection of independent oscillators, and from Section 10.8 we know that </,*l|y",*2) = n<Wa'»*l|Wa»*2) E</lw>e^(M)(il~i2)<wh/'>> a2'2-1) {«}
442 12. Electromagnetic Radiation where \{n}) is the vector describing the state in which there are na light quanta - photons - in each mode a, with the multi-photon energy E({n}) = Y^f^ana (12.2.2) and the multi-photon wave functions (VT a«}i»">=n n\na iv'i) (12.2.3) When the charged particles are present we examine the transformation function (/,...,^,...,^) = (112) (12.2.4) where the dots represent some choice of description for the particles. Keeping in mind the structure of the Lagrangian L — Lpart. + Lint. + Lfleld (12.2.5) we see that, if we vary particle variables only, which appear in Lpart. and Lint.; <5part.(l|2) — -r(l|<5part. / dt (Lpart. + Lint.J |2) (12.2.6) where <W.Lint. = J2 \ TT [Wia Va + vt <Wt JJ ■ (12.2.7) V LUrv L J Now, the solution of the driven oscillator equations of motion (12.1.30) are [compare with (7.2.9) and (7.2.10)] Va(t) = e-^V-^vM + i\[^-a[M V(t- ne-^-^Ut') , fc(<)f = yQ(*i)t e-^^*1 - *) + i^-j\t'ja^ e^C ~ %{t' - t) . (12.2.8) As a consequence of the initial specification of the ya, ya{ti>) —> y"a, and the final specification of the y^a, ya(hV —> yt > one piece of the action variation ^part. Lint. IS
12.2 Effective action 443 J-2 ~~ V WQ L + v, He-^ofr-'U^Mt)] Jpart. fldt y]^va-A'(ra(t),t) J2 „ C (12.2.9) where -a1 (r,t) = Y,\^\A^^iUa{t^t2)y^+yi'^1Ua{tl^t)A»(ry} ■ C V b^a L -» (12.2.10) The second piece of <5part. /2 dt L;nt. is I'dtdt'iT— \6paxt.ja(t)Ht - *')e_iWa(* -*')ja(*) ./2 „ wa L = 6, part. (12.2.11) f At dt> i y; ^ja(t)^(t -11) e-^t* - *')ja(t') So, for a given effective field A'(r,t), we have an effective particle action dt[Lpart. + Y,~va ■ A'(ra(t),t)] a + ['todt'iY^—j^vit-Ve-^-^Mt), (12.2.12) which determines the particle part of the time transformation function, W* 5(...,h\...,t2)A =-(...,^1^11...,*2> (12.2.13) where A', that is: y"a and y]^ , held fixed. The complete time transformation function is given just by „t' - /„t' i, \„" (yl ,...,h\y",... ,t2) = (y1 ,h\y" ,t2) (... ,h\... ,t2)A . (12.2.14) v v ' v v ' photons particles under only the influence of photons
444 12. Electromagnetic Radiation 12.3 Consistency check Of course we should check what must be true: that this also contains the influence of the charged particles on the photons. So consider ^(1(2) = (11^(^)12) dyl = (1| [e-*"<*(*i - ^ ya(h) + i^^fdt e"^^ ~ *)ja(*)] |2) I V —a J2 y'L (12.3.1) and compare it with d dyl ^[(/,^,^)(...,*i|...,*2>A'] d-rr{y^My",t2)){...M\...M)A' 'it I \dyk + W,h \v"M)-?-j{... ,h I-.-, h)A' . (12.3.2) dyl As for the first part, we have —7{y1 , h\y",h) = -—eWf> y? dyl dyl = e -iua(t! -t2)„»/„.t' v'L{v] M\y",h) ■ (12.3.3) For the second part we use the action principle to vary A'(r,t), specifically by an infinitesimal change of yl : 5A, (...,h\...,t2)A' = U...,h\ f dt^2^va-6A'{ra(t),t)\... ,t2)A' , ^2 a C (12.3.4) where, for the present purposes, c \ u 5A> = £™ Syf e-'^(tl ~ *)Aa(r)*. (12.3.5)
12.3 Consistency check 445 So, ^ [l dte-™*^-*)(... ^¢)1... ,t2)A' , (12.3.6) and the two versions are the same, according to the clearly consistent interpretation of a particle matrix element, (l\ja(t)\2) = (y^,tl\y",t2)(...,t1\ja(t)\...,t2}A' . (12.3.7) We have carried out variations of particle variables and photon variables. But what about t\ and t21 Consider, for example, the t\ Schrodinger equation m^-(l|2) = (l|#|2). (12.3.8) Is this exactly reproduced by — {...,tl\...,t2) +W,h\y ,t2)~ ^ ? (12.3.9) It simplifies our task here to adopt the Lagrangian viewpoint, in which va = -~, (12.3.10) so that and [recall the definition of ja in (12.1.28)] dtLint. = ]T\/ ^jT [yaeadra ■ Aa(ra) + y\,eadra ■ Aa(ra)*] , a V Wa a (12.3.12) along with dt Lfieid = Yl [ihy* dy<* ~~ dt ^JaUcJ . (12.3.13) a We identify H as the coefficient of — d5t in 5 [dt L]. Thus, we see that
446 12. Electromagnetic Radiation a v 7 a^b ' a 0[ a s v / — ilpart. The ti Schrodinger equation now reads more explicitly ^-(/,...,*i|y",...,*2> (12.3.15) where we can introduce y,y^ eigenvalues: +ivlrIdte~~'luJa{tl~*)iaw' (12-3-16) and recognize immediately that the purely photonic part, ^J2^y»e~~iuJa{tl~~t2)y«(l\2) > (12-3-17) a is just what emerges from the t\ derivative of (/,^1^,¾) =eSa^'e"6',a('I"'2)^. (12.3.18) The only question is whether -^<y'>--- ,*l|[-Hpart. + i]T y/torfk^yi' j'dt e-™°(tl ~ *)ja(*)] |</",... ,*2) (12.3.19) equals (2/^,^1^,^)-(... ,*i |— ,*2>A' - (12.3.20) This is also the question: Is the t\ Hamiltonian associated with the particle function (... ,^ |... ,t2)A given by Hf = tfpart. + iTV^^ayi' f dt e-M*» - *)ja(t) ? (12.3.21) •/2
12.4 Pree-space photon mode functions 447 To answer this, we look back at the effective action W^ in (12.2.12) where At E>-^-E^^EV^-hw^ ,(1-¾)^ (12.3.22) and, for example, Atii = Yleadra ' Aa{ra) (12.3.23) Observe first that the t variation of the p ■ ■ ■ j term in W^ contains overt references to St but not to ^St; no contribution to H^ here. Of course, £part. gives if part., as before. That leaves the terms of (12.3.22). Again there is St, but not ^St. Problem? No. There is explicit dependence on the final time t\ in the effective field A'. The relevant part of W{% is [dt Yjeava-YJ\f^H^St1)yi'e-i^^-t)Aa(rar ■Sh (12.3.24) A' which displays just the required contribution to ii\ . 12.4 Free-space photon mode functions Let's be more explicit about the Aa(r), the photon mode functions, for the situation of unbounded space. Then, there are no boundary conditions to watch in addition to the defining properties stated in (12.1.18) and (12.1.19). The basic solutions are plane waves, sifc'r with \k\ = (12.4.1) which are given a trans verse-vector character by polarization vectors e^x (with A = 1.2 for linear polarization or A = ± for circular polarization, for instance) subject to k ■ ek\ = 0 and e^A* ■ ek\> U,\> , (12.4.2) and are normalized by reference to small wave number cells,
448 12. Electromagnetic Radiation AkX{r) = ekX I I {Ak) eik'r. (12.4.3) a/(27t)3 (<5fc) J 5k) Indeed, in view of [ {6k) ( {dk') [{dr)e-^k"k,^r = {2n)a{6k)6k.k, (12.4.4) J{5k) J{5k') I , = (27r)35(fc-fc') their orthonormality is easily verified, /(dr) Ahx{rT ■ Ak'Ar) =e- ■ ek'x W7PTWT x / (dfc) / (dfc') f{dr)e-[(k"k'yr J{Sk) J{5kl) J = Sk,k>S\,\> ■ (12.4.5) As long as the range of r is restricted, |Z\fc-r|<l, (12.4.6) where Ak measures the size of a {5k) cell, it suffices to write [now using (dfc) rather than {Sk)} AkX{r)=ekJ^eik-r. (12.4.7) This is, of course, just the trans verse-vector analog of the scalar mode function (10.8.12). With this choice we have {u = c|fc|) + e%xe-ilk-r-u(t-t^vQ, (12.4.8) where it might be clearer to associate the explicit il512 dependence with the corresponding eigenvalues, that is ^[fc-r-wf^giwfc. e~-i[k-r-ut]y^er-iuh ^ (1249) As for the non-local in time interaction term, WW-ioc = f dtdt'lT— Utfrt-^e-^-^U*), (12.4.10)
12.5 Physical mass 449 we have ;«(*)*->$>«„(*) W|^eifc -MO, JaW^e'MV'elJ^e-*-^) , (12.4.11) so that W12>n.loc = ^ d* d*' J] I j^ (J2 eava(t) - ekX eifc " MO) x ir,(t - *') e~ia;(* ~ *') (J2 e»»»(*') ' 4a e"ifc ' r6(i'}) ■ (12.4.12) 12.5 Physical mass To begin our applications, let all y" = 0 (no photons initially) and all y^ = 0 (nor finally), so that the effective field vanishes: A'(r,t) = 0, and consider just one particle (mass m0, charge e, position r, momentum p, velocity v). Then x ir,(t - *') e-iw(* ™ e)v(t) ■ e*kX e-ifc " r(*') , (12.5.1) where the time-non-local term accounts for the net effect that results from the emission of a photon (at the earlier times t') and its subsequent reabsorption (at the later times t). In this situation we can well anticipate that the whole system moves with constant velocity, and constant momentum, as described non-relativistically by an effective Hamiltonian * = £' d^=°< (12-5-2) so that [r = r(t = 0)] v = — , r(t)=r+~t = r + vt . (12.5.3) mm
450 12. Electromagnetic Radiation Then we encounter the operator product v ■ ekX eifc ■ (r + vt) e~ik " (r + vt'^v ■ e*kX , (12.5.4) where, for example, v ■ eu\ and elk\r +vt) commute because ek\ and k are perpendicular [k -ek\=0, cf. (12.4.2)]. Now, one knows, in various ways, that eAeB ^eA + B+\[A,B] + --- ? (12_5>5) where (recall Problem 2-10b) the series terminates if, as here, [A,B] commutes with A and B: A = ik ■ (r + vt) , B = -ifc ■ (r + «*') , U,i3l = [fc-r,fc--^*'l + [*:• — t,k-r] = -ih—(t-t') . (12.5.6) 1 J L m m m v v • v v • So _ifc ■ (r + vt) .-ifc ■ (r + vt') = „ifc ■ w(i - t') P-»fi£(* - «') = eifc.t,(i-Oe-"^(*-*'), (12.5.7) which presents us with the time integral f1 At At' e-i["-k-v + fw2/(2mc2)] (t - t')irj(t _ f!) (12 5 8) If we restrict attention to photons of non-relativistic energy, Hio^mc2 , (12.5.9) the ratio of the third to the first two terms is Two 2mc2 « 1 , (12.5.10) and we neglect the third term (while noting the potential for a relativistic treatment). For the second term we have \k-v\ < -\v\ =u^A , (12.5.11) c c which is neglected relative to uj for non-relativistic motion, |v| Cc. So, the exponent in the integrand of (12.5.8) is effectively equal to — icj(t — t'), and we are left with
12.5 Physical mass 451 f dt dt' e_iw(* ~ *')iJ?(* -t')= f At f dt' i e^* ~ *') = [tldt±(l-e-i»(t-t*)) '*2 fii (12.5.12) Concentrate first on the secular term, the one growing linearly with the duration T — ti — t2- That contribution to W\i is / d^2 E / j£^jv ■ e*^ ■ eix ■ (12.5.13) Inasmuch as there is no explicit dependence on k, the polarization sum is over two of three orthogonal directions, so on the average Ylv-e^v-e*^->h2 ' (12.5.14) and (dfc) -> 4?rfc2dfc = 4?r^dfc . (12.5.15) This gives for the additional action term [ dte2~v2-^rdk= [ dt-Smv2 (12.5.16) J'2 3 -K& J2 2 where so that 4 P2 /•27r/Amin 8 p2 /A ■ 37T i 2 /•27r/Amin Rp2/\ ■ ■~ dk = |li^H£l (12.5.17) ■ c2 y0 3 c2 e2 <5m < m if Amin > —5- , (12.5.18) requiring a restriction to the non-relativistic situation to stop the linearly divergent integral [which, as Weisskopf* and Furryt noticed, is only logarithmic, relativistically, when w gets replaced by u + hk2/(2m)}. This piece of the action adds directly to f2 dt ^m0v2 to effectively change m0 into m = m0 + 5m, (12.5.19) the "renormalized mass". 'Victor Frederick Weisskopf (b. 1908) fWendell Hinkle Furry (1907-1984)
452 12. Electromagnetic Radiation 12.6 Infrared photons Now we turn to the non-secular, transient term of (12.5.12), which is produced from (12.5.13) by the substitution I '^[..•l->[•••! (-^) (e-iwT-l) , (12.6.1) /2 V ^ so, with (12.5.14) and (12.5.15) „2 Wta../d*|^.'(-±)(.-^-l) £f ^(o— -1). <!«,) _13 7TC3 For a state of definite momentum (velocity), this is a numerical addition to the action, one that produces a change in the time transformation function by the multiplicative factor 2eV fdu / -iwT_1\ jrWi2,„-sec = e3 ntlc3 J u) V 7 dw . „ „ 2eV /"dw 3 Tr/kr / w (12.6.3) The factors e"""^ , e""1^ +a; )T, ... clearly indicate the presence of one or more photons. What's going on? The initial choice y'^ = 0, "setting the initial field equal to zero", denies the existence of the magnetic field associated with the uniformly moving charge. In effect, we have set v = 0 at time t-2. Then at time t2 + 0, the magnetic field springs into being, as though the velocity v comes into being instantaneously. That, as we know, produces radiation. Indeed, for long wavelength, 'infrared' photons, it is known that the relative probability of emitting a photon in the range duj is as one sees above. In fact 2t?V^ (12.6.4) 2 e2v2 e 3?r he3 , e 3tt he3 -—, ... (12.6.5) are the absolute probabilities for emitting no photons, one photon, ... ; that these probabilities add to unity is immediately apparent:
12.6 Infrared photons 453 nac6 J u V ' = 1 lim e37rfic3 i w V J =1 . (12.6.6) One may wonder why these factors are probabilities, rather than probability amplitudes. That is because they contain both the probability amplitude for emission (at time t%) and the complex conjugate probability amplitude for absorption (at time tx). The absolute squared amplitude is the probability. One can live with this description, but it would be more physical to make the magnetic field explicit from the beginning. For that purpose, go back to the Lagrangian (12.1.24), with L-mt. in the form (12.1.29), and the ya equations of motion (12.1.30). If there were no time dependences in the current components ja, the steady-state solution for ya would be Accordingly, let us, more generally, redefine ya, y'a Va -> — J^-Ja ■ (12.6.7) ua y nu>a U!a V tV-da Then, for example, — — V CJa whereas L-mt. becomes Ei%4^ -> 5>»«^« + E iv irAiv« (12.6.1 e&-e§^. <"*.) E JW «»■ + S»J»! + 2 E ,¾. i (12-6-10) a a a the sum of the two is just - Y, ^c,yiya + Yl -2&ia ■ (12.6.11) a a a No interaction between photons and charges? It is there, in ^v^dt'«+£^dt'°- (2-6-12)
454 12. Electromagnetic Radiation Thus, we now have a new interaction Lagrangian L;nt. (shift the time derivative to p), and a new particle Lagrangian Lpart,, w -> E [p ■ (S -«) - \^2} „ - 5 E ^¾ + E 5^«. (12.6.14) setting aside, as of only marginal interest at the moment, the particle contribution involving the i{P f^j — 4^ p j) terms of the last sum in (12.6.12); we'll remember about them later, on page 458. What is the additional particle term, for just a single particle? It is, with the single-particle versions of (12.4.11), f (dfc) e2 2 2 2 e2 f2*/x™« i (12.6.15) producing, for each particle, the mass renormalization (12.5.19). How about the interaction between different particles? That is contained in / S ^ E E e«e*eA ■ "a elk ■r" e~ik ■ r»ebe*kX ■ vb A a, b -> E ^ j ^wVa ■ e*A ■Vb eik'{ra ~rb) • (12-6-16) where, with 1 denoting the unit dyadic, E e<^A = 1 - I? , (12-6.17) A=l,2 which, of course, states the geometrical fact, exploited already in (12.5.14), that this dyadic sum is the projector to the plane perpendicular to k. This then gives 1 v-^ eae& 1 f (dfc) r 1 I i/( a^b J r = ra -rb (12.6.18)
12.7 Effective Hamiltonian 455 where /Me*., = 4 r~dfc^ = ^ (mU9) J k1 J0 kr r [note that this is essentially the inverse Fourier transform of the Coulomb potential that we've seen in (11.4.18)], and />0O / dk [sin(fcr) — kr] Jo f/;-M-)-H^(-i =2l,rAk'j^pi = ^r< (12.6.20) Jo K where the equivalent replacement in the first step subtracts an r independent term, which is spurious, since this is differentiated eventually, _ _ T-rvb-r va-vb va-rvb-r no«oi\ va-Vvb-Vr = va-V = 3 . (12.6.21) V|eifc-r Thus, 1 /,(dfe)r 1 _ _ Vg -Vb _ 1 TVg -Vb _ Vg ■ T V b " T1 _ 1 fttg 'Vj Vg ■ V V b " 7» 1 r 21 r r3 J ~ 21 r r3 J ' (12.6.22) which gives us the magnetic-energy interaction contribution to Lpart., 1 V^ eaeb va ■ Vb + Va ■ ~ Vb ■ ~ 4x: known as Darwin* term. 12.7 Effective Hamiltonian (12.6.23) r = ra -rb The new form of the interaction Lagrangian L;nt. in (12.6.13) is obtained from the old one in (12.1.29) by the substitution *Charles Galton Darwin (1887-1962)
456 12. Electromagnetic Radiation _L_d_' Ja ~? ~T,Ja > ua At .t i d ,t -7° wa dr a (12.7.1) This implies no essential change in the A' term because, for example, eluJatja and e \UJat j_A_ ua At Ja (12.7.2) differ by a time derivative. But there is a significant change in the time-nonlocal interaction term of the effective action (12.2.12), which now reads wWic dtdt'i^^V-O^^-^^ •u% At i'^'E/||5[sX>-<"-- d*' jk -ra(t) Note here that, for instance, At At (12.7.3) (12.7.4) so that the magnitude of the second term, compared to the first, is of the order I ^-1 I M I \V\ U)\V\ V I di <1 (12.7.5) because one expects the important values of u to be of the order of j ^ j /1 v \. For the simplicity of writing in what follows let the system be a hydrogenic atom, so there is effectively only one moving charge. Then the non-local in time supplement to the particle action becomes Wl2,n-loc — h ^ J (2tt)2w3 At xi^-Oe-^-O^l.e^e-1*-^). (12.7.6) We should try to find an effective Hamiltonian H in a self-consistent way. It begins by writing fottl e-ifc • r(t') = e-jrH(t - 0 SM*) e-ifc • r(t) ejrH(t - t') {u ? ?) At' At
12.7 Effective Hamiltonian 457 and by supposing that the system is initially in the E0 eigenstate of H, so that on the right side, H —> -Bo- Then the essential terms of the t' integral are fAt'ie-Ln^ + H-E0)(t-t') = f^2 dr{fi-i(J*, + H - E0)r Ji-2 JO (12.7.8) h hu + H -E0 which has the form l-ie-'1XT (12.7.9) x x with x = u + (H — E0)/H and r = t — t2, and equals ir for x = 0. Now we must recognize that r, which is of the order of the total measurement time T = t\ — t2, is a very long time on the atomic scale, particularly if we are interested in energy measurements or the details of transition processes. This means that the rapidly oscillating x function e~~1XT will generally not contribute to x integrals, except in the neighborhood of x = 0. Now, breaking (12.7.9) into real and imaginary parts, 1 - cob(st) + . sin^r) ^^ /: we observe that f\^,rd^=I for XoT>1> (12.7.n) ' ~xo ^ J—oo y so that ^^ -> nS(x) , (12.7.12) x whereas [l — cos(xt)]/x, which vanishes at x = 0, is l/x with the singularity at x = 0 removed: the principal value (V). In summary then, 1 _ e-ixr 1 ► V- + mS{x) , (12.7.13) x x effectively. With only a single integral left, we can present the change 6Wi2 in the action in terms of an addition 5H to the Hamiltonian, dv ~dt '" = -•* £/£$;?«« x ej» — . (12.7.14)
458 12. Electromagnetic Radiation Now, for any function of p and r, eife ' r/(p, r) e_ifc - r = f(p - hk, r) . (12.7.15) Is this change of the electron momentum p by a photon momentum hk relevant in the non-relativistic regime we are considering? No, it isn't because the important photon energies Two are of the order of atomic energies, so that \hk\ = hu/c is very small compared to \p\ ~ (atomic energy)/?;. Thus, the exponential factors can be discarded in (12.7.14), and then the polarization sum can be done once more, drc**[-te*-dF->3dr[-Jdt' (12-7-16) A=l,2 and we arrive at 2 e2H f du dv r 1 xt i dv /du dv r 1 ._ tt „,ii ^■hjTir^ + ^ + ^4 (12.7.17) This addition to H from Wi2,n-ioc is not the whole story, however, because the terms put aside on page 454 must be added. They give an energy contribution ^ul{Jadt dt3a) -ft(vekXeik-r)e'ik-re*kX-v] /du . / dv dv \ /.„ „ .„x ^ <*■*-*■*)> (12-7-18) ~* 3?r c3 where the intermediate steps are analogous to those between (12.7.3) and (12.7.6) as well as (12.7.16). Putting the pieces together, the first-order correction in the effective Hamilton operator, 8Hi-iSH2, (12.7.19) is not a Hermitian operator but has an imaginary part, CTT 2e2H f du dv TT , dv ,-,n~nn\ 5H2 = -— I — — ■ 8{hu) + H-Eo)—, 12.7.20 3 c3 J u dt dt
12.8 Energy shift 459 in addition to its real part 2 e2 6Hi =- 3?r cA J u [dt \ fkJ + H \ dv H -E(J~dt 1 / dv dv \ 2 V dt d£ ) (12.7.21) 12.8 Energy shift Let's first look at the energy change (E0\5Hi\Eo) induced by this real part. A simplification is achieved by using 1 [v, H] = ^ [v, hu + H - E0] = I [v, 0} = i[0, v] (12.8.1) dv dt ihL ' J ihu l to rewrite the integrand (principal values understood): dv 1 dv 1./ dv dv \ , v ^,-1/^, ^, \ + \v ■ [yfl - fiv) + \ (fiv - vfi) = nvn~lvQ -\v2n -\nv2. 4- 4- (12.8.2) As indicated, the fi's to the very left and the very right will stand next to (Eq\ and \E0), respectively, and therefore they can be replaced by their eigenvalue u. So dv 1 dv dt' n~dl + and we have 2 e2 1./ dv dv \ , / 1 1 \ -it)-- --v)->w2v (— v (12.8.3) 2 V dt dt J \fi ujJ v ' 6Hi =- 37TC3 2 e2fi : p_ Two + H — E0 uj i). 37r m2,e with p = meiv. Owing to the occurring subtraction, 8H\ contains no contribution proportional to v2; and, indeed, it shouldn't since mass renormalization is already taken care of by the extra term in (12.6.14), as we've seen at (12.6.15). And so the mass appearing in (12.8.4) is the physical mass of the electron. Mindful of the (E0\ ■ ■ -\E0) context, we note that (principal value understood),
460 12. Electromagnetic Radiation 5Hi = — - ^-^7 I Auiui 37r m^c3 2 e2H 37r m2x6 f - / dojup- 1 H + Hiu--Eo 1 p are equally good, and so is half their sum. Now for any operator A, and L J l for H = p2/(2me]) + V(r), so that ,P 1 P, H + tkj-E0 and (take the adjoint) 1 ■p->--l?W 1 fai H + ho: — Eq V P _H + hu-E0 This gives e2ft P -> -p <Jffi -» - iirm\xc? Jo dwi VV 1 37rmg,c3 VV ■ log p fiio„ H + Hui- E0 1 H + hw- E0 P vv\ H-En p - p ■ log filOr, \H-Eo\ (12.8.5) (12.8.6) (12.8.7) (12.8.: (12.8.9) VV (12.8.10) which stops the non-relativistic integration where it certainly breaks down, at /wmai of the order of meic2, the relativistic energy associated with the electron mass me\. Suppose we replace the logarithmic operator \og(hwmax/\H — Eq\) by some effective numerical value log(meic2/AE) . Then we get ^^log(^)(-i)(VV.p-p.Vv) (12.8. 11) =\hV2v and, with -Zp2 v2v = v2- - ^-^2^ = 4irZez6{r), (12.8.12)
12.9 Transition rates 461 we have '^sss?1* Oaf) ***'<'> t \ 2 / 2 " \ ry 2,... (melC 3 he \me\c) \ AE Ze'log -nH<S(r), (12-8-13) so that which involves the wave function ^/¾ (r) at r = 0, that is: at the site of the nucleus. We know that, for I = 0 and principal quantum number n, hK(0)|2 = -(—V (12.8.15) [(8.3.30) in conjunction with |loo| = (47r)-1]) and with the recognition that the ratio of the electron's Compton* wavelength h/(me\c) and the Bohr radius ao = h2/(meie2) is Sommerfeld's fine structure constant a, H ^0=^ = 7^ (12-8-16) meic/ 137.036 we get Bethe'st result 8Enfi = („,01^1«, 0) = ^^ log (¾) - - (12.8.17) This upward displacement of the I = 0 states is known as the Lamb* shift. Together with relativistic effects of order a2 it gives a complete account of hydrogenic fine structure. 12.9 Transition rates Having dealt with the real part 8H\ of (12.7.21) we now turn to the imaginary part SH2 of (12.7-20). What is its significance? Look at the (Eo,ti\E0,t2) probability amplitude. The "energy shift" —i {5H2) contributes to the time factor e ft the real factor e~\{5H2)T (12.9.1) 'Arthur Holly Compton (1982-1962) fHans Albrecht Bethe (b. 1906) *Willis Eugene Lamb (b. 1913)
462 12. Electromagnetic Radiation implying that the probability for persistence of the state after the elapse of time T = t\ — t2 is -l(5H2)T _ -7T e ft = e (12.9.2) so that 7 is the probability per unit time for the system to leave the state. In other words, 7 is the decay constant of the unstable system associated with spontaneous emission. We have thus 2.,,. 4e2 ,dt; fm dw ... TT dv , _, . 7=-(^^0) = 3^01^ -S^ + H-E0)-\E0) (12.9.3) with />00 J 1 / ^S(hiu + H-E0) = - wv(Eo-H) J0 U) tjQ— ii E(<E0) (12.9.4) where the ellipsis indicates further quantum numbers in case of energetic degeneracy. So _ fV y^ i_ 3 c3 ^ E0-E E(<E0) <*.-\%W (12.9.5) which is a sum over all states of energy below Eq. Alternative versions are obtained from 3^ = 5^ = (*)>.*],"] (12'9-6) and v = p/mei; they read 4 e2 y^ Eq-E 1' ~ 3hc *-*> h E(<E0) 4 e2 y^ E0-E ~ 3 he 2-*> h E«E0) _4 /e2\3 Y- ^o 3 \hc) f-" (E,- (E,- -E H \V\F \ 1 cl / 2 1 p If \ ]meic< fEo-E^ \ e2/«o , r (£,..-1-1¾) ao (12.9.7) and illustrate that one can equally well use transition matrix elements of r or p for the evaluation of 7.
12.9 Transition rates 463 The individual terms in (12.9.5) obviously represent the rate at which transitions are made with the emission of a photon of (angular) frequency u = Ue0 - E) . (12.9.8) This is the quantum analog of the classical Larmor* formula for the rate of radiation of energy. £(£)' = '■ In fact, the quantum rate of radiation of energy is obtained by multiplying each transition rate by the appropriate Two = Eq — E, giving 4e P ^ 3c3 E(<E0) dv i (E,...\^\Eo) dt 2 (12.9.10) For sufficiently excited states, the sum of matrix elements to lower energies becomes equal to the sum to higher energies, so that the replacement £ -+5 E <■"■»> 2 E(<E0) all E is permissible, and -!£<*-i(S),w-£((S)')ik. «—» This recovery of the classical Larmor rate is an example of how classical physics is contained in quantum mechanics as a limit. How about a direct derivation of the transition rate? Return to (12.2.14), {^\...,h\^,...,t2) = {y1'My',t2){---M---^)A' ■ (12-9.13) If we want the probability amplitude for one photon finally, given none initially, we are looking for a term with but a single y^a = (y^ jlQ)- That can only come from (■■■)■■ -)A since (yt ; ^ jo, t2) = 1 ■ So we want 9 ' ■ ' ■ \A'\ _ A/ . I 9 urA'\ *\A'=o -7(... M\.--M)A U=0 = jj<- .tilT-pWtf |... ,fc>' oy'a n dyhc (12.9.14) where [cf. (12.3.1); here one particle only, for simplicity] *Sir Joseph Larmor (1857-1942)
464 12. Electromagnetic Radiation d 4< = f dvt h = J dtev(t) 2-kH dyl J2 V w, where the so-called dipole approximation, e-'*>*(ti -1) Aa(ry A (rY-Ji^le e"ifc'r a[ ' ~ V (2?r)3 a (dfc) (27T) 3 °a ' (12.9.15) (12.9.16) is applicable since typical photon wavelengths A = |fe \ ~ are much larger than internal atomic distances r, so that |fe ■ r\ < r/X <C 1- Then the probability of the transition Eq —> E,... is 2 dt(E,...\ea-v(t)\E0)e[u}t 1 (dfe) 2irh 2 h2 (2?r)3 (12.9.17) where iHt\ (E,...\v(t)\E0} = (E,...\enmve~nHt\E0) = (E,...\v\E0)e^E"E^ which gives, see Problem 9-9d, r dtei(E-E0 + hu)t = T2nH6(E - Eo + hu) (12.9.18) (12.9.19) This is proportional to the duration T = t\ - t2, and so the probability per unit time is 1 (dfe) 2^e22nh6{E^Eo + nio)^ea.v^Eo^ /z2(2?r)3 u, " ""'""v" ~u ' ~~>i\-r<« "i~u/i ■ (12.9.20) We sum over the two polarizations and integrate over fe, and get the rate for transitions E0 —> E, ,2 -^- <J(«w + E - £0) e2 - \(E,. ..\v\Eo)\ 4 e2 Eq-E 3 fie ft (E,...\-\Eo) v(Eo-E) (12.9.21) The sum of these partial rates is the total rate 7 of (12.9.7), which shows the consistency of the two ways of calculating 7. If there had been n photons of this kind present initially, and we ask for the probability of one more, the additional y^ factor multiplying ('</ jn) gives \/n + l(yt jn _|_ 1^ so that the probability (per unit time) has the factor n+ 1: (n + lfoE<-E0 = njE^E0 + 1e<~e0 ■ (12.9.22) In addition to spontaneous emission (rate ^e<~e0) there is stimulated emission (rate wye<~e0), stimulated by the n photons already present. A similar discussion for absorption gives a rate proportional to n, the number of photons available.
12.10 Thomson scattering 465 12.10 Thomson scattering As the last application, we consider a situation in which one photon (wave vector k', polarization A') is present initially, another photon (fc, A) finally, and there is a single charge (electron) that is at rest, p = 0, initially and finally: scattering of a photon by a charge at rest. To extract the amplitude V'scatt. = (Ua,P = (Mi|lfe'v,P = 0,t2) we differentiate twice d d (12.10.1) tps, dyl dy'i (y* ,p = 0,h\y",p = 0,t2) ,r -n ,,"- = 0, y" = 0 So, where with a = kX , /3 = fc'A' , a ^ /3 ^scatt. = <0,p = 0,^(^(0,^ = 0,*2> \ oya /=0, y" = 0 or, equivalently, *= L(*,)w(t2r+^|^ ,f (12.10.2) (12.10.3) (12.10.4) (12.10.5) ■■ o, »" = o corresponding to the choice of order in which we differentiate. In the long-wavelength limit we have in mind, the dipole approximation (12.9.16) is appropriate. Then (T = t\ —12 again) ya(h) = ya(t2)e-'luJ*T + iJ^J1 dt e^'1 -*)et,(t) - Aa(r(t))* ^-T^Mr' (12.10.6) and with „,,„ = , - ^ -, £ ^1^ [* e-i-« - «,, ua> V (27T)3 2V + ^;e-i^(ti-t)e*,] (12.10.7)
466 12. Electromagnetic Radiation we get dy'p V ^a y (2tt)3 y w0 y (2tt)j V "ie] / a e/3 ■ (12.10.: So the transition probability is i/ i2- fe2 V ^2?r (dfe«)i^M ^ * v Vmel/ u«w,i (2irY (27r)J " p/ /" dte^" 2 (12.10.9) = 27rT5(o;a — up) and is the transition rate (now writing ka,oja = k,u and kp,ujp = k',u'). The 5 function states the expected: the scattering is elastic, the scattered photon has the frequency of the incident one. We are interested in the differential cross section da for scattering into solid angle dO, so we put 2 i (dfc)=^-^d/2, (12.10.11) integrate over the frequency uj of the scattered photon, and divide by the incident flux c(dfe')/(27r)3, giving (2 \ ^ ^J (eJ-ev)2d/2. (12.10.12) If we have no knowledge of the polarization of the incident photon, we must take the average of the two A' possibilities, (el ■ ex,f - \ £ K ■ evf = \ - ^^ , (12.10.13) A' = l,2 l l k and if we don't discriminate between the two polarizations of the scattered photon, we must sum over the two possible A values,
Problems 467 K ■ ex,f -> £ (cj ■ ev)2 = 1 - ^¾^ • (12.10.14) k2 A=l,2 Taken together, they amount to where 0 is the scattering angle, k ■ k' = kk' cos0. So the polarization- insensitive version of (12.10.12) is da 1 ( e2 x 2 ) (1 + - cos2 6») . (12.10.16) d/2 2 \meic2 The total cross section, -/^^-f^V^fl+iU^f^V, (12.10.17) is the classical Thomson* cross section for light scattering by small obstacles. The remark made above about the Larmor formula applies here too: classical physics is contained in quantum mechanics as a limit. Problems 12-1 That there are not equations of motion for all field variables of La- grangian (12.1.1) is implied by the ambiguity, or freedom, associated with gauge transformations, 1 d A -» A + VX , <!>->$ —A, cot E-+E , B->B , where X(r,t) is arbitrary. Show that an infinitesimal gauge transformation gives SXL = / (dr) /< 1 . _r. 1 d ' -J-VSX + p-jr-SX c cot Then apply the principle of stationary action to obtain the continuity equation Explain why this states the local conservation of electric charge. *Sir John Joseph Thomson (1856-1940)
468 12. Electromagnetic Radiation 12-2 Use vector identities to show that F±(r) = |(dr') 47r|r^r/|V' x [V x F(r')} are the longitudinal and transverse parts of vector field F(r). 12-3 The completeness relation of the transverse mode functions Aa of (12.1.18), Y,Aa(r)Aa(rr=5±(r,r'), a is in terms of the transverse delta function, a dyadic with the properties V-8±(r,r')=0, V • 8±(r,r') = 0 , and |(dr')8±(r,r')-F(r')=Fx(r) for any F = Fy + Fx- Show that / (dfc) Pifc • (r - r') (■, _ tf. J (2tt)3 V k< where 1 is the unit dyadic, has these properties. 12-4 The mode expansion (12.1.21) constructs the transverse fields A and E± from the mode functions Aa and the non-Hermitian variables 2/a,;y„. Reverse it and express ya, j/£ in terms of A and E±. 12-5 Justify (12.1.22) and (12.1.23). 12-6a Given a set of orthonormal transverse mode functions AQ(r), show that the set of functions defined by B„(r) = -Vxia(r) are mode functions that could be used equally well. Can you express the Aa(r)'s in terms of the BQ(r)'s? 12-6b Recognize that B(r, i) = Vx A(r,t) = ^ V^tkua (ya(t)Ba(r) + ya(t^Ba(r)*) .
Problems 469 Then verify that the radiation-energy-density operator is U^~ (E\ + B2) = ^X>£SJ[ (A*a -A0 + B*a- Bp) yiy0 ~ 2 iAa ' A0 ~ B» • Bp) VcVp -\(K'A0-B*a-B*0)yivl], where the same terms are omitted as in (12.1.23), the terms that would give rise to a non-zero energy density of the vacuum. Check that I (dr) u = Yl ^ocviva 12-6c The energy-flux-density operator, the analog of the classical Poynt- ing* vector, is S = ~E± xB . Air Give its mode expansion (usual omissions). Use it to prove that ^U + V-S=-j±-E±; this is the quantum analog of Poynting's theorem. 12-7a Apply the principle of stationary action to the Lagrangian (12.1.13) and find the fundamental field commutation relations [a-A(r,t),b-E±(r',t)] , where a and b are arbitrary numerical vectors. Then verify that the mode expansion (12.1.21) is consistent, provided that y,y^ have their usual commutation relations. 12-7b State the commutation relations [a-B(r,t),b-E±(r',t)} . 12-8a Suppose the current components ja(t) in (12.1.30) derive from a classical electric current j, so that they are numerical quantities, not operators. Assume further that j = 0 for t < fe and t > t\, and find the vacuum persistence amplitude (0,t\ |0,£2)J- "John Henry Poynting (1852-1914)
470 12. Electromagnetic Radiation 12-8b Now consider j = j< + j> with j< = 0 for t > T and j> = 0 for t < T, where T is an intermediate instant, t\ > T > t2- Use the identity <0,*i|0,*2>J"<+J'> =^(0^i|{«},T)J'>({n},T|0,t2)J'< {«} to find ({n},ti\0,t2)j and (0,*i \{n},t2)j. 12-8c Give an independent derivation by first constructing (y* ,tx 10,^2)J and (0,ti ly",t2)-'; then compare. 12-9a For given wave vector k, adopt a coordinate system in which k = ke^. Then the real unit vectors specify linear polarization, and ek+ = 2"2 (ei + ie2) = i 1 efc„ = 2"2 (iei + e2) = -j=\ 1 specify circular polarization (or, better, helicity). Verify explicitly that E exA = E ek\elx = 1 - TT = [ 0 1 0 A=l,2 A=± ^ V0 0 0, 12-9b Repeat for the pair of polarization vectors Ska = ei cos # + e2 e11*5 sin $ , e^b = ^2 cos •& — e\ e~11? sin •& . How are they related to the ek± pair? Polarization of which kind is specified by eka,ekb? 12-10 Charged particle in an isotropic three-dimensional oscillator potential; initial state \n'x,n'y,nz) = |2); final state (nx,ny,nz\ = (l|. Find the transition rate 714-2 ■ Then sum over all possible transitions to get the decay rate of the initial state. 12-11 Hydrogenic atoms: Concerning matrix elements of the type In, l,m\r\n',1',m') ,
Problems 471 find a selection rule that states which elements are not automatically zero. [Hint: Consider r —> —r.] 12-12a Hydrogenic atoms: Find the decay rate of the states with principal quantum number n = 2, angular quantum number 1 = 1, magnetic quantum number m = 1 or m = 0 or m = -1. Why is it sufficient to consider one m value? 12-12b Repeat for n = 3, I = 0,1,2. 12-12c Repeat for arbitrary n,l = n—l,m = I, the circular Rydberg states of Problem 9-16. 12-13 The Thomson cross section (12.10.17), an area, is a multiple of a squared length, e2/'(me\(?), the so-called classical electron radius. What is its relation to the electron's Compton wavelength and to the Bohr radius?
Index Angstrom unit 305 Angstrom, Jonas Anders 305 absorption 464 action operator 209 action principle (see also quantum action principle) 209 adjoint (see also measurement symbol, adjoint ~s) 54, 91 - matrix representation of an ~ 57 -- of a measurement symbol 54 - of a number 55 - of a product 54 - of a unitary operator 67 - of right and left vectors 54, 66 Ag atom see silver atom Airy, Sir George Bidell 232 Airy function 238 asymptotic form 239, 241 - completeness of ~s 264 - differential equation 232 - extrema 248-252 - Fourier integral 232 - integrated ~ 232, 255, 263-265 - orthonormality of ~s 264 - plot 252, 255 - value at 0 240, 263, 265 - zeros 248-252 a particle 356 angular frequency 203 angular momentum 151,183,200 - as a BE collection of spins 383, 387 - commutator with scalar operator 152 - commutator with vector operator 151 - conservation of ~ 198 - decomposition into spins 155-158 - differential-operator representation 165-167, 296 - eigenvectors 152-155 - rotated s 168-177 - external ~ 188 - for the Stern-Gerlach system 155 - internal ~ 188, 190 - of composite systems 158-162 - orbital ~ 188, 289, 297 - spin ~ 188 annihilation (see also destruction) 37,41,43 annihilation operators 377, 400 anticommutator 35 anticommuting numbers 399 anticommuting quantities 392 atomic energy unit 419 atomic scales 305 atomicity 7,12,46 axial vector 359 - geometrical significance 360 BE (Bose-Einstein) 356 BE statistics 356 - and spin 358 - commutation relations 380 Bell, John Stewart 179 Bell states 179 Bessel, Friedrich Wilhelm 284 Bessel coefficients 284, 301 Bessel functions 284 - generating function 284, 285, 301 - power series 301 - spherical ~ 369 - sum rule 285 /3 decay 337 Bethe, Hans Albrecht 461 Bethe's Lamb shift result 461 binomial expansion 123,143 Bohr, Niels Henrik David 6,17, 74 Bohr energies 305, 363, 422 Bohr magneton 324 Bohr orbit 420 Bohr radius 304, 414, 461, 471 Bohr shells 431,435 Bohr's complementarity 17, 74
474 Index booster 183 Born, Max 14, 85 - and Heisenberg's non-commutativity 85 Born approximation 372 Born's probability interpretation 102 Bose, Satyendranath 356 Bose-Einstein see BE boson 380 boundary condition 253 bra 38 Brillouin, Leon 246 Carlini, Francesco 246 causal theory 1-3,15 causality 2 center of gravity 343 center of mass 343 - cross section in ~ frame 355 - position vector 343 change of description 213-214 charge in electric and magnetic field - Hamilton operator 321 - Lorentz force on ~ 322 charge in homogeneous magnetic field - commutation relations 324 - count of states 328 - energy eigenvalues 325 - Hamilton operator 324 classical analogies 17 classical electron radius 471 classical kinetic energy 241 classical limit 73,132-134,188, 285, 432,463,467 classical momentum 241, 244 classically allowed region 243, 246, 410 classically forbidden region 242, 243, 246,317 coherent state 130 - (over-)completeness of ~s 131 column (see also column vector) 59, 60 column vector 59-63 commutator 35 - identities 90, 91 - trace of a ~ 91 complementarity 17,69-76 complementary properties 74 complementary quantities 76 completeness - of p vectors 103 - of q vectors 101 - of U and V vectors 101 - of Airy functions 264 - of coherent states 131 - of measurement symbols 33, 98 - of minimal-uncertainty states 131 - of radiation field mode functions 439 - of stationary-uncertainty states 123-125,128 composite system 76 Compton, Arthur Holly 461 Compt on wavelength 461,471 connection formulas 243, 246 conservation - local ~ of electric charge 467 - of angular in omentum 198 - of energy 197 - of linear momentum 198 conservation laws 197-199 constant force - asymptotic wave functions 239-243 - energy eigenstates 237, 238 - their completeness 264 - their orthonormality 264 - equations of motion 224 - Hamilton operator 224 - state density 233 - time transformation function 226 constant restoring force 252 continuity equation 258, 467 continuum limit 82-87 Copenhagen school of physics 6 correspondence - between hydrogenic bound states and scattering states 330 - between two-dimensional oscillator and three-dimensional Coulomb problem 304,333 - between two and three-dimensional oscillator 297 Coulomb, Charles-Augustin de 15, 304 Coulomb force field 15 Coulomb potential 304 - Fourier transform 427, 455 - in parabolic coordinates 314 - long-range effect of ~ 332 - virial theorem 310 Coulomb problem see hydrogenic atoms creation 41,43,54 - as though ~ of an atom 36 - symbolic act of ~ 38
Index 475 creation operators 377, 400 cross section - differential ~ 332, 372 - in center-of-mass frame 355 - Mott ~ 357 - Rutherford ~ 333, 355 - Thomson ~ 467, 471 current components 441 curvilinear coordinates 292-293 - distance 293 - gradient 293 - Laplacian 293 - volume element 293 cyclotron frequency 325 Darwin, Charles Galton 455 Darwin term 455 de Broglie, Prince Louis-Victor 17, 29 de Broglie wave 29 de Broglie wavelength 245 - local ~ 245 de Broglie's momentum-wavelength relation 203 degenerate states - in perturbation theory 366 degree of freedom - quantum ~ 76 --82 delta function 102,136 - as a limit 105 - identities 136,264 - transverse ~ 468 delta symbol 36, 50 density matrix 93 Descartes, Rene 156 destruction (see also annihilation) 36,54 - as though ~ of an atom 36 - symbolic act of ~ 38 determinant - of an operator 94 - of an operator product 95 deterministic theory 1-3,15 - statistically ~ 15 deuteron 355 diarnagnetic susceptibility 339 dipole approximation 464, 465 Dirac, Paul Adrien Maurice 38 Dirac's delta function see delta function division algebra 35 down-hill equation 317 driven oscillator - boundary conditions 272 - constant drive 274-276 -- energy eigenstates 275 - energy eigenvalues 275 -- time transformation function 276, 278 - wave functions 275 - equations of motion 272 - Hamilton operator 272 - slowly varying drive 276-278 - temporary drive 278-286 - time transformation function 274 dyadic product 38 dynamical principle 210 dynamical variables 196 - time dependence of ~ 196 effective Hamiltonian 449, 456 - first-order correction 458 effective potential 316, 408 eigenvalue 68 - multiplicity of an ~ 79, 81 - of non-Hermitian operator 125-132 - problem 80 eigenvector 68 - common ~s of J2 and Jz 152-155 - equation 96 - of non-Hermitian operator 125-132 - orthogonality of ~s 81 Einstein, Albert 4,17,191 Einstein relation 22 Einsteinian relativity 191 electric charge density 437 electric current density 437 electric field 46, 437 - force on test charge in ~ 46 - hydrogenic atoms in ~ 316-319 - longitudinal part 438 - transverse part 438 electromagnetic field 437 electromagnetic radiation - absorption 464 - commutation relations 440, 469 - constraints 437 - current components 441 - effective field 443 - effective Hamiltonian 449, 456 - first-order correction 458 - effective particle action 443 - energy density 469 - energy flux density 469 - equations of motion 437, 440 - field Lagrangian 439 - infrared photons 452
476 Index - interaction Lagrangian 439, 441, 454 - Lagrangian 437,439 - longitudinal fields 438-439 - mass renormalization 451,454,459 - mode expansion 440, 469 - multi-photon energy 442 - multi-photon wave functions 442 - orthonormality of modes 448 - particle Lagrangian 439, 454 - photon mode functions 447 - spontaneous emission 462, 464 - stimulated emission 464 - time-non-local effective action 448, 449, 456 - transverse fields 438-439 electron 20 electron charge 304 electron density 408 electron-electron interaction 405 electron-volt 184 electrostatic interaction energy 438 electrostatic potential 316 elementary particle 190 emission - spontaneous ~ 464 - stimulated ~ 464 end-point variation 210 energy 183 - conservation of ~ 197 - potential ~ 191 energy eigenstates - constant force 237, 238 - free particle 237 - harmonic oscillator 230 - linear potential 253 energy eigenvalues - charge in homogeneous magnetic field 325 - from time transformation function 230, 234 - harmonic oscillator 230, 271 - hydrogenic atoms 305 - linear potential 254 - parameter dependence of ~ 307-309 - two-dimensional oscillator 288 energy quantum number 304 equations of motion 195-197 - constant force 224 - driven oscillator 272 - electromagnetic radiation 437, 440 - for identical particles 386, 394 - for operator fields 396 - for the evolution operator 217 - free particle 223 - Hamilton's ~ 200, 207 - harmonic oscillator 226 - Heisenberg's ~ 196, 211 - non-Hermitian variables 270 - probability operator 217 - Schrodinger's ~ 197,211 ergodic theorem 75 Euclid of Alexandria 38 Euclidean geometry 61 Euclidean space 38 Euler, Leonhard 64 Euler's beta function integral 365 Euler's constant 369 Euler's factorial function integral 292 Euler's identity 64 Eulerian angles 162-169, 179 evolution operator 217 - equations of motion 217 - group property 217 exchange terms 407 exclusion principle 381 expectation value 48, 93,109-111,140 - of a unitary operator 140 factorial function 292, 350, 369 - Euler's integral 292 FD (Fermi-Dirac) 356 FD statistics 356 - and spin 358 - commutation relations 380 Fermi, Enrico 356 Fermi's golden rule 372 Fermi-Dirac see FD fermion 380 Feynman, Richard Phillips 308 fine structure constant 335, 430, 461 first quantization 383 flux - incident ~ 332, 466 - scattered ~ 332 Fock, Vladimir Alexandrovich 144 Fock representation 144 Fock states (see also stationary- uncertainty states) 144,377 force-free motion see free particle Fourier, Jean Baptiste Joseph 104 Fourier's integral theorem 104 free particle 202-207 - energy eigenstates 237 - equations of motion 223
Index 477 - Hamilton operator 223 - state density 232 - time transformation function 224 frequency 203 function - of a measurement symbol 50 - of an observable 50 functional derivative 396 Furry, Wendell Hinkle 451 Galilean invariance 183-191 Galilean-Newtonian relativity 183 Galilei, Galileo 1,183 gauge transformation 322, 438, 467 - infinitesimal ~ 467 Gauss, Karl Friedrich 104 Gauss's theorem 414 Gaussian Fourier integral 104, 235 generating function 129 - for Hermite polynomials 142 - for Laguerre polynomials 291,301, 302 - for Legendre polynomials 181 - for probability amplitudes 281 - for spherical harmonics 174 generator 150 - for operator fields 395 Gerlach, Walther 29 Glauber, Roy Jay 130 Grassmann, Hermann Giinther 400 Green, George 246 Green's identity 293 half-width 392 Hamilton, Sir William Rowan 183 Hamilton operator (see also Hamilto- nian) 183 - charge in electric and magnetic field 321 - charge in homogeneous magnetic field 324 - constant force 224 - driven oscillator 272 - for FD systems 394 - for many-electron atoms 405 - for operator fields 395 - for system of particles 190-191 - free particle 223 - harmonic oscillator 226 - identical particles 386 - three-dimensional oscillator 295 - two-dimensional oscillator 288 Hamilton's equations of motion 200, 207 Hamiltonian (see also Hamilton operator) 183 harmonic oscillator - dimensionless variables 230, 235, 269 -- driven ~ see driven oscillator - energy eigenstates 230 - energy eigenvalues 230, 271 - their multiplicity 230 - equations of motion 226 - Hamilton operator 226 - non-Hermitian variables 269 - three-dimensional isotropic ~ see three-dimensional oscillator - time transformation function 227, 271 - two-dimensional isotropic ~ see two-dimensional oscillator - virial theorem 310 -- wave functions 236, 271 - - asymptotic form 272, 299 Hartree, Douglas Rayner 407 Hartree equations 408 Hartree-Fock equations 407 - exchange terms 407 Harvard University VII Heaviside, Oliver 273 Heaviside's unit step function 273 Heisenberg, Werner 14, 85 Heisenberg's equation of motion 196 Heisenberg's non-commutativity 85 - and Max Born 85 - and Schrodinger's wave mechanics 87 Heisenberg's uncertainty principle 110 Heisenberg's uncertainty relation 110 - Robertson's general form of ~ 140 helicity 192,470 helium (4He) nucleus 356 helium (3He+) ion 337 Hellmann, Hans 308 Hellmann-Feynman theorem 308 Hermite, Charles 53 Hermite polynomials 118-123, 236 - differential equation 119 - generating function 142 - of low order 119 - orthogonality 143 - recurrence relations 119 - roots of the ~ 120
478 Index Hermitian conjugation see adjoint hydrogen ion (H~~) 431 hydrogenic atoms - axial vector 358-365 - geometrical significance 360 - circular Rydberg states 373, 471 - decay rate 471 - energy eigenvalues 305 - their multiplicity 305,315,319, 363 - fine structure 461 - ground-state wave function 363, 365 - in external electric field 316-319 - in external fields 365-368 - effective coupling strength 365 - in parallel electric and magnetic field 323 - radial expectation values 306,311, 312,318,333,338,339,373 - scattering states 328-333 - Schrodinger equation 314 - - in parabolic coordinates 314 - Stark-shifted energies 319 - their multiplicity 319 - virial theorem 310 - wave functions 307 hyperon 19 identical particles - annihilation operators 377 - commutation relations 380 - creation operators 377 - dynamics 386-387,392-394 - equations of motion 386, 394 - Hamilton operator 386, 394 - ladder operators 377 - Lagrangian 393 - measurement symbol 375 - modes 376 - occupied ~ 385 - multi-particle operators 384 - multi-particle states 385-386 - wave functions 399 - non-interacting ~ 397-403 - number operator of a mode 377 - one-particle operators 375, 381-384 - scattering of ~ 355-358 - Schrodinger equation 387 - single-particle energy 401 - time transformation function 397 - total number operator 377, 405 - two-particle operators 381-384 incident wave 243 (in)compatible properties 74, 78 independent systems 78 independent-particle energies 408 interference 41-46 - between incident and reflected wave 248 - constructive ~ 46 - destructive ~ 46 internal energy 190 internal variables 190,191 inverse - of a product 65 - of a unitary operator 65, 67 - of an ordered exponential 125 isospin 177 Jeffreys, Sir Harold 246 Kepler, Johannes 373 ket 38 kinetic energy 320 - of two-body system 344 kinetic momentum 321 Kramers, Hendrik Anthony 246 Kronecker, Leopold 36 Kronecker's delta symbol see delta symbol L —> r see left to right reading ladder operators 153, 377 Lagrange, Joseph Louis de 209 Lagrange operator see Lagrangian Lagrangian 209 - for BE systems 393 - for charge in electric and magnetic field 322 - for charge in electric field 320 - for FD systems 393 - for identical particles 393 - for operator fields 395 - with velocity dependence 320 Lagrangian density 437 Laguerre, Edmond 282 Laguerre equation 287, 294 Laguerre polynomials 282 - contour integral representation 330, 349 - differential equation 287 - first and last series terms 282 - generating function 291, 301, 302 - in hydrogenic wave functions 307 - of complex degree 330, 349
Index 479 - of low order 282 - orthogonality 292 - power series 282 - initial terms 283 - recurrence relations 286, 287 Lamb, Willis Eugene 461 Lamb shift 461 - Bethe's result 461 Landau, Lev Davidovich 334 Landau levels 334 Langer, Rudolph Ernest 336 Langer's correction 336 Laplace, Marquis de Pierre Simon 174 Laplace's equation 174, 297, 373 Laplacian differential operator - in curvilinear coordinates 293 - in parabolic coordinates 314 - in polar coordinates 293 - in spherical coordinates 296 - vector eigenfunctions 439 Larmor, Sir Joseph 463 Larmor formula 463, 467 left to right reading 36, 38, 41, 53 Legendre, Adrien Marie 173 Legendre's polynomial 173, 174, 347, 369 - expansion of plane wave 347 - generating function 181 - integral representation 181 - of low order 180 - orthogonality 180 light quantum 5 linear momentum (see also momentum) 183 - conservation of ~ 198 linear potential 252 - energy eigenstates 253 - energy eigenvalues 254 - WKB approximation 255 linear restoring force see harmonic oscillator local-oscillator approximation 229, 262 long-wavelength limit 465 Lord Rayleigh see Strutt, J. W. Lorentz, Hendrik Antoon 322 Lorentz force 322, 339 m.m. see magnetic moment magnetic dipole (see also magnetic moment) 30 - precession of ~ in magnetic field 34 magnetic field 30, 322, 387, 437 - energy of magnetic dipole in ~ 30 - force on magnetic dipole in ~ 30 - of uniformly moving charge 452 - precession of magnetic dipole in ~ 34 - rotating ~ 389 - torque on magnetic dipole in ~ 88, 387 magnetic moment (see also magnetic dipole) 29, 30, 38, 69, 387 - average ~ measured 40 - energy of ~ in magnetic field 30 - force on ~ in magnetic field 30 - torque on ~ in magnetic field 88, 387 many-electron atoms - binding energy 419, 424, 428 - exchange energy contribution 426-427 -- innermost electrons 424 -- oscillatory terms 428-430 - quantum corrections 425-426 - relativistic contribution 430 -- smooth part 428 - TF approximation 419 - effective potential 408 - electron density 408 - energy - as a functional of the density 409 -- as a functional of the effective potential 410 - Hamilton operator 405 - independent-particle energies 408 - strongly bound electrons 420-425 - total number operator 405 mass 186,190 - reduced ~ 344 - total ~ 344 mass renormalization 451, 454, 459 matrix representation 56-57 - of an adjoint 57 Maxwell, James Clerk 3, 29 Maxwell's equations 29 measurement - disturbance by ~ 46-48 - less selective ~ 33, 42 - non-selective ~ 44-46, 74 - selective ~ 31, 32, 34, 42, 44, 50, 78 - successive ~s 32,35,38-41,53 - that accepts everything 32, 42, 44, 48 - that rejects everything 32
480 Index measurement symbol (see also operator) 32-36,73 - addition of ~s 33, 88 - adjoint ~s 53-55 - as product of right and left vectors 37 - completeness of ~s 33, 98 - distributive law of multiplication of ~s 34,36 - for identical particles 375 - for physical property 48-50 - algebraic equation obeyed by ~~ 49 - function of a ~ 50 - Hermitian ~s 53-55 - multiplication of ~s 32 - non-commutativity of ~s 53 - order of multiplication of ~s 35 - self-adjoint ~s 54 - trace of a ~ 57 minimum-uncertainty state (see also uncertainty, states of minimal ~) 204 modes and particles 376 momentum (see also linear momentum) 188 - relative ~ 343 - total ~ 343 - vector 188 Mott, Sir Nevill Francis 357 Mott cross section 357 neutrino 337 neutron 177 Newton, Sir Isaac 1, 29 Newton's laws 29 non-Hermitian variables - equations of motion 270 - generators 269 - Lagrangian 269 nuclear charge 304 nuclear forces 347, 352, 357 nucleon 177 nucleus-electron interaction 405 null symbol 32 number operator 377, 405 observable 48-50 - function of an ~ 50 operator (see also measurement symbol) 67 - acting on vector 68 - compatible ~ pairs 78 - determinant of an ~ 94 - eigenvalue of an ~ 68 - eigenvector of an ~ 68 - fundamental pair of ~s 74 - inverse of an ~ 65, 94 - matrix representation of an ~ 56-57 - non-Hermitian ~s y, y^ 114 - normal ~s 96 - period of an ~ 70 - represented by differential ~ 87 - symmetrized product of two ~s 214,219 - that permutes cyclically 69, 71 - unitary ~ bases 69-76 - unitary ~s 67-76 - as function of Hermitian ~s 71 - reciprocal definition of two ~~s 72,77 -- that differ little from 1 140 -- that permute rows or columns 77 operator fields 395-397 - commutation relations 395 - equations of motion 396 - generator 395 - Hamilton operator 395 - Lagrangian 395 operator space 98 optical theorem 371 ordered exponential function 124-125 orthogonality - of eigenvectors 81 - of Hermite polynomials 143 - of Laguerre polynomials 292 - of Legendre polynomials 180 - of minimal-uncertainty states 131 - of the q eigenstates 107 orthonormality - of Airy functions 264 - of measurement symbols 98 - of radial oscillator wave functions 291, 295, 298 - of radiation field mode functions 439 - of spherical harmonics 174 parabolic coordinates 313, 316, 329 particle 375 - identical ~s see identical particles particles and modes 376 Pauli, Wolfgang 51 Pauli matrices 56, 58,156 Pauli operators 50-53, 56
Index 481 - Hermitian property of ~ 55 Pauli vector operator 53 - two arbitrary components 63-67 Pauli's exclusion principle 381 period - of an operator 70 periodic table 419 permutation operator 160, 178 perturbation theory 319,337,365 - for degenerate states 366 phase angle - imaginary ~ 47 - random ~ 47 phase space 231, 262 photoelectric effect 5 photon 5, 20,192, 442 7C meson 19,177 pion 177 Planck, Max Karl Ernst Ludwig 7, 184 Planck's constant 7,184 Planck's energy-frequency relation 203 plane waves 203 - as photon mode functions 447 - expansion in Legendre polynomials 347 Poisson, Simeon Denise 215 Poisson bracket 215, 220 Poisson distribution 280 Poisson sum formula 411, 422 Poisson's equation 409, 413, 438 polar coordinates 132, 290, 293 polarization - circular ~ 447, 470 - linear ~ 447, 470 polarization sum 451, 454, 458, 464 polarization vector 447 position vector 188 - of the center of mass 343 - relative ~ 343 positron 22 positronium 345,373 potential energy 191,320 potential momentum 321 Poynting, John Henry 469 Poynting vector 469 Poynting's theorem 469 principal quantum number 304 principle of stationary action (see also stationary action principle) 210 - as fundamental dynamical principle 210 probability 38-41,53 - as a fraction of possible outcomes 40 - as absolute square of a ~ amplitude 43 - for compound measurement 43 probability amplitude (see also wave function) 41-46, 62, 68 probability current density 258 probability density 257, 258 probability operator 93,145 -- equations of motion 217 - for non-selective measurement 94 - for selective measurement 94 propagation vector 203 proton 177,355 quanta 12 quantized field 18-22 quantum action principle (see also action principle) 209, 214 - permissible variations 214-216 quantum electrodynamics 23 quantum number - energy ~ 304 - magnetic ~ 323 - principal ~ 304 - radial ~ 292 radial quantum number 292, 336 radiation gauge 438 Rayleigh-Ritz method 255-257, 406 reading dextrally see left to right reading reading sinistrally see right to left reading reduced mass 344 reduced wave number 203 reduced wavelength 203 reflected wave 243 reflection operator 138,144 relative momentum 343 relative motion 345 relative position vector 343 relative speed 346 relative velocity 346 renormalization 23 renormalized mass 451 resonance 392 right to left reading 53 Ritz, Walther 256 Robertson, Howard Percy 140 rotation 183,187
482 Index - finite ~ 161-168 - infinitesimal ~ 150-152,158,184 row (see also row vector) 59, 60 row vector 59-63 Rutherford, Lord Ernest 6, 333 Rutherford cross section 333, 355, 356 Rydberg, Janne 305 Rydberg energy 305 scalar potential 437 scale change - infinitesimal ~ 309 - - generator of —* 309 scattering - elastic ~ 466 - of identical particles 355-358 - of photon by charge at rest 465 - of two BE particles 356 - of two FD particles 356 - proton-proton ~ 347 - Rutherford ~ 328-333, 346-352 - Thomson ~ 465-467 scattering amplitude 370 scattering angle 332 scattering phase shift 354 Schrodinger, Erwin 14, 29 Schrodinger equation 29, 201 - differential ~ for p wave functions 201 - differential ~ for q wave functions 201 - differential ~ of motion 197 - for identical particles 387 - radial ~ 303 - relativistic ~ 335 - symbolic ~ 200 - time-independent ~ 238 Schrodinger's differential equation of motion (see also Schrodinger equation) 197 Schrodinger's wave mechanics 86 - and Heisenberg's non-commutativity 87 second quantization 158, 375, 382, 383 short-range forces 352-355 silver atom 30, 31 solid harmonics 174, 297, 373 Sommerfeld, Arnold 335 Sommerfeld's fine structure constant 335, 430, 461 spectrum 108 speed of light 183,191,321 spherical Bessel functions 369 spherical coordinates 60,173, 295 spherical harmonics 174 - generating function 174 - orthonormality 174 spherical wave 351 spin 157,188 - general ~ dynamics 387-392 - wave functions for ~ states 391 spin and statistics 358 spontaneous emission 462, 464 spread 109-111,140 Spur see trace Stark, Johannes 319 Stark effect 373 - linear ~ 319 state density - constant force 233 - free particle 232 state specification 2-17 state vectors 36-38 stationary action principle (see also principle of stationary action) 210, 214 stationary-uncertainty state 144, (see also uncertainty, states of stationary ~) 230,306 statistics and spin 358 step function 273 Stern, Otto 29 Stern-Gerlach apparatus 88 - rotated ~ 38 Stern-Gerlach experiment 29-31, 50, 69, 88, 218 Stern-Gerlach interferometer 261 Stern-Gerlach measurement 68 - successive ~s 38-41,89,95 stimulated emission 464 Stirling, James 133 Stirling's asymptotic expansion 133 Stokes, Sir George Gabriel 439 Stokes's theorem 439 strongly bound electrons 420-424 - and break-down of TF approximation 421 - energy correction due to ~ 424 Strutt, John William 256 sum rule - for Bessel functions 285 - for matrix elements 91 symmetrized product 214 Taylor, Brook 129 Taylor series 129,138,172,281
Index 483 test charge 46 - force on ~ in electric field 46 TF (Thomas-Fermi) 412 TF approximation 412 - binding energy 419 - corrected for strongly bound electrons 424 - exchange energy correction 426-427 - quantum corrections 425-426 - break-down of ~ 420 - electron density 413, 435 - at the site of the nucleus 435 - energy 415,435 - scaling transformations 434 strongly bound electrons 420-424 TF differential equation 414 - boundary conditions for ions 433 TF function 414 - asymptotic form 432 - initial slope 415,419,433 - integrated ~ 426 - plot 415 theory - causal ~ 1-3,15 - deterministic ~ 1-3,15 - statistically deterministic ~ 15 Thomas, Llewellyn Hilleth 412 Thomas-Fermi see TF Thomson, Sir John Joseph 467 Thomson cross section 467, 471 Thomson scattering 465-467 three-dimensional oscillator - decay rate 470 - energy eigenvalues 297 - their multiplicity 298 - Hamilton operator 295 - orthonormality of radial wave functions 298 - radial expectation values 311-313 - transition rates 470 time 183 time dependence - explicit ~ 196 - implicit ~ 196 - of dynamical variables 196 - of probability operator 217 - parametric ~ 196 time displacement 186,195 time transformation function 204 - and energy eigenvalues 230, 234 - and wave functions 234 - constant force 226 - driven oscillator 274, 276, 278 - free particle 224 - harmonic oscillator 227, 271 - identical particles 397 - short times 227-229 total mass 344 total momentum 343 trace 57-59,74 - cyclic property of the ~ 97 - in terms of matrix elements 58 - of a commutator 91 - of a product 59 - of the unit symbol 58 transformation function 68 - as generating function 272 - between Hermitian and non- Hermitian variables 270 transition matrix elements 462 translation 183,187 - infinitesimal ~ 185 transmitted wave 243 transverse delta function 468 trial wave function 257 tritium 337 triton 337,355 two-body system - center-of-mass position 343 - Hamilton operator 343 - kinetic energy of ~ 344 - Lagrangian 344 - external part 345 -- internal part 345 - reduced mass 344 - relative momentum 343 - relative motion 345 - relative position 343 - relative speed 346 - relative velocity 346 - scattering 346-358 - total mass 344 - total momentum 343 two-dimensional oscillator - energy eigenvalues 288 - - their multiplicity 288 - generating function for wave functions 290 - Hamilton operator 288 - orthonormality of radial wave functions 291,295 - radial expectation values 306, 333 two-dimensional rotation 288 - generator of infinitesimal ~ 289
484 Index UCLA VIII uncertainty - Heisenberg's ~ principle 110 - Heisenberg's ~ relation 110 - states of minimal ~ 111-114 - states of stationary ~ 114-118, 135 uncertainty principle 110 uncertainty relation 110, 140, 218, 327 unit symbol 32-35, 50 - trace of ~ 58 unitary geometry 59-67 - dimensionality of a ~ 80 - metrical relations of a ~ 67 - unit vectors of a ~ 62 unitary transformation 67-69 - infinitesimal ~ 140, 149-150 - generator of ~~ 150,183 - order of ~~s 184 - of an operator function 83 - of operators 149 - of vectors 149 - sequence of ~s 140-142, 164 up-hill equation 317 vacuum 20,385,398,400 vacuum persistence amplitude 469 vector - adjoint ~s 54 - completeness relation for ~s 37, 44 - left ~ 38, 43 - orthonormal ~ set 69, 71 - right ~ 38, 43 - state ~s 36-38 - unit orthogonal ~s 37 vector potential 322, 437 - for homogeneous magnetic field 322-323 - longitudinal part 438 - transverse part 438 virial theorem 310, 366 - Coulomb potential 310 - harmonic oscillator 310 - hydrogenic atoms 310 volume element 293 - in curvilinear coordinates 293 - in spherical coordinates 296 von Neumann, John 217 von Neumann equation 217 wave - incident ~ 243, 332 - plane ~ 329 - photon mode function 447 - reflected ~ 243 - scattered ~ 332 - spherical ~ 329 - transmitted ~ 243 wave function (see also probability amplitude) 46,59-63,68,69 - azimuthal ~ 291 - even ~ 253 - for spin state 391 - hydrogenic ~ 307 - odd ~ 253 - radial ~ 291 - trial ~ 257 wave number 203 - reduced ~ 203 wave number cells 447 wave vector 203 wave-particle duality 17 wavelength 203 - Compton ~ 461, 471 - de Broglie ~ 245 - reduced ~ 203 Weisskopf, Victor Frederick 451 Wentzel, Gregor 246 Wentzel-Kramers- Brillouin see WKB Weyl, Claus Hugo Hermann 110 WKB (Wentzel-Kramers-Brillouin) 246 WKB approximation 243-248, 266 - for energy eigenvalues 266,410 - - Langer's correction 336 - linear potential 255 - scaled momentum in ~ 246 - unified ~ 246 Zeeman, Pieter 323 Zeeman effect - normal ~ 323
T^mf^ JULIAN SCHWINGER Quantum Mechanics ulian Schwinger, who shared the 1965 Nobel Prize for physics with Richard Feynman and Sin-Itiro Tomonaga for his pioneering work on quantum electrodynamics, had a considerable influence on the conceptual development of modern quantum field theory. In addition to being an extremely productive researcher he was also a brilliant teacher, and this book demonstrates his outstanding ability to expose a difficult subject in a clear and concise style. In marked contrast to many textbooks on quantum physics, his approach is inductive rather than deductive. The I SUN ^-s to ti U'.S 8 , "V ,,,' ft / //): /Av inv. 5/) r /»g e r. de