Author: Lions J-L.   Dautray R.  

Tags: mathematics  

ISBN: 3-540-19045-7

Year: 2000

Text
                    Robert Dautray. Jacques-Louis Lions
.


Mathematical
Analysis and
, Numerical Methods
for Science and
Technology


Volume 2
Functional and Variational Methods





Mathematical Analysis and Numerical Methods for Science and Technology ,
Robert \.Dautray Jacques-Louis Lions \ '/ Mathell1atical Analysis and NUll1erical Methods /1 for Science and Technology v 1,;:/l' /l)£f h Q v, Volume 2 Functional and Variational Methods , With the Collaboration of Michel Artola, Marc Authier, Philippe Benilan, Michel Cessenat, Jean-Michel Combes, Hehne Lanchon, Bertrand Mercier, Claude Wild, Claude Zuily Translated from the French by Ian N. Sneddon Springer- Verlag Berlin Heidelberg New York London Paris Tokyo  {;; 
Robert Dautray Ecole Poly technique F-92128 Palaiseau Cedex, France J acques- Louis Lions College de France 11 place Marcelin Berthelot F-75005 Paris, France Title of the French original edition: Analyse mathematique et calcul numerique pour les sciences et les techniques, Masson, S.A. (Q Commissariat a l'Energie Atomique, Paris 1984, 1985 With 20 Figures Mathematics Subject Classification (1980): 35-XX, 41-XX, 42-XX, 44-XX, 45-XX, 46-XX, 47-XX, 65-XX, 73-XX, 76-XX, 78-XX, 80-XX, 81-XX ISBN 3-540-19045-7 Springer-Verlag Berlin Heidelberg New York ISBN 0-387-19045-7 Springer-Verlag New York Berlin Heidelberg Library of Congress Cataloging-in-Publication Data Dautray, Robert. [Analyse mathematique et ca1cul numerique pour les sciences et les techniques. English] Mathematical analysis and numerical methods for science and technology / Robert Dautray, Jacques- Louis Lions; translated from the French by Ian N. Sneddon. p. cm. Translation of: Analyse mathematique et calcul numerique pour les sciences et les techniques. Bibliography: v. 2, p. Includes index. Contents: - v. 2. Functional and variational methods. ISBN 0-387-19045-7 (U.S.: v. 2) 1. Mathematical analysis. 2. Numerical analysis. 1. Lions, Jacques-Louis. II. Title. QA300.D34313 1988 515-dc19 88-15089 CIP This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broad- casting, reproduction on microfilms or in other ways, and storage in data banks. Duplication of this publication or parts thereof is only permitted under the provisions of the German Copyright Law of September 9, 1965, in its version of June 24, 1985, and a copyright fee must always be paid. Viola- tions fall under the prosecution act of the German Copyright Law. (Q Springer-Verlag Berlin Heidelberg 1988 Printed in Germany Typesetting: Macmillan India Limited, Bangalore Printing and binding: Konrad TriItsch GmbH, Wiirzburg 2141/3140-543210 - Printed on acid-free paper 
Introduction to Volume 2 This second volume (which contains Chaps. III to VII) begins by introducing some fundamental techniques: first of all, Fourier series (Chap. III) and the Fourier transform (Appendix on Distributions) and by way of complement, the Hankel and Mellin transforms (the Laplace transform comes in Chap. XVI, Vol. 5); finally, for its importance in numerical applications, the method of the fast Fourier transform. Chapter IV introduces the Sobolev spaces which play a decisive role in the theory of partial differential equations as well as in approximation procedures. These chapters III and IV both make use of the theory of distributions, which is the subject of an Appendix (which can be read independently). Chapter V is a study of the linear differential operators in a sufficiently general context. This setting goes a little beyond the strict needs of the applications indicated in this work, but the generality introduced allows us better to make the essentials more clear.We highlight the role of characteristics and the classi- cal classification of linear differential operators into elliptic, parabolic and hyperbolic operators. For boundary value problems, a possible formulation is obtained by the use of unbounded operators (the "domain" of the unbounded operator correspond- ing, for example, to the boundary conditions, supposed homogeneous); this is why it has seemed reasonable to include (in Chap. VI) a review of the principal concepts relating to operators in Banach or in Hilbert spaces. The last chapter of this volume, Chap. VII, introduces the very powerful variational methods, which, together with the Sobolev spaces, play the most important role throughout the theory (but which naturally are not the only ones, and, besides, are not always applied!). We give below the authors of various contributions, chapter by chapter. Chapter III: M. Cessenat, B. Mercier, C. Zuily. Chapter IV: M. Artola, M. Cessenat, C. Zuily. Chapter V: P. Benilan. Chapter VI: M. Artola, M. Cessenat, J.-M. Combes, C. Wild. Chapter VII: M. Authier, M. Artola, P. Benilan, M. Cessenat, H. Lanchon, B. Mercier. Appendix. "Distributions": M. Artola, M. Authier, M. Cessenat. Finally we mention the partial contributions of the following assistants of the Commissariat a l'Energie Atomique: Messrs. Batail, Gambaudo, Giorla, Sznit- man, Verwaerde. 
VI Introduction to Volume 2 Practical Guide for the Reader Designation of subdivisions of the text: number of a chapter: in Roman numerals number of part of a chapter: the sign 9 followed by a numeral number of section: a numeral following the above number of a sub-section: a numeral following the above. Example. II, 9 3.5.2 denotes Chapter II, Part 3, section 5, sub-section 2. The reader wishing to become acquainted rapidly with the mathematical and numerical essentials of the subject will find them in V ols. 1 and 2 omitting at afirst reading 99 5, 6,7 and 8 of Chapter II (vol. 1) and 994, 5 of Chapter V (vol. 2). These parts are distinguished by an asterisk at the appropriate part of the text, and also in table of contents. 
Table of Contents Chapter III. Functional Transformations Introduction Part A. Some Transformations Useful in Applications 9 1 . Fourier Series and Dirichlet's Problem 1. Fourier Series 1.1. Convergence in L 2 (If) 1.2. Pointwise Convergence on If 2. Distributions on If and Periodic Distributions 2.1. Comparison of' (If) with the Distributions on 1R 2.2. Principal Properties of ' (If) . 3. Fourier Series of Distributions 4. Fourier Series and Fourier Transforms. 5. Convergence in the Sense of Cesaro 6. Solution of Dirichlet's Problem with the Help of Fourier Series 6.1. Dirichlet' s Problem in a Disk . 6.2. Dirichlet's Problem in a Rectangle. The Mellin Transform 1. Generalities 2. Definition of the Mellin Transform 3. Properties of the Mellin Transform 4. Inverse Mellin Transform. 5. Applications of the Mellin Transform 6. Table of Some Mellin Transforms . 2. 3. The Hankel Transform . . 1. Generalities 2. Introduction to Bessel Functions 3. Definition of the Hankel Transform . 4. The Inversion Formula 5. Properties of the Hankel Transform 6. Application of the Hankel Transform to Partial Differential Equations 1 4 4 4 5 5 7 7 9 10 14 15 17 17 20 24 24 26 28 30 32 40 40 40 42 47 48 50 53 
VIII Table of Contents 6.1. Dirichlet's Problem for Laplace's Equation in R.. The Case of Axial Symmetry. . . . . . . . . . . . . 6.2. Boundary Value Problem for the Biharmonic Equation in R., with Axial Symmetry . . . . . . . . . 7. Table of Some Hankel Transforms . . . . Review of Chapter III A . . . . . . . . Part B. Discrete Fourier Transforms and Fast Fourier Transforms 9 1 . 9 2 . 9 3 . 9 4 . 9 5 . Introduction . . . . . . . . . . . . . . . . . Acceleration of the Product of a Matrix by a Vector . . . . The Fast Fourier Transform of Cooley and Tukey . . . . . . . The Fast Fourier Transform of Good-Winograd. . . . . . . . Reduction of the Number of Multiplications . . . . . . . . . 1. Relation Between the Discrete Fourier Transform and the Problem of Cyclic Convolution . . .. .... 2. Complexity of the Product of Two Polynomials . . . . . . . 3. Application to the Cyclic Convolution of Order 2 . . 4. Application to the Cyclic Convolution of Order 3 . . . . . . 5. Application to the Cyclic Convolution of Order 6 . . . . . . Fast Fourier Transform in Two Dimensions. .. ... 96. 9 7 . Some Applications of the Fast Fourier Transform . . . . . 1. Solution of Boundary Value Problems . . .. ..... 2. Regularisation and Smoothing of Functions. ..... 3. Practical Calculation of the Fourier Transform of a Signal 4. Determination of the Spectrum of Certain Finite Difference Operators and Fast Solvers for the Laplacian . ..... Review of Chapter III B . . . . . . . . . . . . . . . . . . . . Chapter IV. Sobolev Spaces Introduction . . . . . . . 9 1 . Spaces Hi (Q), Ir(Q) . . . . . . . . . 9 2 . The Space Ir (R.n). . . . . . . . . . . 1. Definition and First Properties . . . . 2. The Topological Dual of HS(R.n) . . . 3. TheEquation(-L1+k 2 )u==f in R.n,kER.\{0} . 93. Sobolev's Embedding Theorem . . . . . . . . . 9 4 . Density and Trace Theorems for the Spaces Hm(Q), (m E N * == N\ {O} ) . . . . . . . . . . . . . . . 53 55 57 57 59 59 62 64 66 69 69 71 72 73 75 77 78 78 81 83 84 91 92 92 96 96 98 100 100 102 
Table of Contents IX 1. A Density Theorem. . . . . . . . . . . 102 2. A Trace Theorem for Hi (R.) . . . . . . . . . . . . . . 107 3. Traces of the Spaces Hm(R.) and Hm(Q) . . . 113 4. Properties of m- Extension . . . . . . . . . . 114 9 5 . The Spaces H-m(Q) for all mEN . . . . . . . . 120 96. Compactness . . . . . . . . . . . .. ...... 123 97. Some Inequalities in Sobolev Spaces . . . . . . . . . 125 1. Poincare's Inequality for H6 (Q) (resp. H'O (Q)). . . . . . 125 2. Poincare's Inequality for Hi (Q). . .. ... ... 127 3. Convexity Inequalities for Hm (Q) . . . . . . . 133 98. Supplementary Remarks . . . .. ... 138 1. Sobolev Spaces W m , P (Q).. ... . . . . . . 138 1.1. Definitions ....... ... ...... 138 1.2. So bolev Injections . . . . . . . 139 1.3. Trace Theorems for the Spaces Wm,P(Q) ...... 140 2. Sobolev Spaces with Weights ... ........ 141 2.1. Unbounded Open Sets . . . . . . . . . . . . . 141 2.2. Polygonal Open Sets . . . . . . . . .. .... 141 Review of Chapter IV . . . . .. ... . . . . . . 142 Appendix: The Spaces HS(r) with r the "Regular" Boundary of an Open Set Q in R.n . . . . . . . . . . . . . . . . . . . . 143 Chapter V. Linear Differential Operators Introduction . . . . . . . . . . . . . 91. Generalities on Linear Differential Operators . .. .... 1. Characterisation of Linear Differential Operators .... 2. Various Definitions . . . . . . . . . . . . . . . . . . . 2.1. Leibniz'sFormula . . . . . . . . . . .. ... 2.2. Transpose of a Linear Differential Operator . . . . . . 2.3. Order of a Linear Differential Operator ...... 3. Linear Differential Operator on a Manifold . . . . . . . . 4. Characteristics . . . . . . . . . . . . . . . . . 4.1. Concept of Characteristics . . .. ........ 4.2. Bicharacteristics . . . . . . .. ........ 5. Opera tors with Analytic Coefficients. Theorems of Cauchy-Kowalewsky and of Holmgren . 92. Linear Differential Operators with Constant Coefficients 1. Study of a l.d.o. with Constant Coefficients by the Fourier Transform . . . . . . . . . . . . . . . . 148 149 149 152 152 153 154 155 157 157 159 163 170 171 
x Table of Contents 1.1. Existence of a Solution of Pu == f in the Space of Tempered Distributions . . . . . . . . . . 171 1.2. Example 1: The Laplacian . . . . . .. ... 173 1.3. Elliptic and Strongly Elliptic Operators . .. ... 174 1.4. Hypo-Elliptic and Semi-Elliptic Operators . . . 177 1.5. Examples . . . . . . . . . . . . . . . . . . . . . 179 1.6. Reduction of Operators of Order 2 in a Homogeneous, Isotropic "Medium" . . . . . . . . . . . . . . 181 2. Elementary Solutions of a l.d.o. with Constant Coefficients. . 182 2.1. Introduction . . . . . . . . . . . . . . . . . . . . 182 2.2. Elementary Solutions in !/' Examples . . . . . . . 184 2.3. Elementary Solution with Support in a Salient Closed Convex Cone: Hyperbolic Operator . . . . . . . . 189 3. Characterisation of Hyperbolic Operators . . . . . . . . . 191 3.1. Characteristics of a l.d.o. with Constant Coefficients. . . 191 3.2. Algebraic Characterisation of Hyperbolic Operators . 194 3.3. Hyperbolic Operators of Order 2 . . . . .. ... 198 4. Parabolic Operators. . . . . . . . . . . . . . . . . . . 202 3. Cauchy Problem for Differential Operators with Constant Coefficients . . . . . . . . . . . . . . . . . 204 1. Cauchy Problem and the Elementary Solution in {0' (JRn x JR +) 205 2. Propagation in Hyperbolic Cauchy Problems . . . . . . . . 209 3. Choice of a Functional Space: Well-Posed Cauchy Problem. . 214 4. Well-Posed Cauchy Problem in!/' . . . . . . . . . . . . 217 5. Parabolic and Weakly Parabolic Operators . . . . . 221 6. Study of the Particular Case P == a/at + Po . . . . . . . . . 223 6.1. Analysis of One-Dimensional Case . . . .. ... 223 6.2. Case in which Po is Strongly Elliptic . . . 224 6.3. Schrodinger Operator . . . . . . . . . .. ... 225 7. Well-Posed Cauchy Problem in {0': Hyperbolic Operators. 226 4. Local Regularity of Solutions*. . . . . . . . . . . . 230 1. Characterisation of Hypo-Ellipticity. . . . . . . . . . 230 1.1. Necessary Condition for Hypo-Ellipticity . . . . . . . 230 1.2. Algebraic Transformation of the Necessary Condition for Hypo-Ellipticity. . . . . .. ........ 232 1.3. The Principal Result . .. ........... 233 2. Analyticity of Solutions . . . . . .. ... ... 234 2.1. Statement of Results . . . . . . . . . . . . . . . . 234 2.2. Estimates of Analyticity .. ........... 237 2.3. Generalisation: Gevrey Classes .. ... ... 240 3. Comparison of Operators . . . . . . . . . . . . . . . . 241 4. Local Regularity for Operators with Variable Coefficients and of Constant Force . . . . . . . . . . . . . . . . . 245 5. Construction of an Elementary Solution . . . . . . . . 247 
Table of Contents 9 5 . The Maximum Principle * . . . . . . . . . . . 1. Prerequisites . . . . . . . . . . . . . . . . . 2. Parabolic Maximum Principle and Dissipativity. . . . . . . 3. Characterisation of Operators P Satisfying Maximum Principles . . . . . . . . . . . . 3.1. The Weak Maximum Principle . . . . . . . . 3.2. The Comparison Principle . . . . . . . . . . . . . . 3.3. The Strong Maximum Principle. . . . . . . . 3.4. The Principle of the Strong Parabolic Maximum . . . . Review of Chapter V . . . . . . . . . . . . . . . . . . . . . Chapter VI. Operators in Banach Spaces and in Hilbert Spaces XI 250 250 252 259 259 261 263 265 268 Introduction 269  1. Review of Functional Analysis: Banach and Hilbert Spaces 270 1. Locl1y Convex Topological Vector Spaces. N ormed Spaces and Banach Spaces 270 2. Linear Operators 274 3. Duality . 281 4. The Hahn-Banach Theorem and its Applications 282 4.1. Problems of Approximation 282 4.2. Problems of Existence . 283 4.3. Problems of Separation of Convex Sets 285 5. Bidual, Reflexivity, Weak Convergence, Weak Compactness 285 5.1. Bidual 285 5.2. Reflexivity . 286 5.3. Weak Convergence 287 5.4. Weak Compactness 289 5.5. Weak-Star Convergence 290 6. Hilbert Spaces 291 6.1. Definitions 291 J 6.2. Projection on a Closed Convex Set 295 6.3. Orthonormal Bases 299 6.4. The Riesz Representation Theorem. Reflexivity 302 7. Ideas About Functions of a Real or Complex Variable with Values in a Banach Space 304 7.1. Weak Topology 304 7.2. Weak Differentiability . 304 7.3. Weak Holomorphy 305 9 2 . Linear Operators in Banach Spaces 305 1. Generalities on Linear Operators 305 1.1. Domain, Kernel and Image of a Linear Operator 305 
XII Table of Contents 1.2. Nullity and Deficiency Indices . . . . . . . . 1.3. Basic Properties of Linear Operators . . . . . . . 2. Spaces of Bounded Operators . . . . .. ...... 2.1. Introduction . . . . . . . . . . . . 2.2. Various Concepts of Convergence of Operators . 2.3. Composition and Inverse of Bounded Operators . . . . 2.4. Transpose of a Bounded Operator .... . . 2.5. Some Classes of Bounded Operators ..... 2.6. Some Ideas on Functions of a Real or Complex Variable with Operator Values; Families of Operators . . . . 3. Closed Operators . . . . . . . . . . 3.1. Definition and Examples ............ 3.2. Basic Properties . . . . . . . . . . . . . . . . 3.3. The Set g- (X, Y) of Closed Operators from X into Y 3.4. Transpose of a Closed Operator . .. ...... 3.5. Operators with Closed Image. .. ...... 93. Linear Operators in Hilbert Spaces . . . . . . . 1. Bounded Operators in Hilbert Spaces . . . . . . . . . 1.1. Adjoint Sesquilinear Form . . . . . . 1.2. Hermitian Operators . . . . . . 1.3. Orthogonal Projectors. . . . . . 1.4. Isometries and Unitary Operators. 1.5. Hilbert-Schmidt Operators. . . . . . . . . . . . 2. Unbounded Operators in Hilbert Spaces 2.1. Adjoint of an Unbounded Operator. . 2.2. Symmetric Operators . . . . . . 2.3. The Cayley Transform. . . . . . . . . . . . . . 2.4. Normal Operators . . . . . . . . . 2.5. Sesquilinear Forms and Unbounded Operators . Review of Chapter VI . . . . . . . . . . . . . . . . . . . . . Chapter VII. Linear Variational Problems. Regularity Introduction . . . . . . . . . . . g 1. Elliptic Variational Theory . . . . . . . . . 1. The Lax-Milgram Theorem . . . 2. First Examples . . . . . . . . . . . 2.1. Example 1. Dirichlet Problem .... . . . . . · 2.2. Example 2. Neumann Problem 3. Extensions in the Case in which V and H are Spaces of Distributions or of Functions . . . . . . . . . . . . . 4. Sesquilinear Forms Associated with Elliptic Operators of Order Two . . . . . . . . . . . . . . . . . . . . . 307 307 310 310 313 316 322 325 332 334 334 336 339 342 346 348 351 353 353 354 355 357 361 361 361 362 366 367 374 375 375 376 378 379 380 383 384 
Table of Contents g2. 5. Sesquilinear Forms Associated with Elliptic Operators of Order 2 m . . . . . . . . . . . . . . . . . . . . . . 6. Miscellaneous Remarks . . . . . . . . . . . . . . . . . 7. Application to the Solution of General Elliptic Problems (of Dirichlet Type) . . . . . . . . . . . . . . . Examples of Second Order Elliptic Problems . . . . . 1. Generalities . . . . . . . . . . .. ... ... 2. Examples of Variational Problems. .. ........ 2.1. Mixed Problem. . . . . . . .. ... ... 2.2. Non-Local Boundary Conditions . . . . . 3. Problems Relative to Integro-Differential Forms on Q + r 3.1. Problem of the Oblique Derivative . . . . . . 3.2. Robin's Problem . . . . . . .. ... ... 4. Transmission Problem. . . . . . . . . . . . . . 5. Miscellaneous Remarks . . . . . . . . . . . . . 6. Application: Stationary Multigroup Equation for the Diffusion of N e u tr 0 n s . . . . . . . . . . . . . . . .. ... 7. Application: Statical Problems of Elasticity. . . . . . . . . 7.1. Introduction. . . . . . . . . . 7.2. Variational Formulation. . . . . . . 7.3. Korn's Inequality. . . . . . 7.4. Application to Problem (2.39) 7.5. Inhomogeneous Problem 8. Statical Problems of the Flexure of Plates . . . g3. Regularity of the Solutions of Variational Problems . . . . . . 1. Introduction . . . . . . . . . . . . . . . 2. Interior Regularity . . . . . . . . . . . . . . . . . 3. Global Regularity of the Solutions of Dirichlet and. Neumann Problems for Elliptic Operators of Order 2 4. Miscellaneous Results on Global Regularity . . . . . . . . 5. Green's Functions . . . . . . . . . . . . . . . . . . . 5.1. Case of the Laplacian in a Bounded Open Set Q with Dirichlet Condition. . . . . . . . . . . . . . . 5.2. Some Other Particular Examples . . . . . . . 5.3. Green's Functions in a More General Setting. . . . . . Review of Chapter VII . . . . . . . . . . . . . . . . Appendix. "Distributions" g 1. Definition and Basic Properties of Distributions . 1. The Space  (Q) . . . . . . 1.1. Definition . . . . . . . . . . . . . XIII 387 389 391 393 393 394 394 397 398 398 400 400 405 407 411 411 412 414 418 418 420 425 425 426 433 437 441 441 445 451 456 457 457 457 
XIV Table of Contents 1.2. Elementary Properties of the Convolution Product of Two Functions. . . . . . . . . . . . . . . . . . 458 1.3. A Procedure for the Construction of Functions of f0(Q) . 459 1.4. The Notion of Convergence in f0 (Q) . .. .... 460 1.5. Some Inclusion and Density Properties . . . . . . . . 461 2. The Space f0' (Q) of Distributions on Q . . . . . . . . 463 2.1. Definition of Distributions and the Concept of Convergence in f0' (Q). . . . . . . . . . . . . . . . . . 463 2.2. First Examples of Distributions: Measures on Q 464 2.3. Differentiation of Distributions. Examples . . . 467 2.4. Support of a Distribution. Distributions with Compact Support. . . . . . . . . . . . . . . . . . 474 3. Some Elementary Operations on Distributions . . . . . . . 476 3.1. Product by a Function of Class rc oo . . . . . . . . . . 476 3.2. Primitives of a Distribution on a Interval of JR . . . . . 477 3.3. Tensor Product of Two Distributions . . . . . . . 480 3.4. Direct Image and Inverse Image of a Function and of a Distribution by a Function of Class rc oo 481 4. Some Examples .. . . . . . . . . 482 4.1. Primitives of the Dirac Measure . . . . . . . . . 482 4.2. A Division Problem (Case n == 1) . . . . . 483 4.3. Derivative of a Function of JRn Discontinuous on a Surface . . . . . . . . . . . . . . . . 484 4.4. Distributions Defined by Inverse Image from Distributions on the Real Line . . . . . . . . . . . . . . . . . . 485 g2. Convolution of Distributions . . . . . . . . . . . . . . . . 492 1. Convolution of a Distribution on JRn and a Function of f0 (JR n) . . . . . . . . . . . . . . . . . . . . . . . 492 2. Convolution of Two Distributions of Which One (at Least) is with Compact Support . . . . . . . . . . . . 494 3. Distributions with Convolutive Supports. . . . . . . . 496 4. Convolution Algebras . . . . . . . . . . . . . . . . 497 93. Fourier Transforms . . . . . . . . .. ... ... 500 1. Fourier Transform of L I-Functions .. ...... 500 2. The Space g (JRn). . . . . . . . . . . . . . 502 3. F ourier Transform in L 2 . . . . . . . . . . . 506 4. Fourier Transforms of Tempered Distributions . . . 506 5. Fourier Transform of Distributions with Compact Support 509 6. Examples of the Calculation of Fourier Transforms . . . . . 510 7. Partial Fourier Transform . . . . . . . . . . . . 513 8. Fourier Transform and Automorphisms of JRn: Homogeneous Distributions . . . . . . . . . . . . . . . 518 8.1. Fourier Transform and Automorphisms ofJRn . . . 518 8.2. Homogeneous Distributions . . . . . . . . . 519 
Table of Contents XV 9. Fourier Transform and Convolution. Spaces (DM elRn) and (D (JR n) . . . . . . . . . . . . . . . . 520 9.1. The Space (DM(JRn) (= (DM) . . . . . . . .. ... 520 9.2. The Space (D . . . . . . . . . .. ...... 521 10. Fourier Transform of Tempered Measures . . . . . . . . . 523 11. Distribution of Positive Type. Bochner's Theorem. . . . 525 11.1. Functions of Positive Type . . . . . 525 11.2. Distributions of Positive Type . . . . . . . . . . 525 12. Schwartz's Theorem of Kernels. . . . . . . . . . . . 527 13. Some Distributions and Their Fourier Transforms. . 532 Bibliography. . . . . 533 Table of Notations 538 Index . . . . . . . 551 Contents of Volumes 1,3-6 557 
Chapter III. Functional Transformations Introduction In this Chap. III, we study in Part A, in 91 Fourier series which we use to solve boundary value problems on particular open sets of [Rn. These examples serve as an introduction to the following chapter (devoted to Sobolev spaces) in showing that, for the solution of the problem L1 u == 0 in Q c [Rn, bounded, u == g on oQ, the boundary of Q, to have finite energy (a condition which we shall point out in Chap. IV is u E H 1 (Q)), it is necessary that g satisfies a certain condition, which we shall see later to be that g is in the Sobolev space Ht(oQ). In 92 we study the Mellin transform, which is a particular form of the Laplace transform, suitable for solving the Dirichlet problem for a cone in [Rn. In 93 we treat the Hankel transform which arises naturally in the study of Fourier transforms of functions in [R 2 , invariant under rotation, and which thus enables us to solve problems possessing that symmetry. As a preamble, we study the properties of Bessel functions of the first kind which form the kernels of the Hankel transforms. In Part B we next study the discrete Fourier transform and the fast Fourier transform which are the indispensible numerical complements to the study of Fourier series. Some particular examples of applications are given in 96. There are other transforms, useful in applications, than those studied in this chapter, notably the Fourier transform featured in the Appendix "Distri- butions" and the Laplace transform, recalled and systematically used in Chap. XVI. We should also cite the Hilbert transform discussed in the Appendix "Singular integral operators" of Chap. XIII. * An essential motivation in the use of transforms in the theory of linear partial differential equations, treated in subsequent chapters, is that a suitably chosen transform can change a linear differential operator p(D)1 into a multiplication 1 Here P(D) includes the boundary conditions. 
2 Chapter III. Functional Transformations operator which we then say has "diagonalised" the operator P(D). This is a concept which will be developed in Chap. VIII, where we shall find other useful examples, such as the functions of Legendre, Hermite and Laguerre and the polynomials of Tchebycheff. The Fourier and Laplace transforms change linear differential operators with constant coefficients into polynomials in the variables in the transform space. Similarly the Mellin transform changes the operator r into multiplication by ar the variable in the transform space. Finally, the Hankel transform of order v, changes into a multiplier the a 2 1 a .v 2 a 2 1 a operator a 2 + - _ a - 2 with r E ] 0, 00 [. As the operator a 2 + - - occurs in r r r r r r ar the Laplacian J of [R2 expressed in polar coordinates or in J of [R3 expressed in cylindrical coordinates, the 'Hankel transform is useful in the solution of problems of the type J u = f, f given in [R2 (or for an iterated power of the Laplacian, say Jnu = f) or of problems in an open set D of [R3, (with D the whole space [R3 or the half-space D = [Rt = {(Xl' X2, X 3 ); X 3 > O} with OX 3 an axis of symmetry) such that 2 Ju = f (f given in D), (g given on aD). u oG = g with f and g independent of X 3 . * The diagonalisations mentioned are linked to the invariance of the problems to be treated, by a group: group of translations of [Rn for Fourier series and the Fourier and Laplace transforms, group of displacements of the plane [R2 - and hence of rotations about a fixed point of [R2 -for the Hankel transforms, group of homothetic transformations for the Mellin transform. * Functional transforms are of great use in the theory of partial differential equations to obtain analytical solutions which can yield important qualitative (and sometimes quantitative) results-behaviour at infinity or in the neighbour- hood of singularities. Applications are given elsewhere throughout this work; the present chapter serves as introduction and gives some fundamental proper- ties which will not be taken up again elsewhere. One difficulty about the use of functional transformations, often underlined, is that when the analytical calcu- 2 The Hankel transform is equally useful in the corresponding evolution problems involving au a 2 u - - L1u = f, or - - L1u = f(see Chap. XV). at at 2 
Introduction 3 lation becomes so complicated as not to yield a solution in closed form (as is often the case) the "explicit" formula does not lend itself easily to numerical computation. This view must be finely balanced, the example of the fast Fourier transform (introduced in this chapter) having led to a complete re-assessment of numerical methods in the representation of functions by Fourier series 3 . Another difficulty of functional transformations :Y is that such a transformation is useful only if it possesses an inversion formula; the domain of validity of these inversion formulae implies the introduction of "ad hoc spaces". For the Fourier transform, for example, these are L 2 or Schwartz's space of tempered distri- butions. These spaces lS g- are thus "attached" to :Y. We shall see that for a given physical problem f!JJ, considerations of invariance (e.g. of the energy) lead to us "attaching" to this problem a "functional space" lS [1jJ; these are generally Sobolev spaces constructed on L 2 or other spaces such as Schauder spaces 4 . Hence we see that a difficulty could arise if lSf!/J and lS g- did not coincide. The attitude which can be adopted, and which seems the best, is first to show that the problem f!JJ is "well-posed" within the functional framework lS f!/J; if then a functional transformation :Y leads to supplementary conditions, we can carry out the calculations in a formal way, already assured of their convergence in lSf!/J. This is the approach which will be adopted in the later chapters of this book 5 . 3 As well as the use of "spectral" methods for the numerical analysis of certain models. 4 Spaces of m-times continuously differentiable functions, all of whose m-th derivatives satisfy Holder conditions. 5 This is slightly similar to the position taken for elliptic boundary value problems in the last chapter: we begin by considering the existence and uniqueness of weak solutions, then later prove supplementary properties, notably regularity. 
Part A. Some Transformations Useful in Applications  1. Fourier Series and Dirichlet's Problem 1. Fourier Series Suppose that lr is the unit circle in the complex plane. Each function f on [R, periodic with period 2n, may be identified with a function F on If by the formula (1.1) F (e it ) == f(t), \;/ t E [R. In what follows, we write f in place of F. Notice that the functions t  e int (for n E Z) which are periodic functions (of period 2n) on [R are thus identifid with the polynomials z  zn(n E Z) on If. The functions e int obviously don't belong to either Ll([R) or to L2([R) (since they are periodic). On the other hand, they are in L 1 (0, 2n) (resp. L 2(0, 2n)); we shall say also, (by abuse of language) that they are in L 1 (If) (resp. L 2 (If)). We recall that the space L 2(0, 2n) is a Hilbert space 6 for the scalar product f 21t (f, g) = 0 f(t)g(t) dt. In the remainder of this Chap. III, the functions considered will be taken to have complex values. If f E L l(lf) the Fourier coefficient of order n off is defined by  1 f 21t . (1.2) in = 2n 0 e-mtf(t)dt, n E 7L, and the Fourier series of f is by definition the series (1.3) Sf(t) == L in e int , t E If. nEZ The problem which poses itself naturally is that of the convergence of this series, and if it converges, the value of the sum. In general, the series (1.3) will not converge in each point t E If towardsf(t) since a modification of.fin one point, or more generally on a set of measure zero, does 6 A survey of Hilbert spaces is given in Chap. VI, 1. 
1. Fourier Series and Dirichlet's Problem 5 not modify the value of the coefficients of Fourier. Nevertheless, in a certain number of cases which we shall recall below, the convergence takes place in certain functional spaces. 1.1. Convergence in L 2 (T) Recall that a sequence {j;,}, n == 1,2,... of elements of L2(lr) converges in L 2 (lr) towards f, if lim f 2"1!n(t) - f(tWdt = O. noo 0 For f E L 2 (lr) eL l (lr), we can define the Fourier coefficients off We have the theorem: Theorem 1. Suppose that f and g are two elements of L 2 (lr), with corresponding Fourier coefficients]" and g n . 1 ° The sequence of functions SN,M(t) M L in e int n= -N converge to f in L2(lr) (the same for g) when Nand M  00. 2° The series L in gn is summable and we have the relation of Parseval nEZ (1.4)  f 2" f(t) g(t) dt = L J.. gn' TC 0 nEZ I n particular _ 2 1 f 2" If(tWdt = L 1J..1 2 . TC 0 nEZ Theorem 1 expresses that the set {e int j(2TC)t, n E Z} is an orthonormal base (complete orthonormal system) of L 2 (lr). The Fourier series off is then the development off on this base. Remark 1. Theorem 1 does not say anything about the pointwise convergence of the Fourier series off 1.2. Pointwise Convergence on T We could believe that if f is a continuous function on lr, it is a limit 7 in each point of lr of its Fourier series. This is false in general. There exist complicated 7 In effect, there results from a theorem of Weierstrass that each continuous function on lr, is the limit (uniformly on lr, hence on each point of lr) of a sequence of functions {fl'i} of the form N fN(t) = L IN,n e int , with fN,n E C. n = - N 
6 Chapter III. Functional Transformations examples of continuous functions whose Fourier series diverge in some points. 8 It is necessary to make supplementary hypotheses on f9. Here is a sufficient condition for pointwise convergence on u. Theorem 2. Let f be a real function on u, continuously differentiable save in a finite number of points, at which f and its derivative admit limits to the right and to the left. Then the Fourier series off converges at every point of u and its limit is equal to f(x) iff is continuous at the point x and to {f(x+) + f(x -)} /2 if f is discontinuous at the point x E u. In particular, iff E qj1 (u), its Fourier series converges towards it in each point. This result is a consequence of the following fact 10 : iff satisfies the hypotheses of Theorem 2, then r 2 f f( ) sin A,( d = { f(O + ) if I == (0, a) An I t t t f(O-) if I == (- a, 0) with a > O. In effect, a simple calculation shows that for 0 < x < 2n, SN(X) = f e inX ]" =  f 1[-txf(X + 2t) sin(2 + t)t dt n = - N n - -1.. x SIn t 2 and we apply Dirichlet's lemma to the function t -+ /2 f(x + 2t). SIn t Remark 2. Under the hypotheses of Theorem 2, we have seen that for all t E U,   Ok 1 ;oo k=Nhe' t = 2 (/(t+) + f(L)), (k Elf). It is important to note that it is false in general that M  1 N.OO k=Nheikt = 2(/(t+) + f(L)) (k Elf). F or example, consider the function f of the Fig. 1. We have: in = f 21[ f(t)e-int dt = -  f 1t: (  - l ) sinntdt 2n 0 non 1 -- if n=l=O nin o if n == O. 8 See Y osida [IJ, p. 72; theorem of condensation of singularities (application of Baire). 9 We shall show the uniform convergence of the Fourier series whenfis of class rc 2 (see Example 2). 10 Known generally under the name of Dirichlet's lemma (see Chap. XV,  1; see the references Zygmund [IJ, Schwartz [2] and Sneddon [1]). 
1. Fourier Series and Dirichlet's Problem 7 f -I Fig. 1 Hence M  M - N . 1 L fk e ikt = - . wIth N = 1 + _ 2 + k= -N t=O l1r 1 +- N' Hence (JM - (IN does not converge when N, M  00. o Remark 3. If Xo is a point of discontinuity off, then we cannot have uniformity of the convergence of N SN(X) = L e inx in n =-N to f(x) for x in a neighbourhood of a discontinuity Xo (and for N  00). This is often called the Gibbs phenomenon. o 2. Distributions on T and Periodic Distributions (See Appendix "Distributions" for the general properties of distributions utilised below). We denote by CC 00 (If) the space of infinitely derivable functions on If. (CC 00 (If) is identified by (1.1) at the space denote CCfn(!R) of periodic functions of period 2n and infinitely derivable on IR). This space is provided with the topology of the uniform convergence of the functions and their derivatives. The dual space of CC 00 (If) will be called the space of distributions on If and will be written ' (If). As a consequence of the identification of CC oo (If) with CCfn(IR), '(lf) can be identified with the dual of the space CCfn(IR). 2.1. Comparison of ' (If) with the Distributions on IR 2.1.1. Distribution U on If, Image of a Distribution u with Compact Support on IR. Suppose that u is a distribution on IR with compact support: u E ' (IR). We define a distribution U on If by 
8 Chapter III. Functional Transformations ( 1.5) _ it < U, <p> - < u, <p(e ) >, \;/ <p E CC oo CIf); U is called the distribution image of u by!he mapping H: t E IR  e it E If. We have also defined by (1.5) a mapping H: u E ' (IR)  U E ' (If). If u is a function with compact support and integrable, U is given by ( 1.6) i f 21l(k+ 1) < u, qJ> == u(t)qJ(e it ) dt == L u(t)qJ(e it ) dt  keZ 21lk f 21l == L u(2nk + t)qJ(eit)dt. o keZ Thus U(t) is a function and ( 1.7) U(t) == L u(2nk + t). keZ 2.1.2. Distributions f on IR, Inverse of a Distribution F on T. At present let F E L 1 (If) (defining thus a distribution on lr). Then the function f given by the formula (1.1) from F belongs to Lloc(lR) and thus also defines a distri- bution on IR. We can write for each qJ E  (IR) (1.8) i f 21l(k + 1) <f, qJ > == F(e it ) qJ (t) dt == L F(e it ) qJ(t) dt  k E Z 21lk == f 2" F(e it ) [ L qJ(2nk + t) J dt. o kEZ In remarking that the function qJ defined on [0,2n] by ( 1.9) qJ(t) == L qJ(2nk + t)  kEZ can be considered as a function CC OO on If, we obtain (1.10) <f, qJ > == < F, qJ > which also givesffrom F. This formula can be generalised by continuity to the case in which F is a distribution on If. The distribution f on IR defined also by (1.10) is called the inverse distribution of F by the mapping H: t --+> e it . Such a distribution f on IR satisfies the relation (1.11) <f, qJ(x - 2n) > == <f, qJ >, \;/ qJ E (IR). The distributions on IR satisfying (1.11) are said to be periodic distributions of period 2n. We notice that we can identify, by the formula (1.10), the periodic distributions with the distributions on lr. Thus the mapping H*: F E '(lr) --+> f E '(IR) 
1. Fourier Series and Dirichlet's Problem 9 defined by (1.10) is injective and has for image in £&'(IR) the space of periodic distributions on .11 As a consequence of the identification of £&' (If) with the dual of CCfn(IR), each periodic distribution f can be considered as acting continuously on the oo functions on IR, which are periodic, of period 2n. Remark 4. We can use the sections 2.1.1 and 2.1.2 to define a periodic distribution ii from any distribution u on IR with compact support, by ii = H* oH(u) by means of the formulae (1.5) and (1.10); ii is given by (1.7) (with u replacing U)12. F or example, to Dirac's £5-distribution on IR, corresponds Dirac's periodic distribution -- £5 - L £5(2nk + t) = L £5 2nk . kEZ kEZ o 2.2. Principal Properties of £&' (If) The convolution of distributions on T is defined by using the group property of If (compact and commutative). We use the same method as in the Appendix "Distributions" to define the convolution of two distributions. In particular the convolution off E £&' (If) and a regular function ({J E CC 00 (If) is given by (1.12) (f * qJ )(s) = <I, ({Js> where ({Js is defined by qJs(t) = qJ(s - t). The Dirac £5-distribution on If defined by < £5, ({J > = ({J (0), V ({J E CC 00 (If) satisfies (1.13) £5 * u = u * £5 = u, Vu E £&'(If). The usual continuity properties of the convolution (on IR), are likewise satisfied. We call attention again to the following two properties: 1) For each periodic distribution u, there exists a positive integer k, and a continuous periodic function f such that u = (1 - L1 )kf13. 2) Each periodic distribution is a tempered distribution 14 . 11 The mapping H* cannot possibly be the inverse of ii, for as a result of (1.11) there cannot exist a non-null distribution which is both periodic and of compact support on IR. 12 it is called the periodic transform of u. The mapping H* a H : u E $' (IR)  it can be extended to more general distributions. (See Schwartz [1 J, p. 230). 13 See Schwartz [IJ, p. 230. 14 See below, Theorem 4, and Schwartz [1J, p. 253. 
10 Chapter III. Functional Transformations 3. Fourier Series of Distributions We have recalled in Sect. 1.1 the development offunctions in Fourier series. We shall see here that distributions on lr also have developments in "Fourier series" and we shall study them. If u E !!fi' (lr), the Fourier coefficient of order n, (n E Z), of u is defined by A 1 . u == - ( u e -mt > n 2n ' (1.14) n E z. , Un is well-defined since e - int E CC 00 (lr). The Fourier series of u is by definition L Un e int . nEZ We must study in what sense this series converges to the distribution u E !!fi' (lr). We begin with an example which enables us to solve the general problem of the convergence of the Fourier series of u. Example 1. Let u == £5, the Dirac £5-distribution on If defined above. Its Fourier coefficient of order n is Sn == 1/2n. We now show that the Fourier series L e int / 2n converges to £5 in !!fi' (lr). nEZ , F or all <p E CC 00 (lr), we have 15 / f e int,<fJ ) = f <e-int,<fJ) \ 2n n = - N 2n n = - N ( 1.15) N L cPn, n =-N cPn being the Fourier coefficient of order n of the function <po Since <p E CC oo (If), N the sequence SN == L cPn converges to <p(O) as N -+- 00; hence the Fourier n= -N series L  e int converges to b in f?fi'(lf} nEZ 2n o More generally, we have the Theorem 3. The Fourier series of a distribution on If, converges to that dis- tribution in !!fi' (If). Proof By virtue of the continuity of the convolution we can write for each u E !!fi' (If): 1  . u == £5 * u == - i....J emt * u. 2n nEZ N N 15 Because L e int = L e - into n=-N n=-N 
1. Fourier Series and Dirichlet's Problem 11 Now by using (1.12) we have (e int * u) (s) == (u t , ein(S-t» == e ins (u, e - int) == 2nu n e ins , t with the result that: u(s) == L une inS . nEZ o Example 2. Let us show by way of an application that iff E CC 2 (lr), its Fourier series converges uniformly to f Indeed, to begin with,  1 f 211:. 1 f 211: . n 2 fn == - n 2 e - wtf(t) dt == -  e - wtf" (t) dt, 2n 0 2n 0 and so III  cjn 2 , which implies that the Fourier series off is absolutely and uniformly convergent. Its sum g is a continuous function. We observe that we did not need distributions to establish that; but it still remains for us to show that f == g, which is most easily done by, in the present case, consideringf (and its Fourier series) as a distribution on lr. The Fourier series off converges to fin .@'(lr) according to Theorem 3. We must therefore havef == g almost everywhere, and therefore everywhere since bothfand g are con tin uous. o Example 3. Suppose that g E .@' (lr) satisfies the condition 16 ( 1.16) (g,l) == 0 Let us show that there exists f E .@'(lr) such that dfj dt == g. Let us denote by gn the Fourier coefficients of g. Then we shall have go == 0; let us put def f== A L g n int -e . , nEZ, n # 0 zn t E lr. Then f E .@' (If) and dfj dt is equal to g in the sense of distributions. o Remark 5. If g E .@'(lr) does not satisfy the condition (1.16), there does not exist any f E .@' (lr) whose derivative is g. This can be verified by the use of Fourier series, or by observing that differentiation in CCOO(lr) is such that d . f . h df h dt 1 == 0; hence 1 g IS such t at g == dt ' t en <g, 1> = ( , 1 ) = - (f, ; ) = O. f 21t 16 If g is a function, g E L ' (If), (1.16) can be written <g, 1> = 0 g(t)dt = O. 
12 Chapter III. Functional Transformations Thus the condition (1.16) is a necessary and sufficient conditionfor g E '(lr) to be the derivative of a distribution on lr. To the contrary in the case of distributions belonging to ' (IR), differentiation is not a surjective operation of ' (lr) onto ' (lr). o We now give a further example, analogous to Example 3, which we shall use in applications. Example 4. Suppose that g E ' (lr) is such that < g, 1> == 0; then there exists fE'(lr) such that d 2 fjdt 2 == g. In fact, in '(lr), we have g == L gn e int , t E lr. If we definefby nEZ* f == - L ( g ) eint nEZ* n thenfis an element of '(lr) and dfjdt is equal to g in the sense of distributions in ' (lr). Remark 6. Differences between distributions on lr and distributions on the open interval JO, 2n[. By adding the points 0 and 2n (identified) to the open interval ]0, 2n[, we are able to construct a compact set isomorphic to lr. The difference between the two types of distributions is therefore due to the properties that this compactification introduces; this will be seen especially by the discontinuities of the function at 0 and 2n, due to the identification of the points 0 and 2n. It is this which we shall develop below. (1) We must not confuse the distributions on lr with the distributions on ]0, 2n[. In particular, if u is a continuous function on ]0, 2n[, having limits at 0 and 2n denoted by Uo and U 21P then the derivative du j dt in the sense of distributions on lr and the derivative in the sense of distributions on ]0, 2n[ differ by (uo - U2n)<5: 16a ( dU ) ( dU ) - == - + (u o - u 2n )<5 dt '(lr) dt '(]O, 2nD where <5 is the Dirac distribution on lr at 0 (see Sect. 2.2). (2) If u E L2(JO, 2n[) and if dujdt (differentiation in the sense of '(JO, 2n[)) is such that duj dt E L 2(JO, 2n[), we .shall see in Chap. IV that these conditions imply that u E ceO ([0, 2n J). Let {un} be the F ourier coeffiients of u and { v n } the Fourier coefficients of (1.1 7) v = (  ) f0'(]O,2n[) 16a See Schwartz [1], IV, 17. 
1. Fourier Series and Dirichlet's Problem 13 (where the derivative is taken in the sense of £C'(JO, 2n[); these coefficients satisfy: L lu n l 2 < 00 nEZ and L Iv n l 2 < 00. nEZ Let us now examine the derivative of u in the sense of £C' (lr). We have seen that . ( dU ) "" A. - ==  inune lnt . dt '(lr) n E Z Hence, taking account of (1.17), we have . A A 1 lnU n == V n + 2n (u o - u 2n ) Vn E 7L. Thus the develop]11ent of u E H l satisfies ( 1.18) L lu n l 2 < 00 nEZ and L n 2 1u n l 2 < 00 nEZ if and only if U o - u 2n , i.e. if and only if the limiting values of u at 0 and 2n are eq ual. We observe that if (1.18) is valid, then it is possible to verify directly that u(t) == L une int E CCO(lr), nEZ and so u E CCO([O, 2nJ) with u(O) = u(2n). This follows from the Cauchy- Schwarz inequality: L lunl < ( L n2IunI2 )( L  ) . nEZ nEZ nEl. 11 o The question now arises (as a sequel to Theorem 3): how do we characterise a sequence {cn}nEZ such that the C n are the Fourier coefficients of order n of a distribution u E £C' (lr)? This is the object of Theorem 4. The necessary and sufficient condition for a sequence {c n } n E Z to be that of the Fourier coefficients of a distribution u E £C' (lr) is that the sequence {c n } satisfies 1 . ICnl - 0 1m 2k - Inloo(l + n) A sequence {c n }, (n E 7L and C n E C) satisfying the above formula is said to be of slow growth. The set of these sequences is denoted by 0' 17. For the proof (easy) we refer to Schwartz [lJ p. 225, Theorem 1. for k sufficiently large, (k E IR). 17 Notice the analogy between 0' and g"(IR), the space of tempered distributions. 
14 Chapter III. Functional Transformations Theorem 4 allows the characterisation of the distributions on If. In the same way, the functions "regular" on If (more precisely oo (If)) can be characterised by their Fourier coefficients: If f E  00 (If), then its Fourier coefficients in satisfy (1.19) lim Inlklll = 0 Inloo Vk E IR18. A sequence {l}, (n E 7L,l E C) verifying (1.19) is said to be of rapid decay and the set of these sequences is denoted by {).19 4. Fourier Series and Fourier Transforms (1) First of all, we poillt out the following result: The Fourier transform f of a 2n-periodic distribution is equal to the sum of Dirac distributions at the points y = 0, + 1, + 2, . . . , + n, . . . '" whose weights are given (to within 2n) by the Fourier coefficients fn of the development off in the Fourier series: J = I l2nb n nE"Z where b n denotes the Dirac-distribution at the point n. Proof We recall that we have chosen (see Appendix "Distributions") for the definition of J the Fourier transform of a function f E [/' (IR) or L 1 (IR) J(y) = fn;!!(t)e-iYtdt o and that we have 1 1 '" f(t) = - f(y)e iyt dy. 2n  Suppose that b x is the Dirac-distribution on IR concentrated at y = x E IR. We know that (see Appendix "Distributions") b x is the transform of the function 1 . t  _e lxt 2n x, t E IR. Consequently the Fourier transformation of the 2n-periodic distribution defined by f(t) = L le int nEl 18 Notice that this result on the functions of  ex) (u) implies, by duality, Theorem 4, on the distributions of '(u). 19 Notice the analogy between d and Sf(IR), the space of functions of rapid decay. 
91. Fourier Series and Dirichlet's Problem 15 is given by J = L J:,2nb n . nE"Z The use of Fourier series is thus a (very practical) particular case of the use of Fourier transforms in problems- involving periodic functions. (2) On the other hand, just as the Fourier transformation 'diagonalises"20 the operator - d 2 /dx 2 acting in L 2(!R) (see Appendix "Distributions") the mapping f E L 2 (0, 2n)  C!:}nEz E [2(Z) (with.!: the Fourier coefficient off in [2(Z) the set of square summable sequences) "diagonalises,,20 the operator - d 2 /dx 2 acting in L 2(0, 2n) with the periodic boundary conditions u(O) = u(2n), u' (0) = u' (2n). This aspect of diagonalisation of an operator with the help of a transformation will be met with in 99 1, 2, 3 of this Chap. IliA and will become plain in Chap. VIII. 5. Convergence in the Sense of Cesaro By definition, we say that a sequence {An} (n integers, n  0, An E C) converges to A in the sense of Cesaro if Ai + A 2 + + An A = lim n-+oo n It is immediate that a convergent sequence converges in the sense of Cesaro to its limit, but the converse is, in general, false: the sequence {an = (- 1 r} converges to 0 in the sense of Cesaro but it does not converge in the usual sense. The notion of convergence in the sense of Cesaro is a notion of convergence in mean. Being given a trigonometrical series L Cke ikt , we introduce the sequence of nE"Z partial sums Sn(t) = n '" c e ikt . f...J k , k = - n the convergence in the sense of Cesaro of this sequence (in a suitable functional space) reduces then to the convergence of CAt) = mtokXm Ck eikt - kln(l - 1I )Ckeikt. 20 See the introduction to this Chap. III. 
16 Chapter III. Functional Transformations Observe first of all that when C n converges in flfi' (If), its limit has the C k as its Fourier coefficients 2 I 2 1t Cn(t)e-iktdt = (1 - 1I )Ck for n  Ikl, and so C k = lim  f 2" Cn(t)e -ikt dt, n -+ 00 2rc 0 Vk E Z. Using Theorem 3, we deduce: A trigonometrical series L cne int converges in the sense of Cesaro in flfi' (If) if and nE"Z only if it converges in the usual sense in flfi' (If). Let us consider now f EL I (If) and its Fourier coefficients  1 f 211: . A = - f(t)e-lktdt. 2rc 0 We have: 1 f 211: + n Sn(t) = 2n 0 f(s)un(t - s)ds with un(t) = kIn e ikt , and Cn(t) = 2 I 2 " f(s)vn(t - s)ds with A classical calculation shows that: ( ) _ uo(t) + . . . + un(t) t - . n sin(n+} un(t) = . t Sln- - 2 1 vn(t) = - n sin C ; 1 )t 2 . t sIn 2 In particular we have ( 1.20) 1 f 211: n + 1 vn(t)  0 and _ 2 vn(t)dt = rc 0 n We then deduce: Proposition 1. Suppose that f EL I (If) (resp. f E O(lf)), then the Fourier series of f converges in the sense of Cesaro to fin L 1 (If) (resp. o (If)). Proof. We have to show that 1 f 211: lim - ICn(t) - f(t)ldt =0 n -+ 00 2rc 0 ( res p . lim supICn(t) - f(t)1 = 0 ) . n-+oo tElf 
91. Fourier Series and Dirichlet's Problem 17 We know that for f sufficiently regular (e.g., fE 2(1"), after Example 2), the Fourier series converges uniformly21 to f and so in particular Sn converges uniformly to f, and a fortiori, C n converges in L 1 (1"), (resp. o (1")) to f Now, from (1.20) we have 1 f 21t 1 f 21t n + 1 1 f 21t _ 2 _ 2 f(s)vn(t - s)ds dt  _ 2 If(s)lds. rc 0 rc 0 n rc 0 ( 1 f 21t n + 1 ) resp. sup _ 2 f(s)vn(t - s)ds  suplf(t)1 . tElf rc 0 n tElf We thus deduce the result from the fact that regular functions are dense in L 1 (1") (resp. o (1")). o Remark 7. For f E LP(1") with 1 < p < 00, by using the method above, we obtain the convergence of the Fourier series offin the sense of Cesaro in LP(lr). In fact, for p == 2 we know (see 91.1) that lim +m L he ikt - f(t) k = - n L 2 ==0 , n,m-+oo which is much stronger than convergence in the sense of Cesaro. It is possible to show (see Zygmund [1]) that this result is still valid for 1 < p < 00. On the contrary, it becomes false for p == 1. o Remark 8. The Cesaro mean Cn(t) is much more regular than Sn(t), and constitutes a very useful practical. method to "regularise" a weakly regular function, and in particular, a discontinuous function, and thus to remove the Gibbs phenomenon mentioned in Remark 3. (See later, part B of this Chap. III). 6. Solution of Dirichlet's Problem with the Help of Fourier Series 6.1. Dirichlet's Problem in a Disk Suppose that D is the disk with centre 0 and unit radius in [R2 and whose boundary is 1". We seek a function u satisfying { L1 u == 0 in D (1.21) u == g on lr, where g is a function (real or complex) whose regularity we shall make more precise later. This problem has been treated in Chap. II, 94 by a method making 21 In the sense of the norm of the sup. 
18 Chapter III. Functional Transformations use of Poisson's integral formula. Here we make use of Fourier series. Using polar coordinates (r, fJ) we put (1.22) u(r, fJ) = L un(r)e inlJ . nE"Z The function u being real analytic 22 in D is a fortiori continuous at 0 which Imposes ( 1.23) un(O) = 0, for n #- o. On the other hand, the boundary condition u(l, fJ) = g(fJ) = L gn einlJ nE"Z implies (1.24) u n (l) = gn, 'tin E 7L. Finally, the equation L1u = 0 written in polar coordinates is a 2 u 1 au 1 a 2 u ar 2 + -;: ar + r 2 a fJ2 = 0, from which we have  ( u +  u' - n 2 u ) e inlJ = 0, f....J r n r 2 n nE"Z from which again we obtain for n E 7L the sequence of differential equations 1 n 2 u + - u - 2: Un = 0, r r' whose general solutions are: un(r) = cnr:t n, uo(r) = Co log r + c, The conditions (1.23) and (1.24) then lead to un(r) = gn r1nl uo(r) = colog r + go. We recognise in uo(r) the elementary solution 23 of the Laplacian; we see that L1u = 0 in D (and not only in D\{O}) imposes the condition Co = O. Finally for n#-O for n = o. ( 1.25) u(r, fJ) = L gn r + Inl e inlJ . nE"Z 22 See Chap. II, 2. 23 See Chap. V, 2. 
 1. Fourier Series and Dirichlet's Problem 19 Remark 9. (1) The formula (1.25) has a meaning even for very irregular g's, for example g E' (lr) which implies that the sequence of gn's is of slow growth (Theorem 4) with the result that (1.25) is convergent. We have even seen in Chap. II, 96 that we can choose for g an "analytic functional". (2) We have encountered the expression E(u) (corresponding in many appli- cations to an energy of u) def 1 f 1 r 1 r 2n ( au 2 1 au 2< ) E(u) = 2 DIVul2dx = 2Jo Jo or + r 2 of) rdrdf) f l f l1 E(u) = n L lu(r)12rdr + n L n 2 -lu n (r)1 2 dr, nEZ ° nEZ ° r from which (1.26) i) E(u) = n L Inllgnl 2 . nEZ So that the energy of the solution u of the problem (1.21) is finite, it is necessary and sufficient that g be sufficiently regular to satisfy the condition (1.26) ii) L Inllgnl 2 < 00. n We shall see in Chap. IV that this condition characterises the Sobolev space H t (lr). Let us give a counter example to this situation: g(f)) = f  e i (p!)6, p=lP 1 This series is normally convergent since L 4 < + 00 and converges in rcO(lr) p P to g which is a continuous function. The solution u of the Dirichlet problem (1.2.1) is then given by 00 (p!) U = '" r i(p!) ()  e , p = 1 P but this is not a solution with finite energy since L Inllgnl 2 = L (p!)  = + 00, nEZ P P o Remark 10. The interesting aspect of Fourier series in these problems (1.21) is that they reduce the original problem - with a partial differential equation in two vari- ables - to a simpler one, that of solving a system of uncoupled ordinary 
20 Chapter III. Functional Transformations equations. We have thus reduced a problem in two dimensions to a se- quence of problems in one dimension. More generally, we can reduce a problem in three dimensions ([R3) to one in two dimensions ([R2) for a problem formulated in an open set Q c [R3 possessing an axis of revolution. Passing to cylindrical coordinates (r, X3, fJ) and putting: u(r, X3, fJ) == L un(r, x 3 )e in8 , nE"Z we reduce the Dirichlet problem in Q to a sequence of Dirichlet problems in OJ E [R2 where OJ is the meridian of Q. In the same way, when we have an open set Q E [Rn which is of the form Q == OJ x [R, with OJ c [Rn -1, putting x == {x', x n } with x' E OJ, the use of the Fourier transform in X n allows a problem posed in Q to be reduced to one posed In OJ. o 6.2. Dirichlet's Problem in a Rectangle We consider a rectangle Q == ]0, a[ x ] - -tb, -tb[ in the plane [R;1, X2. We propose to solve, by means of Fourier series, the Dirichlet problem: i L1 u == 8 a2 + 8 a2 = 0 in Q (1.27) Xl X2 u == g on a Q, where g is a function given on aQ. The boundary 8Q is made up of the segments: ro == {O} x]- bj2, + bj2[ r a == {a} x ] - b j2, + b j2 [ r'+ == ]0, a[ x {bj2} r'_ == ]0, a[ x {- bj2} with the result that we can write g == gl + g2, X 2 n r'+ Q fa - a Xl b +- 2 fo 0 b - - 2 r'_ Fig. 2 
 1. Fourier Series and Dirichlet's Problem 21 where: gl == 0 on r 0 u ra g 2 == 0 on r'+ u r'- . The solution u of (1.27) is then U == U l + u 2 where U i is the solution of ( 1.27)' L1 U i == 0 in Q, U. == g . on 8Q l l , i == 1, 2. Now by the change of variables: , b Xl == X2 + 2' , X2 == Xl a 2' Q is transformed to: ( 1.28) Q' == JO, b [ x J - a12, aI2[, and the problem (1.27) to: 8 2 u' 8 2 u' _ 8 '2 + _ 8 ,2 == 0 Xl X2 in Q' , U' == g' on 8Q', with U'(X, x;) == u(x 1, X 2 ), g'(X, X;) == g(Xl' X2). In particular, this transformation changes (1.27)' (i == 2) to (1.28) with the result that it suffices to solve (1.27)' for i == 1. In other words we are led to solve (1.27) taking g == 0 on r 0 u r a' i.e. to solve i) L1 U == 0 in Q ii) u(O, X2) == u(a, x 2 ) == 0 for X2 E [ - bj2, bl2J iii) U(Xl' + bj2) == qJ:!:(Xl) for Xl E [0, aJ, ( 1.29) where qJ + and qJ _ are functions given on ]0, a[. Writing ( 1.30)  . nnxl U (x l' X 2) == f...J an (X 2) SIn n=l a where 2 f a . nnxl a n (X2) == - U(Xl' X 2 ) SIn dx l , a 0 a (1.31) and hence in particular, by the condition on the boundary r':!: + def ( b ) 2 f a . nnx 1 ex;; == an + - == - qJ:!: (Xl) SIn dx l . 2 a 0 a Now, we have  ( " ( nn ) ) . nnx l Au(x 1 , x 2 ) = 1..- an (x 2 ) - ----;; 2 a n (X2) SIn a ' with the result that L1 U == 0 in Q can be written: a; - ( '; r an = 0 in ] - b12, b12[, (1.32) 
22 Chapter III. Functional Transformations that is to say (1.33) nnx 2 nnx 2 a n (X 2 ) = Anch + Bn sh . a a Thanks to (1.31) the coefficients An' Bn are given by nnb nnb + An ch 2a + Bn sh 2a = an nrcb nrcb An ch 2a - Bn sh 2a = a;; , that is to say: An= (ach n:b ) -1 J: [<p+(X 1 ) + <p-(xdJsin nna X1 dX 1 Bn= (ash n : b r 1 f: [<p+(xd - <p_(xdJsin nnax1 dx 1 0 Conversely, the formula (1.30) where an is given by (1.33), (1.34) determines a solution of (1.29). Remark 11. When u is of the form (1.30), the energy E(u) (see Remark 9) is given by (1.34) (1.35) i) 1 i 00 f +Q. [ n 2 rc 2 J E(u) = - I Vul 2 dx =  L 2  la n (x 2 )1 2 + la(X2)12 dx 2 . 2 Q 4 n = 1 _Q. a 2 Integrating by parts and using (1.32) we obtain rc  [ nnb nnb 2 2 J E(u) = 2  sh- 2 ch- 2 (An + Bn) . n=l a a nrcb nnb Taking account of(1.34) and ofthefact that sh 2a  ch 2a as n --+ 00, we find that the solutions of finite energy (1.35) ii) E(u) < + 00 are such that (1.35) iii) 00 ( f a nnx ) 2 L n qJ :!: (:X 1) sin 1 dx 1 < + 00. n=l 0 a We shall see that the condition (1.35) iii) is equivalent to saying that the boundary data qJ + and qJ _ are in the space of traces of the functions with finite energy. o 
1. Fourier Series and Dirichlet's Problem 23 Remark 12. Application of the Dirichlet problems (1.21) and (1.27). The mathematical models (1.21) or (1.27) can be found in an electrostatic problem (see Chap. I, 4.5.1) U representing the potential in the interior Q of a hollow cylinder in [R3, (a dielectric occupying the domain Q), of circular or rectangular cross-section, g representing the potential applied to the surface of this cylinder: E(u) is here the energy seen in Chap. lA, 4: we have seen the definition for - the energy W of the whole domain in Chap.IA (4.42) (general case of evolution), - the energy WOK of a compact set K in Q in Chap. IA (4.114) (stationary case), - the energy Wi per unit length of a cylinder in Chap.IA (4.141) (stationary case ). Using the system of natural units (see Chap. lA, 4) the expression E(u) is precisely the electromagnetic energy per unit length of the cylinder: E(u) = r C dXi dX 2 dX 3 = Wi J Q x [0, 1 ] where S, the energy density is defined in Chap. lA, (4.jO). We have pointed out in Chap. lA, 4, that in problems in which the domain Q == Q x [R is modelled by an infinite cylinder in [R3, we often have to require that the energy per unit length be finite, so have the condition (1.26)ii) or (1.35)iii). o Remark 13. The use of Fourier series can equally well be used to determine the spectrum of the Laplacian (see Chap. VIII) in the rectangle Q in [R2 with homogeneous Dirichlet boundary conditions. To simplify what follows let us suppose that Q == JO, n[ x JO, n[. The question is to find u and A satisfying (1.36) { i) L1 u + AU == 0 In Q c [R2 A E C. ii) u == 0 on 8Q Putting then (double development in series of sines )24: 00 00 U(Xi' x 2 ) == L L u nm sin nX i sin mX2 n=im=i we find that equation (1.36)i) then becomes L [-(n 2 + m 2 ) + AJunmsinnXi sinmx 2 == O. n,m 24 The functions (sin nx 1 , sin mx 2 ), with m, n integers, form an orthogonal base in the Hilbert space L 2(Q). 
24 Chapter III. Functional Transformations from which we deduce [A -(n 2 + m 2 )Ju nm = 0, Vn,mEN*, so that u mn = 0 unless A = n 2 + m 2 . We deduce that there is a denumerable infinity of isolated eigenvalues: 25 A = n 2 + m2 nm , n,mEN*, having the associated eigenfunctions 25 U nm (Xl' X2) = sin nXl sin mX2. The preceeding method can be used equally well to solve the Dirichlet problem { -L1u = f on Q U = 0 on oQ, by developing f in a double series of sines. o 92. The Mellin Transform 1. Generalities The Mellin transform appeared for the first time in the famous memoire of Riemann [lJ on the theory of numbers. The reason is the following: if we consider the Dirichlet series 00 a cP(s) = L -i n = 1 n and if we introduce the function f(x) = L an e -nx, then 1 f oo <1>(s) = r(s) 0 x s - 1 f(x) dx in which there makes the appearance of the Mellin transform off defined by Mf(s) = LX) x S - 1 f(x) dx. If we associate with f the function on [R: F(t) = f(e t ) 25 See Chap. VIII. 
2. The Mellin Transform 25 then Mf(s) = f 00 e ts F(t) dt We work here with s complex. We recall that the (bilateral) Laplace transform is defined by .2F(p) = foo e- pt F(t)dt. We therefore have Mf(s) = 2 F( - s). From this we deduce the inversion formula of M. In effect 1 f C+ioo F(t) = lni c _ i 00 e pt .2 F(p) dp with c a convenient real value (and with appropriate hypotheses on F). Hence 1 f C+ioo F(t) = _ 2 . e- st Mf(s)ds nl C - i 00 from which 1 f C+ioo f(t) = -----=- x -s M f(s) ds. 2nl c-ioo The connection thus put in evidence with the Laplace transform by the change of variable t --+ e t = x gives the principal property of the Mellin transformation from the point of view of applications to partial differential equations. While the Laplace transformation diagonalises the operator dj dt (giving multiplication by p), the Mellin transformation diagonalises the operator xdjdx = A (giving multiplication by - s): M(Af)(s) = - sMf(s). [Notice otherwise that Af = sfis equivalent to f(x) = x- s .] Now (and naturally these formulae are not "accidental": they all result from in variance properties) if u satisfies L1u = 0 in the plane, then u, being expressed in polar coordinates, satisfies (after multiplication by r 2 ) v 2 u vu v 2 u r 2 vr 2 + ra;: + ve 2 = 0 1.e. v 2 u A 2 u + ve 2 = o. 
26 Chapter III. Functional Transformations Let us now consider the following problem: in the angular domain 0 < r < 00, - a < () < a, we seek u == u(r, ()), solution of { (i) L1 u == 0 (ii) u(r, a) == u(r, - a) == g(r). (2.1 ) We then introduce the Mellin transform in r of u: U(s, e) = Mr(u(r, e)) = IX) r s - 1 u(r, e) dr. The problem (2.1) becomes then a 2 u ae 2 + S2 U = 0, - IX  e  IX (2.2) U ($, + a) == M g(s). The explicit solution of this problem, in which s is a parameter, is elementary: the two boundary values of U being equal (we could equally well treat the case in which u takes distinct values for () == a and for () == - a!), U is even in (), thus cos (s ()) U(s, e) = () Mg(s) cos sa and hence the solution of (2.1) is expressed by 1 f C + i 00 cos S () u(r, ()) == _ 2 . r- s Mg(s)ds. 1Cl c - i 00 cos sa D Let us now pass to a more precise expose of the properties of the Mellin transform. 2. Definition of the Mellin Transform Definition 1. The Mellin transform of the function with complex values g(x), with x E [R + is given by the formula M(g)(s) = t'X) x S - 1 g(x)dx, (2.3 ) sEA C (:26 , this formula having a sense if, for example: x --+ X S - 1 g(x) EL l (0, + 00) (and also in other cases as shown in the examples which follow). 26 The formula (2.3) only holds for the part of g defined on [R +, we are not concerned to know whether or not g takes a value or not on [R -. 
2. The Mellin Transform 27 Example 1. Let us calculate M (e - ax), for a > o. For all SEe satisfying Res> 0, we have M(e -/XX)(s) = L'X) x S - 1 e -/Xx dx = IX -s LX) x s - l e -x dx = IX s res). Example 2. The functions cos x and sin x possess Mellin transforms: the integral L'X) xs I cos x dx is convergent for Re s E ]0, 1[. Let us write foR x S - 1 cosxdx = fol xS-Icosxdx + fIR xs-Icosxdx. Th function X S - 1 cos x is in L 1 (0, 1) when Re s > 0 and integrating by parts we have fR xS-Icosxdx = [xSlsinx]f - (s - 1) f" x,s-2sinxdx and x S - 2 sinx E L 1 (1, + (0) when Res < 1. Let us proceed in the same way for the function sin x. To calculate the Mellin transforms of cos x and sin x, let us integrate the holomorphic function F(z) = zS-1 e -z round the contour r (see Fig. 3). Denoting by r 1 (respectively r 2) the small (respectively the large) circle we deduce from Cauchy's formula that o = f F(z)dz = fR x S - 1 e-xdx + f F(z)dz r Je r 2 + t (ixy- I e-ixidx + t, F(z)dz E R Fig. 3 The integrals over r 1 and over r 2 tend to zero when B --+ 0 and R --+ 00. The integrals on (B, R) being f.R x S - 1 e -x dx, and is f.R x S - 1 e -ix dx = e tis1t f.R x S - 1 e - ix dx, 
28 Chapter III. Functional Transformations it follows on making B --+ 0 and R --+ 00 that for Re s E J 0, 1 [ fo'" xS-le-ixdx = e- tis1t r(s) from which we deduce that (2.4) M(cos x)(s) = cos(  sn )ns) M(sinx)(s) = sin(sn )ns) SEe and such that Re S E JO, 1[. D 3. Properties of the Mellin Transform Let us at first give some simple formulae, easy to verify,fbeing a function with complex values on IR or IR +: (see Definition 1 for the conditions on f) d M(Log xf)(s) = ds M(f)(s) M(xaf)(s) = M(f)(s + li) li E C (2.5) def M(fJ(s) = a- S M(f)(s) if h(x) = f(ax), a > 0 1 ( s ) def M(fP)(s) = /3 M (f) /3 if fP(x) = f(x P ), /3 > 0 27 . Let us calculate now the Mellin transform of the derivative of a functionf We have M(f')(s) = L"'!'(X)XS-1dX = [xS-lf(x)] - (s - 1) L'" x S - 2 f(x)dx. If we suppose that lim x s - 1 f(x) = lim x s - 1 f(x) = 0, x-+O x-+oo li < Re(s) < /3 and that M (f)(s - 1) exists in this band, we shall have (2.6) M(f')(s) = - (s - l)M(f)(s - 1), SEe and li < Re (s) < fJ. 27 Notice that there are many other interesting properties of the Mellin transformation, and def f oo ( x ) dy notably that it transforms the convolution integral h * h = 0 h ; h(Y)  to a product: M(h*h)) = M()).M(h)). 
2. The Mellin Transform 29 More generally, let us suppose that lim X s - n + k f(k) (x) = lim X s - n + k f(k) (x) = 0, xo xoo Re (s) E J li, Jj[ and k = 1, 2, . . . , n - 1, and that M(f)(s - n) exists in this band, then (2.7) M(f(n»)(s) = (- l)n(s - l)(s - 2) . . . (s - n) M(f)(s - n). As an application of these formulae, we see that M( x  }s) = sM(f)(s). (2.8) This equation expresses the fact that the Mellin transform diagonalises the d operator 11 = x dx . Remark 1. Let us place ourselves in the space of regular functions with compact support on JO, 00 [ provided with the scalar product f 00 dx cpt/J-. o x The completion of this space is the Hilbert space L 2 (0, 00; dxjx). Then the operator A = xdjdx is "antisymmetric", i.e. satisfies f 00 dx f 00 dx (Acp)t/J- = - cp(At/J)-, o x 0 x vcp,t/J E f0(JO, oo[). We can show that the operator iA is essentially self-adjoint 28 in L 2 (0, 00; dxjx) if its domain 28 is D(iA) = f0(JO, 00 C), and that its "proper generalized func- tIons" are the functions x  x - s (for Re s = 0). D Let us denote equally (with domains of validit y 29 eventually different) (2.9) M( t f(t)dt }s) M ( LX) f(t) dt }s) 1 = - - M(f)(s + 1) s 1 = - M (f)(s + 1). s 28 See Chap. IA 6 (Appendix), Chap. VI and Chap. VII for all these notions. 29 See the conditions indicated for (2.6). For the two relations (2.9) to have a sense simultaneously, it is necessary tha. t f + 00 o f(t)dt = O. 
30 Chapter III. Functiona Transformations All of these formulae are frequently used in applications. In particular, let us consider the Laplacian L1 in IR 2 in polar coordinates. We have ( 02 1 0 1 ( 2 ) (2.10) L1 u = or 2 + -;: or + r 2 oe 2 u. Taking the Mellin transform with respect to the variable r, we obtain with the aid of (2.7) (2.11) M(L1u)(s) = [  + (s - 2f ] M(u)(s - 2, 0). 4. Inverse Mellin Transform One of the interesting aspects of the Mellin transformation (as with the trans- formations of Fourier or of Hankel etc.) is that it possesses an inverse. The existence of this inverse is furnished by the Theorem 1. Let g(s) be afunction which is holomorphic in {s E C; Res E ]li, P[}. Suppose that (i) lim g(s) = 0 uniformly in {s E C; Res E]li + B, P - B[} I Ims I -+ 00 (2.12) V B such that 0 < B < t(P - li), (ii) L I g (s) I db < + 00 if s = a + ib, V a E ] IX, P[. Then the function g is the Mellin transform of a function g which is given by g(x) = 2 1 . f x-Sg(s)ds, 1Cl r (2.13) xEIR+ , in which r = {s E C; S = y + i l5, l5 E IR, y fixed, y E ] li, P [} . We then have (2.14) g(s) = LX) xS-1g(x)dx, ReSE]IX,p[. We can show (see Widder [1]) that a function g on IR+ is the inverse Mellin transform of a function g with the above properties if and only if (i) g E Lc(]O, + 00 [), (ii) for all (J E]li, P[, there exists A((J) E IR+ such that Ig(x)1  x-a A ((J), Vx E]O, 00[. 
2. The Mellin Transform 31 So for 0 < Cl < /3, g(x) --+ 0 for x  + 00 for Cl < /3 < 0 g(x) --+ 0 for x  0 + for Cl < 0 < /3 g(x) --+ 0 for x --+ 0 + and for x --+ 00. Under the conditions of Theorem 1, we see that the inverse transformation M- 1 of the Mellin transform M is M- 1 :g   f x-Sg(s)ds 21Cl r Proof of Theorem 1. Let us put x = e - t in (2.13), and def G(t) == g(x) = g(e -t). Then (2.13) can be written G(t) = 2 1 . f ets g(s)ds, 1Cl r t E . As a consequence of the hypotheses (2.12), g(s) is the Laplace transform of the function t --+ G(t) (see introduction to the 92 and Schwartz [3J pp. 196-206; see also Widder [lJ, p. 265, theorem 19a) g(s) = L'''oo e-stG(t)dt Cl < Re s < /3. With the change of variable x = e- t , we find g(s) = too xs 1 G( - log x)dx = too xs 1 g(x) dx i.e. (2.14)30. 30 Notice (see Widder [IJ, p. 248) that in a framework more general than that of Theorem 1, if g is a function on IR+ such that I'?' X'-l g(x)dx converges absolutely for Res = (J and if 9 is of bounded variation in a neighbourhood of x E [R +, then -I f (1+iT 1 lim ---:- g(s)x -sdx = -{g(x +) + g(x - )}. T -+ 00 2nl (1 - iT 2 
32 Chapter III. Functional Transformations 5. Applications of the Mellin Transform Dirichlet's problem in a wedge Let us study the following Dirichlet problem (see Davies [lJ): find u satisfying (2.15) a 2 u au a 2 u (i) r 2 L1u = r 2 a;: + r or + oe 2 = 0, 0 < r < 00, - a < e < a, .. { 1 (11) u(r, + a) = g(r) = 0 o < r < 1 r > 1 . 1. 4 where a IS a given angle 0 < a < 2 1t (see FIg. ). o 9=Ct. 8= -(1 Fig. 4 We have seen in Chap. II, that in the case of an unbounded open set, we must have conditions at infinity on the unknown u to have uniqueness of the solution of the Dirichlet problem. We suppose that when r --+ 00, u(r,O) 1'-1 r- P with {3 > 0 and that in addition, u(r, 0) is bounded for r --+ O. The Mellin transform with respect to r of the function u, say U, exists then 31 and we have (2.16) (i) [ dd;2 + S2 ] U(s, e) = 0, - a < e < a (ii) U (s, a) = U (s, - a) = M g(s) with (2.1 7) i 1 1 Mg(s) = r s - 1 dr = -. o s 31 At least for Re S E JO, P[. 
2. The Mellin Transform 33 We deduce from this (see (2.2) et seq.) U (s, 0) =  . cos(sO) . S COS(Sli) (2.18) To calculate the inverse Mellin transform of U(s, 0) we shall utilise a certain number of preliminary results, the final result being (2.34) below. Proposition 1. Let f(x) be a function of (0, + (0) in  possessing a Mellin transform (Mf)(s) for s E {s E C; Re s E JS1' S2[}. We suppose that: (2.19) f(x) is extended to a holomorphic function F (z) in a sector largzl < {3, o < {3 < 1C of c. (2.20) lim ZS F(z) = 0 for Res E Ja, b[ c JS1, S2[ and 32 largzl < {3. Izl....oo (2.21) lim ZS F(z) = 0 for ReSEJa, b[ c JS1' S2[ and 32 largzl < {3 for z....o each sub-interval Ja, b[ ofJs1' S2[. Then for each 0 sucb that 101 < {3 { M r [ReF(re i6 )](s) = M(f)(s).cossO,33 (2.22) Mr[Im F(rei6)](s) = - M(f)(s). sin sO, s such that Re s E]S l' S2 [. Proof of Proposition 1. Let us integrate the holomorphic function F(z) along the contour re = r 1e U r 2e U C U C e shown in Fig. 5. e <  0 E r 2£ R Fig. 5 We have f F(z)zS - 1 dz == 0, Fe 32 Uniformly for largzl < {3. 33 M r denotes the Mellin transform with respect to the variable r. 
34 Chapter III. Functional Transformations therefore f.R F(re i9 )(re iO y-l eiO dr = f.R f(x)x S - 1 dx + t F(Z)ZS-l dz - f F(z)zS-ldz c/; f.R F(re i9 )r S - 1 dr = e - is9 {f.R f(x)x S - 1 dx + f F(z)zS-ldz - f F(Z)ZS-ldZ } . C. c/; Let us make R tend to infinity and G tend to zero and let us use the hypotheses (2.20) and (2.22). The integrals on C and C 8 then tend towards zero and we obtain the desired formulae (2.22). D Lemma 1. We have (2.23) M C  x }s) = si:ns for 0 < Re s < 1 Proof Let us integrate the function ( e - i 1t z)S - 1 j(z) == 1 + z ' Re(s) E JO, 1 [ along the contour described by Fig. 6. r == C 1 U C R U C 2 U CEo C R Fig. 6 
2. The Mellin Transform 35 The theorem of residues implies that tf(Z)dZ = 2ni(Resf)( - 1), (2.24) where (Re sf)( - 1) is the residue of f at the pole z == - 1. Letting R  00 and 8  0, we see that the integral off on C Rand C e tends to zero. In effect f f 2n - a f(z)dz == f(Re iO ) i Re i8 de C R a from which r f(z)dz  r 21t Rlf(Re i6 )ld8; J C R J 0 we then use the fact that lim Rlf(ReiO)1 == 0 when Re(s) E JO, 1[. We proceed R-+oo in the same way for the integral on C e . On passing to the limit in (2.24), we obtain: (2.25) LX) f(x)dx - L'" f(e 21ti x)dx -= 2ni(Resf)( - 1); now (remarking that f is a multivalued function) 2 . (e - i n x) s - 1 _ (e i n x) s - 1 . X s - 1 f(x) - f(e nl x) == == 2i SIn sn 1 ' 1 + x + x from which (2.26) roo X S - 1 dx == 2in (Resf)( - 1) J 0 1 + x 2i sin (sn) n sin (sn)' since (Re sf)( - 1) == 1. D Remark 2. The preceding method of calculation generalizes easily to that of calculating the Mellin transform of a meromorphic function g(z), bounded at infinity and with no pole on the positive real axis. If z 1, Z2, . . . , Zk are its poles, we obtain (2.27) n k M(g)(s) = . ( ) L (Re sf) (Zj), SIn n s j = 1 where f(z) == (e- in z)S-l g (z). D 
36 Chapter III. Functional Transformations Application to the calculation of the inverse transform of the function U defined by (2.18): Let us apply the Proposition 1 with M (f)(s) = 1 ; we must find s cos srx f == M - 1 ( 1 ) and verify the conditions (2.19), (2.20), (2.21) of the Pro- s cos srx position 1. According to the inversion theorem,f(x) exists. To calculate f(x) we shall proceed by successive reductions: from (2.23) and (2.5), we have (2.28) M( 1 ; x 2 }S) = 2sintns) ' Re(s) E ]0, 2[ and from (2.9) (for Re s E JO, 1[): M(f" 1 t t 2 }S) = M C : x2 )(1 + s) = 2ssin{t:(S + l)} ' which is equivalent to M ( - 2 1 n - Arctgx ) (S) = 2 n 1 · S cos (-ins) Setting p == nj(2rx), (where rx is the angle of the wedge), we have from (2.5) (2.29)M G n - Arctgxp}S) =  2()C:s()' sEC'ReSE]0'p[, 1 n Now -- == rx so p2 ' (2.30) 2 M ( !n _ Arctgxp ) (S) = 1 . n 2 s cos (srx) In inverse form we have (2.31 ) M -1 ( 1 ) (x) == 1 s cos (srx) We use Proposition 1 with 2 - - Arctg x p . n 2 1 F(z) == 1 - n Arctgz P , IArgzl < In. We are able to verify the conditions (2.20), (2.21) with the equality: (2.32) 2 1 zP + i 1 - -Arctgz P == - -Log n i n zP - i' 1 IArgzl < 2 n , 
2. The Mellin Transform 37 in choosing the branch of the logarithm for which Arctg x E ]0, t n[, x  O. It follows then as a consequence of (2.22) and (2.31) that (2.33) u(r, (}) = Ms-l (U(s, (})) = Re (1 -  Arctgr P e ip6 ), from which finally, using (2.32), we have (2.34) u(r, e) == 1 1 A 2rP cos pe 1 . f [ 1] rct g 2 rEO " - n l-r P 1 A 2rPcospe 1 . f 134 rctg r > - 2 1 ' n r P - n where p == 20: . D We have thus verified by this example that even if the boundary value (g on aO) is discontinuous (see (2.13)), the solution of the nonhomogeneQus Dirichlet problem, on 0, admits a re oo (and even analytic) solution in the interior of O. We are able to verify that the obtained solution (2.34) of the problem (2.15) is a solution of local energy Ec(u), finite about the origin, where def 1 f (2.35) Ec (u) == _ 2 I Vul 2 dx KnQ for each compact set K of [R2. This condition, useful in electromagnetism (see Roubine [1] and also Chap. II,  8.7) is called Meixner's condition. D Remark 3. There is not, in general, uniqueness of the solution of the problem (2.1) without supplementary conditions at infinity and at the origin; in the present case of an angular sector, under the supplementary conditions { (i) u(r, e) 1"'.1 r - P for r  00, (with f3 > 0), (ii) u(r, e) bounded for r  0, (2.36) we are going to show (result already announced above) that there is uniqueness of the solution of the problem (2.1) if it exists. We shall equally show that there is uniqueness of the solution of the problem (2.1), (if it exists), under the conditions 34 We thus verify that u satisfies the required conditions at the start: u(r, 0) '" r - P for r  CIJ with f3 = p and u(r, 0) bounded for r  O. 
38 Chapter III. Functional Transformations (2.37) (i) u(r, e) bounded for r  00, (ii) u(r, e) has finite local energy at the origin, i.e. that there exists a compact neighbourhood K of the origin such that (2.35) is finite. Proof (1) To study the uniqueness of the solution of(2.1), we must find if (by taking the difference of two solutions), the problem (2.38) { ) L1 u == 0 in Q, (11) U 10Q == 0 admits a non-null solution. As a consequence of the invariance of this problem (2.38) by the homothety of the centre 0, we need consider only the solutions of (2.38) such that 35 (2.39) u (r, e) == r S cp ( e), where s is a real constant and cp a function of e (real or complex). Substituting (2.39) into (2.38) and passing to polar coordinates we obtain the equation 0 2 U 1 ou 1 02U 2 A + + ( s {() + {()" ) rS - 2 ' LJ U == or 2 -; or r2 oe 2 == 'f' 'f' showing that (2.40) cp" + S2 cp == o. In addition, (2.38)ii) carries over to (2.41 ) cp( - a) == cp(a) == O. The boundary value problem (2.40), (2.41) has solutions: (2.42) cp ( e) == co sse with (2.43) n s == (2k + 1) 2a ' k E 7L. 35 In effect if u is a solution of (2.38) then v(r, 8) = t" u(Ar, 8)..1. -, - 1 dA is also a solution of (2.38) with (2.39). 
92. The Mellin Transform 39 Thus, the problem (2.38) admits the solutions (2.44 ) u(r, 8) == r S cos s 8, with s given by (2.43). Without supplementary conditions to (2.1), there is no unIqueness of the solution of (2.1). (2) Let us suppose now that the supplementary condition (2.36)(i) or (2.37)(i) is imposed; then the solutions (2.44) verifying it are such that (2.45) s < o. The condition (2.36) (ii) at the origin implies s > 0, and so, only u == 0 is a solution of (2.38) if (2.36) is verified; the solution of (2.1), (2.36) is unique, if it exists. The condition (2.37)(ii) leads to f I Vul 2 dx == f [ au 2 +  au 2 ] rdrde == f r2s-1 (S2cp2 + cp'2)drde K K ar r ae K ought to be finite and hence 2s - 1 > - 1, i.e. s > o. We reach the same conclusion for (2.37) as for (2.36). o Remark 4. (1) Even if the given function g (see (2.1)) is continuous on the boundary ao, every solution of the problem (2.1) is not necessarily a solution of (2.1) in the classical or quasi-classical sense (see Chap. II, 93): the solution u of (2.1) is not in general continuous at the origin; in the particular case in which g is null on ao (see (2.38)(ii)), only the solutions (2.44) with s > 0 are continuous at the origin. (2) In the case of the problem (2.15), in which g is discontinuous at the points r == 1 of ao, we have seen that the solution (2.34) of (2.15) is not continuous at the point of its boundary r == 1, e == + (X. Thus u is a solution in a generalized sense (see Chap. II, 94, Perron's method) of the problem (2.15). o For the resolution of problems more general than (2.1), for example in a wedge or an open set with angular points, the reader should refer to Chap. VII and to Grisvard [1], [2] and Kondratiev [1]. It is thus possible to study the Dirichlet problems (2.1) which arise in electro- statics, heat conduction, elasticity, etc. . . with a "corner" geometry. The Mellin transform is used in theoretical physics to study the asymptotic behaviour of Feynman integrals (see Calan [1]) and to study diffusion ("scattering") in quantum mechanics (Velo-Wightman [1]). 
40 Chapter III. Functional Transformations 6. Table of Some Mellin Transforms (Ref. Colombo [1]) f(x) M (f)(s) cosx n r(s) cos-s 2 o < Re s < 1 SInx n r(s)sin-s 2 o < Re s < 1 e- X r(s) r(s)r(a - s) r(a) Re s > 0 (1 + x)-a o < Re s < Re a (1 + X)-l n o < Re s < 1 SIn sn e iax . 1t r(s)e 'SI o < Re s < 1 as n Log(1 + x) - 1 < Re s < 0 s SIn sn e - ax\ a > 0 1 s -a-Ir(s) 2 1 + x cos () 1 + 2x cos 8 + x 2 n cos s(} s =1= kn, k E Z SIn sn x sin () n sin s(} s =1= k n, k E Z 1 + 2x cos () + x 2 SIn sn J v(x) ( s + V )j ( V - S ) 2 s - 1 r 2" r 2" + 1 V < Re s < V + 2 For further transforms, see: F. Oberhettinger, Tables of Mellin Transforms, Springer-Verlag Berlin Heidelberg 1974 3. The Hankel Transform 1. Generalities The Hankel transform is a functional transform which diagonalises the operator d 2 1 d v 2 L - + 2 ' W ith v E . - dx 2 X dx -  (3.1) 
3. The Hankel Transform 41 That such a "simple" transform exists results from two facts: (i) If we work in the space of regular functions with compact support on J 0, + 00 [ provided with the scalar product Loo xcpljJ dx (i.e. the space L 2(0, 00, /l) for the measure d/l(x) = x dx), then the operator L is symmetric: (3.2) LOO x(Lcp )ljJdx = Loo xcp(LIjJ)dx as is easily verified; (ii) we seek then the "eigenvalues" of L, i.e. the functions w l such that (3.3) LWl = AWl and we thus arrive at the Besselfunctions (naturally, all this demands to be made precise; see,Chap. VIII and Naimark [1J). We thus result in the Hankel transform of order v defined by (3.4) £v(f)() = Loo f(x)Jv(x)xdx in which J v(x) is a solution of (3.5) so that LJ v = - J v (3.6) LJv(x) = - 2 J v(x). For v  - t, the inverse operator is expressed by the same formula and the transformationf  Yfv(f) is an isometry of the space L 2 (0, 00; xdx) onto itself. D This Hankel transform has a very rich structure and, like the Fourier transform, can be found by starting from other considerations. First of all the Hankel transform can be constructed in the body of group theory (see Miller [1J). The Hankel transform is linked to the Fourier transform of functions of n invariant under rotations; this is what permits (for example) the extension of the Hankel transform to tempered distributions on J 0, 00 [. A presentation of this connection is given below in Proposition 1. The Hankel transform ap- pears equally in the theory of the transformations of differential operators; see Lions [4J. An "analytic" presentation enters into the theory of "general transformations" introduced by Titchmarsl1 [1J, in which we seek functions k(A) such that if $I() = Loo k(x)f(x)dx then X -1 = x. 
42 Chapter III. Functional Transformations Certainly, the basic tool for the Hankel transform is that of the Bessel functions which are involved in many problems of mathematical physics, and to the essentials of which we turn in the next section. D 2. Introduction to Bessel Functions The Hankel transform involves Bessel functions; this is why in this Sect. 2, we are introducing these functions and giving several of their properties. These func- tions are useful for the resolution of Laplace's equation in cylindrical co- ordinates and will be considered again in Chap. VIII. Let us consider the differential equation in the unknown u (with complex values) (3.7) d 2 u +  du + ( 1 _ v 2 ) u == 0 X E fR\ {O} and v given in fR. dx 2 x dx x 2 One of the methods of solution of this type of equation is to seek solutions in the form of a series of the type 00 u(x) == xll I anx n , n=O a E fR. Introducing this function into equation (3.7) and equating to zero the co- efficients of the different powers of x, we obtain (a 2 - v 2 )a o == 0 [(a + 1)2 - v 2 Ja 1 == 0 [ (a + n)2 - v 2 J an + an _ 2 == 0, n  2 where the first members are respectively the coefficients of xll - 2, xll - 1, X n + 1l - 2. On supposing that a o #- 0, the first of these equations gives us that a== + v Taking a v, we obtain as one solution of (3.7) 00 (_1)n(x/2)v+2n Jv(x) == I n = 0 n!r(v + n + 1) which is called the Bessel function of the first kind (of order v). If v is not an integer, the function J -v (x) which is also a solution of (3.7) is independent of J v. To prove this, it is sufficient to calculate the Wronskian 36 of these two functions; we find that it is of the form A/x with A == - (2/n) sin vn. The general solution of (3.7) is then (3.8) u(x) == C 1 J v(x) + C 2 J - v(x), X E fR\ {O}. 36 By definition, the Wronskian of two functions q; and t/! is the function w = q;' t/! - q;t/!'. If w = 0, <p and t/! are proportional. 
93. The Hankel Transform 43 In the case when v is an integer, v == n, say, it is easy to show that 37 J-n(x) == (- 1)nJ n (x). In this case, another solution of (3.7) independent of I n may be found; it contains a logarithmic term and will not be used in what follows. Recurrence relations. Bessel functions satisfy the following relations: d v dx [x J v(x)J == XV J v - 1 (x), d dx [x - v J v (x) J - - x - v J v + 1 ( x), (3.9) (3.10) (3.11 ) d def V - d Jv(x) == J(x) == J v - 1 (x) - -Jv(x), x x v J  (x) == - J v + 1 (x) + - J v (x), x (3.12) (3.13) x x vJv(x) == 2 JV+1(x) + 2 J v - 1 (x). These relations are easy to establish by using the expression (3.8) for J v (x). I ntegral representations of Bessel functions. Let us consider the function 1 f 1 K(Z_1-) K ( x ) == - z - v - e 2 z dz v 2in c ' x > 0 where C is the contour described in Fig. 7 and v a given real number. 1m z c Re z Fig. 7 This integral is convergent and the change of variable u == t z x gives (x/2)V f - Kv(x) == . u- v - 1 eUe 4udu. 21n c This function can be extended to complex values of x and this is an integral function of x. Let us replace e -x 2 J4u by its Taylor development and integrate term by term: it becomes K ( ) == (x/2)V  (_1)k(X/2)2k f - (v + 1 + k) u d v x 2 . i..J k , u e u, In k = 0 . c 37 See Dieudonne [lJ and Watson [1]. 
44 Chapter III. Functional Transformations which gives 00 (_1)k(Xj2)v+2k K ( x ) =  v kok!r(v + k + 1)' Comparing with (3.8) we see that Kv(x) = Jv(x), i.e. 1 f 1 (z-.l) ( 3.14 ) J ( x ) = - z- v- e 2 z dz v 2in c ' We are going to deduce a second integral representation of J v (x). For that, we replace the preceding contour C by the contour C 1 of th€ Fig. 8 where 8 --+ O. Let us return to the expression (3.14) for J V' We write first of all the contribution of the circular part of C 1; putting z = e i8 , it takes the value (in the limit 8 --+ 0) x > O.  I n e- i(v + 1)8 e(eiO-e-iO) ie i8 de 2 . , In -n Rez Fig. 8 that is to say 1 I n . 8 . . 8 1 f n - e - lV e lXSm d8 = - cos(v8 - x sin 8)d8, 2n - n n ° as the integral between - nand n of sin(v8 - x sin 8) is zero. The contribution of the straight parts is obtained by putting z = exp(t - in) (for the upper straight part) and z = exp (t + in) (for the lower). It becomes (in the limit 8 --+ 0) 1 I o . [ 1 J  e- vt + lnv exp - - x(e t - e- t ) dt 2In 00 2 1 f oo . [ 1 J +  e - vt - lnv exp - - x(e t - e - t) dt 2zn ° 2 sin nv f 00 = - exp[ - xsht - vtJdt. n ° 
3. The Hankel Transform 45 Finally, we obtain for x > 0 (3.15) 1 I n sin n v I 00 lv(x) = - cos(v8 - xsin8)d8 - exp[ - xsht - vtJdt. non 0 In particular for v = n E N, we have 1 f + n . .. 1 I n . (3.16) 1 n (x) = - e - w8 e lX sm 8 d8 = - cos (n8 - x SIn 8)d8. 2n - n n 0 Application to the Fourier transform. Let fEY (1R2) be a regular function of "'- two variables (of rapid decay). Writing g = f, its Fourier transform. Suppose that the two functions f and g are expressed in polar coordinates f(x) = f(r, 8) with x = {r cos 8, r sin 8}, g() = g(p,<p) with  = {pcos<p,psin<p}. Let us decompose f and g in Fourier series (see 9 1 ) (3.17) f(r, 8) = L fn(r) e in8 nE"Z (3.18) g(p, <p) = L On (p )ein<P. nE"Z Let us seek the relations between the functionsfn and On" We have the Proposition 1. Suppose that g is the Fourier transform offwherefE Y(1R2) and let (3.17) and (3.18) be their Fourier series. Then we have the relations .n f 00 "'- 1 f.,(r) = 211: 0 gn(p)J n(r p) p dp, gn(P) = (- i)n 211: Loo J,,(r)J n(r p)r dr, where 1 n is the Bessel function of the first kind of order n. Proof By the definition of the Fourier transform we have (3.19) g(p, q» = Loo L21t e - ipr cos (9 - «J) f(r, 8)r dr d8, and by the inversion formula (see Appendix "Distributions") (3.20) f(r, 8) = (2:)2 LOO L21t e ipr cos (9 - «J) g(p, q» p dp dq>. 
46 Chapter III. Functional Transformations (3.21 ) Taking account of  1 (1.2) 1 f 21t J;,(r) = 2n 0 f(r, e) e - inO de. Substituting from (3.20) into (3.21), we obtain  1 f 21t . ( f ro f 21t . . ) in (r) = 3 e - wO e lpr cos (0 - ((J) L gm(P )elm({J pdp d<p dO. (2n) 0 0 0 mEZ Making the change of variable <p - 0 = t/J in the integral with respect to <p and making use of the periodicity of the function e ipr cos '" + im"" we obtain  1 f 21t . f 00 f 21t . . in(r) = 3 I el(m-n)OdO elPrCOS"'gm(p)elm"'pdpdt/J; (2n) mEZ 0 0 0 now f 21t ei(m - n)O de = { 0 f m =1= n. o 2n If m = n As a consequence J;,(r) = (2:)2 too t 21t e ipr cos t/I rln(P )eint/l p dp dljJ. On the other hand, putting t/J = 0 - tn, we obtain 1 f . ./,' ./, 1. 1t f 21t. . fj . fj _ 21t e,prcos'I' e'n'l'dljJ = -e- m2 e,prStn0e'node. 21c 0 2n 0 As a result of (3.16) we have J;,(r) = e ;:'i too rln (p)J -n (rp)p dp. An analogous calculation would show that rln(P) = ein'i (2n) tJ;,(r)Ln(rp)r dr. Thus the following transform is introduced naturally f -+ too f(p)Jn(rp)pdp and its inverse g -+ too g(r)Jn(rp)rdr. This leads to the Hankel transform which forms the subject of the Sect. 3 below. 
3. The Hankel Transform 47 3. Definition of the Hankel Transform Let us first of all give a formal definition which will be made precise later: Definition 1. The Hankel transform of order v E IR of the function f: IR + --+ C, is defined by (3.22) J¥'v(f)() = LX) f(x)Jv(X) x dx, where J v is the Bessel function of the first kind of order v. Example 1. Let us calculate the Hankel transform of order v of the function f(x) = x V (a 2 - x 2 )1L Y(a - x), j1 > - 1, x, a E IR + , where Y is the Heaviside function defined by Y() = { 1 if  > 0 o if  < o. After (3.8) we have 00 (_ 1)n(xl2)v+2n Jv(x) = b 'T( + + 1) , n - 0 n. v n from which we have J¥'.(f)() = f (- nn(/2)' + 2n f a x2v + 2n + 1 (a 2 - X 2 )" dx. n=On!r(V + n + 1) 0 On the other hand f: X 2v + 2n + 1 (a 2 - X 2 )" dx = a 2 (v +" + n + 1) f01 tV + n(1 - t)" dt 1 = 2 a 2 (v + fl + n + 1) B(v + n + 1, j1 + 1), where B denotes the usual Beta function which, we recall takes the value B ( ) = r(p)r(q) p, q r (p + q). It thus becomes 00 ( _ n)n(j2)v + 2n a 2 (v + fl + n + 1) r(v + n + 1)r(j1 + 1) J¥'v (f)() = n f:o n!r(v + n + 1) ' 2r(v + Jl + n + 2) 00 ( 1 ) n ( aJ: j 2 ) V + fl + 1 + 2n = r (j1 + 1) afl + v + 1  - fl - 1 2 fl L S . n = 0 n! r(v + j1 + n + 2) 
48 Chapter III. Functional Transformations Finally (3.23) Yfv(xV(a2 - X2)fl Y(a - x))() = r(/1 + l)a fl + v + 1-fl-12fl JV+JL+ l(a). In particular, we have: (3.24) aV + 1 Yrv[X V Y(a - x)]W =  J v + da), x, , a E IR + . o We shall see the consequences of these calculations later. On the other hand, we shall give at the end of this part (Sect. 7 below) a table of the principal Hankel transforms. 4. The Inversion Formula In this section we shall exhibit a large class of functions which possess a Hankel transform and give a reply to the following question: if gv is the Hankel transform ofafunctionf, isfthe Hankel transform ofgv? The reply will be positive for the functions of this class: we shall have Yfv 0 Yfv = I. We shall give a sufficiently general theorem to cover this problem; the proof of this theorem is long and, in the end, of little importance for those whose interest is in applications in respect of the result itself. This is why we give only some applications of the theorem, referring the reader to Sneddon [1] for the complete proof. Theorem 1. Let f: IR + --+ IR be a function such that x --+ xi f(x) belongs to L 1 (0, + (0). Then for each real number v satisfying v > - t, Yfv (f) defined by (3.22) exists and f oo 1 o Yrv (f)()Jv(x) d = :2 [f(x +) + f(x - )], where f(x +) [resp. f(x -)] is the limit at the right [resp. at the left] off at the point x in each point x in the neighbourhood of which f is of bounded variation. Corollary 1. Let f: IR + --+ IR be a continuous function at the point Xo and such that x --+ xi f(x) EL l (0, + (0). Then for each real number v satisfying v > - t, Yfv(f) defined by (3.22) exists and LX> Yrv(f)()Jv(xo) d = f(x o ). Let us go back to the case of Example 1 38 . We have seen that Yf v (x v (a 2 - X2)fl Y(a - x))() = r(/1 + l)a JL + v + 1  - fl- 1 2 fl J fl + v + 1 (a), 38 See Davies [1]. 
3. The Hankel Transform 49 and we can apply Corollary 1 to the function whose Hankel transform we have just calculated. This gives (for x =1= a, x > 0, v > - t, Jl > - 1: f oo -I-L _ x V (a 2 - X2)1-L Y(a - x) o  J " + v + d a)J v (x ) d - r (Jl + 1) a" +v + 1 2" ' or again, on setting a = Jl + v + 1, we obtain: f oo ;;-a +v + 1 J ( a;; ) J ( x;; ) d;; = x V (a 2 - x 2 )a-v-1 Y(a - x) . o  a S v  S r (a _ v )aa 2 a - v - 1 For example for a = 1, v = 0 we have: f oo Y(a - x) J 1 (a) J 0 (x) d = , o a x, a,  E IR + , X =1= a. Example 2. We shall prove that o (3.25) [ v ] fixve-ax :Yfv (a 2 + 2 r + 3/2 (x) = . ( 3 ) , a > 0 2V + 1 ar v + - 2 wi th a, , x E IR + and v > t . Because of Corollary 1, it is enough to calculate the integral F(a,) = too XV + 1 e -ax Jv(x)dx = :Yfv(XV e ax). For this, let us use the definition 00 (_ 1)n(xj2)v+2n JJx) = f: Ir( + + 1) ' n-O n. v n to obtain 00 ( 1 ) n ( ;; j 2 ) v + 2n f 00 F(a,) = L S x2v+2n+1e-axdx. n = 0 n!r(v + n + 1) 0 I t is easy to see that fo oo x2v+2n+le- ax dx = r(2v + 2n + 2)a-2V-2n-2, and hence that 00 ( - 1 ) n ( ;; j 2 ) v + 2n r ( 2v + 2n + 2 ) F (a,) = L  a - 2 v - 2n - 2. n=O n! r(V + n + 1) Now, we have r ( 2v + 2n + 2) ( 3 ) = n -1/2 2 2v + 2n + 1 r v + n + _ 2 . r(V + n + 1) 
50 Chapter III. Functional Transformations It hence becomes F(a,) = f (- ,It (/2t + 2nn-1/222v + 2n + 1 r ( v + n +  ) a-2V - 2n - 2 n=O n. 2 gIvIng F(a,) = a2v-2v2v+ In-112 f [-(/a)2rr( + n + 3/2) . n = 0 n. From the binomial formula (1 - x)-a = f r(a + n) x n Ixl < I" n=O n!r(a) , we, deduce that F(a,) = a-2V-2v2v+ln-1/2r(v + )(l + (/a)2)-v-3/2, and finally that F (a,) = n - 1/2 2 v + 1 r (v + ) a C (a 2 +  2) - v - 3/ 2 . From this we obtain (3.25) on going back to the definition of F(a, ). 5. Properties of the Hankel Transform First of all, let us quote two simple relations which are easy to establish. Let a > 0 and letf:  +   be a function satisfying (for example) the hypotheses of Theorem 1. Let us denote by fa the mapping x  f(ax). Then (3.26) 1 Yfv(fa)() = 2 Yfv(f)(/a), a  E +. Also: (3.27) v G f}) = ;v {v-l(f)() + v+1(f)()}, EIR+,V > 1 2' this formula resulting from the recurrence relation (3.13) connecting the J v . The Hankel Transform and Differentiation. Suppose that f is a function of class 1 possessing (together with its derivative I') a Hankel transform. What relation exists between Yfv(f) and YfV(f')? We have 
3. The Hankel Transform 51 (3.28) Yr v(f')() = LX) f' (x)J v(X) X dx f oo d = [f(x)x Jv(x)]g' - 0 f(x) dx (xJv(x))dx. First of all, the quantity xtJv(x) is bounded in the neighbourhood of the origin (for v > - !), from (3.8) andfbeing supposed continuous at zero, we have lim xf(x)Jv(x) = o. xo On the other hand, the quantity ft J v(t) is bounded (see, for example, Whittaker- Watson [1]) and we can write 1 C Ixf(x)Jv(x)1 = fi fi If(x)IRIJ'(x)1  fi filf(x)l, x,  E IR + . It is sufficient for the vanishing of the bracket In (3.28) therefore that lim fi f(x) = o. We have in this case x 00 f oo d Yrv(f')() = - 0 f(x) dx (xJv(x))dx. U sing the recurrence relations concerning J v we write d x dx (xJv(x)) = 2v [(v + 1)J v - 1 (x) - (v - l)J v + 1 (x)]. We deduce finally the formula (3.29) Yrv(f')() = - { v  1 Yrv _ 1 (f)() - v  1 Yrv +1 (f)W },  E !R + . Proceeding as above, that' is to say integrating by parts and supposing that the functionf is sufficiently regular and is such that all the brackets appearing in the calculations are zero, the reader will be able to verify the following fundamental formula (3.30) Yrv(f" + : f' - :: f)W = - e Yrv(f)(). ( d2 1 d v 2 ) The Hankel transform diagonalises the operator  + __ d - 2 . We shall dx x x x see that the Hankel transform (for suitable v) simplifies the study of Laplace's 
52 Chapter III. Functional Transformations equation in polar coordinates in {R2. In effect, let us solve Laplace's equation in {R 2 : L1u = f, where f is known in polar coordinates in the form f(r, 8) = I l,(r) e in8 . nEZ Let us put in the same way u(r, 8) = I un(r) e in8 . nEZ We have seen in 9 1 that Laplace's equation can be decomposed into the differential equations 1 n 2 "'- Ail A, A f, Un + - Un - 2 Un = n. r r Taking the Hankel transform of order n of this equation we obtain, from equation (3.30), the equations 1 "'- Yl'n(un)() = - 2 Yl'nCf,,)(), which by the inverse Hankel transform of order n gives the transforms Un' Remark 1. Let us notice the analogy between the use of the Fourier or Laplace transforms to resolve def Lv = - v" = g, and the use we shall make of the Hankel transformation to resolve: def 1 v 2 Lvv = - v" - - v' + 2 v = g. r r This is linked to the properties of the kernels of the two transformations: the kernel G = e iax of the Fourier transform (or G = e ax for the Laplace transform) satisfies - LG = a 2 G' , the kernel G v = J v (ax) satisfies LvGv = a 2 G v (the functions J v (ax) are generalized eigenfunctions of Lv 39). 39 See Chap. VIII. 
3. The Hankel Transform 53 Parseval's Relation. The following result is fundamental: Theorem 2. F or v  - t, the mapping f --+ ytvf (3.31 ) is an isometry of L 2 (0, 00; xdx) onto itself. Stated otherwise: iff and 9 are in L 2 (0, 00; xdx) we have f: f(x)g(x) x dx = f: Yt'J(x)Yt'vg(x)x dx. By density, it is sufficient to show this formula to be true for functions f and 9 which are regular and have compact support; the result then follows from Theorem 1 (for v > -t). The reader will find a proof in Titchmarsh [1], Chap. 8, Sect. 8.18. Example 3. Let us take, for example, f(x) = XV Y(a - x), g(x) = XV Y(b - x), a, b > 0, v  1 "2. We have seen previously that aV+ 1 Yt'v(f)() = TJV+l(a), bV+ 1 Yt'v(g)() = TJV+l(b). (3.32) Parseval's relation then gives (abt + 1 f co Jv+ 1 (a)Jv+1 (b) d = (min (a, b))2v+ 2 . o  2v + 2 D 6. Application of the Hankel Transform to Partial Differential Equations The Hankel transform is well suited to problems in 1R3 having symmetry about an aXIS. 6.1. Dirichlet's Problem for Laplace's Equation in R. The Case of Axial Symmetry We set ourselves the problem of solving: (i) L1u = 0 in Q = IRt = IR X1 x IR x2 x IRx; (ii) u(x 1, x 2 , 0) = g(r) (for X 3 = 0) with r = (xi + X)1/2 (iii) lim u(x 1 , x 2 , x 3 ) = 0 X3  00 (3.33) 
54 Chapter III. Functional Transformations in which u takes real values and in which g is a given real function of r. Taking account of the invariance of the data under rotations about the axis OX 3 let us adopt cylindrical coordinates r, 0, Z such that Xl = rcosO, X Z = r sin e, X 3 = Z (r > 0, e E (0, 2n), Z > 0). Equation (3.33)i) becomes 8 z u 1 8u 1 8 z u 8 z u (3.34) Au = 8r z + -;: 8r + rZ 80 z + 8Z z = O. As a result of the invariance of the data in (3.33) under rotation about OX 3 let us seek the solutions independent of 0, from which. 8u 80 = 0 8 z u 80 z = O. The problem (3.33) becomes: find a (real) function u(r, Z) on  + X  + satisfying 8 z u 1 8u 8 z u (i) 8r z + -;: 8r + 8Z z = 0 (ii) u(r, 0) = g(r) (3.35) (iii) lim u(r, Z) = O. zoo As we have seen Chap. II, the condition at infinity (3.33) (iii), [or (3.35 (iii)J is necessary to ensure the uniqueness of the solution. In addition to guarantee the interior regularity of the solution of (3.33) we shall need to have that lim u(r, Z) exists for Z > o. rO Let us take the O-order Hankel transform of (3.34) which can be written (on denoting by u and g the Hankel transforms of u and g), (3.36) : - 2U = 0, which yields, with (3.35) (ii) and (iii) u ( , Z) = g ( ) e - z;; . The inversion formula then gives (3.37) u(r, Z) = f: g() e - z J o(r) d. Example 4. sIn r g(r) = - r f oo { (1 - Z) -1/Z g() = 0 J o(r) sin r dr = 0 0<<1  > 1. 
93. The Hankel Transform 55 The solution of the problem (3.33) is thus u(r,Z) = I e-Z(1 - 2)-1/2Jo(r)d. 6.2. Boundary Value Problem for the Biharmonic Equation in R, with Axial Symmetry We wish to treat the problem (3.38) (i) 11 2 u = 0 In [R (ii) U(Xl' X 2 , 0) = g(r) on IR ( ... ) au ( ) 111  Xl' X 2 , 0 = 0 on uX 3 alR (iv) lim U(X l , X 2 , X 3 ) = 0 X3  00 (with the notations at the beginning of the Sect. 6.1). As in the Sect. 6.1, we pass to cylindrical coordinates (r, 0, Z) and look for a solution u(r, Z) of the equation (3.39) ( :r 22 +  :r + a2 ) ( :r 22 +  :r + a2 )u = O. (as a result of (3.38) (i)). On taking the zero order Hankel transformation of (3.39) and (3.38) (ii), (iii), (iv) we obtain (3.40 ) ( a2 - 2)2 U = 0 u(, 0) = g() au az ('O)=O lim u(, Z) = 0 zoo for Z  0 4 °, of which the solution is u(, Z) = g()(1 + Z)e-z. Example 5. Let us consider the case when the given boundary value is g(r) = (1 + r 2 )-t; in this case g () = yt ° (g) () =  - 1 e -  (see table below) so that u ( , Z) =  - 1 (1 +  Z) e - (Z + l) 40 Denoting by ii, 9 the Hankel transforms of order zero of u, g. 
56 Chapter III. Functional Transformations from which u(r, Z) = f (1 + Z)e-(z+ l)Jo(r)d u(r, Z)= J'to(-! e-(z+ l)) + ZJ'to(e-(z+ 1)). Finally (see table) r 2 + (Z + 1)(2Z + 1) u(r, Z) = [(Z + 1)2 + r2 ]3/2 . We verify that u(r, Z) is analytic, real in the half-space Z > 0 and we verify also that the decay condition at infinity is such as to ensure that the inverse transform exists. o Remark 2. The Hankel transform and the Fourier transform in IRn. In the case in which a function F(x), (x E IRn) is invariant under rotation in IRn let "F(x) = F(r)" \Ix, (r = Ixl); the Fourier transform P can be written as a function of quantities invariant under rotation r = Ixl and p = I yl, (y being the variable conjugate to x, y E IR n)  (2n )n/2 f oo n F(p) = F(r)r 2 J E--l (pr) dr 41 . n 2 p2-1 0 The transformation r1-1F(r)  (2n)-1p1- 1 pep) is in effect the Hankel trans- formation of order t n - 1 42 . The preceding problem correspond to n = 2 i.e. to the order o. Remark 3. Numerous properties of Hankel transformations of integer order may be deduced directly from the properties of the Fourier transformation in 1R2 (see Appendix "Distributions"). In effect (see Proposition 1) each function (or each tempered distribution) u on 1R2 of the form (in polars) u(r, 0) = In e - inO fer) has for Fourier transform (in polars) ff u(p, qJ) = u(p, qJ) = (- i)n e - inlp J't n u(p). (i) The formula ff ff u(x) = (2n)2 u( - x) leads to the reciprocal formula J'tn(J'tnf) = f(see Theorem 1). 1 (ii) Parseval's relation II u II i2(2) = (211:)2 II u II i 2(2) leads to: f If(r)1 2 rdr = f I£J(p)12p dp 4i See Schwartz [IJ, p. 259, and Appendix "Distributions" for the definition of the Fourier transformation used here. 42 The transformation F  F is called Hankel-Bochner in Wolf [1]. 
Review of Chapter IlIA 57 (see Theorem 2) for eachfE L 2 (0, + 00; rdr), hence YfnfE L 2 (0, + 00; pdp). (iii) If rf EL l (0, + (0), then U EL l ([R2), so U E rcO([R2), and U  0 for p  + 00; hence YfnfE rcO([O, + oo[) with Yfnf(p)  0 for p  00 and for p  o. Under the hypothesis of Theorem 1: ,-t f E L 1 (0, + (0), since I r t J n(r)1  K, K ( f OO 1/2 ) K l£nf(p)1  JP 0 r f(r)dr = JP IIfll£,(o,+ oo,JY<fr)' We then deduce only that Yfnf E rcO(]O, + oo[) with Yfnf(p)  0 for p  + 00. o 7. Table of Some Hankel Transforms (Ref. Colombo [1J) f(x), x > 0 Yfv(f)( ),  > 0 x V (a 2 - X2)1l Y(a - x) a > 0 r (Jl + 1 )a Il + v + 12 1l  - Il - 1 J Il + v + 1 (a) v > -1 Jl> -1 X S - 1 (_ v-I < s < v + 1) 2S-S-1 r«v + s + 1)/2)/r«v - s + 1)/2) x v-I e - ax a > 0 2 v n-' r(v + DV(a2 + 2)-V-1/2 V > 0 x v e - ax a > 0 2v+1n-1/2r( v + )a.(a2 + 2)-v-3/2 V > -1 X-I e - ax a > 0 (a 2 + 2)-1/2(  )' V > - 1 a + Ja 2 + 2 e- ax a > 0 v(a2+2)-3/2{a + v J a 2 + 2}.{a + J a 2 + 2}-V C Y+1 v > - 2 x 2 xVe-;;;- a * 0 "2 a 2 ve-ta22 v > - 1 SIn ax t2 - :2)-t, O<<a - a > 0 v = 0 x  > a Review of Chapter IlIA The problems which we have treated in 991, 2, 3 are of the following type: to find U satisfying { P(D)U = f in 0 c [Rn, with boundary conditions on cO where P(D) is a linear differential operator with constant coefficients. 
58 Chapter III. Functional Transformations By a suitable change of variable, this problem is reduced to the following problem: (1) Case of Fourier Series. To find u' satisfying { P' (D)u' == f' in [Rd x Qo, ;.vith d condtions o n the boundary of Qo and periodic conditions In [R (perIod a - (a 1 , . - . ad)). We suppose in addition that P'(D) is invariant under a translation a as well asf'. Then the use of Fourier series diagonalises the part of the operator P' (D) acting on [Rd. (In the examples treated in 91, the operator P' (D) is completely diag- onalised and the problem reduces to the calculation of Fourier series). (2) Case of-the Mellin transform. To find u' satisfying { P'(D)u' == f' in [R+ x Qo with conditions on the boundary of [R+ x Qo- We suppose that P'(D) andf' are invariant under homothetic transformations in [R +. Then the Mellin transformation diagonalises the part of the operator P' (D) acting on [R +. (3) Case of the Hankel transform. To find u' satisfying { P'(D)u' == f' in [Rd x Qo with conditions on the boundary of [Ra x Qo. We suppose in addition that P'(D) and f' are invariant under rotations in [Rd_ The Hankel transform applied to the above equation diagonalises the part of the operator acting on [Rd. (In the examples of 93, we have d == 2 and Qo == [R+). 
Part B. Discrete Fourier Transforms and Fast Fourier Transforms 1. Introduction Suppose thatf is a 2n-periodic function with complex values. Its development in a Fourier series is given by (1.1) f(x) = L cje ijX . jEZ From the orthonormality of the functions e ijx for the scalar product 1 f 21t (h, g) = 2n 0 h(x)g(x) dx there results the expression (1.2) c j = (f, eijX) = ;n J:" f(x)e-ijX dx for the coefficients C j. Let Sn be the finite-dimensional space generated by the (2n + 1) basis functions ( ijX ) e -njn. The function};, E Sn, obtained by truncating the development (1.1) to .order n is thus the best approximation to f in Sn (in the sense of the L 2- norm ). It is difficult to calculate since we must evaluate integrals to calculate the coefficients c j . On the other hand, the function gn of Sn of the form +n gn(x) == L aje ijX j= -n which coincides withf at the N = (2n + 1) points 2kn Xk == N' o  k  N - 1, is such that a j satisfies the relations (1.3) n L a je ijXk = f(Xk) j= -n 
60 Chapter III. Functional Transformations (the function gn is in fact the interpolate off in Sn); these relations can be written in the form N-l L ale2i1tkl/N = Zk 1=0 def def with Zk == f(xk) and a l == a j for I = j (mod N). On using the relation (1.4) 1 N-l - L e 2i1t (I- j)k/N N k=O { I if I = j <5 I } . = 0 otherwise we conclude that (1.5) 1 N-l .. a. = - " e- 2l1t }k/N z } N  k, k=O o  j  N - 1. To begin with we make a certain number of remarks which will not be appropriate in the sequel. Remark 1. Here, we have N = 2n + 1. The case N even corresponds to the choice of Sn as the space generated by ( e i j X) j , which has dimension 2n. -n+ljn o Remark 2. The coefficients a j and c j are related by the formula (1.6) a j = L C j + 1N , fEZ Ijl  n. In effect, for /l E 7L we can write /l = v + IN where Ivl  nand I E 7L' , as a consequence f(x) = L cfle iflX = L L cv+1Nei(v+1N)X, J1EZ fEZ Ivl  n so Zk = L L Cv+1Neivxk fEZlvln since NX k = 2kn. We deduce (1.6) by using (1.5) and the relation (1.4). o Remark 3. Estimate of the error for II f - gn II. Let us define the norm: ( ) 1/2 Ilflls = ,L (1 + j2)Sl c j l2 JEZ 
91. Introduction 61 (this is the norm of the Sobolev space H(O, 2n) of periodic functions defined on the interval }O, 2n[). We then have for 0  m  s the majorants Ilf - fnllm  C 1 n m - s Ilflls, Ilf - gn 11m  C 2 n m - s Ilflls where C l' C 2 are positive constants independent of n and off. These results have been established by Kreiss-Oliger [1J in the case m=O and by Canuto-Quarteroni [1J in the general case. o Remark 4. The numerical quadrature formula 1 f 21C 1 N - 1 _ 2 f(x)dx  - L f(x k ) noN k=O (1.7) is exact for fES N - 1 S2n. We see that if we evaluate the coefficients c j by the formula (1.2) with the help of the integration formula (1.7), we obtain in effect the coefficients a j . o Definition 1. We call the mapping eN  eN which associates the (a j )f=-ol with the (Zk)k=O,..., N-l according to the formula (1.5) the discrete Fourier transform. The inverse transformation can be expressed In an analogous and even simpler form, since (1.8) N-l Zk = L ajw j \ j=O where w = e 21Ci / N is the principal root of order N of unity. We see that the calculation of the Zk with respect to the aj, and reciprocally, implies a priori O(N 2 ) operations (i.e. complex additions and multiplications) since it is enough to carry out the product of art N x N matrix by a vector. In fact, we shall see two different ways which enable us to reduce this number to O(N log2 N) from which we derive a considerable saving of computing time when performing calculations involving large values of N. This is what we call the fast Fourier transform. The first, studied in 93, is that due to Cooley and Tukey and applies in the case in which N = 2 m (or 3 m ). The second, studied in 4, is that of Good and Winograd and concerns the case where N is a product of factors which are relatively prime. Both rest on the principle of the acceleration of the product of a matrix by a vector, when the vector satisfies certain decomposition properties (2). We study in 5 principles of algebraic complexity which permit the acceleration 
62 Chapter III. Functional Transformations of the Good-Winograd transformation and thus render it competitive with that of Cooley- Tukey. We study the fast Fourier transformation in two dimensions in 9 6 . Finally in 9 7 we give some applications. 2. Acceleration of the Product of a Matrix by a Vector Just as we have seen in 91, the problem of the discrete Fourier transformation (direct or inverse) comes down to evaluating the product of an N x N matrix by a column vector with N components. Let a == Bz i.e. choosing 0 and N -1 in place of 1 and N as bounds of variation for the indices: N-l a j == L bjkz k , k=O o  j  N - 1. If the matrix B were arbitrary we should expect 0(N 2 ) operations, or more precisely, N 2 multiplications and N 2 additions. We shall show that we can carry out the product of the matrix B by an arbitrary vector z in many fewer operations, on condition that the matrix B satisfies some decomposition properties. For pE N, we denote by I(p) the set def I(p) == {O, 1, . . . , p - 1} of integers less than p. Let us suppose that N == n 1 x n 2 x ... x n r is a product of r factors; these decomposition properties will, in an essential way, make use of the fact that the product set I (n 1) x I (n 2 ) x ... x I (n r ) can be put in bijection with I(N). An example of such a bijection is in the case in which N == lor and n 1 == n 2 == . . . == n r == 10, the mapping which, with a number of r digits, associates the r digits of its decimal expansion: for example 983 -+ {9, 8, 3}. Let us observe besides that the procedure is not unique since we can agree to put them in a different order: For example: 983 -+ {3, 8, 9}. 
9 2 . Acceleration of the Product of a Matrix by a Vector 63 Thus let N = n 1 x n 2 x ... x n r . Then we can be given a bijection f: l(n 1 ) x l(n2) x . . . x I(n r )  I(N) {j l' j 2, . . . , jr}  j = f(j l' j 2, . . . , jr)' In the general case we can equally well be given a second bijection g :1(n 1 ) x l(n 2 ) x ... x I(n r )  I(N) { k l' k 2, . . . , k r }  k = g ( k l' k 2, . . . , k r ) which does not necessarily coincide with f We suppose that there exist na x na matrices B(a), a = 1, 2, . . . r whose elements are denoted by bj:C, ja, ka E lena) and such that (2.1 ) r b Ok = bl k ) X b l 2 k ) X . .. X b l r k ) = TI b l a k ) J 111 22 rr /Xa a=l when j = f (j l' . . . , j r ) & k = g (k l' . . . , k r ). Proposition 1. If B enjoys the decomposition property (2.1), then the product of the matrix B by a vector z with N components can be brought about by N(n 1 + n 2 + . . . + n r ) additions and as many multiplications. Proof Let us put z(r) = Z k k 1 , k 2 , 0 0 0 , k r for k = g(k 1 , . . . , k r ); z(r) is thus a list with r entries equally of dimension nIx n 2 x . . . x n r . By recurrence, suppose that we know z(a) (a E N * fixed, a  r), a list of r entries equally of dimension n 1 x n2 x . .. x n r . We shall calculate z(a - 1) by the following formula na - 1 (2.2) z(a-1) 0 0 =  ba) z(a) . . . k 1 . . 0 , ka-l1a, 0 0 0 ,lr i...J laka. k 1 ,. 0 . , kal a + l' . . . , lr' ka = 0 in other words, k 1 , . . . , k a - 1 , ja+ 1, . . . , jr being fixed in l(n 1 ), . . . , l(n a - 1 ), l(n a + 1)' . . . , l(n r ), it is enough to calculate the product of the matrix B(a) by a vector with na components which accounts for n; additions and as many multiplications. Taking everything into account, the calculation of z(a - 1) starting from z(a) costs (n 1 x . .. x na _ 1 X na + 1 X ... x n r ) x n; = N na additions and multiplications. We remark then that if _ (0) a o - z. . J 11, . 0 . , Jr 
64 Chapter III. Functional Transformations for j = !(jl, j2, . . . , jr), then (2.3) N-l a j = L bjkZ k k=O after (2.1). The calculation of a from Z accounts therefore for N(n 1 + n 2 + . . . + n r ) additions and as many multiplications. o In the condition (2.1), the matrix B is in fact a tensor product B(a). The limiting case N = 2 r shows therefore that we can hope in this favourable case reduce the number of operations to 2N log2 N. Remark 1. The preceding result is valid if the coefficients of the matrix B are of the form (2.4) r b. = TI ba). . k J JI1.,JI1.+1,...,Jr'l1. a=l for j = !(jl,'" ,jr) and k = g(k 1 ,..., k r ); it suffices in effect to replace (2.2) by (2.5) Z(a-l) _ k 1 ,...,kl1.-1'jl1.'....'jr - nil.  ba) . . z(a) . . f..J JI1.,JI1.+1, . . . ,Jn k l1. k 1 ,..., kl1.,JI1.+1" . . ,Jr' kl1.=O There again the number of operations to calculate the N quantities z(a-l) is Nn from which we deduce the total number of O p erations is a' N (n 1 + n 2 + . . . + n r ). In this case B is no longer a tensor product of na x na matrices. 3. The Fast Fourier Transform of Cooley and Tukey Let N = n 1 x n 2 x . .. x n r be.given and let w be the principal root of unity of order N. We recall (see  1) that if Z E eN is given, its discrete Fourier transformation is the vector a E eN such that 1 N-l . a. = -  w - Jk Z J N f..J k, k=O o  j  N - 1. We shall show that the matrix B whose elements are given by 1 'k b jk = N w - J (3.1) satisfies the decomposition property (2.4). 
3. The Fast Fourier Transform of Cooley and Tukey 65 For that, we choose the bijections f and g in the following way: Let def ma = nix ... x na - 1 the product of the first (a - 1) factors of the number N (we agree to take ml == 1); we put r (3.2) g(k 1 ,..., k r ) == L kama (with ka E I(na) for a == 1,. . . , r). a=l On the other hand, let def m = na + 1 X . . . x n r the product of the last (r - a) factors of the number N (we agree that m == 1); we put r (3.3) f(jl'... ,jr) == L jam (withjaE1(na) for a == 1,. . ., r). a=l Proposition 1. With the choice (3.2), (3.3) of the bijections f and g, the matrix B defined by (3.1) (thus with its inverse defined by (1.8)) satisfies the decomposition property (2.4). Proof The proposition follows from the fact that w N == 1 and that w P == w q when p = q (mod N). (Congruence properties thus play an essential role). In effect, for j == f(j l' . . . , j r) k == g (k 1, . . . , k r ), we have, in making use of (3.2) r jk == L ka(jma) a=l and from (3.3) jma == (j 1 m'l + . . . + j a m + . . . + j r m) ma . Now when f3  a - 1. We deduce that mam'p = 0 (mod N) jma = (jam + . . . + jrm)ma (mod) and that Wjmrl == w(jam + ... + jrm)ma. Finally, we have shown that r w jk == n wjkamrt a=l 
66 Chapter III. Functional Transformations satisfies the property (2.4) with def ba) . == wjkama = w(jam + . . . + jrm)kama. Ja, . . . , In ka D Remark 1. The number of operations which we indicate in Proposition 1 of 92 is an upper bound of the number of operations that will be realistically made: in the case N = 4 we predict 16 additions and 16 multiplications, the matrix of the w jk is made up in this case only of elements equal to + 1 or to + i, and there is no multiplication whatever to make. D 4. The Fast Fourier Transform of Good-Winograd This concerns the case in which N = n 1 x n 2 x . .. x n r and where the integers n 1 , n 2 , . . . , n r are relatively prime. We recall first of all the Chinese remainder theorem (see Knuth [1J vol. 2,94.3.2). Theorem 1. Suppose that n 1 , n 2 , . . . , n r are relative prime integers and that N = n 1 x n 2 x , . . ., x n r . Then for all r-tuples of whole numbers u 1 , U2, . . . , U r satisfying o  U a < na, a = 1, 2, . . . r, there exists a unique u satisfying o  u < Nand umodn a = u a , \fa. D We recall that the statement U a = u mod na is equivalent by definition to U a = u(modulo na) and 0  U a < na 43. Because of this Theorem 1, there exist therefore numbers qa, 1  a < r such that { 1 if a = f3 qamodnp = b aP = 0 h . ot erWlse. We verify immediately that the relation between u and the U a is U = Ctl Uaqa) mod N. 43 We recall that U a == u(mod na) means that 3ka E Z such that U - U a = naka. With the condition o  U a  na, U a is therefore the remainder of the euclidean division of U by na. 
4. The Fast Fourier Transform of Good-Winograd 67 Application to the discrete Fourier transform. Theorem 1 sets up a bijection h: I (n 1) X . .. x I (n r )  I (N) such that hUb'" ,jr) = (jaqa)mOdN. We shall show that the matrix B of the discrete Fourier transform satisfies the decomposition property (2.1) with (4.1 ) f = g = h. In effect, let j = h(jl,...,jr) k = h (k 1, . . . , k r ), then, if I = jk, as a consequence of the properties of modular arithmetic, we have I = h(11' 1 2 , . . . , lr) with 1(X = (j(Xk(X) mod n(X. We deduce that r w jk = Wi = fl W1llqll (X=1 from the expression (4.1) of the bijection h. Now by definition of q(X, we know that q(X is a multiple of n 1 . . . n(X-l ncx+ 1 . . . n r ; as a consequence l(Xq(X - j(Xk(Xq(X (mod N) so w'llqll = wjllkll qll . We have finally demonstrated that the matrix B = ((b jk )) satisfies the property (2.1) with (4.2) b(X) = wjllkllqll Jllkll . That establishes, as we have in 2, that the number of operations is not greater than N (n 1 + n 2 + . . . + n r ). This number can be reduced as can be seen by remarking that the passage from z«(X) to Z«(X-1) in the formula (2.2) is in fact a n(X-point discrete Fourier transform. In effect let us fix k l' . . . , k(X - 1 , j (X + l' . . . , j r and put Z k = Z«(X) . . and A. = Z«(X-1) . . II - k 1 ,...,k ll ,)Il+1,...,Jr ill - k1,...,kll-1,JIl,...,Jr. We see from (4.2) that (4.3) nCl-l I k' A. = OJ IlJIlZ k ill Cl kCl=O 
68 Chapter III. Functional Transformations def where OJ == wq. Now (4.3) is clearly a formula for the discrete Fourier transform with na points. This results from OJnrx = w qrxnrx == 1. since qa is a multiple of n 1 . . . n a -l na+ 1 . . . n r . We can thus hope to reduce the number of operations below N(n 1 + n 2 + . . . + n r ) by the condition of being able to optimise each of the elementary na-point discrete Fourier transforms by another method. Letfa be the "cost" of a na-point Fourier transform; we can thus drop down to N ( fl + . . . + f,. ) operations44. n 1 n r For example for N == 5040 == 5 x 7 x 9 x 16, it is enough, in optImlslng them to write algorithms for Fourier transforms with 5, 7, 9 and 16 points. Although 16 == 2 x 2 x 2 x 2 we are not able to further decompose in the Good-Winograd formalism since the factors are no longer relatively prime. On the other hand to optimise the 16-point discrete Fourier transform which will be used many times (in fact 5 x 7 x 9 times) nothing prohibits us appealing to the algorithm of Cooley- Tukey which will yield a number of operations much less than 16 x 16. It is the same for the 9-point Fourier transform since 9 == 3 x 3. With n 1 == 5, n 2 == 7, n 3 == 9, n 4 == 16 we therefore have fl  5 x 5, f2  7 x 7, f3  6 x 9, f4  8 x 16, from which we deduce that the number of operations is less than or equal to (5 + 7 + 6 + 8)N == 26 N operations 45 while 2N log2 N > 24N. We see that if we succeed in optimising the Fourier transforms at 5 and at 7 points, we can perhaps succeed in achieving a number of operations comparable with or eventually better than that of the Good-Winograd method. One of means used for that is to try to minimize the number of multiplications (to the detriment of the number of additions). 44 In effect in the proof of Proposition 1, the passage from z(a) to z(a-l) accounts for (n 1 x ... x na-1na+l'.' n r ) x fa o perationsinplaceof(n 1 x ... xna-1na+l x ... n r ) x n;, from which follows the result by summation over (1,. 45 Here an operation = one multiplication + one addition of complex numbers. 
5. Reduction of the Number of Multiplications 69 5. Reduction of the Number of Multiplications Orientation. We have seen that the principle of the fast Fourier transformation of Good-Winograd is to decompose the problem into many discrete Fourier transforms at a small number of points. We study in this part the method of optimising these elementary Fourier transforms which are destined to be used a large number of times. We show in the first place that the calculation of the N-point discrete Fourier transform can lead to a problem of cyclic convolution of order n = N - 1. We show subsequently that the problem of cyclic convolution of order n is equivalent to evaluate the coefficients of the product of two polynomials modulo the polynomial un - 1. 1. Relation Between the Discrete Fourier Transform and the Problem of Cyclic Convolution The relation (1.8) can be written in the more detailed form 46 Go 1 1 1 1 Zo G 1 1 W w 2 N-1 Zl W G 2 1 w 2 w 4 W2(N - 1) Z2 . . . . . . .. . .. . . . . ... ... .. .. . .. ....... .. . . . .. . .. . . . . . G N - 1 1 N-1 W 2 (N-1) : W(N-1)(N-1) ZN-1 W where we see that we can already eliminate the multiplications by one by calcula ting C 1 W w 2 N-1 Zl w (5.1) C 2 w 2 w 4 W2(N - 1) Z2 ........ ............................... C N - 1 N-1 W2(N -1) W(N - l)(N - 1) W ZN-1 to begin with. By using the relation w N = 1, we see that the table of the exponents of w is none other than the multiplication table of the integers modulo N; (see Fig. 9 in the case N = 7). Now, it happens 47 that when N is prime, we can reduce, after a suitable permutation of the rows and of the columns, this table to a permutation matrix of the integers 1 to N - 1. 46 a j and Zk have been permuted with respect to the formula (1.8). 47 Remark due to C.M. Rader and which results from the properties of the multiplicative group 7L/ N7L which has N - 1 non-zero elements and is a cyclic group. 
70 Chapter III. Functional Transformations 1 2 3 4 5 6 (a) 1 2 3 4 5 6 2 4 6 1 3 5 =><: 3 6 2 5 1 4 3 6 2 5 1 4 2 4 6 1 3 5 4 1 5 2 6 3 3<:: 6 5 4 3 2 1 5 3 1 6 4 2 4 1 5 2 6 3 6 5 4 3 2 1 5 3 1 6 4 2 1 3 2 6 4 5 3 2 6 4 5 1 2 6 4 5 1 3 6 4 5 1 3 2 4 5 1 3 2 6 5 1 3 2 6 4 Fig. 9 In this fashion, in the case N == 7, we can write (5.1) in the following form: c 1 Xo Xl X 2 X3 X4 Xs 2 1 C3 Xl X 2 X3 X 4 Xs Xo 23 c 2 X 2 X3 X 4 Xs Xo Xl 2 2 C6 X 3 X 4 Xs Xo Xl X 2 26 C 4 X4 Xs Xo Xl X2 X3 24 Cs Xs Xo Xl X2 X3 X 4 2s which is a particular case of the following multiplication problem: b o Xo Xl X2 X n -1 Yo b 1 Xl X2 X3 . Xo Y1 (5.2) b 2 . X2 X3 X 4 . Xl Y2 .................. ......... ............. . b n - 1 X n -1 Xo Xl X n -2 Yn-1 called the problem of the "cyclic convolution of order n". Now it turns out the coefficients b i defined by the formula (5.2) are none other than the coefficients of the polynomial. t ( u) == (x 0 + Xl U + . . . + X n - 1 Un - 1 ) ( Y n - 1 U + Y n - 2 U 2 + . . . + Y1 U n - 1 + YO) mod (un - 1). 
5. Reduction of the Number of Multiplications 71 To evaluate them, we shall use some results of theory of complexity due to Winograd [1] concerning the minimal number of multiplications to evaluate the product of two polynomials. 2. Complexity of the Product of Two Polynomials In the first place we state the polynomial form of the Chinese remainder theorem m Theorem 1. Suppose that p is a polynomial of degree n such that p == TI Pa, a=l where Pa are polynomials which are relatively prime; then there exist polynomials qa such that qa mod Pp == b ap , lJ., f3 == 1, 2, . . . , m and such that for every pol ynomial we have the identity tmodp = Ctl taqa)mod P (5.3) def where t a == t mod Pa, lJ. == 1, 2, . . . , m. D Corollary 1. Methodfor evaluating the product of two polynomials rand s modulo p, where p == P1P2 . . . Pm' the polynomials Pa being relatively prime. Let ra == rmodpa Sa == s mod Pa for lJ. == 1, 2, . . . , m; we then calculate t a == rasa mod Pa and t == rs mod p will be given by the formula (5.3). It must be clearly seen that when we wish to calculate the product of two polynomials rand s, the polynomial p occuring in Theorem 1 is arbitrary, and we can choose its coefficients in a set G of numbers as simple as possible (for example, powers of 2) such that the multiplications by numbers of G are negligible (in machine time) before the multiplications between the coefficients of the polynomials rand s (which are complex numbers in the applications which we have in view). We denote by d r the degree of r and by d s the degree of s. We denote by C r (resp. c s ) the coefficient of the term of highest de gree of r (resp s). We recall the following result due to Winograd [1]. Theorem 2. The minimal number of multiplications to evaluate the product of the two polynomials rand s is m == d r + d s + 1. 
72 Chapter III. Functional Transformations This optimal number is obtained by two possible ways only. 1st method: We write the identity rs = (rs) mod p m with p(u) = Il (u - ga) and ga E G, and we use Corollary 1 with Pa(u) = a=l u - ga. 2nd method: We observe that rs = (rs) mod P + crcsp m-l where p(u) = Il (u - ga) is of degree m - 1, & ga E G. a=l There again, we use Corollary 1 to evaluate t = (rs) mod p. o Remark 1. We see that the optimal number of multiplications is obtained when the polynomials Pa are all of first degree: Pa(u) = u - ga. In this case we verify that ra = rmodpa = r(ga). The interpretation of the first method is particularly interesting; it reduces to determining the polynomial t as being the polynomial of degree (m - 1) interpolating the m values r(ga)s(ga) at the m points ga. o 3. Application to the Cyclic Convolution of Order 2 The question is to evaluate ( b O ) = ( XO Xl )( YO ) b 1 Xl Xo Yl in effecting a minimal number of multiplications. (5.4) Theorem 3. The cyclic convolution of order 2 described in (5.4) need only involve two multiplications of the following kind: Calculate t 1 = (xo + X 1 )(Yl + Yo) t 2 = (xo - x 1 )(Yo - Yl), b o = (t 1 + t 2 )/2 b 1 = (t 1 - t 2 )/2. Proof We remark that as in 95.1 then (b o + b 1 u) = (xo + X 1 U)(Y1 U + Yo)mod(u 2 - 1) 
9 5 . Reduction of the Number of Multiplications 73 and we apply the Corollary 1 with Pl = u - 1, P2 = u + 1. We remark that ql = !(1 + u) and q2 = !(1 - u). o In this elementary case, we can also verify the result directly, but the method used here is systematic. 4. Application to the Cyclic Convolution of Order 3 In the case of the 4-point Fourier transform we are led to the cyclic convolution of order 3, i.e. to calculate ( bo ) ( Xo Xl X2 ) ( Yo ) b l = Xl X2 Xo Yl b 2 X2 Xo Xl Y2 with the least possible multiplications. We have the result: (5.5) Theorem 4. We can evaluate b o , b l , b 2 with only four multiplications, by proceeding in the following way: we calculate mo = (xo + Xl + X2)(YO + Yl + Y2) m l = (xo - X2)(YO - Yl) m 2 = (Xl - X2)(Y2 - Yl) m 3 = (xo - Xl) (Yo - Y2), (5.6) (5.7) { CXo _ m l - m2 CX l - m l - m3 and then we have (5.8) i bo _ (mo + 2a o - ad13 b l - (mo + 2cx l - cx o )/3 b 2 = (mo - CX o - cx l )/3. o Proof As we have seen in 5.1 the calculation of b o , b l , b 2 reduces to evaluating the coefficients of the polynomial t = (xo + Xlu + X2U2)(Y2U + Y l U 2 + Yo)mod(u 3 - 1) which are precisely b o , b l and b 2 . We again apply Theorem 1 with P = u 3 - 1 Pl = U - 1, P2 = u 2 + u + 1 and we put 2 r = Xo + Xlu + X2 U S = Y2U + Y l U 2 + Yo 
74 Chapter III. Functional Transformations from which r 1 = rmodpl = r(1) = Xo + Xl + X 2 r 2 = rmodp2 = r - X 2 (U 2 + u + 1) = (xo - x 2 ) + (Xl - x 2 )u and in the same way Sl = Y2 + Y1 + Yo S2 = (Yo - Yl) + (Y2 - Y1)U from which t 1 = (xo + Xl + x 2 )(Yo + Yl + Y2) t 2 = ((xo - x 2 ) + (Xl - x 2 )u)((Yo - Y1) + (Y2 - Y1)u)mod(u 2 + u + 1). Suppose known the coefficients of t 2 which is a polynomial of degree:::::;; 1: t 2 = Ci o + Ci 1 u; it remains to determine t = l mod (u 3 - 1) where l = t 1 ql + t 2 q2 - a o + a 1 u + a 2 u 2 + a 3 u 3 and q 1 = (1 + u + u 2 )/3 q 2 = (2 - u - u 2 )/3 satisfying qa mod Pp = b ap . We shall have t = a o + a 1 u + a 2 u 2 + a 3 u 3 - a3(u 3 - 1) = (a o + a 3 ) + a 1 u + a 2 u 2 . On the other hand 1 = - (t 1 3 F inall y l = t 1 !(1 + u + u 2 ) + (et o + et 1 u)!(2 - u - u 2 ) 3 3 1 1 2 1 3 + 2Ci o ) + 3(t 1 + 2Ci 1 - Cio)U + 3(t 1 - Ci o - Ci 1 )U - 3 Ci1U . b o = a o + a 3 = (t 1 + 2Ci o - Ci 1 )/3 b 1 = a 1 = (t 1 + 2Ci 1 - Ci o )/3 b 2 = a 2 = (t 1 - Ci o - Ci 1 )/3 which establishes the formulae (5.8) with mo = t 1. It remains now to calculate the coefficients Ci o and Ci 1 of the polynomial t 2 with respect to those of the polynomial t 3 - (co + c 1 u)(d o + d 1 u) where Co = Xo - X 2 , d o = Yo - Yl, C 1 = Xl - X2, d 1 = Y2 - Yl. 
5. Reduction of the Number of Multiplications 75 For this we use the second method of Corollary 1 on remarking that t 3 = t 3 mod u (u + 1) + c 1 d 1 U (u + 1 ). To calculate [3 = t 3 mod u(u + 1), we proceed as in 94.3 on remarking that [3 = t 3 (0)(u + 1) - t 3 ( - 1)u = t 3 (0) + (t 3 (0) - t 3 ( - 1)) u t 3 (O) = Co do and t 3 ( - 1) = (co - c l ) (do - d l ). We thus have [3 = cod o + (co do - (co - c l )(d o - d l ))u and t 3 = [3 + c l d l u(u + 1) = cod o + (cod o + c l d l - (co - c l )(d o - d l ))u + c l d l u. Finally t 2 = t 3 mod(u 2 + u + 1) = t 3 - (c l d l )(U 2 + u + 1) = (cod o - C l d l ) + (cod o - (co - c l ) (do - d l ))u which results in a o = Co do - c l d l = (xo - x 2 )(Yo - Yl) - (Xl - X 2 )(Y2 - Yl) a l = Co do - (co - c l )(d o - d l ) = (xo - x 2 )(Yo - Yl) - (xo - x l )(Yo - Y2) which establishes the formulae (5.6) and (5.7). 5. Application to the Cyclic Convolution of Order 6 We have seen in 95.1 that the 7-point discrete Fourier transform lead to a problem on the cyclic convolution of order 6 involving the matrix which we denote by X and state explicitly in the formula ( 5.2). We shall show that this problem leads to effecting combinations of cyclic convolutions of orders 2 and 3. From Theorem 1 of 94, we can decompose each index i satisfying 0  i < 6 into i = {i l , i 2 } i mod 2 and i 2 = i mod 3 and where i = (3i l + 4i 2 ) mod 6. Now we remark that the elements of the matrix are such that where i l Xij = x(i+ j) mod 6' o  i, j < 6. We then put Xili2hj2 = Xij' for i = { i l' i 2 } and j = {j 1 , j 2} . 
76 Chapter III. Functional Transformations On putting k = (i + j) mod 6 we have further k = (3k l + 4k 2 ) mod 6 where k l = (i l + jl) mod 2, and k 2 = (i 2 + j2) mod 3; thus X.. = lJ Xk = X 3 kl + 4k2 X 3 (il + h) mod 2 +4(i2 + h) mod 3 . This shows that we can, for example, decompose the matrix X into four 3 x 3 sub-matrices which will be indexed by i l andjl and which will be each of them matrices of cyclic convolution but of order 3 only. To see it more easily, we can permute the indices of(bi)oi<6 and (Yj)Oj<6 in the formula (5.2) in such a way that we find the indices i l and i 2 in ascending order i = (((3i 1 + 4i 2 ), i 2 = 0, 1, 2), i 1 = 0, 1). The formula (5.2) then becomes b o Xo X 4 X 2 X 3 Xl Xs Yo b 4 X 4 X 2 Xo Xl Xs X 3 Y4 b 2 X 2 Xo X 4 Xs X 3 Xl Y2 b 3 X 3 Xl Xs Xo X 4 X 2 Y3 b l Xl Xs X 3 X 4 X 2 Xo Yl b s Xs X3 Xl X 2 Xo X 4 Ys in which we see clearly the appearance of the four blocks of 3 x 3 cyclic convolution, and which we can write in the form ( 5.9) (:) (: :) (:) which shows that the evaluation of the b i leads to a cyclic convolution of order 2. We can thus make use of the method indicated in 5.3 to solve (5.9) and which consists in calculating t l = (X o + Xl)Cyo + Yl) t 2 = (Xo - Xl)(Yo - Yl)' then b o = !(t l + ( 2 ), b l = !(t l - ( 2 ). In addition to calculate t l and t 2 which are in effect two cyclic convolutions of order 3, we shall use the method detailed in  5.4. We thus arrive at 8 multipli- cations instead of the 36 which we would expect a priori. Conclusion. We have shown by examples that it is in effect possible to reduce noticably the number of multiplications to derive the Fourier transforms 
6. Fast Fourier Transform in Two Dimensions 77 involving a (small) number of points, by using some sharp results of the theory of complexity. These elementary Fourier transforms are subsequently used in the fast Fourier transform of Good-Winograd described in 94, which we thus come to render faster than that of Cooley- Tukey. 6. Fast Fourier Transform in Two Dimensions We denote by Q the square of side 2n: Q = JO, 2n[ x ]0, 2n[ and by f a function defined on Q and 2n-periodic in each of its arguments Its development in a Fourier series is given by f(x, y) = L CjkeiUx+ky). j,kE 7L We call again Sn the finite-dimensional space generated by the functions ei(jx + k Y ), - n  j, k  n which is now of dimension (2n + 1)2. Putting N = 2n + 1, and  = 21n 0 1 l -,   2n; N we look for gn E Sn interpolating f at the points {b m} i.e. such that gn(l' m) = f(l' m)' o  1, m  2nt If we put Zlm = f(b m) ( )  aj " k e i(jx + ky) gn X, Y = i...J - n  j, k  n we see that the (2n + 1)2 quantities a jk satisfy  a. e21t (ji + km) = Z i...J jk 1m' jk from which   Z e- 21t (j'1 + k'm) =   a. e 2 ;1t [0 - j')1 + (k - k')mJ N 2 i...J 1m N2 i...J jk I, m j,k,l,m . = L a jk (  L e 2 (j - n 1 ) (  L e 2' (k - k')m ) jk N 1 N m = Lajkbjj' b kk , = aj'k' jk where we have made use of the relation (1.4). 
78 Chapter III. Functional Transformations Finally, the problem of calculating the discrete Fourier transform in two dimensions leads to evaluating (6.1 ) a. =  "z W + (jl + km) Jk N 2 i...J lm , l, m - n ::( j, k ::( n where 2in W = e N We see that this problem reduces to the evaluation of 2N discrete Fourier transforms in one dimension since we can write (6.2) 1" °1 a jk = N 2 Lz w J b lk where we have put (6.3) b -" km lk - i...J W Z lm . m In effect for k fixed, (6.2) is a discrete Fourier transform in one dimension (and N points). It is the same for (6.3) with 1 fixed. We thus obtain an algorithm involving O(N 2 1og 2 N) operations. 97. Some Applications of the Fast Fourier Transform The principal interest in the fast Fourier transform is that it allows the economical calculation of the coefficients in the development in Fourier series of a function, known point by point, or more exactly of its interpolate. The most of the applications of Fourier series are therefore applications of the fast Fourier transform. 1. Solution of Boundary Value Problems Let us consider the Poisson problem in one dimension with boundary values periodic on the interval ] - n, + n[, i.e. - u" = f on ] - n, n[ u( - n) = u(n), u'( - n) = u'(n), f: u(x)dx = 0 
7. Some Applications of the Fast Fourier Transform 79 in which f is a periodic function which is sufficiently regular (for example, let f +1t f E H( -n, +n)48 with s  0) satisfying -1t f(x)dx = 0). To resolve this problem we can proceed in the following way: We determine, by means of the fast Fourier transform, the coefficients a j of the interpolate gn off in Sn as defined in  1; let +n gn(x) = L a j eijx j= -n with a o 0, and put +n U = " 11 . e ijx n  J ' j= -n with a. A J Uj = -=2' ] j =1= 0, and 110 = O. I t is verified immediately that - u = gn on J - n, n[ u n ( - n) = un(n), u( - n) == u(n). We then have the convergence of Un to U as n  00. More precisely, with the notation of Remark 3, 1, iffEH( - n, + n), s  0, we have Ilu - u n l1 2 ::( Ilf - gn 110 ::( cn-sllflls. For a regular solution, we are thus able to obtain accuracy to about ten significant figures with roughly ten terms, so the method is much more precise than a finite difference method. By using considerations of parity, we can also, in the same manner, resolve the Neumann or the Dirichlet problem with boundary conditions, homogeneous on the interval JO, n[. This method is easily generalized to the case of the Laplacian on a square in two dimensions or in a cube in three dimensions. Case of non-constant coefficients. We consider the problem - (au')' = f on J - n, n[ u(- n) = u(n), u'(-n) = u'(n) f:: u(x) dx = 0 where f satisfies the previous conditions and a = a(x) is a variable coefficient which is regular and periodic. For example a(x) could be piecewise 2 and (7.1 ) 48 See Remark 3, g 1. 
80 Chapter III. Functional Transformations continuous on [ - n, + nJ and strictly positive on [ - n, + nJ. The simplest method then consists of looking for Un E Sn with (7.2) n un(x) = L Uj e ijx j = -n and requiring (collocation method) that the equation (7.1) be satisfied at the N = 2n + 1 points X k = 2kn/N, - n ::( k ::( + n. We are thus led to solving a problem of the type Au = f, where u (resp.f) denotes the column vector with the values of un(x) (resp.f(x)) at the points X k , a problem in which the N x N matrix is neither djagonal nor sparse. We can solve this problem by the aid of an iterative method, remarking that, if v E [Rn is given, the calculation of the matrix product Av does not necessitate O(N 2 ) operations but again O(N log2 N) if the fast Fourier transform is used many times 49 . The effectiveness of the method, in this case, depends strongly on the regularity of the coefficient a(x) and of the solution of the problem. We can treat in the same manner the non-homogeneous Dirichlet and Neumann problems. The method applies equally with the same reservations to first order equations with periodic boundary conditions. Case of non-periodic (or non-homogeneous) boundary conditions. We consider, for example, the solution of the non-homogeneous Dirichlet problem (7.3) -(aux)x = f on J - 1, + 1[ (7.4) u( - 1) = a, u( + 1) = p. where f E reo ([ - 1, + 1J). We then look for (collocation method) the approximate solution Un as being the polynomial of degree n which satisfies equation (7.3) at the (n - 1) points In Xj = cos-, n j = 1,. . . , n - 1 and which satisfies the boundary conditions (7.4)50. Taking N - n + 1, we are therefore led to a linear system of the type Au =f of which the N x N matrix A is neither diagonal nor sparse. 49 See Gottlieb-Orszag [1]. 50 As a matter of fact, we could equally in this case apply the method of Fourier series leading to homogeneous boundary conditions through the change of the unknown function: w(x) = u(x) - (i(P - a)x + i(P + a)). 
g7. Some Applications of the Fast Fourier Transform 81 To solve it, we would make use of an iterative method at each iteration of which it would be necessary to form the product of the matrix A by a vector v. We can form this product in O(N log2 N) operations by noticing that, if v == (v j) and p is the polynomial of degree n such that p(x j ) == v j , 0 ::( j ::( N, we can decompose pinto Tchebycheff polynomials n p(x) == L a j 1] (x) j=O (where 1] (x) == cos(jO) for x = cos 0), the coefficients a j being calculable in O(N log2 N) operations with the help of the fast Fourier transform. We subsequently determine p' (x j) by using the recurrence formulae satisfied by the Tchebycheff polynomials. We then carry out the same operation for the polynomial q(x) = a(x)p' (x), and hence we have determined (Av)j == q'(x j ). This method gives excellent results for regular solutions (see Gottlieb--Orszag [1J). 2. Regularisation and Smoothing of Functions Let us consider the problem { - BU + U e == U on ] - n, n[ u e ( - n) == ue(n), u( - n) == u(n), where U is a function given on [ - n, + nJ; (if U is not periodic nor dis- continuous, the problem is in effect a singular perturbation problem). We know thatifuEL 2 ( - n, n), U e -4 U in L2( - n, n) when B -4 0, hence U e is a function which regularises U (since U e is more regular than U for all B > 0). Now this problem is very easy to resolve by the method presented in 91. If we replace U by its interpolate (7.5) +n U == " u . e ijx n  J ' j = - n we will have (7.6) u == J A U j 1 + Bj2' +n U == L uj e ijx . j = - n 
82 Chapter III. Functional Transformations And the solution of this problem involves only O(N log2 N) operations. We can obviously employ smoothing functions other than those proposed in (7.6), and which consist essentially in cutting out the "high frequencies". Regularisation by convolution. For regularisation, we can also use the convol- ution by a regularising kernel p(X);51 for example p(x) = e- nx2 . We then put u e = Pe * u where Pe(x) = (l/B) p(X/B) (we know that U e  u when B  0). If u j denote the coefficients of the development of u in a Fourier series, and u ej those of U e we have Lemma 1. If u is a periodic distribution, then Uej = Pe(j)u j where Pe denotes the Fourier transform of Pe. Proof We have u,(y) = f p,(z)u(y - z) dz. Thus: Utj = 2- f 2 t' e-ijYu,(y)dy = 2- r f 27t e-ijy p,(z)u(y-z) dzdy 2n 0 2n J  0 (f p,(z)e- ijz dz )u j P e ( j) U j . o As Pe(w) jJ(BW), the rule for transforming Fourier coefficients is thus, U ej = p(Bj)U j . For example, when p(x) = e- nx2 , we have p(w) formation rule can be written e - co 2 /4n and the trans- (7.7) A 10 2 P A U ej = e- 4rr U j . Obviously, in practice, we replace u by its interpolate u and the coefficients u j are thus determined with the help of the fast Fourier transform. 51 We recall that the convolution w = u * v of two functions u and v is defined by w(x) = t u(x - y) v(y) dy, cJI functions defined on [0, 2n[ being extended by periodicity over the whole of [R. 
g7. Some Applications of the Fast Fourier Transform 83 Remark 1. We can generalize the method to the two-dimensional case (on a square). D 3. Practical Calculation of the Fourier Transform of a Signal Let g(t) be a signal (non-periodic gEL l(lR) n O(lR)) whose Fourier transform g(w) = f g(t) e -iwt dt we are seeking to eval ua te. If we measure the values (7.8) Zk - g(t k ), - N  k  N of the signal at the (2N + 1) equidistant points t k - k L1 t, then a practical problem that arises is the determination of the values of the Fourier transform at a certain number of points. This problem is obviously ill- posed, since two signals gland g 2 taking the same values at the instants t k can have very different Fourier transforms (for example if gl oscillates between the t k and g2 does not oscillate). What we shall show is that there exists a signal g(t) satisfying (7.8) whose Fourier transform takes the values a j at equidistant points where the a j are the discrete Fourier transforms of the Z k and are given by the formula: a j = L1 t L w - jk Z k k where w = e2nilN. Lemma 2. Let g(t) = L1t L zkf(t - t k ) k where sin at d f(t) = , an nt n a =-' L1 t' then, on the one hand g(t k ) = Zk, - N  k  N, and on the other hand g(W j ) = a j , N  j  N, where 2n W . = J .L1 w a nd L1 w = - J' N L1t" 
84 Chapter III. Functional Transformations Proof We first of all verify that sin a(t l - t k ) { 0 1 if k = 1, L1t f(t z - t k ) = = b lk = a(t l - t k ) otherwise. On the other hand, we know that J(w) = { 1 if Iwl  a, o otherwIse and if frJ.(t) - f(t + a) l(w) = eiwrJ.J(w). As a consequence, for Iwl  a, we have g(w) = Jr(  Zk e -iwt k ) from which the result follows since wjt k = jk  . D Remark 2. If we take into account an infinity of samples (L1 t fixed, N  00), then the condition g(w) = 0 for Iwl >  L1t would suffice to determine the signal g uniquely (see Papoulis [lJ, p. 51). D Intuitively we could clearly derive this result from Lemma 2 since when N grows, L1w diminishes, and g(w) is known at an increasing number of points in the interval [ - n/ L1 t, n/ L1 t J. 4. Determination of the Spectrum of Certain Finite Difference Operators and Fast Solvers for the Laplacian Just as Fourier series enable us to find the spectrum of the Laplacian, so also does the discrete Fourier transform enable us to find the spectrum of the 3-point scheme or of the 5-point scheme for the Laplacian in. dimension 1 or 2. 1 st 3-point scheme for the Laplacian. Let us consider the one-dimensional Poisson problem with periodic boundary conditions { - u" = f on J - n, n [ u( - n) = u(n), u'( - n) = u'(n). (7.9) we know that this problem has a solution defined, to within a constant, when fEL2( - n, n) satisfies a condition which we shall determine. 
g7. Some Applications of the Fast Fourier Transform 85 Letf = f1 + f2 wheref1 is odd andf2 is even; decomposing U in the same way into an odd part and an even part, we see that U 1 is the solution of the Dirichlet problem (7.10) { - u = f1 on ] 0, n [ u 1 (0) = u 1 (n) = 0 and that U 2 js the solution of the Neumann problem (7.11 ) { - u = f2 on ] 0, n [ u(O) = u(n) = 0 which admits a solution, to within a constant, when I " 12 dx = 0, which is equivalent to I " 1 dx = O. o -n Discretisation. When we discretise the problem (7.9) by the method of finite differences 52 we end up with the system of equations (7.12) 1 - h 2 (Uj+ 1 - 2u j + U j - 1 ) = fj, - n ::( j ::( n, where I h 1 X j + "2 jj = h xj_/(x)dx, with In x. = - for - n  j ::( n, J n ' h = n , n and u j is an approximation of the solution of the problem (7.9) at the point x j , - n ::( j ::( n. We put Uj = uj' for j = j' + 2n, . , - n ::( J ::( n. We can again split the approximate solution (u j ) into an odd part (v j ) and an even part (w j) by putting 1 v j = 2 (u j - U - j), 1 W j = 2 (u j + U - j ); 52 See Chap. XIII. 
86 Chapter III. Functional Transformations we see that (v j ) satisfies (7.13) 1 - h 2 (Vj+ 1 - 2v j + Vj 1) = 1)1)53, 1:( j :( n-l, V o = v = 0 n , which is the 3-point scheme applied to the Dirichlet problem; on the other hand W j satisfies - :2 (W j + 1 - 2W j + wj-d = 1)2)53, 0:( j :( n, W-i = Wi and W n +i = W n - i , which is the, 3-point scheme applied to the Neumann problem. The spectrum associated with the 3-point scheme (7.12) will be determined by the eigenvalue problem (7.14) 1 - h 2 (u j + 1 - 2u j + U j - 1) = Au j 0::( j ::( 2n. We put N = 2n, W = e iTt / n , and (7.15) 1 N-i A _ " - jk Uk - - i....J W Uj N j=O which is the discrete Fourier transform of u j . Observing that 1 N-i " - jk N io W u j + i k A W Uk' 1 N-i - L W- jk Uj _ i = W-kU k , N j=O we see that (7.14) leads to (7.16) 1 k k - - (w - 2 + W - ) Uk = AUk. h 2 Putting (7.1 7) 1 k _ k 2 ( kn ) A = - - ( w - 2 + W ) = - 1 - cos- k h 2 h 2 n ' we must have A = Aka for ko given, or else Uk = 0, Vk. The eigenvalues of the 3- point scheme are therefore the Ak defined in (7.17). 53 Here we denote by f(1) and f(2), respectively, the decomposition off into odd and even parts. 
7. Some Applications of the Fast Fourier Transform 87 To determine the corresponding proper subspace, we have to observe that A = Aka => Uk = 0 for Ikl -=I- ko and hence that the subspace is of dimension 2 for ko -=I- 0, n. Suppose that (x, 13 E C and that we look for the eigenvector such that A U ka = (X, U ka = 13 and U = 0 for Ikl -=I- ko; we have 1 n-1 _ '" jk A U j - - i..J W Uk' N k=-n from which we deduce 1 . k O k U j = - ( (XW J a + f3w - J a). N Choosing (X = - 13 we obtain the odd eigenvectors which are the eigenvectors of the Dirichlet problem (7.13), namely (k ) 1. ( jk o ) v . a = - SIn - n , J n n 1  ko  n - 1. Choosing (X == 13, we obtain the even eigenvectors which are therefore those of the Neumann problem; there are likewise (n - 1) to which must be added the two eigenvectors associated with the simple eigenvalues Ao and An' so that, finally, we have W(k a ) ==  cos ( jko n ) , J n n o  ko  n. Remark 3. Let A be the matrix of the 3-point scheme defined in (7.12) and F the matrix defined by jjk = w-jklJN (i.e. the matrix of the discrete Fourier transform with the factor 11 IN added). The matrix F satisfies FF* == I so it is a uni tary matrix. The significance of the preceding result is that F diagonalises A, i.e. that the matrix D = F AF * is a diagonal matrix (with a zero element on the diagonal, it would demand special attention). We see thus that to solve the linear system Au == f it suffices to calculate] == F f then to solve D11 == j, and finally to calculate U == F - 1 11 to obtain an extremely fast method of solution (involving O(N log2 N) operations). But if we wish simply to solve the Dirichlet problem (7.13) a simple Gaussian elimination would be faster. o Remark 4. Numerous other properties of the matrix F are demonstrated in Auslander- Tolimieri [1J. 
88 Chapter Ill. Functional Transformations In particular, we have F 4 = I, which shows that the eigenvalues of F are equal to + 1 and + i. The dimensions of the corresponding proper subspaces are: m+ 1, m, m, m - 1, if N == 4m , m+ 1, m, m, m, if N=4m+ 1, m+ 1, m + 1, m, m, if N = 4 m + 2, m+ 1, m + 1, m + 1, m, if N = 4 m + 3. D 2nd 5-point scheme. An analysis of the same type can be developed when we wish to solve the Poisson problem with periodic boundary conditions on the square Q == ] - n, n [ x ] - n, n [: - Au = f on Q u( - n, y) == u(n, y), au au ax ( -n, y) == ax (n, y), au au oy (x, - n) = oy (x, n), -n  y  n (7.18) u(x, - n) = u(x, n), - n  x  n. In udxdy = 0 withf given,fEL 2 (Q) and LfdXdY = O. Splitting f into f1 + f2 + f3 + f4 with 1 fl (x, y) = 4 [f(x, y) + f(x, - y) + f( - x, y) + f( - x, - y)] 1 f2(X, y) == 4 [f(x, y) - f(x, - y) + f( - x, y) - f( - x, - y)] 1 f3(X, y) == 4 [f(x, y) + f(x, - y) - f( - x, y) - f( - x, - y)] 1 f4(X, y) = 4 [f(x, y) - f(x, - y) - f( - x, y) + f( - x, - y)] i.e. with f (i = 1, 2, 3, 4) such that f1 is even in x and in y f2 is even in x and odd in y f3 is odd in x and even in y f4 is odd in x and in y 
7. Some Applications of the Fast Fourier Transform 89 and decomposing U in the same way we see that - AU a = fa' (X = 1, 2, 3, 4 on Q. The boundary conditions for each of the U a being dictated by considerations of even-ness and odd-ness, on Qo = ]0, n[ x ]0, n[ we see that U I is a solution of a Neumann problem and U 4 of a Dirichlet problem, while U2 and U 3 are solutions of a mixed problem. The discretisation by finite differences of the problem (7.18) leads to the system of equations (7.19) 1 - h 2 (j+1,k + Uj,k+l - 4u jk + Uj-l,k + Uj,k-l) = jj (where h = n/n and u jk is an approximation to the .value of U at the point {jh, kh}) whose matrix is called the matrix of the 5-point scheme. We determine the spectrum of the 5-point scheme by using, this time, the two- dimensional discrete Fourier transform described in 96: take A and U jk to satisfy 1 - h 2 (Uj+l,k + Uj,k+l - 4u jk + Uj-l,k + uj,k-d = AUjk' (7.20) We put w = e in1n , N = 2n and u =  '" U. w-(jl+km) 1m N i...J )k , jk (7.21) with the result that  '" U. W-(jl + +km) = W:tl U N i...J ):t I, k 1m jk  '" U. W -(jl +km) = W:tm U N i...J ),k:tl 1m' jk and the relation (7.20) becomes 1 I I - h 2 (w + w m - 4 + w- + w-m)U 'm = AUlm' Let us put Aim = - :2 (Wi + w m - 4 + w- I + w- m ) = 2 (2 cas 1: - cas n ) which is therefore the eigenvalue of multiplicity 4 for l, m -=I- 0, n. The associated proper sub-space is given by A U 1m = (Xl' A U 1 , -m = (X2, A . U-1,m = (X3, A U-I, -m = (X4 and Ul', m' = 0 for {l', m'} -=I- { + l, + m}. 
90 Chapter III. Functional Transformations Choosing a 1 == a 2 == a 3 == a 4 we find an eigenvector even in x and in y and thus an eigenvector of the Neumann problem. Choosing a 1 == - a 2 == - a 3 == a 4 we find an eigenvector odd in x and in y and thus an eigenvector of the Dirichlet problem. With a 1 == -a 2 == -a 3 == -a 4 and a 1 == a 2 == -a 3 == -a 4 , we should find the two eigenvectors of the two mixed problems. To pass from u to U we use the inverse transformation U. == "wjl+kmu Jk N i...J 1m. 1m For the Dirichlet problem, we therefore find for the eigenvector associated with the eigenvalue Aim: U jk == (wjl+km - w- jl+km - W j1 - km + w- jl-km) N == (wjl - w-j1)(W km _ w- km ) N 4 a 1. j l. km == - - sIn n - sIn n - , N n n 1  l, m  n - 1. For the Neumann problem, we find U jk == (wjl+km + w- jl+km + W j1 - km + w- jl-km) N 4a 1 jl km == -cosn-cosn-, 0  l, m  n. N n n Application to the Solution of Problem (7.18). In an analogous way to the Remark 3 we remark that the unitary matrix F of the two-dimensional discrete Fourier transform (7.21) diagonalises the matrix A of the 5-point scheme and the matrix D == F AF * is a diagonal matrix. As previously, to solve the problem Au == f we calculate J == Ff, then solve Du == J and finally obtain F -1 A U == u. The number of operations is O(N 2 10g 2 N) to be compared with the O(N 4 ) operations of Gauss' method. The method of the fast Fourier transformation is thus an extremely efficient way of solving Laplace's equation on a square, appertaining to the family of solutions of elliptic problems called ''fast solvers". It is so fast that, when we solve a Dirichlet or a Neumann problem on an arbitrary domain Q it may be useful to immerse this domain in a square and use 
Review of Chapter IIIB 91 a uniform mesh in the interior and in the exterior of Q. The matrix of the discrete problem will only be a perturbation of the matrix of the 5-point scheme which we shall use as an auxiliary operator (see Proskurowski-Widlund [1] and O'Leary-Widlund [1]). All of this can be easily generalized to the three-dimensional case for which we can define a fast solver involving O(N log2 N) operations when the domain is a cube. Review of Chapter IIIB The discrete Fourier transform is the linear mapping which associates with the N values of a trigonometrical polynomial at equidistant points, the N coefficients of its development in Fourier series. The fast Fourier transform is a sophisticated numerical method which allows us to effect the passage of the values to the coefficients and the inverse passage in o (N log2 N) operations. We have seen that two distinct routes allow us to arrive at this remarkable result: that of Cooley and Tukey which applies principally in the case that N is a power of 2, 3 or 4, and that of Good and Winograd which allows us to envisage the case in which N is a product of relatively prime factors. Such a reduction in the number of operations -it would need O(N 2 ) operations for an arbitrary linear mapping-is accompanied by a great numer of practical applications which are in effect numerical applications of the method of Fourier serIes. We have reviewed a good number of topics: solution of particular boundary value problems, smoothing of functions, Fourier transform, and finally "fast solvers" for certain schemes of finite differences. 
Chapter IV. Sobolev Spaces Introduction Dirichlet's principle (established, partially, in Chap. III) has already shown the interest which is attached to the introduction of the spaces of functions v E L 2 (Q) all of whose partial derivatives of the first order-taken in the sense of distributions - are in L 2 (Q); this is the Sobolev space H 1 (Q) which we can endow with the structure of a Hilbert space. This space, and all the spaces constructed by a similar procedure, play an absolutely fundamental role in the analysis of partial differential equations. Our object is to introduce here, always with the minimum of pre-requisites, the Sobolev spaces which are indispensable; (other spaces will be useful notably in linear and non-linear problems and we shall introduce them at the appropriate time). From this point of view the introduction of the spaces HS([Rn), (s non- integral), as early as 92, may come as a surprise; in fact the space of traces of functions of H1(Q) on the boundary r of Q is Ht(r) (as we shall see in 9 4 ) and Ht(r) is defined by a collection of charts from Ht([Rn - 1); it is then both natural and clearer (and does not introduce any additional complication) to introduce HS([Rn) with s arbitrary. This is thus the object of 92. In 3, we give one of the Sobolev embedding theorems (the simplest, proved by the Fourier transform) and, in 94, we give the theorem of traces. That enables us to define, in a clear way, a number of closed sub-spaces of H1(Q), particularly H(Q), its dual H-1(Q) and, more generally H-m(Q). In 6, we study the compactness properties of the injection between different Sobolev spaces for bounded domains Q c [Rn; these properties are fundamental in the study of stationary problems in Chap. VII and in the study of spectra of operators in Chap. VIII. Finally in 7, we shall establish certain a priori inequalities in Sobolev spaces, which are essential for the variational methods of Chaps. VII and VIII. 1. Spaces Hi (Q), Hm(Q) In what follows we shall adopt the following notation: Q will denote an open set in [Rn , a == (a l' . . . , an), a j E N, j == 1, 2, . . . , n 
1. Spaces H 1 (Q), Hm(Q) 93 a multi-index and we shall use the notation Da. = 01 . . . on, where OJ = ,,0 . uXj If T is a distribution DC1.T will denote the derivative of order I a I == a 1 + a 2 + . . . + an of T in the sense of distributions (see App. "Distributions"). Definition 1. We denote by H 1 (Q) the space of distributions identifiable with functions of L 2(Q) whose partial derivatives are identifiable with functions of L 2(Q).1 We shall adopt the following notation H 1 (Q) = {u E L 2 (Q); ::i E L 2 (Q) More generally we introduce Definition 2. For mEN, we denote the following spaces I = 1, . . . , n }. Hm(Q) == {u E '(Q); DC1. u E L 2 (Q) lal  m}. Then for m == 0, we have HO(Q) == L 2 (Q) and for m == 1, we recover the space introduced in Definition 1. We endow Hm(Q) with the scalar product (u, v)m == L f DC1.u(x)DC1.v(x) dx 1C1.Im Q (1.1) and the norm associated with this scalar product (1.2) Ilullm,Q == ( L f I DC1.U(X)12dx ) 1/2 == ( L I IDC1. U lli 2 (Q» 1/2. 1C1.1 m Q 1C1.1 m Remark 1. In the theory of partial differential equations, we make use also of the spaces Wm,P(Q), mEN, p  1, obtained by replacing L 2 (Q) by LP(Q) in the above definitions. We pass now to the study of the spaces Hm(Q). Proposition 1. (i) If m  m', Hm(Q) is contained, with continuous injection, in Hm' (Q). (ii) Hm(Q) endowed with the scalar product (1.1) is a Hilbert space. Proof Part (i) of Proposition 1 is obvious as it is easy to see that if m  m', u E Hm(Q) implies that u E H m ' (Q) and that II u 11m' Q  II u 11m Q. , , 1 The functions (or distributions) considered can take either real or complex values. Here we suppose (implicitly by the notation) that they are complex valued. 
94 Chapter IV. Sobolev Spaces To prove part (ii), it is sufficient to show that Hm(Q) is complete for the norm II U Ilm,o. Let (Un)nE N be a Cauchy sequence in Hm(Q); then for all a, lal  m, the sequence (Daun)n EN is a Cauchy sequence in L 2(Q). This space being complete, we have: Un  U in L 2 (Q) and Dau n  U a in L 2 (Q), lal  m As the injection of L 2 (Q) into !?2'(Q) is continuous, this implies that un  U in !?2'(Q). Now the operator D a is a continuous mapping of !!fi' into itself and so Dau n  Da u in !!fi'(Q); from vThich U a == Da u for lal  m and consequently U E Hm(Q) and Un  U in Hm(Q). We thus have the following inclusions, for all mEN: !?2(Q) c Hm(Q) c H m - 1 (Q) c . . . c H 1 (Q) c HO(Q) == L 2 (Q) c !?2'(Q). We shall now study the particular case Q = lR n ; this case is interesting as use of the Fourier transformation in L 2 (lR n ) will enable us to have another description of the space H m . If U denotes a function or a tempered distribution on lR n , we shall, throughout this chapter, denote by 11 the Fourier transform of u. If U E g(lR n ), 11 E g(lR n ) is defined by (see Appendix "Distributions"). u() = r u(x)e -ix dx. Jn If U E g'(lRn), 11 will be defined by duality. Proposition 2. Let m be a positive integer or zero, then: (i) Hm(lR n ) c g' (lR n ), (the space of tempered distributions). (ii) Hm(lR n ) coincides with the space of tempered distributions U such that (1.3) (1 + 112)m/211EL2(lRn) where 11 is the Fourier transform of u. (iii) The norm 1\ u II m, n is equivalent to (1.4) lulm,[Rn = (tn (1 + 112tlu(OI2d Y/2. Proof Part (i) of the Proposition is obvious since we have the inclusions Hm (lR n ) c L 2 (lR n ) c g' (lR n) . Part (ii) of the Proposition: We show first of all the following inequalities: For all a, 0 < I a I  m, there exists a real constant C, C > 0, such that (1.5) .n ljI2a:j  ( 1 + .f ljI2 ) m  C ( 1 + L ,n ljI2a:j ) J = 1 J = 1 ° < lal  m J = 1 v E lR n . 
1. Spaces H 1 (Q), Hm(Q) 95 The first inequality results from the fact that: a l + a2 + . . . + am  m and the second from the binomial formula. Let us now demonstrate the assertion (ii). (a) If u E Hm(lR n ), Dau E L 2 (lR n ) for I a I  m, and so the Fourier transform of Dau, which is equal to (i)au belongs to L 2 (lR n ). Then by the right-hand inequality of (1.5) we have: L r Wu(Wd = r ( 1 + L Ii ljIZaj ) lu()12d lal  m J [Rn J [Rn 0 < lal  m j = 1  C- 1 Ln (1 + 1lz)mlu(Wd; so (1 + 112}!-mu() E L 2 (lR n ) and in addition the above inequality implies lul;;'.n  C 1 L r lau(W d  Czll u 11;;'.n lal  m J [Rn which proves a part of the assertion (iii). (b) If (1 + 112}!-mu() E L 2 (lR n ), then U E L 2 (lR n ) since (1 + 112)tm  1, and the first part of (1.5) implies that LJjlljIZajlu(Wd  C Ln (1 + 1lz)mlu(Wd < + 00 that is to say au E L 2 (lR n ), so i9u E L 2 (lR n ), which implies that Dau E L 2 (lR n ) for lal  m or again that u E Hm(lR n ). Finally Ilull;;'.n  C' L r .Ii ljIZajlu(Wd  C"lul;;'.n lal  m J [Rn ] = 1 which finishes the second part of (iii) and the proof of Proposition 2. o Example 1 1. The space g(lR n ) of oo functions with rapid decay is contained in Hm(lR n ) for all mEN. 2. The inclusion g(lR n ) c Hm(lR n ) is strict. For this it is enough to consider the following example: n = 1, u(x) = 1/(1 + x 2 ). We can easily verify that u  Hm(lR) for all m, but u  g(lR). o The result (ii) of Proposition 2 suggests a way of defining new spaces, containing as particular cases the spaces Hm(lR n ). Indeed, if Definition 2 when Q = lR n , requires m to be an integer, (1.4), on the contrary, can be extended to non- integral values of m. This will be the object of the next section. 
96 Chapter IV. Sobolev Spaces 2. The Space HS(IRft) 1. Definition and First Properties Definition 1. For s E [R0, HS([Rn) is the space of tempered distributions u, such that (1 + 1  1 2 )s/2 . U E L 2 ([Rn),  E [Rn (with 11 the Fourier transform of u). We furnish HS([Rn) with the scalar product (2.1) (u, v)s = r (1 + I  12)'u(). v() d Jn and the associated norm is (2.2) lIulls = (Ln (1 + 112)SIU(WdYI2. Remark 1. For s == mEN, the space HS([Rn) coincides - as a result of Prop- osition 2 of 1- with the space Hm([Rn) of the Definition 2 of 1 and the prehilbertian structures defined by the scalar products (1.1) and (2.1) are the same. D Let us state some properties of these spaces. Proposition 1. For each real number s, the space HS([Rn) of the Definition 1 satisfies the following properties: (i) HS([Rn) is a Hilbert space. (ii) If S1  S2 then H S l ([Rn) c H S 2([Rn) and the injection is continuous. Proof (i) It is sufficient to establish that HS([Rn) is complete:. Let {jj} be a Cauchy sequence in HS([Rn); we then deduce that {{ 1 + I 1 2 } tjj} is a Cauchy sequence in L2([Rn) which is complete and so (1 + 112)tSJj  g in L 2 ([Rn). Hence g E g'([Rn) and since (1 + 112)-ts is of slow growth, (1 + 112)-tSg E g'([Rn); thus there exists fE g'(lRn) such that (1 + 112)-tSg == J Then f E g' ([Rn) and (1 + 1  1 2 )S/2 J E L 2 ([Rn) i.e. f E HS([Rn) and jj converges to fin HS([Rn). (ii) This results from the fact that (1 + I  1 2 )S2  (1 + 1  1 2 )S 1 if s 1  S 2 . D Remark 2. We thus have the following inclusions: for all 8 1  S2  o. g([Rn) C H S 1([Rn) C HS2([Rn) C . . . c HO([Rn) == L2([Rn) C H- S 2([Rn) C H- S l([Rn) C g'([Rn). D I + 
2. The Space Hs(n) 97 We now turn to a result which is useful in the theory of partial differential equations and which emphasizes the importance of the spaces HS(lR n ). Theorem 1. Every distribution with compact support in lR n is in HS(lR n ) for a certain s E lR. For example b o , the Dirac distribution at the origin in lR n belongs to HS(lR n ) whenever s < -tn. Corollary 1. Every distribution in lR n is locally in HS(lR n ) for a certain s E lR, i.e. V T E ' (lR n ) V qJ E (lRn) 3s E lR: qJ T E HS(lR n ). The corollary follows immediately from Theorem 1 since qJ T is a distribution with compact support. Proof of Theorem 1. We know (see Schwartz [lJ) that every distribution with compact support can be written as a finite sum of derivatives of functions of L 2 . There exists therefore a positive number k such that T == L Dafa, lal  k The distribution T being of compact support, is in Y'(lRn) and we have: fa E L 2 (lR n ). T() == L (i)a l(). lal  k Consequently 2 k  I a I   (1 + !I )-2! T()!  L (1 + 1!2)kI2 Ihl  c L Ifa! lal  k lal  k and L fa E L 2(lR n ) which clearly proves that f E H-k(lR n ). lal  k We could equally well observe that D a maps L 2 (lR n ) into H-lal(lR n ), from which again the result follows. o The theorem which follows, concerning the approximation of functions in HS(lR n ) by more regular functions is important in the measure to which it allows us to extend to HS(lR n ) certain properties valid for functions of (lRn). Theorem 2. The space HS (lR n ) is a normal space of distributions, which means that (lRn) is dense in HS(lR n ). Proof Let f E HS(lR n ); thenf E Y' (lR n ) and (1 + I  1 2 )t s j E L 2(lR n ). Since (lRn) is dense in L 2 (lR n ), there exists a sequence (qJj) E (lRn) which converges in L 2 (lR n ) to (1 + 112)tsjwhenj  00. Let us define the sequence (gj)jEN by gj() == (1 + 112)-S/2qJj()' which has a sense since (1 + 112)-S/2qJj E (lRn) c Y(lR n ). 
98 Chapter IV. Sobolev Spaces Hence gjEY(lR n ) and by construction (1 + 112)SI2gj converges to (1 + 112)sI2 Jin L 2 (lR n ); stated otherwise gj tends to fin HS(lR n ). The theorem follows then from the following two facts:  (lR n ) is dense in Y (lR n ) and Y(lR n ) is contained in HS(lR n ) and the injection is continuous. D Remark 3. We must guard against believing that Theorem 2 always remains valid when we replace lR n by Q, an arbitrary open set (whose complement has positive capacity) and with s a positive integer. Indeed, we shall see later that if m  1 and if Q is a regular open set, not identical with lR n , (Q) is not dense in Hm(Q); we shall give later appropriate results on the approximation of functions of Hm(Q). D Remark 4. We can also define the space HS(lR n ) by interpolation between L 2 (lR n ) and Hm(lR n ), (see Lions-Magenes [lJ, Chap. 1). o 2. The Topological Dual of HS(lR ft ) This section is devoted to the characterisation of the spaces of continuous linear forms on HS(lR n ). We first of all notice that a general result of functional analysis informs us that if E c F are two topological vector spaces such that E is dense in F, then the dual of F can be identified with a sub-space of the dual of E, which we write F' c E' (see Chap. VI and Choquet [lJ, Bourbaki [2J). Theorem 2 assures us that (Hs(lR n ))', the dual of HS(lR n ), satisfies (Hs(lR n ))' c '(lRn), that is to say that the dual of HS(lR n ) is a space of distributions. In fact, we even have (Hs(lR n ))' c Y'(lRn) c '(lRn) since Y(lR n ) is also dense in HS(lR n ) (see the proof of Theorem 2). Theorem 3. For all s E lR, the dual of HS(lR n ) coincides in '(lRn) (algebraically and topologically) with H - S (n). 2 The theorem means that there exists a linear mapping T: (H s )' --+ H-s which is bijective and such that T and T- 1 are continuous. Proof First of all the characterisation of continuous linear forms on a Hilbert space H, applied to the case in which H = HS(lR n ) allows us to say that: for all F E (H s )', there exists ({JF E H such that for all u E H S we have (2.3) F(u) = (u, <fJF)s = r (1 + 112)s/2U()(1 + 112y/2cPF() d Jn 2 This theorem which enables us to identify the dual of Hs(n) allows us also to identify the antidual of Hs(n) (i.e. the set of continuous anti linear forms on Hs(n), see Chap. IV) with H-s(n). 
2. The Space HS(IRn) 99 where (,)s denotes here the scalar product in H s(lR n )3 and II CfJ F II s == II F II (HS)" As def g == (1 + 112 )s/2  CfJF E L 2 (lRn) C Y'(lRn) ( denoting the Fourier transform- ation), 'we have (2.4) def TlpF ==  {(I + 112)s  CfJ F } E Y'. We show that TlpFE H-s(lR n ). Indeed, we have { (I + 112)-s/2(TlpF) = (1 + 112)-s/2{(1 + 112r/2g} = constant. gEL 2 (lR n ) def where g() == g( - ). On the other hand II TlpFII-s == 11(1 + 112)-s/2TlpFIIL2 == CII(l + 112)s/2CfJFIIL2 == CIICfJFlls. From this we have (2.5) II Tlp F II - s == C II F II (HS)' . Let us show that (2.6) Tlp F -== F in '(lRn). For this, we denote by <, > (resp. « ,») the duality ',  (resp. Y', Y), and let u E (lRn); then < Tlp F' U > = « Tcp F' U » = «  - 1 Tlp F'  U » == « (1 + 112)-s/2 -l TlpF' (1 + 112)s/2  U » and (1 + 112) -s/2 ff- 1 TlpF == gEL 2(lR n ) and so (T"'F' u> = f (1 + 112)'/2  u()(1 + 112)SI2  CPF() d; using (2.3) we see it becomes < Tlp F' U > == F (u ) Consequently, we obtain (2.6). We consider now the mapping T: Vu E (lRn). { (HS(lR n ))'  H-s(lR n ) F  T(F) == TlpF' 3 Corresponding to the norm (1.4). 
100 Chapter IV. Sobolev Spaces From (2.6) and (2.5) we deduce that the mapping T is injective and continuous. To show that it is surjective, let v E H-s(lR n ) and put qJ == ff{(l + 112)-s/2.(1 + 112)-s/2ffv} EHs(lR n ). Now let F be the element corresponding to qJ in (H s )', that is to say qJ == qJp. The preceding calculations show that T(F) == TlpF == v. Then T is bijective and continuous and hence, by Banach's theorem,4 an isomorphism. D 3. The Equation (-A + k 2 )u = fin IR ft , k E IR\{O}. As a first application of the introduction of the spaces HS(lR n ), let us consider the partial differential equation in ' (lR n ) (2.7) ( - j + k 2 )u == f in lR n . Theorem 4. Let s E lR. For all f E H S (lR n ), there exists a unique u in H S + 2 (lR n ), solution of equation (2.7) with k E lR, k i= o. Proof Indeed (2.7) is equivalent, after diagonalisation by the Fourier trans- formation, to ( I  1 2 + k 2 ) U == f I.e. 1  (2.8) U = 112 + k 2 f If f E HS(lR n ) then (1 + 112s/2J E L 2(lR n ) and so, from (2.8) (1 + 112)(s+ 2)/2 U E L 2 (lR) and conversely, from which the theorem follows. D The formula (2.8) shows in addition that u depends continuously in H S + 2 (lR n ) on fin HS(lR n ): the operator ( - j + k 2 ) establishes an isomorphism of H S + 2 (lRn) on HS(lR n ) for k E lR\ {O}. 3. Sobolev's Embedding Theorem Theorem 1. Let s E lR and kEN, satisfying s > !n + k. We then have: HS(lR n ) c Bk(lR n ) def {u E CCk(lR n ); lim IDau(x)1 == 0 lal::( k} lal -+ 00 with continuous injection. 4 See Chap. VI, 91. This theorem is a consequence of the closed graph theorem. Notice that the continuity of the mapping T- 1 thus results directly from (2.5). ' 
93. Sobolev's Embedding Theorem 101 Before giving the proof of this result, let us make several remarks. Remark 1. (1) If n == 1; we have H3(lR) c CC 2 (lR). As a consequence, if in Theorem 4 of 9 2 , f E H 1 (lR), the solution u of ( - L1 + k 2 ) U == f is in H 3 (lR) C CC 2 (lR); it is thus a classical solution. (2) If n == 2, k == 0; we must have 8 > 1 for H S (lR 2 ) c BO (lR 2 ). (3) We can find functions u E H 1 (lR 2 ) which are unbounded in lR 2 (in particular u  BO (lR 2 )); for example u(x, y) == qJ(x, y) [log(x 2 + y2 )]a where 0 < rx < t, qJ E  (lR 2 ), qJ == 1 in a neighbourhood of the origin. This shows that the condition 8 > tn + k cannot be improved. Proof (a) Let us show the result for k == o. Suppose that u E HS(lR n ); then if u == ff u, we have: (3.1) r lu()1 d = r (1 + 112)s/2(1 + 112)-s/2Iu()1 d Jn Jn ( r d ) 1/2  Iluli s In;!n (1 + 112)S . As 28 > n, this last integral is convergent and thus U E L 1 (lR n ). Now in Y'(lR n ), u == .? - 1 U and so u E BO (lR n ) (see Appendix "Distributions"). We observe that since U E L 1 (lR n ), then u(x) is given by 1 r A . u(x) = (2nt J n;!n u()e' x d, and u satisfies the inequality (3.2) lIuIIBo(n) == sup lu(x)1  (2 1 ) n r lu(Old  (2 C ) n lIulls, x En n J n n using (3.1) with ( r d ) 1/2 C = In;!n (1 + 112y . This implies that the injection H S  BO(lR n ), for 8 > tn, continuous 5 (b) General case: we use the following (easily verifiable) fact: u E HS(lR n )  Dau E Hs-1al(lR n ) c Hs-k(lR n ) Vlexl  k. 5 We shall denote generally, in what follows, by  a continuous injection. 
102 Chapter IV. Sobolev Spaces Then, from (a), since s - k > tn, we have Dau E BO(lRn) Vlexl  k or again u E Bk(lRn). D Corollary 1. The functions which belong to the space HS(lRn) for all s E IR are infinitely differentiable, or again n HS(lRn) c CC OO (IRn). SE !R D Here, now, is a result which compares the space Hi (IRn) with other functional spaces: the spaces LP (IRn). Theorem 2. 6 When n > 2 we have the inclusion H 1 (lRn)  LP(lRn) for all p 1 1 1 satisfying 2  p  p where p is given by: - = _ 2 - -. p n Remark 2. The exponent p given above is the best possible in the sense in which there does not exist p > p such that Ilu IILP  ell u IIHi for all u E (lRn). D Remark 3. This result generalises to the spaces of functions of Hm(lRn) for m < tn; we have Hm(R n )  LP(lRn) . 1 11m wIth -  -  - - -. 2 ::;/ p ::;/ 2 n D 4. Density and Trace Theorems for the Spaces Hm(Q) (m E t\J* == t\J \ {O}) 1. A Density Theorem We have already proved (see Theorem 2 of 2) that (lRn) is dense in HS(lR n ); (Q) is not in general dense in Hm(Q) for m E N*. Definition 1. Space (Q). Let Q be an open set in IRn; (Q) denotes the space of restrictions to Q of the functions of  (IRn). We shall prove 6 Classical result due to Sobolev (see Sobolev [IJ). See proofs and complements in Adams [IJ, Gagliardo [1]. 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 103 Theorem 1. If Q is a sufficiently regular open set 7 , f0(Q) is dense in Hm(Q) for m E N*. Proof We shall establish the result for m == 1. (1) The case Q == IRn follows immediately from Theorem 2, 2. (2) The case of the "half-space": Q == IR. Let x == (Xl' . . . , X n ) E IRn; we put ( ' ) . h ' - ( ) fTl)n - 1. X == X, X n WI t X - Xl'...' X n - 1 E If\\ , then (4.1 ) IR == {(X', X n ) E IRn; X n > O} is the "open half-space" with boundary olR == r defined by (4.2) r == {(x', x n ) E IRn; X n == O} - R n - l (CCOO-variety of dimension n - 1), with the result that (4.3) IR == {(x', x n ) E IRn; X n  O}. We shall show (in three stages) that the functions of f0 (IR ) are dense in H l (IR ). 1 st stage: Truncation. We shall first establish the Lemma 1. The space of the functions of H l (IR ) with compact support is dense in H l (IR). Proof of Lemma 1. Let <P E f0(lRn) satisfying {O l <p(x) == if Ixi  1 if Ixi  2 Ix I == (lx l l 2 + . . . + I X nI 2 )1/2, and define <Pk by qJk(X) = qJ G ). k E N*. For v E H l (IR) we put V k == <Pk v; then { (i) supp V k C supp <Pk n IR is compact (4.4) (ii) v k E H l (IR). oV k O<Pk OV 2 In effect, V k and -;- == ;;- v + <Pk -;- E L (Q) (i == 1,. . . , n) as <P E f0(lRn) is uXi uX i uXi bounded, together with its derivatives, on IRn. Now IV k - vli 2 (1R",.) :::; 2 r IvI 2 dx -+ 0, Jklxl2k when k  00. 7 See (3) of the Proof of the Theorem and Remark 1. 
104 Chapter IV. Sobolev Spaces Similarly, av av <P - 4> - in L2(1R), when k 4> 00. k ax. ax. " " On the other hand, as there exists a constant C i (independent of k) such that a<p sup _ a (x)  C i (i == 1, 2, . . . , n), XE [Rn Xi we have: a<Pk C. - v  - k " IvIL2([R) 4> 0 a Xi L 2 ([R ) whenk  +00. Itfollowsthatv k 4> v(whenk 4> oo)inHl(IR)andhencethe lemma follows. D 2nd stage. It follows from the first stage that -we can assume, without loss of generality, that U E H l (IR) with compact support. Suppose that h > 0 (finally to tend to zero) and that Uh(X) == u(x', X n + h); then, as a result of the continuity of translations in L 2 , the restriction of U h to IR denoted by U h I [R tends to U in H 1 (IR) when h 4> O. Thus if we introduce l1h E eel (IRn) with { 1 if X n > 0 l1 h (x) == 0 l . f h X n < - , the function W h defined by W h == l1 h . U h satisfies (4.5) { i) W h E H l (IRn), supp W h compact; ii) W h I!R 4> U in H 1 (IR) when h 4> O. 3rd stage: Regularisation. Because of the second stage it is sufficient to approximate Wh (h fixed) by functions of (lRn). We shall introduce to this effect a "regularising" sequence of functions {p j} in the following manner. Suppose that p E (lRn) is such that p  0, r p(x)dx = 1; J [Rn we put P j(x) == jn p(jx) (p j 4> b In g' (IRn)). 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 105 We then define qJ j by <pix) = (Pj*wh)(x) = r pj(x - y')wh(y')dy' Jn supp qJj c within a compact neighbourhood of order 1jj of the support of Who (4.6) Now (for the properties of the convolution, see Schwartz [1] or Appendix "Distributions") if VEL 2(lRn) and supp v is compact Pj* v  v in L 2 (lRn) av - E L 2 (lRn) ax. l and if a av av - (Pj*v) = Pj*-  - in L 2 (lR n ) ax. ax. ax. l l l with the result that qJj  W h in H i (lRn). But the restriction operator r of Hi (IR n ) to Hi (IR) is continuous (since r is linear and II ru" Hl()  "U II Hl(n)); it follows that (4.7) rqJj = qJjl  rW h = whl in H 1 (IR). Hence the point (2) of the theorem. o (3) Case of arbitrary Q (sufficiently regular). We indicate briefly the main lines of the proof referring to Lions- Magenes [1] for the details. (i) By truncation, we can always reduce the problem to considering a function with compact support, with the result that we may suppose (4.8) - Q compact. (ii) We then use the definition of the variety r = aQ by local maps. This supposes a certain regularity in r, for example CC OO . Fig. 1 
106 Chapter IV. Sobolev Spaces There exists a finite family of bounded open sets (!) j, (j = 0, 1, . . . , p) - p covering Q such that U (!) j ::) r and for each j = 1, 2, . . . , p a diffeomorphism j= 1 qJ j (qJ j, qJ j-l of class CC OO ) which sends (!) j into the open cube Q = (] - 1, + 1 [)n of IRn. n h +1 Q+ 1: - - y' Q- -1 y Fig. 2 qJ j sends (!) j n Q in to Q + = {( y', Y n) E Q; Y n > O} (!)j n r into E = {(y', Yn) E Q; Yn = O} (!)j n CQ into Q- = {y E Q; Yn < O}. If (!)i n (!)j i= 0, then there exists a homeomorphism Jij of class CC oo , with strictly positive Jacobian, of qJi((!)i n (!)j) onto qJj((!)i n (!)j) such that qJj(x) = Jij(qJi(X)), \Ix E (!)i n (!)j. {(!)b qJi}' i = 1,2, . . . , p, is then a system of local maps defining r. Now let {(Xi} be a partition of unity subordinate to the covering {(!)i n Q} of Q P ((Xi E (lRn), supp (Xi C (!)i n Q, L (Xi(X) = 1, \Ix E Q); i=O Each element U E H 1 (Q) can be written in the form p (4.9) u(x) = (Xo(x)u(x) + L (Xi(X)U(X); i= 1 (Xo U is with compact support coained in Q and can, by regularisation, be approximated by functions of (Q). To approximate (XiU, (i = 1,, . . . , p), we define with the aid of the CC OO diffeomorphism qJi a continuous mapping of H 1 ((!)i n Q) into H 1 (Q+) with the result that the image of (XiU is approximated in H l(Q +) by functions of (Q +), 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 107 which is permis,ible because of the result for the half-space. By the inverse diffeomorphism, (Xi u is approximated by functions of  ((9i n Q) in H 1 ((!) i n Q) and we show, using (4.9), that u is approximated in the sense of H 1 (Q) by functions of (Q), from which the theorem follows. D Remark 1. We have supposed that r is a variety of class re oo (i.e. qJi and qJi 1 are of class re OO ) and that Q is locally on one side of r. The result for H l (Q) is valid if r is of class re l (i.e. qJi and qJi- l are of class re l ) and for Hm(Q), m > 1 if r is of class rem (i.e. qJi and qJi- l are of class rem). We can in fact weaken the hypotheses on r. For that, see Necas [1]. o Definition 2. We denote by H'O(Q) the closure of (Q) in Hm(Q), m E N*. 2. A Trace Theorem for H 1 (IR ) Statement of the Problem. Let u be a continuous function on IRn. If x == (x', x n ), then we can fix the coordinate X n == x and we obtain the function v: x' 4> v(x') == u(x', x) which is continuous on IRn-l. We say then that the function v is the trace ofu on the hyperplane X n == x. We cannot operate as simply with an element of H l (IRn) when n  2 as such an element is not in general represented by a continuous function, as a result of Sobolev's theorem (Theorem 1 of 3). We shall show that we can define, for a function of H l (IR n ), an extension of the notion of the trace on the hyperplanes X n == x; after a final change of variable, we may suppose that x == 0 and we shall show that we can define a notion of trace for the functions of H l (IR) on the boundary r == {(X',Xn)E IRn; X n == O} = IRn-l. For u E (IR), we can define a mapping (4.10) Yo: u 4> YoU such that x' 4> You (x') == u(x', 0); Yo, thus defined is linear from  (IR ) to  (IRn - 1 ). We shall prove the Theorem 2. (i) The linear mapping Yo defined by (4.10) is continuous from (IR) furnished with the topology of H l (IR) into (lRn-l )furnished with the topology induced by Ht(lRn-l). 
108 Chapter IV. Sobolev Spaces (ii) It can thus be extended to a continuous linear mapping from Hi ([R) into Ht([Rn-l), again denoted by Yo, which is surjective. (iii) The kernel of Yo, denoted by Ker Yo is given by (4.11) Keryo = H6([R+). Remark 2. The mapping Yo is called a trace map of order zero. D Proof of Theorem 2. Point (i) We have to show the existence of a constant C such that we have { IIYoullHl/2(n-l)  CliullHl(), for all u E ([R) ( 4.12) where You (x') = u(x',O), x' E [Rn-i. We then use the partial Fourier transform with respect to x', given by J(', x n ) = r e-i<',x'> f(x', xn)dx'. J n - 1 For u E ([R) the norm II u II Hl(n) is equivalent to II u II 1 given by IIulii = f +oo r (1 + 1'12)lu(', xnWd'dxn o J n - 1 + f + 00 dX n r a o u(', X n ) 2 d' o Jn-l X n = r (1+ 1'12) f +oo lu(',xnWdxnd' Jn-l 0 i f +oo a 2 + _ a u(', x n ) dXn d' n - 1 0 X n the norm IIyou IIHl/2(n-1) is equivalent to IIyou II /2 given by IIY o ulli/2 = tn-l (1 + 1'12)1/2Iu(',OWd' with the result that the problem reduces to establishing an inequality of the type (4.13) IIY o uili/2  Cllulli C constant > O. We shall use Lemma 2. Let v E H 1 ([R+), [R+ = ]0, + 00 [. Then v is (the class of) a con- tinuous function on [R + and we have (4.14) Iv(O)1 2  2IvIL2(+) .Iv' IL2(+)). 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 109 Proof of the Lemma. From Theorem 1, (IR + ) is dense in H 1 (IR +) and it is "enough to establish (4.14) for v E ( IR+ ) (the continuity follows, since 0 does not . playa particular role) d dt Iv(t)1 2 = 2 Re v(t)v'(t) from which, on integrating from 0 to + 00, we deduce Iv(OW = - 2Re L+oo v(t)v'(t)dt:::; 2 L+oo!V(t)"V'(t),dt. Using the Cauchy-Schwarz inequality we derive (4.14). D We shall apply Lemma 2 to the function v (depending on the parameter ') defined by v(Xn) = u( ', x n ), U E (IR ). This gives ( f + 00 ) 1/2 ( f + 00 lu(', OW :::; 2 0 lu(', xnW dXn 0 a 2 ) 1/2 DX n u(', X n ) dXn ; from which we deduce (1 + 1'12)1/2Iu(', 0)1 2 ( f +oo ) 1/2 (f +00 :::; 2 (1 + 1'12) 0 lu(', xnWdx n 0 a 2 ) 1/2 DX n u( ', X n ) dXn and using ab  t(a 2 + b 2 ) (1 + 1'12)1/2Iu(', 0)1 2 f +oo f +oo :::; (1 + 1'12) 0 lu(, xnW dXn + 0 a D u(', X n ) 2 dXn. X n Integrating over IRn-1 we obtain (4.13) with C = 1. D Point (ii) Because of the density of (IR) in H 1 (IR), the mapping Yo extends to a linear mapping Yo E 2(H 1 (IR), Hi(lRn-1)). As r = a(IR) = IRn-1, YoU E Ht(r) and hence we can define the trace of u E H 1 (IR) on the boundary r. Let us now show that the mapping Yo is surjective. For this let a E Hi (IRn-1); we shall show that we can construct <p E H 1 (IRn) such that Yo <p = a. 
110 Chapter IV. Sobolev Spaces We consider a function tjJ of class CC OO with (4.15) { tjJ is defined in IR +, tjJ(O) = 1 supp tjJ is compact; we define cP(., x n ) by " cP(', x n ) = a(')tjJ[(l + 1'12)txn]' a E Ht(lRn-1) with the result that, by the inverse partial Fourier transform, we can write (4.16) <p = Ra where R (which is a convolution operator) is called a lifting operator, or more briefly a lifting. We shall verify that R E 2(Ht(lRn-1), H1(1R)) is a right inverse for Yo. Now in an obvious way yo<p = yoRa = a as <;;;ip = a. In addition, we have immediately: ( 4.1 7) 11(1 + 1'12)1/2cPIIL2()  c lIaI11/2; further a _ a cP(', X n ) = a(')(l + 1'12)1/2tjJ'[(1 + 1'12)1/2Xn] X n and (4.18) f +oo a o cP(', X n ) 2 dXn = la('W(1 + 1'12)1/2 f +oo 1jI'2(t)dt. o X n 0 Integrating (4.18) over IRn-1 we find from (4.17) that (4.19) IIcPll1  c lI a 1l 1 / 2 from which we have point (ii). o Point (iii) - Ker Yo = H a(IR). It is immediate that Yo is a null operator on (IR) which, by definition is dense in Ha(IR). The definition of Yo in H 1 (IR) shows then that Yo is a null operator on Ha (IR ). We shall now show the converse { if U E H 1 (Q), Yo U = 0, then U E Ha(Q), (4.20) where Q = IR. As a result of the density of functions of H 1 (Q) with compact support in Q = IR we can suppose that (4.21) U E H 1 (Q); Yo U = 0, supp U compact. , We shall show that U is the limit in H 1 (Q) ofa sequence of functions Urn E Ha(Q) which will imply U E Ha(Q) since Ha(Q) is closed in H 1 (Q) by its definition. 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 111 For this we introduce the function (!)rn defined by 1 if X n  21m mX n - 1 if X n E [  ,  ] (!)rn(Xn) o otherwise and then define Urn by Urn = (!)rn u ; we verify immediately that Urn E {v E H5(Q); Yo v = O} and we have supp Urn C IR. By regularisation with the aid of the functions Pj (used in the proof of Theorem 1) withj > m, we can, for m fixed, approximate Urn by functions of ilR), which proves that we have Urn E H 5(1R) for all m. It remains for us to show that Urn 4> U in H 1 (IR). Now IU m - UlhD;!'+) = r f ;. 1((I)m - 1)u1 2 dx' dXn J n - 1 0  2 f ;. r Iu(x', xnW dx' dXn -+ 0 o J n - 1 when m 4> + 00. In the same way a au au _ a ((!)rnU) = (!)rn _ a 4> _ a In Xi Xi Xi L 2(1R) for i = 1, 2, . . . , n We have 1. a ((!) (!) ' au _ a rnU)= rnU+(!)rn- a . X n X rn aU au . (!)rn _ a 4> _ a In L 2(1R) and we shall show that (!)u 4> 0 in L 2(1R) X rn X n I (I);" u I hD;!".) = r dx' f 2/m m21 u(x', X n ) 1 2 dXn. J n - 1 l/rn But for almost all X' E IRn-l, X n 4> U(X', X n ) is in H 1 (]0, + CIJ E), so is represented, for almost all X', by a continuous function with respect to X n with u(x', 0) = 0, (see Lemma 2 and point i)). We can then write f x n au 2 f x n au 2 lu(x',xn)12 = _ a (x',y)dy  X n _ a (x',y) dy a.e. in x' o X n 0 X n 
112 Chapter IV. Sobolev Spaces so lu(x ' , xnW  x n r ;. u (x', y) 2 dy for J 0 uX n 2 X n  -. m Thus 1(i);"ulhlR",)  r dx ' m 2 r XndXn f ;. u (x', y) 2 dy J [Rn - 1 J In 0 u X n  _ 2 3 r dX' r ;. u _ (x', y) 2 dy  0 with 11m, J [Rn- 1 J 0 uX n from which we deduce point (iii) of the Theorem. o Application. Solution of the Dirichlet problem for the operator - A + k 2 in a half space. We seek u = u(x', x n ) satisfying { -Au + k 2 u = 0 in '(lR), (4.22) u(x',O) = g(X/), x' E IRn-l = r = alR with g given and with k E lR \ {O} given. As a result of Theorem 2 it is permissible to look for u satisfying (4.23) u E H l (lR) with g given in Ht(lR n - l ). We have: Theorem 3. For g E Ht (lR n - l) given, there exists u E H 1 (lR) satisfying (4.22). Proof We use, as in the proof of Theorem 2, the partial Fourier transform with respect to x' with the same notation; (4.22) becomes i) ( - ::?; + p + 1/12 ) u(', x n ) = 0, ' E IRn-l, X n E IR+ ii) u(/, 0) = g('), ' E lR n - l . (4.24 ) The differential equation (i) of (4.24) is a linear differential equation of the 2nd order with constant coefficients with respect to X n but depending on the parameter '. Every solution satisfying (4.24)ii) is given by ( 4.25) { u(', x n ) = g(')exp( - xn J k 2 + 1'12) + 2Bsh(x n J k 2 + 1'12) B, an arbitrary function of '; X n  U (', x n ) having to be in H l (lR) (which is a condition at infinity for u(', .)) it is necessary that we take B = O. 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 113 Hence the only possible solution of (4.24) is giv en by (4.26) u( ', x n ) = g( ') exp( - X n J k2 + I' 1 2 ). It remains to notice that u E H1() (corresponding to (4.26)). We easily verify, as in point ii) of Theorem 2 that (4.26) defines a lifting of g in the space H 1 () from which the theorem follows. D 3. Traces of the Spaces Hm(IR) and Hm(Q) (1) Traces of the Spaces Hm() Definition 3. Let u E ( ). We call the trace of order j, (j EN)" of u on the boundary o() = linear mapping Yj defined by { Y j : u -+ Y j u, j = 1, 2, . . . , n (4.27) oj oj Yju(x') = ------; u(x', 0) =  u(x', xn)lx = 0' x' E n-1. ux J ux J n n n n-1 the , Using the density of ( ) in Hm(), and the methods of Theorem 2, we establish the Theorem 4. Let mEN * (i) The mapping I'm: U -+ Ym U = (YoU, Y1 u, . . . , Ym-1 u) defined on () ex- tends by density to a continuous linear surjection of Hm() onto m-1 11 H m - j-t(n-1). j=O m-1 (ii) There exists a continuous lifting of 11 H m - j-t(n-1) into Hm() and we j=O have: ( 4.28) Ker I'm = H(). Proof We refer to Lions-Magenes [1]. (2) Traces for Hm(Q) (Q a "sufficiently regular" open set). If Q is a "very regular" open set (r a variety of dimension N - 1 and class OO), respectively "regular"8 (r a variety of class m) we can give (with the aid of local maps defining r) a definition of HS(r) for s E  (resp. s  m (see Lions- Magenes [1], Necas [1]). For example for u E H 1 (Q) with oQ of dimension n - 1 and piecewise of class 1, we can define YoU as an element of Ht(r). 8 We suppose in addition that Q is situated, locally on one side of r. 
114 Chapter IV. Sobolev Spaces We shall now state a result for the proof of which we refer to Lions-Magenes [lJ Necas [lJ and P. Grisvard [1J. Let n be the exterior normal to r == aQ, the boundary of a sufficiently regular - aju a a 1 open set Q. For U E '@(Q) we write Yju = on j / YoU = ulr, where on = on i is the derivative along the exterior normal to r. We have the Theorem 5. Let u be a "very regular" open set 8 in lR n and let mEN *. (i) The mapping "1m: U  Ym U == (You, . . . , Ym-1 u) defined on !?fi(Q) with values in [22(r)Jm has an extension by density to a continuous linear surjection from Hm(Q) m-1 onto TI H m - j-t(r). j=O m-l (ii) There exists a continuous lifting of TI H m - j-t(r) into Hm(Q) and we have j=O ( 4.29) Ker "1m == H'l;(Q). Remark 3. The relation (4.29) means that for m == 1 (4.30) H6(Q) == {u E H 1 (Q); ul r == O}. and for m > 1 (4.31) H'(;(Q) = {u E H I (Q);  r = 0, j = 0, 1, . . . , m - 1 }, aou with --;-0 == u I r. un r o 4. Properties of m- Extension Definition 4. We say that an open set Q c lR n possesses the m-extension property mEN, if there exists an operator P such that { i) P E 2(H m (Q), Hm(lR n )) ii) Pu == u a.e. in Q, Vu E Hm(Q). (4.32) We note the following two propositions. Proposition 1. Let V c Hm(Q) be a dense subspace of Hm(Q) and let P: V  Hm(lR n ) be a continuous linear operator of V into Hm(lR n ) when provided with the topology induced by Hm(Q) and such that we have (4.33) Pu == u in Q for u E v. Then P has a unique extension into an operator of m-extension from Hm(Q) to Hm(lR n ). 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 115 Proof We know that there is a unique extension of P into a continuous linear operator of Hm(Q) into Hm(n); again let us denote this extension by P. It remains therefore to show that Pu = u a.e. in Q for u E Hm(Q). This follows from the fact that { the convergence of a sequence in Hm(Q) implies the convergence ( 4.34) a.e. of an extracted sub-sequence. o Proposition 2. Let Q be an open set and Ql and Q2 such that ( 4.35) Q = Ql U S u Q2; S = a 1 n a 2 is the interface between Ql and Q2' assumed regular, for example of class m+ 1, mEN. Then let U'E m(Q) be such that ( 4.36) ul.aj E (Qj)' } = 1, 2. Under these conditions we have ( 4.37) U E Hm+ 1 (Q). Proof We shall establish the result for m = 0, leaving the task of adapting the proof for the case m  1 to the reader. Fig. 3 Let Vi E L 2 (Q) be defined by its restrictions to Q j, j = 1, 2: a Vi l.a- = _ a (u In), i = 1,. . . , n; 1 x. 1 l to show that u E H 1 (Q) it is enough to show that the continuity of u across the interface S implies that au . ' ( ) Vi = _ a In  Q. x- l 
116 Chapter IV. Sobolev Spaces Let qJ E (Q); we have <Vi,qJ) = f viq>dx = t f q>;-(uIQj)dx. il J = 1 ilj u X i But on integrating by parts over Qj' we have f 0 '+ 1 f f OqJ qJ -;- (u Inj) dx = ( - ) J qJ (u I .Qj) ni dS - (u I ilj) -;- dx n. uXi S n. uX l ' 1  where (n i ) denotes the unit vector of the normal to S directed from Q1 towards Q2; thus <Vi' qJ) = f (UI.Ql - U.Q2)CP . nidS - f u q> dx S il UX i from which the result follows since u E O(Q) and UI.Ql - ul.Q2 = 0 on S by hypothesis. D We shall now establish the Theorem 6. The open set Q = [R possesses the m-extension property, mEN. Proof i) Case oim = 0 In this case, for an arbitrary open set Q it is enough to use the canonical extension P defined by "'" { u inQ Pu = U = o otherwise and we have II Pu II L 2 (n) = II u II L 2 (n) = II u II L 2 (Q) . (ii) For m  1 we shall make use of Proposition 1 with V = ([R). Let u E ([R) and ( ' ) { X' = (x l' . . . , X n - 1 ) X = x,y y = X n . We then put (4.38) ( ) 1 u (x', y) if y  0 Pu x', y = m L CXku(x', - ky) for y < 0 k=l where the CXk are constants chosen in such a way that Pu E CC m - 1 (Q). For that we write the continuity of derivatives in y up to order m - 1 on the boundary r = {(x', y) E [Rn; y = O}; we obtain: oj oj  Pu(x', 0_) =  u(x', 0+), j = 0, 1, . . . , m - 1; uyJ uyJ 
4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 117 but as u E ([R ), we have oj oj m. oj  u(x', 0+) =  u(x', 0) = L (- k)Jrx k  u(x', 0), uyJ uyJ k = 1 uyJ which will be satisfied if we. impose the conditions (4.39) m L (- k)jrx k = 1, j = 0, 1, . . . , m - 1. k=l The system (4.39) is a system (of van der Monde) which admits one, and only one, solution (rxk)k = 1,2, . . . , m; Pu is thus of class ce m - 1 in [Rn and satisfies the hypotheses of Proposition 2, so Pu E Hm([Rn). We then show easily that we have: ( 4.40) II Pu IIHm(n)  y II U IIHm(), y constant > 0, from which we have Theorem 6. o Remark 4. The extension thus defined by (4.38) is called extension by reflexion. o By the methods indicated in the proof of Theorem 1 (point (3)), we can, in red ucing to the case of the half-plane, prove the Theorem 7. Let Q be a bounded "regular"9 open set in [Rn (r of class ce m , m E t\J *). Then Q possesses the k-extension pro perty with 0  k  m. (See Lions-Magenes [1J). Remark 5. Other open sets than those of Theorems 6 and 7 possess the m:- extension property. We cite for example (1) an open cube of [Rn; more generally Q = {x = (x l' . . . , x n ) E [Rn; ai < Xi < b i 1 = 1, 2, . . . , n} where ai' b i E [R (2) Q = [Rn+ = {x E [Rn; Xi > 6 i = 1, 2, . . . , n} "the positive. quarter- plane" . We can show in a manner similar to that used in Theorem 1 that ([Rn+) is dense in Hm([Rn + ). In addition, we use successively extensions by reflexion relatively to each variable, the extension P is then obtained by P = p n oP n - 1 oP n - 2 0 ... oP 1 where Pi is an extension by reflexion relative to Xi' 9 With again Q locally on one side of r. 
/ 118 Chapter IV. Sobolev Spaces For example, for [R2 +, we begin by extending to the half-plane y > 0 by using Px (which is called the Babitch extension in x). Then we use Theorem 6 which ensures the extension in y; thus Pu = Py 0 Pxu, UE([R2+). D Remark 6. Let Q be an open set of [Rn. We can define for all s E [R .Y(S(Q) = {u E '(Q), :3 U E HS(lR n ) such that u = U I Q}; provided with the quotient norm II u II yeS(Q) = inf II U II Hs(n), U U I.Q= U .Y(S(Q) is a Hilbert space. For all s E [R, we have by definition .Y(S([Rn) = HS([Rn). Now if Q i= [Rn, for s = mEN (4.41 ) .Y(m(Q)  Hm(Q); we have the following result: Proposition 3. If Q has the m-extension property, then (4.42) .Y(m(Q) = Hm(Q) with equivalence of the norms. Proof Let u E Hm(Q), then Pu E Hm(lR n ) and Pu = u a.e. in Q which proves that u E .Y(m(Q). In addition II u II yem(Q)  II Pu IIHm(n)  ell uIlHm(Q), which proves that (4.43) Hm(Q)  .Y(m(Q), which together with (4.41) gives (4.42). D Theorem 8 (Extension of H(Q), mEN). Let Q be an open set of [Rn. The canonical extension u  u where ( 4.44) u(x) = {(x), xEQ xQ is a continuous mapping from H(Q) to Hm(lR n ). 
g4. Density and Trace Theorems for the Spaces Hm(Q) (m E N* = N \ {O}) 119 Proof As !0(Q) is dense in H(Q) by definition, it is obvious that for u E !0(Q), U E !0(n) and that we have II U IIH1(n) = II U IIH 1 (Q) from which the result for m = 1 follows by Proposition 1 10 . The case m  2 can be treated similarly. D Orthogonal decomposition of Hi (Q). As H(Q) is a closed subspace of the Hilbert space Hi (Q), this latter space is the direct sum of H(Q) and its orthogonal .Yr(Q) ( 4.45) H 1 (Q) = H(Q) (f) .Yr(Q). An element h of H 1 (Q) is then in .Yr(Q) if and only if (h, V)Hl = 0, as !0(Q) is dense in H(Q). That is to say r - n r ah av JQ h(x)v(x)dx + Jl JQ OX; (x). OX; (x)dx = O. Vv E !0(Q) (4.46) The equality (4.46) is equivalent to the fact that, in the sense of distributions, we have ( 4.47) -L1h + h = O. From (4.45) and (4.47), we deduce that for all u in H'(Q), there exists U o E H(Q) and h E .Yr(Q) such that (4.48) { u = U o + h -L1u + u = -L1u o + U o . Let us make (4.48) explicit in the case in which Q is a regular bounded open set. We use the notation E = {f = -L1u + u, u E H 1 (Q)}; then from (4.48) { - L1v + v = f 'if fE E, 3v E H 1 (Q) such that Yov = 0; thus we have solved the Dirichlet problem for the operator ( - LI + I). We notice similarly that the orthogonal of H(Q) in H 1 (Q) is, for example, when Q is bounded, of infinite dimension if n  2 and of dimension 2 if n = 1. 10 Proposition 1 remains true if Hm(Q) is replaced by Ho(Q). 
120 Chapter IV. Sobolev Spaces 5. The Spaces H-m(Q) for all mEN We have seen in 4 that, by definition, (Q) is dense in H(Q) when the latter is endowed with the norm of Hm(Q). Consequently, the dual of Hm(Q) i.e. the space of continuous linear forms on H(Q), is a sub-space of '(Q). We can introduce the Definition 1. For mEN, we denote by H -m(Q) the dual of the space H(Q). This notation is motivated by the fact that when Q = n, we have H(Q) = Hm(lR n ) and that Theorem 3 of 2 affirms that the dual of Hm(lR n ) coincides with the space H -m(lR n ). Our notation is thus coherent. Proposition 1. Properties of the spaces H -m(Q). (i) H -m(Q) is a Banach space!! when it is provided with the norm I F(u)1 II F 11- m Q = sup · lIullm,n 1 II u 11m. Q u#O (ii) if m > m', H -m' (Q) c H -m(Q) with continuous injection. (5.1) (i) is a general fact concerning the dual of a normed space and (ii) follows from the density of H in H' if m > m' since we have: (Q) c H(Q) c H' (Q) and that (Q) is dense in H m ' (Q). D The following theorem provides a complete characterisation of H -m(Q). First of all let us give the Definition 2. For mEN, let Em be the space of distributions T E '(Q) which can be written in the following (non unique) form (5.2) T = L D rx frx where frx E L 2 (Q), Irxl m provided with the topology induced by II TII2 - inf L IIfrxll 2 . fIX' 'L DlXflX = T Irxl  m Theorem 1. H-m(Q). The space Em coincides algebraically and topologically with . 11 H -m(Q) is even a Hilbert space for this norm. 
g5. The Spaces H-m(Q) for all mEN 121 Proof (a) If T E '(Q) and T = L Dafa, then for all qJ of (Q) we have lal  m < T, qJ > = L < D a fa, qJ > = L (- 1 )Ia l < fa, D a qJ >; lal  m lal  m consequently <T, qJ> - L (- l)la l f fa(x)DaqJ(x) dx lal m Q so, by virtue of the Cauchy-Schwarz inequality ( ) 1/2 1< T, qJ >1  L II fa lIi2(Q) . II qJ IIHm(Q). lal  m This shows that the mapping qJ  < T, qJ > is linear and continuous on (Q) provided with the norm of Hm(Q). Since @(Q) is dense in H(Q),_this mapping extends to a continuous linear mapping T of H(Q) into C, i.e. T E (H(Q))'. We therefore have a mapping: { Em  _(H(Q))' T  T. - - Ihis mapping is injective for if T 1 = T 2 , their restrictions to (Q) coincide; but T i I9(Q) = T i , so T 1 = T 2 . This proves that Em can be identified with a sub-space of (H(Q))'. (b) Conversely if F E (HQ)', the Hahn-Banach theorem 12 implies that F can be written in the non-unique form (5.3) F(u) = L f ga(x).Dau(x)dx where ga E L 2 (Q), lal :::s m Q for all u E H(Q). Indeed, it is enough to consider the mapping u  (Dau )Ial  m of H'(Q) into (L 2 (Q))N where N is a suitable integer. This is an isometric linear mapping; we can thus consider H(Q) as a closed sub-space of (L 2 (Q))N. From the Hahn-Banach theorem, if F E (H Q)', F extends to a continuous linear form on (L 2 (Q))N. The equality (5.3) then follows from the description of continuous linear forms on a Hilbert space (here L 2 (Q)N). Considering the restriction of F to (Q) which is dense in H(Q), we obtain from (5.3) F = L (- l)la l D a g a where ga E L 2 (Q), lexl  m. lal m D 12 See Chap. VI. 
122 Chapter IV. Sobolev Spaces Having defined the spaces H -m(Q), we are now able to generalize Theorem 4 of 92 to the case in which we are considering an open set Q i= [Rn. Theorem 2. The operator L = - A + I (where I is the identity) is an iso- morphism of H(Q) onto H- 1 (Q). Proof First of all if u E H (Q), Lu = u + .f ,,0 ( - u ) E H -1 (Q) l = 1 uX i uX i by Theorem 1. Thus L sends H(Q) into H- 1 (Q). We recall next that the scalar product on H(Q) is given by (U,V)l = .f f u v dx + f u.vdx. l= 1 QUX i uX i Q If F E H -1 (Q) = (H(Q))', because of the characterisation of continuous linear forms on a Hilbert space, here H(Q), we have (5.4) 3u E H (Q): F(v) = (u, V)l, Vv E H(Q). In particular if v E (Q) we have, denoting by <,) the duality , ': _ n \ au av ) F(v) = <F, v) = (u, V)l = <u, v) +,L -;-,-;- l = 1 uX i uX i that is to say < F, v) = < - Au + u, v) Vv E (Q) which shows that in the sense of the distributions on Q, we have (5.5) -Au + u = F and as u E H 5 (Q), (5.5) proves that ( - J + I) is surjective. In addition, from (5.4) we have II u 116(Q) = F(ii)  II F IIH-l(Q) lI u IIH6(Q) from which (5.6) Il u IIH6(Q)  II F IIH- 1 (Q) = II - Ju + u II H-l (Q). We deduce from (5.6) that - J + I is injective. In addition ( - A + I) is continuous because IF(v)1 II -Llu + ull w l(Q) = sup II II II v 111  1 V 1 v*O sup II V 111  1 v*O I<u, v) I II V 111  II U II Hb(Q). Finally, the continuity of G = ( - J + I) - 1 follows from the application of Banach's theorem or from the inequality (5.6). D 
g6. Compactness 123 6. Compactness We recall that a linear operator A from a Banach space E into a Banach space F is said to be compact if A transforms any arbitrary bounded set of E into a relatively compact sub-set of F. We shall see the importance of the compactness of the operator in L 2 {Q), G == (- L1 + AI) - 1. Later we shall see that this property follows from certain results on compactness in Sobolev spaces-results which form the object of the present section. Theorem 1 (Rellich's Theorem). Let Q be a bounded open set in n; thenfor all m in N, the injection of H+ 1 (Q) into H{Q) is compact. Proof Let B be a bounded set in H + 1 (Q); we have to show that for each sequence in B, we can extract a convergent sub-sequence in H{Q). Let {Uk)k c.B, then IlukIIH+l  C where C is independent of k. Since a ball in a Hilbert space is weakly compact, we can extract from the sequence (Uk), a subsequence, denoted by (v j ), weakly convergent in H+ 1 (Q) to v. Putting w j == v j - v, we can assume v == o. Let us denote by v j the extension of v j by zero outside of Q. Then, from Theorem 8 of 94, we have (6.1 ) V j E Hm+ 1 (n) and II v j IIH(J2) = II V j II Hm([Rn) and h.ence, denoting by :F the Fourier transform, it follows that Il v j I11;;'(Q) = r (1 + 112)mlffvj(Wd. J [Rn We put J 1 . j = r (1 + 112)mlffviWd JII  R J 2 . j = r (1 + 112)mlffviWd, JII > R so II v.111m ( Q ) == J 1 . + J 2 .. J 0 ,J ,J Let us estimate the term J 2. j = r (1 + I  1 2 ) - 1 (1 + I  1 2 )m + 11 ff V j(  W d JII > R  (1 + R 2 )-1 f {1 + 112)m+ 11:F Vj{)12 d Rn from which we deduce J 2, j  (1 + R 2 ) - 1 II v j 111 + 1 ( Q) . 
124 Cha pter IV. So bo lev Spaces Therefore 8 > 0 being fixed, there exists Ro such that for R  Ro, we have (6.2) J 2 ,j  8/2. In what follows, R is a fixed number, R  Ro. Let us now estimate the term J l,j. Since the open set Q is bounded, we have L 2 (Q) c L 1 (Q) and hence for all  E [Rn: ffVj() = r vj(x)e-i<x'>dx = f vj(x)e-i<x'>dx. Jn Q Let qJ E ([Rn) be equal to 1 on the compact set Q, then (6.3) ffvi) = r Vj(x)<p(x)e-i<x'>dx = (v j , <p(x)e-i<x'»L2(1Rn). Jn By hypothesis, when j  00, v j converges weakly to zero in L2([Rn) and as qJ(x)e-i<x,> E L 2 ([Rn), (6.3) implies (6.4) :F Vj()  0 for all  E [Rn. j -+ CX) On the other hand, from (6.3) and by virtue of the fact that (v j ) C (Uk) C B (6.5) I:F Vj()1  II V j IIL2(n). II qJ IIL2(n)  ell qJ IIL2(n). Let us consider J 1 ,j = r (1 + 112nffvj(Wd. JII  R From (6.4) and (6.5) we can apply Lebesgue's theorem and conclude that, 8 > 0 being given, for j sufficiently large (6.6) J 1,j  8/2. From (6.1), (6.2) and (6.6), the sequence v j converges to zero in H(Q). This completes the proof of Theorem 1. Remark 1. (1) The result of Theorem 1 does not, in general, persist if Q is not bounded, or if we replace H(Q) by Hm(Q). (See Theorem 2 below.) (2) In particular we have: the injection of H5(Q) into L 2 (Q) is compact when Q is bounded. D We shall now turn our attention to the spaces Hm(Q) and give a sufficient condition on the open set Q for the analogue of Theorem 1 to be valid. Theorem 2. Let Q be a bounded open set in [Rn possessing the (m + I)-extension property. Then the injection of H m + l(Q) into Hm(Q) is compact. 
g7. Some Inequalities in Sobolev Spaces 125 Proof Let B be a ball such that Q c Band V1 E (B), V1= 1 on Q. Denote by P the operator extending Hm+l{Q) into Hm+l{[Rn). We have p '" { Hm+l{Q)  Hm+l{[Rn)  H+l{B) u  Pu  V1{Pu). From the Leibnitz formula DC1:{fg) = L CDPfDC1:-Pg, P we verify easily that the mapping u  V1{Pu) from Hm+l{Q) into H;::+l{B) is continuous. On the other hand, the mapping G: H{B)  Hm{Q), which is the operation restricting to Q a function defined on B, is continuous. Finally, the mapping H: H + 1 (B)  H{B), which is the identity, is compact by Theorem 1. Denoting by j the injection of Hm+ 1 (Q) into Hm{Q), we can decompose it in the following manner: . I Hm+l{Q)  Hm+l{[Rn)  H+l{B)  H{B)  Hm{Q) J: u  Pu  V1{Pu)  t/J{Pu)  u. Now the composition of a continuous mapping and a compact mapping is compact and so j = Go H 0 V1 0 P is compact. This establishes the theorem. o 7. Some Inequalities in Sobolev Spaces 1. Poincare's Inequality for H(Q) (resp. H';(Q». Definition 1. Let  E [Rn, I  I = 1 and ] a, b [ c [R with mesJ a, b [ = d > 0, d =1= 00. Putting (7.1) B d ( ) = {x E [Rn; a < x.  < b} (x .  = X 1  1 + x 2  2 + . . . . + X n  n ) we say that Bd{) is a band of [Rn of width d in the direction . We shall prove the Proposition 1. Let Q be an open set in [Rn such that there exists a band Bd{) with (7.2) Q C Bd{). 
126 Chapter IV. Sobolev Spaces Then for all u E H5, we have Poincare's inequality 2 d 2 2 (7.3) luI L 2(Q)  2IVul(L2(Q»n ( IVu1fp(Q»n = Ig radu lfL2(Q))" = t f  = 1 Q au 2 ) OX; dx . Proof !0(Q) being dense in H5(Q) it is enough to establish (7.3) for u E !0(Q). Let u E !0(Q); we can then write u(x + t) = I Vu(x + crO'  dcr, from which, on using the Cauchy-Schwarz inequality, we deduce (7.4) lu(x + tW  (t - a) r IVu(x + crW dcr. But if P denotes the hyperplane of fRn orthogonal to  and iff E L 2 (Q) (fbeing naturally extended by 0 outside of Q) we have L If(yWdy = r dt t If(x + tWdx, with the result that from (7.4), we deduce 2 (b - a)2 r 2 lulp(Q)  2 JQ IVul dx, Vu E !0(Q), D which is the stated result. (7.5) Corollary 1. Let m E Nn and u E H(Q), with the hypothesis (7.2) on Q. Then ( d2 ) m lulz2(Q)  C m 2 L IDaulz2(Q)' lal =m C m == constant. Proof The proof is immediate by iterating the inequality (7.3) and using the density of !0(Q) in H(Q). D Corollary 2. Let Q be an open set in fRn contained in a band of finite width d. Then (7.6) u  ( i _ n l a au 2 ) 1/2, ( resp.: u  ( L I DauIZ2(Q» ) 1/2 )  Xi L2(Q) lal = m 
97. Some Inequalities in Sobolev Spaces 127 is a norm on H(Q) (resp. on H(Q)) equivalent to the norm II u 111 = ( luli2(Q) + f. a au 2 ) 1/2 i = 1 Xi L2 (Q) ( resp. II u 11m = ( L I Dau 1£2(Q) ) 1/2 ) . o  lal  m Proof From (7.3) we have immediately f. a au 2  II u II i  ( 1 + d 2 2 ) . f. au 2 i = 1 Xi L2(Q) l = 1 aX i L2(Q) and if m E N*, from (7.5) we have L IDaul£2(Q)  II u II  c ( 1 + d 2 2 +... + ( d22 ) m ) L ID au l£2(Q) lal = m lal = m from which the result follows. D Remark 1. If Q is a bounded open set in [Rn, it is obviously contained in a band of finite width. D 2. Poincare's Inequality for HI (Q) We shall now prove Proposition 2. Let Q be a bounded connected open set in [Rn such that we have: (7.7) the injection of H 1 (Q) into L 2 (Q) is compact 13 . Then we have (7.8) for u E H 1 (Q) such that fQ u(x)dx = 0 n au 2 lul£2(Q)  P(Q) L . i=l aX i L2(Q) M ore generally, we have for u E H 1 (Q) (7.9) . 2 1 f n au 2 lub(Q) - mesQ  u(x)dx 2  P(Q) ; 1 ax; ;J&   L2 (Q) The best constant in (7.8), (7.9) is called Poincare's constant. 13 This takes place if Q is sufficiently regular; for example if Q possesses the I-extension property. 
128 Chapter IV. Sobolev Spaces Proof First of all we notice that (7.9) follows simply from (7.8). In effect if u E Hl(Q) we put u = 1 f u(x)dx mes Q Q then the function - v=u-u satisfies V E H 1 (Q), fa VdX = 0, av ax- 1 au ax- ' 1 i = 1, 2, . . . , n with the result that (7.8) gives (7.9) since Ivl£2(Q) = lul£2(Q) - mes Q .luI 2 . It is therefore enough to prove (7.8). For that, let us suppose that (7.8) is not valid; then we can find a sequence {u m } with Urn E H 1 (Q) for all m, L umdx = 0 II um"lll = 1 for all m and a sequence {(Xm} of positive real numbers with (Xm -+ + 00 when m -+ + 00 such that (7.1 0) m 1  lu m l£2(Q)  (Xm L i = 1 au 2 m aXi L2(Q) As Ilum"l = 1, we can extract from {um} a sub-sequence again denoted by {um} which converges weakly in Hl(Q) to u. As a result of (7.7) we can suppose that {u m } converges strongly in L 2(Q) to u, with the result that (7.11 ) lu m IL2(Q) -+ luI L 2(Q). Then from (7.12) and as (Xm -+ + 00 for m -+ 00 we have: au 2 m = 0 aXi L2(Q) , (7.12) lim i = 1, 2, . . . n. m + 00 Since aU m au '- -+ - In '(Q), ax. ax. 1 1 l = 1,. . . n, (7.12) implies that 
97. Some Inequalities in Sobolev Spaces 129 (7.13) au _ 0 - , ax. l 1 = 1,. . . n. Since Q is connected, we deduce from (7.13) that u(x) = U o = constant, a.e. in Q. But the set of the f such that f E L 2 (Q), I/dX = O. is closed in L 2(Q). Thus o = L um(x)dx -+ LU(X)dX = uo(mesQ) = 0 so U o = O. Then on the other hand, from (7.11) and (7.12) II Urn II f  0 for m  + 00, and on the other hand II Urn II f = 1 for all m, which is a contradiction, giving (7.8). D Remark 2. The open sets Q such that (7.9) is valid are called N ikodym open sets. (see Deny-Lions [IJ). A bounded connected open set having the I-extension property is thus a Nikodym open set. A more general situation is given by the bounded open sets having the cone property (see Deny-Lions loco cit.). D Remark 3. It follows from Proposition 2 that u --+ ( f a au 2 ) 1/2 is a i=l Xi L2(Q) norm equivalent to the norm II . 111 on the sub-space V o (closed in H 1 (Q)) defined by (7.14 ) V o = {u E H 1 (Q); L u(x)dx = o}. D Remark 4. For simplicity suppose that Q is a bounded connected open set in [Rn with a sufficiently regular boundary r (for example, satisfying the I-extension property). Suppose also that r = r 1 u r 2' with the capacity of r 2 > 0, and that (7.15) V r2 = {u E H 1 (Q), ulr2 = O}; V r2 is a closed sub-space of H 1 (Q). . 
130 Chapter IV. Sobolev Spaces Then from (7.9): u  Ct 1 ou 2 ) 1/2 OXi L2(Q) is a norm on V r2 "equivalent to the norm induced by H 1 (Q). In effect if u E V r2 n OU Z satisfies L = 0, (7.9) implies that u is constant in Q and of zero i= 1 OX i L2(Q) trace on r z so u = o. D Remark 5. The inconvenience of the proof of Proposition 2 is that it estab- lishes only the existence of the constant P(Q), but does not make it precise. We shall see that it is possible to relate the best constant P(Q) with the second eigenvalue (or the first non-zero eigenvalue) of the Neumann problem in Q. Let us put ourselves in the situation of Proposition 2. We see from Remark 3, the sesq uilinear form n I ou ov a(u, v) = L _ 0 _ 0 dx i=l Q Xi Xi is continuous on V o x Vo, (V o defined by (7.14)) and coercive 14 on Vo. If therefore (7.16) fEL 2 (Q) with Lf(X)dX = 0, the problem of finding u E V o satisfying (7.18) a(u, v) = Lf. iidx for all v E V o admits a unique solution (see Chaps. VI and VII). Since from (7.18), the antilinear form v -+ a(u, v) is continuous on V o for the topology of L 2 (Q), u satisfies (7.17) (7.19) -Ju E L 2 (Q) (-Llu, v)g(Q) = IQJVdX for all v E V o . The functions u satisfying (7.19) are then the functions of H 1 (Q) with Laplacian in L 2(Q) and zero normal derivative (in a generalized sense) on the boundary of Q, with the result that (7.19) is equivalent to the Neumann problem (7.20) -Ju = f ou = o. on r 14 See Chap. VI. 
97. Some Inequalities in Sobolev Spaces 131 In general, the solutions of (7.20) are determined only up to a constant as au - L1 u = 0 - = 0, U E H 1 (Q) are equivalent to u = constant in Q; here the , an r condition u E V o fixes the constant (equal to zero). We now use the spectral theory for the Laplacian in the Neumann problem, for which we refer to Chap. VIII. We know that the eigenvalues of the operator - L1 (relative to L 2(Q)) in the Neumann problem form a discrete set, as from the hypothesis (7.7) this operator has a compact resolvant 15 . Denoting the n-th eigenvalue by An' we know that we have (7.21 ) A 1 = 0 < A 2  A3  . . . An  . . . An  + 00 n  +00 and that the sub-space corresponding to A 1 = 0 has dimension 1. From the Min-Max formula 16 , we have (7.22) . a(u, u) A2 = mIn 2 ' UEVO luI L 2(Q) u*o with the result for all u E V o n au 2 A 2 I u li2(Q)  L i= 1 8Xi L2(Q) and for arbitrary u E H 1 (Q) 2 2 1 f au 2 luIL2(Q) - mes Q lul  A 2 . 1 aX l ' . L2(Q) 1/ A 2 being the best possible constant. Thus, we have the Corollary 3. Under the hypotheses of Proposition 2, the Poincare constant P(Q) = 1/ A 2 , A2 being the second eigenvalue (i.e. the first non-zero eigenvalue) of the Neumann problem. o (7.23) Remark 6. If n  3 we have (Sobolev inclusion)17 H 1 (Q)  L 2 *(Q) 1 1 1 - -- 2* 2 n' if Q is a bounded open set such that (7.23) is valid (Q is thus a Sobolev open set) we have the following result on compactness: 15 See Chaps. VI and VII. 16 See Chap. VIII. 17 This result should be compared with Theorem 2 of 93. 
132 Chapter IV. Sobolev Spaces (7.24) { :or all 8 > 0, (1 < 2* - 8) the injection of H1(Q) into L2*-E(Q) lS compact D We have in addition the following result, a proof of which will be found in Stampacchia [lJ: Proposition 3. Suppose that Q is a bounded convex open set oflR n , (n  3). Then there exists a constant K == K (n) such that we have for all u E H 1 (Q): d n ( n au 2 ) 1/2 (7.25) lu - UI L 2*(Q)  K(n) Q L _ a mes i = 1 Xi L2(Q) with u = 1 Q f u(x) dx, d == d(Q) == diameter of Q. mes Q From Holder's inequality, we deduce: (7.26) lu IL2(Q)  lu IL2*(Q) (mes Q)l/n with the result that (7.25), (7.26) give again Poincare's inequality (7.9), with (7.27) d n P(Q) = K(n) (mesQ)l-l/n where K(n) depends only on n. More generally (see Gagliardo [lJ, Nirenberg [2J), we have for Q bounded and sufficiently regular: (7.28) luILP(Q)  C IVul(L2(Q»rI lull;(), u E H 1 (Q), fnUdX=o . 1 a (1 _r a) wIth n > 2, - == - + , r  1, a E JO, 1 [ where C depends on Q, p 2* r p, r, n. (7.28) still remains valid for U E H 6(Q) under the condition f n U dx = 0, then C depends only on p, r, n. An inequality of this type important for applications is { luIL4(Q)  CIVultL(Q»2IullQ)' Vu E H6(Q), (7.29) n == 2. (for other inequalities of this type see Gagliardo and Nirenberg, loco cit.). D Remark 7. Suppose that Q is a bounded connected open set in IRn which is "very regular" (r == aQ a variety of dimension n - 1 of class oo and Q locally on one side of r). 
9 7 . Some Inequalities in Sobolev Spaces 133 (i) We put: (7.30) n Ilulltr = L i = 1 au 2 f + r IYo(u)12dr aXi L2 (Q) (u E H 1 (Q)) where dr is the surface measure on rand u --+ You the trace map of order o. We know that Yo sends H 1 (Q), continuously and surjectively, onto H 1 / 2(r)  L 2(r). Then by an argument similar to that in the proof of Proposition 2, we obtain { there exists C > 0 such that (7.31) lIulll  Cilulll,r for all u E H1(Q). It follows that Ilul1 1 ,r is a norm equivalent to IIul1 1 on H 1 (Q). (ii) Now let (7.32) V r = {UEH1(Q), fr}'oUdr = a}; V r is a closed sub-space of H 1 (Q). An argument similar to that in the proof of Proposition 2 again shows that (7.33) U  Ctl au 2 ) 1/2 aX i L2 (Q) is a norm equivalent to II u 111 on V r . 3. Convexity Inequalities for Hm(Q) Firt of all we introduce the notation (7.34) i) [uJo = luI L 2(Q) ( ) 1/2 ii) [uJp = L ID(Xuli2(Q) , I(XI = p We shall establish the following result which states a property of intermediate derivatives: P E N*. Proposition 4. Let Q = !R n or Q =1= !R n but having the m-extension property \:1m E N*. For all m E N*, there exists a positive constant K = K(m, Q) such that { [uJr  K. [uJb-  . [uJ  for all u E Hm(Q), (7.35) \:Ir satisfying 0  r  m. Proof (i) First of all we observe that Proposition 4 is trivial for r = 0 or r = m and remark that it suffices to prove it for m = 2. In effect let us suppose that Proposition 4 is true for m, 2  m  mo and show that it is true for m = mo + 1. 
134 Chapter IV. Sobolev Spaces Let U E H m o+ 1 (Q), and r with 1  r  mo. Since au E Hmo (Q) (i = 1,..., n), we have from the recurrence hypo thesis ox. l [ 0 ] [ 0 ] mo + 1 - r [ 0 ] r - 1 U U mo U1iiQ a Xi r - 1 ::( K a Xi 0 a Xi mo and also [ OU ] 1 1 1 -  K[uJo -r[uJ OX i 0 from which we deduce Proposition 4 for m = mo + 1. (ii) Let us prove the proposition for m = 2. If Q is an open set possessing only the 2-extension property, there exists an extension operator P, with (7.36) P E 2(H 2(Q); H.2(IRn)) Pv = v on Q for all v E H 2 (Q) Pv = 0 outside a compact set containing Q in its interior. Hence, it is enough to establish the result for m = 2, Q = IRn, and by the density of (n) in H 2 (IRn) to establish it for u E  (IR n ). By the Fourier transform u  U the result is immediate starting from the inequality: ( n r ) 1/2 ( r ) 1/2 Jl Jn IYlu(Wd ::( c Jn lu(Wd x ( ?: r (ijfIU(Wd ) 1/2 l,j J IRn from which we deduce: ( n i OU 2 ) 1/2 ( 1 ) 1/2 ( ) 1/2 .L _ 0 dx  C lu(x)1 2 dx L ID(XUli2(n) l = 1 IRn Xi n I(XI = 2 and hence the result. D Corollary 4. Let Q be an open set of IRn having the m-extension property. Then for all E > 0, there exists a constant K (E) (depending on E, Q, m) such that for all u E Hm(Q) (7.37) [u Jm-1  E[U]m + K(E)[U Jo. Proof We use Proposition 4 and Young's inequality (7.38) 1 1 1 1 ab  - a(X + - b P - + - = 1 "(1 [3' (1 [3 . 
7. Some Inequalities in Sobolev Spaces 135 Thus for a > 0, (from (7.35) with r = m - 1): [UJm-l  K [uJ-1)/m[u]6/m K  [ aex [ u ] ] 1/ [ u ] liP " m (aex)l/ 0 1 KP  s[uJm + P (s!X)/i/a [uJo from which we deduce (7.37) with 1 Km K (a) = m ( m a ) m - 1 (-+ + 00 m-1 (Where !X = m : l ' P = m ) when a -+ 0). D Remark 8. It follows from the corollary that (7.39) Illulll m = ([u] + [U];)1 / 2 is an equivalent norm to Ilu 11m on Hm(Q). D Remark 9. An estimate in L 00 (Q). Above, we have given estimates for luI L 2(Q) or for luI L 2*(Q), u E Hm(Q), (1/2* = 1/2 - l/n, for n > 2). It is sometimes possible to obtain an estimate for lulvXJ(Q). We shall restrict ourselves, by way of an example, to consider a type of inequality useful in theory of non-linear Schrodinger equations. The method of proof is more useful than the result, since it is capable of numerous variants. Proposition 5. Let Q be a bounded open set in !R 2 with a sufficiently regular boundary (i.e. possessing the 2-extension property). Then, there exist constants C 1 and C 2 such that (7.40) luILOO(Q)  C 1 + C 2 11 u 111, J log(1 + II u 112) (where II u II m denotes the usual norm in Hm(Q), m = 1 or 2). Proof 1 st stage: We begin by reducing to the case Q = !R 2 ; since Q enjoys the 2- extension property, there exists an extension operator P satisfying P E 2(H 2 (Q), H2(!R 2 )), Pv = v on Q for all v E H 2 (Q), Pv = 0 outside a compact set containing Q in its interior, P E 2 (L 00 (Q), L 00 (!R 2) ). 1 8 18 See 4, No.4. For r = 8Q sufficiently regular, we construct P (using local maps) locally by reflexion in the boundary, then we truncate the result to reduce it to functions with compact support. 
136 Chapter IV. Sobolev Spaces If we establish the inequality for Q = IRn, then applying the result to Pv, we deduce the result for Q. 2nd stage: Hence let Q = IR2 and U E H 2 (IR2) with u a compact set in IR2. Since (IR 2) is dense in H 2 (IR2), we can suppose that u E (IR 2). Let u be the Fourier transform of u defined by A 1 i . u(y) = - u(x)e- 1X . Y dx (7.41) 2n 2 x . y = Xl Y 1 + x 2 Y 2' X = (x l' X 2), Y = (y l' Y 2); then 1 i A . u(x) = - u( Y )e1X,y dy, 2n 2 so 1 A lu Iv XJ  2n lu ILl 1 i A 1 _ 2 (1 + IYI)lu 1 1 + 1 1 dy n lylR Y 1 i 2 A dy + _ 2 (1 + Iyl )lu 1 1 + I 1 2 n lylR Y I Y 1 2 = YI + Y. (7.42) From the Cauchy-Schwarz inequality applied to (7.42) we deduce that lulp ::;;; (t2 (1 + lyl)211Wdy Y/2(f: 2 1 lt ( : rf drY/2 . + (t2 (1 + ly12)211Wdy Y/ 2 (t+ oo 2 (1 :d2)2 Y/2 From the definition of the spaces H m (IR 2 ) by the Fourier transformation, we deduce from this last inequality that: (7.43) 1/2 1 lul L CXJ  C 2 11 U 111 (log(1 + R)) + Clil u 112. 1 + R for all R > 0; the constants C i (i = 1, 2) being independent of u and R. We now choose R (depending on u) to be: (7.44 ) from which (7.43) implies lulLCXJ  C l + C 2 11 u 111 J log(l + II u 112). R = II u 112; D 
7. Some Inequalities in Sobolev Spaces 137 Remark 10. We draw attention to Lemma 1. Let fE l(!R) with f' E LOO(!R). Then for u E H 1 (Q), with Q an arbitrary open set in !R n , we have: (7.45) fou(= f(u)) E H 1 (Q) and (7.46) grad(fo u) = (f' 0 u) grad u. Proof Let U m E 1 (Q) with U m --+ u in H 1 (Q) when m --+ + 00; we have f If(um) - f(u)1 2 dx  [suplf'()IJ2 f IUm - ul 2 dx --+ 0 when m --+ 00, Q E Q so f(u) E L 2 (Q) and it is obvious that f' (u). grad u E (L 2(Q) )n. In addition (7.47) f If'(um)Djum - f'(u)Dj(u) 1 2 dx  [sup f'()J2 f IDjum - Djul2 dx Q E Q + fa'f'(U m ) -f'(uWID j uI 2 dx. Now there exists a sub-sequence of (um)-again denoted by (um)-which con- verges, almost everywhere in Q, to u. Sincef' is continuous f'(u m ) --+ f'(u) almost everywhere in Q, so the second term of (7.47) tends to zero when m --+ 00 and Djf(u m ) = f'(um)Dju m --+ f'(u )Dju. As, on the other hand, Djf(u m ) converges to Djf(u) in the sense of distributions we have Djf(u) = f'(u)Dju from which the lemma follows. D Remark 11. Let u be a function and put (7.48) u+ = sup(u,O), u- = sup( -u, 0) with the result that u = u+ - U ; lul=u++u-; we have the Proposition 6. Let u E H 1 (Q) with Q an arbitrary open set in !R n . Then (7.49) u+,u-,Iul E H 1 (Q). Proof I t is enough to prove the result for u + . First of all it is obvious that u + E L 2 (Q). 
138 Chapter IV. Sobolev Spaces Let us evaluate grad u+. For this purpose we introduce, for all 8 > 0 the function _ { ( 2 + 8 2 ) 1 1 2 - 8 if  > 0 J.() - 0 if   O. For all 8 > 0, Ie is of class 1 (lR) with bounded derivative. From Lemma 1 it follows that for U E H 1 (Q) Ie 0 U E H 1 (Q). Thus for cp E !0(Q) f f uD.u (1. 0 u )DiqJ dx = - qJ (2 1 2)1/2 dx, n u>O u + 8 ( Di =  ) AX. l i == 1, 2, . . . , n. Making 8  0 we obtain f il U + D i qJ dx = from which it follows that we have - f cpDiudx u>O D + _ { DiU, .u - l 0 , u - > 0, u  0 l == 1, 2, . . . , n and so Diu+ E L 2(Q), l == 1, 2, . . . , n. o Remark 12. The results of Lemma 1 and of Proposition 6, with a similar proof, are valid for u E W 1 ,P(Q)19 P ): 1. For p ): 1, we should take _ { (P + 8P)11P - 8, Ie() - 0 ,  > 0 (  o. o 8. Supplementary Remarks 1. Sobolev Spaces Wm,P(Q) 1.1. Definitions. For m an integer, 1  p  00 and Q c lR n we define def Wm,P(Q) == {u E LP(Q); D a U E LP(Q), 0  lal  m} 19 See 8 following. 
8. Supplementary Remarks 139 wherefor a == {a1,a Z '...' an} E Nn, we put alIXlu DIXu == and lal == a 1 + a z + . . . + an. axl ax2 . . . axn This is a Banach space for the norm ( ) l II u Ilm,p == L IDIXUlfp(Q) , OIIXIm and in the case p == 00 1  p < 00, Ilu 11m, 00 == max IDIXUILw(Q). o  IIXI  m In the particular case p == 2, we have Wm,Z(Q) == Hm(Q) which is the Sobolev space (which is also a Hilbert space) studied in this chapter. The results concerning the spaces Wm,P(Q) are analogous to that obtained for the spaces H m(Q). On the other hand the methods of proof are very different as the use of the Fourier transform is excluded (even in the case Q == [Rn). The definition of the spaces WS'P(Q) for non-integral values of s, can be given by interpolation between LP(Q) and Wm,P(Q) (see Lions-Magenes [lJ) or by suitable differential quotients (see Necas [lJ or Adams [lJ). We can verify that in the case p == 2, the spaces W S , Z (Q) thus obtained coincide with the spaces HS(Q) defined as the space of the restrictions to Q of the functions of HS([Rn), itself defined by means of the Fourier transform (see Lions-Magenes [lJ). In a manner analogous to Ho(Q), we define Wo'P(Q) as the closure in Wo'P(Q) of the space 92(Q) of infinitely differentiable functions with compact support. For q > 1 we then define w-m,q(Q) to be the dual of the space Wo'P(Q) where p is the conjugate exponent of q, (i.e. l/p + l/q == 1). Since 92(Q) is dense in Wo,P(Q), this is a space of distributions. We verify that differentiation maps Lq(Q) continuously into w-1,q(Q). More generally for lal == m, the operator DIX maps Lq continuously into w-m,q(Q), for 1  q  00. The spaces Wm,P(Q) provide an essential tool for the solution of certain non- linear problems. zo They can be useful for linear problems, due in part to the richness of the results concerning injections between So bolev spaces. 1.2. Sobolev Injections We can prove (Adams [lJ) that if Q is a regular Z1 open set of [Rn (8.1) Wm,P(Q) C O(Q) when mp > n, with continuous injection. 20 See Lions [6]. 21 See the "minimal" conditions of regularity in Adams [1]. 
140 Chapter IV. Sobolev Spaces If in addition Q is bounded, the injection is compact. Likewise, we can show that, ifmp < n, and p  q  np/(n - mp) (8.2) Wm,P(Q) C Lq(Q) the injection being continuous. If, in addition, Q is bounded, the injection is again compact if q < np /(n - mp). Remark 1. In the limiting case mp = n, (8.2) is always true, but with p  q < + 00. In particular, if n = p = 2, we find that H1(Q) = W 1 ,2(Q) c Lq(Q) for all finite q  2; (the result is not true in the limiting case q = (0). This result complements Theorem 2 of 93 where we have Q = [Rn and n > 2.. o Remark 2. As an example of their use, let us show that the Sobolev injections, introduced above, enable us to obtain precise information on the regularity of elliptic problems 22 , owing to the results on regularity of Agmon-Douglis- Nirenberg [1]. For example, if u is the solution of the Dirichlet problem { - L1u = f in Q, u == 0 on r = aQ since the Laplacian is an elliptic operator of order 2, the results of Agmon-Douglis-Nirenberg [1] shows that if f E LP(Q) with 1 < p < 00 and r is regular, then u E W 2 ,p(Q)23. We deduce that auf aX i E W1,P(Q), i == 1, 2, . . . , n. From (8.1), if p > n, then auf aX i E O(Q), so that u E  1 (Q). 23 1.3. Trace Theorems for the Spaces Wm,P(Q) Using methods anal6gous to those of 94, we can establish trace theorems for the spaces w m , P (Q). m-l The mapping from Wm,P(Q) to n Wm-k-1/p,p(Q) defined by k=O u  yu = {you,.... ,Ym-1 U } (where Yju == aju/ an j on r is the trace of the normal derivative of order j of u on r) is continuous and surjective. In particular, a function of Wm,P(Q) admits a trace in Wm-1/p,p(r) (we recover the results of the case p == 2 of 94). 22 See the general definition of an elliptic operator with constant coefficients in Chapter V. 23 For these results, and for others of this type, see Chap. II, 6. 
8. Supplementary Remarks 141 2. Sobolev Spaces with Weights 24 2.1 Unbounded Open Sets In this case of unbounded open sets, we know there is no unique solution of the Dirichlet problem, except if we impose on the solution conditions of growth at infinity (see Chap. II). We are thus lead to look for the solution in Sobolev spaces with weights where the "weight" is adjusted in such a way as to ensure that the solution satisfies reasonable conditions of growth, and that at the same time, there is existence. For example if Q' is the complement of Q, where Q is a regular bounded open set in [Rn, a "good" space in which to look for the solution of the exterior Dirichlet problem for the Laplacian is the space {u; (1 + ,,2 )a u E L 2(Q'), ::i E L 2(Q') } with a == - t. (The function (1 + r 2 )a is called a weight function and depends solely on r == Ix I.) We refer to Chap. XI for the statement of results concerning. such Sobolev spaces with weights, and their use. 2.2. Polygonal Open Sets The regularity results of Agmon-Douglis-Nirenberg applied to the problem of the Laplacian { - L1u == f in Q u == 0 on r == 8Q and which state that iff E LP(Q), then u E W 2 ,P(Q) are not true in general in the case in which Q is a polygonal open set. On the other hand, they are true for such open sets in Sobolev spaces with suitably chosen weights (see Grisvard [1]). The regularity results of Agmon-Douglis-Nirenberg being valid outside of the neighbourhoods of corners, it is enough to consider the regularity in the neighbourhood of a given corner. We can thus reduce the problem to the case in which Q is a cone with vertex the origin. We denote by pm,p the space of functions u defined on Q such that rial - m Dau E LP(Q), Vial  m. We then have the following regularity for the Dirichlet problem: f E pm,p(Q) => U E pm + 2,p(Q) (see Grisvard [1]). 24 Kufner [IJ should be consulted. 
142 Chapter IV. Sobolev Spaces Review of Chapter IV This chapter is essential for the sequel. In effect the space H 1 (Q) is in some sense the optimal functional space for the study of the Dirichlet problem (for bounded Q). From the theoretical point of view, there is not always existence of a solution in a space more regular than H 1 (Q), (for example if Q is a non-convex polygonal open set, the solution is not in H 2 (Q)). On the other hand, if we work in a space larger than H 1 (Q) the derived solutions do not, in general, make sense physi- cally (e.g. may lead to infinite energy). On the numerical plane (see Chap. XII), we shall see that it is much easier to construct sub-spaces of finite dimension included in H 1 (Q) and verify acceptable approximation properties, than in other smaller spaces such as, for example H 2 (Q) or l(Q). In effect, if (QJi = 1, . . . , N is a partition of Q, and if the restriction of u to each of the Qi is regular, it is sufficient that u be globally o on Q for u to be in H 1 (Q). Now the conditions for fitting together the Qi in such a way that u be globally continuous are easy to impose. Likewise, it must be noticed that the solution u of the Dirichlet problem - L1u == f on Q, u == 0 on r == oQ is according to its regularity, a solution of the following three problems: (1) find u E H 2(Q) n H 6(Q) such that In (Llu + f)v dx = 0, 'rIv E L 2(Q); (2) find u E H 6(Q) such that In VuVvdx = In fvdX , 'rIv E H6(Q); (3) find u E L 2 (Q) such that In (uLlv + fv)dx = 0, 'rIv E H 2 n H6(Q). To approach numerically each of these three problems it is a question of constructing a family of spaces U h approaching the space U described by the solution and a family of spaces V h approaching the space V described by the test function. In the first case (U == H 2 n H 6(Q)), it is U h which is difficult to construct, and in the third case (V == H 2 n H6(Q)), it is V h which is difficult to construct as the approximation of H 2 (Q) necessitates the use of finite elements of class  1 (see Chap. XIII). 
Appendix 143 Finally, case (2) represents a compromise in which U and V are each relatively easy to construct; it is enough to make use of finite elements of class ceo. Another determining advantage is that (2) will lead, for the limiting problem, to the solution of a linear system with a positive definite symmetric matrix, contrary to the two other problems (1) and (3). I t is the point of view (2) which will particularly be developed in Chap. VII, concerned with the numerical approximation to the solution of the problem. Appendix. The Spaces HS(r), with r the "Regular" Boundary of an Open Set Q in (Rft We make the following hypotheses: (H.1) The boundary r of Q is an infinitely differentiable variety of dimension n - 1, Q being locally on one side of r 25 (H.2) The open set Q is bounded. Under the hypotheses (H.1) and (H.2) there exists: (1) a finite family of "local maps", that is to say of couples (CD i, ({Jdi = 1 to N, such that (a) (CD di = 1 to N is a family of bounded open sets covering r N r c U CD i , i = 1 (b) (({Jdi = 1 to N is a family of diffeomorphisms 26 of (CD d onto the (cylindrical) open set Q of [Rn defined by (1) Q == {y,y == (Y',Yn), Iy'l < 1 and -1 < Yn < 1}. with (2) ({Ji(CD i n Q) == Q+ clef {y == (y', Yn) E Q, Yn > O} and so (3) ({Ji(CD i n r) == L clef {y == (y', Yn) E Q, Yn == O}, and with the following compatibility conditions: if CD i n CD j -# 0, there exists a diffeomorphism Jij, with positive Jacobian, from ({Ji(CD i n CD j ) onto ({Jj (CD i n CD j ) such that (4) ((Jj(x) == Jij(({Ji(X)), '\Ix E CD i n CD j ; 25 We say that Q is a variety with boundary of class f{jOO. The regularity f{jOO is not always necessary; it does simplify the exposition appreciably. 26 Here this is to say: (fJi is infinitely differentiable from (9i onto Q and has an inverse (fJi- 1 = t/Ji itself infinitely differentiable from Q onto {!)i' We also occasionally call it a f{jOO diffeomorphism, the f{jOO regularity not always being implied. 
144 Chapter IV. Sobolev Spaces (2) "a regular partition of unity on r subordinate to ((!) di = 1 to N" that is to say: a family of real functions (Cidi = 1 to N is defined on r such that: (a) Ci i is an infinitely differentiable function on r with values in [0, 1]. (b) Cii has compact support in (!) i (\ r (c) N L Cij = 1 on r. j = 1 o Let dy be the positive measure on r, called the measure element of area of the boundary r or again the canonical measure (See Dieudonne [2], Chap. XXIII) (this measure exists under the conditions H.1)). (I) First of all let us recall the (intrinsic) concept of the integral of afunctionf on r. If x = (Xl' . . . , x n ) denote the coordinates of a point x E Q in a system of cartesian axes, and if x E (!)i, then x = <fJi- 1 (y) (with Y E Q+) is obtained by prescribing n real functions such that (Y1, . . . , Yn) E Q +  X k E [R: we denote them X k (Y1' . . . , Yn). We denote by: (5) J7(Y') the value for Y = (y', Yn = 0) of the determinant obtained by starting from the matrix 8x 1 8x 1 - - 8 '...'8 Y1 Yn 8x n 8x n - - 8 '...'8 Y1 Yn and suppressing the last column and the line with index k. We also write k ( 8(X1' . . . , x k , . . . , x n ) ) Ji(y') - . 8 (y l' . . . , Y n - 1 ) Yn = 0 Let us put (6) ( n ) 1/2 J;( y') = k f: 1 (J 7(y'))2 . Then if f is a function given on r with support in (!)i, the integral of f on r is defined by: (7) f f(x)dr = r f(x)dr def J f(<fJi-1(y', 0))J i (y')dy'.27 r J((l nr E The integration off on r is thus, in this case, reduced to the integration of the function y' = .f(<fJi- 1 (y', 0)) on the ball L of [Rn-1. 27 See also Necas [IJ p. 120. 
Appendix 145 The general case of a functionf given on r is treated by means of the partition of unity (aJi = 1 to N (see (c)). We put (8) L/(X) dr = jt 1 f /(X)lX j (X) dr 27 and each of the terms on the right side of (8) is meaningful, as a consequence of (7). This definition is intrinsic in the sense that it depends neither on the "maps" nor on the partition of unity considered. We shall not recall here the definition of the space ' (r) of the distributions on r; we shall content ourselves with making reference 'to Schwartz [1] and also Hormander [1], pp. 27-28. (II) Sobolev Spaces HS(r), s E IR. Letfbe a function (real or complex) given on r. We define the function (}ifon IRn- 1 by (9) ((}if)(y') == { (IXJ)(CPi-l(y" 0» if I y'l  1 o otherwIse. We then have the Definition 1. The Sobolev space HS(t) of index s (with s real and arbitrary sign) is defined by (10) HS(r) == {f; (}if E HS(IR,-l) Vi == 1 to N}. Proposition 1. Provided with the norm (11) IlfIIW(r) = Ct 1 118jflliI'(IR;,_,)Y/2 HS(r) is a Hilbert space. Remark 1. The norm (11) depends on the particular choice of the system of "local maps" (£Pi' CPJi= 1 ton and of the partition of unity. On the other hand we can show that: (a) the algebraic definition (10) is independent of the choice of the system of local maps and of the partition of unity; (b) the different norms (11) defined by different systems of local maps and partitions of unity are all equivalent, with the result that the topology of the space HS(r) is also independent of the system of local maps and of the partition of unity. o Remark 2. We have used the regularity of the function y'  Ji(y') defined by (6) and the fact that Ji(y') -# 0, Vy', and that J i . is a bounded function to eliminate the weights J i in the definition (10). o 
146 Chapter IV. Sobolev Spaces Remark 3. In the case s = 0, the formula (11) leads to a norm of the space L 2 (Q) which is not intrinsic. o Remark 4. Generalisations. More generally still, we can define the Banach spaces WS,p(r), for all p such that 1  p  00 and all s E [R, by WS,p(r) = {f, flif E WS,p([R,-l), i = 1 to N} with the norm { N } l/P II f II W',P(F) = Jl II OJ II ''''P(IR,-l) with "analogous" properties to those of the spaces HS(r). o We say that the space HS(r) is defined by localisation and diffeomorphism. (III) General properties of the spaces. The principal properties of the spaces HS(r) are brought together in Proposition 2. (1) The space (r) is dense in HS(r), s ): o. (2) Identifying HO(r) with its dual, we have (HS(r))' = H -S(r). For the proof of this proposition, we refer to Lions-Magenes [1]. Remark 5. On the regularity of the open setfor defining HS(r). If the open set Q does not satisfy the regularity conditions (H.l), the trace theorems are less general. Let us indicate here a result in the (particular) case in which Q is an open set having the property called "k uniform-regularity"28 - essentially, the family qJi considered previously is of class k as well as the inverse qJ i- 1 (see Adams [1]). Then Theorem 5 of 4 is true up to the order m = k. o Remark 6. Another definition of HS(Q). We consider a field of first order differential operators Ab . . . , An-l operating on r, regular and independent; (these are the differential operators in [Rn tangential to r). We then define H 1 (r) = {v, A 1 v, . . . , An _ 1 vEL 2 (Q ) } provided with the usual norm of the graph; we define, for m an integer, Hm(r), (m > 1), in the same way. We then define HS(Q) for s real in [0, m] by interpolation between Hilbert spaces - then for s < 0 we define HS(r) as the dual of H -S(r) when L 2(r) is identified with its own dual. We recover (to an equivalence of norm) the previous spaces H S . 28 Or again Q a variety with boundary of class k. 
Appendix 147 Remark 7. Yet another definition of HS(r). We consider the Laplace-Beltrami operator - J r on the variety r. We consider - J r as a positive definite self- adjoint operator in L 2 (Q), of domain D( -J r ). We then define HS(r) == D((-J r )s/2) SE[R provided with the norm of the graph II v II H S ( r) == (II v II £2 ( n + II ( - J r )Sl 2 v II £2 (r) ) 1 1 2 ; we again recover the preceding spaces (with an equivalence of norm). The method of Remark 6 generalises to the spaces WS,p(r); that of the Remark 7 leads to different spaces of the WS,p(r) when applied in LP(Q) (and with a suitable definition of ( - L1 r)s/2). 
Chapter V. Linear Differential Operators Introduction In this chapter we study general linear differential operators, whose represen- tation is given by P = LaaDa where the aa are functions of x = (x l' Xl' . . . , X n ) or constants (it is the case of constant coefficients which is principally studied here). This study, presented here, often goes beyond the needs of the problems of physical origin which form the principal object of this work. But on the other hand, the questions which are raised are of a sufficiently fundamental character and the methods brought into play to derive solutions are also sufficiently basic, that it is more natural (and at times more simple) to consider these problems in a general framework. In this spirit, we first of all reply to the question: what are the linear and local operators; these are, precisely the linear differential operators (that is, in ad- dition, linked with the theorem of Schwartz kernels). We introduce the concepts of characteristic and of bicharacteristic and we prove (1) the theorems of Cauchy-Kowalewsky and of Holmgren. In 2, in the case of constant coefficients, we study the different types of operators: elliptic, parabolic and hyperbolic by studying the support and the regularity of an elementary solution. An elliptic operator is one with an elementary solution which is analytic in IRn\ {O}; a parabolic operator admits an elementary solution which is merely oo in IRn\ {O}, but with support in a half-space; finally, a hyperbolic operator admits an elementary solution with support contained in a salient cone. In 3, we study the Cauchy problem for hyperbolic or parabolic operators, and we show that, according to the case, it is well-posed in !/' (weakly parabolic operator) and in ' (hyperbolic operator). The study of the hypoellipticity and of the hypo analyticity of operators is the subject of 4, allowing us to establish results on local regularity for the solutions of Pu = fin '(Q). Finally, in 5 various formulations of the maximum principle are studied. We should consult especially, in parallel with this Chap. V and as a complement to it, the book of Chazarain-Piriou [lJ. 
91. Generalities on Linear Differential Operators 149 1. Generalities on Linear Differential Operators 1. Characterisation of Linear Differential Operators Let Q be an open set in [Rn. For j = 1,. . . , n, Dj = %x j are the operators of partial differentiation with respect to the jth coordinate x j; these operators of /(Q) commute with each other (see Appendix "Distributions"), which allows us to define D a = D a l D an 1 ... n for all (J., = ((J.,1, (J.,2, . . . , (J.,n) E r\jn, the product denoting the composition of the operators. The operators Dj and hence D a possess the property of diminishing the support (see Appendix "Distributions" 91.2.4): 'if u E ' (Q), supp Dju C supp u. This property is similarly satisfied by multiplication by a function, or more generally by a distribution: being given a E &(Q)(resp. '(Q)) 'if u E '(Q)(resp. &(Q)), supp(au) c suppu. More generally, it is satisfied by every operator of the form (1.1) P = 'LaaDa where the sum is taken over all (J., E r\jn, the coefficients aa being elements of &(Q) (resp. ' (Q)) and the family (aa) being supposed to be locally finite, that is to say: (1.2) for all compact K c Q, {(J.,; supp aa n K i= 0} is finite This property (1.2) enables us to define Pu as elements of ' (Q) for all u E ' (Q) (resp. &(Q)) and we have (1.3) suppPu c suppu. It is obvious that the mapping u ---+ Pu is a continuous linear mapping from '(Q) (resp. &(Q)) into '(Q) and hence a fortiori from (Q) into '(Q). We notice that for a linear mapping P the property (1.3) is equivalent to: for every open set U c Q, Ul = U2 on U => PU 1 = PU2 on U, that is to say that the trace of Pu on U depends only on the trace of u on U; we say that P is a local operator. It is worthy of note that every local continuous linear mapping is of the form (1.1). More precisely we have: Theorem 1. Let P be a continuous linear mapping from (Q) into ' (Q) be local, i.e. P satisfies (1.3)for all u E (Q). Then there exists a unique family (aa)C( E N n of 
150 Chapter V. Linear Differential Operators locally finite elements of' (Q), that is to say satisfying (1.2), such that P is given by ( 1.1 ). This theorem leads us to frame the following definition: Definition 1. Every local continuous linear mapping from (Q) into '(Q) is called a linear differential operator (abbreviated to l.d.o.), on Q. From Theorem 1 P may be written in a unique way in the form (1.1): The distributions a(X are called the coefficients of P. We associate with P the formal series (with coefficients in '(Q)) P() = La(X(X where (X == 1,. . . , n with the result that P = P(D). This definition of a linear differential operator made its appearance at the beginning of this chapter so that the concept of a linear differential operator is at one and the same time algebraic and functional: this duality will appear more and more clearly in the sequel. We now state a characterisation of the l.d.o. with coefficients in G(Q): Theorem 2. Let P be a local linear mapping of(Q) into '(Q). Thefollowing properties are equivalent: (i) for all u E (Q), Pu E (Q); (ii) P extends to a continuous mapping of' (Q) into itself; (iii) P is a l.d.o. with coefficients in G(Q). Further, when these are satisfied, the continuous extension of P to ' (Q) is unique; we still denote this extension by P; it is local linear and given by (1.1). We observe that in Theorem 2, we do not assume, a priori, that P is continuous, which could be a hypothesis, difficult to verify a priori. In fact we have the following result (see Peetre [1] for a proof): Lemma 1. Let P be a local linear mapping of (Q) into ' (Q). Then there exists a set A of isolated points of Q sueh that the restriction of P to (Q\A) is continuous. We shall prove these two theorems in the course of this Sect. 1, in proportion as we shall adduce some consequences of Definition 1 and shall give some formulae relating the operator P = L a(XD(X to its coefficients. First of all, we observe that by making use of the local character of distributions (see Schwartz [1]), we have that a local linear mapping P of (Q) into '(Q) admits one and only one loeallinear extension of G(Q) into' (Q); we again denote this extension by P: for u E G(Q), Pu is defined by { for every open set V c c Q, Pu = P(au) on VI where a is an arbitrary element of (Q) satisfying a = 1 on V. 1 U C c Q signifies that U is relatively compact in Q (i.e. [j is compact in Q). 
91. Generalities on Linear Differential Operators 151 Let us suppose that Pu = L aaDa E ' (Q) for all u E (Q), then it is still true for all u E G(Q). We can then recover the coefficients by means of the recurrence formula (1.4) ap = ( p - L aa Da ) up a<p where a  {3 means that a j  {3j(j = 1,. . . , n), a < {3 means that a a -# {3 and {3! = ({31!) . . . ({3n!). We deduce immediately the x P up(x) = {J! '  {3, Lemma 2. Let (aa) be a locally finite family of elements of '(Q); put P = L aa Da . On the other hand, let Va be a subspace of ' (Q) satisfying (Q)Va C Va; let us put V = {u E '(Q): (Q)u C Va}. Then P maps (Q) into Va iff the coefficients aa belong to v. Proof of Theorem 1. The uniqueness follows from Lemma 2 with Va = {O}. To prove the existence we prescribe P a continuous linear mapping of Et1(Q) into Et1'(Q). We can associate (one-to-one) with it the continuous bilinear form on (Q) x (Q), a(u, v) = (Pu, v> From the theorem of Schwartz kernels (see Appendix "Distributions"), we can associate (one-to-one) with it the distribution T on Q x Q such that a (u, v) = (T, u @ v >. Suppose now that P is local which may be expressed by a(u, v) = 0 for all u, v E (Q) with supp u n supp v = 0, or in other terms by supp T c {(x, x); X E Q}. We now consider an open set V c c Q. From the local representation theorem 2 we deduce that there exists a finite family of measures on Q x Q (which we denote by (/la,p)) satisfying f sUPP JIIZ,P C sUPP T l T = LLDD/la,p In '(V x V). By the construction of T, we have for all u, v E (V) (Pu, v) = LL( _l)IIZI+IPI f f DlZu(x) DPv(y) dJIIZ,p(x, y), 2 This theorem (see Schwartz [1], pp. 82 et seq.) states: Let T E £C'(Q) be a distribution and let U be a relatively compact open set in Q, then there exists a finite number of measures J1rJ. on Q, whose support is contained in a neighbourhood of tJ such that T = L DrJ. J1rJ. on U. We use here a stronger theorem: If the support of T is contained in a regular closed set F, then we can choose measures J1rJ. such that supp J1rJ. C F (see Schwartz [1], p. 99). In the present case, the support of T E £C'(Q x Q) is contained in the diagonal of Q x Q which is closed and infinitely regular. 
152 Chapter V. Linear Differential Operators But supp /la,p C {(x, x); X E Q}; thus there exists a measure va,p on Q such that /la,p is the image under x E Q ---+ (x, x) E Q x Q of the measure va,p and we have (1.5) <Pu, v) = II(-l)I"I+IPI f D"u(x) DPv(x) dv".p(x). Hence, given u E (U), we have Pu = L I (_1)la l DP(va,pDa u ). We develop DP(va,pDau) by Leibniz's formula and regroup; we see that there exists a finite family (aa) of elements of ' (Q) such that P = IaaDa on (U). A priori this family depends on U; but if (b a ) is a family corresponding to an open set V C C U, then from the uniqueness property we have aa = b a on U n V for all a from which the theorem follows. o To prove Theorem 2, we need several tools which we now introduce. 2. Various Definitions 2.1. Leibniz's Formula We have already used Leibniz's formula which we recall: a! Da(uv) = I DPuDa- p v P  a P! (a - P)! and which is valid for u E &(Q) and v E '(Q). Being given a formal series P() = Iaaa we introduce the derivatives of P() a lPI a' p(P>() = - P() clef L . aaa-p. ap a P (a - P)! If (aa) is a locally finite family of elements of ' (Q) we can associate with that family the l.d.o. a' p(P) = P(P)(D) = I . , aaDa - p. a  P (a - P). Leibniz's formula can then be written (1.6) 1 P(uv) =  - DPuP(P)v  P! . 
91. Generalities on Linear Differential Operators 153 Further, we can introduce the formal series of the derivatives of the coefficients of P: a lPI (DP P)() clef p P() = L (DP a cx ) cx. aX cx  P In general (DP P)(D)u i= DP(P(D)u). In effect, by Leibniz's formula DP(P(D)u) = L L P! (Dka,,)(DdP-k u ). cx kpk!({3 - k)! Suppose that we are given a second l.d.o. Q = L bpDP and that P or Q have P their coefficients in &(Q). We can then define the composition QP which is a l.d.o. on Q. Regrouping the terms in the preceding formula, we deduce that: the formal series associated with the product QP is ( 1.7) 1 (QP)() = L k! Q(k)()(Dk P)(). 2.2. Transpose of a Linear Differential Operator Let P be a continuous linear mapping from f0(Q) into f0'(Q). We can then define the transpose tp of P: it is the continuous linear mapping of f0(Q) into f0'(Q) defined by <tpu, v) = <Pv,u) 'ifu, v E f0 (Q). We have already seen that the property: "p is local" is expressed by: < Pu, v) = 0 'ifu, v E f0(Q) with supp u n supp v = 0; so P is local iff t P is local. In other words: the transpose of a l.d.o. is a l.d.o. Let us suppose that P = L acxD cx and let us determine tp. For all u, v E f0(Q), we have <t Pu, v) = L < acxDcx v , u) = L ( _1)lcx l <DCX(acxu), v). So, for u E f!fi. (Q), a' tpu = L(-1)lcx I DCX(a cx u) = L L (_1)lcx l , . , (DPacx)(DCX-P u ). cx P  cx 13 . (a - 13) . In other words, the formal series associated with the operator t P is (1.8) ( - 1)IPI CP)() = L P! (DP p(P»)( - ). Using this formula and the fact that P is the transpose of tp, we deduce that P has its coefficients in &(Q) iff the same is true of tp. On the other hand the operator P can have regular coefficients (but not completely regular, i.e. in &(Q)) 
154 Chapter V. Linear Differential Operators without the same being true for tp; for example if a is only of class m on Q with m < 00, there will exist a E Nn such that the transpose of the l.d.o. aaDa has coefficients which are not continuous. This justifies our considering l.d.o. with coefficients which are distributions. Proof of Theorem 2. Suppose that (iii) is satisfied; the formula (1.1) shows on the one hand that P maps (Q) into itself, and on the other hand defines a local continuous linear extension of P to ' (Q); finally, because of the density of (Q) in ' (Q) (see Appendix "Distributions"), we have that a continuous extension of p to '(Q) is unique. It thus remains for us to demonstrate the implications (ii) => (iii) and (i) => (iii). If (ii) is satisfied, P is in particular a l.d.o. and, because of the reflexivity of (Q), tp maps (Q) into itself. Applying Lemma 2, we see that tp has coefficients in & (Q) and therefore so also does P. Finally, let us suppose that (i) is satisfied. As a result of Lemma 1, there exists A, a discrete part of Q such that the restriction of P to (Q\A) is continuous. Applying Theorem 1, we see that there exist aa E '(Q\A) such that Pu = LaaDau in '(Q\A) for all u E &(Q\A). Since A is discrete, we find on using the formula (1.4) repeatedly that aa E &(Q). o 2.3. Order of a Linear Differential Operator Let P = LaaDa be a l.d.o. on Q. If P is non-null, there exists a E Nn\ {O} such that aa =1= 0 on Q. We then define the order m of P by m = sup {Ial; aa =1= 0 on Q}. We notice that this order could be infinite. If m is finite, we call p. = L aa Da . lal=m the principal part of the operator P. We shall characterise the order of P without appealing to the coefficients. Being given cp E &(Q) we have Delp = elp Dcp. We deduce first of all: let , XO E en and suppose that cp(x) = . (x - xO) for all x E U, an open set in Q, then ( 1.9) P() = e-lp Pelp on U. Now let'! E e and a E Nn with lal > 0; we see easily by recurrence on lal that e- tlp Dae tlp is a polynomial in '! with coefficients in &(Q) and of supports contained in supp Dcp, without constant term, of degree less than or equal to lal and e- tlp Dae tlp = (Dcp)a,!la l + We then deduce 
9 1 . Generalities on Linear Differential Operators 155 Proposition 1. Let P = L aa Da be a l.d.o. on Q. a cP E (Q) being given, the mapping Plp: TEe -+ e- tlp Pe tlp E '(Q) is a polynomial with coefficients in ' (Q), of degree less than or equal to m and Plp(T) = P.(Dcp)Tm + . . . + ao, m = max {Ial; supp Dcp n supp aa i= 0} where p.() = L aaa. lal = m I n particular, the order of P is the least upper bound of the degrees of the polynomials P lp for all cp E (Q). and We leave as an exercise to the reader the proof of the Proposition 2. A l.d.o. on Q with coefficients in & (Q) is of finite order iff it maps the space (Q) of distributions of finite order into itself. For this, it will prove useful to make use of the following lemma which is a consequence of the formula (1.8): Lemma 3. Let P be a l.d.o. on Q. The order m of the transpose t P is equal to that of P. If m is finite, then the principal part of t P is (1.10) CP). = (_1)m p.. We deduce: a l.d.o. on Q symmetric (resp. antisymmetric), that is to say tp = P (resp. tp'= - P) has an even order (resp. odd). 3. Linear Differential Operator on a Manifold The definition we have taken for a l.d.o. on an open set Q in [Rn can easily be generalised to the case of a differentiable manifold. To avoid having to appeal to the theory of distributions on a differentiable manifold and having to enter into considerations of regularity, we shall assume that all the differentiable manifolds, diffeomorphisms, etc. . . , are of class rrl 00 . Definition 2. Let M be a differentiable manifold. We call every local linear mapping P of rrlo (M) into itself a linear differential operator of class rrl oo (abbreviated to l.d.o.) If U is an open set of M, the restriction of P to rrlo (U) is a l.d.o. on U; it is the trace of P on U. By a classical argument using the partition of unity P is null iff P is null in the neighbourhood of every point of M. Thus P is completely determined when it is determined in the neighbourhood of each point of M, and P extends in a unique manner to a local linear mapping of rrl OO (M) into itself. Let h 
156 Chapter V. Linear Differential Operators be a diffeomorphism of M onto another manifold N. We define the image h*P of P by h: it is the mapping: vEqj(N) --+ hP(vo h)Eqj(N). where for u E qjOO (M), hu == h(u) clef U ° h -1. Obviously h*P is a l.d.o. on N. In particular if h is a diffeomorphism of M onto itself we say that P is invariant under h if h*P == P, in other terms if hand P commute P(hu) == h(Pu) Vu E qj(M). Let us suppose now that M is of dimension n and that X is a map of M, that is to say a diffeomorphism of an open set U of M on an open set Q of [Rn. The image under X of the trace of P on U, denoted by px, is a l.d.o. on Q (with coefficients in &(Q)). Thus there exists a unique locally finite family (aa) of elements of qjOO (U) such that (Pu)(X) == L aa(x)(DaXu)(X(x)) Vu E qj(U), Vx E U. The functions aa are the coefficients of P in the map X and the operator XD j clef X;l D j : u E qj(U) --+ (Djx u ) ° X E qj(U) is the derivative along thejth coordinate curve of the map x. Let qJ E qj(M). By Proposition 1, with the help of a partition of unity, we see that Plp: ! E C --+ e-tlpPe tlp E qjOO(M) is a polynomial in ;: with coefficients in qjOO (M). We define the order of P as the upper bound of the degrees of Plp when qJ runs over qj(M). If a map is defined on U the order of px is equal to the order of the trace of P on U. Example. The Laplace-Beltrami Operator. The Laplace-Beltrami operator L1(1 \ on the sphere " clef S n - 1 " { I'Jl)n 2 2 I }  == == X E lJ\\ ; Xl + . . . + X n = has been defined in Chap. II, 91.3 as the restriction to the sphere of the Laplace operator L1. More precisely let us denote by a the variable of sn-1. L1(1 is defined by (1.11) (L1(1v)(a) = (L1u)(a) where for all v E OO(sn-l), u(x) = v C:I } In particular since it is so for the Laplacian 3 and sn-l, L1(1 is invariant under every orthogonal transformation of sn - 1. 3 See Proposition 6 of 91. 
9 1 . Generalities on Linear Differential Operators 157 The mapping h: x -+ (lxi, x/Ix!) is a diffeomorphism of [Rn* on JO, oo[ X sn-l; it is the transformation to polar coordinates; the image of the Laplacian .1 by h is the Laplacian in polar coordinates: (1.12) a 2 ar 2 + n - r 1 a 1 _ a + 2.1(1' r r that is to say, for every function: v E rc oo (]O, 00 [ x S n - 1) a 2 v n - 1 av 1 (.1u)(rlT) = a 2 (r, IT) + _ a (r, IT) + 2 (.1(J v)(r, IT) r r r r (1.13) where u is defined by u(rlT) = v(r, IT). 4. Characteristics 4.1. Concept of Characteristics Let us consider a l.d.o. P on an open set Q of IRn which we shall suppose to be of fini te order m: P = L aa Da with principal part p. = L aa Da . lal m lal =m First of all, let us choose one of the variable X n . We can write a k P = L Qk _ a k X n where a lal - k Qk = L aa a al a a n -l lal  m Xl' .. X n - 1 an=k does not contain the differential operator a j ax n . The greatest k such that Qk =1= 0 is called the parcial order of P with respect to a/ax n . It is immediate that the partial order of P with respect to ajax n is strictly less than m, the total order of P, if and only if the distribution a(O, 0, . . . , m) is null, that is to say iff p.(O, . . . , 0, 1) = 0 in '(Q), where, let us recall, for a l.d.o. P = Laa Da , P() = Laaa,  E IRn. We shall say that, in this case, the vector  = (0,0, . . . , 1) is a characteristic vector of P or, again, the hyperplanes {x; X n = constant} are characteristic hyperplanes of P. More generally, let us consider M a manifold of dimension n - 1 ::1nd h a diffeo- morphism of Q on an open set U of M x IR (in the preceding particular case, M = IRn-l and h: x E Q -+ (x',x n ) E IRn-l x IR where x' = (Xl"..' Xn-l)). For simplicity, let us suppose that the data are rc oo and that U = M x I = {(x', t); x' E M, tEl} 
158 Chapter V. Linear Differential Operators where I is an interval of. We can consider the l.d.o. h*P which can be written LQk(t)ak/at k where Qk(t) is a l.d.o. on a manifold M with CX)-coefficients de- pending in a CX) way on tEl. The greatest k such that Qk =1= 0 is the partial order of h*P with respect to a/at; it is immediate that it is always less than or equal to m and that it is strictly less than m iff Qm = o. Now let us denote by <p: Q -+ I the t-component of h: h(x) == (x' (x), <p(x)). We have (see Proposition 1), e-UPPet({J==TmP.(D<p)+ . . . == h -1 (e -tt(h* P)e tt ) == Tm(h -1 Qm) + . . . and so Qm == hp.(D<p). We deduce that the partial order ofh*P with respect to a/at is strictly less than m iff P.(D<p) = o. We say that the level surfaces {<p(x) == constant} are characteristics of P. In fact, we shalllocalise the definition of a characteristic. We shall suppose that p. has coefficients which are sufficiently regular; for simplicity, let us suppose that p. has CX) -coefficients. We use the notation p.(x,) == L a(X(x)(X, I(XI =m X E Q,  E n. Definition 3. Being given XO E Q, we say that a vector  E en,  =1= 0 is a characteristic of P at XO if p.(XO, ) == O. The set of  E en such that p.(XO,) == 0 is called a characteristic cone of P at xo. Being given <p E 1 (Q) with D<p(xO) =1= 0, we say that the surface {<p(x) == <p(XO)} is a characteristic of P at XO if p.(XO, D<p(xO)) == o. We should not1ce that the property for a surface S == {<p(x) == <p(XO)}, regular in xO(D<p(xO) =1= 0) to be a characteristic of P at XO is geometrical: it expresses that the normal vectors to S at XO are characteristics to P at xO; in particular it is independent of the function <p representing S in the neighbourhood of xo. The concept of characteristic takes its full sense in the framework of differential geometry. We shall content ourselves here with stating certain properties which we shall find useful subsequently. First of all, let us sharpen the concept of a characteristic. Let us consider  E en,  =1= 0 to be a characteristic vector of P at xO. Since p. is homogeneous in , we have Euler's formula " ape ° . ° L- aj (x ')j = mP (x ,) = O. 
 1. Generalities on Linear Differential Operators 159 We say that  is a simple characteristic of P at XO if (1.14) ape L aj (xO,) > 0, that is to say, if there exists 1'/ E n such that ape L aj (xO,) 1Jj -=I- 0, which again expresses that t = 0 is a simple root of p. (XO,  + t1'/). Otherwise we say that  is a multiple characteristic of P at Xo . We observe, from Euler's formula, that a vector  E en,  =1= 0 is a multiple characteristic of P at XO ape iff aj (xO,) = 0 for j = 1,. . . , n, since this implies that r(xO,) = O. Using Taylor's formula, we see that  E en,  =1= 0 is a multiple characteristic of P at XO iff p.(XO, + 1'/) = 0(11'/1 2 ) when 11'/1  o. A regular surface S embedded in Q is a characteristic of P if it is a characteristic of P at every point XES. It is a simple characteristic of P if at every point XES, the normal vectors at x to S are simple characteristics of P at x. We shall show how the search for simple characteristic surfaces is linked to the solution of an ordinary differential system: the Hamiltonian system. 4.2. Bicharacteristics We suppose here that the coefficients of p. are real (of class CC OO ) and we look for the real characteristic surfaces of P. First of all, let us consider <p E CC 2 (Q) real with D<p(x) =1= 0 for all x E Q and suppose that the level surfaces {<p(x) = constant} are characteristics of P, that is to say that ( 1.15) p.(x, D<p(x)) = 0 for all x E Q. Differentiating (1.15), we obtain for k = 1, . . . , n ape ape a 2 <p (1.16)  (x, D<p(x)) + L ): (x, D<p(x)) a a (x) = 0, \Ix E Q. UX k Uj x j Xk Let us fix XO E Q and consider the solution x(t) of the differential system dX j ape . (1.17) dt = aj (x, Dcp(x)) for ) = 1, . . . , n; x(O) = xO. Putting (t) = D<p(x(t)), we find from (1.17) and (1.16) that dk a 2 <p dx. ape - d (t) = L a a (x (t)) - d J (t) = -  (x (t), D<p(x(t))). t x j X k t UX k 
160 Chapter V. Linear Differential Operators In other words (x(t), (t)) satisfies the differential system dx. op. J a(j (x, () dt (1.18) dk op. - - -(x,) dt OXk for j = 1, . . . , n for k = 1,. . . , n, and the equation (1.19) p.(x,) = O. We observe that the curve x = x(t) is contained in the surface {<p(x) = <p(XO)}, since from Euler's identity d o<p dx. - d <p(x(t)) = L  (x(t)) - d J (t) = mp.(x(t), D<p(x(t))) = O. t uX. t J Finally, we observe that if the surface {<p(x) = <p(XO)} is a multiple characteristic of P at xO, that is to say if the normal to the surface at XO is a multiple characteristic of P at xO, we have by definition op. ° ° a(j (x , Dcp(x )) = 0 for all j = 1,. . . , n and the solution of (1.17) is trivially x(t) = xo. On the contrary, if the surface {<p(x) = <p(XO)} is a simple characteristic of P at xO, from the uniqueness of the solution of (1.17), it is a simple characteristic at every point of the curve x = x(t); in this case we have dx/dt =1= 0 for all t, that is to say that the curve x = x(t) is regular 4 . The differential system (1.18) is called the Hamiltonian system of the operator P. Let us consider a solution (x(t),  (t)) of the Hamiltonian system; we have d op. dx. op. d. dt p.(x(t), ((t)) = L ax. (x(t), ((t)) d/ (t) + aJ'. (x(t), ((t)) d/ (t) = 0, J SJ that is to say that p.(x(t), (t)) is constant. In particular, if (x (t), (t)) satisfies the equation (1.19) for a value t = to, it satisfies it for all values of t. We notice finally that to impose on the curve x = x(t) the condition to be regular, reduces to imposing on (x(t), (t)) the condition of satisfying the inequality op. L a(j (x, () > O. (1.20) By definition, we shall call a bicharacteristic solution of P every solution (x(t), (t)) of the Hamiltonian system of P (1.18) satisfying (1.19) and (1.20), that is 4 A regular (parametrised) curve (of class rem) is a mapping t -t x(t) (of class rem) such that dx(t)/dt i= 0 for all t. 
91. Generalities on Linear Differential Operators 161 to say every solution (x(t), (t)) of the Hamiltonian system (1.18) such that (t) =1= 0 is a simple characteristic of P at x(t). As we have seen, this reduces to the same thing as saying that (x(t), (t)) is a solution of the Hamiltonian system passing through the pair (XO, o) satisfying (1.19) and such that the curve x = x(t) is regular; the curve x = x(t) is called a bicharacteristic curve of P. Thus we come to present the Proposition 3. Being given <p E 2(Q), real and satisfying (1.15) and XO E Q with o = D<p(xO) a simple characteristic of P at xO, the solution (x(t), (t)) of the Hamiltonian system (1.18) satisfying x(O) = xO,  (0) = o is bicharacteristic; in addition x(t) is the solution of (1.17) and (t) = D<p (x (t)). Let us summarize the proof: Proof. Being given <p E 2(Q) satisfying (1.15), XO E Q and o = D<p(xo) a simple characteristic of P at xO, let x(t) be the solution of (1.17) with x(O) = xO; we have dx/dt =1= 0 for all t (if dx/dt = 0 for t = to, then x(t) = x(t o ) would be the solution of (1.17) passing through (to, x(t o )) and so x(t) the solution of (1.17) passing through (to, x(t o )) would be constant, which is contrary to dx/dt(O) =1= 0). Now putting (t) = D<p(x(t)), we have that (x (t), (t)) is the solution of the Hamiltonian system satisfying x(O) = xo, (O) = o (this is the calculation leading to (1.18)). Hence the solution of the Hamiltonian system passing through (xo, o) is a bicharacteristic. D Proposition 4 below shows that every bicharacteristic solution can be defined in this way. In particular in a change of variable y = h(x) where h is a diffeo- morphism from Q onto an open set ofn, if(x(t), (t)) is a bicharacteristic solution of P, (y(t), 1'/ (t)) defined by y(t) = h(x(t)) 1'/(t) = (t). Dh(x(t))-l is a bicharacteristic solution of h* P. Indeed this result, of which we shall make use in the proof of Proposition 4, is easily shown by remarking that the principal of Q = h* P is defined by Q.(h(x),1'/) = p.(x,1'/. Dh(x)). In effect using Proposition 1 and the definition of h* P, Q. (h(x), 1'/) IS the coefficient of t n in the polynomial e- t 1].h(x) Pe t 1].h(x) = t n p.(x, 1'/. Dh(x)) + . . . . This shows that the notion of a bicharacteristic curve is a geometrical concept like that of a bicharacteristic surface. Proposition 4. Let XO E Q, o E n, o =1= 0 and 1'/0 E n be such that op. L iJj (XO, o)1'/o,j =1= 0, p. (XO, o) = 0, 
162 Chapter V. Linear Differential Operators On the other hand let tjJ E OO(D) be such that DtjJ(xO) = o. Then in a neighbour- hood Do of xO, there exists a unique function cp E oo (Do) such that the level surfaces {x E Do: cp(x) = constant} are simple characteristics of P satisfying the initial conditions Dcp(xO) = o, cp(x) = tjJ(x) for all x E Do such that (x - xo). 110 = o. Proof Let us put Ho = {x'; x' .110 = O}. From the implicit function theorem we see that, in the neighbourhood of 0 in H 0' there exists in a neighbourhood of o in , a unique solution t = t(x') of p.(XO + x', DtjJ(xO + x') + t11o) = O. Let us put (x') = DtjJ(xO + x') + t(x')110 and consider for all x' in the neigh- bourhood of 0 in H o , the solution (x(x', t), (x', t)) of the Hamiltonian system satisfying the initial condition, x(x',O) = XO + x', (x', 0) = (x'). By construction p. (x(x', 0), (x', 0)) = 0 and so (see above) (1.21) p. (x (x', t),  (x', t)) = o. Considering cp a solution of the problem defined in a neighbourhood Do of xO, we have for all x' in the neighbourhood of 0 in H o , Dcp(xO + x') = DtjJ(xO + x') + i(X')110 where i(x') is a continuous function of x' with i(O) = 0; we deduce that t(x') = t(x') and hence in the neighbourhood of 0 in H 0' Dcp (XO + x') =  (x') and x(x', t) is the solution of (1.17) with the initial condition XO + x'. We deduce that (1.22) cp(x(x', t)) = tjJ(XO + x'). To complete the proof of the Proposition, we show that (1.22) defines a solution of the problem. We notice, first of all, that the function t(x') is OO and hence so is  (x'); using regularity theorems on differential systems, we see that the functions x(x', t) and (x', t) are OO in the neighbourhood of 0 in Ho x . From the invariance of bicharacteristic solutions under change of variables, we can always suppose that 110 = (0, . . . , 0, 1) and hence that x' = (x, . . . ,x) and : (XO, o) "# O. The Jacobian at 0 of the field h: (x', t)  x(x', t) is ap./an (XO, o). We deduce from the inverse function theorem that h is a diffeomorphism in a neighbour- hood of 0 in H ° x  onto a neighbourhood of XO in D. This shows that the relation (1.22) defines a function of class OO in the neighbourhood of XO in D. We still have to show that (1.23) Dcp(x(x',t)) = (x',t). 
 1. Generalities on Linear Differential Operators 163 Indeed, in making the change of variable defined by h, we can now suppose that x(x', t) = XO + (x', t), that is to say that x j (x', t) = xJ + xj for j = 1,. . . , n - 1, We thus have xn(x', t) = x + t. ; (x(x', t), (x', t)) = j (x', t) = 0 for ) = 1,. . . , n - 1 : (x(x', t), (x', t)) = dn (x', t) = 1. Using Euler's formula ap. L -  (x(x', t), (x', t)) j(x', t) = mp.(x(x', t), (x', t))) = 0, a . J we deduce that (1.24) n(x', t) = O. Finally, we have for j = 1,. . . , n - 1 a o =  p.(x(x', t), (x', t)) uX'. J = r (x (x', t), (x', t)) + L a a P  . (x (x', t), (x', t)) k (x', t) uX. k uX. J J a. - 0/ (x', t), and thus atjl (1.25) j(x', t) = j(x') =  (XO + x') for j = 1,. . . , n - 1. uXj This clearly proves (1.23), since in this change of variable cp(x) = tjI(XO + x'). Remark 1. Supposing that the coefficients of p. are analytic (real or complex), the reader will be able to develop the whole of this Sect. 4.2 to bicharacteristics in the complex domain by continuing the coefficients to an open set of eN. 5. Operators with Analytic Coefficients. Theorems of Cauchy-Kowalewsky and of Holmgren We now suppose that the coefficients aa of P are analytic on Q. We have already seen, in Remark 1 above, that we can extend the theory of bicharacteristics to 
164 Chapter V. Linear Differential Operators the complex domain. Here we shall be interested in existence and uniqueness theorems in the neighbourhood of a surface which is not a characteristic. Theorem 3 (Cauchy-Kowalewsky). Suppose that cp is a real analytic function on Q such that (1.26) p.(x, Dcp(x)) =1= 0, \Ix E Q. Being given Do a relatively compact open set in D, there exists b > 0 such that for each Xo E Qo and for all functions f and g analytic on Q, there exists one and only one solution u which is analytic on th.e open set where U = {x E Q; dist (x, rxo) < b} r X O = {x E Do, cp (x) = cp (x 0 ) } 5 of the Cauchy problem (1.27) Pu = f on U { (u - g)(x + tDcp(x)) = O(t m ) when for all x E U with cp(x) = cp(XO), where m is the order of the operator P. tO (1.28) It should be observed that if cp(x) = X n - c the initial condition (1.28) becomes aku akg (1.29) ---;z (x) = --;:k (x) for k = 0, . . . , m - 1. aX n UX n Indeed, in the neighbourhood of each point xO, we can make a change of variables y = h(x) such that hn(x) = cp(x) - cp(XO). This change of variable is analytic with the result that h*P has analytic coefficients. Since alp = p.(. ,Dcp) is non-zero, we can always divide P by alp' that is we can suppose that am h P = - -  baD y a. * O y m n  lalm an < m To demonstrate the uniqueness of the analytic solution of (1.27), (1.28) we observe that for b chosen sufficiently small the connected components of U intersect the surface {x E U; cp(x) = cp(XO)}. Then, by analytic continuation, the uniqueness reduces to the following lemma: Lemma 4. Suppose that ko  0 and that cp E m+ko(Q) with real values satisfy- ing (1.26), f E kO(D), g E m+ko(D) and u E m+ko(D) 5 The point Xo E Qo is introduced only to express that {x E Qo, <p(x) = c} i= 0 when c = <p(Xo). 
 1. Generalities on Linear Differential Operators 165 a solution of the Cauchy problem (1.27), (1.28) (with U = Q). Further, suppose that for all x E Q with <p(x) = <p(XO), f(x + tD<p(x)) = O(t ko ) and Dag(x + tD<p(x)) = O(t ko ) as t  0, for all a with lal  m. Then Dau(x) = Dag(x) for all a with lal  m + ko and x E Q with <p(x) = <p(XO). F or the moment we assume this lemma and show how to prove the existence of a £5 > 0 such that we can solve the Cauchy problem (1.27), (1.28) for all XO E Qo and all prescribed f, g analytic on Q. We observe that the change of variables y = h(x) in the neighbourhood of XO is uniform with respect to XO E Qo. By uniform, we mean that there exists a fixed neighbourhood U o of 0 in [R such that for all XO E Qo, XO + U ° c Q and we can define the change of variables h in the neighbourhood of XO in XO + U o , such that h(xO + U o ) contains a fixed neighbourhood V o of 0 in [R and that the coefficients b a of h* P are uniformly bounded on V o ' We are then led to the following special case of the Cauchy-Kowalewsky theorem: Lemma 5. Suppose that Q is a neighbourhood of 0 in [Rn and that M > O. There exists a neighbourhood U of 0 in Q such that for every family (aa)lal m,a < m of functions analytic on Q with laal  M, all functions f and g analytic on Q, there. exists on U an analytic solution u of the Cauchy problem emu --;-m = L aa Dau + f on U, UX n lal  m an < m aku Ok g k k on {x E U; X n = O} for k = 0, . . . , m - 1. ax ax n n We shall not prove this lemma, referring for example to p. 483 of Treves [1] (see also Hormander [1], p. 116). Using the above change of variable y = h(x) and applying Lemma 5 to the neighbourhood V o of 0 in [R, we thus see that there exists W o , a neighbourhood of 0 in [R such that for all functions f, g analytic on Q and for all XO E Qo, there exists an analytic solution Uxo of the Cauchy problem (1.27), (1.28) on the neighbourhood Uxo = XO + W o of xO. We can always choose W o = B(O, (5). Taking account of the fact that D<p(x) =1= 0 for all x E Q, we can always find £5 > 0 with the result that Xl, x 2 E Qo, <p(x l ) = <p(x 2 ), U X l n U x 2 =1= 0  3xO E Qo n U X l n U X 2, <p(XO) = <p(x l ) = cp(x 2 ), and by the uniqueness of the analytic solution in the neighbourhood of xO, Xl, x 2 E Qo, <p(x l ) = <p(x 2 )  U x l = U x 2 on U x l n U x 2. 
166 Chapter V. Linear Differential Operators This shows that for all Xo E Qo, we can define u, analytic on the open set U of the theorem by: u == u x , for all x E Qo such that cp(x) == cp(Xo) which is the solution of the Cauchy problem (1.27), (1.28). D Proof of Lemma 4. Since it is the question of a local result, we can always suppose that, by a local change of variable, we have cp(x) == X n - x. We can also suppose that p. (. , Dcp) - 1 after dividing P by p. (. , Dcp) ;/= 0, that is to say (1.30) amu Pu == p - L aa Dau == f X n lalm an < m Finally replacing u by u - g andfby f - Pg, we can take g - o. Using (1.29) we thus have (1.31) Dau(x) == 0 for all a with an < m and x E Q with X n = 0, Referring to (1.30) we obtain amu --;- (x) - f(x) == 0 for all x E Q with X n == 0, ux m n and so (1.31) is true for all a with an  m. Differentiating (1.30) with respect to X n , we obtain the result by recurrence on k o . D This basic recurrence method will be developed in 3 within the framework of spaces of distributions. The Cauchy-Kowalewsky theorem is a local existence and uniqueness theoremfor analytic solutions of the Cauchy problem. In fact, it is not necessary to assume analyticity to obtain uniqueness: this is the basic uniqueness theorem for the existence of the continuation of solutions outside of the characteristics:- Theorem 4 (Holmgren's Theorem). Suppose that cp E 2(Q) with real values satisfying (1.26) and that Qo is a locally compact open set in Q. Then there exists J > 0 such that for every solution u E- [!fi' (Q) of Pu in [!fi' (Q) and all Xo E Qo, u == 0 on {x E Q; cp(x) > cp(Xo)} => u == 0 on {x E Q; dist(x, Qo) < J, Icp(x) - cp(Xo) < J} We shall reduce this result to the following proposition which is interesting in itself: Proposition 5. Suppose that cp is a real-valued analytic function on Q satisfying (1.26) and u E @'(Q) a distribution with compact support in Q. If we define Q+ == {x E Q; cp(x) > O}, then Pu == 0 on Q + => U == 0 on Q + 
 1. Generalities on Linear Differential Operators 167 Proof of Proposition 5. The transpose operator tp has analytic coefficients and by Lemma 3 CP).(x, D<p(x)) = (-l)m p.(x, D<p(x)) ;/= O. We consider Qo a neighbourhood of supp u, locally compact in Q and denote by J o the positive constant in the Cauchy-Kowalewsky theorem corresponding to tp, <p and Qo. We choose J > 0 such that for all Xo E Qo, U = {x E Q; dist (x, Qo) < J, l<p(x) - <p(Xo) < J} C {x E Q; dist (x, {z E Qo, <p(z) = <p(Xo)}) < J o } which exists since min ID<p(x)1 > O. Denoting Qo max {qJ(x); x E supp u} by c we have to show that Pu = 0 on Q + => C  O. Let us suppose to the contrary that: Pu = 0 on Q + and c > O. There exists Xo E Qo such that 0 < <p(Xo) < c < <p(XO) + J. Free to decrease it we can always suppose that Qo C {x E Q; <p(x) < <p(Xo) + J}. From the Cauchy-Kowalewsky theorem, for every fanalytic in Q, there exists a solution w, analytic in U, of t Pw = f on U w(x + tD<p(x)) = O(t m ) as t -4 0 for all x E U with <p(x) = <p(Xo). We shall now define the function v on Qo by ( ) = { w(x) if x E Qo, rp(x) > <p(Xo) (we then have x E U) V X 0 if x E Qo, <p(x)  <p(Xo). Now the distribution u, having compact support, is of finite order ko (see Appendix "Distributions"). If we suppose that (1.32) f(x + tD<p(x)) = o(t kO ) when t -4 0 for all x E U with <p(x) = <p(Xo) we shall have by Lemma 4 Daw(x) = 0 for all a with lal  m + ko and x E U with <p(x) = <p(Xo) and so v E m+ko(Qo)' In addition then: tpv = g on Qo, 
 168 Chapter V. Linear Differential Operators where g is defined by ( 1 33 ) g( x ) = { f o (X) if <p(x) > <p(Xo) . if <p(x)  <p(Xo). Since g is of class ko in a neighbourhood of supp u we can consider < u, g> = < u, t Pv > = o. since Pu = 0 on D + a neighbourhood of supp u n supp v. To achieve this, let us show that u = 0 on {x E Q, <p(x) > <p(XO)}, which will imply the contradiction c  <p(XO). For this we take ( E  ( {x E Do, <p(x) > <p(XO)}) and consider fAx) = (<p(x) - <p(xO))k o +1(2n)-n/2 f p(x - ey)e- I12 dy where: p(x) = (<p(x) - <p(XO))-(k o +1)((X). Since f p(x - ey)e- 1Y / dy = e- n f p(y)e IX:,12 dy the function Ie is analytic on Q. Since { <p(x + tD<p(x)) - <p(XO) = O(t) as for all XED with <p(x) = <p(XO), the function Ie satisfies (1.32). Since p E oo (D) and f e- 1Y / dy = (2n)"/2 (<p(x) - <p(XO))-(k o + 1) Ie (x) -4 p(x) In so Ie -4 ((x) = (<p(x) - <p(XO))ko + 1 p(x) In and ge -4 ( in ko(Q) where ge is defined by equation (1.33). We deduce that <u, ,> = lim <u, ge> = o. Proof of Theorem 4 (Holmgren's Theorem). There exist strictly positive con- stants c and 11 such that for all XO E Qo t-40 oo (D) oo (Q) \Ix E D with Ix - XO I < 11 p.(x,) ;/= 0, and  E IR n with  _ D<p(xO) ID<p(xO)1 < c. 
 1. Generalities on Linear Differential Operators 169 We can always suppose that Q' = {x E Q; dist (x, Qo) < Yf} is locally compact in Q with the result that there exists C such that (1.34) qJ(x)  qJ(XO) + DqJ(xO). (x - XO) - CIDqJ(xO)llx - XO 1 2 \Ix E Q', Xo E Q. Free to reduce the value of Yf > 0, we can always suppose that (1.35) CYf CYf < 1 and 1 - CYf c  -; 2 we fix XO E Qo and set DqJ(xO) o = IDcp(xO)I ' t(x) = o. (x - XO), x'(x) = x - XO - t(x) o. Since Ix - xOl 2  Ix - xOI (It(x)1 + Ix'(x)l), we have from (1.34), (1.35) that Ix - XO I < Yf, c qJ(x)  qJ(XO) => t(x) < "2 Ix' (x)l. We now put tjJ(x) = t(x) + C(28 - (Ix' (X)12 + 8 2 )1/2) where 8 > 0 is chosen so that the set K = {x; I x - XO I < Yf, qJ(x)  qJ(XO), tjJ(x)  O} is compact in the ball B(xO, Yf) = {x; Ix - xOI < Yf}. This is always possible, since if x E K, we have C c(lx'(x)1 - 28)  t(x)  "2 IXl (x)1 and so Ix'(x)1  48 and t(x)  28C (it is enough to take 8 = Yf/ J20 since c < 1). The function t/1 is analytic on B(xO, Yf) and c I x' (x) I IDrf1(x) - ol = (lx'(xW + 8 2 )1/2 < C so that it satisfies (1.26) on B(xO, Yf). Let us consider ( E (B(xO, Yf)) such that ( = 1 in the neighbourhood of K. Being given u E '(Q) solution of Pu = 0 in '(Q) with u = 0 on {x E Q; qJ(x) > qJ(XO)}, the distribution v = (u E lS'(B(xO, Yf)) and satisfies Pv = 0 on {x E B(xO, Yf); tjJ(x) > O} since v = u in the neighbourhood of K :::> {x E B(xO, Yf) n supp u; tjJ(x) > O}. 
170 Chapter V. Linear Differential Operators Using Proposition 5 v =0 on {x E B(xO, tl); tjJ(x) > O}, and so also u = 0 on {x E B(xO, tl); tjJ(x) > O}. Pu tting J = t C8, we have, since c < 1, 8 C8 Ix - xOI < J => Ix'(x)1 < 2 and t(x) > - 2 => tjJ(x) > o. This completes the proof. o Remark 2. If q>(x) = o. x with o E n, I o I 1, we have C = 0 in (1.34); hence we can take J = Ctl/2J20 where c, tl > 0 must satisfy tl < dist (Qo, 8Q) and p. (x,) =1= 0 for all x E Q with dist (x, Qo) < tl and  E n with I -ol < c. o 2. Linear Differential Operators with Constant Coefficients The operators D j are invariant under translations on n; i.e. (Dj u)(x + y) = (Dj u( . , y))(x) x, y E n. It is thus the same for all1.d.o. on n with constant coefficients. Conversely, let P be a locally linear mapping from (n) into '(n), invariant under translation, that is to say P(u(. + y)) = (Pu)(. + y), Vy E n, Vu E (n), where for a distribution f E ' (n), f(. + y) is the distribution defined by < f(. + y), v > = < f, v (. - y) > Vv E (n). Then P is a l.d.o. on n with constant coefficients. Indeed, from Lemma 1 of 1, there exists a non-empty open set U such that the restriction of P to (U) is continuous. From the invariance under translation, P is therefore a l.d.o. on n. Now, the operators D a being invariant under tlanslation, the coefficients of Pare invariant under translation, i.e. are constants. We summarise these properties in: Theorem 1. The l.d.o. on n with constant coefficients are the local linear mappings, invariant under translation, from (n) into ' (n). We propose to present in this section several useful techniques for the study of l.d.o. with constant coefficients. 
2. Linear Differential Operators with Constant Coefficients 171 Definition 1. A l.d.o. on n with constant coefficients is an operator P = P(D) of '(n) where P() is a polynomial with coefficients in C. The order m of P is the degree of P(); the principal part of P is the l.d.o. with constant coefficients p. = P.(D) where p.() is the homogeneous part of degree m of P( ). The transpose t P of P is the l.d.o. with constant coefficients tP(D) = P( - D) (see (1.8)). The composition of two l.d.o. with constant coefficients P = P(D) and Q = Q (D) is the l.d.o. defined by the product P() Q (); in particular P and Q commute. The reader will easily verify the following. Proposition 1. A l.d.o. on a connected open set of n is with constant coefficients iff it commutes with the operators D j f(Jr j = 1, 2, . . . , n. From (1.9), for all  E cn, we have P() = e-lpPelp with <p(x) = .x. In particular, we deduce the: Proposition 2. If P is a l.d.o. with constant coefficients, then u(x) = e' x is a solution of Pu = 0 iff P() = o. 1. Study of a l.d.o. with Constant Coefficients by the Fourier Transform 1.1. Existence of a Solution of Pu = fin the Space of Tempered Distributions An essential tool for studying the l.d.o. with constant coefficients is the Fourier transform. We have in fact (see 3 of Appendix "Distributions") for u E /7 (resp. /7'), Dj u E /7 (resp. /7') and 15;?i(y) = iy j u(y)6. We then define the characteristic polynomial of P by (2.1 ) p(y) = P(iy). 7 We have immediately the: Proposition 3. Suppose that P = P(D) is a l.d.o. with constant coefficients. For all u E /7 (resp. /7'), we have Pu E /7 (resp. /7') and P'U = pu where p is the characteristic polynomial of P defined by (2.1). 6 We denote by u the Fourier transform of u (see Appendix "Distributions"). 7 We have P(D) = p(D/i); for this reason various authors (see Hormander [1]) reserve the notation 1 a D for the operator - -. i ax 
172 Chapter V. Linear Differential Operators Otherwise stated, being given u and f E g": Pu = f <=> pu = J From this we deduce the Theorem 2. Suppose that P is a l.d.o. with constant coefficients and characteristic polynomial p, and that (2.2) p(y) i= 0, Vy E n. Then, for all f E g", there exists one and only one u E g" such that Pu = f. Proof. From Proposition 3, this is the same as showing that for all g E g" there exists one and only one v E g" such that pv = g. From (2.2), p -1 = lip is in $ and so for all g E ' there exists one and only one v E ' such that pv = g. We thus have only to show that under the hypothesis (2.2) p-1g E g" for all g E g", i.e. that p - 1 E (!J M (see 3.9.1 of Appendix "Distributions"). Now, being given pEe [y J, there exists C E  and b E Q) such that inf {I p(y) I; y E n, 1 y I = L} I"'V CL t5 when L  00. This follows from Lemma 1 below with A (L, Y l' . . . , Y n) = (yi + . . . + y - L2)2 B(L, Y1, . . . , Yn) = - {(Re P(Y1' . . . , Yn))2 + (Imp(Y1' . . . , Yn))2}. Now, when p satisfies (2.2), we necessarily have C > 0, and so there exists a constant C' > 0 such that Ip-1(Y)1  C'(l + IYI2)-t5 1 2, Vy E n. Finally, for all (X E Nn, D a p -1 = PaP -Ial where Pa E C [y J, and so there exist constants C a > 0 and b a E  such that ID a p-1(y)1  C a (l + IYI2)-t5 a I 2, Vy E n, that is to say, p -1 E (!J M. We have made use of the following lemma for a proof of which we refer the reader to Hormander [lJ, p. 276: Lemma 1. Let A and B be two polynomials, with real coefficients in r + 1 real variables L, L l' . . . , L,. For aUL E , let us put m ( L) = sup {B ( L, . . . , L,); L k E  and A ( L, L l' . . . , L,)  O}. Then, ifm(L) = + 00 for L sufficiently large, there exist C E  and b E Q) such that m(L) I"'V CL t5 when L  00. Remark 1. We can, indeed, show (see Treves [lJ, p. 313) that for a l.d.o. with arbitrary constant coefficients, for all f E g", there exists u E g" such that Pu = f. 
2. Linear Differential Operators with Constant Coefficients 173 The uniqueness of the solution u E [//' of Pu = f is obviously equivalent to the property U E [//' and Pu = 0  u = 0 which by the Fourier transform becomes v E [//' and pv = 0  v = 0; since the Dirac distributions are in [//', this again is equivalent to the condition (2.2). Hence, the conclusion of Theorem 2 holds iff P satisfies (2.2). On the other hand, we have seen in the course of the proof of Theorem 2 that the condition (2.2) implies the existence of a constant (j E  such that (2.3) Ip(y)1  c(1 + lyI2)O/2 Vy E n with c > O. When such a condition is satisfied, by the definition of Sobolev spaces H U = {u E [//'; (1 + lyI2)u/2Iu(y)1 E L2} (see Chap. IV), we have for all (j E  (2.4) U E [//' and Pu E HU  U E H U + o . D 1.2. Example 1: The Laplacian The Laplace operator or Laplacian, L1 = Di + D + . . . + D;, has -IYI2 = - (yi + y + . . . + y;,) for its characteristic polynomi:al. The operator P = A - L1 thus has characteristic poJynomial p(y) = A + lyl2 and we find that P satisfies (2.2) if and only if A   - = J - 00, OJ and then P satisfies (2.3) with (j = 2. We thus have: if A E e \  - , the restriction of A - L1 to [//' is an isomorphism of [//' which maps H U + 2 onto HU for all (j E . In the particular case A = 0, P is (apart from sign) the Laplacian. The condition (2.2) is not satisfied, and the kernel of the operator P = - L1 is the set of harmonic functions, and P is not an isomorphism of [//' into itself. The surface L1 () = i + . . . + ;, = k is a sphere in en: the operator L1 is the standard type of elliptic operator. The Laplacian is invariant under every isometry of the Euclidean space n: such an isometry is composed of translations, symmetries and rotations. It is the same for all operators P = L a k L1 k . Conversely, we have the Proposition 4. Suppose that P is a local linear mapping of (n) into ' (n), the invariant under the isometries of Euclidean space n. Then P = L a k L1 k . Proof Being invariant under translation P is a l.d.o. with constant coefficients: let p be its characteristic polynomial. An orthogonal transformation h of n 
174 Chapter V. Linear Differential Operators commutes with the Fourier transform in f// (see 3.8 of Appendix, "Distri- butions"). Hence we have for all u E f// h    A ( * P) u = hP(h - uJ = h(p(h - u)) = (hp)u. Since P is invariant under h, pu = (hp)u for all u E f// and so hp = p. This being true for every orthogonal transformation of n, p is a polynomial in (YI + . . . + Y;) and so P = p(D/i) is a polynomial in L1. D Let P = L a k L1 k be order 2m and let us suppose that (2.5) L akA k i= 0, V A  o. Then the zeros A 1 , . . . , Am of the polynomial LakA k are in e\-. Since P = am ( L1 - A 1) . . . (L1 - Am), we deduce: Proposition 5. Under the hypothesis (2.5) the restriction to f//' of P = L a k L1 k of order 2m is an isomorphism of f//' which maps H a + 2m onto H a for all (j E . 1.3. Elliptic and Strongly Elliptic Operators Let us consider first of all a l.d.o. with constant coefficients of order 2 on n; it can be written 0 2 0 P = L L a jk + L a j - + a j k oxjox k j oX j where (ajk) is a symmetric matrix. Let  E en and cp(x) = (. x). For u E .@'(n), we have Pu = el,OQ(e-l,Ou) with Q=LLajk aa +L ( 2Lajkk+aj ) - (} a +P(). j k Xj X k j k x j In particular: if I ajk I i= 0, there exists a (unique)  E en such that Pu = el,OQ(e-l,Ou) with cp(x) = . x and Q = p. + P(). Now let us make a change of coordinate axes in the affine space n, x' = hx where h = (h jk ) is a regular real matrix. We have (}2 h*r = LLajk a 'a' with ajk = LLalrhjlhkr' j k X j Xk 1 r Let us suppose that (a jk ) is real, or, more generally that (Re a jk ) and (1m a jk ) commute. Then there exists a (real) orthonormal basis of eigenvectors of (a jk ) and so there exists a regular real matrix (h jk ) such that a'jj = e iOj for 
2. Linear Differential Operators with Constant Coefficients 175 j = 1,. . . , m, a}k = 0, for j i= k or j = k = m + 1,. . . , n where m is the rank of (ajk) and 0 1 , . . . , Om the arguments of the non-zero eigenvalues of (ajk). In particular, we deduce the Proposition 6. Let P = LLajko2joXjOXk + Lajojox j + a. Suppose that the matrices (Re ajk) and (1m ajk) commute and that the eigenvalues of(ajk) are all non- zero and have the same argument O. Then there exists a change of coordinate axes in the affine space n, x' = hx such that h* p. = e iO 11 x'. We have thus, in this case, reduced P in the affine space n to A - 11. Such an operator is said to be strongly elliptic. More generally we have Definition 2. Let P be a l.d.o., with constant coefficients, of order m, and with principal part p.; we say that P is O-strongly elliptic 8 (0 E j2nZ) if (2.6) Re(e iO P.(i)) > 0, 'V E n\ {O}. Proposition 7. Let P be a l.d.o., with constant coefficients, of order m and O-stronglyelliptic. Then there exists Ao E [R such thatfor all A E C with Re A > Ao, the restriction to [//' of the l.d.o. e - iO A + P is an isomorphism of [//' mapping Ha + m onto Ha for all (J E . Then we should observe that the order m of P is even and that the transpose tp of P is also O-strongly elliptic. Proof of Proposition 7. The characteristic polynomial of the l.d.o., e - iO A + P is P;.(y) = e-iO() + q(y) + eiOi m p.(y)) where q is of degree strictly less than m. Let us put c = min {Re(eiOi m p.());  E n, I I = 1}; by the homogeneity of p. Re(eiOimp.(y))  clylm, 'Vy E n. We have lim Re q(y) - clylm = - 00. Iyloo We define Ao by AO = max {Re q(y) - clylm; y E n}. If Re A > Ao, there exists c' < c and A' < Re A such that: Re q(y)  c' I ylm + A', 'Vy E n, and so Ip;.(y)1  Re A + Req(y) + Re(eiOimp.(y))  (Re A - A') + (c - c')lylm. 8 We say also that P is strongly elliptic if there exists 8 E [R such that P is 8-strongly elliptic. 
176 Chapter V. Linear Differential Operators Thus p A. satisfies (2.3). o The strongly elliptic operators are a special case of elliptic operator: Definition 3. We say that a l.d.o. with constant coefficients P is elliptic if its principal part p. satisfies (2.7) p.() i= 0, v E n\ {O}. Remark 2. Ifn = 1, every operator is elliptic, and it is strongly elliptic if and only if its order is even. If n  2, an operator with real principal part is elliptic iff it is strongly elliptic: indeed, the open set n\ {O} being connected, p.(n\ {O}) is an interval of ; to say that p. satisfies (2.7) is thus equivalent to saying that p. satisfies (2.6) with () = 0 or 7r. That this is no longer true for an operator with complex principal part is shown by the following example: Example 2. The Cauchy-Riemann Operator. The Cauchy-Riemann operator in the plane 2 = {(x, y)} is a def  (  + i ) 2 ox oy with characteristic polynomial 1 p(, r,) = 2(i - r,). We have L1 = 400 where o def  (  - i ) . 2 ox oy This decomposition of the Laplacian into the product of two first order l.d.o. is no longer possible for n  3 since the polynomial i + . . . + ; is then irreducible in C [fl' . . . , nJ. Identifying 2 and C, a distribution u E ' (0) is a holomorphic iff OU = 0 and then OU is the derivative of u. We deduee: Vu E ' (0), ou = 0 in /(O)  u analytic on O. This property is a 'particular case of the hypo-analyticity of o. More generally, a l.d.o. P with constant coefficients is said to be hypo-analytic if, for all u E ' (0), Pu analytic on 0 implies that u is analytic on O. As we shall see later (see Corollary 1 of4), the hypo-analyticity of a l.d.o. P with constant coefficients is equivalent to the ellipticity of P. It follows immediately that o is elliptic. o 
2. Linear Differential Operators with Constant Coefficients 177 1.4. Hypo-Elliptic and Semi-Elliptic Operators Definition 4. We say that a l.d.o. P with constant coefficients is hypo-elliptic if for every open set 0 of n U E 9)' (0), Pu E $(0)  U E $(0). We shall show later (see (4.8) of 4) that a l.d.o. P with constant coefficients is h ypoelliptic iff { there exists c: n  + with C(y)  0 when Iyl  00 such that (2.8) Ip(k)(y)1  C(y)lp(y)1 Vy E IR n , Vk E Nn, Ikl > 0, where p(k) = a1k1p/ayk, p being the characteristic polynomial of P. Being given a l.d.o. P with constant coefficients, we can consider P() to be a polynomial in  1 with coefficients in C [2' . . . , nJ. We define the partial order ofP with respect to a/ax l as the degree of P() with respect to l' Ifm l is this partial order, amI p=p -+..'+P m1  ml 0 UX l where P m1' . . . , Po are operators on n-l = {(x 2 , . . . , x n )} with P m1 =1= O. A consequence of (2.8) is: if P is hypo-elliptic, then P m1 is of order zero. Indeed, if not, there would exist k' E Nn-l, with Ik'i > 0 such that p? = a, a non-zero constant. When IYll -* 00 we should then have p(O,k')(Yl,O, . . . , 0) "'-1 ilk' I +m1 ayTl, p(y,O, . . . ,0) = O(yT1) which contradicts (2.8). For an elliptic operator, all the partial orders of P are equal to the order m of P. As a matter of fact, the principal part of P is amI p. = Q - + . . . + Q ml  m1 0 UXl where Qml'. . . , Qo are the homogeneous parts of order m - m l , . . . , m respectively of the operators P m1 , . . . , Po. If m l < m, then all these operators will be of order  1 and so p. (1, 0, . . . , 0) = Qml (0, . . . , 0) + . . . + Qo(O, . . . , 0) = O. We now give a sufficiently general class of hypo-elliptic operators. Proposition 8. Let P = L a(1D(1 be a l.d.o. with constant coefficients on n. We denote by m l , . . . , m n the partial orders of P with respect to a/ax l , . . . , a/ax n respectively. Being given a = (a l , . . . , an), we denote by la: m I the sum (Xl/ml +. . . + an/m n and suppose that 
178 Chapter V. Linear Differential Operators (i) aa == 0 for all a E N n with la: ml > 1, (ii) the characteristic polynomial po of po == L aa Da satisfies la:ml =1 pO(y) i= 0 Vy E n\ {O}. Then P is hypo-elliptic. Proof For all t > 0, ya == zatla: ml with Yi z.==- l t l/mi . Taking t == IY11 m1 + . . . + IYnl mn , we have I z 1 1 m 1 + . . . + I z n I mn == 1. (a) Since pO(y) == L i lal aaya, pO(y) == tpO(z) and so la: ml =1 I p O (y ) I  c ( I y 1 1 m 1 + . . . + I y n I m n ) with c == inf {lpO(z)l; IZ11m1 + . . . + IZnlmn == 1} > 0; (b) I yal  ca(1 Y11 m1 + . . . + I Yn Imn)la: m l , with c a == sup {I zal; I z 1 1 m 1 + . . . + I z n I mn == 1} . Hence on the one hand Ip(y)1  IpO(y)1 when Iyl -+ 00, while on the other hand, for all k E Nn with Ikl > 0, Ip(k)(y) I == o( I pO(y) I) when I y I -+ 00. The proposition follows from (2.8). D Definition 5. We say that an operator satisfying the hypotheses of Proposition 8 is semi-elliptic. We observe that an elliptic operator is a semi-elliptic operator whose partial orders with respect to %x 1 , . . . , %x n are all equal. We now prove Proposition 9. Let P be a l.d.o. with constant coefficients of order m and with principal part p.. Then P is elliptic iff it is hypo-elliptic and satisfies (2.9) L ape () =I 0, Oj v  E n \ {O}. 
2. Linear Differential Operators with Constant Coefficients 179 Proof Indeed if P is elliptic, it is semi-elliptic and hence hypo-elliptic. On the other hand, from Euler's formula, p. being homogeneous of degree m, op. r(() = m L(j O(j (() and so if L lop./oj()1 vanishes, the same is true of p.(). Conversely, let us suppose that (2.9) is satisfied and that P is not elliptic, that is to say there exists  E n \ {O} with p.() = O. Then we have for y = t/i with t E , p(y) = P(t) = O(ltl m - 1 ) when I tl -+ 00 op oP m-l op. L oYj (y) = L O(j (t()  It I L O(j (() when It I -+ 00. so P does not satisfy (2.8) and hence is not hypo-elliptic. o There exist other classes of hypo-elliptic operators than those of semi-elliptic type (see Hormander [1J, p. 103). The heat operator is an example of a semi- elliptic operator which is not elliptic. 1.5. Examples In Chaps. XIV to XVIII, we shall encounter evolution operators, that is to say operators involving a space variable x E n and a time variable t E , so the operators are in the space n + 1. Example 3. The heat operator. First of all we denote the heat operator by o P = - - L1 o ot x whose characteristic polynomial is p(y, s) = yi + . . . + y; + is. The surface P o(, T) _ T - I  1 2 = A is a paraboloid of revolution in C n + 1; the heat operator is the standard type of parabolic operator. Considering o P, = A + - - L1 A ot x we have: P A, satisfies (2.2) iff Re A > 0; and then P A, satisfies (2.3) with (j = 1. Indeed if Re A > 0 IpA,(y,s)1 = {(lyI2 + ReA)2 + (s + ImA)2}1/2  c(1 + lyl2 + Is!); 
180 Chapter V. Linear Differential Operators we can then apply Theorem 2. But we can remark that for u E ' (n + 1), P;.,u = e{f.l-;")t P I1 (e(;"-I1)t u ). We thus have the result: Proposition 10. Being given A E C, C > - Re A andfE '(n+l) with e-ctfE g", there exists one and only one u E ' (n + 1) with e - ct U E g" such that ( A + :t - L1x) u = f We observe finally, and above all, that the heat operator is semi-elliptic, and hence hypo-elliptic. Example 4. The Schrodinger operator. We now consider more generally the operator a - - wL1 with W E C* at x Its characteristic polynomial w(YI + . . . + Y;) + is vanishes on n + 1 for Re w(YI + . . . + Y;) = 0, s + 1m w(YI + . . . + Y;) = o. If Re ill =f:. 0, a/at - wL1 x is semi-elliptic and therefore hypo-elliptic. In fact it has all the properties of the heat operator. If Re W = 0, the characteristic polynomial of a/at - wL1 x vanishes on the paraboloid in n+ 1 with equation s + (1m W)(YI + . . . + Y;) = 0: it is not semi-elliptic; neither is it hypo-elliptic as ap/as = i and so p does not satisfy (2.8). With a trivial change in the time, we have the Schrodinger operator 1 a -- - L1 i at x which is of a special type. We note that its transpose is equal to its conjugate. Example 5. The d' Alembertian. The wave operator or d' Alembertian is a 2 o = at 2 - L1x whose characteristic polynomial is p(y,s) = YI + ... + Y; - S2. The surface 0 (, T) = T 2 - (I + . . . + ;) = A is a hyperboloid of revol- ution in cn + 1: the d' Alembertian is the standard type of hyperbolic operator. In the case n = 1 the d' Alembertian factorises:- a 2 a 2 ( a a ) ( a a ) at 2 - ax 2 = at - ax at + ax 
2. Linear Differential Operators with Constant Coefficients 181 but such a decomposition is no longer possible for n  2. The d' Alembertian satisfies (2.9) and is not elliptic: it is therefore not hypo-elliptic. 1.6. Reduction of Operators of Order 2 in a Homogeneous, Isotropic "Medium" We have seen that a l.d.o. of order 2 on n invariant under the isometries of n is of the form a(A - L1). Let us now consider a l.d.o. P of order 2 on n+ 1 which we assume is: (i) invariant under every isometry of the space-variable; (ii) invariant under every translation of the time. Then P is invariant under translation on (x, t) and so has constant coefficients. Its characteristic polynomial p(y, s) is invariant under isometries on the variable y (also called orthogonal transformations); being of degree 2. p(y,s) == ko + k 1 is - k 2 s 2 + k(YI +... + y;). Therefore 8 8 2 P == ko + k 1 8t + k 2 8t 2 - k L1x. We may suppose that k i= 0 and Ik 1 1 + Ik 2 1 i= 0, otherwise P reduces to an ordinary differential operator or to ko - kL1x. We shall reduce P to a simple form. Case 1: k 2 i= 0 Let us put c == - k 1 /2k 2 , A == (k 2 /k)!. Then making the change of scale x' == AX in the space variable and changing the independent function from u to v where v(x', t) == e-ctu(x, t), we have t ( 82 V '0 ) (Pu)(x, t) == Ce C a o v + 8t 2 + e l L1x'v (x', t) with C = k 2 , a o = : - ( ::J 2 and e = ar g (- :J . We have thus reduced the operator P to 8 2 iO Q == a o + 8t 2 + e L1 x' . We distinguish two cases: Elliptic case: From the Definition 3, the operator P is elliptic iff k 2 / k   + , so iff k 2 i= 0 and () i= n(2n)9. We observe that then P is strongly elliptic and that if the principal part of P is real, then () == 0(2n) and Q == a o + L1(X',t). Hyperbolic case: If k 2 i= 0 and P is not elliptic, we have k 2 /k E  +, that is to say () == n(2n). Then Q == a o + D(x',t). 9 We make use of the notation k(2n) (resp. k(n)) to denote the equivalence class of k modulo 2n (resp. n). 
182 Chapter V. Linear Differential Operators Case 2: k 2 = 0 We then have k 1 :j:. O. Let us put c = -ko/k, A = Ik 1 /kl and make the change of scale t' = At and the change of function v(x, t') = e-ctu(x, t); we then have with (Pu)(x, t) = Ce ct ( e i8 ;;, - LlxV) (x, t') k 1 C = k and e = arg k' In other words, P is reduced to Q = e iO a/at' - L1x, which we have already classified in two categories: Parabolic case: if k 1 / k €f irR, i.e. if e =1= 1n(n), the operator P is semi-elliptic and possesses the same properties as the heat operator. We remark that if P is real then e = 0 or e = nand P is reduced to the heat operator or its transpose. Finally, we notice that we pass from - a/at - L1 to a/at - L1 by inversion of the time t' = - t. Schrodinger case: if k 1 /k E irR, i.e. e = 1n(n), the operator P is reduced to the Schr6dinger operator or its conjugate which also is its transpose. To sum up we have: Proposition 11. A. l.d.o. of order 2 in rR n + 1, invariant under isometry on the space and under translation in the time can only be one of four types: elliptic, hyperbolic, parabolic or of Schrodinger type. If it is real, it reduces to one of the three operator s: a o + L1(X, t), a o + D (x,t), a at L1x. 2. Elementary Solutions of a I.d.o. with Constant Coefficients 2.1. Introduction A second fundamental tool 10 for the study of l.d.o. with constant coefficients is the convolution (see Appendix "Distributions"). We have, in fact, the Proposition 12. Suppose that P is a l.d.o. with constant coefficients and that u, v are two distributions with convolutive supports 11 . Then Pu, v and u, Pv respectively have convolutive supports and we have P(u * v) = Pu * v = u * Pv. 10 From the Fourier transform, and moreover linked to it. 11 See Appendix "Distributions", 2.3. 
2. Linear Differential Operators with Constant Coefficients 183 The Dirac b-distribution being the unit element for the convolution it is interesting to give the: Definition 6. Suppose that P is a l.d.o. with constant coefficients. We call an elementary solution 12 of P ev:ery distribution E such that P E = b. If E is an elementary solution of P andfa distribution with support convolutive wit that of E, then P(E * f) = E * Pf = f In other words, the operator E * is a right inverse and a left inverse of the restriction of P to the space of distributions with support convolutive with that of E. By virtue of this, the existence of an elementary solution yields existence and uniqueness results: (a) iff is a distribution with support convolutive with that of E, there exists a solution of Pu = f; (b) if u is a distribution with support convolutive with that of E, then Pu = 0 => u = O. In particular, P is of order zero iff it admits an elementary solution with compact support. The necessary condition being obvious, let us suppose that P admits an elementary solution with compact support; in particular, then for every dis- tribution u, Pu = 0 => u = O. But if  E en is a solution of P() = 0, then u(x) = e. x is a non-zero solution of Pu = O. Hence P() :j:. 0 for all  E en, which implies that P is of order o. We observe that if P is hypo-elliptic, then every elementary solution of P is  00 on [Rn\ {O}. It is interesting to note that the converse is true; more precisely we have the Proposition 13. Suppose that P admits an elementary solution E of class  00 on [Rn\ {O}. Then P is hypo-elliptic. Proof We shall make use of the following property of the singular support of the convolution of two distributions. Being given u E fl)' (Q) the singular support ofu, (denoted by supp sing u), is the complement of the largest open set Q' c Q such that the restriction of u to Q' is  00: the fact that this largest open set exists results from the local character of a distribution and the nature ofoo. We then have 13 that, given u, v two distributions on [Rn, with convolutive supports. supp sing u * v c supp sing u + supp sing v. Since E is oo on [Rn \ {O}, supp sing E = {O}. Now let us be given u E £&'(Q) such that Pu E oo(Q). Being given Q' relatively compact in Q, let us be given cp E £&(Q) such that cp = 1 on Q'. Then cpu E C' (Q) and we have P(cpu) = cpPu + v where v = 0 on Q'. Since P(cpu) has compact 12 Or, again, fundamental solution. 13 See Chazarain- Piriou [1] ' 
184 Chapter V. Linear Differential Operators support, we have qJU == E * P(qJu) and hence suppslngqJu c suppsing E + suppsingP(qJu) - supp sing P( qJu) == supp sing v. The last equality comes from the fact that qJPu E £&(Q). Now qJU U on Q' and hence supp sing U n Q' == supp sing qJU n Q' c supp sing v n Q' 0, which shows clearly that U is oo on Q'. o We have an analogous result for analytic functions; we shall show in fact (see Corollary 1 of 94) that every elliptic operator is hypo-analytic and conversely. In dimension n  2, the open set IRn\ {O} is connected. Hence an analytic function on IRn\ {O} cannot vanish on a non..empty open set. An elementary solution of an elliptic operator on fRn with n  2 cannot vanish on a non-empty open set. We shall see later that it is not the same in the general case: the existence of elementary solutions with support in a half-space or a salient cone and the regularity oo of this elementary solution on [Rn, enables us to generalise the classification into parabolic and hyperbolic operators. 2.2. Elementary Solutions in !f'. Examples We shall first of all, by examples, make precise the elementary solutions in !/'. We observe that, since b E !/' and g == 1, a tempered distribution E E !/' is an elementary solution of P iffpE == 1. We have seen in Remark 1 that there always exists an elementary solution in !/' and that it is unique iff p does not vanish on [Rn. We now give some examples of elementary solutions in !/'. Example 6. Elementary solution in !/' of the operator p;. == A - 11. We seek elementary solutions in !/' of the operator p;. == A - 11 where A E C. The characteristic polynomial is P;.(y) == A + lyl2. We notice, first of all, p;. being elliptic, that every elementary solution is oo (even analytic) on [Rn\ {O}. Now if E is an elementary solution, the same is true of F defined by F(x) =  f E(IX1(f)d(f, (fn where d(f is the surface measure on the unit sphere in IRn and (f n is the total mass of d(f. In addition if E E !/', so also does F. Thus to seek the elementary solutions of p;. in !/' reduces to looking for the radial 14 elementary solutions in !/' and the solutions U E !/' of p;.u == O. 14 That is to say, depending only on Ixl. 
2. Linear Differential Operators with Constant Coefficients 185 We distinguish three cases: (a) A E C \ IR-. Then PA does not vanish on IRn. Thus there exists one and only one elementary solution E A of P A, in !/' given by ( 1 ) 1 k n - 2 f 00 n - 3 E A = ff- 1 = - e-klxlt(t2 - 1) dt A + I y 1 2 (j n (n - 2)! 1 where k is the square root of A in C with positive real part. (See Chap. II, 9 8 , (8.109) for the explicit expression for E A ). This solution is obviously radial; it is in fact the only radial elementary solution. (b) A = O. We have the elementary solution of _11 15 (see Chap. II. 9 2 ): 1 for n = 1, E(x) = - 2: Ixl 1 1 E(x) = -Log- 2n Ixl 1 E(x) = ( _ 2) I In-2 ; n (jn XI for n = 2, for n  3, this solution is in !/' and is radial. To within an additive constant, it is the only radial elementary solution. To within a harmonic polynomial, it is the only elementary solution in !/' (see Chap. II, 9 2 ). Finally, we notice that for n  3, the above elementary solution is the only elementary solution tending to zero at infinity. (c) A < O. Then P A = - (11 + k 2 ) with k > O. In the case n = 1, an elementary solution of d 2 /dx 2 + k 2 is ( ) _ sinklxl Ex - k . In the case n  2, the explanation of a radial elementary solution is more critical (see Chap. II, 9 8 ). We observe, however, that such a solution is not unIque. In the case n = 1, U E g" and ( dd: 2 + k 2 ) U = 0 -= u(x) = acoskx + b sin kx with a, bE C. Finally we notice the link between the elementary solutions of - (11 + k 2 ) and those of the wave operator a 2 /at 2 - 11x. Being given, E !/(IR 1), we define the Fourier transform in t by fft(: (x, k) E n +1  f e - ikt ((x, t) dt. 15 These elementary solutions have been used frequently in Chap. II; they were used to define the Newtonian potentials. 
186 Chapter V. Linear Differential Operators Being given F E g" (IRn + 1), we define fft F E g" (IRn + 1) by < JFtF, ,> == <F, fft'> for all 'E g'(lRn + 1). We verify, as in the case of the global Fourier transform, that fft is an isomorphism of 9"(lRn+l) onto itself and that (  ) = -ikF and b == b(x) Q9 1 (k); hence FE g"(lRn+1) is an elementary solution of a 2 / at 2 - L1 x iff E ==  F satisfies -(L1 + k 2 )E == b(x) Q9 l(k) in 9.&'(lRn+1). o Example 7. Elementary Solutions in g" of the operator P == L1\ kEN *. Let us denote by E'k-1 an elementary solution of L1 k - 1 in IR n 16. Then an elementary solution of L1 k in IRn may be found as a solution of the equation (2.10) L1E'k == E'k-1. (a) We shall seek the solutions of L1f == g depending only on the distance from the origin i.e. f == f(r), g == g (r). From the expression for the Laplacian in polar coordinates we must have n - 1 f" (r) + f' (r) == g(r) r (2.11 ) where g(r) is of the form r A . If A :j:. - 2 and A :j:. - n we find that the required s-olution is: (2.12) r A + 2 f(r) = (A + 2)(A + n) + C, where C is an arbitrary constant. If g(r) == r- 2, then (2.13) 1 f(r) == n _ 2 Log r + C, and if g(r) == r A Log r with A :j:. - 2 and A :j:. - n, we find that r A + 2 Log r (n + 2A + 2) A+2 (2.14) f(r) = (A + 2)(A + n) (A + 2)2(A + n)2 r + c. (b) We return to the determination of the E'k. We know (Example 6 1 7) that E == C n r 2 - n n  3. 16 With E = b. 17 Recall that C n = -{(n - 2)O"n}-1. 
g2. Linear Differential Operators with Constant Coefficients 187 We then solve equation (2.10) with k == 2, or (2.11) with g(r) == C n r 2 -n; we find that f(r) is given by C 4-n n r f(r) = 2(4 - n) ' We verify thatf(r) E Lfoc(lRn) so thatf clearly defines a distribution on [Rn. We must show further this distribution, denoted by n :j:. 2, n :j:. 4. C r 4 - n En _ n 2 - 2(4 - n)' n :j:. 2, n :j:. 4 satisfies in the sense of distributions L1E == E which is easily shown with the aid of Green's formula. Recommencing the preceding process, we shall find: C r 6 - n E n _ n 3 - 2.4(4 - n)(6 - n)' n :j:. 2, n :j:. 4, n -# 6. If n is odd, the exponent 2k - n will never be equal to - 2. We therefore obtain the formula En == C r 2k - n k n,k (2.15) . C n C k == . n, 2.4 . . . (2k - 2)(2k - n) . . . (4 - n) If n is even, formula (2.15) is true only for 2k  n - 2; for 2k  n, the exponents 2 - n, 4 - n arrive at the value - 2 and from (2.13) the following stage will lead to the appearance of a logarithmic term and to the use of the formula (2.14). In this case we find En E- _ 1 2 C -2 n, n - 2 r E "2 Cn, n - 2 (n _ 2) Log r En k C n ,n-2 r 2k - n Log r (n - 2) (2k - n) n(n + 2) . . . (2k - 2) == b r 2k - n Lo g r + pn ( r2 ) n,k k (k  ) where p is a polynomial of degree  k - !n such that L1 k P(r2) == O. For all n  2 we thus obtain for the elementary solution of L1 k in n: 
188 Chapter V. Linear Differential Operators (2.16) { C n k r2k-n, n odd or 2k < n En - ' k - b r 2k - n Lo g r n even and n  2k 18 n,k , ". Example 8. Elementary solution in [/' of the Cauchy-Riemann operator. The function E, locally integrable on [R2, defined by 1 E(x,y) = n(x + iy) - 1 ( 0 0 ) is the elementary solution of 0 == - - + i - . 2 ox oy Proof The fact that E E Lfoc([R2) comes from the fact that Ii dx dy f l f 21t P dp de == == 2n 19 B(O, 1) I x + iy I 0 0 p . On the other hand, we have for all q1 E Et1([R2) - - -1 II oqJ(X y) <oE,qJ) == -<E,oqJ) == - '. dxdy. n x + zy From Lebesgue's theorem, we then have  ) 1 1 . If OqJ(X, y) d d < u E, qJ == - - 1m . x y. n f -+ 0 x 2 + y2  8 2 X + z Y Passing to polar coordinates x == r cos e, y == r sin e, we see that the operator 0 transforms to 1 i8 [ 0 i 0 ] :2 e or +  oe . Hence, we find, on putting ip(r, e) == qJ(r cos e, r sin e), that 1 f + 00 f 21t ( 0 - . 0 - ) < 8E, qJ) == -- 2 lim qJ +   (r, e) dr de == lim If' n f-+O fOur r UU f-+O 1 f 21t (f +000CP ) i f +00 (f 21t O ip ) dr If == - - - (r, e) dr de - - - (r, e) de -, 2n 0 f or 2n f 0 oe r 1 f 21t i f + 00 dr If == _ 2 ip(B, e) de - _ 2 [ip(r, 2n) - ip(r, 0)] -, non f r and hence that < aE, qJ) == cP(O, e) == qJ(O,O) == < £5, qJ ). D The function E(x, y) == {n(x + iy)} -1 being oo on [R2 \ {O}, the operator 8 is hypo-elliptic (see Proposition 13). We know moreover from the theory of 18 This elementary solution does not satisfy the recurrence relation (2.10); we can verify that (2.16) is certainly true for n = 2. 19 B(O, 1) denoting the unit ball in [R2. 
2. Linear Differential Operators with Constant Coefficients 189 holomorphic functions that a solution u(x, y) of au == 0 is a holomorphic function of the complex variable z == x + iy and hence an analytic function of (x, y). The function E(x, y) is a tempered distribution. Every elementary solution in /7' of a is of the form E(x, y) + L ak(x + iy)k. kEr In fact, the polynomials L ak(x + iy)k are the only entire functions of x + iy defining a tempered distribution on [R2. D 2.3. Elementary Solution with Support in a Salient Closed Convex Cone: Hyperbolic Operator A cone in [Rn (with vertex the origin) is a part C of [Rn such that AC c C for all A > O. Let C 1 and C 2 be closed cones, with common vertex at the origin, such that C 1 n ( - C 2 ) == {O} then C 1 and C 2 are convolutive: indeed if they were not, there would exist sequences (x) and (x;) of C 1 and C 2 respectively such that the x + x; are bounded without x and x; being so; we can always extract a sub-sequence such that An == Ix I --+ 00 and x / An --+ x. We then have x; / An --+ - x and so x E C 1 n (- C 2 ), which is a contradiction since Ixl == 1. We say that a closed convex cone C with vertex 0 is salient if C n (- C) ==. {O}. More generally, we say that a cone C with vertex 0 is salient if its closed convex envelope is salient: this is equivalent (see, e.g. Rockafeller [1]) to the existence of a closed half-space H such that H n (- C) c {O}. We then have H + C c H. Let £&'(C) be the space of distributions with support in C. We know (see Appendix "Distributions" 92.4) that it is a convolution algebra, that is to say that f E £&' (C), E E £&'(C) => E * f E £&'(C). We can thus state: Proposition 14. Let P be a l.d.o. with constant coefficients and C a salient closed convex cone. The restriction of P to £&' (C) is a bijection iff there exists an elementary solution of P in £&' (C). Then this elementary solution is unique. If H is a closed half-space of [Rn such that H n (- C) c {O}, the restriction of P to £&' (H) is a bijection, and for u, f E £&' (H). Pu == f <=> u == E * f Proof Suppose that P has an elementary solution E E £&'(C). Being given H a closed half-space such that H n (- C) == {O}, for all f E £&'(H), the sup- ports of f and E are convolutive and E * f E £&' (H + C) == £&' (H). Hence u E £&' (H) --+ Pu E ' (H) is a bijection with inverse f E '(H) --+ E * f E £&'(H). We should obtain the same result by replacing H by a closed cone C' such that C' n ( - C) == {O} and C + C' == C', in particular for C' == C. Finally, since 
190 Chapter V. Linear Differential Operators b E ' (C), if P is a bijection of ' (C) onto itself, there exists a unique E E ' (C) such that P E == b. D Definition 7. Let C be a salient closed convex cone. We say that a l.d.o. with constant coefficients is hyperbolic with respect to C if it admits an elementary solution with support in C. We remark that if P and Q are two hyperbolic operators with respect to a salient closed convex cone C, the same holds for the composition PQ and the el- ementary solution of PQ in '(C) is the convolution of the elementary solutions of P and Q in '(C). Example 9. Ordinary Differential Equation. Let m d k P == L a k d k ' (am -# 0) k=O X be a l.d.o. with constant coefficients on , and Ai' . . . , Am the zeros of the m polynomial L akA k . Then k=l P = am ( d - Ai) . . . ( dd x - Am );  + is a salient closed convex cone of  and for A E C, EA(x) == Y(x)e AX (where Y is Heaviside's distribution), is the elementary solution in '(+) of djdx - A. Hence P is hyperbolic with respect to + and has elementary solution in '(+) (am)-l E A1 * . . . * EArn. It is the only elementary solution of P with support in  +, even, moreover, with support bounded from below. But there exist other elementary solutions with a support which is not bounded from below: for example, we saw above the elementary solutions in [/' of P == A - d 2 jdx 2 . It is interesting to note in this connection that an ordinary differential operator is at one and the same time hyperbolic and elliptic and moreover also parabolic according to the definition given later. Example 10. Decomposable Operator in the Real Domain. Let a == (ai' . . . , an) E n, A E C. The operator a a l _ a + Xl a +a--A n aX n has elementary solution E a ,;.: q> E g)(lR n ) -4 LO q>(ta) eAt dt which is a measure with support on the half-line  + a. 
2. Linear Differential Operators with Constant Coefficients 191 We say that a l.d.o. P is decomposable in the real domain if P = Ao Ii ( a kl ';') 0 +... + a kn ';')0 - Ak ) with Ak E C, a kj E fR. k=1 UX 1 UX n We can always choose 8 1 , . . . , 8m E { -1, + I} such that the closed convex cone C generated by the vectors 8 1 a 1 , . . . , 8m am is salient, where a k == (a k1 , . . . , a kn ). Then 8 1 . . . 8m E = AO E" a, .e, A, * . . . * Eemam.emAm is an elementary solution of P in ' (C): P is hyperbolic. In particular, if a 1 , . . . , am are linearly independent, we can complete them to form a basis in fR n ; let x' == hx be the change of variables in this basis, which leads to 8 1 == . . . == 8m == 1 and then we have <E,ip) = f(hip)(X'l"'" x;", 0,...,0) Y(X'l)'" Y (x) e A 1 X'l + ... + Am X dx'1 . . . dx. Applying this to the d' Alembertian in fR2 0 2 0 2 ( 0 0 ) ( 0 0 ) o == ot2 - ox2 == ot - ox ot + ox ' we see that the operator 0 is hyperbolic with respect to the cone {t > Ixl} with elementary solution t Y(t + x) Y(t - X)20. Making use of a characterisation which we shall show later we shall see that this is true for the d' Alembertian in a space of any dimension whatever. D 3. Characterisation of Hyperbolic Operators 3.1. Characteristics of a l.d.o. with Constant Coefficients We consider p. == I aaDa the principal part of the l.d.o. of order In, with lal=m constant coefficients, P == I aaDa. Following the definitions of 1.3,  E cn, lalm .: :j:. 0 is a characteristic vector of P if p. () == O. 20 We observe (see Treves [3J, p. 47) that 0 is also hyperbolic with respect to the cones (t  -Ixl}, {x  Itl}, {x  -Itl}. 
192 Chapter V. Linear Differential Operators The vector  is a simple characteristic if L ar m -1= O. aj A regular surface S is a characteristic surface of P if the normal vectors at each point of S are characteristic vectors of P. In particular if  is a characteristic vector of P, the hyperplanes {x: . x = c} are characteristic surfaces of P. We notice that the translate of a characteristic surface is again a characteristic surface (these properties follow from the fact that the coefficients of Pare constants). We give some examples. It follows from the definition that a l.d.o. with constant coefficients is elliptic iff it does not have real characteristic vectors or, again, iff it does not have real characteristic surfaces. The heat operator and the Schrodinger operator, more generally the operators (a/at - wL1 x ) in [Rn+ 1 with W E C*, admits as its only real characteristic vectors the vectors (0, . . . , 0, A), (A E [R*); these are multiple characteristics; the only real characteristic surfaces are hyperplanes {t = to}. The wave operator D = 0 2 / at 2 - L1 x in [Rn + 1 has real characteristics namely all the vectors (, + I I) with  E [Rn,  =1= O. These vectors are simple characteristics. Aside from the real characteristic hyperplanes, the surfaces t = to + Ix - Xo I and t = to - Ix - Xo I for x =1= xo, are characteristic sur- faces. This example shows that we can have characteristic surfaces which are not hyperplanes. The Hamiltonian system (see 1.4) of a l.d.o. with constant coefficients reduces to dX j ape eft = a;:.. (), 'J d;:' . =O dt ape whose solutions are: (t) = o, xj(t) = xJ + aj (o) t. The bicharacteristic curves are thus the straight lines ape Xj(t) = xJ + aj (o) t where o is a simple characteristic vector of P. For example in the case of the wave operator D, the bicharacteristic curves are the straight lines in [R:t 1 : x = Xo + (t - to)  ° wi th  ° E [Rn, o =1= o. D A l.d.o. with constant coefficients is obviously included in the class of those with analytic coefficients. We can therefore apply to it the theorems of Cauchy- Kowalewsky and Holmgren. We note, first of all, that apart from the local result of the theorem of Cauchy-Kowalewsky (see 91.5) we have a global result: 
g2. Linear Differential Operators with Constant Coefficients 193 Proposition 15. Given  E [Rn,  =I 0 such that p.() =I 0, f and g analytic functions on [Rn, and Xo E [Rn, there exists a unique analytic function u on [Rn which is a solution of the Cauchy problem Pu = f on [Rn, (u - g)(x + t) = O(t m ) as t  0, '\Ix E [Rn with .(x - Xo) = 0, where m is the order of P. Proof From Lemma 5 of 1.5, there exists a neighbourhood U of the origin such that for all Xo E [Rn, there exists a solution Uxo analytic on Xo + U of the Cauchy problem PUxo = f on Xo + U, (uxo - g)(x + t) = O(t m ) as t  0, '\Ix E Xo + U with  . (x - Xo) = o. Since U is independent of xo, we obtain the result by analytic continuation. D We shall now make use of Holmgren's theorem to characterise the hyperbolic operators. First of all, being given H a closed half-space of [Rn, we say that P is hyperbolic with respect to H if it is hyperbolic with respect to a closed convex cone C such that H n (- C) = {O}. We know (see Proposition 14) that it then admits one and only one elementary solution in ' (H). Suppose that H = {x; . x  O} with  E [Rn,  =I 0; we shall show later (see Corollary 3 of 3) that if P is hyperbolic with respect to H, then  is not a characteristic vector of P. We shall assume this result whose proof makes use of a completely different technique and establish the following characterisation: Proposition 16. Let H ° = {x: o. x  O} be a closed half-space of IRn with o E [Rn, o =I O. Then P is hyperbolic with respect to H ° iff it satisfies the two conditions: (i) o is not a characteristic of P; (ii) P admits an elementary solution with support in H 0. Proof We have assumed for the moment the fact that the hyperbolicity of P with respect to Ho implies (i). The necessary condition is thus obvious. Now suppose that p.(o) =I 0 and that P has an elementary solution E E '(Ho). We can always suppose that Iol = 1. Let us consider Co > 0 such that p.() =I 0, '\I E [Rn with I + ol < Co 
194 Chapter V. Linear Differential Operators and apply Holmgren's theorem; using Remark 2 of 1.5, we see that for every neighbourhood Do of X O E H 0 n (- H 0) relatively compact in [Rn \ {O}, E = 0 on {x E [Rn; dist (x, Do) < £5, Io' xl < £5} h s;: Co 11 . h 0 . f . d . ( n {O} ) were u = MA WIt 11 > satls Ylng 11 < 1st o, . 2yt20 In other words, for all X O E H 0 n (- H 0), E = 0 on {x E [Rn; Ix - xOI < clxOI} . h I Co WIt Co = MA' 2yt20 and so suppE c {x E Ho; o' x  clxl}, and this shows that P is hyperbolic with respect to H o. o (2.17) 3.2. Algebraic Characterisation of Hyperbolic Operators Let us be given o E [Rn, Iol = 1 and put Ho = {x:o'x  O}. We shall show in 3 (see Proposition 11), that a l.d.o. with constant coefficients P is hyperbolic with respect to Ho iff { there exist To E [R, C E [R + such that for all (', z) E C n X C, o., = 0 and P(zo + i') = 0 => Rez  To + Cllm'l. In addition then, P is hyperbolic with respect to the cone {x + to; o.x = 0, Ixl  Ct}. The technique used to prove this result is entirely different from those used in this section. We shall use this characterisation to show (see Corollary 3 of 3) that if P is hyperbolic with respect to Ho then p.(o) i= O. We shall likewise make use of this characterisation to show (see Corollary 2 of 3) that P is hyperbolic with respect to H 0 iff it is so with respect to - H 0 and hence that P is hyperbolic with respect to a salient closed convex cone C iff tp is hyperbolic with respect to C. We shall simplify the algebraic condition (2.17). It is immediate that it implies (2.18) { there exiss "0 E  such that P(To + l) = 0 => T  To. In fact, putting (J = o., , =  - (Jo' z = T + i(J, and applying (2.17) we have, since o., = 0 and' E [R1\ that P(To + i) = 0 => T = Rez  To + Cllm'l = To. 
g2. Linear Differential Operators with Constant Coefficients 195 We can expand P(To + i() into TmoPmo(() + . . ., (Pmo =1= 0) where pmo'... are polynomials in (E Ho n (-Ho). We shall prove in 9 3 (see Proposition 7) that the property (2.18) implies the existence of an elementary solution of P with support in H 0 if it is assumed that Pmo(() i= 0 for all ( E H 0 n (- H 0); this condition is obviously satisfied if p.( o) -:j:. 0, since then mo == m and Pmo - p.(o). As for the characterisation (2.17), this existence theorem is established by a technique totally different from those of this section. Assuming these results, we can now state, taking account of Proposition 16, the following algebraic characterisation of hyperbolicity: Proposition 17. A l.d.o. with constant coefficients P is hyperbolic with respect to H 0 iff it satisfies the following two conditions: (i) o is not a characteristic of P, (ii) P satisfies (2.18). When P is homogeneous, i.e. when P == p., the condition (2.18) reduces to P.(To + i) == 0 => T == 0, V(, T) E [Rn+ 1 which, using homogeneity and decomposing  == (Jo + ( with o. ( == 0, we rewri te as: For all y E [Rn with o. Y == 0, the polynomial in Z E C, p.(y + zo) has all its zeros real. Since the condition p.( o) -:j:. 0 expresses the fact that the polynomial p.(y + zo) is of degree m in z we obtain the particularly simple charac- terisation contained in: Corollary 1. A homogeneous l.d.o. p. of order m and with constant coefficients is hyperbolic with respect to H 0 iff for all y E [Rn with o. Y == 0, the polynomial p.(y + zo) has m real zeros (distinct or multiple). In particular, to within a multiplicative constant, p. has real coefficients. We observe that the principal part of a hyperbolic operator is again hyperbolic. In fact P.(To + i) == lim 8 m P ( TO + i ) e-+ 0 8 SO P.(TO + i) i= 0 implies P((T/8) o + i/8) i= 0 for 8 sufficiently small and hence, if P satisfies (2.18), T/8  To for 8 sufficiently small, from which T == 0 21 . The converse is, in general, false as we shall see in the examples below. On the 21 We make use of the fact that if a family of polynomials Pc;(z) of degree m converge to p(z) of the same degree {z E C; p(z) = o} C {z E C; 3£k  0, Zk  z such that Pc;k(zd = O}. 
196 Chapter V. Linear Differential Operators contrary, it is true if the zeros of p.(y + ao) are simple, that is to say if the real characteristics of P are simple. More exactly, let us prove: Proposition 18. Let P be a l.d.o. with constant coefficients satisfying: { for all y E [Rn, y =1= 0 with o. Y = 0, the zeros of the polynomial p.(y + zo) are real and distinct. Then P is hyperbolic with respect to H o. We say then that P is strictly hyperbolic (or hyperbolic in the sense of Petrovsky) with respect to H 0 . We notice that P is strictly hyperbolic with respect to H 0 iff P is hyperbolic with respect to H 0 and all its characteristics are simple; also P is strictly hyperbolic with respect to H 0 iff it is so with respect to - H 0' and iff it is true of its principal part. Proof of Proposition 18. Let us suppose that there exist (rk' k) E [Rn+ 1 such that P(LkO + ik) = 0 and Lk  00, when k  00. Putting ak = k. o, Yk = k - akO' Zk = ak - iLk and Ak = ILkO + ikl = IYk + ZkO I, we have Ak  Lk and so Ak  00 when k -+ 00. We can always suppose, after extracting a subsequence, that Yk Ak -+ Yo and Zk - Z Ak 0 and we shall have O.Yo = 0, Iyo + zool 1. Now there exists a constant C such that IP()I  Ip.()1 - CIlm-l, and hence p. ( Yk + Zk o ) = A;mlp.(LkO + ik)1  CA;l. Ak Ak In the limit p.(yo + Zo o) = o. We have necessarily Yo =1= 0, since p.(o) =1= 0 and Iyo + zool = 1. Using the hypothesis and the implicit function theorem, there exists U a neighbourhood of Yo in {y; o. Y = O}, r > 0 and a: U  [R such that { for all y E U, Z = a(y) is the unique zero of p.(y + zo) in the disk {z; I Z - Z 0 I < r}. 
g2. Linear Differential Operators with Constant Coefficients 197 In other words, there exist a constant c > 0 such that { p.(y + zo)1  clz - (J(Y) I for all Y E U & Iz - Zo I < r. For k sufficiently large, we have Yk/AkEU and I(Zk/Ak) - zol < r, so C  Ak r c: + : o)  C Zk - Ak Cl c: ) and from the definition of Zk Zk - Ak Cl c: ) 2 = Cl k - AkCl C: ) - iT k 2 ( Yk ) 2 Cl k - Ak Cl Ak 2......... 2 + Tk -;;::; T k , which contradicts the hypothesis that Tk -+ 00. Finally we prove the Proposition 19. Suppose that P with constant coefficients is hyperbolic with respect to H o. Then the smallest closed convex cone containing the support of the elementary solution of P in ' (H 0) is c = {x E Ho; .x  0 for all  E ro} where r 0 = { E [Rn; p.( + !o) i= 0 for all T  O}. (We call C the support cone of the elementary solution of P in ' (H 0)). Proof First of all we observe that, being given  E [Rn we have that the polynomial p.( + zo) has all its zeros real, since on writing  = (Jo + Y with (J = o.; p.( + zo) = p.(y + (z + (J)o)' Hence roE { E [Rn such that the zeros of the polynomial p.( + zo) are all strictly negative}. We deduce by continuity, that r 0 is an open set of r = { E [Rn; P · ( ) i= O} and to n r = r 0 . We denote by E the elementary solution of P in '(Ho). The smallest closed convex cone containing supp E being by definition salient is the intersection of the closed half-spaces H = {. x  O} such that supp E c {1]. x  O} for every 1] in the neighbourhood of . Putting r = { E [Rn,  i= 0; supp E c {t].x  O}, \/1] in the nbd. of} we observe that it is an open convex cone and that from Proposition 18, r n roe r. Finally we remark that o E r 0 n r, and that r 0 is star-shaped with respect to o' since being given  E r 0 and A E JO, 1J, r(A + (1 - A)o + To) = Amr( + (+ 1 A )o)#O, \/T  O. 
198 Chapter V. Linear Differential Operators By connectedness we deduce that r 0 == r; as a matter of fact r 0 is a connected open and closed set in r, so the connected component of o' r is a connected open set in r so contained in ro; since r is closed in ro, it is equal to roo This proves the Proposition. D Remark 3. We have shown that ro is an open convex cone and that ro is the connected component containing o of the open set r == { E [Rn; P · ( ) -1= O}; we see also that the support cone of the elementary solution of P in f0'(Ho) depends only on the principal part of P. D 3.3. Hyperbolic Operators of Order 2 We first of all consider the d' Alem bertian D == 0 2 / ot 2 - .1 x on [R 1. It is homogeneous of order 2; for all y E [Rn, y -1= 0, the polynomial Z2 - y2 has two simple real zeros z == + I y I. Hence D is strictly hyperbolic with respect to [Rn X [R+ == {(x, t); x E [Rn, t > O}. Now ro == {(y, s) E [Rn+l; (s + T)2 -1= y2, VT  O} is the open cone {(y, s) E [Rn+ 1; S > I yl}. Hence the support cone of the elementary solution of D in f0' ([Rn X [R +) is {(x, t); t  Ixl}. We shall now consider the hyperbolic operators (with constant coefficients) of order 2 on [Rn. We shall establish the Theorem 3. Every hyperbolic operator of order 2 on [Rn can be redu.ced either to 0 2 - -;2 + a o UX I or to 0 2 _ 2 - .1 ( ) + a o , with s  n - 1.  X2,...,Xs+l UX I Proof (1 ) We consider first the case of a homogeneous l.d.o. of order 2. 0 2 P 2 == L L ajk 0 0 with ajk == akj E C. j k X j X k We know (see Corollary 1) that if it is hyperbolic, it has, within a multiplicative constant, real coefficients. Hence ajk == Aa}k where (a}k) is a real symmetric matrix. Considering the eigenvalues of this real symmetric matrix, we can rank them (AI' . . . , An - /11' . . ., - /1s, 0, . . . , 0) with Aj, /1k > 0; free to change A to 
g2. Linear Differential Operators with Constant Coefficients 199 -A, we can suppose that 0  r  s  n and we have 0 < r + s  n. There is a change of variables in the affine space [Rn, x' = hx such that 0 2 0 2 0 2 0 2 h* A - 1 P 2 =  X ' 12 + . . . + _ 2 - 2 -... - a ,2 . u aX aX+1 X r + s We show that P2 is hyperbolic iff r = 1 or (r = 0 and s = 1), which proves the stated result in the homogeneous case. To simplify the notation, without loss of generality, since P 2 is hyperbolic iff h* A -1 P2 is hyperbolic, we suppose that 0 2 0 2 8 2 0 2 (2.19) P = - + . . . + - - - . . . - 2 axi ax; ax;+ 1 ax;+s with o  r  s  n, o < r + s  n. From Corollary 1, P 2 is hyperbolic with respect to {o. X  O} iff { for all Y E [Rn with  0 . Y = 0 the polynomial P 2 (y + z 0) has two real zeros. We have P 2 (Y + zo) = P 2 (Y) + 2z( Y1 + . . . +  Yr - + 1 Yr+ 1 - +sYr+s) + Z2 P2(0)' and hence the condition can be written { P2(0) =I 0 and for all Y E [Rn with o. Y = 0 (2.20) (Yl +... + Yr - +1Yr+l -... - +sYr+s)2  P2(O)P2(Y). We notice that this implies that r = 1 or (r = 0, s = 1). In fact if r  2, since s  r, for all A  0, there exists Y E [Rn such that ;:0 1 Y 1 + . . . + ;:0 Y _;:0 Y + . . . + ;: Y - 0 S Sr r - Sr+1 r+1 S+s r+s - , Yj = 0 for j > r + s, yi + . . . + Y; = 1, Y; + 1 + . . . + Y; + s = A. Substituting in (2.20), we obtain the contradiction P 2 (  0) =I 0, o  (1 - A) P 2 (  0 ), V A  o. If r = 0 then necessarily s = 1. More precisely, we show that if P 2 is hyperbolic with respect to {o. X  O} with P 2 (o) < 0 (which will be the case if r = 0) then s = 1. In fact if s  2, then using Y E [Rn such that Y1 = . . . = Yr = + 1 Yr+ 1 + . . . + +sYr+s = Yr+s+ 1 . . . = Yn = 0 and Y 2 + ... + Y 2 - 1 r+1 r+s - , 
200 Chapter V. Linear Differential Operators and substituting in (2.20), we obtain the contradiction P2(O) < 0  - P2(O)P2(Y) = P2(O). C I . f 02 . onverse y, 1 r = 1, P 2 =  - L1(X2' . . . , X s + 1 ) IS the d'Alembertian in uX l (X2, . . . , X s + 1; Xl)' hyperbolic with respect to the cone {X E [Rn; Xl  (x + . . . + X;+ 1 )1/2, X s + 2 = . . . = X n = O} and if r = 0, s = 1, P 2 = - 0 2 joxi is hyperbolic with respect to the semi- infinite straight line {(Xl' 0, . . . , 0); Xl  o}. (2) Considering now the general case of a l.d.o. 0 2 0 P = IL>jk a a + 2 Ib j -;- + 2a, j k x j X k uX j if P is hyperbolic, it has the same principal part 0 2 P 2 = I I ajk 0 0 . j k X j X k To within a multiplicative constant and a change of variables in the affine space [Rn, we can therefore suppose 0 2 0 2 P 2 = - - or P 2 = - - L1 ( oxi oxi X2,..., xs+d. We now show that P is hyperbolic iff b j = 0 for all j > 1 in the case p 2 = - 0 2 j oxi and all j > s + 1 in the second case. Using then Q = e -q> Peq> with qJ(x) = - b 1 Xl 0 2 in the case P 2 = -  uX l and qJ(x) = b 1 Xl - b 2 X 2 - . . . - b s + 1 Xs+ 1 in the second case, we shall have (see Sects. 1.3 and 1.6) Q = P 2 + a o reduce to one of the stated forms. Let us recall first of all (see Proposition 19 and Remark 3) that if P is hyper- bolic with respect to {o. X  O}, the connected component roof o in r = {; P2() =I- O} is an open convex cone and P is hyperbolic with respect to {  . X  O} and {. X  0 } for all  E r 0 . In the case P 2 = - 0 2 joxi, r has two connected components: the open half- spaces {1 > O} and {1 < O}; hence if P is hyperbolic, it is so with respect to {Xl  O}. 0 2 In the case P 2 =  - L1(X2,..., Xs+ d with s  2, r has three connected com- uX l ponents: the cone {1 > ( + . . . + ; + 1 )1/2} and its opposite and the open 
2. Linear Differential Operators with Constant Coefficients 201 set {I <  + . . . + ; + l} which is not a convex cone; hence, in the same way, if P is hyperbolic, it is with respect to {Xl  O}. Finally in the case P 2 = 02joxI - 02jox, r has four connected components: the cones { +  l, + 2 > O}. Hence in this case we can, for the moment, only affirm that if P is hyperbolic, necessarily it is with respect to one of the half- spaces {Xl  O} or {X2  O}. Now to complete we suppose that P is hyperbolic with respect to {Xl  O}. Then, from Proposition 17, there exists a constant 'to such that P(il + 't, i2' . . . , in) = 0 => 't  'to for all  E [Rn. Suppose, e.g., that P 2 0 2 -- OXI A(X2,...,X s +d with s + 1 < n and b n =1= o. We show that, given 't > 'tci = max(ro,O), there exists l' 2' n E [R such that P(i 1 + 't, i2' 0, . . . , 0, in) = O. This contradicts the hypo- thesis of hyperbolicity. The other cases are shown in the same way. We have P(il + 't,i2,0,...,O,in) = 't 2 +  - I - 2Imbnn,+ 2Rea + 2i( 1 't + Re bnn + 1m a). If b n =1= 0, there exists A E IR such that ( Re bnA + 1m a ) 2 . + 2 ImbnA  .2 + 2Rea; the values n = A, l = - T-l(RebnA + 1m a), and 2 = (I + 2lmbnn - 't 2 - 2Rea)l/2, give the solution of the problem. o To complete the analysis, we observe that being given an operator P of order 2 with real principal part P2 = L L a jk a a; , and without multiple charac- j k Xj X k teristics that is to say lajk I =1= 0, then we can distinguish three cases. Case 1: the eigenvalues of (a jk ) all have the same sign; then P is elliptic and can be reduced to A + ao. Case 2: the eigenvalues all have the same sign, except for one of the opposite sign; then P is strictly hyperbolic and can be reduced to D + a o . Case 3: which arises only if the dimension n  4 when P can be reduced to A - A + a o with 2  r  .1 n (x 1 , . . . , x r ) (X r + 1, . . . , X n ) 2 We then say that P is ultra-hyperbolic (but it is not hyperbolic). 
202 Chapter V. Linear Differential Operators 4. Parabolic Operators There exist numerous classes of operators which have elementary solutions with support in a closed half-space. We already have the hyperbolic operators. If we now consider the operator in IRn + 1, a p = at - wAx, with w E C* and -tn::::; arg w ::::; tn, it has the elementary solution, with support in IR n x IR+ == {(x, t); t  O} (see 93.6.2): Y(t) ( Ixl2 inargw ) E(x, t) = (4n Iwl t)"!2 exp - 4wt - 2 . In the case - tn < arg w < tn, this elementary solution is in CfjOO (IRn + 1 \ {O}); furthermore, we have already seen that in this case the operator P is hypo... elliptic (see Sect. 1.5). In the case larg wi == tn, i.e. to within a change of the time, the Schrodinger operator, we have IE(x, t)1 == Y(t)/(4nlwlt)n/2 and E is not continuous at any point (x, 0). A fortiori, E  Cfj 00 (IRn + 1 \ {O}) and hence P is not hypo-elliptic. For w = - i (hence for tbe operator :t + i Ax ). we find the "usual" formula: Y(t) ( . nn ) ( .IX I2 ) E(x, t) = (4nt)"!2 exp 14 exp -1"4t, t  0 and iE(x, t) is thus the elementary solution ofthe operator ( :t + Ax), and for w = + i (hence for the operator :t - iAx). we find E(x, t) = (4:X!2 exp ( - in ) exp (i 1:2 ). In a general way we have: Definition 8. Let P be a l.d.o. with constant coefficients on IRn and H a closed half-space in IRn. We say that P is parabolic with respect to H if it has an elementary solution in f!fi' (H) n Cfjoo (IRn \ {O}). We know that then P is hypo-elliptique (see Proposition 13). In the case n == 1, every operator is parabolic with respect to IR + and IR -. In the case n  2, a parabolic operator is not elliptic, since it admits an elementary solution which vanishes on a non-empty open set. We shall see that it is no longer hyperbolic. This results in: 
2. Linear Differential Operators with Constant Coefficients 203 Proposition 20. Let P be a i.d.o. with constant coefficients, parabolic with respect to H = {o. x  O}. Then there exist constants c > 0 and C > 0 such that { 'VTo  0, 'V(, T) E IRn x [- To, 00[, P(TO + i) = 0 => ITI + II  Ce Cto . We deduce the Corollary 2. Suppose that n  2. If P is parabolic with respect to H = {o. x  O}, then o is a characteristic of P. In particular, P is not hyperbolic. Proof of Corollary 2. If o is not a characteristic of P, from Proposition 16, P will be hyperbolic with respect to H and so also with respect to - H and so, since P is hypo-elliptic, P will also be parabolic with respect to - H. Using the Proposition, we see that there will exist C such that { for all (, T) E IRn + \ P(To + i) = 0 => ITI + II  c. Since n  2, there exists y E IRn+ 1 such that o. Y = 0 and Iyl > c. Then the polynomial P(zo + iy) does not have zeros in C, for if z = T + ia, P(zo + iy) = 0 => Iyl  ITI + lao + yl  C. Hence P(zo + iy) is constant in z, that is to. say that P is independent of o and hence a fortiori p.(o) = o. o Proof of Proposition 20. Let E be an elementary solution of P in '(H) n rcoo(lRn \ {O}). Assume that we are given p E  with p = 1 in the neighbourhood of the origin. We have P(pE) = b + t/J where t/J E  and supp t/J c H. Taking Fourier transforms we have (2.21 ) /""--... A. P pE = 1 + t/J where p is the characteristic polynomial of P. Since pE and t/J have compact /""--... A. supports, pE and t/J extend to holomorphic functions in cn and the equality (2.21) is still true for these extensions. We consider ( =  - iTo with (, T) E IRn x [- To, 00 [ where To  O. We have Re ( - i( . X) = - T  0 . x  T 0  0 . x for all x E H. Hence I(j tfr(OI = r e - i, x Dj if; (X) dx  eC<o r IDj if; (X) I dx, J n J n with c = sup {o. x; X E supp t/J}. 
204 Chapter V. Linear Differential Operators Since I'Ll + II  CoL I(il with Co depending only on Iol, -A. P('Lo + i) = p(() = 0  t/!(() = - 1  I'Ll + II  Ce to with C = Co L f'Djl/t' dx. o Remark 4. We have adopted as the definition of a parabolic operator that due to Hormander [1J (p. 152). We should however note that there exists a whole set of operators having some of the essential properties of the heat operator and that this definition of a parabolic operator does not enable us to recover all the essential properties of the heat operator. This will appear clearly in 3. o 3. Cauchy Problem for DitIerential Operators with Constant Coefficients Throughout this section, we shall take P to be a l.d.o. with constant coefficients on IR . We can consider P(, 'L) as a polynomial in  with coefficients in C [J, 1.e. am p=p -+...+P m atm 0 where Po, . . . , Pm are l.d.o. with constant coefficients on IRn. We suppose that m is the order of P with respect to a/at, i.e. P m =f=. O. Let] to, t 1 [ be an interval of IR with to > - CfJ (resp. t 1 < + CfJ). For kEN, the spaces k(]to, t1[;')22 and k([to' t 1 [; ') (resp.k(]to, t 1 ]; ')) are spaces of distributions on Q = IR n x J to, t 1 [ and we have for all dku aku u E k(Jto, t 1 [; '), dt k = atk in '(Q). Being given u o , . . . , U m - 1 E ' and f E '(Q), we shall consider the direct or forward (resp. backward) Cauchy problem on]t o , t 1 [ U E m-1([to, t 1 [; ') (resp. m-1(Jto, t 1 J; ')) Pu = f in '(Q) : (to) = Uk (res p . : (tt)=U k ) for k=O,..., m - 1. 22 Unless the contrary is stated, the notation , ', lff, lff' etc., will be reserved for distributions on [Rn. 
9 3 . Cauchy Problem for Differential Operators with Constant Coefficients 205 The operator P being invariant under a translation, we can always suppose that to == 0 (resp. t 1 == 0). Then we pass from the backward Cauchy problem to the direct Cauchy problem by reversal of the time, t' == - t; the image of P is , a ( 1 ) m P am P == Po - P 1 at' + . . . + - m a t'm . We can therefore restrict ourselves to the study to the direct Cauchy problem on JO, T[ with 0 < T  00. With concern for suplicity, we shall suppose that T == 00 and denote by PC ( Uo, . . . , Urn _ 1 ; f) the direct Cauchy problem on J 0, 00 [with the initial data uo, . . . , Urn - 1 and the second member f 1. Cauchy Problem and the Elementary Solution in q}'«(Rft x (R+) We notice that IR + == [0, 00 [ and ' (IR n x IR +) is the space of distributions on IR n + 1 with support in IR n x IR + . The operator Dt == a/at is an isomorphism of the space ' (IRn x IR +) onto itself. The inverse operator D t - 1 is the convolution with F 0 == b(x) @ Y(t)23 whose support {O} x IR + is convolutive with IRn x IR + (see Appendix "Distributions", 2). Given u E  ([0, 00 [; ,@'), Y u : ( E .@ ( IR n +1) -+ LX) < u ( t), (( . , t) > d t E IR is a distribution with support in IR n x IR + and the mapping u  Yu is a continuous injection of  ([0, 00 [; ,@') into ' (IR n x IR + )24. Suppose now that u E r([o, 00[; '); integrating by parts we have or dku d r - k - 1 dru ot' Yu = k"5;, dt k (0) (8) dt,-k-l J(t) + Y dt' . In particular, considering clef { d k U } ([O, 00 [; ') == U E r([o, 00 [; '); dt k (0) == 0 for k == 0, . . . , r -1 23 Recalling that b is the Dirac distribution (at the origin) in [Rn, Y is the Heaviside function (on [R), we see that F 0 is the distribution on [Rn + 1, (Fo,O = IX> (0, t)dt. 24 This is the "extension by 0" on [Rn X ] - 00, 0[. 
206 Chapter V. Linear Differential Operators we have iyr dru - Yu == Y - Vu E cer o([ O, 00 [ ; !!fi' ) . at r dt r ' Passing to the inverse operator we see that D t- r maps yce ([ 0, 00 [; !!fi') onto Yce ([0, 00 [; !!fi'). Let us takef E ce ([0, ex) [ ; C') and g E ce ([0,00 [; !!fi'). Then the supports of Yf and Yg in [Rn+ 1 are convolutive and we have Yf * Yg == Y(f * g) wheref * g is the element of ce ([0, 00 [; !!fi') defined by (f* g)(t) = LI(t - s) * g(s)ds = LI(S) * g(t - s)ds. IffE ce 1 ([0, 00 [;C') (resp. g E ce 1 ([0, oo[;!!fi')), then 1 * 9 E CC6 ([0,00 [; f0') and :t (f * g)(t) = 1(0) * g(t) + ( : * 9 }t) (resp.!(t) * g(O)+(I*  }t)). By recurrence, we deduce that (a) Iff E ce ([0, 00 [; C') g E ceo ([0, 00 [; !!fi') then d r + s drf dSg f * g E ce+s ([0, 00 [; !!fi') and dt r + s (f * g) == dtr * dt S ' (b) If, in addition,fE ce r + 1 ([0, oo[;C') (resp. g E ce s + 1 ([0, oo[;!!fi')) then f * g E ce+s+ 1 ([0, 00 [; !!fi '). We now state: Proposition 1. Let Eo E ceo - 1 ([0, 00 [ ; !!fi '). Then YEo is an elementary solution of P == Pm am / atm + P m-1 a m - 1 / at m - 1 + . . . Po iff fo * Eo is a solution of PC(O,...,O;fo),wherefo(t) = 6 25 . In addition, the functions Ek defined by dE k Ek+ 1 == (it + P m-kEO for k == 1,. . . , m - 1 (3.1) E 1 == PmEO' are in ce m ([0, ex) [; !!fi') and for all Uo, . . . , u m - 1 E C' and f E ce ([0, 00 [; C'), the function u defined by (3.2) u(t) == L Em-k (t) * Uk + (Eo * f)(t) is a solution of PC(uo, . . . , u m - 1 ; f). 25 That is to say that fo is the C'.)ilstant mapping: t E [R +  b E ', Fo = Yfo = b(x) Q9 Y(t). 
3. Cauchy Problem for Differential Operators with Constant Coefficients 207 I n particular, Em _ k is a solution of PC (0, . . . , b, 0, . . . ; 0) where b occupies the (k + 1 )-th place (for k = 0, . . . , m - 1). Proof. We have D t - l YE o = Yfo * YE o = Y(fo * Eo). Now since Eo E CC'O-l ([0, 00 [; !0'), fo * Eo E CC '0 ( [0, 00 [; !0') and PY(fo * Eo) = YP(fo * Eo). Hencefo * Fo is a solution of PC(O,. . . ;fo) iff Yfo = YP(fo * Eo) = PY(fo * Eo) = PD t - l YE o = D t - l PYE o , i.e. PYE o = DtYfo = b, thus YE o is the elementary solution of P. Then for fE CC([O, 00[; C'),f* Eo E CC'O-l ([0,00 [; !0') and d m - l ( dm-l ) dt m - l (f * Eo)(O) = f * dt m - l Eo (0) = O. On the other hand PY(f* Eo) = Yf* PYE o = Yf hence P(f* Eo) =f In !0'(Q). In other words f * Eo is a solution of PC(O, . . . , 0; f). We observe that if f E CC l ([0, 00 [; C '), then f * Eo E CC'O ([0, 00 [; !0 '). Now u is a slution of PC (u o , . . . , u m - l ;f) iff ii = u - w is a solution of PC(O, . . . , 0; f) with t k _ w(t) = L k! Uk and f=f- Pw. Hence for Uo,..., U m - l E C' and f E CC ([0, 00 [; C'), the function u = w - Eo * Pw + Eo * fis a solution of PC(u o , . . . , u m - l ;f). Writing out w - Eo * Pw, we have (w - Eo * Pw)(t) = L Em-k(t) * Uk' with t k Ek = fm-k - Eo * Pfm-k where h(t) = k! b. We notice thatfm-k and Pf-k E CCOO([O, 00 [; C'), so Eo * Pfm-k E CC'O([O, 00 [; !0') and Ek E CCm([O, 00 [; !0'). Finally let us show that the Ek are defined by (3.1). We have d m - l fm-l E CC'O([O, 00 [; C') and dt m - l fm-l (0) = b, from which PYfm-l = Pm b + YPfm-l and YE l Yfm-l - YE o * (PYfm-l - Pm b) = YPmEO. 
208 Chapter V. Linear Differential Operators For k = 1, . . . , m - 1 dE k dt (t) = Ek+ 1 (t) - Eo(t) * Pfm-k(O) which, since Pfm-k(O) = P m - k b, dE k Ek+1 = dt + Pm-kEO' D becomes This Proposition 1 shows that we can solve the Cauchy problem for data in C', if we know an elementary solution in ' (IR n x IR +), having in addition a certain regularity. We shall now show that when Pm is of order 0, this regularity is automatically satisfied: Proposition 2. Let us suppose that Pm is of order 0 and that E E ' (IRn x IR + ). am am - 1 Then E is an elementary solution of P = Pm + P m - 1 a -1 + . . . + Po ut m t m iff Pm E = YE 1 where E 1 is a solution of PC(O,..., 0, b; 0). Also E 1 E OO([O, CIJ [; '). Proof. We may suppose that Pm = 1. Suppose first of all that E 1 is a solution of PC(O, . . . , 0, b; 0). Then am Ok E 1 E O-1([0, CIJ [; ') and  m E 1 = - L P k  k E 1 E ([O, CIJ [; '). ut k<m ut Hence E 1 E m([o, CIJ [; ') and by recurrence E 1 E OO([O, CIJ [; '). On the other hand, we have d m - 1 E PYE 1 = dt m - 11 (0) (8) b(t)  YPE 1 = b, so YE 1 is an elementary solution of P. Suppose now that E is an elementary solution of P; to prove that P mE = YE l' with E 1 a solution of PC(O, . . . , 0, b; 0) it follows from Proposition 1 that it suffices to show that E E Yo-1 ([0, CIJ [; '), or also that for 0 < T < CIJ and Q a bounded open set in IR n , (both arbitrary), E E Yo-1 ([0, T]; '(Q)). Now from the local representation theorem 26 , there exists r E N such that Dt- r E E Yo-1 ([0, T]; '(Q)). Suppose, to the contrary, that the smallest r E N such that Dt- r E E Yo-1 ([0, T]; '(Q)) 26 See footnote 2 in 1. 
3. Cauchy Problem for Differential Operators with Constant Coefficients 209 is non zero. We should then have D m - 1 D -r+1 E - D -r D m E = D-r b - '\' D k P D-r E t t - t t t f...J t k t k<m which is in Yrc([O, T]; {0'(Q)) and so D t- r + 1 E E D t- m + 1 yrc ( [0, T]; {0' ( Q) ) = Yrc - 1 ([0, T]; {0' ( Q)), which is contradictory to the choice of the smallest value of r. o 2. Propagation in Hyperbolic Cauchy Problems We suppose here that P admits an elementary solution E = YE o with Eo E rc-1 ([0, CIJ [; C'). We shall see later that it is equivalent to the hypothesis that P is hyperbolic with respect to [Rn x [R+ (see Proposition 11). We remark only that, from Proposition 1, this implies that Pm is of order 0, since d;tl_o (0) is an elementary solution of Pm in C'. We deduce that the functions satisfy Ek E rc 00 ([0, ex) [; C') with, for all t E ]0, ex) [ supp Ek(t) C K(t) with K(t) x {t} = supp E n ([Rn X {t}). For all U o , . . . , U m - 1 E {0' and f E rc([O, ex)]; {0'), the formula (3.2) defines a solution U of PC(u o , . . . , u m - 1 ;f) which is, in fact, in rcm([O, ex) [; {0'). Using the property of the support of a convolution, w e have, for all t E ]0, CIJ [, supp u(t) c ( u sUPPU k + K(t) ) U ( U suppf(s) + K(t -S) ) . O<s<t Being given an open set Q of [Rn, let us put Q(t) = Q - K(t); hence Q ( t) x { O} = (Q x {t} - s u pp E) n [R n X {O}. Also let us put Q(t) = U (Q - K(t - s)) x {s} = (Q x {t} - supp E) n [Rn X ]0, t[. O<s<t Then Uk = 0 on Q(t) => (supp Uk + K(t)) n Q == 0 f= 0 on Q(t) => Q n ( U suppf(s) + K(t - S) ) = 0. O<s<t Finally, if U E rc m ( [0, ex) [; {0'), U is necessarily given by (3.2) with dku Uk = dt k (0) and f = Pu. 
210 Chapter V. Linear Differential Operators In effect, this reduces to showing that if u E CC'/; ([0, CIJ [; !0') with Pu == 0, then u == 0; but if u E CC'/; ([0, CIJ [; !0'), P Yu == YPu and so Yu == Y(Pu * Eo), from which u == 0 if in addition Pu == O. Then we have Proposition 3. Let u E CC m ( [0, CIJ [; !0'), Q be an open set of IRn and t E ] 0, CIJ [; let us suppose dkujdtk(O) == 0 on Q(t) for k == 0,. . ., m - 1 and Pu == 0 on Q(t), then u(t) == 0 on Q. The open set Q(t) is called the domain of dependence of Q x {t}; the value of u(t) on Q depends only on the values of Pu on the domain of dependence Q(t) and on that, for k  0, . . . , m - 1, of okuj ot k on the base of the domain of dependence, Q(t) x {O}. Q o domain of dependence Q(t) x base of domain of dependence Q(t) Fig. 1 The problem of the propagation of the singularities, that is to say the study of the singular support 27 of u(t), is more delicate. In the case f = 0, we have from the property of the singular support of a convolution (see Dieudonne [IJ, Chap. XXIII, 9 2 3.12.11) supp sing u(t) c U supp sing Uk + Ks(t) where Ks(t) == supp sing E n IR n x {t}, as supp sing Ek(t) c Ks(t). In the case f =1= 0, the difficulty comes from the fact that being gIven g E CC([O, T]; !0'), SUPP sing (f: g(s) ds ) is not, in general, contained in U supp sing g(t), OtT 27 See the definition in 92, in the proof of Proposition 13. 
3. Cauchy Problem for Differential Operators with Constant Coefficients 211 since g(t) E CX)(Q) for all t E [0, T] does not imply that f: g(t) dt E 'fl 00 (.0). The theory of the singular spectrum 28 of a distribution enables us to describe the phenomenon of the propagation of singularities in the case of general f (see Hormander [2]). We shall consider only the following particular case: Proposition 4. Let u E m ([0, 00 [; ') and let Q be an open set of [Rn and dku t E JO, 00[. Suppose that dt k (0) E 'floo(.os(t))forallk = O,...,m - 1 and that D Pu E Ll(Qs(t))for allll E N n , then u(t) E CX)(Q), where Qs(t). x {O} == (Q x {t} - supp sing E) n [Rn X {O} Qs(t) == (Q x {t} - suppsingE)n [Rn x ]0, t[. Proof I t remains to prove that under the hypothesis D  f EL l (Qs( t)) for all rl E N n where f == Pu E  ([0, 00 [; '), we have If(S) * Eo(t - s)ds E 18"(.0). Let Ql be an open relatively compact set of Q; there exists Pl E ([Rn X [0, t]) such that SUPPPl c (Q x {t} - suppsing E) n [Rn X [O,t] andpl == 1 onQl == (Ql x {t}-suppsingE)n [Rn x ]O,t[. From the hypothesis, we have P 1 f EL l (0, t; ) and so, since I Pi (s)f(s) * Eo(t - s)ds E f». Eo E ([O, t J; g'), Now let Q2 be an open relatively compact set in Ql and P2 E  ([Rn X [0, tJ) such that Q2 x {t} - supp P2 C Ql and P2 == 1 in the neighbourhood of supp sing E in [Rn x [0, t]. We have (1 - P2)E o E Et1([Rn x [0, tJ) and so I (1 - Pi (s))f(s) * (1 - P2(t - s))Eo(t - s) ds E f». 28 Called also "wave front set". 
212 Chapter V. Linear Differential Operators Finally, for all s E [0, t], (supp(l - Pl(S)) + SUPPP2(t - s)) n 02 - 0, hence In short I (1 - p1(s))f(s) * P2(t - s)Eo(t - s)ds = 0 on O 2 , I f(s) * Eo(t - s) ds E &'(0 2 ), D Example 1. Domains of Dependence. (1) We have seen that in the case n = 1, the wave operator D admits the elementary solution 8 2 8 2 --- 8t 2 8x 2 1 . ) 1 E = 2 Y(t + x) Y(t - x) = YE o wIth Eo(t = 2 X[-t,t] where X[-t,t] is the characteristic function of the interval [ - t, t]. We then have supp E = {(x, t); I x I  t} and supp sing E = {(x, t); I x I = t}. Hence the domain of dependence of 0 x {t} is the trapezoid Q(t) = {(x, s); d(x, 0) < t - s} when the singularities propagate along the lines t = x + c and t = - x + c. Q Q supp E supp sing E domain of dependence domain of dependence of regularitie5- Fig. 2 (2) In the case n  2, the d'Alembertian D = 8 2 /8t 2 - L1x admits the elementary solution E = YE o where Eo(t) = tn+1 G(i) with G = $'-1 Si;1 (see Remark 4 of Section 7). 
3. Cauchy Problem for Differential Operators with Constant Coefficients 213 The exact value of G is more delicate to express, but let us recall only (see Chap. XIV) that { {x; I x I  1} if n is even suppG = {x;lxl = 1} ifnisodd,n  3. supp sing G = {x; Ix I = 1}. We deduce that the domain of dependence of Q x {t} is: for n even, the truncated cone {(x, s); d(x, Q) < t - s}, for n odd (n  3), the conical crown u {(x, s); Ix - xol = t - s}. xoEil Finally, the singularities are propagated along the conical surfaces t = Ix - xol + c. (3) Making use of the reduction of a hyperbolic operator of order 2 (see 2, Theorem 3) 8 2 8 2 8 2 8 8 p = '" 2 + 2 L a j a a - L L a jk a a + b -;- + L b j -;:;- + a, ut j Xj t j k Xj Xk ut j UXj we see that, when P is homogeneous or strictly hyperbolic, considering the elementary solution E of P in flfi' ([n X ) and denoting by r the ellipsoid {  E IRn;  ajkjk + 2 L ajj = 1 }, for n = 1 and n even, supp E = {(x, t); .X  t, V E r} for n odd (n  3), supp E = { (x, t); sup . x = t } Er for arbitrary n, supp sing E = { (x, t); sup . x = t } .  E r 
214 Chapter V. Linear Differential Operators In a general manner, we can show (see Hormander [1], p. 137), that being given P strictly hyperbolic with respect to Ho = {x; o. x  O} with elementary solution E E flfi' (H o ), suppE c {x + to;o.x = O,.x  t VEr} suppsingE c { X + to;o.x = O,sup.x ='t } , Er where r = {; o. = 0 and r(o -) = O}, with p. denoting the principal part of P. 3. Choice of a Functional Space: Well-Posed Cauchy Problem Definition 1. Suppose that Olf is a vector space of distributions on [Rn. We say the direct Cauchy problem (associated with P) is well-posed in Olf if for all U o , . . . , U m - 1 E Olf andf E ([O, 00 [; Olf) there exists one and only one solution of PC(u o ,..., u m - 1 ; f) In m( [0, 00 [; Olf). We suppose that Olf is one of the classical spaces flfi'!/'{!)M''" {!),!/',flfi' (see Appendix "Distributions"), or more generally that it satisfies the following three conditions: (i) Olf * ' c Olf and so Olf is invariant under translation and derivation; (ii) ' n Olf is dense in Olf; (iii) we can apply the closed graph theorem. 
3. Cauchy Problem for Differential Operators with Constant Coefficients 215 Remark 1. This long series of possible choices of spaces Olf (which, we hasten to add, can be largely extended to cover the indispensible applications!) can present discouraging aspects for the reader. The guide lines are the following: (i) The physical nature of the problem already fixes certain directions. The very nature of the problem can furnish the functional framework. It is so in the case of the Schrodinger equations. In other cases, we distinguish for example according to the speed of propagation of the phenomena: if the speed of propagation is finite, no condition of growth at infinity is to be imposed and it will be appropiate to work in the spaces  or '; if the speed of propagation is infinite, then conditions of growth at infinity must be imposed and it is always useful to make a preliminary analysis of the situation in !/'; the dual spaces certainly appear inevitably and the spaces (!) M, (!)  are indispensible from the technical point of view. (ii) The spaces introduced here are not Banach spaces and their analogues on (unbounded) open sets Q of [Rn lend themselves very badly to the introduction of boundary conditions; it is one of the reasons which make the introduction of the Sobolev spaces wm,p (Q) (and also the Schauder spaces) absolutely indispensible. (iii) For numerical analysis, the most useful spaces are those "as close as possible" to the Euclidean spaces, that is to say, the Hilbert spaces. It is one of the reasons for the basic use of H m spaces. D We denote by ill the space of distributions e such that the convolution e * .: , E ' n Olf  e * , extends in a continuous mapping from Olf into itself. If Olf =::: ,,' or ', then ill = ' (see Appendix "Distributions"); if Olf = !/, {!)M, (!) or !/', then ill = (!) (see Appendix "Distributions"). From (i)-and (iii)-we have' c ill. We can show, for example by using the theorem of kernels 29 , that the mapping which makes e correspond to the convolution e * . is a bijection of ill onto the space of continuous linear mappings of Olf into Olf, invariant under translation (or commuting with the derivations); this enables us, in the general case, to define naturally a topology on ill and to verify that ill also satisfies the properties (i), (ii), (iii) above; we deduce als9 that ill * ill c ill. We notice that in the case in which the Dirac distribution is in Olf (resp. in the dual Olf' of Olf), in particular if Olf = ', (!), !/', ' (resp. , !/, (!)M, ), then the inverse of the mapping e  e * . is the mapping T E !£ (Olf, Olf)  Tb (resp. CT£5)( - .)). We do not then need to appeal to the theorem of kernels. We shall now prove: Proposition 5. With the above hypotheses and notations, the Cauchy problem is well-posed in Olf iff P admits an elementary solution in y-l ([0, 00 [; ill). Then 29 See Appendix "Distributions". 
216 Chapter V. Linear Differential Operators this elementary solution is unique 30 , and it is in 00([0, 00 [; 1/1). In addition Pm is an isomorphism from Olf onto Olf. Proof. First of all, let us show the sufficient condition. Suppose that YE o with Eo E -1 ([0, 00 [; 1/1) is an elementary solution of P. We use Proposition 1. Firstly E 1 = Pm Eo is the solution of PC(O, . . . , 0, £5; 0); so d m - 1 em = dt m - 1 Eo(O) is an elementary solution of Pm; since em E 1/1, Pm is an isomorphism of Olf onto Olf with inverse isomorphism em * .; it is also an isomorphism of 1/1 onto 1/1. Then, since d m Eo "d k Eo. . Pm d m = - i...J P k d k IS In  ([0, ex) [; 1/1), t k<rn t Eo is in m([o, 00 [; 1/1) and by recurrence Eo E  00 ([0, 00 [; 1/1). The same is true of the functions Ek defined by (3.1 ). We can then define U by (3.1) for all U o ,..., Urn - 1 E Olf and f E  ([0, 00 [; Olf); it is a solution of PC(u o , . . . , u m - 1 ; f) in m([o, 00 [; Olf). For uniqueness, let us consider u, a solution in m([o, 00 [; Olf) of PC(O, . . . , 0; 0); there exists a sequence Uk of elements of([O, 00[; Olf n g') such that Uk  U in m([o, 00[; Olf). We have YU k = YU k * PYE o = Y(Pu k * Eo), from which Uk = PU k * Eo, but PU k  0 in ([O, 00 [; Olf) so PU k * Eo  0 in ([O, 00 [; Olf) and U = O. For the necessary condition, we observe that if the Cauchy problem is well posed in Olf, the mapping U  Pu is a bijection from ([O, 00 [; Olf) onto ([O, 00 [; Olf); by the closed graph theorem, this is therefore an isomorphism with inverse isomorphism P- 1: f E ([O, 00 [; Olf)  U solution of PC(O, . . . , 0; f) in m([o, 00 [; Olf). Considering for all , E Olf, h(t) = " the mapping , E Olf  P - 1 h E  ([0, 00 [; Olf) is continuous and invariant under translation. Hence (P -1 h)(t) = F(t) *' where F E ([O, 00 [; 1/1). Since PF(t) * , = 'for all t E [0, 00 [ and all , E f!fi we have by density PF(t) == PF(t) * £5 == £5 for all t E [0,00 [; 30 In y -1 ([0, ex) [; 1/1). 
3. Cauchy Problem for Differential Operators with Constant Coefficients 217 that is to say that F is the solution of PC(O, . . . , 0; 10 )31 and so Y  is the elementary solution of P. D This proposition shows in particular that it is equivalent to saying that the Cauchy problem is well posed in f2, $, $' or f2' (resp. in !/, (!)M' (!) or !/'). The first case will correspond to hyperbolic operators (see Sect. 7 below): let us study first of all the second case by the method of the spatial Fourier transform. 4. Well-Posed Cauchy Problem in [/' We are interested in the solutions of PC(u o , . . . , u m - 1 ;f) in m([o, co [; !/'). Then we must have uo, . . . , U m -1 E!/' and f E ([o, 00 [; !/'). Under this hypothesis we define Vk = Uk and g = xf where x is the spatial Fourier transform which to f E ([o, 00 [; !/') makes correspond xf E ([O, 00 [; !/') defined by /'-... (xf)(t) = f(t) for all t E [0, 00 [. I t is immediate that  x is an isomorphism of  ([0, 00 [; !/') onto itself and that if u E r ([0, 00 [; !/'), then x U E r([o, 00 [; !/') and d r dru -u=-. dtr x x dt r Hence if u is a solution of PC(u o , . . . , U m - 1 ;f) in m([o, 00 [; !/'), then v = xu is a solution of the ordinary differential problem dmv Pm dt 2 + . . . + POV = g (3.3) dkv dt k (0) = Uk for k = 0, . . . , m - 1, where Po, . . . , Pm are the characteristic polynomials of Po, . . . , Pm. Conversely, if v is a solution of (3.3) in m([o, 00 [; !/'), then u = ; 1 V is a solution of PC(u o , . . . , U m -l; f). 31 Where fo has been defined in Proposition 1; with the above notation it is k with' = b. 
218 Chapter V. Linear Differential Operators To study the differential problem (3.3), we introduce the notation U = {y E [Rn; Pm{Y) =1= O}; it is a dense open set of [Rn. For all Y E U let us consider (3.4) So{Y, .) the elementary solution in '([R +) of L Pk{Y) d d k k k t Sl{Y'.) = Pm{Y)SO{Y'.) d Sk+l{Y'.) = dt Sk{Y,.) + Pm-k{Y)SO{Y'.) for k = 1,...,m - 1. We recall that for k = 1,. . . , m, Sk{Y' .) is the solution of the ordinary Cauchy problem d m Pm{Y) dt m Sk{Y'.) + . . . + PO{y)Sk{Y'.) = 0 d l dt l Sk{Y'.) = bm-k,l for 1 = O,...,m - 1. In particular from the classical theory of ordinary differential equations, Sk E  00 (U x [0, 00 [). We deduce that (3.3) admits one and only one solution in m ([O, 00 [; ' (U)) given by m - 1 I t v{t) = L Sm-k{., t)V k + So(., t - s)g{s)ds; k = 0 0 (3.5) from which we may state:. Proposition 6. Under the hypothesis Pm{Y) =1= 0, V Y E [Rn, there exists at most one solution of PC{u o , . . . , U m - 1 ; f) in m{[o, 00 [; g') and there exists one iff the function v given by (3.5) is in  m{[o, 00 [; !/'); this solution is then u = ; 1 v. Let us now establish the Proposition 7. The Cauchy problem is well posed in !/' iff P satisfies the following two conditions: (3.6) Pm{Y) =1= 0, Vy E [Rn, { there exists To E [R such that L Pk{Y)Zk =1= 0 for all (3.7) (y,z) E [Rn X C with Rez > To. Then the elementary solution in Yo-l ([O, 00 [; (9) is given by E = ; 1 So, where for all Y E [R n, So{Y,.) is the elementary solution in ' ([R +) of L Pk{Y) dkjdt k . Proof. From Proposition 5, the condition (3.6) is necessary for the Cauchy problem to be well-posed in !/' (see Remark 1, 2). Let us now suppose that this 
3. Cauchy Problem for Differential Operators with Constant Coefficients 219 condition is satisfied; then by the spatial Fourier transform (Proposition 6), the Proposition reduces to: So E Yoo([O, 00 [; (!)M) iff P satisfies (3.7). First of all we shall prove the condition to be necessary. Let us suppose So E Y 00 ([0, 00 [; (!)M) and take P E f!fi ([R) such that p == 1 on [0, 1J, p == 0 on [2, 00 [ . Then ' d k L Pk(Y) dt k pSo(Y, .) = b + t/J(y,.), where t/J E £0([1, 2J; (DM)' In particular there exist (J E [R and C > 0 such that It/J(y, t)1  C(1 + lyI2YT/2, V(y, t) E [Rn+l. Now making use of the Laplace transform we have for z E C: (LPk(y)Zk) ff (pSo(Y, . ))(z) == 1 + fft/J(y,. )(z). But if Re z  0 12t/J(y,.)(z)1  fetRezlt/J(y,t)ldt  C(1 + lyI2)'1 1 2 e -Rez. Hence if z is a zero of L Pk(Y)Zk with Re z  0, (3.8) (J Re z  Log C + 2 Log(l + lyI2). But making use of Lemma 1 of 2 with A(T,T 1 ,T2'Yl.,Yn == I LPk(y)(T 1 + iT 2 )k12 + (IYI2 - T)2 B(T,T 1 ,T 2 ,Y1.,Yn) == T 1 we know that (3.9) sup{Rez;zzeroofLPk(y)zk}  clYIJ when Iyl  00. From (3.8) we have c  0 and b  0, from which (3.7) follows We now show the condition to be sufficient. We fix 0 < T < 00. We shall show that the functions v == So and v == S 1 == PmSO satisfy there exist (J E [R and C > 0 such that alai +k o y" otk v(y, t)  C(l + I yI2)"/2, V(y, t) E [Rn X [0, T]. (3.10) for all lJ, E N nand kEN. First of all we know that Sl(Y'.) == E Z1 *. .. * E Zrn where Zl, Zm are the zeros distinct or not of LPk(y)Zk and Ez(t) == Y(t)e zt (see 
220 Chapter V. Linear Differential Operators 2.2.3. Example 9). On the other hand we easily verify by recurrence on m that for Zl,... 'Zm E C with Rez k  To, t m - 1 IE Z1 * · . . * EZm(t)1  (m _ i)! e'ot, Making use of the hypothesis (3.7), we hence see that v = S 1 satisfies (3.10) for l/., = (0, . . . , 0) and k = O. To set up a recurrence relation, we make use of the following three relations. First of all (see proof of Theorem 2 of 2) we see from (3.6) that there exist c > 0 and b E  such that \It E  +. (3.11) IPm(Y)1  c(l + lyI 2 )-J/2, \ly E n. (see (2.3)). Then, for j = 1, . . . , n a apk 1 1 Ok (3.12) -;- Sl (y, t) = - L -;- (y) So(y, t - s) -;-k Sl (y, s)ds uYj uYi ° ut aS 1 . I . f as v = ---;-- IS a so utIon 0 uYj dkv apk d k S 1 dkv L Pk dt k = - L OYj dt k ' dt k (0) = 0 for k = 0, · · · , m - 1. Finally (3.13) a ( a ) k - m + 1 at S 1 (y, t) = - k m Pk(Y) at So(Y, t) which we obtain by integrating d m Sl/dtm = - L Pk d k So/dt k . k<m Then, being given r E Nand k O EN, let us suppose that v = S 1 satisfies (3.10) for alll/., E N n with Il/., I  rand k  ko; from (3.11) it will then be the same for v = So; then from (3.13) v = Sl satisfies (3.10) for alll/., E N n with Il/.,l  rand k  k O + 1; by recurrence v = S 1 and v = So hence satisfy (3.10) for alll/., E Nn with Il/.,l  r and all kEN. Applying then (3.12), v = Sl satisfies (3.10) for all l/., E N n with Il/., I  r + 1 and kEN; by induction, since the recurrence hypothesis is true for r = 0 and k O = 0, v = Sl and v = So satisfy (3.10) for all rx E N nand kEN. D Remark 2. Adapting the proof of Proposition 2, the reader will be able to show that, under the hypothesis (3.6), every elementary solution of P in f2' (+; !/') is in fact in Yo-l ([0,00 [; (9M). D We shall later make the condition (3.7) explicit for an operator P = a/at + Po (m = 1). Let us first of all apply Proposition 7 to the characterisation of parabolic operators. 
3. Cauchy Problem for Differential Operators with Constant Coefficients 221 5. Parabolic and Weakly Parabolic Operators We recall (see Definition 8 of 2) that P is parabolic with respect to n x + if it is hypo-elliptic and admits an elementary solution in ' (n X  +). The result of Proposition 20 of 2 shows that there then exist c > 0 and C > 0 such that for all To  0, (y,Z) E n X C Re z  - TO and Izi + Iyl > Ce C 7: 0 => L Pk(Y)Zk =t o. We remark first of all that if P is hypo-elliptic, then Pm is of order 0 (see 2, Sect. 1.4). We therefore have as an immediate corollary of Proposition 7 (of this 3) and of Proposition 20 of 2: Corollary 1. Suppose that P is hypo-elliptic. Then the following three assertions are equivalent: (i) P admits an elementary solution in ' (n X  + ); (ii) the (direct) Cauchy problem associated with P is well-posed in [/'; (Hi) P satisfies (3.7). As a consequence, if P is parabolic with respect to n x  + it admits an elementary solution, and a single one, in Yo-l ([0, 00 [; (D). There is even uniqueness of this elementary solution in '( +; [/') (see Remark 2) and this solution is in yoo([O, 00 [; (D) (see Proposition 5). As a matter of fact, from Proposition 20 of 2, P satisfies a great deal more than (3.7); we deduce thaJ the elementary solution of P in y-l([O, 00 [; (D) is much more regular. (See Remark 3 below) We shall prove:- Proposition 8. Suppose that the Cauchy problem is well-posed in [/' and denote by E the elementary solution of P in Yo-l ([0, 00 [; [/'). Then the restriction of E to  n X ] 0, 00 [ is in  00 (] 0, 00 [; [/) iff P satisfies (3.14) { for all To E , there exists C such that LPk(y)Zk =t 0, for all (y, z) E [Rn X C with Iyl > C and Re z > To. Proof We know that Pm satisfies (3.6). By the spatial Fourier transform, the conclusion expresses that So E 00(]0, 00[; [/) iff P satisfies (3.14). We then adapt the proof of Proposition 7, whose notation we follow. For the necessary condition, being given p E () with p = 1 on [0,1] and p = 0 on [2, 00 [, d k L Pk(Y) dt k (, So(Y, .)) = b + t/J(y,.) with t/J E '@([1, 2]; Y). In particular f't/J(y, t)ldt -+ 0 when Iyl -+ 00. 
222 Chapter V. Linear Differential Operators U sing the Laplace transform, since, with (Re z) - == max(O, - Re z) f e - tz t{I(y, t) dt  e 2 (Re z) - f I t{I(y, t) I dt, we see that ,if z is a zero of L Pk(Y)Zk, we have - 2(Rez) -  Log f I t{I(y, t)l dt, and therefore that P satisfies (3.14). For the sufficient condition, using Lemma 1 of 2 we have sup{Rez; z zero of L Pk(Y)Zk}  clyl when Iyl  00; but using the hypothesis (3.14), we necessarily have c < 0 and b > O. In other words there exists To E , Co > 0 and b o > 0 such that L Pk(Y)Zk == 0 => Re z  To - Co IYI; we then prove as in Proposition 7, that v == So and v == Sl satisfy, for all a E Nn and kEN, there exist C > 0, T E  and c > 0 such that alai +k o ya ot k v(y, t)  Ce(! - cIYI'O)t, V(y, t) E [Rn X [R + . Hence So E ((j 00 (]O, 00 [; g). o It follows immediately from Proposition 20 of 2, that a parabolic operator with respect to n x  + satisfies (3.14). For this reason we frame the Definition 2. We say that P is weakly parabolic with respect to n x  + if it satisfies the hypotheses (and the conclusion) of Proposition 8. We shall see that there exist weakly parabolic operators which are not hypo- elliptic and therefore not parabolic (see Sect. 6.1). Remark 3. Suppose that P is weakly parabolic with respect to n x  +. Then the functions Ek defined by (3.1) are in ((joo(] 0, 00 [; g). Since g' * g C (9M, we deduce that, for U o , . . . , U m - 1 E g', the solution of PC(u o , . . . , u m - 1 ; 0) in m([ 0, 00 [; g') is in ((j 00 (]O, 00 [; (9 M); assuming greater regularity at infinity in the initial data, we shall obtain greater regularity at infinity in the solution: for example, if U o ,..., U m - 1 E U H(J, then u E ((j00 ( ]O, 00[; n H(J ) , etc. . . (JE (JE 
9 3 . Cauchy Problem for Differential Operators with Constant Coefficients 223 The inhomogeneous problem is more difficult; we shall not deal with it here. The properties of such operators are developed in theory of differentiable or regularising semi-groups. (See Chap. XVIII). D 6. Study of the Particular Case P = :t + P 0 We suppose that P = a/a t + Po, where Po is a l.d.o. with constant coefficients on [Rn and with characteristic polynomial Po. We then have So(y, t) = Y(t)e-tPo(y) and can rewrite (3.7) as (3.15) inf {Re Po(y);y E [Rn} > - 00. Hence the direct Cauchy problem associated with P = a/at + Po is well-posed in !/' iff Po satisfies (3.15); then the elementary solution in Y  ([0, 00 [; (D ) is E = Yff;l e -t Po . Inverting the time we see that the backward Cauchy problem is well posed in !/' iff sup {Re Po(Y); Y E [Rn} < 00. Since a real polynomial is bounded iff it is constant it follows that the direct and backward Cauchy problems are both well- posed in !/' iff Re Po is constant. The condition (3.14) can be written (3.16) lim Re Po(Y) = + 00. Iyl-+oo Hence, P = a / at + Po is weakly parabolic with respect to [Rn x [R + iff Po satisfies '(3.16). 6.1. Analysis of One-Dimensional Case To analyse these conditions, we consider the case n = 1: we then have a k Po = La k ox k ' Po (y) = L akikyk, Re Po(Y) = Re a o + 1m a 1 Y - Re a2y2 - I ma3y 3 + . . . We then distinguish several cases: (a) deg Re Po is odd: then sup {Re Po(Y); Y E [R} = - inf {Re Po(Y); Y E [R} = + 00 so neither the direct nor the backward Cauchy problem is well-posed in !/'. (b) Re Po is constant: then both the direct and backward Cauchy problems are well-posed in !/'. (c) deg Re Po is even and 0 < deg Re Po < deg Po: then P is weakly parabolic with respect to [R x [R + or [R x [R - according to the sign of the leading 
224 Chapter V. Linear Differential Operators coefficient of Re Po; but P is not parabolic. In fact, when n = 1, a / at + Po is hypo-elliptic iff deg Re Po = deg Po > O. This follows from the criterion for hypo-ellipticity (see (2.8) and also 94.1): the characteristic polynomial of a / at + Po is is + Po(Y); hence a/at + Po is hypo-elliptic iff lim I is + Po (y ) I - 1 = 0 Isl+lyl-+oo and lim Ipba)(y)I/lis + Po(y)1 = 0 Isl+lyl-+oo Va E Nn, a =1= o. The first condition reduces to the second if deg Po > O. As for the second condition, since lis + Po(y)1 = ((s + 1m Po(y))2 + (Re Po(y))2)1/2, it reduces to 1 . Ipba)(y)1 = 0 1m , Iyl-+oo IRepo(y)1 When n = 1, this states clearly that deg Re Po = deg Po. (d) deg Re Po = deg Po is positive and even: then P is parabolic. We deduce the hypo-ellipticity by the above argument. We may also remark that P is semi-elliptic and apply Proposition 8 of 2. Finally, we note that if Po has real coefficients, then Re Po(Y) = a o - a2y2 + a4y4 - . . . ; Va E r\jn, a =1= O. hence the case (a) does not arise, the case (b) corresponds to: Po has only terms of odd order, the case (c) corresponds to: Po is of odd order but admits terms of even positive order, the case (d) corresponds to: Po is of even positive order. In the above case (d), the operator Po is 0 or n-strongly elliptic following the sign of the leading coefficient of Re Po. 6.2. Case in which Po is Strongly Elliptic Suppose now that, for arbitrary n, Po is O-strongly elliptic, the case n-strongly elliptic following by an inversion of the time. Then Po satisfies (3.16) (see the proof of Proposition 7 of 2) and P = a / at + Po is semi-elliptic, hence hypo- elliptic (see Proposition 9 of 2); in short, P is parabolic with respect to n x  +. When Po is homogeneous, (3.16) can be written Repo(Y) > 0, VYEn\{o} that is to say Po is O-strongly elliptic. Hence, we have:- Proposition 9. When Po is homogeneous, the following assertions are equivalent: (i) Po is O-strongly elliptic, (ii) P = a / at + Po is parabolic, (iii) P = a / at + Po is weakly parabolic. 
3. Cauchy Problem for Differential Operators with Constant Coefficients 225 But these equivalences do not hold in the general case, as is shown by the case (c) above and the example a 4 a 2 Po = _ a 4 - _ a 2 Xl X 2 in  2 which is not elliptic, but nevertheless P = a/at + Po is parabolic with respect to [R2 x  + since on the one hand Po satisfies (3.16), while on the other hand P is semi-elliptic, hence hypo-elliptic. When Po is O-strongly elliptic, the operator P = a/at + Po possesses more properties than do those of the class of parabolic operators: principally the elementary solution in y ([ 0, 00 [; g') is analytic with respect to X E n for all t > O. This can be shown easily in the case Po = - wL1 x with Re w > 0, since the elementary solution of a/at - wL1 x is defined on n X ]0, 00 [ by ( IX I2 in arg W ) E(x, t) = ff;l e-wlyl2t = (4nlwlt)-n/2 exp - 4wt + 2 . In the general case, this will result from the hypo-analyticity with respect to X of the operator a/at + Po when Po is O-strongly elliptic (see Corollary 2 of 94). We must not confuse analyticity with respect to x with analyticity in ] 0, 00 [ into g. Proposition 10. The operator P = a/at + Po admits an elementary solution in y  '0 - I ([0, 00 [; g ') analytic in ] 0, 00 [ into g iff Po satisfies (3.17) there exists c > 0 such that lim Re Po(y) + c 11m Po(y)1 = + 00 y-+oo (where Po(Y) is the characteristic polynomial of Po). Proof It is enough to prove that (3.17) is equivalent to the analyticity of e- tpo in ]0, 00 [into g, that is to say that e- ZPO E g for all z in a neighbourhood of ]0,00 [ in C. Now e- zpo E g iff lim Re (zPo(Y)) = + 00, Iyl-+oo that is to say, supposing Re z > 0, iff lim Re Po(Y) + (tg arg z) 1m Po(Y) = + 00. Iyl-+oo The result follows. The property (3.17) is, in particular, satisfied if Po has real coefficients satisfying (3.16); for example in 2, Po = a 4 / ax,! - a 2 / ax. This analyticity property and its consequences are generalised in the theory of analytic or holomorphic semi-groups (see Chap. XVII A. 9 6 ). 6.3. Schrodinger Operator Finally, we return to the case Re Po constant, i.e. to the case in which both the direct and backward Cauchy problems are well posed in g'. When Po = -wL1 x ' 
226 Chapter V. Linear Differential Operators this corresponds to Re OJ == 0, i.e., to within a change of the time variable, to the Schrodinger operator (1/i)8/at - L1x. Then for all U o E !/' and fE (; !/'), there exists one and only one solution of the bilateral Cauchy problem 32 , U E  (; !/') au + P f 1 . n rM' ( /Tl)n+l ) -;- 0 U == ;;LJ If\\ ut I u(O) == Uo and this solution is given by u(t) = E(t) * U o + L E(t - s) *f(s)ds where E(t) == ff -; 1 e- tpo . We note that here E is clearly continuous from  into !/'; the elementary solution is YE. Now if Uo E L 2 and f - 0, using Parseval's formula (see formula (3.15) of Appendix "Distributions") we have /""-.. II u(t) IIL2 == (2nt 12 11 u(t) IIL2 == (2nt 12 II e-tpouo.11 L2 == e -tRe Po (2n)nI2 II U O IIL2 == e -t Re Po II U o IIL2. Hence the function t  e t Re Po II u(t) IIL2 is constant. This property and its con- sequences are generalised in the theory of one-parameter semi-groups on a Hilbert space ( see Chap. XVII). In the case of the Schrodinger operator a / at - iL1x we have i sign t ( X2 ) E(x, t) == (4n I tl) -n12 e- nn - TtT . We see directly that E E OO(*; lP M ) and that E is analytic on n X *. These properties enable us to classify the operators- a/at + Po satisfying Re Po constant and more generally the operators am P == Pm _ a +.. . + Po t m for which the bilateral Cauchy problem is well-posed in !/'. 7. Well-Posed Cauchy Problem in p)': Hyperbolic Operators We now return to the general case of an operator am P == P m - + . . . + Po with Pm =1= o. at m 32 That is to say in p as opposed to the direct t  0 used in Definition 1. 
3. Cauchy Problem for Differential Operators with Constant Coefficients 227 From Proposition 5, the Cauchy problem is well posed in , C, C' or ' iff P admits an elementary solution in y-l ([0, 00 [; C'). In particular, since C' c (!)o the Cauchy problem will then be well posed in !f'. By the spatial Fourier transform, the Cauchy problem is therefore well posed in ' iff it is well posed in !f' and So E y-l ([0, 00 [; ffx C'). We shall then make use of the characterisation of ffxC' by the Paley-Wiener theorem (see 3, Sect. 5 of Appendix" Distributions"): a tempered distribution is in C' { { I x I  R}) iff its Fourier transform can be extended to an en tire function F on en satisfying { there exist C > 0 and (J E  such that I F ( () I  e R I 1m  I (1 + I ( 1 2 YJ / 2 , V ( E en. We shall establish the following characterisation: Proposition 11. The following assertions are equivalent: (i) P is hyperbolic with respect to n x  + (ii) the direct Cauchy problem associated with P is well posed in ' (iii) Pm is of order zero and P satisfies (3.18) { there exist To E  and C > 0 such that for all ((, z) E en x e, LPk(()Zk = 0 => Rez  To + CII:n(I. Then P is hyperbolic with respect to the cone {(x, t); Ixl  Ct}. Proof (i) => (ii). Let E be the elementary solution of P in  '(n X  +). There exists C > 0 such that supp E c {(x, t) E n+ 1; Ixl  Ct}. We now prove that E E y-l ([0, 00 [; C'); for this we adapt the proof of Proposition 2. We fix T E ] 0, 00 [; from the local representation theorem, we know that there exists r E N such that D t- r E E y - 1 ([0, T]; C'); we suppose r =1= 0 and prove that Dt- r + 1 E E y-l ([0, T]; C'). Since D m - 1 P D- r + 1 E = D-rb - " DkpD-rEE Y ([ O T ] .C' ) t m t t  t k t ", k<m we have P mDt- r + 1 E E y-l ([0,00 [; C'). Now we have also su pp D t- r + 1 E c {(x, t) E  n + 1; I x I  C t } ; using an elementary solution em of Pm in ' (see 4.5 or Hormander [1] for its existence), the distribution F m = em (8) b(t), elementary solution of Pm in g'(n+ 1) has support, convolutive with that of Dt- r + 1 E; hence Dt- r + 1 E = F m * P mDt- r + 1 E E y-l([O, 00 [; C'). D (ii) => (iii) We have seen that the Cauchy problem is well posed in !f' and that So E y 00 ([0, 00 [; ffx C'). On the other hand, from Proposition 5, Pm is an 
228 Chapter V. Linear Differential Operators isomorphism on ' and hence of order zero. We can then, for all ( E en, consider F 0((, .) to be the elementary solution in ' (+) ofL Pk (()d k /dt k : for all t E , F 0(' , t) is the holomorphic continuation of So(' , t). We then follow the proof of the necessary condition in Proposition 7. Given p E () with p = 1 on [0, 1] and p = 0 on [2, ex) [, we have for all 'E en d k L Pk (() d t k (p F 0 ( (, . )) = lJ + F ((, . ) where F (. , t) is the holomorphic continuation of t/J(. , t) with t/J E ([1, 2]; ff x S '). Using the Paley-Wiener theorem we find that there exists R, C, (J such that IF ((, t) I  C(l + I (1 2 YJ/2 e R11m I, V((, t) E en x . By the Laplace transform we deduce that if Re z  0, (J LPk(()Zk = 0 => Rez  Rllm(1 + LogC + 2 Log(l + 1(1 2 ); which, by an application of Lemma 1 of 92, completes the proof. o (iii) => (i). Since (3.18) trivially implies (3.7), the Cauchy problem is well posed in !f'. Now from (3.18), with the above notation t m - 1 I F ( r t ) I  e ( 1: 0 + C I 1m ( I) t o S, "(m - 1)! and hence by the Paley-Wiener theorem, Eo(t) = ff; 1 So(. , t) has support in {x; I x I  C t }. o An important consequence of the characterisation (iii) is Corollary 2. P is hyperbolic with respect to n x  + iff it is hyperbolic with respect to n x  - . Proof Since Pm is of order 0, we can always take Pm = 1. Given (E en, we denote by Zl,... 'Zm the zeros ofLPk(()zk. We have Zl + . . . + Zm = - Pm-l ((), (3.19) and so from (3.18) { - Re Pm-l (()  m(To + Cllm (I) Re Zk  - (Re Pm-l (() + (m - l)(T o + Cllm (I)). It is enough to show that (3.19) implies that Pm-l is of degree  1. Then Pm-l (() = a o + ia. ( and we have, from (3.19), Ima.Re( = - Repm-l(() + Rea o - Rea.lm(  T + C/llm(1 
3. Cauchy Problem for Differential Operators with Constant Coefficients 229 so 1m a = 0 and Re Pm-l(()  Re a o + IRe alllm(l. To show that Pm-l is of degree  1 we can always suppose that n = 1 since degpm-l = sup {degree in A of Pm-l(AY); Y E n}; we then have Pm-l (() = i(ar(f + . . . + a o ) with a r =I 0 and - Repm-l(() = Llakll(lksin(Ok + kO) with Ok = argak,O = arg(. Taking ( = ne iO / n , we find that: L lakln k sin ( Ok + k: ) is uniformly bounded when n -4 r:f) for 0 bounded. We deduce that k  1. D In fact, by an argument, a little more elaborate, we can prove: Corollary 3. Suppose that P is hyperbolic with respect to n x  +. Then m is the total order of P. In other words, the hyperplane {t = O} is not a characteristic of p. 33 It is this essential result that we stated and used systematically in 92 for the charac- terisation of hyperbolic operators. Proof We can always, as above, reduce to the case n = 1; we then have (see Hormander [1], p. 275), for 1(1 large L Pk( ()Zk = Pm(()(z  Z 1 (()) . . . (Z - Zm( ()). in which the functions z(() = Zk(() can be developed in Puiseux series for large 1 (I, that is to say, if Zk =1= 0, z(() = a o ((l/V)CT + a 1 ((l/V)CT-l + . .. with v E N *, (J E lL, a o =I O. Taking ( = TeiO/r: with 0 E , T > 0, we have for large T ( (JO ) CT - 1 ( ((J - 1 ) 0 ) Rez k (() = laoITCT/VCOS 0 0 +  + la 1 1!-----;-cos 0 1 + VT + ... where Ok = arg a k , (k = 0, 1, . . . ). Now, from (3.18) and Corollary 2, Re Zk(() is bounded as T -+- 00 uniformly for () bounded. We deduce that (J  v. Therefore Zk(() = O(() and p(() = o((m-k) as I (I -+- 00. This shows clearly that the total order of P is m. D 33 Unlike the parabolic case. 
230 Chapter V. Linear Differential Operators Remark 4. Using the spatial Fourier transform, we can calculate explicitly the elementary solutions in ' (n X  +) of a hyperbolic operator. F or exam pIe, let us consider the d' Alem bertian 0 = a 2 / at 2 - L1 x; the elementary solution in ' (n X  +) is Y ff; 1 So where YSo(Y,.) is the elementary solution of d 2 /dt 2 + lyl2 given by So(Y, t) = sin(lylt)/lyl and hence E(x, t) = Y(t)ff: 1 Si;f' t = t! G(  ) with G = ff: 1 Si;tl . o 4. Local Regularity of Solutions* 1. Characterisation of Hypo-Ellipticity We consider P a l.d.o. operator, with constant coefficients on n, of order m and with characteristic polynomial p. We study here the local regularity of U E ' as a function of that of Pu. In particular, we shall characterise the hypo-ellipticity (resp. the hypo-analyticity) of P; we recall the definition: for every open set Q of n and all U E '(Q), Pu E C(Q) (resp. Pu analytic on Q) => u E C(Q) (resp. u analytic on Q). 1.1. Necessary Condition for Hypo-Ellipticity Given a non-empty open set Q E n we consider % = {u E '(Q); Pu = O}. This is a closed sub-space of '(Q), stable under derivation. Suppose that P is hypo-elliptic, then % is contained in C(Q): from the closed graph theorem,34 for all Frechet spaces ff 1 , ff 2 with C(Q) C+ ff i C+ '(Q), the topologies induced by ff 1 and ff 2 on % are identical. Afortiori, for example, the topologies induced by Lloe(Q) and H foe (Q) are identical; that is to say that being given Ql, a non-empty relatively compact open set in Q, there exists an open set Q2 relatively compact in Q and a constant C such that (4.1 )  f IDjul2 dx  C 2 f lul2 dx, J ill il2 Vu E %. 34 See Bourbaki [2]. The closed graph theorem in the context of Banach spaces is treated in Chap. VI. 
94. Local Regularity of Solutions 231 Now, given ( E en, we see that u(x) = e'x is in % iff P(() = o. We have f le.xI2dx = f e2Re.xdx  IQ2Ie2c2IRel. n2 n2  f IDjul2dx = '(l2 f e2Re.xdx  1(12IQlle-2clIRel, J nl nl where IQi I is the measure of Qi and C i = sup {Ixl; x E Qi}. Substituting in (4.1) we obtain 1'1  C C:: Y/2 e(C1 +c2)IRe,I, In short, if P is hypo-elliptic, then { for every constant C 1 there exists a constant C 2 such that (4.2) P(() = 0, IRe (I  C 1  1(1  C 2 . We shall show later (see Theorem 1) that this condition is sufficient. We-remark that we have used only the particular case of hypo-ellipticity: { there exists a non-empty open set Q of [Rn such that (4.3) u E Lloc(Q) and Pu = 0  U E Hloc(Q). It is not obvious a priori that this property implies the hypo-ellipticity of P nor even that % is contained in S(Q). It is interesting to note that we can obtain the necessary condition (4.2) under the following hypothesis which is a priori much weaker than (4.3): (4.4) { there exists a non-empty bounded open set Q of [Rn such that {u E S(Q); IlullL oo  1, Pu = O} is relatively compact in Lloc(Q). As a matter of fact, given ( E en let us consider u(x) = e.x-cIRe1 where c = sup {Ixl; x E Q}. Using (4.4) and the compactness criterion in L 1, given Qo a non-empty open set relatively compact in Q, there exists 8: [R +  [R + , continuous with 8(0) = 0 such that for ( E en with P(() = 0 and all h E [Rn with Ihl < b = dist (Qo, aQ) f u(x + h) - u(x)1 dx  8(lhl), no that is to say le P - 11 r e ReCx dx  B(lhl)e clRe ,I, Jno This clearly implies (4.2). Indeed if (4.2) is not satisfied, there exists a constant C 1 and a sequence ((k) such that P((k) = 0, IRe(kl  C 1 and I(kl  00, for k  00. 
232 Chapter V. Linear Differential Operators We have 11m (kl  00; we can therefore suppose that 11m 'kl > n/2b with the result that we can apply the above relation to , = 'k with n 1m 'k h = 211m C k l 2 ; We then have I e . h - 11 = 11 - ie Re  . h I  1 and so I Q ol e-eCl  ( n ) eCl "G 211m (k Ie, a contradiction in the limit. o 1.2. Algebraic Transformation of the Necessary Condition for Hypo-Ellipticity First of all we introduce the function d: [Rn  [R + defined by d(y) = distance of iy from {( E en; P(() = O}, which, in terms of the characteristic polynomial becomes d(y) = inf {I(I; ( E en, p(y + () = O}. The condition (4.2) can then be written (4.5) lim d(y) = + 00. !y!oo But, using Lemma 1 of g2, we see that there exist C E [R, b E Q such that inf {d(y); I y I = T} ro...I CT b when T  00 so that (4.5) is equivalent to { there exist C > 0 and b > 0 such that (4.6) 1 + d(y)  c(l + y2)Jf 2 , Vy E [Rn, We shall transform these conditions by expressing them by means of derivatives of p. We observe, first of all, that by Taylor's formula 1(llk! Ip(y + () - p(y)1  L - k ' Ip(k)(y)1 k*O . from which (4.7) I p(y)J  L d(lkl J p(k)(y)l. k*O . 
9 4 . Local Regularity of Solutions 233 We deduce that (4.5) is implied by { there exists c: IRn  IR + with lim c(y) = 00 such that (4 8) Iyloo . c(Y)lp<k)(y)1  Ip(y)l, Vy E IRn, Vk E f\jn \ {O}. Let us now prove the inequality: (4.9) d(y)lk1Ip<k)(Y)1  2 m lkl ! Ip(y)l, Vy E IR n , Vk E f\jn. By Cauchy's inequali ty 35, this is a consequence of: (4.10) 1'1  d(y) => Ip(y + ()I  2 m lp(y)l. To prove (4.10) we can always suppose that 0 < 1'1  d(y); then by the definition of d, the zeros Zj of the polynomial g(z) = p(y + z() all satisfy d(y)  Izj(l = Izjll(l and so Izjl  1. We then have Ip(y + ()I Ip(y)1 g(1) = n 1 _   2 m g(O) Zj from which (4.10) follows. From (4.9), the condition (4.6) hence implies { there exist C and b > 0 such that (4.11) blkl Ip<k)(y)1  C(1 + y2)--Y-(1 + Ip(y)l), Vy E IRn, Vk E f\jn. In short, the conditions (4.2), (4.5), (4.6), (4.8) and (4.11) are equivalent. 1.3. The Principal Result We have the Theorem 1. Let P be a l.d.o. with constant coefficients on IRn. Then P is hypo- elliptic iff it satisfies the equivalent conditions (4.2), (4.5), (4.6), (4.8), (4.11). Proof It remains to prove the sufficient condition. For that, we use the condition (4.11). Suppose we are given an open set Q of IR n and u E '(Q) with Pu E C(Q). Restricting ourselves to an open set relatively compact in Q, we see from the local representation theorem, that we can always suppose that u E (Q), a space of distributions of finite order 36 . Since (Q) = U Hfoc(Q) and C(Q) = n Hfoc(Q), 35 f(O = p(y + 0 being holomorphic on 1" ::::; d = d(y) Ikl! If(k)(O)1 ::::; - sup If(')I. d Ikl II d 36 See, for example, Schwartz [1], p. 85. 
234 Chapter V. Linear Differential Operators the theorem follows by induction over r E Z: p(k)U E H 1 +lkl)b(Q) V k E r\jn \ {O} => p(k)U E Hlkl)b(Q) To prove this implication, we suppose that p(k)U E H 1 + Ikl) b (Q) V k E r\j n\ {O} and we take p E f0(Q). From Leibniz's formula, 1 P(k)(pU) = L, D a pp(k+a)u, rx. V k E r\j n\ { O} . Vk E r\jn, we have, from the hypothesis, p(a)u E Hlc(Q) for all rx E r\jn (for rx = 0 this comes from Pu E S(Q)). Hence P(pu) E H rb . Since pu E H rb , we have ............... (1 + y2)rb/2(1 + Ip(y)1) I pul E L 2 . On the other hand, using (4.11), we have ............... ............... 1(p(k)(pu))1 = Ip(k)11 pul  C(1 + y2)lk l b/2(1 + Ip(y)1) I pul, hence P(k)(pU) E H(r+lkl)b. Finally for rx =1= 0, p(k+a)u E Hl+lk+aDb c Hlkl)b. Hence 1 pp(k)U = p(k)(pU) - L ,Dapp(k+a)u E H(r+lkl)b. a*Orx. D 2. Analyticity of Solutions 2.1. Statement of Results We consider Q, an open set in [Rn, and G a linear subspace of [Rn. We say that u is analytic in S(Q) with respect to G if the mapping  E G  u(. + ) is analytic in S(Q) in the neighbourhood of  = O. In other words, u is analytic in S(Q) with respect to G iff u E S(Q) and { for all compact K of Q and all rx E r\jn, there exists a constant C such that I(. DY Dau(x)1  (CIlrY Vr E r\j*, Vx E K, V E G 37 . It is clear that u is analytic in Q iff u is analytic in S(Q) with respect to G = [Rn. In the contrary case, it is a question of a partial analyticity with respect to certain variables. 37 In the case of one variable, the classic condition for the analyticity off(x) in the neighbourhood of Xo is (If(r)(x o )llr!)l/r ::::; C; but from Stirling's formula, ('r!)l/r  rle as r -+ 00, this condition is therefore equivalent to If(r)(xo)1 ::::; (CrY, (with another constant C) 
4. Local Regularity of Solutions 235 We shall prove the following result: Proposition 1. Suppose that P is a l.d.o. with constant coefficients on [Rn. Consider the set G of the  E [Rn satisfying { there exists a constant C such that (4.12) I'; .YI1klp(k)(y)  C(1 + Ip(y)l), Vk E f\,jn, Vy E IRn. This is a linear sub-space of[Rn and for every open set Q of[Rn and u E 9(Q) we have Pu analytic in 9(Q) with respect to G => u analytic in 9(Q) with respect to G. We say that P is hypo-analytic with respect to G. Before proving this proposition, which will be a little long, let us state and prove the converse: Proposition 2. Let P be a l.d.o. with constant coefficients on [Rn, and  E [Rn. Suppose P is hypo-elliptic and that there exists Q an open set of [Rn and X O E Q such thatfor all u E 9(Q) with Pu = 0, thefunction t  u(X O + t) is analytic in the neighbourhood oft = o. Then  satisfies (4.12). We note that, as for hypo-ellipticity, the property of hypo-analyticity with respect to G is implied by quite a small part of this property. We note the i immediate corollary: Corollary 1. A l.d.o. with constant coefficients on [Rn is hypo-analytic iff it is elliptic. Proof of Corollary 1. From Propositions 1 and 2, we have to prove that P is elliptic iff (4.12) is satisfied for all  E [Rn. Now, (4.12) is satisfied for all  E [Rn iff { there exists a constant C such that (4.13) lyI1k1Ip(k)(y)1  C(1 + Ip(y)l), Vk E f\,jn, Vy E IR n . Since " I Yl lkl l P (k) (y)l ............ I 1 m th i.-J -;:::; Co Y wi Co = k L Ip(k)1 > 0 Ikl=m and L lyI1k1Ip(k)(y)1  C o (l + Iylm) with Co < 00, k (4.13) is equivalent to { there exists a constant C such that Iylm  C(l + Ip(y)l), Vy E [Rn that is to say, P is elliptic. o 
236 Chapter V. Linear Differential Operators Let us give another example: Corollary 2. Suppose that Po is a l.d.o. with constant coefficients on [Rn. The operator P = a/at + Po is hypo-analytic with respect to IRn x {O} iff Po or ( - Po) is O-strongly elliptic. Then, we recall (see Proposition 9 of 3), P is parabolic with respect to IRn x IR + or IRn x IR- and its elementary solution in Y([O, 00[; g') is analytic with respect to x for all t > O. Proof of Corollary 2. We have a1kl +1 alas' p(y, s) Ip(y, s)1 2 = (s + 1m Po(y)1 2 + (Re PO(y))2 1 if 1 = 1 k = 0 , o if 1  1, 1 + I kl I p)(Y)1 if 1 = 0, k =1= 0; > 1 so (4.12) is satisfied for (, 0) E IRn+ 1. I . Yllkllp)(Y)1  C(l + IRe Po(y)l), \lk E N n , \ly E IRn. Repeating the argument of Corollary 1, we find that (4.12) is satisfied for all (, 0) E IRn+ 1 iff Iylm  C(l + IRe Po (y)l), that is to say, iff Po or ( - Po) is O-strongly elliptic. D Proof of Proposition 2. We consider % = {u E Lloc(Q); Pu = O}. This is a closed sub-space of Lloc(Q) and hence is a Frechet space for the induced topology. By hypothesis % c S(Q) and for u E %, the function g(t) = u(x o + t) is analytic in the neighbourhood of 0; in other words, there exists C such that I( . DY u(xO)1 d r g (0)  (C ) r  r, d( \lr E N*. For C E IR, we consider %c = {u E %; I( . Dru(xO)1  (Crr, \lr E N*}. By hypothesis % = u % c. Now % c is an equibrated convex set, closed in %, since the topologies induced by S(Q) and Ltoc(Q) on % are equivalent by the closed graph theorem. From Baire's theorem 38 , there exists therefore, C E IR such that %e contains a neighbourhood of zero in %; by homogeneity, there exists therefore a compact set K c Q and Co E IR such that 38 For Baire's theorem, see Chap. VI, 91 or Dieudonne [2], Y osida [1]. 
94. Local Regularity of Solutions 237 I(  . D)' u(xO)1  (Cr)' Co L JuJ dx, Applying this to u(x) = eC(X-x O ), we obtain Vu E..AI, Vr E N*. I. 'I r  (CrYCoIKI ecIRel, V( E en with P(() = 0, Vr E N*. Given ( E en with P(() = 0, we apply this result with r the smallest integer greater than IRe 'I. We obtain I.(I  C 1 (1 + IRe(!), V( E en with P(() = 0 where C 1 = Ce c sup (C o IK!)l/r < 00. r E N* Given Y E IRn, we consider ( E en with P(() = 0 such that d(y) = I iy - (I; we have IRe (I  d(y) and hence I . yl  C 1 (1 + d(y)) + II d(y); using (4.9), we deduce (4.12). D 2.2. Estimates of Analyticity We now prove Proposition 1. We fix  E IRn satisfying ( 4.14 ) I . YI1k1Ip(k)(y)1  1 + Ip(y)l, Vk E Nn, Vy E IR n . Given an open set Q of IRn and K a compact subset of Q, we shall estimate the quantities II( . Dy u IIL2(K) as a function of the quantities II( .D r ) Pu II L 2(,Q). Prop- osition 1 will then be deduced easily. We shall split these estimates into several lemmas. Lemma 1. There exists a constant C such that for aU8 E JO, 1J, v E Hm({lxl < 8}) and  satisfying (4.14) (4.15) L 8- 21kl r IP{k)(.DvWdx  ( C ) 2 L8 -2 I k l r IP{k)VJ 2 dx. k*O JIXI<t:/2 8 k JIXI<t: Proof First of all we remark that there exists a constant Co such that (4.16) 1.YI2 L 8-2Ikllp(k)(y)12  ( Co ) 2L8-2IkIIP{k)(yW. k*O 8 k Indeed, if I . yl  1/8, the inequality is trivial with Co  1. Now if 8- 1  I. yl, using (4.14), we have 1 I . yl8 -Ikll p(k)(y)1  - (1 + I p(y)!) 8 
238 Chapter V. Linear Differential Operators and the ineq uali ty is true with 1 + A Co  (N - 1) sup ( ) 1/2' ).  0 I I pk 1 2 + A 2 Ikl=m where N is the number of k E Nn with Ikl  m. Let us now take P E  with supp P c {Ixl < 1} and P = 1 on {Ixl  t} and put Pe(x) = p(X/E) and V e = PeV. On {ixi < tE} we have p(k)( . Dv) = PeP(k)(. Dv) = p(k)(. Dv e ). Hence, using Parseval's formula (see Appendix "Distributions") r IP(k)(.DvWdx :::;: f IP(k)(.DVe)12dX = f IP(k)(Y)Ve(Y).YI2dY. J Ixl < e/2 We denote by 'E the first member of(4.15); using the above inequality, (4.16) and Parseval's formula, }; :::;: ( o r  B - 21kl f lP(k) V, 1 2 dx. Now, from Leibniz's formula 1 1 (1 ) 1/2 IIP(k)v e II L 2  I, II(D a Pe)(p(k+a)v)IIL2  I, IID a PellL OO IP(k+a)vI 2 dx ; a rx. a rx. Ixl<e now (Dap,)(x) = B-1al(Dap)( ; ). so IIDapenLOO  E-laIIiDapIlLOO. We deduce that 'E  ( C O ) 2 NII IDa"LOO B- 21 k+ a l r IP(k+a)vI 2 dx, E k a rx. J Ix I < e from which the result follows by regrouping the terms. D Lemma 2. There exists a constant C such that for  E n satisfying (4.14), for all r E N*, Q an open set of n, K a compact sub-set of Q, u E Hm+r-1(Q) and E E JO, r J with E  dist(K, 8Q) ( 4.17) ( r ) 21kl f I - IP(k)(.DruI2dx k*O E K ( Cr ) 2(r- j) f ( Cr ) 2r ( r ) 21kl f  - 1(.D)jPuI2dx + - I - IP(k)uI 2 dx. O<J<r E {J EkE {J 
94. Local Regularity of Solutions 239 Proof First of all, let us prove the result for r == 1. Putting Qo == {x' E Q; dist (x', K) < 18} we have, for W EL l (Q) r dx' r Iw(x + x')1 dx  b'/3 f Iwl dx Jno Jlxl <8/3 K and f dx' r Iw(x + x')1 dx < b 2 '/3 i Iwl dx no J Ixl < 28/3 n where b 8 is the measure of the ball {Ixl < 8}. We now apply (4.15) with 28/3 and v(x) == u(x + x') with x' E Qo, and integrate over x', using the above inequalities. We then obtain (4.17) (in the case r == 1), with as constant 2 n / 2 (3/2)mc, where C is the constant of Lemma 1. Let us now prove it for arbitrary r E N*. We consider for j == 1,. . . , r { . (r - j + 1) 8 } Qj == X E Q; dlst(X, K) < r and Kj = {x E Q; dist(x, K) < (r  j)8 }. Applying the inequality (4.17) with r j == 1, Qj, Kj, u j == (. D)j-1 U and 8j == 8/r, we have ( r ) 21kl i ( Cr ) 2 ( r ) 2Ikl i L - IP(k)(.D)juI2dx  - L - IP()(.D)j-1uI2dx. k=t=O 8 K. 8 k 8 . J &J Multiply these inequalities by (Cr/8)2(r- j) and sum over j == 1,. . . , r. Since Qj c K j - 1 for j == 1, . . . , r - 1, Kr == K and Q1 c Q, we obtain (4.17) in the general case. D Proof of Proposition 1. First of all, in using the equality S(Q) == {u E Lfoc(Q); Dau E Lfoc(Q), Va E N n }, and remarking that Go == { E [Rn; II  1,  satisfies (4. 14)} is a neighbourhood of 0, bounded in G, we see that u is analytic in S(Q) with respect to G iff { for every compact K of Q and a E Nn, there exists C such that II( . DY Dau IIL2(K)  (CrY, Vr E N*, V EGo. Hence let us fix a compact set K in Q, a E N n and take Qo to be an open neighbourhood of K relatively compact in Q. Using u E S(Q), we see that Co = (IIP(k)Daulli2(Qo)Y/2 < 00 
240 Chapter V. Linear Differential Operators and, using the analyticity of Pu in S(Q) with respect to G, that 1 C 1 = sup sup - II(. Dr PDaulll(Qo) < 00. r E N*  E Go r Finally we put G = inf(1, dist(K, oQo)) and Co = ( L I p(k) 1 2 ) 1/2 > O. Ap- Ikl=m plying (4.17) with  EGo, r E N*, Qo, K, Dau, G, we have c6 (  ) 2m f I(. Dr D a ul 2 dx::::; L ( Cr ) 2(r- j) (Cd)2 j + ( Cr ) 2r (  ) 2m ci';; G K 0 <j<r G G G from which 1 sup sup - II( . Dr Daullllf(K) < 00. r E N*  E Go r D 2.3. Generalisa tion: Gevrey Classes More generally, given Q an open set of IR n , G a linear sub-space of IR n al}d b > 0, we say that u is in the Gevrey class of order b in S(Q) with respect to G 39 if { for each compact set K of Q and each (X E N n , there exists a constant C such that I(. Dr Dau(x)1 ::::; (CI\rl/0Y, Vr E N*, Vx E K, V E G. In taking up again the proofs of Propositions 1 and 2 we can prove (see Treves [1J, Hormander [1J, pp. 113-114): Proposition 3. Given b > 0, let us consider the set G lJ of the  of IRn satisfying { there exists a constant C such that I . YIOlkllp(k)(y)\ ::::; C(1 + Ip(y)1) Vk E Nn, Vy E IR n . ( 4.18) It is a linear subspace of IRn and for every open set Q of IRn and u E S(Q), Pu is in the Gevrey class of order b in S(Q) with respect to Go => U is in the Gevrey class of order b in S(Q) with respect to Go. Proposition 4. Given b > 0 and  E IR n , suppose that P is hypo-elliptic and that there exists an open set Q of [Rn and X O E Q such that for all u E S(Q) with 39 The Gevrey class of order <5 in (Q) with respect to G = [Rn is the set of u E (Q) satisfying { for all compact K of Q, there exists C such that IDau(x)1  (ClaI 1 / O )la l , Va E Nn, Vx E K It coincides with the space  Mk(Q) defined in Lions-Magenes [1], Vol. 3 (definition 1.3) for M k = kk/". 
94. Local Regularity of Solutions 241 Pu = 0, the mapping t  u(Xo + t) is in the Gevrey class of order £5, i.e. { there exists a constant C such that I ( . Dyu(xO)1  (Crl/y, Vr E N*. Then  satisfies (4.18). Example 1. Suppose that P is semi-elliptic of total order m  1. Let us denote by m j the order of P with respect to Dj. We then have (see the proof of Proposition 8 of g2) 1 + Ip(y)1  c o (1 + IYll m1 + . . . + IYnl mn ) with Co > 0 and Ip(k)(y)1  C o (1 + IYll m1 - k1 + . . . + IYnl mn - kn ). Hence m. IYjl n: 1k1Ip(k)(Y)1  C(1 + Ip(y)l). We deduce that for £5 > 0, G  is the sub-space generated by the base vectors e j such that mj/m  £5. D In the general case, the mapping £5  G is decreasing. We note that if £5 > 0 and G =1= {O}, then £5  1 since from (4.18), for  E C coll/jm.t'jm < C(l + p G ) ) < C(l + Am), VA > O. Finally, if P is hypo-elliptic, from (4.11) we see that there exists £5 > 0 such that G = n. In other words, if P is hypo-elliptic, there exist, possibly after change of coordinates, £5 1 ,. . . , £5 n > 0 such that for all £5 > 0, G  is the subspace generated by the base vectors e j such that £5 j  £5. The reader will then be able to show that then u is, for all £5 > 0, in the Gevrey class of order £5 in S(Q) with respect to G iff { for every compact set K of Q, there exists a constant C such that I D il u (x) I  C I ill ex 1/  1 . . . exn / n , V X E K , Vex E N n. The class r(Q) thus defined, satisfies u E f0'(Q) and Pu E r(Q) :z> U E r(Q). 3. Comparison of Operators We have seen, throughout the course of this study of local regularity, the importance of the function defined on n X  + by 
242 Chapter V. Linear Differential Operators (4.19) p(y, t) def (  t 2Ik1 Ip(k)(yW Y/2. We call p the weight function associated with P. We note that there exist c > 0, depending on P such that (4.20) ct m  p(y,t)  c- 1 (1 + t 2 + y2)m/2, V(y,t)E IR n x IR+. Proposition 5. Let P and Q be two l.d.o. with constant coefficients on IR n ; then the following assertions are equivalent: (1 . ) h . 0 h h I q (y ) I t ere eXlsts t > suc t at sup _ ( ) < 00 Y E [Rn P y, t ( . . ) h . 0 h h q(y, t) 11 t ere eXlsts t > suc t at sup _ ( ) < 00 Y E [Rn P y, t ( ... ) fi 11 0 q(y, t) 111 or aT> , sup sup _ ( ) < 00. t  T Y E [Rn P y, t Definition 1. When the equivalent conditions of Proposition 5 are satisfied we say that Q is weaker than P, denoted by Q >- P. If P >- Q and Q >- P, we say that P and Q are of the same force. Finally we say that Q is dominated by P, denoted by Q  P, if for all A E C, P and P + AQ are of the same force. Proof of Proposition 5. We have trivially that (iii)  (ii)  (i); let us now show that (i)  (iii). First of all, by Taylor's formula (4.21) Ip(y + ()I  C1P(y,I(I), Vy E IR n , V( E cn with C 1 = ( L (  ) 2 ) 1/2 Olklm k! . We now consider ((a) a family of distinct points of {( E cn; 1'1  1} indexed by A = {a E N n ; lal  m}. We then have (4.22) p(y, t)  C 2 sup Ip(y + t(a)1 aEA where C 2 depends only on the family ((a). Indeed, from Lagrange's interpolation formula (see e.g. Laurent [lJ, 95.4), there exists a family of polynomials (Pa)aEA such that for every polynomial P of degree  m p = LP((a)Pa. Applying this result to the polynomial (  p(y + t() we have p(y + t() = L p(y + t(a) Pa(() from which by differentiating with respect to ( at the point ( = 0 we obtain tiki p(k)(y) = L p(y + t(a) pk)(O); 
4. Local Regularity of Solutions 243 we deduce (4.22) with C 2 == L L Ipk)(O)I. lal m Ikl m We can always suppose P and Q to be of order m and take (a E IR n . Let us suppose that Iq(y)1  Cop(y, to); we then have for all t > 0 q(y, t)  C 2 sup! q(y + t(a)1  C 2 Co sup p(y + t(a, to) a a  CCosupsuplp(y + t(a + to(p)1  CCOCIP(y,t + to) a p  QC o c 1 (1 + t; r p(y,t). o We note first of all that P is elliptic of order m iff p(y, 1)  c(l + y2)m/2 with c > O. Hence, using (4.20), we find that all elliptic operators of the same order mare of the sameforce; in addition they are stronger than (resp. dominate) all operators of order  m (resp. < m). The reader will be able to show that in a general way, semi-elliptic operators of the same partial orders m 1 , . . . , m n are of the same force and that they are stronger than (resp. dominate) the operators P == L aaDa (resp. la:ml  1 P == L aa Da ) (see Proposition 8 of 9 2 ). la:ml < 1 An elliptic operator is of the same force as its principal part. More generally, we have the ( 4.23) Proposition 6. Suppose that the principal part p. of P satisfying ape I Oj m # 0, VElRn\{o} then P and p. are of the same force. This result stems from Proposition 9 of 92; the proof makes apparent the property of domination: in effect, this result expresses that p. dominates all operators of order less than m. Proof By homogeneity, we have p.(y, t) == tmp. ( , 1 )  t m ( I opO (  ) 2 + I I p O(k)1 2 ) 1/2 t j aYj t Ikl = m  ct(tm-1 + !ylm-l) 
244 Chapter V. Linear Differential Operators with {( ) 1/2 ( op. 2 ) 1/2 } C = min L Ip.(k)1 2 , inf  a () > o. Ikl=m 11=1 ] yj On the other hand, there exists C such that for all (y, t) E IR n x [1, 00 [  Ip(y, t) - p.(y, t)1  p - p.(y, t)  C(t m - 1 + IYlm-1) so for t > max(l, Cjc), (1 -  ) p. (y, t)  p (y, t)  (1 +  ) p. (y, t), from which the result follows. o We refer to Hormander [lJ for a more systematic study of the comparison of operators. In particular, it is proved that Q  P iff 1 . q(y, t) 1m sup ,.., = o. t -+ 00 Y E [Rn P ( y, t) Let us, meanwhile, prove the Proposition 7. Let P and Q be two i.d.o. with constant coefficients on IRn of the same force. Then P is hypo-elliptic iff Q is. Proof. Suppose that P is hypo-elliptic. Then from the characterisation (4.11), there exist b > 0 and C such that blkl Ip(k)(y)1  C(l + y2)-T(1 + Ip(y)I), which shows that p(y, (1 + y2)d/2)  NC(l + Ip(y)1) where N is the number of elements kEN h such that I k I  m. Since Q -< P we deduce that for a certain constant C q(y, (1 + y2)d/2)  C(l + Ip(y)I). Now since P -< Q for a constant C 1 + Ip(y)1  Cq(y,l); so for a certain constant C q(y, (1 + y2 )d/2)  Cq(y, 1). We deduce that L [(1 + y2)lk l d - C 2 Jlq(k)(y)1 2  (C 2 - 1)lq(Y)1 2 , k=l=O from which the hypo-ellipticity of Q follows. o 
4. Local Regularity of Solutions 245 4. Local Regularity for Operators with Variable Coefficients and of Constant Force We consider an open set Q of IR n and P = L aa Da a 1.d.o. on Q with variable coefficients aa which for simplicity we assume to be in $ (Q). Definition 2. We say that P is of constant force if for all xl, x 2 E Q, the operators with constant coefficients L aa(xi )D a and  aa(x 2 )D a are of the same force. Example 2. We say that P is elliptic if its principal part p. satisfies p.(x,) =1= 0, Vx E Q, V E IRn\ {O}, that is to say if, for all x E Q, the 1.d.o. with constant coefficients L aa(x)D a is elliptic of order m. Since all the elliptic 1.d.o. with constant coefficients of the same order are of the same force, an elliptic operator is of constant force. o Example 3. Let Q be an open set of IRn+ 1 and P = a%t + Po a 1.d.o. on Q with coefficients in $(Q), where Po = L aa Da depends only on the deriva- lal m tives with respect to the space variables D a = aa / ax a . We suppose that for all (x, t) E Q, { a(x, t) =1= 0; we put e(x, t) = arg a(x, t) L aa(x, t)D a is e(x, t)-strongly elliptic. Then P is of constant force. This comes from the fact that if Po and Qo are two e-strongly elliptic 1.d.o. with constant coefficients on IRn, then a/at + Po and c/at + Qo are of the same force. o We now state the Proposition 8. Let P =  aaDa be a l.d.o. on Q with coefficients in $ (Q) (resp. analytic on Q). We suppose that P is of constant force and that for x E Q, the l.d.o. \vith constant coefficients L aa(x)D a is hypo-elliptic (resp. hypo-analytic). Then P is hypo-elliptic (resp. hypo-analytic), that is to say: U E !»' (Q) and Pu E $ (Q) (resp. analytic on Q)  u E $ (Q) (resp. analytic on Q). We shall give a proof of the hypo-ellipticity property: for a proof of the hypo- analyticity property we refer the reader to Hormander [1], pp. 178-180 and Lions-Magenes [1]. Proof. We take Xo E Q and put Po =  aa(xO)D a . We show that in the neighbourhood of xO, Pou is  00. It is obvious that the set of 1.d.o., with con- stant coefficients, weaker than Po is a finite-dimensional linear space; let us consider Pi' . . . , P N 1.d.o. with constant coefficients weaker than Po such that 
246 Chapter V. Linear Differential Operators Po, . . . , P N is a base in this linear space. In the neighbourhood of X O we can, after division by a function of $ (Q) which nowhere takes the value 0, hence suppose that P = Po + L crPr with C r E $ (Q) and cr(X O ) = o. Using the hypo-ellipticity of Po we see that there exist C and b > 0 such that its characteristic polynomial Po satisfies (4.11). Given (J E IR, suppose that there exists Q(:T' an open neighbourhood of X O in Q such that Pou E H1:c(Qa): this is always true for (J sufficiently small. Using Pr « Po in the proof of Theorem 1, we see that Pr(k)U E H Ik 1 t5(Qa) V k E N n and r = 0, . . . , N. Given p E !» (Qa) we have pPou = pPu - L crpPr u . pPru = Pr(Pu) - L - k 1 Dkpp?)U, so k =1= 0 ! Now: where Po(pu) = f - L c;Pr(pu) f = pPu - L .2, Dkp ( P6 k )U - L CrPr(k)U ) E Ha+t5 k=l=ok. r and c E !» with c = C r in the neighbourhood of supp p. But, using Pr « Po, we see that there exists C r such that IPr I  C r (1 + Ipo I); we deduce that IIcP,pulll.H = fi''2*p;Ptl,2(Y)(1 + y2)"Hdy  C,2 ff (2(y - y'W(1 + IPo(Y')1)21(y'W(1 + y2)"Hdydy'. Now (1 + y2)a+t5  C(1 + y'2)a+t5(1 + (y _ y')2)la+t5 1 so there exists a constant C r such that II c;PrPu II H a + b  C r II c II Hla+bl (II pu II H a + b + II Po(pu) II H a + b ). Taking the support' of p sufficiently small, we can always suppose that C = L C r II c II Hla+bl < 1, and hence that: 1 IlpullHa+b + IIPo(pu)IIH a + b  (1IpuIIHa+b + IlfIIH a + b ). 1 - C We obtain the stated result by induction. o 
4. Local Regularity of Solutions 247 5. Construction of an Elementary Solution We suppose again that P is a l.d.o. with constant coefficients; we put p(y) == p(y, 1 ). We shall construct an elementary solution E of P satisfying ( 4.24) /'..... ppE E L OO Vp E 5». This property expresses a local regularity of the elementary solution which we shall construct. Remark 1. (1) If P is elliptic, (4.24) can be written /'..... (1 + y2)m/2pE EL oo , VpE5» or agaIn (4.25) pDrxE E % -1 L OO , Vp E 5», Va E N n with lal  m. (2) The reader will verify that in the case of a semi-elliptic operator, (4.24) expresses that pD rx EE%-1L OO , VpE5», VaEN n with la:ml  1. We should notice that this local regularity is the best that we can hope for, at least in the neighbourhood of zero (see Treves [1], Hormander [1]). o We now take Y1, . . . , Yr a finite sequence of elements [Rn such that ( 4.26) p(y)  C sup inf Ip(y + eiOYj)l, . Vy E [Rn. 1jreE[R Such a sequence always exists: it can even be shown (see Hormander [1], p. 68) that given N the dimension of the space of l.d.o. with constant coefficients of order  m, and Y1, . . . , YN a sequence of N distinct elements of [Rn, there exists a constant C such that (4.26) holds with the sequence of the N(m + 1) elements 'kYj/m; k == 0,..., m,j == 1,... ,N} for each operator of order  m. \'le notice also that (4.26) is satisfied with the single element Y1 == 0 iff Ip(k)(Y)1  CI p(y)l, Vk E N n , and Y E IRn. This is the case for P == )" - L1 with )" E C \ IR -. It is also the case for P = A + :t - L1 with Re}, > O. Having chosen Y1, . . . , Yr satisfying (4.26), we consider the closed parts: Fj == {y E [Rn; p(y)  Clp(y + eiOYj)1 ve E IR}. By hypothesis uF j == [Rn. We shall denote by ((Jj the characteristic function of Fj (F 1 u . . . U F j - 1 ). Remember only that 
248 Chapter V. Linear Differential Operators (4.27) ({J l' . . . , ({Jr are measurable functions from IR n into [0, 1] such that ({J 1 + . . . + ({J r = 1 and for j = 1, . . . , rand Y E IR n ((Jj(Y) > 0  p(y)  Clp(y + eiO Yj)1 \Ie E IR. Given u E , we then define (4.28) E(u) = (2n) -n-l  f ({J. (y) dy i 21t (ff it )(y  eiO Yj) de '1] 0 p(y + e l8 Y j) where it(x) = u( -x). We note first of all that the integral is convergent and that E: u  E(u) is a distribution on IR n . Indeed from (4.27) and (4.20) L ((Jj(Y) C -1.    Cc , j Ip(y + eiOYj)1 p(y) on the other hand for all kEN 1 ( ff it ) (y + eiO Y j ) 1  (1 + Y 2 ) - k II (1 - L1)k ( e - ie i6 Y j x U ( - x) II L 1 . Now oW Ck(u) = sup II (1 - L1 )k(e- Ze YjX U ( - x)) II L1 8 E [R 1 jr is a continuous semi-norm on , and for k > n12, M k = f (1 + y2)kdy < 00. Hence, for all u E , the integral defining E(u) is convergent and IE(u)1  (2n)-n c -l CMkCk(U). We then state the Proposition 9. Given elements Yl, . . . , Yr of IRn satisfying (4.26) and ({Jl, . . . , ({Jr satisfying (4.27), the formula (4.28) defines an elementary solution of P satisfying /""'-.... pE E!/' and ppE E L OO for all p E $ such that p(x)ch(yjx) E!/ for all j = 1,. . . , r. Note that in particular E satisfies (4.24). The property stated in the proposition expresses, in addition to the local regularity of E, its behaviour at infinity. Proof First of all, E is an elementary solution of P. Given u E , we have (PE, u) = (E, tpu) and CPu)( -x) = (Pu)(x); 
9 4 . Local Regularity of Solutions 249 so (PE, u) = (2n)-n-l  f Ipj(y)dy fit ($'i't)(y + eiOyJdB. But for v E , i 21t '>i .0 f . i . dz '>i u(y + e t Yj) dlJ = e-tYxv(x) dx e -tZYjX  = 2nv(y). o I Z I = I lZ Hence using I ([Jj = 1, (PE, u) = (2n)-n f ($'i't)(y)dy = «(j, u), this being true for all u E , PE = b. Now we fix P E $ such that p(x)ch(yjx) E!/ for j = 1,. . . , r. 40 Taking u(x) = p(x)v( -x) with v E , we have ($'u)(y + ei6 y J = f e-ixty+ei.yJp( -x)v(x)dx = p;)(y) . i6 with PO,j(x) = e- te YjX p( -x). Using (4.28) and (4.27), we thus have I(E, u)1 = IE(u)1 :( (2n)-n-1C fit We shall show that /"--.... Po, j v dlJ. p L1 (4.29) /"-.... P o .V , ] A V :( Mo - L1 P L1 "'" p where M 0 depends only on v. We shall then have shown that for all u of the form pv with v E , A v I(E, u)1 :( (2n)-nM  P L1 where M = CM or does not depend on v. In other words for all w of the form ff -IV/p with v E, I (pE,  (pw) 1 :( M II w II L1. The Proposition then follows from the density in L I of the set of the w's of the form /F-I!V/p with v E. To prove (4.29) we observe first of all that the functions e -ie i6 Yjx ch(yjx) 40 Note that the conditions p E ff, p(x)ch(yjx) E g imply that p = {p(x)ch(yjx)} ch(y j x)-l E g. 
250 Chapter V. Linear Differential Operators all lie in a bounded set of (!) M. Hence, from the hypothesis, the functions Po, j all lie in a bounded set of f//. It is therefore the same for the f;}ctions P;;; in particular M = sup f (l + y2)m/21 p(Y)1 dy < 00. OE[R 1 jr /'--... /'-... A Now we have PO,jV = PO,j * v and by Taylor's formula p(y')  C(l + (y - y')2)m/2 p(y), so that  I Po, J .v(y)1 f "v(y')  C (1 + (y - y')2)m/2 Po .(y - y') '" dy' p(y) , J p(y') from which  P o .V , J A V  MC - P L1 P L1 o 5. The Maximum Principle* 1. Prerequisites We assume that we are given an open set Q of IR n . We recall the most classic form of the maximum principle (see Chap. II, 2, Proposition 8): 1 Suppose that Q is connected and that u is harmonic on Q; then either u is constant on Q, or infu < u(x) < sup u, Vx E Q (said otherwise: u attains neither its maximum nor its minimum at any point of Q). Note the link with Harnack's inequality (see Chap. II, 2, Proposition 12) which we express in the following form: 1 Suppose that Q is connected and that K c Q is compact; then (5.2) there exists a constant C such that for every positive harmonic function u on Q max u  C min u. K K (5.1 ) Since if u is harmonic, the same is true of Au + J1 so (5.1) leads in effect to the property: { suppose that Q is connected and that u is positive and harmonic on Q, then there exists x E Q, u(x) = 0 => u = 0, which is a trivial consequence of (5.2). 
95. The Maximum Principle 251 An immediate consequence of the maximum principle (5.1) is the uniqueness of the solution of the Dirichlet problem for the Laplacian: ! Suppose that Q is bounded and tha there exists a function u harmonic on Q and continuous on Q, then u = 0 on aQ => u = 0 on Q. (5.3) We recall (see Chap. II, 94.6) that a sub-harmonic function on Q is a mapping u: Q  [- 00, 00 [ u.c.s. satisfying ! for ever open set Qo relatively compact in Q and every function U o E  (Qo) harmonic on Qo, u  U o on aQo => u  U o on Qo. We have the maximum principle (see Chap. II, 9 4 . 6 ): (5.4) ! Suppose that Q is connected and that u is sub-harmonic on Q, then either u is constant on Q or u(x) < sup u, Vx E Q. We shall refer to this result as the principle of the strong maximum in contrast to the following principle of the weak maximum which it trivially implies: (5.5) Suppose that Q is bounded and that u is sub-harmonic (resp. harmonic) on Q and u.c.s. (resp. continuous) on Q, then (res p . and min u = n: u ). max u = max u an (where U.c.s. means upper semi-continuous.) Now if u E 2(Q), then u is sub-harmonic on Q iff L1u(x)  0, Vx E Q41. Another weak form of (5.4) is therefore the local property: { Suppose that u E  2(Q) and x E Q, then (5.6) u(x) = max u => L1 u(x)  O. which we can verify directly. We shall see that this property is in direct relation with the maximum principle for the heat operator (ajat - L1) and the dissipativity of the Laplacian for the uniform norm. 41 More generally, moreover, given U E Llc(Q), there exists u: Q ---+ [ - 00, 00 [ sub-harmonic such that u = u almost everywhere on Q iff L1u  0 in !0'(Q) (see Chap. II, 9 4 . 6 ). 
252 Chapter V. Linear Differential Operators 2. Parabolic Maximum Principle and Dissipativity We take T E ]0,00 [and denote by Q the product Q x ]0, T[. By the parabolic boundary of Q we mean the part: apQ = Q x {O} u aQ x [0, T]. We put m,l(Q) = l(]OT[; (Q)) n (]O, T[; m(Q). Given a locally compact space X(X = Q, Q, Q, etc. . . . , not necessarily bounded), we define o(X) = {u E (X); VB > 0, {x E X; lu(x)1  B} is compact in X}; it is a closed sub-space of b(X) = {u E (X); u bounded on X}42 We shall prove the Theorem 1. Let P be a l.d.o. with (real) continuous coefficients on Q of order m and c E IR. The following assertions are equivalent: (i) P is of order m  2, that is to say a 2 a P = L L ajk a a + L . aj ax . + a o j k X j X k J J and satisfies (5.7) L L ajk(x)j k  0 j k and ao(x)  c (ii) for all u E m(Q) and x E Q u(x) = max u  0  Pu(x)  cu(x); Vx E Q, V E IRn; 42 If X = Q is a bounded (resp. non-bounded) open set of [Rn, o(X) identifies with {u E (Q); u = 0 on oQ (resp. and lim u(x) = O)}. Ixl-+oo If X = Q, then o(X) = (Q) (resp. {u E (Q); lim u(x) = O}). Ixl-+oo Finally, when Q is bounded, {u E (Q); u = 0 on opQ} identifies with o(X) with X = Q x ]0, T]; if Q is non-bounded, for X = Q x ] 0, T], o(X) = {u E (Q); u = 0 on opQ and lim u(x, t) = 0 uniformly w.r.t. t}. Ixl-+oo Note also that o(X) is the closure in b(X) of the space of continuous functions with compact support in X. 
95. The Maximum Principle 253 (iii) for all u E m(Q) n o(Q) and A > 0 with AC < 1, II U IIL oo  (1 - AC)-l II u - APU IIL oo ; (iv) for all u E m,l(Q) n ([O, T]; o(Q)), lIu(T)llv" :::; eCTllu(O)llv" + I T ec(T-t)  (t) - Pu(t) L'" dt; (v) the coefficient ao = Pi satisfies ao(x)  c, Vx E Q, and for all u E  m, 1 (Q) au - -  Pu on Q and u+ E o(Q\apQ)  u  0 on Q;43 at (vi) for every bounded open set Qo contained in Q and u E m, 1 (Q 0) n  (Q 0)44, au _ a Pu on Qomaxu(T)=maxeC(T-t)u+(x,t). t BpQo We shall prove the theorem according to the following scheme: (v)  (iv)  (iii)  (ii)  (i)  (ii)  (v) and (vi) 11 (vi) Proof. Proof that (v)  (iv) Given u E m,l(Q) n ([O, T]; o(Q)), let us consider ii(x, t) = u(x, t) - (e ct II u(O) II L'" + I ec(t-s)  (s) - Pu(s) L'" dS). We have U E m,l(Q), u+ E o(Q\apQ), au au au at (t) - Pii(t) = at (t) - Pu(t) + (a o - c)(u(t) - ii(t}} - at (t) - Pu(t) L '" Using ao :::; c on Q, we find that  :::; Pii on Q. Hence, using (v) we have ii :::; 0 on Q and, by continuity u(x, T) :::; e cT II u(O) II L'" + I T ec(T-t)  (t) - Pu(t) L'" dt. Replacing u by -u we hence obtain (iv). o 43 u+ = max u(x, 0). 44 Where Qo = Qo x ] 0, T[. 
254 Chapter V. Linear Differential Operators Proof that (iv)  (iii) Given U o E CCm(Q) n CCo(Q) and A > 0 we consider u(x, t) = et;;.uo(x). We have u E m,l(Q) n CC([O, T]; CCo(Q)) and OU at - Pu = A -1 etlA(u o - APU o )' (iv) then gives e T 1.1. II U o II L'" ::::;; e cT II Uo II L'" + (IT A - 1 e(tl A) +c(T - t) dt) II U o - APU o II L"" Supposing AC < 1, we then have (e T /;' - eCT)lIuollLoo  (1 - Ac)-l(e T / A - eCT)llu o - APuoII L oo from which we deduce (iii) on dividing by e T / A - e cT > o. o Proof that (iii) or (vi)  (ii) We take u E CCm(Q) and Xo E Q with u(Xo) = max u  o. We put v(x) = p(x)(u(x) + 8), where 8 > 0 and p E (Q) satisfy suppp C {x E Q;u(x)  - 8}, o  p(x) < p(Xo) = 1 for all x E Q \ {XO}, and DClp(xO) = 0 for all a E N n with 0 < lal  m. We have V E m(Q) n o(Q), o  v(x) < v(XO) = u(XO) + 8 for all x E Q \ {XO}, and Pv E o(Q), Pv(XO) = Pu(XO) + 8a o (xO). Let us suppose (iii). Given A > 0 with AC < 1 we consider x A E Q such that I v(x A) - APV(X A) I = II v - APV II L OO . We have II V IIL oo = v(XO)  v(x A ) = Iv(xA)1  II v IIL oo - 2A II Pv IIL oo . Hence v(x;.) --+ v(XO) when A --+ O. Using the fact that v(XO) is a strick maximum of v, we find that x;. --+ XO when A --+ O. Now we have v(x A ) - APV(X A )  v(XO) > 0 when A --+ 0 and so for A sufficiently small v(x A) - APV(X A) = II v - APV II L 00. 
g5. The Maximum Principle 255 From (iii) we then have ( ) v(x;J - II v - APV IIL oo Pv X A = A v(x A ) - (1 - AC) II V IIL f 0 )   CV,X . A Passing to the limit we find that Pv(XO)  cv(XO), that is to say Pu(XO)  cu(XO) + e(c - ao(xO)). Letting e --+ 0, we obtain (ii). Now let us assume (vi). Suppose that Pu(XO) > cu(XO); then we show that this leads to a contradiction. For e sufficiently small, we shall also have Pv(XO) > cv(XO); by continuity there will exist r o E ]0, dist(xO, 8Q)[ and c' > c such that Pv  c'v on Qo = {Ix - xOI < r o }. Putting w(x, t) = ec't v(x), we have w E m,l(Qo) n (Qo), ;  Pw on Qo, max ec(T-t)w + (x, t) = max ( e CT Il!.ax v(x), ec'T max V(X) ) OpQo no ono < ec'T v(XO) = max w(T). which contradicts (vi). o Proof that (ii) <=> (i). We assume (ii) and first of all show that I a(lD(l is of order m  2. Given XO E Q and a E N n with lal > 2, we take A E  and P E (Q) satisfying o  P  1, P = 1 in the neighbourhood of xO, { a! } supp p c x; Ix - xOllal-2  21AI with the result that the function ( Ix - xOl 2 A ) u(x) = p(x) 1 - 2 + -(x - xO)a a! satisfies u E (Q), u(XO) = 1 = max u and 8 2 u Du(xO) = 0, a a (XO) = b jk , DPu(xO) = b ap for IPI > 2. Xj X k Using (ii), we have Pu(XO) = c(XO) + Aa(l(xO)  c, where c(XO) = I ap(xO)DPu(xO) IPI  2 
256 Chapter V. Linear Differential Operators is independent of A. Varying A over , we deduce that a(l(xO) = O. We shall show now that P satisfies (5.7). Given XO E Q, we consider for A  0 and  E n the function 1 u(x) = A - 2(.(x - XO))2; we have u E t!(Q), u(XO) = max u = A, Du(xO) = 0, 00 2 0 u (XO) = - j k' x j Xk Using (ii), we have Pu(XO) = Aao(xO) - I L ajk(xO)jk  AC, that is to say (5.7). We now assume (i) and take u E CC 2 (Q) and XO E Q with u(XO) = max u  O. The eigenvalues A l , . . . , An of the real symmetric matrix (aij(xO)) are all real non-negative. We consider (e'l' . . . , e) an orthonormal basis of associated eigenvectors; with respect to the coordinates (X/l' . . . , x) in this basis P can be wri tten P _ "", 0 2 ", a - i...J i...J a jk a ' a ' + i...J a j - a ' + a o X j Xk x j where a}k(xO) = Ajb jk . Since u attains its maximum at xo, au ° 02U - a ' (x ) = 0, _ 0 ,2 (XO)  0; hence x j x j ° " ( 02U ° , ° au 0 ) ° 0 ° Pu(x ) = i...J Aj OX'.2 (x ) + a j(x ) ax'. (x) + ao(x )u(x )  cu(x ). J j o Proof that (ii) => (v) and (vi). First of all, we assume (ii) and show that it implies (v). Let us take au - u E m,l(Q) with at:::;; Pu on Q and u+ E o(Q \opQ); Suppose, on the contrary, that there exists (xo, to) E Q such that u(XO, to) > O. Given c' > c, consider v(x, t) = e-c'tu(x, t); we have v+ E CCo(Q/opQ) and v(XO, to) > 0; hence there exists (x, "i) E Q x JO, toJ such that v(x, t) = max v. We then n x ] 0, to] have ov/ot(x, t)  0, and, using (ii), Pv(x, t)  cv(x, t); but OV _ c' t ( au ) ' , - - Pv = e - - Pu - c v  - c v at at ' 
5. The Maximum Principle 257 so (c' - c)v(x, t) < 0, which contradicts c' > c and v(x, t)  v(X O , to) > O. Now we shall prove (vi). The hypothesis (ii) is local, so we can apply the result (v) to every open set Qo contained in Q. We suppose Qo is bounded and take - au u E m, l(QO) n  (Qo) with at  Pu on Qo; we put: m = max e-ctu(x, t) and v(x, t) = u(x, t) - mect; OpQo we have v E m,l(Qo)n(Qo)' V  0 on opQo, ov au t - - Pv = - - Pu + ( a - C ) me C  o. at at 0 , applying the preceding result, we deduce that v  0 on Qo. D Remark 1. The property (v) is a parabolic comparison principle. Note that when Q is bounded, the condition u+ E o(Q\opQ) is implied by u E (Q) and u  0 on opQ; this is no longer the case when Q is not bounded, this hypothesis including a condition at infinity. D When Q = n and P is the Laplacian, we have for u E 1([0, T]; !7') (see 3) (5.8) u(T) = E(., T) * u(O) + LT E(., T - t) * (  (t) - AU(t)) dt, where E is the elementary solution of the heat operator: Y(t) _ E(x, t) = (4ntt/ 2 e 4'. Since E  0, we deduce { Vu E 1([0, T]; !7') with ou/ at  Au in '(n x] 0, T [) and u(O)  0 in ', we have u(T)  0 in '. The condition at infinity is here very weak. We sall prove now the parabolic comparison theorem under the following sufficiently genral conditions: Proposition 1. Let Q be an open set of n X ]0, T[, a ( 02 A ) .P = _ a - LLa jk a a + La j - o + a o t j k Xj Xk j Xj a l.d.o. with (real) continuous coefficients on Q and u, v E 2, l(Q) n (Q). We suppose LLajk(x, t)jk  0 and ao(x, t)  c, V(x, t) E Q, V E n. j k 
258 Chapter V. Linear Differential Operators When Q is not bounded, we assume in addition that Ia jj , j Ilajl, j u - v have an upper bound in Q. Then fflu  fflv on Q and u  v on opQ  u  v on Q. When Q is an arbitrary opn set of n x J 0, T [, the parabolic boundary 0 pQ of Q is the complement in Q of {(XO, to) E Q; 3r > 0, {Ix - xOI < r} x Jt o - r, toE c Q} and rem, l(Q) is the space of the u E re(Q) such that ou/ot and D u are continuous on Q for all a E Nn with lal  m. Proof of Proposition 1. By linearity we can always reduce to the case v = O. In the case when Q is bounded, the proof is identical with that of the implication (i)  (v) in Theorem 1; Q not necessarily being of the form Q x JO, T[ it is enough to remark that there exists a sequence (Qn) of open sets, relatively compact in Q U opQ, covering Q. We consider now the case Q unbounded; replacing u by e-ctu, we can always suppose that c = O. Taking (XO, to) E Q and being given R > 0, we consider QR = Q n {Ix - xOI < R} x JO, T[ and M UR(X, t) = u(x, t) - 2 (Ix - xOl 2 + 2(c + c'R)t) R where M = supu, c = sup Ia jj , c' = sup Ilajl. Then QR is a bounded open set of [Rn x J 0, T [, U R E re 2 ,1(QR) n re(QR)' 2M ( " " 0' 1 ° 2 ) fflu R = 2u + R 2 i...Jajj - c + i...Jaj(Xj - X j ) - c R + 2 aolx - x I  0 OpQR C opQ U {Ix - xOI = R} x [0, TJ, U R  U  0 on opQ, U R  u - M  0 on {Ix - xOI = R} x [0, TJ. Applying the result in the bounded case, we have that UR  0 on QR, and hence, in particular, that ° 2M , ) u(x , to)  ---z(c + c R to. R In the limit when R --+ 00, we therefore have u(XO, to)  O. o ---- 
g5. The Maximum Principle 259 3. Characterisation of Operators P Satisfying Maximum Principles 3.1. The Weak Maximum Principle We take P to be a l.d.o. of order m with (real) continuous-coefficients on an open set 0 in n. When 0 is bounded, we say that P satisfies the weak maximum principle on 0 if45 Vu E rcm(O) n rc(O) with Pu  0 on 0, max u = max u. aD We prove the Proposition 2. (a) Suppose that for every open set 0 0 , relatively compact in 0, P satisfies the weak maximum principle on 0 0 , then P is of order m  2 without a zero order term: (5.9) 8 2 8 P = LLa jk 8 8 + La j ;;- j k x j X k uX j and satisfies (5.10) LLajk(X)jk  0, j k VXEO, VEn. (b) Let P be given by (5.9) satisfying (5.10); suppose 0 is bounded and (5.11) there exists cp E 2(0) such that Pcp(x) > 0 for all x E Q. Then P satisfies the weak maximum principle on o. Proof Under the hypothesis of (a), we show that for u E rcm(O) and Xo E 0 u(xo) = max u => Pu(xo)  o. The conclusion of (a) will then be a consequence of the implication (ii) => (i) of Theorem 1. As we have seen in the proof of the implication (iii) or (iv) => (ii), we can always suppose that u(x) < u(xo) = max u for all x #- xo. Given r with r E JO, dist(xo, 80)[, we apply the weak maximum principle on Or = {I x - Xo I < r}; since max u < u(xo), there exists X r E Or such that aQo Pu(xr) < 0; letting r --+ 0, we obtain Pu(xo)  o. We now prove (b). We suppose that there exists u E rc 2 (0) n rc(Q) such that Pu  0 on 0 and m = max u > max u. Replacing u by u - max u, we can aD aD always suppose that maxu = o. We consider 0 0 = {x; u(x) > tm}: this is an ail 45 From Proposition 2 we see that operators satisfying the maximum principle are generators of diffusion processes. 
260 Chapter V. Linear Differential Operators open set, relatively compact in Q; hence there exists 8 > 0 such that maxcp < m/48. We then have max(u + 8cp) > im > max(u + 8cp) which Do Do aDo contradicts, by Theorem 1, the fact that P(u + 8cp) > 0 on Qo. o Remark 2. The necessity of the conclusion in (a) of Proposition 2 is linked to the localisation of the hypothesis: we can obtain maximum principles for l.d.o. of order greater than two. For example the operator P = A - A 2 satisfies: Vu E CC 2 (1R n ) n g'(lRn) with Pu  0 in '(lRn) and u non-constant on IR n , u(x) < max u(x'), Ix' - xl = r In fact P = (1 - A)A, and the operator 1 - A is a bijection of g' onto itself so (1 - A) - 1 conserves the order; hence if u E g'(lRn) with Pu  0 in '(lRn), we have Vx E IR n , Vr > O. Pu E g'(lRn) and Au = (1 - L1) - 1 Pu  0 in '(lRn); if u E CC 2 (lR n ), we deduce that u is sub-harmonic from which the conclusion follows. We observe that if h is a diffeomorphism of IR n onto a bounded open set Q, then the operator Q = h*P satisfies the weak maximum principle on Q: in effect if v E CC 2 (Q) n CC(Q), then u = v 0 h E CC 2 (lR n ) n g'(lRn) and Qv  0 on Q if Pu  0 on IR n . o Remark 3. Suppose that P is given by (5.9) and satisfies (5.10). First of all if it has constant coefficients it satisfies (5.11) if and only if it is non-null. In effect if P does not satisfy (5.11), for all  E n LLajkjk = 0 and Ibjj = 0 j k j then P is null; (it suffices to apply the negation of(5.11) to cp(x) = + .x and cp(x) = e. x in using (5.10)). N ow let us consider P with variable coefficients. We suppose that its co- efficients are not all zero at Xo E Q, so that it satisfies (5.11) and hence the weak maximum principle in the neighbourhood of Xo. But the weak maximum principle on Q is not a local property. The operator 2 + d 2 d P = (1 - x) dx 2 - x dx satisfies the weak maximum principle in the neighbourhood of every point of IR but does not satisfy the weak maximum principle on any open interval contain- 
95. The Maximum Principle 261 ing [ -1, 1] since the function u(x) = - (Ixl - 1) + P is of class rc 2 for p > 2 and satisfies Pu  O. Finally we note that when P satisfies the weak maximum principle on every open set, relatively compact in Q, it satisfies it on Q. In particular, if P satisfies (5.11) on every open set, relatively compact in Q, it satisfies the weak maximum principle on Q. It is, in particular the case if P satisfies either (5.12) there exists  E IR n such that Laj(x)j =1= 0, Vx E Q, j or (5.13) there exists  E IR n such that LLajk(x)jk =1= 0, Vx E Q. j k In the first case, it follows by continuity that Laj(x)j has a constant sign, for j example positive; we take cp (x) = . x. In the second case, we deduce, again by continuity, that for every open set Qo, relatively compact in Q, there exists C( E IR such that C(2LLajk(x)jk + C( Laj(x)j > 0, Vx E Qo; j k j take cp(x) = ea(.x). o 3.2. The Comparison Principle We say that a l.d.o. P satisfies the comparison principle on Q if { Vu E rcm(Q) with Pu  0 on Q, u + E rc o(Q) => u  0 on Q. It is immediate that, when Q is bounded, if P satisfies the weak maximum principle on Q, then it satisfies the comparison principle on Q; the converse is true if the coefficient of order 0 is null on Q. We state the (5.14) Proposition 3. (a) Suppose that for every open set Qo relatively compact in Q, P satisfies the comparison principle on Qo, then P is of order m  2: 8 2 8 P = LLa jk a 8 + Laj + a o j k Xj X k j uXj satisfying (5.10). (b) Let P be given by (5.14) with its coefficients satisfying (5.10); suppose that (5.15) there exists cp E rc 2 (Q) such that inf cp > 0, and Pcp(x) < 0, Vx E Q. Then P satisfies the comparison principle on Q. 
262 Chapter V. Linear Differential Operators Proof. To prove (a), we show as in Proposition 2 that for u E m(Q) and Xo E Q u(xo) = max u => Pu(x o )  ao(xo)u(x o ); which, with the help of Theorem 1, completes the proof. To prove (b), suppose there exists u E 2(Q) with Pu  0 on Q, u+ E o(Q) and max u > O. Considering v = uj cp, we have v E  2(Q), V + E  o (Q), max v > 0 and Qv  0 on Q, where with 0 2 0 Q = LLb jk a a + Lb r : 1 + b o j k x j X k j uX j ocp b jk = <{Jajk, b j = <(Ja j + 2 La jk ax k ' b o = P<{J. Considering Xo E Q such that v(xo) = max v, we have from Theorem 1, Qv(Xo)  bo(xo)v(xo) < 0 from which a contradiction follows. o Remark 4. We can develop comments about localisation and globality for the comparison principle similar to those made above for the weak maximum principle. However, we shall content ourselves with studying the condition (5.15) in the case of constant coefficients: hence, let P given by (5.14) and satisfying (5.10) have constant coefficients a jk , a j on Q. We distinguish two cases: (1) Weakly parabolic case: that is to say when P satisfies there exists .E IR n such that LLajk jk = 0 and Lajj =1= 0 then whatever the value of a o E IR, P satisfies (5.15) and hence the comparison principle; it is enough to take cp(x) = eC.x with c E IR such that c Lajj < a o . In this case we have constructed a parabolic comparison theorem (see Prop- osition 1). (2) Weakly elliptic case: that is to say, P satisfies v E IR n , L La jk jk = 0 => La j j = O. j k j Considering the sub-space H' of IR n orthogonal to { E IR"; fajkjk = o}, and for all x E Q, the open set of H/:Qx = {x' E H'; x + x' E Q}, we are brought back to the (strongly) elliptic case: LLajkjk > 0, j k v E IR n \ {O}, which we now assume. 
95. The Maximum Principle 263 We consider the operator Q defined by Qv = e -lfJo P( e lfJO v), where CPo (x) = eo.x with 2 Lajk + a j = 0, j = 1,. . . , n k with the result that 0 2 Q = LLa jk + b o with b o = a o - LLajkJ . j k oxjox k j k We suppose that Q is bounded: then P satisfies the comparison principle on Q (resp (5.15)) iff the same is true for Q. To within an affine change of coordinates we can therefore reduce the problem to P = L1 + a o . We then consider A(Q) = inf {Llgrad ul 2 dx; U E '@(Q), Lu 2 dx = 1}. We know this is the first eigenvalue of -,1 on Q with the Dirichlet boundary conditions (see Chap. VIII). We can then conclude: ) If a o < (Q), then by continuity, there exists Q', an open neighbourhood of Q such that a o < A(Q'). Considering u E CCo(Q'), u =J= 0 such that - L1u = A(Q')U on Q', we can always suppose that u > 0 on Q'. Then the restriction cp of u to Q satisfies inf cp > 0 and sup Pcp < O. That is to say, P satisfies (5.15) on Q. (b) If a o  A(Q), considering u E CCo(Q), u =J= 0 such that - L1u = A(Q)U, we know that u has a constant sign on Q (see Chap. VIII); we can always suppose that u < 0 on Q. We then have Pu = (a o - A(Q))U  0 on Q. Hence P does not satisfy the comparison principle on Q. We leave to the reader the task of discussing the case when Q is unbounded: we remark only that when a o  0, P satisfies the comparison principle on Q since it satisfies the strong comparison theorem (see Remark 5). o 3.3. The Strong Maximum Principle We now study the strong maximum principle: Proposition 4. Suppose that P given by (5.9) satisfies ( 5.16) LLajk(X)jk > 0 Vx E Q, V E [Rn\{o} and that Q is connected. Then P satisfies the strong maximum principle on Q: { Vu E CC 2(Q) with Pu  0 on Q and u non-constant on Q u(x) < sup u, Vx E Q. Proof. Suppose that u E CC 2 (Q) is non-constant with Pu  0 and attains its maximum m on Q. Then Qo = {x E Q; u(x) < m} is an open set of Q and 
264 Chapter V. Linear Differential Operators aDo n D =1= 0. Let Xo E Do be such that dist(xo, aDo) = ro < dist(x o , aD) and Yo E aDo with Iyo - xol = ro; we have Yo E D, u(Yo) = m and the open annulus B = {!r o < Ix - xol < ro} is relatively compact in Q. We consider the function I 0 1 2 2 v(x) = u(x) - m + e(e- Cl x - x - e- Clr o ) where e > 0 and ex > 0 are determined in order that v  0 on aB and Pv;;: 0 on B that is to say, since v = u - m  0 on Ix - xol = ro, if e(e- ClrU4 - e- Clr 5 )  m - max u(x), ro Ix - xol = "2 and, since Pv ;;: ePe- Cllx - xOl 2 = 2exee- Cllx - x ol2 ( 2ex LLajk(Xj - xJ) (x k - x2) j k - L(ajj + aixj - xJ))) if r C ro cex _ 2 ;;: Co + 1 2 where c = mil!. LLajk(x)jk > 0 XEB j k II = 1 and Co = m:x ajj(x), C 1 = m:x ( fa j (x)2 Y/2. Ap:"'lying the weak maximum principle on B, with v  0 on B, i.e. in particular for all t E J!, 1[, 2 2 2 e(e- Clt ro - e- Clr o )  U(yO) - U(tyo + (1 - t)xo). Since u attains its maximum at Yo, Du(yo) = O. Dividing both sides of the above inequality by (1 - t) and letting t  1 we obtain 2 2exee - Clr o  0 which gives a contradiction. o Remark 5. Suppose that P given by (5.14) satisfies (5.16) and ao  0 on D. Then P satisfies the strong maximum principle on D: for all u E 2(D) with Pu ;;: 0 on Q, if u attains a positive maximum on D, then u is constant on D. In fact if u(xo) = max u > 0, then on Do = {x; u(x) ;;: O}, we have (P - ao)u ;;: 0 and u(x o ) = max u, which contradicts Proposition 4. We can, in fact, show by Do 
5. The Maximum Principle 265 taking up the proof of Proposition 4, that u cannot attain a non-negative maximum (see Gilbarg and Trudinger [1]). o We note the particular case which explains the importance of the Laplacian: Proposition 5. Let P be a l.d.o. on IR n invariant under isometries of IR n . Then the following assertions are equivalent: (i) P = cL1 with c > 0; (H) P is non-null and there exists an open set Q of IR n such that for all u E 8(Q) and all x E Q u(x) = max u => Pu(x)  0; (iH) for every bounded open set Q of IR n , P satisfies the weak maximum principle on Q. , (iv) for every connected open set Q of IR n , P satisfies the strong maximum principle on Q. Proof It is immediate that (iv) => (Hi) => (H). The implication (ii) => (i) follows from Theorem 1 and from the invariance under isometries (see Proposition 4 of 2). The implication (i) => (iv) follows from Proposition 4. o 3.4. The Principle of the Strong Parabolic Maximum We can, in the same way, characterise the heat operator a/at - L1 on IRn+l. We profit by establishing a principle of the strong parabolic maximum. Proposition 6. Suppose that P is a l.d.o. on 1Rt 1 invariant under isometries in the space variable and under translations in the time variable. Then the following assertions are equivalent: (i) P = coo/at - cL1 with Co > 0, c > 0; (H) P is not independent of a/ax and for every bounded open set Q of IR n and o < T < 1, P satisfies the principle of the weak parabolic maximum on Q = Q x ]0, T[, that is to say Vu E 8(Q) (\ rc(Q x [0, T[) with Pu  0 on Q, sup u = sup u; Q Op Q (Hi) for every connected open set Q of IR n and 0 < T < 00, P satisfies the principle of the strong parabolic maximum on Q = Q x ] 0, T [: Vu E 8(Q) (\ rc(Q x ]0, T]) with Pu  0 on Q and u non-constant on Q, u(x, T) < sup U, Q Vx E Q. 
266 Chapter V. Linear Differential Operators Proof It is immediate that (iii) => (ii). Now the principle of the weak parabolic maximum implies the weak (elliptic) maximum principle: from Proposition 2, when it satisfies (ii), P is of order less than or equal to two without a term of order zero. Using the invariances 0 2 a P = - cLi + k ot 2 + Co at . (see Sect. 1.6, 2). It is easy then to see that k = 0 and Co > 0: given a bounded open set Q in IRn, we consider Ao = A(Q) the first eigenvalue of - Li in Q with Dirichlet boundary conditions (see Remark 4) and Uo E o(Q) such that - Liu o = AoU o on Q. For cp E (J 0, T[) we have P( cp(t)uo(x)) = (kcp"(t) + coCP' (t) + AOCCP (t))u o (x); we can assume U o > 0 on Q, hence P(cp(t)uo(x))  0 on Q iff kcp" + coCP' + AoCCP  0 on JO, T[. The hypothesis (ii) thus implies { for all cp E (JO, T[) (\ ([O, TJ) kcp" + coCP' + AoCCP  0 on JO, T [ and cp(O) > 0 => cp(T)  cp(O), which is satisfied only when k = 0, Co > 0 and AoC ;;: O. From this we have the implication (ii) => (i). The implication (i) => (iii) is a simple consequence of the following lemma. Lemma 1. Let B =.{Ix - xol < r}, Q = B x JO, T[ with Xo E IRn, 0 < T  2  < 00; and let.u E 2,1(Q) (\ l,O(Q) with u ;;: 0 and au/at;;: Liu on Q then 2n (5.17) L (r 2 - Ix - Xo 12)u(x, 0) dx  E(,TT) u(xo, T) where (5.18) E(r, t) = (4nt)-n/2 e -r 2 /4t. Proof of Lemma 1. We can always assume that Xo = O. Let us consider p(x, t) = E(lxl, T - t) - E(r, T - t). We have p > 0 on Q,p = 0 and op/on < 0 on oB x JO, T[, op E(r, T - t) P E C(Q), at + LJp = (r 2 - 2n(T - t» 4(T _ tf > 0 on Q and p(., t) --+ b when t --+ T (we recall that Y(t)E(lxl, t) is an elementary solution of the heat operator (see 2, Sect. 4 and 3.6.2)). We then obtain, 
95. The Maximum Principle 267 u(O, T) - Ip(X, O)u(x, O)dx = LT dt I (u  + P : )dX  LT dt I (pAu - uAp)dx = LT dt IB(p : - u :: )d{)  O. Remarking that ( r 2 - l xI 2 ) E(r T) p(x,O) = E(r, T) e 4T - 1 ;;: (r 2 - Ix1 2 ) 4'T . we achieve the stated result. o Completion of the Proof of Proposition 6. Considering u E rc 2 , l(Q x ] 0, T[) n rc(Q x ] 0, T]) with au/at  Au, and applying Lemma 1 to sup u - u, we see that if u(xo, to) = sup u in (xo, to) E Q x ]0, T[ then u(x, t) = sup u for (x, t) E Q x ]0, T[ such that _ Ix - xol 2 o < t < to, B(xo, Ix - xoD c Q and (to - t)  2n . By connectedness, we deduce that if u (xo, T) = sup u at Xo E Q, then on Q x ]0, T[, u = supu. o Remark 6. Lemma 1 has a certain link with Harnack's formula. We note however that it is impossible to obtain for positive solutions of au/at = Au on Q a Harnack estimate max u  C min u K K where K being a compact subset of Q, C would depend only on Q and K. Indeed if this were true, by applying it to a neighbourhood Q of K = B(0,1) x {1}, and to the functions u(x, t) = E(lx + Xo I, t), there would exist a constant C such that e1xo + xI2 -Ixo + x'1 2  C V x E IRn Vx x' E B ( O 1 ) -....;;::;, 0 " , which is obviously false. We remark, likewise, that if the principle of the weak parabolic maximum implies the weak maximum principle, the principle of the strong parabolic maximum does not imply the strong maximum principle, which is not satisfied by the heat operator. Finally we observe that we can establish a strong maximum principle for strongly parabolic operators with variable coefficients see Lions-Magenes [1], vol. 2, p. 91). 
268 Chapter V. Linear Differential Operators Review of Chapter V This chapter has introduced many fundamental concepts and methods: the concepts of characteristic and of bicharacteristics, the theorems of Cauchy-Kowalewsky and of Holmgren, the elementary solutions (of which examples and applications are gIven, notably to the Cauchy problem), local regularity properties: infinitely differentiable regularity (hypo-elliptic operators): analytic regularity (elliptic operators), the maximum principle in its different forms. All of these properties are used frequently throughout this work. Apart, perhaps, from the general notion of a hypo-elliptic operator, all the other concepts currently enter into very many applications. We have restricted ourselves to the classical theory of distributions; all the spaces considered are sub-spaces of spaces of distributions. It might be important to go outside of this theory and make use of ultra-distributions, or of spaces of analytic functionals. We refer the reader to Hormander [3], vol. 1, Chap. 9. 
Chapter VI. Operators in Banach Spaces and in Hilbert Spaces Introduction In the preceding chapters we have been led to consider a great number of functional spaces: m(Q), (Q), '(Q) constructed on an open set Q of IRn and which are locally convex topological vector spaces whose topology is not defined by a norm or a semi-norm, and Banach and Hilbert spaces, like the spaces LP(Q) and the Sobolev spaces Hm(Q), whose topology is defined by a norm. The interest of the normed spaces (and very particularly of the Banach and Hilbert spaces) is very great for applications particularly because of the large number of physical quantities which need to be measured globally and it is interesting to have a physical model which shows it clearly. It is for this reason, for example, that the space H 1 (Q) is sometimes called the energy space, since it serves to model, among others, problems on systems with finite energy. We have therefore gathered together in this Chap. VI a group of results concerning complete normed spaces, as well as the theory of linear operators in these spaces, which we shall use throughout the rest of the work 1. We have given some proofs to help the reader, but many of the results are given without proof, with a reference to the bibliography (which is extensive) on the subject. The detailed proofs are those which, from the methodological point of view, are the most important for numerical analysis.  1 gives the essential results of the fundamentals of functional analysis. 2 studies the theory of bounded, or closed, linear operators in Banach spaces. 3 gives a succinct account of the Vrincipaleslts concerning the Hilbert theory of linear operators. 1 Furthermore, certain results or concepts of this chapter have already been used previously! This chapter is specially directed to users who wish to review rapidly the essential ideas on topological vector spaces. 
270 Chapter VI. Operators in Banach and Hilbert Spaces  1. Review of Functional Analysis: Banach and Hilbert Spaces 1. Locally Convex Topological Vector Spaces. Normed Spaces and Banach Spaces Let E be a vector space over the field IK (IK =  or C). Definition 1. E is a topological vector space, if E is furnished with a topology compatible with the structure of the vector space, i.e. such that the mappings { Ex E --+ E: (x, y) --+ X + Y IK x E --+ E: (A, x) --+ AX (1.1) are con tin uo us. We shall use the notation: + = ]0, + 00[, + = [0, + 00[; then Definition 2. A map x --+ f(x), E --+ + is called a semi-norm if it satisfies the following properties: (1) fis sub-additive, i.e. \/x, y E E,f(x + y)  f(x) + f(y); (1.2) (2) f is positively homogeneous of degree 1, i.e. \/x E E, \/ A E IK, f(AX) = IAlf(x). We deduce from (1.2) (2) thatf(O) = O. Definition 3. A semi-norm is called a norm if (1.3) f(x) = 0 => x = O. Example 1. Let Q be an open sub-set of n and K c Q compact. If U E O(Q) (the space of functions continuous on Q), jK defined by (1.4) jK(U) = sup lu(x)1 XEK is a semi-norm on O(Q). D Example 2. Let x = (Xl' . . . , x n ) E IKn; for p  1 (1.5) x -4 IIxll p = Ctl Ixkl P YIP is a norm, such that (1.6) lim Ilxll p = IIxll oo = max Ixkl. poo k D 
 1. Review of Functional Analysis: Banach and Hilbert Spaces 271 Example 3. If LP(Q) (Q an open set of [Rn) denotes the space of the (classes of) functions with measurable complex values in Q and such that f Q Iv(xW dx < + 00 (p :;?; 1), then ( 1. 7) V -4 (f Q Iv(x)IP dx YIP is a norm on LP(Q). D Definition 6. We say that E is a locally convex topological vector space if E is (1) We say that "p is stronger than q" (p >- q) if there exists a constant C > 0 such that (1.8) q(x)  Cp(x) \/x E E. (2) p and q are said to be equivalent if at the same time p >- q and q >- p. Example 4. In [Kn (and, more generally, in every finite-dimensional vector space) all the norms are equivalent. Definition 5. Iff is a semi-norm (resp. a norm), we call: an open semi-ball (resp. ball) of centre Xo and radius p associated with f, the set ( 1.9) def Bf(xO,p)=={XEE;f(x - xo) < p}; closed semi-ball (resp. ball) of centre Xo and radius p associated with f, the set (1.10) Bf(xo, p) = {x E E;f(x - xo)  p}. We denote by Bf the open unit semi-ball (resp. ball), i.e. Bf = {x E E,f(x) < I}. Then the open unit semi-ball (resp. ball) of centre Xo is the set Xo + Bf. Definition 6. We say that E is a locally convex topological vector space if E is endowed with a topology which possesses the following property: every neighbourhood of Xo contains the set Xo + BB f relative to a semi-norm f, continuous on E and for suitably chosen B > O. The space E is then said to be separated if the set: (1.11) {x E E; infp(x) = O} = {O}, P the inf being taken over all the continuous semi-norms on E. The simplest example of a separated locally convex topological vector space is then given by: 
272 Chapter VI. Operators in Banach and Hilbert Spaces Definition 7. Suppose that p is a norm on a vector space E, then the pair (E, p) is called a normed vector space. We generally denote the norm p by 11.11 E or 11.11. The distance between two elements x, y of a normed vector space E is then defined by (1.12) def d(x, y) = IIx - yll and we note that this distance is invariant under translation d (x + y, z + y) = d (x, z) for all x, y, Z E E (we verify indeed that the axioms of distance are satisfied: (i) d(x, y) = 0 implies x = y (ii) d(x, y) = d(y, x) (iii) d(x, z)  d(x, y) + d(y, z)). Thus a normed space is a metric space, and the open balls, centred at the origin, form a basis of neighbourhoods of the origin. In a metric space (E, d), a sequence (xn)nEN converges to x when n  00 if lim d(x n , x) = o. n 00 In a normed space (E, II . liE)' the convergence relative to the distance defined by the norm will be described as strong convergence in E. A Cauchy sequence in a metric space is a sequence {xn}nEN which satisfies (1.13) { V8 > 0, 3N(8) E N d(x n , x m ) < 8. A metric space is said to be complete, if and only if, every Cauchy sequence is convergent. A complete normed space is called a Banach space. A complete metrizable 2 locally convex topological vector space is called a Frechet space. such that (m, n) E N x N, m, n > N e , implies Example 5. Suppose that KeIRn is compact. The space rcO(K) endowed with the norm lIuli oo = suplu(x)1 is a Banach space. Example 6. It is the same for the space rck(K), k E N*, endowed with the norm (1.14) k Iluli k = L IIDjuli oo . iii =0 . ail + j2 +. . . + jn DJ = .. . . ax{l ax2 . . . axn 2 A topological space is said to be metrizable if its topology can be defined by a distance (or metric). 
91. Review of Functional Analysis: Banach and Hilbert Spaces 273 Example 7. Let Q be an open set of n. Then O(Q) and k(Q), k E N* are not Banach spaces. However, if {Kn}nEN is a sequence of compact sets of Q such that (1.15) 00 Q = U Kn, n=O we can define the sequence of the semi-norms Pn such that ( 1.16) Pn(u) = sup lu(x)l. XEK n Then, endowed with the distance (1.1 7) 00 Pn(u - v) d( u, v) = L an 1 (_) , n = 0 + Pn U v where {an}nEN is a sequence of positive terms such that L an is convergent, the space O(Q) is a Frechet space. In a similar way, we can define on k(Q) a Frechet space structure by defining the sequence of semi-norms {P}nEN: ( 1.18) k p(u) = L Pn(Dju) j=O where Pn is defined by (1.16), and the associated distance by a relation similar to (1.17). D Example 8. Let Q be an open set in n. The spaces g(Q), (Q), (Q) which are respectively: the space of continuous functions with compact support in Q, the space of functions of class k with compact support in Q, the space of functions of class oo with compact support in Q are neither Banach spaces nor Frechet spaces, but inductive limits of Frechet spaces (spaces IE ff) for which we refer the interested reader to Schwartz [1], Horvath [1] or Treves [2]. D Example 9. Q being an open set in n, LP(Q), 1  P < + 00 endowed with the norm: IIflle(Q) = (Llf(X)lPdxY1P, and Loo(Q), given the norm IIflloo = sup. ess If(x)l, are Banach spaces. XEQ Likewise, Sobolev spaces constructed from LP(Q) are Banach spaces. D Definition 8. Let F be a subset of a topological vector space E; we say that F is dense in E if every neighbourhood of a point of E contains a point of F. 
274 Chapter VI. Operators in Banach and Hilbert Spaces If E is a Banach space, this can be translated by (1.19) \:Iu E E, \:18 > 0, 3v E F such that Ilu - v II < 8. In an equivalent way to (1.19), we see that F is dense in E if for all u E E there exists a sequence {un} nE r\J Un E F for all n, such that Un  u in E. Example 10. Let Q be an open set in [Rn; .f0(Q) is dense in LP(Q), 1  p < + 00, but .f0(Q) is not dense in L 00 (Q). D 2. Linear Operators Let E and F be two topological vector spaces; A is a linear operator from E into F if (1.20) { A(x + y) == A(x) + A(y) for all A(2x) == 2A(x) for all x, Y E E, 2 E IK, x E E; A is a continuous operator if (1.21) x  y in the topology of E implies: (1.22) A(x)  A(y) in the topology of F. We denote by 2 (E, F) the vector space of continuous linear mappings of E into F. If E == F, we write 2(E) def 2(E, E). When E and Fare normed spaces, the idea of a continuous linear operator coincides with that of a bounded operator, i.e. an operator such that the image of a bounded set of E is a bounded set of F. We cite (without proof) the Proposition 1. Let E and F be two normed spaces. Then A E 2(E, F), if and only if there exists a constant M > 0 such that (1.23) IIA(x)IIF  M IlxilE for all x E E.  "Under the conditions of Proposition 1, we can then put (1.24 ) IIAII = sup IIAxllF = XEE Ilxil E X =1= 0 sup IIAxII F . 3 Ilxil E  1 We verify that the mapping A  IIA II defined by (1.24) is a norm on 2(E, F). A natural question is: When is 2(E, F) a Banach space? We have the 3 We write, indiscriminately A(x) or Ax. We shall keep this last notation for circumstances in which there will be no risk of confusion. 
 1. Review of Functional Analysis: Banach and Hilbert Spaces 275 Proposition 2. Suppose that E is a normed space and F a Banach space. Then 2(E, F) is a Banach space. Proof Let {An} be a Cauchy sequence in 2(E, F) endowed with the norm defined by (1.24). By definition { \:IB > 0, 3N t E N such that m, n > Nt implies (1.25) IIAn - Amll < B. It follows then from (1.25) that: (1.26) n, m > Nt implies IIAn(x) - Am(x)IIF  BllxlIE. Hence {An(X)}nEN is a Cauchy sequence in the Banach space F, and so there exists W x E F such that lim An x == W x . n -+ 00 We can verify easily that the mapping x  W x is linear and we put W x == A (x). Then (1.26) implies: (1.27) lim IIAn(x) - Am(x)IIF == IIAn(x) - A(x)IIF  c;llxIIE' m-++oo and for n sufficiently large (1.28) IIA(x)IIF  IIAn(x) - A(x)IIF + IIAn(x)11  Bllxli E + IIAn II IIxII E . But, since {An}nE N is a Cauchy sequence in 2(E, F), n  IIAn II is bounded, i.e. there exists a constant C > 0 such that II An II < C < + 00. From (1.28) we deduce IIA(x)IIF  C 1 11x11 E , (C 1 constant> 0), \:Ix E E. Thus A E 2(E, F). Likewise, (1.27) implies { c; > 0, 3N t such that n > Nt => IIAn - All < B, which signifies that An  A in 2 (E, F); hence Proposition 2 follows. D We shall now prove the following important theorem, known as the Principle of Uniform Boundedness Theorem 1 (The Banach-Steinhaus Theorem). Suppose that E is a Banach space and F a normed space. Let {Ai}ieI be afamily (not necessarily denumerable) of continuous linear operators from E into F. If (1.29) sup IIAi(X)IIF < + 00 for each x E E, ieI 
276 Chapter VI. Operators in Banach and Hilbert Spaces then (1.30) sup IIAi II < + 00. iel Remark 1. The proof of Theorem 1 rests on an important property of complete metric spaces - known as Baire's property - which we shall first of all, recall and prove. D Lemma 1 (Baire's lemma). Suppose that (X, d) is a complete metric space and that X n' n  1 is a sequence of closed sets in X. We suppose that 4 o X n == 0 00 for each n  1. Then the interior of U X n is empty: n=l ( 0 xn ) O = 0. n=l Proof of Lemma 1. Let On == ex n which is then a dense open set. We shall 00 prove that 0 == n On is dense in X which is equivalent to the conclusion of the n=l lemma. Let Q be a non-empty open set of X; we shall show that Q nO=/:. 0. For this, we choose Xo E Q and ro > 0 such that 5 : B(xo, ro) c Q We next choose Xl E B(xo, ro) n 0 1 and r 1 > 0 such that ! B(Xl' r 1 ) c B(xo, ro) n 0 1 , ro o < r 1 < 2' which is possible since 0 1 is a dense open set. Step by step, we construct, by recurrence, two sequences {x n } and {r n} such that ! B(xn+ 1, rn+ 1) C B(x n , r n ) n On+ 1, n  0, r n o < rn+ 1 < 2. 4 Y denotes the interior of Y c X. 5 For all x E X, and r > 0, we denote, in the whole of this section, by B(x, r) (resp. B(x, r)) the open (resp. closed) ball B(x, r) = {x' E X, d(X', x) < r} (resp. B(x, r) = {x' E X; d(X', x)  r}). 
1. Review of Functional Analysis: Banach and Hilbert Spaces 277 It follows that {xn} is a Cauchy sequence in X which is complete. Hence X n  x. As X n + p E B(xn, r n ) for all n  0 and all p  0, a passage to the limit as p  + 00, shows that X E B(xn, r n ), \:In  o. In particular x E Q n 0, from which the lemma follows. D Remark 2. In numerous applications (of which the proof of Theorem 1 is one) Baire's property is used in the following form. Let X be a non-empty complete 00 metric space and let {X n}n  1 be a sequence of closed sets such that U X n == X. o n=l Then there exists no such that X no i= 0. D Proof of Theorem 1. For all n  1, let us put X n == {x E E; \:Ii E I IIAixliF  n}; as for all i E I, Ai E 2 (E, F), X n is closed. From (1.29), we have 00 U X n = E. n=l o It follows then from Lemma 1 (Remark 2) that Xno i= 0 for a certain no  1. Then let Xo E E and r > 0 be such that B(xo, r) c Xno. We have IIAi(xo + rz)IIF  no, \:Ii E I, \:Iz E B(O, 1). We deduce that 1 2no IIA i "  -(no + IIAix o II F)  - r r from which (1.30) follows. D An immediate corollary of the Banach-Steinhaus theorem is then Corollary 1. Suppose that E and F are both Banach spaces, and that {An}n 1 is a sequence of operators with An E 2(E, F )for each n, such that for all x E E, An(x) converges, when n  00, to a limit, denoted by A(x). Then, we have: (i) (ii) (iii) sup IIAnl1 < + 00, n A E 2(E, F) II A II  lim II A nil. Proof (i) follows immediately from Theorem 1. (ii) There exists, therefore, a constant C > 0 such that IIAn(x)IIF  Cllxll E , \:In, \:Ix E E. 
278 Chapter VI. Operators in Banach and Hilbert Spaces From this we deduce IIA(x)IIF  Cllxll E , \:Ix E E. As, in addition, A is obviously linear we obtain (ii). (iii) follows, finally, from IIAn(x)IIF  IIAnll.llxII E , \:Ix E E. D Theorem 2 (The Open Mapping Theorem). Suppose that both E and Fare Banach spaces. If u E 2 (E, F) is surjective, then u is an open mapping (that is u sends open sets of E onto open sets of F). Proof (i) It is enough to show that there exists a constant c > 0 such that (1.31) u(BE(O, 1)) ::> BF(O, c). Indeed, let Q be an open set of E and let us show that u(Q) is an open set of F if (1.31) holds. Let Yo E u(Q) with the result that Yo == u(xo) with Xo E Q and let r > 0 be such that BE (xo, r) c Q (i.e. Xo + BE (0, r) c Q). We then have Yo + u(BE(O, r)) c u(Q). Now, from (1.31) we have u(BE(O, r)) ::> B(O, rc), and as a consequence B(yo, rc) c u(Q); Q being an open set in E is the union of open balls; it is thus the same for u(Q) which is thus an open set in F. (ii) We prove now that (1.31) holds under the hypotheses of the Theorem, in two stages. First stage. Let u be a linear operator and surjective from E onto F. Then there exists c > 0 such that (1.32) u(BE(O, 1)) ::> B(O, 2c). 00 Proof Let X n == nu(BE(O, 1)); as u is surjective, we have U X n == F and from o n=l Baire's property there exists no such that X no i= 0; it follows that o u(BE(O, 1)) i= 0. Let c > 0, Yo E F be such that B(yo, 4c) C u(BE(O, 1)); 
 1. Review of Functiona] Analysis: Banach and Hilbert Spaces 279 In particular Yo E u(BE(O, 1)) and, by symmetry, we have also - Yo E u(BE(O, 1)), from which, by addition B(0,4c) C u(BE(O, 1)) + u(BE(O, 1)). Since u(BE(O, 1)) is convex, we have u(BE(O, 1)) + u(BE(O, 1)) == 2u(B E (0, 1)), from which we deduce (1.32). D Second stage Let u E 2 (E, F) satisfying (1.32). Then we have (1.31)' u(BE(O, 1)) :::> B(O, c). Proof Fix Y E F with IlyliF < c. We seek x E E such that IlxilE < 1 and u(x) == y. From (1.32), we know that \:Ie; > 0, 3z E E with 1 IlzIIE <:2 and Ily - u(z)IIF < e;; choosing e; == tc, we obtain Z1 E E with 1 II z 111E < :2' c II y - u(z1)IIF < :2. In the same way, by considering y - U(Z1) in place of y and with 8 == ic, we obtain Z2 E E such that 1 II z 211E < 4' c Ily - U(Z1) - u(z2)IIF < 4. By iteration of this procedure, we obtain a sequence {zn} nE N* with 1 IlznllE < 2 n ' c II y - U(Z1 + . . . + zn)IIF < 2 n ' \:In  1. If we write X n == Z 1 + . . . + Zn, it follows that {x n } nE N* is a Cauchy sequence in E which is complete. We thus have X n  x when n  00, and x satisfies Ilxil E < 1 and y == u(x), since u is continuous, proving the theorem. D We deduce immediately from Theorem 2 the Corollary 2. Let E and F be two Banach spaces. Suppose that u E 2(E, F) and in addition that u is a bijection from E onto F. Then u- 1 E 2(F, E). 
280 Chapter VI. Operators in Banach and Hilbert Spaces Proof From (1.31) we see that for all x E E such that lIu(x)IIF < c, we have IIxliE < 1. By homogeneity, IlxllE  (l/c) lIu(x)IIF' \:Ix E E, so u- l is continuous. D Remark 3. Let E be a vector space furnished with two norms II 111, II 112; we suppose that each of the normed spaces (E, II 111 ), (E, II 112) is a Banach space and that there exists constant c > 0 such that IIxll2  cllxll l , \:Ix E E. Then the two norms II 111 and II 112 are equivalent (i.e. there exists c' > 0, with II x 111  C' II X 112, \:I x E E). Indeed, this follows immediately from Corollary 2 with E == (E, II 111)' F == (E, II 112) and for u the identity. D Theorem 3 (The closed graph theotem). Suppose that E and F are two Banach spaces, and that u is a linear mapping of E into F. Then u E 2(E, F), if and only if, its graph (1.33) G == {(x, Y) E E x F; y == u(x)} is closed in the product space E x F. Proof (i) If u is continuous, it is obvious that G is closed. (ii) Let us show that G closed entails u E 2(E, F). Let us consider the two norms Ilxlll == IIxllE + lIu(x)IIF IIxl12 == IlxiI E . on E. As G is closed, E endowed with the norm II 111 is a Banach space. In addition, IIxll 2  IIxll l , so by using Remark 3, we deduce that these two norms are equivalent: hence there exists a constant c > 0, such that Ilxlll  cllxll 2 . Therefore lIu(x)IIF  c'llxIl E . D The closed graph theorem remains valid for topological vector spaces more general than the Banach spaces. See Schwartz [2J, Martineau [lJ. D 
\  1. Review of Functional Analysis: Banach and Hilbert Spaces 281 3. Duality We take E to be a topological space over the field tK (tK == lR or C). Definition 9. (i) A linear form on E is a linear mapping of E into tK (ii) An antilinear form on E is an antilinear (or semi-linear) mapping of E into C (f antilinear or semi-linear means: f(x + y) == f(x) + f(y) and f(2x) == 2f(x), \Ix, y E E, 2 E C) (iii) We call 2(E, tK), the space of continuous linear (resp. antilinear) forms on E, the dual (resp. antidual) of E and denote it by E'. If fEE', the value off on the vector u will be denoted by (1.34) f(u) == < f, u), where < . , . ) denotes (according to the case) the bracket in the duality (resp. antiduality) between E' and E. If E is a normed space with norm denoted by II II, then the dual norm II 11* is defined by (1.35) 1< f, x)1 11111* = f Ilxll . x =1= 0 As tK is complete, we deduce from Proposition 2: (1.36) E' endowed with the norm II. II * is a Banach space. We note that we have ( 1. 3 7) I<f, u)1  Ilfll*.llull, \luEE, fEE', which is a kind of Cauchy-Schwarz inequality. Example 11. Let Q be an open of lR n , E == LP(Q), 1  p < + 00. Then E' == L q (Q) with p - 1 + q - 1 == 1. D Example 12. See in 5 of Chap. IV, the duality of Sobolev spaces. o Example 13. '(Q) is the dual of (Q) but is not a Banach space. (See Appendix: "Distributions".) o Example 14. In the same way g'(lRn) is the dual of the Frechet space g(lR n ) but is not a Banach space. D 
282 Chapter VI. Operators in Banach and Hilbert Spaces Example 15. The space lP, 1  p < + 00 of sequences x == {X n }nEN*5a of complex numbers, endowed with the norm II x II P == ( L IX n I P ) l/ P , nEr\J* is a Banach space. Its dual is the space lq with p-l + q-l 1, 1  p < 00. D 4. The Hahn-Banach Theorem and its Applications The most important property of locally convex topological vector spaces is expressed by the Hahn-Banach theorem. Theorem 4 (Analytic form of the Hahn-Banach theorem). Let: E be a vector space, M a vector sub-space of E, p a semi-norm on E, f a linear form on M. , we suppose that (1.38) I < J: x> I  p (x), \:Ix E M. -- Then there exists a linear form f defined on E such that (1.39) { l(x) == f(x) 11(x)1  p(x) \:Ix E M, \:Ix E E. -- We say that f is an extension of f to E. Proof Consult any text on functional analysis for a proof; see the bibli- ography, e.g. Dunford-Schwartz [1J, Treves [2J etc. . . . D The applications of the Hahn-Banach theorem are very varied and numerous. We shall indicate briefly three important types of problems in which this theorem is applied: (1) problems of approximation, (2) problems of existence, (3) problems of separation. '" 4.1. Problems of Approximation We shall consider, for example, a normed space E, but it is possible to consider a locally convex topological vector space whose topology is defined by a family of semI-norms. Sa In the course of the text, we sometimes also take the set N in place of N* as the index set for the space IP. 
91. Review of Functional Analysis: Banach and Hilbert Spaces 283 Let M be a closed vector sub-space of E and M 0 a vector sub-space of M. Suppose that we wish to prove that every element of M is the limit of elements belonging to Mo. (The reader can imagine that E is a space of functions, M a closed sub-space of E formed by the solutions of a functional equation, and M 0 a known special class of solutions of this equation). In other words, suppose that we should wish to show that M 0 is dense in M. To this end, we can use the Corollary 3. Let E be a locally convex vector space 6 , M a closed vector subspace of E, and M 0 a subspace of M. Then M 0 is dense in M iff every continuous linear form on E which vanishes on M 0 vanishes on M. Proof See Schwartz [1J. D We can similarly deduce from the Hahn-Banach theorem the following corollaries. Corollary 4. Let E be a normed vector space, M a vector sub-space of E and m' EM'. Then there exists x' E E' such that ( 1. 40) { <x', x) == < m', x), IIx'lI* == Ilm'II*. \:Ix E M Corollary 5. x' E E' with Let E be a normed vector space, x E E, x i= O. Then there exists (1.41) { II x' II * == 1 < x', x) == II x II . Corollary 6. Let E be a normed vector space, M a closed vector sub-space of E, Xo E E, Xo i M. Then there exists x' E E' such that (1.42) { <x', x) == 0 \:Ix E M < x', xo) i= 0 ( == 1 for example). We recall now that a topological space is said to be separable if it contains a dense denumerable set. Then Corollary 7. Let E be a normed space. If E' is separable, then E is also separable 7 . 4.2. Problems of Existence Let E, F be two locally convex spaces (e.g. spaces of differentiable functions) and E', F' their respective duals (e.g. spaces of distributions). Suppose also that u is a linear mapping from E to F (e.g. u == P(D)), a linear differential operator. 6 For example, a normed space or a space of distributions. 7 See Y osida [IJ, p. 126. 
284 Chapter VI. Operators in Banach and Hilbert Spaces Then if we are given y' E F', it is always possible to define y' 0 u E E'. Indeed the mapping y'  y' 0 u is linear from F'  E'. We thus have Definition to. The mapping of F'  E' defined by y'  y' 0 u is called the transpose of u and is denoted by t u . The application of the Hahn-Banach theorem will enable us to prove the possibility of the existence of a solution y' E F' of the equation (C) tu(y') == y'o, when y is given in E', (e.g., if u == P(D), a differential operator, and y == b, then y' is an elementary solution of P(D)). In such a situation, it is possible to make use of the following consequence of the Hahn-Banach theorem. Corollary 8. Let E, F, t u , y be given as above with the condition: y(x) == 0 for all x E E satisfying u(x) == O. Let 1m u be the image of u in F, and suppose that the linear form (defined on 1m u): u(x)  y(x) == <y, x) is continuous when 1m u is endowed with the topology induced by F. Then there exists y' E F' such that tu(y') == y. Remark 4. Generally, the solution y' of the Corollary 8 is not unique. Indeed, if z' is a solution of the homogeneous equation (C 1) tu(z') == 0 (i.e. z' E Ker t u == N eu) == kernel of t u ), it is obvious that y' + z' is again a solution of (C). From the Hahn-Banach theorem, it follows that the equation (C 1) has solutions, different from zero, if and only if 1m u is not dense in F. o We have the Corollary 9. Let E, F, u, t u be defined as above. The following two conditions are equivalent: t.. (i) 1m u is dense in F; (ii) t u is injective. Corollary 9 is often used to show that certain spaces are spaces of distributions. Example 16. (i) (Q), G LP(Q), 1  p < + 00, where Q is an open set of IRn, i denotes a i continuous injection, and i((Q)) is dense in LP(Q). Then the transpose ti of i is an injection. We have LP' (Q) G ' (Q), q 1 1 -+-==1. P p' 
9 1 . Review of Functional Analysis: Banach and Hilbert Spaces 285 (ii) (n) G g(n), i(()) is dense in g(n)); then i g' (n) G ' (n). t. l (iii) In a similar way, with Q open in n, (Q) G H5(Q) i (Sobolev space, see Chap. IV) i((Q)) is dense in H5(Q); then H-l(Q) G '(Q) where H-l(Q) = (H5(Q))'. ti We note also that Corollary 4 has been of use to us in Chap. IV to give us the structure of an element of (H5(Q))' = H- l (Q) with the help of HO l (Q) r7 (L 2 (Q)) n + 1 d fi d b ( ) -; e ne y u -4 u, ::/ . . . , ::n . o 4.3. Problems of Separation of Convex Sets We recall that a set K of a vector space is said to be convex if (1.43) x, y E K implies (1 - t)x + ty E K, Vt E [0, 1]. It is easy to prove that the closure of a convex set is again a convex set. We then have the Corollary 10. Let E be a normed pace on , M a closed convex set in E, x E E, x if: M. Then it is possible to find TEE', T#-O and rx E  such that (1.44) T(x) < rx  T(y) for all y E M. We shall give an application of this corollary in 95.3 below, (Corollary 11). 5. Bidual, Reflexivity , Weak Convergence, Weak Compactness 5.1. Bidual The dual of a normed space E being a Banach space E', E', will, in turn, have a dual which is a Banach space, called the bidual of E, and is denoted by E". In general it is not possible to identify E" with E. We shall now study the relation between E and E"; we denote by II . II * * the norm in E". We have the 
286 Chapter VI. Operators in Banach and Hilbert Spaces Proposition 3. Let E be a normed space. There exists J E fLJ(E, E") such that (i) J is injective, (ii) II J(x) II ** = IlxilE for all x E E. Proof Let x E E. For all x' E E' we have I<x', x)1  Ilx' II * .llxIlE' The mapping J(x):x' -+ J(x)(x') = <x', x) is a continuous linear form on E', so J(x) E E", Vx E E. On the one hand, we have I<x', x)1 IIJ(x)II** = sup II '11 :::; IIxllEo x' E E' X * Thus But it is possible to find (Corollary 5 of the Hahn-Banach theorem) y' E E' with { < y;, x>_ = IlxilE lIyll* - 1. I J(x)(y') I I J(x) (y')1 sup  y'EE' Ily'lI* lIy'lI* II J(x) II ** = Ilxil E . It follows that (1.45) II J(x) II ** Ilxil E . It is easy to see that x -+ J(x) is a linear mapping; (1.45) shows that it is continuous and injective. o 5.2. Reflexivity We see from Proposition 3, that it is possible to identify E with the vector subspace J(E) c E". We then have Definition 11. We say that a normed space E is reflexive if we have ( 1.46) J(E) = E". From this definition there follows Proposition 4. If E is a reflexive Banach space, then its dual E' is a reflexive Banach space. We also note the following proposition whose proof we omit (see Brezis [1]). Proposition 5. Let E be a reflexive Banach space, M a closed subspace of E. Then M is reflexive. Example 17. (1) Let Q be an open set of n. For p satisfying 1 < p < + 00, LP(Q) is a reflexi ve Banach space. (2) The dual of L 1 (Q) is L 00 (Q), but L 1 (Q) is not reflexive. 
91. Review of Functional Analysis: Banach and Hilbert Spaces 287 (3) The Sobolev spaces Wm,P(Q), 1 < p < + 00, mEN are reflexive Banach spaces; again with 1 < p < + 00, mEN, W,P(Q) is a closed sub-space of Wm,P(Q) which, by Proposition 5 above is therefore reflexive. D 5.3. Weak Convergence Let E be a topological vector space, E' its dual (or antidual). Definition 12. We say that a sequence {xn} nE 1\P X n E E, Vn, converges weakly in E to x if we have (1. 4 7) lim <x', x n ) - <x', x) for all x' E E'. n-+oo We denote this variously by w x n  x or X n ----+' x or x n -+ X weakly. Remark 5. It follows from the Hahn-Banach theorem that if the weak limit of a sequence exists it is unique. D Remark 6. In a normed space (E, II liE)' we call strong convergence the convergence in the sense of the norm, i.e. Ilxn - xllE -+ 0 as n -+ 00, which we denote by X n -+ X (without qualification). Then if X n -+ x, we have IIxn II -+ Ilxll (which follows from the inequality Illxnll E - IlxliEI  Ilxn - xIIE)' D Remark 7. If X n -+ X in (E, II liE)' then X n -+ X weakly in E. (We have in fact ( 1.48) I<x', x n ) - <x', x)1  Ilx'I*"x n - xII E ). D Remark 8. If E is finite-dimensional the concepts of strong convergence and of weak convergence, defined above, coincide. D Remark 9. If E is not finite-dimensional, the converse of the properties contained in Remarks 6 and 7 above is false. Let us verify that by an example: Let Q = ]0,1[; E = L 2 (Q) = E' (see Sect. 6 below), x n : t -+ j2sin(nnt), n E N*; (1.49) IIxnli E = (I 2Sin 2 (nnt)dtY / 2 = 1 for all n E N*. 
288 Chapter VI. Operators in Banach and Hilbert Spaces In addition, for all y E L 2(Q) = E', we have: (1.50) <y, xn> = J2 f y(t)sin(nnt)dt -. 0 when n -. 00, [indeed, if y E  (Q), (1.50) holds, for after an integration by parts constant I<y, xn>1   0 when n  00, n and we deduce (1.50) for Y E L 2 (Q) because of the density of (Q) in L 2 (Q)]. Thus, X n  x = 0, weakly in L 2(Q); however IIxnli E  1 =1= 0 = IlxllE and {x n } does not converge strongly to o. o We note the following very important property of weak convergence. Proposition 6. Let E be a Banach space. Every sequence {xn}nEN weakly convergent to x in E is bounded. In addition, we have: (1.51) IIxllE  lim inf IIxnlI E . n-+oo Proof (i) We define An E 2(E', !K) = E" by An = J (xn) where J is defined in Proposition 3, and we note that II An II = II J(xn)11 ** = Ilxn liE' from (1.45). From the hypothesis, IIAn(x')1I is bounded for each x' E E'; then, from Theorem 1, we obtain: 3C > 0, IlxnllE < C < + 00, \In. (ii) We see from Corollary 5, that there exists x' E E' such that <x', x> = Ilxil E and Ilx' II * = 1. By hypothesis, we have <x', x n >  <x', x> = IIxII E , and I<x', xn>1  Ilx n II; so lim <x', x n > = IIxllE  lim inf Ilx n II, n-+oo from which the result follows. o We shall now prove the very important: Corollary 11. Let E be a normed space on , K a convex subset of E. Then, if K is closed in the strong topology, it is also weakly closed and conversely. 
91. Review of Functional Analysis: Banach and Hilbert Spaces 289 Proof (i) K strongly closed => K weakly closed. Let {x n }, (xn E K, Vn), be a sequence which converges weakly to x. We wish to prove that x E K. Suppose that x if: K; then (Corollary 10) there exists a couple (T, ex) with TEE' ex E  , T (y)  ex for all y E K T(x) < ex. In particular T(xn)  ex, Vn and T(x) < ex, so it IS impossible to have lim T(xn) == T(x) - hence a contradiction. n-+oo (ii) K weakly closed => K strongly closed. This follows immediately from the fact that strong convergence implies weak convergence. D 5.4. Weak Compactness We shall state, without proof, the following important result concernIng reflexive Banach spaces. Theorem 5. Let E be a reflexive Banach space, {xn}nEN a bounded sequence in E. Then it is possible to extract from {xn} n a subsequence which converges weakly in E. For proof, see Brezis [1], Theorem III, 16. Remark 10. Theorem 5 expresses a result on weak compactness. It is some- times stated in the following form: "The unit ball in a reflexive Banach space is weakly sequentially compact." We note that the result of Theorem 5 is also a necessary and sufficient condition for reflexivity. D Remark 11. In applications, we frequently encounter the following situation. A priori estimates lead us to consider a sequence { xn} n EN, II X n II E  C < + 00. Then, from Theorem 5, we see that it is possible to extract from the sequence {Xn}n E N a sub-sequence {xn'} which converges weakly to x in E. Let us now suppose that it is possible to prove that the limit x is independent of the extracted sub-sequence, for example because we know in advance that the solution of our problem is unique. Then the whole sequence converges weakly to x as is shown by: Proposition 7. Let E be a reflexive Banach space and {Xn}nEN a sequence of elements of E. We suppose that 
290 Chapter VI. Operators in Banach and Hilbert Spaces (i) IIx n liE  C < + 00 for all n E N, (ii) the sequence {x n , n EN} possesses a unique weak limit point x. Then the sequence {x n }  x weakly in E. Proof Suppose that the sequence {x n } does not converge weakly to x. Then it is possible to find 8 > 0, x' E E' and a sub-sequence, {x m } say, of {x n } such that (1.52) I<x', X m - x)1 > 8  0 for all m. But {x m } is a bounded sequence of E and from Theorem 5, there exists a sub- sequence {xm'} of {xm} which converges weakly to a certain element of E which, by hypothesis, can only be x, and we have lim <I, X m - x) = 0 for all fE E', m-+ 00 which contradicts (1.52). o 5.5. Weak-Star Convergence Definition 13. Let E be a normed space, E' its dual. Weak-star convergence is defined in the following manner: we say that a sequence {x}n EN, X E E', Vn, is weak-star convergent to x' E E' if (1.53) lim <x, x) n-+ + 00 <x', x) for all x E E. We denote this convergence by (1.54) x  x' weak-*. Weak convergence in E' is the following: x  x' in E' if (1.55) <x, x)  <x', x) for x E E". Weak-star convergence is therefore a new concept only when E is not reflexive. Example 18. Let Q be an open set of n and take E = LP(Q), 1 < p < + 00, E' = LP' (Q); weak convergence in E (resp. in E') is the following: J;,(resp.f)  f(resp. f') in E(resp. E') weakly if f/ngfdx(resp. f/9dX) -4 f/ gfdX (resp.llgdX) for all g' (resp. g) belonging to E' (resp. to E). o 
91. Review of Functional Analysis: Banach and Hilbert Spaces 291 Example 19. Let Q be an open set in n. The dual of L 1 (Q) is L 00 (Q); fn -+ f weakly in L 1 (Q), if: f /ng dx -4 f /g dx for all gEL 00 (Q). But (L 00 (Q))' does not coincide with L 1 (Q) as L 1 (Q) is not reflexive (unlike LP(Q), 1 < p < + 00 ). Thus we can define weak-star convergence in Loo(Q): .in -+ f in L oo(Q) weak - *, if f/;,gdX -4 f/gdX for all g E L 1 (Q). o We have for weak-star convergence the analogue of Theorem 5 and of Prop- osition 7. Theorem 6. Let E be a separable normed space. Then from every bounded sequence in E', it is possible to extract a sub-sequence which is weak-star convergent in E'. Proposition 8. Let E be a separable Banach space and {X}nE N a sequence of elements of E'. We suppose that (i) IIxIIE  C < +00, \In E N; (ii) the set {x, n EN} possesses a unique weak-star limit point x'. Then x -+ x' weak-* in E'. 6. Hilbert Spaces 6.1. Definitions In the space n the scalar product is linked to the Euclidean norm as follows: if x = (x 1, . . . , x n ), Y = (y 1, . . . , Y n) E n, n (x, Y) = L XiYi and i = 1 ( n ) 1/2 8 Ilxll = Jl Ixd 2 . In the same way in en we define the hermitian scalar product by: n (z, z') = L ZiZ, i= 1 ( n ) 1/2 Ilzll = if: 1 I Z il 2 8 We often use the notation Ixl for II x II. 
292 Chapter VI. Operators in Banach and Hilbert Spaces where z = (Z l' . . . , Zn), , ( , , ) t0n Z = Z l' . . . , Zn E "--' . Hilbert spaces are particular Banach spaces which are generalisations of the spaces [Rn and c n . Definition 14. Let E be a vector space on C. A sesquilinear form on E is a mappIng { (x, y) -+ a(x, y) ExE-+C wi th the properties (1.56) (i) a(x 1 + x 2 , y) = a(x 1 , Y) + a(x 2 , y) a(x, Yl + Y2) = a(x, Yl) + a(x, Y2) (ii) a(Ax, y) = Aa(x, y) (iii) a(x, AY) = Aa(x, y). Thus the mapping x -+ a(x, y) is a linear form for y fixed while the mapping y -+ a(x, y) is anti- (or semi-) linear form for x fixed. Definition 15. A hermitian form a( . , . )9 on E is a sesquilinear form satisfying ( 1. 5 7) a(x, y) = a(y, x) for all x, Y E E. Remark 12. (1) A hermitian form is a symmetric sesquilinear form. (2) If E is a real vector space, the notion of a sesquilinear form becomes that of a bilinear form and the notion of a hermitian form becomes that of a symmetric bilinear form. o Definition 16. A sesquilinear form a( . , . ) is said to be non-degenerate if it satisfies the following conditions: { if x E E is such that a(x, y) = 0, Vy E E, then x = 0 if y E E is such that a(x, y) = 0, Vx E E, then y = o. (1.58) We are here especially interested in sesquilinear forms which are positive definite, that is to say which satisfy (1.59) Vx E E, x =1= 0, a(x, x) > o. In particular, it is easy to prove that a positive definite sesquilinear form lS hermitian. In addition, a sesquilinear form is a hermitian form iff we have (1.60) Vx E E, a(x, x) E , 9 In the sequel, we shall often use the notation a(x, y) or a(u, v) to denote a( . , . ). 
91. Review of Functional Analysis: Banach and Hilbert Spaces 293 and it is obvious that a positive definite sesquilinear form is non-degenerate. We say further that a sesquilinear form is non-negative (or again, positive) if it satisfies: (1.61) Vx E E, a(x, x)  o. We then have the Proposition 9. Let a( . , . ) be a non-negative sesquilinear form on E. Thenfor all x, y E E we have the Cauchy-Schwarz inequality: (1.62) la(x, y)1 2  a(x, x) a(y, y). Proof The sesquilinear form a(x, y) being non-negative: o  a(x + 2y, x + 2y) = a(x, x) + 2Re {Ia(x, y)} + 121 2 a(y, y), V 2 E C. The choice a(x, y) . A = t I )1 (t E IR) If a(x, y) of- 0, a(x, y leads to a trinomial of the second degree in t which is positive for all t, and hence to (1.62). If a(x, y) = 0, (1.62) is obvious. o It follows immediately from Proposition 9 that { (i) if a( . , . ) is non-negative, x  (a(x, x)}t is a semi-norm on E. (1.63) (ii) if a( . , . ) is positive definite, x  (a(x, x)}t is a norm on E. Definition 17. Let E be a vector space over IK = C (or ) and a( . , . ) a sesquilinear (or bilinear) positive definite form on E. (i) The pair (E, a( . , . )) is called a complex (or real) pre-Hilbert space, (ii) A pre-I-Iilbert space which is complete is called a Hilbert space. A Hilbert space is a particular case of a Banach space. When (E, a( . , . )) is a Hilbert space, we say that a( . , . ) is a scalar product on E. In the sequel, we denote such a scalar product by ( . , . )E. Conversely, being given a normed space (E, II. liE) we can ask the question as to whether or not it is a pre-Hilbert space, i.e. does there exist a positive definite sesquilinear form a( . , . ) such that (a(x, x))t = Ilxll E ? The reply is given by Proposition 10. The norm II II E is a Hilbert norm iff the following relation holds: 1 Ilxlli + Ilyili = "2 [llx + ylli + Ilx - ylliJ ( 1. 64) (the parallelogram equality). Proof If (1.64) holds, we define a(x, y) by (1.65) a(x, y) = Re a(x, y) + i 1m a(x, y), 
294 Chapter VI. Operators in Banach and Hilbert Spaces with ( 1.65)' 1 Re a(x, y) = 4 (II x + YII - Ilx - Y II) 1 1m a(x, y) = 4 (II x + iy ll - Ilx - iy ll). The converse is immediate. The hermitian form we are seeking is thus given by the formula (1.65) with (1.65)'. o Examples 20. (1) IR n and en are Hilbert spaces. (2) [2 = {x = {xn}n E N*; X n E C, L IXn 1 2 < + 00 } n is a Hilbert space with the scalar product 00 (x, Y)l2 = L XnYn, n=l (Xn, Yn E C), and the associated norm ( 00 ) 1/2 Ilxll,2 = Jl Ix n l 2 . (3) Let Q be an open set in IRn. L 2 (Q) = the space of (the classes of) functions, measurable in Q for the Lebesgue measure, and the square of whose modulus is integrable on Q, is a Hilbert space for the norm IIull L 2 = (f.a'U(xWdX Y/2, and the scalar product (u, V)L2 = f.a u(x) V (x) dx. (4) The Sobolev spaces Hm(Q) (already seen in Chap. IV) are Hilbert spaces. o Definition 18. In a pre-Hilbert space E, two elements x and yare said to be orthogonal when (x, Y)E = O. In this case we have ( 1. 66) IIx + Y IIi = Ilxlli + II Ylli. 
91. Review of Functional Analysis: Banach and Hilbert Spaces 295 6.2. Projection on a Closed Convex Set Theorem 7. Let K be a non-empty closed convex set in a (real or complex) Hilbert space E and let x E E. Then there exists a unique element Xo E K such that ( 1. 67) Ilx - Xo liE == inf Ilx - IIE' EK Proof (a) Existence of Xo. We may always suppose that x == o. Let d == inf IIIIE; there exists a sequence {xn}n E N*, X n E K such that EK d == lim II X n II E . noo With the aim of proving that {xn} is a Cauchy sequence in E, we note that !'(x n + x m ) E K (since K is convex) with the result that 11!(x n + xm)IIE  d, \:1m, n. Then, from (1.64), it follows that Ilxn - X m IIi == 2(llx n Iii + Ilxm Iii) - Ilxn + X m Iii  2(llx n Iii + Ilx m IIi) - 4d 2 , from which we deduce that IIxn - X m liE  0 when nand m  00. Thus {xn} is a Cauchy sequence in E which is complete. Then X n  Xo in E and, as K is closed, Xo E K, and we have lim II X n II E == II x 0 II E == d. noo (We note that the Hilbert space structure of E is brought into play by the utilisation of (1.64)). (b) Uniqueness of xo. Suppose that there exist two elements xo, Xl E K with Xl =1= Xo, IIxo liE == II Xl liE == d. Then we have d Xo + Xl  2 1  2 (1lx o liE + Ilxlll E ) == d, E from which Xo + Xl 2 == d. , E 
296 Chapter VI. Operators in Banach and Hilbert Spaces and from (1.64) we obtain II 11 2 2 ( II 11 2 I IxlIIE 2 - IIxo + 2 XlIIi ) = 0, Xo - Xl E = Xo E + from which we have Xo = Xl. o Definition 19. The element Xo defined in Theorem 7 is called the projection ofx on K. We write (1.68) Xo = Px. Theorem 8 (Characterisation of the Projection). Let X E E be given. The following three propositions are equivalent: i) Xo E K, IIx - Xo liE  IIx - yliE for all y E K; ii) Xo E K, Re (x - Xo, Y - XO)E  0 for all y E K; iii) Xo E K, Re (x - y, Xo - Y)E  0 for all Y E K. Proof (a) (i)  (ii). Let Y E K and let Y = (1 - t)xo + ty E K, t E JO, 1J IIx - xolli  IIx - Ylli = IIx - xolli - 2tRe(x - Xo, Y - XO)E + t 2 11y - xolli, from which 2Re(x - Xo,y - XO)E  tlly - xolli; Letting t  + 0, we obtain (ii). (b) (ii)  (iii). If Re (x - Xo, Y - XO)E  0 we can write o  Re(x - y + Y - Xo,y - XO)E = Re(x - y,y - XO)E + lIy - xolli; from which (iii) follows. (c) (iii)  (ii). Let y E K and Y = Xo + t(y - xo) E K, t E JO, 1J. Re(x - Y,x o - Y)E  0  - tRe(x - Xo,y - XO)E + t 2 11 y - Xo IIi  0; from which tlly - xolli  Re(x - Xo,y - XO)E; we obtain (ii) by letting t  + o. (d) (ii)  (i). We have Re(x - Xo, Y - XO)E = Re(x - Xo, Y - x + x - XO)E  0 Re(x - Xo, Y - X)E + Ilx - Xo IIi  0; 
 1. Review of Functional Analysis: Banach and Hilbert Spaces 297 from which we deduce IIx - Xo 11 2  Re(x - xo, x - Y)E  IIx - Xo liE IIx - Y liE by the Cauchy-Schwarz inequality; this proves (i) and hence the theorem. o Remark 13. The projection P defined by (1.68) is a contraction mapping or a contraction 10. Indeed if x, y E E have projections Px, Py on K we have: Re(x - Px, z - PX)E  0, Re (y - Py, z - PY)E  0, Vz E K, Vz E K; we choose z = Py in the first of the above inequalities, z = Px in the second, and add the two inequalities to obtain Re(x - Px, Py - PX)E + Re(y - Py, Px - PY)E  0, from which we derive IIPx - Pylli  Re(x - y, Py - PX)E; by use of the Cauchy-Schwarz inequality, we obtain (1.69) IIPy - Pxll E  Ilx - yiIE' which proves that P is a contraction. o (1.70) A closed vector sub-space of E being a particular example of a closed convex set, we obtain the Proposition 11. Let E be a Hilbert space; V a closed vector subspace of E. Let u E E be given. Then there exists one (and only one) element U o E V such that { (u - U O , CP)E = 0 Vcp E V, U o = Pu. Proof (i) Let U o = Pu, where P defined by (1.68), is the projection on V; then from Theorem 8 (ii) Re (u - Pu, v - PU)E  0, Vv E V. Here we can choose v = Pu + cP, cP E V, from which we have + Re(u - Pu, CP)E  0, Vcp E V, 1 0 We call a mapping A a contraction if "Axil  !Ixll, \Ix E X, a Banach space. 
298 Chapter VI. Operators in Banach and Hilbert Spaces so (1.71 ) Re (u - Pu, CP)E == 0, Vcp E V. (ii) Now if we replace cP by + icp in (1.71) we obtain in to tal: 1m (u - Pu, CP)E == 0, (u - Pu, CP)E == 0, Vcp E V; Vcp E V. o Remark 14. Let E and V be as in Proposition 11. We introduce the notation (1. 72) V-1 == {x E E; (X,V)E == 0, Vv E V}. Proposition 11 shows that u - Pu == u - U o E V -1. Conversely, if u - V o E V -1 then V o == Pu == U o . Every element u E E can thus be written in the unique manner { u == v + v-1, v == Pu. We say that P is the orthogonal projection or the orthogonal projector 11 on V. If Q denotes the orthogonal projector on V -1 (we note that V -1 is always closed, even if V is not), (1.73) can be written as (1.73) V E V, v-1 E V-1 (1.74) u == Pu + Qu, which is a very important property of Hilbert spaces which distinguishes them among all the Banach spaces. We say that E is decomposed into the direct sum of the orthogonal spaces Vand V -1 and we write ( 1. 7 5) E == V Ef) V-1; V being closed, it is easy to verify that (V -1)-1 == V and we can state the Theorem 9. Let V be a closed sub-space of a Hilbert space E, and let P be the orthogonal projection on V. Then P E fi?(E) == fi?(E, E) and we have (1.76) { P == p2 (Pu, V)E == (u, PV)E for all u, v E E. Conversely, every P E fi?(E) satisfying (1.76) is the orthogonal projector on V == P(E). Proof (i) We easily verify that P is linear by using (1.70). The continuity of P follows from the fact that P is a contraction. Obviously, p2 == P and if Q denotes the orthogonal projector on V-1 11 The word "orthogonal" can be omitted when there is no risk of ambiguity. 
 1. Review of Functional Analysis: Banach and Hilbert Spaces 299 (Pu, V)E == (Pu, Pv + QV)E == (Pu, PV)E == (Pu + QU, PV)E == (U, PV)E, from which (1.76) follows. (ii) Conversely, let us put V == P(E); P E 2(E) implies that V is a vector subspace of E. We shall prove that we have: ( 1. 77) U E V <=> U == Pu. First of all if u E V, then by definition, there exists v E E such that u == Pv. Hence (1.76) implies u == Pv == p 2 v == Pu. Now if V rn E E and if PV rn  w when m  00 we have (by continuity of P) P PV rn  Pw when m  00, or again (p2 == P): PV rn  Pw. Thus V is closed. Let P be the orthogonal projection on V, Q the orthogonal projection on V1-. We shall prove that P == P. If v E V we have v == Pv from (1.76) and Pv == v, so Pv == PV. If v E V 1-, Pv == o. From (1.76), JPv, PV)E == (v, P 2 V)E == (v, Pv) == 0 as v E V1-, Pv E V; hence Pv == 0 and Pv == PV. If v E E, we decompose v into Pv + Qv and use the preceding two cases; the theorem is therefore proved. o 6.3. Orthonormal Bases (1) Sets of orthonormal vectors Definition 20. A set {Xl}lE A' (A an index set), of elements Xl E E, a Hilbert space, is called an orthonormal family in E if the elements Xl satisfy { 1 if A == A' (1.78) (Xl' Xl')E == b l , l' == 0 if A i= A'. We note that the Xl are linearly independent. n Indeed, if L CXiX li == 0, CX i E IR or C then: i= 1 CX j == ( .f OI:i X Ai' X Aj ) == 0, l= 1 E ] == 1, 2, . . . , n. 
300 Chapter VI. Operators in Banach and Hilbert Spaces Conversely, by using the Schmidt orthonormalisation procedure, we can from any sequence Xl' . . . , X n , . . . of linearly independent elements of E, construct an orthonormal sequence of elements Wi' . . . , W n' . . . defined in the following manner: Vi = Xl' (1. 79) nl (X n , VJ 2 E V i V n = X n - f...J i= 1 II Vi liE W n V n , Vn. IIvnll E (2) Generation of a Hilbert space Definition 21. A family {Xl}AE A is said to be total (or complete) in the Hilbert space E if (1.80) (v, X..)E = 0 } 0 for all A E A  V = . Let us denote by V A the closure in E of the vector sub-space generated by the family (x l } lEA' Let P A be the orthogonal projection on V A and Q A the orthogonal projection on V. We have: for all X E E, X = PAX + QAx, and so (QAX,Xl) = 0, VA E A; QAX = 0 for all X E E. X = PAX, Vx E E Thus with the result that E - VA. In other words: the vector sub-space generated by a total family is dense in E. A Hilbert space is said to be separable if it contains a denumerable total family. It follows then from the Schmidt orthonormalisation procedure that if E is separable, then E contains a denumerable total orthonormal family. Definition 22. A total orthonormal family is called an orthonormal base. Remark 15. We can prove, with the help of Zorn's lemma, that every Hilbert space (=1= {O}) contains an orthonormal base. o Remark 16. Suppose that E is separable, and let {wm}m E N* be an orthonormal base. Let u E E; we can approximate u by a sequence Urn = L (X W n where at n most a finite number N (m) of the coefficients (X are non-zero. 
 1. Review of Functional Analysis: Banach and Hilbert Spaces 301 If PN(m) is the orthogonal projection on the sub..space of finite dimension N(m) generated by W l' W z , . . . , W N (m)' we have II PN(m)U - uIIE  lIu m - ulI E , with the result that PN(m)u  u when m  00. Thus, if Pm denotes the orthogonal projection on the vector sub..space V m == {wl,...,w m },wehave (1.81 ) Pmu  u in E when m  00. Let us put Pmu we have: m L Un. W n . From the characterisation (1.70) of the projection, n=l (u - Pmu, Wj)E == 0, ) == 1,. . . , m, which leads to (1.82) U j == (u, w j ), } == 1,. . . , m. The coefficients u j given by (1.82) are sometimes called Fourier coefficients relative to the orthonormal base {wn}n E N*. Thus (1.83) m Pm U == L (u, W j ) E W j . j= 1 In addition (1.84) IIPmulli m L I (U, W j ) E 1 2 . j= 1 Since Pmu  u in E we have: (1.85) 00 L I (u, w j )E1 2 - II ulli; j=l (1.85) is called Parseval's relation. If u E E, we then denote (1.86) 00 00 u - L UjW j == L (u, Wj)EWj, j=l j=l the sense of the series in (1.86) being lim II Pm u - uliE == O. moo o Remark 17. In the preceding remark, the mapping u  {uj}j E N*, is an iso.. metry from E onto 1 2 , from (1.82)-(1.86). o 
302 Chapter VI. Operators in Banach and Hilbert Spaces Remark 18. Let {Wn}n E N*, be a denumerable non-total orthonormal family in E, and let V be the closure of the vector sub-space generated by {wn}n E N*, then E == V ffi v-I-, V-L =1= {O}. Let P be the projector on V, Q that on V -L. If Pm is the projector on V m == {w 1 ,..., W m }12 then, from Remark 16, Pmu  Pu in E. We have for all u E E { u == Pu + Qu, Qu =1= o. Iluili == IIPulii + IIQulli, Thus IIPulli  Iluili. As a result we have the Bessel-Parseval inequality ( 1. 8 7) 00 L I(u, w j )El 2  lIull j= 1 which is an equality only when u E V. o 6.4. The Riesz Representation Theorem 13. Reflexivity Let E be a Hilbert space, E' its dual. We denote by < . , . ) the duality between E' and E and II . II * the dual norm of II. liE. We then have the very important (see 3): Theorem 10 (Riesz' theorem). Iff E E', there exists a unique element u f E E such that we have (1.88) { <I, v) == (v, Uf)E \Iv E E Ilfll* == IIU f liE. Conversely, every element u E E defines an element fu E E' such that (1.89) { <fu, v) == (v, U)E Ilfull* == Iluli E . \Iv E E Proof (i) The converse of the theorem is obvious. (ii) The direct theorem falls into two parts: (a) Uniqueness ofu f . If <I, v) == (v, U})E == (v, U})E for all v E E, we have (v, u} - U})E == 0, \Iv E E, 1 _ 2 so U f - U f . 12 This denotes here the vector space generated by WI' . . . , W m . 13 Also called the Riesz-Frechet theorem (see Brezis [IJ). 
1. Review of Functional Analysis: Banach and Hilbert Spaces 303 (b) Existence of U f. If f == 0, U f == 0 answers the question. If f =1= 0, let N == Ker f == {v; V E E, <I, v) == O} (the kernel off); N is a closed vector sub-space of E. If N.l is the orthogonal sub-space of N in E, then we easily see that dim N.l == 1. Let U o E N.l and put (1.90) U f == < I, U o ) U Ilu o Iii 0, U f E N.l. Then: (1) if v EN, (v, U f ) E == 0 == < I, v), (2) if v E N.l then v == AU o (because dim N.l == 1) (v, Uf)E == A<f, u o ) == <I, AU o ) == <I, v) (3) if v E E == N Ef) N.l, V == V N + vN.l and we have again from (1)-(2) above: <I, v) == (v, u f )E. In addition II fll* == sup I <I, v)1 IIvll E = 1 sup I( v, U f )E I  II U f II E, Ilvil E = 1 11111*  \f, IluIIE ) = CluIIE ' U f t IIU f liE; so II f II * == II U filE' completing the proof of the theorem. o Remark 19. (1 ) We can therefore identify a Hilbert space with its anti-dual. (2) Denote by E' the anti-dual of E. There exists, in an obvious way, a variant of the Riesz theorem for the antidual E'. (3) It is also possible to identify E and E" and the Riesz theorem therefore shows that a Hilbert space is a reflexive Banach space. In particular, the unit ball in a Hilbert space is weakly sequentially compact. It follows, again from the Riesz theorem, that a sequence {un}n E N converges weakly to U in E if (un, V)E  (u, V)E' \Iv E E or (v, Un)E  (v, U)E, \Iv E E. We then show easily that if a sequence {Un}n E N satisfies the conditions { i) Un  U in E weakly; ( 1.91) ii) "un liE  IluIIE when n  00, 
304 Chapter VI. Operators in Banach and Hilbert Spaces then ( 1.92) Un  u in E when n  00 o 7. Ideas About Functions of a Real or Complex Variable with Values in a Banach Space Let X be a Banach space, and Q a part of IR or C. We consider a mapping t  x(t) of Q into X; the ideas of limits, continuity, differentiability and integrability (in the Riemann sense) of continuous functions with values in X, endowed with the strong topology, are supposed to be known already; there are in addition few differences in the case when X is finite dimensional. We give here the analogous concepts when X is endowed with the weak topology. 7.1. Weak Topology A mapping t -+ x(t) defined in a neighbourhood of to in IR or C is said to be weakly continuous at to if: Vx* E X', the mapping t -+ <x*, x(t) > is continuous at to. It follows from Theorem 1 (or from Proposition 6) that if x is weakly continuous at to, then IIx(t)11 is bounded in the neighbourhood of to. 7 .2. Weak Differentiability A mapping t  x(t) is said to be weakly differentiable at to E IR if: Vx* E X', the mapping t -+ <x*, x(t) is differentiable at to. If this derivative has the form <x*, u), u E X is called the weak derivative of x at to. We call attention to the particular property: Proposition 12. If the mapping t -+ x(t) is weakly differentiable on ]a, b[ an interval of IR and if its weak derivative vanishes identically, then x is constant on ]a, be. Proof Indeed, by hypothesis Vx* E X', t -+ <x*, x(t) is constant on ]a, be; from which <x*, x(t) = <x*, x(t ' ), Vt, t ' E ]a, be. This being true for all x* E X', we deduce that: x(t) = x(t ' ), Vt, t ' . o 
g2. Linear Operators in Banach Spaces 305 7.3. Weak Holomorphy Let Q be an open set in C; a mapping z  x(z) on Q with values in C is said to be weakly holomorphic if: Vx* E X' the mapping z  (x*, x(z) is holomorphic on Q with values in C. However, the following theorem, which we shall find useful in 2, shows that it is pointless to distinguish between the concepts of weak or strong holomorphy. Theorem 11. Let Q be a domain in C; if the mapping z  x(z) is weakly holomorphic on Q, then it is holomorphic in the sense of the norm (i.e. it is strongly differentiable ). Proof For x* E X', we can apply Cauchy's integral formula to the function z  (x*, x(z) to obtain * - 1 f (x *, x (() ) (x , x(z) - _ 2 . ( d(, Vz E Q, In r - z where r is a circle with centre at z and contained in Q. Let us then form: / x*, x(z + h) - X(Z) ) _  <x*, x(z) > =  f <x*, x(0) d(. \ h dz 2in Q (( - z - h)(( - Z)2 Since x is weakly continuous on r and hence bounded I(x*, x(()1  Cllx* 11 *. Consequently the first member converges to 0 uniformly on the ball IIx*I!*  1 in X' when h  o. It follows that x(z) is strongly differentiable, hence holomorphic. o 2. Linear Operators in Banach Spaces 1. Generalities on Linear Operators 1.1. Domain, Kernel and Image of a Linear Operator Let X and Y 14 be two Banach spaces; in a general way, by a linear operator A from X to Y, we mean a linear mapping defined on a sub-space 15 D(A) of X and with values in Y. 14 Throughout this g2, X and Yare Banach spaces. 15 We use throughout g2 the terminology "sub-space" for "vector sub-space". 
306 Chapter VI. Operators in Banach and Hilbert Spacs The sub-space D(A) is called the domain of A. If D(A) == X, A is defined everywhere. If D(A) is dense in X, A is said to be densely defined. The particular case Y == IR or C corresponds to the linear forms of  1. Example 1. X == Y == IP (1 :( p :( (0). We define the operator "translation to the right" Ad defined by x == (x 1, X 2, . . . , x n , . . . )  Ad X == (0, Xl' . . . , X n + l' . . . ). (d for "droite"). It is a linear operator defined on the whole of I p. In the same way, the operator "translation to the left" Ag defined by x == (Xl,X2,...,Xn,...)Agx == (X 2 ,X3'...'X n + 1 ,...). (g for "gauche") is an example of a linear operator defined on the whole of I p. D Example 2. X == Y == C(j0 ([0, 1 J). Taking for the domain D(AM) == C(jl ([0, 1J), we define a differentiation operator AM == d/dx "in the classical sense" in the space C(j0 ([0, 1 J), by: du A M u == - dx' Vu E C(jl ([0, 1J). We can also consider AM as an operator (defined everywhere) from X == C(jl([O, 1J) into Y == C(jO([O, 1J). D Let A: D(A) c X  Ybe a linear operator. The sub-space R(A) of Y defined by R(A) def {y E Y; there exists x E D(A) such that y == Ax}, is called the image of A; it is also denoted by 1m A. Let E be a sub-space of X; we shall denote by A(E) the image of E n D(A) by A. In particular we have R(A) == A(D(A)). We define the kernel of A, denoted by Ker A, or ker A or again by N(A) to be the sub-space of X def { } N(A) == x E D(A), Ax == 0 . Finally, we call the sub-space of X x Y defined by G (A) def {{x, y} E X X Y, the graph of A. X E D(A), y == Ax}. 
g2. Linear Operators in Banach Spaces 307 We verify that a subspace G of X x Y is the graph of a linear operator iff (2.1) G does not contai an element of the form {O, y}, with y -# o. 1.2. Nullity and Deficiency Indices We recall that if E is a vector sub-space of the Banach space X we define its orthogonal E.l by E.l def {f EX', < f, X> == 0, V X E E} ,. If E.l is of finite dimension 13, we say that E is of finite codimension 13 in X. We then define the defect (or the deficiency index) fJ(A) of an operator A to be the codimension of its image R(A) in Y (with the convention that fJ(A) == + 00 if R(A) is of infinite dimension). We call the dimension cx(A) of the kernel N(A) the nullity (or the index of nullity), with the convention that cx(A) == + 00 if N(A) is of infinite dimension. Example 3.. Again taking up the operators Ag and Ad defined in Example 1, we see easily that R(Ad) == {xEIP,x == {0,X 2 ,X 3 ,...}} is of codimension 1, so fJ(Ad) == 1; on the other hand the kernel of Ad is reduced to 0 so CX(Ad) == 0 (Ad is injective). In the same way, we have R (Ag) == IP, so fJ(Ag) == 0 (Ag is surjective)16; the kernel of Ag is N (Ag) == {x E 1 P, x == {x 1 , 0, 0, . . . }} and so cx(Ag) == 1. o Example 4 (see Example 2). The operator djdx is surjective from re 1 ([0, 1]) onto reO([O, 1]), and its kernel is the set of constant functions. o 1.3. Basic Properties of Linear Operators 1.3.1. Extension, Sum and Composition of Operators. Let A and B be two linear operators from X to Y with D(A) c D(B) with Ax == Bx for all x E D(A). We say then that B is an extension of A (this idea generalises to linear operators the extension of linear forms envisaged in Theorem 4,  1). Conversely, A is called the restriction of B to D(A). We observe that there is an inclusion of graphs: G(A) c G(B). If A and B are two linear operators from X to Y, we define the operator A + B by (A + B)x == Ax + Bx and its domain by D(A + B) == D(A) n D(B). 16 We note that, with the definition given above, P(A) = 0 does not lead to the conclusion that A is surjective but to the conclusion that R(A) is dense. 
308 Chapter VI. Operators in Banach and Hilbert Spaces Let A be a linear operator from X to Y, B a linear operator from Y to Z; we define the composition operator BA as the linear operator from X to Z with domain A -1 (D(B))17, such that (BA) x clef B(Ax) for all x E A -1 (D(B)). Remark 1. In the preceding definitions, the domain of the operator A + B or that of the operator BA can find itself reduced to {O}. o Let A be a linear operator from X to Y, with N(A) = {O}, that is to say injective; we can then define its inverse A -1: Y  X to have domain R(A) and to be such that x = A - 1 Y <=> Y = Ax. We then have the following properties (2.2) (i) the image of A -1 coincides with the domain of A: R(A -1) = D(A), (ii) the operator A -1 A coincides with the identity on D(A), (iii) the graph of A - 1 is GS(A), the inverse graph of A. Finally, if A and B are respectively injective linear operators from X into Yand from Y into Z, we have (2.3) (BA) - 1 = A - 1 B- 1 . 1.3.2. Quotient Ope!ator. Let A be a linear operator from X to Yand N(A) its kernel. Denote by X = XjN(A) the quotient space provided with the norm II x II = in( II x II. XEX where x is the equivalence of x. This is a Banach space. __ We then define the quotient operator A by: -- Ax = Ax, -- \Ix E X; -- A is injective by construction. 1.3.3. Transposed Operators. We denote by X' and Y' the respective duals of X and of Y and consider two operators: A: D(A) c X  Y and B: D(B) c Y'  X'. def 17 For E c Y, we define A -1(E) = {x E D(A); Ax E E}. 
g2. Linear Operators in Banach Spaces 309 We say that A and B are two mutually transposed operators if (2.4) (y', Ax )y' x y == (By', X )x' x x' "Ix E D(A), Vy' E D(B) ( we say also that B is a transpose of A). The following basic properties are immediate: If Bl c B then A and Bl are also mutually transposed. On the other hand: it follows from (2.4), that if A and B are mutually transposed, the two subspaces G(A), the graph of A and G S ( - B) the inverse graph of ( - B), are orthogonal in the duality of X x Y and X' x Y'. 1.3.4. Examples. Let us take up the body of examples already described above to illustrate the definitions which we come to introduce. Example 5. In the situation of Example 1, taken up again in Example 3, we verify easily that: (i) the products of Ad and Ag are given by: Ag Ad == 1 on X == Z p AdAg: x == {Xl' . . . , X n , . . . }  {O, X2, . . . , X n , . . . }, (ii) Ad is invertible, and in addition: A - l A d C g' (Ai 1 is the restriction of Ag to R(A d )). (iii) Ag is such _that N(Ag) == {{Xl' 0, . . . ,0, . . . }, Xl E [R}l8 IS closed, the quotient space X can be identified with {{O, X2, . . . , X n , . . . }} == R(A d ), and we have Ag ( x) == {x 2' X 3' . . . , X n + l, . . . } == Ag (X) == Ag {O, X 2' . . . , X n , . . . }; (iv) finally as far as concerns duality, we identify X' with zq where p-l + q-l == 1 for 1 < p < + 00, then the operators Ad in ZP and Ag in zq Ag in ZP and Ad in zq are mutually transposed. o Example 6. In the context of Example 2 for X == C(j0 ([0, 1 J), we introduce successively the different operators of differentiation: dx AM defined on D(A M ) == C(jl([O, 1J) by AMx == - dt 18 Or C if we are working in the complex field. 
310 Chapter VI. Operators in Banach and Hilbert Spaces and the different restrictions corresponding to the particular boundary con- ditions: * Ak on D(A k ) = {x E c&l ([0, 1]); x(O) = kx(1)}, with in particular: Ao on D(Ao) = {x E c&l ([0, 1]); x(O) = O} Aoo on D(Aoo) = {x E c&l ([0, 1]); x(1) = O} * Aoo on D(Aoo) = {x E c&l ([0, 1]); x(O) = x(1) = O} * and finally Am on D{Am) = '@(]O, 1 [) functions infinitely differentiable and with compact support on ]0, 1 [. In these conditions we verify easily that: we have the sequence of inclusions Am C Aoo c Ak C AM' Vk. , the operator AM is not injective and hence not invertible, but on the contrary Ak is invertible iff k -# 1, its inverse being defined everywhere by Ak 1 y(t) = 1  k [L y(u)du + k f y(u)du 1 but the restriction Aoo is invertible only on the domain D(AoOl) = {x E O([O, 1 ]); J: x(t) dt = O}, and its inverse is defined by AOOl y(t) = L y(u) du; finally Am sends .@(] 0, 1 [) into .@(] 0, 1 [) but IS only invertible on .@(JO, 1 [) n D(AoOl). D Remark 2. We see, right now, by this example the necessity of making precise the domain at the same time as the operator; the datum of an operator A is that of its graph G(A). D 2. Spaces of Bounded Operators 2.1. Introduction Let X and Y be two Banach spaces and recall that 2(X, Y) is the set of all bounded linear operators from X into Y. We have seen (Proposition 2 of 91) that this is a Banach space. 
2. Linear Operators in Banach Spaces 311 After having studied some examples of bounded linear operators we study in this section the various concepts of the convergence of operators (uniform, strong, weak). We study also the composition and invertibility of bounded operators, then the spectral radius and the resolvant set. Finally we make precise the notions of continuity, of differentiability and of weak holomorphy, introduced in 1, in the case of functions of complex (or real) variables with values in a space of operators. Example 7. Every linear operator defined on a finite-dimensional space is bounded. o Example 8. We recall the operators Ad and Ag of Example 1: they are bounded on IP with, in addition IIAdlllP == IIAglllP == 1; indeed, \Ix E IP: IIAdxl1 == Ilxll, IIAgxl1  IIxll (the equality holding, for example, for the sequences en == {O, . . . , 0, 1, 0, . . . }). o Example 9. "Infinite matrices in 1 p." Let X == 1 P and let {a h , k} N* x N* be a given double sequence, real or complex. For every x == {Xl' . . . , x n , . . . }, we can associate the sequence Y == {Yn}nE N* where (2.5) clef 00 Yn == Lan, kXk k=l (when this expression exists). We shall show that, if 00 L lah,kl  M' k=l (2.6) 00  I I  M "  a h , k" , h=l the definition (2.5) allows to define an operator A (denoted by the "infinite" matrix (ah,k)) x  Ax == Y, bounded on X with 1 1 1 1 IIAII  M'li M"li where - + - == 1. p q Indeed, (2.5) gives 00  1 1 IYhl   lah,kllilah,kllilxkl; k=l 
312 Chapter VI. Operators in Banach and Hilbert Spaces we apply Holder's inequality to obtain p IYhI P :::; [ktllah,kl]'[ktllah,kllxkIP 1 We can then form 00 00 00 p I IYhl P :( M'q I I lahkllxkl P , h=1 h=1 k=1 the convergence being assured by Fubini's theorem; inverting the order of the summations, we obtain: 00 00 00 I IYhl P :( M'f I Ixkl P I lah,kl. h=1 k=1 h=1 By the hypothesis, we find from this: 00 00 I IYhl P :( M'fM" I Ixkl P , h=1 k=1 and the result by: II Ax II :( M' M"t 1/ x II, \/x E X, (the modification is immediate for p == 1 or p == + (0). o Remark 2 bis. The displacement operators introduced in Example 8 and taken up later are particular cases of infinite matrices with: .f d . I h . h { 1, k == h - 1, lor Ad' ISp acement to t e rIg t, ah k == h . 19 , 0 ot erWIse ; . { 1, k == h + 1, for Ag, dIsplacement to the left, a h k == h . , 0 ot erWIse. o Example 10. Integral 0 peratars. Let Q be an open set in IRn, Q' an open set in IR m and k: Q' x Q  IR or C a given mapping. We can prove, as previously, that if (2.7) to! I k(t, u)1 du < M, Vt E Q', and f Q' Ik(t, u)1 dt < M ' , Vu E Q 19 Hence, in particular, ah,k = 0 if h = 1 (otherwise h - 1 = 0, which is forbidden). 
2. Linear Operators in Banach Spaces 313 then the mapping of LP(Q) into Lq(Q'), x M y, defined by . y(t) def to! k(t, u)x(u) du, Vx E U(Q) allows to define a bounded operator A: LP(Q)  Lq(Q') such that y == Ax. We then have: 1 1 IIAII < M1/qM,1/p for - + - == 1. q p D Remark 3. Let us consider the operator Ao defined in Example 6; we verify that its inverse given by y == AD" 1 x with y(t) = J: x(u)du is a simple particular case of an integral operator with k(t, u) == X[O,t](u) (characteristic function of the interval JO, t[). Incidentally, we have shown that the solution of the equation AoY == x that is to say, y == AD" 1 x is unique and depends continuously on the given x since AD" 1 is a bounded operator. In the more general context of partial differential equations, when the problem is well-posed, the solution depends continuously on the data. The mapping which associates the solution of the problem with the data is a fundamental example of a bounded operator. We have had numerous examples in the previous chapters: for example the mapping which with f E L 2(Q) associates the solution u of the Dirichlet problem { - L1u == fin Q, an open bounded set in IR n , grad u E L 2(Q)n, u==o on oQ, defines a bounded operator of L 2 (Q) into L 2 (Q). D 2.2. Various Concepts of Convergence of Operators We say that a sequence of linear operators {An} C 2(X, Y) IS uniformly convergent if it converges in norm, that is to say if (2.8) lim II An - A II == 0, n-+oo where II II is the norm in 2(X, Y) defined in (1.24). u We shall also denote this by A == u - lim An or by An  A when n  00. n-+oo 
314 Chapter VI. Operators in Banach and Hilbert Spaces In the context of partial differential equations, if A denotes the linear mapping which associates with the data g, the solution of the problem, and An the solution of a modified problem (for example the problem approximated by the method of finite elements studied in Chap. XII), a result of the type (2.8) will be particularly interesting, for it will 'prove that the convergence of the solution of the modified problem to the solution of the exact problem is independent of the data. This concept of convergence is too strong in many applications, so we also introduce weaker notions of convergence. In the context, raised above, of partial differential equations they will corre- spond to a convergence, which contrary to uniform convergence, does depend on the data. We say that the sequence of operators {An} converges strongIy 20 to A if (2.9) \/x E X, Anx  Ax, when n  00; in other words, if for all x E X, Anx converges strongly to Ax in Y. We shall then write s An  A or A == s - lim An. n-+oo In the same way, we say that the sequence of operators {An} converges weakIy21 to A if (2.10) \/x E X, w Anx  Ax when n  00; otherwise said, if for all x E X, Anx converges weakly to Ax in Y. We shall then write w An  A, or else A == w - lim An. n-+oo It is obvious that uniform convergence implies strong convergence, and that strong convergence implies weak convergence: u s w An  A => An  A => An  A. The converse properties are not, in general, true except in the case of finite dimension. Example 11. Let X == Y == IP and let the sequence of bounded operators Ak be defined by x == {x m }  Akx == Yk with Yk == (x k + 1, . . . , X n + k + 1, . . .) 20 Note the difference with strong convergence previously defined in a Banach space. 21 Do not confuse either this notion with weak convergence in 2(X, Y) in the sense of Definition 12 of  1. 
2. Linear Operators in Banach Spaces 315 (these are the iterated powers of the operator Ag - see Remark 2bis - in the sense of the composition of operators which will be studied later). This sequence of operators Ak satisfies: II Ak II == 1, \I kEN. However: s-lim Ak == 0; as a matter of fact, \I x E lP, IIYkl1 = C=+l IXhl P YIP  0 as k  00 D We shall now call attention to the following fundamental theorem on the properties of weak convergence as related to convergence in norm. Theorem 1. Every weakly convergent sequence {Ak} E 2(X, Y) is uniformly bounded. Proof By definition, we know that \Ix E X, the sequence {Anx} c Y is weakly convergent and hence bounded by Proposition 6 of 1. Consequently, \Ix E X the set {Anx; n EN} is bounded in Y. From the Banach-Steinhaus theorem of 1 (Theorem 1), we deduce that II An II  c, \In E N. D The result is a fortiori true if the sequence An converges strongly. Corollary 1. If a sequence {An} C 2(X, Y) converges weakly or strongly to A then A is bounded and (2.11 ) II A II  lim inf II An II. n-+oo Proof If we suppose weak convergence, we are led, thanks to Proposition 6 of  1, to: II Ax II  lim inf II Anx II, \Ix E X which to all intents and purposes is the n-+oo case of strong convergence for which the result is immediate. D Corollary 2. The space 2(X, Y), X and Y being Banach spaces, is sequentially complete in the sense of strong convergence. In addition, if Y is reflexive, then 2(X, Y) is weakly sequentially complete. Proof (i) The first part of the Corollary follows directly from Corollary 1 of  1. (ii) Let (An) be a Cauchy sequence in 2(X, Y) for the topology of the weak convergence of operators. Then, for each x E X, (Anx) is a Cauchy sequence in Y for the weak topology, hence converges weakly in Y since Y is reflexive (see Bourbaki [2J, Chap. IV, 3.3, p. 88). Let Ax be this limit; the mapping 
316 Chapter VI. Operators in Banach and Hilbert Spaces X E X  Ax E Y is linear; in addition by applying the Banach-Steinhaus theorem (twice), we verify that this mapping is bounded, from which the corollary follows. 0 Remark 4. Let V be a dense subset of X. We can prove that if a sequence of operators (An) is bounded and satisfies \/x E V, An x  Y E Y when n  00 then the sequence (An) converges strongly to an operator A E 2(X, Y). o Remark 5. (1) It may be shown that if < I, Anu) converges uniformly to < I, Au) for all u and f such that II u IIx :( 1 and IIfil Y' :( 1, then An converges uniformly to A. s (2) If An  A, then the sequence (Ax) converges uniformly to Ax in Y for x in any compact set in Y. w (3) If An  A, then for given x E X, the sequence <I, Anx) converges uniformly to < I, Ax) for f in any compact set in Y'. o The above remarks explain why these three concepts of convergence coincide in a finite-dimensional space which is characterised by the fact that the bounded closed sets are compact (which is not true in a space of infinite dimensions). o 2.3. Composition and Inverse of Bounded Operators 2.3.1. Definition. In the case of bounded operators, the composition of two operators is easier to define, since the domain of a bounded operator is a complete space. Let us consider two operators A E 2(X, Y), B E 2(Y, Z); the following correspondence: X I-+' Ax I-+' B(Ax) defines everywhere a bounded operator c: X  Z, denoted by BA .and satisfying (2.12) II C /I  II B II II A II (which follows immediately from the definitions of the norms). Example 12. Let us take up again the material of Example 9 with X = Y = Z = IP and two infinite matrices A = {ah,k}N* x N* and B = {bh,k}N* x N* satisfying (2.6). 
2. Linear Operators in Banach Spaces 317 We can verify that we generalise also the product of finite-dimensional matrices: the composition BA (also called the "product") is associated with an infinite matrix C == (c h , k) such that 00 Ch,k == I bh,jaj,k, Vh, k. j=1 on checking that Ch, k exists thanks to the hypotheses (2.6) on A and B and that C itself satisfies these same hypotheses. D Example 13. In the same way if we consider the situation of Example 10 for integral operators, we obtain analogous results. Let Q, Q', Q" be open sets in IRn, IRn', IRn" respectively; x == LP(Q), Y == LP(Q') and Z == LP(Q"); A E 2(X, Y) and B E 2( Y, Z) defined by the respective kernels k on Q' x Q and k' on Q" x Q' satisfying the hypotheses (2.7). Then the composition BA E 2(X, Z) is defined by a kernel kIf on Q" x Q, where: k"(v, u) == i k'(v, t)k(t, u)dt Q' and which satisfies (2.7). D The study of the passage to the inverse of an operator A E 2 (X, Y) is limited to the case in which A -1 E 2( Y, X). That when A -1 is an unbounded (but closed) operator and that of an injective, closed, unbounded operator A admitting a bounded inverse will be studied in Sect. 3. 2.3.2. Continuity of the Composition Proposition 1. Let X, Y and Z be three Banach spaces; the mapplng (A, B)  BA of 2(X, Y) x 2( Y, Z) into 2(X, Z) is continuous. Proof The result is immediate by forming classically B' A' - BA == B'(A' - A) + (B' - B)A, thanks to (2.12). D We now examine continuity in the sense of strong convergence: Proposition 2. We consider two sequences {An} C 2(X, Y) [resp. {Bn} C 2(Y, Z)] converging strongly to AE2(X, Y) [resp. BE2(Y, Z)], then: the sequence {BnAn} C 2(X, Z) converges strongly to BA in 2(X, Z). 
318 Chapter VI. Operators in Banach and Hilbert Spaces Proof Let us form BnAn x - BAx = Bn{An - A)x + {Bn - B)Ax, We can then, thanks to (2.12) derive the inequality: II BnAn x - BAx II  II Bn" "{An - A)x" + II{Bn - B)Ax II; Vx E X. the result comes from the definition (2.9) and from the fact that the sequence {Bn} is uniformly bounded in !e (Y, Z) (see Theorem 1 and Corollary 1). D These results on continuity are false in general for weak convergence, as is shown in the following example: Example 14. In X = L2{), let us define the sequences {An} and {Bn} by: (Anx)(t) = eintx{t), 2 (B n x) (t) = e - int X ( t) V X E L ( ); manifestly: AnBn = BnAn = I, Vn. Now: w w An  0, Bn  0; indeed, if we identify X with its anti-dual, the weak convergence, which can be written: t e:!:intx(t)Y(t)dt  0 is a consequence of the Riemann-Lebesgue lemma. Vy E X, D 2.3.3. The Algebra 2{X). In the case in which X = Y, the preceding results show that the composition operator provides the space !e{X) = !e{X, X) of bounded operators of X into itself with an algebraic structure; !e{X) is thus a Banach algebra 22 . A continuous linear operator taking X into itself is also called an endomorphism. For endomorphisms we use the terms "product" or "com- position" indiscriminately. An important property is then to be able to introduce power series of operators which allows us study inversion in !e{X) (see, likewise, the definition of etA for A E !e{X) in Chap. XVIIA, 1). For this, we must introduce the notion of the spectral radius: 22 A normed algebra A is an algebra provided with a norm x --1- II x II such that " xy II  "x II. II y II, V x, YEA A Banach albegra is a complete normed algebra (i.e. one whose underlying normed vector space is a Banach space). 
S2. Linear Operators in Banach Spaces 319 Proposition 3. Let A E 2(X), then the following limit exists: 23 def (2.13) r(A) == lim II An Ill/n == inf II An Ill/n n -. ex) and the quantity r(A) is called: the spectral radius. 'rhe proof is similar to the tinite-dimensional case; in particular we have r(A)  II All. Example 15. Displacement operators in lP (see Example 1). The iterated powers of the operators Ad and Ag are defined by (see Example 11): x == {x n }  Ax == {O,... O,x l ,.. .}, i in k-th place X {x n }  A;x == {X k + l ,... 'X n + k + l '.. .}, sa tisfying II A II == II A; II == 1, \lkEN; conseq uently: r(Ad) == r(Ag) 1. (We remark that in the finite-dimensional case, these operators are nilpotent 24 with the result that their spectral radius is zero.) Example 16. Integral operators. In the context of Example 2, let us consider the operator A on reo ([0, 1 J) defined by Ax(t) = I x(u)du, Vx E O([O, t]). A classical calculation (iterated primitive) shows that n I t (t - u)n-l (A x)(t) = _ 1)' x(u)du, ° (n . \Ix E reo([O, 1J); in this case, r(A) == O. Such operators are said to be quasi-nilpotent. D 2.3.4. Invertibility of Operators in 2(X). We study at present the invertibility of operators in the class 2(X). The result which follows is particularly interes- ting when we have to solve an equation of the form (I - A)u == f; 23 An denotes the operator AA. . . A applied n times. 24 A nilpotent operator is one which raised to a certain power is the null operator: for example the matrix ( :) is nilpotent. 
320 Chapter VI. Operators in Banach and Hilbert Spaces (it shows, in fact, that u is the limit of the sequence un defined by un + 1 == Au n + f with U O == 0). Lemma 1. (The "Neumann series"). Let A E 2(X) be such that r(A) < 1; then the operator (I - A) is invertible in 2(X), and the operator (I - A)- \ (where I is the identity in 2(X)) satisfies: 25 (2.14) (I - A)-l 00 LAn. n=O Proof The hypothesis r(A) < 1 and the definition of r(A), ensure that the series (2.14) converges in norm in 2(X). The result then follows from the passage to the limit in the classic identity in 2(X): N I - A N + 1 == (I - A) L An n=O when N  +00. Application. Let A E 2(X); then for all A E C such that IAI > r(A), the oper- ator (AI - A) == (A - A) is invertible in 2(X). It is enough, in effect, to apply Lemma 1 to the operator A(I - AI A); in this case: 00 An ( A - A ) - 1 == " -  1n + 1 n=O /t, IAI > r(A). For IAI  r(A), we can ask if the operator (A - A) is invertible (or, what comes to the same thing, if the equation AU - Au == f admits a solution). This leads us, here and now, to give definitions which will be recalled in Chap. VIII. Definition 1. The set of numbers A E C for which the operator (AI - A) is invertible in 2 (X) is called the resolvent set of A and is denoted by p(A); its complement in C is the spectrum of A and is denoted by a(A). For A E p(A), the operator (AI - A) -1, denoted by R(A, A) is called the resolvent operator (or, simply, the resolvent) of A. We have thus proved the following result: Proposition 4. The resolvent set of a bounded operator A of 2(X) is not empty: it contains the set {A E C; IAI > r(A)} (in an equivalent manner a(A) == {A E C; IAI :( r(A)}. For IAI > r(A): (2.15) 00 An R(A, A) = no An+ 1 . Lemma 2. The set J(X) of the linear operators of X into X, bounded, invertible in 2(X), is an open set of 2(X) (provided with the topology of the norm). 25 With AO = I. 
2. Linear Operators in Banach Spaces 321 Proof If A E J (X), B E 2(X), we can apply Lemma 1 to: (A - B)-1 == (I - A- 1 B)-IA-l if II A-I B II < 1, in particular, if: II B II < 1/ II A-III. Consequently, for A E J(X), the ball with centre A and of radius 1/ IIA -111 is contained in J(X). o Theorem 2. The passage to the inverse in 2 (X): A  A-I is continuous: this is a homeomorphism 26 on J(X), provided with the topology induced on J(X) by the topology of the norm on 2(X). Proof Let A E J(X) be fixed. Let A' == A - B where B is such that II B II < 1/ II A-I II; A' is invertible from Lemma 2. Let us calculate the inverse, with the help of Lemma 1: A,-I = Lo (A-IBt]A- I . Let us then form ( 2.16) A,-I - A-I = [n1 (A-IBt]A- I . There follows the inequality (2.17) II A' - 1 - A-I II II A-I 11 2 II B II  1 - II A-I IIII B II ' and hence the result, if II B II is taken sufficiently small. o 2.3.5. Passage to the Inverse in !£ (X, Y) Theorem 3. The passage to the inverse A  A -1 is a continuous (non-linear) mapping of 2(X, Y) into 2(Y,X)for the norm. Proof The proof is similar to that of the previous theorem: if A and A' == A - B belong to 2(X, Y), we can write: A' == [If - BA-IJA, since A is invertible in 2(X, Y); [I y - BA -1 ] is invertible in 2( Y) as long as II B II < 1/ II A -1 II. The majoration of II A-I - A' - 1 II is then analogous to (2.17). o 26 Homeomorphism: this is a bijectionf: Y ---1- Z (where Yand Z are two topological spaces) such that the image by f of an open set in Y is an open set in Z, and conversely. In other words f is bicontinuous. 
322 Chapter VI. Operators in Banach and Hilbert Spaces Example 17. In the context of Example 16, let us again take up the integral operator A on O([O, 1J) by: (Ax)(t) = f x(u)du, VXE'li'°([O,l]). We have seen that reA) = 0; it follows that the resolvent set of A is C* C\ {O}. Following (2.15) we can then calculate: (R(A, A)x)(t) = xi t ) + ;2 I exp C  u )X(U)dU, \:I A # 0, \:I x E o ([0, 1 J ). o Remark 6. The results on continuity obtained in Theorems 2 and 3 are, in general false for strong convergence. Thus, for example, in Hilbert spaces (see  3), the identity (hence invertible) is the strong limit of degenerate operators 27 . However, as an exercise, the reader using the same method as that for Theorems 2 and 3 will be able to prove: Proposition 5. Let {An} C 2(X, Y) be a sequence which converges strongly to A and satisfies: An is invertible with A; 1 uniformly bounded \:In E N; and A is invertible. Then the sequence {A; 1 } converges strongly to A -1 in 2( Y, X). o 2.4. Transpose of a Bounded Operator 2.4.1. Definition. Let X and Y be two Banach spaces with duals X' and Y' respectively. We have seen in  1 (see Definition 10) that we can associate with every linear operator A: X  Y, defined on all of X, a transpose, denoted by tA and defined by (2.18) <tAy,x)x'xx = <y,Ax)y,x y, \:Ix E X, Y E Y'. Theorem 4. Let A E 2(X, Y) be a bounded operator; its transpose tA has the following properties: (i) tA E 2( Y', X'), (ii) II tA II = II A II : the mapping A  tA is thus an isometry of 2(X, Y) into 2( Y', X'). Proof Let us prove that the transpose tA of A E 2(X, Y) is bounded. Indeed IltAy'lI* = sup l<tAy',x)1  IIAlllly'lI* /lx/l < 1 (as a result of using (2.18)). 27 A degenerate operator is one whose image is finite-dimensional (see Sect. 2.5.2.4). 
2. Linear Operators in Banach Spaces 323 Thus it follows that tA E 2(Y', X') with (2.19) II tA II  II A II. We now show the equality of the norms. For this, let us introduce the bi- def transpose of A, t = tCA) E 2(X", Y"); if we identify X with a sub-space of X" by the injection: x -+ x" defined by (see Proposition 3,  1): <x",f) = <I, x), Vf EX', it appears that t is an extension of A. In fact: let us write successively the following equalities: V x E X, Vy' E Y' < ttAx", y' ) <x", tAy') (definition of ttA by (2.18)) <tAy', x) = <y', Ax) (by (2.18)) < (Ax)", y' ); consequently Ax is identified with (ttA)x" in Y" and A, considered as an operator from X" into Y", of domain X therefore admits ttA as an extension, which implies that (2.20) II A II  II ttA :: . We deduce then, from (2.19) and (2.20), the set of inequalities: II tA II  II A II  II ttA II  II tA II and finally: II A II II tA II. o Remark 7. If X and Yare reflective, so identifiable with their biduals X" and Y": ttA = A , VA E 2(X, Y) and the mapping A -+ tA is involutive (a mapping B is said to be involutive if B composed with B is equal to the identity). o 2.4.2. Examples Example 18. "Infinite matrices". Let us take up again the situation of Ex- ample 9 with X = Y = IP for 1 < p < + 00 and an operator A defined by the infinite matrix (a h , k)N* x N* satisfying (2.6); we shall prove easily that tA is defined by the infinite matrix (ak,h)N* x N*, the transposed of A 28 . 28 Transposed matrix: let A = (ak,h) be a matrix, its transposed is the matrix tA = Cak,h) with t ak,h = ah,k. 
324 Chapter VI. Operators in Banach and Hilbert Spaces That generalises the finite-dimensional case, in particular: - if 1 < p < + CIJ, IP is reflexive and ttA == A; -for the displacement operators (see Example 1) tAd == Ag, t A == A 29 g d. o Example 19. Integral operators. We verify in the same way for an integral operator A == LP(Q)  LP(Q') defined as in Example 10 by a kernel k:(t, u) E Q' x Q  C that the operator tA is defined by the kernel tk:(u, t) E Q x Q'  C, adjoint of k and defined by tk(u, t) == k(t, u), Vu E Q, t E Q'. o 2.4.3. Continuity for Other Topologies. From Theorem 4, the mapping A  tA is an isometry and hence a continuous mapping of 2(X, Y) into 2( Y', X') for the norm of these spaces; the continuity result is false, in general, for strong convergence. However, it is true for weak convergence if X is reflexive. We then have Proposition 6. If X is reflexive, let {An} C 2(X, Y) be a sequence weakly convergent to A, then the sequence {tAn} converges weakly to tA in 2 ( Y', X'). Proof Passing to the limit on the two members of the equality defining tA we have: <tAny', x) == < y', Anx), Vx E X, y' E Y'; the second member converges to < y', Ax) == <tAy', x), so Vy' E Y', w* tAny'  tAy' (weak*-convergence, see 1) from which the result follows if X == X". o 2.4.4. Algebraic Properties. We verify easily that the transpose of the sum is eq ual to the sum of the transposes: (2.21) teA + B) == tA + tB. We now study the properties of the transpose of BA for two operators Band A. 29 The operators Ad and Ag in the member on the right being taken in [q. 
92. Linear Operators in Banach Spaces 325 Proposition 7. Let X, Yand Z be three Banach spaces and let A E 2(X, Y) and B E 2 ( Y, Z) then (2.22) t(BA) = tA tB in 2(Z', X'). Proposition 8. If A E 2 (X, Y) is invertible and if A - 1 is in 2 ( Y, X), then tA is likewise invertible and CA)-l is in 2(X', Y') and satisfies: (2.23) CA) -1 = teA - 1). Proof It is sufficient to apply Proposition 7 to: AA- 1 = I y and A- 1 A = Ix. D 2.5. Some Classes of Bounded Operators In this section, we shall examine some classes of sufficiently general bounded operators which we shall take up again in 3 in the particular case of Hilbert spaces. 2.5.1. Projections. An operator P belonging to 2(X) and idempotent (i.e. such that p2 = P), is called a projection. We verify immediately that if P is a projection, so also is I - P. We deduce (with the notation of 2.1.1) that def N = N(P) = R(I - P) (2.24) def M = R(P) = N(I - P), and in particular that M and N are closed in X. The identity x = Px + (I - P)x, valid for all x E X, shows that MffiN=X that is to say that every element can be decomposed into the sum of an element of M and an element of N and in a unique way. In this case we say that M and N are supplementary or that M is a supplement of N. Conversely, if M and N are supplementary closed sub-spaces of X, then for every x E X, there exist a unique X M E M and a unique X N E N such that x = XM + X N . We can thus define a projection operator P on M (along N) by def Px = X M , \:Ix E X, but this operator is not necessarily continuous. 
326 Chapter VI. Operators in Banach and Hilbert Spaces If M is a closed sub-space of X, we say that M' admits a topological supplement if there exists a projection operator P continuous on X onto M, and we say that N = N(P) is a topological supplement of M. Example 20. (1) Every sub-space M of afinite-dimensional Banach space admits a topological supplement. (2) Every sub-space M of finite co-dimension admits a topological supplement. (3) In a Hilbert space, every closed subspace M admits a topological supple- ment. This follows from Theorem 9, 1. o Remark 8. We should point out that nevertheless, even in reflexive spaces,  we can construct closed sub-spaces which possess no topological sup- plement. A remarkable result is that every Banach space not isomorphic to a Hilbert space possesses closed sub-spaces with no topological supplements (Lindenstrauss- Tzafriri [1]). o Example 21. Let X be a Banach space, x E X and f E X' given. Then the operator P defined on X by Pu = <I, u)x, is a projection if and only if <I, x) = 1. In these conditions: \:Iu E X R(P) = Cx N(P) = (Cf)-L n X. o Example 22. Let X = o([ -a, +a]), a > 0, M[resp. N] the sub-space of even [resp. odd] functions, then the projection P on M along N [resp. the projection (I - P) on N along M] is defined by 1 Px(t) = 2 [x(t) + x( - t)], \:Ix E X, t E [ - a, + a] 1 [resp. (I - P)x(t) = -[x(t) - x( - t)]]. 2 o Let us return to the general case: - If P E 2(X) is a projection, its transpose tp E 2(X') is a projection. -Let P l ,P 2 E 2(X) be two projections; we say that P l  P 2 , if we have P l P 2 = P 2 P l = Pl. This defines an order relation on the set of 
92. Linear Operators in Banach Spaces 327 projections of X. We verify that if P and Q commute 30 , we have the relations sup{P,Q} = P + Q - PQ inf {P, Q} = PQ. Other properties will be pointed out in 3 in the context of Hilbert space. 2.5.2. Compact Operators 2.5.2.1. Definitions. A bounded linear operator A E 2(X, Y) is said to be compact (or, again, completely continuous) ifit transforms every bounded set of X to a relatively compact part 31 of Y. Then, from every bounded sequence {xn} in X, we can extract a sub-sequence { x nk } such that {Ax nk } converges in Y. The compact operators form an extremely important class of bounded oper- ators. In particular, we shall see in Chap. VIII that a compact operator possess a purely point spectrum. We shall deduce in Chap. XVII asymptotic properties for the semi-groups generated by compact operators, or by operators with compact inverses. In the context of partial differential equations, it happens in many applications that the operator, which associates with the data the solution of the problem, is not only continuous, but compact. For example, if Q is a bounded open set in [Rn, the operator which associates with f E L 2 (Q) the solution U E H5(Q) of the Dirichlet problem -L1u = f w s is compact in L 2 (Q). This means that even only iffn  fin L 2 (Q), Un  u in L 2 (Q). We shall see numerous other examples. Example 23. If one of the spaces X or Y is of finite dimension, then every operator A E 2(X, Y) is compact. In particular, I x the identity on X is compact iff X is finite-dimensional. o Example 24. Let Q and Q' be two compact sets of [Rn and [Rn' respectively, k an integral kernel continuous on Q' x Q satisfying the conditions (2.7) (see Example 10). Then: A is a bounded compact linear operator mapping X def L l(Q) into Y def O(Q'). In effect, from IAx(t) - Ax(t')1 :( t Ik(t, u) - k(t', u)llx(u)1 du, V t t' E Q' , , 30 Operators P and Q are said to commute if PQ = QP. 31 A relatively compact part is a set whose closure is compact. 
328 Chapter VI. Operators in Banach and Hilbert Spaces it follows that IAx(t) - Ax(t')1  IIxll max Ik(t, u) - k(t', u)l. uefJ Hence, since k is uniformly continuous on Q' x Q, the second member can be made as small as we please with It - t'l, uniformly on all II x II  M It follows that: - the family {Ax; x such that II x II  M} is equicontinuous, -in addition, under the same conditions, it is equibounded. Consequently, from Ascoli's theorem (see Schwartz [2]) {Ax; II xii  M} IS relatively compact in O(Q'). D 2.5.2.2. The Space .ff(X, Y). We denote by .ff(X, Y), the set of compact operators of 2(X, Y). Theorem 5. The set .ff(X, Y) is a closed vector sub-space of 2(X, Y) (in the sense of uniform convergence). Proof Let us denote by S the unit ball of X; it is enough to prove that: (i) if A' and A" belong to .ff(X, Y), from each sequence {x n } of S, we can extract a sub-sequence {x nk } such that {(A' + A")x nk } converges in Y. For that, it suffices to extract from {x n } a sub-sequence {x n ' } such that {A' X n ' } converges in Y; then from this we extract a new sub-sequence {x n "} such that {A" x n "} converges in Y. The sub-sequence X n " = x nk answers the question. (ii) if {An} C .ff(X, Y) is a sequence of compact operators converging to A in 2(X, Y), then A(S) is totally bounded 32 , hence relatively compact: Let B > 0 be given and let us choose n(B) such that II A - An II  B12. Since An is compact, An(S) can be covered by a finite number of balls B(Yk, !B), k = 1, . . . , p(B). For all XES, there exists k such that B IIAn x - Yk II < 2; from which II Ax - Yk II  IIAx - Anxil + IIAn x - Yk II < B. Consequently, we have proved that for all B > 0 there exists pEN and {Yk}k=l,...,P such that P A(S) c U B(Yk, B), k=l from which the result follows. D 32 A set M in a complete metric space is said to be totally bounded if VB > 0, M can be covered by a finite family of balls of radius B, with centres belonging to M (see Y osida [1]). 
92. Linear Operators in Banach Spaces 329 We point out an important algebraic property whose proof is easy: Proposition 9. Every product on the left or on the right of a bounded operator by a compact operator is compact. Corollary 3. .ff(X) def .ff(X, X) is a two-sided ideal of .P(X). Corollary 4 (See Example 23). A compact operator on X can be invertible in 2(X) only if dim X < + 00. Example 25. In many examples (see Chap. VII) we encounter the following situation in which A is a closed linear operator 33 from X into X, with domain D(A), such that (i) A is bijective, (ii) D(A) c; X is compact. Then, the operator A - 1 is not only continuous (from the closed graph theorem, see Theorem 3,  1) but compact (indeed A - 1 transforms the bounded sets of X to bounded sets of D(A) which are relatively compact in X from (ii). A concrete example of this situation is furnished by the case in which X = L 2(Q), Q being a bounded open set in [Rn, A = - L1 and D(A) = {v E H(Q); L1v E L 2 (Q)}. o 2.5.2.3. Transpose of a Compact Operator Theorem 6. (Schauder). Let A be a linear operator, continuous from X into Y, compact (A E .ff(X, Y)), then its transpose tA is compact CA E .ff(Y', X')). Proof Let S (resp. T') be the unit ball of X (resp. Y'); {y} a sequence in T'. Let us consider the sequence of mappings y E Y -+> < y, y >; it is equibounded, equicontinuous: I < y, y> - < y, z> I  II y - Z II , Vy,ZE Y. Therefore, from Ascoli's theorem, this sequence is relatively compact on every compact set in Y (in particular on A(S) by hypothesis). There exists then a subsequence {yk} uniformly convergent on A(S): or, as < y, Ax> = < tA y, x>, Yx E S, the sequence {tAy} converges uniformly on the unit ball of X, hence in X'. We have therefore proved that tA is compact. Conversely if tA is compact, then so is 33 See later in Sect. 3 of this 92. 
330 Chapter VI. Operators in Banach and Hilbert Spaces ttA, and hence if S" is the unit ball of X", ttA(S") is totally bounded. The same is true for A(S) identified with a part of ttA(S"). It follows that A(S) is compact and that A E %(X, Y). o 2.5.2.4. Operators of Finite Rank or Degenerate. An operator A E 2(X, Y) is said to be degenerate if its image R(A) is of finite dimension. The rank of A, denoted by rank A is defined by rank A = dim R(A); this being finite such an operator is compact. From Theorem 5, if A E 2(X, Y) is the limit (in the sense of uniform conver- gence) of operators An E %(X, Y) of finite rank, then A E %(X, Y). We note that, in general, (Banach's approximation problem) the converse is false: a compact operator cannot, in general, be approximated in the topology of 2(X, Y) by operators of finite rank, if Y is arbitrary (Enflo [lJ). [This converse is however true when Y is, for example, a Hilbert space, see Brezis [lJJ. The degenerate operators form a sub-space of %(X, Y), in general not closed; however the algebraic properties analogous to those of Proposition 9 are still valid, if we replace "compact" by "degenerate". In addition, every degenerate operator A admits a canonical representation in terms of a base {Yl, . . . , Ym} of R(A). Indeed, for every x E X, there exists a sequence of scalars {Xl'. . . , x m } de- pending on x and on A such that m Ax = I XiYi. i = 1 As this sequence depends linearly on x, we can associate with Yi a linear form x, (i = 1,. . . , m) \:Ix E X, These linear forms are bounded as: Xi = <x,x). Ixil  CIIAxll (equivalence of norms on R(A)) We shall write symbolically (2.25) m A = I < x , · ) Y i . i = 1 If we introduce the operator tA, transpose of A, then, by definition, m m <tAy', x) = <y', Ax) = L <y', Yi) <x, x) = L < <y', Yi)X, x), i=l i=l \:I x EX, y' E Y'. 
2. Linear Operators in Banach Spaces 331 This shows that RCA) is generated by the x, i == 1,. . . , m and hence that (i) tA is degenerate. (ii) rank tA == rank A. (iii) In addition, the decomposition of tA can be written (2.26) m tA == L <.'Yi)X i = 1 Conversely if tA is degenerate, the same is true of A considered as the restriction of ttA itself degenerate. We have thus proved Proposition 10. An operator A E 2(X, Y) is degenerate iff its transpose is, and in this case rank A == rank tA. 2.5.2.5. Nuclear Operators. Definition 2. Let X and Ybe two Banach spaces and let T E 2(X, Y). If there exist sequences {x} eX', {Yn} c Y, {cn} c C such that sup II x II * < + 00 sup II Yn II < + 00 00 L Icnl < + 00 n=O and that we may write (2.27) \:Ix E X, m Tx == lim L C n <x, x) Yn' m-oon=l then T is called a nuclear operator. Proposition 11. A nuclear operator is compact. Proof From Theorem 5, it is enough to show that T is the uniform limit of the operators of finite rank {Tm} defined by m Tm x == L c n < x, x) Y n . n=l Let us form: II Tx - Tmxll  00 L j=m+ 1 00 cj<xj,X)Yj  C L j=m+ 1 Icjlll x II. As a result of the hypothesis on the sequence {C n }: II T - Tm II  o. moo o 
332 Chapter VI. Operators in Banach and Hilbert Spaces 2.6. Some Ideas on Functions of a Real or Complex Variable with Operator Values; Families of Operators I Here we make precise the results for vector-valued functions pointed out in 1 in the case where the Banach space is a space of operators. Semi-groups, studied in Chap. XVII, form a very important particular case. 2.6.1. Continuity. Let A: IR -+ 2(X, Y); t -+ A(t) be a mapping defined on a neighbourhood of to E IR or C with values in 2(X, Y). If we express the idea of the limit t -+ to following the convergences defined in 2.2.2, we say that the function t -+ A(t) is uniformly continuous at to if lim "A (to + h) - A(t o ) II = O. h.....O In the same way, we say that it is strongly continuous at to if \Ix E X, lim II A(t o + h)x - A(to)x II = 0, h.....O and weakly continuous if \Ix E X, y' E Y' , lim < y', A(t o + h)x - A(to)x> = o. h.....O Some useful properties. If the mapping A is uniformly continuous at to, then the same is true of the mapping t -+ II A(t) II. This property does not hold, in general, for the other types of convergence; however: if the mapping A is weakly continuous (and afortiori if the mapping A is strongly continuous), then t -+ II A(t) II is locally bounded 34 , and lower semi- continuous (see Theorem 1 and its Corollaries 1 and 2). 2.6.2. Differentiability For a function t -+ A(t) from IR into (X, Y), (and without here going into the details) we can define its uniform derivative by A'(t) = u - lim[A(t + h) - A(t)J its strong derivative by \Ix E X, A'(t)x = lim _ h I [A(t + h)x - A(t)xJ, h.....O and its weak derivative in a similar way. 34 A measurable function which is bounded on the bounded sets is said to be locally bounded. 
9 2 . Linear Operators in Banach Spaces 333 2.6.3. Integration We shall limit ourselves here to continuous families t  A(t) of [a, b] (a and b finite) into 2(X, Y) (provided with the uniform, resp. strong, topology). We introduce: B(t) = f A('r) dT. If A is strongly continuous, the above integral is defined by B(t)x = f A(T)xdT, Vx E X; In every case II B(t) II  f II A(T) II dT and d dt B(t)x = A(t)x, Vx E X. 2.6.4. Holomorphy We can also define in different senses: complex differentiation, and integration along a curve in C of a mapping A:C  2(X, Y); z 1-+ A(z). But for the concept of holomorphy on a domain of C, this is not useful (as in 1.7): the notions of strong, weak and uniform holomorphy coincide. Therefore let z 1-+ A(z) be a mapping, A defined on a domain D c C with values in 2(X, Y). Suppose that the mapping A is weakly holomorphic on D, i.e.: Vx E X, Vy' E Y', the mapping z  <y',A(z)x) is holomorphic on DeC. Then by a proof similar to that of Theorem 11 and  1, thanks to the fact that II A(z) II is locally bounded, we can show that the mapping A is u-holomorphic, that is to say is differentiable in the sense of the norm at every point of D. Hence, we have Theorem 7. Every mapping A: z  A(z) weakly holomorphic in a domain D c C with values in 2(X, Y) is holomorphic in the sense of the norm. Properties. The mapping A admits a development in Taylor series about every point Zo E D: 00 (2.28) A(z) = A(zo) + L An(z - zo)n n=l uniformly convergent on every disk centred at Zo and contained in D. Con- versely, if a series expansion of this form converges weakly on an open disk Do, it converges uniformly on every disk D c Do (with strict inclusion). The radius of convergence of the series (2.28) is given by Iz - zol < [lim sup II An Ill/n]-l 
334 Chapter VI. Operators in Banach and Hilbert Spaces from the Cauchy-Hadamard theorem (see Saks-Zygmund [1J, p. 93). Example 26. Let us consider the case in which An clef An, with A E 2(X). The radius of convergence of the series (2.28) is given by (r(A)) - 1 and with the notation of Definition 1, we deduce that R(A, A) = (AI - A) -1 is holomorphic on p(A). D 3. Closed Operators 3.1. Definition and Examples Definition 3. A linear operator A: X  Y with domain D (A) is said to be closed if for every sequence {x n } c D(A) satisfying { Xn  x in X (strong) (2.29) AX n  y in Y (strong) when n  00, then x E D(A) and y = Ax. Proposition 12. Let X and Y be two Banach spaces and A: X  Y have domain D (A); then the following assertions are equivalent: (i) A is closed; (ii) the graph G(A) is closed in X x Y; (iii) the sub-space D(A) is complete for the "norm of the graph" II IID(A) defined by (2.30) clef II x II D(A) = II x II X + II Ax II y . We remark that a sequence {x n } satisfying (2.29) is convergent in D(A) for the norm of the graph. We sometimes write x = A - lim X n . n..... 00 The closed operators are thus the operators with closed graphs. Bounded operators are closed; but the converse is, in general, false. We frequently study closed operators whose inverses are bounded; this is the case in partial dif- ferential equations: the operator ( - L1) defined on H 2 (Q) n H5 (Q)35 is a closed (unbounded) operator of L 2 (Q) into L 2 (Q) whose inverse is bounded. It is the operator which is involved in the study of the Dirichlet problem. We shall encounter numerous other examples. Example 27. On X = rcO([O, 1J) the operator AM defined on D(A M ) = rc 1 ([0, 1)J by x  dxjdt is closed. 35 For Q a regular bounded open set in [Rn. 
92. Linear Operators in Banach Spaces 335 In fact: let {Xn } be an AM-convergent sequence, i.e. {Xn} C rc 1([0, IJ) such that X n  x In rcO([O, 1]) dXn ° dt  Y In rc ([0, IJ). From classical theorems on uniform convergence it follows that x is differ- entiable on [0, IJ and that dx/dt == y on [0, IJ. In addition the norm of the graph is equivalent to the norm on rc 1([0, IJ) defined by: dx II X 111 == II x IIccO([o, 1J) + dt CC O ([O,1J). D Remark 9. Closable operators. Let the operator A: X  Y be linear, with domain D(A). We say that is closable if it ad mits a closed extension. We verify immediately that A is closable iff the closure G(A) of the graph of A is a graph. In other words (see  2.1), A is closable iff (2.31 ) X n E D(A), X n  0 and AX n  y implies y == O. The (closed) operator A whose graph is G(A) is called the closure of A. For example, the differentiation operator Am' defined in Example 6 (1.3.4) with domain D (Am) == !!2 (J 0, 1 [) is closable. Indeed, if X n E !!2 (J 0, 1 [) is such that X n  0 and dXn/dt  y in rcO([O, IJ), it follows, from classical results on uniform convergence, that Xn(t) = J: n (U)du -+ 0 = f>(U)dU, from which we deduce that y == O. Vt E [0, IJ, D Remark 10. A closed operator A:X  Yand with domain D(A), can be considered as a bounded operator from its domain D(A) into Y, with D(A) provided with the norm of the graph. This results immediately from the definition (2.30) of the norm of the graph. Example 27 gives us an illustration of this remark. D Finally, the closed graph theorem stated in  1 (Theorem 3) shows that a closed operator A: X  Y, defined throughout X is bounded. In conclusion, we shall limit ourselves in the sections below to the study of unbounded closed operators (in so far as they are such or are extensions of closable operators). Indeed we can now make precise the result: an unbounded closed operator does not admit a bounded extension; in particular its domain is 
336 Chapter VI. Operators in Banach and Hilbert Spaces not closed (in many applications the operators are densely defined). The data of an unbounded operator are then either those of its graph or of the pair (A, D(A)). We find again the importance of the data of the domain (see, in particular, Example 6). Example 28. Case of a closed, but unbounded, operator. In Example 27, we have proved that the operator of differentiation AM is closed on its domain 1([0, 1J) O([O, 1J). We note that it is unbounded; indeed, let us consider the sequence {x n } C  1 ([0, 1 J) defined by xn(t) == t n for which II X n II == 1, but dXn = n, showing that the sequence {Amxn} is not bounded in O([O, 1]). dt D Example 29. Case' of an unbounded closable operator. Let us return to Example 9: the operator Am with domain D(Am) == (JO, 1 [) is closable, but is not closed. The closure of the graph G(Am) c  60([0, 1 J) x g([O, 1 J) (where we denote by g([O, 1J) def {X E 0([0, 1J), X(O) == x(1) == O}  60 ( [0, 1 J) def {X E  1 ([ 0, 1 J), x (0) == x ( 1) == x' (0) == x' ( 1) == O}.) defines an operator Ago, restriction of Aoo (see Example 6) with the domain D(Ag o ) == 60([0, 1J). D Example 30. We leave to the reader the task of proving, beginning with classical theorems on uniform convergence, that the operators Aoo and Ak (k arbitrary) of Example 6 are likewise closed. D Let X and Y be two Banach spaces. We denote by (X, Y) the set of closed linear operators mapping X into Y. We shall study the properties of this set (which is not a vector space) in Sect. 3.3. In Sect. 3.4, we shall study the transpose of a closed operator, of domain D(A) dense in X, the only category of closed operators for which we can define a transpose in a unique manner. Finally, in Sect. 3.5, we study more particularly the operators whose image is closed. 3.2. Basic Properties Proposition 13. Let A be a closed operator from X into Y with domain D(A). Then its kernel N (A) is closed. 
92. Linear Operators in Banach Spaces 337 Proof In effect, if { Xn} c N (A) is a sequence which converges to x EX: then AX n = 0, \:In. Since A is closed, we have from (2.29): x E D(A) and Ax = 0 so X E N(A). o ,..., def As a consequence, X = XjN(A) is a Banach space, for the norm defined by (2.32) II x II def inf II x II , ,..., \:Ix E X XEX ,..., (see Sect. 1.3.2). We can then define the quotient operator A on: (2.33) D(A) = {x E X; X E D(A)} with values on Y given by ,..., Ax = Ax (if x EX) ,..., We recall that A is, by construction, injective, and hence invertible. Propositio 14. Let A: D(A) c X  Y be such that N(A) is closed. Then A is closed iff A is closed., Proof Let us suppose that A is closed. Let {xn} c D(A) be a sequence such that ,..., x n  x in X, ,..., AX n  y in Y. As Ilxn - xii inf Ilxn - x -  II,  E N(A) there exists n E N(A) { Xn - n  X A(x n - n) = AX n  y. Since A is closed: { X E D(A) Ax = y, it follows that {  E D(A) Ax = y ,..., and hence that A is closed. ,..., Conversely, let us suppose that A is closed. Let {xn} E D(A) be a sequence such that X n  x in X, AX n  y in Y; 
338 Chapter VI. Operators in Banach and Hilbert Spaces then -- { Xn -+- X in X AX n = AX n -+- Y In Y with {  E D(A) Ax = Y (by hypothesis). Therefore { XED(A) Y = Ax so A is closed. D Remark 11. Let A: X -+- Y be a closed operator satisfying !lAx II  Mllxll, Vx E D(A), then D(A) is closed in X. The converse is true and an unbounded closed operator cannot have a closed domain. D Example 31. "Unbounded infinite matrices." Let X == Y = IP and suppose that we are given a double sequence {ah,k}N* x N* as in Example 9; with each x = {Xl'. . ., Xn," .} we associate a sequence {Yn} defined by 00 Yh = L ah,kxk k=l (whenever this expression exists). We then define an operator A on the domain by D(A) = {x E IP; {Yn} = Y E [P} Ax = y. The properties depend on the double sequence {ah,k}' For example, let us consider the case of a "diagonal matrix" in which a h , k = h 2 b h , k Vh, k E N* (where bh,k is the Kronecker symbol). Then the operator A associated with the sequence {a h , k} is densely defined 36 and closed. D Example 32. Differentiation in LP(JO, 1 [)37, 1  p < 00. Let X = LP(JO, 1 [) = Y; let us introduce on X the differentiation operators analogous to those in Example 6 on the space rcO([O, 1J). 36 Indeed Coo c D(A) where coo is the set of sequences which are null from a certain rank. 37 We also denote this space more simply by LP(O, 1). 
2. Linear Operators in Banach Spaces 339 First of all let AM be the (maximal) operator defined by dx AMx(t) = dt (t) (the differentiation being taken in the sense of .@' (]O, 1 [)) on dx D(A M ) = {x E LP(]O, 1 E); dt E LP(]O, 1 [)} = WI, P(]O, 1 E). As these functions are almost everywhere equal to absolutely continuous functions we can, by introducing boundary conditions on D(A M ), consider restrictions of AM of the type considered in Example 6. D(A k ) = {x E D(A M ); x(O) = kx(I)}, with the particular cases: k = 0, with x(O) = 0 k = 00, with x(l) = 0; D(Aoo) = {xED(AM);x(O) = x(l) = O} and D(Am) = '@(]O, 1 E). Then, these operators are densely defined; their domains contain D(Am) dense in x. The operator AM is closed; indeed, let {x n } C X be such that x n --+- X in LP(O, 1), hence in .@'(]O, 1 E); dX n dt = AMx n --+- Y if follows that y = ; in ,@'(]O, t[), and hence x E D(A M ). We should prove, by using the trace theorems seen in Chap. IV, that the operators Ak (for all k) and Aoo are closed. D 3.3. The Set ff (X, Y) of Closed Operators from X into Y Naturally the properties seen in the preceding sections for bounded operators do not extend at all. For example (X, Y) is not in general a vector space. Indeed, let A E (X, Y) be unbounded, with domain D(A). Let us put B = - A; then the operator (A + B) defined on D(A) satisfies A + B = o. It cannot be closed since, from Remark 11, D(A) is not closed. Hence if A and B are closed unbounded operators, the operator A + B is not, in general, closed. 
340 Chapter VI. Operators in Banach and Hilbert Spaces However, we can prove the following results: Proposition 15. If A E !E(X, Y) (i.e. is bounded) and if B is closed, then A + B is closed. Proposition 16. If A: X  Y is closed, if B: Y  Z is closed and if B- 1 : Z  Y is continuous, then BA: X  Z is closed. (The verification of these results, which is simple, is left to the reader.) Proposition 17. Let X and Y be two Banach spaces and let A be a linear operator from X to Y with domain D(A) c X and image R(A) c Y. We suppose that the mapping A is injective. Then, the operator A of domain D(A) is closed iffits inverse A-I (of domain R(A)) is. Proof The result comes immediately from the property of the graphs of A and A-I G(A- l ) == GS(A) (inverse of the graph of A), with the result that if one is closed so is the other. D Corollary 5. If A E !E(X, Y) (i.e. is bounded) and if the mapping A is injective, then the inverse A-I of A is a closed operator. Corollary 6. If A E 9' (X, Y) (i.e. is closed) and the mapping A is injective, then for its inverse A -1 to be bounded it is sufficient that the image R(A) satisfies R(A) == Y. (These two corollaries follow from the closed graph theorem: see 1, Theorem 3.) Example 33. Returning to Example 31 in the case of the "diagonal matrix" on lP defined by x == {Xl'. . . , Xn,. . . }  Ax == Y == {Xl' 2 2 x 2 ,. . ., n2xn, . . . }, we see that: A is invertible and its inverse A-I is bounded on lP; in fact, A is injective and A -1 is defined everywhere on lP by Y = {Yl,...,Yn,"'} -+A-1y = x = {Yl""' ; ""}' and is manifestly bounded. It follows from Proposition 17 that the operator A is closed. D Example 34. In the context of Example 32, we can prove that the results of Example 6 seen on rcO([O, IJ) remain valid on X == LP(JO, 1 C), 1  p < 00, that is to say that: (1) The operator AM == djdt, from D(A M ) == W l ,P(JO, 1[) into LP(JO, 1[) is not invertible. 
92. Linear Operators in Banach Spaces 341 (2) The operator Ak is invertible iff k -=I- 1 and the inverse of Ak defined everywhere on X by (k -=I- 1) (2.34) A;ly(t) = t  k [f>(U)dU + kf y(u)du J is bounded on LP(]O, 1 E). (Here again, it follows from Proposition 17 that Ak is closed.) (3) The operator Aoo is invertible on D(Aol) = {y E LP(]O, t[); f>(t)dt = O} and defined by: AOOI y(t) = t y(u) duo D We see in another example that, for operators from X into Y with the same "expression", but defined on different spaces, we can have invertibility or not. Example 35. Let us consider again the differentiation operator djdt on IR in the following spaces: We define X P as the space LP(IR) for 1  p < 00 and X 00 not as La) (IR) but as the space of uniformly continuous and bounded functions on IR. In all of these cases djdt is densely defined; we denote by Ap the operator on X P' 1  p  00, defined by: for P E [1, 00 [ dx D(A p ) = {x E LPIR); dt E LP(IR)} dx Apx = dt wi, P(IR)} and for p = + 00: D(Aoo) = {x E Xoo; : E Xoo} dx A 00 x = dt . (1) On Xoo,Aoo is not invertible: N(Aoo) contains the constant functions. (2) On Xl' Al is invertible, but All is not densely defined; indeed R(A I ) is not dense in X I: Let us consider the continuous linear form: <p:y E LI(IR) = Xl f--+ flRy(t)dt; this form is not null, but it vanishes on R(A I ). 
342 Chapter VI. Operators in Banach and Hilbert Spaces (3) On X p' 1 < p < + 00, Ap is invertible and A; 1 is densely defined: let us show that the orthogonal of R(A p ) in Lq(IR), 1jp + 1jq == 1 reduces to {O}. Let y E Y' == Lq(IR) such that L y(t)  (t)dt = 0, Vx E D(A p ) = Wl,P(IR). Then for all x E !0(IR), denoting by <, ), the duality !0'(IR), !0(!R) we have: \ y,  ) = \ -  , x ) = 0 and hence, dyjdt == 0, y == constant; now y E Y' so Y == o. o Remark 12. As far as the topological properties of the set ff(X, Y) are concerned, we can provide ff(X, Y) with a metric space topology by a Hausdorff distance on the graphs of the operators (which are closed in X x Y) (see Kato [lJ). o  Remark 13. Starting from the operator A associated with an operator A E ff(X, Y) (see Sect. 3.2), we can define a number y(A) by  y(A) clef inf II Ax II = inf II A_x II ; x E D(A) II X II x E D(A) II X II y(A) is called the reduced minimal modulus of A (with the convention that y(A) == 0 if A - 1 is unbounded, and y(A) == + 00 if A -1 is null). The interest in this number is that y(A) > 0 is equivalent to saying that A has a closed image. Indeed, ify(A) > 0, A -1 (which is closed by Propositions 14 and 17) satisfies the hypotheses of Remark 11, so D(A -1) == R(A) == R(A) is closed. Conversely, if R(A) is closed, then A -1, which is an operator defined on the whole of !3(A) and closed, is from the closed graph theorem (see 1) bounded on R(A) in X, so that y(A) > o. o 3.4. Transpose of a Closed Operat-or Let X' and Y' be respectively the duals of X and Y and let A be an operator from X into Y. If A is bounded (on X), the passage to the transpose has been studied in 2.2.4; if A is not defined everywhere, a fortiori if A is unbounded, the situation is much less simple. Only the case of operators densely defined (sufficiently frequent in applications) is relatively easy. Recall (2.1.3.3) that two operators A: X  Y and B: Y'  X' are said to be mutually transposed if (2.35) <y', Ax )y, x y == <By', x )x' x x, X E D(A) c X, y' E D(B) c Y'.. 
2. Linear Operators in Banach Spaces 343 As indicated in Sect. 1.3.3 if B 1 is a restriction of B:B 1 c B, then A and B 1 are likewise mutually transposed. The problem of finding, for a given A, a transposed operator, denoted by tA, which is maximal in the sense of the inclusion of operators and such that A and tA are mutually transposed, is treated by the: Theorem 8. Let A be a linear operator from X into Y, the domain D(A) being dense in X, then there exists a transposed operator, denoted by t A, closed and maximal, defined in a unique fashion on its domain (2.36) DCA) = {y' E Y'; 3x' E X' such that <y', Ax) b y : tAy' = x'; <x',x)Vx E D(A)} c Y' tA will be called, more briefi y, the transpose of A. Proof Let us construct the operator tA in the following way: DCA) = {y' E Y'; X  <y', Ax)y, x y is continuous on D(A) for the topology in.duced by X}. Following the hypothesis of the density of D(A) in X, this mapping has an extension which is continuous on the whole of X defining x' E X'. On the other hand, always as a result of the density hypothesis, the correspondence y'  x' is unique; in addition it is linear and hence defines an operator tA: DCA) X'. This is maximal in the sense of the inclusion of operators: indeed, let B satisfy (2.35); Vy' E D(B), the mapping x  <By',x) is continuous on X; hence D(B) c DCA). In addition, following the density of D(A) in X, By' = tAy'. We therefore certainly have B c. tA. Finally, let us prove that tA is closed: its graph is closed as GS( - tA) closed in X' x Y'. G(A)-1- is o Example 36. If we take up again Example 31 in the particular case of the diagonal matrix in IV (1  p < 00) A: x = {x l' . . . , x n , . . . }  {X 1, . . . , n 2 x n , . . . } 38, we verify immediately that tA: x = {X 1, . . . , x n , . . . }  {X 1, . . . , n 2 x n , . . . } in lq with l/p + 1/q = 1 39 . 38 With, we recall, D(A) = {x E [P, Ax E [P}. 39 With, likewise, DCA) = {x E [q, tAx E [q}. 
344 Chapter VI. Operators in Banach and Hilbert Spaces Example 37. Let us take up again the different operators of differentiation (see Example 32) AM' Ak Aoo in LP(]O, 1 C), 1 < p < + 00, and denote by B M , B k , Boo their homologues in Lq(]O, 1 [) with lip + l/q = 1. We shall show that tAM = -Boo (in the same way we shall have: tA k = -Bilk' k finite or not; tA oo = -B M ). AM and - Boo are mutually transposed, indeed: r y(t) : (t)dt = [y(t)x(t)JA - r x(t)  (t)dt = - r x(t)  (t)dt \Ix E D(A M ), \ly E D(Boo); hence tAM:::) -Boo. Conversely: for y* E DCA M ) let us define x* = tAMy*; by definition I x*(t)x(t)dt = I y*(t) : (t)dt, Vx E D(A M ). Let us put z(t) = J: x* (u) du, that is to say dz dt (t) = x*(t), t E ]0, 1[ z(O) = o. Integrating by parts, we find that I (y*(t) + z(t)) : (t)dt - z(l)x(l) = O. Now the set { : ' x E D(A M ), x(l) = o} is identical with X. Hence z(t) = -y*(t), "it E ]0,1[. In addition: z( 1) = O. Finally: y* is absolutely continuous, y* = -x* with y*(O) = y*(l) = 0; dt from which: y* E D(Boo) and x* = tAMy* = -BooY*, and the inclusion tAM c Boo. D Remark 14. (1) Let A and B be two mutually transposed operators, then, if the domain of one is dense, the other is closable. 
2. Linear Operators in Banach Spaces 345 Indeed, let us suppose that D (A) is dense, then, from the preceding theorem BetA is therefore closable (as the restriction of a closed operator). The verification of the implication "D(B) dense in Y' implies that A is closable" is left to the reader (see Kato [1], p. 168.) (2) If the Banach spaces X and Yare reflexive, A closable and of dense domain, then tA is closed, of dense domain, and ttA is the closure of A. In particular, if A is closed, tfA = A (see Kato [1], p. 168). For example, we know that the operator Am of Example 32 admits Aoo for its closure. In a manner similar to Example 37, we can show that tAm = -B M . Applying the preceding result, we find that Aoo = tB M . D We now study the existence of an inverse for the transpose of an operator A. Proposition 18. Let X and Y be two Banach spaces, and let A E (X, Y) have domain D(A) dense in X, and such that its inverse A -1 E !.t(Y, X) (i.e. exists and is bounded). Then the inverse of the transpose tA of A, CA) - 1 say, exists, is likewise bounded (CA)-1 E !.t(X', Y')), and satisfies CA)-1 = t(A -1). Proof From Theorem 4, if A -1 E !.t( Y, X) then t(A -1) exists in !.t(X', Y'). For y' E DCA) and y E Y: (t(A- 1 )tAy',y) = (tAy',A- 1 y) = (y',AA-1y) = (y',y); so (2.37) t(A -1 )tAy' = y', Vy' E D(A). For x' E X', X E D(A): (t( A - 1 )x', Ax) = (x', A - 1 Ax) = (x', x ), so t(A -1 )x' E DCA) and by definition: (2.38) tACA -1 )x' = x', Vx' EX'. The relations (2.37) and (2.38) show clearly that tA IS invertible and that CA ) - 1 = t( A - 1 ). D Example 38. Let I denote the interval] 0, 1 [ . We can verify directly the results of Proposition 18 on the differentiation operators A k , k =1= 1, defined on LP(I) (see Example 32). 
346 Chapter VI. Operators in Banach and Hilbert Spaces Thus, for k = 0 we have seen in Example 37 that tA o = - Boo on the space Lq(I) with l/p + l/q = 1. On the other hand Ao 1 : y f-+ f> (u) du on U (I), -B;;,t:Yf-+ fY(U)dU on Lq(I). From Fubini's theorem we have I (J: y(u)du )X(t)dt = I y(u) (r x(t)dt ) du \Ix E LP(I), Y E Lq(I), that is to say I (Ao1y)xdt = I y(-B;::,tx)du; from which it follows that Ao 1 = t( - B/) in agreement with Proposition 18. o Remark 15. Let A E ff(X, Y) be a closed operator with dense domain, then A and tA have the same reduced minimal modulus (see Remark 13), i.e. y(A) = yCA) (Kato [1], p. 234). In particular, if A has a closed image, then tA has a closed image. o 3.5. Operators with Closed Image This is the sole class of closed operators which we shall study in.the context of Banach spaces (for Hilbert spaces, see 93). We shall first of all give the properties of the inverse of an operator with closed Image. Proposition 19. Let X and Y be two Banach spaces and let A E ff (X, Y) have domain D(A) and image R(A). Then, the following properties are equivalent. (i) The operator A is injective, and the inverse of A, say A -1, is bounded on R( A) (provided with the topology of Y) in X. (ii) There exists a positive real number m, such that (2.39) IIAxl1  mllxll, \Ix E D(A). (iii) R(A), the image of A, is closed in Yand N(A), the kernel of A, is reduced to {O}. Proof We give a quick demonstration: (i) => (ii). By definition, if A -1 exists R(A) -+ D(A), is bounded: IIA- 1 yll  mllyll, \ly E R(A), 
2. Linear Operators in Banach Spaces 347 and the result (2.39) follows, since Vy E R(A), 3x unique, such that y = Ax, X E D(A). (ii) => (iii). In an obvious way: N(A) = {O}. In addition, if {x n } c D(A) is such that AX n  y in Y, for n  00 (2.39) implies that {x n } is a Cauchy sequence in X so X n  x for n  00. The result then comes from the fact that A is closed. (iii) => (i). This is a consequence of the closed graph theorem (Theorem 3 of 9 1 ) applied to A - 1. o Application. We have defined in 92.2.3 the notions of the spectrum a( A), the resolvent set p(A) and the resolvent operator R(A, A), for a bounded operator A E !.t(X). We can extend these notions to closed operators. Thus, the set p(A) is defined to be the set of complex numbers A E C such that the operator AI - A is invertible, in other words that it has the entire space X for its (closed) image and that it is injective. Then, from Proposition 19, the operator R(A, A) = (AI - A)-l is bounded: A E p(A)  R(A, A) E !.t(X). We shall return in Chap. VIII to a study of these concepts which form part of spectral theory. However, we point out the following: Proposition 20. Let X be a Banach space and A a closed linear operator in X, with domain D (A), whose resolvent p( A) is not empty. Then,for every polynomial Pn of degree n on C the (linear) operator Pn(A) in X with domain D(Pn(A)) = D(A'1) is closed. Proof. The result is trivial for n = 1. We prove the result by induction, and suppose it to be true for n - 1: Pn(A) = (A - AoI)Pn-1(A) + CI, AO E p(A). Since (A - AoI) is closed, since (A - AoI)-l E !.t(X) and since P n - 1 (A) is closed, it follows from Proposition 16 that P n (A) is closed. On the other hand x E D(Pn(A)) iff x E D(Pn-1 (A)) = D(A n - 1 ) (induction hypothesis) and P n - 1 (A)x E D(A); 
348 Chapter VI. Operators in Banach and Hilbert Spaces we show then, by induction, that An-Ix E D(A), that is to say that D(Pn(A)) = D(A n ). D We now study the properties of the transpose of an operator with closed image. Proposition 21 (The closed image theorem). Let X and Y be two Banach spaces and let A E ff(X, Y) have domain D(A), dense in X, and image R(A). Let tA be its transpose. Then there is an equivalence between the two assertions: "the image R(A) of A is closed" and "the image ReA) oftA is closed". In addition, under this hypothesis, R(A)-L = NeA), ReA) = N(A)-L ex(A) = peA) and exCA) = p(A)40. The proof, which is simple, is left to the reader (see Y osida [1], p. 205). Remark 16. A closed operator, with closed image, whose indices of nullity and deficiency ex and 13 are finite, is called a Fredholm operator. Operators of this type are used, for example, in perturbation theory. D 3. Linear Operators in Hilbert Spaces Introduction. Hilbert spaces being particular Banach spaces, all the properties concerning operators in Banach spaces will be valid in Hilbert spaces. However the fact that the norm of a Hilbert space is derived from a scalar product given by a continuous sesquilinear form, is going to imply particular properties among which pride of place is given to Riesz' theorem which plays a fundamental role, as it gives the structure of the dual (resp. of the anti-dual) of such a space. Let V be a complex Hilbert space and denote by (( . , . )) the scalar product and by II. II the norm in V. If V' denotes the anti-dual of V with norm denoted by II . II * we recall Riesz' theorem. (3.1) { for all ! E V', there exists (unique) u E V such that ((u, v)) - <f, v), \Iv E V, with (3.2) Ilull Ilfll*; 40 See above (2.1.2) the definitions of the indices of nullity and deficiency rx and p. 
3. Linear Operators in Hilbert Spaces 349 (3.1) and (3.2) show that there exists an (isometric) isomorphism of V onto V', that is to say J E 2(V, V'), J- 1 E 2(V, V) such that { Ju = f, U E V IIJull* = Iluli. (3.3) We know that, if we wish, we can then identify V and V'. A classic example of this situation is then the following: we take V = HA (Q), Q a bounded open set in [Rn. We know (see Sobolev spaces, Chap. IV) that provided with the scalar product n f au oi} f - ((u, v)) = L  _ 0 dx = Vu Vvdx, i = 1 Q uX i Xi Q V is a Hilbert space for the norm lIull = Ctl fQ ::; 2 dx Y/2 = (LIVU I2 dX Y/2, From (3.1), VfE H- 1 (Q), there exists unique u E HA(Q), such that ((u, v)) = <f, v), Vv E HA(Q). It is easy to see by taking v E  (Q) < - j u, v) = < f, v ), V V E  ( Q), with the result that,  (Q) being dense in HA (Q), (3.3) is interpreted by (3.4) - j u = f E H - 1 ( Q), U E HA(Q) II - JU llw 1 (Q) = (f Q IVU I2 dX Y/2; (- j) appears here then as an isometric isomorphism of HA (Q) onto H- 1 (Q) (so (-11) E 2(HA(Q), H- 1 (Q))), i.e., is a bO,unded operator and (3.4) is the resol- ution of a Dirichlet problem. Besides, as HA(Q) c; L 2 (Q), the injection having a dense image, it follows from the Hahn-Banach theorerrl that the transpose of this injection is an injection (see 1, Corollary 9 and Example 16) and we have (L 2 (Q))' c; H- 1 (Q), with the result that if, thanks to Riesz' theorem, we identify L 2 (Q) and its anti- dual (L 2 (Q))', we have (3.5) HA(Q) c; L 2 (Q) c; H- 1 (Q); we may then wish to consider (-j) as an unbounded operator in H = L 2 (Q), associated with the Dirichlet problem, where H is provided with the norm 
 350 Chapter VI. Operators in Banach and Hilbert Spaces lul = (fQ1U(xWdX Y/2 and with the scalar product (u,v) = fQ u(x) v(x) dx. The domain D ( - L1) is then D( -L1) = {u E H6(Q); L1u E L 2 (Q)} and we can show that, if the boundary of Q is sufficiently regular, we have (3.6) D( -L1) = H6(Q) n H 2 (Q). From (3.1), (3.4) and (3.5), ( - L1) is an isomorphism of D ( - L1), provided with the norm of the graph, on H = L 2 (Q). Thus, in this example, with the sesquilinear form ((u, v)) continuous on V x V, we associate an op.erator - L1 E 2(H6 (Q), H- i (Q)) which appears as the exten- sion to V of an unbounded operator in H = L 2 (Q), (denoted again by -L1) with domain D( -L1) = H 2 (Q) n H6(Q) dense in V (and in H). This example shows also that, conversely, to the operator ( - L1) considered as the unbounded operator in L 2 (Q) associated with the Dirichlet problem, with domain D ( - L1) defined by (3.6), we can make a correspondence with a sesquilinear form u, v  ((u, v)) by ( (u, v)) = ( - L1 u, v ) for u, v E D ( - L1) x D ( - L1 ). This sesquilinear form can again be defined on H6(Q) x H6(Q) and even on H i (Q) x H i (Q). We have indicated how Riesz' theorem applied to V = H6 (Q) allows us to recover (- L1) as an operator with domain defined by (3.6). The argument leading to this result is no longer valid if we consider (( u, v)) to be defined on H i (Q) x H i (Q). We shall see in Chap. VII that, by using the methods developed in this section, the consideration of ((u, v)) defined on Hi (Q) x Hi (Q) leads to an unbounded operator ( - L1) associated with a domain different from (3.6). In the light of what has gone before, we can pose the following natural questions: Q1. Let Vbe a real (resp. complex) Hilbert space, V' its dual (resp. antidual) and a(u, v) a bilinear (resp. sesquilinear) form, continuous on V x V. Does the form a(u, v) define an isomorphism A E 2(V, V')? Q2. Is this isomorphism the extension to V of an unbounded operator in a Hilbert space H such that V  H (V dense in H) with D(A) c V? Q3. Given an unbounded operator A in a Hilbert space H, with domain D(A), can we associate with it a sesquilinear form with domain V dense in H, such that A extends to an isomorphism of V into V'? We shall reply to this type of question in this section (3). We begin by studying the linear operators, bounded or continuous, from one Hilbert space X into another Y by defining the idea of an adjoint, and then generalising it to unbounded operators. Next we reply, or give the elements of reply to the questions Q.1, Q.2, Q.3 posed above. 
g3. Linear Operators in Hilbert Spaces 351 1. Bounded Operators in Hilbert Spaces Sesquilinear forms and bounded operators. Let X and Y be two complex Hilbert spaces. We use the notation: ( . , .)x (resp. I. Ix) the scalar product (resp. the norm) in X, (., .)y (resp. 1.ly) the scalar product (resp. the norm) in Y. Here we identify X and Y with the antiduals X' and Y' respectively. We recall the 41 Definition 1. (1) A sesquilinear form on X x Y is a mappIng of X x Y  c: {x, y}  a(x, y) satisfying the conditions: (3 7) { (i) for fixed y E Y, X  a( x, y) is linear on X; . (ii) for fixed x E X, Y  a(x, y) is anti-linear (or semilinear) on Y. (2) A sesquilinear form on X x Y is said to be continuous or bounded on X x Y if there exists a constant C > 0 such that (3.8) la(x, y)1 < Clxlxlyly, Vx E X, Vy E Y; the smallest constant C for which (3.8) holds is called the norm of a and is denoted by II a II. We denote by !/(X x Y)(!/(X) if Y = X) the set of continuous sesquilinear forms on X x Y. We then have the: Theorem 1. There exists an isomorphism of !/(X x Y) onto 2(X, Y) which associates with a sesquilinear form a(x, y) the operator A E 2(X, Y) defined by (3.9) (Ax, y)y = a(x, y), Vx E 'X, Y E Y and we have (3.10) II all = II A II. Proof Let x E X be fixed; the mapping y  a(x, y) is then a continuous antilinear form on Y. From Riesz' theorem, there exists y* E Y such that a(x, y) = (y*, y)y, Vy E Y. ThemappingA:x  Ax = y*islinearandcontinuous(asa(x,y) E !/(X x Y) and satisfies (3.9)). The proof of the converse is immediate, as is that of (3.10). D 41 See g1.6, Definition 14. 
352 Chapter VI. Operators in Banach and Hilbert Spaces Example 1. "Infinite matrices" on 1 2 . We consider the particular case in which X = Y = 1 2 (on C); we take the double sequence {a hk }, a hk E C, Vh, k with (3.11 ) 00 L la hk l 2 < + 00. h, k = 1 F or all x = {x k }, Y = {Yk} E 1 2 , we put 00 a(x, y) = L ahkxkYh; h, k = 1 now, by use of the Cauchy-Schwarz inequality, it can be easily seen that ( 00 2 ) 1/2 ( 00 ) 1/2 la( x, y)l:::;; h f: 1 i>hkXk . h 11 Yh 1 2 ( 00 ) 1/2 ( 00 ) 1/2 ( 00 ) 1/2 1 a( x, y) I:::;; h.  1 I a hk 1 2 . k  1 1 X k 1 2 . h  1 I Y h 1 2 . ( 00 ) 1/2 Thus a(x, y) defines A E 2(X), II A II:::;; h,  1 1 ahk 1 2 with: x  Ax = x = (Xl' . . . , X h , . . .) 00 X h = L ahkxk. k=l (3.12) Example 2. Integral operators. We consider (see also Example 10 of 92) X = L 2 (Q), Q c !R n Y = L2(Q'), Q' c !R m and the kernel k: Q x Q'  C with ff Ik(t,u)12dtdu< +00; Q' x Q we consider a(x, y) = ff k(t, u)x(u)y(t)dtdu where x E L 2 (Q), Y E L2(Q'). {1' x Q It is easy to verify that I a(x, y) 1  ( f 1 k(t, U )1 2 dt d u ) 1/2IXIL2(Q) 'Iy IL 2 (Q'); Q' x Q a(x, y) thus defines A E 2(L 2 (Q), L 2 (Q')) given by: Ax = x with x(t) = fQ k(t, u)x(u)du. o 
9 3 . Linear Operators in Hilbert Spaces 353 1.1. Adjoint Sesquilinear Form Let a E !/(X x Y). We define the adjoint sesquilinear form a* E !/( Y x X) by (3.13) a*(y, x) = a(x, y), for y E Y, X E X. Riesz' theorem shows that we have: (3.14) a*(y,x) = (A*y,x), Vy E Y, "Ix E X, where A* E 2(Y, X) is called the adjoint of A. o Example 3. Returning to Example 1, A* is defined by (lk,h and (3.15) 00 a*(y, x) = L (lk,hYkXh. h, k = 1 o Example 4. We take up again Example 2. If A is associated with the kernel k on Q' x Q, then A * is associated with the kernel k* on Q x Q' where k* satisfies: (3.16) k* (u, t) = k( t, u), Vu E Q, t E Q', and a*(y, x) = f r k(t, u )x(u)y(t) dt duo QJQI o 1.2. Hermitian Operators We suppose here that Y = X. If A E 2(X) is such that A = A* where A* E 2(x) is the adjoint of A, we say that A is a hermitian operator 42 . In this case, the sesquilinear form associated with the operator A satisfies the condition of hermitian symmetry43. (3.17) a(u, v) = a(v, u), Vu, VEX. If X is a real Hilbert space, we say that A (resp. a( u, v)) is symmetric. Example 5. In the context of Example 1; a(x, y) = a*(x, y) and A = A* iff (3.18) - ah,k = ak,h o 42 We also sometimes say self-adjoint, but the term self-adjoint is often reserved for the case of unbounded operators. 43 See 91.6, Definition 15. 
354 Chapter VI. Operators in Banach and Hilbert Spaces Example 6. Returning to Example 2, we see that the integral operator A is hermitian if its kernel is: (3.19) k(t, u) = k(u, t), a.e. t E Q', U E Q. D We then have the Proposition 1. Let A E 2(X), A = A*. Then: (3.20) 1 (i) a(x, x) E!R, \:Ix E X, (ii) II A II = sup (Ax, x). Ixl  1 Proof (i) a(x, x) = (Ax, x) = a(x, x) E!R, \:Ix E X. (ii) Let a = sup (Ax, x) = sup a(x, x); Ixl  1 Ixl  1 then we have from Theorem 1: a  II A II. To prove the opposite inequality, we form for u and v E X, A E !R I(A(u + AV), u + Av)1 = I(Au, u) + 2A Re(Au, v) + A 2 (Av, v)1  alu + Av1 2 , from which there follows 14ARea(u, v)1  a(lu + Avl 2 + lu - Av1 2 ) = 2a(lul 2 + A 2 IvI 2 ); f\ taking A = lul/lvl, we obtain IRe ( Au, v) I = I Re a (u, v) I  a I u II v I ; - observing that we can replace v by iv, we obtain la(u, v)1  alullvl, \:Iu, v E X, from which it follows that II A II  a, by Theorem 1. Hence we have (3.20).  D 1.3. Orthogonal Projectors In 92 we have defined the projection operators on Banach spaces as idempotent operators and in 91 the orthogonal projection on a closed vector sub-space of a Hilbert space. If V is a closed vector sub-space of the Hilbert space H, then \:Iu E H, there exists a unique element v = Pu E V called the orthogonal projection of u on V, characterised by (see proposition 11 of 91) (3.21 ) (u - Pu, v) = 0, \:Iv E V. 
93. Linear Operators in Hilbert Spaces 355 We verify easily (see Theorem 9 of 1) that (3.22) p2 == P , P == p* , P E 2(H). In addition, (3.22) characterises a projection operator or a projector on N(P) == Ker P, called the orthogonal projector since the image R(P) of P and N (P) are orthogonal. We can prove also that if {P n}n E N is a sequence of orthogonal projectors which converge in 2(H) to P, then P is an orthogonal projector. 1.4. Isometries and Unitary Operators A linear operator A E 2 ( X, Y) is said to be isometric if (3.23) lAxly == lxix, Vx E X (we say also that A is an isometry). In an equivalent way, we can define the isometries as the operators A E 2(X, Y) such that (3.24) (Au, Av)y == (u, v)x Vu E X, Vv E X. Indeed, (3.24) implies (3.23) trivially. Conversely, if the condition (3.23) is valid, we have 1 2 1 2 1 2 == 2 1u + vlx - 2 1ulx - 2 1vlx == Re(u, v)x. 111 Re(Au, Av)y == -IA(u + v)l - -IAul - -IAvl 222 Hence, if we change v to iv, we obtain (3.24). We say that A is unitary if A is an isometry and if, in addition, R (A) == Y. Proposition 2. An operator A E 2(X, Y) is an isometry iff A * A coincides with the identity of X, where A * E 2 ( Y, X) is the adjoint of A. In this case, A A * is then equal to P the orthogonal projection of Y onto R (A ). Proof The first point follows from (3.24). Indeed, for u E X ( A u, A v ) y == (A * A u, v) == (u, v) x' Vv E X; from which A * A u == u. On the other hand, from (3.23), the image R (A) of A is closed, and so we can define the orthogonal projection on R (A ). Then let Z E R(A) and z == Ax, with x E X. We have AA*z == AA*(Ax) == A(A*Ax) == Ax == z, and AA * coincides with the identity on R(A). 
356 Chapter VI. Operators in Banach and Hilbert Spaces On the other hand, AA* is obviously null on R(A)-L = N(A*), the kernel of A*, from which it follows that AA * = P. D Corollary 1. An operator A E 2(X, Y) is unitary iff A* A = Ix AA * = I y where I x and I y denote the identities of X and of Y. In particular, A * is also unitary. We also verify the following properties: (1) If Al and A 2 are isometric, AIA2 is isometric. (2) If {An} is a sequence of isometric operators converging strongly to A, then A is isometric. Remark 1. We encounter operators A such that (3.25) A*A = Px AA * = P y where P x and P yare orthogonal projections into X and into Y (such operators are called partially isometric). In this case A is unitary on R(Px) into R(Py) and null on R(Px)-L. Let { An} be a sequence of operators satisfying (3.25) and converging strongly to A. Then if { A:} converges strongly to A *, A also satisfies (3.25). D Example 7. Let X = [2; the operator of displacement to the right A : x = {x n } --+- {O, X I' x 2 , . . . , X n + I' . . .} is isometric but not unitary. D Example 8. The Fourier transform (see Appendix "Distributions"). Let X = Y = L 2([Rn); with u E L 2([Rn), we associate its Fourier transform u(y) = f e-ix'Yu(x)dx. [Rn The mapping IF so defined admits for inverse IF - I defined by I A 1 f . A 1 * A (IF- u)(x) = - elx.Yu(y)dy = - (IF u)(x). (2n)n [Rn (2n)n We therefore have that the mapping (2n)- tn IF is unitary. D Example 9. Fourier Series (see Chap. IlIA, 1). Let X = L 2 (0,2n) and Y = [2(Z). 
3. Linear Operators in Hilbert Spaces 357 The mapping ff which with u E L 2 (O, 2n) associates the sequence {Uk}k E d: defined by A 1 f 21t . Uk == - e - lkO U ( fJ) dfJ 2n 0 is unitary (to within a factor (2n)-t). In fact, the inverse mapping is given by the mapping {Uk}  u defined by u( 8) == L uke ikO , kEd: and satisfies Parseval's relation (see Chap. IliA, 9 1 ) 1 f 21t - lu(8)1 2 dfJ == L lu k l 2 . 2n 0 kEd: o Remark 2. We shall see other examples of isometric or unitary operators in the sequel: (a) isometric operators and the Cayley transform, (b) partially isometric operators and self-adjoint extensions of symmetric oper- a tors, (c) unitary semi-groups of class o (see Chap. XVIIA, 9 4 ). o Remark 3. If inf (Ax, x) > 0, the hermitian operator or the associated Ixl ;?: 1 hern1itian form 44 are said to be positive definite; then the hermitian form a( u, v) defines on H a scalar product equivalent to the initial scalar product. o 1.5. Hilbert-Schmidt Operators We recall that a continuous linear operator T: X  Y is compact if the trans- forms by T of the bounded sets of X are relatively compact in Y. Example 10. The compactness of the injection HS(Q) <=+ L 2 (Q), s > 0, for Q sufficiently regular, implies that of several inverse operators of closed operators A of domain D(A) c HS(Q) for a certain s > 0 (see Chap. IV, Chap. VII and Chap. VIII.) o The spectral properties of compact operators will be studied in detail In Chap. VIII. We study here a class of special compact operators: the Hilbert-Schmidt operators. Let us suppose that X is a separable Hilbert space; the space X therefore admits 44 For the hermitian form, that is in accord with the definition of 1. 
358 Chapter VI. Operators in Banach and Hilbert Spaces an orthonormal base {en}n E rj, that is to say such that { I n==m (en, em) == 0 th . o erWlse and that the series L (u, en}e n converges to u, for all u E X. In particular, we nEN have, for all u E X, the relation: (3.26) lul 2 == L I(u, e n }12. n E N Let T E 2(X, Y}; we say that T is a Hilbert-Schmidt operator if def ( ) 1/2 IITII2 == L ITenl < +00. nEN The quantity II TI12 thus defined is a norm called the Hilbert-Schmidt norm. Let us verify that it is independent of the choice of the base (en). Let (fm) be an orthonormal base in Y. We then have, by applying the analogue of the relation (3.26): (3.27) I Tenl == L I(Te n ,fm}yI2 == L I(e n , T*fm}xI 2 ; mEN mEN from which, on summing over n E N, and interchanging Land L 45 , we have n m L I Tenl == L L I(e n , T*fm}li == L I T*fml, nEN mENnEN mEN which shows at the same time that the two members are independent of the orthonormal base chosen. II T 112 == II T * 112 where T* E 2( Y, X) is the adjoint of T. We introduce the space 2 2 (X, Y) == {TE2(X, Y}: IITI12 < +oo} of Hilbert-Schmidt operators. We shall show that it is a Hilbert space for the scalar product (T, S) 2 == L (Ten, S en )y . n E N We state as a preliminary the following properties whose verification is left to the reader 46 . 45 Which is permissible for series with positive terms. 46 See Chap. VIII and Aubin [1], p. 261. 
3. Linear Operators in Hilbert Spaces 359 Proposition 3. (i) IfTE 2 2 (X, Y) and BE 2(Z,X), then TB E 2 2 (Z, Y) with II TB 112  II TI12 II B 112(Z, Y). (ii) If T E 2 2 (X, Y) and A E 2( Y, Z). then AT E 2(X, Z) with IIATI12  IIAI12(x,z)IITI12. (iii) For T E 2 2 (X, Y), we have inequality (3.28) II TII  II T 112. Proposition 4. The space fE 2 (X, Y) is complete for the norm II. II (so it is a Hilbert space). Proof Let us consider a Cauchy sequence {Tn} c: fE 2 (X, Y) for the norm II. 112; it is also a Cauchy sequence for the norm of 2 (X, Y) (see (iii) above); hence it converges to T E 2(X, Y). We now show that Tis Hilbert-Schmidt. For every prescribed 8 > 0 and for all I, we have: 1 L I(Tm - Tp)ekl  8 2 , k=l for m and p sufficiently large (independently of l); hence for p  00 1 L I(Tm - T)ekl  8 2 , Vl. k=l It follows that II Tm - TI12 < 8 for m sufficiently large, and hence that TE 2 2 (X, Y); in addition T = lim Tm in the sense of the norm 11.112. m 00 D Proposition 5. The Hilbert-Schmidt operators are compact: 2 2 (X, Y) c: x(X, Y). 00 Proof In effect since L I Tek I  < 8 2 for n sufficiently large, denoting by Tn k=n+l the operator of finite rank defined by { Tnek _ Tek, (3.29) T"e k - 0, k  n k > n, the inequalities II T - Tn 11 2  II T - Tn II   8 2 , n sufficiently large, show that T, the uniform limit of the sequence {Tn} of compact operators is compact. D 
360 Chapter VI. Operators in Banach and Hilbert Spaces Remark 4. The space 2 2 (X, Y) is not closed in ff(X, Y) (nor in 2(X, Y)) for the norm II. II of !e(X, Y). In fact, with the help of spectral theory, we can show (see Chap. VIII) that every compact operator is the uniform limit of operators of finite rank on a Hilbert space; now, there exist compact operators which are not of Hilbert-Schmidt type (see Remark 5 below). In particular in !e(X), !e 2 (X) = !e 2 (X, X), the space of Hilbert-Schmidt operators on X is a two-sided ideal not closed in X but a Banach algebra for the norm II . II 2. o Example 11. Integral operator of Hilbert-Schmidt type. Let A: L 2 (Q)  L 2(Q') be the integral operator considered in Example 2 as arising from a kernel k satisfying (3.30) IIkll 2 = II Ik(t,u)12dtdu < 00; Q' x Q it is of Hilbert-Schmidt type. As a matter of fact, if { xn} and {x} are two complete orthonormal systems in L 2 ( Q) and L 2 (Q'), {X n @ x} is also acorn plete orthonormal system in L 2 ( Q x Q'). II A II = n; I(Ax n , x;"W n; I IQ'xQ k(t, u)Xn(u)x;"(t)dtdu 2 = II Ik(t, u)1 2 dtdu = IIkll 2 . Q'xQ This example is used especially in Chap. VIII. o Remark 5. Conversely, as stated previously, all compact operators are not necessarily Hilbert-Schmidt operators. More precisely, we show that a compact opertor A: X  Y is of Hilbert- Schmidt type iff there exist orthonormal bases { en} n E N and {1m} mEN in X and Y respectively and a sequence of scalars {An}n ENE 1 2 such that Ax = L An(e n , x) fn. n EN (Indeed, in this case we have ( ) 1/2 " A 112 = L I An 1 2 < + 00.) n EN (In the case X = Yand where the two bases {en} and {fn} are identical the An are the eigenvalues of A.) o 
3. Linear Operators in Hilbert Spaces 361 Remark 6. We shall see later (see Chap. VIII) another category of compact operators: the operators of the trace class (or nuclear operators). D 2. Unbounded Operators in Hilbert Spaces 2.1. Adjoint of an Unbounded Operator As in the general case of Banach spaces, we encounter the diverse notions of closed operator, with closed image, and of transposed operator. In the context of a Hilbert space (on C) the notion of adjoint operator arises in a natural way (because of a scalar product which uses the antidual rather than the dual). Let X and Y be two complex Hilbert spaces as in 93.1 and with the notation of 93.1. Let A be a linear operator with domain D(A) c X into Ywith D(A) dense in X. The closed operator A * : Y  X defined by (3.31 ) D(A*) == {y E Y; 3x* E X such that (y, Ax)y == (x*, x), \Ix E D(A)} and (A *y, x)x == (y, Ax )y, \Ix E D(A) and y E D(A*), is called the adjoint of A. We can verify that the graph of A and the inverse graph of - A * are orthogonal in X x Yand supplementary. 0 Finally, X and Y being reflexive, the domain of A * is likewise dense and, on the   other hand, if A is closed, it coincides with the double adjoint or biadjoint A **. We also have the orthogonality relations (3.32) N(A *) == R(A)-1- and N(A) == R (A *)-1-. If in addition R (A) is closed, then (Prop. 21, 92) R (A *} is closed and we have the reciprocal orthogonality relations (3.33) R ( A) == N ( A * )-1- and R ( A *) == N ( A )-1- . 2.2. Symmetric Operators We consider Y == X. Definition 2. An unbounded operator A, with domain D(A) dense in X, is said to be symmetric if one of the following three equivalent conditions is satisfied: (i) A * :::> A, (ii) (Au, v) == (u, Av), (iii) (Au,u) E IR \lu, v E D(A), \lu E D(A). 
362 Chapter VI. Operators in Banach and Hilbert Spaces If, in addition, A = A* (hence with D(A) = D(A*)), the operatorA is said to be self-adjoint. 2.3. The Cayley Transform An interesting tool enabling the passage from self-adjoint operators to unitary operators is the Cayley transform which will be useful in Chap. VIII (spectral decomposition) as for problems of evolution (see semi-groups, Chap. XVIII). We remark first of all that if A is symmetric, then (3.34) I(A + il)vl 2 = I Avl 2 + Iv1 2 , Vv E D(A); we deduce immediately that if A is symmetric, the operator A + ail, (8 = + 1) is injective and hafi a closed image. It follows that the operator (3.35) U A = (A - il)(A + il)-l is well defined on D( U A) = R( A + iI). This operator U A is called the Cayley transform of the operator A. Its interest lies in the fact that it allows us to characterise self-adjoint operators. First of all, let us note the C", Theorem 2 (von Neumann). Let A be a<c:)Symmetric operator whose domain D ( A) is dense in X. Then:" ./ (i) its Cayley transform is a closed isometry from R( A + il) into R( A - il); (ii) the operator (1 - UA) is invertible; (iii) A = i(l + UA)(l - UA)-l, D(A) = R(l - UA); (iv) R(l - UA) is dense in X. Proof (i) This point follows from (3.34) which shows that the operators (A + il) are isometries on D(A) provided with the norm of the graph: hence U A is similarly an isometry of R(A + il) into R(A - il); these spaces are closed, hence U A is closed. (ii) Let us prove that (1 - U A) is invertible: let y E R(A + il) be such that (1 - UA)y = 0; writing (3.36) { Y = (A + il)x, x E D(A) UAy = (A - il)x it follows that (3.37) 1 x = 2i (1 - UA)Y = 0 1 Ax = 2 (1 + UA)y = 0; from which y = o. 
3. Linear Operators in Hilbert Spaces 363 (iii) The expression for A as a function of V A follows immediately from the preceding calculation: A == i(/ + V A )(/ - VA)-l on D(A) == R(/ - VA). D Let us now examine the inverse correspondence: Theorem 3 (von Neumann). Let V be a closed isometric operator such that R(1 - V) is dense in X. Then, there exists one and only one closed symmetric operator A whose Cayley transform is V. Proof. (i) Let us first show that (/ - U) is invertible: for that we assume that there exists y such that (1 - V)y == O. For all Z E R(/ - V), let z == (/ - V)x; we form: (y, z) == (y, x) - (y, Vx) == (Vy, Vx) - (y, Vx) == (Vy - y, Vx) == O. As R (1 - U) is dense in H it follows that y == O. (ii) Let us put: A == i(/ + V)(1 -V)-l on D(A) == R(/ - V); then: (a) A has a dense domain; (b) A is symmetric; indeed, for x == (1 - V)u and y == (1 - V)v in D(A), let us form (Ax, y) == (i (1 + V)u, (1 - V)v) == i [( U u, v) - (u, V v ) ] == ((1 - V)u, i(1 + V)v) == (x, Ay). The Cayley transform of the operator A, so constructed, coincides with V; indeed, let x == (1 - V)u then Ax == i(1 + V)u. A simple calculation shows that (3.38) { 2iu == (A + i1)x 2iVu == (A - i1)x. Bringing together these results with (3.36)47 we have D(V A ) == {2iu; u E D(V)} == D(V) and V A (2iu) == 2iVu == V(2iu), from which: V == V A. To complete the proof, let us show that A is closed; let {Un} c D(A) be a sequence such that X n == (1 - V)u n  x with AX n == i(1 + V)u n  y.   Witten for arbitrary Y E R(A + iI). 
364 Chapter VI. Operators in Banach and Hilbert Spaces From (3.38), the sequences {Un} and {V Un} converge, and V being closed: Un ---+- U U Un ---+- V U. It follows that x n ---+- X = (I - V)U AX n ---+- y = i(I + V)u = Ax from which we have the stated result. D We now state the theorem on the characterisation of self-adjoint operators. Theorem 4. A symmetric operator A is self-adjoint iff (3.39) R (A + iI) = R (A - iI) = X. Proof (i) The condition is necessary. If A is self-adjoint, therefore closed. R(A + aiI)1- = N(A - ail) = {O}, a = + 1, and since R(A + ail) is closed, R(A + c;iI) = X. (ii) The condition is sufficient. Let us suppose that (3.39) holds; we shall show that A* c A , which together with A c A * (A symmetric) will imply A = A *. Let y E D(A *); there exists x E D(A) such that (A* + iI)y = (A + iI)x = (A* + iI)x; from which (A* + iI)(y - x) = 0 and y - x E N(A* + iI); but N(A* + iI) = R(A - iI)1- = {O}, so y = x from which the result follows. D From Theorems 2, 3, 4 we deduce the Corollary 2. A symmetric operator A: D (A)  X ---+- X is self-adjoint iff its Cayley transform VA = (A - iI)(A + iI)-l is unitary. Proposition 60 If A is a self-adjoint operator in X, with domain D(A) which is injective, then A-I is likewise self-adjoint. Proof (i) First of all, (A - 1)* exists as D (A - 1) = R (A) is dense in X; indeed: R(A)1- = N(A*) = N(A) = {O}. 
3. Linear Operators in Hilbert Spaces 365 (ii) A -1 is symmetric. Indeed, let x, y E R(A). Then there exist u, v E D(A) such that x = Au, y = Av and we have: (A - 1 x, y) = (u, A v) = (A u, v) = (x, A - 1 Y ). (iii) Now, let us prove that A -1 satisfies (3.39) R(A- 1 + iI) = (A- 1 + iI)R(A) = + i(A + iI)A- 1 R(A) = + i(A + iI)D(A) = X. Hence the result follows. D In a similar fashion we can prove the Proposition 7. If A' is self-adjoint in X, then An is self-adjoint in X. Finally, we draw attention to the following result due to von Neumann which describes an important class of self-adjoint operators. Theorem 5. Let X and Y be two Hilbert spaces and let B: D(B) c X  Y be a closed operator whose domain is dense in X. Then B* B is self-adjoint in X. Proof We have already pointed out that in the Hilbert space X x Y provided with the natural scalar product, the subspaces G(B), graph of B, and G S ( - B*), the inverse graph of - B* are orthogonal and supplementary. (i) Let us prove that B* B satisfies the conditions of Theorem 4, remarking that in this theorem, the choice of + i and - i can be replaced by a E  provided (see Chap. VIII) that - a does not belong to the spectrum of the operator A. Here (B*Bx, x)x = (Bx, Bx)y = IBxl  0, x E D(B*B) c D(B); hence taking a = + 1, we shall show that: (3.40 ) R(I + B*B) = X. We note that for all Z E X, we have the following decomposition: (Z,O) = (x, Bx) + (-B*y, y)48 xED (B), Y E D(B*), so (3.41 ) { Z = x - B*y o = Bx + y, it follows that x E D(B* B) and that Z = x + B* Bx from which (3.40) follows. (ii) We now establish that D(B* B) is dense in X: Let Z E D(B* B) in X; with the decomposition (3.41), we have in particular o = (z, x) = Ixl 2 + (B*Bx, x) = Ixl 2 + IBxI 2 , from which x = Z = O. The Theorem 5 follows. D 48 Do not confuse (,), which here denotes a pair, with the scalar product. 
366 Chapter VI. Operators in Banach and Hilbert Spaces Remark 7. Self-adjoint extensions of a symmetric operator. Being given a symmetric operator A with domain D(A) dense in X, we can propose to study the existence of symmetric opeators Al (eventually self-adjoint) extending A. In these conaitions A C Al c Ai c A*. Such strict extensions may not exist. A symmetric operator will be said to be maximal symmetric if it does not admit of a strict symmetric extension. We note tha t: (a) a maximal symmetric operator is closed, so A == A **, (b) every closed symmetric extension of A is an extension of A **, (c) if A * == A **, then A * is the sole symmetric extension of A; the operator A is then said to be essentially self-adjoint: its closure A == A ** == A * is a self- adjoint operator. We can give conditions bearing generally on the Cayley transform in order that an operator admits symmetric extensions. For that we refer to Akhiezer- Glazman [1J, Kato [1J and the bibliographies contained in. We note only that a closed symmetric operator is self-adjoint iff its deficiency indices 49 are (0, 0) and that a closed symmetric operator admits strict self- adjoint extensions iff its deficiency indices (m, n) satisfy m == n > O. 2.4. Normal Operators Definition 3. Let N be a closed operator in X with domain D(N) dense in X; N is said to be normal if it commutes with its adjoint: N* N == N N* 50 Examples of normal operators are given by: self-adjoint or hermitian operators, bounded unitary operators, functions of hermitan or self-adjoint operators (see Chap. VIII). The interest of normal operators rest more especially in the fact that compact normal operators admit a particularly simple spectral decomposition and that many operators occurring in applications have compact normal inverses (see Chap. VIII, Spectral decomposition). We now give the important result on the characterisation of normal operators: Proposition 8. An operator N: D(N) c X  X, closed and densely defined, is normal iff: (3.42) { (i) D(N) == D(N*) (ii) INxl x == IN*xlx, \/x E D(N). 49 When A is a closed symmetric operator, the deficiency index f3(A  I) (defined in 2.1.2) for  E C remains constant in each of the half-planes Im > 0, Im < 0; we then denote by the deficiency indices of the symmetric operator A the pair of these constant values. 50 Note that this means that D(N* N) = D(N N*). 
3. Linear Operators in Hilbert Spaces 367 Proof Since N is closed, then, by Theorem 5, N N* and N* N are self-adjoint. (1) The condition is sufficient: (ii) is equivalent to (Nx, Ny) = (N*x, N*y), \Ix, y E D(N) = D(N*). Then if x E D(N* N), and Nx E D(N*), the mapping y  (Nx, Ny)x = (N* Nx, Y)x = (N*x, N*y) is continuous and antilinear, so: N*x E D(N**) = D(N), hence x E D(N N*); in addition: (NN*x,y)x = (N*Nx,y)x, from which N* N x = N N* x. We have thus proved the inclusion: N* N c N N*. (We can, in a similar way show that N N* c N* N.) (2) The condition is necessary: if N is normal: \ly E D(N) dense in X from which (x,N*Nx) = (x,NN*x), \lxED(N*N) INxl x = IN*xlx. We prove that the equality is true for x E D(N) and also D(N*) (which are therefore equal) in passing to the limit since D(N N*) = D(N* N) is dense in D(N*) and D(N) for the norm of the graph (see Kato [1]). D 2.5. Sesquilinear Forms and Unbounded Operators Let V be a complex Hilbert space. We denote by ((., .)) (resp 11.11) the scalar product (resp. the norm) in V. Let (u, v)  a(u, v) be a continuous sesquilinear form on V x V, that is to say satisfying (3.43) { there exists a constant C > 0 with I a( u, v) I  ell u 11.11 v II for all u, v E V. From Theorem 1, we can associate with a(u, v) a continuous linear operator .91 E 2(V). With the adjoint sesquilinear form of a defined by (3.44 ) a*(u, v) = a(v, u) is then associated .91* E 2(V), the adjoint of .91, satisfying: , ((.91 u, v)) = ((u, .91* v)) for all u, v E V. 4- Now let V' be the antidual of V, which we do not here identify with V. If !/( V) denotes the set of the continuous sesquilinear forms on V x V, we then have the analogue of Theorem 1. 
368 Chapter VI. Operators in Banach and Hilbert Spaces Theorem 6. There exists an isomorphism of !/ (V) onto 2 (V, V') which associ- ates with a sesquilinear form a(u, v) the operator A E 2(V, V') defined by: (3.45) a(u, v) = <Au, v), \lu, v E V « , ) antiduality between V' and V) and we have (3.46) II a II = II A 11(v, V'). Proof The proof is similar to that of Theorem 1. We observe. that for u fixed in V, the mapping v  a(u, v) defines u* E V' such that a(u, v) = <u*, v), \lu E V, the correspondence u  u* being linear. D The operator A * E 2(V, V') defined by a*(u, v) satisfying (3.44) is the adjoint of A defined by a(u, v) (see (3.14)). We verify easily that we have { N(A) = R(A*)1- 51 (3.47) N(A *) = R(A)-L, orthogonality being taken in the sense of the antiduality between Vand V'. We shall now give a condition ensuring that A (resp. A *) are isomorphisms of V onto V' which corresponds to (3.1) and replies to the question Q.1 of the introduction. Definition 4. The sesquilinear form a(u, v) E !/(V) is said to be V-coercive 52 if it satisfies { there exists a constant a > 0 such that (3.48) Rea(u, u)  allul1 2 , Vu E V. We then have the Theorem 7 (The Lax-Milgram .Theorem). Let a(u, v) be a continuous sesqui- linear form on V x V which is V-coercive and let A be the associated operator. Then A E 2(V, V') is an isomorphism of V onto v'. Proof (i) a(u, v) being continuous V-coercive we have: a II v 11 2  Re a( v, v) = Re < A v, v)  II A v II * II v II , so (3.49) IIAvll*  all vii, \Iv E V. (ii) As Re a*(u, v) = Re a(u, v) = Re a(u, v), we have, in a manner similar to that in establishing (3.49), that (3.50) IIA*vll*  all vii, \Iv E V. 51 See (3.32). 52 We say V-coercive or V-elliptic. 
3. Linear Operators in Hilbert Spaces 369 (iii) It follows from (3.47), (3.49) and (3.50) that A (resp. A *) is injective with an image which is dense, and closed in V'. Hence we obtain the theorem. o Remark 8. Theorem 7 is valid, with the same proof, under the hypothesis, la(u, u)1  all u11 2 , a > 0, Vu E V. o Remark 9. Theorem 7 means that { 'iff E V there exists a unique U E V such that a(u, v) - <I, v), Vv E V, (3.51) or again that { Vf E V', there exists a unique u E V such that (3.51)' Au =f We shall see numerous applications of this theorem in Chap. VII. o Remark 10. In many situations (e.g. concerning evolution equations, semi- groups, Laplace transform) we are not required to find u satisfying Au = I, but do have to find u satisfying f given in V' , Au + AU = f This leads to introduce two Hilbert spaces instead of the single space V (V I is intrinsically linked with V). Thus let us consider the following situation: We introduce two Hilbert spaces V and H with (3.52) { V C; H the continuous injection C; having its image dense in H. Then we can identify H with its antidual H', and therefore (3.53) V c; H c; V'; we denote by (. , . ) (resp. 1.1) the scalar product (resp. the norm) in H. Instead of denoting by <I, v) the scalar product in the antiduality between V I and V, we denote it by (I, v) which is justified by the identification of H with H', the antiduality limiting itself in H to the usual scalar product. 
370 Chapter VI. Operators in Banach and Hilbert Spaces Hence we now consider the problem: Find u E V such that we have: (3.54) a(u, v) + A(U, v) == (f, v), Vv E V A being a given complex number. We are thus led to modify Definition 4. Definition 5. The sesquilinear form a(u, v) is said to be V-coercive with respect to H if there exist Ao E  and a > 0 such that (3.55) Rea(u, u) + A o lul 2  allul1 2 , Vu E V. Corollary 3. If a(u, v) satisfies (3.55) the problem (3.54) has a unique solution for all A E C satisfying (3.56) Re A  Ao. Proof The proof is immediate from the observation that A + AI is defined by the sesquilinear form a;.(u, v) == a(u, v) + A(U, v) which is continuous on V x V and V-coercive in the sense of Definition 4 if (3.56) holds. o Unbounded operator A defined by the triplet (V, H, a(u, v)). Let us consider V, H two Hilbert spaces in the situation (3.52) and let a(u, v) E Y( V) be V-coercive in the sense of Definition 4. If in (3.51), we take f E H, there exists unique u E V such that Au == f E H. Let (3.57) D(A) == A- 1 (H) == {u E H; Au E H}. We shall see that we can consider A restricted to A -l(H) as an unbounded operator in H with domain D(A), Theorem 7 signifying that AlA -l(H) is an isomorphism of D(A), provided with the norm of the graph, onto H. We prove the Proposition 9. Under the preceding hypotheses: (i) We have the following characterisation of D( A): { space of the u E V such that v  a(u, v) (3.58) D(A) == is continuous on V for the topology of H (with an analogous characterisation of D( A *)); (ii) D(A) (resp. D(A *)) is dense in H; (iii) the restriction of A E 2(V, V'), denoted again by A, which domain D(A), is closed (same result for A *). Proof (i) If W denotes provisionally the space described by the second member of (3.58), we have by takingfE H in (3.51): <Au, v) == a(u, v) == <I, v) == (I, v), Vv E V, 
3. Linear Operators in Hilbert Spaces 371 with the result that la(u, v)1 == ICt: v)1  Ifl.lvl, Vv E V, so U E Wand D(A) c W. The converse is immediate. (ii) Let us suppose that there exists f E H with (u, f) == 0, Vu E D(A). Using (3.51) with A replaced by A *, we see that there exists unique u* E D(A *) such that A *u* == f Thus: (u, f) == (u, A *u*) == (A *u*, u) == a*(u*, u) == a(u, u*) == (Au, u*) == 0 But (Theorem 7 and (3.57)) A maps D(A) onto H; u being arbitrary in D(A), Au == y is arbitrary in H with the result that (y, u*) == 0, Vy E H, which implies u* == 0 so f == o. (iii) Let Un E D(A) with Un  u in H and Au  fin H. We put fn == Au n ; fn  f in H and in V' from (3.53). Let us introduce A -1 E 2(V', V); then Un == A -lfn converges to A -if in V hence in Hand u = A-if Thus u E D(A) and Au = f o Remark 11. (i) We have (3.59) D(A) is dense in V. To see this, it is enough to note that if there exists f E V such that ((u, f)) == 0, Vu E D(A), then introducing d* E 2(V) there exists u* E V such that d*u* = f, and ((u, f)) = a*(u*, u) == a(u, u*) == (Au, u*) = 0, Vu E D(A), from which u* = o. (ii) We note also that v  (Rea(v, v))!- is an equivalent norm to the norm on V, with the result that V is the closure of D(A) for this norm. o Let us now prove the Proposition 10. Under the preceding hypotheses, the operator A * with domain D( A *) defined by the triplet { V, H, a* (u, v)} is the adjoint of the unbounded A with domain D(A). - - Proof _Let A be the adj2int of A. We shall show that A = A*, that is to say that D(A) = D(A *) and Au == A *u for u E D(A *). 
372 Chapter VI. Operators in Banach and Hilbert Spaces Let v E D(A); then the form u  (Au, v) is continuous on D(A) for the topology of H and we have -- (Au, v) == (u, Av). Then, let v* E D(A *) be the unique solution of -- A *v* == Av. , we have (Au, v) == (u, A *v*) == a* (v*, u) == (Au, v*), so (Au, v - v*) == 0 for all u E D(A), and hence -- v - v* == 0 and v E D(A*), Av == A*v. -- -- We thus have D(A) c D(A *) and as D(A *) c D(A) is obvious, the proposition is proved. o Consequence: If a E g(V) is V-coercive and hermitian (a(u, v) == a*(u, v)), then the unbounded operator A with domain D(A) defined by (3.58) is selfadjoint. Remark 12. The operators A and A* E 2(V, V') (defined by a(u, v) and a*(u, v) E g(V) and V-coercive) are the extensions to V of the unbounded operators denoted again by A and A * defined respectively by the triplets (V, H, a) and (V, H, a*). Numerous examples of this situation are given in Chap. VII. o Remark 13. An operator A (unbounded in H) is said to be accretive if for all u E D(A) we have (3.60) Re(Au, u)  O. A self-adjoint operator which is accretive is said to be posltlve. Thus, an unbounded operator A defined by a triplet (V, H, a(u, v)), with a E g(V) and V- coercive, is accretive, and if a(u, v) is V-coercive with respect to H, then A + AI is accretive for suitable A E . More generally, a sesquilinear form a(u, v) is said to be regularly accretive if: (3.61 ) (i) for all G > 0, there exists a( e) > 0 such that Re a(v, v) + elvl 2  a(e)11 v 11 2 , \Iv E V (ii) 11m a(v, v)1  f3 Re a(v, v), \Iv E V. The unbounded operator A defined by the triplet (V, H, a(u, v)) (then said to be "regularly accretive") is accretive. If A is accretive with domain D( A) dense in H, we can seek all the accretive 
3. Linear Operators in Hilbert Spaces 373 extensions __ of A; A is said to be maximal accretive 53 if there does not exist an operator A with (3.62) -- A :::J A, where A is accretive and where the inclusion is strict. o Remark 14. Sectorial operators. An unbounded operator in H with domain D(A) is said to be sectorial if there exists 8 E [0, tn[ such that (Au, u) E S ( 8 ) lul 2 ' (3.63) Vu E D(A) where (3.64) S(8) == {A E C; ArgA  8} is the sector in the complex plane of angle 28, vertex the origin and symmetric with respect to the real axis. In particular, the operator A is accretive. We should also notice that the accretive self-adjoint operators (A == A*, (Au, u)  0, Vu E D(A)) are in particular sectorial operators with 8 == o. The sectorial and accretive operators plays an important role in the theory of semi-groups and we shall give (see Chap. XVII on semi-groups) a character- isation of the m-accretive operators. o Remark 15. We corne to see how it is possible to reply to the questions Ql and Q2 of the introduction. Let us give some indications of how to reply to Q3. Suppose that A is an unbounded operator in H defined on 1/ c H, 1/ without topology and also that A is strictly accretive on 1/ (i.e. Re(Au, u) > 0, u i= 0, U E 1/). Then we can associate with A, the sesquilinear form a(u, v) defined by: (3.65) which satisfies a(u, v) == (Au, v), Vu,v E 1/, (3.66) Re a(u, u)  0, Vu E 1/, a(u, u) == 0 =? U == o. Then (3.67) II v II == (Iv1 2 + Re a(v, V))1/2 gives 1/ a structure of a pre-Hilbert space. Let V be the completion of 1/ for this norm. 53 Or m-accretive. 
374 Chapter VI. Operators in Banach and Hilbert Spaces If V is also contained in H (that is to say if the injection 1/  H extends to a continuous injection of V  H), then the sesquilinear form a(u, v) which extends by continuity to V (in fact V x V) to a sesquilinear form again denoted by a(u, v), defines with the help of the triplet (V, H, a(u, v)) an accretive operator (again denoted by A) with domain D(A) :::::) 1/. If the operator A (defined on 1/) is also sectorial, then we can prove (see, for example, Kato [1J) that V is contained in H, and the sesquilinear form a(u, v) extended to V also satisfies: (3.68) 11m a(u, u)1  f3 Re a(u, u), Vu E V, and (V, H, a(u, v)) defines an operator, again denoted by A, regularly accretive 54 , with domain D(A) :::::) 1/, satisfying (3.61). We do not here dwell further on these questions. For a deeper study of accretive operators and their extensions, we refer to Chap. XVII, semi-groups, Chap. XVI, Laplace transform, and to Kato [1J. Review of Chapter VI In  1 of this chapter, we have given successively the fundamental results of functional analysis indispensable to the reader for the sequel. We have directed the study on normed spaces, but many of the results extend to more general spaces. Tbe Hahn-Banach theorem plays here a considerable part notably in the theory of duality and is used time and again in the course of this work (Sobolev spaces, Appendix "Distributions", . . .). The use of the property of weak compactness of the unit ball of a reflexive Banach space (hence of a Hilbert space) is of constant use for passing to the limit in the majority of the constructive methods of solution of variational problems (Galerkin method, finite difference method, . . .). 2 is the study of linear operators, bounded or not, and of their adjoints in Banach spaces; it shows the importance of closed operators. 3 specialises the properties encountered in 2 to Hilbert spaces, which are very special Banach spaces; certain of the properties are simpler in this con text. The concepts of self-adjoint, normal, compact and unitary operators are funda- mental for the understanding of later chapters, particularly VII and VIII. Finally it is noted that if the data of an operator A is in fact the data of its graph, it is sometimes more advantageous, when this is possible, to define this operator by a suitable sesquilinear form. This is the point of view which has been taken up in 3, in Sect. 2.5, and which paves the way to the following chapter, Chap. VII. 54 The operator A is then also sectorial, with the same angular sector 8(8) (see (3.63), (3.64)) as the operator A, given initially). 
Chapter VII. Linear Variational Problems. Regularity Introduction The first part of this chapter ( 1 and 2) is devoted to the study of the variational method of solving boundary value problems involving elliptic operators, as, for example, the Dirichlet problem or the Neumann problem for the Laplacian operator or more general operators. Let V be a real Hilbert space, a a continuous bilinear form on V x V, and L a continuous linear form on V; we consider the following problem: Find u E V such that a(u, v) = L(v), \Iv E V. This is called the abstract variational problem. We recall (see Chap. VI, 3, Definition 4) that the bilinear form a is said to be V- elliptic (or V-coercive) if there exists a > 0 such that a(v, v)  allvl1 2 , \Iv E V. Questions of existence and uniqueness are treated with the help of the Lax- Milgram theorem. We next give two very simple examples (Sect. 2, 1), then complements in the case (Q) eVe H c '(Q) (Sect. 3, 1). We give some results on V-ellipticity for elliptic operators of order 2m with the help of Garding's inequality in Sect. 5,  1. Several examples are given in 2. Finally, the more delicate question of the regularity of the solutions is taken up in 3 of which is enough to accept the statement of the results for the reading of subsequent chapters of this work. 91. Elliptic Variational Theory The Hilbert spaces which we shall consider here will be supposed a priori to be complex except when express mention is made to the contrary. We denote by V' the antidual of V. 
376 Chapter VII. Linear Variational Problems. Regularity 1. The Lax-Milgram Theorem Let V be a complex Hilbert space, a(u, v) a sesquilinear linear form continuous on V x V, and LEVI, a continuous antilinear form on V. l We consider the following problem: { Find u E V such that a(u, v) == L(v), \Iv E V, called the abstract variational problem. We prove the following existence result: (1.1) Theorem 1 (Lax-Milgram Theorem). Under the preceding hypotheses, if in addition the sesquilinear form a is V-elliptic, i.e. if there exists a > 0 such that (1.2) Rea(v, v)  allvlj, \Iv E V, then the problem (1.1) admits a unique solution. Remark 1. This follows immediately from Chap. VI, 3, Theorem 7, the operator A y defined below (like At) being an isomorphism of V onto VI. We give here a different proof. Proof The sesquilinear form a being continuous, there exists M > 0 such that ja(u, v)1  Mllull y Ilvll y . We introduce the operator Ay: V  V defined by (Ayu, v)y == a(u, v), \Iv E V where (. , .)y denotes the scalar product in V. The existence of Ayu follows from the Riesz theorem 2 , the antilinear form v  a(u, v) being continuous on V. We verify immediately that A y is bounded: I(Ayu, v)yl IIAvuliv = sup II II ::( Mllull v . veY v y v* 0 (1.3) On the other hand, again from the Riesz theorem, there exists b E V such that (b, v)y == L(v), \Iv E V. Setting, for u E V, Tu == u - p(Ayu - b), the problem reduces to finding the fixed points of T. 1 See Chap. VI, for the definition of these terms and the notation used. 2 See Chap. VI,  1, Theorem 10. 
 1. Elliptic Variational Theory 377 We have II TU2 - TUlll == II(U2 - u 1 ) - P(A y U 2 - AYUl)ll II U2 - u l l1 2 - 2p Re(a(U2 - Ul, U2 - U 1 )) + p211Ay(U2 - ul)ll  (1 - 2pa + p 2 M 2 )11 U2 - ulll. Hence T is a strict contraction in V for 0 < p < 2a/ M 2 , and as a result admits a unique fixed point u. The problem (1.1) therefore admits a unique solution. D Remark 2. (1) The preceding proof is constructive in the sense that it gives a recurrence relation u n + 1 == Tun converging strongly to the solution of the problem. (2) The operator Ay constructed in the proof is thus an isomorphism of V into V. D Remark 3. In the case in which the sesquilinear form is hermitian, that is to say (see Chap. VI, 1, Definition 15) a(v, u) == a(u, v), \I u, V E V, we verify that the solution u of the problem (1.1) is also the solution of the optimisation problem: find u E V satisfying (1.4) J(u) == inf J( v) veY where J: V   is the quadratic function defined by 1 J(v) == 2a(v, v) - ReL(v). Indeed, the gradient of J at u, J'(u) say, is given by3 <J'(u), v) == lim - e 1 [J(u + ev) - l(u)] 8-+0 = ![eRea(U,V) + e; a(v,v) - eReL(V)] == Re (a(u, v) - L(v)) == 0, \Iv E V. The optimality condition J'(u) == 0 for the solution of the problem (1.4) is thus equivalent to Re (a(u, v) - L(v)) == 0, \Iv E V 3 The bracket <, > denotes the antiduality V, V'. 
378 Chapter VII. Linear Variational Problems. Regularity which is clearly equivalent to (1.1) since v can be chosen to be complex. We note that the function J is strictly convex if a is V-elliptic. ,.Jj D Remark 4. Let Vand H be two Hilbert spaces with V  H, V dense in H. If V' is the antidual of V, we can identif y 4 Hand H' and we have (1.5) V  H  V'. Again let a(u, v) be a sesquilinear form. V-coercive with respect to H;5 let us suppose that the injection V  H be in addition compact 6 (see especially Chap. IV for examples of this situation). We then have the following alternative: (1) ker A {O} and A is an isomorphism of D(A) onto H;7 or (2) ker A i= {O}, then ker A is of finite dimension, and the problem Au == f withf E H given, has solution only iff belongs to the image of A (equal to the orthogonal to the kernal of A * which is finite-dimensional). This result is a consequence of the Fredholm alternative (see Chap. VIII, 9 2 ). Indeed, the equation Au == f is equivalent to (A + AoI)u - AoU == f, that is to say (for - Ao E p(A)8) U - Ao(A + AoI)-lu == g where g == (A + AoI)-lf It is sufficient then to verify that (A + AoI)-l is compact; now this follows from the fact that (A + A o I)-l is an isomorphism of H onto D(A), (see Chap. VI, 9 3 ) that D(A) c V and that the injection of V into H is compact. D 2. First Examples We give, for the present, two elementary examples of variational problems. 4 See Chap. VI, 991 and 3. 5 I.e. a(u, v) + Ao(U, v) is V-coercive for suitable Ao. 6 The bounded sets of V are relatively compact in H (see Chap. VI). 7 For the definition of D(A) see Chap. VI, (3.57). 8 p(A), the resolvent set of A is defined in Chap. IV, 9 2 .2.3. 
1. Elliptic Variational Theory 379 2.1. Example 1. Dirichlet Problem Let Q c n be a bounded open set. The space V == H J (Q) is a Hilbert space 9 for the scalar product (u, V)l def f ( t DiU DJ5 + Uv ) dx. Q l=l The sesquilinear form a(u, v) def f ( t DiU DiV ) dx Q l=l is V-elliptic, from Poincare's inequality (see Chap. IV, 9 7 ). LetfE L 2 (Q) be given and L(v) def f./ v dxo The variational problem (1.6) f ( t DiU DiV ) dx == f fv dx, Q l=l Q \Iv E H6(Q) therefore admits a unique solution u E H6(Q). Just now let us interpret the solved problem: let us choose v == cp where cp E (Q) is arbitrary; we have < -L1u,cp) == f ( of. DiUDi<fJ ) dX == <f,cp) {J l=l for all cp E (Q), from which it follows that - L1u == f in '(Q). As on the other hand u E H6(Q) we have (with an interpretation which depends on the regularity of the boundary r of Q, see Chap. II and IV) 'You == ul r == 0, the solved problem is the (homogeneous) Dirichlet problem for the Laplacian. Conversely, if u is the regular solution of the classical Dirichlet problem { -L1u =f u == 0 on In Q 10 , r, the Green-Ostrogradski formula f (L1u)v dx == f a au v dr - f of. DiU Div dx Q r n Q l=l immediately implies that (1.6) holds. 9 au See Chap. IV, 4. We put DiU = -. aX i 
380 Chapter VII. Linear Variational Problems. Regularity We note that everything that comes to be said remains valid, without change, withfE H- l (Q) (instead offE L 2 (Q)). Remark 5. If we wish to solve the inhomogeneous problem { -Jw == f w == g on r In Ql0,ll where g E Ht(r) is given, we can proceed in the following way. The trace map Yo: H l (Q)  Ht(r) having a continuous lifting (see Chap. IV, 4), there exists z E H l (Q) such that YoZ == g. We then make the change of the unknown varia ble w==u+z with the result that u is the solution of the homogeneous Dirichlet problem: { -Ju ==f+ Jz inQ u == 0 on r. The second member f + Jz is then in H- l (Q), but this leaves us free to choose L(v) == <f + Jz, v), where the bracket <,) denotes the duality between H- l (Q) and H6(Q). Obvi- ously, L is continuous on H6(Q). o 2.2. Example 2. Neumann Problem Let Q c IRn be a bounded open set 12 . The space V == H l (Q) is a Hilbert space 13 for the scalar product (u, V)l == f ( .f D i U DJ5 + Uv ) dx. [} l=l We choose the sesquilinear form a(u, v) to be equal to the scalar product (u, V)l, and take L(v) def f /v dx + f /v dr where f E L 2(Q), gEL 2(r) are given. The antilinear form L being visibly continuous on H l (Q), (thanks to the trace theorem proved in Chap. IV, 4), the preceding result (Theorem 1) gives the 10 With "regular" (Lipschitzian) boundary. 11 In addition we suppose here that Q is situated locally on one side of r. 12 Supposed regular (r Lipschitzian), and locally on one side of r. 13 See Chap. IV,  1. 
1. Elliptic Variational Theory 381 existence and uniqueness of the solution of the problem (1.7) f ( t DiUDiV + UV ) dX = f fiJdx + f giJdr [} l=l [} r Vv E H 1 (Q). It remains now to interpret the problem so resolved. If we choose iJ = qJ E (Q), (1.7) can be written < - J u + u, qJ > = < f, qJ > where the bracket denotes the duality between '(Q) and (Q); from this we have since qJ is arbitrary in (Q): (1.8) -Ju + u = f in '(Q). We shall prove that (1.7) also implies the boundary condition: au l . n H - 1/2 (r). Y1 U = - == g an r ( 1.9) For that, we use the following results: Lemma 1. Let H(J, Q) def {v E H 1 (Q); Jv E L 2 (Q)} be provided with the norm IlvIlH(LI,Q) = (1Iv ll1I l(Q) + II JVlli2(Q)Y/2, The trace mapping Yl :(Q)  (r) (or k(r)) defined by14 au Yl u == - an r has an extension, again denoted by Yl, which is a continuous mapping of H(J, Q) into H-t(Q). Proof Let u E (Q). For w E Ht(r), we put l(w) = f r  wdr. From Theorem 4 of 4 of Chap. IV, there exists v E Hi (Q) having w for its trace and such that II V IIH 1 (Q)  ell w II H l/2(T). au 14 The mapping u  - sends (Q) into the functions (r) if r is a oo-variety. In general an r au - E k(r) if r is (k + 1) times differentiable. Here, we can suppose r to be Lipschitzian. an 
382 Chapter VII. Linear Variational Problems. Regularity (1.10) Using Green's formula 15 we can write l(w) == f Juvdx + f t Di UD J5dx. [J [J z=l From Schwarz's inequality, we therefore have l(w)  IluIIH(A,Q) Il v ll H l(Q)  Cli u II H (A,Q)II w II H l/2(T) which proves that I E H-t(Q) and, in addition, that the mapping 'Y1: u E (Q)  I E H- 1 / 2 (r) is bounded on (Q) provided with the norm of H(J, Q); it can therefore be extended by density16 to a continuous linear mapping of H(J, Q) into H-t(r). o Corollary 1. For u E H(J, Q) and v E H 1 (Q) we have the generalized Green's formula ( 1.11) ( 'Y 1 U, v> == f J u . 8 dx + f t D i U DJj dx [J [Jz=l where (,) denotes the duality between H-t(r) and Ht (r). o Let us now prove that (1.9) is valid if u is the solution of the problem (1. 7). From (1.8), we see that u E H(J, Q). The generalized Green's formula (1.11) implies that t/ -Au + u)vdx + ('Y1 U ,V) = f/ VdX + f/ vdr , for all v E H 1 (Q). From (1.8), we thus have ('Y1U - g,8) == 0, \Iv E H 1 (Q) from which it follows that 'Y1 U == 9 in H- 1 / 2 (r). The interpretation of the variational problem (1.7) in terms of a boundary value problem is therefore the following: { - Ju + u == f in '(Q) 'Y1U == 9 in H- 1 / 2 (r) i.e. the (inhomogeneous) Neumann problem for the operator ( - J + I). Conversely, it is immediate that every solution of the "classical" Neumann 15 Also called the Green-Ostrogradski formula; see Chap. II. 16 Indeed it can be proved that (Q) is dense in H(J, Q) (see Lions-Magenes [IJ). 
1. Elliptic Variational Theory 383 problem - J u + u == f In Q au - == g an on r 17 satisfies (1. 7). We shall see other, less "standard", examples of variation problems in 2 which is devoted to examples. 3. Extensions in the Case in Which V and H are Spaces of Distributions or of Functions We shall consider the special case in which (Q) eVe H c /(Q),18 where Q is an open set in [Rn. In addition we suppose that all the injections are continuous and that (Q) is dense in H. That occurs in particular in the examples treated in Sect. 2, with H == L 2(Q). We observe that (Q) is not, in general, dense in V (e.g. (Q) is not dense in H 1 (Q)), and hence V'is not, in general, identifiable with a sub-space of /(Q). Let a(u, v) be a continuous sesquilinear form on V x v. For fixed u E V, we consider the mapping cp  a(u, cp), (Q)  c; this is a continuous antilinear form on (Q). Thus, there exists Pu E ' (Q) such that (1.12) < Pu, cp > == a(u, cp), Vcp E (Q), where the bracket <, > denotes the antiduality between /(Q) and (Q). We have thus defined a continuous linear mapping P: V  /(Q); (if (Q) were dense in V, the operator P would coincide with the operator sf: V  V I defined by < sf u, v > == a (u, v) 1 9). We then denote by A the restriction of P to { D(A) == {u E V; Pu E H, a(u, v) == (Pu, v), Vv E V} (1.13) == {u E V; 3y E H such that (y, v) == a(u, v), Vv E V}. 17 For Q with sufficiently regular (Lipschitzian) boundary r. 18 Here we denote by /(Q) the antidual of (Q). 19 The bracket <, > denotes here the antiduality between V' and V. 
384 Chapter VII. Linear Variational Problems. Regularity In other words D(A) is the set of the u E V such that the antilinear form v  a(u, v) defined on V can be extended to a continuous antilinear form on H. Consequently (1.14) D(A) == D(A) and A coincides with the operator A defined by the triplet (V, H, a(u, v)). We have seen that, if the sesquilinear form is V-elliptic, the operator A is an isomorphism of D(A) onto H. Let G == A -1 be the inverse isomorphism. We consider now the restriction of G to (Q). We see from the inclusion of H in '(Q) that G is, in particular, an operator from (Q) into '(Q). From Schwartz's kernel theorem (see Appendix "Distributions") we know such a mapping is defined by a distribution Gx,y E '(Qx x Qy) by means of the formula ( 1.15) < Gu, v> == < G x, y, u(x)v(y) > for all u E (Q) and v E (Q). In this formula, the first bracket denotes the antiduality between ' (Q) and (Q), and the second bracket, the duality between '(Qx x Qy) and (Qx x Qy). The distribution G x, y so defined is called the Green's kernel of the variational problem (1. 16) a (u, v) == (I, v), V V E V. We shall see in the examples the reasons for this name. We shall return in 3 to the study of the regularity properties of Gx,y. 4. Sesquilinear Forms Associated with Elliptic Operators of Order Two We take Q to be an open set (bounded or not) of n and H == L 2 (Q). Let V be a closed subspace of H 1 (Q) containing H6(Q); we consider on V the sesquilinear form: (1.17) a(u, v) def . t f aij(x)DjuD i V dx + f ao(x)uv dx. l, ) = 1 [} [} We make the following hypotheses: ( 1.18) (i) a ij , ao E L OO(Q) (ii) Re L aiji[j  a(I112 + . .. + lnI2) a.e. x E Q, for i, j all  E C where a > 0 is given (independent of x) (iii) Re ao(x)  f3 a.e. x E Q, where f3 is a prescribed positive constant. 
1. Elliptic Variational Theory 385 It is then immediately obvious that the sesquilinear form a(u, v) is V-elliptic if fJ is strictly positive. If fJ == 0, the sesquilinear form a(u, v) is only V-coercive relative to L 2 (Q) If the matrix of the coefficients aij is hermitian (note that in this case, a(u, v) is \. hermitian if ao E ), the condition (ii) means that the matrix (aij) is positive definite and that its smallest eigenvalue is bounded below by a. Remark 6. If the coefficients aij are real, the condition (1.19) L aij(x)ij  a(I112 + . . . + lnI2) ij for all real i (V  E n), is equivalent to (1.18) (ii). Indeed, if we put 1 b.. == - ( a.. + a.. ) IJ 2 IJ Jl we have L bij(X)i j == L aij(x)i j' ij ij V  E n. Hence the (symmetric) matrix (b ij ) has its smallest eigenvalue uniformly bounded below by a if (1.19) holds. We deduce that L b ij (X)i j  a(liI2 + . . . + lnI2) ij and hence that (1.18) (ii) holds, since Re" a..J:."J". == " b..J:."J". L lJlJ L IJlr Let us now consider a slightly more general sesquilinear form of order two: for example (1.20) b(u, v) = a(u, v) + L r (bi(x)(Di u)i! + b; u Dii!) dx, i J Q where a(u, v) is defined as in (1.17), and where b i and b are given functions of x E Q. Proposition 1. Under the hypotheses (1.18), if in addition bi' b E LOO(Q), i == 1,. . . , n, then the sesquilinear form b( u, v) defined in (1.20) is V-coercive relatively to L 2(Q). Proof. Let us denote by (,)m and II . II m the scalar product and the norm in the space Hm(Q), for m a positive integer (in particular, since HO(Q) == L 2(Q), II 11o is the norm of L 2(Q) and (,) == (, )0). From (1.18), a(u, v) is V-coercive relatively to L 2 (Q), that is to say (1.21) R e a( v, v) + a II v 116  a II v II i, V V E H 1 (Q). 
386 Chapter VII. Linear Variativnal Problems. Regularity The coefficients b i and b being bounded on Q, there exists a constant Co such that I ( biD i u, u) I  CO II u II 0 II U 111 I(bu, Diu)1  Collullollull l for i == 1,. . . , n and arbitrary u E HI (Q). From the inequality 1 8 . afJ  -a 2 + _fJ2, valId for all 8 > 0, we have 28 2 8 2 Co 2 Colluolilluil l  2 Collull l + 2;llullo, from which, by choosing 8 == aj(2C o n), we deduce the inequality C 2 2 " b b ' a 2 on 2 Re '7 [( iDi U , u) + ( i U , DiU)]  - 2 lIull l - 2 a lIuli o from which, by addition with (1.21), we derive the lower bound ( C2 n 2 ) a (1.22) Re b(u, u) + 2 : + ex lIul15  2"llulli VUEH 1 (Q) which proves the desired result. o Remark 7. The preceding result is still true if (1.18) (iii) holds with fJ < 0 finite (the coefficient of II u 116 in the first member of (1.22) is then increased by - fJ). o Remark 8. The operator P:V  '(Q) defined by (1.12) with the help of the sesquilinear form a(u, v) given by (1.17) is the elliptic operator of order two: - L Di(aijD j ) + ao. ij The operator Q: V  '(Q) defined with the aid of the sesquilinear form b (given by (1.20)) by (1.12) contains in addition the first order terms L (biD i - D i b). i o Remark 9. If we wish to solve the problem (1.23) b(u, v) == L(v), \Iv E V H6(Q) eVe H'(Q) and H == L 2 (Q) with where LEV' is given, it is not sufficient to know that b is V-coercive relatively to 
 1. Elliptic Variational Theory 387 H. From Remark 4, this problem can indeed not have a solution (from the Fredholm alternative). In the particular case where b(u, v) is given by (1.20) and V == H6(Q), (the coefficients aij, ao, b i and b being given once for all), we wish to know for what class of open sets w c Q, we can guarantee that the form b is not only H6(W)- coercive relatively to L 2 (w), but is H6(w)-elliptic. The answer is the following: there exist two positive constants Co and p such that Re(b(u, u))  Co Iluili for all u E H6(W), on condition that the open set w is regular and that the diameter of OJ is less than p. In other words, when placed in V == H( w) for w sufficiently small, the problem (1.23) admits a unique solution. This result is a consequence of Poincare's inequality (see Chap. IV, 97) which expresses that Ilullo,w  yp L IIDiUllo,w, VUEH6(W) i where y is a positive constant and p is the diameter of w. Recalling the estimates of Proposition 1, we have: a ( 2n2C2 ) R e b( u, u)  "2 f II DiU II . '" - ex 0 + f3 II u II . '" (where we suppose as in Remark 7 that fJ can be negative). We therefore have [ a ( 2nC )] Re b(u, u)  "2 - y2p2 ex + f3 f IIDiull,,,, Whence we have the stated H6(w)-ellipticity of b, since p is sufficiently small. o 5. Sesquilinear Forms Associated with Elliptic Operators of Order 2m We consider throughout a bounded open set Q of n and H == L 2(Q). Let P(x, D) be a linear differential operator, with coefficients in L 00 (Q), of order 2m where m is a positive integer (see Chap. V, 91). The expression (Pu, v) def t (P(x, D) u (x)) . v(x)dx has a meaning for all u, v E oo (Q). We shall call a sesquilinear form 20 associated with the elliptic operator P(x, D) or 20 Do not confuse this with the notion of the sesquilinear form associated with a selfadjoint operator A in a Hilbert space H (see Chap. VI, 3). 
388 Chapter VII. Linear Variational Problems. Regularity a Dirichlet form, every sesquilinear form a(u, v) defined on OO(Q) x OO(Q), of the type a(u, v) =  t Aj(x, D) u (x) (Bj (x, D)v(x)) dx, where the Aj and Bj are differential operators of order m and such that a( u, v) == (Pu, v), Vu, V E (Q). Remark 10. Several Dirichlet forms a(u, v) with different formal expressions (but certainly identical on (Q)) can be associated with the same elliptic operator P. For example, in Q c 1R2, if P == - j, we can associ ate with it a 1 (u, v) = t grad u. grad v dx, or equally well a 2 (u, v) = t ( :x + i :y )u( :x + i :y )v dxdy. In the same way in Q c IRn, we can associate with P == j2 a 3 (u, v) = t Llu Llvdx or just as well i a2u a 2 v a4(u, v) == L a a a a dx. Q i } . X. x. x. x. · l } l } We now give a coercivity result for the sesquilinear forms associated with elliptic operators of order 2m. Let a(u, v) be a sesquilinear form of the type (1.24) a(u, v) == I f aaP(X) Da u DP v dx, lal m, IPI m Q where a {ai' . . ., an}, fJ == {fJi" . ., fJn} are multi-indices, and a1a1u Dau == a x a 1 . . . a x n an ' lal == a i + a 2 + . . . + an. We say that the sesquilinear form a(u, v) is uniformly strongly elliptic 2i on Q if 21 Note that the terminology adopted in Chap. II, 98 in the case where m = 1 is a little different from that used here. 
 1. Elliptic Variational Theory 389 there exists a constant C > 0 such that for all x E Q and for all  E IRn we have Re L aap(x)aP  CI12m, lal = IPI =m (1.25) 0 - aap E rc (Q) lal = IPI = m, aap E L 00 (Q), otherwise. We then have the following result due to Garding: Proposition 2. Let Q be a bounded open set and a(u, v) a sesquilinear form of the type (1.24) whose coefficients satisfy (1.25); then the following two properties are equivalent: (i) a(u, v) is uniformly strongly-elliptic on Q, (ii) a(u, v) is H(Q)-coercive relatively to L 2(Q): i.e. there exist Co  0, C 1  0 such that Re a( v, v) + Co 1\ v 116  C 111 v \I , \tv E H(Q). For the proof of this result, we refer to Garding-Lions [1]. For the more general case in which H(Q) eVe Hm(Q), we refer the reader to Agmon [1]. Remark 11. In a similar fashion to Remark 9, we could show that the restriction of the sesquilinear form to H(OJ) for OJ with diameter p sufficiently small is H(OJ)-elliptic and not only H(w)-coercive relatively to L 2(W). D 6. Miscellaneous Remarks Before developing other classes af examples, we shall make some remarks. Remark 12. The variational method applies neither to the Dirichlet problem for the totality of elliptic operators, nor to the boundary value problems arising in application. Referring to pp. 200 et seq. of Vol. 1 of Lions-Magenes [1] for a study of the domain of application of this method, let us merely give an example which shows that if a linear differential operator P is elliptic it does not necessarily belong to the variational class (V,H = L 2 (Q), a) where a(u,v) is a sesquilinear form V-coercive with respect to H, associated with the linear differential operator P. Example 3 (See Seeley [1], Lions-Magenes [1 ], Vol. 1, p. 203). In 1R2, let Q be the bounded open set defined in polar coordinates (r, 0) by Q = {(r, 0); rc < r < 2rc, 0  0 < 2rc} and let P be the elliptic operator (in the sense of Chap. V) defined by P = _ ( ei6 ) 2 _ e 2iO ( 1 +  ) ao or 2 . 
390 Chapter VII. Linear Variational Problems. Regularity Then the problem ( 1.26) { (P + A)U == 0 ulr == 0 admits, for all A E [R, the non-null solutions u == sin r cos(fJe - iO) U == sin r sin (fJe - iO); when A -# 0, fJ E C is such that fJ2 == A; for A == 0, we have u == SIn r u == e- iO sin r. Let us suppose that there exists a Hilbert space V contained in L 2(Q), containing E.0(Q), and a sesquilinear form a(u, v) on V x V, V-coercive (with respect to L 2(Q)) such that the boundary value problem is the Dirichlet problem { Pu == f ulr == O. The sesquilinear form a(u, v) would satisfy Re a(u, u)  Cllu" - Ao"u11, and the sesquilinear form a).Ju, v) defined by a)..(u, v) == a(u, v) + A(U, v)o would hence be V-elliptic for all A such that Re A  Ao, and the problem (1.26) would therefore have a unique solution, namely the null solution, in con- tradiction to the found results. o Remark 13. The preceding examples, and most of those of this  1, have referred to elliptic operators (with which we associate coercive forms on Sobolev spaces Hm(Q)). But it must be remarked that ellipticity only plays a role on account of the choice for V of a space linked with the Sobolev spaces. Indeed, suppose, for example, let P be the square of the wave operator (which, obviously, is not elliptic!) Pu == D(Du); let Vbe the completion of E.0(Q), with Q a regular bounded open set, for the norm Ilull == (Du, DU)1/2; the sesquilinear form a(u, v) defined by a(u, v) == (Du, D v) is a V-elliptic form, the boundary value problem associated with d defined by 
 1. Elliptic Variational Theory 391 the Lax-Milgram theorem (Theorem 1) starting from a(u, v): d: V  V' can be interpreted as a "Dirichlet problem" for P. For example, for n = 2, after a change of coordinates transforming P to the form D;D;, the problem obtained is Pu = f ulr = 0 au = 0 av r where au def au au oi' r (t) = x' (t) ox (x(t), y(t)) - y' (t) oy (x(t), y(t)) if r admits the parametric representation (x(t), y(t)). We can generalize what has gone before to every operator P satisfying { <PqJ,ifJ)  0 IlqJllo  CII PqJllo for all qJ E (Q). The variational method thus has a domain of application very much greater than that of the elliptic problems, provided that V is adapted to the operator studied. But the difficulty of applying the method will be trans- ferred to the study of the space V. o 7. Application to the Solution of General Elliptic Problems (of Dirichlet Type) Let Q be a bounded open set in [Rn with regular boundary r. 22 We consider the following problem: Find u solution of: def P(X, D)u == ( - l)la l I Daaap(x)DP u = f lal m, IPI m (1.27) You = ul r = go au Y1 U = - = gl an r am - 1 U Ym-1 U = a m-l = gm-l n r 22 That is to say, r is a variety of dimension n - 1, infinitely differentiable, with Q locally on one side of r. 
392 Chapter VII. Linear Variational Problems. Regularity where n is the outward unit normal vector to r, withf E L 2(Q) a given function together with the functions gk E Hm - k -! (r). We make the following hypotheses: i P(x, D) is a differential operator of order 2m uniformly strongly (1.28) elliptic, that is to say such that Re (p.(x, ))  CI12m, V  E [Rn, where p.(x, ) is the characteristic polynomial of the principal part p. of P (see Chap. V,  1); (1.29) { aap E o(Q) for all a, /3 such that lal == 1/31 == m, and aap bounded and measurable for all (a, /3) satisfying lal + 1/31  2m. The problem (1.27) can be resolved in the following manner: 1 st stage. Generalizing the method of  1.2.1 of solving the problem in u: { LJu == f YoU == g, to begin with we use a lifting of the boundary conditions: as a result of Theorem 5 of Chap. IV, 94, there exists v E Hm(Q) such that Yk v == gk for k == 0 to m - 1. 2nd stage. We make the change u == v + w of the unknown function. The problem (1.27) is then equivalent to the following problem in w: { P(x, D)w == h (1.30) Yk w == 0, k == 0, . . . , m - 1 with h == f - P(x, D)v E H-m(Q). 3rd stage. By Proposition 2, the sesquilinear form a(u, v) defined on H(Q) x H(Q) by a(u, v) = L r aap(x)Dau DPii dx lalm, IPIm JQ (1.31 ) is H(Q)-coercive relatively to L 2(Q). There therefore exists a real Ao such that for all real A  Ao, the operator P(x, D) + AI is an isomorphism of H(Q) onto H-m(Q). This allows to assert the existence and uniqueness of the solution w of the problem: (1.32) { (P(x, D) + AI)w == h Yk W == 0, k == 0, . . . , m - 1. If it is possible to take A == 0 (i.e. if Ao  0) in (1.32), that is to say if the sesquilinear form a(u, v) is H(Q)-elliptic (and not only H(Q)-coercive relatively to L 2(Q)), we have therefore solved the problem (1.30), hence (1.27). 
2. Examples of Second Order Elliptic Problems 393 In the opposite case, the problem (1.30) (or directly (1.27)) will be able to be solved by the Fredholm alternative. Remark 14. Influence of the size of the domain. From Remark 11, we know that if (J) is an open part of an open set Q in which (1.28) and (1.29) are satisfied, and if (J) is sufficiently small, then a(u, v) is H((J))-el1iptic: the problem (1.27) then has one, and only one, solution u E H m ( (J)). A very simple method of under- standing better this phenomenon is the following: Consider solving the problem { ( - A)m u + Qu == f in the ball Bt; of centre 0 and radius 8, small, with YjU == 0 on aBt;, 0  j  m - 1, where Q == L qa(x)D a , qa being given regular functions (for simplicity). lal  2m - 1 If we introduce w by w(x) == U(8X), W is defined in the ball B 1 with centre 0 and radius 1, and has to satisfy in B 1 : ( - L1)m w + 8Q,W = 8 2m 1 (; ) 'Y j w == 0 on a B 1, 0  j  m - 1, where Q,w = Lqa (; }2m-l- 1 a l Daw. It is then clear that, by perturbation, we have existence and uniqueness for 8 sufficiently small. o 9 2 . Examples of Second Order Elliptic Problems 1. Generalities In 2.1, 2.2 and 2.3, Q will be a "regular"23 bounded open set of [Rn with boundary r, and P a second order elliptic operator of the form (2.1 ) P(x, D) == - L Djaij(x)D i + Lbi(x)D i + c(x); i, j i V will denote a closed sub-space of H 1 (Q) containing H5(Q), H will be the space L 2 (Q). Ifwe suppose that the coefficients aij, bi' c are in L OO(Q), we see that we can define 23 That is to say, with Lipschitzian boundary, with Q locally on one side of r. 
394 Chapter VII. Linear Variational Problems. Regularity Pu for all u E H 1 (Q), and that the sesquilinear form ao(u, v) defined by (2.2) ao(u, v) == I(aij D i u, D j v) + I(b i Diu, v) + (cu, v), i, j i will be continuous on V x V. Under the hypotheses of Proposition 1 of 9 1 , ao(u, v) is V-coercive relatively to L 2(Q). Therefore there exists Ao such that for all A  Ao, the sesquilinear form a).(u, v) defined by (2.3) a).(u, v) == ao(u, v) + A(u, v) is V-elliptic. We shall consider other types of sesquilinear forms than the sesquilinear form a). (u, v). F or example, let T E 2(H i (Q), H- i / 2 (r)) be a continuous linear operator from Hi (Q) into H -t(r); we could consider on V x V the continuous sesquilinear form a(u, v) defined by a(u, v) == ao(u, v) + < Tu, Yov> where the bracket denotes the duality between H-t(r) and Ht(r). Or again: Let W be a closed sub-space of Ht(r) and let b(u, v) be a continuous sesquilinear form on W x W. On the closed space V of H i (Q) defined by V == Yo l(W) we can consider the continuous sesquilinear form a(u, v) defined by a( u, v) == a o (u, v) + b (1' 0 u, l' 0 v). 2. Examples of Variational Problems 2.1. Mixed Problem We shall consider the spaces V constructed in the following way. Let r 1 be a closed subset of r; for all v E H i (Q) we can speak of the restriction of YoV to r i -this is an element of Ht(r i ). We take V == {v E H1(Q); YoVIT! == O}. The sub-space W of Ht(r) defined by W == {w E H 1 / 2 (r); wiT! == O} is a closed sub-space of Ht(r). We see that then V == Yol(W) is a closed sub-space of Hi (Q) containing H (Q). 
2. Examples of Second Order Elliptic Problems 395 Let r 2 be the complement of r 1 in r and let gEL 2(r 2) be given. Likewise, let f E L 2(Q) be given; we put, for all v E V, (2.4) L(v) = r fv dx + f giJ dr 2. JQ r 2 From the continuity of the trace map: 'Yo: H 1 (Q)  Ht(r) we deduce that the anti linear form L is continuous on V. We then consider the variational problem (2.5) ao(u, v) == L(v), Vv E V; under suitable assumptions on the coefficients aij, b i and c, the sesquilinear form ao(u, v) is V-elliptic, and this problem admits a unique solution u E V. The difficulty is to interpret the solution of the problem thus obtained. By choosing v == qJ E E.0(Q) in (2.5), we obtain immediately (2.6) P(X, D)u == f in E.0'(Q). In addition, we know, from the definition of V, that (2.7) u == 0 on r 1 . I t remains to interpret the boundary condition on r 2. Let P 2 (X, D) == - L Djaij(x) D i ij be the principal part of P. We put H(P, Q) == {v E H 1 (Q); P 2 v E L 2 (Q)}; in a similar way to the proof of Lemma 1 of  1.2.2, we can show that the trace mappIng au 'Yp: u  _ a Vp r a where _ a denotes the boundary operator defined by Vp (2.8) a au _ a u == I a ij _ a cos(n, xJ, v p ij X j and n denotes the unit vector in the direction of the outward normal to r, can be extended to a continuous mapping 'YP : H(P, Q)  H- 1/2(r). In a similar way to the Corollary 1 of  1.2.2, we have the generalized Green's 
396 Chapter VII. Linear Variational Problems. Regularity formula ( :v: ' v) t - (Pu)vdx + ao(u, V),24 for all u E H(P, Q) and v E H 1 (Q). For u a solution of (2.6) (with f E L 2(Q)), we have t Puvdx = t fVdX . As u E H(P, Q) from (2.6) the generalized Green's formula (2.9) and (2.4) and (2.5) imply (2.9) ( :v u p ' v) = fr2 gvdr 2 , which gives (since v is "free" on r 2)25 (2.10) Vv E V, au - == g on r 2. av p Conversely, if u is a sufficiently regular solution of the mixed problem (2.11 ) P(x, D)u == f in Q u == 0 on r 1 au _ a == g on r 2, v p then we can verify easily that u is the solution of the variational problem (2.5). Remark 1. (1) If u is regular, e.g. u E H 2 (Q), then a Ou E L 2(r) and (2.10) can be written V p f2 ( ::p - g)vdr 2 , \Iv E V and the equality (2.11) is therefore valid in L 2(r 2) and hence almost everywhere on r 2 (in effect for all regular qJ with compact support in r 2' there exists v E V admitting qJ for trace, and on the other hand the regular functions with compact support in r 2 are dense in L 2(r 2). (2) In the general case, we can show that (2.11) holds in (H66 2 (r 2 ))' (anti)dual of 24 With the antiduality Ht(r), H-t(r). 25 See Remark 1. 
2. Examples of Second Order Elliptic Problems 397 the space H52(r 2), itself defined in the followIng way: def H52(r2)={VEL2(r2);3wEH1(Q),w == 0 on r1w == v on r 2 }26 (it differs from the space H 1 / 2 (r 2) which can be defined as the space of the restrictions to r 2 of the functions of H 1 / 2 (r)). The result (2.11) then follows from a trace theorem which states that the mapping 1'0,2 : v E V  YO,2 V == Yovlrz is continuous and surjective from V onto H52(r 2) (see the bibliography given in Lions- M agenes [1], Vol. III, p. 307 and Baiocchi-Capelo [1]). The antilinear form l(v) = ( ::p ' v) is then continuous on V, and hence on H52(r 2) and therefore clearly defines an element of (H52(r 2))' the antidual of H52(r 2). au The formula (2.10) then shows that _ a == g In (H52(r 2))' (in effect V p L 2(r 2) c (H52(r 2))'). o Remark 2. The limiting case r 1 0 reduces to the Neumann problem, while the limiting case r 2 == 0 reduces to the Dirichlet problem. o 2.2. Non-Local Boundary Conditions Throughout, let Q be a regular bounded open set in [Rn, with boundary r. Since H 1 / 2 (r) c L 2(r) c L l(r), we can, for all u E H1(Q), define Sr Yo u dy; the linear form u -4 fQ youdy is thus a continuous form on H 1 (Q). We consider the space V defined by V def {u E H 1 (Q); L youdy = o} This is a closed sub-space of H 1 (Q) and it quite obviously contains H5(Q). For a 26 This space is provided with the topology of the quotient norm II v II HM1(r 2) =- inf Ilwll v . weV wlr 2 = v & wlrl = 0 
398 Chapter VII. Linear Variational Problems. Regularity given f E L 2 (Q), we can consider the variational problem ao(u, v) = (f, v), Vv E V, (ao(u, v) defined by (2.2) with the hypotheses of Proposition 1 of  1) and interpret the corresponding boundary value problem. For that we make use of the generalized Green's formula (2.9) and we find the following interpretation: Pu = f in E.0'(Q) ou . ) _ 0 = constant (not gIven V p t youdy = 0, o . where - IS defined by (2.8). oVp in H -1/2(r) 3. Problems Relative to Integro-Differential Forms on Q x r Throughout this section V is a closed sub-space of H 1 (Q) containing H(Q), but we shall make use of a different sesquilinear form from the se$quilinear form a o ( u, v) defined in (2.2). 3.1. Problem of the Oblique Derivative Let Q be a bounded open set of [Rn, with regular boundary r 27 , let T be a differential operator of order one, tangent to r, defined in the following manner: In each system of local coordinates s = {s l' . . . , Sn _ 1} on r, the operator Tis expressed by n-l ocp Tcp = il IXi(S) OSi ' IX i E 1(r). We can then verify that the mapping u  Tyou of E.0(Q) into 1 (r) extends to a continuous mapping of H 1 (Q) into H -t(r). For example, when Q c [R2 and du (Tyou)(s) = a(s) ds where s is the curvilinear abscissa, and a is a regular function; this result follows from the continuity in the sense of distributions d ds : H 1 / 2 (r)  H- 1 / 2 (r). 27 That is to say, here, of class rei (at least), with Q locally on one side of F. 
2. Examples of Second Order Elliptic Problems 399 We can then consider on H 1 (Q) x H 1 (Q) the sesquilinear form b(u, v) defined by (2.12) b(u, v) == ao(u, v) + < Tyou, Yo v) where the bracket denotes the anti-duality between H -t(r) and Ht(r) and ao(u, v) is the sesquilinear form defined by (2.2) with the hypotheses of Prop- osition 1 of  1. We show that the sesquilinear form b(u, v) is H 1 (Q)-coercive relatively to L 2(Q). For that, we first of aU show that there exists a constant Co such that IRe < Tyo v, Yo v)1  Co II Yo v II 12(r). This is the essential point. This follows from the fact that the expression: i acp 1 f a Re (X i(S)- a ijJdr=- 2 Re (X i(S)- a Icpl2dr Q Si r Si (after integrations by parts) does not contain derivatives of cp. Then we show that, for all s > 0, there exists C( s) > 0 such that (2.13) IIYovIIL 2 (r)  sllvll 1 + C(s)lIvll o on using for example standard theorems on the trace and the fact that the injection of HS(Q) into H S ' (Q), S > Sf, is compact (Q is bounded and regular (see Chap. IV, 6)). We then make use of Proposition 1 of  1; there exist constants (X > 0 and A E [R such that, for all u E H 1 (Q) Re ao(u, u)  Clilull - Allull6. Taking s == ta/co in (2.13), we see that Re b(u, u)   Ilull - (A + C( 2:J ) Ilu116, from which the desired result follows. It is then possible, by addition of a positive constant to the coefficient c of (2.2), to obtain a H 1 (Q)-elliptic sesquilinear form b(u, v). The variational boundary value problem relative to the sesquilinear form b(u, v) is then interpreted (with the help of a Green's formula which can be justified) as f Pu = f in Q, 1 ( ap + T)U = 0 in H- 1 / 2 (r). This is a problem of the regular oblique derivative (at every point of r, the differential operator  + T is transversal to r). av p 
400 Chapter VII. Linear Variational Problems. Regularity 3.2. Robin's Problem This problem can be considered as a particular case of the above. Throughout let Q be a regular bounded open set in [Rn with boundary r and let h E L OO(r) be a function defined on r. We consider the sesquilinear form b(u, v) defined on H 1 (Q) x H 1 (Q) by b (u, v) == ao (u, v) + < hy 0 u, Yo v > where ao(u, v) is defined throughout by (2.2) with the hypotheses of Proposition 1,  1, and where the supplementary term is written def f (hyo u, Yo v> = r h(s)u(s)v(s) dy (in a classical sense, since H 1 / 2 (r) c L 2(r)). The V-coercivity with respect to H of the sesquilinear form b(u, v) (with V == H 1 (Q)) poses no problem, if (2.13) is used. The variational problem thus posed (by taking a coefficient c in (2.2) sufficiently large) can be interpreted formally as: (2.14) { Pu == f in Q, ( Op + h)U 1r = O. This problem is called Robin's problem. 4. Transmission Problem Let P(x, D) be a second order elliptic operator of the form P(x, D) == - L DjaijD i , x E [Rn 28 ij We shall suppose that the regular bounded open set Q in [Rn is made up of two sub-domains Ql and Q2 separated by an interface (a regular hyper surface) L, with the result that Q == Ql U L U Q2. Let r be the boundary of Q, r 1 the boundary of Ql and r z that of Q2. We have, in addition, r 1 n r 2 == f. 29 28 We could, more generally, consider an operator of type (2.1) 29 We can have especially the following two figures: Fig. 1 Fig. 2 
2. Examples of Second Order Elliptic Problems 401 Let at be the restricion of aij to Qk' k = 1,2; we suppose that the coefficients at are continuous on Qk' for k = 1,2, but that they can be discontinuous across E. We put, for k = 1, 2 adu, v) = r (  atDiUDjV ) dX J Qk lJ with the result that (2.15) a(u, v) = t (t: a ij DiU DjV) dx = a 1 (u, v) + a 2 (u, v). Let f E L 2(Q) and V = Hb(Q); we call U E V the solution of the variational problem (2.16) a(u, v) = (f, v), \:Iv E V (the existence of u is assured by suitable ellipticity hypotheses). Then let U I and U 2 be the restrictions of u to QI and Q2; we shall show that U I and U2 are solutions of a transmission problem, i.e. of a problem posed on Ql for u 1 , coupled with a problem posed on Q2 for u 2 . Let Pk(X, D) = - L Djat(x) D i ij be the restriction of the operator P(x, D) to Qk' By choosing v ({Jk E f0(Qk)' k = 1, 2 in (2.16), we deduce immediately that (2.1 7) PkU = h in '(Qk)' k = 1, 2 where h = flQk is the restriction off to Qk' k = 1, 2. On the other hand, since U E H(Q) we have the boundary conditions Uk = 0 on r n r k , k = 1, 2. It remains to interpret the "transmission" conditions on E expressIng the coupling between U 1 and U 2 . First of all, from the continuity of the trace on E, we must have (2.18) (i) U I = U 2 on E. (2.18) (ii) Finally, we shall prove that oU I oU 2 - + - = 0 on E, oV P1 oVpz where O au k , k = 1, 2 is defined in a manner similar to au , the normal vector nk v Pk oV p being chosen exterior to Qk (in particular on E, we have n i + n2 = 0). For that, 
402 Chapter VII. Linear Variational Problems. Regularity we proceed in a "formal" manner by supposing that the Green's formula (2.19) f 8U iJdr k = r - (Pku)iJdx + ak(u, v), "Iv E (Q). r k Pk J Qk is applicable. Now, from (2.17) we have L r (PkUk)iJdx = r fiJdx, k J JQ from which, on making use of the Green's formula (2.19) and the variational equation (2.16), we derive (2.20) L r iJdrk = f (  +  ) VdL == 0, Vv E V, k Jr k aV Pk L aV P1 aV P2 where we have used the fact that v is null on r. Since v is "arbitrary" on L, we deduce the transmission condition (2.18) (ii). Remark 3. If we wish to establish rigorously the transmission condition (2.18) (ii) and know in which space it is valid, we must proceed in a manner similar to that of Lemma 1,  1.2.2 in the case of Fig. 1 which we shall consider here (the case of Fig. 2 presents no particular difficulty). Let U E (Q); for W E Hb(Q), we put f ( aUi aU2 ) l(w) == _ a + _ a wdL L v P1 v P2 == L f :Uk wdr k = L r - (PkUk)wdx + a(u, w) k Tk V Pk k JQk where we have applied Green's formula (2.19) which is permissible since U is very regular. We deduce that l(w) :S; CllwIIH'(Q) (IIUIIH'(Q) + f II PkUk II L 2 (Q)). Let Z12 be the space of functions U E Hb(Q) such that PkU k E L 2 (Qk)' k == 1,2 where Uk == UIQk is the restriction of U to Qk; we see that for U E Z12 the anti- linear form I is continuous on Hb(Q), and hence on Hb2(L)30 from the trace theorem quoted in Remark 1. We deduce that I E (H52(L))', and that the def 30 With H62(1:) == {v E L 2(1:), 3w E H6(Q) such that WIE = v}, this space being provided with the quotient norm II v II = inf II w II Hl(Q)' WE H5(Q) wlI = v 
g2. Examples of Second Order Elliptic Problems 403 mappIng YE : U E f!C(Q) n H(Q)  I E (H} (17))' can be extended to a continuous linear mapping of Z12 into (H62(17))'. In other words the functions of Z 12 admits a trace OU OU2 ( 1/2 ( )) ' _ 0 + _ 0 E Hoo 17 . v P1 vP 2 In addition, we have the generalized Green's formula \ OUl OU 2 ) " i - _ 0 + _ 0 ' v =  - (Pkuk)vdx + a(u, v) v P1 v P2 k Qk for U E Z 12 and v E H(Q), where the bracket is interpreted as the antiduality between (H62(17))' and Hoo(17). The rigorous analogue of (2.20) is thus \ OUl + OU2 v) _ _ 0 1 Vv E Ho(Q) oVP 1 oVP 2 ' , from which OU 1 OU 2 2 - + - = 0 In (H6 (17))' OVPl oVP 2 from the surjectivity of the mapping v H viE of H(Q) into H62(17). This shows (with Remark 1) once more: (1) that the formal interpretations can be justified, (2) that the variational formulations avoid many complications. D Conversely, let U 1 and U 2 be solutions, supposed to be sufficiently regular 31 of the transmission problem: i) Pi U 1 = 11 In Ql ii) P 2 U 2 = 12 In Q2 (2.21 ) iii) U 1 =0 on r 1 nr iv) U 2 =0 on r 2 n r (2.22) i) U 1 = U 2 on 17 00) au! au 0 11 -+-= on 17, OVPl OV P2 31 We recall that the study of transmission problems of the type (2.21), (2.22) with the study of their solution (hence of U 1 and u 2 ) has been done in Chap. II, g8. 
404 Chapter VII. Linear Variational Problems. Regularity then the function U whose restriction to Qk is Uk for k = 1, 2 satisfies (1) U E H (Q), by generalization of Proposition 2, 4 of Chap. IV, (2) for v E H (Q) '\' f au k _ a(u, v) = I ak(u k , v) =  _ a v + (f, v) k k r k v Pk from Green's formula (2.19). Finally, the transmission condition shows that a (u, v) = (I, v), Vv E H(Q). Remark 4. We can thus verify that P 1 u 1 = f1 In Q 1 P 2 U 2 = f2 In Q2 plus the transmission conditions (2.22) between U 1 and U 2 on r, imply that Pu =f in '(Q). D Remark 5. Other boundary conditions. Naturally, we should have been able to envisage other boundary conditions on r than conditions of Dirichlet type. The method is the same, and the transmission conditions unchanged. D Remark 6. We can also remark that the transmission conditions show that if the coefficients are discontinuous across L:, the normal derivative of u will involve a measure on L: and hence that u rt H 2 (Q). D Remark 7. It is interesting to state exactly the conditions (2.22) in a particular case which occurs often in practice: the case in which { a 1 <5 ij , x E Q1 aij(x) =  Q a 2 U ij, x E 2 (2.23) with a 1 and a2 E C and Real and Rea2 > 0, a 1 =1= a 2 . Equation (2.22) then becomes a condition on the normal derivatives of U 1 and U 2 on L:: (2.24 ) au 1 aI- a n E aU2 = a2- an E where n is the normal to L: directed from Q1 towards Q2. D 
2. Examples of Second Order Elliptic Problems 405 5. Miscellaneous Remarks Remark 8. We have seen (Remark 10 of  1) that a given elliptic differential operator can correspond to several sesquilinear forms. We shall return, by an example, borrowed from Lions-Magenes [1J, Chap. II, on this question. Let J be the aplacian in the plane: Let Q c [R2 be a regular bounded set and let C E reOO(Q) be a real function. To study the Neumann problem in Q, we have used in  1.2.2, the sesquilinear form a(u, v) on H 1 (Q) defined by r ( au au au au ) a(u, v) = JQ aX ax + oyoy dxdy and the Green's formula r - r au_ - JQ Lluvdxdy = a(u, v) - Jr an vdr. We can also associate with the sesquilinear form b(u, v) on H 1 (Q) defined by i ( au av au av au av b(u v) == -- + -- + c-- - , Q ax ax ayay ax ay ac au _ ac au _ ) + --v - --v dxdy ayax ax ay , the corresponding Green's formula being - fQ Lluvdxdy = b(u, v) - fr G + C : )VdY au av c-- ayax where a / ay denotes the derivative along the element of arc r. In other words, the boundary value problem, corresponding to the given sesquilinear form b(u, v) in H 1 (Q) == V, that is to say the "Neumann problem" for this form, can be interpreted by ! -Ju ==1 au au an + C oy = 0 on r; this is a problem of the regular oblique derivative for the Laplacian (see Sect. 3.1). Thus, with the same elliptic operator P, we can associate two different sesqui- linear forms (see also Remark 10 of  1) and hence two different boundary value pro blems. In Q D Remark 9. The hypotheses "Q a bounded open set, situated locally on the one side of its boundary r, a variety of class re oo " are often made for simplicity. The hypotheses are too restrictive in a certain number of situations (transmission problems, see 
406 Chapter VII. Linear Variational Problems. Regularity Sect. 4, open sets with "corners" etc.). As long as we are neither interested in regularity properties of the solutions (see 93) nor in a precise characterisation of the spaces of possible boundary values for inhomogeneous problems, we can enlarge the differentiability conditions on the boundary of Q; we can suppose it to be piecewise-ct 1 or Lipschitzian, for example, (without touching the other hypotheses: Q bounded and on one side only of its boundary). One of the crucial points is the possibility of extending the functions of Hm(Q) to elements of Hm([Rn), which is possible under sufficiently general conditions (see Chap. IV, 4 and also see Agmon [lJ for example). An important situation involving domains not situated locally on only side of the boundary is that of cracks (we can refer to the collective work I.I.M ecanique de la rupture", (Ecoles CEA-INRIA-EDF, 1982-83)). Thus let (Fig. 3) Q be an open set of [R2 whose boundary oQ is composed of the I.l. ex terior boundary" r and of the "internal boundary" ff which models a crack 32 ; in the neighbourhood of .:F, the domain Q is locally on both sides of ff. We look for a function u which is a 'I. su itable" solution of - L1 u = f In u = 0 on Q, oQ = r u Y , where f E L 2 (Q) is given. This problem admits a unique solution (the general theory, valid with an arbitrary open set, is of course valid!) in the space H  (Q). But it is not true that u is in H 2 (Q) (which would be the case if there were no crack). However, we know exactly how we must I.l.correct" u to obtain a function in H 2 (Q). Denoting by Q the open set Q u _ff (i.e. without crack; see Fig. 4) and by w an arbitrary open set, contained in Q, and containing the origin (chosen to be the internal extremity of ff). Fig. 3 Fig. 4 32 Refer to the work cited for the physical justification - very partial- of this model. 
2. Examples of Second Order Elliptic Problems 407 We introduce Sdx) = 1m (Xh + iX2)1/2 = r l/2 sin  if Xl + lX2 re w . Then there exists a constant c such that u - cS 1 E H 2 (w) (with the same constant c whatever w is). This type of result (see P. Grisvard [1 J, [2J which contains more general results) is very useful in numerical applications (to the "base functions" serving for approximation, for example for the finite element method, we add S 1). o 6. Application: Stationary Multigroup Equation for the Diffusion of Neutrons We propose to solve the following problem: Find (ug)g = 1 to G the solution of the system of equations G f i) - div(d g grad u g ) + h.f: l (JghUh = /g in Q, 9 t ii) u g = 0 on r, 9 = 1 to G, where Q c [R3 is a bounded domain with boundary r. This system of equations models the diffusion of neutrons in a nuclear reactor (see Chap. lA, 5) with a given source of neutrons, represented here by (1;)g = 1 to G. This inhomogeneous problem is interesting 34 only in the case where the medium under consideration is sub-critical (see Chap. lA, 5 - for this notion). The (real) coefficients d g , and (J" gh are supposed to satisfy the following conditions: 1 to G 33 , (2.25) (2.26) (i) d g & (J"gh E L 00 (Q) for all g & h (ii) there exists l5 > 0 such that 1 to G, dg(x)  l5 a.e. x E Q, g == 1 to G. 33 This notation is to be compared with that of the analogous equation IA (5.36). To simplify the writing, we add here the diagonal matrix ( + L:i) and the matrices ( - L:f -+ g) and (- KgVg'L:) which furnish the matrix (jgg' denoted here by (jgh' The notation is equally modified for Dg, sg, f/>g; they become d g , /g, u q respectively. 34 The physical problem most frequently studied is that of a critical medium without source, this leads to equations of the type (2.25), but with /g = 0 (g = 1 to G). This is a spectral problem (see Chap. VIII and XII). 
408 Chapter VII. Linear Variational Problems. Regularity We shall see that this problem (2.25) is an example of a variational problem, choosing the Hilbertian framework (nevertheless see Remark 10 below on this subject) below: Let us put v = (H(Q))G, H = (L 2 (Q))G,35 these spaces being provided respectively with the norms: ( G ) 1/2 lIullv = g11IugIl11(il) , ( G ) 1/2 II U II H = g 1 II u g III2(.Q) . Introducing, on V, the bilinear form: def r ( G G ) a(u, v) = In gl dggraduggradv g + g.l (JghUhVg dx where u = {u 9 } 9 = 1 to G V = {v 9 } 9 = 1 to G E V. The problem (2.25) is then equivalent to the variational problem:find u E V such that (2.27) a(u, v) = L(v), Vv = {V 9 }9=1 to G E V, def G f with: L(v) = gl fVgdx,/g E L 2 (Q)(or, even/g E H- 1 (Q)), for g = 1 to G. Under the hypotheses (2.26), the bilinear form a( u, v) is continuous on V x V; let us study the coercivity of a( u, v). Let M be a positive real number such that: II d g IILcc  M and II O"gh IILcc  M for g and h = 1 to G. We verify immediately that the bilinear form a( u, v) is V-coercive relatively to H: (2.28) a(u, u) + /lo II u II  £5 II u II, Vu E V with /lo = M G 2 + £5. If (2.28) is verified with /lo = 0, the medium is said to be "sub-critical"36, and from the Lax-Milgram, the variational problem (2.27) admits a unique solution in V. If, on the contrary, (2.28) can only be satisfied with /lo > 0, the problem (2.27) does not necessarily have a solution (the solution of (2.25) or of (2.26) could be obtained by the Fredholm alternative). 35 The spaces here are taken to be real. 36 We shall see in the sequel that this notion coincides with the definition that we give then of sub- criticality in a general framework. (Recall that these notions have been introduced intuitively in Chap. lA, 5). 
2. Examples of Second Order Elliptic Problems 409 Remark 10. (1) The interpretation of Remark 9 of  1 is here the following: if the diameter of the domain {J is sufficiently small (the properties of the materials, hence the coefficients d g and ()" gh being fixed), then the bilinear form is V-coercive, and the medium is "sub-critical". (2) In general ()" gh =1= ()" hg' and the bilinear form a (u, v) is not symmetric. (3) We have solved the problem (2.29) for a "sub-critical" medium by making use of the pair of spaces V == (Hb(Q))G and H == (L 2 (Q))G. In fact the natural space in which to work on the physical problem considered is the space (L 1 (Q))G (rather than (L 2 (Q))G because f L ug(x)dx, which rep- ng resents the total flux of neutrons (for u g  0, V g), has to be finite. The domain Q being bounded, the space (L 2 (Q))G is contained in (L 1 (Q))G (with continuous injection and density), with the result that the solution of the problem (2.27) in (L 2 (Q))G (for f == {!;} 9 = 1 to G E (L 2 (Q) )G) belongs also to (L 1 (Q))G. Note however that we have not obtained the solution of the problem for the given functions h ELl (Q), h rt L 2 (Q), g == 1 to G. A natural approach to the problem (2.25) in the monokinetic case (G == 1), in the space L 1 (Q) is the following: We consider A a 2nd order elliptic operator with regular coefficients, and in an open set Q with regular bound ary 37. [All that we shall say is valid for a 2nd order system or an elliptic system of arbitrary orderJ. Consider then the problem { Au == f in Q (2.29) U == 0 on r. We show that iff E W -1, P(Q) (1 < p < 00), then there exists a unique solution U E Wb' P (Q) of (2.29). This is the result established in this chapter if p == 2 and which can be extended, by means of the theory of singular integrals, to the case p =1= 2, 1 < p < 00. o We can deduce from that the results relative to the case f EL l (Q). In effect (2.30) L 1 (Q) C W- 1 ,p (Q) . f 1 I . 1 1 ( 1 1 1 W,P (Q) c LOO(Q) 1.e. -; - - < 0 - + - == p n p' p 1} 37 With Q c [Rn, n arbitrary, for greater generality. 
410 Chapter VII. Linear Variational Problems. Regularity In fact we then have W1,p' (Q) c rcO(Q) and hence 9Jl(D) c W-1,P(D) if 1 -  -  < 0 38 P n (2.31 ) n I.e. p < (p arbitrary but finite if n == 1). n - 1 Hence: (2.32) Iff ELl (Q) (and even iff E m(Q), for example if f = b(x - a) = Dirac mass at the point a E Q), then U E W6,P(D) with p < n 1 (p arbitrary but finite if n n- 1 ). (4) It remains to determine the regularity of the solution thus found, which shall be done in 3, and to verify that if the source f = {h} 9 = 1 to G is positive, then the solution {u g } 9 E G (representing the flux of neutrons) is itself39. (5) Many other types of boundary conditions are possible (see Chap. lA, 5): existence and uniqueness results can be stated in a similar fashion for the corresponding boundary value problems. D Remark 11. Global balance of neutrons. Let us suppose that f == { h } 9 = 1 to G E (L 2 (Q) )G. We take the scalar product of (2.25) (i) with the element v = {V g }g= 1 to G, such that v 9 = 1 in Q (characteristic function of Q). We then obtain by integration by parts (2.33) gtlfrdg  dr + g,tlf/J9hUhdX - gtffdX. This relation can be considered as the global balance of the neutrons: -the 1st term ( - f f dg OUg dr ) is the flux of neutrons across the 9 = 1 r on surface r; - the 2nd term ( - f f (J gh U h dx ) represents the internal production of the h, 9 = 1 Q neutrons (or their internal absorption); -finally, the last term is due to the neutron sources in the medium. We recall 38 Where 9Jl(Q) denotes the space of bounded (Radon) measures on Q. 39 That necessitates supplementary hypotheses on the coefficients (Jgh and on Q, notably: Vg, h, 9 i= h, (J gh is non positive on Q (see Stewart [1]) and will be a consequence of the Krein-Rutman theorem. 
2. Examples of Second Order Elliptic Problems 411 that the total flux of neutrons is here given by the expression L f ugdx. 9 = 1 to G il It is possible to make local balances of neutrons by integrating (2.25) (i) over (open) domains Q' contained in Q. This will be done, for example, in Chaps. XV and XVIII. D 7. Application: Statical Problems of Elasticity 7.1. Introduction We shall study here the statical problems of elasticity (problems of the type III of Chap. lA, g2). In a continuous medium occupying the domain 40 Q c [R3 subjected to given volume forcesf(x) and surface forces F(x) on a part r F of the surface r or Q, we plan to determine the displacement field u(x) (vector in [R3 with components U i , i == 1, 2, 3), and the stress tensor O"(x) (symmetric 3 x 3 tensor with com- ponents O"ij, 1  i, j  3). They satisfy the equations (2.34) a - L -;:-O"ij == 1;, x E Q, 1  i  3 j uX j (2.35) 0" ij == L a ijkh Bkh (u), 1  i, j  3 kh (2.36) where B( u) is the linearised strain tensor associated with U by the formula 1 ( au. au. ) Bij(U) = 2 ax: + ax ' 1  i,j  3. The stress tensor satisfies on r F the boundary conditions (2.37) "O"..n. == F. on r F  lJ J l j where n == (n j) is the unit vector of the exterior normal to r. On the com- plementary part r u == r\r F , we suppose that the continuous medium is fixed, so we have the boundary conditions (2.38) u i == 0 on r u 41. 40 We shall suppose in what follows that Q is a regular bounded open set (see Theorem 1) situated locally on one side of r. 41 We shall suppose in the sequel that r u is a closed part of r. 
412 Chapter VII. Linear Variational Problems. Regularity 7.2. Variational Formulation Let H be the sub-space in (L 2 (Q))9 of symmetric 3 x 3 tensors with square- integrable components (this space is isomorphic to (L 2 (Q))6 since a symmetric 3 x 3 tensor has only six independent components). We introduce into H the scalar product (0", ,) = fn 0" ., dx where (J. L = L (JijLij is the Euclidean scalar product in [R9. Then, let V={vE(H 1 (Q))3;V=O onr u } be the space of admissible displacements (of finite energy), and L( v) = r f. v dx + r F. v dT JQ Jr F (in this case, we take the spaces to be real, since all the physical quantities to be found are real). IffE (L 2 (Q))3 and F E (L 2 (r F ))3 we shall suppose hereafter, LEV'. We then have the following result, called the "The theorem of virtual power". Lemma 1. Let (J E H be a stress tensor; then the following conditions are equivalent: (i) (J satisfies the equation (2.34) and the boundary condition (2.37) together with (2.38); (ii) (J satisfies the variational equation ((J, c;(v)) == L(v), \Iv E V. 42 Proof. This result is an immediate consequence of the Green's formula: i f i o(J.. (J.c;(v)dx == (Jijnjvidr -   lJ vidx. il r lJ il lJ U X j The implication (i)  (ii) presents no difficulty if we suppose that (J is suf- ficiently regular for the integrals on the right side to exist. The converse (ii)  (i) involves the same difficulties as the interpretation of the mixed problem (see Remark 1, Sect. 2.1). Using the same technique we find that if (ii) holds, then (2.34) is satisfied in '(Q) and (2.37) in ((H62(rF))3)'. D 42 Where for v E V, we denote by e( v) the strain field associated with v, defined by (2.36). 
2. Examples of Second Order Elliptic Problems 413 Remark 12. The physical interpretation of this result is the following: In the stationary case the total virtual power of all the forces both internal - i (Y. .8. . ( v ) dx 1J 1J il and external r fvdx + r Fvdx Jil Jr F is zero - this being true for each virtual displacement defined by a field v E V (see 9 3 . 3 of the Appendix "Mechanics" of Chap. IA). Here, we have taken advantage of the conservation laws for (2.34), (2.37) and (2.38) (see Chap. I, relations (2.14), (2.15) and (2.16)) and we deduce from them the "Theorem of virtual power"; but we have seen in 3 of the Appendix "Mechanics" of Chap. IA that we can adopt a different point of view which would establish the above physical interpretation in a principle called "the principle of virtual power" and from which we should then deduce the relations (2.34) and (2.37) when we have imposed the conditions (2.38) on the choice of the set of the "virtual displacements". D Lemma 1 shows that the desired displacement field U E V has to be the solution of the variational problem (2.39) a(u, v) == L(v), \Iv E V where a( u, v) is the bilinear form on V x V defined by (2.40) a(u, v) = (0", s(v)) = r L aijkhBkh(U)Bij(v)dx. J il i, j, k, h From the hypotheses made on the coefficients a ijkh (see (2.8) to (2.11), Chap. lA, 9 2 ), the bilinear form a( u, v) is symmetric and positive. We shall see 43 that under the assumption of the existence of a > 0 such that (2.41 ) L aijkhXijXkh  aX. X 44 ijkh for all X E ;, that is to say for X a 3 x 3 symmetric tensor, the bilinear form a( u, v) is in fact V-elliptic and therefore there follows the existence and uniqueness of the displacement u in the problem (2.39) and hence (2.34), (2.37), (2.38) by the Lax-Milgram theorem. 43 See the corollary below. 44 The hypothesis (2.41) implies the strict convexity of the functional of potential energy introduced later in Sect. 7.4 and hence the uniqueness of its minimum and so of the solution. To deform a material satisfying (2.41), we must hence provide a strictly positive energy. 
414 Chapter VII. Linear Variational Problems. Regularity Remark 13. (1) The coefficients of elasticity a ijkh can depend on x in the case of a hetero- geneous medium. (2) We recall that in the case of a homogeneous, isotropic material, the coef- ficients a ijkh can be expressed in a simple way as functions of Lame's constants A and /1 (see Chap. lA, 2.3) and that the inequalities /1 > 0 and k  3A + 2/1 > 0 seen in 2.3 of Chap. IA are sufficient to ensure the existence of rx such that (2.41) is valid. D 7.3. Korn's Inequality Theorem 1 (Korn's inequality). If Q is a bounded open set with regular bound ary 45, there exists a constant C > 0 depending on Q such that (2.42) L (s( v) . s( v) + v. v) dx  q v II L where II . 111 denotes the norm in (H 1 (Q))3. \Iv E (H 1 (Q))3 Remark 14. By definition of II . 111, we have (2.43) II viii = L(vv.vv + v.v)dx where Vv(x) is the 3 x 3 tensor with components (ovi/ox j ), 1  i,j  3, which is not, in general, symmetric. From the properties of the decomposition of a tensor into a symmetric part and an antisymmetric part, we have 1 Vv  8(V) + "lcurl v, and 1 Vv. Vv  8(V). 8(V) +"l curl v. curl V. 46 This shows that Korn's inequality (2.42) is not obvious: in effect it proves that if 45 For example, piecewise ri 2 . 46 In this formula, the term curl v. curl v denotes the scalar product in [R3 of curl v with itself; here curl v is considered as a vector, and not as an antisymmetric tensor. 
2. Examples of Second Order Elliptic Problems 415 8(V) is a symmetric 3 x 3 tensor such that 8(V) E (L 2 (Q))9 and v E (L 2 (Q))3 then (i) Vv == gradvE(L 2 (Q))9 (ii) v E (Hi (Q))3 (iii) curl v E (L 2 (Q))3 (that is to say the "control" of the symmetric part of the gradient of a function v implies that of the antisymmetric part of the gradient of this function v and hence the control of the whole gradient of v). o Remark 15. The result of Theorem 1 is quickly established in the particular case in which ru == r, and thus: V == (H6(Q))3. Applying Green's formula, we have  f OVi oV j d  (f oVi d f 02Vi )  -- x ==  n.v.- r - v.dx ij QOX j OX i ij r l J OX j [JOXiOX j J  (f ( OVi OVi ) i OVi OV j ) ==  niVj- - nj-V j dr + --dx ij r OX j OX i Q OX i OX j = L l diVV 12 dX. where we have used the fact that v E (H6(Q))3 is null on r. As s(v).s(v) = L ( OVi + OVj ) 2 = L [( OVi ) 2 + ( OViOV j )] l ' J .4 ox. ox. 2 l ' J ' ox. ox. ox. J l J J l we have, for v E (H6(Q))3: LS(V).S(V)dX = L Vv. Vvdx + LldivVl2dX from which the result follows. o Remark 16. It is interesting to remark that f s(v). s(v)dx =  f I curl vl 2 dx + r Idiv vl 2 dx, Q 2 Q JQ \Iv E (H6(Q))3, so L (Vv. Vv)dx = LICUrlVl2dX + LldiVV12dX, v E (H6(D»3 
416 Chapter VII. Linear Variational Problems. Regularity which gives the norm of the gradient of each function v E (Ha(Q))3 in terms of the norm of the curl and the divergence of this function. D The proof of Korn's inequality in the general case rests on Lemma 2. If Q is a bounded open set with sufficiently regular boundary47 we have IIpllL 2 (Q)  C(IIpllk- 1 (Q) + IIVpll(H- 1 (Q))3}. In other words ifp E H- 1 (Q) has its derivatives in H- 1 (Q), then p E L 2 (Q). Proof of Theorem 1. Everything is based on the identity V 2 V i v v v v v = -;- 8ik ( v) + -;- 8 ij ( v) - -;- 8 jk ( V ) X j X k uX j uX k uX i which shows that if eij(v) E L 2 (D) ( so ak eik(v) E H- 1 (D) since the derivative is continuous from L 2 (Q) into H- 1 (Q) ) , then a 2 v; E H- 1 (D). We can then vxjvx k VV i . vV i 2 .. apply Lemma 2 to p = -;-; thIS shows that -;- E L (Q), whIch IS none other uX j uX j than Korn's inequality. D With the view of establishing the V-ellipticity of the bilinear form, defined in (2.40), we establish the following result: Theorem 2. Under the hypotheses of Theorem 1, if r u has non-zero measure, then there exists a positive constant C such that (2.44) in e( v) . e( v) dx  CII V lIl(Q))" \Iv E V. Proof We prove first of all that Ile(v)IIH = (Le(v).e(V)dX Y/2 is a norm on V, that is to say that if v E V and II 8 ( v) II H = 0  v = o. But II 8(V) IIH = 0  8(V) = 0 a.e. in Q; 47 See Duvaut-Lions [1], p. 112, for the case in which Q has a regular boundary, and Bolley-Camus [1] for the case in which Q is polyhedral. 
2. Examples of Second Order Elliptic Problems 417 now 8( v) = 0 implies that v is of the form 48 (2.45) v(x) = a + b x x with a, b E [R3 gIven. In other words v is an infinitesimal rigid body dis- placement. But r u being of non-zero measure, the boundary condition v = 0 on r u implies that a = b = 0, in other words that v = o. We then reason by reductio ad absurdam: if (2.44) were not true, there would exist a sequence {v n } E V such that II V n "(L 2 (Q))3 = 1 and " 8( v n ) "H  O. This sequence being bounded in (H 1 (Q))3 it follows, from Theorem 1, that we can extract a weakly convergent sub-sequence {v n '} converging to v E V in (Hi (Q))3, from which it follows that "8(V) 11 2  lim inf 118(V n ,) 11 2 = 0 n' - 00 from which, after the preceding, we have v = O. But the injection H 1 (Q)  L 2 (Q) being compact, we deduce also that the sequence {v n '} converges strongly to v in (L 2 Q)3, from which we have II v II (L2(Q))3 = 1 hence, aeon tradiction. D Corollary 1. Under the hypothesis (2.41) and the hypotheses of Theorem 2, the bilinear form a( u, v) defined in (2.40) is V-elliptic. Remark 17. Let R be the set of infinitesimal rigid body displacements of the form (2.45). If r u = 0, the preceding proof would show that the bilinear form a(u, v) is V-elliptic on the quotient space V = (H 1 (Q))3jR from which we deduce in this case, the existence of the displacement u to within an infinitesimal rigid body displacement, provided the linear form L( v) defined by L(v) = LfoVdX + LFoVdr fE L 2 (D)3, FE L2(T)3 is likewise defined on the quotient space, that is to say if L vanishes on R, in other words if the general resultant of the external forces, and their resultant moment are zero. D 48 Where b x x denotes the vector product of the vector b E [R3 by the vector x E [R3. 
418 Chapter VII. Linear Variational Problems. Regularity Remark 18. The method used to prove Theorem 2 could be applied also to the Neumann problem for the Laplacian: -L1u==f on Q au an == g on r. We could show therefore that this problem admits a solution u defined within a constant in H 1 (Q), iff Lf dX + frgdr = O. D 7.4. Application to Problem (2.39) The above Corollary 1 enables us, by use of the Lax-Milgram, to conclude the existence (and the uniqueness - to within infinitesimal rigid body displacements in the case r v == 0) of the displacement u, solution of the variational problem (2.39). The bilinear form a( u, v) being symmetric, we know (see Remark 3,  1) that u is also solution of the optimisation problem: find u E V satisfying J ( u) == inf J ( v ) V E V where J: V  IR is the quadratic functional defined by 1 J(v) == -a(v, v) - L(v). 2 This "minimum principle" has a physical interpretation, the functional J being interpreted as a potential energy (see Mandel [1J), and the displacement field solution u hence appearing as the element from the set of kinematically admiss- ible displacement fields (i.e. those satisfying the boundary conditions) which gives the minimal potential energy. The space (H 1 (Q))3 also appears as the space of displacements having a finite energy, and this explains why it is the "good" space in which to work. 7.5. Inhomogeneous Problem We finish this Sect. 7 by saying some words on the inhomogeneous problem, that is to say, solving (2.34) to (2.37) but replacing the boundary condition (2.38) by (2.38)' u. == U. on r v l l , i == 1 to 3, 
92. Examples of Second Order Elliptic Problems 419 where Vi E H 1 / 2 (r) is given, and which corresponds physically to the case in which we apply an imposed displacement on the part r u of the boundary. Then let Uo E (H 1 (Q))3 such that Uo == V on r; we put w == u - Uo with the result that w satisfies the homogeneous boundary conditions. We find then that w E V is the solution of the variational problem a(w, v) == L(v) - a(u o , v) which, by the Lax-Milgram theorem, therefore admits a solution, since L(v) - a(u o , v) is clearly a continuous linear form on V. We deduce the existence and uniqueness of the solution u of the inhomogeneous problem (2.34) to (2.37) and (2.38)'. It is interesting to note that if we put Ol1 ad == {v E (H 1 (Q))3; V == V on ru}, the displacement u of the inhomogeneous problem is then the solution of the optimisation problem (with constraints): find u E Ol1 ad such that . 1 J(u) == Inf J(v) (where J(v) == -a(v, v) - L(v)) v E °71 ad 2 which is interpreted again as a minimum energy principle. Remark 19. Elasticity operator. With the bilinear form a( u, v) defined on V x Vby (2.40), we can associate, as in 1 of this Chap. VII, and in Chap. VI, 3: (1) considering a(u, v) as a continuous bilinear form.on V x V, an operator d continuous from V into V', the dual of V by a (u, v) == < d u, v >,49 V V E V, U E V. As a result of Theorem 2 of this 250 and of the Lax-Milgram theorem, d is an isomorphism of V onto V', if r u has non-zero measure. But we must note that if r F == r\ru also has non-zero measure, the space V is different from the space (H6 (Q))3 and V' is not a space of distributions; (2) the continuous linear mapping, denoted by P, of V into (H - 1 (Q ))3 defined by (see 1, Sect. 3) ( PU)i = - L -!- ( L a ijkh 8kh ( U ) ) , j = 1 a 3 u X j k, h = 1 a 3 l 1 to 3 49 Here the bracket <, > denotes the duality V, V'. 50 We suppose again that the open set Q is bounded. 
420 Chapter VII. Linear Variational Problems. Regularity and thus such that: a(u, cp) = <Pu, <p >, u E V, & cp E Cgfi(Q))3 or cp E (H6(Q))3 where <, > denotes the duality between ' (Q) and (Q) or between (H- 1 (Q))3 and (H6(Q))3; (3) considering now the real Hilbert space (L 2(Q))3, and a(u, v) as an un- bounded bilinear form in (L 2 (Q))3, the operator A, unbounded in (L 2 (Q))3, defined by one of the following equivalent ways: (i) D(A) = {u E V; du E (1 2 (Q))3} (ii) D(A) = {u E V; Pu E (L 2 (Q))3, a(u, v) = (Pu, v) Vv E V} (iii) D(A) = {u E V; such that the map u  a(u, v) is continuous on V with the topology of (L 2 (Q))3} and Au = Pu, u E D(A). It follows from Theorem 2 of this 92 and of Chap. VI that A is a closed operator, with domain dense in (L 2 (Q))3, and that A is an isomorphism of D(A) onto (L 2 (Q))3. Moreover, by the symmetry of a(u, v): tA = A. 51 D 8. Statical Problems of the Flexure of Plates We have discussed in Chap. IA (92.5.7) the equations of the flexure of plates in the context of the Love-Kirchhoff theory. The median section of the plate at rest being represented by an open set Q c 1R2, we look for a real function w(x), x E Q, representing the transverse displacement of the plate and a field of symmetric tensors M, of order 2 (the flexure tensor) such that [ a2w ] 52 (2.46) Map = - G h vLlw Dap + (1 - v) oX a oX p on Q (IX, P = 1, 2) where G h = Eh 3 /12(1 - v 2 ) is a coefficient depending on the Young's modulus E, and on the Poisson's ratio v of the material forming the plate, and h the thickness of the plate. 51 Complexifying the space (L 2 (.Q))3 (i.e. considering complex functions), we should therefore obtain a self-adjoint operator A. 52 The coefficients given in this expression (and in many works; see, e.g. Duvaut-Lions [IJ) are different from those given in Chap. IA (2.78). Thjs arises from slightly different mechanical situations (v, E are related to A, fJ by IA (2.27) and (2.28)). 
2. Examples of Second Order Elliptic Problems 421 LetfE L 2 (Q) be the transverse pressure supported by the plate; we then have the equilibrium equation (see Chap. lA, 92)  0 2 MaP (2.46)'  a a = -f on Q. ap Xa x p Finally, the boundary conditions can be of several types. We shall suppose that the plate is clamped on a part r u of its boundary and is free on the other part r F . 53 We thus have the boundary conditions ow w = 0 - = 0 on r u , an (2.47) (2.48) (i) M n = 0 and (ii) oM, + Q = 0 on r F aT where M n = LMapnanp ap (n - (na) is the unit vector of the outward-drawn normal to r), Mt = L Mapna Lp ap L = (La) is the unit vector to the tangent to r (making an angle + tn with the outward-drawn normal), Q = LMap,pna. ap We introduce the space H = (L 2(Q)) of the fields of symmetric 2 x 2 tensors, square integrable on Q, and the space v = {v E H 2 (Q); ov v = 0, an = 0 on ru} of the fields of kinematically admissible transverse displacements. We then establish a "Theorem of virtual work". 53 In the sequel, we shall suppose that Q is a regular bounded open set (with a piecewise-2 boundary) of [R2, situated locally on one side of its boundary r, and that r u is a closed part of r. 
422 Chapter VII. Linear Variational Problems. Regularity Lemma 3. Let M E H; then the following two conditions are equivalent: (i) M satisfies the equilibrium equation (2.46)', and the boundary conditions (2.47), (2.48) (ii) M satisfies the variational equation (2.49) (M, D 2 v) = - (f, v), \Iv E V where in the second member of (2.49) (. , .) denotes the scalar product in L 2 (Q) and where in the first member of (2.49), (. , .) denotes the scalar product in H defined in terms of the Euclidean scalar product in 1R4, M . N = L MaP NaP, by ap (M,N) = t M.Ndx; finally D 2 v is the 2 x 2 tensor formed from the second derivatives of v: a 2 v (D 2 v)aP = a a ' a, [3 = 1,2. Xa x p Proof. The result is a consequence of Green's formula i a2v i ( av ) i 02 M MaP a a dx = - MaP,pna v - M"pnp  dr + ap vdx Q Xa x p r vX a Q oXtY. oX p which implies (by summation over a and [3) 2 i " 0 2 MaP i f ( av av ) (M,D v) = 1...; 0 a vdx - Qvdr + Mn + Mt dr Q ap Xa x p r r vn vI: f "a 2 M ap i ov f ( aMt ) = L... a a vdx + Mndr - --;- + Q vdr Q ap Xa xp r vn r vI: where we have used an integration by parts on r. We see immediately that (i) => (ii). Conversely if (2.49) holds for all v E V, we have (2.46)' immediately, in the sense of distributions, in Q. We then deduce with the help of Green's formula that r av r ( a Mt ) Jr u M n an dr - Jr u ----a;- + Q v dr = 0, \Iv E V; as v and avian can be chosen arbitrarily on r F , we deduce (by a purely formal argument) that M n = 0 on r u 
2. Examples of Second Order Elliptic Problems 423 and oM t  + Q = 0 on r F (the rigorous argument goes through by Sobolev space methods and is techni- cally a little complicated). D Conclusion. We then deduce immediately from this result that the displace- ment w, solution of (2.46)' with (2.46), (2.47) and (2.48) is the solution of the variational problem (2.50) a( w, v) = (f, v), \Iv E V, where (2.50)' a( w, v) = G h f} v Aw Av + (1 - v )D 2 w D 2 v] dx. We verify (inequality of Poincare type) that the bilinear form a(u, v) is V-elliptic on condition that the measure of r u is non-zero. In this case, by the Lax-Milgram theorem, we can assert the existence and uniqueness of the solution w of the problem (2.50). In the contrary case (signifying that the plate is free on the whole of its boundary) we can prove the existence of a displacement w to within an affine function (the affin functions are interpreted as the rigid transverse displace- ments of the plate) on condition that the mean value off is zero, and that its first order moments are zero, which can be interpreted as a transverse loading on the plate whose general resultant and whose resultant moment are both zero. D Remark 20. (1) The results on local regularity of 3 (see also Chap. V, 2) show that if f E L 2 (Q), then U E Hc(Q). Even in the purely clamped case (r u = rand r F = 0), the global regularity resultf E L 2 (Q) => u E H4( Q) is only true under the hypotheses of regularity on the boundary r. In particular if r is a convex polygon, we only have u E H 3 +e(Q), with 8 > 0 sufficiently small (see Kondratiev [1]). If r possesses re-entrant angles, we have regularity results only to within singular terms (see 2.5 and Grisvard [1] [2]). (2) We could have envisaged the case of a plate simply resting for which the boundary conditions are { Mn = 0 on r w = 0 on r (see Chap. lA, 2.5. 7). 
424 Chapter VII. Linear Variational Problems. Regularity We come back again to the variational problem (2.50), but choosing V = H 2 (Q) n H(Q) (instead of V = H6(Q) as in the purely clamped case). In this connection, we point out "Babuska's paradox" (see Necas [1]): let (QN )N> 0 be a family of polygons tending to an open set Q when N  00 (supposing, for example, that QN is a polygon with N sides). Then, let UN be the solution of the problem obtained by replacing Q by QN (in other words substituting a plate with polygonal boundary for one with curved boundary); the sequence UN does not converge to U as N  00 (for any "reasonable" topology). This will pose some problems for the approximation of solutions by the method of conformal finite elements (see Chap. XII). o Remark 21. Operator for the flexure of (bounded) plates. As for the preceding example, we can associate with the bilinear form a( u, v) defined on the space V = {v E H 2 ( Q); v = :: = 0 on r u } by (2.50') a(w, v) = G h t (vAwAv + (1 - v)D 2 wD 2 v)dx continuous mappings - again denoted by .91, P - defined respectively on V into V', and on V into H- 2 (Q), by: a(w, v) = <dw, v), Vw E V, V E V (with <, > denoting the duality between V and V') a(w, v) = <Pw, v), Vw E V, V E (Q), or v E H6(Q) (with <,) denoting the duality between (Q) and' (Q) or between H6(Q) and H- 2 (Q)). Thus P is expressed by Pw = -G h L i7 i7 [ VAW£5",p + (1 - v) i7 2 w ] a, p Xa Xp iJx a iJx p [ iJ4w ] =-G h vL1 2 w+(1-v)L iJ 2 c 2 ; a, p Xa Xp if r u has non-zero measure, then (by the Lax-Milgram theorem) .91 is an isomorphism of V onto V' (but V' is not a distribution space if r F has non-zero meas ure ). o 
3. Regularity of the Solutions of Variational Problems 425 3. Regularity of the Solutions of Variational Problems 1. Introduction We have seen that the variational problem: Problem PD(Q,f) Find u E H(Q) such that L Vu. Vvdx = <f, v), \Iv E H6(Q) where f E H- 1 (Q) is given, admits a unique solution, u E H(Q), satisfying - L1u = f in gfi' (Q). We are interested in the possible regularity of this solution, in particular to examine the connection with the idea of a "classical" solution whose existence for the Dirichlet problem has been established in Chap. 11 54 . In the case in whichf E H- 1 (Q), there is, in general, no supplementary property of u related to the connection with H(Q), since - L1 is an isomorphism of H(Q) onto H- 1 (Q). We are then led to consider the case wherefE L 2 (Q) or elsefE Hk(Q) with k  0, and we shall see that under suitable hypotheses for the boundary of the open set Q, the solution u of the problem PD(Q,f) then belongs to Hk+2(Q). Owing to the injection theorems of Sobolev (established in Chap. IV, 3) we thus deduce, according to the values of the integer k, the order of differentiability of the solution' of the problem. These results will allow us also to give the regularity properties of the Green's function. For a more general variational problem, associated with a boundary value problem on an open set Q c n, three factors are involved in determining the regularity of the solution: (1) the regularity of the coefficients of the elliptic operator, (2) the geometry of the open set Q and in particular the regularity of its boundary, (3) the type of boundary conditions imposed. As far as the regularity result itself is concerned it may be placed in one of two subdivisions of partial results. 54 Note also that the study of the regularity of the solution of this problem has been made in Chap. II, 96. 
426 Chapter VII. Linear Variational Problems. Regularity 1. A result of "regularity in the interior" applicable to the interior of the open set Q and which depends only on the regularity of the coefficients of the elliptic operator. In the case of the Dirichlet problem or, more generally, of a problem involving an elliptic operator of order two, the result is stated in the following way: fE Hfoc(Q) => u E Hfo2(Q) where we recall that Hfoc(Q) = {f E !0' (Q); fep E Hk(Q), Vep E !0(Q)}; this type of result compares with the properties of the hypoelliptic operators with constant coefficients established in Chap. V, 4. 2. A result of global regularity, that is to say up to the boundary of the open set, being stated (for the Dirichlet problem or more generally for a problem involving an elliptic operator of order two): fE Hk(Q) => U E Hk+2(Q). F or this type of results, the last two factors listed above (geometry of the open set and type of boundary conditions) are involved in a crucial manner. By way of an example, let Q c 1R2 be a 'bounded polygon, there exist functions U E H(Q) such that - L1 u E rc 00 (Q), but U  HI + : (Q) where OJ is the measure of the largest of the internal angles of the polygon. In particular, if the polygon is not convex, the solution of the Dirichlet problem cannot be in H 2 (Q) even if the second member f is very regular. 2. Interior Regularity We shall now prove the following interior regularity theorem: Theorem 1. Let Q be an open set of IR n , and def P(x, D) == L aa(x)D a lal  2m an operator associated with a uniformly strongly elliptic sesquilinear form of order 2m with coefficients aa(x) E rcoo(Q), lal  2m. Then u E !0'(Q) and Pu E Hfoc(Q) imply: U E Hfo 2m(Q) for all k E lL. Thanks to Sobolev's injection theorems (Chap. IV, 3) we deduce in particular that the operator P is hypo-elliptic (see Chap. V, 2 or 4), that is to say that Pu E rcoo.(Q) implies that U E rc oo (Q). The interior regularity result depends upon an a priori inequality we shall establish first of all. 
93. Regularity of the Solutions of Variational Problems 427 Proposition 1. Let Q be an open set in [Rn, OJ a bounded open set such that w c Q and def P(x, D) = L aa(x)D a letl  2m a strongly elliptic operator 55 in Q with infinitely differentiable coefficients. For every integer s, positive or zero, there exists a constant C > 0 such that, for all u E H(OJ) we have (3.1) II u II S  C ( II Pu II S - 2m + II u II s - 1 ) where II . II s denotes the norm of H S (OJ ). Proof The proof is carried out in three stages (see Lions-Magenes [1], Chap. II, 3). First of all, using the Fourier transform, we shall establish the result in the case of a homogeneous operator with constant coefficients. We then extend this result to an operator with variable coefficients by noticing that locally this is a perturbation of an operator with constant coefficients. It will then be easy to treat the general case of a non-homogeneous operator with variable coefficients. 1 st stage (P homogeneous with constant coefficients). The idea is to employ the Fourier transform; u E H (OJ) can be extended to a distribution, again denoted by u, of H( [Rn) (sub-space of the distributions of H S ([Rn) with support in the compact set OJ). We have (3.2) () = (2in)2m ( L (aaa)u() ) letl = 2m /'--.. (the denoting the Fourier transforms) using the fact that P is homogeneous with constant coefficients. We shall now make use of: "- the ellipticity of P: there exists c > 0 such that (3.3) L aaa  cl12m, letl = 2m -the standard inequality (1 + I  \)S  2 k (1 + I  1 2 )s - k (1 + I  12k) valid for all  E [Rn, all s E [R and all kEN, - the inequality, obvious for all k > 1: II u II S - k  II u II S - l' leading, after integration, to the inequality (3.4) Ilull;  2 2m c- 1 1IPull;_2m + 2 k llull;_1, 55 See Chap. V, 92 Definition 2, formula (2.6). 
428 Chapter VII. Linear Variational Problems. Regularity from which, in an obvious way, we obtain the sought inequality in this case by taking C = 2 m [sup(c-1, 1)]1/2. 2nd stage (P homogeneous with variable coefficients). We shall denote by Pxo the homogeneous operator with constant coefficients Pxo(D) = P(x o , D). The strong ellipticity of P in Q is translated into the existence for all x E Q of constants c(x) such that L aa(x)a  c(x)1  1 2m . lal = 2m We then remark that OJ being compact, we can take c(x) independent of x in w; we shall denote this constant by c. The result of the first stage then allows to assert that there exists a constant Co, independent of Xo E cO, such that for all u E H(w) and Xo E OJ, we have (3.5) (3.6) Iluli s  Co(IIPxoulls-2m + Ilull s - 1 ). The problem is thus to evaluate (3.7) II Pu - P xou lis - 2m = L (aa(x) - aa(xo))Da u "Ial = 2m s - 2m For that, we shall make use of the following lemma: Lemma 1. Let Q be a bounded open set of n, k and 1 two positive integers, Xo E Q and cp a rc 00 -function in Q such that cp (x o ) = O. Then for all 8 > 0, there exists b > 0 such that (3.8) II cpDa u Ilk -1 < 811 U Ilk for all a c Nn such that lal = 1 and all u E Hk(Bt5(xo)) (Bt5(xo) denotes the ball contained in Q of centre Xo and radius b); in addition b depends only on sand land the pper bounds of cp and its derivatives of order less than or equal to sup (l, k, 1) on Q. The proof of this lemma is simple enough, we can deduce it, for example, from Lemma 3.2 of Chap. II of Lions-Magenes [1]. To evaluate (3.7), we shall majorize the terms of the form (3.9) II (aa(x) - aa(xo))Dau lis - 2m where u E H(w) and lal = 2m. The preceding lemma will ,give such an eval- uation when the support of u is small. We know that thus there exists b a > 0, which we can take to be independent of Xo in the compact set OJ from the last assertion of the lemma, such that (3.10) 1 II(a",(x) - a,.(xo))D"'ull s -2m ::::; 2n 2m C o lIuli s 
3. Regularity of the Solutions of Variational Problems 429 for all U E H(Bf> (x o )), where Co is given by (3.6). By taking Cl b == inf lal = 2m b a and remarking that there are at most n 2m terms (3.9), we see, by applying (3.6) that II U lis  Co (II p xoU lis - 2m + II U lis - I)  Co (II PU lis - 2m + II PU - P xoU lis - 2m + II u lis - I) 1  C o (IIPull s -2m + 2C o Iluli s + Ilulls-d from which it follows that (3.11 ) Iluli s  2C o (IIPulls-2m + Ilulis-l) for all U E H(B(xo)). It now suffices to remark that the compact set w can be covered by a finite number of balls of radius b. We take a differentiable partition of unit sub- ordinate to this covering: (t/1j)Ijq say. Using (3.11) we have: 'lulls  L Iit/ljulis  2C o L (IiPt/ljulls-2m + IIt/1 j ulls-l). I jq 1 jq Now (3.12) II pt/1 j u Ils- 2m  II t/1jPU IIs- 2m + II Pt/1 j u - t/1jPU IIs- 2m. Starting from (3.12), we obtain, by obvious inequalities (3.13) IIPt/1 j ulls-2m  C 2 1lpulls-2m + C 3 11ulls- I as (Pt/1 j - t/1 jP) is an operator of order 2m - 1 with rc OO coefficients in Q, and hence acts continuously from H-1 (co) into Iis-2m(co). We deduce (3.1) in an obvious way. 3rd stage (General case). P can be written in the form P(x, D) == pO (x, D) + pI (x, D) where po is the principal part of P po (x, D) == L aa(x)D a lal = 2m and where pI is an operator of order less than or equal to 2m - 1. Obviously we do not change pI U by multiplying the coefficients of pI by a function of E0(Q) taking the value 1 on w, in other words we can suppose that the coefficients of pI are in E0(Q). We then have (3.14) II PI U lis-2m  C 4 11 U lis-I. 
430 Chapter VII. Linear Variational Problems. Regularity Now, from the second stage, there exists a constant C s > 0 such that II u lis  C s ( II pOU lis-2m + II u Ils-1)  Cs(II(P - p 1 )ull s - 2m + Ilull s - 1 )  C S ( II Pu lis-2m + (1 + C 4 ) II u Ils-1) on using (3.14). Hence we have established the proposition. o From Proposition 1, we shall now extract the interior regularity result stated above (Theorem 1), but first we shall introduce a method of proof which will often serve us well in the future. It is a question of the method of differential quotients (Nirenberg [lJ). First of all let us give some notation. Let x E lR n ; we shall denote by LX the translation by the vector x, and hence, by LX u, the image of the distribution u by LX. Now let (e j )l jn be the canonical base of lR n and h E lR; we shall denote by L1 J the differential quotient of the distribution . 1 L1  u == h (The j U - U). (3.15) More generally, let a E f\jn be a multi-index and H a set of lal real numbers H == (h jk) 1 j  n ; 1 kr:xj we denote (3.16) IHI L Ihjkl 1jn 1  k  aj and (3.17) L1u == ( n n L1 jk ) U. 1 jn 1 kaj In the sequel we shall make use of the following three lemmas. Lemma 2. Let s be an arbitrary integer S6 ; let a be a multi-index and u a distribution in HS(lR n ); Dau E HS(lR n ) iff L1u remains bounded in HS(lR n ) when IHi  o. Proof We shall consider only the case lal == 1, in other words take Aa _ Aj LJh - LJh. 56 The result remains valid, with the same proof, if s is an arbitrary real number. 
93. Regularity of the Solutions of Variational Problems 431 Since U E HS(rR n ), we can take the Fourier transform of iJi U: 57  . sinn.h iJu() = 2iemjh h J u(). Let us suppose that Dju E H S ; using the fact that sin njh h  nljl for all h and all  j we find that II L1  u II;  f (1 + I 12)' 12injfi() 1 2 d II Dj u II; which is the desired result in this case. If Dju  H S ,. then lim r (1 + 112)' 12inju(W d + 00. A -+ 00 J II  A In other words, for all B > 0, there exists A such that r (1 + 112),12nju(W d ;::, B. JIIA sin n  . h . Now on every compact set, h J tends unIformly to nj when h  O. Hence for every 8 > 0, there exists h, 0 < h < 8, such that II L1  II; ;;:: r (1 + I W)' 2 sin n  j h U () 2 d  ;::, B . JIIA h 2 Hence, we have the second part of the lemma. o Lemma 3. For all S E Z and all cp E !/(rR n ), the mapping u  [iJ, CPJu = iJ(cpu) - cp.iJu from HS(rR n ) into Hs--Ial + 1 (rR n ) is bounded. 58 This lemma can be proved without difficulty. 5 7 We take here: u() = i e - 21ti(x . ) u(x) dx (as for (3.2)) IR n 't heJ U = u(x + he j ). 58 This result is valid for all S E . 
432 Chapter VII. Linear Variational Problems. Regularity Lemma 4. Let P(x, D) == L aa(x)D a be a differential operator with coefficients lal i in ?(lR n ). Then the commutator [LiB' P] is a bounded mapping of HS(lR n ) into H S - 1 - l a l +l(lR")for all S.E Z; the norm of this operator is independent of H. 58 This is almost a direct consequence of the two preceding lemmas and of the fact that, for all {3 E f\jn and every distribution J(DPu) == DP(Ju). Before proving Theorem 1, we shall first of all give the proof in the following particular case: Lemma 5. The hypotheses being those of Theorem 1, let us suppose that, in addition, u E Hfo 2m - 1 (.0); then u E Hfo 2m(Q). Proof. It must be shown that, for all qJ E (Q), qJU E Hk+2m(Q), knowing that qJU E H k + 2 m-l (.0) and that qJPu E Hk(Q). We see immediately that then P(qJu) E Hk(Q). In effect, [P, qJ] is a differential operator of order 2m - 1 at most, and P( qJu) == qJPu + [P, qJ] u. Let us suppose, first of all, that k + 2m - 1  O. Let j E {I, . . . , n} and h > 0, the distributions JqJU59 (notation (3.15)) all belong to H+2m-l(w) for a fixed bounded set w such that OJ C .0, when h is sufficiently small. We can therefore apply (3.1) 11L1qJullk+2m-l  C(IIP(L1qJu)llk-l + 11L1qJullk+2m-2) from which, using Lemma 4, we have (3.18) 11L1qJullk+2m-l  C'(II L1P(<pu)llk-l + IlqJullk+2m-l + 11L1<pullk+2m-2). By making use of Lemma 2, we see then that the second member of (3.18) is bounded, from which we have the conclusion sought from this same Lemma 2 since then DjqJu E Hk+2m-l(Q) for allj E {I,. . . , n}. If k + 2m - 1 < 0, we are led to the preceding case by the following method. By identifying qJU by its extension by 0 outside of .0, we can consider its Fourier transform q;u and define a distribution v on .0 as the restriction to .0 of the distribution W on lR n whose Fourier transform is ./""'-... "" qJU W = (1 + 112)1 ' 59 We denote LJ (qJu) by LJ qJU. 
3. Regularity of the Solutions of Variational Problems 433 In other words, v is a solution in Q of (1 - L1)1 V = qJU. It is immediate that v E H k + 2m - 1 +21(Q) and that, from the preceding P(1 - L1)IV E Hk(Q). We take I such that k + 2m - 1 + 21  0; we are led to the preceding case for the elliptic operator P(1 - L1)1 of order 2(m + I) and the distribution v. From this we easily deduce the stated result. D Proof of Theorem 1. Throughout let qJ E (Q), and qJPu E Hk(Q), qJU has compact support, so there exist S E Z such that qJU E HS([fn), each distribution being locally of finite order. If S  k + 2m - 1, on account of the preceding there is nothing to prove. If S < k + 2m - 1, qJPu E H s - 2m + 1 (Q) since k > S - 2m + 1; as qJU E HS(Q), Lemma 5 allow'S us to assert that qJU E HS+1(Q), from which, by a finite number of steps of this type, we are led to the case s  k + 2m - 1. D 3. Global Regularity of the Solutions of Dirichlet and Neumann Problems for Elliptic Operators of Order Two The global regularity results are, in general, technically complicated to establish. We shall develop ony a simple case but one which allows us to show the general ideas clearly. We shall give other results in the following section. We denote by Q a regular bounded open set of n (with oo boundary) and by r its bound ary 60. N ow let  be the half-space n = { X E [Rn. X > O } + , n and Br+ = {xE;lxl < r} the open half-ball with centre the origin and radius r contained in . We know that for all Xo E r, there exists a neighbourhood V of Xo in n and a diffeo- morphism lJ such that { lJ( V n Q) = Br+ lJ(x o ) = O. We shall make use of this property in the sequel. _ Let P(x, D) be a differential operator of order 2 with oo (Q)-coefficients P(x, D) = - L Djaij(x) D i + L bj(x)D j + c(x). lin ldn ljn 60 With Q situated locally on one side of r. 
434 Chapter VII. Linear Variational Problems. Regularity We shall suppose P to be uniformly strongly elliptic in Q. We associate with it the integro-differential form on Q defined by (2.2). However, we shall now denote this form by a(u, v); V will denote either the space H5(Q) or the space HI (Q); the sesquilinear form a(u, v) is V-coercive relatively to L 2 (Q). By the possible addition of a positive constant to the function c(x), we can suppose that the sesquilinear form a(u, v) is V-elliptic. Finally, for allf E L 2(Q), we denote by u the solution of the variational problem { Pu == f u E D(A) where D(A) is the sub-set of the u E V such that the mapping v I---+- a(u, v) is continuous on V provided with the norm of L 2 (Q). The preceding hypotheses and notation will, without repetition, remain valid throughout this section. We shall now state the principal result. Theorem 2. Under the preceding hypotheses, iff E H k (Q), then u E Hk+ 2 (Q), Vk E N. Before proving this result, we shall give some corollaries. The first among these concerns the inhomogeneous Dirichlet problem: iffE H-I(Q) and if g E Ht(r), we know that there exists a unique solution v E HI(Q) of Pv == f, Yov == g, V E HI(Q). (3.19) 3 Corollary 1. Under the hypotheses of this section, iff E Hk(Q) and g E Hk+z(r), then the solution v of (3.19) belongs to H k + 2 (Q), for all kEN. 3 Proof The trace mapping Yo being surjective from Hk+ 2(Q) into Hk+z(r), (see Chap. IV, 4, Theorem 5) and admitting a continuous lifting, we know that we can find W E Hk+ 2(Q) such that Yo W == g. Putting u == v - w we have Pu E Hk(Q). In effect, P operates continuously from Hk+2(Q) into Hk(Q), so Pw E Hk(Q); on the other hand u E H5(Q) (characterisation of H6(Q) in HI(Q) by the nullity of the trace on r), u belongs therefore to Hk+2(Q) from Theorem 2, giving the required result. D - - Corollary 2. Under the hypotheses of this section, iff E CfjOO (Q), then u E CfjOO (Q). This is an immediate consequence of 3 and 4 of Chap. IV. D 
3. Regularity of the Solutions of Variational Problems 435 Before coming to the proof properly so-called of Theorem 2, we shall give a preliminary result on estimates in the half-balls playing for Theorem 2, the role Lemma 1 played for Proposition 1. The proof is quite technical; we shall give only the main steps. Lemma 6. Under the hypotheses of this section (u E D(A) and Pu == f), let us suppose that Q == Br+ andfE Hk(B r +), k  0; then for all p < r, u E H k + 2 (B;) and there exists a constant C > 0 (depending only on p and k) such that (3.20) II u IIH k + 2 (B;)  C( IlfIIHk(B r +) + II U IIHI(Pr+)). Remark 1. Br+ is not a regular open set, but this hypothesis does not enter the proof. D Proof. It is made in several stages each of which we shall incorporate into a lemma. First of all we need a variant of the method of differential quotients. Lemma 7. Let u E Hk(B r +), be null in the neighbourhood of Ixl == r; let a E f\jn with an == 0, then Da u E H k (Br+) iff L1 u is bounded in H k (Br+) when h --+ O. This lemma can be obtained by a suitable extension of u to [Rn and an application of Lemma 2. We can also state in this case a lemma analogous to Lemma 3. By the method of differential quotients, we then obtain: Lemma 8. Under the hypotheses of Lemma 6, let p < r, I  k + 1 and Y E f\jn such that lyl == I and Yn == 0, then DY u E H 1 (B;) and there exists a constant C (depending only on p and I) such that (3.21) II DY u II H I (B;)  C ( II fll Hk(Br+) + II u II HI (Br+ )). Besides the method of differential quotients, the proof of this lemma uses induction on I (for I == 0 the result is obvious) and the coercivity of the sesquilinear form a. It is now necessary to take into account the derivatives with respect to x n ; it is the subject of the following lemma. Lemma 9. Under the hypotheses of Lemma 6, let p < r, I  k + 2 and Y E f\jn with Y n I; then D Y u E L 2 (B;) and there exists a constant C > 0 such that (3.22) II DY u IIL 2 (B;)  C(llfIIHk(B r +) + II U IIHI(Br+)). The proof of this lemma uses the ellipticity of P in the following form: the coefficient an,n(x) never vanishes on Br+; we therefore have 1 D;u == - (f - Q(x, D)u) an, n where Q is an operator of order 2, containing no derivative with respect to X n of order strictly greater than 1. We can then argue by induction of I: the result is 
436 Chapter VII. Linear Variational Problems. Regularity true for I = 0 and for I = 1 from Lemma 8 (pay attention to the passage from HI to L 2 between (3.21) and (3.22)). Let us put for I  2, y' = (Y1'. . ., Yn-1, Yn - 2). We have DY u = DY'D;u = DY' [ (f - Q(X,D)U) ] . an,n Now Iy'l  k and DY' Q(x, D) only contains derivatives with respect to X n of order I - 1 at most and is of order k, from which (3.22) follows by use of the induction hypothesis. o Consequently, Lemma 6 becomes almost obvious from (3.22); it suffices to write the norm of u in Hk(B;) as a sum of norms in L 2(B;) of elementary derivatives of u, and apply (3.22) to each of them. We can find a complete proof of the lemma in Lions-Magenes [lJ, Chap. II, 4, in a much more general case, in Lions- Magenes [lJ, Chap. VII, in the case treated here, and in Agmon [lJ, Sect. 9, for the Dirichlet problem for an operator of order 2m. We can now return to the proof of Theorem 2. Let V o , VI' . . . , Vz and W o , W 1 , . . . ,  be two finite open coverings of Q stisfying: V o c Q, , i  1 is contained 'in a domain of a local map e such that e( n Q) = Br+, o = Va,  c , for all i  1, finally let (qJi) be a partition of unity subordinated to the covering (). Let io  1 be fixed; we shall show that qJio u E Hk+2(Q) sincef E Hk(Q). In order to simplify the notation we suppress the index io in the proof which follows. We introduce a sequence of open sets satisfying W = Uk C U k - 1 . . . c U o c V and we recall that the function qJU we are studying has its support in W n Q. A priori qJU E H 1 (V n Q), hence PqJu E L 2 (V n Q) since (3.23) PqJU = qJf + [P, qJJu and that [P, qJ J is of order 1. Lemma 6 then allows to assert that U E H 2 (U O n Q). The argument proceds in an obvious way: from (3.23) PqJu E H 1 (U o n Q), so, from Lemma 6, qJU E H 3 (U I n Q), from which it follows in k stages that qJU E H k + 2 (W n Q), which is the result we were looking for. The case of W o is dealt with by the use of Proposition 1 instead of Lemma 6. Theorem 2 is therefore completely proved. o The preceding results are valid for the Neumann problem and the Dirichlet problem for Laplace's equation or Poisson's equation; we therefore again 
3. Regularity of the Solutions of Variational Problems 437 encounter here the problems treated in Chap. II. We shall study a little later (see Sect. 4) the regularity properties of the Green's function. 4. Miscellaneous Results on Global Regularity In this section, we shall state, without proof, a more general result of global regularity than that of the preceding section. We shall follow closely Lions- Magenes [lJ and shall need a certain number of definitions which we shall now give with the hypotheses of this section. The open set Q will again be taken to be regular and bounded (with CfjOO boundary), and its boundary will always be denoted by r. 6i The operator: (3.24) P(x, D) == L aa(x)Da lal2m with principal part p.: p.(x, D) == L aa(x)Da, lal=2m will be supposed to hve coefficients in Cfjoo (Q) and to be properly elliptic in an open set containing Q, that is to say: Definition 1. The operator P given by (3.24) is properly elliptic in an open set U, if it is elliptic in U, and if for all x E U and all  E [Rn\ {O}, ' E [Rn\ {O} the polynomial in T (3.25) p.(x,  + T') == L aa(x)( + T')a lal=2m has m roots with (strictly) positive imaginary part. This is always true for an elliptic operator when n  3 and for an elliptic operator with real coefficients when n == 2, see Lions- Magenes [1 J, p. 122. The following hypotheses are related to conditions on the boundary and require some developments on the boundary operators. Such an operator B is of the form B(x, D) == L bp(x)DP IPI  J1 where b p E fJfi(r) {the multi-index P always denoting (Pi' P2' . . . , Pn) E f\jn). It operates on Cfjoo (Q) in the following fashion: Bg == L bp(x)yo(DP g) IPI  J1 61 We again suppose that Q is locally on one side of r. 
438 Chapter VII. Linear Variational Problems. Regularity where Yoh represents the restriction of h E reOO(Q) to r. We see that we can extend B with suitable theorems on the trace, to certain Sobolev spaces. We shall call the integer Jl, the order of B. Definition 2. Let (B j )l jr be a system of boundary operators of order f.lj respectively; we say that this system is normal on the open set r 1 of r if: (a) Jlj =I- Jli if j =I- i, (b) for all x E r 1 , and all  E [Rn\ {O} normal to r at x: L bjp(x)P =I- O. IPI=pj Example 1. Ifwe denote by alan the normal derivative to r, every system of the form ((alan)llj)l jr with Jli =I- Jlj if i =I- j (and with the following convention for Jl == 0: (a I an)O == id 0) is a system of boundary operators. This system is normal on r. o We see that this property is independent of the operator for which we wish to study a boundary value problem. It is the property which we shall define now which involves this operator. Definition 3. Let P(x, D) be a properly liptic operator of order 2m with coefficients re oo in an open set containing Q; let (Bj)Ojm-l be a system of m boundary operators; we say that it covers P on the open set r 1 of r if, for all x E r 1 , all  E [Rn\{o} tangent to r-at x, all' E [Rn\{o} normal to r at x, the polynomials L bjp(x)( + t')P in t, 0  j  m - 1, are linearly independ- IPI = Pj ent modulo the polynomial TI (t - tt (x, , ')), where the ti+ are the m lim roots with positive imaginary part of the equation P-(x,  + t') == O. This property is invariant under a diffeomorphism. We shall give an equivalent, but more manageable, form in the case of operators with constant coefficients and of a plane part of the boundary. For that, we introduce the following notation [R == {(y, t); Y E [Rn-l, t > O}; we shall write the operator with constant coefficients P(D) in the form P(D) == L aaD;' D:n, lal2m and the boundary operators in the form rx == (rx', rx n ),  P' Pn Bj(D) ==  bjpDy Dt , IPI Pj we denote by 11 the dual variable of y, t the dual variable of t. P == (P', Pn); 
93. Regularity of the Solutions of Variational Problems 439 Assume that P is properly elliptic, that is to say that p. (17, T) == L aa17 a 'T an lal=2m is non-null for (17, T) E [Rn\ {O} and, as a polynomial in T, has m roots with positive imaginary part for all 17 E [Rn-l \ {O}. We shall denote these roots by Tt (17), 1  i  m and we shall put M+ (17, T) == Il (T - Tt (17)) 1 im L c: (17)T m - k . Okm Then let Bj be a boundary operator; we shall denote by Bj(17, T) the remainder of the Euclidean division (as polynomial in T) of the polynomial Bj(17, T) Bj (17, T) == L b jp 17 P ' T Pn IPI = Pj by M+ (17, T); this is a polynomial in T of degree less than or equal to m - 1, with coefficients analytic in 17. We then have Lemma 10. Under the above hypotheses and with the above notation, if Bj(17, T) == L bjk(17)T m - k , Okm-l the system ofm boundary operators (Bj)Ojm-l covers P iff the matrix of the (bjk(17))Ojm-l is invertible for all 17 E [Rn-l \ {O}. Okm-l The verification of this lemma is easy. We shall now state the regularity result we had in view; the reader can find a proof in Lions-Magenes [1J (Chap. II, 5, Theorem 5.1). Proposition 2. Let Q be a regular bounded open set in [Rn, let P(x, D) be a properly elliptic operator of order 2m with coefficients which are Cfjoo in an open set containing Q, let (Bj)Ojm-l be a system ofm boundary operators on r, covering P, and such that 0 ' Jlj  2m - 1; let u be a solution in H 2m (Q) of the boundary value problem 62 { P(x, D)u == f in Q Bju == gj on r. If fE Hk(Q), gj E H k + 2m -/lj-t (r), 0  j  m - 1, then u E H 2m + k (Q), Vk E N. 63 62 For the existence and uniqueness of the solution of this problem by the Fredholm alternative, see Lions-Magenes [1J, Chap. II 95. 63 This regularity result can also be presented in the form: Let U E H 2m (Q) be such that P(x, D)u E Hk(Q), Bju E H k + 2 m-/tj-t(r), a  j  m - 1, then U E H 2m + k (Q). 
440 Chapter VII. Linear Variational Problems. Regularity Remark 2. This result is valid for boundary value problems formulated by other methods than the variational method. D Example 2. 1. If we take the system ((ajan)j)Ojm-l as the normal system of boundary operators, then for every properly elliptic operator, the hypotheses of Propo- sition 2 are satisfied; the preceding result is then applicable to the Dirichlet problem, for which m = 1 and Bou = u = (8j8n)Ou. 2. In the case of the Laplacian and a plane part of the boundary, we see (by using Lemma 10, for example) that the problems of Dirichlet, of Neumann, and of the regular oblique derivative are included in the preceding group. D The hypothesis on the regularity of the coefficients of the boundary operators can be insufficiently restrictive: for example, in the mixed probems (g2.2.1), the condition is not satisfied. In fact, the regularity result is not then valid. D Example 3. Application to the regularity of the solution of the stationary multigroup problem of the diffusion of neutrons (see Sect. 6 of 2). . The previous results enable us to assert that if the diffusion coefficients d g and the absorption coefficients (J gh are very regular and the boundary r of Q is very regular, then f = {fg} E (L 2 (Q))G implies that the solution u of the equation (2.25) of g2.6 satisfies the following regularity result: (3.26) U E (H 2 (Q))G. Then using the injection result of Sob ole v (see Adams [lJ), we deduce from the injection H 2 (Q) c O,A(Q), O<At true for a regular bounded set Q c [R3, that the solution of the problem is in fact continuous and Holderian of order A, that is to say lu(x) - u(y)1  Klx - yl\ - \;f x, Y E Q by definition of the space o, A(Q). The regularity result (3.26) is still valid if the coefficients d g and (Jgh satisfy (with r = aQ a variety of class rc 2 , Jl) only (with (2.26)) d g E l,Jl(Q) (Jgh E O'Jl(Q) for 0  J.1  1 (where l,Jl(Q) denotes the space of continuous functions whose derivatives are in rco,Jl(Q)) (see Stewart [lJ). It is also valid if the boundary r 
3. Regularity of the Solutions of Variational Problems 441 may have angular points with angles less than 11: (see Grisvard [1J or Kondratiev [1J). The interior regularity results (Sect. 1) show that if f is regular in the interior, then u is regular in the interior, in particular twice continuously differentiable, and hence the "classical" solution of equation (2.25) in the sense of Chap. II. We know (Remark 6 of 2.4) that if the coefficients d g are discontinuous, it is not possible for the solution to have H 2 -regularity. On the other hand, we can, for all that, show (Stewart [1J) that U E o').(Q) if the coefficients d g admit a finite number of sufficiently regular surfaces of discontinuity. D 5. Green's Functions 5.1. Case of the Laplacian in a Bounded Open Set Q with Dirichlet Condition We shall now return to the study of the Green's kernel begun in 1.3. We start with a particular case (that of the Dirichlet problem for the Laplacian in a regular bounded open set Q (with oo-boundary) in [Rn) and recover results of Chap. II. To simplify the writing, the spaces considered here will, in the first instance, be taken to be real. Hence, let us consider in a regular bounded set Q c [Rn the situation of  1.2.1. (3.27) v = H6(Q), H = L 2 (Q) a(u, v) = L (DiU, Div); lin taking account of the remarks made in 1.2.1, we shall study the reciprocal isomorphism G of (3.28) -L1: H6(Q)  H- 1 (Q), which extends in an obvious way that which was studied in 1.3. The Green's kernel Gx,y E /(QX x Qy) is defined by <Gu, v> = <Gx,y, u(x)V(y»64 for all u E (Q) and all v E (Q) (see 1.3). Before giving the properties of the Green's operator G and of the Green's kernel Gx,y we shall give some definitions concerning kernels. We recall that we call a distribution defined on Q x Q a kernel 65 , that is to say, a kernel is an element of /(Q x Q). 64 The function spaces considered here are taken to be real (as distinct from 1.3). 65 See Chap. V, 1. 
442 Chapter VII. Linear Variational Problems. Regularity Definition 4. Let Kx,y E '(Qx x Oy) be a kernel; we say that Kx,y is symmetric if <Kx,y, u(x )v(y) > = <Kx,y, v(x )u( y) > for all u and v of (Q). We say that Kx,y is very regular if: (i) Kx,y is a OO-function on Qx with values in '(Qy), (ii) Kx,y is a OO-function on Qy with values in '(Qx), (iii) the restriction of Kx, y to the complement of the diagonal D = {(x, y); X E Q, y E Q, x = y} of Qx x Qy is a rc oo -function. We then have: Proposition 3. The Green's kernel of the Dirichlet problem (1.1), g1 with (3.27), (3.28) is symmetric and very regular. Proof Let us define the mapping G: (Q) --+ '(Q) by (3.29) < Gu, v) = < G x, y, u (x) v ( y) > for u, v E (Q). We therefore have < G x, y, u (x) v ( y) > = < Gu, v > = < Gu, - L1 Gv > a(Gu, Gv) = L: / -;:,0 GU,:- GV ) 1  i  n \ uXi uXi a(Gv, GU) = <Gv, -L1GU) = <Gv, U> <Gx,y, v(x)u(y» from which we have the first assertion. The second part of the proposition is a general property consequent upon the hypo-ellipticity of the Laplacian (see Chap. V, g2). D To be able to identify the Green's function (which is now a function Gx,y regular for x i= y) with the Green's function G Q defined in Chap. II, g4, we need the following property. Proposition 4. G extends to an isomorphism of H - 1 (Q) + s' (Q) onto H(Q) + S'(Q).66 Proof It follows from the regularity results of Sect. 1, that G is an isomorphism of H- 1 (Q) n S(Q) onto H(Q) n S(Q) (the closed graph theorem ensuring 66 In a general way, if V and Ware topological vector spaces with V n W dense in V and in W, we denote by V' + W'the (strong) dual of the space V n W (with V n W given the coarsest topology making the natural injections V n W -+ V and V n W -+ W continuous). The space V' + W' is then made up of the linear forms v E V n W -+ L(v) + M(v) with LEV' and M E W'. 
3. Regularity of the Solutions of Variational Problems 443 the continuity). On the other hand, the calculation made in (3.29) shows that, on this space, G coincides with its transpose which, itself, is an isomorphism of H- 1 (Q) + @'(Q) onto H (Q) + @'(Q). D We shall denote by G(x, y) the value at the point (x, y) of the function G and by G(., y), Y E Q, (resp. G(x, .), X E Q) the mapping x  G(x, y) on Q\ {y} (resp. y  G(x, y) on Q\{x}). We then have Theorem 3. For all y E Q, G(.,y) is the unique solution in H(Q) + @'(Q), of t he equation (3.30) -L1 x G(.,y) = by Proof We first verify that - L1 is the reciprocal isomorphism of G between the spaces H.6(Q) + @'(Q) and H- 1 (Q) + @'(Q). We know that -L1 operates continuously from Hb(Q) + @'(Q) into H- 1 (Q) + @'(Q), and, on the other hand, that 2fi( Q) is dense in each of these two spaces and that finally for all u E 2fi(Q) u = -L1(Gu) = G( -L1u). Let us now calculate G(b y ) for y E Q. We have defined G on H- 1 (Q) + @'(Q) as the transpose of the restriction to H- 1 (Q) n @(Q) of the Green's kernel G; hence G(b y ) is a distribution on Q whose value on the function u E 2fi(Q) is given by < G( by) , u > = < by, Gu > (the second "scalar product" is that of the duality (@'(Q), @(Q))); thus <G(b y ), u> = (Gu)(y); the mapping K = Gu : y --+ < G ( by), u ) is therefore oo on Q, it defines a distribution on Q by the formula <Cu, v) = fQ(CU)(Y)V(Y)d Y , Vv E 2fi(Q) or again, by using the kernel G x, y <Gu, v) = <Gx,y, u(x)v(y». The kernel Gx,y can thus be obtained (as a distribution in x with value a distribution in y) by Gx,y = Gx(b y ) from which, in an obvious way, we have (3.30). The theorem is thus completely proved. D 
444 Chapter VII. Linear Variational Problems. Regularity In the set of the functions LP(Q), it follows from (2.32) (applied to the case in which A ::z: - L1) that: n G( . , y) E W ' P ( Q) if p < V Y E Q n - 1 ' . Note that from Proposition 5 below, that this comes down to saying that the elementary solution satisfies: E E W 1 ,p ( lRn ) n loe i.e. that if n  3, rn2 ' a: i ( rn2 ) are in U(B), where B is the ball of centre 0 and radius 1. This is satisfied if xir- n E LP(B) i.e. if(n - 1)p < n, and we recover the above condition on p (obvious variants if n = 1 or n = 2). o Proposition 5. The Green's kernel for the problem (1.1) g1 with (3.27), (3.28) is given in terms ofG!b the Green'sfunction of the open set Q (see g3.1 of Chap. II) by the formula G(x, y) = - G .Q(x, y). From the definition of G.Q (see Definition 8, g4 of Chap. II) G(x, y) = -G.Q(x, y) = vy(x) - En(x - y) where v y is for each y E Q the unique solution in H 1 (Q) of the Dirichlet problem { L1xVy = 0 (3.31 ) YoV y = Yo(En(x - y)), which from Corollary 2 (of Theorem 2) is OO(Q) 67 and En is the elementary solution of the Laplacian (see Chap. I I). Proof Let v y be the solution in H 1 (Q) of(3.31); we consider the distribution Ty on Q defined by Ty = +v y - En(. - y). We have (3.32) - L1x Ty = by. On the other hand, we can write, cp being a function of (Q) taking the value 1 on a neighbourhood of y Ty '= -En(. - y)cp + [v y - En(. - y)(1 - cp)] the first term of the sum is in $' (Q); the second is the sum of an element of H 1 (Q) and of a function of OO (Q), it therefore belongs to H 1 (Q), its trace on the boundary of Q is null, by definition of v y , this second term therefore belongs to 67 If r is of class rc oo . 
3. Regularity of the Solutions of Variational Problems 445 H(Q), from which (3.33) Ty E H6(Q) + /(Q). Now G(y,.) is the only distribution satisfying (3.32) and (3.33) by Theorem 3. D We recall that we have given in Chap. II, 4 several examples of Green's functions relative to the Laplacian with Dirichlet condition, and in particular -in a bounded interval Ja, b[ of  (Example 10, 4), -in a ball of n (n  2) (Example 11, formulae (4.24) and (4.25)), - in an unbounded interval J a, 00 [ of  (Example 12, 4), -in the exterior of a ball of n (n  2) (Example 17), -in the half-space of n (n  2) (Example 18) and (in  7 of Chap. II) in a half-ball of n (by the method of images). 5.2. Some Other Particular Examples After having studied the Green's function associated with the Dirichlet problem for the Laplacian in a regular bounded open set of n, we propose to examine some other simple cases 68 . (a) Green'sfunction associated with the Neumann problemfor the Laplacian, in a regular, bounded and connected open set in n. The determination of the Green's function N(x, y) in the case of the Neumann problem (written in a fashion similar to the Dirichlet problem) reduces to solving the problem: -L1xN(., y) = b(x - y) = by aN --;- (x, y) = 0, X E r un x for each y E Q. Now this problem does not admit of solution. In effect the Gauss condition (see for example Chap. II, formula (6.56)) of the problem - L1 u = J.1 in Q au an = v on r 68 We recall also that in Chap. II, 8.6, we have studied some other examples of Green's functions than those indicated previously (see the Examples 4, 5, 6, 7, 8, Chap. II) in particular the Green's function for the problems of Neumann and Robin for the Laplacian, these examples being considered below in a different light, resulting certainly in the same analytical expressions. 
446 Chapter VII. Linear Variational Problems. Regularity with given J1 and v (bounded Radon measures), the condition which can be written: L dfl + L dv = 0, is not verified for J1 = by, v = o. Another way of taking account of the non-existence of the Green's function for the Neumann problem, is to look for a function N(x, y) of the form N(x, y) = -En(x - y) + vy(x), the function v y having to satisfy -L1xVy = 0 oV y on x (n In Q )<Tn ax ( IX _ lYln_z ) oEn on x on r n  3. 69 Now v y being a harmonic function on Q must satisfy the Gauss condition f Vy (x)dr(x) = 0; r un x but (see Chap. II, g3), we know that f oEn - dr(x) = 1! r on x It is nevertheless possible to have a technique for solving the Neumann problem by using a function analogous to a Green's function. Let us consider problem PN ( Q, by _ 1 Q ) mes 5J - L1xH(x, y) 1 by(x) - mesQ In Q70 oH -;-(x,y) un x 0, X E r 71 , 69 With (J n the area of the unit sphere in [Rn. 70 With mes Q = volume of the bounded domain Q. 71 We recall that, in Chap. II, 8, we have also called the Green's function (of the Neumann problem for the Laplacian) the solution of this problem with the condition In H(x, y)dx = 0; this termin- ology is particularly justified by the properties of this solution. 
3. Regularity of the Solutions of Variational Problems 447 for each y E Q. Since the Gauss condition f ( <5y - 1 ) dx = 0 n mes Q is satisfied, this problem clearly admits a solution (determined to within an additive constant, with the hypothesis that the open set Q is connected). To verify it, we look for H of the form H(x, y) = -En(x - y) + wy(x) with En an elementary solution of the Laplacian, and w y a solution of: 1 L1 x w y (x) = In Q mesQ DW y = DEn (x _ y) on r, an x an x for each y E Q. We know the existence of the solution (to within an additive constant) of this problem (in H 1 (Q)). We consider at present the problem PN(Q,f): { - L1u = f in Q au _ a = 0 on r, u r with f E L 2(Q) a given function such that f n f dx = O. Using the formula (see Chap. II, gl, formula (1.9)): fn (uL1v - vL1u)dx = L( u : - v :: )dr, formally with v = H (., y) (naturally this can be justified), we obtain -u(y) + 1 f UdX + f H(x, y)f(x)dx = 0, mes Q n or, agaIn: u(x) - 1 f UdY = f H(y, x)f(y)dy. mes Q n Putting, then def 1 f f U(x) = !-t(x) - Q udy = H(y, x)f(y)dy, mes n we see that U is the solution of the Neumann problem PN(Q,f) satisfying f n U(x)dx = O. 
448 Chapter VII. Linear Variational Problems. Regularity ..........., -- Let us denote then by L 2(Q), V, D(A N ) the following sets U(Q) ={fE L 2 (Q), I/dX = O}, - .--..../ V == H 1 (Q) n L 2 (Q), D(AN) == D(AN) n U(Q) with D(A N ) == domain of the Laplacian in L 2(Q) with Neumann condition. The bilinear form a(u, v) == I grad u grad v dx, defined on the space V, being V- n  coercive (see Chap. IV, 7), the unbounded operator in L 2(Q), AN associated - ,,--../ with a(u, V)72 (hence defined by the variational class V, L 2 (Q), a(u, v)) is, as a result of the Lax-Milgram theorem, an isomorphism of D(A N ) onto j) (Q) with inverse H == AN 1 . As in the case of Proposition 3, we can show that the kernel H x, y of this operator is symmetric and very regular. In addition, the operator H extends to an isomorphism from the space (L 2 (Q) + $'(Q)f def {fE 'L 2 (Q) + $'(Q), In fdX = o} - def { I } onto the space (D(A N ) + $'(Q)) = U E D(A N ) + $'(Q), n udx = 0 . We shall again have H(f) == U solution of the problem PN(Q,f) for all f E (L 2 (Q) + $'(Q))-, and in particular H(b y ) == H(., y), which identifies the kernel Hx,y of H with the function H(x, y) solution of the problem PN ( Q, by - 1 ) , chosen in such a way that mesQ In H(x, y)dx = 0 \ Vy E Q. Thus H(x, y) can be considered as the Green'sfunction of the Neumann problem relative to the Laplacian. (b) Green'sfunction associated with the Robin problemfor the operator - L1 + A, A > 0 in a regular bounded open set Q. We give now an example of the use of the Green's function, in the Problem PR(Q,f): i) (- L1 + A) u == f in Q . . ) d ef au h a 11 Bu = _ a + u == 0 on r == Q n r 72 This operator is identical with the opposite of the Laplacian with Neumann boundary condition, in the space l2(Q). 
3. Regularity of the Solutions of Variational Problems 449 for Q a bounded open set in n, with regular (OO) boundary r = 8Q, with A > 0, h a given positive function, also supposed regular (OO (r)), and with a given j such that j E L 2 (Q). As a consequence of g2.3.2, we know that this problem P R(Q, j) with Robin boundary condition 73 can be put into variational form by putting: V = H 1 (Q), H == L 2 (Q) and a(u, v) = In(gradUgradV + Auv)dx + IrhUVdr; the bilinear form a(u, v) thus defined and under the hypotheses made, V- coercive; by the Lax-Milgram theorem, the operator A, unbounded in L 2(Q), associated with a(u, v) is an isomorphism of D(A) onto L 2 (Q). Let G == A -1 its inverse, (which is thus an isomorphism of L 2 (Q) onto D(A)) and Gx,y, the kernel of G, defined by def <Gx,y, u(x)v(y» == <Gu, v> Vu, V E !0(Q). We can again verify, as in Proposition 3, that Gx,y is symmetric and very regular.. We can show (see Lions [3J) that the mapping G = A - 1 can be extended to an isomorphism of L 2 (Q) + $'(Q) onto D(A) + $'(Q), and that G(b y ) == Gx(b y ), denoted by G(x, y) is the solution of the Robin problem: f ( - L1 x + A) G (x, y) = by in Q 8G l on x (x, y) + h(x)G(x, y) = 0 on r. Utilising Green's formula In[U(A - A)V - v(A - A)uJdx= Ir[U( :: + hV) - v( : + hU) Jdr formally74 with v = G(. ,y), we obtain the solution u of the problem PR(Q,j) in the form U(x) = In G(x,y)f(y)dy. Thus the function G(x, y) is clearly the Green's function of the problem PR(Q,j). 73 From which we derive the notation of the problem. 74 This can be justified by passage to the limit. 
450 Chapter VII. Linear Variativnal Problems. Regularity Remark 3. An interesting generalisation of the problem P R(Q, f) is: Problem PR'(Q,f, g): ( - L1 + 2) u = f in Q def au B == (Xu + f3;;- = g on r un r with 2 E , (X and f3 given (regular) functions on r such that (X + f3 = 1, and the givenfE L 2 (Q), g E Ht(r). Suppose that there exists a Green's function G defined on Q x Q\D (with D = {(x, x), X E Q}), such that f(-L1x +2)G(x,y) =l5(x-y) in Q l BG(x, y) = 0 on r for each y E Q. Then, using the following Green's formula I}U(,1 - A)V - v(,1 - A)U] dx = I r [ ( au + P  ) ( - v + : ) - ( - u +  ) ( av + P : ) ] dr, and replacing v formally by G( . , y) we obtain the solution of P R' (Q, f, g) in the form u(x) = In G(x, y)f(y)dy + Ir[ G(x, y) -  (x, y) ]g(y)dr(y). But the justification of these results would be much more delicate. o (c) Green's function associated with the biharmonic operator L1 2 , with Dirichlet conditions, in a regular bounded set Q c n. Let us now consider the variational framework: v = H6(Q), H = L 2 (Q), a( u, v) = I n ,1 u ,1 v dv; a is a V-coercive bilinear form, and defines an isomorphism: L1 2 = L1L1 of H6(Q) into H- 2(Q)75. Let G be the reciprocal isomorphism, and G x, y its kernel which is therefore defined by <Gu, v) = <Gx,y, u(x)v(y) Vu, V E fJfi(Q). 75 The situation considered here is a particular case (for v = 1) of that of Remark 21, 2. 
9 3 . Regularity of the Solutions of Variational Problems 451 By a method similar to that used in Propositions 3 and 4, we can show that (i) G x, y is symmetric and very regular, (ii) G can be extended to an isomorphism oj H- 2 (Q) + $'(Q) onto H6(Q) + $'(Q) and finally thatJor all y E Q, Gx,y also denoted by G(x,y) is the unique solution in H6(Q) + $'(Q) oj the equation L1 G(x, y) = by in Q; G(x, y) appears therefore as the Green's function of Problem P(Q,f, g, h): L1 2 u =J In Q ulr =g on r au = h - on r an r withJ: g, h given such thatJE H- 2 (Q), g E H 3 / 2 (r), h E H l / 2 (r). Using the Green's formula f.a (u Ll 2 v - VLl2 u) dx = f }(u Ll 2 v - Llu Llv) + (Llu Llv - v Ll2 u)] dx f ( a L1v ou av a L1u ) = u - - - L1v + L1u - - - v dr r an an an an (formally) with v = G(., y); the solution u of the Dirichlet problem P(Q,J: g, h) can then be written u(x) = + f.a G(x, y)f(y)dy + f g(y) _ a a LlyG(x, y)dr(y) - f h(y)L1yG(x, y)dr(y). r ny r In the case in which Q is the unit disk in (R2, we can give an explicit expression: I' - 171 2 -, 1 G(Xl' X2, Yl, Y2) = - 8n Log 11 - - - - (1111 2 - 1)(1(1 2 - 1) 17 - , 16rc x = (Xl' X2) E Q, y E (Yl, Y2) E Q, where we have put ,= Xl + iX2, 17 = Yl + iY2, ,= Xl - lX2, 17 = Yl - lY2. F or other examples involving Green's functions in a more or less explicit fashion see Chap. XI; see also Lavrentiev-Shabat [1], Rektorys [1] and Butkovsky [1]. 5.3. Green's Functions in a More General Setting We shall now consider more general situations. For that, we shall first fix the notation and give some preliminary properties. Let a(u, v) be a sesquilinear 
452 Chapter VII. Linear Variational Problems. Regularity form 76 on Hm(Q) (Q a regular bounded open set in (Rn) given by (3.34) a(u, v) == I (a(Xp D(Xu, DP v), lal m IPI m a(Xp E CC OO (Q). The adjoint form a*(v, v) defined by (3.35) a*(u, v) == a(v, u), Vu, V E Hm(Q), is given by a * (u, v) == I (DP u, a rxp D(X v ). lal m IPI m If, to the sesquilinear form a(u, v) there corresponds the differential operator A (3.36) A(x, D) == I (-1)IPI DP a(Xp D(X lal m IPI  m linked to a(u, v) by (1.12) there corresponds to the sesquilinear form a*(u, v) the formal adjoint differential operator of A: (3.36)' A * (x, D) == I ( _1)1(X1 D(X a(Xp DP lal m WI  m satisfying (A *(x, D )u, v) == (u, A(x, D)v), Vu, V E (Q). (a) Green's kernel in a variational setting. Now, let V be a closed sub-space of Hm(Q) with H'[;(Q) eVe Hm(Q). We suppose that a(u, v) is V-elliptic; the operator A, unbounded in L 2 (Q), defined by this variational scheme, and the domain D(A) defined by (1.13) is therefore (by the Lax-Milgram theorem) an isomorphism of D(A) onto L 2 (Q); it is obvious that a*(u, v) is also V-elliptic. We can prove (see Chap. VI, 3) that the operator A * in L 2 (Q) with domain D(A *) associated with the adjoint form a*(u, v), is the adjoint of A with domain D(A) and that A* is an isomorphism of D(A *) onto L 2 (Q). Let G x , y (resp. G;, y) the Green's kernel of the problem { Au == f,f given in L 2 (Q) ( res . { A*U == f, f given in L 2 (Q) ) u E D(A) P u E D(A*) and G == A -1 (resp. G* == A*-l) the mapping of L 2 (Q) into D(A) (resp. D(A*)) 76 Here, the spaces are taken to be complex. 
3. Regularity of the Solutions of Variational Problems 453 whose restriction to (Q) is defined by (3.37) < Gu, v> = < G x , Y' u(x )v(y) > (resp. < G*u, v> = < G;, Y' u(x )v(y) > ),77 Vu, V E (Q); we have the Proposition 6. The kernel G x, y possesses the following properties: (a) G x , y = G:, x, (b) G x, y is very regular. Proof The point (a) is established by a calculation identical with that used in the proof of Proposition 3. The point (b) follows from the fact that A and A * are strongly elliptic from  1.5, therefore hypo-elliptic, from which the desired property follows (see Chap. V, 2). Proposition 7. The operator G, defined by (3.37), in terms of the Green's kernel Gx,y can be extended to an isomorphism from L 2 (Q) + $'(Q) onto D(A) + $'(Q). We cannot employ the same method of proof as was used in Proposition 4 as, (Q) not necessarily being dense in V, we canr:ot use the transpose of a regularity result. We assume this result; for a proof see Lions [3J,  12. (b) Green's function for a problem not necessarily variational. We take up the operators A and A * given by (3.36) and (3.36)', but without assuming a priori the existence of a variational framework for these differential operators. We shall give a Green's formula permitting us to give a sense to the notion of "adjoint boundary value problem". The discussion which follows is taken from Lions-Magenes [lJ (Chap. II, 2) to which we refer for the proofs. Proposition 8. The differential operators A and A * defined by (3.36) and (3.36)' being supposed elliptic, let (Bj)o  j  m-l be a normal system (see Definition 2) of boundary operators of orders less than or equal to 2m - 1. We can choose (in a non-unique way) another system of boundary operators (Sj)o  j  m-l having the same properties as (B j ) such that the system of 2m boundary operators (Bo, . . . , B m - 1 , So, . . . , Sm-l) is normal. Then there exist 2m boundary operators C j , Tj such that the system (Co, . . . , C m - 1 , To, . . . , T m - 1 ) is normal, that for all j, Ojm-1 order B j + order Tj = order C j + order S j = m 77 In the member on the left, the <, > denotes the antiduality 92'(.0), 92(.0) and in the member on the right, the < , > denotes the duality 92' (.0 x .0), 92(.0 x .0). 
454 Chapter VII. Linear Variational Problems. Regularity and that (3.38) f Au.vdx - f u. A*v dx = L f Sju Cjv dy n n Ojm-l r .L f Bju Tj v dy, Vu, V E fYfi(Q). OJm-l r The C j and Tj are determined in a uniquefashion, once the choice of the Sj has been made. The Green's formula (3.38) lets us talk of adjoint problems in the sense of this formula: the operators A, A *, Bj being given, there are several formulae possible, depending on the choice made of the Sj. Definition 5. The system of operators (C j ) is said to be the adjoint of the system (B j) with respect to (3.38). The justification of this definition emerges from the following remark: If u E fYfi(Q), we have Bju = 0 for 0  j  m - 1 iff f.Q Au . v dx = f .Q u . A * v dx for all v E fYfi(Q) such that Cjv = 0 for 0  j  m - 1. Again we say that the boundary value problem { A*U =f in Q Cju = gj on r, 0  j  m - 1 is the adjoint with respect to the Green's formula (3.38) of the problem { AU=f in Q Bju = gj on r, 0  j  m - 1. (For example the Dirichlet problem for the Laplacian Ltu = f, with u I r = 0, is its own adjoint (for any Green's function whatsoever of the type (3.38)).) D (3.39) Suppose that the posed problem is interpreted as { Au _ I in Q Ig.i ve n in L 2 (Q), _ Bju - 0 on r, ] - 0, 1, 2, . . . , m 1 where the system of boundary operators (B j ) satisfies the conditions of Prop- osition 8. We shall explain what we are looking for when building a "Green's function" for this problem, without for the moment being pre-occupied with conditions of validity. We then write Green's formula (3.38) and give the name Green's function to a function 78 (x, y)  G(x, y), oo on Q x Q\D (where D is 78 If it exists. 
3. Regularity of the Solutions of Variational Problems 455 (3.40 ) - - - the diagonal of Q x Q: the set (x, y) E Q x Q such that x = y) satisfying { A*G(., y) = by C j G ( ., y) = 0, j = 0,,1, . . . , m - 1 for all y E Q. We see then that the solution u (if it exists) of the problem (3.39) satisfies (3.41 ) u(y) = fn G(x,y)f(x)dx. We can rigorously justify the above formal argument. Remark 4. If U is an open set containing Q, suppose that we have in U x U a kernel E(x, y) satisfying A E(x, y) = b(y - x) for all y and all x in U . We can verify (ellipticity of A *) that E is very regular. Let H be defined by H = E - G. We see (on examining the adjoint problem of (3.39) to justify the Green's formula) that and { A*H(.,y)=O CjH(., y) = -CjE(., y) { AH(X,.) = 0 B j H (x, .) = - B j E ( . , y) - - which shows, when Proposition 2 can be applied, that G E OO(Q x Q\D). The only interest of the remarks that we are about to make is an a posteriori justification of the calculations made to find, in concrete cases, the Green's functions of boundary value problems. D (c) Identification of Green's kernel and Green's function in a variational frame- work. Let us suppose afresh that the operator A occurring in (3.39) with differential expression (3.36), with the boundary conditions B j u = 0 on r, j = 0, . . . , m - 1, is defined by the variational scheme of (a), V, L 2 (Q) and the V-elliptic sesquilinear form a(u, v). (For the operators A and the boundary operators Bj having this property, see Lions-Magenes [lJ, Chap. II 9, p. 220). Thus A is an unbounded operator in L 2 (Q) whose adjoint A* (which is the unbounded operator in L 2 (Q) defined by the variational scheme V, L 2 (Q), and the sesquilinear form a* (u, v)) is, under sufficiently general conditions (see Lions-Magenes [lJ, Vol. 1, p. 196, identified with the realisation in L 2 (Q) of the differential operator A * (see (3.36)') with the boundary conditions Cju = 0 on r, 0  j  m - 1. 
456 Chapter VII. Linear Variational Problems. Regularity The problem (3.39) is then a problem of the form { Au = f u E D(A) where H = L 2 (Q), D(A) cHand A is a bijective operator. Let G = A -1: H --+ D(A), the inverse operator. We have defined the Green's kernel 79 G x, y E fYfi' (Q x Q) by < Gf, v) = < G x , y,f (x)v(y), Vf and v E fYfi(Q). In other words, since u = Gf, we have shown that < u, v) = < G x, y' f (x ) v ( y) ) , Vv E fYfi(Q) which is the analogue of (3.41). On the other hand Propositions 6 and 7 permit us to assert that G., y is, for all y E Q the unique solution in D(A*) + g'(Q) of A*G.,y = by which is the analogue of (3.40). Finally, Proposition 6 states precisely that G x, y is very regular and that G x, y = G(x, y).80 Review of Chapter VII As related to the methods developed in the earlier chapters, the variational method developed in 1, 2 has several significant advantages. On the mathematical plane, it allows us to establish existence and uniqueness for a large class of elliptic problems with discontinuous coefficients. With regard to the classical methods (for example the theory of the potential developed in Chap. II), it has the advantage of a very great generality resulting from the methods and from the fact that, in the first place, we have for objective only the existence (and uniqueness) of weak solutions. A great advantage of the variational method is that it gives rise easily to systematic methods of approximation as we shall see in Chap. XII. In what concerns the regularity of the solutions, supplementary methods and hypotheses are necessary. In the best of the cases (very regular coefficients and open sets Q), for an elliptic problem of order 2m, if the second member is in Hk(Q), the solution will be in H k + 2 m(Q). 79 In this case real. 80 Note that we have thus obtained the existence of the Green's function, solution of (3.40). 
Appendix. "Distributions" It is necessary in problems involving linear differential operators to make use of the theory of distributions "which re-establishes differentiation as a simple operation of analysis" 1. We have gathered together here several definitions and elementary properties of distribution with the aim of aiding the reader's memory. It makes no pretension therefore to being a treatise on the theory of distributions and the greater part of the results stated are given without proof. This appendix thus contains neither a historical survey nor an exhaustive bibliography on the subject. We refer the reader above all to Schwartz [1]. 1. Definition and Basic Properties of Distributions 1. The Space (Q) 1.1. Definition Let Q be an open set in n; we denote by (Q) the space of infinitely differentiable functions with compact support in Q and with values in C. (This space corresponds to the case k = + 00 of the definition of rct (Q), with the result that in the literature we occasionally encounter the notation rc (Q) for (Q)). It is obvious that (Q) is a vector space on the field C of complex numbers, but this vector space could be reduced to {O}. We shall thus show, in the first place, the existence of non-null functions belonging to , and then give a systematic procedure for constructing such functions showing that the space (Q) is sufficiently rich to serve as a space of "test functions". The existence of non-null functions of (Q) follows from the 1 Schwartz [1]. 
458 Appendix. "Distributions" Lemma 1. For all Xo E Q andfor all r > 0 such that the closed ball B(xo, r) with centre Xo and of radius r is contained in Q, there exists a function qJ E (Q) satisfying (1.1) supp qJ c B(xo, r), qJ  0, L cp(x)dx = 1. Proof. Let us introduce the function tjJ defined in Q by ( 1.2) tjJ(x) = [ - r 2 ] exp r 2 - Ix - x o l 2 o otherwise. if Ix - xol < r, (1.3) then the function qJ defined by cp(x) = r/J(x) I L r/J(x) dx answers the question. D We shall give a systematic proceClure for the construction of functions belonging to (Q). This procedure uses the concept of the convolution product of two functions, so we shall, to begin with, recall several properties of this convolution product. (1.4) 1.2. Elementary Properties of the Convolution Product of Two Functions To fix- our ideas (see Remark 1 below), suppose that u and v are two continuous functions on n of which at least one has compact support. We define their convolution product 2 u * v by: u * v(x) = r u(x - y)v(y)dy Jn which satisfies the following elementary properties: (1.5) u * v = v * u (1.6) supp(u * v) c suppu + suppv. = {x + y; x E suppu, y E suppv}. 2 We find sometimes in the literature that this is called the composition of u and v. 
 1. Definition and Basic Properties of Distributions 459 If two at least of the three functions u, v, ware of compact support: ( 1. 7) (u * v) * w = u * (v * w). Remark 1. Of course, it is not necessary to suppose that u and v are continuous to define their convolution product, nor that one of the two functions is of compact support. For example, if (1.8) 1 1 1 - = - + - - 1, r p q 1  p, q < + 00 U E LP, v E Lq then u * v E L r with and (1.9) II u * V IILr  II u IILP.II v IILq ( Young's inequality) with the result that, we have, in particular { i) p = q = 1, r = 1 ii) p; 1 < p _ < + 00, q = 1, r = p. (1.10) o The convolution product of two functions is particularly interesting because of its regularising properties and these are precisely the properties we shall system- atically exploit. For example: if u and v are of compact support then: (1.11) u * v is of compact support (from (1.6)), (1.12) if u is continuous then u * v is also, { if DkU exists and is continuous, then Dk(U * v) exists and is (1.13) continuous and we have Dk(U * v) = (Dku) * v. The above remarks lead to: 1.3. A Procedure for the Construction of Functions of (Q) For 8 > 0, we introduce the function Pe defined p.(x) = :n <po( : ) (1.14) where <Po is the function introduced in Lemma 1- and so defined by (1.2) and (1.3)-with Xo = 0, r = 1. We then have the 
460 Appendix. "Distributions" Lemma 2. Let u be a function locally integrable in Q and with compact support c Q. Then for 8 sufficiently small ( 1.15) belongs to f0(Q). Proof The proof is immediate. From (1.6), supp U e is contained in a compact neighbourhood of order 8 of the support of u. As supp u is compact and is included in Q, we have supp U e c Q for 8 sufficiently small. In addition Pe E f0(Q) and from (1.13): u e = Pe * u ( 1.16) for all a E N n Dau e = D a Pe * u. o Lemma 2 shows that the space £t1(Q) is sufficiently "rich". In fact, we shall see that this space is dense in the numerous common functional spaces constructed on Q. A very interesting application, generalising U rysohn's theorem, 3 is given by: Lemma 3. Let K be a compact set in lR n and 8 > 0 arbitrary. Then there exists a function qJe E f0, (f0 = f0(lR n ) following our conventions), having the properties 1 qJ e (x) = 1 for all x E K, qJ e  0 (1.1 7) supp qJe C Ke = U B(X,8). XEK Proof From Urysohn's theorem, there exists a continuous function u equal to 'Ion K e / 3 and such that supp u C K 2e / 3 . Then qJe = Pe/3 * u, (with Pe defined by (1.14)), answers the question. o 1.4. The Notion of Convergence in !'.d(Q) We can provide the space f0(Q) with a topology of a topological vector space (see Schwartz [1], Treves [2]), but this topology (inductive limit of Frechet spaces) is not also easy to manipulate as a topology of a Banach space. Fortunately, for practical purposes, the notion of convergence of sequences is sufficient for the majority of applications. Definition 1. Let {qJp }PEN be a sequence of functions of f0(Q). We say that such a sequence converges- in f0(Q) to qJ E f0(Q) if the following conditions are satisfied: (i) there exists a fixed compact set K in Q such that sUPPqJp c K for all pEN; 3 See, for example, Dieudonne [2]. 
 1. Definition and Basic Properties of Distributions 461 (ii) for all a E N n , the sequence {DaqJp }pEN of the lal-th derivatives of qJp converge uniformly on K to D a qJ. 1.5. Some Inclusion and Density Properties We shall now give some inclusion and density properties of the space f0(Q) in certain familiar functional spaces. For that, we begin by considering the case Q = IR n . Definition 2. We put k(lRn) = k.4 We say that a sequence {Up}PEN converges in k to u in k if for all a E N n , 0  lal  k the sequences {Da(u p - u) }pEN converge uniformly to zero on every compact set in IR n . o Proposition 1. The.space f0 is contained in and is dense in \ 0  k  + 00. In addition every sequence which is convergent in  converges in k (is contained in k with a finer topol ogy 5). Proof First of all, it is obvious that  is contained algebraically in k. The proof of the density of  in k is made in two stages by "truncation" and "regularisation" : 1st stage: "Truncation". We show that t = t(lRn) is dense in k. 2nd stage: "Regularisation". We show that we can approximate thefunctions of by the functions of f0(Q). 1st stage. We make use of Lemma 3. From this lemma we know that we can find a function qJ E  such that qJ = 1 on the closed ball B(O, 1). For p E N*, we then define qJ p by: cpp(x) = cp G )' Then qJp E  takes the value 1 on B(O, p). Under these conditions for every function u E \ the sequence of functions {Up}pEN defined by: up (x) = qJp(x)u(x) satisfies up E t for all p and it is immediately verified that Dau p converges uniformly to Dau on every compact set in IRn. Thus up  u in \ from which the result follows (1st stage). Rema 2. The multiplication of the function u by qJp ensures that up is equal to u on B(O, p) and is of compact support. We say that we have "truncated" the function u, from which we derive the name operation of truncation given to this first stage. o 4 More generally, the symbol [Rn will be omitted for the spaces of functions or distributions on [Rn (i.e. Q = [Rn). 5 We sometimes indicate this by  <:+ k. 
462 Appendix. "Distributions" 2nd stage. We now show that f!fi is dense in  by making use of the regularising properties of the convolution with a function of f!fi (hence the name of "regularisation" given to this stage). Let p E f!fi satisfy Lemma 1 with Xo = 0, and r = 1. For pEN *, we define pp by: (1.18) pp(x) = pn p(px) in order that we have r pp(x) dx = 1. Jn Now let u E  and let us define {up }pEN* by ( 1.19) Up = u * pp. It follows from the properties of the convolution product, outlined above, that we have: f up E f!}j 1 Da(u p - u)(x) = t. [Dau(x - y) - Dau(x)]pp(y) dy. To verify the uniform convergence of DCX(u p - u) to 0 on every compact set it is sufficient to verify this property for a = O. But u being continuous and of compact support, is uniformly continuous on n and the property to be verified follows immediately. Finally, taking account of the definitions of the convergence of sequences in f!fi and k, it is immediate that every sequence which converges in f!fi to a function of f!fi converges to that function in the sense of k, which completes the proof. (1.20) o Proposition 2. The space f!fi is contained in and is dense in LP, 1  p < + 00 and every convergent sequence in f!fi converges in LP. Proof Propositio 2 is a consequence of Proposition 1 and of the density of g in LP. It is not uninteresting also to proceed directly. The method is similar to that used in the proof of Proposition 1. 1st stage. Truncation. We multiply u E LP by the continuous function Xrn which is equal to 1 on the ball B(O, m) and to zero for Ixl  2m, where mEN. The function Urn = uXrn E LP. In addition, it is easily verified that Urn ---+ u in LP as m  00, 1  p < + 00. Hence the functions of LP with compact support are dense in LP. 2nd stage. Regularisation. From the result of the 1st stage, it is enough to approximate a function v E LP with S1;lPP v compact by functions of f!fi. For 
91. Definition and Basic Properties of Distributions 463 mEN, we then introduce Prn by (1.18) and V rn by (1.21 ) We have v rn = V*Prn. (1.22) f V m E f?fi l (v m - v)(x) = t. [vex - y) - V(X)]Pm(y)dy. We then show (left as an exercise to the reader) by using Holder's inequality and the continuity of the translation in LP (see also Chap. XVIIA)-which here replaces the property of uniform continuity used in the proof of Proposition 1 - that (1.23) r Ivm(x) - v(x)lPdx  0 when m  00. Jn Hence we have the result. Thus  is dense in LP, 1  p < + 00. It remains for us to show that convergence in  implies convergence in LP. Now, if Urn  u in , Urn  u uniformly on a compact set K in IRn (where supp Urn c K and supp u c K). This implies the convergence of Urn to u in the sense of LP. o For the space (Q), with Q c IRn, the results are similar. For example, by using the fact that (Q) is identifiable (by extending by 0 the functions to the exterior of Q) with the subspace of k consisting of functions ha ving their support in Q, we verify the Proposition 3. The space (Q) is contained in and is dense in (Q)for all k  0 and in LP(Q) for all p such that 1  p < 00. Every sequence which converges in (Q) converges in each of these spaces. We have a similar statement for the spaces k(Q). 2. The Space ' (Q) of Distributions on Q 2.1. Definition of Distributions and the Concept of Convergence in '( Q) Definition 3. A continuous linear form on Q is called a distribution on Q. We denote by ' (Q) the set of distributions on Q. If T E '(Q), we denote by < T, qJ> its value on the function qJ E (Q). To say that T is continuous on (Q) means that for every sequence {qJp }pEN in (Q) such that qJp  qJ in (Q) (1.24) when p  00 we have < T, qJ P >  < T, qJ > in C. 
464 Appendix. "Distributions" We then verify easily that ' (Q) is a complex vector space which (by definition) is none other than the topological dual of (Q). We can supply £0'(Q) with topologies (see Schwartz [1]), but we shall content ourselves, as for £0(Q), with the following notion of convergence of sequences which is sufficient for applications: Definition 4. A sequence of distributions {Tp }pE N converges to the distribution T if for every qJ E (Q) (1.25) < Tp - T, qJ >  0 in C when p  + 00. Remark 3. (1) The question there is one of weak convergence in '(Q); but we shall systematically omit the qualification "weak" when we are concerned with the convergence of a sequence of distributions. (2) We notice that we can prove the following property which follows from a theorem of Banach and Steinhaus (see Schwartz [1]): Proposition 4. If {T p } pEN is a sequence of distributions such that for all qJ E (Q), < Tp, qJ> converges in C when p  00, then the linear form qJ  lim < Tp, qJ > p-+ 00 is an element of £0'(Q). 2.2. First Examples of Distributions: Measures on Q Definition 5. We call every continuous linear form on the space g(Q) of continuous functions with compact support, a measure on Q. The space of measures on Q (the dual of g (Q)) is denoted by '(Q).6 Of course Definition 5 makes sense only if we have made the topology of g(Q) precise. Just as for £0(Q), we content ourselves - and this is sufficient for applications - with the following idea: Definition 6. A sequence {qJp }PE N of functions of g(Q) converges to a function qJ E g if (and only if): (1) there exists a fixed compact set K in Q such that sUPP qJp C K, \lp E N; (2) qJp converges uniformly to qJ on K. Denoting by </1, qJ> the value of a linear form /1 on qJ E g(Q), to say that /1 E ' (Q) reduces to verifying that for each sequence {qJp}PE N with qJp  qJ in g(Q) when p  + 00; we have: ( 1.26) < /1, qJ p >  < /1, qJ > in C. 6 We also use the notation m(Q). 
91. Definition and Basic Properties of Distributions 465 Observing that the convergence of {<Pp}pEN to <P in the sense of '@(Q) implies that of {<PP}PE N to <P in the sense ofg(Q) (this we denote by '@(Q) c; g(Q)), we see that if /1 E '(Q), </1, <Pp>  </1, <P> for every sequence {<Pp}pEN such that <Pp  <P in '@(Q) when p  00. This leads to the observation that /1 E '(Q) defines an element of £0'(Q). Thus ' (Q) c; £0' (Q), the notion of (weak) convergence in ' (Q) being: ( 1.27) { /1p E '(Q)  /1 in' (Q) if for every <P E g(Q), < /1 P' <P >  < /1, <P > in C. Remark 4. It is usual in the case of measures to use the notation /1( <p) in place of </1, <P> for the value of /1 on the "test function" <P E g(Q). We say also that /1( <p) is the integral of the function <p relative to the measure /1 and we write Jl( q» = f.a q>(x) dJl. Naturally we can define /1( <p) for functions other than functions <p E g(Q); for example for functions which are continuous and different from zero in the whole of Q, or even discontinuous functions. This can be done by extension methods, for example, using the density of g(Q) in various spaces. The measurable functions <p for which we can thus extend the definition of /1( <p) are called summable functions 7 for the measure /1. In particular, if the continuous function which is equal to 1 for all x E Q is summable for the measure /1, we say that /1 is summable on Q or summable 8 if Q = [Rn. We again denote by ( 1.28) Jl(f) = L f(x)dJl the integral of a function summable for the measure /1. The set of functions summable for /1 depends of course on /1, but by definition the functions <p E g(Q) are summable for every measure /1 E '(Q). Example 1. Densities with respect to Lebesgue measures. We recall that Lloc(Q) is the space (of classes) of functions, integrable on every compact set in Q. With f E Lloc(Q) given we can define a corresponding measure /1f by: Jl J(q» = f.a j(x)q>(x) dx. ( 1.29) 7 Or (more often) integrable. These functions are such that 1l{1<p1) < + ex). 8 We say also that the measure II is bounded. 
466 Appendix. "Distributions" The correspondence which associates f.1 f with (the class of)9 f is an injection Lc(Q)  f(Q) (and hence into f(Q)). The image of Lc(Q) in f(Q) under this correspondence is called the set of absolutely continuous measures in Q; f is called the density of f.1 f with respect to the Lebesgue measure (which we denote by df.1 f = f(x) dx) which plays a particular role in Q c [Rn. I t is usual to identify the measure f.1 f with its density f (in fact the class of f). We always write f.1 f = f and indiscriminately f.1 f (<p) or <f, <p > to interpret (1.29) and we say that such a measure is afunction. Afortiori, f.1f defining a distribution, we say that this distribution is the function f (in fact the class of the function). Remark 5. We have (1.30) LP(Q) c; Lc(Q) with 1  p  + 00. (1.31 ) Hence every element of LP(Q), 1  p  + 00, defines a distribution on Q (which is a measure on Q). Similarly, every function of class \ 0  k  + 00 defines a distribution (or a measure) on Q. Of course, there exist measures which are not functions. Example 2. The Dirac measure. Definition 7. Let Xo E Q. The mapping <p E g  <p(xo) defines a measure belonging to f(Q) denoted by b xo and called the Dirac measure at the point Xo. We verify immediately in fact that this mapping is linear, that if <Pp - <p in the sense of g(Q), <pp(xo)  <p(xo) in C, the uniform convergence implying the pointwise convergence. Remark 6. The measure b xo was introduced initially by Dirac to meet certain needs in quantum mechanics, as a measure of mass + 1, placed at the point Xo E [Rn and is improperly called "the Dirac delta function" by certain physicists, who manipulate it like a summable density of mass + 1, that is to say write i) 15 xo (cp) = cp(xo) = r cp(x)15(x - xo) dx Jn ii) 1 = r 15(x - xo)dx. Jn Such a "function" x  b(x - xo) has to be zero for x i= Xo and take the value + 00 at the point x = Xo. According to the classical theory of functions, its integral on [Rn shall be zero which is in contradiction with (1.31) (ii). The Dirac measure is thus a measure and it is even the simplest example of a measure which is not a function. 9 g belongs to the class off iff (x) = g(x) a.e. 
 1. Definition and Basic Properties of Distributions 467 (1.32) However we shall abuse the notation and write b(x - xo) for b xo and make use of the formulae (1.31) which ought to be understood in the sense of Remark 4. For Xo = 0, we denote b o by b c5 (cp) = cp(O) = r cp(x)c5(x) dx Jn r c5(x) dx = 1. Jn o Remark 7. Positive measures and positive distributions. Being given J1 E ' (resp. T E gj)'), we can define ji (resp. T) the complex conjugate of J1 (resp. of T') by: ji (<p) = J1( cp) (resp. < T, <p > = < T, cp> ) (cp denoting the complex conjugate function of <p). _ A measure J1 (resp. a distribution T) is real if ji = J1 (resp. T = T). A measure J1 (resp. a distribution T) is said to be positive, denoted by J1  0 (resp. T  0) if J1( <p)  0 for all <p  0, <p E g (resp. T( <p)  0 for all <p  0, <p E gj)(Q)). Of course, a positive measure defines a positive distribution. Conversely we can prove (see Schwartz [1], p. 29) the Proposition 5. A positive distribution is a positive measure. o When the positive measure J1 is summable on Q, we define the total mass of J1 on Q (or the mass of J1 if Q = [Rn) to be the value of J1(I) (1 being the function which is equal to 1 for all x E Q) which is by definition the integral S n dJ1. Thus b xo is a positive measure with mass equal to + 1 (b xo (1) is defined and has value + 1). It would be futile to introduce the space gj)' (Q) if all distributions were measures. Examples of distributions which are not measures will be given later. 2.3. Differentiation of Distributions. Examples The differentiation of distributions (which coincides with the usual differen- tiation of continuously differentiable functions) is the essential property which justifies the introduction of distributions. Definition 8. If T E gj)' (Q) the derivative D i T, i = 1, . . . , n, of T is the distri- bution on Q defined by: (1.33) < D i T, <p > = - < T, D i <p >, i = 1, 2, . . . n. If a E Nn (1.34) < DC/. T, <p > = ( - 1 )Ia l < T, DC/. <p > for all <p E gj)(Q). o 
468 Appendix. "Distributions" From the Definition 8 and the notion of convergence in '(Q) we deduce immediately the Proposition 6. If {T p }PE N is a sequence of distributions which converges to Tin ' (Q) when p  00, then we have: (1.35) for all (X E t\jn, DcxT p  DcxT in '(Q) when p  00. Proof Since for all cp E (Q), Dcxcp E (Q) for all (X E Nn, we have <D cx Tp, cp) = ( -l)lal < Tp, Dcxcp)  ( -l)lal < T, Dcxcp > = <D cx T, cp), when p  00. o Let us now give some examples: Examples 3. Let (1.36) { +l x>O Y:x  Y(x) = o x < 0 ( 1. 3 7) (the Heaviside function). This is a locally summable function which thus defines a distribution. derivative D Y in the sense of ' (lR) is given for cp E (lR) by: (+00 <DY, cp) = - Jo cp'(x)dx = cp(O). The We therefore have (1.38) D Y = b. The derivatives of order m > 1 are thus the m-th derivatives of the Dirac measure at the origin defined by: (1.39) <Dmb,cp) = (-l)m<b,Dmcp) = (-l)mDmcp(O). To define Dmb, it is therefore necessary to consider test functions which belong to the class m on lR. Hence Dmb for m  1 is not a measure and we have here the first examples of distributions which are not measures. In general, a derivative in the sense of ' (Q) of a measure on Q defines a distribution which is not a measure except in the case in which the measure is a sufficiently regular function. Example 4. Derivative of a regular function. Let f be of class  1 and denote by f' its usual derivative. Now let us calculate the derivative Df offin the sense of ' (lR). It satisfies: (1. 40) <Df, cp) = - ff(X)CP'(X)dX; 
1. Definition and Basic Properties of Distributions 469 (1.41 ) as a result of an integration by parts: <Df, <p) = It (x)<p(x)dx. Thus Dfis the distribution (Radon measure) defined by ff, the usual derivative off. Example 5. Derivative of the locally summable function x  Log lxi, the pseudo- function l/x or the principal value of l/x. Let us consider the measure defined by the locally summable function on IR defined by x  Log Ixl. Its derivative in the sense of distributions is defined by f +oo <D Log lxi, <p) = - <Log lxi, <p') = - -00 Loglxl<p'(x)dx = lim - r Loglxl<p'(x)dx e  0 J Ixl  [; . { f + 00 <p(x) - <p( -x) } hm + Log 8[<p(8) - <p( -8)] + dx eO e X lim f + 00 <p (x) - <p ( - x) dx eO e X since (<p(8) - <p( -8))Log8  0, as a result of the mean value theorem. We introduce the notation (1.42) lim f + 00 <p(x) - <p( - x) dx = Pf f + 00 <p(x) dx. eO e X -00 x The symbol Pf denotes the partie finie (the finite part) of the divergent integral f:oo <Px) dx a concept introduced by J. Hadamard for the needs of the theory of hyperbolic partial differential equations. Since we also have (1.43) 1 . f +oo<p(X) - <p( -x) d 1 . 1 <P(X) d 1m x = 1m - X e  0 e X e  0 Ixl  [; x = the Cauchy principal value of f oo <p(x) dx - 00 x 
470 Appendix. "Distributions" we sometimes write (vp for "valeur principale") f + 00 <p(x) f + 00 <p(x) Pf - = vp - dx; -00 x -00 x so we have f + 00 <p(x) f + 00 <p(x) <DLoglxl,<p> = Pf -dx = vp -dx. -00 x -00 x We verify immediately that the mapping <p  pf f +oo <p(x) dx is linear and - 00 x continuous on ([R) (but not on g([R)); it therefore defines a distribution (which is not a measure) which we denote by (1.44) PC  ( or vp  ) and which we call "Pseudo-function l/x" (or "principal value of l/x"). Example 6. Piecewise regular functions. Let {xv} V E Z be a sequence of distinct real numbers such that lim Xv = + 00, v + 00 lim Xv = - 00. v-oo ( 1.45) Suppose that we are given a functionf which possesses the following properties: (1) for all v E 7L,f is infinitely differentiable in the classical sense in the intervals ]xv, X v + 1 [, V E 7L. (2) f and its derivatives of order pEN in the usual sensef(p) have discontinuities of the 1st kind at the points xv, v E 7L (the set of which have zero Lebesgue meas ure in [R). Such a functionf is called a piecewise regular function. We observe thatf and all its derivatives in the classical sense f(p) are in the space Lloc( [R) with the result that f(p), p  010 define distributions (which are measures) which we identify withf(p), p  O. As a distribution,f admits an infinity of derivatives which we denote by DPf, pEN \ {O}. It is interesting to make precise the link which exists between DPf and the distribution defined by f(p) (identified with f (P»). Suppose that Xv is a point of discontinuity of the first kind off (P), p  0, the limit from the rightf(P)(x v + 0) and the limit from the leftf(P)(x v - 0) both exist. We introduce the notation: s P) is the "saltus" off (p) at Xv defined by sp) = f(p)(x v + 0) - f(P)(x v - 0), sO) = Sv (the saltus off at xv), 10 We use the notationf(O) = f 
 1. Definition and Basic Properties of Distributions 471 The distributional derivative Df off is defined by: (1.46) (Df,cp) = -(f,cp') = - r j(x)cp'(x)dx = - I f x v+l f (X)Q/(X)dX J  v Xv for all <p E ; an integration by parts shows that (Df, cp) = I cp(Xv)Sv + r f' (x) cp (x) dx v J (1.47) which is written (1.48) Df = ff + I Svb(x - Xv). v Thus the discontinuitie.s of f appear in the derivative in the form of Dirac measures. Proceeding step by step, we shall have (1.49) DPf = f{P) + I Sp-l) b xv + I sP- 2) bv + . . . + I svbT-l) v v v (where b b2) , . . . , b<j:-l) denote the derivatives of b x = b(x - xv)). v v v v Example 7. The monomial pseudo-functions Pfx, Pfx .11 Let A be a complex number. If Re A > -1, the function x A + = { XA X > 0 (1.50) 0 x < 0 is locally summable and defines the distribution: (1.51) (x, cp> = t+ oo xcp(x)dx. More generally, for arbitrary A E C, we can define Pfx by (1.52) (Pfx, cp> = Pf t+ 00 x cp(x) dx = finite part of the integral L+ 00 x cp(x) dx. ForRd> -1,Pf L+oo xcp(x)dx = L+oo xcp(x)dx and the symbol Pf is redundant. 11 Denoted x and x by Gelfand and Shilov [1]. Here we use the notation of Schwartz [1]. 
472 Appendix. "Distributions" For A i= -l, 1 E N -{O}, we have I + 00 { i + 00 eA. + 1 (1.53) Pf xAcp(x)dx = lim xAcp(x)dx + cp(O) 1 o eO e It. + 1 eA.+2 cp(k) (0) e).+k+l } + cp'(O) A + 2 + . . . + k! A + k + 1 where kEN is defined by: (1.54) Re A + k + 1 < 0, Re A + k + 2 > O. If A = -l, 1 E N \ {O}, in the formula (1..53) we have to replac the term eO /0 by log e. Thus in the case A = - 1, we have: (1.55) ( 1 ) f +oo cp(x) {f +oo cp(x) } Pf-, cp = Pf -dx = lim -dx + cp(O)Loge ; x + 0 X e--+O 0 X in the case A = -2 ( 1 ) f +oo cp(x) Pf 2 , cp = Pf -Zdx x+ 0 x . {f +oo cp(x) cp(O) } = hm -zdx - - + <(J'(O)Loge . e--+O 0 X e The function A  < Pfx, cp) is, for all cp E ([R) an analytic function of the complex variable A except for A = -l, 1 E N \ {O}. We shall verify that (1.56) (1.57) if A i= -l, 1 E N D{Pfx} = A PfX-l (D = d: ). In other words, the rule of differentiation is the classical one for the differen- tiation of a monomial. To verify it, it is sufficient to confirm that (1.57) is true (without the symbol Pf) when ReA> O. Then the functions A  <D{Pfx}, cp) and A  <APfx-l, cp), which are analytic for A i= -l, 1 E N \ {O} and which coincide for Re A > 0, will coincide throughout the whole domain of analyticity, from which (1.57) follows. For A = -l, we verify by a direct calculation that: (1.58) [ 1 ] { I } J b (I) D Pf- = (-l) Pf- + (-l)t_ Xl+ Xl++ 1 l! 
 1. Definition and Basic Properties of Distributions 473 where £5(l) is the I-th derivative of the Dirac delta function. Then when A = -I , 1 E N, we an define PfX=' by: ( 1 ) f o <p(x) Pf I' <p = Pf ---y- dx; x_ -00 x we have: (1.59) [ 1 ] { 1 } £5(l) D Pf - = (-I) Pf - - ( - 1) I -, X'_ X'_+ 1 I! . 1 wIth the result that we can define Pf l by: x (1. 60) 1 1 1 Pf - = Pf - + Pf- x' X'+' X'_ and we have: (1.61 ) D[ Pf l ] = (-I)Pf Xll ' If we wish to define Pf x , there is a place to choose suitably the determination of ( -1 )A. We therefore define: (1.62) A { o x>O x_ = ei1tAlxlA x < 0 (for A = -I, we thus have a good definition of 1/X', x < 0). Then: f o 1 +00 < Pf x, <p> = Pf e i1tA Ixl A <p(x) dx = Pf e i 1t'A Ixl A <p( - x) dx, -00 ° so: (1.63) < Pfx':., <p > = e i1t ). Pf f" Ixl). cp(x) dx = e i1t ). < Pf x'i-, cp > where cp(x) = <p( - x). We are thus able also to define Pfx A or Pflxl A by: (1.64) Pf x A = Pf x + Pf x where (1.65) Pflxl A = Pf x + e- i1tA Pf x. 
474 Appendix. "Distributions" We observe that we come to need to consider the family of pseudo-functions (1.66) 1 Y;. = r(2) Pfx-l, 2 #- -[ (with r denoting the gamma function) Y - I = b (l) if A = -1, 1 EN. For all cp E ([R), the mapping A  Y..l is (thanks to the factor 1jr(A) an entire analytic function of the complex variable A. We then have the differentiation formula: (1.67) DY..l=Y..l- l . We shall see other examples later. 2.4. Support of a Distribution. Distributions with Compact Support (1) Restriction of a distribution to an open set. Let us consider two open sets Q and Q' of [Rn with Q c Q', and let T E '(Q'). We can associate with T a distribution T Q called the restriction of T to Q, defined for all cp E (Q) by: (1.68) <TQ,cp) = <T,(fJ), where (p is the extension by 0 of cp to Q'. It is necessary to pay attention to the following two facts: (a) Contrarily to the case of c'ontinuous functions, if Q is dense in Q', two distinct distributions on Q', can have the same restriction to Q. For example: Let Xo E Q' and Q = Q' - {xo}, the null distribution and the Dirac measure b(x - xo) ha ve the same restriction to Q. (b) Not every distribution on Q is the restriction to Q of a distribution on Q'. For example, let Q' = [R and Q = [R\ {O}; the distribution defined on Q by the OO-function x  e- 1 / x is not the restriction to Q of any distribution on [R. We now give a criterion for "recognition" of a distribution which is often rather useful: Proposition 7. Let Q be an open set of [Rn and E a locally convex topological vector space of functions on Q satisfying: { (Q) is contained in and is dense in E, (1.69) every convergent sequence in  (Q) converges in E. Then the continuous linear forms on E are identified with distributions. The identification is made in the following manner: let T be a continuous linear form on E, then the restriction of T to (Q) is the distribution we are looking for. 
 1. Definition and Basic Properties of Distributions 475 (2) Support of a distribution. We can easily verify that if the restriction of a distribution to each of two open sets is null, then its restriction to their union is null. This allows us to define the "largest open set" in which a distribution T is null: it is the union of all the open sets such that the restriction is null. We then have the Definition 9. Let T E q)'(Q); the support of T, denoted by supp T is the complement of the largest open set OJ of Q such that the restriction of T to OJ is null. For example, the support of b xo ( = b(x - xo)) is the set consisting of the single point xo. The support of T E q)'(Q) is a closed set of Q (for example Q itself). For a certain number of questions, it is useful to know how to characterise the distributions with compact support in Q; this is what we shall now do. (3) Distributions with compact support. The space &'(Q). We denote by &(Q) (or OO(Q)) the space of infinitely differentiable functions on Q; &(Q) is a Frechet space when it is provided with the family of seminorms: Ilullm,K = sup IDiXU(x)1 lalm XEK where m ranges over Nand K over a denumerable family of compact sets in Q, increasing and "tending" to Q; a sequence {Up}pE N converges to U o E &(Q) if for every compact set K and all a EN, the sequence {DiXu p } pEN converges uniformly on K to DiXu O . We have the Proposition 8. Let! E q)'(Q) have compact support; there exists a unique con- tinuous linear form T on &(Q) whose restriction to q)(Q) is T. Thus, let T E q)'(Q) and let <p E &(Q); we take t/J E q)(Q) to have the value 1 on a neighbourhood of supp T; we put -- < T, <p > = < T, <pt/J >; we verify easily that the linear form T on & (Q), thus defined on & (Q) IS continuous and that it does not depend on the choice of t/J. D Conversely, we have -- Proposition 9. Let T be a continuous linear form on &(Q); its restriction T to  (Q) is a distribution with compact support. Proof From Proposition 7, we see that it suffices to verify the support pro- perty of T. Now every open set Q of [Rn can be written as the union of an 
476 Appendix. "Distributions" increasing sequence of compact sets: Q = U Kp. peN Let w p = Q\Kp (complement of Kp in Q); if supp T is not compact, the restriction of T to w p , for all p, is not null. We can therefore find qJp E  (w p ) such that (T, qJp> = 1. Now the sequence (qJp)pE N converges to 0 in If (Q), from which it follows, in an obvious fashion that we have a contradiction. D We denote by If'(Q) the set of distributions with compact support in Q. Proposition 10. The space If' (Q) is identified with the space of distributions of If' (rR n ) with compact support contained in Q. The verification of this proposition presents no problem. Proposition 11. If T is a distribution with compact support S T, then its deriva- tives DCI. T have compact supports included in S. Proof Let ill = Q \ ST the complement of ST in Q. We have by hypothesis ( T, qJ > = 0, v qJ E  (ill), from which we deduce that (DCl.T, qJ> = (_1)1Cl.1 (T, DCl.qJ> = 0, v qJ E  (w ). D 3. Some Elementary Operations on Distributions 3.1. Product by a Function of Class CCCX) Definition 10. Let t/f E If (Q); for all T E ' (Q) we define the distribution t/fT (called the product of T by t/f) by the formula: (1.70) (t/fT, qJ> = (T, t/fqJ > v qJ E f0 (Q). D (1.72) We verify immediately that (1.70) generalises the product of a function by a function of class  00 . Among the properties of this product, we can point out (1.71) Dj(t/fT) = (Djt/f)T + t/fDjT, j = 1,2, . . . , n, the formula for differentiating a product and more generally Leibnitz's formula D CI. ( t/f T) = L ( rJ. ) D CI. - P t/f D P T, rJ., f3 E I\J n, pCI. f3 
91. Definition and Basic Properties of Distributions 477 where P  li means Pi  lii for i E {1, 2,... ,n} and ( P li ) = ( P li 2 l ) ( P li2 2 ) . . . ( P lin n ) with ( li i ) = lii! . Pi Pi! (lii - P J ! Finall y (1.73) supp ( t/J T) = supp t/J n supp T. 3.2. Primitives of a Distribution on an Interval of IR Let / be an open interval of rR; first of all we show the Proposition 12. The equation (1.74) DT = 0 admits the constant functions as its only solutions. Proof The constant functions are obviously solutions of (1.74). Conversely, if DT = 0, we have: (1.75) <DT, qJ> = - < T, qJ' > = 0 for all qJ E  (/). We verify easily that a necessary condition for qJ E  (/) to be the derivative t/J' of a function t/J E  (/) is that (1.76) i qJ(t)dt = O. From (1.75), T a solution of (1.74) satisfies (1.77) < T, qJ> = 0 for all qJ E  (/) satisfying (1.76). We notice then that the space of functions of  (/) satisfying (1.76) is of codimension 1 in  (/) and that a supplement is generated by a function () E  (/) such that (1.78) i 8(t)dt= 1. In effect, every function qJ E (/) can be written in the form: (1.79) qJ = [i qJ(t)dt] 8 + IjJ t/J E (/) and satisfying (1.76). 
478 Appendix. "Distributions" With this notation we have (T, f{J) = [i f{J(t)dt] (T, 8) + (T, 1/1) = i (T, 8)f{J(t)dt, and T is identified with the constant < T, (). D From this result we deduce the Proposition 13. A distribution S E ' (/) admits an infinity of primitives (that is to say there is an infinity of distributions T E '(/) such that DT == S). Two such primitives differ by a constant. Proof We have to solve (1.80) DT == S , or agaIn <DT, qJ) <S, qJ) for all qJ E  (/), that is to say (1.81) < T, qJ') ' - <S, qJ) for all qJ E  (/); T is thus known on the subspace of the functions of  (/) satisfying (1.76). It suffices then to arbitrarily prescribe < T, () in the preceding proof, where () E  (/) and satisfies (1.78). The result follows. D Remark 8. Distributions whose derivatives are measures. It is interesting to characterise the distributions which are the primitives of a measure. We have the following proposition for a proof of which we refer to Schwartz [1]. Proposition 14. (1) For a distribution of '([R) to have a measure as its derivative, it is necessary and sufficient that it be of bounded variation on every finite ntervaI12. (2) For a distribution to have a positive measure as its derivative, it is necessary and sufficient that it be an increasing function ("in the large" sense). We shall make use of this result in the proof of Bochner's theorem (see 3). Further, we have the following theorem 12 We recall that the variation of a function f on a bounded interval is defined as the quantity: Var(f) = sup I If(xJ - f(x i - dl where the supremum is taken over all the sub-divisions i a = Xo < Xl < . . . < X N = b. 
g 1. Definition and Basic Properties of Distributions 479 Theorem 1. (1) If, almost everywhere, a continuous function f has a derivative g (in the usual sense) summable on every compact set of and iff is the indefinite integral of g, the distribution f has the distribution g for its derivative. (2) If a distribution has for its derivative a (locally summable)function g, it is itself an absolutely continuous function f, an indefinite integral of g. It has g(x) as its derivative (in the usual sense) almost everywhere and at every point x where g is continuous. It follows that if the derivative of order k + 2 of a distribution is a measure, the derivative of order k is a continuous function 13 . We then have the: Corollary to Theorem 1. A distribution all of whose derivatives are measures is a function of the class  00 in the usual sense. This corollary can be extended to the case of several variables, but we shall not study this case here; for the sequel we refer to Schwartz [1J. Linear Differential Equations with Constant Coefficients We have the Proposition 15. The solutions in ' (I) of the differential equation with constant coefficients (1. 82) T (m) + am _ 1 T (m - 1) + . . . + a o T = 0 are the usual  00 solutions of this equation. Proof. We use the fact that the mapping (1.83) u  eAXu is, for all A E C, a bijection of '(I) onto itself with inverse mapping u  e - AX u. If A is a solution of the equation A m + am _ 1 A m - 1 + . . . + a o = 0, the mapping (1.83) transforms the equation (1.82) to an equation of order m - 1 for u', from which the result follows by recurrence from Proposition 12 (the preceding method is also called the method of "variation of parameters"). o To complete this section we give the following additional result. 13 For other results of this type see g3.2 of Chap. II (vol. 1) 
480 Appendix. "Distributions" Proposition 16. The solutions of the equation (1.84) xT = 0 in ' (I) (I an interval of [R containing 0), are the distributions (1.85) T = c where c E C. Proof Every distribution of the form (1.85) is obviously a solution of (1.84). Conversely, suppose that T satisfies (1.84), we fix e E (1) such that e(O) = 1; then every function <p E (1) splits up in the unique fashion <p = <p(O)e + t/J where t/J E  (I) and satisfies t/J(O) = 0; we deduce that the function X defined by (1.86) t/J(x) X ---+ -, X: x o ---+ t/J' (0) x =1= 0 is also in  (I); we then have < T, <p) = <p (0) < T, e) + < T, t/J) = < T, e) < , <p) + < T, x X) = <T, e) <, <p) + <xT, X) = <T, e) <, <p) from which we deduce the solution we are seeking. o 3.3. Tensor Product of Two Distributions Let Q be an open set in [Rn, Q' an open set in [RP, f a function of Q and g a function on Q'; we denote by x the variable in [Rn, by y the variable in [RP. We give the name tensor product of f and g to the function defined on Q x Q' by f (8) g(x, y) = f(x)g(y). If now we have two distributions, 8 E '(Q) and T E '(Q'), we can state the following result, which serves as the definition of the tensor product of two distributions: Proposition 17. Let 8 E '(Q) and T E '(Q'); there exists a unique distribution 8 (8) T satisfying 8 (8) T E '(Q x Q') and such that (1.87) <8 (8) T, <p (8) t/J) = <8, <p) < T, t/J) for all cp E  (Q) and all t/J E  (Q'). 
91. Definition and Basic Properties of Distributions 481 Denoting by Sx (resp. Ty) "a distribution of the variable x" (resp. "of the variable y"), we then have (1.88) { for all qJ E 9fi(Q x Q') <S (8) T, qJ) == <Sx, < Ty, qJ(x, y) ) and also (1.89) <S Q9 T, qJ) == < Ty, <Sx, qJ(x, y) ). Further, we have the following properties of the tensor product: (1.90) supp S (8) T == supp S x supp T (thus, for example: (1.91) l5(X) (8) l5(y) == l5(x, y), and (1.92) D D: (S (8) T) == D S (8) D: 'T Remark 9. (1) Of course, the tensor product defined above coincides with the tensor product of two functions when Sand T are functions. (2) Not every distribution on Q x Q' is the tensor product of a distribution on Q and a distribution on Q'. o 3.4. Direct Image and Inverse Image of a Function and of a Distribution by a Function of Class ft 00 Let h be a mapping of class  00 from an open set Q in [Rn into an open set Q' in [RP. We seek to see if we are able, and how, to transform functions and distributions by h. Functions are transformed naturally by the inverse image. With every qJ E C (Q') we associafe the inverse image of qJ by h, h*qJ E C (Q) defined by (1.93) h* qJ == qJ 0 h for all qJ E S (Q'). If, in addition, h is a proper mapping, that is to say is such that the inverse image h -1 (K) of every compact set K in Q' is a compact set in Q, then h*qJ E  (Q) for all qJ E  (Q'). The direct image h* T of T E S' (Q) by h is the distribution with compact support on Q' defined by (1.94) for all qJ E S (Q'). <h* T, qJ) == < T, h* qJ) 
482 Appendix. "Distributions" If, in addition h is proper we see that (1.94) allows us to define h* T for all T E '(Q) (h*T then belongs to '(Q')) since, in this case, for all cp E '(Q), h*cp E  (Q). For certain quetions, we shall need to define inverse images 14 of distributions. In the case in which h is a diffeomorphism, we then define the inverse image h*T of T E '(Q') by (1.95) h*T = Idethl-l(h-l)*T where Idet hi is the absolute value of the Jacobian determinant of h. This extends to much more general conditions on h. 15 4. Some Examples 4.1. Primitives of the Dirac Measure We recall that the Dirac measure at the point Xo is defined by l5 xo == l5(x - xo): cp --+ cp(x o ). We shall denote by { 1 if x > Xo (1.96) Y XQ = Y(x - xo) = 0 if x < Xo the translate by amount Xo E IR of the usual Heaviside function defined in the examples of 1.2.3. We have, of course: (1.97) { l5xo == l5(x - xo) == D Y(x - xo) DP l5 xo = l5): cp --+ (- l)P cp(p)(x o ), V qJ E  (IR). Let us now consider the family of distributions {Vm}mEz defined by ( 1.98) (x - xo)lml V m - 1 == Y(x - xo) for' m \ Iml! V-I = Y(x - x o ), V o = l5(x - x o ), V m = l5(m)(x - xo) for m  O. -1 , 14 Especially when we are led to make changes of variables or to seek the transform of a distribution by the action of a symmetric group. The idea of an inverse image of a distribution extends in a natural way (by continuity) the concept of an inverse image offunctionsf E Lloc (Q') (defined as in (1.93)). We should notice that even when h* h* T and h*h* T have a meaning, h*h* T and h*h* T are in general different from T. With (1.95) we have in particular h*h* T = Idet hl- l(h - l)*(h)* T = Idet hl- 1 T. 15 See, for example, Schwartz [1], Chap. IX. 
91. Definition and Basic Properties of Distributions 483 It is clear that ( 1.99) dkU m _ dX k U m - k . 4.2. A Division Problem (Case n == 1) A problem analogous to that of looking for primitives is that of the division of a distribution by a function. This problem is important in applications; we shall here consider a very simple example and refer the reader to Schwartz [1] for a more thorough study. Statement of the problem. We wish to find T E ' (IR) such that (1.100) xT == 1. We remark, first of all, that if we are working in the open set Q == IR \ {O}, the solution in '(Q) is none other than the function l/x. It is thus only in the vicinity of 0 that T is unknown and we are not able to affirm its existence. We notice that if T 1 and T 2 are two solutions of (1.100), we have: x(T l - T 2 ) == 0 and from Proposition 16: (1.101) T 1 == T 2 + cl5, C E C. Hence if we know a particular solution of (1.100), we know all the solutions by ( 1.1 01). By the rules for the multiplication of a distribution by an infinitely differentiable function, we can remark that we have: \ 1 ) \ 1 ) f + 00 xcp f + 00 xPf-,cp == Pf-,xcp ==Pf -dx== cp(x)dx x x -00 x -00 for all cp E  (IR), with the result that x. Pf l/x == 1. Thus all the solutions of (1.100) are given by (1.102) (1.103) 1 T == Pf - + cl5, x c an arbitrary complex constant. In a general manner, for every integer I E f\J \ {O} the solution of: (1.104) x I T == 1 is given by (1.105) 1 T==Pf-+ Xl L c l5 (p) p , Opl-l (c p arbitrary constants). [In effect, if T' is another solution of (104), 
484 Appendix. "Distributions" xl(T - T') = 0 so supp (T - T') = {O} with the result that T - T' = I c p l5(P)]. Opl-I 4.3. Derivative of a Funclion of R ft Discontinuous on a Surface Let Q be a bounded open set in [Rn, r its boundary of class oo (a closed infinitely differentiable hypersurface) and Q locally on a sinle side of r. Let f be_ an infinitely differentiable function in the usual sense in Q (i.e. the resriction to Q of an infinitely differentiable function in [Rn) and null for x E [Rn \ Q. Thus f is a function having a discontinuity of the first kind along r. We denote  by DPf the usual derivatie of order Ipi = PI + . . . + Pn of f (which is only defined in Q and in [Rn\Q as a function of class OO) and which also has a discontinuity of the. first kind all along r. Taking as a basis in [Rn, (e I , . . . , en), I e j I = 1, j = 1, 2, . . . , n, and denoting by n the outward normal to r, we put: n j = (n, e j ) = n. e j , j = 1, 2, . . . , n. We denote by dr the surface measure on r. Then D j f is defined by <Djf,cp) f acp -<f,Djcp) = - f(x)- a dx n x j - f rf(x)qJ(x)n j dr + f n DJ(x)qJ(x) dx. ( 1.1 06) Therefore  (1.107) Djf = Djf + J.1,  so Djfis the sum of the distribution defined by Djf,j E [1, 2,  . . , n] and of the measure J.1 possessing the surface density - f(x)n j and satisfying supp J.1 - r. 11 a2 a 2 a 2 d . h L 1 . . ITl)n More genera y A = -z + -z + . . . + _ a 2' enotlng t e ap aCIan In lI\\ , ax 1 aX 2 X n we have (1.108) (L1f, qJ) = (I, L1qJ) = f n f (X)L1 qJ (X) dx and ( 1.1 09) f f acp f af (L1f,qJ) = nqJL1 f dx + r f an dr - r qJ on dr for all cP E [Rn, 
 1. Definition and Basic Properties of Distributions 485 with a a CP = grad cP . n = f a acp ni = the derivative following the normal n. n i = 1 Xi Equation (1.109) states that the Laplacian L1fin the sense of distributions is the  sum of the distribution defined by the usual Laplacian L1f and of a measure f.1 af with supp f.1 = r which is a density defined by -- and of a distribution an cP --+ f r f : dr carried by r. (1.111) 4.4. Distributions Defined by Inverse Image from Distributions on the Real Line 1. Inverse image of a distribution T on (an open set of) IR by a mapping p:Q c IR n  IR. Let p be a reoo-mapping of an open set Q E IR n onto an open set J of 1R,16 such that Igradpl(x) =/:. 0, Vx E Q. (In numerous application J = IR\K and Q = IRn\K' with K and K' finite sets of points of IR and IR n resp.). For every functionfE reO(J) (orfE Ltoc(J)), we put (1.110) p*f = fop E reo (Q) (or Ltoc (Q)). We then define a distribution on Q by: < p* f, cP > = r f{p{x)) cP (x) dx, Jn v cp E  ( Q ). (1.112) We can show (from the theory of integration 1 7_ see, for example, Chap. 6 of Bourbaki [1] - and from differential geometry18 and this corresponds to our making a change of variable in which p is taken as the independent variable) that for p E J, there exists on the surface Sp = {x E Q: p(x) = p} (a manifold of dimension n 1) a unique positive measure w p such that: <p*f,cp) = f d Pf(P) f cp(x)wp. J Sp Denoting by dS p the surface measure on S p' we verify easily that w p is of the 16 We can generalise these conditions (e.g. taking I, J to be rc oo-manifolds of dim. m. n). 17 We use the disintegration of the measure dx by the mapping p (see Bourbaki [1]). 18 We use the correspondence between measures and differential forms: denoting by dx = dX 1 A . . . A dXn a differential form corresponding to the Lebesgue measure on [Rn, we define a (unique) [differential form OJ p on the surface Sp = {x E Q: p(x) = p} by dp A OJ p = dx (with A denoting the exterior product, see Gelfand-Shilov [IJ): we obtain n op OJ p = Igradpl-2 I (-I)j-l-dx 1 A ... A dx j - 1 AdX j + 1 A ...Adx n . j = 1 OX j 
486 Appendix. "Distributions" form w p = (XdS p with the function (X defined by the (a priori formal) formula: dp dp dx = dpw p = dn (X dn dS p = dn (Xdx (where n is the unit normal oriented in such a way that the normal derivative of d ( d ) -1 P' d: ispositive,fromwhichll. = d: = Igradpl- 1 ,w p = Igradpl- 1 dS p ' It should be observed that w p depends essentially on the function p. For all cp E (Q) let us put (1.113) Mcp(p) = f cp(x)wp. sp The mapping M cp: p E J --+ M cp(p) (denoted also by p* cp or again pcp and called the direct image of cp by p) is such that M cp E (J), V cp E (Q). Finally we .can write (1.112) as (1.114) (p*f,cp) = L dpj(p)Mcp(p). This formula enables us to define the inverse distribution p* T for every distri- bution T on J by ( 1.115) < p* T, cp) = < T, M cp ), for all cp E (Q).19 If (fn) is a sequence of continuous functions on J, which converges to T in '(J) then, by (1.114), p* fn converges to p* T in '( Q). The mapping T E '(J) --+ p* T E '(Q) thus appears as the transpose of the mapping M = p*: cp E (Q) --+ Mcp E (J). ( 1.116) Remark 10. (i) If Ko is the support of T E '( J), then supp p* T = P -1 (Ko). (ii) We can immediately verify the differentiation formula  (p* T) = ap p*T' ax. ax. l l (where T' is the derivative in '( J) of the distribution T) which generalises the differentiation of composite mappings. Indeed, for allfE 1(J) we have a ( ap ) ax; (foP)(X)= ax; f'op(x) 19 Or again < p* T, q» = < T, p* q», V q> E (.Q). 
 1. Definition and Basic Properties of Distributions 487 and thus for all cp E '@(Q): i 0 f ap ( op I ) _ 0 fa p(x)cp(x)dx = f' a p(x)- o cp(x)dx = _ 0 p*(f),cp . [Rn Xi Xi Xi From this we deduce (1.116) (because of the density of  1 (J) in .@' (J)). o 2. Application. Dirac distribution concentrated on a surface S (a) Let Po E J. The inverse distribution of the Dirac distribution at Po is then (from (1.115)) given by (1.11 7) < p * J Po' cp > = < J Po ' M cp > = M cp ( Po). This distribution is also denoted by J(p(x) - Po) or even more improperly by J (S Po) (it is called the Dirac distribution of the surface S Po; its support is the surface S Po ). By comparison of (1.117) with (1.114) (or (1.115)) we see that a knowledge of the family of distributions (J (S p))p E J is essential for the determination of every inverse distribution p* T of T by p. We should notice that in a system of "local orthogonal" coordinates (u i , . . . , Un-i) of Spo (thus such that (u i , . . . , u n - i , n)20 form a system of local orthogonal coordinates of n in the vicinity of Spo) the expression (1.113) (or (1.117)) for J(Spo) is given by: (1.118) <lJS po ' qJ> = r I 1d I qJ(u 1 , . . . , U n -l, Po)du 1 . . . dU n - 1 J [Rn - 1 gra p ( or again < lJS po ' qJ> = r 1d I qJ(X', Po)dx' ) . J [Rn - 1 I gra p (b) Differentiation of piecewise continuous functions on Q whose discontinuities follow a surface Spo Let Qpo = {x E Q: p(x)  Po} and let XQpo be the characteristic function (in Q) of this set, and Y po the characteristic function in  of the set [Po, + 00 [. We then have, in an obvious way XQ = Y p a p = p*( Y p ). po 0 0 Then, applying the differentiation formula (1.116) we obtain o ( ) _ 0 p * ( I ) _ 0 p * ( J ) _ qp J ( ) _ a XQpo - _ 0 P Y po - _ 0 P Po - _ 0 Spo Xi Xi Xi Xi (since Yo is the Dirac distribution J p = J(p - Po) on ). Thus the Dirac 20 Where n is the normal to Spo; in this coordinate system Spo is identified with the plane X n = Po. 
488 Appendix. "Distributions" distribution of the surface S Po is (to within the factor ::j ) the derivative of the characteristic function XQ of Q p : Po 0 (1.119) a ap ap _ a (XQpo) = _ a £5(Spo) = _ a £5(p(x) - Po). Xi Xi Xi This formula is clearly in agreement with the formula (1.107), so that we can write here by introducing the function pS (defined in the vicinity of S Po) such that we have at the same time pS(x) = Po on Spo and since s = 1, Igrad pSI = 1, aps grad pS = n, normal to S Po' ax. = n i l (1.120) - a 8 (XQ ) = ni(j( pS(x) - Po) = ni(j(So)' x. Po l More generally, the'formula (1.107) can be written for a functionfcontinuous on Q p and on n\Q p , having a discontinuity [f]s 21 on crossing S p : o 0 Po 0   (1.121) Dif = Dif + [f]Sponi£5(So)' Certainly, this relation is directly linked with the Stokes formula as we have seen in Sect. 4.3. (For other generalisations of this formula see Schwartz [1]). (c) Successive derivatives of the Dirac distribution on S Po' We can also define for Po E J, the successive derivatives £5(k)(p(X) - Po) (again denoted by £5(k)(Spo)) of the Dirac distribution £5(p(x) - Po) = £5(Spo) for kEN, by the inverse image by p of the distributions £5(k)(p - Po) on J: (1.122) def £5(k)(p(X) - Po) = £5(k)(Spo) == p*(£5) = p*(£5(k)(p - Po)) or again with the aid of (1.115) d k (1.123) < (j(k) (Spo)' q> > = < (j(k) (p - Po), M q> > = ( -1)k dpk M q>(po). 22 3. Some Examples (a) Case in which p is the mapping. x = (x l' . . . , X n - l' X n ) = (X', X n ) E R ---+ X n E . 21 Here [f(x)] = Urn f(x') - lim f(x"), spo x' - X x" _ x \:Ix E Spo. p(X') > Po p(x") < Po 22 For other expressions for b(k)(Spo)' see Gelfand and Shilov [1]. 
91. Definition and Basic Properties of Distributions 489 For this simple example, we have J = , Q = n. The surfaces Sp are the planes X n = p. The mapping T E .@' ()  p* T E .@' (n) has for its image the set of distributions independent of the variables Xl' . . . , X n _ l' that is to say the set of distributions invariant under the translation La, a E ,- 1 (this is a closed vector subspace of.@' (n). We denote generally by T the element p* T E .@' (n) (as for functions). The Dirac distributions l5(Sp) and its successive derivatives l5(k)(Sp) are then given (with the aid of (1.118) and (1.123)) as ( 1.124) < (j ( x n - p), cp) = r cp ( Xl' . . . , X n - l' P ) dx l' . . . , dXn - 1 J n - 1 = r cp ( X', p) dx' J n - 1 i ak<p < l5(k) (Xn - p), <p > = ( - 1 )k _ a k (X', P ) dx', V <p E .@ ( n). n - 1 X n We give, by way of example, the distribution Pf(xn) on n, defined by < Pf(xn)' cp) = < Pf p, t,-, cp (x', p )dx'), V<p E .@( n). o (b) Case in which p is the mapping X E n  Ixi E +. In this example, we have J = ]0, +oo[ = +, Q = n\{o} - :. The sur- faces Spare the spheres j X I = p. The mapping T E .@'(  +)  p* T E .@'( ) has for image the set of distributions on  which are invariant under rotations. The Dirac distributions l5(Spo) concentrated on the spheres Ixi = Po(Po =1= 0) and l5(k)(Spo) are obtained simply by passing to spherical coordinates: Putting <p(x) = <p(lxl, 8), 8 E sn -1, V<p E .@( ), we have (1.125) < l5(lxl - Po), <p> = M <p(Po) = f <p(Po, 8)p-1 dB = f <p(x)dSpo sn-1 S PO (1.126) (we have here with the notation of (1.113), OJ p = dS p = pn - 1 de), and also < (j(k)(lxl - Po), cp) = (-l)k dkcp) (Po) f ak = (- l)k a k [<p pn - 1] I p = po d8 sn-1 p ( - l)k f ak = n - 1  [<p pn - 1] I p = po dS Po' Po s PO P 
490 Appendix. "Distributions" We observe that if we replace the mapping p:x  Ixl by the mapping p: x  Ix1 2 , (the sets J, Q are then unchanged), the Dirac distributions l5(p(x) - Po) = l5(lxI 2 - Po) (concentrated on the sphere Spo) and l5(k)(p(X) - Po) = l5(k)(lxI 2 - Po) are given by (1.127) < l5(lxI 2 - Po), qJ > - ) _ ) 1  1   = (Mcp (Po = 2$0 Mcp(y Po) = 2$0 <u(lxl - yPo),cp) = - 2 1 f qJ($o,8)p-2)/2d8, sn -1 and (1.128) < b(k)(lxI 2 - Po), cp) = < bo' M cp(p) = (-l)k dk M cp(Po) or, agaIn (1.128)' < l5(k) (I x 2 1 _ P6), qJ > = (_ 1 )k (   ) k ( M qJ ( p ) ) . 2p dp 2p p = Po Remark 11. We should remark that these formulae agree with the fact that the mapping ,0 can be written ,0 = Pop where P is the function P (A) = A 2 on  + . Indeed we have, by the transitivity of the composition { *T _ p*(P*T), VTE /(+) p * qJ - P * (p * qJ), V qJ E  I (  ). (1.129) We verify immediately that (1.130) 1 PA/(2) = JI rfJ(JI), 2 A v t/J E  I (  + ), so (1.131) P*Cp(Po) = 2 (p*cp)(jP;;), Po V qJ E  I (  ), in agreement with (1.127). (c) Case in which p is the mapping x = (x', x n ) E n - 1 X   x; - X'2 E . The case in which n = 4 is of particular interest for physics. (In this case we write X 4 = t, and x' = x, or we equally well use the dual variables of the Fourier transform). 
 1. Definition and Basic Properties of Distributions 491 We have J = \ {O} = *' Q = n\ {O} = . The surfaces Sp are the hyperboloids x; - X,2 = p, of one sheet if p < 0, of two sheets if p > 0, and the cone x 2 - X,2 = O. To determine the Dirac distributions £5(Sp), we calculate the integral r f( x;; - x,Z) cp (x', x n ) dx' dX n by changing the variables and splitting the Jn domain of integration into two parts * = {x = (x', x n ) E : p(x) = x; - X,2  O} (called "the light cone") and  \ * = {x E : p(x) < O}. On *' the mapping x = (x', x n )  (p(x), x') E + X [Rn-l\{O} is bijective, the abs olute val ue of the Jacobian of this mapping being 1 J I = 21 X n I = 2 J (p + X,2), (i= 0). On  \  *, there exist points at which the Jacobian of this transformation vanishes; also it is preferable to make the following change of variable: (1.132) x = (x', x n ) E  \*  (p, OJ, t) E; X sn-2 X  with x' = rOJ, r = Ix' I, t = X n , P = t 2 - r 2 so r = (t 2 _ p)1/2. With the notation introduced in (1.111), (1.114) and (1.117) we easily obtain: (i) if p < 0, V qJ E 92 (  ): (1.133) < £5(p(x) - p), qJ> = < £5(Sp), qJ> = M qJ(p) =  f qJ(rOJ,t)rn-2dOJdt(withr = (t 2 - p)1/2); 2 sn - 2 X  (ii) if p > 0, V qJ E 92 (  ): (1.134) < £5(p(x) - p), qJ> = < £5(Sp), qJ> = M qJ(p) r dx' = J.-1 [cp(x', - (p + X'Z)l/Z) + cp(x',(p + X'Z)l/Z)] 2(p + X,Z)l/Z o In physics, we usually have p = m 2 > O. The image set of 92' (*) by p*: {p* T E 92' (), V T E 92' (*)} is also the set of distributions on [R which are invariant under the group G of linear transformations in  leaving invariant the quadratic form p(x) = x; - X,2 (i.e. for n = 4, G is the Lorentz group). We observe that for p > 0, we can split the Dirac distribution £5(Sp) into (1.135) £5(Sp) = £5(S:) + £5(S;) where S: (resp. S;) denotes the positive sheet i.e. with X n > 0 (resp. negative, 
492 Appendix. "Distributions" x n < 0) of the hyperboloid Sp, with 36 ) () ( :!:: ) > - i ( ' ( , 2 ) 1/2 ) dx' (1.1 < S p , qJ - qJ x, + p + x 2( + ,2 ) 1/2 . IRn - 1 P X These distributions can also be obtained directly as the inverse image of () p by the mapping p + (resp. p _) defined by: p:!:: = pllR\+, with :!:: = {(x', x n ) E , Ix' I < + x n } (+ is called "the future cone" and  _ "the past cone"). (These distributions are invariant under the elements of G which conserve :!::, called (in the case n = 4) proper Lorentz groups). For further developments on this example, refer the reader to Methee [1] to Schwartz [1] and to Faraut [1]. 2. Convolution of Distributions In this section we shall generalise to distributions the definition of the convol- ution product of two continuous functions one of which has compact support, recalled in  1.1. We shall not be interested in the extension of the convolution product to functions of the class LP (n) nor to summable measures (which are used in particular in probability theory) other than as particular cases of distributions. Also, we shall not study the convolution product on groups other than n. [We follow very closely, 1.6 of Chap. 1 of Hormander [1] to which we refer the reader. ] The reader should also consult Schwartz [1] and Treves [2]. 1. Convolution of a Distribution on (Rft and a Function of  ((Rft) Let T E ' (n) and qJ E (n); we shall put, for all x E n: (2.1 ) ( T * qJ ) ( x) = < T ( y), qJ (x - y) >, the notation of the second member denoting the value of the distribution T on the function qJx E  ( n) defined, for all y E n by qJx ( y) = qJ (x - y). This definition obviously extends that given in 1.1.2. Proposition 1. Let T E ' ( n) and let qJ and t/J be long to  ( n ). (a) T * qJ E  00 (n) and D a ( T * qJ) = (Da T) * qJ = T * Da qJ. (b) supp( T * qJ) C supp T + supp qJ. (c) T * (qJ * t/J) == (T * qJ) * t/J = (T * t/J) * qJ. 
2. Convolution of Distributions 493 These properties are practically obvious and generalise the properties (1.5), (1.6) and (1.7) of  1.1 (see the works cited there for a proof). One of the interesting points about the convolution which we shall define rests in the possibility of "regularising" distributions, i.e. of approaching them as limits of regular functions. Proposition 2. Let Q be an open set of [Rn and T E [i)' (Q); then there exists a sequence { qJ p } pEN of functions in [i) (Q) converging, in [i)' ( Q), to T. Proof As in numerous proofs of density (see Proposition 2 of 1 for example) we can proceed by "truncation and regularisation". (a) (Truncation). Let (K p ) be a sequence of compact sets in Q whose union is Q; let {t/Jp} a sequence of functions of [i) (Q) satisfying, for x E Kp: t/Jp(x) = 1; (such functions exist: see Lemma 3,  1.1.3). The sequence {t/J p T} obviously converge to Tin [i)' ( Q). This allows us to assume in the remainder of the proof that T is of compact support in Q. (b) (Regularisation) Let T E g'(Q) and K = supp T, we consider a sequence ( 8 p) of functions of [i) ( [Rn) satisfying (2.2) supp8 p c B(O, lip) r Op(x)dx = 1 Jn (such a sequence exists, see the proof of Proposition 2, 1). We see that for p sufficiently large, T * 8 p E [i)(Q) (Proposition Ib), (2.1) and we verify without difficulty that T * 8 p converges to T in [i)' (Q). o Remark 1. We should be able in the same fashion as above, define the convolution of a distribution with compact support and arc OO-function (without condition on support). Let T E g' ( [Rn) and qJ E g( [Rn), we then write as in (2.1) (2.3) ( T * qJ) ( x) = < T (y), qJ (x - y) >, and we verify without difficulty the assertions of Proposition 1 in this case. As far as regularisation is concerned we bring forward the following: - if qJ is analytic on [Rn, then T * qJ is analytic, - if qJ is a polynomial, then T * qJ is a polynomial. We shall find a verification of these facts in Schwartz [1], Chap. 6. o Remark 2. The convolution product which we have defined has properties of continuity with respect to its factors. We can verify that: 
494 Appendix. "Distributions" - the mapping (2.4) TT*cp of / ([Rn) into $ ([Rn) is continuous (in the following sense: cp is fixed, if {  } is a convergent sequence in ' ([Rn), then {* cp} is a convergent sequence in $ ( [Rn ) ), - the mapping (2.5) cp  T * cp of ([Rn) into $( [Rn) is continuous. We can give results similar to those of Remark 1 for the convolution. The proofs will be found in Hormander [lJ, 1.6 or Schwartz [lJ, Treves [2J. o 2. Convolution of Two Distributions of Which One (at Least) is with Compact Support The definition is based on the following Proposition 3. Let U be a continuous linear mapping of  ( [Rn) into rc 00 ([Rn), which commutes with the translations; then there exists a unique distribution T E ' ([Rn) such that U(cp)=T*cp for all cp E  ([Rn). Proof Suppose that !h is the translation x  x + h; the image of cp, !hCP is then defined by: (2.6) ! h cp ( x) = cp (x - h), and the property in the statement is written U(!hCP) = !hU(CP) for all cP E  ([Rn). The mapping cP  (p defined by cP(x) = cp( -x) being obviously continuous in  ([Rlf) into itself, the linear form (p  (U cp) (0) is continuous by hypothesis and defines a distribution T E '([Rn). We therefore have ( U cp ) (0) = < T, cP > = (T * cp) (0). Applying then to the two members the translation operator! -h which com- 
2. Convolution of Distributions 495 mutes with U and the convolution, we obtain the desired result ( U qJ ) ( h) == (T * qJ) ( h ), Vh E [Rn. o Now let T E '([Rn) and S E ' ([Rn); the mapping qJT*(S*qJ) satisfies the hypothesis of the preceding proposition; there therefore exists a unique distribution W such that W * qJ == T * (S * qJ); we shall reach a similar conclusion by studying qJS*(T*qJ) (see Remark 1). We can then give the following definition. Definition 1. Suppose that T and S are two distributions on [Rn, one at least having compact support; we denote by T * S the convolution product of T (2.7) for all qJ E ([Rn). We verify that this definition generalises all those we have met with until now. The basic properties of the convolution are resumed in the following proposition: (T*S)*qJ == T*(S*qJ) Proposition 4. Suppose that U, Vand Ware three distributions of which two at least are of compact support. Then (a) U * V == V * U (b) supp( U * V) c supp U + supp V (c) D a ( U * IV) == DaU * V == U * D a V, for all a E N n . (d) (U * V) * W == U * ( V * W). A proof of these properties will be found in the works already cited. Examples (a) Let U E ' ([Rn) and let l5 (x - y) be the Dirac measure at the point y E [Rn, then (2.8) U *(l5(x - y)) == TyU x (where Ty is the translation: x  x + y in [Rn defined on U by < T y U, qJ > == < U, T y qJ >, T y qJ (x) == qJ (x + y)). (b) 'The property (d) of the preceding proposition (associativity) is valid only under the hypotheses of the proposition; we can find distributions U, V, W such that the two members of the stated equality are defined and different: 
496 Appendix. "Distributions" (2.9) for example, suppose that on  we have the distributions U = 1, V = £5'(x), W = Y(x), Y being Heaviside's function, then { U * V = 0 and ( U * V) * W = 0 V * W = £5 and U * ( V * W) = U. The associativity stated in part (d) of the preceding proposition is not verified, and this takes account of the fact that only one of the three distributions has compact support. 3. Distributions with Convolutive Supports In view of certain applications (see in particular the following paragraph) it is necessary to be able to define the convolution of two distributions of which neither has compact support. Definition 2. l,et U and V be two distributions on n; we say that their supports are convolutive if when x E supp U and y E supp V and x + y goes over a bounded set of n, then x and y go over both bounded sets. Examples. An arbitrary distribution and a distribution with compact support have their support convohitive. If U = V = Y(x), Heaviside's function, U and V have their support con- volutive. If U = Y(x) and V = Y( -x), U and V do not have their support convol- . utive. o We notice then that the convolution product of two distributions U and V, at least one of which has compact support,' satisfies: (2.10) < U * V, qJ> = < U(x), < V(y), qJ(x + y) > > for all qJ E (n). Now the second member of this formula retains its meaning when the supports of U and V are only convolutive; we take (2.10) as the definition of U * V when the supports of U and V are convolutive. We can then state the Proposition 5. Let U and V be two distributions on n with convolutive supports; then we have (a) U * V = V * U, (b) supp (U * V) c supp U + supp V, (c) D a ( U * V) = D a U * V = U * DaV, for all lJ. E N (d) If U, V, Ware three distributions whose supports satisfy: if x E supp U Y E supp V, Z E supp W, and x + y + Z goes over a bounded set of n, then x, y 
g2. Convolution of Distributions 497 and z go over bounded sets and (2.11 ) (U * V) * w == U * ( V * W) (all the distributions written have their supports convolutive). We can find a proof in Schwartz [1], Chap. 6. We shall see in 3 other generalizations of the convolution product, where the conditions on the support, essential here, are replaced by conditions "at infinity" (convolution of tempered distribution and of a distribution of rapid decay). 4. Convolution Algebras We know that we call an algebra on the field of the real numbers, or on that of the complex numbers, a set A endowed with three operations: sum, product by a scalar and product having the following properties: - A endowed with a sum and product by a scalar is a vector space; -the product is then a bilinear mapping of A x A into A; - the product is associa ti ve. We shall be interested in several examples of subspaces of'@' which are algebras for the operations: sum of two distributions, product of a distribution by a scalar, convolution of two distributions. Let .91 be such a space; it is necessarily a vector subspace of .@', and it must be verified in addition (and this will be sufficient) that: -the convolution of two distributions of .91 is defined and is an element of d, - the convolution is associative when we restrict it to the elements of d. The interest of such algebras is, in particular, the following: if bEd (d is thus an algebra with unit element), and if U E .91 admits an inverse U* E .91 for the convolution, that is to say (2.12) U * U* == U* * U == b, this inverse is unique, and we shall know how to solve all convolution equations (in V) of the form (2.13) V*U==w where WEd; in effect such an equation has, in .91, the unique solution V == W * U*. Among the convolution equations, we know that we can find all the partial differential equations with constant coefficients since (2.14) P(D)T == [P(D)b] * T for each distribution T on [Rn. 
498 Appendix. "Distributions" Example 1. Algebra C' of distributions with compact support. The distributions with compact support on IRn form, in an obvious fashion an algebra of con- volution (Proposition 4) which has for its unit element the Dirac distribution at the origin. This algebra is not very interesting for partial differential equations as we can show that the only invertible elements for the convolution in C' are of the form Ab (where A is a scalar); in fact it can be shown that if U E C' is not of the form Ab, the equation (in V) (2.15) V*U==b does not admit any solution with compact support. o Example 2. Algebra fl)'+ of the distributions on IR with support in [0, + 00 [. This is, in an obvious fashion, a commutative algebra containing the null element b. Its interest arises from possible applications to linear differential equations with constant coefficients. Let, in effect, W E fl)'+ ; the search for the U E fl)'+ satisfying (2.16) P(D)U == DmU + a m _ 1 D m - 1 U + ... + aoU == W reduces to the solution in fl)'+ of (2.1 7) [P(D)b] * U == W; it is enough, therefore, to know how to invert P(D)b in fl)'+. Let us suppose that we have (2.18) P (A) == (A - A 1 )m 1 . . . (A - Ak )m k where the Aj E C are distinct and the m j are strictly positive integers; we see that (2.19) P ( D ) b == (b' - A 1 b )m 1 * . . . * (b' - A k b )m k where (2.20) (b' - Ab)m def (b' - Ab) * . . . * (b' - Ab). It is enough, therefore, to know how to invert (b' - Ab); now (2.21) Y ( x ) e AX * ( b' - Ab) == b. To exploit more systematically this type of calculus, we notice that there exists an isomorphism between the algebra of rational fractions C {p} with one indeterminate and the subalgebra (for the convolution) of fl)'+ generated by b' and Y. This isomorphism is determined by { p  b' lip  Y. Taking this isomorphism into account we see that the calculation of the inverses in fl)'+ of the elements of this algebra is reduced to the analogous calculation in (2.22) 
g2. Convolution of Distributions 499 c {p}, an algebra which we know well and which we use as a tool to effect decomposition into simple elements; (the calculus so developed is sometimes called Heaviside's symbolic calculus). For example: We consider the integro-differential equation u' + 2u + f: u(t)dt = (j (2.23) in !?)'+. This equation can be written (2.24 ) ( b' + 2b + Y) * u == b. We must thus seek the inverse of the rational fraction (2.25) 1 p2 + 2p + 1 p+2+-== p p Now (2.26) P p2 + 2p + 1 1 P + 1 1 (p + 1 )2 so (2.27) (b' + 2b + Y)- 1 == Y(x)e- x - Y(x)xe- x ( We notice that the inverse of ((j' - A,(j r is Y( x) (:- : )! e Ax ). D Example 3. Algebra!?)'r of distributions on [Rn with support limited to the left relatively to a strictly convex cone r. Let r be a cone, closed in [Rn, with vertex 0; we shall suppose it to be strictly convex, that is to say convex and such that there exists a vector N such that <x, N> > 0 for all x E r\ {O}. (We say also that r is salient). We then consider distributions T on [Rn such that there exists a translation L Xo r of r such that supp T C Lxor; we shall say that T belongs to !?)'r. We easily verify that two distributions of !?)'r have their supports convolutive; !?)'r is then a convolution algebra with unit algebra. We can verify that there are no divisors of zero in !?)'r?3 We have seen in Chap. V that a class of differential operators admits elementary solutions (i.e. convolution inverses) in this algebra. In effect, let P be a hyperbolic 23 That is, if U, V E 'r and satisfies U * V = 0, then necessarily one (at least) of the two distributions U and V is null. That is a consequence of a theorem on the closed convex envelopes of the supports of convolution products. (see Lions [1 J, [2J). 
500 Appendix. "Distributions" operator in n with respect to the half-space {x, x. N  O} N E n; then there exists a strictly convex cone r (satisfying x. N > 0 for all x E r\ {O} and a (unique) elementary solution E of P with support in r. The calculus in the algebra 'r yields the solutions of Cauchy problems (with data on hyperplanes x . N == a). 3. Fourier Transforms 1. Fourier Transform of L I-Functions For u ELl, we may define the Fourier transform of u, denoted b u (or by ff u ) by the formula: (3.1) u(y) = .? u(y) = r e- ix.y u(x) dx Y E IR n . Jn The mapping u --+- u defined by (3.1) is obviously linear. From the inequality: lu(Y)1  r lu(x) I dx for all Y E IRn, Jn (3.2) we deduce: { if U ELl, U is a bounded continuous function on n with II u II 00  II u II 1 . In addition we have the Riemann-Lebesgue theorem: (3.3) u(y) --+- 0 in C when Iyl --+- 00. In effect, first of all if u E , we have, as a result of an integration by parts with respect to the variable X k /II. 1 I . u ( y) == -=- e - lX. Y Dk U ( X ) dx, l Y k n from which we deduce A 1 lu(y)1  I Yk I II Dku 111 -+ 0, when lyl -+ 00. Now, as  is dense in L l , if v ELl, for all B > 0, we can find u E  such that IIu - vlll  1£. Thus from v(y) == (  )(y) + u(y), (3.4) 
g3. Fourier Transforms 501 we deduce on using (3.4) 188 Iv(Y)1  Ilu - vll l + IYkl llDkulll < :2 + :2 = 8 for I y I sufficiently large, from which (3.3) follows. D Let us denote by Bo ( [Rn) = Bo the space of bounded continuous functions on [Rn which tend to zero at infinite. Provided with the norm II u /I 00 = sup lu(x)l, x E [Rn Bo is a Banach space. We can thus summa rise the properties (3.2) and (3.3) in Proposition 1. The mapping u  u defined by (3.1) is linear and continuous from L 1 into Bo. (3.5) Remark 1. We can also consider  defined by ff u (y) = r e ix . Y U (x) dx J [Rn y E [Rn which also possesses the property of the Proposition 1 and which we sometimes call the Fourier co-transform. D To make further precise the properties of the Fourier transform of L 1, let us denote by: La f the translate of the function f of amplitude a E [Rn defined by (3.6) La f(x) = f(x - a) for all x E [Rn, a E [Rn given. '" f the symmetriser of the function f defined by: (3.7) '" f(x) = f( -x) for all x E [R. Again for x E [Rn, a E N, we put xrJ. = Xl. X2 2 . . . xn; we then have the Proposition 2. For u, VEL 1, we have 24 '-....,/ (3.8) (i) if = (u) = #"(u); #"(u) = (u) = u, (u) = (u), (ii) r u(x)v(x)dx = r u(y)v(y)dy, J[Rn J[Rn ( ... ) { (y) = e-ia.yu(y), a E [Rn, y E [Rn llZ . A  ( e - la . X . u) = L _ a U. 24 In the sequel we denote indiscriminately by :F (u) and :F u the Fourier transform of u. 
502 Appendix. "Distributions" (iv) (a) If xau ELi, I al  k, then 11 E rri k and we have (3.9) D a (11) == (- i )lalW (b) IfuErrikwithDPuELlforall[3ENn,I[31  k, then y P I1 E L 00 and (3.10) i lPI yP u = J)P;; Remark 2. (1) The property (iii) expresses that the Fourier transform exchanges translation of amplitude a and multiplication by the exponential e- ia. y . (2) The property (iv) expresses that the Fourier transform exchanges differen- tiation of order I a I and multiplication by x a , or again, differentiability and decay at infinity. (3) It is interesting to be able to introduce a subspace of L 1 such that property (iv) of Proposition 2 be true for all kEN. This is the reason for the introduction of the space !/ ([Rn) == !/. D 2. The Space ff ( (Rn) We put !/ ( [Rn) == !/ == {u E rri 00; Va, [3 E N n, za DP U  0 as I x I  + 00 }; !/ is the space of functions of class rri 00 of rapid decay at infinity, which is not a normed space, but of which the topology can be defined by the (denumerable) sequence of semi-norms: U  sup I x a DP u(x)1 == daP(u) x E [Rn which yields a complete metrisable space. " d ap (u - v) Indeed d(u, v) =  aap 1 d ( _ )' u, V E 9', where the aap are chosen CX,fJE r\jn + ap U v to be such that L aap == 1, is a distance on g. a,p We should notice that if U E !/, then x a DP U E LP for all p  1 and for all a, [3, E N n . Further, !/ is dense in LP for all p with 1  p < 00 (on the contrary !/ is not dense in L 00 ). For U E g, we can thus define its Fourier transform by (3.1), as well as the Fourier transforms of x a DP u for all a, [3 E N n . Then we also have (3.11 ) 11 E g. Thus!/ is a dense subspace of L 1 which is stable by!F (which is not the case for 
g3. Fourier Transforms 503 .L 1 ) and, in addition, we have the inversion formula 25 11'. 1 r A' 1 - A u(x) = :F - (u)(x) = (2n)" J [Rn u(y )e'x,y dy = (2n)" :F (u )(x) for all x E n; (3.12) and the properties stated in Proposition 2 are still valid, the property (iv) being replaced by: { iY?i = (iyYx u (3.13) ________ DPCt = (- ix)Pu for all U E!/ and for all lJ., [3 E r\jn. In addition: { :F is an isomorphism of !/ onto itself with inverse :F - 1 (3.14) defined by (3.12) and we have: (3.15) 1 A A 26 (u, v) = (2n)" (u, v) (Parseval's formula) , 1 A II u II = (2n)"/2 II u II (Plancherel's formula). Remark 3. The formulae (3.15) show that the Fourier transform :F and its inverse :F - 1 are continuous maps of !/ onto itself when we provide !/ with the topology induced by that of L2; they also show that (to within a factor 1j(2n)n/2) the norm in L 2 of a function in !/ is conserved by the Fourier transform. Similarly, the scalar product of two functions is conserved (to with a factor 1j(2n)n). It is because of that that it is sometimes more convenient for reasons of symmetry to define the Fourier transform by the formula 1 i . :Fu (y) = e-lx'Yu ( x ) dx (2n)2 n ' (then we have :F - 1 = :F) or again by :F u(y) = r e- 2i1tx.y u(x) dx. Jn With one or the other of these last definitions, :F conserves exactly the scalar product and the L 2 -norm of the elements of !/. 25 We give below a demonstration of this fundamental property. 26 Where (u, v) = f u(x) v(x) dx and II u" = (u, U)1/2. n 
504 Appendix. "Distributions" Our choice of the definition of ff by (3.1) is motivated by the fact that it is that definition which is the most generally adopted by research workers in partial differential equations. We shall thus state the Theorem 1 2 7 . The Fourier transform ff (resp. # ) is (to within a constant factor) an isometry of !/ onto itself when !/ is given the topology induced from L 2. An example of a function of !/. F or a > 0, the function t/1 a defined (3.16) t/1a(X) = - alxI2 e is in !/. Its Fourier transform is given by ( 3.1 7) .A. ( n ) n/2  i Wa(Y) = - e- 4a and fa(Y) dy = (2n)n. a [Rn Let us verify (3.17). For that, it is enough to prove the result for n = Fubini's theorem: 1 for, from (3.18) f +oo f +oo fa(Y) = e- axi e- i X 1Yl dX 1 . . . e- ax ; e-iXnYndxn; -00 -00 let us suppose therefore that n = 1. The function t/1a is the unique solution of the differential equationf' + 2axf = 0 satisfying tf(X)dX =  . Using (3.13), we see that tfra is the unique sol- ution of the differential equation 2ag' + yg = 0 satisfying g(O) =  . Hence tfrAy) =  e-Y2/4a when n = 1, and we deduce (3.17). D Proposition 3. The space  is contained with continuous injection and density in the space !/. Proof We notice the preceding property by:   !/. To verify the density of  in !/, we realise a truncature operation by multiplying a function u E !/ by qJp E  such that for all p E N*, we have: p(x) = ( ; ) where qJ E  is such that qJ(x) = 1 for Ixi  1. 27 After having introduced the tool of tempered distributions (see No.4 below), we shall give a simple demonstration of this fundamental result. 
93. Fourier Transforms 505 We thus construct up E f!fi such that up (x) = qJp(x)u(x) Vp E N*, and we verify (exercise left to the reader!) that up --+ u for the metric on fI/ defined above. Finally on the notion of convergence in f!fi and fI/, it is easy to see that if a sequence converges in the sense of f!fi, it converges in the sense of fI/ from which the continuity of the injection of f!fi into fI/ follows. o Remark 4. The space f!fi is not stable under the Fourier transform: if u E f!fi (IRn), then U E fI/ (IRn) and we cannot expect also that U E f!fi! In a general way, the Fourier transform of a function with compact supp0rt is never of compact support, for it is analytic. Indeed if f(x) is (for example) in L I (IR) and with support in [XO, Xl]' then I X! J(y) = e- ixy f(x) dx Xo is continued analytically to an entire function, defined by (3.19) J (y + il1) = I x 1 e - ix(y + i'l> f (x) dx; Xo   hence y --+ f{y) only has compact support if f = 0 i.e. (after the Fourier inversion theorem) f ==- O. o It is interesting to characterise the Fourier transforms of the functions qJ E f!fi. Such a characterisation is given by the Theorem 2. (The Paley-Wiener theorem). Let a > O. The following two prop- ositions are equivalent: (1) if1 is the Fourier image of a function qJ E f!fi with support contained in the ball B(O, a) = {x E IRn; Ixl  a}. __ (2) if1 can be continued to a holomorphic function if1 in en and possessing the following properties: { f all kEN, there exists C k > 0 such that .1 if1(z)1  C k (1 + Iz12) -k/2 ealIrnzl, Z E en. (3.20) where n 11m Z/2 L 11m Zj12, Z = (Zl, . . . , zn), 1m Zj = imaginary part of Zj. j= I For the proof of this theorem, we refer to Schwartz [1]. 
506 Appendix. "Distributions" 3. Fourier Transform in L 2 The space !/ being dense in L 2 and :F and :F - 1 being continuous from !/ onto itself when this space is furnished with the topology of L 2 , we can extend :F and ff - 1 to L 2 in a unique manner. The Fourier transform of a function of L 2 will thus be a function of L 2 satisfying (3.15). We can also say that the Fourier transform in !/, which, is to within a constant factor, an isometry of !/ endowed with the topology induced by that of L 2, can be extended in a unique manner to an isometry (to within a constant) of L 2 which is called the Fourier transform in L 2. We again denote this transform by :F and write u = :F(u) for the transform ofu E L 2 . Although the integral r u(x)e -ix.y dx does not then have a meaning in so far as Jn the Lebesgue integral r lu(x)1 dx diverges for u E L 2 with u rt= L \ we again use Jn the notation u(y) = ff u(y) = r u(x)e- ix . y dx Jn for the Fourier transform of u E L 2 . Remark 5. This misuse of notation can be justified in the following way. Let us suppose for simplicity that n = 1. We can show (see Remark 9, (3.95)) that for u E L2(1R) we have: f +A U (y) = lim u (x) e - ix . y dx A-++OO -A f +OO The Fourier transform of u E U(IR) is thus given by an integral _ 00 taken in a generalised sense. Similarly, for the inversion formula we have (3.21 ) a.e. In y. (3.22) 1 f +A . u(x) = lim - u(y)e lX . y dy, A-++OO 2n -A a.e. In x. o 4. Fourier Transforms of Tempered Distributions To extend the Fourier transform to distributions, taking account of the fact, already indicated above, that !!fi is not stable under this transform, we are led to introduce the vector space of continuous linear forms on !/ which we denote by !/' (or more precisely by !/'(lRn)). 
93. Fourier Transforms 507 From Proposition 3, the injection of !!fi into !/ being continuous with dense image, the transpose of this injection is an injection with the result that the space !/' can be identified with a subspace of !!fi'. It follows that the elements of !/' are th us distributions. We call them tempered distributions. The space !/' will be furnished with the following notion of convergence: (3.23) { a sequence {T p }PEN of tempered distributions tends to T in !/' if, for all qJ E !/, < Tp, qJ > --+ < T, qJ > in C when p --+ + 00. The bracket <, > denotes here the duality between !/' and !/. It is, as for !!fi', a question of the notion of weak convergence, but as in the case of !!fi', we shall always omit this qualification in the sequel. We are then able to define the Fourier transform for tempered distributions by transposition: for each T E !/', the Fourier transform of T, denoted by T or ff T will be the tempered distribution defined by: .A. A (3.24) < T, qJ> = < T, qJ> for all qJ E !/. Remark 6. The elements of LP, 1  p  + 00 can be identified with tempered distributions (and in particular !/ c !/'). In effect, as for 1  p < + 00, !/ c;: LP with density, we deduce, by trans- position of the injection, that U' <:+ [7' with  +  = 1 so for 1 < pi :::;; + 00. P p' A direct study shows that the elements of L 1 also define tempered distributions. [We verify easily that the mapping of !/ --+ C defined for fELl by cp --+ r f(x)cp(x)dx is continuous on [7]. Then, using Fubini's theorem, we Jn easily establish that (3.24) yields the classical Fourier transform of functions of L 1 defined by (3.1) with the result that (3.24) very well provides an extension to tempered distributions of the classical definition of ff. o Remark 7. (1) As we have already indicated if qJ E !!fi, {p E !/ (but not to !!fi) with the result that (3.24) does not allow us to define the Fourier transform of an arbitrary distribution T. An arbitrary distribution T will not have a Fourier transform (at least as a distribution). (2) Numerous distributions, useful in applications, are tempered distributions. Thus the distributions with compact support are tempered (lif' c;: !/'). We have seen (see Remark 6) that the elements of LP, 1  p  + 00 identify with elements of !/'. The polynomials on [Rn identify with tempered distributions, in the same way as every functionf E Lfoc with slow growth at infinity, that is to say majorised, for Ixl sufficiently large, by a polynomial in x. 
508 Appendix. "Distributions" Furthermore, it is because of this "slow growth" at infinity that the distributions of !f/ have been qualified as "tempered". The locally summable function on IR: x  eX does not define a tempered distri- bution. We easily verify that the derivative of a tempered distribution is tempered in exactly the same way as its product by a polynomial. o We can characterise the tempered distributions in the following manner (see Schwartz [1]): Proposition 4. F or a distribution T to be tempered, it is necessary and sufficient that it be a derivative of a continuous function with slow growth in the usual sense, that is to say of afunction which is the product of P(x) = (1 + IxI 2 )k/2, kEN by a bounded continuous function on IRn. Thus T E !f/' is of the form: (3.25) T = DP(P(x)f(x)), f bounded and continuous on IRn. Proposition 4 thus justifies the name of tempered distributions (or of slow growth) as stated above. In particular it shows that: (1) in the case n = 1, eX does not Jdefine a tempered distribution, for none of its primitives is a function of slow growth (see previous Remark 7); 00 x m (2) in the same way the series L , is not convergent in !f/'(IR), since the partial o m. N m sums L  (which are in !f/'(IR)) are not bounded by a polynomial. om! A. 1 (3) We observe finally that the pseudo-functions Pfx+, Pf , are tempered x distributions. For other characterisations of tempered distributions, the reader should consult L. Schwartz [1]. We defin La T the translation of amplitude a and the transform by symmetry (as by (3.7)) T of the distribution T by { <:aT, qJ> _ = <L-aqJ> (3.26) _ < T, qJ > - < T, qJ >. We can then state: Proposition 5. (a) The Fourier transform:F defined by (3.24) and ff (defined in a similar way) are two isomorphisms of !f/' onto !f/'. (b) For all a E IR n , we have: { :F (La T . ) = e - ia . y. :F ( T) ( 3.27 )1 :F ( e - la . x. T) = 7: _ a . :F ( T). 
g3. Fourier Transforms 509 (c) For aU a, f3 E r\jn, we have { :F(Da T) = (iy)a :F(T) DP:F(T) = :F[( -ix)PT]. (3.27)2 Remark 8. The fundamental results expressed by Theorem 1 and Proposition 5 are all consequences of the formula (3.28) :F 1 = b(y) on , (where we take here as definition :Fu(y) = Je-2niXYu(x)dx). Indeed we deduce immediately that in n :F (e2niax) = b(y - a) = b(Y1 - a 1 , . . . , Yn - an). Then if qJ E !f/, we have (:F(e2niaX), qJ) = (e2niaX, ff qJ) = (b(y - a), qJ) qJ(a) so q>( a) = f e 2 1[iax dx f e - 21[ixy q>(y) d y which is the inversion formula from which all the results of Theorem 1 and Proposition 5 follow. Proof of (3.28). Let us put :F 1 = S. We observe that ff( :x 1 ) = ffO = 0 = 2niyff1 = 2niyS so yS = 0 and so (Proposition 16, 1.3) S = cb. Hence with (3.17) (:F1, e- ny2 ) = (cb,e- ny2 ) = c = (1, :F(e- ny2 ) = (1, e- nX2 ) '= f +oo e-1[X2 dx = 1, - 00 from which (3.28) follows. o 5. Fourier Transform of Distributions with Compact Support We have already observed that every distribution T E lC' is tempered. In this case, the obtaining of the Fourier transform is particularly simple: Proposition 6. Let T E lC', then Tis afunction of class C(;j00 (which can be extended to a holomorphic function in en) given by (3.29) T(Y) = (T, e- ix . y ) for all y E n, 
510 Appendix. "Distributions" and we have (3.30) D a T(Y) = < ( - ix)a T, e- ix . y > for all li E r\jn. In addition it is possible, by an extension of the Paley-Wiener theorem, to characterise the Fourier transforms of distributions with compact support. We refer- the reader to Schwartz [1] or Treves [2] for the proof of the Theorem 3 (The Paley-Wiener-Schwartz Theorem). For afunctionf defined in n, the following two propositions are equivalent: (1) f is the Fourier transform of a distribution T E C' with compact support in the ball B(O, a) = {x E n; Ixl  a };_ (2) f can be extended to a function f holomorphic in en and satisfying the following property: (3.31 ) { there exist mEN an: a constant C > 0 such that for  E en we have If()1 < C(l + 112)m/2 eaIIml. 6. Examples of the Calculation of Fourier Transforms We shall restrict ourselves to some simple examples. For other calculations see Gelfand and Shilov [1]. Example 1. Fourier transform of a function of L2(). Let us consider the functionf defined by f(x) = (i + x) -1; we havef E L 2() andf ft L 1 (). Hence .A. f + A. e - ixy f(y) = lim . dx A.-++OO -A. 1 + x We use the calculus of residues by integrating round the contour AC 1 BA for y > 0 and round the contour BC 2 AB for y < 0; we thus obtain J(y) = { 0 -2nie-y for y > 0, for y < 0; T} x =  + iT} y<O A - ),.\ \ , "- ..........- B 1+ ),. / - i , ,"'" --C 1  y>O Fig. 1 
3. Fourier Transforms 511 We verify Plancherel's theorem: I dx 1 1 + 00 1 A. IIfl1 2 = 2 = n = - 4n 2 e- 2Y dy = -llfI1 2 . [R,1 + x 2n 0 2n o Example 2. Fourier transform of the characteristic function [ - a, a] c lR. Xa is defined by (3.32) X (x) def { 1 x E [ - a, + a], a 0 X  [ - tl, + a] Xa E LP(lR) 1  p  +00. of the interval a > 0 An elementary calculation shows that (3.33) " ( ) = 2 SIn a y Xa Y . Y Since Xa defines a distribution of type C' (lR), it is left to the reader to verify the Paley-Wiener-Schwartz theorem in this case. Example 3. Fourier transforms of Dirac measures and their derivatives. The Dirac measure b at the origin in lR n is, as we have seen, a distribution with compact support reduced to the origin. Hence t is a function of class rc oo given by: (3.34) t(y) = < b, e - ix.y > = 1 for all y E lR n ; in addition we have: /"'.... A. (3.35) Dkb(Y) = (iy)k. b = (iy)k for all y E lR n , k E Nn; \ and we can again observe that (3.36) (b) = #(b) = 1, with the result that we recover the result (3.28) of the Remark 8 (for the Fourier transform used here): (3.38) (3.37) (1) = (2n)nb (indeed from (3.36), we deduce (1) = (g-(b)), but g- = (2n)n-1 with the result that (1) = (2n)n(-1(b)) from which (3.37) follows). Now as b(x - a) is the translate Lab of b, (with a E lR n ) we have with the notation b a = b(x - a) {  (b ) (y) =  (Lab) (y) = e - iay  ( b) (y) = e - iay (ela.x.b) = Laff(b) = 1; 
512 Appendix. "Distributions" similarly (see also Remark 8) { ff (b a) = < b a' e ix . Y) = e ia . Y = ff ( b - a) (3.39) ff(e ia . x ) = ff((ba)) = (211:)" ff ff-l(b a ) = (211:)"b a . o (3.40 ) Example 4. Fourier transform of the tempered distribution defined by the function x  cos ax, a E !R given. x  cos ax is a bounded function so (see Remark 6) defines a tempered distribution. From: cos ax = !(e iax + e - iaX), we deduce on using (3.39) for n = 1 i ff(cosax) = n(b a + b_ a ) OZ--l ( ) ba + b_ a !#' cos ax = 2 . The reader will verify that x  cos ax satisfies the Paley-Wiener-Schwartz theorem. o Example 5. Poisson's summation formula. Let us consider the distribution (3.41) +00 S = L b(x - n). '- 00 We verify without difficulty that this is a tempered distribution since +00 cp  <S,cp) - L cp(n) -00 is linear and continuous from Y to . Defining in this example ff cp by ff cp(y) = f e- 21tixy cp(x) dx = <,O(y) we shall verify that S is invariant under ff: (3.42) As < ff S, cp ) ff S = S. < S, ff qJ ), it is equivalent to showing that (3.43) +00 +00 L cp(n) = L <{J(n), -00 -00 't/qJEY. More generally, with cp E Y let us associate the function Pcp defined by (3.44 ) 00 Pcp (x) = L cp(x + n). -00 We verify without difficulty that (3.44) has a meaning and defines a function Pcp 
g3. Fourier Transforms 513 which is rc oo and of period 1: Pcp (x + 1) = Pcp (x) \Ix E . Hence But using (3.44) r 1 i n + 1 Jo Pcp(x)e - 21tikx dx = L n cp(x)e - 21tikx dx = <$(k) Pcp(x) = X (J: Pcp(x)e-21tikxdx )e21tikX. so (3.45) +00 Pcp(x) = L <{J(k)e21tikx. -00 Comparing (3.44) and (3.45), we thus have the identity +00 +00 L cp(x + n) = L <{J(k)e21tikx -00 -00 which reduces to (3.43) for x = o. o Other examples are given in 97 below. 7. Partial Fourier Transform Let Q be an open set of q; we denote by (x, ) the current point of n x Q. It is possible and useful to define the "partial Fourier transform with respect to the variable x" of a distribution T E qfi' (n X Q) provided that the distribution is "tempered with respect to the variable x". We restrict ourselves here to some situations which will prove useful to us. (1) Let T = Tl Q9 T 2 , Tl E g'(n), T 2 E qfi'(Q). Then the partial transform with respect to x of Tis: (3.46) ffx(T) = Tl Q9 T 2 where Tl = ff(T 1 ) E g'(n). For example if T = c5(x) Q9 c5(), c5(x) (resp. c5()) Dirac measure at the origin in n (resp. in q), then: ffx(T) = 1 Q9 c5() that is to say: < fFA T), cp> = r cp(y, 0) dy, for all cp E !?C(lRn x IRq). Jn In a similar manner, we can, in this case, define ff(T) = c5(x) Q9 1 
514 Appendix. "Distributions" by Let us define u() : x  u(x,) =  e (2 TC)n and calculate ffx(u()). Froln (3.16), (3.17) which we apply with a = l , we have: Ixl 2 ffx(e -4f) = 2n()ne-IYI2, <ff'(T),cp> = r cp(O,y)dy for all cpE'@(lRn X IRq). Jn (2) Let us suppose that for all  E Q, u() E [/' (). The mapping   u() is then a distribution-valued function of Q (i.e. a function with vector values). Then for all  E Q, we can calculate ffx(u( )); a frequent situation is that in which u() is a function of L 1 (n) or of L(n) (i.e. u(): x  u(x, ) is.in L 1 or L 2); then we have (3.47) (ff'x u)(y, ) = r e - ix.y u(x, ) dx. Jn For example, let Y() = { 1, ; > 0 the Heaviside function defined in IR = Q, 0,  < 0 Ixl 2 4 , X E n  > 0, so (3.48) ffx(u())(y) = Y()eIYI2, Y E n. o (3) Let T = Tl Q9 T 2 where Tl E [/'(n), T 2 E [/,(q). Then T E [/'(n X q) = [/'(n+q) has a Fourier transform which is the tensor product of partial Fourier transforms with respect to x and with respect to , i.e. (3.49) ..... ..... ..... T = Tl Q9 T 2 . (3.50) Application to the calculation of the Fourier transform of the distribution defined by the function x  eialxl2, x E n with a > o. As eialxl2 = e iaxi e iaxi . . . e iax ;, (3.49) shows that it is sufficient to calculate the Fourier transform of the distribution defined by the bounded function f such thatf(x) = e iax2 , x E . For all qJ E [/() we have: < f, cp> = t e iax2 cp(x) dx. (3.51 ) For all N E N, we introduce fN by fN (x) = { f O (x) if Ixl < N otherwise. 
3. Fourier Transforms 515 Then it is immediate that for all qJ E Y() <IN' qJ) = f +N eiax2cp(x)dx  r e iax2 cp(x)dx when N  +00. -N J In other words: (3.52) IN  I In Y'(), so from the continuity of !F: (3.53) .A. .A. IN = !F IN  I = !F I; now (3.54) IN(Y) = f +N eix[ax-y] dx -N has for limit when N  00 the semi-convergent integral (3.55) L+: eix[axY]dx = e- ; L+: eia[x ;J dx = e- ; L+: eiaed which is thus J(y). To calculate the semi-convergent integral f:: eia2 d, we note that it is equal to 2 L+ 00 eia2 d and use the contour shown in Fig. 2 (the function z  e iaz2 being holomorphic). By Cauchy's theorem (3.56) L e iaz2 dz = O. Now, it is easy to show that on the circular arc AB, the integral f eiaz2 dz  0 AB when R  + 00. On the ray OB, we have z = .j2 (1 + i),  v:rying from R 2 i----- I I I 1t/4 : 1 A Fig. 2 
516 Appendix. "Distributions" to O. Thus 1 R eia2 d - f eiaz2 dz + (1 + i) fi i R e-a2 d AB 2 0 V and (3.57) 1+ 00 eia2d = (1 + o f 1+ 00 e-a2d = (1 ;Jafi 1+ 00 e-2d. i + oo fie As 0 e-2 d = 2' then from (3.55)(3.57) we deduce that q;:: i y2 (3.58) J(y) = ; (1 + i)e - 4a Finally, we see that 28 (3.59) :F (eialxI2)(y) [ Jbc I n _ i1Y12 ( 1 + i) e 4a. 2fi In a similar manner we have ff-l(eialxI2) = 1 #(eialxI2) = ff(eialxI2) (2n)n (2n)n from which we deduce that [ 1 + 1 I n _ ilYl2 :F -1 (eialxI2) = e 4a 2 . Another demonstration of the formula (3.59). The following demonstration of (3.59) contains several ideas which prove to be useful in other circumstances. It takes up for its essential element the proof of (3.17) given previously. It is sufficient to consider the case n = 1. We observe that the function (3.60) . 2 . 2 a --+ "x --+ e laX " = e laX is holomorphic from 1m a > 0 --+ [/', continuous from 1m a  0 --+ [/'. We define (3.61 ) . 2 f(y, a) = :Fx(e laX ); the function a --+ f(., a) is holomorphic from 1m a > 0 to [/' and continuous 28 With, it will be recalled, a real and positive, a =1= O. 
3. Fourier Transforms 517 from 1m a  0 to /7'. We notice that (3.62) f(y, i) = ff(e- X2 ) = fie- y2 / 4 . We then calculate that ( d. 2 ) . 2 ffx dx e WX == iyf(y, a) = ff x (2iaxe WX ) of - 2a oy (y, a) so that (3.63) of iy oy (y, a) + 2a f(y, a) = 0 with the result that (3.64) f(y, a) = c(a)e-iy2/4a, with c(a), holomorphic for 1m a > 0, to be determined. We notice that of 2 . 2 0 2 oa (y, a) = ffAix e laX ) = -i oy2 f(y, a) so that of . 0 2 (3.65) oa (y, a) +  oy2 f(y, a) = O. Using (3.64) in (3.65) we obtain 1 (3.66) c'(a) + 2a c(a) = 0 so that (3.67) { c(a) = ka- 1 / 2 = klal-l/2e-i8/2, 0  f}  n, a = lale i8 . From (3.62) and (3.67), c(i) = fi = k C ;/ ) giving k =  1 ; i ; hence we have f(y, a) =  (1 + i)lal-l/2e-iO/2 e- iy2 / 4n (from which (3.59) is recovered for a > 0). (3.68) 
518 Appendix. "Distributions" 8. Fourier Transform and Automorphisms of (RII: Homogeneous Distributions 8.1. Fourier Transform and Automorphisms of IRn Let h: x  h(x) be an automorphism of n, that is to say a bijective linear mapping from n to n. We denote by th its transpose, i.e.: th(x).y = x.h(y) 't/x, Y E n. We can define, as in 91.3.4, for each tempered distribution T E /7' (n), its direct image h* T by h, by (3.69) < h* T, qJ> = < T, qJ 0 h> 't/ qJ E /7(n). Naturally h* T E /7'(n). We can also define for each tempered distribution T E /7' (n), its inverse image th* T by th, with the aid of the formula (1.95); directly, th* T is the distribution (also denoted, with the help of the variable y, as Tth(y») obtained by the change of variable y  th(y).29,30 We shall show the following result: The Fourier transform of the direct image by h, an automorphism of n, of every tempered distribution T on n is the reciprocal image by th of the Fourier transform of T, or again (3.70) . /""-.. (the sIgn /""-.. ..... h T-th*T * - denoting the Fourier transform g-). Proof We have in effect, by the definition, for every function qJ E /7(n): (3.71 ) /""-.. < h* T, qJ > = < h* T, <P > = < T, <p 0 h > . Now fp 0 hey) = r e-ix.h(Y)cp(x)dx = r e-ith(X).Ycp(x)dx Jn Jn and with the change of variable z = th(x) fp 0 hey) = r e-iz.Ycpeh-l(z))ldetthl-ldz = Ideehl-1W(y). Jn Taking back this result to (3.71), we have, for all qJ E /7(n): /""-..  ..... < h* T, qJ > = 1 det t h 1- 1 < T, qJ 0 t h - 1 > = 1 det t h 1- 1 < t h; 1 T, qJ > 29 Or, again, if he is a sequence of functions of Ltoc([I) converging to T E g'(n) in g'(n) (or in '(n)), th* T is the limit of the sequence of functions he 0 th E Ltoc()' 30 The use of th instead of h in this part is explained in the sequel (see (3.70)). 
3. Fourier Transforms 519 so with (1.95): /'.. .............. h*T= Idetthl-lth;lT= th*T. o 8.2. Homogeneous Distributions Let h;. be the homothety in n of ratio A (A > 0): h;.(x) = AX, with inverse h; 1 = h 1 /;.. A function f E Lfoc (n) is said to be homogeneous of degree m E 7L if (3.72) f(AX) = Am f(x), hence iff 0 h;. = Am f. We extend this concept in a natural fashion to distributions T E qfi' (n) (by continuity) with the aid of the inverse image of T by h;.: a distribution T is said to be homogeneous of degree m E 7L if (3.73) hi T = Am T or again, in an equivalent fashion, by (1.95), with the aid of the direct image of T by h; 1 : (3.74) h- 1 T - A m + n T 31 ;.* - which may be rewritten as (3.75) < h;*l T, qJ> = < T, qJ 0 h; 1 > = A m+n < T, qJ > 't/ qJ E £0(n). 32 We can characterise such distributions, as in the case of functions, by Euler's relation n aT L x j - a = mT. j= 1 X j The search for elementary solutions of differential operators can lead in certain cases to the search of the Fourier transform of homogeneous distributions. This search is facilitated by the following result: Proposition 7. Let T be a tempered distribution on n; T is homogeneous of degree m if and only if its Fourier transform t is homogeneous of degree - m - n. Proof. Let T E- y'(n) be homogeneous of degree m. Using (3.70) with h = h;. = th;., and (3.74) we obtain: ....... ....... hi T = h;.*T = h1/l*T = A-m-nT so T is clearly homogeneous of degree - m - n. The converse presents no difficulty. o 31 On remarking that det h;. = An. 32 We could also use (3.74) or (3.75) as the definition of homogeneity of degree m of a distribution, but the definitions (3.74) and (3.75) are a priori less natural than (3.73). 
520 Appendix. "Distributions" Example 6. The Dirac b-distribution is homogeneous of degree - n and its Fourier transform !F b = 1 is homogeneous of degree 0; we write in the usual fashion: 1 b(Ax) = An b(x) (A > 0). More generally, every partial derivative Dab of b(ex E r\jn) has for Fourier transform (iy)a (see (3.35)) which is homogeneous of degree lexl, so Dab IS homogeneous of degree - lexl - n. o We shall observe again that the concepts of image of distributions by homo- theties and of homogeneous distributions are important in physics for the changes of systems of units. o 9. Fourier Transform and Convolution. Spaces (9M(lR ft ) and (9(lRft)33 (3.76) First of all, let us consider two functions qJ, .p E f/. We can then define their convolution product by: cp * ljJ(y) = r cp(y - x) ljJ(x) dx, Jn the integral on the right being absolutely convergent. It is then possible to verify that qJ * IjI E f/, and if we calculate !F(qJ * 1jI), we obtain (3.77) !F (qJ * 1jI) = !F (qJ )!F (1jI). In other words, the Fourier transform in f/ transforms the convolution product into the ordinary product. To be able to extend (3.77) to the convolution product of two distributions, it is necessary to introduce new spaces-the spaces (9M(n) and (9(n). 9.1. The Space (9M(n) ( = (9M) If ex is a function belonging to lC (n), we say that ex is a multiplicator of f/' (n) (resp. f/(n)) if the mapping 34 (3.78) T  exT is continuous from f/'(n) (resp. f/(n)) into itself. We already know multiplicators of f/', namely the polynomials on n. It is clear from the definition of the product of a distribution by a function of the 33 We use also the notation m M or me (see Chap. II and V) 34 See definition 10 of  1. 
3. Fourier Transforms  521 space  (see 91.3.1) that each multiplicator of f/' is a multiplicator of f/ and conversely. Weare thus led to introduce a subspace (!) M of the space , which leaves stable by multiplication the space (whence the notation "M" for multiplicator). This space is the space of infinitely differentiable functions of slow growth in n as well as all their derivatives. The space (!) M is thus characterised by: { a E (!)M if and only if, a E  and for all k E n, Dka is (3.79) majorized by a polynomial (whose degree depends on k)35 F or example, the function x  e iax2 is in (!) M()' its derivative of order m being the product of e iax2 by polynomial h m of degree equal to m. SIn ax In the same way x  E (!)M() as well as x  cos ax. x We remark that, clearly (see (3.58)) the Fourier transform of x  e iax2 will again b . .. I h J.' h . J.' f SIn ax d e In (!)M' It IS no onger t e same lor t e FourIer translorms 0 x  an x x  cos ax (see (3.32), (3.33), (3.40)) 9.2. The Space (!) From (3.76), if a is given in f/, then a (defined by a(x) = a( - x) is also in f/ and the mapping qJ  a * qJ sends f/ into [/ and we can show that this mapping is continuous from f/ into f/. This allows us to define the convolution product of T E f/' by a E f/ by the formula (3.80) < a * T, qJ) = < T, a * qJ), which defines a * T as an element of f/,36. Put this way, we say that T is a convolutor of f/(n) if: { T E f/', the mapping qJ  qJ * T ( = T * qJ) is continuous (3.81 ) from f/ into itself. The set of convolutors of f/ is denoted by (!)c ("C" for convolution); it is a vector space which we call the space of distributions of rapid decay. We have the following characterisation of the elements of (!)c: 1 for T E (!) it is necessary and sufficient that for all m  0, T be (3.82) the finite sum of derivatives of continuous functions whose product by (1 + Ixl 2 )m j 2 are bounded on m. 35 We put D°(X = (X. 36 Notice that if (X or T has compact support, (3.80) reduces to the definition of 2. 
522 Appendix. "Distributions" For example the function x ---+- eialxI2 which we know already is in (!)M is also in (!) . Indeed, let us verify this for n = 1. We have seen that the derivative of order k of e iax2 is the product of e iax2 with a polynomial precisely of order k which we denote by h k . Then these polynomials h k , k  2m form a basis in the vector space of polynomials of degree less than or equal to 2m with the result that we can develop (1 + x2)m in terms of these polynomials. We then obtain: (3.83) (1 + x2)m = L /lkhk(X) k::::;2m from which we deduce that (3.84) 2 . 2 " . 2 "d k . 2 (1 + x )m e Wx = i....J /lk e laX hk(x) = i....J /lk d k e laX ; k ::::; 2m k ::::; 2m X on dividing both sides of (3.84) by (1 + x2)m, we see, thanks to integrations by parts, that e iax2 is a linear combination of derivatives of order less than or equal to 2m of functions majorized by Cj(1 + x 2 )m. The result follows from (3.82). o This example shows in addition that (!) M and (!) are. not in duality as f defined by . 2 - - . 2 f(x) = e WX and f defined by f(x) = e -laX are both in (!) M and (!) and tf(X) !(X) dx = + 00. The interest in the introduction of (!) resides in the fact that it is possible to define the convolution of T E (!) with S E !/'. We proceed in the following manner:. '" First of all if T E (!), we notice that T E (!) and so the mapping qJ ---+- T * qJ is continuous from y; into itself: in these conditions, we can define T * S E !/', T E (!), S E !/' by the formula (3.85) '" < T * S, qJ > = < S, T * qJ > for all qJ E !/. We should notice that if T and S are in lS', the convolution product defined by (3.85) is that of 2.2 and possesses especially the proper lies of Proposition 5 of 2 of this Chapter. It is easy to verify that (!) is a vector subspace of !/'. It is the space of the convolutors of !/'; if T E (!), then r, 7: rx T, DP T, Xk T are again in (!). The space (!) contains in particular the distributions with compact support, and the locally integrable functions with rapid decay at infinity. The reader will verify thus that the Fourier transforms of the functions of (!) M, . I 1 2 sIn ax . X ---+- ela X , X ---+- cos ax, x ---+- are In (D. x 
3. Fourier Transforms 523 This holds in general for the functions of (!) M and is made precise in the following proposition: Proposition 8. ( 1 ) The Fourier transform interchanges the spaces (!) M and (!) (% ( (!) M) = (!) and conversely). (2) If S E !/', T E (!), a E (!) M, { %(T * S) = %(T)%(S) (3.86) %(O(S) = (In)-n %(0() * %(S). 10. Fourier Transform of Tempered Measures A measure fl on IRn is with slow growth or tempered in the space ' (IRn) of measures if there exists mEN such that (3.88) r /dfl/ (3.87) JlRn (1 + Ixl 2 )m < + 00. If B(O, a) denotes the ball of centre 0 and of radius a > 0, the condition (3.87) reduces to saying that: there exists m' E I\J such that; !(a) = r Idjll = O(a m ')37 when a -4 + 00 J B(O, a) or agaIn (3.89) fl is the product of a polynomial by a summable measure on IRn. For example, the complex measure fl having for density with respect to the Lebesgue measure dx in IR the function f such that f(x) = eialxl\ satisfies Idjll = dx and is a measure with slow growth or tempered (it satisfies (3.87) with m = 1). In the same way, the Dirac measure b is of slow growth in '. We then state the following result, for the proof of which we refer the reader to Schwartz [1J, Chap. III. Proposition 9. If fl is a positive measure on IRn, then a necessary and sufficient condition for fl to belong to !/'(lRn) is that it be of slow growth in '. If T = fl is a measure with slow growth and if A is a bounded measurable set in IRn, then the part fl A of the measure fl carried by A is summable with the result that its Fourier transform % f.1A is a continuous bounded function which is expressed by a usual Fourier integral. 37 O(a m ), in Landau's notation, expresses that !(a)/a m is bounded as a -+ 00. 
524 Appendix. "Distributions" Let us suppose that {Ak}kEN be bounded measurable parts of lR n with Ak C Ak + l' lR n = U Ak kEN (for example, Ak = B(O, k), kEN). Then we can show that we have: (3.90) llAk  II in!/' with the result that (3.91 ) /lAk  /l in!/'; (3.92) we shall then write $'(Jl) = A = r e- ix . y dJl = lirn f e-ixy dll J n k -+ 00 Ak the limit being understood in the sense of !/'. Remark 9. We notice that this is the method of calculation that we have used for calculating the Fourier transform of the function x  e iax2 which can be interpreted as the "density" of a measure (see formulae (3.50) to (3.55)). In the case of this example, the integral f + 00 e iax2 e- ixy dx = r e- ixy dJl - 00 J is semi-convergent. But (3.92) permits the justification of writing in the form of integrals (divergent in the usual sense and not semi-convergent) formulae used classically in electricity, quantum mechanics, etc. . . . F or example, we shall write (for n = 1): (3.93) 2n£5 = %(1) = f +oo e-iXYdx = r+ oo cosxydx. - 00 Jo Of course, the integrals figuring in (3.93) are not even semi-convergent and (3.93) has to be understood in the sense of (3.92), that is to say: 1 2n £5 = lim f H e-ixYdx = lirn 2 r k cosxydx (3.94) k-+oo -k k-+oo Jo in !/' (lR). We remark again that each functionfE LP, p  1 defines a measure II having for density f, with slow growth. We verify in effect (3.87) with m > 0 thanks to Holder's inequality. We will thus 
3. Fourier Transforms 525 be able to obtain the Fourier transform off E LP, p  1, by: (3.95) !Ff= lim f e-ix.Yf(x)dx, a  + 00 B(O, a) the limit being taken in the sense of !/'. This justifies Remark 5; let us recall that it is just the method of calculation which we have used in Example 1 of 93.6 for the functionf E L2(1R) defined by f(x) = 1/(i + x). o 11. Distribution of Positive Type. Bochner's Theorem 11.1. Functions of Positive Type A functionf continuous on IR n is of positive type (we write f  0) if for all points X 1 'X 2 '... 'X k of IR n and complex numbers Zl, Z2,... ,Zk we have: (3.96) 'Lf(x j - X1)ZjZl  o. j,l Taking successively k = 1 and k = 2 in (.96), the properties of Hermitian forms show that we have: on the one handf( - x) = f(x) which we can write: (3.97) on the other hand "'V _ ,."., ,.,." f = f or f = f, f(x) = f( - x); (3.98) f(O)  0 and If(x)1  f(O); a function  0 is thus bounded on IR n . Using the properties of Riemann sums, we can show (see Schwartz [1]) that the condition (3.96) is equivalent to (3.99) t.+/ (x - )<p(x)<p() dx d  0 for all <p E !!fi. By the operation v and using the operation  defined above, we deduce from (3.99) the equivalent condition: t.+/(X + )<p(x)ciJ()dxd  0, (3.100) VqJE!!), (3.101) which can be interpreted again (making the change of variable t = x + ) as: <f, qJ * ip> = j . f(t)<p * ciJ(t) dt  0 n VqJ E !!). 11.2. Distributions of Positive Type We can then define distributions of positive type and write T  0 if we have: (3.102) < T, qJ * ip >  0, VqJ E !!). 
526 Appendix. "Distributions" The set of distributions of positive type form a closed set for c0E-vergence in '([Rn) and we see immediately that if T  0 it is the same for T (defined by - -.....,..,.."", (T, qJ) = (T, ijJ »), T, T. We have the Proposition 10 (See Schwartz [1], pp. 275-276). (1) Each distribution T of positive type possesses the symmetric Hermitian property T = Tor T = T, and is a bounded distribution on [Rn (hence in y').38 (2) A distribution of positive type, which in the neighbourhood of the origin, is a continuous function, is in [Rn a continuous function of positive type. A characterisation of the distributions of positive type is then given by the Theorem 4 (The Bochner-Schwartz theorem). In order that a distribution T be of positive type, it is necessary and sufficient that it be the Fourier transform of a measure II  0 with slow growth. Proof. (1) The condition is necessary. If T is a distribution  0, we have seen (Prop. 10) that it is in Y'; it is thus the image by the Fourier transform of a distribution U E Y. We have: (3.103) (T, qJ * ip) = (:FU, qJ * ip) = (U, :F(qJ * ip) = (U, c{J.tp) for all qJ E Y. Putting u = c{J E Y in (3.103), we thus have from (3.102): (3.104) 0  (T, qJ * ip) = (U, u. ii) . We shall deduce that we have: (3.105) ( U, tjJ)  0 for tjJ  0, tjJ E Y; the space  being dense in Y, it is enough to show it for tjJ E . Of course, each function tjJ E , tjJ  0 is not of the form u. ii = lul 2 for at the least for n > 1, ft is not differentiable in the neighbourhood of a point at which tjJ is null. But we must observe that tjJ is the limit in  of functions of the form u. ii. Let us introduce in effect (see Lemma 3,91) a E  with (3.106) a  0, a = 1 in the neighbourhood of supp tjJ 38 A distribution T is bounded if the set {'1:a T: a E [Rn} is bounded in P}'. This property is equivalent to saying that T is defined by a continuous linear form on the space of summable functions together will all their derivatives. 
3. Fourier Transforms 527 Then (3.107) t/J = lim cx 2 (t/J + 8) e-+ 0 e> 0 where the function of the second me mber of (3.107) is of the form u.u with (3.108) u = cx J t/J + 8 E . Then, from (3.104), we deduce (3.105). This proves that U is a distribution  0, so because of the Proposition 5 of 91, a measure II  O. As it is tempered, we see from Proposition 9 that it is of slow growth in '. (2) The condition is sufficient. Conversely, if T = g- (U) with U = II E ' and II  0, with slow growth then U = II satisfies (3.105) and it is immediate that T satisfies (3.104) so that T  O. Hence the theorem of Bochner-Schwartz. o Remark 10. The proof of the Bochner-Schwartz shows that, for the topology of !/, the v * v, V E , are dense among the functions with rapid decay of positive type (qJ E !/ and qJ  0). (3.109) T  0 is equivalent to < T, qJ >  0 for qJ E !/, qJ  0, we have thus the two analogous definitions ( being dense in !/) (3.110) { T  0 if T( qJ)  0 for qJ > 0, T  0 if T( qJ)  0 for qJ  0, qJE qJ E . o 12. Schwartz's Theorem of Kernels We shall in this section adopt the following notation: we consider IRP = X P , X E XP IRq = yq , Y E yq and we denote x = (XP), y = (yq), We consider also the product space XP x yq  = '(XP), ; = ,(yq). 
528 Appendix. "Distributions" and we write (XP x yq) = xy '(XP x yq) = Y. Let 2(x, ;) be the space of continuous linear mappings from x into y which is a complete space 39 . We shall establish the Theorem 5 (Schwartz's theorem of kernels). The space of continuous mappings from x into ;, coincides algebraically and topologically with the space y of distributions on X p x yq.40 Remark 11. The theorem of the kernels means: let A be a continuous linear operator from x into ;; then we can associate with it a distribution of y (which we call a kernel distribution, or kernel.) Ax, y = A(x, y) (denoting distributions as functions) and a unique element such that (3.111) A cp(y) = (A(x, y), cp(x) XP VcpEx. Proof It is carried out in three stages.' 1st stage. Operator associated with T E y. Let T be a given distribution in y. We can associate with it an operator AT: x  ; in the following manner: If cp E  x' (3.112) tjJ H (T, cp (8) tjJ) = (T(x, y), cp(x)tjJ(y) )x p x yq is a continuous linear form on y; hence it can be written (3.113) < T(x, y), cp (x) fjJ(y) ) X p x yq = < A T ( cp), tjJ), AT(cp) E ;. It is easy to verify that cp HAT (cp) is continuous from x into ; (i.e. that AT E 2(x, ;)). In addition, the set of linear combinations of functions of the type cp (8) tjJ: (x, y) H cp (x) fjJ (y ) 39 See Garding-Lions [2J from which we draw the inspiration for the proof here of the theorem of the kernels. 40 The space y is provided with the topology of the space of distributions. We furnish .!.e( £0 x , ;) with the topology corresponding to the following notion: a sequence;;' E .!.e(x, ;) converges to f E .!.e (x, ) if </,,( cp), f/J> --+ <f( cp), f/J> uniformly for cp and f/J in the bounded sets of  x and of Y' 
3. Fourier Transforms 529 being dense in !!)Xy, the mapping T  AT is injective with the result that we can identify !!)y with a sub-space of 2 (!!)x, !!);). In the 2nd stage, we shall compare on !!)y the natural topology and the topology induced by 2(!!)x, !!);). 2nd stage. We shall show the Proposition 11. The natural topology of !!)y and the topology induced by :£ (!!)x, !!);) coincide. Proof (i) First of all let us show: (3.114) the mapping T  AT is continuous from !!)y into 2( !!)x, !!);). Indeed, if Tn  0 in !!)y and if Bx is a bounded set in !!)x, we have to prove that (3.115) A Tn ( qJ)  0 in!!); uniformly for qJ E Bx i.e. < ATn (qJ), t/J >  0 uniformly for qJ E Bx, t/J E B; where B is a bounded set of !!)y. But from the definition of AT: < ATn (qJ), t/1 > = < 1;" qJ (8) t/J> for all n and when qJ E Bx, t/J E B;, qJ (8) t/J remains in a bounded set of !!)Xy, from which (3.115) results and (3.114) is demonstrated. (ii) Let us suppose now that (3.116) ATn  0 in 2(!!)x, !!)). We deduce that, being given Bx, B; bounded sets of !!)x and !!)y respectively, we have (3.117) < 1;" qJ (8) t/1 >  0 uniformly for qJ E Bx and t/J E B;. To achieve the demonstration of Proposition 11, we have to show that (3.117) implies (3.118) <,X>  0 uniformly for X E Bxy where Bxy is a bounded set of !!)Xy. Now (3.118) follows from Lemma 1. Let B be a bounded set of!!)xy. Then we canjind B 1 and B 2 , bounded sets respectively of!!)x and !!)y such that for all X E B we can write (3.119) X ( x, y) = L)w i qJ i ( X ) t/J i (y ) i with (3.120) L IAil < M, i qJi E B 1 , t/Ji E B 2 Vi. 
530 Appendix. "Distributions" Proof The functions of B have their supports in a finite "cube",] - a, a[ p +q say. Let us put a = a + 1, Q = ] - a, + a[p+q. F or X E B, let X be the extension of X by periodicity of period 2a in each variable. We can then develop X in a Fourier series of the form with x(x, y) = L Ck,1 exp(ink.x/a) exp(inl.y/a) k, I k = [k 1 , . . . , k p ], 1 = [11,...,1q] k. x = k 1 Xl + k 2 X 2 + . . . + kp. x p l.y = 1 1 Y1 + 1 2 Y2 + . .. + 1q.Yq. We verify by integrating by parts in the formulae giving C k , I that these are of rapid decay (that is to say, for each polynomial P, there exists a constant Mp such that I P(k, 1) C k , II  Mp uniformly for X E B). Let 8 E (Q) be such that (3.121) { 8(x, y) = 8 1 (x) 8 2 (y) 8(x, y) = 1 for (x, y) E ] - a, + a[p+q; then x = Ox =  C k ,{ 0 1 exp C: k.x )Oz exp C: 1.y ) J. which can be written in the form (3.122) -  Ck,1 ( 1/3 [ in J)( 1/3 [ in J) x(x, Y) - h I C k ,llz / 3 0Il Ck,ll exp --;; k. x Oz/Cd exp --;; 1. Y observing now that 1/3 [ in ] _ 8 1 1 C k , II exp -;;: k. x - ({Jk, I E a bounded set of x 1/3 [ In J _ 8 2 1 C k,11 exp --;;l.y - t/lk,1 E a bounded set of y we obtain Lemma 1. o 
3. Fourier Transforms 531 Proposition 11 is thus completely demonstrated. o 3rd stage. Completion of the proof of the theorem of kernels. We shall establish Lemma 2. (1) Let A E 2(!0 x , Sy) C 2(!0 x , !0;). Then there exists T E !0y such that A = AT; (2) !0y is dense in 2 (!0 x , !0;). Proof (i) If qJ E !0 x , qJ H A(qJ)(Y) is a continuous linear form on !0 x so can be written A(q»(y) = <Sy(x), q>(x)xp Sy(x) E !0. The function y H Sy(x) is infinitely differentiable from yq  !0. If we put Sy(x) = T(x, y), then it is clear that A = AT from which the point (i) of Lemma 2 follows. (ii) Let A E 2(!0 x , !0;) and let Pn, Pn E !0 y for all n, be such that Pn  J. If qJ E !0 x , let us put (3.123) An(q» = Pn * A(q». We define in this way An E 2(!0 x , Sy). From the point (i) of the Lemma, there exists  with An = ATn. As Pn  J, (3.123) implies that An  A so ATn  A in 2(!0 x , !0;). Lemma 2 follows as a consequence. o As !0y is a subspace of 2(!0 x , !0;) for the induced topology and is complete, it is closed in 2(!0 x , !0;). Since, from Lemma 2, !0y is dense in 2 (!0 x , !0;) we deduce that !0y = 2('@x' !0;) Schwartz's theorem of kernels is thus proved. o Remark 12. If Q is an open set of IRP, (!) an open set of IRq, the same proof as given above shows that (3.124) !0'(Q x (!)) = 2(!0(Q), !0'((!))). o 
532 Appendix. "Distributions" 13. Some Distributions and Their Fourier Transforms The table below lists the Fourier transforms of some common functions and distributions on IR. Fourier Transforms Function or distribution u ffu = u(y) = L e-;XYu(x)dx exp( - alxl) (a > 0) n t a- 1 exp( - y2/4a 2 ), (a > 0) 2a(a2 + y2)-1 exp( - a 2 x 2 ) { I when Ixl < L o when Ixl < L (L > 0) 2y-l sin Ly { 1 when S - L < Ixl < S + L o otherwise with S  L  0 4y-l cos Sy sin Ly { e o xp (iyox) when Ixl < L when Ixl > L 2(yo - y)-l sin L(yo - y) { ( 0 1 - Ixl/L) when Ixl < L when Ixl > L L(Ly /2) - 2 sin 2 (Ly /2) exp(iyox) cos(yox) sin(yox) (cos YoX)2 (sin YoX)2 2nb(y - Yo) n[b(y - Yo) + b(y + Yo)] in[b(y + Yo) - b(y - Yo)] n[tb(y + 2yo) + b(y) + tb(y - 2yo)] - n[tb(y + 2yo) - b(y) + tb(y - 2yo)] 1 exp( - ixoY) b(x) b(x - xo) +00 +00 I b(x - nxo) (2n/xo) I b(y - (2nn/xo)) n=-oo n=-oo { + 1 when x > 0 - 1 when x < 0 { I when x > 0 o when x < 0 - 2ivp(l/y) nb(y) - ivp(1 /y) 
Bibliography Adams, R. A. [1] Sobolov Spaces. Academic Press, New York 1975 Agmon, S. [1] Lectures on Elliptic Boundary Value Problems. Van Nostrand, New York 1965 Agmon, S., Douglis, A., Nirenberg, L. [1] Estimates near the Boundary for Solutions of Elliptic Partial Differential Equations Satisfying General Boundary Conditions. I. Comm. Pure Appl. Math., 12 (1959) 623-727 II. Comm. Pure Appl. Math. 17 (1964) 35-92 Akhiezer, N. I., Glazman, I. M. [1] Theory of Linear Operators in Hilbert Space. Vol. 1 & 2. Pitman, London 1981 Aubin, 1. P. [1] Cours d'analyse fonctionnelle. Universite Paris IX Auslander, L., Tolimieri, R. [1] Is Computing with the Finite Fourier Transform Pure or Applied Mathematics? Bull. Amer. Math. Soc. 1 (1979) 847-897 Baiocchi, C., Capelo, A. [1] Variational and Quasivariational Inequalities: Applications to Free-Boundary Value Problems. Wiley, New York 1984 Berezanskii, 1. M. [1] Expansions in Eigenfunctions of Selfadjoint Operators. Amer. Math. Soc. Translat. of Math. Monographs No. 17 (1968) Bochner, S. [1] Lectures on Fourier Integrals, (with supplement) Princeton Univ. Press, 1959 Bolley, P., Camus, J. [1] Regularite pour une classe de problemes aux limites elliptiques degeneres variationnels. C.R. Acad. Sci. Paris, Ser. A 282 (1976) 45-47 Bourbaki, N. [1] Elements de mathematique: Integration. Hermann, Paris 1968 [2] Topological Vector Spaces. Springer-Verlag, Berlin, Heidelberg 1987 Brezis, H. [1] Functional Analysis. Springer- Verlag, New York 1988 Butkovskiy, A. G. [1] Green's Functions and Transfer Function Handbook. Halsted Press,-New York 1982 Calan, C. de [1] Theorie perturbative des champs. Centre de physique theorique, Ecole Poly technique, Paris 1982 Canuto, C., Quarteroni, A. [1] Propfietes d'approximation dans les espaces de Sobolev de systemes de polynomes orthogon- aux, C.R. Acad. Sci. Paris, ser. A, 290 (1980) 925-928 Cartan, H. [1] Differential Calculus. Kershaw Publishing Co., London 1971 
534 Bibliography [2] Elementary Theory of Analytic Functions of One or Several Complex Variables, Addison-Wesley, Reading, Mass. 1972 Chazarain, J., Piriou, A. [1] Introduction a la theorie des equations aux derivees partielles linea ires. Gauthier-Villars, Paris 1981 Choquet, G. [1] Cours d'analyse. Masson, Paris 1973 Colombo, S. [1] Les transformations de Mellin et de Hankel. Editions du CNRS, Paris 1959 Davies, B. [1] Integral Transforms and their Applications. Springer-Verlag, New Y<Ork 1978 Deny, J., Lions, J-L. [1] Les espaces du type de Beppo-Levi. Ann. Inst. Fourier 5 (1953-.!54) 305-370 Dieudonne, J. [1] Infinitesimal Calculus. Kershaw Publishing Co., London 1973 [2] Treatise on Analysis. V 01. II Academic Press, New York 1970 Vol. VII Academic Press, New York 1979 Vol. VIII Academic Press, New York 1981 Dunford, N., Schwartz, J. T. [1] Linear Operators. Vol. 1 General Theory, Wiley, New York 1958 Vol. 2 Spectral Theory, Wiley, New York 1964 Vol. 3 Spectral Operators, Wiley, New York 1971 Duvaut, G., Lions, J-L. [1] Inequalities in Mechanics and Physics, Springer-Verlag, Berlin, Heidelberg 1976 Enflo, P. [1] A Counterexample to the Approximation Problem in Banach Spaces. Acta Math. 130 (1973) 309-317 Faraut, J. [1] Distributions spheriques et equations differentielles singulieres. Seminaire Goulaouic- Schwartz de l'Ecole Poly technique, 1976-1977 (Expose nO IV) Gagliardo, E. [1] Ulteriori proprieta di alcune classi di funzioni in piu variabili. Richerce di Mat. 8 (1959) 24-51 Garding, L., Lions, J - L. [1] Functional Analysis. Varenna 1958 [2] Functional Analysis. Supp1. AI, vol. XIV, Ser. X, del Nouvo Cimento (1959) Gelfand, I. M., Shilov, G. E. [1] Generalized Functions. Vol. 1 Properties and Operations, Academic Press, New York 1964 Gilbarg, D., Trudinger, N. S. [1] Elliptic Partial Differential Equations of the Second Order. 2nd ed. Springer- Verlag, New York 1983 Gottlieb, D., Orszag, S. A. [1] Numerical Analysis of Spectral Methods. SIAM Philadelphia 1977 Grisvard, P. [1] Behaviour of the Solutions of an Elliptic Boundary Problem in a Polygonal or Polyhedral Domain. Numerical Solutions of Partial Differential Equations III, Ed.: Hubbard, B. Academic Press, New York 1976; 207-274 [2] Comportement de la solution d'un probleme aux limites elliptique dans un domaine plan avec fissure. In: Mecanique de la rupture, Ecoles CEA-INRIA-EDF, October 1982; 183-192 [3] Boundary Value Problems in Nonsmooth Domains. Halstead Press, New York 1986 Hormander, L. [1] Linear Partial Differential Operators. Springer-Verlag, Berlin, Heidelberg 1963 
Bibliography 535 [2] Linear Differential Operators. Congres International des mathematiciens (Nice, 1970), Gauthier- Villars, Paris 1971 [3] The Analysis of Linear Partial Differential Operators. Vol. I and II. Springer- Verlag, Berlin, Heidelberg 1983 Horvath, J. [1] Topological Vector Spaces and Distributions. Addison-Wesley, Reading, Mass. 1966 Kato, T. [1] Perturbation Theory for Linear Operators. 2nd ed., Springer-Verlag, Berlin, Heidelberg 1984 Knuth, D. E. [1] The Art of Computer Programming, Vol. 2. Semi-numerical Algorithms. Addison- Wesley, Reading, Mass. 1979 Kondratiev, V. A. [1] Boundary Problems for Elliptic Equations in Domains with Conical or Angular Points. In: Trans. Moscow Math. Soc. 16 (1967) English translation, A.M.S. (1968), pp. 227-313 Kreiss, H. 0., Oliger, J. [1] Stability of the Fourier Method, SIAM J. Numer. Anal. 16 (1979) 421-433 Kufner, A. [1] Weighted Sobolev Spaces. Teubner, Leipzig 1980 [2] The Analysis of Linear Differential Operators. Vol. I & II, Springer- Verlag, Berlin 1983 Laurent, P. 1. [1] Approximation et optimisation. Hermann, Paris 1972 Lavrentiev, M., Shabat, B. [1] Methodes de la theorie desfonctions d'une variable complexe, MIR, Moscow 1972 Lindenstrauss, J., Tzafriri, L. [1] On the Complemented Subspaces Problem. Israel J. Math. 9 (1971) 263-269 Lions, J. L. [1] Supports de produits de convolution. I. C.R. Acad. Sci. Paris 232 (1951) 1530-1532 II. C.R. Acad. Sci. Paris 232 (1951) 1622-1624 [2] Supports dans la transformation de Laplace. J. Analyse Math. 2 (1953) 369-380 [3] Lectures on Elliptic Partial Differential Equations. Tata Inst., Bombay 1957 [4] Operateurs de transmutation singuliers et equations d'Euler-Poisson-Darboux generalisees. Rend. Sem. Mat. Fis. Milano. XXVIII (1959) 3-16 [5] Problemes aux limites dans les equations aux derivees partie lies. Les Presses de l'Univ. de Montreal 1967 [6] Quelques methodes de resolution des problemes aux limites non linea ires. Dunod, Paris 1969 Lions, J-L., Magenes, E. [1] Non-Homogeneous Boundary Value Problems and Applications. Vol. 1 Springer-Verlag, Berlin 1972 Vol. 2 Springer- Verlag, Berlin 1972 Vol. 3 Springer- Verlag, Berlin 1973 Lions, J-L., Peetre, J. [1] Sur une classe d'espaces d'interpolation. Inst. Hautes Etudes Scientifique Pub. Math. 19 (1964) 5-68 Mandel, J. [1] Cours de mecanique des milieux continus. Tome 1: M ecanique des fiuides; Tome 2: M ecanique des solides. Gauthier-Villars, Paris 1966 Martineau, A. [1] Sur Ie theoreme du graphe ferme. C.R. Acad. Sci. 263A, Paris (1966) 870-871 Methee, P. D. [1] Sur les distributions invariantes dans Ie groupe des rotations de Lorentz. Comm. Math. Helv. 28 (1954) 225-269 
536 Bibliography Miller, W. [IJ Symmetry Groups and their Applications. Acad. Press, New York 1972 Miranda, C. [IJ Partial Differential Equations of Elliptic Type. Springer-Verlag, Berlin, Heidelberg 1970 Mizohata, S. [IJ The Theory of Partial Differential Equations. Cambridge Univ. Press 1973 Naimark, M. A. [IJ Linear Differential Operators, Vol. 1 & 2, Harrap, London 1968 Necas, J. [IJ Les methodes directes en theorie des equations elliptiques. Masson, Paris 1967 Nirenberg, L. [1 J Remarks on Strongly Elliptic Partial Differential Equations. Comm. Pure Appl. Math. 8 (1955) 648-674 [2J An Extended Interpolation Inequality. Ann. Scuola Norm. Sup. Pisa Cl. Sci. 20 (1966) 733-737 [3J Lectures on Linear Partial Differential Equations. Regional Conference Series in Mathematics, No. 17, A.M.S., Providence R.I. O'Leary, D. P., Widlund, O. [IJ Capacitance Matrix Methods for the Helmholtz Equation on General Three-Dimensional Regions. Math. Comput. 33 (1979) 849-879 Papoulis, A. [1 J The Fourier Integral and its Applications. McGra w- Hill, New York 1962 Proskurowski, W., Widlund, o. [IJ On the Numerical Solution of Helmholtz's Equation by the Capacitance Matrix Method., Math. Comput. 30 (1976) 433-468 Peetre, J. [IJ Rectification a l'artic1e "Une caracterisation abstraite des operateurs differentiels". Math. Scand. 8 (1960) 116-120 Raviart, P. A., Thomas, J. M. [IJ Introduction a l'analyse numerique des equations aux derivees partielles. Masson, Paris 1983 Rektorys, K. [IJ Variational Methods in Science and Engineering. Reidel, Dotdrecht 1977 Riemann, B. [1 J Uber die Anzahl der Primzahlen unter einer gegebenen Grosse. Gesammelte M athe- matische Werke. Teubner, Leipzig (1876) 136-144 Rockafellar, R. T. [IJ Convex Analysis. Princeton Univ. Press, 1970 Roubine, E. [IJ Complements d'electromagnetisme. Ecole superieure d'electricite, Paris 1980 Saks, S., Zygmund, A. [IJ Analytic Functions, P. W. N., Warsaw 1952 Schwartz, L. [1 J Theorie des distributions. Hermann, Paris 1966 [2J Cours de l'ecole poly technique, Paris [3J Transformation de Laplace des distributions. Communication of the Mathematical Seminar of the University of Lund. Supplementary volume (1952), dedicated to Marcel Riesz Seeley, R. T. [IJ Extensions of COO-functions. Proc. Amer. Math. Soc. 15 (1964) 625-626 Sneddon,!. N. [IJ The Use of Integral Transforms. McGraw-Hill, New York 1974 Sobolev, S. L. [IJ Applications of Functional Analysis in Mathematical Physics. AMS, Providence, R.I. 1963 
Bibliography 537 Stampacchia, G. [IJ Equations elliptiques du second ordre a coefficients discontinus. Les Presses de l'Univ. M on treal, 1966 Stewart, H. B. [IJ Spectral Theory of Heterogeneous Diffusion Systems, J. Math. Anal. Appl. 54 (1976) 59-78 Ti tchmarsh, E. C. [IJ Introduction to the Theory of the Fourier Integral, Oxford Univ. Press 1937 Treves, F. [IJ Linear Partial Differential Equations with Constant Coefficients. Gordon & Breach, New York 1966 [2J Topological Vector Spaces, Distributions and Kernels, Academic Press New York 1967 [3J Basic Linear Partial Differential Equations. Academic Press, New York 1975 Velo, G., Wightman, A. S. [IJ Rigorous Atomic and Molecular Physics. Plenum, New York 1981 Watson, G. N. [IJ A Treatise on the Theory of Bessel Functions. Cambridge Univ. Press 1958 Weil, A. [IJ L'integration dans les groupes topologiques et ses applications. Hermann, Paris 1953 Whittaker, E. T., Watson, G. N. [IJ A Course of Modern Analysis. Cambridge Univ. Press 1947 Widder, D. V. [IJ The Laplace Transform. Princeton Univ. Press 1946 Winograd, S. [IJ Some Bilinear Forms whose Multiplicative Complexity Depends on the Field of Constants. Math. Systems Theory 10 (1977) 169-180 Wolf, K. B. [IJ Integral Transforms in Science and Engineering. Plenum, New York 1979 Y osida, K. [IJ Functional Analysis, 6th ed., Springer-Verlag, Berlin, Heidelberg 1980 Zygmund, A. [IJ Trigonometrical Series. 2nd ed., Cambridge Univ. Press 1968 
Table of Notations The Theorems, Propositions, Lemmas, Definitions, Remarks, Examples and Formula are numbered by Sections. Figures and footnotes are numbered by Chapters. Generalities def   => ij, ) o V 3 iff I.e. o equality by definition arrows of mapping (see A below) implication sign Kronecker delta (0 if i # j, 1 if i = j, i and j EN) denotes the end of a passage forming a logical sequence (remark, proof, . . . ) if necessary for the clarity of the text for all there exist if and only if that is the empty set A. Notations Relating to Sets and Mappings Let E, F, G be three sets and let A be subset of E. We denote by: {xEE;P} E x F En E\A f.E  F } . X  f( X ) the subset of E consisting of the elements possessing the property P the cartesian property of E and F the n-th power of E (n a positive integer) the complement of A in E the mapping of E into F which is such that to the element X E E there corresponds f (x) E F 1 or: x E E  f( x) E F 1 We often writefx instead off(x) whenfis a linear operator. 
Table of Notations I E or I fog fn Imf == f(E) f- 1 flA f( . , t) { ak} or (a k ) (a ij ) 539 the identity mapping in E the composite mapping off:F  G and g:E  F 2 i.e x E E  (fo g)(x) = f(g(x)) E G the composite mapping of order n of f: E  E (n a positive integer): fn == f 0 f 0 . . . f (n factors) the image off: E  F in F 3 the inverse of f if f is injective 4; f- 1: f( E)  E => f- 1 (y) == X E E iff f(x) == y E f(E); thus f- 1 0 f == I E and f 0 f- 1 == I f(E) 5 the restriction off: E  F to the subset A of E the function defined by x H f( x, t) for fixed t the sequence a 1 , a 2 , . . . , a k , . . . the matrix whose ij-th element is a ij B. Notations Relating to Topology Q B (x, a) FE F cc E usually denotes an open set in a topological space the open ball with centre x and radius a > 0 the set F is contained in E with continuous injection the set F is contained in E with compact injection Let A be a proper subset of a topological space E. We denote by: A o A 8A or r supA infA diam A d (A, B) supf(x) XEB (resp. inf f(x)) XEB lim Un (or lim infu n ) the closure of A the interior of A the boundary of A the supremum or upper bound of A in [R the infimum or lower bound of A in [R the diameter of A c [Rn the distance between A, B c [Rn for sup A (resp. inf A) wherefis a mapping of set B into [R, and where A == f(B) c [R 6 the lower limit lim inf Un == sup ( inf Urn) nEN m;::;n 2 We often writefg instead offo 9 whenfand 9 are linear operators. 3 The mapping f is said to be surjective, or a surjection, if f( E) = F (i.e. Vy E F, 3x E E such that f(x) = y. 4 The mappingfis said to be injective, or an injection iff(x) = f(x')  x = x'. 5 If the mappingf is both surjective and injective it is called bijective or a bijection. 6 We also use the notation maxf(x) to indicate that sup f(x) is attained at a point of B. xeB 
540 lim un (or lim sup Un) lim sup f(x) Ixl -+- 00 lim inf f(x) Ixl -+- 00 lim f(x) x-+ +0 lim f( X ) ) x-+ -0 f( X + ) (resp. f(x -1)) O(x) o(x) f I"V g for X  0 Table of Notations the upper limit lim sup Un = inf ( sup Urn) nEI\J m;::;n the upper limit of the real function f when I x I  CIJ limsupf(x) = inf sup f(x) Ix I -+ 00 r ;::; 0 Ixl > r the lower limit of the real function f when I x I  CIJ liminff(x) = sup inf f(x) Ixl -+- 00 r ;::; 0 Ixl > r the limit off: [R  E when x tends to zero through positive (resp. negative) values the limit to the right ( resp. left) of fat x E  a function satisfying IO(x)/xl  k, a positive constant. a function satisfying lo(x)/xl  0 as x  0 functions f and g are real and such that f(x) = g(x) {I + B(X)} where B(X)  0 as x  0 c. Notations Relating to Numbers N, 7L,  and e denote the sets of natural numbers, (positive and negative) integers, real numbers and complex numbers. def A * = A \ {O} with A a subset of n or en containing 0 {x E ; a  x  b} {x E ; a < x < b} {x E ; a < x  b} {x E ; a  x < b} Ifl) def ] 0 [ Ifl) def ] _ CIJ , 0 [ , + == , + CIJ ,_ + def [0, + CIJ [ - def  ==  u { - 00, + CIJ } C 1 Sub-sets of IRII and CII A* [a, b] = ] a, b [ = ] a, b] = [a, b [ = +,-,+ -  n x  (resp ) T or Sl sn - 1 the n-th power of , when the generic element is denoted by x the half-space {x = (Xl' . . . , x n ) E n; X n > 0 (resp. X n  O)} the one-dimensional torus or the unit circle in 2 the unit sphere in n 
Table of Notations 541 C 2 Numbers Let n E N *, a == (a l' . . . , an) E N n, X == (x l' . . . , x n ) E [Rn, Z E C def lal the modulus of a, lal == a 1 + . . . + an X CX x X Y x.y Ix I Re(z) or Rez Im(z) or Imz arg(z) or arg z z Izl an or Qn k n [m] X CX == X CX1 X CXn 1 ... n vector product of x and y belonging to [R3 'scalar product of X, y with X and y belon ging to [Rn h 1 . f def J ( . t e euc Idean norm 0 x: I x I == (x. x) sometImes denoted by r in the text) the real part of z the imaginary part of z the argument of z the complex conjugate of z the modulus of z the surface area of the unit sphere in [Rn defined by k 2 == 2n, and k n == (2 - n)a n for n > 2 the integral part of m E [R D. Notations Relating to Functions and Distributions D 1 Principal notations Y (or Y(x)) C> (or C>(x)) C>a (or C>a(x)) vp Pf the Heaviside function on [Rx the Dirac distribution, sometimes denoted by C>O or C>o(x) the Dirac distribution concentrated at a E [Rn def ba(X) == C>(x - a) Cauchy principal value (of an integral) finite part (of an integral) Let f and g be two (possibly vector) functions or distributions on Q c [Rn; we denote by: ff f or J fff the Fourier transform off; iff: [Rn  C is integrable on [Rn, we have J(y) deC r e  ixy f(x) dx Jn the Fourier co-transform off: ff f(x) deC r e ixy f(y) dy J n if f is integrable on [Rn. 
542 Table of Notations ff - If the inverse Fourier transform off: 1 def 1 . ff - f(x) = (2n)- n e1XYf(y) dy [Rn i.e. ff - 1 = (2n)- n #. f * g the convolution product of f and g. If f and g are continuous with compact support in n, we have (1* g)(x) deC r f( - y)g(y)dy J [Rn f@g the tensor product off and g. Iff and g are two real functions, we have def (f @ g) (X, y) = f(x)g(y) suppf sing suppf sing sUPPaf fO fO the support off' the singular support off the singular analytic support off positive function or distribution function or distribution of positive type D 2 Differential calculus and linear differential operators Let f be a (possibly vector) function on  or Q c n with values in a Banach space X. We denote by: D f( a) the derivative of f:   X at the point a E  7 the derivative of order n off at a E . Notice that f(O)(a) = f(a), f(I)(a) = ff (a), f(2)(a) = f"(a) The Frechet derivative of f: n  X at the point a E n: ff ( a ) f(n)(a) f( a + h) = f( a) + D f( a ) . h + I h I 0 ( h ) D k f( a) The Frechet derivative of order k off: n  X at the point a E n . 8f 8f 7 In mechanics, we also use f for - and f i for -. 8t 8x i 
Table of Notations of Dkfor _ 0 X k (or 0kf) olalf Daf or _ 0 x a (or oaf) of and of OZ oz gradf or Vf (or Df) divf or v. f curl v or V x v (or rot v) 543 the partial derivative of f: n  X with respect to the variable x k 7, 8 the partial derivative of order I a I of f: n  X def D a f == (D 1 )a 1 . ( D 2 )a 2 . . . (D n )a n f, where a = (al,az,...,a n ) f being a function of x and y (both real) of def  ( of _ i of ) . of def  ( of + i Of ) oz 2 ox oy' oi 2 ox oy . def ( Of Of ) the gradIent ofj:Q -+ IR or X: Vj = ax! "'" ax. the divergence off: Q  n or en, of of V .f=- o +...+- 0 Xl x n the curl of v: 3  3: curl v def ( OV3 _ OVZ OV 1 _ OV 3 OV 2 _ OVI ) OXZ OX 3 'OX 3 OX 1 'OX I oX 2 More generally, iff: Q  n V x f = ( oj; _ ofj ) ox.ox."- 1 J 1 1, J - ,..., n the Laplacian off: Q  n: 0 2 f 0 2 f L1f = _ 0 2 + + _ 0 2 Xl X n 02f Of or Onf the d'Alembertian of f:Q  : Of = 0 z - L1f, with here Q c  X t t J f the Jacobian off: n  n: Jf(x) def Idet(ojJ;(x))1 Let v be the external normal to 0 Q with j-th direction cosine cos (v, Xj)9; we L1f or L1n f 8 In the case in whichfis a function of the space-variable x E [Rn and of the time-variable t E [R, with values in X, we also denote the partial derivative off with respect to t by a f ; we consider thenfto be at a function of time t  f(', t) with values in a space of functions (or of distributions) of x E [Rn into X. 9 We also often use the notation n to denote the normal to aQ. 
544 Table of Notations denote by: floo or fir or Yo f of ov or ovf (or Y1f) Ynf of OVA the trace of order zero of: Q  [R (or C or X) the trace of order 1 off: Q  [R (i.e. the partial derivative with respect to V)10 an! the trace of order n off:Q  IR, Ynf = _ 0 v n n with f: [R x [Rt  [R and A an operator: n A = L 0 i ( aij (x, t ) OJ) + L ai (x, t) a i + -a o (x, t) i. j = 1 we have D a of def n _ 0 L aij(x,t)Ojf.cos(V,Xi) V A i. j = 1 the differential operator Dj = 0 0 and x j D a - D a l D an wl . th - ( ) - 1'.' n a - a 1, . . . , an p(y) the differential operator P(D) = I aa Da where (aa) is a locally finite family aE Nn the characteristic polynomial associated with P(D): P(D) p(y) = L aa(iy)a=p(iy)11 a E Nn P*(D) P.(D) the formal adjoint of P(D) the principal part of P (D) E. Notations Relating to Spaces of Continuous Functions and of Distributions E 1 Spaces of continuous functions (a) Functions with real or complex values. Suppose that Q is an open set of [Rn, K a compact set of [Rn, /tad the seminorm 12 of of 10 We also use - and - where v+ and v- are respectively the external and internal normals ov + ov - to oQ. 11 We also find in publications the alternative notations: 1 0 D =  -; in this case the characteristic polynomial is p(y) = p (y); 1 ox o D = i -; in this case the characteristic polynomial is p(y) = P( - y). ox 12 See 1 of Chap. VI. 
Table of Notations 545 defined by f sup I DCXf{x) I, (a E n and d c [Rn) and k E  we denote generally by: b{[Rn) or 8O{[Rn) the space of bounded continuous functions on [Rn. It is a Banach space for the norm /to, [Rn o ([Rn) or 80 0 ([Rn) the space of continuous functions on [Rn tending to zero at infinity. It is a Banach space for the norm /to, [Rn 80k ([Rn) the space of functions of class k on [Rn which, together with all derivatives of orders  k, tend to zero at infinity. It is a Banach space for the norm f  sup (/ta, [Rn (f)). lal  k (K) or O{K) the space of functions continuous on K. It is a Banach space for the norm /to, K k{ Q) the space of functions of class k on Q, provided with the seminorms {/tCX, K, lal  k, K c Q} (Q) the space of functions of class k{Q) with compact support contained in Q  0, cx ( Q), 0 < a  1 the space of Holder functions of order a on Q, i.e. the space of functions f continuous on Q such that sup If(x) - f(:) 1 < + 00. x, Y E Q I x - y I k,CX{Q) tne space of functions f E f£k (Q) such that Dj f E o, CX{Q), Vj, Ij I  k. the space of the functions  00 on Q, provided with the semI-norms oo{Q) or G{Q) Y ( [Rn ) {/tCXK, a E  n, K c Q} the space of rapidly decreasing functions of G ([Rn) such that all the derivatives satisfy: IxlkIDCXf{x)1 -+ 0 as Ixl -+ 00, V k E , Va E n; this space can be pro- vided with the semi-norms {f sup{lxlkDCXf{x)),k E ,a E n} XEn (Q) the space of slowly increasing functions of G{[Rn), together with all their derivatives (space of multipli- cators of Y) the space of those functions of G (Q) which have compact support in Q. The sequence (A) of (Q) (9M{[Rn) 
546 Table of Notations tends to zero in !!2 (Q) if U supp fk C K is compact in Q and kEN Ita, K(fk) -+ 0, Va E n the space of functions on [a, b J which can be extended to functions of !!2([R). The sequence (A) of !!2([a, bJ) tends to zero in !!2[a, bJ if /ta,[a,bJ(A) -+ 0, Va E n when k -+ 00 the space of the functions of !!2 (Q) with support in K, provided with the semi-norms {fta, K, a E } 13 If Q = [R or [Rn = [R we denote by f!lj, f!ljo, . . . , !!2,!!2 K the sets f!lj ([R), f!lj 0 ( [R), . . . ,  ( [R), !!2 K ( [R ) (b) Functions with values in a Banach space X, with norm II . II. The preceding definitions can be generalized to functions with values in X. The semi-norm ft,d then becomes !!2 ( [a, b J ) !!2 K (Q) f sup II Daf(x) II. XEd We denote the corresponding spaces by f!lj ([Rn, X), f!ljo ([Rn, X), f!ljk ([Rn, X),  (K, X) or o (K, X) etc. . . . In the same way if Q = [R or [Rn = [R, we denote f!lj ( [R, X), f!ljo ( [R, X) etc. . . . by f!lj (X), f!ljo ( X), etc. . . . E 2 Spaces of integrable functions Let Q be an open set in [Rn, and s a real number greater than or equal to 1. (a) Functions with real or complex values L S (Q) the space of classes of measurable functions on Q such that x  If(x)IS is integrable on Q. This is a Banach space for the norm ff-+ [fn1f(X)ISdX TIS L 00 (Q) the space of classes of measurable functions on Q such that x  I f(x)1 is essentially bounded. This is a Banach space for the norm f sup ess f(x) XEQ Lfoc (Q) the space of classes of measurable functions on Q such that x  I f( x) IS is locally integrable 13 We also use the notation f?fi (Q) to denote the set of the restrictions to Q of the functions of f?fi ([R_). 
Table of Notations 547 L(Q) the space with weight p: Q   +, locally integrable, of the classes of measurable functions such that x  p(x)1 f(x)IS is integrable on Q. It is a Banach space for the norm fH [fn'f(X)!S p(x)dx TIS the space of the classes of square integrable functions for the measure J1 If Q = ] a, b [, these spaces are denoted by LS(a, b), L 00 (a, b), . . . , L; (a, b). We also use the notation L S , L 00, . . . , L; when there is no risk of confusion about the domain Q. (b) Functions with vector values in a Banach space. The preceding definitions can easily be generalised to functions with values in X. We denote the corresponding spaces by: L; (  ) LS(Q, X), Loo(Q, X) etc. If X is a Hilbert space, so is the space L 2 (Q, X). E3 Spaces of distributions (a) Distributions with real or complex values. f!t)' (Q) the space of distributions on Q, i.e. the set of continu- ous linear forms on f!t) ( Q): def f!t)' ( Q) = 2 ( f!t) ( Q),  or C) G' (n) the space of distributions with compact support on n, i.e. the set of continuous linear forms on G( n): def G'(n) = 2(G(n),  or C) ?' ( n) the space of tempered distributions on n, i.e. the set of continuous linear forms on ? (n): ?'(n) = 2(?(n),  or C) the space of tempered distributions on n which, with all their derivatives, are rapidly decreasing (or the space of convolutors of ? (n)). If Q =  or n =  we denote these spaces by f!t)', G', ?' and (!). (b) Distributions with vector values in X, a Banach space The above definitions can be generalised and the corresponding spaces in trod uced: (!)(n) f!t)'(Q, X), G'(n, X), ?'(n, X) and (!) (n, X). 
548 Table of Notations E4 Sobolev spaces Let Q be an open set in n and that m E , 1  p < 00, S E . We denote by: def Hm(Q) Hm(Q) = {fE L 2 (Q); DafE L 2 (Q), Va E n, lal  m}. This is a Hilbert space with sca lar product (f,g) def L f Daf(x).Dag(x)dx lal  m Q Ho(Q) H-m(Q) The closure of '@(Q) in Hm(Q) The dual space of Ho (Q): H -rn(Q) def !f(Ho(Q), [R). This is a Hilbert space with the norm H S ( [Rn) I <F,f>1 F H ;E Ilfll . Hs(n) def {fE ?'([Rn); (1 + IYI2)S I 2 fffE L2([R)} If S == m and Q == [Rn, H m ( Q) and H S ([Rn) coincide. H foe clef {f E .@' ( Q); V qJ E .@ ( Q), f. qJ E H S ( Q ) } def Wm,P(Q) = {fE LP(Q);Va E n, lal  m=>DafELP(Q)} This a Banach space for the norm H foe ( Q ) w m , P(Q) [ J IIP f 1--+ L II Dafil P lal  m W(f,P(Q) the closure of '@(Q) in wm, P(Q) If Q == [R or n == [R we denote these spaces also by H m , Ho, etc. . . . F. Notations Relating to Linear Operators Generalities on the spaces relative to the operators considered In a general way, we denote by 00 the sum of two parts A and B of a vector space the direct sum of two vector spaces F and G the tensor product of two vector spaces F and G the quotient space of F by G if G c F the orthogonal complement of H in a prehilbert space the exterior Hilbert sum of a sequence (Hn) of Hilbert spaces A+B FG F@G FIG H.1 Ef) Hn n = 1 
Table of Notations 549 Let X be a Banach space with dual X' (see Sect. F 2 ) we denote by: dim X the algebraic dimension of X IIxllx or II xii the norm of x E X or Ixlx or Ix I II x' II' < x, x' > I x or id x or I the norm of x' E X' the bracket of duality between x E X and x' E X,14 the identity operator in X F 1 Linear operators Let X, Y be Banach spaces and A:X  Y a linear mapping; we adopt the following notations (when the entities exist) ker A(or N(A)) 1m A (or R(A)) D(A) G(A) G S ( A ) a( A) or S ( A ) p(A) A -1 A* tA A R (A, A) or R (A) VA a(A) {3(A) rang A or rg ( A ) detA tr A or Tr A r(A) II A II A 2 ::) A 1 the kernel of A the image of A the domain of A def the graph of A: G(A) == {(x,y) EX x Y;xED(A),Y == Ax} the inverse graph of A: GS(A) def { (y, x); (x, y) E G(A)} the spectrum of A the resolvent set of A: p(A) == C\a(A) the in verse of A the adjoint of A the transpose of A the closure of A the resolvent operator of A: R (A, A) def (A - AI) - 1 for A E P (A) 15 the Cayley transform of A:V A == (A + iI)(A-iI)-1 h 11 . . d def t e nu Ity In ex of A == dim N(A) h d fi .. def tee clency Index of A == codim R(A) the rank of A the determinant of A the trace of A the spectral radius of A the norm of A the operator A 2 is an extension of the operator A 1 14 If H is a Hilbert space we denote the scalar product of x and y in H by (x, Y)H or (x, y) or « x, y)). 15 We sometimes adopt the definitions R (A, A) = (AI - A) - 1. 
550 Table of Notations F 2 Spaces of linear operators Let X and Y be two topological vector spaces, real or complex. We denote by: the set of continuous (or bounded) linear mappings of X with values in Y If X and Yare both Banach spaces, then so is f£(X, Y), provided with the norm: II A 112(x, Y) = sup II Ax II Ilxllx = 1 def f£(X) == f£(X, X) the dual space of X, i.e. f£ (X, [R or C), the set of continuous linear forms on X 16 the bidual space of X, i.e. the dual of X' for the topology of the norm IIxll" = sup l<x',x)l. x' E x' Ilx'll = 1 Let H be a complete separable complex Hilbert space; we denote by f£ 1 ( H), the trace class, i.e. the set of nuclear operators in f£ ( H). This is a Banach space with norm 8  tr 181. 17 f£(X, Y) f£(X) X' - l X" 16 We also frequently denote by X' the antidual of X, i.e. the set of continuous antilinear forms on X. def * 1 17 We shall see in Chaps. VI and VIII that ISI == (8 8 )2, 8 E f£ (H). 
Index Babuska's paradox 424 backward Cauchy problem 204,223 Baire's lemma 276 Baire's property 276 ball 271 Banach algebra 318n Banach space 272 Banach's approximation problem 330 Banach-Steinhaus theorem 275 Bessel functions 42 -integral representation 43 -recurrence relations 43 Bessel-Parseval inequality 302 Bessel's equation 42 bicharacteristic 159 bicharacteristic curve 161 bicharacteristic solution 160 bidual 285 biharmonic equation 55 biharmonic operator 450 bijection 539n bilinear form 292 Bochner-Schwartz theorem 526 bounded operator 274 Cauchy problems 164, 204, 209, 214, 217 Cauchy-Kowalewsky theorem 164 Cauchy-Riemann operator 176, 188 Cauchy-Schwartz ine1iuality 121, 293 Cauchy sequence 272 Cayley transform 362 Cesaro mean 17 Cesaro summability 15 characteristics 157 -multiple 159 -simple 159 characteristic cone 158 characteristic hyperplane 157 characteristic polynomial 171 characteristic vector 157 Chinese remainder theorem 71 closable operator 335 closed graph theorem 280 closed image theorem 348 closed operator 335, 361 closure of an operator 335 codimension 307 coercive sesquilinear form 368 collocation method 80 compact operator 123 compactness (weak) 289 comparison of operators 241 comparison principle 261 complete family 300 complete metric space 272 composition of operators 307, 316 cone -future 492 -past 492 constant force operator 245 continuity 332 continuous operator 274 contraction 297n convergence -strong 287, 317 -weak 287, 318 ( abstract variational problem 375 accretive boundary value problem 453 accretive extension 372 accretive operator 372 adjoint operator 353, 361 adjoint sesquilinear form 353 algebra 497 - Banach 318n -normed 318n -of convolutions 497 -of distributions 498 analytic semi-groups 225 analyticity of solutions 234 antidual of a topological space 281 anti linear form 281 automorphisms of [Rn 518 
552 convergence (cont.) - weak -star 290 convergence of operators 313 convex set 285 convexity inequalities 133 convolution of distributions 492 convolution of functions 458 convolution algebra 497 convolutive support 496 convolutor 521 Cooley- Tukey transform 64 cracks 406 cyclic convolution 69 -of order 2 72 -of order 3 73 -of order 6 75 d'Alembertian 180, 230 decomposable operator 190 degenerate operator 322 density 273, 461 denumerable total family 300 diagonalisation of operators 15, 25, 29, 51 differentiability 332 differential quotients, method of 430 differentiation in LP 338 diffusion of neutrons 440 Dirichlet problems 17, 20, 32, 53, 112, 379 Dirichlet's lemma 6n Dirac distribution 466 -on a surface 487 Dirac measure 466 -primitives of 482 Dirac's periodic distribution 9 direct Cauchy problem 205, 223 discrete Fourier transform 59 discretisation of Poisson problem 84 displacement operators 319 dissipativity 251 distance 272 distributions -bounded 526n -convolution of 492, 494 -differentiation of 467, 488 -homogeneous 519 -invariant under rotation 489 -invariant under translation 489 - normal space of 97 -of positive type 525 -of rapid decay 521 -of slow growth 507 - periodic 7 Index -positive 467 -restriction of 474 -tempered 507 - tensor product of 480 -with compact support 475 - with convolutive support 496 division problem 483 domain of a linear operator 305 domain of dependence 210 dual of a topological space 281 elasticity operator 419 elastostatics 411 elementary solution of an operator 149, 183 elliptic problems of 2nd order 393 energy space 269 equivalent norms 280 equivalent semi-norms 271 estimates of analyticity 237 Euler's relation (identity) 160, 519 extension - by reflexion 117 -canonical 116 -m-extension 114 -of a linear operator 282, 307 famil y -complete 300 -orthonormal 299 - total 300 fast Fourier transform -applications 78 -in two dimensions 77 -of Cooley and Tukey 64 -of Good and Winograd 6 fast solvers for Laplacian 84 finite part 469 five-point scheme 85 flexure of plates 420 force of an operator 242 form -antilinear (or semi-linear) 281 -bilinear 292 -linear 281 -sesquilinear 292, 351, 367 forward Cauchy problem 204 Fourier coefficient 5, 10, 301 Fourier cotransform 501 Fourier inversion formula 503 Fourier transforms 356, 500 -calculation of 510 -fast 61, 64, 66 
Index Fourier transforms (cont.) -in L 1 500 -in L 2 506 -in [Rn 56 -of distributions with compact support 509 -of tempered distributions 506 -of measures 523 -partial 513 Frechet space 272 Fredholm alternative 378 Fredholm operator 348 function of positive type 525 function of rapid decay 502 function of slow growth 523 fundamental solution 183n future cone 492 Galerkin method 374 Garding's inequality 389 Gevrey class 240 global regularity 426, 437 Good-Winograd transform 66 graph of an operator 306 Green-Ostrogradski formula 379 Green's function 441, 445 Green's kernel 441 Hahn-Banach theorem 282, 374 Hamiltonian system 160, 192 Hankel-Bochner transformation 56n Hankel transform 2, 40, 47 -table of 57 Harnack estimate 267 Harnack's inequality 250 heat operator 192 Hermitian form 357 Hermitian operator 353 Hermitian scalar product 291 Hermitian symmetry 353 Hilbert-Schmidt operator 357 Hilbert space 291 -separable 300 Holmgren's theorem 166 holomorphic semi-groups 225 holomorphy 305 homeomorphism 321n hyperbolic operator 190 -of order 2 198 hypoellipticity 177, 230 Image -of a distribution 481 553 -of a linear operator 305 inclusion properties 461 inductive limit in Frechet spaces 273 indices -of deficiency 307 -of nullity 307 inequalities in Sobolev spaces 125 inequality -of Cauchy-Schwarz 293 -of Garding 389 -of Korn 414 -of Poincare 126 -of Young 459 infinite matrices 311, 323, 338, 352 injection 539n injection of Sobolev 139 injective mapping 539n integral operators 312,319,324,352,360 integro-differential forms 398 interpolate 60 invariance under translation 272 inverse image 481, 485 invertibility of operators 319 involutive mapping 323 isometric operator 355 isometry 41, 355 kernel of a linear operator 305 Korn's inequality 414 Landau's notation 523n Laplace-Beltrami operator 147, 156 Laplace operator 173 Laplacian 173 Lax-Milgram theorem 368, 376 Leibniz's formula 152,476 lifting operator 110 linear differential equations 479 linear differential operators 148 linear form 281 local operator 149 local regularity of solutions 230 locally finite operator 149 Lorentz group 491 Love-Kirchhoff theory 420 m-extension 114 matrices, infinite 311, 323, 338, 352 maximum principles 251, 259, 265 measure -absolutely continuous 466 -density of 466 
554 Index one-parameter semi-groups 226 open mapping theorem 278 operator in Banach or Hilbert spaces -adjoint 353, 361 -bounded 274, 325 -compact 327 -continuous 274 -degenerate 322 - Fredholm 348 - Hermitian, positive definite 353 - Hilbert-Schmidt 357 - isometric 355 -nilpotent 319n -nuclear 331 -orthogonal projection 354 - partially isometric 356 -projection 325, 355 -quasi-nilpotent 319 - restriction 307 -symmetric 361 -transpose 322, 343 -unitary 355 operators, linear differential -antisymmetric 155 -characteristic polynomial of 171 -decomposable 190 -elli ptic 176 -formal series associated with 150 -hyperbolic 190 -hypo-analytic 176, 235 -hypo-elliptic 177, 221 -locally finite 149 -of constant force 245 -order of 154 - parabolic 202 -partial order of 157 -principal part of 154 -semi-elliptic 247 -strictly hyperbolic 196 -strongly elliptic 175n, 224 -symmetric 155 -8-strongly elliptic 175 -transpose -153 -ultra-hyperbolic 201 -weakly parabolic 222 -with analytic coefficients 163 -with constant coefficients 170 operators (not necessarily bounded) -accretive 372 -bounded 274 -closable 335 -closed 335, 361 -compact 123 -continuous 274 -essentially self-adjoint 336 -invertibility 319 -lifting 110 -maximal symmetric 336 -mutually transposed, 309 - normal 366 -sectorial 373 -self-adjoint 362 -transpose 309, 343 -with closed image 346 order of a linear differential operator 154 orthogonal decomposition 119 orthogonal elements 294 orthogonal projection of an operator 298, 355 orthogonal projector 298, 355 orthonormal base 300 orthonormal family 299 measure (cont.) -of mass + 1 466 -of slow growth 523 -positive 467 -summable 465 -tempered 523 Meixner's condition 37 Mellin transform 2, 24 - table of 40 mixed problem 394 monomial pseudo-functions 471 multiplicator 520 mutually transposed operators 309 Neumann problems 380, 448 Neumann series 320 neutrons, global balance of 410 Newtonian potentials 185n Nikodym open sets 129 non-local boundary conditions 397 norm 270 normal space of distributions 97 normed algebra 318n normed vector space 272 nuclear operator 331 nullity 307 Paley-Wiener theorem 505 Paley- Wiener-Sch wartz theorem 510 parabolic boundary 252 parabolic comparison principle 257 parabolic maximum principle 252 
Index parabolic operator 202 parallelogram identity 293 Parseval's relation 53, 301, 357, 503 partial Fourier transform 513 periodic distribution 7 periodic transform 9n Plancherel's formula 503 Poincare's constant 127 Poincare's inequality 126, 379 Poisson's integral formula 18 Poisson's summation formula 512 positive definite form 357 pre-Hilbert space 293 primitives of a distribution 477 primitives of the Dirac measure 482 principal part of a linear differential oper- ator 154 principle of the strong maximum 251 principle of the strong parabolic maXI- mum 265 principle of the weak maximum 259 principle of uniform boundedness 275 projection 296, 328 projection operator 355 projector 355 propagation of singularities 210 proper mapping 481 properly elliptic operator 437 pseudo functions Pf 471 quadrature formula 61 quasi-nilpotent operator 319 quotient operator 308 quotient space 308 radial function 56 rank of an opera tor 330 reduced minimal modulus 342 reflexivity 286, 302 regularisation 81, 104,461 regularity of solutions of variational problems -global 426,437 -interior 426 relatively compact part 327n Rellich's theorem 123 resolvent operator 320 resolvent set 320 restriction of a distribution 474 restriction of an operator 307 Riemann-Lebesgue theorem 500n Riesz-Frechet theorem 302n 555 Riesz representation theorem 302 Robin's problem 400, 448 salient cone 189 saltus 470 scalar product 291 -hermitian 291 -on a Hilbert space 293 Schauder spaces 3 Schauder's theorem 329 Scheme with five points 88 Scheme with three points 84 Schmidt's orthonormalisation method 300 Schrodinger operator 180, 192 Schwartz's proposition 526 Schwartz's theorem of kernels 528 sectorial operators 373 self-adjoint operator 351 n semi-ball 271 semi-groups 225 semi-norm 270 sesquilinear form 292, 351, 367 -adjoint 353 -non-degenerate 292 -non-negative 293 -positive definite 292 -uniformly strongly elliptic 389 So bolev spaces 92, 138 -density theorems 102 -trace theorems 113, 140 -with weights 141 Sobolev's embedding theorem 100 Sobolev's inclusion 131 Sobolev's injections 139 space - Banach 272 -complete metric 272 -energy 269 - Frechet 272 -Hilbert 291 -locally convex topological 271 - metric 272 -metrisable 272n -normed vector 272 -of bounded operators 310 -of distributions 463 -of tempered distributions 507 -of test functions 457 - of traces 22 -pre-Hilbert 291 -reflexive normed 286 -Schauder 3 
556 space (cont.) -separable 283, 300 -separated topological 271 -Sobolev 92 spectral radius 319 spectrum of an operator 84, 320 speed of propagation 215 statical problems of elasticity 411 Stirling's formula 234n strong covergence 287 strong maximum principle 263 strong parabolic maximum 265 sub-critical medium 405 subharmonic function 251 sum of operators 307 summability 15 support of a distribution 474 support cone of an elementary solution 189 surjection 539 surjective map 539 Table -of Fourier transforms of distributions 532 -of Hankel transforms 57 -of Mellin transforms 40 Tchebycheff polynomials 81 tempered distributions 184, 507 tensor product of two distributions 480 tensor product of two functions 480 test functions 457 3-point scheme 84 / topological supplement 326 topological vector space 270 to tal family 300 'totally bounded set 328n trace map 108 traces of Sobolev spaces 113 transform -bilateral Laplace 25 -discrete Fourier 59 -fast Fourier 64 - Fourier 2 Index - Hankel 2, 40 - La place 2 - Mellin 2, 24 transformation -Cooley- Tukey 64 -Good-Winograd 66 - to the left 306 - to the right 306 transmission conditions 401 transmission problem 400 transpose -of a linear differential operator 153 -of an operator in Banach or Hilbert space 343 truncation 103, 461 uniform boundedness 275 uniform derivative 332 unitary operator 355 variation of a function 478n varia tion of parameters 479 vector space 270 virtual power 412 von Neumann's theorems 362 wave front set 211n wave operator 180 weak compactness 289 weak convergence 287 weak derivative 332 weak differentiability 304 weak holomorphy 305, 333 weak maximum principle 259 weak-star convergence 290 weak topology 304 weakly parabolic operator 221 weight function 242 weighted Sobolev space 141 well-posed Cauchy problem 214, 217 Young's inequality 459 
Contents of Volumes 1, 3 - 6 Volume 1 Physical Origins and Potential Theory Chapter I. Physical Examples Introduction Part A. The Physical Models  1. Classical Fluids and the N avier-Stokes System  2. Linear Elasticity  3. Linear Viscoelasticity  4. Electromagnetism and Maxwell's Equations  5. Neutronics. Transport and Diffusion Equations  6. Quantum Physics Appendix. "Mechanics" - Basic Ideas Concerning the Problems of Mechanics  I. Indicial Calculus. Elementary Techniques of the Tensor Calculus  2. Notation, Nomenclature and Conventions in Mechanics  3. Ideas About the Principle of Virtual Power Part B. First Examination of the Mathematical Models  1. The Principal Types of Linear Partial Differential Equations Introduced in Chapter I.A  2. Global Constraints Imposed on the Solution of a Problem: Inclusion in a Function Space; Boundary Conditions; Initial Conditions Chapter II. The Laplace Operator Introd uction  1. The Laplace Operator  2. Harmonic Functions  3. Newtonian Potentials  4. Classical Theory of Dirichlet's Problem  5. Capacities  6. Regularity  7. Other Methods of Solution of Dirichlet's Problem  8. Elliptic Equations of the Second Order Bibliography Table of Notations Index 
558 Contents of Volumes 1, 3-6 Volume 3 Spectral Theory and Applications Chapter VIII. Spectral Theory Introduction  1. Elements of Spectral Theory in a Banach Space. Dunford Integral and Functional Calculus  2. Spectral Decomposition of Self-Adjoint and Compact Normal Operators in a Separable Hilbert Space and Applications  3. Spectral Decomposition of a Bounded or Unbounded Self-Adjoint Operator  4. Hilbert Sum and Integral Associated with the Spectral Decomposition of a Self-Adjoint Operator A in a Separable Hilbert Space H Appendix. "Krein-Rutman Theorem" Chapter IX. Examples in Electromagnetism and Quantum Physics Introduction Part A. Examples in Electromagnetism  1. Basic Tools: Gradient, Divergence and Curl Operators  2. Static Electromagnetism  3. The Spectral Problem in a Bounded Open Domain (Cavity) with Perfect Conductor Boundary Conditions  4. Spectral Problems in a Wave Guide (Cylinder) Part B. Examples in Quantum Physics Introduction on the Observables of Quantum Physics  1. Operators Corresponding to the Position, Impulse and Kinetic Moment Observables  2. Hamiltonian Operators in Quantum Physics Appendix. Some Spectral Notions Bibliography Table of Notations Index Volume 4 Integral Equations and Numerical Methods Chapter X. Mixed Problems and the Tricomi Equation Introduction  1. Description and Formulation of the Problem  2. Methods for Solving Problems of Mixed Type Chapter XI. Integral Equations Introduction Part A. Solution Methods Using Analytical Functions and Sectionally Analytic Functions Introduction  I. The Wiener-HopfMethod  2. Sectionally Analytic Functions  3. The Hilbert Problem  4. Application to Some Problems in Physics 
Contents of Volumes 1,3-6 559 Part. B. Integral Equations Associated with Elliptic Boundary Value Problems in Domains in]R3  1. Study of Certain Weighted Sobolev Spaces  2. Integral Equations Associated with the Boundary Value Problems of Electrostatics  3. Integral Equations Associated with the Helmholtz Equation  4. Integral Equations Associated with the Problems of Linear Elasticity  5. Integral Equations Associated with the Stokes System Chapter XII. Numerical Methods for Stationary Problems Introduction  1. Principal Aspects of the Finite Element Method Applied to the Problem of Linear Elasticity  2. Domains with Curved Boundaries  3. A Non Conforming Method of Finite Elements  4. Application to the Problems of Plates and Shells  5. Approximation of Eigenvalues and Eigenvectors  6. An Example of the Approximate Calculation for a Problem of the Eigenvalues of a non Self-adjoint Operator Chapter Xill. Approximation of Integral Equations by Finite Elements. Error Analysis Introduction  1. The Case of a Polyhedral Surface  2. The Case of a Regular Closed Surface Appendix. "Singular Integrals" Introduction  1. Operator, Convolution Operator, Integral Operator  2. Hilbert Transform  3. Generalities on Singular Integral Operators  4. Operators with Symbols, Operators on L2  5. The Calderon-Zygmund Spaces Theorem  6. Marcinkiewicz Spaces Bibliography Table of Notations Index Volume 5 Evolution Problems I Chapter XIV. Evolution Problems: Cauchy Problems in]RD Introd uction  1. The Ordinary Cauchy Problem in Finite Dimensions  2. Diffusion Equations  3. Wave Equations  4. The Cauchy Problem for the Schrodinger Equation, Introduction  5. The Cauchy Problem for Evolution Equations Related to the Convolution Product  6. An Abstract Cauchy Problem. Ovsjannikov's Theorem 
560 Contents of Volumes 1, 3 - 6 Chapter XV. Evolution Problems: The Method of Diagonalisation Introduction  1. The Fourier Method or the Method of Diagonalisation  2. Variations. The Method of Diagonalisation with an Operator Having Continuous Spectrum  3. Examples of Application: The Diffusion Equation  4. The Wave Equation: Mathematical Examples and Examples of Application  5. The Schrodinger Equation  6. Application with an Operator Having a Continuous Spectrum: Example Appendix. Return to the Problem of Vibrating Strings Chapter XVI. Evolution Problems: The Method of Laplace Transformation Introd uction  1. Laplace Transform of Distributions  2. Laplace Transform of Vector Distributions  3. Applications to First Order Evolution Problems  4. Evolution Problems of Second Order in t  5. Applications Chapter XVII. Evolution Problems: The Method of Semigroups Introd uction Part A. Study of Semigroups  1. Definitions and Properties of Semigroups Acting in a Banach Space  2. The Infinitesimal Generator of a Semigroup  3. The Hille- Y osida Theorem  4. The Case of Class ([0 and Stone's Theorem  5. Differentiable Semigroups  6. Holomorphic Semigroups  7. Compact Semigroups Part B. Cauchy Problems and Semigroups  1. Cauchy Problems  2. Asymptotic Behaviour of Solutions as t  + 00, Conservation and Dissipation in Evol ution Equations  3. Semigroups and Diffusion Problems  4. Groups and Evolution Equations  5. Evolution Operators in Quantum Physics. The Liouville-von Neumann Equation  6. Trotter's Approximation Theorem Chapter XVIII. Evolution Problems: Variational Methods Introduction. Orientation  1. Some Elements of Functional Analysis  2. Galerkin Approximation of a Hilbert Space  3. Evolution Problem of First Order in t  4. Problems of First Order in t (Examples)  5. Evolution Problems of Second Order in t  6. Problems of Second Order in t. Examples  7. Other Types of Equation Bibliography List of Equations Table of Notations Index 
Contents of Volumes 1, 3 - 6 561 Volume 6 Evolution Problems II: The Navier-Stokes and Transport Equations in Numerical Methods Chapter XIX. The Linearised Navier-Stokes Equations Introd uction  1. The Stationary N avier-Stokes Equations: The Linear Case  2. The Evolutionary N avier-Stokes Equations: The Linear Case  3. Additional Results and Review Chapter XX. Numerical Methods for Evolution Problems  1. General Points  2. Problems of First Order in Time  3. Problems of Second Order in Time  4. The Advection Equation  5. Symmetric Friedrichs Systems  6. The Transport Equation  7. Numerical Solution of the Stokes Problem Chapter XXI. Transport  1. Introduction. Presentation of Physical Problems  2. Existence and Uniqueness of Solutions of the Transport Equation Appendix of  2. Boundary Conditions in Transport Problems. Reflection Conditions  3. Spectral Theory and Asymptotic Behaviour of the Solutions of Evolution Problems Appendix of  3. The Conservative Milne Problem  4. Explicit Examples  5. Approximation of the Neutron Transport Equation by the Diffusion Equation Appendix Bibliography List of Equations Table of Notations Index