/
Similar
Text
.
11
i 1
Eberhard Zeidler
Nonlinear
Functional Analysis
and its Applications
II/A: Linear Monotone Operators
Translated by the Author and by Leo F. Boront
With 45 Illustrations
Springer
Eberhard Zeidler
Sektion Mathematik
Karl...Marx..Platz
7010 Leipzig
German Democratic Republic
Leo F. Boront
Department of Mathematics
University of Idaho
Moscow, ID 83843
U.S.A.
Mathematics Subject Clas.qficalion (19RO 4611
LiInry cI ConptII Catalo&ialu. NJlicatioD Data
(.... for \tOI. 2 1*. A-B)
Zeidler. Eberhard.
. 2. ptJ. A-B: TP.,laed by the IUIhor and
Leo L. Boron.
\\II. 3: T......1fed by Leo L. Bomn.
Vol. 4; Tn.-liled by Juerpe QuandI.
IftChIdeI bibliographies and indexes.
C (MIlenll : 1. Fixed point IhcoIanI - 2. pt A. U1aI'
monoconc openIOis . PI. B Noalinear operators - (eec.1
- 4. AppiicatioGs 10 mattaem.l icaI pbyIia.
1. NonIiftell' functional....ysis. I. 1itle.
2I.S.7ASI3 I98S 515.7 83-20455
Previous edition, V{)rlrII'" iht, ,,'C''''''rwarr F...Ic'i()nc"tJnoIiJ. Voll. I-III. puhlished by 888
B. Ci. Tcuhner Vcrlal'lKhan. 1010 Leipzia. Stemwarlen'trasse H. DcUIM:he DcmokraliKhe
Repuhlik.
C> 1990 by S p.. -Y«IIa New "-t 11Ic.
All ti8h. rcrvod. This work may not be translaled or copied in whole 0' in pari wiahout the
wrtuen permi55ion of tbe publisher fSprin,cr-Verlag. 175 Fifth Avcnue. New York. New York
10010. U.S.A. CkCCpt for briei' excerpts in connection with reviews or RChnlarly analysis. Use in
connection with any form of information storage and retrieval. etronic adaptation. computer
5O(twarc.or hy imilar or diuimil.r methodology now known or herurterdcc1oped is (orbidden.
Reprinted by Wor1d Puhl'-hinl Corporation. BetJnl. 1992
for distributIOn and sa in The People IS Pe?ublic of Chma only
ISBN 7 - 5062 -1308-7
ISBN O-JR7..96A02-4 Springer. VerJla New York Berlin Heidelberg
ISBN )-f)..96802.4 Sprin,er-VerJal RerJin HcideJher, New York
Preface to Part 11/ A
A theory is the more impressive.
the simpler are its premises.
the more distinct are the thin.. it connects,
and the broader is ils fanF of applicability.
Alben Einstein
This is the second or 8 five-yolume exposition of the main principles of
nonlinear functional analysis and its numerous applications to the natural
sciences and mathematical economics. The presentation is seU-contained and
accessible to a broader audience of mathematicians. natural scientists. and
engineers. The basic content can be undentood even by those readers who
have little or no knowledse or linear functional analysis. The material of the
five volumes is organil.ed as rollows:
Part I: Fixed-point theorems.
Part II: Monotone orerators.
Part III: Variational methods and optimization..
Parts IV IV: Applications to mathematical physics.
The main goals of the work are discussed in detail in the Preface or Part I.
A Table of Contents of Parts I through V can be Cound on page 87 J of
Part I. The emphasis or the treatment is baed on the rollowing considera-
tions:
(a) Which are the basic, guiding concepts, and what relationship exists be-
tween them?
(b) What is the relationship between these ideas and the known results of
classical analysis and or linear functional analysis?
(c) What are some typical applications?
. .
VII
. . .
VIII
Prcrace 10 Part III A
The present Parlll is divided into two subvolumes:
Part II/A: Linear monotone operators.
Part 11/8: Nonlinear monotone operators.
These two subvolumes form a unit. They consist of the following sections;
introductIon to the subject;
hnear Jnonotonc problems;
gncralization to nonlinear stationary problems;
generalization to nonlinear nonstationary problems;
general theory of discretization methods.
The numerous applications concern differential equations and integral equa-
tions.. as well as numerical methods to their solution.
The Appendix, the Bibliography, and the Jndex material 10 Parts IliA and
11/8 can be found at the end or Part II/B.
l"he modern theory of linear partial differential equations of elliptic, para-
bolic, or hyperboJic type is based on the so-called H ilberl spa(:e n.ethod.f\. In
this connection, boundary value problems and initial value problems arc
transformed Into operator equations in Hilbert space. The solutions of these
opera lor equations correspond to generalized solutions oftbe original classical
problcm. Here, the generalized solutions live in so-called SObO/('I' :iPUL'('S.
Roughly speak ing, Sobolev spaces consist of functions which have sufficiently
reasonable generalized derivatives.
l'he theory of ,nol1olone operators generaliles the Hilbert space methods to
nonlineClr problems. We want to emphasize that the Hilbert space methods
and the theory of monotone operators are connected with the main stream,"
of mathematics. They are closely related to Hilbertts rigorous justification of
the Dirichlet prin£';ple and to the 19th and 20th problems or Hilbert which. he
formulated In his famous Paris lecture in 1900. The relevant historical back-
ground will be discussed in Chapter 18. Fron) the physical pornt of view, the
Hilbert pace methods and the more general theory of monotone operators
are based on the fundamental concept of enerYJ. Roughly speaking. Sobolcv
spact:s can be regarded as spaces of functions which correspond to physicaJ
states of finite energy. We will show that, in our century. the notion of
monotone operators played, both implicitly and explicitly, a fundamental role
in the development or the calculus or variations, in the theory of lInear and
nonlinear partial dilTerential equalions and in numerical analysis.
In order to help the reader understand the basic ideas, the first chapters of
this volume serve as an elementary introduction to the modern functional
analytic theory of linear partial differential equations. In particular, Chapter
1 M contaln an elegant functional analytic justification of the Dirichlet prin-
ciple.. based on a generalization of the classiQtI Pythagorean theorem to
Hilbert spaces. An introduction to the theory of S()btJlev spaces can be found
in Chapter 21. Experience shows that students frequently have trouble with
the tcchnicalitie!-; of Sobolev spaces. In Chapter 21 t for the benefit of the reader 1
Prerace 10 Part iliA
.
IX
we choose an approach to Sobolcv spaces which is as elementary as possible.
To this end we first prove all the embedding theorems in an extremely simple
manner in R 1 before passing to nil. For the convenience or the reader. the
basic properties of the Lebesgue integral are summarized in the Appendix
to Part II/B. In this connection, we choose the simplest definition of the
Lebesgue integral. In contrast to other definitions of the Lebesgue integral,
our definition also applies immediately to runctions with values in B-spaces.
Such functions are nceded in connection with evolution equations. Moreover,
in Chapter 18 we discuss a number of important principles which arc frequently
used in modern analysis. for example, the smoothing principle via mollifiers,
the localization principle via partition of unity, the extension principle, and
the completion principle.
The ba.ic ideas and basic principles of the theory or nonlinear monotone
operators are discussed in detail in a special section at the beginning or Part
II/B. Any reader who wishes to learn about nonlinear monotone operators as
quickly as possible may immediately begin reading Part II/B.
A reference of the form Al (20) and A 2 (20) is to formula (20) in the Appendix
of Parll and II/B. respectively; while (18.20) refers to formula (20) in Chapter
18. Omission of a chapter number means that the formula is in the current
chapter. The References to the Literature at the end of each chapter are of the
following form: Krasnoselskii (1956, M, B, H), etc. The name and the year
relate to the Bibliography at the end of Part II/B. The letters stand for the
folio wi ng:
M: monograph:
L; lect u re notes;
s: survey article;
P: proceedings;
8: exlensivc bibliography in the work cited;
H: comments on the historical development or the subject contained in the
work cited.
.
A List of Symbols may be found at the end of Part II/B. We have tried
to use generally accepted symbols. A few peculiarities, introduced to avoid
confusion, are described in the remarks introducing the List of Symbols.
Basic materia) on linear functional analysis may be found in the Appendix
to Part I.
The theory of monotone operators is related to the simple fact that the
deri,'ut;l'e I' of a convex real function f is a monotone function. However. it is
quite remarkable that the idea of the monotone operator allows many diver..
sified applications. For example. there are applkations to the rollowing topics:
(i) variational problems and variational inequalities;
(ii) nonlinear elliptic, parabolic, and hyperbolic partial differential
equations:
(iii) nonlinear integral equations;
x
Prcracc to Part IliA
(ivJ nonlinear semigroups;
(v) nonlinear eigenvalue problems;
(vi) nonlinear Fredholm alternatives;
(vii) mapping degree for noncompact operators;
(viii) numerical methods such as the Ritz method (e.g.. the method of finite
elements), the Galerkin method. the proje<:tion-iteration method. the
difference method, and the Kanov method for conservation laws and
'Jariational inequaiitieL
Concerning time-dependent problems we emphasize both the Galerkin
InethtHl and the method of semlgroupJ. We also discuss in detail the facl that
the theory of monotone operatoR generalizes both the theory of bou'zded and
unbounded linear operators. To this end we develop, in Chapters 18 and 22
through 24, the theory or linear partial differential ultions based on bounded
linear operators, and in Chapter 19. we study in detail the elegant method of
the Friedrichs extension for unbounded linear operators and its applicHtions
to variational problems and 10 linear and semi linear elliptic. parabolic, and
hyperbolic equations. as well as applications to th..: semiIinear Schrodinser
equation. As we shall show in Parts IV and V. unbounded linear operdtors
playa decisive role in quantum mechanics and quantum field theory (elemen-
tary particle physics). In contrast to this. for example, bounded linear operators
are related to elasticity and hydrodynamics.
At the center or the theory of monotone operators there stands the notion
or the nJuxi,nal monotone operator. which generalizes both the theory of
bounded and unbounded linear monotone operators. The theory of maximal
monotone operators will be studied in detail in Chapter 32.
A number of diagrams contained in the tcxt should help the reader to
discover interrelationships between different topics. In particular, we recom-
mend Figure 27.1 in Section 27.5 or Part II/B where the reader may Dnd
interrelationships between many important operator properties in nonlinear
functional analysis. A list of all these schematic overviews can be found at the
end of Part II/B. A list of the basic theorems and of the basic definitions can
aJso be found there.
In Part I we studied equations involving compact operators. The decisive
advantage of the theory of monotone operators is that it is also applicable to
nun('olnpu(" operators. AloftJ with abstract existence theorems we also stress
the methods of numerical functional analysis. Chapters 20 through 22 (resp.
Ch8ptC 33 through 35) may serve as an introduction to linear (resp. non-
linear) numerical functional analysis. For example, in terms of numerical
functional analysis, monotone operators allow us to justify the rollowing
fundamentKI principle: Consistency and stability imply convergencc. In this
connection. the scneral notion of A-proper maps is crucial.
The connection between the theory of monotone operators and general
variational methods will be studied in detail in Part III. In Parts IV and V we
will consider applications or the theory or monotone operators to interesting
Pre(acc to Pari II, A
.
XI
problems in malhemuli<:al phy..;cs. For example, the theory of monotone
operators plays an important role in elasticity, hydrodynamics (the Navier-
Stokes equations), gas dynamics (subsonic flow), and semiconductor physics.
I hope that the reader will enjoy discovering a number or interesting
interrelationships in mathematics.
Leipig
Spring 1989
Eberhard Zeidler
Contents (Part II/A)
Preface to Part 11/ A VII
INTRODUCTION TO THE SUBJECT I
CHAfIEIUB
Variational Problems. the Ritz Method. and
the Idea of Orthogonality 15
18.1. The Space C(G) and the Variational Lemma 17
fi 18.2. Integration by Parts 19
fiI8.3. The First Boundary Value Problem and the Ritz Method 21
18.4. The Second and Third Boundary Value Problems and
th R it7 Mthnd 28
18.5. Eigenvalue Problems and the Ritz Method 32
18.6. The Holder Inequality and its Applications 35
18.7. The History of the Dirichlet Principle and Monotone Operators 40
18.8. The Main Theorem on Quadratic Minimum Problems 56
18.9. The Inequality of Poincare- Friedrichs 59
fiI8.10. The Functional Analytic Justification of the Dirichlet Principle 60
18.1 I. The Perpendicular Principle, the Riesz Theorem, and
the Main Theorem on Linear Monotone Operators 64
fi 18.12. The Extension Principle and the Completion Principle 70
18. t 3. Proper Subregions 71
fiI8.t4. The Smoothing Princil'le 72
18. I S. The Idea of the Reaularity of Generalized Solutions and
the Lemma of Weyl 78
18.16. The Localization Principle 79
18.17. Convex Variational Problems. Elliptic Differential Equations.
and Monotonicity 81
. . .
XIII
XIV
Conlenls(Parlll/A)
18.18. The General Euler- Lagrange Equations 85
18.19. The Historical Development of the 19th and 20th Problems of
Hilbert and Monotone Operators 86
18.20. Sufficient Conditions for Local and Global Minima and
Locally Monotone Operators 93
CHAPTER 19
The Galerkin Method for Differential and Integral Equations,
the Friedrichs Extension. and the Idea of Self-Adiointness 101
19.1. Elliptic Differential Equations and the Galerkin Method 108
19.2. Parabolic Differential Equations and the Galerkin Method 111
19.J. Hyperbolic Differential Equations and the Galerkin Method 113
&194. 'nteRral Equations and the Galerkin Method 115
19.5. ('omplete Orthonormal Systems and Abstract Fourier Series 116
19.6. Eigenvalues of Compact Symmetric Operators
(Ifilbert Schmidt Theory) 1 19
19.7. Proof of Theorem 19.8 121
19.8. Self-Adjoint Operators 124
19.9 The Friedrichs Extension of Symmetric Operators 126
&19.10. Proof of Theorem 19.C 129
*19.11. Application to the Poisson Equation 132
19.12. Application to the Eigenvalue Problem for the Laplace Equation 134
19.13. The IncQuality of Poincare and the Compactness
Th,.orm or Rellich I J5
19.14. Functions of Self-Adjoint Operators 138
19.15. Application to the Heat Equation 141
19.16. Application to the Wave Equation 143
19.17. Semigroups and Propagators, and Their Physical Relevance 145
19.18. Main Theorem on Abstract Linear Parabolic Equations 153
19.19. Proof of Theorem 19.D 155
19.20. Monotone Operators and the Main Theorem on
Linear Nonexpansive Semigroups 159
19 21. The Main Theorem on One-Parameter Unitary Groups 160
&19.22. Proof of Theorem 19.E 162
19.23. Abstract Semilinear Hyperbolic Equations 164
19.24. Application to Semilinear Wave Equations 166
19.25. The Semilinear Schrodinger Equation 167
19.26. Abstract Semilinear Parabolic Equations. Fractional Powers of
Operators. and Abstract Sobolev Spaces 168
19.27. Application to Semilinear Parabolic Equations 171
19.28. Proof of Theorem 19.1 171
&19.29. Five General Uniqueness Principles and Monotone Operators 174
19.30. A General Existence Principle and Linear Monotone Operators 175
CHAPTER 20
Difference Methods and Stability 192
20.1. Consistency. Sta!-:ility, and Convergence 195
20.2. Approximation of Differential Quotients 199
Content5 (Part iliA)
xv
20.3. Application to Boundary Value Problems for
Ordinary Differential Equations
20.4. Application to Parabolic Differential Equations
20.S. Application to Elliptic Differential Equations
20.6. The Equivalence Between Stability and Convergence
20.7. The Equivalence Theorem of Lax for Evolution Equations
200
203
208
210
211
ONE PROBLEMS 225
CH.AfiEIUJ
Auxiliary Tools and the Converaence of the Galerkin
Method for Linear Operator Equations 229
21. t. Generalized Derivatives 231
21.2. Sobolev Spaces 235
21.3. The Sobolev Embedding Theorems 237
21.4. Proof of the Sobolev Embedding Theorems 241
21.S. Duality in B-Spaces 25 I
21.6. Duality in H-Spaces 253
21.7. The Idea of Weak Convergence 255
21.8. The Idea or Weak. Convergence 260
21.9. Linear Operators 261
21.10. Bilinear Forms 262
21.11. Application to Embedding! 265
21.12. Proiection Operators 265
21.13. Bases and Galerkin Schemes 271
21.14. Application to Finite Elements 273
21.1 S. Riesz-Schauder Theory and Abstract Fredholm Alternatives 27S
21.16. The Main Theorem on the Approximation-Solvability of Linear
Operator Equations, and the Convergence of the Galerkin Method 279
21.17. Interpolation Inequalities and a Convergence Trick 283
21.18. Application to the Refined Banach Fixed-Point Theorem and
the Converaence of Iteration Methods 285
fi21.19. The Gagliardo-Nirenberg Inequalities 286
21.20. The Strategy or the Fourier Transform for Sobolev Spaces 290
21.21. Banach Algebras and Sobolev Spaces 292
121.22. Moser-Type Calculus Inequalities 294
21.23. Weakly Sequentially Continuous Nonlinear Operators on
Sobolev Spaces 296
CHAPTER 22
Hilbert Space Methods and Linear Elliptic Differential Equations 314
22.1. Main Theorem on Ouadratic Minimum Problems and the
Ritz Method 320
22.2. Application to Boundary Value Problems 325
22.3. The Method of Orthogonal Projection, Duality, and a posterior;
Error Estimates r 335
22.4. Al!Plication to Boundary Value Problems 337
X\'I
2 5.
2 .
22. 7.
22.N.
22.9.
2 . U)
12 U
1 11.
12. Ll
2 IA
2. L.5.
:!2 L6..
12. lL
s ..,., L.8
s-_. .
22.19.
:2 2 20
2.21.
s" "
..._._... .
Contents (Part II A.
Main Theorem on Linear Strongly Monotone Operators and
the Galerkin Method
Application to Boundary Value Problems
C"ompact Perturbations of Strongly Monotone Operators,
Fredholm Alternatives. and...1he Galerkin Method
Application to Integral Equations
Application to Bilinear Forms
Application to Boundary Value Problems
EiJZen\ aluc Prohlems and the Ritz Method
Application to Bilinear Forms
Al"phcation to Boundary - EiRenvalue Problems
(j,\rdln Forme;;
The (Jardlng I nequality for Elliptic Equations
The Main Theorems on Garding Forms
Application to Stronly Elliptic Differential Equations of Order 2",
Difference Approximations
Interior Regularity of Generalized Solutions
Proof of Theorem 22.' f
Regulant} of Generalized Solutions up to the Boundary
Proof of Theorem 22.1
CHA£IER..2J
Hilbert Space Methods and Linear Parabolic Differential Equations
23.1.
*23.2.
23.3.
23 4
23 5.
23.6.
s, 7
" - .' . .
23 8
&23.9.
Particularities in the Treatment of Parabolic E q uations
The Lebesgue Space IJ p (O. T; X) of Vector-Valued Functions
T h e Dual Space to t p (O, T; X)
Evolution Tri p les
Generalized Derivatives
The Soholev Space W"I (0, T; V, H)
Main Theorem on First-Order Linear Evolution Equations and
the Galerkin Method
Application to Parabolic Differential Equations
Proof of the Main Theorem
(HAPTER 24
Hilbert Space Methods and Linear Hyperbolic
Differential Equations
24 I. Main Theorem on Second-Order Linear Evolution Equations
and the {.alerkin Method
24.2. Application to Hyperbolic Differential Equations
24 3. Proof of the Main Theorem
11 9
345
147
349
350
351
352
357
361
364
366
369
371
374
376
1 7N
383
lR4
402
402
406
410
416
417
422
42 1
426
430
452
453
456
459
Contents (Part II/B)
Preface to Part II/B VII
GENERALIZATION TO NONLINEAR
STATIONARY PROBLEMS 469
Basic Ideas of the Theory of Monotone Operators 471
CHAPTER 25
Lipschitz Continuous, Strongly Monotone Operators, the
Projection-Iteration Method, and Monotone Potential Operators 49S
CHAPTER 26
Monotone Operators and Quasi-Linear Elliptic
Differential Equations 55J
CHAPTER 27
Pseudomonotone Operators and Quasi-Linear Elliptic
Differential Equations S80
CHAPTER 28
Monotone Operators and Hammerstein Integral Equations 615
CHAPTER 29
Noncoercive Equations, Nonlinear Fredholm Alternatives,
Locally Monotone Operators, Stability, and Bifurcation 639
. .
XVII
XVIII Contents (Part II/B)
GENERALIZATION TO NONLINEAR
NONSTATIONARY PROBLEMS 765
CHAPTER 30
First-Order Evolution Equations and the Galerkin Method 767
CHAPTER 31
Maximal Accretive Operators, Nonlinear Nonexpansive Semigroups,
and First-()rder Evolution Equations R 17
CHAPTER 32
Maximal Monotone Mappings 840
CHAPTER 33
Second-Order Evolution Equations and the Galerkin Method 919
GENERAL THEORY OF DISCRETIZATION METHODS 959
(HAPTER 34
Inner Approximation Schemes, A-Proper Operators, and
the Galerkin Method 963
CHAPTER 35
External Approximation Schemes. A-Proper Operators, and
the Difference Method 978
("HAPTER 36
Mapping Degree for A-Proper Operators 997
Appendix 1009
References 1119
List of Symbols 1163
list of Theorems J 174
List of the Most Important Definitions 1179
List of Schematic Overviews 1182
List of Important Principles 1183
Index 1189
INTRODUCTION TO THE SUBJECT
Each progress in mathematics is based on the discovery of stronger tools and
easier methods, which at the time makes it easier to understand earlier methods.
By making these stronger tools and easier methods his own, it is possible for
the individual researcher to orientate himself in the different branches of
mathematics.
The organic unity of mathematics is inherent in the nature of this science, for
mathematics is the foundation of all exact knowledge of natural phenomena.
David Hilbert (1900)
(Paris lecture)
For me, as a young man, Hilbert became the kind of mathematician which I
admired, a man with an enormous power of abstract thought, combined with a
fully developed sense for the physical reality.
Norbert Wiener (1894-1964)
Hilbert always emphasized that mathematics is a unity, that its different parts
are in permanent interaction with each other and with the natural sciences, and
that this interaction not only provides the key to an understanding of the nature
of mathematics. but also the best cure against a splitting into different and
unrelated parts-a danger which, in our time of huge qualitative growth and
alarming specialization of mathematical research, must always be kept in mind.
Pavel Sergeevi Aleksandrov (t 97 t)
In the modern theory of partial differential equations, generalized solutions
playa fundamental role. We want to explain brieOy the mathematical and
physical background of the notion of a generalized solution. To this end we
consider the first boundary value problem for the Poisson equation
- u = f on G, ( la)
u=g on oG. (I b)
I
")
-
Introduction 10 the Subject
G
Figure 18.1
Here.. G is a bounded region in R J , and iG denotes the boundary of G (Fig.
18. I ).
Let = (, '1.. ) and
u = u.. + U "" + u...
"'lot -."
The functions.r and 9 are given. We seek the function u. Problem (I) is closely
related to the variational problem
r (UZ + u; + II! - 2fu)dx = min!.
JG
II = g on l'G.
(2)
In addition, let us consider the potential
II(X) = r - fey) dy.
4n J G Ix - yl
Up to a ultiplicativc constant, II represents the potential of a mass distribu-
tion on G which has density .r.
The following observations are important.
(i) Classical solrltions. For sufficiently smooth functions f and g and suffi-
ciently smooth boundary ?G, the original problem (1) has a classical solution.
More precisely, if
(3)
r E C 2 (G).. g E C2'(iG), cG E C 2 . 2 , 0 < 2 < I,
then (I) has a unique classical solution u, where u E C 2 . 2 {G).
(ii) Generalized solutions. Counterexamples show that equation (1 a) need
not have a classical solution u in case f: G .... R is continuous. This surprising
fact depends essentially on the properties of the potential u in (3). If the density
.f: G IR is sufficiently smooth, say f is continuously differentiable, then the
potential u in (3) is a classical solution of (I a). However, if f: G R is only
continuous, then u in (3) need not have second derivatives. Hence, u is not a
classical solution of ( I a).
Consequently, if f: G IR is continuous, then we have the following situa-
tion. The function II in (3) has a well-defined physical meaning, namely, u
represents the potential of a mass distribution with continuous density f. But
Introduction to the Subject
3
u is not necessarily a solution of the differential equation (I a). Therefore it is
reasonable to introduce the notion of a generalized solution of (I a). To this
end we multiply (la) by cp e Co(G) and, using integration b}' parts, we obtain
- L uArp d:< = L frp d:<
for all cp e Co (G)
(4)
(cr. Section 18.2). By definition. a function u is called a generalized solution of
equation (I a) iff relation (4) holds. It can be shown that. in this sen, the
potential II in (3) is a generalized solution of (la) if f is continuous on G.
Obviously. classical solutions of (1 a) are also generalized solutions of (I a).
(iii) Variational problems, generalized solutions. and the idea of completion.
Let us now consider the variational problem (2), which corresponds to the
famous Dirichlet principle. In Chapter 18 we will show that each sufficiently
smooth solution u of (2) is also a classical solution of the original problem (I).
But the point is that the minimum problem (2) need not have a solution u
which has classical first derivatives. This fact caused enormous trouble in the
mathematics of the nineteenth century. related to the justification of the
Dirichlet principle. This will be discussed in Section 18.10.
I n order to understand the typical difficulties. let us first consider the
following simple minimum problem
F(x) = min!,
. e [a. b].
(5)
where F: [a, b] -. R is a continuous function on the compact interval [a. b].
The famous Weierstrass theorem tells us that problem (5) has always a
solution (Fig. 18.2). Suppose now that the unique solution . = 2 of (5) is an
irrational number, and suppose that we consider the modified problem
f(x) = min!,
x e Q n [a, b],
(5.)
where 0 denotes the set of rational numbers. Then problem (5.) has no
solution. Consequently, mathematicians, who do not know irrational num-
bers, cannot prove the Weierstrass existence theorem for (5). We now want to
show that the variational problem (2) corresponds to a similar situation. For
simplicity. let g = O. We set
F(u) = L (u + u + u! - 2fu)d:<
F
a
(J
I
b
Figure 18.2
4
Introduction to the Subject
and
M = {u E C 1 (G): u = 0 on cG}.
Then, problem (2) reads as follows:
(D*) f.(u) = min!, u e M.
This classical variational problem need not have a solution. Roughly speak-
ing, problem (D*) corresponds to (5*). However, as we shall show in Section
18.10. if we introduce the Sobolel' space W 2 1 (G), then we obtain the modified
problem
(D) Ftu) = min!. u E W 2 1 (G).
which has a unique solution u if JGf 2 dx < OC,,'.
By definition. this solution u is called a generalized solution of the original
boundary value problem (I) with g = O. In addition, u is also a generalized
solution of the Poisson equation (Ia) in the sense or(4). Roughly speaking, we
obtain the following:
The introduction of Sobolev spaces corresponds to the introduction of real
numher..'; t;a irrational numbers in classical mathematics.
To explain this. we start with the abstract problem
( P*)
Ffu) = min!,
U EM.
I n order to solve (P*) we consider the modified problem
(P)
F(u) = mint
U E N.
where N is a completion of M, i.e., the set N is obtained from the set M by
adding appropriate "ideal" elements. For example. in the case of problem (5*),
"'e complete the set of rational numbers to the set of real numbers by adding
irrational numbers. In the case of problem (D*), we complete the set
M = {u E C t (G): u = 0 on cG}
to the Sobolev space N = W 2 1 (G) by adding appropriate functions which have
generalized first-order derivatives. The Sobolev space W 2 1 (G) is a Hilbert space
equipped with the scalar product
(ulr) = f (Ull1 + U ., t' ., + u"l',,)dx.
G
The precise definition of Wi (G) will be given in Section 18.10. Roughly
speaking, the Sobolev space ;J,2' (G) is the smallest Hilbert space with the scalar
product (ull') which contains the set of functions CO'. Note that C<f(G) c M.
(iv) Approximation methods. In order to obtain approximate solutions for
the original boundary value problem (I). one frequently uses the so-called Rit:
method. The basic idea of this method is the following. Instead of solving the
Basic St ra tegy
5
variational problem
(D) L (u + u + uf- 2fu)dx = min!, u E WZI(G),
we consider the approximate problem
(D,,) L (ul + u + uf - 2fu)dx = min!, u EX",
where X" is a finite-dimensional subspace of W 2 1 (G), i.e., X" consists of all
functions u of the form
u = C I WI + c 2 "'2 + ... + c" "'"
with "i = 0 on cG for all j, where the so-called basic functions "'1' ..., w.. are
fixed. This way the approximate problem (D,,) is reduced to the determination
of the unknown real coefficients c l' .. . , ell. Let u and UtI denote the solutions
of (D) and (D,,), respectively. Generally, it is not possible to prove the conver-
gence of this Ritz method in the sense of pointwise convergence:
lim u,,(x) = u(x) for all x E G.
,,-x,
However, in Chapter 22 we will prove that the sequence (u,,) converges to u
in the Sobolev space Wl (G), i.e.,
lim (u" - ulu.. - u) = O.
"... x
Explicitly, this means that
. f ( au" CU ) 2 ( au" OU ) 2 ( cu" OU ) 2
! J G 1f - c + e" - 0" + e{ - o{ dx = O.
Consequently, Sobolev spaces also play an important role in modern
numerical analysis. For example, it is not necessary to use smooth basis
functions "'.. ..., ,v" in (D,,) above; it is sufficient to use p;ece,,';se-sn.ooth
functions. This is the basic idea of the important method of finite elements,
where \\'1' .. ., "'" are piecewise polynomial functions.
Basic Strategy
The basic strategy of the modern theory of partial differential equations is the
following:
(S I) We prove the existence of generalized solut;ons.
(S2) We sho,,' that the generalized solutions are even classical solutions if the
data of the problem are sufficiently smooth (method of regularization).
In (S t) we use general results from functional analysis. The analytic sub-
stance of the existence proofs in (5 I) is concentrated in the so-called Sobolet'
6
Introduction to the Subject
e",h('dt/iny III('ore"I.... \\'hich correspond to inequalities for integrals. The proofs
of the Sobolev embedding theorems are based on the Holder inequality. The
Sobolev embedding theorems generalize the famous classical inequalities
of Poincare and Friedrichs. In (S2) we need more sophisticated analytical
methods \vhich are based on the specific properties of the problems under
considera t ion.
Along with the Ritz method and the Galerkin method" difference methods
represent a universally applicable method for the numerical solution of partial
differential equations. This will be considered in Chapter 20. In Chapters 34
and 35. "'e shall construct a general theory of discretization methods for
nonlinear prohlems.
Figure 1 .J sho"'s important interrelationships which will be studied in
("haptcrs 18 and 19. There are two options for giving an introduftory le('ture
011 H ilherl spaC(l "1£11 hods and their applications to integral equations and
Idea of self-adjointness
('hapter 19)
.dea of orthogonality
('haptcr 1 t
!
parallelogram Identlt
IIhe Pythagorean theorem)
ij
quadnuic minimum problem,
U
I perpendicular principle I
ij
theorem of R IC'\/
Friedrichs' - - - - - -.. ltilbert Schmidt
extension theory
(eigenvalue problems)
function
/ calculus
semlgroups
1
elliptic. parabolic.
and hyperbolic differential
equations
(linear and semilinear)
quadratic variational ·
problems
Figure 18.3
Hilbert Spaces
7
partial differential equations:
(i) Chapters 18 and 19;
(ii) Chapters 18,21. and 22.
Approach (ii) is simpler than (i), since we only work with bounded operators.
However, approach (i), based on unbounded operators, gives more insight.
Note that unbounded operators are indispensable in quantum theory. If there
is enough time in an introductory lecture, then one can add Chapters 23 and
24 to (ii) concerning existence theorems for parabolic and hyperbolic equa-
tions via the Galerkin method.
In Chapters 2S through 36, we generalize the basic principles for linear
operators considered in Chapters 18 through 24 to nonlinear operators.
An introductor.\' lecture on nonlinear monotone operators can be based on
Chapters 2S through 27.
Hilbert Spaces
For the convenience of the reader, we recall some basic facts on Hilbert spaces.
Scalar product. Let X be a linear space over K = R, C (cr. A I (22) in the
Appendix of Part I). By definition, a scalar product on X is a function
(u, (.)..... (ul t') from X x X into K which has the following three properties for
all u, l\ w e X and all ;., P e K:
(i) (ul ;.1" + pK') = ;(ul v) + p(ul K').
(ii) (ult") = (vlu).
(iii) (ulu) > 0 iff u #: o.
Here, the bar denotes the conjugate complex number. From (i) and (ii) it
follows that
().v + p"'lu) = l(vlu) + ji(wlu).
Let u, t' EX. Then u is called orthogonal to t. iff
(ulv) = O.
Pre-Hilbert spaces. By definition, a pre-Hilbert space X is a linear space
together with a scalar product. We set
II u II = (u I u) 1/2 .
The convergence
u ll -+ U
as n -+ 00
is defined by
lIu" - ull -+ 0
as n -+ 00.
A sequence (u ll ) is called a Cauchy sequence iff, for each I; > 0, there exists an
8
Introduction to the Subject
110(1:) with
1111" - u'" II < I: for all n, m > no(I:).
Hi/her' spaces. By definition, a Hilbert space is a pre-Hilbert space with the
additional property that each Cauchy sequence is convergent.
Hilbert spaces are briefly called H-spaces.
The Scl,,\'ar: inequality and its consequences. The most important In-
equality in an H-space X is the so-called Schwarz inequality
l{ulr)1 < lIullll('11 for all u. rEX.
This follows from
o < (II - i.I' I u - ; 1') = II U 11 2 - i. ( u It.) - I ( t' III) + I ;.1 2 Ill' 11 2
with i. = 1III\l2/(ulr) in case (ult') o.
It follows from the Schwarz inequality that
1111 + ,." 2 = (II + I' I u + l') = II U 11 2 + (II I (') + (v I u) + Ill' 11 2
< (ilull + 111'(1)2.
This yields the triangle inequality
II II + r II S !lull + 111'11
for all u, I' E X.
Moreover, the generalized triangle inequality
1111111 - III'''' < lIu + I'll < lIuli + 111'11
follo\\'s from
for all u, I' E X
lIull - II f II = II (II - I') + ('" - 111'11 < II u - (' II.
If II" -+ U as " -+ x, then
lIu,,:1 -+ lIuli
as n -+ y,
since III II" II - II "111 < II II" - u II -. o. Finally.
II" -+ U
and
('" .... ('
as n.... x
Implies
(11,,11',,) -+ (ul(')
as n -+ x.
This follows from
Itu,,'!',,) - (ulr)1 = 1(1I"lr" - (,) + (u" - ull')1
< I (u" 1('" - t.) I + I (II" - u I v) < II U" 1111 t." - ('II + II U" - u 1111 t'li.
Equiralent scalar products. Let Y be an H-space over II< with the scalar
product (.1 .), and let ('1 .). be another scalar product on X. By definition,
t.1 .) and ( '1 .). are called equivalent iff the corresponding norms
lIuli = (UIU)1/2 and lIuli. = (ulu)2
Hilbert Spaces
9
are equivalent, i.e., there exist positive constants c and d such that
cllull s lIuli. s dllull
for all u eX.
In this case. X is also an H-space with respect to (.1 · )..
In the following let X and Y be H-spaces over k = R. C. and let M be a
subset of X.
Bounded sets. The set M is called bounded iff there is a real number r such
that
lIuli S r
for all u e M.
Open and closed sets. The set M is called open iff for each u E M there is a
number r > 0 such that the ball {t' e X: IIv - ull < r} belongs to M.
The set M is called closed iff for each sequence (u,,) in M,
u" -+ u as n -+ 00 implies u e M.
The closure M of M contains exactly all the elements 14 of X with the
property that there is a sequence (u,,) in M such that u.. -+ u as n -+ 00.
Dense sets. The set M is called dense iff M = X. Obviously. M is dense iff.
for each u e X and each £ > 0. there is an element v in M with IIv - ull < t.
An H-space X is called separable iff X contains an at most countable dense
set.
Compact sets. The set M is called relatively compact iff each sequence in
M has a convergent subsequence.
The set M is called compact iff it is relatively compact and closed, i.e., each
sequence (14,,) in M has a convergent subsequence 14,,- -+ U as n -+ 00 where
ue M.
Linearity. The set M is called a linear subspace of X iff
u,('EM and .{JEk imply 2U+{Jt'EM.
The operator A: D(A) X -+ Y is called linear iff D(A) is a linear subspace of
X and for all u. t' e D(A) and «. {J e k.
A(ocu + fJv) = GlAu + fJAt'.
The linear operator A: D(A) X -+ Y is called bounded iff there is a real
number c such that
"Au" S c 111411 for all 14 e D(A).
If the linear operator A: X -+ Y is bounded. then we define
IIAII = sup IIAuli.
.-a s I
This implies
IIAull s IIAliliuli
for all u eX.
In this connection note the linearity of A.
10
Introduction to the Subject
A tlti!i1Jearity. The operator A: X -+ Y is called antilinear iff. for all II, (' E X
and , II E (K.
A(:xu + fir) = Au + PitU.
Here, the bar denotes the conjugate complex number. If X and Yare real
H-spaces, then antilinear and linear operators coincide.
Continuity. The operator A: X -+ Y is called continuous iff. as n -+ x,
II n -... II
implies
Au" -+ Au.
We \\'ant to prove the following:
A Ii"ear operator /t: X -+ r is hounded ill it ;s continuous.
First suppose that A is bounded. Then it follows from II" -+ II as II -+ that
d .4 "" - .4 III! = II A ( u" - II) II < it A 1111 u II - ull
and hence .4u n -+ Au as " -+ x, i.e.. .4 is continuous.
Conversely. suppose that A is continuous. If A is not bounded. then there
is a sequence (u,,) such that
IIUn'l = I
and
IIAu,,1I > 11
for all IJ.
Sct f" = "" iI.4I1nlll. Then.as1l-'" x,
'Ir,,11 = 1/IIAu"II"2 -... 0
and II A 1',,1. = II .411 n 1 1 --+ x.. This contradicts the continuity of A.
("oulpactness. The operator A: X --+ Y is called compact iff it is continuous
and it maps bounded sets into relatively compact sets. Thus, if A is compact,
then each bounded sequence (u,,) contains a subsequence (u,,-) such that (Au",)
is convergent.
Linc",. IUIICI;OIlClls. A linear continuous map J': X -+ K is called a linear
continuous functional on X. The set of all the linear continuous functionals
on .Y is denoted by X*. Consequently. we have lEX. iff I: X -+ IK is linear
and there is a real number c such that
I l(u)1 < c lIu;1
for all II E X.
We set
11.1 II = sup 1.((u)l.
II I S I
Instead of ((II) we also write <I,ll).
In the Appendix of Part I. the interested reader williind the relationship
between the notions introduced above and general notions in topology.
The notion of an abstract Hilbert space was first introduced by von Neumann
1929). In his basic papers around 1906, Hilbert used the space 1 2 consisting
Hilbert Spaces
I 1
of all sequences :< = (,) with , e K for all i and
at
L 1,12 < 00.
i=1
Here the scalar product is given by
x
(:<Iy) = L !,'1i.
iel
The space 1 2 generalizes the classical Euclidean space R" to infinite dimensions.
A Look at the History of Hilbert Spaces
The following quotations should help the reader to understand the birth of
the notion of Hilbert space.
Further considerations of the subject led me to the conclusion that the systematic
construction of a general theory of integral equations is of the utmost importance
for the complete field of analysis and, in panicular, for the expansion offunctions
in infinite series, linear differential equations, analytic functions, potential theory,
and the calculus of variations.
David Hilbert (1912)
(From the Preface to his monograph on linear integral equations.)
In order to understand the great achievement of Hilbert (1862-1943) in the field
of analysis, it is necessary to first comment on the state of analysis at the end of
the nineteenth century. After Weierstrass (1815-1897) had made sure of the
foundations of the complex function theory, and it had reached an impressive
level. research switched to boundar}' t'alue problems, which first arose in physics.
The ",.ork of Riemann (1826- t 866) on complex function theory, however, had
shown that boundary value problems have great importance for pure mathe-
matics as well. Two problems had to be solved:
(i) the problem of the existence of a potential function for given boundary
values: and
(ii) the problem of eigenoscillations of elastic bodies. for example, string and
membrane.
The state of the theory was bad at the end of the nineteenth century. Riemann
had believed that by using the Dirichlet principle, one could deal with these prob-
lems in a simple and uniform way. After Weierstrass' substantial criricism of
the Dirichlet principle in t 870, special methods had to be developed for these
problems. These methods, by C. Neumann, Schwarz, and Poincare. were very
elaborate and still have great aesthetic appeal today; but because of their variety
they were confusing, although at the end of the nineteenth century, Poincare
( 1854- 1912), in particular, endeavoured with great astuteness to standardize
the theory. There was, however. a lack of "simple basic facts" from which one
could easily get complete results without sophisticated investigations of limiting
processes.
Hilbert first looked for these "simple basic facts" in the calculus of variations.
He considered so-called regular variational problems which satisfy the Legendre
condition. In 1900 he had an immediate and great success: he succeeded in
just il\'ing the Dirichlet principle.
While Hilbert used variational methods, the Swedish mathematician Fredholm
12
Introduction to the Subject
(1866 19:!7t approached the same goal by developing Poincare.s work by using
linear integral equations. In the winter semester 1900/1901 Holmgren. who had
come from Upsala (Sweden) to study under Hilbert in Gottingen, held a lecture
in Hllberfs seminar on Fredholm's work on linear integral equations which had
been published the previous year. This was a decisive day in Hilbert's life. He
took up Fredholm's new discovery with great zeal, and combined it with his
variational methode;. In this way he succeeded in creating a uniform theory
y.'hich solved problems (i) and (ii) above.
In 1904 H ilberfs first note on the uF oundations of a General Theory of linear
Integral Equations" was published in the Gt)tringer Nachrichren. These results
y.'ere based on lectures which Hilbert had held from the summer of 1901
on\\'ards. Fredholm had proved the existence of solutions for linear integral
equations of the second kind. His result was sufficient to solve the boundary
\'alue prohlems of potential theory. But Fredholm's theory did not include the
eigenscillations and the expansion of arbitrary functions with respect to eigen-
functions Only .Iilhert solved this problem by using finite-dimensional approxi-
mations and a passage to the limit. In this way he obtained a generalization of
the classical principal-axis transformation for symmetric matrices to infinite-
dimenc;ional matrices. The symmetry of the matrices corresponds to the symme-
try of the kernels of 1I1tegrai equations, and it shows that the kernels appearing
In oscillation problems are indeed symmetrical.
From our point of "ley.' today, Hilbert's paper of 1904 appears clumsy.
compared to the elegance of Erhard Schmidfs method (1907a, b) which he
developed in his dissertation written while a student of Hilbert in Gottingen.
But the tirc;t step had been made In the same year. 1904, Hilbert. in his second
note. \\'a ahle to apply his theory to general Sturm -liouville problems. His
third note In 1905 contained a very important result. Of the great problems
whIch Rlernann had posed with the comple function theory, there was still one
left open the proof of the existence of differential equations with a prescribed
monodromy group Iilbert solved this problem by reducing it to the determina-
tion of t'o functions which are holomorphic in both the interior and exterior
of it closed curve. and whose real and imaginary parts satisfy appropriate linear
relatlon on the curve (the Hilbert Riemann problem I). The solution to this
problem IS a classic example for the axiomatics of limiting processes demanded
hy 'ilbcrt No concrete limiting processes are used, but everything results from
the existence of the Green function for the interior and the exterior of the closed
cur\e. and from the Fredholm alternative which says that either the homoge-
neous Integral equation has a nontrivial solution or the inhomogeneous integral
equation has a solution
Jfilbert soon noticed that limits are set to the method of integral equations.
In order tn overcome these limits he created, in his fourth and fifth notes in 1906,
the general theory of quadratic forms of an infinite number of variables. Hilbert
helie\ed that with this theory he had provided analysis with a general basis
which corresponded to an a.'(iomatics of limiting processe.. The further develop-
ment of mathematice; has proved him to be right.
Otto Blumenthal (1932)
The above quotation constitutes an extract from Hilbert's biography, which
can be found in Volume 3 of his Col/ectec/ Works. We recommend that the
I Toda. Riemann Hilbert problems playa fundamental role In the theory of solitons. See
I addec\ and TakhtadJan (19R7)
Hilbert Spaces
13
reader has a look at these Collected Works. Every single one of Hilbert's
mathematical works and essays is a masterpiece.
In the fall of 1926. the young John von Neumann (1903 - 1957) arrived at
Gottingen to take up his duties as Hilbert's assistant. These were the hectic years
during which quantum mechanics was developing at breakneck speed, with a
new idea popping up every few weeks from all over the horizon. The theoretical
physicists who were developing the new theory were groping for adequate
mathematical tools, trying in succession infinite matrices without any considera-
tions of convergence, differential operators. "continuous matrices" (whatever
that might mean), etc. As late as 1924, most physicists did not even know that
a finite matrix was! It finally dawned upon them that their "observables" had
properties which made them look like Hermitian operators in Hilbert space, and
that by an extraordinary coincidence, the "spectrum" of Hilbert (a name which
he had apparently chosen from a superficial analogy) was to be the central
conception in the explanation of the "spectra" of atoms. It was therefore natural
that they should enlist Hilbert's help in trying to put some mathematical sense
in their computations. With the assistance of Nordheim and von Neumann,
Hilben first tried integral operators in Lz' but that needed the use of the Dirac
ub-function," a concept which was for the mathematicians of that time self-
contradictory. Von Neumann therefore resolved to try another approach.
Jean Dieudonne ( 1981 )
Stimulated by an interest in quantum mechanics, John von Neumann began the
work in operator theory which he was to continue as long as he lived. Most of
the ideas essential for an abstract theory had already been developed by the
Hungarian mathematician Frigyes Riesz (1880-1956), who had established the
spectral theory for bounded symmetric operators in a form very much like that
now regarded as standard. Von Neumann saw the need to extend Riesz's
treatment to unbounded operators and found a clue to doing this in Carleman's
highly original book on integral operators with singular kernels. The result was
a paper von Neumann submitted for publication to the Mathematische Zeit-
schrift but later withdrew. The reason of this withdrawal was that in 1928 Erhard
Schmidt (1876-1959) and I, independently, saw the role which could played in
the theory by the concept of the adjoint operator, and the importance which
should be attached to self-adjoint operators. When von Neumann learned from
Professor Schmidt of this observation. he was at once able to rewrite his paper
in a much more satisfactory and complete form, giving a full spectral theory for
all closed symmetric operators as well as for the self-adjoint operators. This he
did by abandoning Carleman's method. which he had been able to apply only
by use of a transfinite induction, and introducing the Cayley transform, which
served to reduce the theory of unbounded symmetric operators to that of
bounded isometric operators. Incidentally, for permission to withdraw the paper
without penalty. when it was already in page proof, the publisher exacted from
Professor von Neumann a promise to write a book on quantum mechanics. The
book soon appeared and has become one of the classics of modern physics,
particularly valued for its analysis of quantum statistics (Mathematische Grund-
lagen de, Quantenmechanilc. Springer-Verlag, 1932).
Marshall Harvey Stone (1970)
A great master of mathematics passed away when David Hilbert died in
Gottingen on February 14, 1943. at the age of eighty-one. In retrospect, it seems
to us that the era of mathematics upon which he impressed the seal of his spirit,
and which is now sinking below the horizon. achieved a more perfect balance
14
Introduction to the Subject
than has prevailed before or since. between the mastering of single concrete
problems and the form.ltion of general abstract concepts. Hilbert's own work
contributed not a little to bringing about this happy equilibrium. and the
direction in which we have since proceeded can in many instances be traced
back to this impulse. No mathematician of equal stature has risen from our
generation. . . .
Hilbert was singularl)' free from national and racial prejudices: in all public
questions. be they political. social. or spiritual. he stood forever on the side of
freedom. frequently in isolated opposition against the compact majority of his
environment. He kept his head clear and was not afraid to swim against the
current. even amidst the violent passions aroused by the First World War
(1914 1918) that swept so many other scientists ofT their feet. It was not mere
chance. y..hen the Nazis '''purged'' the German universities in 1933 and their hand
fell most heavily on the Hilbert school. that Hilbert's most intimate collaborators
left Germany either voluntarily or under the pressure of Nazi persecution. He
himself was too old. and stayed behind: but the years after 1933 became for him
years of ever-deepening tragic loneliness.
Hermann Weyl (1944)
CHAPTER 18
Variational Problems, the Ritz Method,
and the Idea of Orthogonality
When we do scientific work. we must often step down from our high horse of
grand principles. and dig in the dirt with our noses. When \\'e achieve our
purpose. we cover the tracks of our efforts in order to appear as gods of clear
thought.
Albert Einstein
In Sections 18.1 through 18.5 of this chapter we consider a number of important
concrete examples in order to explain the following:
(i) The connection between variational problems and boundary value prob-
lems for elliptic partial differential equations.
(ii) The equivalence between classical and generalized solutions in case the
solutions are sufficiently smooth.
(iii) The Ritz method for the construction of approximate solutions.
This should help the reader to recognize the simple basic ideas behind the
general theory. In this connection, a fundamental role is played by:
(a) the formula of integration by parts; and
(b) the variational lemma.
A general convergence proof for the Ritz method will be given in Section 22.1.
In the middle of this chapter there stands an elegant functional analytic
justification of the famous Dirichlet principle. Figure 18.4 shows the logical
structure of our existence proof.
(i) In Section 18.8 we use the parallelogram identity in order to prove very
simply a general existence theorem for quadratic minimum problems. The
parallelogram identity is a generalization of the classical Pythagorean
theorem.
15
16
J K Variational Problems. the Ritz Method. and the Idea of Orthogonality
parallelogram identity
(the Pythagorean theorem in
Hilbert spaces)
I
perpendicular
principle
existence principle
for quadratic minimum problems
I
+ the Holder inequality
+ the Poincare- Friedrichs inequality
theorem of R iesz
+ Sobolev spaces
main theorem on
linear monotone
operators
Dirichlet principle
(generalized solutions of the
Dirichlet problem
in Sobolev spaces)
I
regularization of
the generalized solutions
(Lemma of Weyl)
Figure 18.4
(ii) The functional analytic justification of the Dirichlet principle is a special
case of the general existence theorem in (i). To this end, we need the
following three analytical tools: the Holder inequality. the Poincare-
Friedrichs inequality. and Sobolev spaces (Sections 18.9 and 18.10).
(iii) In Section 18.11 we show that the following three principles are mutually
equivalent: the existence principle for quadratic minimum problems, the
perpendicular principle, and the Riesz theorem.
The perpendicular principle says that there exists a perpendicular from each
point of an H-space to a closed linear subspace. Consequently, we obtain the
following:
Tile functional anal)'ti(" justification of ,he Dirichlet principle is based on tlte
idea of orthogonality.
Therefore, the Dirichlet principle is closely related to the classical
Pythagorean theorem.
There are ideas in mathematics which remain eternally young and which
lose nothing of their intellectual freshness, even after thousands of years.
18.1. The Space CtG) and the Variational Lemma
17
Mathematicians of the Pythagorean school in ancient Greece attributed the
Pythagorean theorem to the master of their school, Pythagoras of Samos
(circa S60 B.C. -480 B.C.). It is said that Pythagoras sacrified one hundred
oxen to the gods in gratitude. In fact, this theorem was already known in
Babylon at the time of King Hammurabi (circa 1728 B.C.-1686 B.C.). Pre-
sumably. however, it was a mathelTlatician of the Pythagorean school who
first proved the Pythagorean theorem. This theorem appears as Proposition
47 in Book I of Euclid's Elements (300 B.C.).
The theory of Hilbert spaces is the abstract and very efficient formulation
of the idea of orthogonality. It seems that this idea has deep roots in our real
world. In Part V we shall show that Hilbert spaces represent the right mathe-
matical tool in order to describe the strange nature of quantum phenomena.
In nature we observe a duality between waves and particles. which is typical
for quantum processes. For example. light possesses such a dual structure. In
order to formulate this duality in terms of mathematics. one uses the theory
of Hilbert spaces. The main idea is that the quantization of particle theories
and wave theories lead to equivalent theories in H-spaces.
In Sections 18.7 and 18.19 we discuss briefly the historical development of
the Dirichlet principle, and of the 19th and 20th problems of Hilbert and their
relationship to the modern theory of monotone operators.
One cannot comprehend what it is one possesses if one has not understood what
one.s predecessors possessed.
Johann Wolfgang von Goethe (1749- 1832)
Mathematics has the advantage over many other fields of knowledge. such as
history. for example. that one can sometimes distinguish between true.' and
.false." Because of this mathematics pays dearly for this advantage in that it is
the most remote of all human things. One best notices the mathematician.s
longing for humanity and history in the introductions to mathematical writings.
Wilhelm Blaschke ( 1942)
18.1. The Space C o ( G) and the Variational Lemma
We first list a number of function spaces.
Definition 18.1. Let G be a nonempty open set in R N , N 1.
(a) Ci(G) is the set of all real functions u: G -+ R that have continuous partial
derivatives of orders m = O. I, . . . . k on G. We understand the derivative
of order m = 0 to be the function itself.
(b) CIt(G) is the set of all u E CIt(G) for which all partial derivatives of orders
m = O. .. . . k can be extended continuously to G.
(c) We denote briefly by C(G) (resp. C(G) the continuous real functions on
G (resp. G) instead of CO(G) (resp. CO(G)).
(d) If u e Ci(G) (rc:sp. u e C'(G» for all k = O. 19 .... then we write u e CX(G)
(resp. u e c r (G)).
IN
I" Vanallonal Problems. the Ritz Method, and the Idea or Orthogonality
,
I
I
G
- - -..
fa) ItE(',:(R)
(h)
Figure 18.5
(e) (... ((i) is the set of all functions II E C" (G) which vanish identically outside
a compact subset K of G that depends on II (Fig. 18.5).
The follo\\'ing important result will be used extraordinarily often.
Proposition 18.2 (Variational Lemma). Let G he a '10ne,p'ply open set ;11 {R'''.
.". > I. Let II E L 2 (G and suppose fhClt
f lll'dX = 0
(,
fc)r all l' E C(G).
(6
1'lren "'(' ohtail1
II(=O
alll,ost er'ery,,'here 0" G.
The notion "almost everywhere'" will be used very frequently in this \'olume.
We recall this in the Appendix, A 2 (2).
PR(XU-. We set ..\' = L 2 (G) and use the well-known fact that Co (G) is dense in
.\'. Hence there exists a sequence (un) in C (G) with
"" -+ II in..\'
as II -+ X.
From (6) \\'e obtain f 11111" = 0 for all II. letting II -+ X this implies (u Ill) = 0,
and hence II = O.
The density of C (G) in 1'( will be proved in Section 18.14 as a special
application of smoothing operators. 0
If " E C"(G) in Proposition 18.2, then obviously we obtain u(x) = 0 for all
E G.
In Sections 18.2 through 18.5 we shall make the following assumption:
(i is a hounded reJ;(Jn ;n IR' ,,'illl N > I. 1"he boundary i...
piece,,';se s,nooth. i.e.. to he precise. i'G E Co. 1 holds. 1 (7)
We gave the exact definition of h(G E Co. I ". in Section 6.2. Intuitively. we
think of this to mean all reasonable regions where the boundary can also have
I Throu1Z hout thl \ olume. condition (7) means that G is a bounded open Inter\'al If
\ - I
I R.2. Integration by Parts
19
aGeCO I
aGCU I
/
(8)
(b)
Figure 18.6
a finite number of vertices and edges with nonzero angles (Fig. 18.6(a)). Zero
angles. as in Figure 18.6(b), are not allowed. Recall from Part I that a region
is a nonempty open connected set.
let x = (I" . . . .,. ) E H'" and
D /"
i = (- Ci'
We denote the surface integral by
S = f gdO.
iG
Here. for N = 2. S is equal to the contour integral over the boundary curve
cG.. where this curve is oriented in the mathematically positive sense. For
N = I and G = ]a. h[ we have S = gl:. i.e.. S = g(b) - g(a). The Laplacian
L1 is defined by
....
II = L Dlu.
i= I
18.2. Integration by Parts
The integration by parts formula is the key to the modern theory of partial
differential equations.
The classical integration by parts formula
f..b U'f dx = mo': - f..b UfO' dx (8a)
is valid in R I , where - x < a < b < x'. The natural generalization to func-
tions II: G c R N ..... R reads as follows for i = I, .. . . N:
f (DfU)l' d.,<: = f u(ni dO - f lIDfl' dx. (8b)
G G G
20
I X. Variational Problems. the Rilz Method. and the Idea of Orthogonality
"
a(i
Figure 18.7
Here Il = (11 I" . . . n,.) denotes the outer unit normal to the boundary (G (I:ig.
I R. 7). At the houndary points, in which no outer normal is defined. i.e.. in
vert ices a nd edges, let n; = 0, ; = I. . . .. N . We denote the su rface d ifferen t ia I
by dO. For N = I. (8a) and (8b) coincide.
Proposition 18.3 (Integration by Parts). Equation (8b) is ralid .for aI/II. rEel (G)
;n case G sat ;.f;es 1l.SIl"lpt ;011 (7).
If II or r belongs to C (G). then these functions vanish on i'G; therefore. the
boundar} integral drops out in (8b).
Corollar' 18.4. fOor (8b) to hold. it $(f.r;ce." tllat II. l" tire eleIJ.ellts ,,/ Soholel"
-"relct)s. to he ;"trotJllcetl lat('r. To he precise.
II E . I ( G)..
nUI.c;t Irold.
r E W q I ( G). I < p < x.
p-I + q I = I
A proof of the "'ell-known Proposition 18.3 can be found. for example. in
Nec,ls (1967. M). p. 121. Corollary 18.4 follows by means of a passage to the
limit. since CI(G) is dense in Wpl(G) and I(G).
Definition 18.5. We understand the (outer) normal derivative of II at x E tG
to be
tu(x) \'
- = L II;D;II(X).
tn ;=1
(9)
Oor N = I and G = ](1. h[ we have
(II(a) ,
-:;- -- = - u (a}.
('"
(u(h) ,
- . = II (h).
i'11
If we write the one-dimensional integration by parts formula (8a) in the form
f: (url' d.y; = 1lt'1.
then this is precisely the fundamental theorem of calculus, which expresses the
18.3. The First Boundary Value Problem and the Ritz Method
21
relationship between tangent and area discovered by Newton and Leibniz.
The famous Gauss theorem
f D,(uv)dx = f uvn,dO
G i'G
generalizes this relationship to higher dimensions. But this is precisely the
integration by parts formula (8b). We shall see later on that the integration
by parts formula is the key to generalized derivatives, Sobolev spaces, distribu-
tions, and generalized solutions of partial dilTerential equations. Thus, this
formula is a cornerstone of modern analysis.
18.3. The First Boundary Value Problem
and the Ritz Method
18.3a. Equivalent Problems
We want to investigate the connection between the following problems for
the unknown function u. Let M = {v E C 1 (G): v = 0 on cG}.
(A) Variational problem
L (It (D l u)2 - uf )do'( = min!, u E C'(G),
u = g on eG.
(B) Generalized boundary t'alue problem
f ( t DluD I (, - fV ) dX = 0
G i-I
( 10)
for all
VE M,
( II )
u = 9 on eG.
(C) Boundary value problem
-u =f on G,
( 12)
u = g on aG.
Equation (12) is called the Euler equation to the original variational problem
(10).
Proposition 18.6. Let f E C(G) and 9 E C(oG). Assume that the region G satisfies
(7). Then the following ;S I:alid:
(a) For u E C 2 (G), the variational problem (10), the generalized boundary value
problem (II), and the boundary value problem (12) are mutually equivalent.
.,.,
-..
I X Vanallonal Problems. the Ritz Method. and the Idea of Orthogonahty
(j
a(;
Figure 18 H
(h) f'or II E (., (G), I he rar;al ;ollcll prohleIJ1 ( 10) and the Jel1erali:ed hOIl"c/clr.\"
raille prohle,,, ( II tare eCI"iralent.
I n order to associate a physical picture \\'ith this problem. let G be a region
in 2. Then \\'e can interpret u(x) as the vertical displacement of a membrane
at the point x under the influence of an external force .r (e.g." the force of
gravity). The boundary condition "" = 9 on tG'" means that II is fixed at the
boundary. Figure 18.8 sho\\'s the case y = O. Problem (10) corresponds to the
principle of minimal potential energy.. i.e.. the membrane realizes the state of
minimal potential energy for fixed boundary values.
The following argument (I) is typi('al for the calculus of variations. In Section
18.18 \\'e shall sho\\' that the follo\\'ing implications (10) => (II) => ( 12) can be
generalized in a straightfor\vard manner to general variational problems.
PR( )()J- . We set
f (II) = f ( .r ID,U)2 - ./11 ) dx.
(; -. I
The decisive ,rie" of t he calculus of variations consists of reducing variational
problems to extremal problems for real functions. To this end, we define the
real function
<p(r) = F(1l + If)
for all I E IR
and fixed r E .\1" i.e..
q>IO = 1. G i ID i lll + tl'I1 2 - .fIll + 11'1)dX.
Ad(a). Let II E ("2(G).
(I) (10) => ( II). Suppose that" is a solution of the original variational problem
( I 0). i.e..
f.(II) = min!.
II = y on i'G"
II Eel ( G ).
( I O. )
Recall that J\I = :,. E C t (G): r = 0 on ('G:. Then for all rEM. I E IR'I we
have
" + I r = y on ('G.
II + U- E C1(G).
18.3. The First Boundary Value Problem and the Ritz Method
23
i.e. the functions u + It' are allo\\'ed in the competition in (10.). Hence
the real function <p has a minimum at I = 0: thus we obtain the key
condition
qJ' (0) = o.
This implies
r ( .f D/uDlt' - ft' ) dX = 0 for all l' E M. (II.)
JG .=1
which is precisely the generalized boundary value problem (II).
(II) (11) => (12). By integration by parts, it follows from ( 11.) that
t (- Au - f)t'd:v: = 0 for all t' e M.
in particular for all l" e CO=(G). Application of the variational lemma
(Proposition 18.2) yields
- L\u - f = 0 on G.
Since II = g on cG, we obtain the boundary value problem (12).
(III) (12) => ( 11 ). We multiply (12) by t' E M and integrate over G. Then inte-
gration by parts yields (11).
(IV) (11) (10). Since cp is a quadratic function, the following is valid:
<p has a minimum at I = 0 iff <p/(0) = O.
Recall that <p depends on v. Hence the original variational problem (10)
is equivalent to
cp'(O) = 0
for all t' E M.
But this is identical to (11).
Ad(b). In the proof of(10)(II) we have only used u e C1(G). 0
In addition. we define
bJrF(u; v) = cp(t)(O)
and call IcF(u; t') the kth variation of the variational integral F at the point u
in the direction of ('. In particular, we obtain
t5F(u: v) = r ( f D1uDIv - ft. ) dx.
JG '=1
t5 2 F(u; v) = r t (D 1 t,)2 dx.
J G i-I
Thus the generalized boundary value problem (II) is equivalent to the van-
ishing of the first variation, i.e.,
bF(u; t') = 0
for all t' e M.
24
18 Variational Problems. the Ritz Method. and the Idea of Orthogonality
18.3b. The Ritz Method
We now explain, with the Ritz method for the approximate solution of (10)
and hence of (12). a basic general approximation method for the solution of
variational problems. The basic idea can be formulated brieny as follows:
(R) f'or a girl"1 rarialio'1al prohle," one rarie.4t only o('er finile-dimen."ional
suhset. tltat satis.{y tile side conditions.
The advantage of this method consists in that one can reduce variational
problems in function spaces to variational problems for real functions with
finitely many variables. As a simple example, we first consider the minimum
problem
f( t :: ) - ., (13
I""'N - min. )
for the real function ,r: 1R'\' -+ IR. An approximate solution can be obtained by
varying over only special coordinates. for instance, one can choose I' . . . . ,
to be free and set . + I = ... = .,.. = O. The variational problem (10) transpires.
in contrast to (13), in an infinite-dimensional function space. The Ritz method
for (10) is based on the following two conditions, in accordance with (R):
(i) One varies only over all real linear combinations of finitely many fixed
functions.
(ii) All these linear combinations fulfill the boundary condition u = g on cG..
which corresponds to the side condition in (R).
As a formal simplification.. we first assume that g = 0 and discuss at the end
of this section a general method for reducing problems with inhomogeneous
side conditions to problems with homogeneous side conditions, by means of
a simple subtraction trick. In order to formulate (i), we choose fixed functions
"',. . . ., "'" and make the trial
"
Il" = L c,,, \\',
'=1
for the approximate solution u", with the unknown real foefficients C I ", .....
c"", In order to satisfy condition (ii) with 9 == 0, we require
\\', = 0 on cG
Then the boundary condition
for all k.
u" = 0 on cG
is automatically fullfilled for all II". According to (i), we now replace the
original variational problem (10),
i (2 ! t (D,U)2 - Uf ) dx = min!,
G i=1
U = 0 on cG..
18.3. The First Boundary Value Problem and the Ritz Method
2S
by the following approximate problem:
L G / (D/U,,)2 - uf )dX = mint, e" e R", (14)
where c" = (c l ",..., ClIft). Thus, in contrast to the original problem, in (14) we
vary over all functions u" with u" = 0 on aGe Yet we add a purely technical
condition. So that the integral in (14) will exist, we require that all the basic
functions "', , . . . , "'" are continuous and piecewise continuously differentiable
on G.
Corollary 18.7. If u" ;s a solution of (14), then
f ( f D/u"D/w j - fW j ) dx = 0,
G 1=1
j = I, . . . , n.
(1 S)
Obviously, this relation is equivalent to the so-called Ritz equations:
fCII" f f D/WilD/Wjdx= f fWJdx, j=1,...,n. (16)
i-I G I-I G
This is a linear system of equations for determining the coefficients C I ", ...,
ClIft of U". A comparison with (II) yields the following crucial observation:
The Ritz equations (15) are obtained formally from the generalized problem
(t 1) by replacing u by U" and requiring that (11) holds for all v E span { ""1' . . . , "'" }.
PROOF. We write (14) as
f(c II) = min!,
For a solution e", we obtain
C" E R".
iJf(e,,) = 0,
CCj"
j = I, .. . , n.
This immediately implies (16) and hence (15).
o
A concrete example for the Ritz method will be considered in Problem 18.1.
18.3c. Discussion
Our considerations above give rise to the following questions:
(i) In what sense do equations (10), (II), and (12) have a solution u?
(ii) In which sense does (u,,) converge to u? How are WI' W2. ... to be chosen
so that convergence occurs?
(iii) How can the error estimates be obtained?
26
I R V Jriatlonal Problems. the Ritz Method. and the Idea of Orthogonality
We give the answers to thcse qucstions in Sections 22.1 through 22.3. Here we
content ourselves with a brief explanation. Let g == O.
Ad(i). In the general case. the original variational problem (10) possesses
110 solution II E C I (G). but a unique solution II in the
Sobolev space W 2 1 (G).
I n order for the integral in ( 10) to exist. it suffices that II has generalized first
derivatives DiU that are square integrable. These functions are in W 2 1 (G). One
can motivate the choice of W 2 ' (G) by the boundary condition
II = 0 on (G.
In this connection. note the following:
(a) the inclusion JI (G) C W 2 1 fG) is valid: and
(b) the functions from AI = {liE ('I (G): u = 0 on (G: are dense in f21(G).
In fact the functions in the Sobolev space t'21 (G) have even boundary values
in a certain generalized sense, i.e..
II E V2t (G)
implies
II = 0 on i"G
in a certain generalized sense (see Section 21.3).
fOurthermore. we mention the fact that we can allow .r E L 2 (G) in (10). i.e..
Je, ,. 2 clx < x. Thus. the function (may have discontinuities.
The final generalized boundary value problem ( 11 ) reads as follows:
fOor /;r(,11 I E 1.- 2 (G. Cl '/;411CI ion II E W 2 1 (G) is sought so t lIat (II ) is .'lliid .for
all,. E "'21 (Gt. i.e..
(c. )
f ( ,t DjllDjl' - I r ) dx = 0
(i . :: I
for all r E W 2 1 (G).
This problem is called the generalized problem to the classical problem (12),
I.c..
( (-. )
- Il = r on G.
II = 0 on iG.
According to Proposition 18.6. each classical solution of (C) is also a gener-
alized solution. i.e.. it is a solution of (C.). However. the converse is not true.
Only with smooth data f and ?G. can one verify that the generalized solutions
of (C.) are also classical solutions of (C).
F-"rom the physical poilu of vie,v, the generalized solutions are very natural.
namely:
We cleter"Jine SO/ut;OlIS U .(or \\,lJich tlte energy integral (10) of the prin(.;ple
01 mininlal potential energy exists at least in a generalized sense.
In this connection.. note the following. In the original variational problem
( 10) and in the generaliled boundary value problem (II). we only need .(irst-
18.3. The First Boundary Value Problem and the Ritz Method
27
order derivatives. Thus. it is natural to work with the Sobolve space W 2 1 (G)
in (C*). which contains functions having generalized first-order derivatives.
In contrast to this. the classical boundary value problem (C) contains second-
order derivatives.
Ad(ii). In Theorem 22.A we shall see that {\\'1' \\'2'...} must form a basis in
W 2 1 (G) or. more generally, a so-called Galerkin scheme. By A 2 (S7), this occurs,
e.g., with regard to the functions {gPl' gP2' gPJ' ...}, where the Pi are poly-
nomials and g, with certain regularity properties, vanishes on the boundary
eG. Moreover, by A 2 (S9), one can use piecewise linear functions. i.e., so-called
finite elements. Trigonometric polynomials can also serve this purpose (cf.
Problem 18.1).
Ad(iii). In order to obtain error estimates, one makes use of duality theory,
i.e.. one considers a dual maximum problem besides the initial minimum
problem. We discuss a simple special case in Section 22.4. The general duality
theory will be treated in Part III, Chapter 50, within the context of convex
analysis.
Remark 18.8 (Reduction Principle). In conclusion, we describe a simple
method by means of which one can reduce inhomogeneous boundary value
conditions to homogeneous ones. To this end. we consider the inhomo-
geneous boundary value problem
- u = f on G.
u = 9 on cG.
We choose a function g: G --+ R with
g = 9 on eG.
If we now set l = II - g, then we obtain the homogeneous boundary value
problem
- t' = f + g on G,
t' = 0 on cG.
Analogously, the variational problem
L G It (D l u)2 - uf ) d.'t = min!, u = 9 on eG,
is reduced to
L ( I (D,(r + g»2 - (t' + g)f )dX = min!, t' = 0 on ('G.
In order to transform an inhomogeneous differential equation into a homo-
geneous one, we proceed similarly. For example, we consider the inhomoge-
lIeOIlS equation
- u = f on G,
u = 9 on eG.
28
18 Va national Problems, the Ritz Method, and the Idea of Orthogonahty
If one knows a special solution " with
-L1ii = f on G,
then we set r = II - II.. and hence we obtain the homogeneous equation
- L1r' = 0 on G,
r = y - u on ('G.
18.4. The Second and Third Boundary Value
Problems and the Ritz Method
I n this section we first of all encounter the task of explaining the concept of
nalural houlldary conditiolls. These are boundary conditions that do not appear
at all in the original variational problem, and first arise as necessary solvability
conditions. Moreover.. we shall discuss a method which allows us to force
uniqueness for problems that are not uniquely solvable, by going over to a
subspace (cf. Corollary 18.10). We consider the following problems:
fA) 'llr;at;oll£ll prohleIJI
L G , (Dill)! - llf)dX + s'G G 1121t - llg )dO = min!.
II Eel ( G).
( 17)
(B) Ge'lerll!i:ed hou1Jdary I'a/ue prohle,n
L C D,IlD j l" - (I') tlx + f,(; (lw - g)t'dx = 0 for all t' E C. (G). (18)
(C) BOIII,clary raille prohlenJ \\'illl IIalural hou,1dary condition
- L\u =.f on G.
('U
- + "" = g on c'G.
('n
( 19)
"-or II = 0 resp. II 0), (19) is called the second (resp. third) boundary value
problem for the Poisson equation.
Proposition 18.9. Let .r E C(G) and y, h E C(iG). Suppose G satisfie.'i (7). The"
the .follo"';ng are t'alid:
(a) (or II E C 2 (G).. prohleIJls (17), (18), and (19) are mutually equit:'alent;
(b) for If E C1(G), problems (17) and (18) are equ;vale1Jt;
fc) f II E C 1 (G) i... a solution of (17) M!ith h = 0, then the soll'ability condition
f .r dx + f g dO = 0
G i'G
(20)
is redid.
18.4. The Second and Third Boundary Value Problems and the Ritz Method
29
In order to associate a physical picture with these assertions, we again, as
in Figure 18.8 above, interpret u(x) as the vertical displacement of a membrane
at the point x over the region G in R 2 under the influence of the external force
f (e.g., the force of gravity) and the external boundary force g. The membrane
is to be elastically joined to the boundary oG. Problem (17) contains the
requirement for a minimal potential energy. The term 2- 1 Ji'G u 2 h dO describes
the potential energy of the elastic coupling at oG. Whereas no boundary
condition appears in (17), there occurs in (19) the so-called natural boundary
condition
ou
- + hu = g on oG
an
as a necessary solvability condition. This natural boundary condition is a
necessary condition for the boundary forces g and the behavior of the displace-
ment u on the boundary under equilibrium, whereas the solvability condition
(20) represents an equilibrium condition for the external forces f and g.
In contrast to (19), the first boundary condition II = 9 on cG in (12) prescribes
the displacement of the membrane at the boundary points.
PROOF. We use the same strategy as in the proof of Proposition 18.6. To this
end, we set
F(u) = t (It (D l u)2 - lu )d. + t GU 2 h - ug ) dO.
Moreover, we define the real function
cp(t) = F(u + tv)
for all t E R
and fixed l' eel (G).
Ad(a). Let u E C 2 (G).
(I) (17) (18). If u is a solution of (17), then cp has a minimum at t = 0, i.e.,
q>'(O) = o.
This implies (18).
(II) (18) (19). Integrating (18) by parts yields
f (-u - f)vdx + f ( hU - 9 + oU ) VdO = 0 for all v E C1(G).
G
I t first of all follows from this that
t(-U-f)VdX=O forall veCo(G),
therefore - 4u - f == 0 on G according to Proposition 18.2. Second, it
follows from
f ( hU - 9 + Ou ) v dO == 0
fG en
for all v e C 1 (G)
30
I K Varlallonal Problems, the Rltl Melhod, and the Idea of Orlhogonalit},
that IJII - Y + i"I1/(" = 0 on l'G.. since the boundary values of the func-
tions r E C' (G) are dense in L 2 (l'G).
(III) (19) =>(18). Multiplication of -U =.r by I' E CI(G) and integration by
parts yield ( 18).
(IV) (18)=>(17). One deduces as in the proof of Proposition 18.6(IV).
Ad( b). For ( 17) <:> ( 18) only II E C 1 (G) was used.
Ad(c). Choose r = 1 in (18). 0
One clearly recognizes in the proof that the natural boundary condition
aries by means of integration by parts.
The second boundary value problem (19) with II = 0 is 1I0t uniquely solvable
since II = I is a trivial solution. It is well known from the classical theory that
the IlIriqlleIJess is forced by means of the normalization condition
f U dx = O. (21 )
(i
We shall see in Section 22.2f that (21) also suffices for the uniqueness of the
corresponding generalized problem. As motivation for Definition 22.13 we
prove the following.
Corollary 18.10. L,ef II = 0 a"d suppose that the sO/l'abilit y c01ldit ;011 (20) ;s
."Cltisfiecl. Then lire eqllil"llience statements in Pr()po.ition 18.9(a), (b) remain
correct ;11 ('a.e olle pla('e. the additional normalization condition (21) on II a"d
r ;11 (17), (18). and (19).
PR()t)r. Let .\1 = : n' E C' (G): Je; \\' d:< = O}. I t suffices to prove that for II = 0
condition ( 18) is equivalent to
r ( t D,IID. '" - rl\' ) c/x + f (hll - g)\V dx = 0 for all ". E M. (IS.)
J G p: I , ,'G
Each f' E (., (G can be represented as r = c + \\. with \\' E AI and a constant
c. Because of (21) and" = O.
r ( t D;IID j c - IC ) dx + f (lIU - g)c dO = 0
J (; , -= I ,"'J(j
holds trivially. Therefore.. (18) follows from (IS.). The converse is obvious. 0
18.4a. Generalized Problems in Sobolev Spaces
Motivated by (IS.).. in Section 22.2f we will formulate the final generalized
problem to the second houJldar.r I'a/ue prohlem (19) with It == 0 as follows. Let
.
M. = {w E WZ1(G): fG wdx = o}.
18.4. The Second and Third Boundary Value Problems and the Ritz Method
31
For given 9 e L 2 (cG) and f e L 2 (G) with (20), we are looking for a function
ue M I
such that (IS.) holds for all \\' E MI'
The choice of the Sobolev space W 2 1 (G) results from the fact that first-order
partial derivatives appear in (IS.).
Motivated by (IS), in Section 22.2g we will formulate the final generalized
problem to the third boundary value problem (19) as follows.
For git'ell g e L 2 (rG) and h e C(cG) ,,'ith h ho > 0 on cG and 1 e L 2 (G)
,,'ith (20), "Ie are looking for a function
u e W 2 '(G)
such that (IS) holds for all '" e W z ' (G).
The condition "h ho > 0 on cG" ensures that the elastic potential energy
! J;IG hu 2 dO is positive.
18.4b. The Ritz Method
In order to formulate the Ritz method for the original variational problem
(17), we proceed from the trial
"
II" = L c." Wi'
'=-1
(22)
where the basis functions "'1' ..., WIt are given. We determine the unknown
real coefficients e I'" ..., e"" in a way parallel to (17) from the variational
problem
f ( - 2 1 f (D,U,,)2 - U"/ ) dx + r (iu:h - u"g)dO = min!, e" E R", (23)
JG i-I JCG
where Crt = (e l ",..., e",,). We assume that all w" ..., ,v" are continuous and
piecewise differentiable on G. In case h == 0, we force the normalization
condition
t u" dx = 0
by means of the requirement JG "', dx = 0 for k = 1, . . . , n.
Corollary 18.11. If u" is a solution 01 (23), then
r ( f D,u"D,w J - IW J ) dx + f (hu.. - g),,'J dO = 0,
JG i-I cG
j = I,..., n. (24)
Because of (22), this is a linear system of equations for determining c I '" . . . , ClIft'
32
I H Va national Problems. the Ritz Method. and the Idea of Orthogonality
The proof runs analogous to that of Corollary 18.7. Equation (24) results
formally from the generalized boundary value problem (18) if one replaces u
there by "n and requires that (18) is valid for allt, E span {"'... . .. "',,}.
Assertions concerning existence. uniqueness. and convergence of the Ritz
method can be found in Sections 22.2f and 22.2g.
18.5. Eigen\'alue Problems and the Ritz Method
In this section it will be shown that eigenvalue problems result from appropri-
ate variational problems with side conditions. The method of Lagrange multi-
pliers is typical for the investigation of variational problems with side condi-
tions. We shall occupy ourselves therewith in detail in Part III. Here we use
a direct method analogous to Sections 18.3 and 18.4. We consider the following
problems. Let AI = : II E C. (G): u = 0 on l'G}.
(A) Variatiollal problenl M'ith side conditions
t i (DjU)2 dx = min!. tI E M.
f U2 dx = I,
(i
[ Uti. dx = O. r = 1..... m - 1.
Je;
(25a)
(25b)
(25c)
This problem is to be understood as follows: We determine u 1 from (25a) and
(25b). Then 1'2 results from (25a)-(25c) with m = 2. and so forth.
(8) Gel1erali:ed bouIJdary eigent'alue problem
f ( ,f DjllMDjl' - JlMUM I ' ) dx = 0 for all l' E M,
G 1=1 (26)
U m = 0 on eG.
(C) Boundary e;genl'alue problenJ
- u'" = JJ",Il", on G,
u'" = 0 on eG.
(27)
(D) Ritz I1lethod
f ( .t DjwDj\\i - Jl""') ) dx = 0, j = I,.... n. (28)
G I-I
The basis functions \\'1' . . . 'I "'n are given. We seek JJ E R and the real coefficients
(' .' . . . , c" of
n
\V = L Clt "'It.
_-I
18.5. Eigenvalue Problems and the Ritz Method
33
Problem (28) is an eigenvalue problem of the type
Ac = JJBc,
(28* )
where A and B are symmetric n x n matrices and the column matrix c contains
the elements c. to c".
Parallel to Sections 18.3 and 18.4 one obtains (28) from the generalized
boundary eigenvalue problem (26) if one only simply requires that (26) holds
for all functions v in span {\v.,. . . , w,,}. So that w in (28) satisfies the boundary
condition
w = 0 on cG
we require that
"', = 0 on oG
for k = I, .. . , n.
So that the integral in (28) is meaningful, we require, !ay, that all w. are
continuous and piecewise continuously differentiable on G. A precise analysis
in Section 22.11 shows that the eigensolutions and eigenvalues of(28) approxi-
mate the eigenfunctions u'" and eigenvalues /.l", of (27) in an appropriate sense.
A numerical example for the Ritz method will be considered in Problem
18.1 b.
Proposition 18.12. Slippose that the region G satisfies (7). Then the following
are talid:
(a) Etery solution u'" e C 2 (G) of (25) also satisfies (26) and (27) for m = I, 2,
. .. ",;th suitable JJ", E IR.
(b) Etery solution u'" eel (G) of (25) also satisfies (26).
(c) For u'" E C 2 (G), the boundary eigenvalue problem (27) and the generalized
problem (26) are equivalent.
PROOF. Ad(a), (c). Let u'" e C 2 (G). We set
r t (D,U)2 d.'(
F(u) = J G 'at .
L u 2 dx
(I) (25) (26). First let m = I. Recall that
M = {u e C1(G): u = 0 on cJG}.
If u I is a solution of (25), then it is also a solution of
F(u) = min!,
We define the real function
cp(t) == F(u. + tv) for all t E R
and fixed t' e M. Since U 1 + tv e M for all t, the real function cp has a
u eM, u o.
(29)
34
I Vanatlonal Problcm4\. the Ritz Method. and the Idea of Orthogonality
minimum at I = O. i.e..
«/,'(0) = o.
This implies
1 4 ( t D," I D. r - Ii." .1' ) c/x = 0
. (i , ': I
for all (' E I.
"'ith I'I = f"(u.). after a simple calculation. This is (26) with In = I.
No\\" let In = 2. We set
.\1. = {" E AI: 1; flU I dx = o}.
Let 112 be a solution of (25). Then II 2 is also a solution of
f.(II) = min!.
II EM. . II f= O.
We set
"'(I) = F(1I 2 + t"')
for dll t E (R
and fixed "" E ,\1." Since 11 2 + ,n' E .\1. for aliI.. the real function'" has a
minimum at t = O. i.e..
""(0) = O.
This yields
L ( D j "2 D j ". - I l 2"2 u) (Ix = 0 for all '" E MI' (301
Furthermore.. it follows from (26) with ,,, = I. l' = 112. that
L; (DjIl2Djlll - III 1(2 1 '1 ) dx = O. (31)
Each r E .\1 can be represented as
r = CII. + "'.
n' E J\".. (' E R.
From J(; "1"2 c/x = 0 it easily follows that (30) is also valid for all \" E .\'1.
T his is (26) \\' i t h ", = 2.
One deduces the result for 1)1 > 2 analogously.
II) (26t (27). Integrating (26) by parts yields
f - l\u", - JI",II", )1' cI. = 0 for all l' EM.
(;
Proposition 18.2 yields -l\u", - JI",I1", = 0 on G, because Co (G) c: M.
(III) (27):::. (26). Multiplication of - [\lIm = J'm"", by r E J\1 and integration by
parts yields (26).
Ad(b). In the proof of(aJ we used only that II E C.(G). 0
18.6. The Ho1der Inequality and its Applications
35
The final generalized problem in Section 22.13 reads as follows.
We are looking for a function Ulft e W 2 1 (G) and a real number #lift such tllat
relation (26) Iioids for all t' e W 2 1 (G).
As in Section 18.3, this is motivated by the fact that the set M is dense in
the Sobolev space W 2 1 (G). The existence of solutions of the generalized prob-
lem and the convergence of the Ritz method will be proved in Section 22.13.
18.6. The Holder Inequality and its Applications
The most important inequality in modern analysis is the following Holder
; lIeqlla lit.\':
L IfYldx S; (t IfI PdX )' p (t IYI'dX)"
(32)
with
1 1
- + - = 1.
P q
1 < p < x.
(33*)
If we set
Ufll p = (t IfI PdX )' p.
then we may write (32) in the form
tfYdX S; tlfgldXS; UfllplIgU"
(33)
Proposition 18.13 (The Holder Inequality). Let G be a nonenlpty nleasllrable set
;n At" ",;tl, N > I (e.g.. G ;.'i ope" or closed). Then ;t follo,,'.'i .fro,,,
fe L,(G).
g e Lq(G)
"';111 (33.) ,hat (33) holds.
If fe L,(G) and g e Ly(G), tllell (33) holds \\'ir/l p = I and q = x'.
For the convenience of the reader. the basic material of modern analysis is
summarized in the Appendix (e.g., measurable sets and measurable func-
tions. the Lebesgue integral. the Lebesgue spaces L,( G). 1 < P < 'X. etc.). For
example. we have
fe L,(G),
iff .f: G -+ R is measurable and
I sp< 'X.J
IIfll, < x.
36
18 Variational Problems, the Ritz Method. and the Idea of Orthogonality
f-'urthermorc. we have .re L-r(G) iff I: G -. R is measurable and there exists a
real number C such that
I.{(x)l < C
for almost all x e G.
Thcn. II {I: I is equal to the infimum of all possible C. We write
lI.fll x = ess sup 1/(.\')1.
.lEG
For I < p < x.. L p ( G) is a real B-space with the norm n." p' provided
we identify any two functions which differ only on a set of N-dimensional
Lebesgue measurc zero. That is, we identify .r with g iff f(x) = g(x) for almost
all x E G.
Note that only pathological sets and functions are lIot measurable. Never-
t he less. in each case. one has to check that the functions and sets under
consideration are measurable. To this end one can use the tools summarized
in A 2( 1)- A 2( 12). Furthermore, note that in each case, one has to check the
existence of the integrals under consideration. In this connection. the most
important tool is the majorant criterion of Lebesgue A 2 ( 11).
Roughly speaking. Proposition 18.13 tells us the following:
{ the riglu-hantl side ;11 (33) exists. then the remaining integrals also exist al1d
the ;IJeqflalities (33) hold.
The Holder inequality was first proved by Otto Holder (t 859-1937) in 1889
for series. The extension to integrals is due to Fryges Riesz. The following
proof of Proposition 18.13 is based on the Young inequalit.\,:
" l' b"
Clh < + -. for all a, heR. (34)
p q
and all p. q with (33.).
In Part III we shall show that (34) is the germ cell of the general duality
theory.
PRCX)F 01-" PROPCJSITIC)N 18.13.
(I) Proof of (34). If p = q = 2. then (34) follows from (a - h)2 > O. In the
general case, we consider the function
a P h 4
F(a) =. + - - - ab
p q
for fixed b > O. We have
f-(O) > O.
/;(h 4 'P) = O.
lim F(a) = + x .
CI-+X
"Ience f. has a minimum on R.. Thus, there exists a number a o > 0 such
that
F(a) > f'(a o )
for all a E R + .
18.6. The Holder Inequality and its Applications
31
From F'(ao) = 0 it follows that ao = b"", and hence F(ao) = o. This is
(34).
(II) Proof of (33) for 1 < p < 00. We may assume that IIfll" = IIgli. = I.
Otherwise we use a division. Integration of
I/gl S III' + Igl'
p q
over G yields JG Ifgl dx 1. This is (33).
(III) In the case p = 00, we obtain
t Ig d.v: s t I/lIgl d.v: s III II x t Igi d.v:.
o
We now consider some important applications of the Holder inequality
which will be used frequently.
18.6a. The I-Trick
The I-trick means that we apply the Holder inequality (33) with 9 = I. This
is only possible for bounded measurable sets G. Otherwise IIgli. = 00. This
way we obtain
t I/ldx S (measG)I"(t I/IPdx Y'p
for all f e L,,(G) with I < P < .
(35)
EXAMPLE 18.14. Let G be a nonempty bounded measurable set in R N , N I.
Then the embedding
L,(G) s Ls(G),
ISssrsoo
is continuous, i.e., we have
IIhlls const Uhll,
for all h E L,( G).
(36)
PROOF.
(I) Let 1 S s < r < 00 and let h E L,(G). Then (36) follows from (35) with
f = Ihls and p = rise
(II) Let I S s < r = and h E Lao(G). Then Ih(x)1 S C for almost all x E G.
Hence
(t Ihlsdx Y's S CmeasG.
This implies (35), since IIhll2> is the infimum of all C.
o
EXAMPLE 18.15. Let G and H be two nonempty bounded measurable sets
in R N with N I and G s H. Suppose that f E L,(H) with I < p S 00.
JX
I \',anatlondl Problems. the Rllz Method. and the Idea of ()rthogonalit
-"hen
,.
J 1'1 dx < (meas G)I elll.fll / . pclI .'
(,
\\'here'l I ...... P I :..= 1 for 1 < P < _ and 'I = I for r = x.
Thi'\ follo\,." from (J}
Remark 18.16. If 1 < P < -, and I E L"fll). then
I f I d \" -+ () as mcas G o.
I (,
\\'here If and (i arc given as in Example I .15. This follows from the continuity
of the embedding Lp(llt c 141 (H) and the absolute continuity of the integral
(cf. A:!(20H. In the case I < P < x. Example 18.15 contains precise information
about this limiting process.
18.6b. lhe Sum Trick
The prototype of the sum trick is the inequality
2tlh < I:" + I: - I h:!
for all u. hER +
(37t
- -
and aliI: > O. which follo\\'s from (,: l:d - h, \, r.t > O. This inequality tells us
that it is possible to estimate a product against a sunl of squares. where one
of the summands can be made arbitrarily small. From the Young inequality
f 34) there follo\vs a generalization of (37). namely.
1."(11' h q
ah < +
- I' ql;q
and alii: > () Jere. I < p < and p - t + C/ - t = I.
for all Cl. hEIR..
(38)
EXA1PII: 18.17 Let G be a nonempty measurable set in (R". N > 1. Suppose
that I E 14,,(G) and / E L«I((j) with I < p < x. and q-I + p-I = I. Then
i r" i 1 i
I f /I ,,"( < . - I f I p d."( + - . q I g I q d."(
(, p (, ql. (;
holds for alii; > o. In particular.. one can choose p = q = 2.
This follows from the Holder inequality and from (38).
f39)
18.6c. The Nemyckii Trick
For the investigation of nonlinear differential and integral equations. the
Nemyckii operator plays a fundamental role. This operator will be considered
18.6. The Holder I nequality and its Applications
39
in Section 26.3. In this connection, the following result is crucial. Our goal is
to show that the estimate
I \\'(x)1 I u(x)1 + I r(x)I'" on G.
1 S p.. q < .x, 't
(40)
implies
11"'11, s const( lIull, + IIt'II").
EXAMPLE 18.18. Suppose that:
(i) G is a nonempty measurable set in R N , N > I.
(ii) II e L,(G) and l' e L,(G).
(iii) The function ,,': G -+ R is measurable (e.g., continuous) and (40) holds.
Then we have ". E L,(G).. and (41) holds.
(41 )
PRC)()F. Let r > O. We use the inequality
(a + b)r < C(a r + b r )
for all a. h e R.'t
(41 * )
where the constant C depends on r. From (40) we obtain
1\\'1' < C(lul' + It'I').
Integration over G yields
t Iwl 4 dx S c(t lul 4 dx + t 1l'IPdX).
Now the assertion (41) follows from (41*) with r = Ijq.
o
There are two completely different methods for the investigation of nonlinear
differential and integral equations:
(i) One uses L,tG)-functions.
(ii) One uses C'(G)-functions or C t .2(G}-functions.
Note the following peculiarities. In the case (i). Example 18.18 above is
responsible for the fact that one needs restrictive gro".th conditions for the
nonlinearities. But one has the advantage that the L,(G)-spaces are re.f1exil'e
in case 1 < p < 'X. In the case (ii), no growth conditions are needed, but the
C'-spaces and C'.-spaces are not reflexive. Unfortunately.. many existence
theorems are only valid in renexive B-spaces. The point is that in renexive
B-spaces. each bounded sequence has a weakly convergent subsequence.
In the Appendix, A 2 (29). we will consider the Holder inequality and the
Young inequality for n factors. If we combine these two inequalities. then we
obtain the following Holder- Young inequalit)':
r Ii Ii dx
JG 1=1
" ( i ) I "
S n l.hl" dx
,a 1 G
" I i
< L - l.hl'" dx.
i-I P, G
(42)
40
I R Variational Problems, the Ritz Method, and the Idea of Orthogonality
This inequality holds for all
E L,,(G).
, = 1. . . . , II.
where the real numbers I < PI' . ... p" < x' satisfy the relation
t l =1.
i-I Pi
and G is a nonempty measurable subset of R N , N > 1.
18.7. The History of the Dirichlet Principle
and Monotone Operators
In order to determine integrals of algebraic functions and their inverse func-
tions (elliptic and Abelian functions), one can apply a principle which was used
by Dirichlet in his lectures. This principle allows us to solve boundary problems
for the Laplace equation.
Bernhard Riemann (1857)
By the Dirh'lr/t'l pr;,,(';plt' we understand a method for solving boundary value
problems via minimum problems for variational integrals. This principle goes
back to Gauss (1839), Lord Kelvin. and Dirichlet. Weierstrass (1870) was the
first to underline the shortcomings of this principle. He showed that there are
variltional problems which do not have any solution. In 1900 I showed that it
is possible to justify rigorously the Dirichlet principle.
David Hilbert (1901)
Riemann f 1826 - 1866) \\'as a man of outstanding intuition. By his ingenuity he
outshone all his contemporaries. Whereever his interest was awoken. he began
anew, without allowing himself to be misled by tradition and without recognizing
any compulsion of systematology.
Weierstrass (1815-1897) was first and foremost a logician. He proceeded
slowly, systematically, gradually. Whereever he worked. he strove after the final
form.
Certainly mathematics itself pronounces the verdict, even if it does so without
rigorous proof. It will, however. remain for ever the secret of vigorous produc-
tivity to find new questions and to discover new theorems which lead to valuable
results and new interrelations. Without new ideas and without the formulation
of new objectives, mathematics would soon amount to nothing more than the
rigidity of its proofs, and would begin to stagnate, its fuel running out.
So, in a certain sense, mathematics has been furthered most by those who
distinguished themselves more through their intuition than through strong
deduction.
Felix Klein (1926)
The "'ell-known hOll mot that uEurol>ean philosophy". is only a footnote 10 Plato
is perhaps true. but I would venture a much truer one: modern mathematics is
only a footnote to Riemann.
Krys7tof Maurin f 1982)
18.7. The History of the Dirichlet Principle and Monotone Operators
41
18.7a. The Calculus of Variations
As the Dirichlet principle is very closely connected with the calculus of
variations, we will next take a short glance at the history of this fundamental
mathematical discipline.
In Parts III through V we will see that many basic equations of physics
result from an important variational principle, namely from the principle of
critical or stationary action, which is often incorrectly called the principle of
least action. The general theory of relativity and the gauge field theories in
modern elementary particle physics belong here.
It seems that the local symmetries of variational integrals, which lead to the
gauge field theories, provide the key to understanding the four fundamental
interactions in nature: the strong, weak, electromagnetic, and gravitational
interactions. This will be discussed in detail in Part V.
About three hundred years ago, at the beginning of a fascinating develop-
ment in mathematics and physics, there was the following prize competition.
Johann Bernoulli, professor of mathematics, greets the most sophisticated
mathematicians in the world. 1 Experience shows that noble intellectuals are
driven to work for the pursuit of knowledge by nothing more than being
confronted with difficult and useful problems.
Six month ago, in the June edition of the Leipzig Acta Euriditorium, I presented
such a problem. The allotted six-month deadline has now gone by, but no trace
of a solution has appeared. Only the famous Leibniz informed me that he had
unraveled the knot of this brilliant and outstanding problem, and he kindly
asked me to extend the deadline until next Easter. I agreed to this honourable
request. . . . I will repeat the problem here once more.
Two points. at different distances from the ground and not in a vertical line.
should be connected by such a curve so that a body under the influence of
gravitational forces passes in the shortest possible time from the upper to the
lower point.
Johann Bernoulli (January, 1697)
This paper sol\'es my brother's problem, to whom I will set other problems in
return.
Jacob Bernoulli (May, 1697)
How does one apply the methods of maxima and minima in the determination
of unknown curves (solution to variational problems)?
Leonhard Euler ( 1744)
The famous Euler succeeded in tracing back to a general method all investiga-
tions on variational problems. But however sophisticated and fruitful his method
may be, one has to admit that it is not simple. Here one now finds a method
which only uses simple principles of calculus.
Joseph Louis Lagrange (1762)
I leibniz (1646-1716). Johann Bernoulli (1667-1748), Jacob Bernoulli (1654-1705) (brother or
Johann Bernoulli), Daniel Bernoulli (1700-1782) (son of Johann Bernoulli), Euler (1707-1783),
Lagrange (1736-1813), Legendre (1752 -1833), Gauss (1777 -18SSt, Jacobi (1804-185 I), Dirichlet
(180S-18S9). Riemann (1826-1866). Weierstrass (181 S-1897), and Hilbert (1862-1943).
42
I Vanatlonal Problems. the Ritl Method. and the Idea of Orthogonality
The necessary conditions for the solutions of variational problems. found
by Euler and Lagrange, will be considered in Section 18.18. We have already
met the prototype in Proposition 18.6.
Euler lIsed a difference method. Lagrange"s simple method. based on inte-
gration by parts. is still used today. For example. we used Lagrange"s method
to prove Proposition 18.6. In contrast to Euler. Lagrange could also deal with
variational problems for multi-dimensional variational integrals. which lead
to partial differential equations.
In 1110'( of the prohlcm of the calculus of variations it follo\\'s from the nature
of the problem as to \\'hether or not a minimum or maximum is possible. There
arc. ho\\'e\er. case In "' hich this distinction is not so easy. I therefore thought
that the formulation of a criterion. which allows for a distinction between
ma,ima and Inlnlma. \\'ould be useful.
Adrien-Marie Legendre f 1786)
Ttti !ul.alled Legendre condition _ill he Cfll1!1idered in Settion I R.17. It
d_..t. uh .. fl...., ",."Ui" tUf nu,.Uti. ilt" Iflln'fttMr t...MlHd'lI twh,.1J
that he had found a umcient condition. As Jacobi. however" pointed out.
Legendre.s observations contained a serious mistake.
I ha \ c ucceeded In closing an essential gap in the calculus of variations. lJntl1
no", If" general sufficient condition was known by which to recognize whether
it ma'lmum or minimum existed. For example. the integral. considered in the
principle of least action. can never have a ma,imum. as Lagrange mistakenly
believed: in no way. ho\\'ever. will it always have a minimum.
Carl Gustav Jacob Jacobi (1837)
I n Section 29.12d Vie will consider a general functional analytic formula-
tion of the classical Jacobi theory based on so-called accessory variational
problem,;.
The decisl\c idea conist of studying a linear eigenvalue problem" which
is closel) connected \\'ith the quadratic form of the second variation. Therefore.
our results on linear eigenvalue problems in ("hapter 22 are of great interest
for general variational problems.
.. rum thl ..Indcpendence theorem" arise not onl) the known sufficient condi-
tions of Jacobi and Weierstrass for the appearance of minima. but also all the
essential conclusions of the Hamilton Jacobi theory.
David Hilbert ( 1906)
Hilbert refer here to his invariant path integral. which we will consider in
Part III \\'ithin the frame\\'ork of field theory in the calculus of variations (see
Proposition 40.17). There \\'e will also elucidate the connection with the
famous Weierstrass £-function.
The Hamilton Jacobi theory describes mathematically the fundamental
propagation of action in nature and the structure of optimal processes in
18.7. The History of the Dirichlet Principle and Monotone Operators
43
nature. technology. and economics. This will be studied in detail in Parts III
through v.
18.7b. The Beginnings of Potential Theory
In 1182 Laplace remarked. in a study on the shape of the planets,. that the
potential
t i p(:<}
u(y) = -- dx
4n G Ix - yl
of a mass distribution of density p on the region G satisfies the following partial
differential equation
(43)
L11l = 0 otside G,
(44)
which is called the Laplace equation.
Interestingly. he first found this equation in the complex form of spherical
coordinates. In 1181 he discovered the much simpler expression for u in
Cartesian coordinates, namely,
&u = u + u.,,, + u,.
where x = (..". ,). In 1813 Poisson found that for a ball G the following
equation is valid in the case of a constant density p:
-&u=p onG.
This equation is called the Poi.son equation.
(45)
Gauss proved the validity of this equation for more general cases in 1839
in his fundamental paper: "General Theorems about the Forces of Attraction
and Repulsion which are Inversely Proportional to the Square of the Di-
tance.... Such forces are. for example, Newton's gravitational force and the
electrostatical Coulomb force. Gauss also used such forces in order to explain
magnetic phenomena. The above-mentioned work of Gauss is the beginning
of the mathematical foundations of potential theory. In this work, for example,
Gauss proved the fundamental mean value property of potential functions
(solutions of the Laplace equation &u = 0). He had already realized in 1813
that volume integrals can be transformed into surface integrals (Gauss' inte-
gral theorem). The formula of integration by parts. which is at the forefront
of the modern theory of partial differential equations. is equivalent to the
integral theorem of Gauss. We have already discussed the fundamental impor-
tance of this theorem for topology (mapping degree, cohomology) in the
introduction to Chapter 12.
In 1801 Gauss became Professor of Astronomy and Director of the Ob-
servatory in Gottingen. where he worked until his death in 1855.
I Laplace (1749-1827». Poisson (1781-1840). Gauss fl777-18SS). and Green (1793-1841).
44
I H Varaational Problems. the Ritl Method. and the Idea or Orthogonality
For valuable contributions to potential theory we also have George Green
to thank. who. in 1829. discovered the Green integral formulas and who
introduced the important Green function. which allows us to represent the
solutions of elliptic boundary value problems by integral expressions.
18.7c. Riemann and the Dirichlet Principle
From 1846 until 1847 Riemann. studied under Gauss in Gottingen. In the
years 1847 1849 he was a student in Berlin and attended lectures by Dirichlet, I
Jacobi. Eisenstein. and Steiner. He learnt about the Dirichlet principle in
Dirichlcfs Icctures on potential theory. During this time, as a student in Berlin,
the idea matured within Riemann to give a foundation to the theory of
complex analytic functions by means of the theory of partial differential
equations for real functions. To explain this let
.f(:) = " + ir
he an analytic function. Then the real functions" and r satisfy the so-called
("au('II.r Rienrallll di{ferelltial equations
Il = r".
",,= -I'.
where = == + ;'1.
By differentiation. \\'e obtain the Laplace equation
,, + II"" = O.
(46)
and r + r"" = O. i.e.. All = Ar = O. The corresponding variational problem
to (46) is given by
f 2 2 d - .,
(u + u,,) x - min..
G
(47)
II = y on tG.
In this way Riemann was able to give a physical interpretation, in terms of
potential theory. to the mathematical problems of the theory of analytic
functions. During his lifetime Riemann, like his teacher Gauss. tried to find
Jeep COllnf!ctions between mathematics and physics.
In 1849 Riemann returned to Gauss in Gottingen, and in 1851 handed over
his dissertation which was much praised by Gauss. In this fundamental paper
of a mere fourty pages. Riemann laid the foundations for the complex function
theory and proved his famous theorem on conformal mappings. In this paper,
and in his fundamental paper on Abelian integrals on Riemannian surfaces
written in 1857. he placed a general existence principle at the head of the
theory. which he called the Dirichlet principle. In today's language the principle
, Riemann (lH6 IH66,. Dlrachlet 0805 18591. Jacobi (1804 1851). Eisenstein (1823- IR52t.
Steiner t 1796 -I H63). Weierstrass t t 815-1897).
18.7. The History or the Dirichlet Principle and Monotone Operators
45
is as follows:
(D)
For giren sufficient I}' smooth functions 1,' and g, the minimum problem
t (u( - ('.,)2 + (u( + ('.,)2 dx = min!.
u = g on cG
ah,'aJs has a solution in the set of piecewise continuously differentiable functions.
Here, G is a bounded region in 1R2.
In the special case to = 0, problem (D) corresponds to (47). Like Dirichlet,
Riemann made no attempt to prove this existence principle. Both considered
it to be obvious that problem (D) possesses a solution. In fact, the time was
not yet ripe for an existence proof for (D).
18.7d. The Theory of Abelian Integrals
The theory of Abelian integrals and their inverse functions was at the center
of Riemann's and Weierstrass' research on complex function theory, which
was carried out totally independently from one another.
In his obituary for Weierstrass, Hilbert writes:
Karl Weierstrass studied law in Bonn. In 1839 he decided to study mathe-
matics and therefore went to Munster, where he attended a mathematical lecture
by Gudermann on elliptic functions. That was the only mathematical lecture
which he ever heard in his life. After that Weierstrass worked from 1840 until
1855 as a schoolteacher. totally isolated. in small towns. During this time he
published a series of fundamental papers on the most difficult problems of
complex function theory.
In his academic inaugural speech in t 857 as a newly elected member of the
Berlin Academy. he described how the theory of elliptic functions had tremen-
dously attracted him. even before his studies, and that he regarded it as one of
the main tasks of mathematics to create a general theory on periodic functions
of 5e\'eral complex variables (Abelian functions).
To prepare himself for this difficult task, he first studied the available tools
and occupied himself with less difficult problems. As a result of these studies
he published. during the years 1841-1843. papers on the theory of power series
and on the definition of analytic functions by means of algebraic differential
equations.
In 1854 Weierstrass first succeeded in finding the solution to the famous
"inverse problem" posed by Jacobi in 1832. The generalization of this result to
general Abelian integrals, which was later obtained by Riemann and then by
Weierstrass, is one of the greatest achievements in analysis.
Whereas Riemann based his theory of Abelian functions on the imperfect
nonrigorous Dirichlet principle. Weierstrass' theory is based on purely algebraic
arguments.
46
'X Vandllonal Problems. the Ritz Method, and the Idea of Orthogonaht
We would brlcny like to explain the mathematical background to the
reader. By an Abelian integral we understand an integral of the form
A = f RI:. 1\'1:))d:.
"'here \\ = "'t:) IS an algebraic function and R = R(:. "') is a rational function.
i.e,. the function '" = ".(:) satisfies an algebraic equation of the form
ll"{:),,,n + ... + Cl l f:)\" + a o (:) = O.
"'here tI u . . . .. a" arc polynomials with respect to :. For example. let Pn be a
polynomial of degree II and let
".2 = Pn(:)'
Then. '" = '\ Pn t :). If II = 3.4 (resp. II = 5.6). then tht: integral J R(:. "'(:Ht/:
is called elliptic t resp. hyperelliptic). I n the special case \\,2 = I - :2. we obtain
the Abelian integral
I = f: ,
d:
I - :2
Here. the Inverse function is given by : = sin t.
In 1832 Jacobi formulated the following hypothesis:
CO'I.id('r lire "\'o f,llrctio,rs 1l = r'(l,. h) and r = I'(a. h) I;I"ell hy tire ,(ol/()\\'illO
\\'st£'II' of "yperelliplic inte/rals:
I II d: I ' d:
"II \.-PI:) + "" rp6(:- ) = a.
I " :d: I I :d:
", \. ptol-) + I, \.' ;;6(:) = h.
"flrell IIr£' ,,, 0 fllll( I;OllS II + I' alld Ul' art' ullil'a/f.'lIt and they po,sess ,(ollr diflerent
periods.
Riemann and Weierstrass proved a generalization of this assertion for
systems of II equations with n general Abelian integrals. Whereas Weierstrass
used algebraic methods.. Riemann worked with the topological idea of the
so-called Rie,nanniclll slIr.(ace. To explain this. we shall consider the elliptic
integral
f RI:. wi:)) d:
with ,,'f:) = \. p(:). \\'here p is a polynomial of degree 11 = 3.4 with pair\\'ise
different zeros. Riemann"s idea was to consider this integral on the Riemannian
surface 1. of ". = ,,'f:). Here T is homeomorphic to a two-dimensional torus.
The point is that on the torus. two different paths exist. C 1 and C 2 ' as pictured
18.7. The History of the Dirichlet Principle and Monotone Operators
47
T
Figure 18.9
in Figure 18.9. Set
W, = r R(z, W(Z)) dz,
JC i
Each path on T can be deformed into
nC. + mC 2 ,
where nand m are integers. It follows from the Cauchy theorem that the
deformation of paths on T does not change the corresponding integrals.
Consequently, we obtain, without any calculations. that the elliptic integral
J R(z, ,,'(z))d: has the two additive periods WI and W2. Therefore, the corre-
sponding inverse functions, i.e., elliptic functions. have two periods.
Riemann generalized this argument to general Abelian integrals. He showed
that the corresponding Riemannian surfaces are homeomorphic to spheres
with p handles, and hence the corresponding Abelian integrals have 2p additive
periods (Fig. 18.10(a)). The number p is called the genus of the Riemannian
surface. This is the only topological invariant of Riemannian surfaces which
correspond to algebraic functions. In the special case of a two-dimensional
torus we have p = 1 (Fig. 18.IO(b)).
Ingeniously, Riemann recognized that there are close and deep connections
between analysis, topology, and algebra. With this recognition he decisively
innuenced mathematics in the twentieth century.
At the end of the last century Hilbert discovered that a new powerful
instrument for number theory can be obtained by combining the idea of the
Riemannian surface with the Galois theory. Thus he gave number theory a
i = I, 2.
@) >
p=2
(a)
p=1
(b)
Figure 18.10
48
I H Varaatlonal Problems. the Ritl Method. and the Idea of Orthogonality
decisive new impulse. This is discussed in detail in the famous book by Felix
Klein (1926): The History of Mathematics in the Nineteenth Century.
It is quite interesting that today Riemannian surfaces playa fundamental
role in the so-called string theory of physicists (see Section 76.7) and in the
theory of solitons (see Problem 30.7).
18.7e. The Criticism of the Dirichlet Principle by Weierstrass
In studying Riemann's work, Weierstrass found the Dirichlet principle un-
satisfactory. In 1870 he constructed a counterexample which showed that the
existence of solutions to variational problems is no trivial matter. He considered
the follo\\'ing minimum problem:
Je() d( f I fxf'fxH 2 dx = min!,
-I (48)
.r ( - 1 ) = a,
with a # h. The sequence of functions
fll('\:) = a_ + b _ _ - b . arctan nx ,
2 2 arctan n
.(( I) = h
n = I, 2, . . .
satisfies the boundary conditions h( - I) = a and f,,( I) = b, and we obtain
J(/,,) -. 0
as 11 -. T.;.
Consequently, the infimum of the integral in (48) is equal to zero. However,
there is no CI-function which solves the minimum problem (48). In fact, if f
is such a function, then it follows from (48) that xf'(x) = 0 on ] - I, 1 [. This
implies _f = constant. But this contradicts the boundary condition I( - 1) i=
{( 1 ).
Because of this. Weierstrass' essential result is as follows:
(a) Each rariat;onal prohle,n \\'hich i.') bounded belo"' possesses an infimum and
"etlce n,i"i,nal seqlle"('e. eX;.t.
(b) Ho"'erer, those minimal sequences do not necessaril}' conl'erge to a sol,,';on
oj' the l'tlr;at iOPlal prohlenl.
As the Dirichlet principle could not at first be justified, one looked for other
methods to save Riemann's theory. To this end, Schwarz (1843-1921), C.
Neumann (1832 - J 925). and Poincare (1854- 1912) developed special methods
to solve the Laplace equation without using the Dirichlet principle.
18.7f. Hilbert's Justification of the Dirichlet Principle
In his Paris lecture in 1900 Hilbert formulated his famous twenty-three
problems, which have substantially innuenced mathematics in this century.
18.7. The History of the Dirichlet Principle and Monotone Operators
49
On this subject we recommend two anthologies by Aleksandrov (1971) and
Browder (1976). In connection with the twentieth problem Hilbert said:
The sophisticated methods ofSchwa C. Neumann, and Poincare essentially
solved the boundary value problem for the Laplace equation. However, these
methods cannot be directly extended to more general cases. . .. I am convinced
that it will be possible to get these existence proofs by a general basic idea.
towards the Dirichlet principle points. Perhaps it will then also be possible to
answer the question or whether or not every regular variational problem pos-
sesses a solution if. with regard to boundary conditions. certain assumptions are
fulfilled and if. when necessary. one sensibly generalizes the concept of solution.
At this time Hilbert had already found a possible means of saving the
Dirichlet principle. He first presented a rough idea in a lecture in 1900, and
then in greater detail in a later paper in 1901 (cf. Hilbert (1932), Vol. 3, pp.
10-37). Hilbert's idea was as follows:
(i) one constructs a minimal sequence; and
(ii) one shows that this minimal sequence contains a converging subsequence
whose limit provides a solution to the original variational problem.
This is a typical compactness argument. In the second volume of the standard
work of mathematical physics by Courant and Hilbert (1937), one finds the
idea (i). (ii) worked out.
In today's terminology one works in Courant and Hilbert (1937) with dense
subsets of the Sobolev spaces
Wl(G) and W 2 '(G),
where these subsets consist of piecewise smooth functions. The decisive com-
pactness argument is based on a compactness theorem of Rellich (1930). which,
in today's terminology, implies that the embedding
W 2 1 (G) c: L 2 (G)
is compact.
In their fundamental paper on difference methods. Courant. Friedrichs, and
Lewy (1928) had already emphasized the advantages of using generalized
solutions of partial differential equations. The concept of generalized solutions
is also stressed in Courant and Hilbert (1937).
In Part III we will formulate two general functional analytic existence
principles for extremal problems, namely:
(a) a compactness principle which generalizes (i) and (ii) above (Chapter 38);
and
(b) a convexity principle which is based on the Hahn- Banach theorem, i.e.,
on the separation of convex sets in B-spaces by hyperplanes (Chapter 39).
Our functional analytic justification of the Dirichlet principle in Section
18.10 is closely related to (b), i.e., in order to give a very elementary existence
proof. we avoid compactness arguments.
50
1 M Variational Prohlems. the Rltl Method. and the Idea of Orthogonaht),
The Ritz method was developed by Walter Ritz in his post-doctoral thesis.
\\'hich he wrote in 1908 in Gottingen. Ritz applied his method, for example.
to obtain numerical results for the clamped plate.
A few more words about Hilbert. The great mathematical tradition which
Gauss founded in Gottingen was continued there by Riemann. Klein. and
Hilbert. On Felix Klein's initiative, Hilbert, at the age of thirty-three, was
appointed to a professorship in Gottingen. where he remained until his
death. In the 1920's Gottingen was the leading center in mathematics and
quantum physics. The r-"ascist seizure of power in t 933 also had the worst
possible consequences for Gottingen. The world famous mathematicians
Emmy Noether. Emil Artin. Paul Bernays. and Richard Courant were driven
a\\'ay. as \vell as the Nobel Prize winner for physics. Max Born. The number
theory specialist Edmund Landau and the Nobel Prize winner for physics
James Franck lost their positions. Hilbert's close personal friend Otto
Blumenthal emigrated to the Netherlands and was murdered in the concentra-
tion camp Thercslcnstadt in 1944. When the Nazi minister for education asked
Hilbert if mathematics in Gottingen was nourishing, Hilbert replied:
"There is no mathematics in Gottingcn anymore."
Hilbert died at the age of ninety-one, in t 943. Scarcely a dozen friends
accompanied the most important mathematician of this century to his final
resting place. The words on his gravestone comprise his epistemological credo:
We must kno\\'.
We will know.
18.7g. The Work of Tonelli
The ideas of Hilbert for justifying the Dirichlet principle were substantially
developed hy the Italian mathematician Leonida Tonelli (t 885 - 1946). The
general compactness principle (a) mentioned in Section 18.7f above is based
on the work of Tonelli.
In 1884 Volterra had proposed the concept of functionals in the real field. and
he considered the integrals of the calculus of variations as typical examples of
functaonals. The Ideas of Volterra I \\'ere actively cultivated by Arzela in Bologna
(ltalyt. It \\'as Arzela who tried unsuccessfully to obtain a direct proof of the
eistence o( the minimum for the Dirichlet integral. Hilbert in 1900 obtained
such a direct proof. but his proof relied on .pe(';f;c properties or harmonic
functions. The ideas were follo\\'ed with enthusiasm by Beppo Levi. Lebesgue.
and Fubini.
In 1905 Balre had introduced his concept of semicontinuity for real (unc-
tions sho\\'ing that. in a compact domain. lower semicontinuous functions had
I VolterrdflR60 1 940t. Ar/ela(1R47 1912). Ac;coh(1RR7 1957). Beppo le\'i(1R75-1961). Fuhlni
flH79 1943t. Lchcue (l87 -1941). and Bairc (1874-1932).
18.1. The Hitory of the Dirichlet Principle and Monotone Operators
51
an absolute minimum. and upper semicontinuous functions had an absolute
maximum.
Tonelli became acutely aware that a theory was needed for the direct existence
proof of maxima and minima of general problems of the calculus of variations.
Tonelli realized that in the Ascoli-Arzela compactness theorem and Baire semi-
cont;'lu;ty transferred from real functions to the functionals of the calculus of
variations. he had the perfect tool for a "direct method in the calculus of
variations" based on semicontinuity for an existence theory of maxima and
minima. By this it is meant. as it is meant today. that to prove the existence of
the absolute minimum of a functional F in a class C:
(i) one has to pro\e first that the functional u...... F(u) has a finite infimum on
C (i.e.. F is bounded below on C);
(ii) that the functional F is lower semicontinuous with respect to some type of
convergence; and
(iii) that there is some minimizing sequence which converges in the stated type
of convergence toward an element Uo in C. and which then realizes the
absolute minimum of the functional F in C.
The type of convergence is not assigned, but at the time. and for many more
years. the natural convergence was the uniform convergence of the trajectories.
Indeed. when very young. Tonelli proved in 1911 the existence of the absolute
minimum for general positive regular integrals of the calculus of variations over
parametric continuous rectifiable curves in A Z . The regularity was only a strong
form of co'n'e.it)'. actually a strong form of the Legendre and Weierstrass
conditions. and this assumption implies the lo,,'er sem;continllity of the integrals
with respect to uniform convergence. The realization was reached that the
classical Legendre and Weierstrass conditions are nol so much conditions for a
minimum or a maximum, but conditions for the lower or upper semicontinuity
of the integral. For parametric integrals the conditions. required by Tonelli at
the time. are expressed today by requesting only continuity and convexity.
In 1914-1 S Tonelli proved the existence of the absolute minimum for ordinary
curves in HZ (i.e.. nonparametric). namely, absolute continuous functions. and
under conditions at the time. which are expressed today by requesting only
continuity. convexity. and coercivity.
After World War I. Tonelli collected the new ideas. and the impressive amount
of work he had accumulated in so few years. into a complete theory. which
appeared as his main opus. the Fondamenti di Calcolo delle Varia:;on;. a complex
of 1150 pages. two volumes. in 1921 - 23.
Lamberdo Cesari ( 1987)
18.7h. Further Development
Since the 1930's one has systematically worked on the construction of a
general theory of elliptic differential equations. The Sobolet. .fipaces go back to
Friedrichs (J 934), who worked with the space W 2 1 (G) and other energetic
spaces in connection with elliptic equations, and to Sobolev (J 936). (1937).
(t 938), who studied systematically the spaces W,"'(G) together with his associate
Kondracov and who applied his results to hyperbolic equations.
Friedrichs recognized the fundamental importance of estimates of the form
tC)
(Aulu) > c lIull 2
for all u e D(A)
52
I H Variational Problems. the Ritl Method. and the Idea of Orthogonality
for the functional analytic existence theory for elliptic differential equations.
In the case (' > 0, condition (C) means the strong monotonic;ty of the linear
symmetric operator A: D(A) c: X -. X on the real H-space X. The theory of
the Friedrichs extellsioll will be studied in detail in Chapter t 9.
The most general formulation of the notion of a generalized solution of
partial differential equations can be obtained from within the framework of
the theory of distrihllt;()lJs. which was created by Laurent Schwartz (1950).
I n the 1950's and 1960's the theory of linear elliptic differential equations
\\'as formally completcd. By using interpolation theory, one was able to ex-
plicitly characterize the function spaces. which have the crucial property that
linear elliptic differential operators represent IJon,eomorpIJisins between those
spaces. .or exam pie, let us consider the boundary value problem
- u = r on G.
f49)
" = {IOn tG.
,,'here G is a bounded region in R J with a smooth boundary tG. If we write
(49) in the form
L 1# = ( f g),
then the operator
L: Yl (G) -.. L 2 (G) x Y/ 2({G)
is a linear homeomorphism. In particular. this means that for the givcn
functions
, E L 2 (G)
and
g E 'i\ 2(l'G).
the boundary value problem (49) has a unique solution u in the Sobolev space
W/(G). Roughly speaking. the functions in the space W 2 2(G) have square-
integrable generalized second derivatives. Note that this final result is based
on the use of Sobolev spaces Y2'"(tG) of the fractionat order m = 3/2. The
precise definitions \vill bc given later on. A dctailed study of interpolation
theory and its applications to partial differential equations can be found in
the monographs by lions and Magenes f 1968), and Triebel f 1978).
The de\'elopment of the theory of partial differential equations in this
century can be characterized by the following headline:
t PI) f.ro", ;lIte/'.(l1 operators to di.llerential operat{)r..
We want to explain this. The theory of integral equations. formulated by
Fredholm and then by Hilbert around 1900, had its origin in potential theory.
The basic idea was as follows:
(i) In order to solve the boundary value problem (49) in R 3 with r = O. we
look for the function II in the form of a douhle layer potential
II(X) = f D(x. y)l'( y) dO
,. (i
18.7. The History or the Dirichlet Principle and Monotone Operators
53
with charge density p and the kernel
I c I
D(x'Y)=- 4 --;- 1 I '
n (In, Y - x
where c/cn, denotes the outer normal derivative on the boundary cG with
respect to )' (cr. Chapter 83 on electromagnetism).
(ii) To determine the unknown charge density p, we use the jump conditions
for the double layer potential at boundary points. This way the boundary
value problem (49) is transformed into the following integral equation for
p.
· g(:c) = tp(:c) + f D(x, },)p(}') dO.
t'G
Right up to the 1930's one frequently tried to solve differential equations
(D)
Au =f
by a passage to the integral representation
(I) u = A -If,
where the kernel of the integral operator A -I corresponds to the Green
function K of (D), i.e., equation (I) has the form
u(:c) = t K (x, y)f(y) d)'.
For example, if equation (D) corresponds to the boundary value problem (49)
with g = 0, then
I
K(:c, y) = 4 I I + H(:c, .\,).
ny-x
(K)
where, for each }' e G, the function x H(x. }') is uniquely determined by the
following boundary value problem:
AxK(x, )') = 0 for all x e G with . }',
K(x,y) = 0 for all x e eG.
Note that problem (K) is independent of the function f
This method led to many difficulties in more complicated elliptic problems
than (49). Therefore. one began. in the 1930's. to study differential operators
A: D(A) X -+ X
as independent objects in Hilbert spaces. Kurt Otto Friedrichs (1934) was a
pioneer in this field.
The theory of pseudodifferential operators and Fourier integral operators
emerged in the 1960's and 1970's; it combines the theory of linear integral
operators and differential operators. In this connection. one uses the Fourier
transform and the theory of distributions. We refer to the four-volume exposi-
tion of Hormander (1983).
54
'R Vanallonal Problems. the Ritz Method. and the Idea of Orthogonality
18. 7i. Monotone Operators
An operator A: D(A) c X -. X on the real H-space X is called nJO'lotone iff
(M)
(All - Arlll - r) 0
for all II, r E D(A).
Furthermore, the operator A is called strictly monotone iff
tAu - Al'lu - I') > 0
for all u, [' E D(A) with U:F r.
Finally.. the operator A is called strongly monoto'Je iff there is a constant c > 0
such that
(.4 II -- .4 r III - r) > c 1111 - (' II 2
for all II.. r E D(A).
If the operator A is linear.. then the monotonicity condition (M) above is
equivalent to the simpler positivity condition
tAu Ill) > 0
for all U E D(A).
Let X be a real B-space. Then the operator
A: D(A) 5; X --+ X.
is called monotone iff
< All - A l\ U - r) > 0
forall U,l'ED(A).
Recall that < r. u) means the value I(u) of the linear functional f E X. at the
point II. Furthermore, A is called strictly monotone iff
<Au - /1I".1l - r) > 0
for all II.. r E D(A) with u:;: r,
and A is called strongly monotone iff there is a c > 0 such that
(A u - A r, II - r) > c 1111 - l'1I 2 for all u, l' E D(,4).
If X is a complex H-space (resp. complex B-space), then, in the monotonicity
definitions above, we have to replace
f 'I' )
with
Rc( '1' )
tresp, (,..) with Re('.' »). For example, the operator A: D(A) c X -+ X on
the complex H-space X is called monotone iff
Re(Au - Al'U - (.) > 0 for all u, I' e D(A).
Similarly, the operator A: D(A) c: X -+ X. on the complex B-space X is called
monotone iff
Re(Au - Av,u - 1') > 0
for all u, l' E D(A).
Even in the 1930's. the notion of the monotone operator was implicitly
hanging over the theory of linear elliptic differential equations. One also
encountered this notion in the 1950's when investigating linear parabolic and
hyperholic differential equations by means of the theory of semigroups.
18.7. The History of the Dirichlet Principle and Monotone Operators
55
A further source for the theory of monotone operators is the classical calculus
of variations. In Section 18.11 we shall show that the classical Legendre
condition for minima is closely related to the convexity of the variational
problem and hence to the monotonicity of the corresponding Euler- Lagrange
equations. In order to explain the abstract basic idea, let
F: X ..... R
be a functional on the real 8-space X. Then the minimum problem
(P) F(u) = min!, u e X,
corresponds to a classical variational problem. If u is a solution of (P), then u
satisfies the operator equation
(E)
F'(u) = 0,
which corresponds to the classical Euler- Lagrange equation. The point is that
the convexity of the functional F implies the monotonicit}' of the operator
F': X ..... X..
However, note the following very important fact:
The theory of monotone operators is also applicable to equations which are
not related to a t'ariational problem.
I n the 1960's one recognized that the concept of monotone operators allows
us to deal uniformly with many apparently different problems in a very elegant
manner. A list of such problems is given in the Preface.
In connection with the general principle (PI) in Section t 8.7h above, we
mention the following principle:
(P2) A priori estimates and compactness yield existence.
This principle was used by the Russian mathematician Bernstein I at the
beginning of this century and by Leray and Schauder in the 1930's, in order
to obtain existence proofs for linear and nonlinear elliptic differential equa-
tions. This was discussed in the introduction to Chapter 6. The Leray-
Schauder principle (Theorem 6.A) represents the most fruitful functional
analytic formulation of (P2).
The theory of monotone operators allows us to free ourselves from the
restrictive compactness assumption in (P2). It turns out that (P2) can be
replaced by the following principle:
(P3) Coercivity and monotonicity yield existence.
In this connection the necessary a priori estimates follow easily from the
coercivity condition.
1 Bernstein (1880-1968). Schauder C 1896- 1943). and Leray Cborn 1906).
56
18. Vanational Problems, the Ritl Method, and the Idea of Orthogonahty
The interrelations between the calculus of variations, the development of
the 19th and 20th problem of Hilbert, and the theory of monotone operators
will be discussed in Section 18.19.
18.8. The Main Theorem on Quadratic
Minimum Problems
That's what it was really all about.
Faust
We consider the minimum problem
! a ( U. II) - b ( u) = m in!. " E U 0 + X 0 . ( 50)
In Section 18.10 below we shall show that the Dirichlet problem is a special
case of (50). We also consider the equation
a(II. (') = h(r)
for all (' e Xo.
( 50. )
Here we seek II E "0 + Xo. We make the following assumptions:
(H 1) Xo is a closed linear subspacc of the real H-spacc X.
(H2) The map a: X x X -. R is bilinear and symmetric. i.e., for all u, ", \\' E X
and 2. pER.
a(u + (It'. K') = 2a(u, K') + fJa(v, w),
a(u,l') = a(t', u).
(H3) There is a constant d > 0 such that
la(II. 1')1 tlllulllll'1i
(H4) There is a constant c > 0 such that
Cllllll 2 a(u.II)
for all u, I' EX.
forall ueXo.
(H5) The map h: X -. R is linear and continuous, i.e., for all u, l' e X and ,
fJ e R,
h(214 + pt') = h(u) + flb(t,),
Ib(l')f constllvll.
Theorem 18.A (Main Theorem on Quadratic Minimum Problems). Assume
(H I) through (H5). Let "0 e X he given. Then the original problem (50) has a
Ilnique solut ion. M oreOl'er. problems (50) and (50.) are equivalent.
The key to the proof of Theorem 18.A is the so-called parallelogram identity
II x + .r 11 2 + II x - )'11 2 = 211 X 11 2 + 211 J' 11 2 (51 )
18.8. The Main Theorem on Quadratic Minimum Problems
57
.\' - .f " + .t
o
.f
(a)
(b)
Figure t 8. t I
for all x, y e Y, where Y is a real or complex H-space. This follows from
IIx :t Yill = IIx 11 2 + (xIY) + (ylx) + IIYII 2 .
Figure 18.11 (a) shows the geometrical meaning of this identity. By Figure
18.11 (b) it is obvious that (5 I) generalizes the Pythagorean theorem.
PROOF OF THEOREM IS.A.
(I) Equivalent equation. We show that (SO) is equivalent to (SO.). To this
end. we set
F(u) = !a(u, u) - b(u)
for all u EX.
Moreover. for fixed u e X and fixed v E Xo. we set
(,O(t) = F(u + tv)
for all t e R.
Then
cp(t) = !t 2 a(v, v) + t(a(u, v) - b(v» + !a(u, u) - b(u).
Note that a(t'.l') > 0 for all v E Xo with v #= O. Thus, the original problem
F(u) = min!, u e Uo + Xo
has a solution u iff cp has a minimum at t = 0 for all v E X 0' i.e..
o = cp'(O) = a(u. v) - b(v) for all v E Xo.
(II) Uniqueness. Let u and ii be solutions of the original problem F(u) =
min!, u e U o + Xo. By (I),
a(u, v) = b(v) for all v E X o ,
a(u, v) = b( v).
Letting v = u - U, we get
ellu - ull 2 S a(u - u,u - u) = O.
Hence u = u.
58
I H V dnatlonal Problems. the Ritz Method. and the Idea or Orthogonality
(III) Energetic space. We set
tlllr)f. = afll. r),
I'" IF. = (u III ) 2
for all u, I' E X o.
By (H3) and (H4).
(' H II ji 1 < II II 1\ i < till II 11 2
forall lIeXoo
(52)
Ry definition. the ('II('rget;c space X f. is equal to the set X 0 equipped
with the so-called energetic scalar product ('1' )!" Since "Yo is a real
H-space with respect to the original scalar product ('1'), it follows from
(52 that .\'f. is a real H-space with respect to (". )[,
In fact. let fll,,) be a Cauchy sequence in X E . By (52), (II,,) is also a
("auchy sequence in X o. Since X 0 is an H-space, there is a II E Xc such
that illl" - (II! -+ 0 as It x.. By (52). we obtain 1111" - 1I11F. -+ 0 as n --+ x.
Thus. X f. is an J-I-space.
(IV) Existence proof. Letting '" = II - "0' the original problem (50) is
equivalent to the problem
G( ",} = min!.
,,'eXo.
(53)
where
Gf"' = !(,,'I "')f. - hi (,,')
for all ". E "'Y o .
and hi (n') = h(,,') - a(lI o . "'), By (52). there is a constant B such that
I h If'" ) I \I h 1111 "'II + d 111'0 It II"' II
< B II \"It I. for all '" e X 0 .
(I V -I) Minimal seq uence ("',,). Set
:r = i 11 f G ( '" ),
". E "n
Then there is a sequence ("',,) in Xo such that
Gf"',,) -+
as n -+ x.
"'or all n' e "\'0'
G (",) > II "'II i. - B II n'lI [.
This implies G(,,' -+ + x as Iln'II[.... x. Hence > - 7'_.
(IV-2) (-'auchy sequence, Adding -hit".,,) - h.(\\'",) to the parallelogram
identity (51) in X 1:'
/"'"Ili. + 11"'",11;' = ! II"'" - "'",IIi + ill"'" + "'",lli.
we obtain the key relation
( ", + \" )
(; (1\'") + G (1\'",) = ill\\'" - 1\'", II i + 2G -" -2 -- '"
> ! p "'" - "'"," + 2.
18.9. The Inequality of Poincare-Friedrichs
59
Since G(,,',) -. 2 as k -+ x, ("',,) is a Cauchy sequence in the H-space
X [. Thus. there is a ". e X E such that
",' -. ". in X
" E
as n -+ x.
This implies
G( "',,) -+ G( "')
as n -+ 'X.
Hence G(\,') = 2, i.e., \\' is a solution of (53).
o
18.9. The Inequality of Poincare- Friedrichs
In order to check assumption (H4) above in connection with the justification
of the Dirichlet principle in the next section, we will use the following inequality
of Poincare- Friedrichs:
C f u2dx s f t (D j U)2dx
(i G i-I
where C > o.
rorall ueCo(G),
(54)
Proposition 18.19. Let G be a bounded region in AN, N > I. Then (54) holds..
"'here tlte constant C depends on G.
PR()()F. We choose N = 2. The general case proceeds analogously. Let u e
Co (G). As in Figure 18.12. consider a rectangle R = [a.b] x [c.d] with G c
int R. Note that u vanishes outside G. Then
u(.'O = 1" u,,(.}') d}' for all (.'O e R.
The Holder inequality with I-trick yields
lu(. '1)1 2 (1" d Y ) (1" u:(. J)d}')
(d - c) 1 4 u:(. }')d}'.
rJ t
:t
L____
l ) f R
a
1
b
.
Figure t 8.12
60
I R Vanatlonal Problems. the Ritz Method. and the Idea of Orthogonality
By integration over R..
t 11 2 dx (d - c)2 t u dx.
o
Inequalities of the type (54) were considered by Poincare (1890) in a famous
paper on eigenvalue problems for the Laplace equation. Friedrichs (1934)
recognized that such inequalities are the ke)' to a functional analytic existence
theory for linear elliptic differential equations.
18.10. The Functional Analytic Justification
of the Dirichlet Principle
18.10a. The Dirichlet Problem
Let G be a bounded region in H 2 . We consider the Dirichlet problem
I f 2 2 d - .,
2 (i (u + II,,) x - min.,
II = Y on iG.
If we regard u(.. '1) as the vertical displacement of a membrane at the point
(.. '0.. then (55) corresponds to the principle of minimal potential energy.
Below \\'C will use the so-called energetic scalar product
f II If) f: = f ( u to + "" l'., ) d x .
G
Note that 2 -I (ulll)! corresponds to thc potential energy of the membrane.
By Proposition 18.6.. the corresponding Euler equation for (55) reads as
follows:
(55)
II = 0 on G..
(56)
u = g on iG.
J-.urthermore.. the generalized problem for (56) reads as follows:
f (II-:r + 11,,('.,) dx = 0 for all I' E C; (G),
(;
u = g on (''JG.
If II is sufficiently smooth.. then (55), (56).. and (57) are mutually equivalent by
Proposition' 8.6. Formally.. one obtains (57) by muhiplication 0((56) by t" and
subsequent integration by parts. Moreover. integration by parts in (57) yields
(57)
18.10. The Functional Analytic Justification of the Dirichlet Principle
61
the so-called weak problem:
L uAvdx = 0 for all v E Ccf(G),
u = 9 on aGo
Relation (58) says that the differential equation u = 0 holds on G in the sense
of the theory of distributions (cf. A 2 (62». Note that in (58) we do not assume
that u is differentiable.
(58)
18.10b. Generalized Derivatives
The point of departure for the definition of generalized derivatives is the
classical integration by parts formula:
L IID,vdx = - L (D,u)vdx for all v E C:(G), (59)
where U E C I (G). Here. D 1 = c/o and D 2 = 0/0", The simple trick is to set
w = D j u.
Then we obtain
L uDjvdx = - L wvdx for all v E C:(G). (60)
The point is that this formula also remains valid for certain nonsmooth
functions U and w.
Definition 18.20. Let u, W e L 2 (G) and suppose that (60) holds. Then K' is called
a generalized derivatit'e of u on G of type D i .
As in the classical case we write w = Diu.
Proposition 18.21. The generalized derivative K' = DiU is uniquel}' determined
up to the ('alues on a set of measure zero.
PROOF. Suppose that (60) holds for "'t W E Lz(G). This yields
L (w - w)vdx = 0 for all v E C:(G).
and the variational lemma (Proposition 18.2) implies that w(x) = w(x) for
almost all x e G. 0
18.1Oc. The Sobolev Space W 2 1 (G)
Definition 18.22. The space Wzi (G) consists exactly of all the functions u e
L 2 (G) which have generalized derivatives D,u e L 2 (G) for all i. Furthermore,
62
18 Varaational Problems. the Ritz Method. and the Idea of Orthogonality
we set
(ul")1.2 = fG (III' + Ilr + 1I,,1',,)d.'(
and lIull1.2 = (ulu):'2 2 .
(61 )
Proposition 18.23. 1.he space W 2 1 (G) together with (.1 .)1. 2 becomes a real
H-space in case "'e identify t,,'o functions whose l'alues differ only on a set of
Ineasure :ero.
PR(X)F. (-'rom (ulu)1.2 = 0 it follows that u = O. Hence (., . )1.2 is a scalar
product. In order to prove the completeness of W 2 1 (G), let (U,.) be a Cauchy
. '.1/1 G .
sequence In ,., 2 ( ), I.e.,
1111,. - 11,"111.2 < c
for all n, m > no(I:).
(62)
Hence (II,,) and (Dill,,) are Cauchy sequences in L 2 (G). This implies
II" -+ II in L 2 (Gt
From
and
Diu" -+ "'; in L 2 (G)
as n -+ x.
J II,.D;I" dx = - J (D;u,,)r dx
(i Ci
forall ("EC(G)
we obtain
L IlD j ,'dx = - L wj,'dx rorall "E C;(G).
Hence "'; = 0;11. This implies U E W 2 1 (G) and
lim 1/ U II - U III. 2 = o.
II-X
In this connection. note that
1111,. - III: f. 2 = lIu" - ull 2 + L IID;u" - D;uU 2 ,
;
where 11'11 denotes the norm on L 2 (G).
o
Definition 18.14. Let J{l2 1 (G) denote the closure of Co (G) in Wi (G).
Consequently, we have II E W 2 1 (G) iff u E W 2 1 (G) and there exists a sequence
(u,,) in C; (G) with
1I1l" - U 111. 2 -+ 0
as n -+ x,
i.e., U" -+ u in L 2 (G) and Dill" -+ D;u in L 2 (G) as n......x for all I. This IS
equivalent to the following two conditions:
(i) u E L 2 (G):
(ii) there exists a sequence (u,,) in C'(G), which is a Cauchy sequence with
respect to 11'111. 2' and we have ",. -+ u in L 2 (G) as n OCJ.
18.10. The Functional Analytic Justification or the Dirichlet Principle
63
Hence it is meaningful to say that the functions u in Wl<G) satisfy
the boundary condition
u = 0 on oG
in a generalized sense. This will be formulated more precisely in Theorem
21.A.
The space W 2 1 (G) is an H -space. since it is a closed linear subspace of
the H-space W 2 1 (G).
18.1Od. The Existence Theorem
We set
a(u, (') = t (u(' + u.,v,,) dx.
The following problem
a(u, u) = min!. u - 9 e W 2 ' (G) (63)
is called the generalized Dirichlet problem.
Obviously, problem (63) generalizes the classical Dirichlet problem (55),
since the condition u - g E W 2 1 (G) can be regarded as a generalization of the
classical boundary condition u - 9 = 0 on eG.
Theorem 18.B. For each given function 9 e W 2 1 (G), problem (63) has a unique
solution u E W 2 1 (G).
Corollary 18.15. For all l' e Co (G),
a(u, v) = o.
t uA('dx = O.
(64)
(65)
PROOF. We want to apply Theorem IS.A. To this end, we set
X = W 2 1 (G), Xo = W 2 1 (G). U o = g.
(I) By the Holder inequality, for all u, t. EX.
la(u,l')1 t r ID,ullD,vl d."C
1= I J G
it (t (D,U)2 dX) 1/2 (t (D,l,)2 dX) 1/2
S "u III. 2 It t' III. 2 ·
64
I R Variational Problems, the Ritz Method. and the Idea or Orthogonality
(II) By the Poincare- Friedrichs inequality, for all t' E Co (G),
C f (.2 dx f L (Oi t ,)2 dx.
G G ;
Adding C JG L; (0;1.)2 dx, we obtain
c f ( 1'2 + L (0;t')2 ) dx < f L (D;l")2 dx
GiG i
for all(' E C <> (G).. where c = C;'( I + C), i.e...
eliI'll i. 2 < ah., 1') for all l' E c; (G).
f III) Density argument. Let" E X o. Then there is a sequence (v,,) in C; (G)
such that 1111 - 1',,11 I. 2 -+ 0 as n -+ oc. Hence
f" -. II in L 2( (i) and D;l'" -+ D;u in L 2 (G) as n -+ .
I-'rom cllr"IIf.2 < a(l'", I',,) we get, as n -+ 00, the key inequality
('lIull:.2 s a(u, u) for all u E Xo.
Now Theorem 18.A yields Theorem 18.8 and Corollary 18.25. 0
In the proof of Theorem IS.A we introduced the energetic scalar product
(ull')f = a(u, t.) = f D;UOil,dx
G I
for all u, f E "'2 1 (G). The corresponding energetic space X I; is equal to the set
W 2 1 (G) equipped with ('1 . )E' Recall that X E is an H-space.
18.10e. Regularity of the Generalized Solution
Corollary 18.26. A.(rer ("hanging tile ('alues of Ille solution u in Theorem 18.8 on
a set of mea,ure zero, if necessary, u belongs to C r (G) and \\'e have u = 0 on
G in the classical sense.
This follows from a more general result which we will prove in Section 18.15
(Lemma of Weyl). The smoothness of u on G, i.e., the smoothness up to the
boundary, will be considered in Section 22.21.
18.11. The Perpendicular Principle,
the Riesz Theorem, and the Main Theorem
on Linear Monotone Operators
The following results will be used freq uently in this volume. Our goal is to
underline the importance of the idea of orthogonality.
18.11. The Perpendicular Principle. the Riesz Theorem. and the Main Theorem 6S
18.11 a. Orthogonal Projection
We consider the minimum problem
II u - vII = min!,
ve M,
(66)
and make the following assumptions:
(H) Let M be a closed linear subspace of the H-space X over k = R, C, and
let u e X be given.
Figure 18.13 shows the geometrical meaning or (66), i.e., we seek the root v
of a perpendicular from the point u to the plane M. Let M 1 denote the
orthogonal complement to M, that is, by definition,
M 1 = {weX:(wlv)=OrorallveM}.
Theorem tB.C (Perpendicular Principle). Assume (H). Then the minimum prob-
lem (66) has a unique solution v, and u - ve M 1 .
Coroiliry 18.17 (Decomposition Theorem). If (H) holds, then there exists a
unique decomposition of u of the form
u=v+w, veM, weM 1 .
PROOF OF THEOREM 18.C. Letting w = u - v, problem (66) is equivalent to
(wlw) = min!, w e u + M. (66.)
(I) Let K = R. By Theorem 18.A, there exists a unique solution w or (66.),
and (wi v) = 0 for all v e M.
(II) Let k = C. Problem (66*) is identical to
a(w, w) = min!,
w e u + M,
where a(v, w) = Re(vlw) for all v, we X. We regard X as a real H-space
with respect to the scalar product (u, v)..-. Re(ulv). By Theorem 18.A, there
exists a unique solution w of (66*). This implies that
(wlw) s (w - lvlw - ;..v)
= (wlw) - )..(wlv) - I(vlw) + 1)..1 2 II vII 2
/ 0
u
M
Figure t 8.13
66
I K Vanatlonal Problems. the Ritl Method. and the Idea of Orthogonality
fOf all rEAl and i. E C. Letting i. = (rl"')/III'1I 2 . we obtain 1t"'lr)1 2 O. i.e..
(\\'Ir) = 0
for all l' e 1"".
o
PR()()f: (}I C()R(U.LAR Y 18.27. The existence of such a decomposition follo\\'s
ffom Theorem 18.C. To prove the uniqueness of the decomposition" let II =
j' + \f. r E .\1. ". E .\1 1 be a second decomposition of II. Then
o = (r - f) + (". - ".).
r - l' E .\1. '" - '" E J\ll. Hence tr - f'l.. - f-') = O. i.e.. l' = .;.
o
18.11 b. The Riesz Theorem
The geometrical meaning of the following theorem of Riesz is that in an
H-space. there is a one-to-one correspondence between nonzero linear con-
tinuous functionals and closed linear subspaces of codimension one.
Theorem 18.0 (RiesL (1934U. Let f: ..\' -+ IK he tllinear ('olui'UIOIIS Jitn('tio'ltll
O'J tlte H"'SPCl('f! ..\' orer = IR'I C. Tlle'J there exists Cl IIniqllely cleter"ri'Jecl
ele",ent r E "'\" SII( II t ha,
r( II) = (1'111)
.lor ,III II EX.
f67)
Corollar) 18.28. COllrer!ely. .I'or eaclt elenJelJt t. E X. equation (67) tlf!.Iines a
linear cont ;nIlOIlS .rullet ;olJal .l on ,,\, ,,'it II lI.r II = II (,'11.
In the case of the special space X = L 2 (G)" this fundamental theorem of
linear functional analysis was proved independently by the Hungarian
mathematician Fryges Riesz (1880 - 1956) and the French mathematician Rene
Maurice Ffechet (1878 1956) in the same issue of Volume 144 of COIJrptes
Rendlls (1907). In 1934. Riesz proved this theorem for abstract H-spaces.
PR(X)F ()f- TIU-()Rf- t 18. D,
(I) LI nlqueness. It follo\\'s from
(rill) = O'lu)
for all II E X
that (,. - rill) = 0 and hence (I' - i'll' - i') = 0 . i.e.. r = f .
t II) Existence.
(II-I) Let [K = iRe By Theorem 18.A" the minimum problem
}(rlr') - .f(l') = min!" l' E X"
has it unique solution r. and (1'111) = f(ll) for all II E X. This IS the
assert ion.
,11...2) Let IK = C. Suppose that .r =F- O. The null space
N=:ueX:f(II)=O}
18.11. The Perpendicular Principle, the Riesz Theorem, and the Main Theorem 61
is a closed linear subspace of X. By the perpendicular principle (Corol-
lary 18.21), there exists an element U o E N 1 with U o #: 0, since N X.
We may assume that I(uo) = 1. For each u E X,
I(u - I(u)u o ) = 0,
i.e.. II - f(u)uo e N. Hence we obtain the orthogonal decomposition
(D) II = \\' + /(U)I'o, \\' eN, 1'0 E N 1.
Scalar multiplication by Uo yields
(uolu) = 1(14)(u o ll' o ).
Letting t' = uo/llu o ll 2 , we obtain (61).
o
Relation (D) of this proof shows that the null space N of f is a closed plane
in X with dim N 1 = I, i.e.,
codim N = 1.
PROOF OF COROLLARY 18.28. The Schwarz inequality yields
Ih'lu)1 S IIt'lIlIull for all u E X.
Hence lI.fll S IIt'll. Because I(t") = 11('11 2 , we obtain 11.(11 = IIl'lI.
o
18.lle. The Duality Map
Let X. be the dual space of X, i.e., X. consists of all linear continuous
functionals I: X -.. K, where X is an H-space over I< = R, C. We define the
so-called duality map
J:X-..X.
by J(r) = .r with
1(11) = (l.lu)
for all II EX.
By Section 18.11 b, the map J is bijective and isometric, i.e.,
IIJ(r)1I = IIt'li for all rEX.
Moreover, J is antilinear, i.e., we have
J(21' + p,..,) = 2J(t.) + pJ(,...)
for all t\ ". E X and 2, p E K. Here, the bar denotes the conjugate complex
number.
Let X. be the space of all antilinear continuous functionals g: X -.. K. Set
1(t") = I(t'). If f e X., then 1 e x.. Conversely, if g e X., then g e X.. We
define the so-called antidllalit y map
J: X -+ X.
68
I H Vanatlonal Problems, the Ritz Method" and the Idea of Orthogonality
by )(1') = y with
Y(II) = (IIII')
for all U EX.
Obviously. J(r) = J(r). The map J is linear, continuous, and isometric.
In the case of a real H-space X.. we have X. = X. and J = J. i.e., the duality
map J is linear.
18.11 d. Equivalent Existence Principles
Tire .1()II()u'i"J three existellce principles are "ultllally equil'alellt:
fi) The eiste"ce principle for quadratic IP,initnuIP' prohlen,s (Theore,,. 18.A).
(ii) Tire perpendicular principle (Tlleoren, 18.C).
(iii) The Riesz t"eorel1' (T"eoren, 18.0).
These three principles represent the idea of orthogonality in H-spaces.
Above we proved that
(i) (ii) (iii).
Thus. it remains to show that (iii) (i).
To this end.. we consider the minimum problem
a(u.. u) - h(u) = min!.. u e "0 + X 0'
as in Section 18.8. By the proof of Theorem 18.A, this problem is equivalent
to the problem
( I' If) I:. - hi (I') = m in!. 1" E X f...
where hi: X f: --+ IR is linear and continuous. By the Riesz theorem there is an
element U' E X £ such that
hi (r) = (" 'Il' ) f: for all rEX £.
Thus we obtain the equivalent problem
!( r - \"Ir" - ".)£ = min!,
which has the unique solution r = "'.
rEX f..
18.11 c. Application to the Main Theorem on
Linear Monotone Operators
Theorem 18.E. Let B: X --+ X be a linear continuous operator on tile H-space
X or'er K = R. C. Suppose that there is a c > 0 such that
(C")
I( BIll u ) I > c II U 11 2
forall ueX.
Then, lor each gil'etJ f EX. tile operator equation
Bu = J, U E X
htls a unique soillt ;0'1.
18.11. The Perpendicular Principle. the Riesz Theorem. and the Main Theorem 69
For example, condition (C) is satisfied iC the operator B is strongly monotone,
i.e., there is a c > 0 such that
Re(Bulu) cllull 2 for all u e X.
The following proof will be based on the perpendicular principle.
Theorem t8.E was proved by Lax and Milgram (1954). It is called the
Lax-Milgram lemma. Applications to elliptic boundary value problems will
be considered in Section 22.6.
PROOf.
(I) Uniqueness. Suppose that BU I = BU2 = f. Set u = U l - U2' Then Bu =
o implies cllull 2 S I(Bulu)1 = 0, and hence u = O.
(II) Existence.
(II-I) Closedness of the range R(B) of B. By the Schwarz inequality, for all
u eX,
cllull 2 s I(Bulu)1 s II Bull Jlull.
Thus, we obtain the crucial a priori estimate
cRul S RBul for all U E X.
This implies the closedness of R(B). In fact, let
Bu" .... v
as n.... oc>.
Thus, (Bu,,) is a Cauchy sequence. Now it Collows Crom
lIu" - u",11 S c- I I1Bu" - Bu",11
that (u,,) is also a Cauchy sequence. Hence there is a U such that
u" .... u
as n.... oc.
The continuity of B implies that Bu = v, i.e., v e R(B). Consequently,
R(B) is closed.
(11-2) We show that R(B)l = to}. To this end, let v e R(B)l. Then
(Bulv) =0 Corall ueX.
Choosing u == v, we obtain ell vII 2 I(Bvl v)1 == 0. i.e., v = o.
(11-3) By the perpendicular principle (Corollary 18.27), the closedness of R(B)
and R(B)l = to} imply R(B) = X. 0
Theorem I8.F (Main Theorem on Linear Monotone Operators). Let A: X ....
X. be a linear continuous operator on the real H-space X. Suppose that A is
strongly monotone, i.e., there is a c > 0 such that
(Au, u) c lIull 2 for all u e X.
Then, for each given b e X., the operator equation
Au = b, u e X
has a unique solution.
70
I N Variational Prohlems. the Rltl Method. and the Idea of Orthogonality
PR()OF. Using the Riesz theorem. we will reduce Theorem 18.F to Theorem
I R. E. Recall that A". denotes the space of all linear continuous functionals on
X. and (All.. I') denotes the value of the functional Au E X. at the point r.
(It Operator B. r:ix II E X. By the Riesl theorem (Theorem 18.D). there is
an element \t' E .t such that
(.411. r) = (,,'h')
r 0 r a II r E .\'.
and Ii \"1' = :1.4111. We set BI, = \\'. Then
II BIIII = II A II II < ",4" II lilt .
No"' consider B" for all II EX. Then the operator B: X --. X is linear and
continuous. Moreover. we have
( 8" III) = (A II. II) > c II U III
forall ueX.
Note that (811111) is real. since X is a real H-space.
(II) Again by the Ries7 theorem. there exists an element .r E X such that
<h. I') = ( f'r)
for all rEX.
(III) Equivalent operator equation. The original operator equation Au = b..
II E o't. is equivalent to
<Au.r) = <h. I')
for all t. EX.
that is.
( Bill r) = ( fir)
for all rEX.
Thi is equivalent to the operator equation
Bu = f:
IIEX.
which has a unique solution according to Theorem 18. E.
o
18.12. The Extension Principle and
the Completion Principle
The clasical Idca of approximation corresponds to dense sets in functional
analyc;ls.
I n order to simplify proofs. the following extension principle will be used
frequently. We make the following assumptions:
(H I) Let <it and Y be B-spaces over It( = R. C. The linear operator A: D(A) c
X -+ Y satisfies the inequality
hAul! s Cllu!!
for all u E D(A).
f68)
where (.. is a constant. i.e.. A is bou'1ded.
(H2) The set D( A) is a linear subspace of X which is den.e in X. i.e.. D( A) = '(.
18.11 Proper Subregions
71
Explicitly. the density of D(A) in X means that. for each u e X and each
I: > 0, there exists a point v e D(A) with
II u - t' II < E.
Proposition 18.19. Suppose that (H I). (H2) hold. Then:
(a) The operator A can be uniquely extended to a linear continuous operator
A: X... Y with (68) for all u e X.
(b) If A: D(A) s; X .-. Y is compact. then so is A: X ... Y.
PROOF. Ad(a).
(I) Existence. Let u e X and u f D(A). Then there exists a sequence (II..) in
D(A) with u" -+ u as n ... 00. In particular. (u,,) is a Cauchy sequence. By
(68 ).
II Au" - Au",11 s Cllu" - u",lI.
Hence (Au..) is also a Cauchy sequence, i.e.. (Au,,) converges. We define
Au = lim Au".
(69)
"-X
We have to show that this definition of Au is independent of the choice
of (u,,). To this end. let (v,,) be another sequence in D(A} with v".-. u as
n ... . Then
ItAu.. - Av,,11 s C lIu" - 1.,'..11 ... 0
as n... OC;.
Hence At'.. ... Au as n ... 00.
A passage to the limit shows that (68) holds for all u e X.
(II) Uniqueness. Each linear continuous extension A: X ... Y of A: D(A)... Y
satisfies (69). Hence the extension is unique.
Ad(b). Let (u,,) be a bounded sequence in X. Since D(A) = X. there exists
a bounded sequence «(',,) in D(A) with u" - t'" ... 0 as n -+ 00. Since A: D(A)... Y
is compact. there exists a convergent subsequence (At',,-). Hence (Au,,-) is
convergent. 0
In the context of Sobolev embedding theorems. the advantage of the ex-
tension principle is that it is sufficient to prove inequalities of type (68) for
smooth functions.
The completion principle is closely related to the extension principle. This
principle ensures that pre-Hilbert spaces (resp. normed spaces) can be com-
pleted to Hilbert spaces (resp. Banach spaces). This will be considered in
Problem 18.3.
18.13. Proper Subregions
Recall that A s; B and A c B means inclusion and proper inclusion, respec-
tively.
72
I R Variational Problems. the Ritz Method. and the Idea of Orthogonality
G
C H
Figure 18.14
Definition 18.30. Let II and G be regions in fR'''. Then H is called a proper
subregion of G jff
He G.
We write II c c G.
If G is bounded, then this definition means intuitively that a boundary strip
of G is outside H (Fig. 18.14). In the same way, the symbol H c c G will also
be used for arbitrary sets in R N . In particular.. if G is open and H is compact,
then H c G implies H c c G.
Definition 18.31. Let .f: DCf) c IR ft . --+ R be a function. Then the closure of the
set
{ x: .f(:<) # O}
in R'''' is called the support, supp .r. of r.
I n other "'ords, supp./- is the smallest closed set S in A'IV with .r = 0 out-
side s.
Definition 18.32. Let G be a region in IR'IV. A function u: G -. IR belongs to the
space Lp,'oc(G) iff
U E L,(H)
holds for all compact subsets H of G.
Obviously.. we have
Lpt G) c L,.,oc( G).
and the integral J (j ur d.'( exists for all
II E Lp,'oc(G).. I s p Cfv,
I p x,
and all t' E C(G).
18.14. The Smoothing Principle
It is typical for modern analysis to work with nonsmooth functions (e.g.,
functions in Lebesgue spaces or Sobolev spaces). In order to simplify many
investigations. nonsmooth functions u are frequently approximated by smooth
18.14. The Smoothing Principle
73
(2)
Figure 18.15
functions lie. where we set
U c = ScU.
(70)
In this section we want to study the properties of the so-called smoothing
operator St. As a special result, we shall obtain that Co(G) is dense in L,(G)
for I S P < x. Let £ > o.
The point of departure is the integral formula
uc(x) = f qJc(x - y)u( y) dy. (71)
JR
where cp((x) = t-.'Vcp(x/£). and cp has the following properties:
(i) cp e Co (R N ) and fP(x) = 0 for I.tl > 1.
(ii) cp 0 on R'v.
(iii) J R cp(x) dx = 1.
I n particular, we obtain
cp,(x) = 0
for Ixl t.
This implies:
The values of ut(x) depend only on the values of U on a ball
around x of radius t. (72)
If dist(x, sUPP u) > £, then u(.) = 0 (Fig. 18. t 5). (73)
A special choice is given by
( ) = { c ex P [( I x I 2 _ I) -1 ]
cpx 0
for Ixl s I,
for 1.1 > 1,
where the positive constant c is chosen in such a way that the normalization
condition (iii) above is satisfied. Fig. 18.16 shows cp, in case N = 1.
Proposition 18.33 (Smoothing Operator). Let G be a nonempty open set in R N
\\'ith N 1 and let
U E L,(G).
I S P < 'X.J.
74
18 Vanatlonal Problems. the Ritz Method. and the Idea of Orthogonality
.
I Y ,
t
-l
E.
Figure 18.16
Ye set u(x) = 0 olltside G. Tllell:
(a) S(II E CY(fR") and IIS(ulip < Itullp./llr all t; > O.
(b) S,II u in lI'(G) as E; + O.
(c) If sUPP II i.ti a cOI"pact subset o.f G, tltell
S,u E C (G)
for all 0 < t; < 1: 0 ,
",here 1:0 is the distallce het"'een tile boundary of G and supp u.
(d) Let the (unction II he cont;nuolI. on an open neighborhood of the contpact
.llhset ( (f (i. Theil "'f obtain the u'tiform convergence
S,II ::: II on (oo
as E; + O.
(e) Let I p < x. 7"lIell statenlent (a) renlains true. Moreor-er,
(S(II)(X) u(x)
as r. + 0
for almost all x E G.
PR(X)f. Ad(a). Let U E L p ((;). I p < x. Since tp is C'" , the function U r in (71)
has derivatives of arbitrary order. In this connection note that the function u
is integrable on bounded open subsets of G, and hence we have majorants at
hand (cr. A 2(25)). This implies II, E ex (AN).
We set J = J\. 'rom (71) we get
1I.lx) = f cp(:)II(X - r.z) dz.
If p == x, then
1111. I! " < IIlIlIox f cp(z)dz = 111111",..
Now let I < p < x and p -I + q -I = I. Use <p u = tp t/ 4 ( <p Il,u). By the Holder
Inequality.
11I.(x)jP f III(x - E:zWcp(z)dz,
noting that (J tp(z) dZ)"'4 = I. By the Fubini theorem A 2 (23).
flll'X - /;zl!"cp(z)dzdx = f cp(Z)(flll(X - r.ZWdX)d:
= f cp(:) lIulldz = lIull.
Hence 1111,11" < 111111 P'
18.14. The Smoothing Principle
75
Ad(b). Let B = {}' EA....: 1.\'1 < I}. It follows from (71) that
uA) = f, u(x - Ey)cp(Yldy
and hence
II,(X) - u(x) = f. (u(x - ty) - u(x)lcp( y) dy. (74)
The Holder inequality yields
IU,(xl - u(x)IP d f. lu(x - r.y) - u(x)IP dy.
where d is a constant. By the Fubini theorem A 2 (23).
t ju,(x) - u(xW d. d f. (t lu(x - cy) - lI(xll P 'Ix) dy.
The right-hand side goes to zero as £ -+ + 0 according to the p-mean continuity
of the function U E Lp(G) (cf. A 2 (32)).
Ad(c). This follows from (73).
Ad(d). This follo\\.s from (74) and the uniform continuity of u on an open
neighborhood of C.
Adfe). The validity of (a) for p = x has been proved in fa) abo\'e. Let
Br = :.\' E R : I.r - x I < E::. Then
f ( X - ,, )
lu,(x) - u(xll = t - \' B. cp - (lI(yl - u(x)) Ily
< E-'''C r lu(yl - u(xll dy -+ 0
JB,
as E; --+ 0 for almost all x E G. by the Lebesgue theorem A 2 (25e). 0
Corollary 18.34. Let 1 < P < x and let G be a noneIJ'pt)' open set in R". N > 1.
Then Co (G) is dense in L,,(G).
PROOF.
(I) Let G be bounded and H c: c: G. Let II E L,,(G). We set
{ U(X) on H.
r(x) =
o on G - H.
Then
f IU-ttl"dx= f lul"dx.
G G-H
By the absolute continuity of integrals A 2 (20). the right integral is arbi-
trarily small provided the measure of G - H is sufficiently small. Thus.
76
I Vanatlonal Problems. the Ritz Method. and the Idea of Orthogonality
"'C can choose the set H in such a way that
II II - r II p < t.
Ry Proposition 18.33( b) (c) there is a \\' E C (G) with II r - "." p < I:. Hence
t1" - "'II r < 2,:.
(II) Let G be unbounded. Then for each ,; > 0, there exists a sufficiently large
open ball B such that
f lulP tl.'( < 1:,
(i-II
\vhere II = G n B. Now we can use the same argument as in (I). 0
We no"' consider the derivatives of U , . In order to collect all the important
properties of the smoothing operator in one place, we also investigate the case
that II has generalized derivatives. The definition of D'u can be found in
Section 21. I.
Proposition 18.35 (Derivatives). LeI G he a 110l1enJply bounded opel1 set ;11 R''',
N > I. Let II e LI'.loc( G), I p < x. Suppose that Ihe gel1eralized der;t'at;t'e
DII exists ".;,11 DII e LI'.'oc(G). Tllen:
(a) f.or all I; > 0 alld all '( E G "';111 dist(.'(, (G) > f... ,,'e hat'e
D'(SfU)(X) = (SfD2 11 ){X).
fht If H c c (i alld H i, "Ieasuruhle tllell
lim ilD 2 (S,u) - D2Ulll.pfH. = o.
, -.0
(75)
(76)
(c) Let (' he a ('o"ll'act suhset 0.1 G. If Dll is ('onli'UIoUS 011 G, then "'e hat'e the
lilt; (or"J COIIl'eryence
D 2 tS, u) ::: DlI 011 C
as I: --+ + o.
f d) J f G = IR ", t 11£1'1 f 7 5 IIold$ .ft)r all '( E IR'" and all t > O.
PR(X)J-. Ad(a). Note that
( ('
.. lp,( - y) = - lpc(x - y).
( '( cy
By differentiation and integration by parts, it follows from (71) that
WIS,II)(X) = L (-I)'J'D;tp.(x - Y)II(y)dy
= L tp.(x - y)D 2 u(y)dy.
This is (75), Note that, by the definition of generalized derivatives, the integra-
tion by parts formula remains true for generalized derivatives.
Ad(b), (c), (d). These follow from Proposition J 8.33. 0
18.14. The Smoothing Principle
77
Proposition 18.36 (Generalized Variational Lemma). Let G be a nonempt y open
set in R N \vith N 1 and let u e LI.1oc(G). Then it follows from
L uvdx = 0 for all v e C(G) (77)
that u(.) = 0 for almost all x e G.
Corollary 18.37. If (77) holds for all v e CO=(G) with JG I' dx = 0, then u(x) =
constant for almost all x e G.
PROOF. Since each function v e Co(G) vanishes outside a bounded set, we can
assume that G is bounded.
(I) Let H c c G. We set
( { sgn I u(x)I on H,
",' x) = 0
on G - H.
We shall show that
L uwdx = O. (78)
This implies JH lul dx = 0, i.e., u(x) = 0 for almost all x e H. Since H is
arbitrary, we obtain u(x) = 0 for almost all x e G.
(II) We prove (78). Let WIt = 5 1 ,,,\\' for all sufficiently large n no. From (71)
it follows that
sup I w,,(x)1 S sup I w(x)1 S I.
.x.G .x.G
(79)
By Proposition 18.33, we have WIt e q(G) for large nand \v" -+ \\' in L 2 (G)
as n -+ 00. Thus, there exists a subsequence, again denoted by (w,,), such
that w,.(x) -+ \\'(x) as n -+ for almost all x e G (cr. A 2 (36»). By (77),
f uwdx = lim f uw"d. = o.
G ,,- G
In this connection, use (79) and the principle of majorized convergence
A 2 (19). 0
PROOF OF COROLLARY 18.37. Fix h e Co(G) with JG hdx = I. For ,,' e Co(G),
we set
v(x) = w(x) - h(x) t w dy.
Then JG vdx = O. By assumption, JG uvdx = 0, i.e.,
t (u - C)wdx = 0 for all we q(G),
where C = JGuhdx. Proposition 18.36 yields u = c.
o
78
I X. Variational Problems. the Rltl Method. and the Idea of ()rthogonality
18.15. The Idea of the Regularity of Generalized
Solutions and the Lemma of Weyl
We consider the Laplace equation
Il = 0 on G.
(80)
Multiplication of (80) by r E C (G) and integration by parts yield
f 11£\1' dx = 0 for all I' E CO (G). (81)
Each function II e LI. ,Ot ( G) with (81) is called a ,,'eak solution of (80). Obviously..
each classical solution II e C 2 (G) of(80) is also a weak solution. The following
famous so-called Lemma of Weyl tells us that the converse is true. This is the
prototype of regularization theorems. More general results in this direction
will be considered in Chapter 22.
Theorem 18.G (Weyl (1940)). LeI G he " hou'J(/ed reyio'l in n..... N > I. Suppose
tllat II E LI.1o\:(G) is a \\'eak solution 0.( tile Laplace eqllation (80).
Then. a.fter cha"gi'lg the ('allies of U 011 a set oJ' "Ieasure zero.. iJ' 'Jecessary, II
is 'I classical snilltion of (80) ,,'ith u E C-r (G).
PR()()f-, We shall show that the weak solution u has the mean value property..
I.C '.
II(X) = _1- i u( r) dr
JI ..
8,.'.t)
(82)
for all x e G and all sufficiently small r > 0.. where B,(x) is a ball around x of
radius r. and V is the volume of B,(x). By a well-known classical result.. it
follows from (82) that u is a classical solution of (80) and U E cx. (G) (cf. Courant
and Hilbert (t 953.. M ).. Vol. 2).
Step I: The special case II e C(G).. i.e... suppose first that u is continuous.
Let N > 2. The simpler case N = I proceeds analogously.
(I) Choose a fixed ball Bft(x). For 0 < E; < R.. set
{ constant
cp(r) = 0
for r t..
for r > R..
and extend lfJ to a function in C7 (R.). Furthermore.. let
('Lr) = fP( Ix - .\'1)..
\\'(r) = r l -" f u dO.
tBr.xt
(83)
18.16. The Localization Principle
79
(II) We show that ", = constant on ]0, R[. Let
"'(r) = :r (r N - 1 tp'(r)).
Bya suitable choice of cp above, we may obtain each function'" E C(£, R)
which satisfies the normalization condition
f' "'(r) dr = O.
Using polar coordinates, it follows from (81) and (83) that
f' w(r) "'(r) dr = O.
By Corollary 18.37, \\' = constant on [£, R]. This is true for each suffi-
ciently small £ > o. Thus, there exists a constant C(x) depending on the
point x such that
\\'(r) = (7N-l C(x)
for all r E ]0, R[,
where (7N-1 denotes the surface measure of the (N - 1 )-dimensional unit
sphere. In (83.) below, note that V = (7N_lr N /N.
(III) We now prove (82). Multiplication of w by r N - 1 in (83) and subsequent
integration over [0, r] yield
C(x) = r lI(y)dy. (83-)
J 'r(x)
Letting r -+ + 0, we obtain C(x) = u(..).
Step 2: The general case u E LI.1oc(G).
We use the same argument as in Step 1. In this connection note the following
peculiarities. By the Fubini theorem A 2 (23), the integral corresponding to w(r)
in (83) is defined for almost all r. If we choose a fixed r, then the absolute
continuity of integrals A 2 (20) implies that x C(x) is continuous in (83.).
Finally, by the Lebesgue theorem A 2 (25e), for almost all., the right-hand side
of (83*) goes to u(x) as r -+ O. 0
18.16. The Localization Principle
In modern analysis. global results are frequently proved by localization. To
this end. one uses the following proposition.
Proposition 18.38 (Partition of Unity). Let C be a compact set in RN, N I.
Suppose that C is covered by a system {V,} of finitely many nonempty open sets
80
18 Variational Problems. the Rill Method, and the Idea of Orthogonality
V" Then there are .functions !Pi: R N -+ IR K'ith the folloK'ing t"'O properties:
(i) cP; E C; (Vi) .for all i,
(ii) L; cp;(x) = I for all x E C.
Motivated by (ii), {<Pi} is called a partition of unity. For given u: R N -+ R.
the funct ion
u. = In.U
I 'f',
is called a localization of u. By (i). U; vanishes outside a compact subset of Vi'
and by (ii),
II = L u i .
i
PR()()f". The main idea of proof is contained in (85) below. We may suppose
that the sets V; are bounded.
(I) We show that if is open with C C Vi' then there exists a function
t/Ji E C<f( Vi) with
1/1; = I on, 0 < I/Ii < 1 on Vi'
To this end, we choose an open set W with Jti C eWe c V; (Fig. 18.17).
Let
{ Ion W,
X( x) =
. 0 otherwise.
By Proposition 18.33, for sufficiently small r., the function 1/1; = SEX has
t he desired properties.
(II) For sufficiently small > 0, the sets
(t5) = {x E Vi: dist(x, (1 V; ) > t5}
cover C. Otherwise. there exists a sequence (XII) in C with
XII f U (n-I)
for all n.
(84)
v,
Figure 18.17
18.17. Convex Variational Problems. Elliptic Differential Equations. and Monotonicity 8 t
The set C is compact. Hence there exists a subsequence, again denoted
by (XII)' such that XII ..... .'t as n ..... 00, and X e C. Thus, there is a j with
:< e U i , i.e., .'t e J'j(n- 1 ) for sufficiently large n. This contradicts (84).
(III) In (I) we choose = (<<S) for sufficiently small fixed «S > O. Then there
exists an open set Z with
c c Z c U .
i
By (I), Li "', > I on Z. As in (I), we construct a function'" e Co(Z) with
t/J = I on C. Then the desired functions Cf'i are given by
cp = { 'JIIMr., "', )-1 on Z, (85)
I 0 otherwise.
o
18.17. Convex Variational Problems,
Elliptic Differential Equations,
and Monotonicity
In this section we want to explain a number of important interrelations
contained in Figure t 8.18. In particular, we want to show that convex varia-
tional problems lead to elliptic differential equations. The definition of regularly
elliptic equations will be given below. According to Hilbert, a variational
problem is called regular iff the corresponding Euler- Lagrange equation is
regularly elliptic. Then the Legendre condition is fulfilled.
Note that there are elliptic differential equations which are not Euler-
Lagrange equations to variational problems. In Chapters 26 and 27 we shall
use the theory of monotone and pseudomonotone operators in order to treat
elliptic equations which are not necessarily related to variational problems.
This is one of the decisive adt'antages of the theory of monotone operators.
regular variational problem
(convexity)
Euler- Lagrange equation Legendre condition
(necessary conditioll (necessary condition
for a minimum) for a minimum)
1
regularly elliptic
differential equation
(monotonicity)
Figure 18.18
82
I H Varaallonal Problems. the Ritz Method. and the Idea of Orthogon,.hly
In order not to obscure the simple basic ideas by many indices, we begin
with the case of R 2 .
18.17a. Variational Problems in 2
We consider the variational problem
f L(.'(. II, p, q) (Ix = min!, II E C 2 (G),
(i
II = Y on l'G.
Here. let x = (. '1). P = u. q = u.,. furthcrmore, let
J' = (x, II. p.. q)
and P(x) = tx, ut X), p( x). q(x)). We make the following assumptions:
(HI) G is a bounded region in (R2.
(H2) The Lagrange function L: G x IR J -+ I( is C 2 .
Important necessary conditions for (86) are given by the following t:uler
Layran/f! equal ;on
(86)
,
( ('
...., L ,, (P( x}) + L q (P{x)) - Lu(Ptx)) = 0 on G (87)
( . (' '1
and by the Leyellclre ('olJd;t;on
L""f p(x) )(12 + 2L pq ( P(x))ah + Lqq( P(x))h 2 > 0 on G (88)
for all real numbers a and h. Note the interesting fact that (88) contains only
second partial derivatives of L with respect to p and q.
Proposition 18.39. (,' "tier the assu"lpl ions (H I ) and (H 2}, if II is a SO/lit ion or
(86). tile" (87) and (88) are sat;s.(ied.
Remark 18.40 (('onvexity). Suppose that L in (86) is convex with respect to
(p. q). Then the Legendre condition is fulfilled automatically. To show this let
rjJ(l) = L(x.lI. p + tel, q + th), t E R.
Then the real function t/J is convex. i.e.,
fj!"(l) > 0
for all I E R.
This implies
L",,( P)a 2 + 2L pq ( P)ah + Lqq{P)h 2 > 0
for all PEG x IR J and all real II and h.
Remark 18.41 (Regularly Elliptic Equations). The partial differential equation
("': j'.
A(P(x)) + B(P(x)) + C(P(x)) = 0
i"" ( '1
(89)
18.17. Convex Variational Problems. Elliptic Differential Equations. and Monotonicity 83
is called regularly elliplic iff
A,(P}a 2 + (A.(P) + B,{P)ab + B.(P)b 2 > 0
holds for all real a and b with a 2 + b 2 :F 0 and all PEG X R 3 . Note that the
Euler- Lagrange equation (87) is a special case of (89) with
A = L,. B = L4' C = - Llf.
The variational problem (86) is called regular ifTthe Euler- Lagrange equation
(81) is regularly elliptic. Then the Legendre condition (88) is fulfilled automati-
cally. Explicity. the regularity of (86) means that
L,,(P} L".,(P) 0 0
>, L,,(P} >
L,.(P) L..(P)
The regularity of the variational problem implies .p"(I) > 0 for aliI in Remark
t 8.40. i.e.. the Lagrange function L is strictly convex with respect to p and q.
Equation (89) is quasi-linear, i.e., it is linear with respect to the highest
derivatives of u. In fact, (89) can be written in the form
for all PEG X R3.
Apu + (A q + B,)u, + B.u" + lower order derivatives = O.
The regular ellipticity of (89) means that the corresponding quadratic form is
positive definite.
PROOF OF PROPOSITION 18.39. Let u be a solution of the minimum problem (86).
(I) Euler- Lagrange equation. We use the same simple idea as in Section 18.3.
To this end. we set
K' = U + I h.
h E Co(G),
I e R.
Since" = 0 on cG, we get K' = g on cG. Thus, the real function
rp(t) = L L(x, "', "'<' "'.,) dx
has a minimum at t = 0, i.e.. cp'(O) = O. Hence
rp'(O) = t (L"h + Lph( + LqI'.,)dx = 0,
where Lu stands for Lu(P(x)}, etc. Integration by parts yields
t [L" - (L p )( - (L,).,]hdx = 0 for all h E Co (G).
By the variational lemma (Proposition 18.2), we get [. . .] = O. i.e., (87) is
satisfied.
(II) Legendre condition. Since cp has a minimum at I = O. q>"(0) > O. i.e.,
rp"(O) = L (Lpph: + 2LPlIh(h., + L"h + F)dx 0 (90)
84
18 Variational Problems. the Ritz Method. and the Idea of Orthogonality
with the "unimportant" term
F = L,,"hl + 2Luphh + 2L.,qhh".
Relation (90) holds for all h e Co"(G). Our goal is to obtain the Legendre
condition from (90) by special choices of h.
First. let the point Xo be fixed in the region G. and let B denote the unit
ball. Je'or each given g e cg (8) and sufficiently small p > 0, the function
( X - x o )
h(x) = PO -p-
lives in the space C (G). Introducing the new variable J' = (x - xo)/p and
letting p 0, it follows immediately from (90) that
is (Lppgf + 2Lpqgg" + Lqqg )dy > 0 (90*)
for all g E Co (8). Here. L pp denotes Lpp at the point Xo. Observe that
x = Xo + P.\'.
We now choose y = III and g = 92' where
Y 1 (\") = t/J(x) cos t(a + h,,).
II 2(X) = t/1(x) sin l(a + hlO,
for fixed tI. hER. 1 > 0, and t/I E Co (B). Adding the two expressions
corresponding to (90.), we get
t 2 ( L pp u 2 + 2L pq ah + Lpph2) is 1/12 dx + (...) > O.
for all t/1 E C (8). \vhere (...) is uniformly bounded with respect to all
t > o. Dividing by I > 0 and letting t --. + x" we obtain the desired
Legendre condition
r 2 L - 2
Ll'pU + 2 pqab + Lpph ? 0
for all a, hER.
o
18.17b. Variational Problems in (RN for
Several Unknown Functions
The preceding results can be generalized immediately to the following varia-
tional problem:
f L(x, u, p)dx = min!, U e C 2 (G)K.
G
II = g on c'G.
(91 )
Here let
.. .. )
x = ( I ' . . . . ... .
" = ("I . . . . . Ill( ).
Furthermore, we set
pt = Dill"..
18.18. The General Euler-Lagrange Equations
85
and denote by p the tuple of all the pre The Euler-Lagrange equations are
given by
C
-L-Jr - Lu. = 0
c, ,,- Ir ·
and the Legendre condition is given by
LJJ':Pja,ajc.c". > 0 (93)
for all a E R N and C E R«. In this connection, the argument of L in (92) and
(93) is (x, u(x), p(x)). We sum over two equal indices where i, j = I, .... N
and k. In = I, ..., K. Condition (93) is also called the Legendre-Hadamard
condition.
k = t, ..., K,
(92)
Proposition 18.41. Let G be a bounded region in R N . N > I. and suppose tlJal
ti,e Lagrange function
L: G x R K + KN ...... R
is C 2 . Tllen each solution u of (91) satisfies (92) and (93).
The proof is the same as the proof of Proposition 18.39.
18.17c. Regularly Elliptic Systems
The system
C . .
C, A, (x, u(x). p(x» + C (x, u(x), p(x» = 0,
is called regularly elliptic iff
cA: Ie III
'" (., u, p)a, a j > 0 for all a:l: 0 (95)
l'Pj
and for all possible arguments. i.e.,. e G, u e R K , p, a E R«N. We sum over two
equal indices where i, j = t, ..., Nand k, m = 1. .... K. The variational
problem (91) is called regular iff the Euler- Lagrange equations (92) are
regularly elliptic. Then the Legendre condition (93) is fulfilled automatically.
k = t.....K
(94)
18.18. The General Euler- Lagrange Equations
We consider the variational problem
t L(.,u,Du)dx = min!, u e C 2 "(G)«,
D2 1 , = Y2 on oG for all «: 121 S m - 1.
(96)
86
I Vanatlonal Problems. the Ritz Method, and the Idea of Orthogonality
In this connection we set U = (II,. .... UK). The Lagrange function L depends
on II and all partial derivatives of u up to order nt. Furthermore. D 2 u denotes
an arbitrary partial derivative of order II. Let Du = (D2 U ). SI,ls",.
Theorem 18.H. Let G he a hounded regioll ;11 R'Y ".ith N > I, and suppose tlrClt
1 he Laf/rCllJge fllnct ion I..: G x RI: -+.'1 -+ IR is C 2 . Then each solut;on 1I of tire
rariatiolJClI prohlenJ (96) is also a solution o..f the Euler- Lagrange equations:
[ iL ]
L (- I )I"D' ' D - ,-- (x. u(x). DII(x)) = O.
12{ $m ( II,
k = I..... K
(97)
lor all x E (i.
:\lor('orer, II ;$ a/so a solutiott (f the generali:ed £lIler- Lagrattge eqllations:
i (L
L D ; (..II(x). Du(x))D 2 h,(x)dx = 0
I'IS'" (i ( U,
lor a/l II, e Co (G) alJd all k. = I,..... K.
The proof.. which proceeds analogously to the proof of Proposition 18.39.
\\'ill be given in Problem 18.2.
The calculus of variations was founded by Euler (1744) who obtained (97)
for N = I by means of a difference method. Lagrange (1762) invented those
elegant methods which we use today. In contrast to Euler. Lagrange obtained
(97) for f\' > I. For brevity in the following. the Euler- Lagrange equations
\vill be simply called the Euler equations.
18.19. The Historical Development of the 19th
and 20th Problems of Hilbert
and Monotone Operators
18.19a. The 20th Problem
The solution of the 20th Hilbert problem was closed in some sense by the study
of monotone operators.
Jindfic Necas (1982)
We consider the regular variational problem
r L(x, 11(.\'), Du(.\')) dx = min!.
J (i
II = Y on rG.
Here. Du denotes the tupel of the first-order partial derivatives of the real
(98)
18.19. The Historical Development
87
function u. In the 20th problem of his Paris lecture, Hilbert asked the following
question.
Is it true that (98) has a solution u in case we generalize the notion of solution
in an appropriate sense?
Today we know a positive answer. There exists a solution u in the Sobolev
space Wi (G), I < p < 00. in the case where the following assumptions are
satisfied:
(i) G is a bounded region in R N , N I.
(ii) The functions Land 9 are sufficiently smooth, and L is convex with
respect to Du. More precisely, let 9 e W,I(G) and L: G x A N + 1 ..... A is Ca.
(iii) There are positive constants a, b, c such that L satisfies the groK,th
condition
IL(x,u.Du)1 S a(1 + lul' + IDul')
and the coerciveness condition
(99)
L(x,u,Ou) > blOul P - c.
(100)
This result and its generalizations will be considered in Chapter 42. The
theory of monotone operators can be regarded as a natural development of
the 20th problem. Roughly speaking, the main idea is the following:
(i) If the function L is convex with respect to u and Ou, then the integral in
(98) corresponds to a convex functional J = J(u) where the derivative J'
is a monotone operator. The Euler equation corresponds to the operator
equation )'(u) = O.
(ii) If L is only convex with respect to OUt then the operator J' is pseudomono-
tone.
(Hi) The existence theorems for monotone and pseudomonotone operator
equations allow us to prove the existence of solutions for nonlinear elliptic
differential equations which are not necessarily the Euler equations to
variational problems.
These results are also valid for problems which contain derivatives up to
order n. In this case the function L in (i) must be convex with respect to u and
all derivatives. In (ii) we only need the convexity of L with respect to the nth
derivatives.
Note that the coerciveness condition (I (0) is not satisfied for the problem
of minimal surfaces, i.e., for
L = J I + u1 + u.
In this case one has to use special sophisticated geometrical methods. In
particular, if the region G is not convex, then there need not exist solutions of
(98). This will be considered in Chapter 52.
Vilik (196 t) was the first to apply monotonicity methods to quasi-linear
elliptic differential equations.
88
18. Variational Problems. the Ritz Method. and the Idea of Orthogonality
18.19b. The 19th Problem
In the 19th problem Hilbert asked the following question: Is it true that the
analyticity of the functon L implies the analyticity of all the solutions of the
regular variational problem (98)? A generalization of this problem can be
formulated as follows. Consider the regular elliptic second-order system
...
(' , "
..- Ai (x. Il(X). DIl(x) + C (x. u(x). Du(x») = 0,
l'<;;
k = I, ....1(, (101)
on the bounded region G in R N where u E IRK. We now ask the following
question:
(Q) Are the solutions u (r (101) sufficie'ltlJ' smooth if the functions A, C Ic lire
su.fficiently sInooth?
For N = 2. a positive answer to this question was given by Morrey (1940).
For N > 2 and K = I (the case of one equation), a positive answer was given
by De Giorgi (1957) and Nash (1958). By generalizing the classical Harnack
inequality. Moser (1961) simplified the proofs of De Giorgi and Nash.
In very rough terms. the main idea of the regularity proofs for (101) is the
following. We consider (101) with K = I. For the sake of simplicity, we assume
that CIc(.) does not depend on Du. Put
ru
r = ---
i , 1ft
for fixed m.
Then differentiation of (101) with respect to '" yields the key equation
, ( a ij ; ) =.f on G.
(<;i (j
where aij and .r depend on u.
Step I: Existence of generalized second-order derivatives. Let u E W 2 1 (G) be
a generalized solution of (101). i.e., the function u has first-order generalized
derivatives. We show that u E W 2 2 (G), i.e., u has second-order generalized
derivatives. Hence l' E W 2 1 (G). Furthermore, we show that v satisfies (101.).
(101.)
Step 2: Boundedness of the first-order derivatives. We show that u E Wxl (G),
I.e.,
sup I Du(x)1 < x.
".G
Step 3: Reduction to the regularity theory for linear elliptic equations. For
.fixed u, equation (101.) represents a linear second-order elliptic equation for
I" where we assume that there is a c > 0 such that
aij(x)b;h j c Ihl 2 for all x E G, b ERN.
Essentially, this condition follows from the regular ellipticity of( 101). We now
18.19. The Historical Development
89
use the following two basic results for linear elliptic equations:
(i) Theorem of De Giorgi and Nash. Let G be a bounded region in R N , N > I,
and let a,l,f E L:x;(G) for all i,j. Then, each generalized solution v of equation
(101.) with
t' E Wl(G)
;s locally Holder continuous in G, i.e., there exists an E ]0, 1 [ such that
v E Chx:(G).
This statement remains true if f E L,( G), where p > N /2.
(ii) The smoother the coefficients a'l and the right member f are, the smoother
is the solution v of (101.).
This way we obtain the regularity of the generalized solution U of(101) step
by step. Note that (i) is a very strong result since the functions a'l and fare
"bad," i.e., they are only measurable and bounded. For example, it follows
from (i) that u E C l .« and hence a'l' f E C« for all i,j. Then the regularity theory
for linear elliptic equations tells us that the weak solution t' E Wl of (10 I.)
belongs to C t .« and hence u e C 2 ... This process can be continued if A:, C' in
( 101 ) are sufficiently smooth.
In this connection, note the following. By delinition, the function v is a
generalized solution of equation (101.) iff
J Ct, ccp J
- G a'J oJ C, dx = G!cpdx
for all cp e C o ( G). This identity follows formally from (10 I.) by multiplying
( 10 I.) with cp and integrating by parts.
Similarly, the function u is a generalized solution of equation (101) iff
J t ccp(x) J .
A, (.t, u(.t). Du(x» .. dx = C (x, u(x), Du(x)cp(.t) dx
G ( C;, G
for all cp E Co(G) and k = I,..., K.
The regularity theory for linear elliptic equations can be found in Ladyzen-
skaja and Uralceva (1973. M) and in Gilbarg and Trudinger (1983, M). We
also recommend Giaquinta (1983, M) where the following techniques are used:
the Caccioppoli inequality, Campanato spaces, and the freezing of coefficients
(cf. also Problem 18.7).
We now consider the case K > I where (101) represents a regularly elliptic
system. Unfortunately, the answer to Hilbert's question (Q) above is negative.
From the physical point of view this is not completely surprising since the
deformation of elastic bodies can be described by nonlinear elliptic systems
where rupture may occur. Innuenced by the counterexamples of De Giorgi
(1968) and Giusti and Miranda (1968), Neeas (1977) constructed an elliptic
system of the form (101) with analytic functions A', C' which has a generalized
solution being merely Lipschitz continuous. For systems one has two options:
90
I K Variallonal Problems. Ihe Rilz Mcthod. and the Idea of Orthogonality
(a) One proves partial relularity.. i.e... one shows that the generalized solution
is regular up to a set of small Hausdorff dimension. Therefore.. the set of
singularities is "thin.'"
(b) One proves the full regularity for special systems of so-called Liouville
type.
In this connection, we recommend Giaquinta (1983.. M).. Necas (1983.. M),
and Kohn (1984, S). In Problem 72.11 we shall discuss an important partial
regularity theorem for the Navier - Stokes equations which describe viscous
nows. This theorem allows the interpretation that the possible domain of
irregular behavior (turbulence) is "thin."
As an introduction to the regularity theory for nonlinear problems we
recommend Necas (1982. S), (1983. M), Giaquinta (1983.. M), and Giaquinta
and Hildebrandt ( 1989, M). Furthermore. we recommend the standard works
by Ladyienskaja and Uralceva (1964.. M), (1973.. M), Morrey (1966, M).. and
Gilbarg and Trudinger (1983, M).
18.19c. Two Basic Strategies in Regularity Theory
The main idea of regularity theory is to use the integral identity for the
generalized solution II in order to obtain integral inequalities for appropriate
quantities. From these inequalities one obtains information about II. There
are the following important possibilities:
fat Integral inequalities for difference quotients ensure the existence of gener-
alized derivatives (cf. Section 22.18).
fb) Integral inequalities for "mean values" yield the Holder continuity of
functions (see the Morrey lemma and the theory of Campanato spaces
below).
(c) Let (i be a nonempty measurable set in R'\! with N I and meas(G) < 'X,.
Let 1 PI < P 2 < . . . such t ha t p" -+ X' as n -+ 'YJ and
I
II E n lp"(G)
"I
with
sup flu II," < Xl.
"
Then II E L,(G) and
lIull.. = Ii m /I u " p" .
".. T
Statement (c) can be used to prove the boundedness of the function II via
L,-estimates for II. A simple proof of the well-known result (c) can be found
In K ufner.. John. and Fucik (1977, M).. p. 85.
Fir.f\t ...trCltegy: The difference quotient method.
This is related to (a) above. In Section 22.18fT we will use this method in
order to obtain general regularity results for linear elliptic equations of order
2nl. The same method can be used in Step I from Section 18.19b. The idea is
to use the integral identity for the generalized solution and the ellipticity
18.19. The Historical Development
91
condition in order to obtain integral inequalities for appropriate difference
quotients. These estimates yield the existence of additional derivatives.
Second strateg}': Campanato spaces.
This is related to (b) above. The main idea is contained in the following
lemma.
Fundamental Lemma 18.43. Let G be a nonempty open set in R N , N 1. Set
B,tr) = {. e R N : I. - yl < r} and define the mean value ,,'ith respect to the ball
B,{y) by
14,,= 18 ) 1 u{x)dx.
· me as ,(}' B r (,.)
Let 0 < 2 S 1, I S P < OC, and R > 0 be given. Then the inequality
1 lu(x) - u,.,I' dx const rN+p«
Br('.
for all r < min(R, dist(y, cG)) implies u e Croc(G), i.e.. u is IDea"}' Holder con-
tinuous on G.
PROOF. Cf. Giaquinta (1983. M), p. 72.
o
We want to show that this result generalizes the famous classical Morrey
lemma.
Corollary 18.44. Let u E W,I(G). Then the inequality
1 IDul'dx constrN-'+Jn
Br().)
for all r < dist(},. cG) and y e G implies u e Cfoc(G).
PROOF. Set 8 = 8 1 (y). By A 2 (S3a), the norm
(is IDt'IPdx + is vd:( P)'i P
is an equivalent norm on W,I(B) (inequality of Poincare). Let \\' E W p l (B). We
set t' = \,. - "',. I' Then J B v dx = 0, and hence
Sa I viP dx S canst Sa IDrl P d.x.
Let I' e Wi (G). Using the transformation x rx we get
r lu(x) - u,.rI P dx S canst r P r IDul P dx.
J Br('. J Brb.
The assertion now follows from Lemma 18.43. 0
92
I R Vanational Problems. the Ritz Method. and the Idea of Orthogonality
Finally, we want to formulate a global variant of the Fundamental Lemma
18.43.
Definition 18.45 (Campanato Space y,.A(G)). Let I < p < x, and 0 ;. < x.
Let G be a bounded region in RH, N > I. We set G,(y) = {x e G: Ix - YI < r}
and
[1l],.A = sup r-A. f lu(x) - u).,I"dx,
, E G.'> 0 G r .,..
where
I f
II, , = - - - - u(x)d:(.
. meas G,( Y) Grf)"
The CampalJaro space .,sp,.A(G) consists of all II E L,(G) with [U],.A < (y.
.!I'I'. A( G) is a B-space with the norm II u" l.pfG' + [u],. A'
Let X and }' be B-spaces over 6(. Then X and Yare called equil'alelu
iff there exists a linear homeomorphism h: X -+ Y. In particular, there are
constants c. d > 0 such that
I;h(u)h). < cllullx
and
IIlIlIx dllh(u)U r
for all II E X. lJsually. one identifies h(u> with II.
If X and Yare function spaces, then. by definition, the equivalence between
X and Y includes the fact that u and h(l4) denote the same function.
The two B-spaces X and Yare called normisomorphic iff they are equivalent
and we have
II II( u) II), = 1/ U II.\'
for all u E X.
Proposition 18.46. LeI !\' < i.. N + p and 2 = (). - N)/p. Let G he a hounded
region in A'\' ,,'ith i'aG E ('0.1, N > 1. Then the Campanato space ,.A(G) ;s
equifale,u 10 lite Holder .pace C 2 (G).
PR()()F. cr. Giaq uinta (1983, M), p. 70.
o
In particular. this import'!.nt result tells us that it follows from U E L,(G)
and [u]P.A < x that U E C 2 (G), i.e., u is Holder continuous and
ilulle'fl;) < const(lIuIlLp(G) + [11],..\).
Definition 18.47 (Morrey Spaces LP'.\(G)). We use the same notation as in
Definition 18.45. Set
[II]:.A = sup r" A (f '''1' dx ) I'P,
)'E G.r>O Grf)"
The Morrey space LP.A(G) consists of all u E L,,(G) with [U]:.A < oc...
L". A( G) is a B-space with the norm [u ]:..t.
Problems
93
Proposition 18.48. For each I S P < 00, the Morrey space L,.N(G) is equit'alent
10 L(G) where N = dim G.
Let aG e Co. I. Then. for each I S P < 00 and 0 S i. < N.. the Morrey space
L,.A(G) is equivalent to the Campanato space fi',.A(G).
PROOf. Cf. Neeas (1983, M), p. 35.
o
In particular.. it follows from u E L,(G) and [U]:.N < 00 with I S P < 00
that u E Lx(G), i.e., u is bounded.
18.20. Sufficient Conditions for Local and Global
Minima and Locally Monotone Operators
The Euler- Lagrange equations from Section 18.18 are necessary conditions
for local minima. In order to obtain sufficient conditions for minima, one has
the following four important possibilities:
(i) Monotone operators in the convex case. In Chapter 2S we show that in
the case of the convex minimum problem
(M) F(u) = min!. u E X,
on the real B-space X. the Euler equation
(E) F'(u) = 0, U E X,
is a necessary and sufficient condition for global minima of (M). Here. the
operator F': X X. is monotone.
If F' is strictly monotone, then the unique solution u of (E) is the unique
minimal point of (M).
(ii) Definiteness of the second variation, accessory quadratic problems. eigen-
value criteria and locally regularly monotone operators. This will be studied in
detail in Chapter 29.
In this connection, we will use the theory of eigenvalue problems for bilinear
symmetric functionals in H-spaces and the theory of strongly elliptic differ-
ential equations. These tools will be investigated in Chapter 22.
(iii) Field theory. See Chapter 40.
(iv) Duality theory. See Chapter 52.
PROBLEMS
18.1. E.'(amples for the Ritz mrhod.
18.1 a. Solve the boundary value problem
u"(x) + cos x = 0,
o < x < ft,
u(O) = u(n) = 0
with the aid of the Ritz method.
94
I H. Variational Prohlems. the Ritz Method. and the Idea of Orthogonality
Solution: As basIs functions we choose
"., = sin kx.
= I. 2. . . . .
These functions satisfy the boundary condition. The corresponding variational
problem reads as follows:
i ff (2 I U'2 - II COS x) Jx = min!..
()
u(O) = lI(n) = o.
The Rill method Yields an approximate solution
"
II" = L (, ".,..
, -I
where the unkno\\'n coefficients c" are determined by the minimum problem
flc) dd So" 12 111/ - lI.cosx)dx = min!.
From
tF
= 0..
t'c,
k ::: 1...... n
\\'C obta.n (,. Explicitly.
2" sin2r
"2" = L - 2 - .
n r -I r(4r - I)
As ,. '. this series converges uniformly on [0. n] to the exact solution
IIf \ t = cos - 2n 1 - I.
18.1 b Solve approximately the boundary eigenvalue problem
-II"(X) = Jill(X).. - I < .\ < L
u( - I) = u( I ) = 0
with the normalization condition
f I 11 2 (/ = I
- 1
by means of the Ritl method. This example was considered by Ritz (1909) in
his basic paper.
Solution: In order to compute approximately the even eigenfunctions.. we
proceed from thc trial
,. = (I - X 2 ) h.o + C I X 2 + . .. + c".' 2.. ).
which satisfies the boundary condition Let
Flc) = fll 11,'2 - 1!I,2)dx.
Then the Ritz equation (28) reads as follows:
tf(c)
- - = 0
i("" ·
k = O. . . . .. ".
Problems
95
(I) Let n = I. We obtain the eigenvalue problem
(35 - 14,.)Co + (7 - 21l)c , = 0,
(21 - 61l)co + (33 - 21l)c, = o.
The determinant of this system must vanish. i.e.,
11 2 - 28,. + 63 = o.
The solutions are
I. = 2.467 440,
11 = 25.6.
The exact solutions of the problem are the eigenfunctions
u = cos rex/2, sin rex, cos 3rex/2, . . .
with the eigenvalues
p = n Z i4 :z: 2.467 401.
p = 7(z,
,. = 97(z /4 = 22.207. . . .
Thus, our approximate eigenvalue p = 2.46 744 is already a good approxi-
mation of the lowest eigenvalue.
(lit let n = 2. Then we obtain
11 = 2.467 401,
and t' = (I - .2)( I - 0.233 430.2 + 0.018 962..) which is a very good
approximation of the lowest eigensolution.
The odd eigenfunctions are obtained through the trial
\\' = (I - xZ)(co' + CIX J +... + C"XZ"+I).
18.2. Proof of Theore,n 18.H.
Solution: We set
". = u + t h,
Then the real function
h e Co (G),
t e R.
tp(r) = t L(.. w. Dw)d.
has a minimum at t = O. i.e.,
tp'(O) = f L L"... D"". d." = o.
J G 2..
Integration by parts yields
t [( -I "D"L"...] h. d. = 0
for all h. e Co(G). The variational lemma (Proposition 18.2) implies [...] = o.
18.3. The completion principle for B-spaces. Let D be a normed space over K = R. C,
i.e., 0 is a linear space over f( and there exists a function u lIull on D with the
following properties:
(i) lIuli 0 for all u e D, and lIuli = 0 iff u = 0:
96
I It Variational Problems, the Ritz Method. and the Idea or Orthogonality
(ii) 1111 + I'll S; 111411 + IIvll for all 14, Ii E D;
(iii) 1121411 = 121111411 for all 14 E D, 2 E K.
The B-space X over K is called a completion of D iff D is dense in X and the
"t -norm and the D-norm coincide on D.
Two B-spaces X and Y over k are called normisomorphic iff there exists a
linear bijective map j: X ..... Y with
II j(u)" = Itull
forall UEX.
18.3a. UII;queness (r completions. Show that two completions X and Y of Dare
normisomorphic.
Solution: Let D X and D £ Y. We define the operator j: D s; X -+ Y by
j(u) = u. Then we have II j(u)" = null on D. By the extension principle in Section
18.12, there exists a unique extension j: X ..... Y with II j(u)U = lIun on X.
In order to prove that j(X) = let (u..) be a sequence in D with 14" -+ I' in Y
as n .... rx. Then (14,,) is a Cauchy sequence in X and Y. Thus, we obtain u" -+ 14
in X as n ..... 'X. i.e., j(u) = v.
18.3b. Existence of a completion. Show that there exists a B-space X which is a
completion of D.
Solution: Two Cauchy sequences (14,,) and (VII) in D are called equivalent iff
II U II - t'" II ... 0 as n..... x.
Let X be the set of the corresponding equivalence classes u = [(u..)]. We define
operations by
[(14,,)] + [(r..)] = [(14.. + VII)]'
2[(U,,)] = [(2U..)].
It is easy to show that these operations are independent of the choice of the
representatives. This way X becomes a linear space.
Furthermore. we define
111411 = lim It 14,,11.
( 102)
" ... r.
Since IlIu,,1/ - Ilu", II I < lIu" - u",lI. this limit exists. Moreover. it follows from
III 14..11 - 11('..111 s II 14 11 - ('..If
that the limit in (102) is independent of the choice of the representative (14..) of
u. It follows easily from (102) that X is a normed space.
Let M' E D. Then the constant sequence (w) is a Cauchy sequence and hence
[h")] lies in X. We identify K' with the equivalence class [(w)]. This way D
becomes a subset of X, i.e., D s; X.
Each Cauchy sequence (u II ) in D converges to 14 = [(14 11 )] in X. This follows
from U - U'" = [(14" - 14",)] for fixed m. Hence D is dense in X.
Finally, we show that X is a B-space. To this end let (w II ) be a Cauchy sequence
in X. We choose a sequence (u,,) in D with 1114.. - w..11 < I/n for all n. Hence (14,,)
is a Cauchy sequence and u.. ..... 14 as n ..... 00. This implies w.. ..... u as n ..... 00.
18.3c. Example. If D is equal to the set of rational numbers, then the completion of D
is X = A. Our method of proof in Problem 18.3b corresponds exactly to
the well-known construction of real numbers due to Cantor (1845-1918). In
Section 19.10 we shall use this completion procedure in order to construct the
Problems
97
Friedrichs extension. This way we shall obtain existence theorems for partial
difTerential equations.
t 8.4. The completion principle for H-spaces. Let D be a pre-Hilbert space over K = R,
C. Then the Hilbert space X over K is called a completion of D ifT D is dense in
X and the X -scalar product and the D-scalar product coincide on D.
Two H-spaces X and Y over K are called H-isomorphic ifTthere exists a linear
bijective map j: X -+ Y with
(j(u)lj(v)) = (ulv)
for all u, veX.
Such operators j are called unitary operators.
18.4a. Existence and uniqueness of completions. Show that there exists an H -space X
which is a completion of D and that two such completions of Dare H-isomorph.
Solution: Let k = C. For scalar products, one has the identity
4(uIL') = lIu + vll 2 - lIu - vtl 2 - illu + it'1I2 + illu - il," 2 . (103)
Hence each normisomorphism between H-spaces is also an H-isomorphism.
According to Problem t 8.3b 9 let X be a B-space which is a completion of D.
Then, by (103), X is also an H-space 9 where the scalar product on X is given
by the limit
(ulv) = lim (u,,1 v,,).
,,-x
Here, (u,,) and (v,,) are sequences in D with u" -+ u and t." .... v as n ..... x>. The
existence of this limit follows from (103).
If K = R, then use (103) with i = o.
18.4b. Example. Let G be a bounded region in R N , N 1. We equip the real linear
space CO(G) with the scalar product
(ulvh.2.0 = r f D,uD,vdx.
JG lei
Show that the Sobolev space W 2 1 (G) is the completion of Co(G) with respect
to ('I'h.2.o.
Solution: By Section 18. to, the space W 2 ' (G) is the closure of CO( G) with
respect to (.I.)..z. Hence W 2 1 (G) is the completion of CO(G) with respect to
('1. ),. 2. By the inequality of Poincare- Friedrichs (54), we have
C(ulu)'.2 S (ulu)"2.0 S (ulu)1.2
for all u e W 2 1 (G), i.e., the convergences with respect to (.1.).. 2 and ('1.).. 2.0
coincide on W 2 1 (G).
18.S. A densit}' criterion in H-spaces. Let S be a set in an H-space X over K. Show
that if the set S is dense in X 9 then
(ulv) = 0
for all v e S
implies
u = o.
(104)
Conversely, it follows from (104) that span S is dense in X.
Solution: Let S be dense in X. Then (ulv) = 0 for all v e S implies (ulv) = 0
for all veX, i.e., in particular, (ulu) 8E O. Hence u == O.
98
18. Variational Problems. the Ritz Method, and the Idea of Orthogonality
Conversely. suppose that (104) holds. If span S is not dense. then the closure
of span S in X is a proper linear subspace of X. By the perpendicular principle
in Section 18.11, there exists a u 0 in X which is orthogonal to span S. This
contradicts (104).
18 6. Dells;ly a"d d,It";IY. Let X and Y be B-spaces over k = R. C, such that the
embedding
x£;t"
f 105)
is continuous. and .,\. is dense in Y. Show the following:
(I) The embedding Y. £; X. is continuous.
(il) If..¥ is renexive, then y* is dense in X..
Solution: Ad(i). It follows from (105) that
I X /I r < co n s t II x II \'
Let .r E }' - be given. Then, for all x EX.
I ( .r * , x » I < iI." */1 r. /I x II )
for all x EX.
f I 06 )
< cons' Ii ."*1 1 ),.11 x II x.
Let y-: ..\" IK denote the restriction of the functional y-: Y --+ &( to the subset
.X of Y. By f 106). we get .r* E X*, where
< r -. x> \' = < y-. x» for all x E ..\".
and
'j'-lix. < constay-I!). for all Y- E Y-.
f 107)
( 1 08 )
We \\'ant to sho\\' that .v. = 0 implies y* = O. In fact, let .,,* = O. Since X is
detJ...e in Y. it follows from ( 107) that < y*, x» = 0 for all x E y, and hence y* = O.
Set jt,,*) = .f., Then, it follows from our considerations above that the
operator
j: Y- -+ ..¥-
is ilJjeclir'e and continuous. Therefore, the functional J'* E Y- can be identified
with the functional .r. E X.. In this sense. we have Y. X.. Moreover, it
follows from (107) and ( 108) that
< .r - , x).\, = (.r - , . »
for a II -" * E Y * , X E ..\'.
( I 09 )
and
II.\'_,I,\,. < const II y. " t.
for all y. E Y..
Ad(ii). If the assertion is not true, then the closure of Y. in the 8-space x-
is a proper closed linear subspace of X.. By the Hahn- Banach rheorem, there
exists a functional.f E (X.). such that
f(x*) = 0
for all x* E r.
( 110)
and .f O. Since X is re.fle.;r'e, there exists an x E X such that
((x.) = (x. . x)\,
for all . EX..
By (109) and (110).
(X., '>r = 0
for all x. E Y-.
References to the Literature
99
Since X s; Y, and hence x E y, we get x = O. This implies f =- 0, which contra-
dicts f :;: O.
18.7. Elementar}' approach to the regularity theory for linear and nonlinear elliptic
partial differential equations of second order. Such an approach, including the
theorem of De Giorgi and Nash, can be found in Giusti (1984, M), Appendix
C (approach via the Harnack inequality), and in Kinderlehrer and Stampacchia
(1980, M), Appendix to Chapter 2 (the De Giorgi technique).
More general results can be found in Gitbarg and Trudinger (1983, M)
(Chapter 8), Ladyfenskaja and Uralceva (1973, M), (1986, S), and Krylov (1985,
M). An elegant modern approach to the regularity theory can be found in
Giaquinta (1983. M) and in Giaquinta and Hildebrandt (1989, M).
References to the Literature
Classical works: Euler (1744) and Lagrange (1762) (foundation of the calculus of
variations), Laplace (1782) (Laplace equation Gauss (1839) (potential theory Riemann
(1851), (1857) (foundation of complex function theory), Weierstrass (1870) (criticism of
the Dirichlet principle), Hilbert (1901) (justification of the Dirichlet principle), Fried-
richs(1934), Sobolev (1936), Courant and Hilbert (1937. M). Schwartz (1950) (distribu-
tions).
Collected works: Euler (1911), V ols. 1- 72, Lagrange (1867), V 015. 1-14, Laplace
(1878), Vols. 1-14, Gauss (1863), Vols. 1-12, Riemann (1892), Vols. 1-2, Weierstrass
(1894), V ols. 1-7, Poincare (1916), V ols. 1 -II, Hilbert (1932), V ols. 1 - 3.
History of mathematics of the nineteenth century: Klein (1926, M).
History of functional analysis: Dieudonne (1981, M).
History of calculus from Euler to Riemann and Weierstrass: Bottazzini (1986, M).
Hilberfs problems. their solution, and their influence on the mathematics of the
twentieth century: Aleksandrov (1971, S), Browder (1976, S).
Biographies of Hilbert: Blumenthal (1932), Reid (1970).
History of potential theory and of the theory of partial differential equations:
Burkhardt and Meyer (1900, S), Sommerfeld (1900, S), and Lichtenstein (1921, S) (three
articles in the encyclopedia of mathenlatics), Dieudonne (1981, M).
Collection of important classical papers in analysis: BirkhofT (1973).
Potential theory, integral equations, and boundary value problems: Courant and
Hilbert (1953, M), Gunter (1957, M).
Classical mathematical physics: Courant and Hilbert (1953, M), V ols. I, 2.
Modern mathematical physics: Reed and Simon (1971, M), Vols. 1-4.
Interpolation theory and linear partial differential equations: Lions and Magenes
(1968, M), Vols. 1-3, Triebel (1978, M), (1983, M).
General theory of linear differential operators, pseudodifferential operators, and
Fourier integral operators: Hormander ( 1983, M), V ols. 1 - 4.
Variational methods: Courant and Hilbert (t 953, M) (recommended as an introduc-
tion), Morrey (1966, M) (standard work), Klotzler (1971, M), Velte (1976, M) (ele-
mentary introduction), Blanchard and Bruning (1982, M), Giaquinta and Hildebrandt
(1989, M) (modern standard work).
Survey on minimal surfaces: Osserman (1986, M).
Minimal surfaces, modern geometric measure theory, and functions of bounded
variations: Giusti (1984, M).
Harmonic mappings: Jost (1984, L), (1985, L), (1988. L), Hildebrandt (1985. L),
Hildebrandt and Leis (1989, S).
100
J 8 Variational Problems. the Ritz Method. and the Idea of Orthogonaluy
Essays on mathematics and optimal form: liildebrandt and Tromba (1985, M).
Regularity theory: Netas (1982. S (1983, M) and Giaquinta (1983, M) (introduction).
ladyienskaja and Uralceva (1964, M), (1973, M), Morrey (1966, M), Friedman (1969.
M) (linear equations), Gilbarg and Trudinger (1983, M), Koshelev (1985, L), Giaquinta
and Hildebrandt (1989, M) (cf. also Problem 18.7).
Classical papers on the Ritz method: Ritz (1909), Courant (1943).
Numerical realization of the Ritz method: Velte (1976, M), Stoer and Bulirsch (1976,
M) (elementary introduction), Michlin (1962, M), (1969, M), (1985, M), Rektorys (1977,
M), Glo\\'inski (1984. M), Hackbusch (1986, M).
The Ritl method and the method of finite elements: Ciarlet (1977, M). Cf. also the
References to the literature to the Appendix.
Multigrid methods: Hackbusch (1985, M).
Software system ELlPACK for solving elliptic equations on computers: Rice and
Boisvert (1984. M) (Ritz method and finite elements. finite differences, SFT. multigrid
met hods. eCI.).
Sofl\\'are system for general classes of mathematical problems: I MSl (1987).
Jfandbook of numerical analysis: Ciarlet and Lions (1988, M), V ols. 1 ff (finite
element method, difference method, etc.).
Supercomputing: M urman (1985. P), lichnewsky and Saguez (1987. S). Martin
( 1988. S).
Numerical recipies-the art of scientific computing: Press (1986, M).
CHAPTER 19
The Galerkin Method for Differential
and Integral Equations,
the Friedrichs Extension,
and the Idea of Self-Adjointness
In mathematics, it's the idea that counts.
Sofia Vassilevna Kovalevskaja ( 1850-189 t)
At the very beginning the given elliptic differential operator is only defined on
a space of twice continuously differentiable functions. Then this operator is
extented to an abstractly defined operator by using a formal closure. The main
task is to show that the extended operator is self-adjoint. In this case it is possible
to apply the methods of John von Neumann (1929).
Kurt Otto Friedrichs (1934)
In 1971, I read the beautiful paper by Fujita and Kato (1964) on the quasi-
linear Navier-Stokes equations and was delighted to find that, properly viewed,
it looked like an ordinary differential equation, and the analysis proceeded in
ways familiar for ordinary differential equations. This is perhaps no surprise to
people in partial differential equations, but my training was in ordinary and
functional differential equations, and my attempts to read partial differential
equations usually became bogged down in technalities.
Many partial differential equation problems can be written as ordinary dif.
ferential equations in Banach spaces, involving unbounded operators. Rewritten
as Volterra integral equations via semigroups, unbounded operators no longer
appear in parabolic problems and the analysis is entirely analogous to the case
of ordinary differential equations. One works exclusively with generalized (mild)
solutions, which often turn out to be classical solutions. The major technical
differences with the case of ordinary differential equations are that we work in
''''0 spaces (or more) and the Gronwall inequality must be modified.
Dan Henry (1981)
The Basic Ideas of the Galerkin Method
In the preceding chapter we have seen how generalized problems and the Ritz
method result, in a natural way, in a connection with variational problems.
101
102
19. The Galerk.n Method ror Differential and Integral Equations
In this chapter we shall show how this discussion can be carried over to a
large class of problems which are not related to variational problems. In this
connection. we make use of the following heuristic method:
(i) Multiplication of the differential equation for u by the functions r E K
and integration over G with subsequent integration by parts yield the
gellerali:ed prohlem.
For integral equations the integration by parts naturally drops out.
(ii) In this connection. it is decisive that the function class K is so chosen that
a sufficiently smooth solution fl of the generalized problem is also a
solution of the original classical problem in the case where the data are
sufficiently smooth (boundary. coefficients. boundary and initial values.
etc.),
(iii) Restriction of II and r in the generalized problem to appropriate finite-
dimensional subspaces yields the Galerkin method as a generalization of
the Ritz method.
It is important for the reader that he learns, by a study of the given examples.
to carryover a given problem: first. formalJy to a generalized form and then
to obtain therefrom the Galerkin method in a simple way. In this connection.
one must always pay attention, especially to (ii).
For variational problems, the Ritz method and the Galerkin method coin-
cide. The generalized problems to be defined later in Chapters 22 through 24
result from the generalized problems given in this chapter, if one, roughly
speaking. replaces the classical derivatives by generalized derivatives. For
the practical realization of the Galerkin method one needs the basis functions
"'.. .... "',.. We give a number of possibilities in the Appendix, A 2 (56) - (60)
(polynomials and so-called finite elements. i.e.. piecewise polynomial functions).
Table 19.1 shows those problems that the Galerkin method leads to. The
Galerkin method distinguishes itself by its great universalit..v. It can be applied
to linear as well as to nonlinear problems. However, this method is not only
of great practical significance but also possesses, for nonlinear problems
within the context of the theory of monotone operators. central theoretical
meaning for obtaining existence statements, as we shall explain later in detail.
I n Section 19.4 we justify the designation projection method for the Galerk in
method.
Table 19. J
()riginal problem
Elliptic differential
equation and integral
equation
Differential equation
linear
Approximate problem
System of equations linear
in R'"
nonlinear
nonlinear
parabolic
hyperbolic
System or ordinary
differential equations
first order
second order
19. The Galerkin Method for Differential and Integral Equations
103
The Basic Ideas of the Friedrichs Extension
In the middle of this chapter. there stand existence theorems based on the
notion of self-adjoint operators and the method of the Friedrichs extension.
Figures 19.1 and 19.2 show important interrelations and applications to be
considered in this chapter. We want to show that:
The Friedrichs e.'(tension and the more general energetic extension of sym-
metric operators are the functional analytic core of mathematical physics and
the germ cell of the theory of monotone operators.
Note that such an elegant functional analytic formulation of mathematical
physics is only possible because we use the notion of generalized solutions.
We first discuss stationary problems which are related to elliptic differential
equations and variational problems. Instead of the original operator equation
(A)
Au = I,
where the linear operator A: D(A) X ... X is symmetric and strongly mono-
tone on the real H-space X, we consider the following two operator equations
{B)
AFu = f
and
(C)
A£u = f
with A c: A F c: A£, i.e., A F is an extension of A and, in turn, A£ is an extension
of A F . Here the Friedrichs extension
A F : D(A F ) X .... X
is linear bijective, self-adjoint. and generally unbounded. whereas the energetic
extension
AI:: X £ .... X
is a linear, strongly monotone homeomorphism on the so-called energetic
space X 1:' In particular, AI: is a bounded operator in contrast to A. To be
precise, we have the following situation:
(a) The energetic space X I: is the completion of the set D(A) with respect to
the so-called energetic scalar product defined by
(u I v)1: = (Auf (')
for all u, v e D(A).
(b) The operator AI: is the duality map of X I: which is closely related to the
theorem of Riesz by Section 18.11.
(c) The Friedrichs extension A F of the original operator A is defined through
the relation
AFlf = Ailf
for all f eX.
104
19. The Galerkin Method for Differential and Integral Equations
parallelogram identity
(the Pythagorean theorem)
!
quadratic minimum problems
!
perpendicular principle
1
I theorem of Riesl I
1
energetic +- energetic extension
spaces of symmetric operators
!
Sobolc\'
spaces
· strongly monotone
operators
I extension of Friedrichs I
. maximal monotone
operators
1
semlgroups
!
parabolic and
hyperbolic differential
equations"
Schrodinger equation
(linear and semilinear)
elliptic boundary
and eigenvalue
problems
variational
problems
Figure 19.1
Note that the range X: of the operator AI; leaves the original space X.
51 nce
D(A) c D(A f ) C XI; £; X £; XI.
In applications to elasticity. the elastic energy of the body is given by
Eel.'" = 2- I (ulu)£.
where II corresponds to the deformation. This motivates the designation
....energetic scalar product." Now to the point.
In contrast to tire original equation (A) above. equations (B) and (C) have
a solution u lor each g;len I e X and I e X:. respectively.
These solutions are called generalized solutions to the original equation (A).
Each solution of (A) is also a solution of (B) and (C). However, the converse
is not always true.
For given I E X. equation (B) is equivalent to the variational problem
(V)
!(III U)E - (II u)x = min!.
U E X E'
19. The Galerkin Method for Differential and Integral Equations
105
self-adjoint operator
bounded
I
unbounded
I
compact
(Hilbert-Schmidt theory)
ex tension
of Friedrichs
1
eigenvalue problems
1
complete orthonormal systems
I
differential
equations in
H-spaces
!
semigroups
1
parabolic and
hyperbolic differential
equations,
Schrodinger equation
(linear and semilinear)
Figure 19.2
which corresponds to the principle of minimal potential energy in elasticity.
In this connection, f is related to the outer forces.
In the special case of the Dirichlet problem, equation (A) corresponds to
the classical problem
(D)
- Au = f on G,
II = 0 on oG,
with D(A) = Co(G) and X = L 2 (G). In Section 19.11, we will obtain that
X E = Wl(G), D(A,) = W 2 2 (G)nX f .
The variational problem (V) above corresponds to
t (.t(D'U)2 - fu )dX = min!, u e X E .
which generalizes the classical variational problem to (D).
Therefore, we have the following two options:
(a) We consider equation (B) and work with unbounded operators in an
H-space X. This will be done in this chapter.
(b) We consider equation (C) and work with bounded operators between the
t\\'O H-spaces X I; and Xl. This will be done in Chapter 22.
106
19 The Galerkin Method for Differential and I nregral Equations
Later on. the approaches (a) and (b) will be generalized to nonlinear maxi-
mal monotone operators and to nonlinear strongly monotone operators..
respectively.
The theory of nlollotolle optJrators can be regarded as a generalizatiolJ of the
'riedrichs extension and the energetic extension to linear 1Ionsymlnetr;c opera-
tors and to nonlinear operators. The theory of linear monotone operators has
the following t\\'O important sources:
(i) A genera' existence principle for the equation All = b.. where A is a linear
continuous strongly monotone operator on an H-space (Theorem 18.F).
(ii) The generation of linear nonexpansive semigroups by maximal dissipative
operators B. i.e... - B is maximal monotone (Theorem 19.E).
In (I) and (ii) \\'e use linear monotone operators which are not necessarily
symmetric. Theorem 19.E shows that the theory of semigroups leads in a
natural way to the notion of monotone operators. The importance of the
theory of semigroups for the mathematical description of evolution processes
in nature \\'ill he investigated in detail in Section 19.17.
We no\\' discuss brieny I'lJIIstationclry problelns which are related to para-
bolic and hyperbolic differential equations. The classical ordinary differential
equation
X'(l) = Bx(t)
has the solution
f I )
.(l) = eB'x(O)..
and
X"(l) + 8 2 x(t) = 0
has the solution
f II )
(t) = (costB)x(O) + B-1(sin Bt)x'(O).
I n Sections 19.15 and 19.16. we show that the same formulas (I) and (II)
describe the solutions of the heat equation and the wave equation. respec-
tively. To this end.. we only need an appropriate definition of /(8), where .r is
a real function and B is a self-adjoint operator. This shows the heal't..\' and
ele/a"('e of functional analysis.
In particular.. if the operator B has a complete orthonormal system of
eigenvectors (II;) with the corresponding eigenvalues (i.i)' then
BII = L ;.;( II; III )14;
i
for all U e D( B)..
and we simply define
f(B)u = L f(;..)(u.lu)u i
;
for all II for which this series converges.
19. The Galerkin Method for Differential and Integral Equations
107
The Basic Ideas for Semilinear Problems
Along with linear equations we also consider semilinear equations in this
chapter. The basic idea is to transform the semilinear operator equation
(E)
Au = f(u)
into the equivalent equation
(E.)
u=A-1/(u)
and to solve (E.) by using the Banach fixed-point theorem. Here, A is a linear
operator and f is a nonlinear operator. In the case of the operator differential
equation
(D) u' = Bu + f(u)
for u = u(t), which corresponds to parabolic and hyperbolic partial differential
equations. we pass to the integral equation
(D.) u(t) = e'Bu(O) + f e"-SIBf(u(s)) ds
and solve (D*) by using the Banach fixed-point theorem. In order to define
the operator e's.. we need the theory of sem;groups. The solutions of (0.) are
called generalized or mild sol utions of (D).
In connection with (E*) and (0.) we encounter the following typical
difficulty:
In order to handle nonlinear problems, it ;s necessary to choose the appropriate
spaces.
We want to discuss this.
(a) We first consider (D.). In order to be able to apply the Banach fixed-point
theorem to (0.).. we need a space in which e,a and f behave in such a
way that the right-hand side of (D.) has the contractivity property. In
Section 19.26 we shall use spaces defined by fractional powers B of B,
which are closely related to Sobolev spaces and to interpolation theory.
(b) We now consider (E*). Let G be a bounded region in R J and let X = L 2 (G).
Set
f(u) = u J .
Then f is not an operator from X to X, since u e L 2 (G) does not always imply
u J e L 2 (G). However, the operator
f: W 2 1 (G) -+ L 2 (G)
is locally Lipschitz continuous. This follows from the continuous embedding
108
19. The Galerkin Method for Differential and Integral Equations
W 2 I fG) £; L 6 (G) which is a special case of the general Sobolev embedding
theorems. This property of f will be used in Section 19.24 in order to solve
the semilinear wave equation
A 2 · J
II" - uU + m u = -)..u
which is related to mesons of mass m.
Generally. one uses the Sobolev embedding theorems. the Gagliardo-
Nirenberg inequalities. and the abstract methods of interpolation theory in
order to investigate the properties of nonlinear operators. As for (D.), it is
frequently necessary to use more than one space.
As we shall see in this section, Sobolev spaces are closely related to:
(i) the energetic spaces of Friedrichs; and to
(ii) the spaces defined via fractional powers of operators.
In the following Sections 19. t through 19.4 we use cont'ention (7) in Chapter
18. Furthermore. we restrict ourselves to homogeneous boundary conditions.
Inhomogeneous boundary conditions can be reduced to homogeneous bound-
ary conditions by using Remark 18.8.
19.1. Elliptic Differential Equations and
the Galerkin Method
EXAMPLE 19.1. We consider the boundary value problem
Lu =.f on G.
u = 0 on cG.
( I )
with
N
Lu = -6u + L aiDiu + aou
i1
and a i = ai(x). Multiplication by t' E C(G) and integration yield
L (LII - f)vd:c = 0 for all v E CO'(G). (2)
After integration by parts we obtain the generalized problem corresponding
to ( I ):
r ( .f DjllDjt' + (ajDjll)r- + ao"t' - ft. ) dx = 0
JG 1=1
U = 0 on eG.
for all v E C( G),
(3)
The designation "generalized problem" is justified by the fact that, for
19.1. Elliptic Differential Equations and the Galerkin Method
109
u e C 2 ( G) and Qo, ..., aN' f e C(G), equation (3) implies (2) after integration
by parts, and, in turn, equation (2) implies (I) by Proposition 18.2.
Remark 19.1. The choice of the generalized problem is not unique. It would
suffice that one requires (3) for all ve M with M = {v e Ct(G): v = 0 on aG}.
The final generalized problem in Section 22.6, for which we prove existence
statements there, reads as follows:
We seek a function u e l(G) so that (3) holds for all v e 21(G).
Note that C:(G) as well as M are dense in W 2 t (G).
Remark 19.3. In the general case, problem (I) does not arise from a variational
problem. In Example 41.7 we shall show that for this it is necessary and
sufficient that the integral on the left-hand side of (3) is symmetric in u and v
for f == O. This means that at = ... = aN == O.
In order to construct an approximate solution of (1) with the aid of the
Galerkin method, we choose functions WI' . . ., W. with
Wi = 0 on oG
for all k
and make the trial
..
u.. = L Cb W ..
.=t
It is thus guaranteed that u.. = 0 on aG. If we now replace u in (3) by u.. and
require that (3) holds only for all
v e span { WI' . . . , w..},
then we obtain the so-called Galerkin equations
r ( t D,u/lD,w j + (a,D,u/I)w j + aOu/lwj - !Wj ) dX = 0, j = 1,..., n. (3.)
JG i-I
This is a linear system of equations for the real numbers C I.., ..., c.... For
ao = ... = aN == 0, equation (3*) is identical to the Ritz equations (16) in
Corollary 18.7.
The following example concerns natural boundary conditions.
EXAMPLE 19.4. We study the second boundary value problem
Lu = f on G,
au = 0 on oG,
on
(4)
whe re L has the same meaning as in Example 19.1. Multiplication by v E
CaJ(G) and integration by parts yield the generalized problem corresponding
to (4):
r ( t D,uD,v + (a,D,u)v + aouv - !V ) dX = 0 for all ve COO(G). (5)
J G ,-.
110
19 The Galerkin Method for Differential and Integral Equations
f-or smooth functions we obtain backwards equation (4) from (5), For
example. let U E C 2 (G) and a o , ,. '. aN' f E C(G), Then (5) implies
f (Lu - .r)r c/x + f () dO = 0 for all (' E C 1 (G), (6)
(i i'JG ( n
after integration by parts. This yields (4) as in the proof of Proposition 18.9.
In contrast to Example 19.1. here we multiply by I' E CX(G) and not by
r E C (G). In the latter case we would not obtain the equivalence of (4)
and (5) for smooth functions. It is remarkable that the boundary condition
u('":U/(111 = 0 on i'G" in (4) docs nol appear in the generalized problem (5). It is
a so-called natural hou"dar." condition.
In order to formulate the Galerkin method for (4)" we choose functions
"'.. . ... "'" and proceed from the trial
"
1'" = L ('." U'A:.
'=1
If we replace II in (5) by II" and require that (5) holds for all
r E span { "'. . . . . " "'" }"
then we obtain for U" the linear system of equations (3-) to determine the real
numbers ('.". .... ClIft. The only difference from Example 19.1 now consists of
the fact that we place no boundary condition on \\'"
As our next example. we consider a nonlinear problem.
EXAMPLF. 19.5. Suppose. for given p > 2. the boundary value problem is
.'i
- L D;UDiul"- 2D.I') + aou =.r on G.
i=1
(7)
II = 0 on iG.
Multiplication by ,. E Co (G) and integration by parts yield the generalized
problenl corresponding to (7):
f ( f I Oi"l p- 2 Oi"Oj (' + QolII' - ft' ) dx = 0
G ,:.
for all I E C o '( G).
(8)
II = 0 on ('G.
For" E C 2 (G). equation (7) easily follows from (8) analogously to Example
19.1. To formulate the Galerkin method we choose functions "'I" . . .. "'II with
"', = 0 on?G
and proceed from the trial
for k = I, . .. . "
"
II" = L e". "',.
'=1
Then we have II" = 0 on tG. If. as usual. we replace u in (8) by II" and require
19.2. Parabolic Differential Equations and the Galerkin Method
1 t I
that (8) holds for all v E span {"'.,. . . , ",..}, then we obtain the Galerkin equa-
tions for j = 1. . . . , n:
f ( t ID,Il.I P - 2 D,Il.D,w j + Qoll.W j - !Wj ) dX = O. (8.)
G ,-.
This is a nonlinear system of equations for determining the real numbers C III ,
.. . . C.... of the form
9 J( C.... . . · , c....) = O. j = I,.... n.
We occupy ourselves with the interative solution of such equations in Sections
26.4 and 26.5. We investigate a difference method for (8) in Chapter 35.
19.2. Parabolic Differential Equations and
the Galerkin Method
EXAMPLE 19.6. Let QT = G x ]0. T[. We consider the parabolic initial-
boundary value problem
u,(x, t) - Au(x. t) = f(x, t) on QT' (9a)
u(x, t) = 0 on cG x [0, T] (boundary condition), (9b)
u(x.O) = uo(x) on G (initial condition) (9c)
(Fig. 19.3). If we interpret u(x, t) as the temperature at the point x E G at time
t, then (9c) means that the temperature at the initial time t = 0 is prescribed.
Condition (9b) means that the temperature on the boundary cG is equal to
zero for all times t E [0. T]. The function f describes an external heat source.
We are given f and Uo. We seek u. A physical motivation for (9) can be found
in Section 69.2.
Multiplication of(9a) by v E CO(G) and subsequent integration over G yield
L (II, - All - f)vdx = 0 for all ve CO(GI. (10)
t
QT
....- - - - --..
G
Figure 19.3
112
19. The Galerkin Method for Differential and Integral Equations
Integration by parts yields the generalized problem corresponding to (9):
d - f lI(X.I)I'(X) dx + f ( t DjuD j (' - f t ' ) dx = 0 for all (: E Co (G),
t G G i=1
u(x, t) = 0 on cG x [0, T],
lI(X, 0) = Ilo(.) on G.
( II a)
(lIb)
( II c)
We note that the function l' depends only on the space variable x, but not on
time t.
Conversely. for sufficiently smooth" and J, from the generalized problem
( II at one first obtains equation (10) by means of integration by parts and (10)
implies the original classical problem (9), according to Proposition 18.2.
In order to construct an approximate solution U II with the aid of the Galerkin
nlethod. we make the trial
"
lI,,(x, t) = L c,,,(t)w,,(x).
,:: 1
In contrast to Section 19.1, the unknown real coefficients Cle" now depend on
time I. If we replace u in the generalized problem (1Ia) by u" and require that
(I J a) holds for alll e span {"'. .. . . . K',,}. then we obtain the Galerkin equations
for j = 1. . . . . n:
f. c,,(t) f "."jdx + f. c... f t D,w.DjwJdx = f fKJd.. (12a)
'-=1 G '=1 G ;=1 G
This is a linear system of first-order ordinary differential equations for the real
functions (' I", . . . . c "'" We must still add an initial condition
C j.. (0) = 2 i" , j = I,..., n.
This must correspond approximately to the initial condition
u(x,O) = uo(x) on G
of the original problem. For this we choose a sequence (u"o) with
( 12b)
"
u"o(x) = L ,,, M'.(X)
A:=I
so that (11..0) converges to Uo as n -+ 00.. and indeed does so in- the norm of
L 2 (G). i.e..
t (u"o - uo)2 dx -+ 0
Then the equation
as n -. OCt
u,,(X,O) = u"o(x) on G
corresponds to (12b). An especially favorable choice of u"o results from the
t 9.3. Hyperbolic DilTerential Equations and the Galerkin Method
113
requirement
t (uo(x) - U.. o (X»2 dx = min!. (13)
which corresponds to the least squares method. If we write (13) in the form
g(<< 1,., · · · ,«,.11) = min!,
then from the necessary extremal condition
og == 0, . t
J = ,... , n,
o'J.J"
we obtain a linear system of equations to determine 2.", ..., 2"". In order to
satisfy the boundary condition (11 b) also, i.e.,
u..(x, t) = 0 on oG x [0, T],
we choose w, so that
w,(x) = 0 on cG
for k = 1, . .. , n.
Remark 19.7. If one replaces u in (9) by a nonlinear differential operator,
then there arises in place of the Galerkin equation (12a) a nonlinear system of
ordinary differential equations, but for this one finds in the standard literature
well worked-out approximation methods for numerical computations.
The exact treatment of Example 19.6 will be considered in Section 23.8.
19.3. Hyperbolic Differential Equations and
the Galerkin Method
EXAMPLE 19.8. Parallel to Example 19.6 we consider the hyperbolic initial-
boundary value problem
u,,(x, t) - Au(x, t) = f(x, t) on QT' (14a)
u(x, t) :aI 0 on oG x [0, T] (boundary condition), (14b)
u(x,O) = uo(x) on G (initial conditions), (14c)
u,(x, O) = u. (x) on G.
The functions f, U o , U 1 are given. We seek u.
If we interpret u(x, t) in the case where G c R 2 as the vertical displacement
of a membrane at the point x e G at time t (Fig. 19.4), then (14c) means the
prescription of the displacement U(x, 0) and the displacement velocity u,(x,O)
at the initial time t == O. Condition (14b) means that the membrane remains
tightly coupled with the boundary for all times t e [0. T].
Multiplication of equation (14a) by v e C:(G) and integration by parts yield
114
19. The Galerkin Method for Differential and Integral Equations
" lI(f, I)
G
Figure 19.4
the /elJerali:ed prublenJ corresponding to (14):
Jl f f ( N )
J "" 2 U(X,I )r(x) dx + .L O;IlD;f - ft' dx = 0
t (i (j , :: 1
U(.X. t) = 0 on cG x [0, T].
u(x.O) = "o(.X) on G.
1I,(x,O) = II 1 (x) on G.
for all l' E C(G).
( 15a)
( 15b)
( I 5c)
One again obtains (14). for sufficiently smooth u and f. backward from (15),
by means of integration by parts and an application of Proposition 18.2.
I n order to construct an approximate solution with the aid of the GalerkilJ
,nethod. we again proceed from the trial
n
11,,(X. t) = L ("n(l)"',(.x)"
';1
If we replace II in the generalized problem (15a) by U,. and require that (15a)
holds for all r E span {,\'.'".". ,,",,}. then we obtain the Galerkin equations for
j = 1. "" . ""
II f " f ... f
L c.',,(t) "", \\j (Ix + L C,,,(lt L Oi\,'.Oi"1 dx = f"j dx. (16at
. r I (i . :1 I G i r..q G
This is a linear system of second-order ordinary differential equations for the
real functions ("n' .... C""" To this we add the initial conditions
('j"(O) = ':Xj'" (';,,(0) = Pj", j = I. . . . . ,1.
These are to realize (15c) approximately. To this end. we set
( 16b)
"
finO = L Ic" "'Ic'
'=1
"
II" I = L Pie" "'Ie .
Ice I
Then ( 16b) means that
II,.(X.O) = U,.o(x).
eu
--,.(x.O) = U"I(")'
( t
19.4. Integral Equations and the Oalerkin Method
11 S
As in Example 19.6, we choose (u"o) and (U"I) so that
""0 -. "0 and U"I -. "I in L 2 (G) as n -. 00.
In particular. one can choose u"o and U"1 as solutions of the corresponding
minimum problem (13). In order to fulfill the boundary condition (I Sb), i.e.,
",,(x, t) = 0 on aG x [0, T],
we choose Wi so that
w,(x) = 0 on cG for k = 1, .. . , n.
The exact treatment of this problem follows in Section 24.2.
19.4. Integral Equations and the Galerkin Method
EXAMPLE 19.9. We consider the integral equation
u(x) - t k(x, y)u(y) d}' = f(x) for all x e G.
In this connection, let k E L 2 (G x G) and f e L 2 (G), i.e., the integrals
f k(., y)2 d. dy and f f(X)2 d.
GxG G
exist. Multiplication of (17) by v E CO(G) and integration over G yield the
following problem that is equivalent to (17):
f uvd.'( - f k(x,y)u(y)v(x)dydx = f fvdx
JG JGxG JG
( 17)
for all v E Co(G).
( 18)
Here we take the Fubini theorem A 2 (23) into account. We designate (18) as
the generalized problem corresponding to ( 17).
In order to construct an approximate solution with the aid of the Galerkin
method we make the trial
"
"" = r. Ct" Wt.
.-1
If we replace U in the generalized problem (18) by II" and require that (18) is
valid only for all v E span {WI'. . . , w,,}, then we obtain the Galerkin equations
for j = I. . . . , n:
f u" w} dx - f k(x, y)u.(y)w}(x) d}' dx = f f¥tJ d.'(. (IS.)
JG JGxG G
This is a linear system of equations for the real numbers C I ", ..., C"".
In conclusion, having this example at hand, we shall explain why the
Galerkin method is also designated as a projection method. To this end we set
116
19. The Galerkln Method for Differential and Integral Equations
x = L 2 (G) and write (17) in the form
u - Ku = f,
Let "'I' ..., "'" E X and let
14 e X.
( 19)
X" = span {,,'.,..., "',,}.
Then, by Section 21.12 below, there exists an orthogonal projection operator
P,,: X -+ "\',,
onto X". The operator PIt is self-adjoint. i.e.,
( PIt U f (") = (14 I PIt I' )
for all u, reX.
and P"II = II holds for all 14 e X". Instead of the original equation (19), we
consider the approximate equation
P"(Il,, - K u,,) = P",f,
We shall show that (19a) is equivalent to
(II" -- Ku,,'''}) = (/1"j), II" EX", j = J,.... II. (19b)
These are exactly the Galerkin equations (18*) above. In fact, first of all (19a)
is equivalent to
f PIt (II" - K u" )I f) = (P" .rlt' )
This is equivalent to
(u" - Ku"IP"l') = tfIP"I')
and thus equivalent to
fll" - KII"'r) = (fft')
which is equivalent to ( 19b).
u" EX".
( 19a)
for all l' EX.
forall t'EX,
for all rEX",
19.5. Complete Orthonormal Systems and
Abstract Fourier Series
The following results on orthonormal systems are generalizations of the
classical Fourier expansion and other expansions with respect to important
special functions of mathematical physics. The following propositions will be
used frequently in connection with eigenfunction expansions, functional cal-
culus, convergence of the Ritz and Galerkin methods, etc. We make the
following assumption:
(H) Let X be an H-space over K = IR. C, and let {u,,} be an at most countable
orthonormal system in X, i.e.,
( 11. 1 14' ) = b .
. ') .')
for all i, j.
19.5. Complete Orthonormal Systems and Abstract Fourier Series
117
We consider the partial sum of the Fourier series
III
S'" = L (u..lu)u...
..-I
Our goal is to study the convergence of the Fourier series
:10
U = L (u..lu)u...
..=1
(20)
Definition 19.10. Assume (H). The countable orthonormal system {u..} is called
complete in X iff (20) holds for all u e X. The finite orthonormal system {u..}
is called complete in X iff it is a basis in X.
We want to motivate (20). In order to find a good approximation of u by
a linear combination Cl UJ + ... + c"'u"', we set
... 2
f(c 1 ,...,c...)= u- L C"U"
..=1
and consider the minimum problem
f(c 1 ,..., C",) = min!,
C 1 , . . . , c... e K,
(21 )
according to the least squares method of Gauss.
Proposition 19.11. The unique solution of(21) is given by the Fourier coefficients
c" = (u..lu).
PROOF. A short computation shows that
III '"
f(c) = lIull 2 - L l(u..lu)1 2 + L I(u..lu) - C..,2.
..-1 ..-I
o
In particular, this implies
lIu - s...11 2 f(c)
and the Bessel equation
for all c e K'"
(22)
...
lIu - s...11 2 = lIull 2 - L l(u"lu)1 2
..-I
for all u eX.
(23)
Hence we obtain the important Bessel inequality
...
L l(u"lu)1 2 S 111411 2
,,-I
for all u EX.
(24)
Proposition 19.11 (Convergence Criterion). Assume (H). Then the series
L C.."..,
..
c.. e I<
is convergent iff L.. I C ..1 2 is convergent.
118
19 The Galerkin Method for Differential and Integral Equations
PRCX)F. This follows from
, m...1t 1 2 ",+It
; L e"u" I = L le,,1 2 .
":a,,, n=",
o
Thus.. by (24}.. the Fourier series on the right-hand side of (20) ;s ah,'ays
cotJrergent. The following theorem contains conditions which ensure that the
limit of the Fourier series is equal to u.
Theorem 19.A fCompleteness Criterion). Assunre (H). Then the following four
condit iotls are "Illtually equir-'alenl:
(i) 7'he orthollorlnal systenl {II,,:- is conlplete.
fii) Tire linear Irull 0.( II,,} is dense in X.
{iii} F or aI/II EX.. there holds the Par.fiel'al eqllation
1111" 2 = L 1(11" III )1 2 .
"
(iv) For II gir'en dellse set D ;11 X it .lolloM's.. .{ro," l' E D and (u,,1 l') = 0 for all n.
that,. = o.
PR()()F. (i) <=> (ii) <=> Oii). This follows from (22) and (23).
(i) (iv). Use (iii).
(iv)::>(ii). Let S = span{II t ..u 2 .....}. Denote by Sl the orthogonal comple-
ment to S. By (iv).. S 1 (\ D = {O}. Since D is dense in X. the set Sl (\ D is dense
in Sl. Hence Sl = {O}. Now it follows from X = S Sl that S = X. i.e.. S is
dense in x. 0
As a typical example we consider the classical Fourier expansion. To this
end, let X = L 2 (O. 2n) with the scalar product
(ult.) = So2" UI'dx.
Moreover.. we set U o = L Vi 2n and
I
112. = ---;:- cos kx..
,11[
1 . k
1I 2t ... I = r sin x.
y'n
k = I. 2. . . . .
Then
(u", I u,,) = t5",,,
for all m. n.
i.e... {"m is an orthonormal system in X.
Let Cp[O.. 2n] denote the set of all the continuous functions II: [0.. 2n]
!R with ufO) = u(2n). We equip C p [0.2n] with the usual norm lIull =
max OSX $ 2ft lu(x)l. By the classical theorem of Fejer. for each u E Cp[O. 2n].. we
have the uniform convergence
So + ... + s", - 1 ....
-- ----- - u on [O,2n].
m
19.6. Eigenvalues or Compact Symmetric Operators (Hilbert-Schmidt Theory) 119
where
III
SIll = L (u"lu)u".
,,-0
Hence the linear hull of {u",} is dense in C,,[O, 2n]. Since C,,[O, 2n] is dense in
X = L 2 (0.2n). the linear hull of {u",} is also dense in X. By Theorem 19.A,
{"",} forms a complete orthonormal system in X. i.e.,
f 2.
Urn Iu - s",1 2 dx = 0
'" .. « 0
This means that, for each u e L 2 (0, 2n), the classical Fourier series converges
to u in the sense of the space L 2 (0, 2n).
for all u e L 2 (O, 2n).
19.6. Eigenvalues of Compact Symmetric Operators
(Hilbert-Schmidt Theory)
We consider the eigenvalue problem
Bu = iu,
u EX. ) E K.
(25)
We want to prove the existence of eigensolutions {u". ).,,} and the convergence
of the Fourier series
at
U = L (ulllu)u"
,,-I
for all u eX.
(26)
Definition 19.13. A linear operator B: D(B) c X -+ X on the H-space X is
called symmetric iff D(B) = X and
(Bult) = (ul Bv)
for all u, v E D(B).
The operator B is called skew-symmetric iff D(B) = X and
(Bulv) = -(uIBv)
for all u, t' E D( B).
We assume:
(H) The operator B: X -+ X is linear, symmetric, and compact on the H-
space X over K = R, C. Let B :F o.
Theorem 19.8. Suppose that (H) holds with dim X = 00 and Bu = 0 implies
u = 0, i.e., zero is not an eigenvalue of B. Then:
(a) The operator B has a countable system {u". ).,,} of eigenso/utions containing
all the eigenr,alues of B.
(b) All the eigent.alues ;" are real and of finite multiplicity, and A" -+ 0 as n -+ 00.
(c) The e;gent'ectors {u,,} form a complete orthonormal system in X, i.e., (26)
holds.
120
19. The Galerkin Method for Differential and Integral Equations
This theorem will be proved in the next section together with the following
more general result.
Proposition 19.14. Suppose that (H) holds. Then:
(a) The operator B has an at most countable system {u... ;....} of eigensolutions
cO'llaining all the nonzero eigent'alue. of B.
(b) All the eigellt'alues i.,. are real. There exists at least one nOllzero eigentalue
o.f B alld all the nonzero eigenl'alues of B have finite multiplicity.
(c) The eige,u'ector. Il..} .form an orthonormal system in X and
Bu = L i..,.(u..lu)u,. for all II E X. (27)
"
If" i. m # O. the" all the ".. corresponding to ;"m form a basis in the eigenspace
to i. m .
(d) If Bu = 0 implies u = 0, then (26) holds.
Corollary 19.15. f (H) holds. "'here X is separable, then the operator B has an
at mo.t countahle complete orthonormal system of eigent'ectors in X.
PRoof- ()f COROLLARY 19.15. We will use the fact that, in each separable
nonzero H-space. there exists an at most countable complete orthonormal
system. This will be proved in Problem 19.16.
Let N = {u E X: Bu = O} and suppose that N #: {O}. Then the linear sub.
space N is closed. and hence N is an H-space. We consider the orthogonal
decomposition
X=N(f)NJ..
Let {I'm} be a complete orthonormal system in N. Then Bv", = 0 for all m, i.e.,
I'm is an eigenvector with the corresponding eigenvalue ).. = O. Furthermore.
B(NJ.) c N 1. In fact. if II E N 1 , then
(Bull') = (uIBt,) = 0 for all r E N,
i.e." Bu E N 1. By Proposition 19.14(d). B has a complete orthonormal system
{II,,} of eigenvectors in the H-space N 1. Thus, all the U"9 v'" form a complete
orthonormal system of eigenvectors of the operator 8 in X. 0
I n order to obtain another important consequence of Proposition 19.14, we
consider the equation
;'u - Bu = f,
U E X.
(28)
Proposition 19.16 (The Fredholm Alternative). Let ;.. #: 0 and assume (H). Then:
(a) If;..;s not an eigenvalue of B, then for each f EX, equation (28) has a unique
solut ion.
19.7. Proof of Theorem 19.8
121
(b) If; is an eigenvalue of B, then equation (28) has a solution iff
<flu) = 0
for all eigenvectors u of B corresponding to )...
PROOF. Let Bu" = )."u". Then
(unIBu) = (Bunlu) = A"(u,,Iu).
From (28) we obtain the key formula
(; - A")(u,,lu) = (unlf).
(29)
Ad(a). Let ; #= A" for all n. Let u be a solution of (28). From (27) and
(29) we obtain
1 ( )
U = 1 f + l _ l" (u"l/)u" ·
Conversely, from Lit l(u n l/)1 2 II f 11 2 and lIt -+ 0 as n -+ 00 we obtain
(30)
;" 2 f 2
). _ )." l(u,,1 )1 < 00.
Hence the right-hand side of (30) converges and represents a solution of (28).
Ad(b). Let l = l", for fixed m. Let u be a solution of (28). It follows from
(29) that (u"lf) = 0 for all u" corresponding to )",.
Conversely, suppose that (u"lf) = 0 for all u" corresponding to ;",. Then (30)
represents a solution of (28) in case we do not sum over all those n which
satisfy A" = )"",. 0
(31 )
19.7. Proof of Theorem 19.B
The basic idea is to use a variational principle in order to obtain the largest
eigenvalue of B.
Lemma 19.17. Let B: D(B) X .... X be a linear symmetric operator on the
H-space X over K = R, C. Then:
(a) (Bulu) is real for all u e D(B) and all the eigenvalues of B are real.
(b) Eigenvectors of B with different eigenvalues are orthogonal.
(c) If {u,,} is a complete orthonormal system of eigenvectors of S, then the
sequence {),,} of the corresponding eigenvalues contains all the eigenvalues
of B.
(d) If D(B) = X and B is continuous, where X :I: {OJ, then
sup 1(8ulu)1 = 11811.
'."jl-t
(32)
122
19 The Galerkin Method for Differential and Integral Equations
The proof will be given in Problem 19.3. We now consider Proposition 19.14
in the case where dim X = ,x. The simpler case dim X < oc can be treated
analogously.
Step I: Variational problem. We consider the maximum problem
I(Blllu)1 = max!,
"ull = I.
(33)
By (32)" the maximal value is equal to IIBII. Recall B #: O. Let (t'lI) be a sequence
with I; f" H = I for all 11 and
If 81',,11',,)1 --+ it 811
as n --+ x.
By passing to a subsequence, if necessary, we obtain
( Br'" II',,) --+ i..
as 11 --+ 'X.
Here.
1i.11 = IIBII
and II BI'" If < I i.. I for all II. Hence
UBI'" - i..I'"n 2 = IIBt',,/l2 - 2i. 1 (Bl'"lt',,) + ;.:-+0
as '1 -+ OC.
The operator 8 is compact. Thus there exists a subsequence, again denoted
by (8r',,). such that (8r',,) converges. Since 81'" - )'1 ('II --+ 0 and ;'1 :#= 0, the
sequence (1',,) also converges to a certain u I' This implies
B II. = i.. Il, .
"11.11 = I.
and ". is a solution of f 33).
Step 2: Induction. Let
Y = {liE X: (ulu l ) = O}.
Then Y is a linear closed subspace of X. The key to our induction argument
is the relation
B( Y) £; Y,
i.e.. Y is in\ariant Ytith respect to B. In fact, it follows from u E Y that
(Blltu.) = (uIBu.) = ;..(ulu.) = 0
and hence Bu E r. We assume that B 0 on Y. Otherwise, we finish our
procedure.
We now apply Step I to the restricted operator B: Y --+ Y. This way we
obtain
BII2 = A 2 U 2 '
U2 E y,
where 1;'21 is the norm of B on Y. This implies
\:81'11 s 1;2111t'1i
and 1;21 < liB!; = 1;..1.
for all t' e Y
(34)
19.7. Proof of Theorem 19.8
123
We now set Z = {U e Y: (UIU2) = O} and continue this procedure. This way
we obtain the eigensolutions
Bu" = ).."u", PI = I, 2, ...,
I )...1 I).. 2' ... > 0,
and {u,,} is an orthonormal system.
Step 3: We show that ;.." ..-. 0 as n ..-. 00. Otherwise, the sequence (A.;. U,,) is
bounded. The operator B is compact. Since B(l';'u,,) = u", the sequence (u,,)
contains a convergent subsequence. But this is impossible, since (u"lu",) = 0
and
lIu" - U'" 11 2 = lIu" 11 2 + lIu", 11 2 = 2
Step 4: We show that
for n #: m.
Bu = L )..,,(u,,1 u)u".
,,=.
(35)
To this end, let
'"
w'" = u - L (u"lu)u".
,,-I
We define V = {u eX: (ulu i ) = 0,; = l,...,m}. By Step 2, B(V) C V and
II Bvll S IA",+alllvll
for all t" e V.
Obviously, "'", e V, and hence
II Bw", II S I;..", + .111 w'" II.
The Bessel equation (23) yields
IIw",1I s lIuli.
This implies Bw", ..-. 0 as m ..-. oc, and hence we obtain (35).
By Proposition 19.12, each Fourier series is convergent. Hence
\V'" ..-. w
as m..-. 00
and B,,' = o. If Bw = 0 implies w = 0, then "'", -+ 0 as m -. , i.e., the ortho-
normal system {I'",} is complete.
Step 5: We show that the sequence ()",,) contains all the eigenvalues ).. :F- 0
of B. In fact, it follows from
Bu = )..u, u 0, ). 0,
and ).. A." for all n, that (u,,1 u) = 0 for all n, according to Lemma 19.17(b).
By (35), Bu = o. This is a contradiction.
An analogous argument shows that all the eigenvectors corresponding to
i.", :#: 0 form a basis in the eigenspace to the eigenvalue A.",. Since ).'" -. 0 as
m -+ 00, each A.", :#: 0 has finite multiplicity.
The proof of Proposition 19.14 and Theorem 19.B is complete.
124
19, The Galerkin Method for Differential and Integral Equations
19.8. Self-Adjoint Operators
While working with operators A: D(A) £ X ...... X, which are not defined on the
total space X, observe carefully the specific form of the domain of definition
D(A) of A.
Definition 19.18. Let A: D(A) s; X -+ X be a linear operator on the
H-space X with D (A) = X. The adjoint operator A-': D(A-') s X -+ X
is defined by
(llIAI) = (A.'ull').
To be precise, we first define U E D(A.'} iff U E X and there exists an f E X
with
( U I A I') = (f I t')
for all l' E D(A),
(36)
and, for II E D(A* ), we then set A.'II = f
The operator A is called self-adjoint iff A = A-', and A is called
ske'-adjo;nt iff A = - A-'.
The definition of A*' makes sense. In fact, if (36) also holds for 1. then
( r - .flt,) = 0 for all l' E D(A). Since D(A) is dense in X, we obtain f = 1
In the following we agree to speak about the existence of the adjoint
operator A *' only in the case where A is densely defined. We use the
notation A.' in order to emphasize the difference between the adjoint
operator A. on X and the dual operator A* on the dual space X-.
However, for simplicity of notation, and if it is stated expressively, the
adjoint operator to A is denoted by A-. In particular, this convention will be
used throughout this chapter.
We write
A c B
iff the operator B is an extension of the operator A, i.e., D(A) c: D(B) and
Au = Bu on D(A). Using Definition 19.13, it follows easily from (36) that the
following hold:
(i) A ;s sy,nnJetr;c iff A s; A..
(ii) A is ske\\'-symmetr;c iff A s; - A *.
This immediately implies the following:
(iii) If A is self-adjoint, i.e., A = A., then A ;s symmetric.
(iv) If A ;s skew-adjoint, i.e., A = - A., then A is skew-symmetric.
Generally, the converse of (iii) and (iv) is not true. However, it follows
from (i) that each symmetric operator A: D(A) c: X -+ X with D(A) = X
is self-adjoint. In this special case the operator A is also continuous, by
Problem 19.5e. But note that, in the general case D(A) X, the self.
19.8. Self-Adjoint Operators
125
adjointness of A is a much stronger property than the symmetry of A.
This difference plays a fundamental role in the theory of partial differen-
tial equations. In particular, self-adjoint operators are not necessarily
continuous. However, we have the following weaker result:
(v) Self-adjoint and skew-adjoint operators are graph closed.
This follows from the fact that adjoint operators are always graph closed, by
Problem 19.5a. Finally, we mention the following result:
(vi) If A: X -+ X is a linear continuous operator, then D(A.) = X and
A.: X -+ X is also linear and continuous.
To prove this we use the Schwarz inequality
(S)
l(uIAv)' s !lull nAvll s lIuUIIAU IIv"
for all u, veX.
Thus, for each fixed u E X, the mapping v......(uIAv) is a linear continuous
functional on X. By the Riesz theorem in Section 18.11, there exists an f e X
with
(uIAv) = (flv) for all v E X
and IIfll s nAliliull, by (5). This implies A.u = f and IIA.ull s; nAliliuli for
all u E X.
Let X and Y be two H-spaces over K = R, C. Then the operator U: X -+ Y
is called unitary (or an H-isomorphism) iff U is linear, bijective, and
(Uul Uv) = (ulv) for all u, VEX.
For a linear operator U: X -+ Y, the following two conditions are equivalent:
(i) U;s unitary.
(ii) U is a normisomorphism, i.e., U is surjective and II Vult = lIuli for all
u EX.
The proof was given in Problem 18.4a.
For a linear operator U: X -+ X, the following two conditions are equivalent:
(a) U is unitary.
(b) U.U = UU. = I.
To prove this first suppose that (a) holds. Then (Uul Uv) = (ulv) implies
(U.Uulv) = (ulv) for all u, veX,
i.e., U.U = I. Furthermore, we have UU.u = UU.UU- 1 u = u,i.e., UU. = I.
Conversely, suppose that (b) holds. From U U. == I it follows that D( U.) ==
X and U is surjective. Moreover, U.V = I implies that U is injective (i.e.,
Vu == 0 implies u == 0) and that (Uul Uv) = (ulv).
126
19. The Galerk.n Method for Differential and Integral Equations
19.9. The Friedrichs Extension of
Symmetric Operators
Let A: D( A) c X -+ X be a linear symmetric operator. We consider the
equation
Au = f. u E D(A). (37a)
If this equation corresponds to a classical boundary value problem for a linear
elliptic equation. then. as a rule, problem (37a) is not solvable for each .r E X.
Therefore. along with (37a). we study the following two generalized problems:
A ,-1' = r. II E D ( A F ). ( 37 b )
filiAl') = C/lt') for fixed II E X E and all l' E D(A). (37c)
Here A c A F . Indeed. we shall show that (37b) and (37c) are equivalent and
that. for each f EX. both equations have a unique solution u. This solution
can also be obtained by the following variational problem:
2 - I ( u I u) f - Cf I u) = m in!. u E X E. ( 38)
We make the following assumption:
(H) The linear operator A: D(A) c X X is symmetric on the real H-space
X with D( A) = X and A is strongly motJotone, i.e.,
(Alii") > c lIull 2
for all II E D(A) and fixed c > o.
(39)
Our goal is to construct the extensions
A c: A F c: Al'
where A F is called the f"riedrichs extension and A[ is called the energetic
extension of A. Note that the range X; of the operator A E : XI:'-' X; leaves
the original space X. The so-called energetic .'ipace X E is the completion
of D(A) with respect to the so-called energetic scalar product (ulv)[ = (Ault').
This space will be studied in greater detail in Section 19.10.
Theorem 19.C (Friedrichs (1934). Suppose that (H) holds. Then:
(a) There exists a self-adjoint e:<ten.ion A F : D(A F ) c: X X of the opera-
tor A \\,;th D(A F ) c X E C X and
(AFulu) > f lIull 2 .for all u E D(A F ).
(b) Tile in,'erse operator A i I: X X exists and is linear, continuous, and
self-adjoint. Consequently. .for each f e X. equation (37b) has a unique
solution.
(c) The operator Ail: X -.. X E is linear and continuous.
(d) The embeddings X E C X s; X; are continuous.
(e) The operator A F has the extension
A E : X E -+ Xl,
19.9. The Friedrichs Extension of Symmetric Operators
127
where AE; ;s the duality map of X E ' i.e., A£ a linear homeomorphism with
(AE;u, u) = lIulil
for all u e XE;.
(40)
Moreot'er,
AF'f == A£'f for all f e X.
(f) If the embedding XE; c X is compact, then the operator A f ': X... X is
compact.
Corollary 19.19. For git'en / e X, problems (37b), (37c), and (38) are mutually
equit,alent. Each solution of the original problem (37a) is also a solution of (37b),
(37c), and (38).
The proof of Theorem 19.C will be given in the next section.
PROOF OF COROLLARY 19.19. From A C A, we obtain (37a) => (37b). More-
over, it follows from
(AFulv) = (uIAFv) = (utAv) for all ve D(A)
that (37b) => (37c). In the next section we shall prove that (37c) => (37b) and
(37b) <:> (38). 0
We now consider the eigenvalue problem
Au = Il U + f,
u e D(A), Il e R.
(41 a)
along with the two generalized problems
A,u = IlU + f. u e D(A,), (4Ib)
(u I A v) = JJ (u I l') + (f I v) for Ii xed u e Xl and all v e D (A). (41 c)
By Corollary 19.19, problems (41 b) and (41c) are equivalent, and each solution
of (41 a) is also a solution of (41 b) and (4Ic).
Corollary 19.%0 (Eigenvalue Problem). Suppose that (H) holds with dim X =
x), and that the embedding XE; X is compact. Then:
(a) The operator A,: D(A F ) e X... X has a countable system of eigensolutions
{u". J,l,,} which contains all the eigenvalues of A F .
(b) All the eigenvalues of A F have finite multiplicity. Furthermore, we have
Il" c for all nand IlI1 -+ + 00 as n ... ex:..
(c) The eigenvectors {u,,} form a complete orthonormal system in x.
PROOF. If AFu = pu with u #= 0, then (AFulu) cllull 2 implies p c.
We set B = A F I . By Theorem 19.C, the operator B: X ... X is compact and
Bu = 0 implies u = O. In the case where Il #:. 0, equation (41 b) is equivalent to
equation
).u = Bu + ABf
with ). = JJ- 1 . Theorem 19.B yields the assertion.
(42)
o
128
19 The Galerkin Method ror DilTerential and Integral Equations
Corollary 19.21 (The Fredholm Alternative). Suppose that (H) holds K'ith
dim X = ,and that the embedding X£ c X is compact. Let JJ #: o. Then:
(a) r is not an eigenvalue of AF' then, for each f EX, equation (41 b) has a
unique solution.
(b) If I' ;s an eigenvalue of A F , then.. for given f E X, equation (41 b) ha..-; a
solut ion iff
(flu) = 0
holds for all e;gent'ectors u of A F corresponding to JJ.
PROOF. This follows immediately from (42) and Proposition 19.16. In (b),
note that A,..u = u implies u = pBu and
(8flu) = (fl8u) = -I(flu).
Hence ().Bflu) = 0 iff (flu) = O.
(43)
o
Concrete examples will be considered in Sections 19.11 and 19.12.
Discussion
Roughly speaking, we have the following situation.
(i) If the original equation (37a) above corresponds to a boundary value
problem for a linear elliptic differential equation, then the variational problem
(38) corresponds to the classical variational problem related to the boundary
value problem (37a). The inverse operator Ail corresponds to an integral
operator whose kernel is an appropriate Green"s function.
(ii) The point is that the classical problem (37a) has only solutions for
sufficiently smooth right-hand sides f The energetic space XI: is constructed
in such a way that the variational problem (38) has a solution in X E for each
fe X.
(iii) In classical mathematical physics, one studied eigenvalue problems
for differential operators by reducing them to integral equations. This cor-
responds to the passage from
AFu = JJU + f
to
A -I A -I f,
u= F u+ F ,
which we have used in the proof of Corollaries 19.20 and 19.21.
(iv) The energetic space X E corresponds to Sobolev spaces, and the compact
embedding X£ c X corresponds to Sobolev embedding theorems which have
their roots in the compactness theorem of Rellich (1930).
(v) In Sections 19.15 and 19.16, we shall show that the Friedrichs extension
is also important for solving linear parabolic and hyperbolic equations.
19.10. Proof of Theorem 19.C
129
(vi) In terms of the theory of monotone operators, the energetic extension
A£: X£..... Xl is a strongly monotone operator. In Section 31.4 we shall show
that the Friedrichs extension A,: D(A,) s; X..... X from Theorem 19.C is a
maximal monotone ex tension of A in X. The theory of monotone operators can
be regarded as a generalization of the Friedrichs extension to linear operators
(without any symmetry properties) and to nonlinear operators.
(vii) The embeddings X£ X Xl are the prototype of evolution triples
which will playa fundamental role in the investigation of nonlinear evolution
equations.
(viii) Ifwe treat elliptic equations with the aid of the unbounded Friedrichs
extension AF' then we work in terms of the theory of unbounded operators.
This will be done in this chapter. In Chapter 22, we will use the bounded
energetic extension A£ in order to study elliptic differential equations. This
approach is based on the theory of bounded operators. However, note that in
quantum theory, the use of unbounded operators is indispensable.
19.10. Proof of Theorem 19.C
The basic idea of the proof is the following:
(i) The energetic space X£ is the completion of D(A) with respect to the
energetic scalar product (UIV)E = (Aulv). It is important that Xl can be
identified with a subset of X.
(ii) The energetic extension A £: X £ ..... Xl is the duality map of X £t i.e.,
(A£u, v) == (ulv)£ for all u, veX £.
(iii) The Friedrichs extension is constructed by D(A,) = A£I(X) and A,u =
A£u on D(A F ), i.e., A,I = A E I on X.
Having this in mind, the proof proceeds straightforwardly by using the
Riesz theorem (Theorem t 8.D).
Step I: The energetic space Xl' We define the energetic scalar product by
(ulv)£ = (Aulv) for all u, v e D(A), (44)
and we set lIull£ = (ulu)1'2. From (39) we obtain the first key formula:
II II II d II u II I: fo r all u e D (A), (45)
where d ::: 1/.JC, By definition, X£ is the completion of D(A) with respect to
('1' )£. In particular, D(A) is dense in X£ (cf. Problem 18.4).
Step 2: There exists a linear continuous injective map j: X£..... X. Thus if we
identify u with j(u), then X£ s;; X.
To begin with we construct j: D(A) X£ .-. X by j(u) = u. According to
130
19 The Galerkin Method ror Differential and Integral Equations
(45), and the extension principle in Section 18.12, there exists a uniquely
determined continuous extension j: X E -+ X. Passing to the limit in (44), we get
(u Il') f = (A u I j( r) ) for a II II E D ( A ). rEX f . ( 46 )
Therefore, j(l') = 0 implies I' = O. i.e.. j is injective,
Identifying j(l') with l' in (46), we obtain the second keY.fornJula
(1IIr)£ = (Aulr)
forall lIeD(A), l'eX E ,
(47)
Passing to the limit in (45), we get
",," < dllull[
for all II e Xl'
(48t
Step 3: We show that there exists a linear continuous injective map
II: X --+ Xl. Thus if we identify II with h(u), then X c: Xi,
For fixed .f e X, we set
f*(r) = (ltr)
. .
for all r e X£.
Then .f. E .t;. since
I.l.(l')j < " fll 111'11 < dllflllh'fl E for all v E X E ,
We set II( f) = .r.. Because lI.f." < d" f It, the map h: X -+ Xl is continuous.
Since ,,\, f. is dense in X, it follows from f* = 0 that .r = 0, i.e., II is injective.
In the sense of the identification f = f*. we have
< f l') , = ( r, L') for a II .r EX, l' E X f:' ( 49)
Step 4: ("onstruction of the energetic extension A f: X £ --+ X;,
Let A I. be the duality map of X 1:- Recall the following construction of A F.
from Section 18.11. Let II E X f' We set
b(l') = (UIV)E for all r e X E , (50)
According to the theorem of Riesz (Theorem 18.0) we obtain b e Xl and
II h \I = ff u II f' We defi ne
AE" = b,
Again by the theorem of Riesz, the operator A E : X E --+ Xl is bijective. From
IIAF.ull = HuH,; for all u E X f
it follows that A E is isometric and hence A,; is a linear homeomorphism.
From (48) and (50) it follows that
(Afu, 1') = (ulv)£ for a II u,veXF.'
(A£u,u) = lIull > cllutl 2 .
Let j' EX. By (49) and (51), the equation
Afu = f
(51 )
(52)
19.10. Proof of Theorem 19.C
131
is equivalent to
(ul v)£ = (II v)
for all v E X£.
(52* )
The operator A£: X£ -. X; is an extension of A: D(A) c X -. X. In fact, it
follows from Au = I that
(ult')£ = (Ault') = (II v) for all (' E X£
and hence AEu = f
Step 5: Construction of the Friedrichs extension A F : D(A F ) S; X .... X. We
set D(A F ) = A£I(X) and AFu = AEu on D(A F ). The last argument of Step 4
shows that A S; A F c AE'
Step 6: The inverse operator Ail: X .... X. We set B = Ail. The operator B
is the restriction of Ail: X; -+ X E to X. Since the embeddings X£ S; X c Xi
are continuous, the operators B: X -+ X E and B: X -+ X are continuous. If the
embedding X E X is compact, then B: X .... X is compact.
We will show that B: X .... X is symmetric. Let I.. 9 E X. We set u = BI and
I' = 8g. By (52*),
(BII Bg)E = (/18g).
Hence (/IBg) = (gI8/), i.e., 8 is symmetric. Since D(B) = X, the operator
B: X -+ X is self-adjoint.
As an inverse operator of the self-adjoint operator B, the operator A F is also
self-adjoint (cf. Problem 19.5d). By (52) and (52*),
(AFulu) = (ulu)E > cllull 2 for all Ii e D(A F ).
Step 7: For each b E X;, the variational problem
2- I (ulu)E - b(u) = min!.. u E Xl' (53)
has a unique solution given by u = Ailb. In fact, if we set a = Ai1b, then
b(t,) = (alv)E for all f E X E ..
by construction of Al. Hence (53) is equivalent to
2- I (u - alu - a)E = min!, u E XE'
with the unique solution u = Q.
In particular, if we choose I E X and b(u) = tflu) for all U E XE' then
II = A£lb = Ai l /.
Consequently, problems (38) and (37b) are equivalent.
Step 8: Let / E X be given and let u be a solution of (31c), i.e.,
(uIAv) = (fit') for fixed u e X E and all t- E D(A).
By (41), (ull)£ = (flv) and hence u = A"E1I = Ail/, i.e., u is a solution of(31b).
The proof of Theorem 19.C and Corollary 19.19 is complete.
132
19. The Galerkin Method for Differential and Integral Equations
19.11. Application to the Poisson Equation
We consider the classical boundary value problem
- u =.r on G.
u = 0 on l'G..
where G is a bounded region in R' with N > I. Generally, this problem has
no classical solution if.r and the boundary rG are not sufficiently smooth. In
order to apply the results of Section 19.9 to this problem, we set
(54)
x = L 2 fG),
D(A) = Co(G)
and
Au = -u.
Then (54) corresponds to the operator equation
All = I,
U E D(A),
.r E X.
(55)
Integration by parts yields
(Alii r) = r f D.uDj(' dx
J G i= I
for all u, l" e D(A).
Hence A is symmetric. The inequality of Poincare-Friedrichs from Section
18.9 yields the key estin,ate
(Aulu) > c"ull
for all u E D(A) and fixed c > o.
According to Section 19.10, the energetic scalar product is given by
(ull')! = (Alllr) for all u, [' E D(A).
The energetic space X E is the completion of Co-(G) with respect to ('1' )E' By
Problem J 8.4, the Sobolev space W 2 ' (G) is the completion of Co(G) with
respect to ( . , . )E' Hence
X E = J;21 (G).
By Theorem 19.C. the embeddings
X f c X c X;
are continuous. I n Section 19.13 we shall prove the following:
The embedding X E c X ;s compact.
By Theorem 19.C, the operator A has the energetic extension
At;: X£ ..... X;.
which is a linear homeomorphism, namely. A E is equal to the duality map of
X£o For each.r E X;, the equation
(56)
(57)
A£u = J,
., E XI:
has a unique solution. For given arbitrary .r EX, exactly all the solutions II of
19.11. Application to the Poisson Equation
133
the latter equation form the set D(A F ). The Friedrichs extension A F of the
operator A is the restriction of the energetic extension A E to the set D(A F ).
For given fixed f e X, the equation
AFu = f
is equivalent to each of the following two problems:
(uIAv) = (flv) for fixed u E X£ and all t' E D(A) (58)
and
!(ulu)£ - (flu) = min!,
Explicitly, problem (58) reads as follows:
J f DluD,vdx = J fvdx
G'-1 G
U E Xl.
(59)
(58.)
for fixed U E Wl(G) and all v E Co(G). Furthermore, the equivalent variational
problem (59) reads as follows:
t G I (D , u)2 - fu ) dx = min!,
U E W 2 1 (G).
(59.)
Note that, by Section 18.3, (59.) is the variational problem which corresponds
to the original problem (54), and (58.) is the generalized problem for (54).
Summarizing we obtain the following from Theorem 19.C and Corollary
19.19.
Proposition 19.21. Problems (58.) and (59*) are equivalent and, for each given
f E L 2 (G), they have a unique solution u E Wl(G) which can be regarded as a
generalized solution of the original classical boundary value problem (54).
Frequently, one writes A£ = - &. Then, by (56), the embeddings
l(G) S L 2 (G) l(G).
are continuous and
- L1: WI (G) .... W 2 1 (6)-
is a linear homeomorphism, namely, the duality map of l(G). In the litera-
ture, one also uses the notation
J}1(G) = 11(G),
The Friedrichs extension
HO(G) = L 2 (G),
H- 1 (G) = W 2 1 (G)-.
-: D(A F ) .... L 2 (G)
is bijective and has a linear symmetric compact inverse operator
B: L 2 (G) ..... L 2 (G),
134
19. The Galerkin Melhod ror Differential and Integral Equations
i.e., B = A,.. I. The equation
" = Bf
corresponds to the classical formula
II(:C) = r K (:c. y)f(y)dy. (60)
JG
\\'here K fx. .r) = K Cr.. y) for all x" J' E G.. and K is the classical Green
lill'('I;OIl to (54). i.e... (60) is the classical integra) representation for the
solutions u of the original classical problem (54).
In Section 22.21 we shall show the following: If iG Eel and f E: L 2 (G)"
then the solution II of the generalized problem (58*) belongs to the
Sobolev space "22(G). Moreover. we shall obtain an explicit characteri-
zation of the domain of definition D(A F ) of the Friedrichs extension 4F
of .4. namely.
D(A,.) = W 2 1 (G) (\ Wl(G).
19.12. Application to the Eigenvalue Problem
for the Laplace Equation
We now consider the classical boundary eigenvalue problem
-t1II=JlU+J' onG"
II = 0 on i'G.
(61 )
Here G is a bounded region in A'.' with N > I and Il is a real number. We will
use the same notation as in Section 19. J I.
Instead of (61). we consider the operator equation
A II = JI II + I II E D ( A )
together with the two equivalent generalized problems
A F" = IllI + .f.
" E D ( A ,.. )
and
( III A r) = Il ( II I r) + (f I.' )
for fixed " E X £ and all r E D(A}.
Explicitly. that means
r t DjIlDjt'd:c = Jl rill' dx + r ft. d:c (62)
JG ,-1 JG JG
for fixed II E H-'2 1 (G) and all., E (.(G). Note that we obtain (62) from (61) by
multiplication by r' and subsequent integration by parts.
Proposition 19.23 (Eigenvalue Problem). Let .r = O. Then equation (62) has (l
countahle syste", (r eigensolut;ons {"".. Jl,,} \\,hic'h contain,,; aI/the eigenr'ailles.
19. t 3. The Inequality of Poincare and the Compactness Theorem of Rellich
135
We have
Pre > 0 for all nand Pre -+ 00 as n -+ 00.
The eigenfunctions u. are in W 2 1 (G) and form a complete orthonormal system
in L 2 (G).
PROOf. This follows from (57) and Corollary 19.20.
o
Proposition 19.14 (The Fredholm Alternative). Let P :I: O. Then:
(a) If Il is not an eigenvalue of the homogeneous equation (62) with f = 0, then,
for each f e L 2 (G), equation (62) has a unique solution u e W 2 1 (G).
(b) If P is an eigenvalue of the homogeneous equation (62), then, for fixed
f e L 2 (G), equation (62) has a solution u e W 2 1 (G) iff
L fwdx = 0
for all eigenvectors w corresponding to p.
PROOf. This follows from (57) and Corollary 19.21.
o
19.13. The Inequality of Poincare and the
Compactness Theorem of Rellich
Proposition 19.1S (Rellich (1930». If G is a bounded region in R N , N I. then
the embedding Wl(G) s; L 2 (G) is compact.
This proposition is equivalent to the following statement: Each bounded
sequence (u,,) in Wl(G) has a subsequence (u".) which converges in L 2 (G) to a
certain function u, i.e., from
f [ u + f (D'U ll )2 J dx < constant
J G '-I
it follows that
for all n,
(63)
t (u II ' - U)2 dx -+ 0 as" -+ x>. (64)
Our proof of Proposition 19.25 will be based on the following special
inequality of Poincare:
f u 2 dx 2R 2 N f f (D,U)2dx + (2R)-N f udx 2. (65)
Je Je l .. Jc
Lemma 19.16. Let C be a closed cube in RN, N I, ,,'ith edge length 2R. then
(65) holds for all u e CI(C).
136
19. The Galerkin Method for Differential and Integral Equations
The simple proof, based on the Holder- Young inequality, will be given in
Problem 19.11.
PR()()F ()f PROPOSITI()N 19.25. To simplify the notation, let N = 2 and x =
('1 ,,). For general N, the proof proceeds analogously.
Since C(G) is dense in W 2 1 (G), it is sufficient to prove that the embedding
operator
j: C(G) s; wi (G) -+ L 2 (G)
is compact according to the extension principle in Proposition 18.29. let
B = {u E C (G): 1111111. 2 I}. Recall
lIulll.2 = (L (u 2 + ul + u)dx Y/2,
Since j is linear, it suffices to show that j(B) is relatively compact in L 2 (G). By
AI (21), we have to prove that for each £ > 0, the set j(B) has a finite E:-net, i.e.,
there are elements u. ' . . . , U, E B with
m}n II j(u) - j(u j )1I 2 == mjin L (u - U;)2 dx < I:
(66)
for allu E B.
(I) Boundary strip. For each > 0, there exists a proper subregion H cc G
such that
i 112 dx < {)
G-II
for all U E B
(61)
(Fig. 19.5). This will be proved in (III) below.
(II) Inequality of Poincare on H. We choose closed cubes C., ..., C J of edge
length 2R, which cover H with C. c: G for all k. Let D = Ui Ci' By the
special inequality of Poincare (65) with N = 2.
fH u 2 dx < fo u 2 dx
< 4R 2 f (ul + u)dx + (2Rr 2 L r udx 2 (68)
D i J
G
aG
(3)
(b)
Figure 19.5
19.13. The Inequality of Poincare and the Compactness Theorem of Rellich
131
for all 14 e B. We set
F ( 14) = (y 1 , · · . , Y I)'
Y. = r 14 dx.
JC te
By (68), for all u e 8,
In u 2 dx S 4R 2 11ullt2 + (2R)-2IF(uW.
where IF(u)1 2 = r.IY.1 2 . The Holder inequality yields
IY.1 2 S canst r 14 2 dx.
JC te
(69)
Thus, the set F(8) is bounded in R' and hence relatively compact.
Consequently, for each " > 0, the set F(B) has a finite II-net, i.e., there
exist elements u 1 , . . . , u, e B with
min I F(u) - F(u,)1 < "
i
for all U E B.
From (67) and (69) we obtain the key formula
r (14 - u,)2dx = r (14 - u,)2dx + r (14 - u,)2dx
JG JG-H JH
4 + 4R 2 1114 - u"lf.z + (2R)-2IF(u) - F(u,)1 2
for all 14 e 8. In this connection, note that (14 - 14,)2 S 214 2 + 2uf. Now if
we choose , R, and " sufficiently small, then we obtain the desired
estimate (66).
(III) We prove (I) under the assumption cG e CO. 1. To this end we choose a
local (,')-coordinate system as in Figure 19.5(b). More precisely, we
assume that the boundary oG has a local representation of the form
, = g(p), P E J,
where g: J .... R is a co. I-function and J = ] - 2, (X[ with ex > O. Let
0_:11 {(p,C): peJ,g(p) - fJ < J < g(p)}.
For small fJ > 0, the boundary strip Q_ belongs to G according to the
definition of "oG e Co. I" in Section 6.2. Let U E B. For all points (p, C)
and (p, z) in n "
u(P. {) = f u,(P. t) dt + u(P. z).
From (a + b)2 S 2a 2 + 2b 2 and the Holder inequality with I-trick it
Collows that
i ff,,)
u(, C)' s 2fJ u(, t)2 dt + 2u(p. Z)2.
1(11)-_
138
19. The Galerkin Method for D1fferential and Integral Equations
Integration over: yields
PII( 11.0 2 < r q ,", (2p2 U ;(/l. t)2 + 2u(/l. t)2) dt.
J g'"»-II
and integration over Or yields
{I r u 2 Jx f. r (2P 2 uf + 2u 2 )dx
J n, J n,
< cconst lIulii.2.
Since a finite number of local boundary strips covers a boundary strip
of G. we obtain (67) in the case where f. is sufficiently small.
(IV) Th assertion (I) remains true if the assumption cG E co. I drops out. In
this connection, note that the functions u in B vanish outside a boundary
strip" and hence we can pass to a smaller region with a co. I-boundary
in the case where G does not have a co. I-boundary. 0
Let 'II}.I fG) denote the closure of C. (G) in Jil2. (G). Then the same proof as
abo\'e shows that the embedding
11 I (G) C L 2 (G)
is copact in the case where i'G E co. I. We have only to replace C (G) by
Ct (G). In Section 21.4 we shall prove that
11 2 1 (G) = Ji/ 2 ' (G)
provided tG E Co. I. However. this fact will not be used before Chapter 21.
19.14. Functions of Self-Adjoint Operators
Our goal is to construct a simple functional calculus for an important class
of self-adjoint operators. Applications to the heat equation and the wave
equation "'ill be considered in the following two sections. Our basic tool will
he the (,()lJl'er/elJct' criterion for abstract Fourier series (Proposition 19.12). We
make the following assumption:
(H) The linear operator A: D(A) c X -+ X is self-adjoint on the separable
H-space X over K = R, C, and A possesses a complete orthonormal
system of eigenvectors u,,} in X, i.e., Au" = l"u" for all n.
Proposition 19.27. Assu",e (H). 1'hen
A II = L i." ( u" III ) u"
"
jt}r all U E D(A).
(70)
19. t 4. Functions of Self-Adjoint Operators
139
Furtltermore, the follo'ing three conditions are mutually equil.alent:
(i) U e D(A).
Oi) The right-hand side of (70) converges.
(iii) L" Il..1 2 1(u..lu)(2 < x.
PROOF. For all t' E X.
v = L (u,,1 v)u...
..
(71 )
since {u,,} is complete. If u e D(A). then
(u..IAu) = (Au..lu) = ;...(u..lu).
Hence (71) implies (70).
By Proposition 19.12, it follows that (H) and (Hi) are equivalent.
We now construct a linear operator B: D(B) c: X .... X by
Bu = L ;.,,(u..1 u)u..
"
for all u E X with (iii). From
(Bult") = L ;...(ulu")(u,,ll') = (uIBt.)
..
it follows that B is symmetric. By our construction. B is an extension of A.
Since A is self-adjoint. there does not exist a proper symmetric extension (cf.
Problem 19.5c). Hence A = B. 0
Let the function f: R -+ IK be given. We define the operator
I(A): D(f(A» c: X -+ X
by the natural formula
f(A)u = L f(;...) (11,,1 u)u...
..
(72)
Moreover, by definition, u belongs to D(f(A» iff the right-hand side in (72)
converges. By Proposition 19.12, u E D(f(A» iff
L If(A,,)1 2 I(u,,1 u)1 2 < oc. (73)
..
In particular, we have that f(A)u.. = f(i...)u" for all n.
Proposition 19.28. Assume (H). Then the operator f(A) is self-adjoint for each
real .runcI ion f: R -+ IR.
PROOF. Let B = f(A) and let B* denote the adjoint operator. As in the proof
of Proposition 19.27, we obtain that B is symmetric, i.e., B s; B*. In order to
show that B* c B, let u E B*. Since
(u..1 B*u) = (Bu,,1 u) = f(;.,,)(u,,1 u).
140
19. The Galerkin Method for Differential and Integral Equations
we obtain
B*1l = L (u,,1 B*u)u" = L f(A")(u"lu)u,,,
" "
i.e.. II E D( B).
o
With a view to applications in the next two sections, we now consider
functions
.((A, I) = L f(l", 1)(u"lu)u"
"
depending on a parameter t, which will play the role of time. We set
.(t) = f(A, I)U
for fixed II. In order to obtain the continuity and differentiability of .(.), we
need the following two majorant conditions:
L Ic"(u,,tu)1 2 < x,
"
e" = sup If(;.", t)l,
, e J
(74)
Lid" (u" , u)l2 < ,x.
"
tl" = sup Ih(;.". 1)1.
,. J
(75)
Proposition 19.29. Suppose thaI (H) holds and let the functio'J f: A x J --t IR be
girten. "there J is a real inlerl'al. Then:
(a) (t J'(i.. I) is cOllI;nuous on J for each e R, then t t-+ x(t) ;s continuous
Oil J prorided (74) holds.
(b) (t........ .((i., I) ;. co"t;Iluou./J' differelltiable Oil J for each i. e R, then I........ X(I)
;S cOlllinuolisly differentiable on J provided (74) and (75) hold. If ",.e set
II(i.. t) = f,(;'. I). thell
."('(1) = h(A.I)u for all t e J.
PRCX)F. Ad(a). We set .(,,(1) = 1(;.", t) and a" = (u"lu). Then
Ii .(.4. t)u - ((.4. s)1l11 2 = L Uh(l) - !,,(s»a,,1 2
"
L I{f,,(I) - f,,(s)a,,1 2 + 4 L Ic"a,,1 2
"SN ,,>N
< f. + €,
in the case where N is sufficiently large and It - sl is sufficiently small.
Ad(b). Use the mean value theorem
III(t) - f,,(s) _ I:(s) = 1:(." + .9(, - s)) - I:(s),
t - s
o < :) < 1,
and an analogous argument as in (a).
o
This functional calculus is the prototype of the general functional calculus
for self-adjoint operators which we summarize in the Appendix, A 2 (96).
19.1 S. Application to the Heat Equation
141
19.15. Application to the Heat Equation
We consider the initial value problem
u'(t) + A,u(t) = 0,
u(O) = w.
We make the following assumptions:
(HI) The operator A: D(A) X -t X is symmetric. and strongly monotone
on the real, separable H-space X.
(H2) Let A, denote the Friedrichs extension of A with the energetic space Xl'
and suppose that the embedding X £ s; X is compact.
o S t < 00,
(76)
Proposition 19.30. Assume (HI), (H2). Then for each we D(A,), equation (76)
has a unique C 1 -solution 14: R+ -t X. This solution is given by
u(t) = e-.A,w for all t O. (71)
Corollary 19.31. If we X, then the function 14 in (17) is continuous on R+. We
call this function 14(.) a generalized (mild) solution of (76). The family {e-,A,}
forms a semigroup, with generator - AF' i.e., the operator e-. A ,: X -t X is linear
and continuous for all t 0,
e-('+S)A, = e-,A'e- sA ,
for all t, S 0,
(18)
and the limit
. ( e- ItA , - I )
hm h w
,, +0
exists exactly for all K' e D(A,) and is equal to - A,w.
This corollary is a special case of a much more general result which will
be considered in Section 19.18.
EXAMPLE 19.32. Let G be a bounded region in AN, N I. We consider the
initial-boundary value problem
14, - u = 0
on G x R+.
u(x, t) = 0 on iJG x R+, (19)
14 (x, 0) = w(x) on G.
We set X = L 2 (G) and Au = -u with D(A) = cg;(G). Then the conditions
(H I), (H2) are fulfilled, by Section 19.11.
I By Definition 19.13. this implies D(A) X. i.e., D(A) is dense in X.
142
19. The Galerkin Method ror Differential and Integral Equations
We regard both the solutions of (76) and the generalized (mild) solutions of
(76) as generalized solutions of the classical problem (79). By Corollary 19.31.
for each initial value function K' e L 2 (G), we obtain a uniquely determined
mild solution.
PROOF ()J-" PROPOSITI()N 19.30 and COROllAR Y J 9.3 J .
(I) Uniqueness. Let 11: IR. -. X be a CJ-solution of (76) with u(O) = o. Then
it follows from
:, (u(l)1 u(l)) = 2(1"(')111(1)) = - 2( A FU(I)III(t)) < 0
that 11ft) = o.
(II) Existence. We set B = A F . By Corollary 19.20. the operator B has a
complete orthonormal system (u") of eigenvectors with
Bu" = ;,," u"
and
i." > c > 0
for all n.
We no\\' use the simple functional calculus from Section 19.14. Then
B,,' = L ;,,"(u"' ,,')u",
"
and \\' E D(B) iffL" li."(u,,I"')1 2 < 'XJ. By definition,
e- rB ", == L e- rAn(u"tw)u",
"
and ". E D(e- rB ) iff L" le-rA"(u,,( "')1 2 < OC °
For all '" E X and t > 0,
IIe-'B"'1I 2 = L le-';O"(u"lw)1 2
"
S L I ( u" I ",) 1 2 = II"'" 2 .
"
Hence D(e- IB ) = X and lIe- I8 II S 1. From
- ( r +' ) A -rA -.sA
e =e e
we obtain the semigroup property (78).
We now set
U(t) = e- rB ",.
For n' E X and all t > O. we have the majorant condition
L le-'J."(u"t"')1 2 < L l(u"lw)1 2 < X-J.
" "
By Proposition 19.29. the function t t-+ u(t) is continuous on IR+, For \\' E D(8)
and all t > 0, we have the majorant condition
L ri."e-'A"(u"I"')1 2 < L IA"(u,,Iw)1 2 < x.
" "
19.16. Application to the Wave Equation
143
By Proposition 19.29, there exists the derivative
u'(t) = - r l"e-'.t"(u,,1 w)u"
"
for all t 0, and u' is continuous on R+. Because of e-"u" :III e-'.t"u" and the
self-adjointness of the operators Band e-'., we obtain
u'(t) = - L )""(u"le-'.w)u,, = - Be-"J w = - Bu(t)
"
for all t 0, i.e., u( · ) is a solution of (76).
Let us define the operator C: D(C) X -+ X by
( -Ita I )
CW = lim e h - w,
,.. +0
where w belongs to D(C) ifT this limit exists. We want to show that C - - B.
I n fact, differentiation of the relation
(e-,aul v ) = (ule-,a v ) for all u, v E D(C)
with respect to t at t = ° yields (Cui v) == (uICv) for all u, ve D(C), i.e., C is
symmetric. Since .,'(0) - - Bw. the operator C is a symmetric extension of the
self-adjoint operator - B, and hence C - - B (cf. Problem 19.5c). 0
19 .16. Application to the Wave Equation
We consider the initial value problem
U"(I) + AFu(l) = 0, -00 < 1 < 00,
u(O) = v, u'(O) = w,
and make the following assumptions:
(HI) The operator A: D(A) c X -+ X is symmetric 1 and strongly monotone
on the real, separable H-space X.
(H2) Let A F denote the Friedrichs extension of A with the energetic space X£,
and suppose that the embedding Xl c X is compact.
In the trivial special case X = Rand A F > 0, equation (80) describes the
motion of a harmonic oscillator with the classical solution
(80)
u(t) = (cos tB)v + B- 1 (sin tB)w, (81)
where 8 = A/2. The energy of the harmonic oscillator is given by
E(t) = i(1 u'(t)1 2 + 1 8u(t)1 2 ).
· By Definition 19.13. this implies D(A) S X. i.e.. D(A) is dense in X.
144
19. The Galerkin Method ror DifTerential and Integral Equations
By definition, a classical solution of (80) is a C 2 .function u: R --. X such that
(80) holds and tBu(t) is C 1 from R into X.
Below we shall prove that D(A F ) c D(B). This implies that (AFulu) =
(B 2 Illu) = (Bul Bu) for all u E D(A F ), by Section 19.14. According to our defini-
tion above. a classical solution of (80) satisfies u(t) e D(A F ) for all t, and hence
u(t) e D( B) for all t.
Generalizing the classical expression for the energy of the harmonic oscilla-
tor above. we define the energy of a classical solution of (80) at time t by
E ( t) = ! ( II u ' ( t ) II i + II 8 u ( r ) II ).
Since IIBu(t)lli = (Afll(t)lU(t)), we obtain
E(t) = ![lIu'(t)il + (Afu(t)lu(t»].
(81* )
Proposition 19.33. ASSllme (H I). (H2). Then for each v E D(A F ) and W E D(A¥2),
equation (80) has exactly one classical solution. This solution is git'en by (81)
\\'ith E(t) = COlJstant.
EXAMPLE 19.34. We consider the initial-boundary value problem for the
classical wa ve equation
u" - &u = 0 on G x A,
u(x, t) = 0 on aG x R,
(82)
u(x.O) = (,'(x) on G,
u,(x, O) = "'(x) on G.
where G is a bounded region in R N , N > I. We set X = L 2 (G) and Au = - u
with D(A) = C(G). Then, the conditions (H I), (H2) are fulfilled, by Section
19.11.
We regard the solutions of (80) as generalized solutions of the classical
problem (82). By (81.), for sufficiently smooth functions u, we obtain the
energy
E(t) = L (u;- - (Au)u)dx,
where II = 0 on c'G. Integration by parts yields the well-known classical
formula
E(r) = L (u;- + I (D j U)2 )dX.
The latter expression makes sense for a broad class of functions u.
PROOF ()F PROPOSITION t 9.33.
(I) Uniqueness. Let u be a classical solution of (80) with v = w = O. As in
19.17. Semigroups and Propagators, and Their Physical Relevance 145
the proof of Proposition 19.30, we obtain
Ai-w = L A:(u"lw)u" for each (I > 0,
"
where we D(A}) iff L,,),,.I(u,,'W),2 < 00. Hence D(A,) D(A2), since
;." -+ +cx; as n ..... 00 in the case where dim X = 00. The operator B = A2
is self-adjoint and hence graph closed; therefore
(8u(r»' = Bu'(r)
(cf. Problem 19.4d). Differentiation of the energy function yields
£'(t) = (u"(I)lu'(t» + (Bu'(t)IBu(t»
= -(A,u(r)lu'(t» + (u'(t)IA,u(r» = 0,
and E(O) = O. This implies E(t) == 0, and hence U(I) == o.
(II) Existence. Using the same method as in the proof of Proposition 19.30,
one easily verifies that (81) is a classical solution of (80). The conditions
v e D(A,) and w e D(A}'2) guarantee the convergence of simple majorants
obtained in a natural way. 0
In Section 19.23 we shall construct a one-parameter unitary group {5(t)}
which yields generalized (mild) solutions of equation (80) of the form
(u(t), U'(I» = S(r)(v, w),
where v e Xl and w e X are given. In Example 19.34 above, this means
l' e Wl(G) and we L 2 (G).
19.17. Semigroups and Propagators, and
Their Physical Relevance
The notion of a semigroup is the most important notion for describing
time-dependent processes in nature in terms of functional analysis. The key
relation. are
S(t + s) = S(t)5(s)
S(O) = I,
forall t,seR.,
(83)
(84)
and (85) below.
Definition 19.35. A semigroup {S(t)} on a B-space X consists of a family
of operators S(t): X ..... X for all r e R+ with (83) and (84). The generator
B: D(B) X ..... X of the semigroup {S(t)} is defined by
B . . S(t)w - w
". = 1m. (85)
,"'+0 r
where w belongs to D(B) iff the limit in (85) exists.
146
19. The Galerkin Method for Differential and Integral Equations
A one-paranJelt'r group {S(I)} on the B-space X consists of a family of
operators S(I): X X for all 1 E R with (83) for all I. S E Rand (84).
Sometimes generators are also called infinitesimal generators.
STANI)ARI) EXAMPLI: 19.36. We consider the differential equation
11'(1) = Bu(t) on R.,
11(0) = "'.
Let B: X -+ X be a linear continuous operator on the B-space X over IK. We
set S(I) = e,R. where
(86)
XI" 8"
e' B = L - ,- .
,,o n.
(87)
This series converges on the 8-space L(X. X) for all 1 E R. i.e.. this series
converges with respect to the operator norm. Then {S(I)} is a one-parameter
group and the unique solution of (86) is given by
11ft) = 5(1)"..
The operator B is the generator of : S(I) l. since 11'(0) = 811(0) = Bu'.
(88)
In Section 19.15 we have already encountered a semigroup in connection
\vith the heat equation. In this case, the generator of the semigroup was an
unbounded operator.
19.17a. Classification of Semigroups
Let .</ = : S( I) be a semigroup on the 8-spacc X.
(i) .(/' is called strongly conl;nll01l5 iff 1 S(t),,' is continuous on IR. for all
". EX. i.e..
lim S(t)". = S(s)\\'
for all 5 E IR -+ .
, . \
At ... = 0 \\'e have to take the limit from the right.
Sometimes strongly continuous semigroups are also briefly called continuous.
(ii) fl' is called ulJilor',l/y c0111;n1l01l5 iff all the operators S(t): X X are
linear and continuous. and t S(I) is continuous on R. with respect to the
operator norm. i.e..
lim IIS(I) - S(s)1I = 0
for all 5 E IR + .
, -,
At ,.. = 0 \\'e have to take the limit from the right.
(iii) .</' is called lIo'leXpanS;('e iff all the operators S(t): X -+..\' are non-
expansive and
lim S(I)'" = '"
,- .0
19.17. Semigroups and Propagators. and Their Physical Relevance
147
/
/
;'"
".,
",
// )0
o 0::: ......
"-
'"
..........
'"
........
........
"
1:
..
Figure 19.6
holds for all ". e X. In Problem 19.9 we will show that each nonexpansive
semigroup is strongly continuous.
(iv) ..e'/' is called a linear semigroup iff all the operators 5(1): X ..... X are linear
and continuous.
If one replaces R+ with R. then one obtains the analogous notions for
one-parameter groups. By a one-parameter unitary group we understand a
strongly continuous one-parameter group {5(1)} of linear unitary operators
5(1): X -+ X on the H-space X.
(v) Let X be a complex B-space and let 1: be an open sector with the sector
angle as in Figure 19.6, i.e.,
1: = {z e C: - < arg z < ,z #: O}.
By an analytic semigroup {5(t) }. we understand an open sector 1: and a family
of linear continuous operators 5(1): X ..... X for all te1: with 5(0) = I and the
following three properties:
(a) ,...... 5(1) is an analytic map from 1: into L(X, X);
(b) S(t + s) = 5(t)5(5) for all t, 5 e 1:;
(c) lim 5(1)\\' = ". as I -+ 0 in 1: for all w eX.
Therefore, an analytic semigroup is the analytic continuation of a strongly
continuous semigroup on R+ to an open sector 1:. Note that, by definition,
analytic semigroups are linear semigroups.
By a bounded analytic semigroup {5(t)} we understand an analytic semi-
group as defined above which has the additional property that, for each
{I e ]0,2[,
sup 115(1)11 < oc,
's I,
where 1:, = {z e C: -p < argz < p,z #: O}.
EXAMPLE 19.37. (Fundamental Situations).
(i) The one-parameter group {e'8} in Standard Example 19.36 is uniformly
continuous and analytic on C.
(ii) If B: D(B) s; X -+ X is a linear self-adjoint operator on the H-space X
with (Bulu) SOon D(B), then B is the generator of a linear nonexpansive
148
19. The Galerkin Method for Differential and Integral Equations
semigroup. This is a special case of Theorem 19.E in Section 19.20. In terms
of the general functional calculus for self-adjoint operators this semigroup is
given by {e,8:.
Such semigroups describe, in particular, heat conduction and diffusion
processes.
(iii) If H: D(H) e X.... X is a linear self-adjoint operator on the complex
H-space .,\'. then - iH generates a one-parameter unitary group. This will be
proved in Section 19.21. I n terms of the functional calculus. this group is given
by e-i'"}. Such groups describe the dynamics of quantum systems as will be
shown in Part V. The operator H corresponds to the energy of the quantum
system and is called the Hamiltonian of the system.
(iv) If C: D(C") c X -+ X is a skew-adjoint operator on the real H-space X.
then C generates a one-parameter unitary group. This will be shown in Section
19.21. Such groups describe. for example. the dynamics of wave processes.
19.17b. Inhomogeneous Linear Evolution Equations and
Semilinear Evolution Equations
We consider the inhomogeneous equation
u'(t) = 811(t) + (I) on R.,
u(O) = "'.
(89)
In the classical case. u is a real function and B is a real number. Then
lagrange's method of the variation of the constant yields the solution
1I(t) = S(t)", + t SIt - s)/(sl ds (901
with Sf') = e,B. In the general case of an unbounded operator B, the situation
is more complex. Formal differentiation of (90) yields (89). Unfortunately, this
formal procedure can only be justified if f is sufficiently regular. For example.
if.f is merely continuous, then u(. ) in (90) need not be a solution of (89). This
is a Iypical difficulty of the theory of semigroups. However, it is quite natural
to call the solution u(. ) of (90) a mild (or generalized) solution of(89). This way
the senJi/inear problem
.,'(t) = 8u(l) + 1ft, u(t)) on IR..
(91 )
u(O) = ,'..
can be reduced to the Volterra integral equation
u(n = S(t)", + t SIt - slf(s. u(s))ds. (921
The solutions of the .(ixed-point problem (92) are called mild solutions of (91).
If f is sufficiently regular, then the mild solutions are also classical solutions
of (91 ).
19.17. Semigroups and Propagators. and Their Physical Relevance
149
19.17c. Nonautonomous Equations and Propagators
We consider the initial value problem
u'(t) = B(t)u(t) on [s, 00 [,
u(s) = w.
Here the point is that the operator B depends on time t. Such equations
correspond to processes in nature which are not homogeneous in time (e.g.,
time-dependent outer forces or the fundamental scattering processes of ele-
mentary particles in accelerators). Suppose that for each fixed time s, equation
(93) has a unique solution u(.). We define the so-called propagator {P(t, s)} by
P(t, s)w = u(t) for all t > s. (94)
(93)
The assumed uniqueness of the solution implies
P(t, r)(P(r, s)w) = P(t, s)w.
If B is independent of time I, and if B generates the semigroup {S(t)}, then
u(t) = S(t - s)w, i.e.,
P(t, s) = 5(t - s).
This shows that propagators represent generalizations of semi groups. Formal
differentiation shows that the so-called series of Dyson
pet,s) = 1 + f f ' f '. ... f '''-' B(t.}...B(t")dt,,...dt.
fiSt I S S J
yields a formal solution of(93) by means of (94). As we shall see in Part V, this
is the key formula of quantum field theory.
Furthermore, formal differentiation of
u( t) = pet, s)w + J: pet, r)f(r, u(r» dr
with respect to I yields
(95)
u'(t) = B(t)u(t) + I(t, u(t)), u(s) = w. (96)
The solutions of the fixed-point problem (95) are called mild solutions of (96).
19.17d. Quasi-Linear Equations
We consider the quasi-linear equation
u' = C(u, t)u + /(t, u), u(s) = w (97)
for the unknown function u = u(t). Such equations correspond to physical
processes with interactions. Comparing (97) with (96), equation (97) can be
150
19 The Galerkin Melhod for Differential and Inlegral Equallons
reduced to a jlxed-point problem for u of the form (95) by letting
8(t) = C(u(t), t).
Note that B depends on u, and hence P in (95) also depends on u.
19.17e. Abstract Semigroups and Abstract Propagators,
and Their Physical Interpretation
Definition J 9.38. Let X be an arbitrary set. By a propagator {P(t, s)} on X.. we
understand a family of operators P(t.. .): X .... X for all real t > with the
following two properties:
P(t.. t) = I
P(t.. s) = P(t.. r)P(r, s)
for all t..
for all t > r > s.
(98)
(99)
By a se"rigrollp S(t) on X (resp. one-parameter group), we understand a
family of operators S(t): X .... X for all real t > 0 (resp. all real t) with 5(0) = I
and
S(t + s) = S(I)S(S)
for all I.. S > 0
( 1(0)
(resp. for all realt. s).
Let us discuss the following four fundamental principles for the mathe-
matical description of time-dependent processes in nature:
(I) Propagators describe ('allsal processe.'t.
(ii) One-parclnreter groups describe causal processes \\'hich are homogeneous ;11
1 inJe anel relersible.
(iii) Proper senJigroups describe causal processes "'hich are homoge"eous in ti,ne
c",eI irrel'ersible (e.g., the gro",th in biology, heat conduction, or diffus;o1J).
(iv) Tire cle.cr;plio" o.f t;,ne-dependent processes by propagators or semigroups
;. ,nore lJatural than tlte description b)' differential equations.
In order to have an intuitive picture at hand, let X be a manifold (e.g. the
surface of the earth). Let {S(t)} be a semigroup on X (or a one-parameter
group). Then
11ft) = S(t)\\.
can be regarded as the motion of a nuid particle on X. Here, u(t) is the
position of the particle at time t and u(O) = \\'. Therefore. semigroups and
one-parameter groups are also called semij1oM's and flows, respectively. More-
over, semigroups are also called abstract dynamical systems.
Propagators and Causality
We regard the elements of the set X as the possible states of a physical system.
Suppose that the system is in the state \\' at time s. Then
11(1) = P(t. s)\\'
(101)
19.17. Semigroups and Propaptors. and Their Physical Relevance
151
describes the time-evolution of the system, i.e., u(t) is the state of the system
at time t. The causality of the process is reflected by the propagator property
(99). In fact, relation (99) tells us that
u(t) = P(t, r)u(r)
for all r with s S r S t, i.e., the tinal state u(t) is uniquely determined by each
of the intermediate states u(r).
If {S(t)} is a semigroup, then we obtain a propagator by letting
P(t, s) = S(t - s) for all t S. (102)
In fact, the propagator property (99) follows from the semigroup property
(tOO). Conversely, if the process is homogeneous in time, i.e.,
P(t + a,s + a) = P(t,s)
for all real a, then we obtain a semigroup {S(t)} by letting
S(t) = P(t, 0).
I n this case relation (102.) implies (102).
(102.)
Semigroups and Irreversibility
Let {S(t)} be a semigroup and let
u(t) = S(t)w
(103)
be the time-evolution of the initial state w at time t = O. Then, in general, u(t)
is only detined for t 0 and the inverse operator S(t)-1 does not exist, i.e., the
past state w is not uniquely determined by the future state u(t). The situation
changes if {S(t)} is a one-parameter group. In this case, it follows from the
group property (100) that
S(t)S( - t) = S( - t)S(t) = /,
i.e., S(t): X -+ X is surjective and injective and hence bijective with the inverse
operator S(t)-1 = S( - t). This means
u( - t) = S( - t)w = S(t)-I w.
In particular, we obtain that if the process t..... U(I) is possible, then the inverse
process t..... u( - t), which corresponds to a time reflection, is also possible.
Such processes are called reversible. Many processes do not have this prop-
erty, i.e., they are irreversible. For example, the growth of a human being is
irreversible. In Part IV we shall show that the second law of thermodynamics
is responsible for the fact that most processes in nature are irreversible.
Standard Example 19.36 shows that linear bounded operators B: X -+ X
on a B-space X generate one-parameter groups. Therefore, irreversible
processes in nature cannot be described by differential equations u' = Bu
where B is bounded. This underlines the imporlance of unbounded operators.
152
19 The Galerkin Method for Differential and Integral Equations
One-Parameter Groups and the Representation
Theory of Groups
Let X be a set and let B(X, X) denote the set of all bijective mappings
F: X -+ X. Then B(X, X) forms a group. By a representation of a given group
G in B(X, X), we understand a mapping H: G ..... B(X, X) with
H(yh) = H(g)H(h) for all g, h E G,
i.e., a representation is a homomorphism from G into B(X, X).
In Part V we shall show that the representation theory of groups plays an
important role in modern physics. From the abstract mathematical point of
view, modern physics is based on the following three fundamental principles:
(a) Elementary particles correspond to irreducible representations of groups.
(b) Conservation laws correspond to global symmetries (group invariance) of
processes.
(c) The four fundamental interactions (i.e., gravitative, weak, electromagnetic,
and strong interaction) correspond to the curvature of manifolds (fiber
bundles with structural groups), and the interaction is related to local
symmetries (gauge transformations).
For example, if processes are invariant with respect to time translation (resp.
spatial translation or rotation), then there exists a mathematical conservation
quantity which corresponds to energy (resp. momentum or angular momen-
tum). In this connection, invariance with respect to time translation means
that if the process t..-. 14(t) is possible, then all the processes t.-. U(I + a) are
also possible for arbitrary a e R. This result explains mathematically why
conservation of energy, momentum, and angular momentum exist in all fields
of physics (e.g., mechanics, electromagnetism, quantum theory, etc.).
Semigroups and Differential Equations
Suppose that a physical process is described by a classical partial differential
equation of the structure
11'(1) = Bou(t), u(O) = \\', (104)
(e.g., the heat equation). In order to treat (104) by means offunctional analysis,
we have to extend Bo to an operator B. Then, instead of (104), we consider the
operator equation
U'(I) = Bu(t),
u(O) = }\',
(104. )
(e.g., B is the Friedrichs extension of Bo). From the point of view of the theory
of semigroups, this extension is necessary since Bo is generally not the genera-
tor of a semigroup. Let B: D(B) S;; X -+ X be the generator of the semigroup
{S(t)}. In Section 19.18 we shall show that the solution of (104.) is given by
u(t) = S(t)w (104..)
19.18. Main Theorem on Abstract linear Parabolic Equations
IS3
for all initial values ". e D(B). However, the operator S(t) is defined on the
entire space X. Thus, the function u = u(t) in (104..) is defined for each initial
value w e X. We call this function u in (104**) a generalized (mild) solution
of the original problem (104).
Note that the derivation of the classical partial differential equations of
mathematical physics is based on the (unrealistic) assumption that the situa-
tions are sufficiently smooth. Therefore, it is quite natural to regard the semi-
grollps as primary and the differential equations (104) and (104.) as secondary.
The complexity of the general theory of nonlinear semigroups reflects the
complexity of time-dependent processes in nature. For example. one observes:
(i) explosions (e.g., supernovae and quasars),
(ii) shock waves. and
(iii) turbulence.
Mathematically. this corresponds to blowing-up effects. instabilities and dis-
continuities. and strange attractors.
19.18. Main Theorem on Abstract Linear
Parabolic Equations
We consider the initial value problem
u'(t) = Bu(t) + I(t) on ]t o , T[,
(105)
U(I O ) = "',
together with the formula
u(r) = S(t - to)w + l ' S(t - s)f(s)ds. (106)
'0
We make the following assumptions:
(H t) Let {S(t)} be a strongly continuous linear semigroup on the B-space X
over K = R, C with the generator B, i.e., {S(t)} is a semi group of linear
continuous operators S(t): X ... X for all t 0, and t S(t)w is continu-
ous on R+ for all WE X.
(H2) The function f: [10. T[ -.. X is continuous.
Definition 19.39. Let the function u: [tOt T[ ..... X be given.
(i) u is called a classical solution of (I OS) iff u is C I on ]t 0' T[ and is continuous
on [to. T[. and (t OS) holds.
(ii) u in (106) is called a mild solution of the original problem (I OS).
Theorem 19.D. Assume (H I), (H2). Then:
(a) There exists ar most one classical solution of (IOS), and each classical
solution is also a mild solution.
154
19 The Galerkin Method for Differential and Integrat Equations
(b) III is C 1 a"d n' E D( B). 1 hen I he mild solul ion ( 106) ;s also a classical solut ion
o.f ( 105).
/" part;c,dar. .for each \\' E D(B). the homogeneous probleln (105) "'ith
r = 0 lias Cl IllJiqlle classical solution given hy u(t) = S(t - to)"'.
(c) 1,- the operator B: X X is linear and continuous. then. for each ", E X alld
eacll C0l1t;1"'OIIS .f. the mild solution (106) ;s also a classical solution of (105).
This theorem shows clearly the importance of semigroups for the solution
of the initial value problem (105).
Corollary 19.40. A ssun1e (H I). Then:
(i) 1.',ere exist constal11S C > I and a > 0 such that
"S(t)1I Cell' for all t > O. ( 107)
(ii) Tile gt'nerator B: D(B) c X X of the semigroup {S(I)} ;s a linear graph
closed operator and D( B) ;s dense ill X.
(iii) Tile selnigroup is ,,,,iquely determined by its generator.
The proofs will be given in the next section.
In order to be able to apply Theorem 19.D to many concrete problems. we
need to know properties of B which ensure assumption (H I).
ST ADARI) EXAMPLF. 19.41. Let B: D(B) c X -+ X be a linear densely defined
operator on the H-space X over G< = R. C. Then assumption (H 1) above is
satisfied if one of the following two conditions is valid:
(i) B is nlaxitJlClI dis'iipalit"e. i.e..
Re(Bulll) S 0
for all u. t' E D(B)
and R(I - I1B) = X for some Ji > O.
(ii) - B is sectorial (cf. Definition 19.52).
We prove (i) in Section 19.20 (Theorem 19.E). The proof of (ii) can be found
in f-.riedman (1969 9 M).
A process in physics is called dissipative iff the entropy is increasing (e.g..
heat conduction or diffusion). Mathematically, such processes are frequently
described by semigroups with maximal dissipative operators. This motivates
the notion "dissipative operator." Maximal dissipative operators generate
nonexpansive semigroups. Such semigroups are very important for the follow-
ing reason. We replace "9 w. / in (106) with u i . \\'i. h. respectively. Then
II S(.) II 1 for all s > 0 implies
1111. (I) - 11 2 (1)11 11"'1 - "'211 + (t - ' 0 ) max 11/. (s) - 12(s)lI.
'0 s s s'
19.19. Proof of Theorem 19.D
155
for all t to, i.e., if "'1 and II are close to W2 and 12' respectively, then the
solution u I is close to u 2.
EXAMPLE 19.42. Let B: D(B) X .... X be a densely defined linear operator on
the H-space X over K = R, C. Then assumption (H I) above is satisfied if one
of the following four conditions is met:
(i) B is self-adjoint and (Bul u) S 0 for all u e D(B).
(ii) - B is the Friedrichs extension of a symmetric strongly monotone
operator.
(iii) B = - iH, where H is self-adjoint and the H-space X is complex.
(iv) B is skew-adjoint.
In all these cases, B is maximal dissipative.
PROOF. Ad(i). All i. > 0 belong to the resolvent set of B (cf. Problem 19.7).
l-fence R(j.1 - B) = X, i.e.. B is maximal dissipative.
Ad(ii). This is a special case of (i).
Ad(iii). All). e C with 1m A :;: 0 belong to the resolvent set of the self-adjoint
operator H (cr. Problem 19.7). Thus, all real ). :F 0 belong to the resolvent set
of B = - iH. The operator B is skew-adjoint. Hence Re(Bulu) = 0, i.e., B is
maximal dissipative. Note that (Bulu) = (ul-Bu).
Ad(iv). If X is complex, then iB is self-adjoint, i.e.. (iv) is a special case of
(iii). Note that (iB)* = - iB* = iB.
If X is real, then we extend B to the complexification of X. Here B remains
skew-adjoint (cf. Problem 19.6). 0
EXAMPLE 19.43. If assumption (H 1) above is satisfied for the operator B, then
it is also satisfied for all the operators B + ;./ with ;. E R.
In particular, assumption (HI) holds for B in the case where B - AI is
maximal dissipative for some ;. E R.
PROOF. If B generates the semigroup {S(t)}. then B + ).1 generates the semi-
group {e A ' S(t)}. 0
19.19. Proof of Theorem 19.D
After a time translation, if necessary, we can assume that to = o. In the
following we will frequently make use of the following two simple results:
(i) Let X be a B-space. If(};.) is a sequence of continuous functions fIt: [a, b] ....
X on the compact interval [a, b) and (};.) converges uniformly on [a,b]
to the function f: [a, b] .... X, then f is continuous and
S: fftds -+ S: f ds
as n -+ 00.
156
19 The Galerkin Method ror Differential and Integral Equations
(ii) If .r: [a. b] X is continuous, then
d I'
dt Jof(s)ds = f(t)
for all 1 E [a,b].
The proofs are the same as in the classical case X = R.
Step I: We show that
sup 1/ 5(1)11 < x
o, T
for all T > O.
( 1 08 )
By the semigroup property (83), it suffices to show that (108) is valid for a
fixed T. If this is not true, then there exists a sequence 'ft --+ + 0 as n --+ 'XI
with IIS(I,,)II n for all n. Since 5(1,,)u -+ u as n -+ 00 for each u e X, we
obtain sup" 115(t,,)11 < x by the uniform boundedness theorem A I (35). This is a
contradiction.
Step 2: We prove (107). By (108), there is a constant C with 115(1)" < C on
[0, T]. Since S(O) = J, C > 1. Let a = T- 1 In C. Each 1 > 0 can be represented
as 1 = nT + s with n = 0, I, . . . and 0 S s < T. Hence
/I S ( t ) (I = II S ( ,t T) S ( s) II S II 5 ( T) II " II S (s) II
< C"" I < CC'IT = Ce a ,.
Step 3: Let \\' E D(B) and 1 o. We show that
d
dl S(I)\\' = S(I)8\\' = 85(t)\\..
( 109)
This is the key relation of the theory of semigroups which generalizes the
classical form ula
d
- e,B = Be,B
dt
for real functions and for linear continuous operators B: X -+ X on a B-space
X. In particular. it follows from (109) that S(t)", E D(B) for all \\' E D(B) and
all t > o.
(I) Letting II -+ + O.
S(h) - I S(h) - I
't S(t)", = 5(1) h w -+ S(I)Bw.
By definition of the generator B, we obtain that 5(1)\\' E D(B) and
BS(t)", = S(t)Bw as well as
d+
dt- S(t)w = S(t)Bw.
Note that S(t + II) = S(h)S(t).
19.19. Proof of Theorem 19.D
157
(II) Let t > O. In order to show that
d-
dtS(t)w = S(t)Bw,
we use the identity
S(t)w - S(t - h)w _ S(t)Bw
h
== S(t - h) ( S(h): - w - Bw ) + S(t - h)Bw - S(t)Bw.
By (108) and by the definition oCthe generator B, the right-hand side goes
to zero as h -+ +0.
Step 4: Let t O. Integration of (109) yields
S(t)w - w == t S(s)Bwds == f BS(s)wds (110)
for all we D(B). Furthermore, for all w e X,
S(t)w - w == B t S(s)w ds. (III)
This follows from the identity
S(h) - I f' I r'
h Jo S(s)wds == h Jo (S(s + h)w - S(s)w)ds
== 1'+11 S(s)w ds - f: S(s)w ds.
By letting h -+ +0, the right-hand side goes to S(t)w - S(O)w and S(O) = 1.
Step S: D(B) is dense in X. Let w e X and t > O. Then
der I J '
w, = - S(s)wds -+ w
t 0
as t -+ + 0,
and w, e D(B) by (111).
Step 6: B is graph closed. Let WIt ..... wand Bw" ..... z as n -+ 00. By (110),
S(t)w lI - W II == t S(s)Bw lI ds.
Letting n -+ 00 and noting (108) we get
S(t)w - w == f S(s)z ds.
Hence (d/dt)S(t)w = S(t)z, i.e., Bw = z by letting t = o.
158
19. The Galerk in Method for Differential and Integral Equations
Step 7: Let of: [0, T[ -+ X be continuous and let u be a classical solution of
(105). We set
r'(s) = S(t - s)u(s),
Differentiation yields
f'(S) = - BS(t - s)u(s) + S(t - s)u'(s)
= - BS(t - s)u(s) + S(t - s)(Bu(s) + f(s))
= S(t - s)f(s).
O<s<t<T.
Note that the product rule is valid because of (108). Integration yields
f ,-t
I'(t - r.) - ('(1;) = . S(t - s)f(s) ds.
Letting f. -+ + 0,
t'(t) - r(O) = I SIt - s)f(s) ds.
Hence
u(t) = SIt)", + I S(t - s)f(s)ds.
( 112)
i.e., u is a mild solution of ( 105).
Consequently, the classical solution u of (105) is unique.
Step 8: Let B be the generator of a linear strongly continuous semigroup
: T(t)}. The preceding arguments show that, for each ,\0' E D(B), the function
u(t) = T(t)\\. is a classical solution of the initial value problem (105) with
10 = 0 and r = O. Hence T(t)\\-' = S(t),,' for all t > 0 and all K' E D(8). Since
D( B) is dense in ,,\'. this implies T(t) = S(t) for all , > O. i.e.. the generator B
uniquely determines the corresponding semigroup.
Step 9: let f: [0, "1"[ -+ X be CI. We show that u in (106) is a classical
solution of (105) with 10 = O.
By (109) it suffices to show that the function
g(t) = f S(t - ...)f(s)ds
is a classical solution of (105) for to = 0 and \\' = 0, i.e., ufO) = O. Let 0 S I < T.
(-"rom the identity
dcr t + I!! -=- g(t) = [' S(s / ( _ ) - f t_ -:-! ds
h Jo h
I f '+,.
+ Ii I Sls)f(t + h - s)ds
19.20. Monotone Operators and the Main Theorem
159
and (108) we obtain that the derivative g'(t) exists and that
g'(t) = t S(s)f'(t - s)ds + S(t)f(O).
Furthermore, it follows from
I (f '+" f ' )
A = h 0 S(t + h - s)f(s)ds - 0 S(t - s)f(s)ds
that
S(h) - I r ' I J '+"
h Jo S(t - s)f(s)ds = A - h I S(t + h - s)f(s)ds.
Letting h ..... + 0,
Bg(t) = B t S(t - s)f(s)ds = g'(t) - f(t).
Step 10: let B: X ..... X be linear and continuous and let f: [0, T[ -+ X be
continuous. By Standard Example 19.36, the operator B generates the semi-
group {S(t)} with S(t) = e'Se Here the series
x I"
e 'S = L -8"
".0 n!
converges absolutely with respect to the operator norm for all t e R. As in the
classical case X = R, we obtain that (d/dt)e's = Be 's with respect to the
operator norm and that u in (106) is a classical solution of (105).
The proof of Theorem 19.D and Corollary 19.40 is complete.
19.20. Monotone Operators and the Main Theorem
on Linear Nonexpansive Semigroups
Recall that a linear nonexpansive semigroup on an H-space X is a semigroup
{S(t)} of linear continuous operators S(t): X -+ X with IIS(t)1I s 1 for all t 0
and t...... S(t)w is continuous on R+ for each w EX. If {S(t)} is an arbitrary
linear strongly continuous semigroup on X, then by Corollary 19.40, there
are constants C > I and a so that {C- t e-. r S(t)} is a linear nonexpansive
semigroup. Thus, in order to characterize the class of all linear strongly
continuous semigroups. it is sufficient to characterize the class of all linear
nonexpansive semigroups.
Definition 19.44. Let B: D(B) c X -+ X be a linear operator on the H-space
X over K = R, C.
(i) B is called dissipative iff
Re(Bulu) S 0
for all u E D( B).
160
19 The Galerkin Method for Differential and Integral Equations
(ii) B is called 11IClximal dissipative iff it is dissipative and R(I - JjB) = X for
some Jl > O.
(iii) B is called accretil'e ifT(1 + JJB) is injective and (I + JJB)-' is nonexpansive
for all JI > O.
(iv) B is called nJQximal accretit'e iff it is accretive and R(J + JJB) = X for all
JI > O.
Theorem 19.[ (Hille (1948). Yosida (1948)). Let B: D(B) £; X -+ X he a linear
operator on the H-space X o('er K = R, C \vith D(B) = X. Then the follo,,';ng
three conditions are 11Jlltuall.r equivalent:
(i) B is the generator (r a linear nonexpans;r'e semigroup.
(ii) B is maximal dissipat;('e.
(iii) - B is max;nlal accretire.
The proof will be given in Section 19.22. Let I( = IR. Then the linear
operator B: D(B) c X -+ X is dissipative iff
(811 - Brill - r) < 0
for all u, v e D(B)t
i.e.. - B is monotone. In Section 31.2 we shall prove that B is maximal
dissipative iff - B is maximal monotone. This shows that there is a very close
("onlJect;on between the theory of monotone operators and the theory of se",;-
groups. Later on we will generalize this connection to nonlinear operators.
Corollary 19.45. The operator B is dissipat;t'e iff - B ;s accretire.
PROC)F. For alii' E D( B) and JI > O.
11(1 - JIB)l'1I 2 = IIt'1I 2 - 2 Re(Bt'lv) + Jl211Bv1l 2 .
By definition. - B is accretive iff 11(/ - JlB)vIl 2 > 1/ vll 2 for all t' e D(B). JJ > O.
o
19.21. The Main Theorem on One-Parameter
Unitary Groups
Recall that a one-parameter unitary group {5(l)} on the H-space X is a family
of unitary operators S(t): X -+ X for all t e R so that
S(t + s) = 5(t)5(s) for all tt seAt (113)
and t........ S(l)'" is continuous on R for each K' e X.
Theorem 19.F. Let B: D(B) c X -+ X be a linear operator on the H-space X
Ol'er I( = Rt C. Then the following three assertions are mutually equ;t'alent:
(i) 8 is the generator (f a one-parameter unitary group.
19.21. The Main Theorem on One-Parameter Unitary Groups
161
(ii) 8 and - 8 are ma:(imal dissipative and 0(8) = X.
(iii) B is skew-adjoint.
Corollary 19.46 (Theorem of Stone (1932». The linear operator B: D(B) c
X --. X on the complex H-space X is the generator of a one-parameter unitary
group iff B = - iH, where H is self-adjoint.
PROOF OF THEOREM 19.F. By Corollary 19.40, generators are always densely
defined.
(i) (ii). Let B be the generator of the one-parameter unitary group {S(t)}.
We set S:t(t) = S( + t) for all t O. Then {S.(t)} and {S_(t)} is a nonexpansive
semigroup with the generator 8 and - B, respectively. Now (ii) follows from
Theorem 19.E.
(ii) => (i). By Theorem 19.E, the operators 8 and - 8 generate a non-
expansive semigroup {S.(t)} and {S_(t)}, respectively. We set
S(t) = { S.(t) for t 0,
S_(-t) for tSO.
For all we 0(8) and t 0,
d
dt (S+(t)S_(t)w - S_(t)S+(t)w)
= BS_(t)w - S+(t)8w + 8S.(t)w - S_(t)8w = O.
Hence
S+(t)S_(t) = S_(t)S.(t) = 1 for all t o.
The semigroup property of {S+(t)} implies (113). From IIS:t(t)1I S 1 and I S
IIS+(t)IIIIS_(t)1I it follows that IISf(t)1I = I, i.e., {S(t)} is a one-parameter unitary
group. Obviously, 8 is the generator of {S(t)}.
(iii) (ii). Cf. Example 19.42.
(i) (iii). Suppose that B generates the unitary group {S(t)}. Differentiation
of
(S(t)vl S(t)w) == (vi w) for all v, w e D(B)
yields (Bvlw) + (vlBw) = 0, i.e., B is skew-symmetric; therefore, - B c: B.
where B* denotes the adjoint operator.
We show that 8- S; - 8. Let z e D(8-). According to (i) => (ii), the operator
:f: B is maximal dissipative and hence + B is maximal accretive by Theorem
19.E. In particular, R(I :t: 1J8) = X for fixed /J > O. Thus there exists awe
0(8) such that
(I - 1J8)w = (I + JJ8-)z.
Since - 8w == 8. w, (I + JJB*)(z - w) = O. Thus, for all y e D(B),
«I + /JB)ylz - w) == O.
Hence z - \\' = O.
o
162
19 The Galerkin Method for Differential and Integral Equations
PR()()F ()F COROLLARY 19.46. From (iB). = - ;B. it follows that B is ske\\'-
adjoint iff; B is self-adjoint. 0
19.22. Proof of Theorem 19.E
Step I: If B generates a linear nonexpansive semigroup {S(t)}. then - B is
maximal accretive.
To prove this let i. > O. Then: e - AI S(t)} is a nonexpansive semigroup whose
generator is B - il. By (110) and (111).
e - A'S(II\\, - It. = t e- AS S(s)( B - i.l)\\' ds for all \\' E D(B),
e-;"S(t)1" - I' = (8 - ;'1) t e-ASS(s)rds for all I' E X.
The operator B - i./ is graph closed, by Corollary 19.40. Thus. letting ,
+ 'Y_ . we find that
'" = I' t' - A' S(s)(i'/ - B) '" ds for all '" E D(B),
I" = (i./ - B) L' e- AS S(S)I' ds for all I' E X
(cf. Problem 19.4dJ.
Consequently. the operator i./ - B is injective and surjective. and
,,( i.1 - 8) - J I" II < /II'" L' e - As ds = i. - J II I" II
for all r E ...\'. i.e.. - B is maximal accretive.
Stt'P 2: If - B is maximal accretive, then B generates a linear nonexpansive
scmlgroup.
In order to prove this, we use the idea of the Yosida approxinztlt;olt B,.. We
set R" = (I - JIB) -1. By assumption. the operator
R,,: X D(B)
is nonexpansive and bijective. We set
BIJ = - I (R IJ - I)
for all JI > O.
Then BIJ: X X is linear and continuous. Hence BIJ generates the semigroup
S,.(t) = £1'8".
Formally. 8" 8 as JI + O. Our goal is to prove that the following limit
ex ists:
S dcC I . S
(t) ". = 1m" (t) '"
,,-o
for a II '" EX, t > O.
(114)
19.22. Proof of Theorem 19.E
163
(I) Since R;t B,. ,,, = B,,',
B" '" = R,. B,,'
Because (I - J,lB)R,. ". = "',
B" '" = BR" ",
for all W E D(B).
for a II ,,, eX.
Since II R,.II S 1,
II B,. "'11 S II B\\'II
for all \\. e D(B).
It follows from
IIR,,'\' - "'II = IIIlB"wll S IJIIBwl1
that R" '" -+ w as IJ -+ + 0 for each w e D(B). Since D(B) is dense in X and
II R" II < I, we obtain R" '" -+ ". as J,l -+ + 0 for each \\. eX. Hence
B" ". = R#j B'" -+ B'" as Il -+ + 0
for each \\' E D(B).
(II) Let (X = J,l-I. For allt > 0,
IIS,,(I)II = Ue- 2 'e 2R . r ll S e-'e'2R"., S I,
i.e., {S,,(I)} is nonexpansive.
( III) For all I > 0 and all positive ;", IJ, we get
I ' d
5.1(1) - 5,,(1) = 0 ds (5,,(1 - s)5 A (s» ds
= I 5,,(1 - s)5 A (s)(B A - B,,)ds.
and hence, for all '" e D( B),
II SA (I)'" - S,,(I)"'II S I liB A'" - B,,"'II S 2tllB"'II.
Consequently, for each '" e D(B), the sequence (S,,(I)"') converges as
IJ -+ + 0 uniformly on compact t-intervals. Since nS,,(t)1I 1 and D(B) is
dense in X, the same is valid for each '''I EX.
(IV) We now define the operator S(t): X -+ X through the limit (114). Recall
that {S,,(I)} is a nonexpansive semigroup on X for each JI > O. Since
IIS,,(t)"'1I s II "'II for all '" EX, J,l > 0, t > 0,
and since (S,,(I)"') is uniformly convergent on compact I-intervals as
J.l -+ + 0, the limit function I....... S(I)'" is continuous on R... for each ,,, eX,
and {S(t)} is a nonexpansive semigroup on X.
(V) Let C be the generator of {S(I)}. Let ". e D(B). As J.l -+ + 0, the sequence
(5"(I)B,, "') converges to (S(I)B"') uniformly on compact t-intervals. Thus,
letting JI -+ + 0, it follows from
5,,(I)W - W = t 5"(s)B,, W ds.
164
19. The Galerkin Method for Differential and Integral Equations
by (110), that
S(t)w - '" = t S(s)Bwds.
Differentiation at t = 0 shows that ". E D( C) and C'\' = B,,', i.e., B s C.
Since - B is maximal accretive, I - B is surjective. By Step t, I - C is
injective. Hence B = C.
Step J: If - B is maximal accretive, then B is maximal dissipative. This
follows from Corollary 19.45.
Step 4: If B is maximal dissipative, then - B is maximal accretive.
By Corollary 19.45, it suffices to prove that R(I - pB) = X for all J1 > o.
By assumption, there is a Po > 0 such that I - PoB: X ...... X is bijective
and
II (I - I' 0 B) - I II I.
" rrdbhuft j If. ,h..,. ".'"Ii chi 'ft.r... df't"MI"r (I aM I4I1J -, an N tdr Mil ,
with III - J10I < I. A limple continuation argument shows that (I - JI8)-1
exists on X for aliI' > o.
The proof of Theorem 19.E is complete.
19.23. Abstract Semilinear Hyperbolic Equations
We consider the initial value problem
U"(I) + AU(I) = I(U(I)), 0 < 1 < oc,
u(O) = Uo, u'(O) = roo
We make the following assumptions:
(H 1) The linear operator A: D(A) c X --. X is self-adjoint and strongly
monotone on the H-space X over I( = R. C. Let X E be the energetic
space of A with the norm II.IIE'I i.e., X E is the completion of D(A) with
respect to the energetic scalar product (ult')£ = (Alllr").
(H2) The operator I: X E -+ X is locally Lipschitz continuous, i.e., for each
R > 0 there is an L such that
( 115)
II/(u) - 1((')11 < Lilli - vilE
for a II U, l' E .,\, E wit h "u II E. II t"l1 E < R.
We set r' = u'. Instead of (115) we consider the first-order system
(:) = ( _0 A ) (::) + (;).
( 116)
19.23. Abstract Semilinear Hyperbolic Equations
165
Moreover, we set z = (u, v) and write (116) in the form
z'(t) = Cz(t) + F(z(t»,
z(o) = zoo
Let Z = X£ X X and D(C) = D(A) X Xl.
o < t < 00,
( II 7)
Proposition 19.47. Assume (HI). Then the operator C: D(C) s; Z.... Z is skew-
adjoint and generates a one-parameter unitary group {S(t)}.
Corollary 19.48. Assume (H I) and let f = 0, i.e., F == o. Then, for each Zo E
D(C), equation (111) has a unique classical solution, in the sense of Definition
19.39, given by
u(t) = S(t)zo.
For each Zo e Z, the function u is called a mild solution of equation (117)
with F == O.
PROOf. We set B = A 1 / 2. By Problem 19.8b, (ulii)£ = (BuIBii) for all u, ii E Xl.
Hence, for all z, Z E z.
(zlz)z = (ul u )£ + (vlv) = (BuIBii) + (vlv),
where (.1.) is the scalar product on X. Note that Cz = (v, -Au). For all z,
Z E D(C),
(zICZ)z = (BulBV) - (vIAii) = (Aulv) - (vi Aii),
(Czlz)z = (BvIBu) - (Aul v ) = (vi Au) - (Aulv).
Hence (zICZ)z = -(Czlz)z, i.e., C is skew-symmetric.
Furthermore, R(l + C) = Z. In fact, the equation z = z + Cz is equivalent
to
ii = u + v,
v = v + Au,
and the latter equation has the solution u = (I + A)-I (ii =+= v), v = + (ii - u).
Consequently, both the operators C and - C are maximal dissipative.
Theorem 19.F yields the assertion.
In this connection note the following important fact. Since (Aulu) 0 for
all u E D(A), the resolvent (I + A)-t: X .... X exists, by Problem 19.7b. 0
Corollary 19.48 follows from Theorem 19.D.
In order to solve the original problem (115) with f 0 we consider the
integral equation
z(t) = 5(t)zo + t S(t - s)F(z(s»ds. (118)
The solutions of (118) are called mild solutions of (117) and of (115). Note that
each sufficiently regular solution of (115) is a solution of (1 t 7) and hence a
166
19. 1 he Galcrkin Method for Differential and Integral Equations
mild solution of f 117), by Theorem 19.D. We set Y = c( [ - T, T], Z) with the
usual norm
11=11)' = max 1I=(t)lIz.
-T'T
Theorem 19.G. Suppose Illal (H I). (H2) hold. Tile 11 , for each Zo = (u o ' ('0) in Z.
there are IUllnhers T > 0 and r > 0 so that equatio" (115) has exactly one mild
solutio" = E Y \\,;tll 11= - =011 r S r.
PR(X)J-. Let 11'11 denote t he norm on Y and let
AI = := E Y: /1= - =011 < r}.
We write f 118) in form of the following .fIxed-poi", equation:
.. - I,..
... - 1'\.. .
= EM.
(118.)
Noting (H2) and IIS(t)!lz < I we obtain that
ilK: - 1\:11 max (' IIF(z(s)) - F(z(s))lIzds
T!;IS T Jo
< TL(r)lI= - =11 for all z,: E M.
This implies
II /( = -- = 0 II < If K: - K = 0 II + II K : 0 - = 0 II
< TL(r)lI= - =011 + IIK=o - =otl
for all Z E M.
Hence we can choose rand T in such a way that K(M) c: "', and K: M -+ M
is k-contractive. Consequently, (118.) has a unique solution by the Banach
fixed-point theorem (Theorem I.A). 0
19.24. Application to Semilinear Wave Equations
We consider the initial value problem
u" - II + n,2u = - i..u.\ on G x ]0. T[.
II(X.O) = uo(.x) on G,
II,(X, O) = l'o(.) on G,
II(X, I) = 0 on aG x ]0, T[.
Here G is a bounded region in R N , N > I. In the case where ;.. = O. this is the
so-called Klein - Gordon equation. Problem (119) describes the classical field
fI of a meson of mass m. The nonlinear term - ).u 3 . with the so-called coupling
constant ;. > O. describes the self-interaction of the field. This is only a classical
model of a quantum field. since quantum fields correspond mathematically to
( 119)
19.2S. The Semilinear SchrOdinler Equation
167
distributions with values in an H-space (cr. Part V). In order to formulate (119)
as an operator equation of the form
u H + Au = )/(u) on ]0, T[,
u(O) = Uo, u'(O) = Vo
with I(u) = - u 3 , we set X = L 2 (G), D(A o ) = Co(G), and
Aou = (- + m 2 )u.
( 120)
For all u E D(A o ),
(Aoulu) = (-L\ulu) + m 2 (ulu) m 2 (ulu).
let A: D(A) X -+ X denote the Friedrichs extension of Ao. Then the energe-
tic spaces of Ao and - are the same up to an equivalent nor'11, i.e.. the
energetic space of A is equal to X E = Wl(G). In Problem 19.IOb it will be
proved that the operator I: X[ -+ X is locally Lipschitz continuous in the case
where N = 3. Hence we obtain the following result.
EXAMPLE 19.49. Let m O.
(a) Let ; = O. Then, by Corollary 19.48, for each U o e D(A) and too e Xl'
equation (120) has a unique classical solution. For each "0 E X [, Vo eX,
there exists a mild solution of ( t 20).
(b) Let ; > 0 and N = 3. Then Theorem t 9.G applies to (120), i.e., for each
"0 E X E and V o E X there exists a unique local mild solution of (120).
The energy of (I t 9) is defined by
E(t) = 2- 1 f [ ul + m 2 u 2 + f (D , u)2 + r l ;'U 4 J dX'
G '-I
In the case where i. = m = 0, this corresponds to the energy of the wave
equation in Section 19.16. If u is a sufficiently smooth solution of (119), then
integration by parts yields
£'(t) = L u,[u" - Au + m 2 u + ).u J ] dx = O.
i.e.. E(t) = constant. This describes the conservation of energy.
19.25. The Semilinear Schrodinger Equation
We consider the Schrodinger equation
U'(I) = - iHu(t) + F(u(I» on] - T. T[,
u(O) = Uo.
(121)
168
19 The Galcrkin Method for Differential and Integral Equations
We make the following assumptions:
(H I) The linear operator H: D(H) c X -+ X is self-adjoint on the complex
H-space X.
(H2) The operator f.: X -+ X is locally Lipschitz continuous.
As we shall show in Part V, equation (121) with F = 0 is the basic equation
of quantum mechanics. The operator H corresponds to the energy of the
quantum system.
Proposition 19.50. Suppose that (HI) holds. Then -iH is the generator o.t a
one-para"Jeter fUJitary group {S(t)}.
Corollary 19.51. Suppose that (H I) holds and let F = O. For each 110 E D(H),
eqllation (121) has a 14nique classical solution given by
u(t) = S(t)uo.
For each "0 EX. this is called a mild solution of (12 t).
PROOF. Cf. Theorems 19.D and 19.F.
o
I nstead of ( 121). we now consider the integral equation
U(I) = S(t)u o + t S(t - s)F(u(s)) ds. (121*)
The solutions of ( 121*) are called mild solutions of (121). Let Y = C( [ - T, T], X).
Theorem 19.". Suppose that (H I). (12) hold. Then, for each "0 E X. there exist
1JunJhers T > 0 and r > 0 so that equation (121) has exactly one mild solutio,.
".;th 1111 - 1101/ t' < r.
PR()()f. Use exactly the same argument as in the proof of Theorem 19.G. 0
19.26. Abstract Semilinear Parabolic Equations,
Fractional Powers of Operators, and
Abstract Sobolev Spaceg
We consider the scmilinear equation
u'(1) + Au(t) = /(t. u(t)) on ]t o . to + T[.
u(t o ) = "'.
( 122)
19.26. Abstract Semilinear Parabolic Equations
169
and we make the following assumptions:
(H 1) The linear operator A: D(A) s; X -+ X is self-adjoint on the H-space X
over K = R, C, and A is semibouned, i.e., there is a real constant c such
that
(Aulu) c(ulu)
for all u e D(A).
In the following, let lIuli denote the norm on X.
(H2) We choose a fixed real number a with a + c > o.
Then the operator 8 dcr A + al is strongly monotone. Hence, in the sense
of the general functional calculus for self-adjoint operators. the operator
8 2 : X -+ X is self-adjoint for each a > O. We equip X« = D(B2} with the graph
norm
II u II: dcr II u II + II u II.
Then X" becomes a B-space over K (cf. Problem 19.4a).ln Lemma 19.55 below
we show that
lIull dllull
for all u e D(Ir) and fixed d > o.
Hence
II u II. dcr II Ir u II
is an equivalent norm on X«. For a = 0, X 2 = X. The space X 2 is called an
abstract'Sobolev space.
(H3) There is an a e [0, I[ such that
II/(t, u) - I(s, 1')11 L( I t - sl' + II u - 1,'11.)
for all (t, u), (s, l') in a neighborhood of the point (to, ",) in R x X 2 . Here
L > 0 and 0 < fJ 1 are constants and the initial value w e X. is given.
The decisive adt'antage of the spaces X 2 is that we can treat more general
nonlinearities than in X. In applications, the functions in X:r are smoother than
those in X. A concrete simple example will be considered in the next section.
By Examples 19.42 and 19.43. the operator - A generates a linear strongly
continuous semigroup {S(t)}. Instead of the original problem (122) we con-
sider the integral equation
u(') = S(, - 'o)w + [' S(, - s)/(s. u(s)) ds. (123)
J,o
We set Y = C([t o , to + T], Xs) with the norm
lIuli r = max lIull 2 .
'0 , S '0 + T
The solutions of (123) are called mild solutions of (122).
Theorem 19.1. Suppose that (H I) through (H3) hold. Then there are positive
numbers T and r such that the original problem (122) has exactly one mild
solution u e Y K'ith II u - ""II r r.
170
19 The Galerkin Method for Differential and Integral Equations
"
..
/'"'
/
,/'
/'
A
::./
/"
O(
c "
""-
""-
"
'-....
""-
""-
y
J
.
Figure 19.7
Definition 19.52. The operator A: D(A) c X -+ X on the complex B-space X
is called s('(.toricll iff the following are true:
(i) A is linear. graph closed. and D(A- ) = X.
(ii) There are numbers C E R. M > I. and j' E ]0. n/2[ such that the open sector
I: = :i_ E C: I < larg(i- - (")I < n.i_ :f:. c:
is a subset of the rcsolvcnt sct of .4 and
1(i_1 - A) III < MI; - cl"'
for all i_ E 1:
(cf. Fig. 19.7).
Corollary 19.53. Theore"r 19.1 re"rai,rs true ij' ,,'e replace (HI) hy ,lrt' "lore
general as.u"lp';()n tlra' ,Ire operator A is sectorial.
The proof of Theorem 19.1 will be given in Section 19.28. This proof is based
on properties of the fractional powers lJ2 which we shall prove in Lemma
19.55. The proof of Corollary 19.53 proceeds completely analogously to the
proof of Theorem 19.1 by noting the following. If A is sectorial. then - A
generates an analytic semigroup and it is possible to define fractional powers
B2 which have the same properties as in Lemma 19.55 below. The definition
of B2 in the general case and its properties can be found in A 2 ( 110).
Remark 19.54 (Regularization). Using the properties of fractional powers. one
can show that the mild solutions above are just classical solutions of (122).
The proof for sectorial operators can be found in Henry (1981. L). The tech-
nique of proof is the same as in the theory of singular integral equations.
Theorem 19.1 and Corollary 19.53 allow important applications to general
classes of parabolic equations and parabolic systems. since such problems are
related to sectorial operators. This will be discussed in A 2 ( IIOc).
19.28. Proof of Theorem 19.1
171
19.27. Application to Semilinear Parabolic Equations
We consider the initial-boundary value problem
II, - L1u = ).uU x on G x ]0. T[,
Il(.t, t) = 0 on cG x [0, T[,
1I(.t, O) = ",(x) on G,
with a constant ;.. Here G is a bounded open interval in R, i.e., .1u = U xx . Let
X = L 2 (G) and let A be the Friedrichs extension of - .1 with D( - .1) = Co(G).
By Problem 19.8b,
X 1'2 = XE. = wi (G),
i.e., X I 2 is the energetic space of - A. We set f(u) = ;.uu x . By Problem 19.IOd,
the mapping
(t 24)
f:X£-+X
is locally Lipschitz continuous.
Therefore, we can apply Theorem 19.1 to (124). This way, for each \\' E
W 2 1 (G), we obtain a mild solution u: [0, T] -+ W 2 1 (G) for (124) on a sufficiently
small time interval, and this solution is unique in the sense of Theorem 19.1.
19.28. Proof of Theorem 19.1
We set to = O. Furthermore. we make the following additional assumption:
(H2.) We have c > 0 and a = 0, i.e., B = A and lIuli z = II ASuli.
We free ourselves from (H2.) in Step 3 below.
Step I: Properties of fractional powers.
I n what follows we will use the general functional calculus for self-adjoint
operators summarized in the Appendix. A 2 (96). The reader should convince
himself that the following proof becomes completely elementary if, in addition,
\ve assume that the operator A possesses a complete orthonormal system of
eigenvectors. In this case we can use the special functional calculus for self-
adjoint operators introduced in Section 19.14. Finally, the reader should also
convince himself that the following proof can be immediately translated to a
proof of Corollary t 9.53, if we assume that the operator A is sectorial. In this
case we have to use the calculus of fractional powers for sectorial operators
which we summarize in the Appendix, A 2 (110).
Lemma 19.55. T/lere ;s a constant C > 0 depending on 2 e [0, I [ such tlJat the
.Iollo\\'ing are true:
172
19. The Galerkln Method for Differential and Integral Equations
(i) Hull s; C /I A 2 ull for all u E D(A 2 }, i.e., the embedding X tl X is continuous.
(ii) lI(e-fA - l)ull < CttlllA:lull for all U E D(A), t > O.
(iii) IIAe-f'''1I < C,-2 and e-1AA tI c A 2 e- 1A for all t > O.
PR()()F. Let J = [c, x [ and t O. Then
A2" = L ;'2dE A u,
where u E D(A2)iffJJ ;.2dIlEAUIl2 < . Furthermore.
e-,A" = L e- A , dEAu.
where u E D(e-fA)iffJJe-2ArdIlE.\uIl2 < oc. In particular.
"e -, A ,," 2 < Lie - A' 1 2 d II E AU 11 2
< L dilE A uII 2 = lIuII 2 ,
I t follows analogously to Section 19.15 that {e -I A} is a semigroup generated
by - A (cr. Problem 19.13).
Ad(i). For II E D(A 2 ),
IIA 2 ull 2 = L i. 22 d1lE A u1l 2 > L c 2I1 dIEA,,"2
= c 2 :1l1ull 2 .
Ad(ii). The real function x.......(e- X - I)/x:l is bounded on ]0. oc[. Thus. for
alill E D(A 2 },
lI(e',A -1}uIl 2 = Lie-fA - 11 2 dllEAUll 2
L 0211 A 211dllE A ull 2 = Ct 211 11A"u1l 2 ,
Ad(iii). Let t/J(;..) = ;.. 2 e -fA for ). > O. The real function x....... x(Je- X is bounded
on IR.. Thus. for all U E X. t > O.
1It/t(A)uIl 2 = L 1t/t()')1 2 dIlE A uIl 2
< L 0-211d1lE A u1l 2 = Ct- 211 11u1l 2 ,
By A 2(96v), t/J( A) = A 2 e- fA and e- 1A A 2 t/1(A).
o
19.28. Proof of Theorem 19.1
173
Step 2: The fixed-point theorem of Banach.
Let ". E D(A ar ) and M = {u E Y: lIu - wllr} s r. We write the integral
equation
u(t) = e-uw + t e-I'-SlAf(s,u(s»ds
in the form of the fixed-point equation
u = K u,
ue M,
( 125)
.
I.e.,
(KuHt) = e-'A w + t e-If-SIAf(s,u(s»ds.
By the Lipschitz condition (H3),
II(KuHr) - (Kv)(r)lls S L f IIA"e-cr-sIAII dsllu - vllr,
and, for all t e 1:
II(K u)(t) - "'liar S II(K U)(I) - K wiL. + II K w - wll ar
S IIA"(e-'A - I)wll + t II A"e- If - SIA II ds(Lllu - wllr + F),
where 11/(1, w)1I s F on [0, T].
By Lemma 19.55, there exist numbers r > 0 and T > 0 such that K (M) M
and the operator K: M -+ M is k-contractive. In this connection, note that
J ' IIAse-I,-sIAIi ds S J ' C ds = Ct l -" .
o 0 (I - sr I - a
Furthermore, note that w E D(A ar ) and hence
Aar(e- rA - l)w = e- rA Aclw - A ar ", .... 0
as t -+ + 0, since the semigroup {e- rA } is strongly continuous.
Now the Banach fixed-point theorem (Theorem I.A) yields the existence of
a unique solution of (125).
Step 3: We set u(t) = e llr V(I). Then the original equation
u'(t) + Au(t) = /(1, u(t»
is transformed into
v'(t) + (A + al)v(t) = e-.r/(t, ear v(t».
Condition (H2.) is satisfied for the latter equation. Hence, without loss of
generality, we can assume that (H2.) holds.
The proof of Theorem 19.1 is complete.
174
19. The Galerkin Method for Differential and Integral Equations
19.29. Five General Uniqueness Principles and
Monotone Operators
We consider the equation
f E)
.4u = f..
IIEX.
Theorem 19.J. Let A: D(A) X -+ X be a linear operator on the H-space X
ol'er K = R, C. Then, for each f EX, equation (E) has at most one solution iI'
case ol,e (f the jollo,,'il'g fit'e conditions is satisfied:
(i) Strict monotonicity (positivity):
Re(Alllu) > 0
(ii) A priori estimate (stability):
for all II E D( A) K,;th II O.
(' II u II < 1/ Au 1\
(iii) Contractivity of .4 - I:
II Au - II /I < II 1111
.for all II E D(A) and fixed c > o.
for all II E D( A) ,,'itlt II:F O.
(iv) Monotone type: With respect to an order cone on X,
All < AI'
implies
u S r.
(v) Duality: There exists aI' operat'}r B: D(B) c X -+ X K,;th R(B) = X and
( A III t') = (III BI')
.for all u E D(A), (' E D(B).
PR(X)f-". We have to show that Au = 0 implies II = O. This is obviously satisfied
in (i)" (ii)" and (iii).
In (iv), .411 = 0 implies that All 0 and Au > O. Hence u < 0 and II > 0, i.e.,
" = o.
In (v)" All = 0 implies that (ul\\') = 0 for all \\' in the dense set R(B). Hence
u = o. 0
This theorem summarizes important uniqueness principles which allow
many diversified applications to concrete problems.
The following result tells us in which sense condition (v) is necessary and
sufficient for uniqueness. Here A. denotes the adjoint operator.
Corollary 19.56. LeI A: D(A) c X -+ X be a linear operator on the H-space X
orer K. ",here D( A) and D( A. ) are dense in X. Then the following t,,'o statement s
are equiralent:
(i) For each fE X, equation (E) has at most one solution.
(ii) R(A*);s dense in X.
PR()()f". (i) (ii). If R(A*) is not dense in X, then there exists a u 0 in R(A *} l
by the perpendicular principle in Section 18.11. Since D(A) is dense in X" we
19.30. A General Existence Principle and Linear Monotone Operators
175
may assume that u e D(A). Hence
(Au I v) == (uIA.v) = 0
for all v e D(A.).
Since D(A.) is dense in X, we obtain Au = O. This contradicts u :;: O.
(ii) (i). Use Theorem 19.J(v) with B = A.. 0
In the following we generalize Theorem 19.J to nonlinear operators.
Proposition 19.!7 (Strict Monotonicity and Uniqueness). Equation (E) has at
most one solution in the case where one of the following two conditions is valid:
(i) Let X be an H-space over K. The operator A: D(A) c: X X is strictly
monotone, i.e.,
Re(Au - Avlu - v) > 0
for all u, v e D(A) with u:;: v.
(ii) LeI X be a B-space over K. The operator A: D(A) c: X X. is stricti}'
monotone, i.e.,
Re(Au - Av,u - v) > 0
for all u, v e D(A) witll u:;: t'.
PROOF. Au = Av implies u = v.
o
Proposition 19.58. Let A: D(A) X Y be an operator, where X and Yare
B-spaces. Then, for each feY, equation (E) has at most one solution in the case
"'here one of the following three conditions is satisfied:
(i) Stability:
cllu - vII s IIAu - Avll
for all u, ve D(A) and fixed c > o.
(ii) Contractivity of A-I:
II(Au - u) - (Av - v)1I < lIu - vII
for all u, t' e D(A) "'ith u:;: t'.
(Hi) Monotone type: With respect to order cones on X and Y,
Au S Av
implies
u S v.
PROOF. Let Au = Av. Then u = v by (i) or (ii).
In (iii), Au = Av implies Au S Av and Au Av. Hence u S v and u t', i.e.,
u = v. 0
19.30. A General Existence Principle and
Linear Monotone Operators
The equation
(E) Au = f, u E X
has a solution for each feY iff R(A) = Y, i.e., A is surjective.
176
19. The Galerkin Melhod for Differenlial and Integral Equations
Theorem 19.K. Let A: D(A) c X -.. Y be a linear operator \\,here X and Yare
B-spaces orer IK = IR. C. Tllen. \\'e hare R(A) = Y iff the follo,,'ing t,,'o cOlldi-
lions are satis.fled:
(i) R( A) is closed.
(ii) R(A)l = rO:.
Corollary 19.59 (.4 prior; Estimates and Existence). Let A: D(A) c X -.. Y be
a linear graph closed operator "'here X and Y are B-space over K. Suppose that:
(i) A priori estimate. There is a c > 0 such that
c 1!1I11 S; I' Aull
for all u E D(A).
(ii) R(A)l = :0:.
Then. .for each fEY. equation (E) "as a ullique solution.
Let Y be a B-space (resp. H-space). Then, by definition, R(A) 1 = {O} means
that
(l'.. Au) = 0 for all U E D(A) implies l' = 0
(resp. (rIAII) = 0 for all II E D(A) implies l' = 0). If D( A) = Y, then there exists
the dual operator A* (resp. adjoint operator A*). In this case we have
R(A)l = {O} iff N(A*) = {O}.
PR()()f 01- THEOREM 19.K. Obviously. R(A) = Y implies (i), (ii). Conversely.
suppose that (i) and (ii) hold and R(A) =F Y. Then R(A) is a linear closed
subspace of I'.
(I) Let Y be an H-space. By the perpendicular principle in Section 18.11. there
is a l =F 0 in R(A)l. This contradicts R(A)l = {O}.
(II) Let Y be a B-space. Now we use the separation theorems for convex sets
in Section 39.1 which follow from the Hahn- Banach theorem. To this
end we choose an element Y E Y with}' f R(A). Then y O. The compact
set y} and the closed convex set R(A) can be strictly separated by a closed
hyperplane. i.e., there exists arE Y. with (t', y) > 0 and (1" Au) 0
for all U E D(A). Hence (v, Au) = 0 for all u E D(A). This contradicts
R(A)l = {O}. 0
This proof makes use of the following general principle:
Passing .(roln H-spaces to 8-spaces, replace the perpendicular principle by tlte
separation of convex sets (the Hahn-Banach theorem).
Recall that we have shown in Section 18.11 that the following three principles
are mutually equivalent in H-spaces: the perpendicular principle, the Riesz
19.30. A General Existence Principle and Linear Monotone Operators
177
theorem, and the existence principle for quadratic minimum problems. This
underlines the importance of the Hahn-Banach theorem as a general
e.'(;stence principle in functional analysis.
PROOF OF COROLLARY 19.59. The a priori estimate yields the closedness of the
range R(A). Hence Corollary 19.59 is a special case of Theorem 19.K. In fact,
let (j;.) be a sequence in R(A) with j;. -+ f as n -+ 00. Then j;. = Au" and
c lIu" - u",11 s II Au" - AU",II for all n, m.
Hence UtI -+ U as n -+ oc. Since A is graph closed, Au = f.
o
Remark 19.60 (Application to Linear Monotone Operators). We want to show
that Theorem 19.K generalizes our results on linear monotone operators in
Section 18.11 e. Let X be an H -space over k = R, C.
(i) We first consider the equation
Bu = /.
ue X,
( 126)
where the operator B: X -+ X is linear, continuous, and strongly monotone,
i.e., there is a c > 0 such that
Re(Bulu) cllull 2 for all u e X.
This implies c lIull 2 s I(Bulu)1 s IIBuliliuli and hence
c II u II s II Bu II for all u eX.
Moreover, R(B)l = {O}. Indeed, (vIBu) = 0 for all u e X implies
c 111'11 2 S Re(Bvl v) = 0,
i.e., (' = o.
By Corollary 19.59, equation (126) has a unique solution for each f eX.
(ii) Next we consider the equation
Au = b, u e X, (126*)
where the operator A: X -+ X. is linear, continuous, and strongly monotone,
i.e., there is a c > 0 such that
Re(Au,u)clluIl2 forall ueX.
This implies c IIull 2 S I (Au, u) I s IIAuliliuli and hence
c lIuli s II Aull for all u eX.
Furthermore, R(A)l = {O}. Indeed, (v, Au) = 0 for all u e X and fixed I' e
X.. implies
c II vII 2 S Re(Av, v) = Re(v, Av) = 0,
i.e., t' = O. In this connection, note that each H-space X is reflexive. Thus, each
178
19. The Galerkin Method for Differential and Integral Equations
element l' E IY.. can be regarded as an element in X with
<,.. \,.) = (\r.l')
for all l' E X, '" EX..
In particular. (.41'.1') = (I'. Al') (cf. Sections 21.5 and 21.6).
By Corollary 19.59. equation (126.) has a 1I1,ique solutioll for each hE X*.
PR()BLEfS
19.1. .\1; '\:ed hOlilldary ralEle prohlenl. Formulate the generalized problem for
- Il = f on G.
II = y on t, G.
( 127t
('"
--:-. + clfl = II on ("-2 G.
('"
and find the corresponding Galerkin equations. In this connection, let t l G and
t]. G be disjoint sufficiently regular subsets of cG with iG = t'. G U (")2 G.
Solution: The generalized problem reads as follows:
f. ( .t D;uD,1' - li' ) dX + f (all -1I)rdO = 0 (128)
{, I' i:G
for allt E .\1 and
u = y on t, G.
" here
\f = {I' E CltG): (' = 0 on (I G:..
f I) t 127) => f 12K). From (J 27). by multiplication by t' and subsequent integra-
tion by parts. it follows thai
f ( ... ) f iu
LD,IlD;r-!r'dx- ;-1'(10=0.
(i i-I .-Gln
( 129)
This yields (128). since (' = 0 on t. G.
U It (128) (127). Let II E ('2(G) be a solution of (128). Then integration by
parts yields
r (- L\1I - J ) r (/. + f ( CiU - " + ) I' (10 = 0
... (i f"G l n
for aliI' E .\f. This implies (127).
We suppose that we can extend the boundary function y 10 the set G.
Then the Galerkin equations, which follow from (128), read as follows:
f. ( .t D;"D,I'j - .lii ) d."( + f (CUI - II)!') dO = O. j = I..... II.
G 1-:.1 ....lei
where u = L; I C}I'} + g and t"} E AI for all j.
19.2. Variational prohle,n. Show that (127) is the Euler equation to a variational
problem.
Problems
J79
Solution: We consider
r ( ! t (D,U)2 - fU ) dX + f (lall2 - hll)dO = min!.
JG 2 ,-, i*zG
II =- 9 on c, G.
Let u e C 2 (G) be a solution of(130). Denote the left-hand side of(130) by J(u)
and set cp(t) = J(u + tt') where t' eM. Then cp'(O) = O. This is precisely (128),
whih implies (127).
( 130)
19.3. S,'mmetric operators. Let B: D(B) s; X X be a linear symmetric operator
on the H-space X over k == R, C.
19.3a. Show that all eigenvalues of B are real.
Solution: From BII = i.1I and (1IIu) == 1 it follows that
i. = (III BII) = (8ulll) = I..
19.3b. Show that two eigenvectors of B with different eigenvalues are orthogonal.
Solution: From BII = ).11 and Bt, = /Jt' with i. #: Il it follows that
(A - /J)(ult.) == (Bult') - (1I18t,) == 0,
and hence (ult') = O.
19.3c. Let {u,,} be a complete orthonormal system of eigenvectors of B. Show that
the corresponding system {)..,,} of eigenvalues contains all the eigenvalues
of B.
Solution: Let BII = i.1I with II :#: 0 and i. :#: i.... for all n. By (26).
II = L (11"111)11,,.
..
By Problem 19.3b, (11,,111) = 0 for all n. Hence II = O. This is a contradiction.
19.3d. Show that if D(B) = X and B is continuous. then
fJ de( sup I( Bill 11)1 == "BII.
III -,
Here assume X to}.
Solution: From I(Blllu)1 S II BI1 111111 2 it follows that fJ S II BII. Conversely, we
show that II BII S fJ. To this end, we set
t' + = ).11 :t: l- I BII.. i. > O.
It follows from (B 2 11111) = (Bill 811) that
II BII1I 2 = 4 - I [( Bt, + It' +) - (Bt' _I (' _ )]
S 4 - I P( II t' + II 2 + II t' _ 1! 2 )
=: 2 - I fJ (A 2 111111 2 + ;. - 2 II BII1I 2 ).
Letting l 2 = "BIlII and 111111 = I, we find that 11811n 2 s JlII Bun. Hence II BIt S p.
19.4. Graph closed operators. The operator A: D(A) S X Y, where X and Yare
B-spaces, is called graph closed iff the graph
G(A) = {(II, Au): u e D(A)}
180
19. The Galerkin Method for Differential and Integral Equations
IS closed in X x r. I.e.. for each sequence (u,,) in D(A), it follows from
II" ..... II in X as n --+ x,
Au" ..... 1" in Y
as n --+ x'.
that II E O(A) and Au = f. This is an important generalization of the notion of
continuity
Let A: O( A) c .\" -+ X be a linear operator. We define the graph llorm
II . = I U i + II A 1111
for all u E D(A).
The hnear operator B: D( B) c X -+ X with A c B is called the closure of A iff
G(A) = G(B).
We "rite B = A. If the closure A exists, then it is unique. The linear operator
.-1 is called t'$.'it'luiall.\' self-adjoint iff the closure A is self-adjoint. This is a
fundamental notion in quantum theory.
19.4a. Show that the linear operator A: O(A) c X -+ X on the 8-space X is graph
closed iff D( A) is a B-space with respect to the graph norm.
Solution: Let A be graph closed and let (u..) be a sequence in O(A) \\'hich is
a Cauchy sequence \\'ith respect to the graph norm il.II.... This implies the
convergence II" ..... II as II -+ X and Au" -+ r as II -+ x. Hence All = f. i.e., D( A)
IS a B-space "'ith respect to 11.11..
The converse follows analogously.
19.4b. Let.\" bean H-space. We set
(1111'). = (ulr') + (AulA!')
for all u. r E O( A).
Show that the linear operator A: D( A) s;;; X -+ X is graph closed iff O( A) is
an H-space with respect to the graph sfalar product (.1. )...
Solution: This follows as in Problem 19.4a.
19.4<:. Let A: 0(.4 t c .\" .X a linear operator on the B-space .,\". Show that if .4 is
injective and graph closed. then A -I is also graph clof.ed.
Solution: G( 4) = G( A I).
19.4d. Our goals are the following two important formulas:
(I ( dIIU) )
- (AIIU)) = A - - .
dt dt
A r u(t)d, = r Au(" dt. (1321
let A: O( A) £; .\' ..... X be a linear graph clo$ed operator on the B-space X.
Show that:
(B If the functions u: U(n c R ..... D(A) and S"-' AII(S) are differentiable at r.
then ( 131 ) holds.
(ii) If the functions II: [a, h] ..... D(A) and Au('): [a. h] -+ X are continuous on
the compact interval [a. b]. then (132) holds.
Solution: Let "',,(t) = (u(t + h) - u(t))jh. Then
(131t
A 14(1 + II) - Au(t)
Au. II (t) = ---------.
h
Problems
181
Letting h -+ 0, we have "',.(1) ..... U'(I) and the right-hand side goes to (Au(')'.
Since A is graph closed, we obtain (131).
Formula (132) follows analogously by using the fact that the integral is the
limit of sums.
19.5. Adjoint operators. Let X be an H-space over I( = R, C. We consider two
linear densely defined operators A: D(A) c X -+ X and S: D(S) e X..... X.
Let A * and 8* denote the corresponding adjoint operators.
19.5a. Show that A* is graph closed.
Solution: Let A*u" -t t: and u" -t U as n ..... . It follows from
(K'I A * u,,) = (A K'lu II ) for all \\' E D(A)
that ("'It') = (AM'lu) for all ",. e D(A), and hence A*u = 1'.
19.5b. If A S, then S* s; A *. Furthermore, A is symmetric iff A s; A * and A is
skew-symmetric iff A - A *.
Solution: Use the corresponding definitions in Section t 9.8.
In particular, we obtain that if A is symmetric, then so is A**. In fact,
A s; A* implies A** s; A*, and hence A** (A**)*.
19.5c. Show that if A is self-adjoint, then A has no proper symmetric extension, i.e.,
A is maximal symmetric.
Solution: let A S where S is symmetric. i.e.. S s; S*. From A = A * we
obtain that
A s; S s B* s; A* = A,
and hence A = S.
Analogously, we obtain that each skew-adjoint operator is maximal skew-
symmetric.
19.5d. Int'erse operator. let A: D(A) s; X ..... X be symmetric with R(A) = X. Then
(A -1)* = (A.) -I ,
where all the appearing inverse and adjoint operators exist.
In particular, if A is self-adjoint. then so is A -I.
Solution: The operator A is injective. In fact. Au = 0 implies
(u I At') = (A u 1(') = 0
for all t' E D(A).
Since R(A) is dense. u = o.
The same argument shows that A. is injective. Since R(A) = X. the adjoint
operator B = (A- t ). exists. We have
(ult') = (A-I Ault') = (AuIBt') for all u E D(A). (' E D(S).
(:1\\') = (AA- I zlw) = (A-I zIA.,,') for all '" E D(A.), Z E D(A- I ).
Hence A* St' = vand SA.,,-' = K', i.e., S = (A*)-I.
t 9.5e. If A: X ..... X is linear and symmetric. then A is continuous.
Solution: Since A = A*. the operator A is graph closed. By the closed graph
theorem A I (3ge), A is continuous.
19.5f. Continuous linear operators. let A, 8: X ..... X be continuous linear operators
on the H-space X over k = A, C. Show that:
182
19. The Galerkin Method for Differential and Integral Equations
tit (A.). = A.
fii) A.(i.ll + Jlt') = A.u + jlA.v for all u, t' e X and all ie, JJ E IK.
(iii) (A B). = B.A..
fiv) IA.!I = I'A I.
Solution: Adh) tiii). Use the definition of A. and 8..
Ad(iv). From (Aulr) = (uIA.I') it follows that
.IAull = sup UAulr'U < lIu!! IIA.II.
I' I
and hence IIA,' S I,A.i,. By (i), ".4.11 < !I A/!.
19.5g.. Posit ire sell-adjoint nperators. Let 1..,(X. X) denote the set of all linear con-
tinuous self-adjoint operators A: X -+ X on the H-space X over k. For A.
8 E l..,( \'. .\'), we define
AB
iff
( A III u) < (8u I u)
for all II E .X.
For A, B. (" E l..,f X. X). prove the following:
fit If A > 0. then A 1 / 2 O.
tii) It follows from
A"
and C 0, A(' C A, 8(' c: C8 that AC < BC.
(Iii) It follows from
al < A < hi
and Cl, /> > 0 that A -, belongs to t.(X. X) and
/>-11 < A -I < a-II.
flint: C.r. Riesl and Nagy (1956. M), Section 104. The statement (Iii) will be
used in t he proof of Theorem 26. B.
19.6. ConJple xi(;('(J(;oPl. Let X be a real H-space. By definition, X ( is the set of all
"complex" elements
II + ;1'
with
u, rEX.
We set
(2 + ;!lHIl + il') dd (711 - Ill') + ;({1u + r)
for all . PER
and
(u + i"lx + (r) dd ;(111.\') - i(rlx) + (ul.) + (rl.r).
This definition IS motivated by a formal multiplication. A linear operator
A: D ( A) c ,,\, -+ X is ex tended to X f by
A,. (II + il') dd Au + iAr.
,
19.6a. Show that X, becomes a complex H-space which is an extension of X.
Solution: Elementary computation.
19.6b. Sho\\' that (At ). = (A.)r .
Solution: By definition.
("'IA(:) = HAc )*"'1:)
\\'ith ". = II + ;r and: = x + iy. Now set x = 0 or J = O.
Problems
183
This result implies the following: If A is self-adjoint or skew-adjoint, then
so is Ac.
19.7. Spectrum of self-adjoint operators. Let A: D(A) s;;; X .... X be a self-adjoint
operator on the real or complex H-space.
19.7a. Show that all ;_ e C with 1m A :I: 0 belong to the resolvent set of A.
Note that this statement refers to the complexification of A in case X is real.
Solution: We use the functional calculus for self-adjoint operators sum-
marized in A 2 (96). Let X be complex. We fix a i. e C with 1m A#:O and set
';(11) = ( - 1')-1 for all I' e R. By definition,
I/1(A)u = f: I/1(p)dE"u,
where ue D(.;(A))ifTfTI"'(1')12dIlE,.uIl2 < IX. Since t/J is bounded on R, we
obtain D(t/1(A)) = X and
1II/1(A )u1l 2 = r: I 1/1 ( p)1 2 d" E" ull 2
S c f: dllE"ul 2 ... Cllull 2 ,
i.e., t/1fA): X -+ X is bounded. By A 2 (96v), it follows from (i. - 1')t/1(I') = I and
It/J()I s constant for all I' e R
that (AI - A)t/1(A) = I and t/1(A)(AI - A) s; I. Hence
t/1(A) = ()J - A)-I,
i.e., i. belongs to the resolvent set of A.
19.7b. Suppose that there is ace R such that
fAulu) c(ulu)
for all u e D(A).
Show that all real ;. < c belong to the resolvent set of A.
Sol ution:
(I) We first show that EJ. = 0 for all A S c. If EJ.u = u, then
c lIull 2 s (Aulu) = f« p d II E,.u1l 2
= f x pdllE"E1ull2 = f pdllE"ull 2 S Allull 2 ,
-:(. JI-aJ.J.J
by A 2 (93). That means EJ. == 0 for i. < c. Since the spectral family {E..} is
continuous from the left, E J. = 0 for A. = c.
(II) We fix ;. < c and set "'(1') = (;. - 1')-1. Then
"'(A)u = f. "'(Il)dE"u.
(t. x (
We now use arguments similar to Problem 19.7a.
19.8. The Friedrichs extension.
184
19. The Galerkin Method ror Differential and Integral Equations
19.8a. Complex H-.pafes. Let A: D(A) s; X .... X be a linear symmetric and strongly
positive operator on the complex H-space X. i.e..
(Aulut c !IUd 2
for all II E D(A) and fixed c > o.
(133)
Sho\\' that A can be extended to a self-adjoint operator A F : D(A F ) £ ,,\,
.\' such that
(AFulu) cllull 2 for all u E D(A F ).
Solution: The proof runs analogously to Section 19.10 with the following
modifications. Let X f be the energetic space of A constructed as in Section
J 9 10 According to Section 18.11, let X; denote the antidual space to Xl
and let AI:: Xl .... X; be the corresponding antiduality map.
Replace h: X .... Xl in Section 19.10 by the map : X .... X;. i.e.. h(u) = h(u),
where the bar denotes the conjugate complex number. This yields the con-
tinuity of the embeddings
XEXX:.
Moreover, replace the duality map A E : X E .... X; by A E : X E .... X;. Then the
Friedrichs extension is given by D(A F ) = Af-I(X) and AFu = Afu on D(A F ).
19.8b. Structure 0.( the energetic space. Let A: D( A) S; X .... X be given as in Problem
19.8a where X is an H-space over k = R. C. Let A, denote the Friedrichs
extension of A and let X£ be the energetic space of A. Show that
X,. = D(A 2).
and /lull£ = II A}.2 U Il for all u E X f..
Solution: Set J = [c. XJ[. By(133)and Problem 19.7b,
Afu = L 'I dE"u.
For 0 < I, we obtain
A}u = L Jl2 dE.u.
where u e D(AJ) 1fT $J 2. d II E,.ull 2 < . let B - A2. Then D(A,) c: D(8).
Since A S; A F' we obtain
Iluul = (Aulu) = (Afulu) = L Jl d Ii E"ull 2
= II Bull 2 for all u e D(A).
( 134)
(I) We show that Xl s;; D(B). Let u E Xl' By construction of Xl in Section
19.10, there follows the existence of a sequence (u..) in D(A) with
u.. .... u in X
as n -.. 00
and (Au" - Au",lu" - u"').... 0 as n, m.... oc. By (134), (Bu,,) converges in
X. The operator B is self-adjoint and hence graph closed: therefore.
u E 0(8).
(II) We show that O(B) = Xl' By Section 19.10, D(A F ) S;; X£. According to
A 2 (96vi). the set D(A F ) is dense in D(8) equipped with the graph norm
II Bull. Thus. it follows from (I) that D(B) == X [.
Problems
185
19.9. Nonexpansit'e semigroups. Show that each nonlinear nonexpansive semi-
group {S(t)} is strongly continuous.
Solution: let t, h > o. Then
IIS(t + h)v - S(I)t'li == IIS(t)S(h)v - S(t)t'li
s IfS(h)t, - t'li ..... 0 as h..... + 0,
and
IIS(, - h)r - S(t)I'1i == IIS(, - h)v - S(I - h)S(h)vll
Slit' - S(h)t,U ..... 0 as h..... + O.
19.10. Simple t'ariants of the Sobolet' embedding theorems and the Lipschitz continuity
of nonlinear operators. Let G be a bounded region in AN, N I. Let n." p and
".n denote the norm on L,(G) and W21(G respectively, i.e.,
lIull, = (L lul' d:C)"'.
lIuli :c: ( r t ID/uI 2 dX ) 1/2 .
JG '::1
In the following all the constants are denoted bye. Since CO(G) is dense in
W 2 1 (G). in the following proofs we may restrict ourselves to functions from
Co (G) (d. Proposition 18.29).
19.IOa. Let N = 3. Show that the embedding W 2 1 (G) S; L 6 (G) is continuous.
Solution: let x - (,,,, C) and let II e Co(G). Integration of (u)( -= 4u 3 11(
yields
lu(;)14 = fr 4U3U(d S 4 fr IU3Udd.
Replacing by " and " we obtain
lu(:<)1 6 S 2 3 (f: lu 3 ud d)"2 (r lu 3 u.1 d,,)"2 (r: lu 3 utl d' )"2 .
We integrate over G and apply the Holder inequality based on + 1 = t.
Then
lIull: S crlull8"lIull,
and hence
rlult 6 S ellull
for all u e Ccf(G).
( I 35)
19.10b. Let N == 3. We set
f(u) ::It 11 3 .
Show that the operator f: W 2 1 (G) ..... L 2 (G) is locally Lipschitz continuous.
Solution: Let u, t' e Co(G) with lIull, IIvll s r. By the Holder inequality
based on J + i == I,
A r !'/(u) - I(t') II 2 = rI(u - t')(u 2 + uv + v 2 )11 2
S lIu - t'II 6 11u 2 + uv + vZlIl
S 1111 - VIl6(lIull + IIu1l 6 111:11 6 + IIvll)z.
By (I 35), A Sell u - t II.
186 19. The Galerkin Method for Differential and Integral Equations
19.IOc. Let .\' = I. Show that the embedding ""2 1 (G) s; C(G) is continuous.
Solution: Let G = ]a. h[ and let u E Co (G). Then
IIh I = L' "l,,1 d.I'.
The Holder Inequality with I-trick yields
max lu(\')1 < C (f 1II'lld:< ) 12 = ('111111.
Qr<h G
19.IOd. Let J" = I. We set
r(lI) = uu'.
Show that the operator j': W 2 1 (G) ..... L 2 fG) is locally Lipschitz continuous.
Solution: Let u. I' E Co (G) with Iud. 111'11 < r. By Problem 19.IOc.
1(11) - If' )1 1 2 = 11(11' - ,,')u + ,.'(u - 1")/1 2
< II(u' - ,.')UI12 + 'Ir'(u - 1')1'2
S maxlll(x)ll:u' - 1"11 2 + maxlufx) - r(x)lIlI"112
cllul11lu - "Ii + cl!u - I'll IIr'lI < 2crllu - I'll.
19.11. Prool 01 L(',,,,,,a /9.Jf».
Solution: We consider the square
C = (. to E R2: - R < ." < R}
"'ith R = . Let II E ('fC). and let = f.,,) and J' = f.IJ). Then
r r
11(>:1 - III.\'I = J. 1I(t.fI)d, + J, 1I1,'lcI,.
8) the inequality (u + h)2 2fu l + hl) and by the Holder inequality with
I-trick.
(u(.\t - uf.rU 2 = .,'X)2 + U(y)2 - 2u(x)u(y)
f l2
< 2 [ II H ./0 1 + u" ( . n 2 ] d I.
I 2
Integration o\'cr C x C' with repect to (x. y) yields
2 i 11 2 d.>: < 2 i III: + II;} dx + 2 (f "dx r .
This is the desired result.
For general R we use the transformation x........ 2Rx. The proof proceeds
analogously in H'''.
19.12
FUlJcti(JlJal calculu.-' alJd one-pClranJeler unitary groups. Let H: D(H) 1'( --+ ,x
be a self-adjoint operator on the complex H-space X. Show that {e- itH } is a
one-parameter unitary group with the generator - iH.
Solution: We use the functional calculus summarized in A 2 (96). Set J = J .
(I) We have
/I" = f I' dE..",
Problems
187
where u e D(H) iff f p2 d II E" ull 2 < oc. By definition,
e-j'Hu.. f e-j'''dE"u.
where u e D(e- i ,H)ifTJle- u "1 2 dIlE"uIl 2 < 00. Hence D(e-i,H) = X and
lie -j,H 1.111 2 = f le- j'''1 2 d II E"uil 2 = f d II f" 1.1 Ii 2 = II 1.111 2
for all u e X. Since e-i'''e'''' = I and le-""I = I. we obtain
t e R,
( 136)
e IrHei,H = ei,He-.,H = I
and (e- IrH ). = ei,H by A 2 (96v). Thus, the operator e- irH : X -. X is uni-
tary. From e-U"e- is ,. = e-I('+,)" it follows that
e-UHe-lsH = e-1U+.)H
for all t, s e R. by A 2 (96v).
(II) let U E D(H). Then J 1'2 d II E"un 2 < x. Formal differentiation of (136)
yields
!!..-e-i'H u = -i f l'e-ir"dE u.
dt "
For all t and peR, Ipe- i '''1 2 S p2. Hence, by the majorant criterion
A 2(97), the derivative (137) exists at each point t e R.
(III) We set t/1(p) == pe- u " for allp e R and fixed t e R. Since 1t/1(ll)1 2 = 1'2 for
all Jl e R, we have D(t/1(H)) = D(H). From
le-""I S ItJI(p)1 + I, Ipl s 1t/1(p)1 for all t. I' e R,
( 137)
and from A 2 (96v) we obtain
t/1(H) = He-I,H = e- IrH H.
Hence
!!..-e-i,H u = - iHe -UH u
dt
(IV) Let C be the generator of {e- I ' H }. Differentiation of
(e-.,Hule-i,H t ,) = (ull")
for all u e D(H), t e R.
( 138)
at , = 0 yields
(Cult') + (uICt') = 0
for all u, v e D(C).
i.e., C is skew-symmetric. By (138), - iH s C. The operator - iH is
skew-adjoint. Hence C = - iH, by Problem 19.5c.
19.13. Functional calculus and semigroups. Let A: D(A) S X ... X be a linear self-
adjoint operator on the H-space X over K = R, C with
( A u I u) c II U 11 2
for all u E D(A)
and fixed real c. Show that {e-'A} forms a linear strongly continuous semi-
group which is nonexpansive in the case where c o.
188
19 The Galerkin Method for Differential and Integral Equations
Solution: Let J = [c. x;[ and t > O. By Problem 19.7b, we have
Au = L p dE"u.
\\ here II e D( A) iff JJ Il2 d If E" u IIZ < x. By definition,
e 'Au= f e-'''dE u
" ,
J
( 139)
\\'here II E Dle-'A)iffJ J e- z '''dIIE,.uIl 2 < x. From
e 2,,, s; e 2 ,1e!
for all Jl E J
it follows that D(e -'A) = X and
e '04 11 11 2 = L e - 2'''d II E"u 11 2 S e 2 "<'11 u 11 2 .
Let hIli) = IU' 'I' for all JJ E J. By definition.
hlAlu = L hlll)dE"u.
According to A 2 (96v). it follo\\'s from
le-'''I < Ille-'''1 + constant
for all Jl E J
that
I. ( A) = A e ' A
and
h(A) ;2 e .,A A.
We set
,,(1) = e 'A".
for fixed ,,' E D(A) and all I O.
Formal differentiation of (139) yields
It'll) = - L hI pi dE" w.
Since
h( JI)2 e letJ 112
for all Jl E J. , E [0. T].
the majorant criterion A 2 (97) shows that u'(1) exists on R+ for all WE D(A).
Hence
1I'(t) = - Ae lAW.
( 140)
From e-'''e- s " = e u.,)" and A 2 (96v). we obtain that
-,A -JA - C t+J ) '"
e e = e
for all t, S O.
Let B be the generator of {e-""}. The operator e-'A is self-adjoint, i.e..
(e .'Aul t ') = (ule-tAt') for all II, t: E X.
Differentiation at , = 0 yields
(Bult:) = (uIBv)
for all u. f' E D( B).
By (140), A c B. Thus, the operator 8 is a symmetric extension of the self-
adjoint operator A. Hence A = B, by Problem 19.5c.
Problems
189
19.14. Limits of continuous functions and equicontinuity. The following results will
be used frequently. Let (f,,) be a sequence of functions
f,,: M .-. t:
where M is a metric space and Y is a B-space. We ask the following questions:
(i) When is the limit function of (j;.) continuous?
(ii) When is the convergence of (f,,) uniform?
(iii) Suppose we know that the limit
(I) lim J;.(x)
"..cx
exists for all x in a dense subset of M. When does the limit (I) exist for all
xe M?
The answers are based on the notion of equicontinuit y.
t 9.14a. Definition. The sequence (};.) is equicontinuous at the point x iff for each I: > 0,
there exists a 6(£) > 0 such that
d(x, }') < b(£) implies 1If,,(x) - f,,(y)1I < I: for all n.
The point is that b does not depend on n. We call (f,,) equicontinuous on
M iff (f,,) is equicontinuous at each point x e M.
19. t 4b. Equicontinuit y and continuous limit functions. Suppose that the limit
f(x) = lim f,,(x)
,,-
exists for all x e M and that (f,,) is equicontinuous at Xo. Then the limit
function f is continuous at Xo.
19.14c. Dense convergence. Suppose that the limit
f(x) =r lim );.(x)
,,-cr
exists for all x in a dense subset of M and that (};.) is equicontinuous on
M. Then the limit exists for all x e M and the limit function f: M .-. Y is
continuous.
19.14d. Uniform convergence. Suppose that all the functions fIt: M -. Yare contin-
uous and that we have the uniform convergence
f,. '4 f on M
as n -.. x,
i.e., for each £ > 0, there exists an n o (£) such that
tI);.(x) - f(x)1I < I:
for all n no and all x eM.
The point is that no does not depend on x.
Then the limit function f: M -. Y is continuous.
19.14e. First criterion for uniform convergence. Suppose that all the functions f,,:
M .... Yare continuous on the compact metric space M and that (f,,) is equi-
continuous on M. Then the simple convergence
f,,(x) .... f(x)
as n.... <X)
for all x e M
190
19 The Galerkln Method for Differential and Integral Equations
Implies the uniform convergence
f" ::: f on AI
as n x.
and the limit function (: J\I -. Y is continuous.
19 14f. Seco"d criterion lur ulliforn. conrergence (theorem of Dini). Suppose that all
the functions I,,: 1\1 ..... Rare ,nollotolle increasing (resp. monotone decreasing)
on the ('0'''1'(1('( metric space jU. Then the simple convergence
f ,,( x) (ex)
as " x..
for all x E ,\1
Imphe<; the uniform convergence
, =: r on ,\1
as ,,-+ 'Y...
In the case where the limit function {: M --t R is continuous.
Hint: Use the corresponding definitions and the fact that Cauchy
sequences in B-spaces are convergent. cr. Dieudonne (1968, M), Vol. I.
Chapter 7.
19.15. (.on"n"tCltil'ity o{ iterated /in,its Oil metric spaces. We are interested in the
relation
lam lim I( x. y) = lim lim .((:<.}') = lim If., y) ( 141 )
, .Q , ." ) --4" t .d cx.).) .Co,".
for the mapping
I : ..\' x )' -. z.
where .X. r. and Z are metric spaces fe.g" X and Y is a neighborhood of Cl
and h in a B-space, respectively, and Z is a B-space). In this connection. the
notion of uniforlt' convergence is crucial.
19.15a. l! "ifor", co,u'eryellce. By definition, the limit
F(x) = limf(x.y)
)' -"
IS uniform with respect to all x E M iff for each r. > 0 there is a <5(r.) > 0 such
that
cI(y.h) < ()(r.) implies cl(/(x.y),F(x)) < t
The point is that i) does not depend on x.
19.15b. ,. irst (riteriolJ. Suppose that:
(i) The inner limits lim, -bff x. y) and lim .1-41 f(:<. y) exist.
(it) The inner limit lim,_b.f(, y) is uniform with respect to all x in a neighbor-
hood Uta) of a.
(iil) Z is a complete metric space (e.g.. a B-space).
Then relation ( 141 ) holds where all the appearing limits exist.
for all .\ EM.
19.15c. SecolJd criterion. Suppose that:
(i) The inner limits lim, .,.f(x,}') and lim,x -.of(x. y) exist.
Oi) The limit lim cx .,. -. CtI.b) f( x. )') exists.
Then relation (141 ) holds where all the appearing limits exist.
19.15d. 7'l!ird criterion. Suppose that:
(i) The inner limit lim). -b f(:<. .r) exists.
(ii) The limit timex. ". -ed,bt.(('}') exists.
References to the Literature
191
Then
lim lim f(x, y) = lim f(x, )').
-. ,-b (.,)-(..bt
( 142)
Hint: Use the corresponding definitions and the fact that, in a complete
metric space, each Cauchy sequence is convergent. cr. Fichtenholz (1972. M),
Sections 168 and 506.
19.16. The Schmidt orthogonalization method. Let X be a separable H-space, X :#:
to}. Show that there exists an at most countable complete orthonormal
system in X.
Solution: By assumption. there exists an at most countable set {t'",} which
is dense in X. We may assume that VI :#; O. Set
u I = v. / II t'l II.
Suppose we have u.' ..., u.. constructed such that {u t, . . . , u,,} forms an
orthonormal system. Then let
"
u,,+. = (',,+1 - L (u,lt',,+1 )u,
'=1
and
u..+ 1 = u..+./Uii,,+tll
in the case where u". 1 :#: O. Thus, (u.lu,,+I) = 0 for all k = I,..., n. If U"+I = 0,
then we use V,,+2' etc. This way we obtain an orthonormal system {u...}.
By induction, it follows that all the v... are finite linear combinations of the
UtI' Hence the linear hull of {u",} is dense in X. By Theorem 19.A. {u",} is
complete.
References to the Literature
Introduction: Riesz and Nagy (1956, M).
Semigroups and partial difTerential equations: Pazy (1983, M) (standard work),
Friedman (1969, M), Reed and Simon (1971, M), Vols. 1,2, Showalter (1977, M), Henry
(1981, L), V. Wahl (1985, M), Benilan, Crandall, and Pazy (1989, M).
Functional calculus: Riesz and Nagy (1956. M), Reed and Simon (1971, M). Vol. I,
Triebel (1972. M).
Nonlinear hyperbolic differential equations: Christodoulou and Klainerman
(t 990, M).
Linear partial difTerential equations and their applications in mathematical physics:
Dautray and Lions (1984. M), Vols. 1-3.
Nonlinear partial differential equations and their applicaions in mathematical
ph ysics: Chern (1984. S).
CHAPTER 20
Difference Methods and Stability
Consistency and stability imply convergence.
The golden rule of numerical analysis.
Difference methods are a proper tool in order to compute approximately
the solutions of ordinary and partial differential equations and to prove the
existence of generalized solutions.
Richard Courant. Kurt Friedrichs. and Hans Lewy (1928)
Our main concern here is with stability of difference methods rather than with
accuracy.
John von Neumann and Robert Richtmyer (1947)
Well-posedness of the original differential equation problem and consistency
imply the equivalence between stability and convergence of difference methods.
Peter Lax (1953)
Do not use unstable difference methods on computers! The rapidity of con-
vergence depends on the order of consistency. i.e., on the accuracy of the
approximation of the difTerential quotients by difTerence quotients.
F olclore
The advances that are now being made in various areas of computer science are
forcing us to re-evaluate the way in which we do numerical computations.
The realization that a different view is necessary has already created a new
catch phrase. scientific computation, which is becoming recognized as a separate
discipline transcending the old computational mathematics.
Mathematicians can playa significant role in this emerging discipline. if they
are able to provide new paradigms for computation. but in order to do so much
of what has been done up to now will have to be reconsidered. There are some
signs that such a fundamental rethink is beginning. The work of Traub (1980)
and Smale (1985) introduces ideas from information theory and computational
complexity in an effort to understand more clearly what we can expect from
numerical algorithms. . . . The full realization of the potential of computational
mathematics requires the development of new and sophisticated analytical tools.
Peter Linz ( 1988)
192
20. Difference Methods and Stability
193
A universally applicable method for the approximate treatment of differential
equations are the difference methods. Here the basic idea consists of replacing
differential quotients by difference quotients. The aim of this chapter is to
explain the fundamental connection between stability, consistency, and
con,)ergence.
A principal difficulty in difference methods consists of the fact that one has
various possibilities for approximating differential quotients by difference
quotients, and this reduces to finding especially favorable numerical approxi-
mations. For the engineer and practitioner it is, in this connection, not so much
the rapidity of convergence of the method that stands in the foreground but
rather its stability, since unstable methods collapse or yield only very inexact
results on computers. By stability, we understand roughly the situation that
in the solution of the difference equation. errors can not grow indefinitely but
rather remain uniformly bounded, independently of the width of the grid. We
elucidate this using the simple first-order differential equation
y' = Ay
(I)
with the solution
y(x) = y(O)e AJr .
First, we consider the stability of the differential equation. If y and z are two
solutions of (1) then, for the difference
(x) = y(x) - z(x),
there holds
I £\(x)1 = eAJrI £\(0)1.
(2)
So that, for x > 0, the initial error £\(0) does not become larger, the stability
condition
) sO
must hold.
In order to formulate the difference method for (I) we set
Y" = y(ph) for p = 0, 1, . II and fixed h > 0
and denote an approximate value for y" by v". Then for (1) there first results
the following two possibilities:
h- 1 (v p + 1 - v,,) = AV", p = 0, I, ...,
h- 1 (v,,+1 - v,,) = A. V" + 1 , P = 0, I,... .
(i) Explicit method. From (3) we obtain
V,,+1 = (1 + )..h)v", p = 0, I, ... .
If w" is another solution of this equation and we set
(3)
(4)
, a: v, - w",
194
20. Difference Methods and Stability
then 4,+ 1 = (1 + ;..1,)4, holds and thus we get the key relation
1&'+11 = 2,+11&01. p = O. I. ....
(5)
with
2 = 11 + lh ,.
In order that the initial error Ao does not increase in the course of the
computation. we must have the stabilit}, condition
2 < I.
For fixed i.. < O. this is a conditiolJ for the grid step h.
(ii) I"'plicir nlethod. From (4) we obtain
1'+1 =(1 - i..h)-(,+l) t oo.
This implies (5) with
2 = 11 - i.1I 1- I .
In contrast to (3) there now arises. for ;. < O. no restriction on the grid step It.
i.e.. we obtain 2 < I for all ;.. < 0 and h > o. Therefore. we shall prefer (4) to
(3). Method (3) is called an explicit method in contrast to the more favorable
inrplic;r method (4).
(iii) Mea" t O a/lie Inetllod. A still more favorable method results by using the
mean value of (3) and (4), i.e.,
11- I (t.,+1 - v,) = i.(t"+1 + t.,)/2. (6)
Now (',+1 = (I + i.lr/2H 1 - ;..11/2)-1 1 " holds. Therefore. we obtain (5) with
2 = II + i.h/2111 - lr/21-1.
For ;. < O. Ir > O. this value of is 2 smaller than in (3) and (4). Consequently,
it follows from (5) that an initial error in (6) dies olT more rapidly than in (3)
or (4). For parabolic differential equations, (6) leads to the favorable Crank-
Nicolson method in Section 20.4.
If the stability condition 2 < t is violated in (5), i.e. 2 > I, then a round-ofT
error for the initial condition will increase morc and more at each step, and
will completely falsify the result after a sufficiently large number of steps p.
Therefore, slabil;t y plays a dec;sit'e role in the numerical realization of differ-
ence methods on computers. Whereas with an explicit method, as in (3), one
obtains the solution of a difference equation directly; with an implicit method.
as in (4), one has to solve an equation or a system of equations. However. in
contrast to this greater demand on computer time with an implicit method.
there is the decisive advantage that in general the stable grid step domain for
h is greater in the case of an implicit method than with an explicit method.
Therefore. as a rule, one will prefer implicit methods to explicit methods. In
particular, one is interested in difference methods for which all grid steps are
stable. In Example 20.5 we explain this situation on the basis of a numerical
example.
20.1. Consistency. Stability. and Converaence
195
20.1. Consistency, Stability, and Convergence
It is very remarkable that the proofs of convergence for difference methods
can all be obtained from one unified principle which reads roughly as follows:
(C) Consistency and stabilit}' imply conl'ergence.
In the following we explain this principle on the basis of an abstract model.
and apply this model to concrete problems in Example 20. t and the following
sections. In Sections 20.6 and 20.7 we consider a certain converse of (C).
This principle reads as follows:
With consistent methods. for ,,'ell-posed problems. stability and convergence are
equit'alent.
In this connection, consistency always means that the approximate equation
converges in an appropriate sense to the original equation. A problem is said
to be ,,'ell-posed if for fixed data it possesses exactly one solution and the
solutions depend continuously on the data in a certain way. We now study
an important general abstract model.
We consider the equation
\' e y:
. ,
and, for all h e ]0'''0]' the approximate equation
B"."" = fit. }'" e Y lt .
together with the following approximation scheme:
8
B)' = f.
(7)
(8)
B,.
Ylt · Zit
Our aim is to investigate the convergence of this approximation method. We
assume:
Y
P.j
. z
I Q.
(9)
(H t) Approxi,nat;on scheme. let '10 > 0 be fixed. For all h e ]0. h o ]. the spaces
Y, Z, Y,., and Zit are given normed spaces over k = R, C. and the
operators B. Bit. Pit. and Q" are given according to (9).
(H2) Existence and uniqueness. For given fixed fEZ, the original equation (7)
has exactly one solution y.
We are given a sequence (f,,), 0 < h < hOft such that the approximate
equation (8) has a unique solution y" for each f,. e Zit.
(H3) Consistency. There are positive constants 2, p, and r. independent of h.
such that
IIQItBy - B"p"YII S 'Jh r .
IIQltf - fltll fJh r
for all 'I e ]0. ho].
196 20. Difference Methods and Stability
(H4) Stability. There are positive constants )' and s. independent of h. such that
Ill' - "'11 r.. ,'II B,. v - B,. w II Z II ( to)
for all t", ". E Ylt and all h E ]0. ho].
Let s = I. If B,. is a linear operator, then (10) is equivalent to
'I I' 11t-" S; ," B" L' II z" for all l' E f" and all h E ]0. ho].
In turn, this is equivalent to the existence of the inverse operators 8,,-1
on ZIt with the property
II B,,-III i' for all h E ]0, ho].
Theorem 20.A (Convergence). From (H I) through (H4) "'e obtain the error
estimate
1I.r" - PItYIf" < (2 + P))'h' for all h E ]0, ho]. (11)
This implies the cont'ergence of the approximation method. i.e..
II y,. - PIt.."II)'" -. 0
as h -. + o.
From this it follows, in particular, that, for fixed s in (H4). the rapidity of
convergence with respect to h is determined by the consistency condition.
For all difference equations, the consistency condition results in an easy
way from the Taylor theorem (see Section 20.2).
The verification of the stability condition requires a more precise analysis
of the difference equations and can take on a knotty form. The stability
condition (10) means that small changes on the right-hand side in B"y" = fIt
also lead to small changes in the solution y", i.e., small errors relative to fit in
(8) can not lead to a catastrophic falsification of the solution.
PROOF. We set fIt = BItP It }, - Blty". It follows from (7) and (8) that
fit = B"P"y - Q"By + Q"By - B"y" = B"P"y - Q"By + Q"f - fIt.
The triangle inequality and the consistency condition give
IIt,,1I < "B"P"y - Q"Byli + IIQ"f - fit II < (2 + P)h F .
The stability condition yields
II y" - PItY II S )'" B">,,, - B" PItY II ;,( + fl)h'.
o
STANDARD EXAMPLE 20.1. Let ). < O. Parallel to (1) and (4). we consider the
differential equation
L dcf, ..
)' = y - ).y = g
y(O) = c
on [0.1],
( 12)
20. t. Consistency, Stability. and Convergence
197
along with the difference equation
L dcf h -1 ( ) -
,l' = V,+I - V, - J.V,+I = 9,+1 t
p = 0, 1, . . ., N - I,
( 13)
Vo = c,
where 9 eel [0, I] and c e R are given. Let y be the unique solution of (12).
Then y e C 2 [0, I].
We partition the interval [0, I] by means of the partition points 0 = xo <
XI < · .. < X N = I with x, = ph, and we set
y, = y(ph), 9, = 9(ph).
Let lIyll be the norm of}' in C 2 [O, I], and let 11911 be the norm of 9 in C 1 [0, I].
(i) Consistency. Our point of departure is the formula
IL,y - (Ly)(ph)1 s (1).1 + !)h lIyll,
p = O. I, . . ., N - I.
(14)
This formula results, by the Taylor theorem, from
h- 1 (y'+1 - y,) - y'(ph) = h}'''()/2,
).(y,+1 - y(ph) = lhy'(PJ),
where and" are suitable points. Moreover, we use the formula
19, + t - 9, ISh II 9 II , p = 0, I, . .. , N - 1,
which follows from 9,+1 - 9, = h9'().
(ii) The key equation (modified difference equation). We set
( 15)
e, = y, - v,.
This quantity is crucial for our convergence proof. We want to obtain a
difference equation for e,. From the original differential equation (12) and the
consistency condition (14) it follows that
L,y = 9, + k"
Yo = c,
p = 0, I, . .. , N - 1
(16)
with
Ik,I S (Ill + !)hIlYIi.
After subtracting (16) from the difference equation (13) we obtain the key
equation
L,e = m"
with m, = g, - 9,+1 + k,.
(Hi) Stability. We can easily solve (17) explicitly. Noting l < 0, we obtain
e,+1 = (I + 1)..lh)-I(e, + hm,), p = 0, ..., N - I.
eo = 0,
p = 0, I, . . . , N - I
( 17)
198
20 Difference Methods and Stability
Therefore, by induction. we get the crucial stability condition
IeI'I s ph(1 + 1).lh)-1 max I,npi
OpSN-1
( 18)
< max Inlpl,
Ops,Y-1
p = 0, . . . , N.
(iv) Co,u'('ren('(). Finally. (18) implies
max Iy(ph) - 1',,1 < h(lIgli + (1;.1 + })IIYII).
Op"
( 19)
Letting II -.... 0, we obtain the convergence of the difference method (13) for the
original differential equation (12) in the case where A. < o.
All convergence proofs for difference methods for linear ordinary and
partial differential equations follow the method of Standard Example 20.1
above. The has;c idea is the following:
(a) consider the difference
e = r - I'
" . p "
at the grid point p.. where }' denotes a solution of the original differential
equation and I' denotes a solution of the corresponding difference equation:
(b) construct a ,nodif;ed difference equation for e p as in (J 7); and
(c) obtain IIni/or1n estimates for f! p as in (18) (stability).
In conclusion.. we will yet show how the proof of convergence in Standard
Example 20.1 corresponds to the situation of our abstract model in Theorem
20.A. To this end. we write the original differential equation (12) in the form
By = .f.
Y E }'.
fe Z,
with
By = (Ly, y(O)). f = (g. c)
and Y = e 2 [O, I]. Z = ('1[0, I] x R.
Furthermore. let .",. = (l'0.1'"... .t.....). We write the difference equation (13)
in the form
Bit}'" = .f". -"" E Y". f" E Zit'
with .f = (II"..., g.... c) and Y" = ZIt = R N + 1 . We choose the maximum norm
1I.1I:t on Y" and ZIt.
Standard Example 20.1 corresponds precisely to Theorem 20.A with the
operators
Pit)' = (.\'(0). )'(h). . . . .. y( Nil)).
Q"f = .f".
The consistency condition (H3) above corresponds to (14) and (J 5) and the
stability condition (H4) corresponds to (18). The convergence statement (11)
of Theorem 20.A coincides with (19).
20.2. Approximation of Differential Quotients
199
20.2. Approximation of Differential Quotients
To verify the consistency condition one uses the Taylor formula in order to
approximate differential quotients by means of difference quotients. In this
connection, we also obtain in a simple way the order of approximation with
respect to h, which is decisive for the rapidity of convergence of the difference
method.
We define the difference operator
v f - f(x. + h) - f(:()
It. (x) - h ·
and the discrete Laplace operator
It = V -It V,.,
h 0,
h > o.
A short calculation shows that
r - f(x + h) - 2f(x) + f(x -_ h)
Ie. (x) - h 2 ·
The following formulas are typical and important examples for approximating
differential quotients by difference quotients via the Taylor formula:
Vltf(x) = JO,(x} + hf"()/2,
lef(x) = !"(x) + hf(3.()/3,
ltf(x) = !"(x) + h 2 f(4)()/12,
e [x, x + h],
e [x - h, x + h],
e [x - h, x + h].
(20)
(21 )
(22)
Observe that formula (22) corresponds to a higher accuracy of approximation
with respect to h than (21).
In particular, it follows from the Taylor formula
f(x + h) = f(..) + hf'(x) + h 2 f"(x)/2 + h J f(J)( oJ; }/6.
\\'ith :t e [x - h, x + h], that the difference quotient
l (V V ) _ f(x. + h) - f(x - h)
1 II + -II - 2h --
is a better approximation of the derivative f'(x) than Vie if f is C 3 . In fact, we
obtain that
h > O.
!(V II + V _Ie) = f'(x) + h 2 f(J)()/6, e [x - h, x + h],
if f e C 3 [a, b] and x, x + h e [a, b]. This approximation is of order h 2 whereas
the approximation (20) is only of order h. In this connection, note that
!(f(J)(+) + f(3)(_)) = f(3)()
for a suitable E [. - h, x + h]. This follows from the intermediate value
theorem for continuous functions.
200
20. Difference Methods and Stability
Table 20.1
Formula
Assumption on r
Assumption on x. II (h 0)
(20)
Ie C 2 [a.h]
.. x + II e [a. b]
(21) (22)
Ie CJ[a.b] Ie C.[a.h]
x. . + h e [a. b]
Generally. one has the following principle:
1.lre Sl1lootlJer the liu,ct;ons are, the better ;s the approximation of differelttial
c/uot;eIJts h.r .Il;tahle difference quotients.
By Theorem 20.A. the rapidity of the convergence of difference methods
depends on the degree of consistency, i.e... on the accuracy of the approxi-
mation of differential quotients by difTerel1ce quotients. Thus, we obtain:
Tire SIPlootlrer ,he so/utiolJS of dfferelJt;a/ equations are, the higher;s ,he rapidity
(r cOIJrergence o.f suitable difference methods.
This is an important general principle of numerical mathematics which is
also valid for other approximation methods, for example, for the Ritz and
Galerkin methods (see Chapters 21 and 22).
Proposition 20.2. Let ell. b] be a compact ;nterr,al. Then the approximation
.Iorlnu/as (20) through (22) are ['laUd in the case K'here the assumptions of Tahle
20.1 are .(ulfilled.
PR()()F. For instance. in order to obtain (22), we proceed from the Taylor
formula with a remainder of fourth order
.r(.'( + Ir) = .f'(x) + '!f'(x) + h 2 !"(x)/2 + h J f(Jt(:<)/6 + h4!(4.(! )/24.
It follows from this, by addition, that
6" ((x) = !"(x) + 1r4(!(.)(+) + f(.)(_»)/24, :! E [x,x + h].
The intermediate value theorem for continuous functions yields
I c 4 t ( +) + J' ( 4 ) ( ) = r ( 4 ) ( )
with a suitable E [x - II, x + II].
o
20.3. Application to Boundary Value Problems for
Ordinary Differential Equations
We consider the boundary value problem
- y" + QY = f on ]0, 1[.
y(O) = y( I) = O.
(23)
20.3. Application to Boundary Value Problems ror Ordinary Differential Equations 20 I
Let a O. In order to formulate the difference method, we partition the
interval [0, 1] by means of the partition points 0 < h < 2h < ... < Nh = 1.
We set
J'" = y(ph), I" = I(ph)
and denote an approximate value of y" by v". Motivated by (21), the difference
equation for (23) reads as follows:
-h- 2 (v'+1 - 2v" + V,-I) + av, = f"
p = 1,..., N - I,
(24)
to = VN = 0,
i.e., we replace y" by "y. In matrix notation, equation (24) means
2 + ah 2 -1 0 VI fl
-1 2 + ah 2 -1 1'2 f2
1
-
11 2
-1 2 + ah 2 -1
0 -1 2 + ah 2 VN-I fN-I
F or this we write
B,,(a)v,. = fit.
(25)
(25.)
The coefficient matrix B,.(a) of (25) is a symmetric tridiagonal matrix.
For arbitrary starting element y1°), the total step method for an iterative
solution of (25), according to Section t .8, reads as follows:
}'"+ It = A"y") + b..
n = 0, t, . . .
(26)
with
020
202
I
2 = 2 + ah2 '
h 2
b = 2 + ah 2 fit.
A" =
!X 0
020
Proposition 20.3.11 feel [0, 1], then the boundary value problem (23) possesses
exactly one solution y E C 3 [0, 1]. For each N = 2,3,..., the difference equation
(24) has exactly one solution v and
h
max Iy(ph) - v,,1 S 24 max ly(3)(x)l.
IS"SN-I OsxSI
(27)
r Ie C 2 [0, t], then ye C 4 [O, 1] and
h 2
max Iy(ph) - v,,1 S - max ly(4)(x)l.
1 S" S N -I 96 ° s x S 1
(28)
202
20. Difference Methods and Stability
This shows that the difference method converges as h -+ 0, and that the
rapidity of the convergence depends on the smoothness of the solution y of the
original boundary value problem (23).
Remark 20.4 (I terative Solution of the Difference Equation). By Problem 20.1.
the eigenvalues i.j.It and the eigenvectors ",'J.It of the matrix B,,(a) are given by
. '2 2 sin 2 { jnh 2)
t. J " = a + } n - . 1 _ ., 2 '
(In'I-)
Y J ." = (sinjnh. sin 2jnh.. . .. sin(N - 1 )jnh).
As h --+ O. they tend to the eigenvalues i.j = a + j2 7[2 and the eigenfunctions
y} = sin ix of the corresponding boundary eigenvalue problem
-y"+ay=i.y on]O.I[. .\'(0)=.\'(1)=0.
j= I.....N-I.
(29)
8)' Problem 20.1. we ohtain. for the eigenvalues of the matrix A".
2 ol.Jnl,
I' J -,. - .2 ;- a h J · J - I..... N .. I.
i.e.. the spe('lral rat/ius of A" is smaller than one. Consequently, the total step
method (26) ('O'H'erJes to the solution of the difference equation (24), by
Proposition I.! 5. However. note that by Table 1.3. the optimal relaxation
method con\erges more rapidly.
PRCX)f- ()f PROP()Slll()N 20.3. We assume that y E C 4 [O, I].
(I) C"onsistcncy. Let ('" = .r( ph) - l'p' It follows immediately from (22). (23).
and (24) that
- II f ('". 1 -- 2e p + e" _I ) + tie p = r""
p = 1. . . ." !'J - 1. ()O)
with
11 2
I r pi < - max ly.4)(x)l.
12oJC1
f II) Stability. Using the coefficient matrix 8,,(a) of (25), equation (30) means
8,.(a)e = t.
I.e." e = 8,,(tI)-1 t. Let B,,(a)-' = (cpq(a)). Then
\. - .
e p = L C I'q(a)r q .
q:.l
p= I,....N-I.
(31 )
B) Problem 20.1.
o cpq(a) < cpq(O)
for all CI 0 and all p. q. This implies the crucial stahility relat;on
max 'e pi P max 1 t,,1 (32)
'$.P$."i 1 1 C;pS."'-1
20.4. Application to Parabolic Differential Equations
203
with
.'''-1
fJ = max L C pq(O).
1 SPSN-I 4=1
{III) Convergence. It follows from (32) that
fJI12
max It,l _ 12 max l y ( 4 )(x)l.
I SPS.\'-I Os.xs I
This is assertion (28) in the case where we will yet show that fJ < i.
To this end we choose f(x) = I. Then the function
)' = 1x( I - x)
is a solution of the differential equation (23) with a = 0 and at the same
time of the difference equation (24), since by (21) the discretization error
depends only on the third derivative y(3) which vanishes here. Thus,
equation (30) holds with II' = I and
e I' = y(ph) = !ph( 1 - plJ.
therefore (31) also holds with a = O. This implies
(J = max !pll(1 - hp) max 1x(1 - x) = A.
IspSN-1 OsxSI
Analogously, one obtains assertion (27) from (31).
o
20.4. Application to Parabolic Differential Equations
In this section, by means of special difference methods for parabolic differen-
tial equations. we want to demonstrate the following two facts which are
typical for general difference methods:
(i) For a given differential equation one can arrive at different difference
methods by different approximations of the derivatives.
(ii) These different difference methods can exhibit completely different con-
vergence behavior for the same given problem.
We consider the initial- boundary value problem
II, - U xx = g on Q.
(33a)
u(.. 0) = cp(x) for 0 ." I.
ufO, t) = u( 1. t) = 0 for 0 I T.
where Q = {(x,t): 0 < x < 1.0 < t < T}.ln order to formulate the difference
method we choose the lattice points
(33b)
.'<1' = px, t. = q1
with p = 0.. . ., Nand q = 0...., M, so that N.'( = 1 and M t = T (Fig. 20.1).
204
20. Difference Methods and Stability
.
t
t q I
J---;
, I '-4
: =t L --_
T - ---. --. --1
+- t -<>- ·
'1/ +- - A - - -- - ... _ Q
I _-
--1"- . - .. - - -,. -. --
L-i- ---.
o r" I
(a) explicit n1ethod
("
(b) impl icit method
-.- 1- r--
I I
I
.- -
i I
1" t.----(>-
b ' 1- -+- -
, I
---1-l-
I
-j----
( I'
(c) Crank-Nicolson method
Figure 20.1
We set
u,.. = u(.p' I.)
and denote an approximate value for up.. by [',..' Furthermore, let
fl p . q = g(Xp,lq)"
(A) Explicit Illelhod
CP, = cp(x ,).
Vt.\rt",.q - 6 Ax l',.q = Y,.q'
(34a)
1',.0 = f/J"
r o . q = ('N.q = O.
(34b)
In (34a) let p = I.. . . ., N - I, q = 0, . . ., M - I, and in (34b) let p = 0, . .., Nit
q = 0, .. .. M. Equations (34a) (resp. (34b) result from (33a) (resp. (33b». By
Section 20.2, the difference equation (34a) means
l',. q + I - l',.. (' p + I. q - 2 (' ,., + t p - I. .
t11 - --- (t1X)2 = gP....
therefore. with p = 1/(X)2,
"".Q+l = (1 - 2p)t',.q + p(v,_l.q + 1',+1..) + g,..t. (35)
20.4. Application to Parabolic Differential Equations
205
In Figure 20.1 (a) we have indicated symbolically by small circles which lattice
points are involved in this equation, and by (34b) the values in the marked
boundary lattice points are known. Let
1 - 2p O.
Using induction on q = 0, I, ..., we obtain from (35) the decisive stability
estimate
Iv,.,1 S qt max Ig,.,1 + max IfP,1
,., ,
S Tmax Ig,.,1 + max IfP,1
,. . "
for all p, q.
(36)
(8) Implicit method
We replace the difference equation (34a) by
V -41 V ,., - 4 6X v,., = g,.,.
(31)
Here p = I, ..., N - I, q = 1, ..., M. The initial and boundary conditions
coincide with (34b). In contrast to the explicit method (34a), the forward
difference quotient V 41 is replaced by the backward difference quotient V -6"
Analogous to (35), from (37) we obtain
(I + 2p)v,., = V,.,_I + P(V,-I.f + v'+1.') + g,.,41. (38)
In Figure 20.1 (b) we have indicated symbolically by small circles which lattice
points are involved in this equation. Note that we have completely different
situations in Figure 20.1 (a) and (b).
We set = max,lt,.,I. It follows from (38) that
(I + 2p)V., S _I + 2pV., + 41 max Ig,.,I,
"..,
therefore, S _I + 41 max,.,lg,.,I. This yields the decisive stability estimate
Iv,.,1 S max IfP,1 + q4t max Ig".,1
" ,.,
S max IfP,1 + T max Ig".,1
, ".,
for all p. q.
(39)
The following observation is important. Equation (38) represents a linear
system of equations with as many equations as there are unknowns. By (39),
if all fP" and g"., are equal to zero, then so are all v,." i.e., the homogeneous
system (38) has only the trivial solution. Consequently, the inhomogeneous
system (38) has a unique solution.
This general argument underlines the importance of stability estimates for
the solvability of difference equations.
(C) Crank-Nicolson method
We now replace the difference equation (34a) by
V 41V,., - 2- 1 (4 6x V,.,+ I + 4 6x V".,) = 2- 1 (g"..,+1 + g,.,) (40)
206
20. Difference Methods and Stabihty
for" = I. . . . . tv - I. q = O. . . . . AI - I. The initial and boundary conditions
coincide with (34b). In the introduction to this chapter we discussed the good
behavior of method (6). In fact. (40) corresponds to (6). Thus we expect that
(40) has good behavior too.
Equation (40) means
-1'1'1'_1 q.l + (2 + 2p)"".q+1 -1'1'".l,q+.
= P"1'-l.q + (2 - 2I J )r".q + pr p "1.q + t(YP.q + YP.q+I)' (41)
In Figure 20.1 (c) we have indicated by small circles which lattice points are
involved in this equation.
We occupy ourselves. in Problem 20.2. with the unique solvability and the
iterative solution of f4 J ).
EXAtPLF 20.S. In order to demonstrate the different behavior of the explicit
method and the Crank Nicolson method we choose
{ 2X
q>(xl = 2( I - xl
for 0 < x < 1.
for ! < x < I.
By means of Fourier series "'e obtain. in this special case. the explicit solution
of the original problem (33) with g = O. namely.
8 I sin(nn '2) .
IIf.\:. I) = 2 L -2 - sin nnx exp( - n 2 7[2 t).
7[ 11=. "
Table 20.2 sho\\'s that for I' = I the explicit method fails. The reason lies in
the fact that in f 36) "'e have proved stahilit.r only for p !. whereas the
implicit method and the Crank Nicolson method are stable for all p > O. The
restrictive condition I' < in the explicit method has the decisive disadvant-
age that one can choose only very small time steps dr. Thus. this simple
Table 20.2
Explicit
Method x=10 I . Al = 1 0 .\
P = AI (t\,Xt 2 I' = 1 0 - I
Approximate 1'(0.5. ,) Relative
solution error
I = 0.000 1.0000 0
, = 0.005 0.8597 2.3,10- 2
I = 0.0 I 0.7867 1.6. 10- .!
I 0.02 0.6R91 1.2'10- 2
I = 0 10 0.3056 1 2. 10
Comment 100 times steps to
reach, = 0.10
Explicit
L1x = 10 I, t = 10- 2
I' = 1
1'(0.5, t)
1. ססoo
0.6
1.0
not usable
Crank Nicolson
Ax = 10-', I = 10 :!
p=1
1'(0.5.1) Relative
error
1 . ססoo 0
0.7691 -7'10- 3
0.6921 1.6 10- l
0.3069 1.6. 10 1
10 times steps to
reach t = 0.10
20.4. Application to Parabolic Differential Equations
207
Table 20.3
Method
Condition on
p = At/(Ax)2
Stability
Error estimate
(',., = u(.,.t,) - t',.,
Convergence for
A. -+ O. At ..... 0
lIeli = max ([ e..Ax)lf2.
. ,
Explicit
O<p < l
Implicit
p>O
Crank - Nicolson
p>O
Stable Stable
IItU s nett x s K((Ax)2 + At)
Stable
Convergence for
fixed p with
O<pS
Convergence for
fixed p with
p>O
;--
v' Ax lIell s lIell
s L((Ax)2 + (At)2)
Convergence for
fixed p > 0 or for
fixed (At)2/(Ax)2
lIeli x = max 'e ,..'
'.f
example already shows that the stability investigation is not only of theoretical
interest but also of great practical interest.
Proposition 10.6. If the original initial-boundar)' t'alue problem (33) has a
solution u e C(Q) (resp. u e C 5 (Q)), then all the assertions in Table 20.3 are
('alid for the e.'tplicit and the implicit method (resp. .for the Crank-Nicolson
method).
Here the constants K and L depend only on certain derivatives of u. They
are independent of l1.'t and l1t. By the maximum principle for parabolic
equations, u is the unique solution of(33) (cf. Problem 7.2d). Moreover, all the
difference equations are uniquely solvable.
PROOF FOR THE EXPLICIT METHOD. Let e".. = u(x", t,,) - vI'." and let lIuli be
the norm of u in C 4 (Q).
(I) Consistency. It follows from (20) and (22) that
V 4 ,e".. - l1 4x e"." = T"..
with It"..1 S (2- 1 l\t + 12-1(X)2)lIuli.
(II) Stability. By (36) we obtain
Ie "..1 S T max It "..1.
"."
This yields the convergence.
o
One argues in an analogous manner for the implicit method. We discuss
the proof for the Crank - Nicolson method in Problem 20.3.
208
20. Difference Methods and Stability
20.5. Application to Elliptic Differential Equations
let Q = {(x,y) E R2: 0 < x < a,O < )' < h}. We consider the boundary value
problem
u xx + u y ). = 9 on Q.
u = <p on cQ.
(42a)
(42b)
In order to formulate the difference method we choose the lattice points
x p = px, )', = q)'
with x = y = It and N x. = a, M)' = h (Fig. 20.2). We denote by v p ., an
approximate value for u(x p , y,). Furthermore. let
gp.q = g(x.p,y q ), <P p . q = <p(x p .)',).
As the difference equation we choose
&J:rp.q + .\,.rp.q = gp.q at all interior lattice points (x p . )'.) of Q, (43a)
r - cp at boundar y lattice P oints. (43b)
p.q - p.q
By Section 20.2, equation (43a) means
4- ' f r p +l.q + "".q+1 + (.p-I., + ('P,q-I) - I'p" = 4-1(X)2gp". (44)
In Figure 20.2 we indicate with little circles which lattice points are involved
in this equation. The values at all the boundary lattice points are known.
If gp,q = 0 for all p, q, then it follows from (44) that v p .. is the mean of four
lattice point values (cf. Figure 20.2 where ('p" corresponds to the full circle).
This immediately implies the following discrete maximum principle:
fM d ) I}" the .(ullct;on (p, q) t'p., is a solution of (43) with gP.' = 0 for all p, q,
tllen this function takes on its maximum and minimum on the boundary ('Q.
In order to solve (43). observe the following. The linear system of equations
(43) has as many equations as unknowns. The corresponding homogeneous
,
- --1=i
\' -- t-
o q
t--
.t r
Figure 20.2
M.S. Application to Elliptic Differential Equations
209
system (43) with 9,., = 0 and (/),., = 0 for all p, q has only the trivial solution
(".4 = 0 for all p, q, by (M.,). Thus, the inhomogeneous system (43) has a unique
solution.
We deal with the important iterative solution of (44) in Problem 20.4.
In connection with the discrete maximum principle (M.,), recall the classical
maximum principle:
(M) If the function u e C2(Q) " C(Q) is a solution of (42) with g == 0, then this
function takes on its maximum and minimum on the boundary cQ.
From (M) it follows immediately that the original problem (42) has at most
one solution u e C 2 (Q) n C(Q).
Proposition 20.7. If the original boundary value problem (42) has a solution
u e C 4 (Q), then
max lu(x" Y.) - (',.ql S Kh 2 .
p.,
(45)
The constant K depends only on Q and the fourth partial der;vatit'es of u.
Thus, the difference method (43) converges of order h 2 as h -+ O.
PROOF.
(I) Consistency. Let
e ,., = u(x" y,) - v,.,.
It follows from (22), (43), and (44) that
&1le ,., + &).e ,., = t ,.,
(46)
with
h Z a 4 u(p) c 4 u(P)
It,.,1 S _ 6 max .. + .. .
,.Q ax y
(II) Stability. We use a discrete maximum principle to show that
max le,.ql S 2- 1 2(a 2 + b Z ) (46*)
P.4
with
2 = max It,.,I.
p.,
In order to prove this let w(x, y) = 4 -I «(x 2 + y2) and wl'.' = w(x" }'q).
We have
W.x.x + W" = .
The function w has vanishing third derivatives. Therefore, by (21), we
also have
AxW,., + L1&,w p ., = tI.
210
20 Difference Methods and Stability
Letting =".q = e".q + \\',., we obtain
.!, = p., + .\)'= p.q = r ".q + 2 > O.
After multiplication by 1J2.. this is equivalent to the key formula
= 1'.' < 4 - I (= p + I . q + = p., + I + =1'- I ., + = p. q - I ).
By the same argument as in the proof of the discrete maximum principle
(M d ) above.. this implies that the function (p.. q) =,.q takes on its maxi-
mum at boundary lattice points. However.. on the boundary cQ. we have
t' p., = 0 and "'p., < 4 -I (Cl2 + h 2 );
therefore. =p.q < 4 1 1 (a 2 + h 2 ) holds for all p.. q. That is..
t!p., < 2- 1 (a2 + h 2 ) for all p.. q.
Similarly. If we define =p.q = e,.q - "'P.4' then we obtain
('".q > - 2- 1 (1I2 + h 2 ) for all p.. q.
Consequently. (46*) holds.
(III) Convergence. From (46*) one immediately obtains (45).
o
20.6. The Equivalence Between Stability
and Convergence
'Ne consider the operator equation
Au = h.
along "'ith the approximate equations
IIEX.
(47)
AllIIn = h"..
II II E of\'.. 11 = I.. 2.. . . . .
(48)
and we formulate the following conditions:
(H I) Well-posedness (r tlJe prohlel)1. Let X and Y be B-spaces over = iR.. C.
The operators
.4. All: A' -+ t'..
are linear. continuous, and bijective.
It follows from the open mapping theorem A I (36) that the inverse operators
II = I. 2. . . . ..
A-I. A;I: Y -+ X..
n = I, 2. . . . ..
are continuous. This implies the a priori estimates
II A u II > c 111111
"AIIlIli > (""lIuli
for all II EX..
for all u EX.. II = I.. 2. . . . ..
(49)
where c and all the ell are positive constants. Hence condition (H I) tells us
20.7. The Equivalence Theorem of Lax for Evolution Equations
211
that equation (47) (resp. (48)) has a unique solution for each bEY (resp. bIt E Y),
and this solution depends continuously on b (resp. bIt).
(H2) Consistency. We have
A"u ..... Au
b" ..... b
for all U E X as n..... 'X; ,
as n..... OC;.
(H3) Stabilil}'. There is a constant d such that
IIA;111 s d for all n.
(H4) Cont'ergence. From
bIt ..... b in Y
as n.....
it follows that:
II" ..... u in X
as PI..... .
Here U and II" denote the solution of (47) and t48), respectively.
Theorem 20.8 (Kantorovic (1948)). From K'ell-posedness (H 1) and consistency
(H2) it follo\\'s that:
stability (H3) <:> conlergence (H4).
PROOF. The key to the proof is the uniform boundedness theorem.
(I) (H3) => (H4). By (H2) and (H3),
lIu - u,," S dIlA,,(u - u,,)11 = dllA"u - h,,11
..... \I Au - b II as n..... 'Y.).
From Au = b it follows that u. ..... u as n ..... 'XJ.
(II) (H4) => (H3). We set b = bIt for all n. Then (H4) implies that
A;lb..... A-Ib
as n..... 'Xl for all bet:
The uniform boundedness principle A I (35) tells us that
sup II A'; III < oc.
"
o
20.7. The Equivalence Theorem of Lax for
Evolution Equations
We want to translate the fundamental equivalence principle of Theorem 20.8
to evolution equations. To this end, we consider the initial t'alue problem
U'(I) = AU(I), 0 S t S T < x;,
ufO) = Uo E D
(50)
212
20. Difference Methods and Stability
for fixed T > 0 together with the abstract difference method
('(h) = 8(11)110'
r(211) = B(h)t'(h) = B(h)2uo. ....
that is.
,. 1111) = B ( II )"1' 0 ,
n = I. 2. . . . . 0 < " < "0.
(51 )
We formulate the following conditions:
(H I) IVell-posed,less 0." tile prohleIJI. Let .(\' be a B-space and let
.4: D(A) c: X -+ X
be a linear operator where D c: D(A) and D is a dense subset of .t. For
each initial value U o E D. problem (50) has a unique solution II = Il(t).
and this solution depends continuously on the initial values. i.e.. there
is a constant c such that
sup IIII(t) - 11(1)11 £';lu(O) - u(O)1I
(), T
for all u(O). u(O) E D.
52)
F--or each II E ]0.11 0 ], the operator
8 ( It ) : .,t -.. ,,\,
is linear and continuous with sUPO<ltSll o 118(h)1I < x.
In order to define the notion of a generalized solution of problem (50) we
define the linear operator S(t) by
11ft) = S(t)uo.
where "0 E D and II = u(t) denote the unique solution of (50). It follows from
(52) that
sup II S ( 1 ) II 0 - S ( t }ilo ,I < c 1111 0 - f, 0 II for a II u o. 1'0 ED. ( 53)
os's T
By the extension principle in Section 18.12. the operator S(I): D -+ X can be
uniquely extended to a linear continuous operator
S(I): X -+ X
for all t E [0. T]
so that (53) remains valid for all "0' iio e "t. We call
11(1) = 5(1)110'
O < tsT.
a generali:ed solution of (50) if U o E X. In the case where Uo E D. the generalized
solution is also a classical solution. From (53) we get
sup IIS(I)II < c. (54)
os's T
(H2) Con"t;"ttency. There exists a dense subset DI of the B-space X so that
lim I ( h)lI(t) - . (_+! = 0
11-0 h
uniformly on [0. T]
for all generalized solutions u = II(t) of (50) with ufO) E D,.
20.7. The Equivalence Theorem of Lax for Evolution Equations
213
(H3) Stability. There exists a constant b such that
II B(h)" II S b
for all h E ]0, ho] and n e N with nh S T.
(H4) Convergence of the difference method. For all t E [0, T] and all initial
values "0 E X we have
lim II B(h le )"1c"0 - "(I) II = O.
i-X)
Here u denotes the generalized solution of (50) with u(O) = Uo. Moreover,
(hi) is an arbitrary zero sequence, and (n le ) is an arbitrary monotone
sequence of natural numbers with
lim n,h" = t.
,-<X
Recall that t'(nh) = B(h)lI uo denotes the corresponding solution of the
difference method.
Theorem 2O.C (Lax (19S3)). From wel'-posedness (HI) and consiSlenC)' (H2) It
follows ,hat:
stability (H3) <:> convergence (H4).
Corollary 20.8 (Rapidity of Convergence). If the order of consistency is h',
p > 0, then the rapidity of convergence is also h'.
M ore precisely, let u be a generalized solution of (50) with u(O) E DI . Suppose
that (H I), (H2), (H3) hold M'ith
su p B(h)u(t) - u(t + h) Ch'
h S
OSIST
for all h E ]0, ho] where C is a constant. Then we obtain
sup IIB(h)"u(O) - u(t)1I S Kh'
OSlsT
for all h E ]0, ho] and n E f\I with nh = t. Here K is a constant.
PROOF.
(I) We show that every generalized solution u = u(t) of (SO) is continuous
on [0, T]. Indeed, this is true if u(O) e D. In the case where u(O) EX, use
the density of D in X, condition (54), and the triangle inequality. More
precisely, we begin with the decomposition
S(t)u o - S(s)u o = S(t)v - S(s)v + S(t)(u o - v) - S(s)(u o - v).
Then for each £ > 0, we can choose a point v e D, sufficiently close to
"0' such that
IIS(I)Uo - S(s)uoll IIS(t)v - S(s)vll
+ 2 sup II S ( t) 1111 u 0 - l'lI < £
,
if It - sl < (£).
214
20. Difference Methods and Stability
(II) (H3) (H4). We set
d = B(hru(O) - u(t),
where nil < T and ,1#1 t as n . The key to the proof is the
decomposition
n-I
d = L 8(/1)' [B(I1)u( (n - I - k)lJ) - u( (n - k)h)] + ,,(nil) - 11(1).
1=()
II- J} Let ufO) E DI. By H2) and (H3).
I B(IJ)u(t) - ,,(t + h) I
It/II < sup II B(II)' II sup 1111 I h -- ---
1r.1e' I
+ II u(nh) - 11(1)11 -+ 0 as n..... x.
Note that ilia < T and nil t as n ..... -:1_. This is (H4).
The same argument yields Corollary 20.8.
(11-2) Let u(O) E X and let U(l) = S(I)U(O). We will use a similar approximation
argument as in (I). To this end, we begin with the decomposition
d = 8(11)"1' - S(I)r + B(IJ)n(u(O) - v) - S(t)(u(O) - r)).
For each I: > 0 we can choose a I' E D" sufficiently close to ufO), such
that
lid 11 < II B(lI)" I' - S (t)1' II + (S,. II 8(h)" II + II S(t) II) lIu(O) - I'll < f-
if II > lI o fr.). This is (H4).
(III) (H4) => (H3}. Suppose that (H3) is not true. Then there exist sequences
(II,) and (n,) with
lim II B(II.J"l&li = x
'--r
(55)
and n"lI, T for all k. By compactness we may assume that (h.J is
convergent. If (n,) is bounded, then we obtain
sup IIB(h,)nl&lI < sup IIB(h,)II"1& < .
, ,
by (H I). But this contradicts (55). Thus (n,) is unbounded. and hence
hie ...... 0 as k 'x}.
By the convergence condition (H4).
sup IIB(h,)"l&uoll < OC ' for all "0 E X.
,
The uniform boundedness principle A I (35) tells us that
sup II B(h, r k II < x:.
,
This contradicts (55).
o
Problems
215
PROBLEMS
20.1. Special tridiagonal matrices. let b E Rand M 2.
20.1 a. Show that the M x M ..matrix
b - 1 0
-1 b -I
Au(b) =
(56)
-I b -I
o -1 b
has the eigenvalues
A.} = b - 2 cos },
j= I,...,M,
and the eigenvectors
.v} = (sin 2). sin 22).. . . . sin M2)T,
j = I,....M,
where 2} = jrc/( M + I).
Solution: Explicit computation. Cf. Zurmuhl (t 964, M), p. 229.
20.tb. Let A = (a'j) and B = (b , )). We write
o S A S 8 iff 0 < a,) < b , } for all i, j.
Show that
o S AM(b)-1 < A M (2)-1
for all b 2.
Solution: Let h 2. We consider the M x M..matrix
0 b- ' 0 0
b- I 0 b- I
8(b) =
b- ' 0
0 0 b- ' 0
Obviously.
AM(b) = b(1 - 8(b)).
By Problem 20.1 a. the eigenvalues of the matrix A M(2) satisfy 0 < A} < 4 for all
j. Hence the eigenvalues of 8(2) = 1 - 2 -I A u(2) satisfy - 1 < p} < I for all j.
Thus. the Neumann series
(I - 8(2»-1 = 1 + 8(2) + 8(2)2 + ... 0
converges. Since 0 S 8(b) S 8(2), we also obtain the convergence of the series
(I - 8(b))-1 == J + 8(b) + 8(b)2 + ... s (I - 8(2))-1.
This implies
AM(b)-1 = b-I(I - 8(b»-1 s 2- 1 (/- 8(2»-1 = A M (2)-I.
20.2. Crank-Nicolson method and the iterative solution of the corresponding difference
equations. Parallel to Problem 1.6, investigate standard iterative methods for
solving the difference equations (41).
216
20. Difference Methods and Stability
Solution: For fixed q + I, equation (41) reads as follows:
1"'.4+1 = p(2 + 2p)-lh',_I..+1 + V,+I..+I) + h,.., p = I, .... N - I. (57)
Here b,.q depends only on (',., and g,.s. Therefore we can calculate L',., succes-
sively for the time lines q = 0, I, .. . . In fact, if (".f is known up to the qth time
line, then b,.. is known and the values of v on the (q + l)th time line can be
obtained from (57).
In (i) -(iii) below let n = 0, I, ... and v. o . = O.
(i) The total step method reads as follows:
I '.". I. - p(2 + 2p) -I (t ,.II. + V eil. ) + b
".4+1 - ,-1.4+1 ,+1..+1 P.4'
f58)
Here q + J is fixed and p = I..... N - I.
(ii) The oL'errelaxation method reads as follows:
..; = (o[p(2 + 2p)-I(t,"_"I"..1 + t'!1.4.1) + h"..] - ((0 - I)('+I'
(59)
\\'here 0 < (I) < 2.
Note that both the equations (58) and (59) are equil'alenr to the original
equation (57) if we simply replace both v. lI ) and ('fll+1) with t\
(iii) The single .'\tep method results from (59) for W = I which, in contrast to
(58), always makes use of the newest known iteration values.
If we write (58) in the form
"e ll + I. = A 1'(11. + b,
then A = - p(2 + 2pr 1 AN-I (0). By Problem 20.1, the eigenvalues of A are
equal to i.} = p( t + p)-I cosjn/ N,j = 1,.... N - t; therefore, we obtain for the
spectral radius r of the matrix A,
r = p(t + 1))-1 cos(n/N),
i.e.. r < I. Thus, the system (57) has exactly one solution and the iterative
method (58) converges by Proposition t.1 5.
According to Table 1.2 the corresponding spectral radius for the single step
method is r 2 and the optimal relaxation para mete r results for
UJ opt = 1/( I + Ji - r 2 )
with the spectral radius cOopt - I which is less than r 2 .
Therefore, the methods converge faster and faster in the sequence: total
step method, single step method, and optimal relaxation method. The corre-
sponding asymptot ic rates of convergence R'X, are eq ual to log lor -I . 210g lor -I ,
log I o(W OPI - I) -I (cf. Definition 1.11).
20.2a. An example for the rapidir)' of convergence. We consider Example 20.5 once
more with Ax = O. t and At = 0.01. let n be the number of iteration steps needed
to calculate 1'5.1 on the first time line with four place precision. Show that n = t t
(total step method), n = 7 (single step method). and n = 5 (optimal over-
relaxation).
20.3. Proof of convergence for the Crank-Nicolson method. let u E C(Q).
Solution:
(I) Consistency. We denote the difference equation (40) by
L&t' = g.
Problems
217
Let e,., == u(x" t,) - V,.,. As in Section 20.2, there results from the Taylor
theorem that
L 4 e,., t,.,
(60)
with
t,., = 2- 1 AI [urr(x" I,) - uJrJr,(x" I,) - II,(X" I,)] + O((AX)2 + (AI)2).
Since u satisfies the differential equation u, - U,x,x = II. the term with AI
drops out. That leads to an increase in the rapidity of convergence.
(II) Stability. Our goal is the inequality
lien S TlitU (61)
with lIell = max,(L,e:.,Ax)I/2 and an analogous expression for IItli.
Parallel to (4 t), there results from (60) the system of equations
(2 + 2p)e,.....1 - p(e,-l.t+1 + e,+I..+I)
=- (2 - 2p)e,., + p(e,-a.4 + e,+a.,) + AI(t,., + T,.,+I)
for q = I, ..., M - t and p == 1, ..., N - 1 with to., = eN., == O. If we
denote the column vector (ea."..., eN_a.,)T bye, and the identity matrix
by I, then we obtain
(21 + pT N - 1 )e'+1 == (21 - pT N - 1 )e, + AI(!, + t,+I)
with T-, == AN-I (2) (cf. (56»; therefore,
e'+1 = (21 + pT N - 1 )-1(21- pTN-1) JAx e,
+ AI(21 + pT N - 1 )-1 (t, + t,+I)'
By AI (59a), the norm of a symmetric matrix with respect to the Euclidean
norm 11'11 2 is equal to the spectral radius. According to Problem 20.1 and
the spectral mapping theorem A I (6Oc), the assertions in Table 20.4 are
valid.
(62)
Table 20.4
Matrix
Eigenvalues
(5 == 1..... N - 1)
Spectral radius
r == max.IA.1
TN-I
)". == 4 sin 2 (snI2N)
A:t = (21 + p TN -I ) A I A I A
Ai == 2 :t 4p sin 2 (sn/2N) II ).; A; /;,,:
rs! r< 1
It thus follows from (62) that
tte4+1111 S lIe,n1 + 2-'At(lIt,1I2 + lIt,+,1I2)'
For q = 0, we have e. == O. Induction with respect to q = 0, I, . .. yields
tl JAx e,lIz qAt max IIT,nz.
,
This implies (61).
218
20 Difference Methods and Stability
(III) C'on\ergence. From(61)and i,tH = O((X)2 + (1)2)thererollowtheerror
estimates in Table 20.3.
20.4 Itt'rcllire .'\olut;oll cf the dfference equations .for the Poisson equation. Investigate
(44). i e.. study the system
f I' 1.41 + r" .1.41 + ('P.. - I + t"p.. + I - 4t'p.. = (AX)2 g p.q (63)
for the rectangle in Figure 20.2. where p = I, .... N - I and q = I. .... AI - 1.
Solution: We write (63) in the form
r = At" + h.
\\ here
8 I
I 8
-1 =
0
o
-4 I 0
I -4
B= t64)
J
0 -4
I
I B
Here. I denotes the identity matrix. According to the row sum criterion in
Section 1.8. we obtain r(A) < for the spectral radius of the matrix A. Con-
sequently. the total step method
r'" + I) = A r'") + b.
II = 0, I, . . . ,
con\crgcs to the solution I' of (63) as II X for each given "COt. The same holds
true for the single step method. the optimal relaxation method, and the block
iteration method with the rapidity of convergence given in Tables 1.2 and 1.3.
For nonrectangular regions. there result tridiagonal block matrices that have
a more cC'mplicated structure in contrast to (64).
Toda one uses the highly effective mu/tigrid nlelhod,,, on computers (cf.
Problem 20.9).
We recommend Varga (1962. M), Chapter 6. Meis and Marcowitz (1978. M).
and Hackbusch (1985. M) (multigrid methods). A software system can be found
in Rice and Boisvert (1984, M).
20.5. Appli('al;oll to reactor t('clall%gy. The calculation of neutron nows in nuclear
reactors leads to partial differential equations with piecewise constant coeffi-
cients that correspond to the various reactor materials. The difference method
leads to systems of equations with 10 and more unknowns whose matrices,
however. contain many zeros (sparse matrices). In this connection. study Varga
(1962. M). p. 302 (optimal overrelaxation block iterations). Hageman and
Young f 1981. M) fsparse matrices). and Hackbusch (1985, M) (multigrid
methods).
The most challenging problem in technology is controlled nuclear fusion.
Here. the basic equations of magnetohydrodynamics and the stability of plasma
playa fundamental role. In this connection, numerical methods can be found
in Bauer. Betancourt. and Garabedian (1978. M),(1984, M),(1986), Teller(1981.
M). V ols. J. 2, and Betancourt (1988).
20.6. DiffertJII( t' nretlrods ;11 gas dYllam;cs. Shock waves represent a very interesting
physical and mathematical phenomenon. They occur in supersonic now of
gases and correspond to jumps of velocity. pressure. entropy. temperature, etc.
Problems
219
(see Part V). The sonic booms that are caused by supersonic aircraft belong to
these. Till now, exact proofs of existence and uniqueness for the full equations of
gas dynamics are lacking. One has, however, set up sophisticated difference
methods for numerical simulations.
I n this connection, study Smoller (1983, M) (the stochastic Glimm difference
method), Richtmyer and Morton (1967,M) (the Lax-Wendroff difference
method), and Rofdestvenskii and Janenko (1978, M) (comprehensive presenta-
tion). See also Chapter 86 from Part V.
Fundamental results on the design of shock-free airfoils based on numerical
methods can be found in Bauer, Garabedian, and Korn (1972/77, M), V ols. 1- 3.
20.7. The stabilit)' test of John t'on Neumann bj' means of finite Fourier series. In this
chapter we used a priori estimates and the spectral properties of matrices in
order to study the stability of difference methods. There is a more explicit
method for the investigation of stability which represents the solutions of the
difference equations by finite Fourier series.
In order to have something definite at hand, we consider the explicit method
(34) for the heat equation in Section 20.4. Let t,e. t and v Ut be two solutions of
the difference equation (34) that differ only by different initial values. Let the
boundary values and the inhomogeneous member g,.. be equal for both. We
set ". = .,CI) - t"(2). Then
V.1l "',.. - &x "',.. = 0
(65)
holds for p = 1, .. ., N - I and q = 0, . . ., M - I. We interpret "' as the error
in the solution t,C I) and ask ourselves when the initial error '''',.0 at time t = 0
remains bounded for all time steps. The basic idea consists of the trial
IV
"',., = L C,,:17:
.=0
(66)
with
_ b h '''4.
'-ft - ,
" - e l .. 4 .
.,,, - ,
and with the constants Crt to be determined. If Re 2. S 0, i.e.,
117,,1 s I
for all n,
(67)
then
1"',..1 S L I C"I
.
for all p. q,
i.e., the errors remain un iform I)' bounded. In the case of (67), the difference
method is said to be stable in the sense of John von Neumann.
20.7a. Application to the explicit method for the heat equation. Show that for equation
(6S) the coefficients Crt are uniquely determined from the initial errors w,.o and
that for
1/(x)2 S i
the method is stable in the sense of John von Neumann. This stability assertion
coincides with Table 20.3, which refers to our general notion of stability.
Solution: Let p = 1/(dx)z. The coefficient determinant for the determination
of Crt is a Vandermonde determinant and thus not equal to zero. From (65) and
(66) we obtain
"',.,+. - "',.. = p(w,-I.t - 2w,.. + w,+l.t)'
220
20. Difference Methods and Stability
therefore 'I" - I = p(; I - 2 + .), i.e.,
'I. = 1 - 4p sin l (nnAx/2).
Thus. if p s . then
111..1 s 1
for all n.
20.7b. Application to the implicit method for ,he heat equation. Show that the implicit
method (37) is stable in the sense of John von Neumann for all At/(Ax)l > O.
Solution: One obtains
'I.. = (I + 4p sin l (nnAx/2»-I.
20.7c. Application to the wat't equation. To solve the wave equation
u" - U.K.K = g,
(68)
we make use of the difference method
AAlv,., - 4t',., = g,."
I.e..
t'".,., - 2('"., + V".,_I = r 2 (v,+.., - 2v", - V,_I,,) + g,.,(At)l, (69)
with r = At/A.'(. In Figure 20.3 we have marked the points involved in (69) by
circles. As usual, (',., denotes an approximate value for u(x" t,), where ....:, = pA.
and I, = q41 with P = 0, :t I, + 2, ... and q = 0, I, 2, ... . In addition, we set
g,., = g(.'(,. I,).
More precisely, we want to solve the initial value problem for the wave
equation (68), i.e.. we have to add the following initial condition to (68):
u(.. 0) = a(:() and u,(....:.O) = h(x) for all x e A,
where a and b are given functions.
This means that the values (',., are prescribed for the time lines q = 0 and
q = I. namely. we set
l'".o = a(....:,,) and t,. I = v,.o + b(....:,)At for all p.
From the difference equation (69) we obtain successively the values v,., for the
time lines q = 2. 3. . . . .
,
'".,
.t
XI'
Figure 20.3
Problems
221
Show that for r S I the difference method (69) is stable in the sense of John
von Neumann.
Hint: One obtains
,, - 2",,( 1 - 2r 2 sin(nnAx» + 1 = o.
cr. Smith (1965, M), Example 3.2. This result permits an interpretation that is
generally important for hyperbolic differential equations. The theory or char-
acteristics for the wave equation shows that the value II(." t f . I ) of a solution
u of (68) depends only on the values of u and u, at all points (x, 'f) with
x e [x, - &t,x, + At]
(Fig. 20.3). Because of the stability condition r S I, therefore &t S &x, we obtain
the following:
The dependence region of the hyperbolic differential equation must be smaller
than that of the corresponding difference equation.
This is a natural general principle since the solution of the difference equation
can, roughly speaking, not get away with less information than the solution of
the differential equation.
A complete investigation of the wave equation can be found in Forsythe and
Wasow (1960, M) and in Isaacson and Keller (1966, M). In this connection, the
wave equation is handled with the Fourier method, and the corresponding
difference equation completely parallel thereto with the discrete Fourier
method (66).
A more exact study of the John von Neumann stability test can be found in
Richtmyer and Morton (1967. M).
20.8. Slabil;t}, intestigations for ordinar}' differential equations. In this connection.
study Albrecht (1979, M). Special caution is advisable in dealing with so-called
stiff s}'stems for which the eigenvalues of the matrix exhibit extreme differences.
Such stiff systems appear frequently in microelectronics, chemical reaction
kinetics, and so forth. There, special careful stability investigations are required.
Cf. concerning these problems also Gear (1971, M), Willoughby (1974, P), Stoer
and Bulirsch (1976, M), Shampine and Gear (1979, S Miranker (1981, M
and Dekker and Verwer (1984, M).
20.9. Multigrid methods. The goal of these methods is to obtain very effective and
very fast difference methods for solving linear and nonlinear elliptic differential
equations. The basic idea is the following:
(i) Error smoothing by relaxation (see Problem 1.6).
(ii) Because of (i), corrections for the approximate solutions can be calculated
on coarser grids. This idea can be used, recursively, employing coarser and
coarser grids.
(iii) Combination with nested iteration, i.e., use of coarser grids to get good
initial approximations on tiner grids.
This procedure yields asymptotically optimal methods, i.e., the computa-
tional effort for getting the discretization accuracy is proportional to the num-
ber of the unknowns of the difference equation.
Study the survey articles Stuben and Trottenberg (1982) and Brandt (1982)
as well as the monograph by Hackbusch (198S).
222 20. Difference Methods and Stability
20 10. 7 he deject forreft ion method. The basic idea of t his very effective modern
numerical method is the following. Suppose we want to solve the equation
Au = 0
(I) Coarse discretilation. Compute a rough approximate solution s by means
of a low-order discretization method: A I S = o.
Hi) Fine discretization. Use a high-order discretization method in order to
compute the defect d of s: d = A 2 s.
(lin lJse d and a low-order discretization in order to compute the correction f
b) n1ea ns of A I C = d.
(i\) Compute the corrected approximate solution .'i + f.
We recon1mend the survey by Bohmer and Stetter (1984}. The multigrid
method In Problem 20.9 is closely related to this method.
References to the Literature
C.lassical \\'orks: Courant. Friedrichs, and Lewy (1928) (fundamental paper}, \'.
Neumann and Richtmyer (1947), Lax (1953), Lax and Richtmyer (1956).
Introduction: Smith (1965, M), Stoer and Bulirsch (1976, M), Meis and Marcowitz
(1978, L).
Handbook of numerical analysis: Ciarlet and Lions (1988, M), V ols. I ff (finite
element method. difference method, etc.).
(:Iassical monographs: Forsythe and Wasow (1960}, Isaacson and Keller (1966).
Richtmyer and Morton (1967), Birkhoff and lynch (1984).
Monographs: Collat1 (1960), Varga (1962), Ames (1977), Ansorge (1978), Albrecht
(1979), Botha (1983). Marcuk and Shaidurov (1983), Shokin (1983), Samarskii (1984),
Tornig (1985), Reinhardt (1985), Hackbusch (1985), (1986).
Sparse matrix technology: Hageman and Young (1981, M}, Coleman (1984, M),
Varga (1984, S}, Duff(1986. M). Pissanetzky (1986, M).
Difference method on irregular networks: Heinrich (1987, M).
Survey of the important modern defect correction method: Bohmer and Stetter
( 1984).
Multigrid methods and fast solvers: Stliben and Trottenberg (1982, S) (introduction),
Brandt (1982, S), Iackbusch (1985, M) (comprehensive representation).
Multistep multiderlvative methods: Gekeler (1984, L).
Functional analysIs. discrete Sobolev spaces, and difference methods: Teman
(1970.M}, (1977.M), Aubin (1972,M), ladyzenskaja (1973,M), Wloka (1982.M).
Rhcinhardt f 1985. M).
Shock waves in gas dynamics and difference methods: Smoller (1983 M) (recom-
mended as an introduction), Lax and Wendroff (1964), Glimm (1965) (funda-
mental paper on stochastic difference schemes), Richtmyer and Morton (1967, M)
Rozdestvenskii and Janenko (1978, M), DiPerna (1983).
Design of shock-free transonic airfoils and numerical methods: Bauer. Garabedian,
and Korn (1972/77, M). Vols. 1-3, Jameson (1988).
Controlled nuclear fusion, stability of plasma, and numerical methods: Bauer.
Betancourt, and Garabedian (1978, M), (1984, M). (1986), Teller (1981, M), V ols. I, 2,
Betancourt f 1988).
Ordinary differential equations and difference methods: Stoer and Bulirsch (1976, M)
(introduction), Stetter (1973, M) (general theory), Albrecht (1979, M).
Stiff systems of ordinary differential equations: Stoer and Bulirsch (1976, M) (intro-
duction). Shampine and Gear (1979,S) (a user.s guide), Gear (1971, M), Willoughby
References to the literature
223
(1974, P) (numerous scientific applications), Albrecht (t 979, M), Miranker (1981, M),
Dekker and Verwer (1984. M).
Software: Rice (1983. M Rice and Boisvert (1984, M) (solving elliptic problems
using ELLP ACK), BirkhofT and Schoenstadt (1984, P) (fast elliptic problem solvers),
Hackbusch (1985, M).
Software system for general iteration methods: ITP ACK (d. Birkhoff (1989)).
Software system for solving partial differential equations and other general classes
of mathematical problems IMSL (1987).
Pursue the journal ACM Trans. Math. Softv.'are.
Simulation: Bratley (1987, M).
Efficiency of algorithms of analysis: Smale ( 1985, S).
Computational complexity: Traub and Wozniakowski (1980. M), Calude (1984, M).
The art of computer programming: Knuth (1968, M). Vols. 1-3, Knuth (1982, M),
(1989, M) (foundations of computer science).
Handbook of artificial intelligence: Cohen and Feigenbaum (1981, M), V 015. t - 3.
Supercomputer methods: Murman (1985, P), Lichnewsky and Saguez (1987, S),
Martin (1988. S).
...
LINEAR MONOTONE PROBLEMS
That it is not enough to look for the most general solutions of partial differential
equations, but rather that the question of the fulfillment of special boundary
conditions and initial conditions had to be closely studied, was convincingly
shown in 1822 by the appearance of Fourier's "Theorie de la chaleur" (heat
theory), and this concept was, for example, explicitly expressed by Riemann
around 1860.
The validity of theorems on eigenfunctions can be made plausible by the
following observation made by Daniel Bernoulli (1700-1782). A mechanical
system of n degrees of freedom possesses exact n eigenoscillations. A membrane
is, however, a system with an infinite number or degrees of freedom. This system
will, therefore, have an infinite number of eigenoscillations.
Arnold Sommerfeld (1900)
This book originated as a course of lectures that I have delivered since 1949
at the Department of Mathematics and Physics of Leningrad University in the
areas of mathematical physics and partial differential equations.
The content of these lectures varied as my own undentanding of the
subject developed. However, the pivotal idea, which determined the style of the
lectures, was clearly formulated from the very beginning. This idea consists of
replacing classical formulations of boundary value problems by generalized
formulations... .
The idea of introducing generalized solutions first started to penetrate mathe-
matical physics in the 1920's. It came from two sources. The first source was
that of two-dimensional variational problems. Investigation of these necessi-
tated the extension of classes of functions among which a minimum is sought,
and admission for consideration not only of continuously differentiable func-
tions but also of continuous functions possessing so-called generalized deriva-
tives (Tonelli classes) . . . .
The second source of generalized solutions was that of nonstationary prob-
lems, first the wave equation Un = C Z Au and then the equations of hydro-
dynamics. Discontinuous solutions were introduced for both a long time ago:
for the former, plane and spherical waves with a strong discontinuity on the front
moving at velocity c; and for the latter, solutions describing shock waves. In the
225
226
Linear Monotone Problems
1920's. researchers were trying to understand which discontinuous solutions
should be considered as "admissible" . . . .
The 1930's led to a further development of the above-mentioned trends. In
K. O. Friedrich's papers 1934), (1939), devoted to finding the minimum of the
quadratic functional
)(u) = f ( . t a'JD,uDJu + au 2 -- 2 f U ) JX
(; .. I I
under the boundary condition II = 0 on tG and the ellipticity condition.
, ,
: : > . , : 2
'- a'J"JIi - ( '- i ·
.j-t FI
c > O.
the considerations "ere carried out in the class of functions which subsequentl)'
was designated by JV 2 1 (Gt. ..
In 1936. S. L. Sobolev obtained generalized solutions of the initial value
problem for second-order hyperbolic equations with sufficiently smooth coeffi-
cients and homogeneous initial conditions but rather "poor" free terms. . . . In
the late 1930's, Sobolev and his associate, V. I. Kondraov, obtained the most
complete results, for that time. for the spaces W,"'tG). These were preceded by
results obtained by F. Rellich (t930) on the compactness of embedding W 2 1 G'
in 1..2(G)... .
Also. in the 1930's. J Leray "'ent beyond the scope of classical solutions for
boundary value problems. The paper by leray (1934) concerns nonstationary
solutions of the equations of hydrodynamics for viscous. incompressible nuids.
The same applies to N. M. Gunter's work on the first boundary value problem
for the equation L\u = I and on the initial- boundary value problem for the
equation of the oscillation of a nonhomogeneous spring. .. .
I n the late 1940's. I proposed that generalized solutions of boundary \ alue
and ,nitial - boundary value problems for various types of equations elliptic,
parabolic. and hyperbolic) should be determined with the help of integral ic/en-
1;1;£1$ replacing the equation and, sometimes, a part of the initial and boundary
conditions. Furthermore. the importance of the fact that for each one can
introduce various classes of generalized solutions. defined by that functional
space ",'. to which the generalized solution which is sought should belong, was
alo duly noted. . .. The choice of the space W is up to the researcher and could
be limited only by smoothness of data. The only requirement which must be
atitied is that of the admissibility of the introduced extension. i.e.. the require..
ment that the uniqueness theorem be preserved in the class W provided that this
uniqueness theorem is in agreement with the "spirit" of the problem and occurs
in the class of classical solutions. . . .
Thus. the definition of a generalized solution of a problem was separated from
any method of obtaining it (as it was earlier in the papers of K. O. Friedrichs
and S. L. Sobolev) and. even more. from any analytic representation of it (as in
the work of J. Leray and N. M. Gunter). . . .
At first. I established the existence of admissible generaliled solutions using
the method of finite differences. ... The entire problem under discussion was
realized in the book by Ladyzenskaja (1953). which is mainly devoted to initial
boundary value problems for hyperbolic equations that were most difficult and
were the least investigated at that time.
Olga Aleksandrovna Ladyzenskaja 1973)
I n recent years. progress in the theory of partial differential equations has been
tremendous, often in unexpected directions, while also solving classical problems
in more general settings. New fields have been added, like the study of variational
linear Monotone Problems
227
inequalities. of solitons, of wave front sets, of pseudodifferential operators. of
differential forms on manifolds. etc. Much of the progress has been made by
the use of functional analysis. However, in the process, much of the original
simplicity of the theory has been lost. This is perhaps connected with the
emphasis on solt.ing problems, which often requires piling up mountains of a
priori inequalities and the skillful juggling of function spaces to make ends meet.
It is good to remember that mathematics is not only concerned with solving
problems, but also with studying the structure and behavior of objects that it
creates. One of the best examples is the classical theory offunctions of a complex
variable. It. incidentally, does solve problems as in the Riemann mapping
theorem. But much of its beauty lies in statements that can hardly be considered
as "solving" anything. like the calculus of residues, or Picard.s theorem, or
Cauchy.s formula
(C)
1 f f(K')
I(z) = _ 2 . d"..
n, c K' - z
The only "problem" solved by (C) is the improper one of determing I from its
values on the curve C. which generally has no solution. Formula (C) is not only
strikingly beautiful but also extremely useful. It shows immediately that each
analytic function f can be differentiated infinitely often and can be represented
by convergent power series.
Fritz John (1984)
In the following Chapters 21 through 24 we consider operator equations
and evolution equations with linear monotone operators. In Chapters 25
through 36 this linear theory will be generalized to nonlinear problems.
CHAPTER 21
Auxiliary Tools and the Convergence
of the Galerkin Method for
Linear Operator Equations
I am not able to learn any mathematics unless I can see some problem I am
going to solve with mathematics. and I don.t understand how anyone can teach
mathematics without having a battery of problems that the student is going to
be inspired to want to solve and then see that he or she can use the tools for
solving them. Steve Weinberg (1986)
(Winner of the Nobel Prize in physics in 1979)
In this chapter we consider the following auxiliary tools. which will be used
very frequently in the remaining chapters of Parts 11- V:
(i) generalized derivatives and Sobolev spaces;
(ii) the Sobolev embedding theorems;
(iii) weak and weak. convergence;
(iv) relation between linear operators and bilinear forms;
(v) projection operators, bases, and Galerkin schemes with applications to
finite elements and polynomial bases.
In order to display the basic ideas of the crucial Sobolev embedding
theorems as clearly as possible, we first prove the Sobolev embedding
theorems in RI and after that in R N . This approach is well-situated for
an introductory lecture. Indeed, we will consider two different important
approaches to Sobolev spaces:
Elementary inequalities for integrals based on the Holder inequality and
the Young inequality (Section 21.4);
Fourier transform (Section 21.20).
Furthermore, we consider the following:
(a) Abstract Fredholm alternatit'es and the Riesz-Schauder theory for com-
pact perturbations of invertible operators.
229
230
21 Auihary Tools and the Con\'crgcnce or the Galerkin Mcthod
(b) The main theorem on the approximation-solvability of linear operator
equations and the convergence of the Galerkin method.
(c) Interpolation inequalities. a general convergence trick. and the conver-
gence of refined iteration methods.
(d) The fundamental Gagliardo - Nirenberg interpolation inequalities for
Sobolev spaces. the Moser-type calculus inequalities. and the weak
sequential continuity of nonlinear dilTerential operators in Sobolev spaces.
(c) Banach algebras and Sobolev spaces.
The general theorem in (b) applies to
k-contractive perturbations of the identity. and
compact perturbations of linear strongly monotone operators.
These t\\'O classes of operators allow many applications because they are
related to
the Banach fixed-point theorem.
the Riesz Schauder theory. and
the theory of monotone operators.
Thus. the convergence of the Galerkin method can be established for broad
classes of operators.
More precisely. the main theorem on the approximation-solvability of
linear operator equations in Section 21.16 shows that
1111 - II" II s const dist ( u. X,,).
where II is th solution of the original equation Au = b, and u" is the solution
of the corresponding Galerkin equation in the finite-dimensional space X".
Hence we obtain the following fundamental result:
Tile rapit/it.r of fOllrergellce or tlte Galerkin method depet1ds on dist(u. X,,).
()bserve the following crucial fact. The methods of approximation theory
allow us to estimate dist(u, X,,). In particular, we will consider such estimates
for polynomials (the theorem of Jackson) in Section 21.13. and for finite
clements in Section 21.14.
The point is that the estimates for dist(u. XII) do not depend on the specific
form of II. but only on the smoothne.s clas. of u.
Roughly speaking, we obtain the following: The convergence of the
Galerkin method is faster, the smaller dist(u, XII) is. Moreover, dist(u, XII) is
smaller, the smoother the solution II is. This implies that:
The conl'erge'Jce of the Galerkin method is faster, the smoother the solution
u ;..
Clearly. this result is fundamental for numerical analysis. In Chapter 20 we
have discovered a similar principle, namely: The convergence of an appropriate
21.1. Gcncralilcd f>Crjvativcs
231
difference method is faster. the smoother the solution of the corresponding
differential equation is.
Concerning (e) above, we will prove a relined version of the Banach fixed..
point theorem in Section 21.] 8 via tuples of s.spac:es
XyZ
and via interpolation inequalities. This allows important applications to
mathematical physics (quasi..lincar symmetric hyperbolic equations in Chap-
ter 83). The use of sel'eral 8-spaces in connection with interpolation inequal-
ities represents an important moder,. strategy in the theory of nonlinear partial
differential equations.
The reader who wants Co learn, as quickly as possible, applications of the
tools from this section to partial differential equations may pass to Chapter
22 after reading Sections 21.1 through 21.3 (So bole v spaces) and Section 21.10
(bilinear forms).
21.1. Generalized Derivatives
The simple poinl of departure for the definition of generalized derivatives is
the classical integration by parts fi,,,"ula
t uD-l.dx = (-.)"'1 So (o-,,)vdx (1)
for fixed u e CClI(G) and all v e q{G). Here, G is a nonempty open set in R N
with N I and D« denotes a partial derivative of order 1«1. In the special case
N = t with G = ]a.h[, formula 'I) corresponds to
f." uv(-, Jx = ( - I r L' ,j-)rulx
for u e c-(a, b) and all v e Co(a, b).
Definition 21.1. Let x = (I'. .., N) and = iJloi. By a multiindex tx =
(er 1 . · · · t N)' we understand a tupel of nonneplive int.rs «I t .. . , (J.N. We set
ItZl = tli + .. · + tiN and
IYu - 0':- ... na. u
I "t
.
I.e.,
at-l ll
D-u =
aii · .. aTt.
For « ::: O. we set DOu = II.
Formula (I) is obtained by a repeated application of the integration by parts
formula (18.8). Note that ve C:(G) vanishes on a boundary strip of G. Thus,
in contrast to (18.8), no boundary integrals appear.
232
21. Auxiliary Tools and the Convergence of the Galerkin Method
Now to the point. In formula (I) we set
\V = Du.
Hence we obtain the key formula
t UD2(' dx = ( - 1)1«1 t WI' dx for all l' e Co(G). (2)
The idea of the following definition is that relation (2) remains true for certain
nonsmooth functions 14 and M'.
Definition 21.2. Let G be a nonempty open set in R.... with N > I and let u..
\\' E L 1.loc( G). Then, \\' is called a generalized derivative of u of type DfJ iff (2)
holds. We write \v = Du.
This is one of the most important definitions of modern analysis.
Proposition 21.3. If II E C"'( G), then all tire classical partial derit'atives of u lip
to order m are a/so generalized derivatil'es,
Elich generalized der;lat;t'e \\' = D:lu is unique up 10 a change of ,he ("alues
of '" on a set (r ",eas"re zero.
PR(X)F. The first statement follows from (I) and (2).
Suppose that ". = D 2 u and w = D«u. By (2),
f (\\' - \v)vdx = 0 for all v E CO=(G),
(i
By Proposition 18.36.. this implies \\'(x) = M'(X) for almost all x E G. 0
The following examples describe typical generalized derivatives. Roughly
speaking.. we shall obtain the follo\\'ing:
Piece,,'ise continuous deri,)atit'es are generalized derivatives,
This principle is important for the construction of finite elements (piecewise
polynomial functions), which playa fundamental role in modem numerical
analysis as basis functions for the Ritz and Galerkin methods.
EXAMPLE 21.4. Let u(x) = Ixl. Define
I if x > 0,
\,,( x) = - 1 if x < O.
if x = O.
where 2 is a fixed. but otherwise arbitrary.. real number. Then the function \\'
is the generalized derivative of u on each open interval ]a, b[.
21.1. Generalized Derivatives
233
Note that w(x} = u'(x) if x :F 0, i.e., w(x} coincides with the classical deriva-
tive of u. At the point x = 0, where the classical derivative of u does not exist,
the value of w can be chosen arbitrarily.
PROOF. Suppose first that 0 e ]a, b[, where -00 < a < b < 00. For all
l' e C(a, b), integration by parts yields
f: uv'dx = J.0 uv'dx + f: uv'd:(
= uv ° + uv" - f o u't,dx - r" u'vdx
G 0 G Jo
= -1" wvdx.
Note that t'(a) = v(b) = 0 since v e CO(a, b).
In the case where a = -00 or b = +00, integrate over a sufficiently large
bounded interval J with the property that v vanishes at the boundary points
of J. 0
STANDARD EXAMPLE 21.5. Let the function u: [a,b].... R be continuous and
piecewise continuously differentiable on the compact interval [a, b].
Let C be the set of points ., where the classical derivative u'(x) exists. Define
( { u'(X) if . e C,
\\' x) =
arbitrary otherwise.
Then the function w is the generalized derivative of u on ]a, b[.
More precisely, we assume that the function u has the following properties:
(i) u is continuous on [a, b].
(ii) There exist finitely many points a, with a = ao < a I < · .. < a" = b such
that u is continuously differentiable on the open subintervals ]a"a'+1 [
and the derivative u' can be extended continuously to the closed sub-
intervals [ai' a,+ I] for all ; (cr. Fig. 21.1).
PROOF. Divide the interval [a, b] into the subintervals [ai' a;+ I] and use the
same argument as in the proof of Example 21.4. 0
u
QI
Q2
I
b
Figure 21.1
234
11 Auxiliary Tools and the ('on\'ergence or the Galerkin Method
- G:
G,
Figure 21 2
EXAMPLF 21.6. Let G be an open triangle in H 2 . Divide G into three open
subtriangles G... G 2 . G) as pictured in Figure 21.2. Suppose that the function
II: G -+ iR has the following properties:
(i) "is continuous on G.
(ii) 1# has continuous partial derivatives D. u and D 2 11 on G., G 2 , G J , which
can be continuously extended to G.. G 2 , G J , respectively.
Define
{ Djl,(X) if x E G i for some i..
"'.( x) =
} . arbitrary otherwise.
Then the function "j is a generalized partial derivative of II on G.
PRC)()f. For all r E C (G).. integration by parts yields
f IIDjl' (Ix = r f IID j l'dx
c; . <i,
= L f . IIrnj ds - L f rDjll dx
. t G, . G,
= - fG ,oDju d.\:o
Note that the boundary integrals drop out since they have pairwise different
signs because of the opposite directions of the outer unit normal n = (n... n 2 ).
In this connection. observe the continuity of u. 0
EXAPI.E 2 J. 7. Obviously.. the preceding example can be generalized imme-
diately to much more general situations. Roughly speaking, we need the
following:
(i) (j is a bou nded region in R....
(ii) I' E Ct(G) for fixed k = O. I. ... .
(iii) All the partial derivatives D 2 u with 121 = k + I are piecewise continuous.
Here.. we assume that the surfaces of discontinuity are sufficiently regular.
i.e.. they belong to Co. · .
21.2. Sobolev Spaces
235
Then the functions Du with 121 = k + 1 are generalized (k + 1 )-th partial
derivatives of u on G. In this connection, the values of D 2 u can be chosen
arbitrarily on the surfaces of discontinuity.
Rem.rk 21.8 (Distributions). Let u(x) = Ixi and consider the generalized deri-
vative "0 = II' as in Example 21.4. Then there does not exist any function v
which is the generalized derivative of . on R. However, we obtain that
w' = 2b,
where b denotes the so-called Dirac b-distribution at x = O. This will be
explained in the Appendix in A 2 (66b). If G is a nonempty open set in IAN, then
each Cunction u e LI.1oc(G) has derivatives of arbitrary order in the sense of
the theory of distributions (cf. A 2 (65), (66)).
In what follows we do not need the theory of distributions since we only
work with generalized derivatives which can be represented by functions.
21.2. Sobolev Spaces
Let 1 S P < x. We set
lIuli p = (L lu(x)IP dx riP
(3)
and
lIuli MoP = ( L f I Dalu(.'()I P dX ) ' p. (4)
o Sl21 s. G
Here. the sum is taken over all the partial derivatives of u up to order m. Recall
that DOu = u. Moreover, we set
(ult'h = L ul"d.
and
(ul r).. 2 = L f D2uDaI v d:-<.
o s 1.1 S ... G
(5)
Note that
( ) IIp
lIull.. p = L II D2u ll: .
Os 1 2 1 s.
(lllv).02 = L (DaluIDt')2'
o s 1 2 1 s .
For It. = 0, we obtain that Ifulloop = flullp and (ulv)002 = (ulv)2'
Definition 21.9. Let G be a nonempty open set in R N with N 1. Let I S P <
'X and m = I. 2. .. . .
236
21. Auxiliary Tools and lbe Convergence of'tbe GaJcrkin Method
(i) The Lebesgue .pace L,(G) is the set of all measurable functions u: G -. R
with" "", < 00.
(ii) The Sob(Jlev space W;(G) is the set of all functions U E L,(G), which have
generalized derivatives up to order m such that
0-" E L,,(G) for all «: 1«1 S m.
For m == O we set W,O(G) = L,,(G).
The spaces W:(G) will be defined at the end of this section.
Propositioa 21.10. Let J p < 00 and m = 0, It _ .. .
(a) The .el W".(G) together with ,he nor", U-8.." hecomn a real B-space
provided we identify any two !wu:tian., which differ unly on a set of N-
dimensional LebeSfl"e measure zero.
(b) For p ::: 2. Wf'(G) is an H-space with the scalar product (.,. )..2-
Pa(X)F. Ad(a). We use the well-known fact that L,,(G) becomes a B-space with
the norm II. n" provided two functions are identified whose values dilTer only
on a set of N-dimensional Lebesgue measure zero.
It follows from the Minkowski inequality
C I, + ",1' )'" Ct 1(,1' )'" + Ct 1",1' )'"
for all real i' "i that
Nil + I7R.., S HuH.... + nul..,.
i.e.. II-II.." is a norm on W,.(G In order to prove the completeness of W,.(G
let (u.) be a Cauchy sequence in W,,"(G). Then (u.) and ("'''.) are Cauchy
sequences in L,,(G) for 1«1 s m, and hence
... II in L,,(G) as n <x).
0-"_ .... w. in L,,( G) as n -. 00.
In this connection, note that ID-"I" S luB..". By Example 18.IS, we obtain
that
... -+.. in L 1 (H) as n -. 00,
0-... ... w. in La (H) as n -+ CO.
ror each compact subset H of G.
Thus, it rollows from
L u.l>-udx -= (-It" So (1)-,,.)114,, for aU 11 E CO(G)
rhar
So "I>-udx = (-It" t w.odx
for all U E C:(G).
21.3. The Sobolev Embedding Theorems
231
This implies W tI = Dtl u and War e L,(G); therefore,
u" ..... u in W p '"( G) as n.... 00.
Consequently, each Cauchy sequence in Wp'"(G) is convergent, i.e., W,'"(G)
is a B-space.
Ad(b). This follows from the well-known fact that L 2 (G) is an H-space with
the scalar product ( '1' )2' 0
Definition 21.11. Wp'"(G) denotes the closure of Co(G) in Wp'"(G).
Explicitly, this means the following. Let u e W,Ift(G). Then. u belongs to
Wp"'(G) ifTthere exists a sequence (u,,) in CO'(G) with lIu" - ull",., ..... 0 as n ..... (£.
Proposition ZI.I%. W,"(G) is a B-space with the norm fI.II.... p and W 2 "'(G) ;s an
H-space ,,'ith the scalar product ('1' )...2'
PROOF. Note that Co(G) is a linear subspace of W,"(G). Hence W,"'(G) is a
closed linear subspace of Wp"'(G). 0
Let G be a nonempty open set in AN, N I. The space Lx(G) has been
introduced in Section 18.6. We set
II u II 1ft. = L II [)tI u n .
Itli ...
By definition, the Sobolev space W:(G) consists of all functions u e Lx(G)
which have generalized derivatives up to order m such that
DaR u e Lx(G) for all : II S m.
Proposition 21.10 remains true for the space W,"'( G) with p = 00.
21.3. The Sobolev Embedding Theorems
Definition 21.13. Let X and Y be B-spaces over I( = R. C with X c: Y. The
embedding operator j: X ..... Y is defined by j(u) = u for all u e X.
(i) The embedding X S; Y is called continuous iff j is continuous, i.e.,
lIulf, S const lIulix for all u e X. (6)
(ii) The embedding X S; Y is called compact iffj is compact, i.e.. (6) holds and
each bounded sequence (u ll ) in X has a subsequence (u ll ') which is conver-
gent in 1':
More generally, we speak of an embedding if there are two B-spaces X and
Y over I< and an injective linear operator j: X ..... Y. Since j is injective, we can
identify u with j(u). In this sense, we write X s; Y.
238
,
21. Auxiliary 10015 and Ihe ("onversence of ahe Galerkin Method
Rccalilhat two norms n.1t I and N .11 2 on the B-space X are called equivalent
iff there exist positive constants (' and d with
null. s cllull2
and
0llD 1 s dllull.
for all u eX.
The importance of this notion results from the fact that many properties are
invariant under a passage to equivalent norms, e.g., this is true ror "conver-
gence. 8-space. continuous operator, compact operator. bounded set, etc."
We set
lIull1.2 :0: (t (u 2 + (ur )dx)'I2,
"ull..1.0 = ( f t (OJII)2 dX ) I12 .
JG 1-1
(7)
Proposition 11.14. Ltl G IN.' a hounded region in RN with N I. Then:
(a) Ti,t' "(Jrm. n-II.. 2 and II. n 1.2.0 tire equivalent on J41 2 1 (G).
(b) 1'he (Imbedding WZI(G) L 2 (G) is omfHJCt.
(c) The emht'dtl;ng.
Lz(G) W Z I (G) W 2 1(G) 2 W Z 3(G) ;2 .. ·
are c-t)mpafl.
It follows from (a) that t G) is an H-space with respect to the so-called
energetic scalar product .
. .
(ulv).%.o - f f D,uO,vdx. .
<6 '-1
UUftT.2 a (L ,tJD.)2dX + fIG UdXrr12,
lIuHT = ( f (0,U)2 dx + r U2dO ) II2.
J(i I Jna
In A I we set $,1<; ,,1 dO = U(G)2 + u(h)J, where G - ]a, h(.
We now set
(8)
Theorem 21.A. Let G be a bowed region In AN with N I and sufficiently
smntJth boundary, t I.e., aG E CO.,. Theil:
(8) Density. c«'(G) is dense In W 2 1 (G).
(b) Compact embeddina. TIw ""bedtling Wz-(G) L 2 (G);, CDlllptJct.
(c) Equivalent norms. &Ie" of IIw two IIOI1fU ill (8) iI till quiMlIt IIOrIfl on
W 2 t (G).
I Ie N :1:0 I. then Gis. bounded open in1erY11l... in PropoU tion 21.14. .
21.3. The Sobolev Embedding Theorems
239
(d) Regularity. For m - j > N/2, the embedding
W 2 '"(G) c CJ(G)
is continuous, i.e., each function u e W2"(G) belongs to Cj(G) after changing
tI,e t"alues of u on a set of N-dimensional Lebesgue measure zero, if
necessary.
(e) Generalized boundary function. There exists exactly one linear continuous
operator
B: WZ1(G) -+ L 2 (cG)
,vitll the property that, for each u e C 1 (G), the function
Bu: eG -+ R
is the classical boundary function to u, i.e., Bu is the restriction of u: G -+ R
to the boundary eG.
In the case where u e Wi (G), the function Bu E L 2 (cG) is called the
generalized boundary function to u. The values of Bu are uniquely deter-
mined on fG up to changing the t'alues on a set of surface measure zero.
If u e Wz'(G), then Bu = 0 in Lz(cG), i.e.,
Bu(.) = 0 for almost all . e eG. (9)
1 n the following, instead of (9) we will simply write:
u = 0 on eG.
( 10)
Corollary 21.IS. Let G be a bounded region in R N 'ith N > 1 and cG e co. I.
Then:
(a) CX(G) is dense in Wz"'(G) for m = 0, 1, 2, ... .
(b) The embeddings
L 2 (G) ;2 W 2 1 (G) ;2 Wf(G) ;2 W 2 J (G) ;2 ...
are compact.
(c) If liE W2"(G) with m > 1, then B(D2 U ) = 0 in Lz(cG) for 121 S m - 1, i.e.,
D 2 u = 0 on cG for all : II m - I (11)
in the sense of (10).
The proofs of Proposition 21.14, Theorem 21.A, and Corollary 21.15 will
be given in the next section. These embedding theorems are the prototypes of
more general theorems, which will be summarized in A 2 (41) fT.
Remark 21.16 (The Importance of the Sobolev Embedding Theorems for
Boundary Value Problems). In Chapter 22 we shall apply the Sobolev embed-
ding theorems to boundary value problems for elliptic differential equations.
I n this connection, note the following.
240
21. Auxiliary Tools and the Convergence of the Galerkin Method
(i) F.or solving the .(irst boundary value problem for linear second-
order elliptic equations, we need the equivalence of the norms 11'111,2 and
1/ '111.2.0 on W 2 1 (G). This corresponds to the classical inequality of Poincare-
Friedrichs.
(ii) For solving the second (resp. third) boundary value problem, we need
the eq uivalence of the norms 11.11 1 . 2 and 11'11 t, 2 (resp. 11'11 1 . 2 and 11'11 t,*2) on the
Sobolev space W 2 1 (G). This also corresponds to classical inequalities due to
Poincare and J-"riedrichs.
In order to prove the equivalence of norms on Sobolev spaces, one uses
frequently the compactness of certain embeddings. In particular, the proof of
Theorem 21.A(c) will be based on the compactness of the embedding W 2 1 (G) C
L l( G).
(iii) For solving eiyenl'aille problems, one needs the compactness of the
embedding W 2 1 (G) c L 2 (G).
(iv) The reglliarity statement (d) of Theorem 21.A will be used in order to
prove the regularity of generalized solutions u. The basic idea is to show that
II belongs to the Soholev space W 2 '"( G) for sufficiently large m, i.e., u has
/enerllli:ed derivatives up to order m. Then, for j < m - N /2, we obtain II E
Cj(G), i.e., u has classical derivatives up to order j. In particular, in R 1 we have
that
W 2 "'(G) c: Cm-I(G),
and in 1R2 and J we have that
"'2'" ( G) C m - 2 ( G ),
nJ = I. 2. . . . .
nl = 2, 3. . . . .
(v) The boundary operator B is crucial for the formulation of boundary
('o,ldit;ons in the generalized sense. For example, let u E W 2 1 (G) and g E W 2 1 (G).
Then the boundary condition
II = g on rG
is to be understood in the sense
Bu = By in L 2 (cG),
i.e.. Bu(x) = Bg(x) for almost all x E eG.
The construction of B proceeds as follows. First. we prove the key inequality:
!lIlIt L ;l4"G) const 11"111.2
for all U E C1(G).
( 12)
This will be done in the following section, That means
f U2 dO const f ( 142 + .f (D 1 u)2 ) dx.
;'G G' ::a I
Now. let Bu denote the restriction of u: G -+ R to the boundary oG in the case
where u E C I (G). Then, (12) can be written as
II Bu 111.. 1 4;'(;) < const II u II.. 2
for all u E C1(G).
(12.)
21.4. Proof of the Sobolev Embedding Theorems
241
Since C 1 ( G ) is dense in Wl(G), the linear operator
B: C 1 (G) Wl(G) -+ L 2 (cG)
can be uniquely extended to a linear continuous operator
B: W 2 1 (G) -+ L 2 (oG)
such that (12*) remains true for all u e W 2 1 (G). This follows from the extension
principle in Section 18.12.
Explicitly, we obtain Bu in the following way. Let u e Wl(G). We choose a
sequence (u,,) in C 1 (G) such that
lIu" - uili. 2 -+ 0 as n -+ 00.
Then II Bu" - Bu II Lz(i'G) -+ 0, i.e.,
J (u" - Bu)z dO -+ 0 as n -+ 00.
G
In particular, if u e Wl (G), then Co(G) is dense in WZI (G), i.e., we can choose
(u,,) from Co(G). Since u" = 0 on oG, we obtain Bu = 0 in L 2 (oG), i.e., Bu(x) = 0
for almost all x e oG.
If u E Wz"'(G), then DSu e Wl(G) for 121 S m - t. Hence B(Du) = 0 in
L 2 (cG), i.e., we obtain (II).
Relation (11) is important for the first boundary value problem for linear
strongly elliptic equations of order 2m.
Corollary 2t.IS(b) is an easy consequence of Theorem 21.A(b). To show
this, let (u,,) be a bounded sequence in Wz'"(G). Then (DCl u ,,) is a bounded
sequence in Wl(G) provided 121 < m - t. Since the embedding W z l(G) S;
L 2 (G) is compact, there exists a subsequence, again denoted by (u,,), so that
u" -+ u in L 2 (G) as n -+ 00,
Du" -+ "'« in L 2 (G) as n -+ 00
for 121 S m - I. Hence W s = DClu. This implies
u" u in W 2 ",-1 (G)
as n -+ 'X),
i.e., the embedding W 2 "'(G) S; W 2 ",-1 (G) is compact.
In the same way, Proposition 2t.14(c) follows from Proposition 21.14(b).
21.4. Proof of the Sobolev Embedding Theorems
In order to make the basic ideas of the Sobolev embedding theorems as
transparent as possible, we begin with extremely simple proofs in RI. Further-
more, we shall use the following strategy of proof:
242
21. Auxitiary Tools and lhe Convcrgence or the Galerkin Method
(i) Let ,,*;(G) denote the closure of I(G) in Wp"'(G
(ii) We first prove the embedding theorems for 1f';"'(G).
(iii) Finally, we show that CI(G) is dense in W:z"(G), i.e., if';-(G) = W 2 -CG).
provided G is a bounded region in R N , N I, with iJG E CO. I .
This approah has been chosen for didactic reasons. In an introductory
lee-lure. it suffices to work with (he spaces 11'.(G) instead or W".(G). This
simplirlCS the approach coo!tiderably, because ('(G) is autolJlatically dense
in 11 .(G).. 1 < p < OCI.
Note that 1#';'(G) = W:'(G) holds since we assume that l)G e eft... Fr
regions (j with palhologi(.al boundaries. we have lI';'(G) c W;I(G), i.e., C'(G)
is nol dcn in W;'(G).
However, in the case of the spaces Wp.(G), Ihc structure of the boundary fiG
docs nOI play any role. This depends on the raet that. by construclion. Co'(GJ
is dense in W,,"'(G) and each function u e C'(G) vanishes on a boundary strip.
I n the following, we shall see that the Sobolev embedding theorems result
from simple integral inequalilie.-;, which are based Jn the Holder inequality
and the main formula of calculus:
u(.'(J - u(xo) = r x u'(tJdt.
J '-'J
An aJ1crnativc approach to the Sobolcv spaces W 1 -(G) via Fourier transform
will be considcrcd in Section 21.20.
21.4a. Proof of Proposition 21.14 in R I
Let G - ]a.h[ with -tX; < Q < h < 00. In the following all positive constants
arc dcnoled by the same symbol (t. Lei f = J:. We use the following norms
Uu Ii 1 = (f 142 Jx ) 112 . 8t,II(. == max lu(.'()I.
..I.
. UIII. 2 = (f (1,2 + U'2) dx Y'2 . lIuK 1.2.0 == (J !A'Z d.,< Y/2 .
./C'p I: The embedding ZI(G) S; C.(G) is -ontinuous.
Let u e Cc'(G). Since U(l') ::: O. we obtain the Icey fi,rmula
u(x) = fC u'(I)dt. (3)
By the Holder inequality with I-Irick in Section 18.6.
lIuN(. S f lu'141 S (h - Q)I/Z (f 11'2 tit ) 1/2
= (h - a) \11 nun 1.2.0 (h - a)111 HuN I. 2. (14)
21.4. Proof of the So bole v Embedding Theorems
243
This inequality shows that the embedding operator
j: Co (G) W 2 1 (G) -. C(G)
is continuous. The set q'(G) is dense in W 2 1 (G). By the extension principle in
Section 18.12, there exists a unique linear continuous extension
j: W 2 1 (G) -. C(G).
The typical argument (A). Let u E W 2 1(G). Then there exists a sequence (u,,)
in C(G) with
u,,-.u in W 2 1 (G)
as n -. 00.
Hence
j(u,,) -. j(u) in C(G)
as n -. OC:.
Since j(u,,) = u" and u" -. u in L 2 (G) as n -. oc, we get u = j(u) in L 2 (G), and
hence
u(.) = j(u)(x) for almost all x E G.
Thus, the operator j: W 2 1(G) -. C(G) is injective. If we identify u E W 2 I (G) with
j(lI) E C(G). then we can write W 2 1 (G) C(G) and
II u "c sell u III. 2 for all u E W 2 1 ( G).
Step 2: The inequality of Poincare-Friedrichs.
For all II E Co(G),
lIullt2 = f u 2 dt + lIullt2.0 s (b - a)lIull + lIullr2.0
s cllull.2.0'
by (14). A passage to the limit shows that lIull.2 S cllullt.2.0 holds for all
I' E W 2 1 (G). Hence n.ul. 2 and n'III.2.0 are equivalent norms on W 2 1 (G).
Step 3: The embedding W 2 1 (G) C 1 / 2 (G) is continuous.
For all u E Co(G), it follows from (13) and the Holder inequality that
J ,.
lub') - u(.)1 = JC u'dt
s (f: dr Y/2 (f: U'2 dt) 1/2 S Iy - .I"2I1ull,. 2
for all x, }' E G. For the Holder constant H(u), we thus obtain H(u) S lIu1l1.2'
Hence
lIullc' = lIulic + H(u) S cllull1.2'
As in Step I, this implies the continuity of the embedding W 2 1 (G) C I/2 (G).
Step 4: The embedding CII2(G) c C(G) is compact.
Let (u,,) be a bounded sequence in C 1 !2(G), i.e., lIu"lI c '1 < c for all n. Then,
244
21. Auxiliary Tools and the Convergence 0{ the Oalerkin Method
(or all x, ye G and all
lu.(x) - u.(y)' S clx - 11 112 , lu.(x)1 S c.
Hence (u.) is equicontinuous and uniformly bounded. By tho Arzem-Ascoli
theorem A I (24g), (u.) has a subsequence which is convergent in C(G).
Step 5: The embedding C(G) c: Lz(G) is continuous.
This follows from
"".1 = (f u 2 dl)"2 S (6 - a) 112 1111" for all u e C(G).
It follows from Steps 3-5 that the embeddinp
W 1 1 (G) s; C(G) and Wl(G) L 2 (G)
are compact.
21.4b. Proof of Theorem 21.A in AI
Let u e CI(G). By the mean value theorem,
(b-ar t J: utlt=u(x o ) for some xoe[a.b].
Instead of (13) we now use the key fiJrmula:
u(x} == I x u' tit + (6 - ar l f it dt. (IS)
"0 ..
Slp I: The embedding ZI(G) s; C(G) is continuous.
For all u e c«'(G), by the Holder inequality. it rollows from (IS) that
IluRc S; c(f U 12 dtY/2 + c(J U 2 tlt)'12
S 2clluU 1.1.
Since C((G) is dense in 1r 2 t (G), the assertion follows as in Step I of Section
21.4a_
Sfep 2: The embedding 1fi'(G) el/JtG) is continuous.
If U E CJ'( G), then (I S) yields
u(y) - u(x)" f: u' dt.
Now, t he assertion foUows as in Step 3 of Section 21.4a.
Slep 3: Generalied boundary values.
For all U E C.a'(G),
u(a)2 + u(h):Z S 2null. $ 8c 2 Uullf.2.
(16)
21.4. Proof of the Sobolev Embedding Theorems
245
By Step I, if u E 11 2 1 (G), then there exists a unique continuous function
j(u) e C(G)
with u(.) = j(u)(x) for almost all x e G. The generalized boundary values of u
are given by
j(u)(a) and j(u)(b).
For brevity, we write u(a) = j(u)(a) and u(b) = j(u)(b).
Note the following. Since the function u e 11'2 1 (G) can be changed on a set
of measure zero, we obtain the uniquely determined generalized boundary
values of u after changing u on a subset of G of measure zero, if necessary.
Since eX (G) is dense in 11 2 1 (G), a passage to the limit shows that (16) remains
true for all u e 11;1 (G), where u(a) and u(b) are to be understood as generalized
boundary values.
We now set
lIu1l1.2 = (f u'2dx + f udx 2Y/2.
lIullT..2 = (f U'2 dx + u(a)2 + U(b)2 Y/2 .
Step 4: lI.n T. 2 is equivalent to 11.11.. 2 on 11';1 (G).
Let U E CX,(G). From (A + 8)2 S 2A 2 + 28 2 and (15) it follows that
U(.'()2 S 2(f u' dtY + 2(b - a)-2 f udt 2.
and hence
fU 2 dtSC f U '2 d t + C fUdt 2,
f U dt 2 S c f u 2 dt.
This implies
cllull1.2 S lIullT.2 S dllull1.2. (17)
A passage to the limit shows that this inequality remains true for all u e
11;1 (G).
Step 5: II.IIT. is equivalent to 11.11..2 on 1I 2 1 (G).
Let u e CCC(G). From
U(x) = f: u' dt + u(a)
it follows that
IIull e S c(f u'2 d t Y/2 + lu(a)l.
246
I Auxiliary Tools and the ('onvergence of the Galcrkln Method
and hence
f 11 2 dr < c (f 11'2 dr + lI(a)2).
Together "'ith (16) this implies
c /I II II.. 2 < II II " ..2 d" u II.. 2 .
A passage to the limit shows that this inequality holds for all" E 11;1 (G).
21.4c. Proof of Theorem 21.A in (RN
Note that Proposition 21.14 follows from the inequality of Poincare
f-"riedrichs in Section 18.9 and the compactness theorem of Rellich in Section
19. 13.
In order to prove Theorem 21.A. let G be a bounded region in R.... with
N > 2 and (-:G E Co. I .
Step J: The embedding 11 ;'1 (G) c: L 2 (G) is compact.
This has been proved in Section 19.13.
Step 2: The generalized boundary operator.
For aJlII E Ca(G). we show that
f . ,,2 clO < C f [ "2 + .f (Djll)2 J dx.
f (; G J:: I
( 18)
This implies Theorem 21.A(e) according to Remark 21.16.
To simplify notation.. we consider the case N = 2. The general case proceeds
analogously. In a neighborhood ora boundary point x E cG.. we choose a local
(.. ,,)-Coordinate system. where the boundary has the local representation
'1 = "( ). - < < .
with the ("0. I-function h. i.e.. b is Lipschitz continuous (.....ig. 21.3).
Then there exists a p > 0 such that all the points (. PI) with
- < < .
h() - fJ < '1 h()
,,
/
I
aG
-'---.
Figure 21.3
21.4. Proof of the Sobolev Embedding Theorems
247
belong to G. Let u e CI(G). Then
f b( t
u(, b(» = I u"(,,O d" + u(, t),
where b() - fJ S t S b(). It follows from the inequality (A + 8)2 S 2A 2 +
28 2 and from the Holder inequality with I-trick that
i bC(t
u(, b())2 S 2p u..(, ,,)2 d" + 2u(, t)2.
bCt-'
Integration with respect to t yields
pu(, b())2 S 2 fbC(' [P2 u,,(. '0 2 + u(, ,,)2] d".
J bCt-'
Finally, integration over the interval [ - 2,] yields the key estimate:
f2 pu(.b())2d S 2 is (P 2 u: + u 2 )dx. (19)
where S denotes a local boundary strip.
First suppose that b(') is CI. Then the differential of arc length is given by
ds = (1 + b '2 )1/2 d.
Addition of the local inequalities (t 9) yields the assertion (18).
If b(.) is merely Lipschitz continuous, then the derivative b'() exists for
almost all all and b' is bounded. Thus, we also obtain (18).
Step 3: Equivalent norms on 11;1 (G).
In order to get insight, we will prove a more general result than Theorem
2I.A(c). To this end, we set
lIull = (f f (D J U)2 dx + p(U)2 ) 1/2,
G)-I
where p: 1t 2 1 (G) -+ R is a seminorm with the following two properties:
(i) There is a positive constant d with
p(u) S d II U III. 2 for all u e 1f 2 1 (G).
(ii) If u == constant, then p(u) == 0 implies u = O.
We shall show that:
lI.n is an equivalent norm on 1f;1 (G). (20)
Recall that a seminorm p has the same properties as a norm with the only
exception that p(u) == 0 does not necessarily imply u == O. By (i), it is sufficient
to prove that:
null 1.2 s c lIuli
for all u e 11;1 (G).
(20. )
248
2 J Auxiliary T 0015 and the Convergence of the Galerkin Method
I n the two special cases
p(u) = L u dx and p(u) = (f.,G u 2 dO ) I 2 .
we obtain Theorem 21.A(c) from (20).
The proof of (20.) will be based on the compactness of the embedding
11 ;'1 (G) C L 2 (G). Suppose that (20*) is not true. Then there exists a sequence
(II,,) in 11;'1 (G) such that
1111"".. 2 = I
and
I > " II U" II
for all n.
Since (u") is bounded in the H-space 11;'1 (G). there exists a subsequence. again
denoted by (II"). such that
II,,U in 1I 2 1 (G)
by Theorem 21.D. This implies
"" -. II in L 2 (G)
as n -. oc.
as n..... oc,
because of the compactness of the embedding 11 2 1 (G) C L 2 (G) (cr. Proposi-
tion 21.35(d)).
It follows from Hu,,11 -. 0 as n -. 00 that
Dju" -. () in L 2 (G) as n..... X.- for allj,
p(u,,) -. 0 as'J -. x.
Hence Djll = 0 on G for all j. By Problem 21.3a, u = const. Since p: 11;1 (G) -.
IR is convex and continuous. by (i) and Ip(v) - p(w)1 S p(t - w), it follows from
U" U in 11;1 (G) that
p(u) lim p(u,,).
"-7.'
by Proposition 25.20. and hence p(u) = O. because p(') is nonnegative. By (ii).
II = O. From Djll" ..... 0 and u" -+ 0 in L 2 (G) we obtain that
lIu"lI.. 2 -. 0 as n -+ x,
contradicting 111,,, 11.. 2 = I for all n.
Step 4: The embedding 11 ;1c(G) c C(G) is continuous in the case where
k > N/2.
This implies Theorem 21.A(d). In fact, if U E 11 1c+1 (G), then Dju E 1I 2 1c {G)
and hence DJu E C(G) for all j; therefore II E C I (G), etc.
As we shall show. the crucial condition uk > N /2'" results from
f dx 2
J. IxI 2IN -') < oc" ( I)
where B is a ball in R N .
Let C be a closed spherical cone with vertex at the boundary point x E c'G
and C c G (Fig. 21.4). Moreover, let R > 0 and n > 0 be the radius and the
21.4. Proof of the Sobolev Embedding Theorems
249
x
Figure 21.4
solid angle of C, respectively. We want to prove the key inequality:
lu(x)1 2 c(R,CW L f IO.uI2 dx for all U E c«>«h (22)
I s l Si Jc
Since cG E Co. 1 , there exist numbers R > 0 and 0 > 0 so that, at each bound-
ary point . E cG, there exists such a cone C with radius R and solid angle O.
Thus, from (22) we obtain
max lu(x)1 S c(R, 0) lIull..2 for all u E CX(G).
%.
This implies the continuity of the embedding 1I2i(G) s; C(G), by using exactly
the same argument (A) as in Step 1 of Section 21.4a.
We now prove (22). To simplify notation, let N == 3, k == 2, and x == o. We
choose a ex -function h: R -+ R with
h(t) = { I for t S!,
o for t > 1,
and we set
cp(r) = h(r/ R).
Then cp(O) = 1 and cp'(O) = cp(R) == cp'(R) = O. Integration by parts yields
U(O) = - tit (fPu).dr= tit r- l (fPu)rrr 2 dr.
Noting dx = r 2 dr dw, integration over the solid angle w yields
Ou(O) = L r-l(fPu)rr dx .
By the Holder inequality,
IOu(OW L r- 2 dx L (fPu);dx.
This yields (22).
If N > 2, then we use (21) and the formula
ufO) = (-It s: r'-N OU) r N - 1 dr.
250
21 Auxdiar) Tools and the Convergence of the Galerkin Method
21.4d. Density of Smooth Functions in Sobolev Spaces
We prove that
11 ;1 ( G) = W 2 1 ( G) ( 23)
provided (G E C.. i.e., we prove that CT (G) is dense in W 2 1 (G). The boundary
property ('(i E C was introduced in Section 6.2. In particular. the condition
i'G E C is fulfilled in the case where?G E Co. J. We first consider the case N = I
and G = ]a,h[ with -x < a < h < 'X. Let II E Wi (G).
St('P I: Localization of II.
As in Figure 21.5(a). let {V j L:r.. 2..\ be a covering of G, where the sets V j are
nonempty and open. and the boundary points are covered by VI and V J . Let
: t/l i : be a partition of unity with respect to { V j }. i.e..
t/l i E C (ll}) and L I/!.(x) = I. 0 < I/!j(x) < I.
k
for all x E G and all j. We set
II j = IIt/1j'
By the product rule for generalized derivatives (cf. Problem 21.3d). we get
, , .1. .1.'
u j = II 'Pj + uY'j'
.cncc II} E "21 (G, for all}.
Step 2: Translation of the localizations u j at the boundary points.
""or sufficiently small k > 0, the function
l' (x) = U t (. + k)
has the following two properties:
(i) l' E Y21 (H) for a sufficiently small neighborhood H of G.
(ii) (' III in W 2 t (G) as k -+ O.
Property (i) follo\\'s from
,,'(x) = u (x + k) for all x Ell.
In fact. for all cp E C; (H), we have
r lI;fx + kltpfxlt/x = f u(x)cp(x - k)dx
Ju G
= - f II, (x)cp'(x - k) dx = - r l4.(x + k)cp'(x) dx.
G JH
In this connection, note that x.-. cp(x - k) belongs to Co (G) for sufficiently
small neighborhoods H of G.
Property (ii) follows from
f (II, (.x + k t - II 1 (.x))2 + (u; (x + k) - U (x))2 d X 0 as k 0,
G
according to the mean continuity of L1-functions (cr. A 2 (32)).
21.S. Duality in a-spaces
251
U..
I .
I
. ) I ( .
a b
) I ( ) / (
/
VI U,
(a)
Figure 21.5
aG
(b)
Step 3: Smoothing of t'.
We set
v, = SeVe
By Proposition 18.35, it follows from v E W 2 1 (H) and G c H that L'r E C7.(R)
for small E: > 0 and
t'r -+ t' in Wl (G)
as E: -+ + o.
Using an analogous construction for the other localizations uj" we find that
C(G) is dense in W 2 1 (G) ifG c R I .
The general case G c R N can be treated completely similarly by using local
coordinate systems for the local representation of the boundary cG. Then the
translation in Step 2 corresponds to a translation with respect to the local
coordinate system. For example, if N = 2, then we use v(,'1) = Uj(''1 + k)
(cr. Fig. 21.5(b)).
In the same way we prove the density of CX'(G) in Wp'(G) with I < p < oc,
k = 1.. 2. ..., in the case where G is a bounded region in R N with N > 1 and
("':G E C. Hence
11 1c(G) = Wplc(G). (24)
This completes the proof of all the assertions stated in Section 21.3. 0
21.5. Duality in B-Spaces
We summarize some well-known facts about B-spaces. Let X be a B-space
over K = R, C. By definition, a linear continuous functional on X is a linear
continuous map I: X -+ 1(. We set
<I, u) = f(u)
for all u e X
and
IIfll = sup I <f, u) I.
1 1 11 S I
Hence
1<I,u)1 s 1I/IIIIuli
for all U EX.
Let X. denote the set of all linear continuous functionals on X. Then X.,
252
21. Auxiliary Tools and the Convergence of the Galerkin Method
together with the norm Uln, becomes a B-space over 1(, which i called the
dUdl 'pa,'e to X. We set X.. = (Y.)..
Let U EX. If we set
V(f) = (f.u)
for all .re X,
then U: X. -.. IK is a linear continuous functional. i.e., U e X.. and II U If =
1-IIU. Lettingj(u) = U.. we obtain a linear isometric map
j: X X...
Since j is injective.. we can identify X with the subsetj(X)of X**. In this sense.
we \\'ritc X X... The B...space X is caUed reflexive iff j is surjective. Then j
is a normisomorphism from X onto X.., and we can write X = X**. More-
over, if we identify u with U = j(" then we can write
<.f. u) == < u,l> for all f EX. , U eX.
Let X and Y be two B-spaces over K and let
.4: D(A) s; X -+ Y
be a linear operator.. Recall that this means that D(A) is a linear subspace of
X and
A(au + (Jv) = «Au + {lAv (or all u, v E D(Ah CI, fJ e k.
Suppose that D(A) is dense in X. Then the dlUll operator
A.: D(A.}!; Y. -. X.
is defined by the relation
(A.v, u> == (p, Au)
for all ve D(A*}. U E D(A
(25)
'.0 be precise, by definition, p belongs to D(A*} iff there exists awe X. with
(w, u> = <vA,,)
for all u e D(A).
Then we set A.v.: w.
Since D(A) is dense in Xt the element w is uniquely determined by v. For
aU u, () e D(A) and all fJ e K,
A*(tlu + /Iv) = mA*u + fJA.v. (26)
The operator A * is graph closed.
if A: X -. Y is linear and continuous, then the dual operator A-: Y. -+ X.
is also linear and continuous, and we have
IA*I IAN-
If, in addition, X and Y are reRexiy then
A*. = A
in rhe sense of x*" = X and Y.. == t
(27)
21.6. Duality in H .Spaccs
253
.
Tile lin,Jar cotttinuous operator A: X -+ Y is compact iff A.: y* -+ X. is
co"rpa(" (theorem of Schauder (1930)).
The most important theorem on dual operaton in B-spaces is the following
closed range theorem of Banach.
Let
! N(A) = {t' E X*; (v,u) = 0 for all II e N(A)},
N(A*) I = {v E Y: (u.v) = 0 for all u e N(A*)}.
Theorem 21.8 (Banach 1932). Let A: D(A)!;; X Y be a linear. densely defined,
grapl. cl(}o('d op(rator, where X and Y are B-spaces over 1<. Then the follo'ing
four condit ;()ns art nnltually equivalent:
0) R(A) is (Olosed in Y.
(H) R(A*) i. closed in X.,
(iii) R(A, = NA*) i .
(iv) R(A.) = 1 N(A).
This t heorenl is a generalization or the existence principle in Section t 9.30.
In particular.. if the range R(A) of A is closed. then we have R(A) = N(A*).l,
i.e... r he equation
Au =: b,
U E A
(28)
has a solution iff
(w,b) = 0
for all M,' with A. w = o. This is an abstract Fre4holm "lttrnative. As in SectiOn
19.30. the closednes or R(A) follows rrom the a priori estimate: .
HAun > <'Ilull for all u e D(A) and fixed c > o.
Note that A is graph closed.
The proofs of all the statements above can be found in Y osida (1965, M)
and Kala (J 966, M).
21.6. Duality in "-Spaces
Let A: X X be a linear continuous operator on the real H-space X. By
Section 21.5" the dUlll operator A *: X. .... X. satisfies
(A. f), u) = (v, Au)
for all v e D(A.), .. E D(A).
By Section 19.8, the adjoint operator A..: X -+ X satisfies
(A.'!'fu) = h'tAu)
for all 11 e D(A*.). U E D(A).
254
1. Auxiliary Tools and the ('onvergence of the Galerkin Method
We want to show that there is a close connection between X and X. as well
as A * and A * . This is gi\en by the following commutative diagram:
.\ A. It
Jj I J 1 (29)
A.
X* X*.
J-"urthcrmore" we need the following two formulas:
(JII.I'> = (1111") for all u" t' E X"
(A* Ju, r) = (A* ull') for all 14" rEX.
(30)
(31 )
Recall that L(X. r) denotes the set of all linear continuous operators A:
X }'.
Proposition 21.17. Let X he CI real H-spa(.e. Theil:
(a) /--or each II EX. there exists a unique linear cOlltinuous functional Ju OIl ..
\\'ith (30). The operator J: X -+ X* ;s linear.. bijectite, and IIJull = lIuli .for
aIlIlEX.
(b) II A E L(X. X)" thell (31) holds and A*' = J- 1 A*J.
The operator J is called the duality 'nap of the H-space X. Duality maps
of 8-spaces will be considered in Chapter 32. Such duality maps can be
multivalued.
PR(X)F. Ad(a). This is another formulation of the Riesz theorem in Section
18.11.
Ad(b). J-"or all II" rEX.
< ;t · J II. ,. > = (J II, A r > = (II I A I.) = (A *' 1111') = (J A *' II" I. >.
Hence A. J = J A * .
o
Identification Principle 21.18 (X* = X). Let X be a real H-spClce.
(i) Sillce tile dUellit y map J: X .-. X * is a norm;somorphism, "'e can ident if)' J u
\\'ith u. /,, this sense" "'e ,,,'rite X. = X. Then, we have
(U,l') = (ulv)
for all u, l. EX,
(32)
alld J he('onles tire identity operator on X.
(ii) For A E L(X.. X), \\le ohtain A* = A.... and eacll operator A E L(X, X*) can
he identified ,,,'itll an operator A E L(X, X).
In particular" we obtain that each H-space is reflexive.
21.7. The Idea of Weak Convergence
255
21.7. The Idea of Weak Convergence
For the development of Functional Analysis, the most important concepts
introduced by Hilbert (t 906a) were what he calls "continuity" and "complete
continuity," which correspond to what will later be called the "strong" and
"weak" topologies on Hilbert space.
Jean Dieudonne (t 98 t )
Theorem 21.C (Riesz (t 918)). A B-space is finite-dimensional iff the closed unit
hall is compact.
The proof will be given in Problem 21.2. This famous result tells us that in
infinite-dimensional B-spaces there exist bounded sequences (u II ), which have
no convergent subsequence. This lack of compactness in infinite-dimensional
B-spaces is responsible for many difficulties in the calculus of variations and
the theory of partial differential equations. In order to overcome this difficulty,
Hilbert (1906a) introduced the concept of weak convergence in H-space.
Definition 21.19. A sequence (u ll ) in the B-space X is called weakly cont'ergent,
I.e.,
UIIU
as n -+ 00,
iff
<I, u,,) -+ <I, u)
as n -+ 00 for all Ie X*.
The following result is crucial for proving existence theorems in the calculus
of variations and the theory of monotone operators.
Theorem 21.0 (Eberlein (1947), Smuljan (1940»). Each bounded sequence in a
reflexit'e B-space has a "'eakly convergent subsequence.
PROOf.
(I) Let X be a separable H-space and let (u,,) be a bounded sequence in X.
We choose a countable set {VA;}' which is dense in X, and we use the
diagonal procedure:
( u 1 11 t' 1 ), (u 1 21 t' I ), (u 1 31 VI)' . · . -+ ai'
( u 2 I 1 v 2), (u 2 21 v 2)' (u 2 31 v 2)' . · · -+ a 2' ... .
To be precise, since l(u,,1 v l )1 s lIu" IIlIv, II, there exists a subsequence of(u,,),
denoted by (u ll1 ), such that (u,,,lv,) -+ a l as n -+ 00. Furthermore, there
exists a subsequence (U211) of (u l ,,) such that (u211Iv2) -+ a2 as n -+ 00,
etc.
The "diagonal sequence" (w,,) with WIt == u"" has the property that
(\\',,1 VA;) -+ aA; as n -+ 00 for all k.
256
21. Auxiliary Tools and the Convergen(C of the Galerkin Method
Moreover, there exist numben a(v) such that
(will v) -+ a(v)
This follows from
I(w.. - w.lv)1 = I(w. - w.lv - VIJ + (w" - w.IVt)t
S Iw. - ",.lllv - Vt + I(w.. - w",lvt)1 < I;
for suitable Vi and all II, m n o (£). Note that (w,,) is bounded and {v t } is
denseinX.
Obviously, the map v...... a(p) is linear and, by (33
la(v)1 S IsB sup I w.D for all v EX.
.
as n -+ 00 for all veX.
(33)
By the Riesz tlaeorem, there exists awe X with a(v) = (wi v) for all" e X.
From l33) it follows that
ww
.
as ".-. OCJ.
(II) The proof for the general case can be found in Yosida (1965 t M),
Chapter 5. 0
Definilion 21.20. A B-space X is called ....'rit.tly ('.UIJtiex iff
It.. + (I - t)(}11 < 1
provjded lr'U = IIrJU = I u v, and 0 < , < I.
A B-space X is called loeaU, uniformly conl'ex iff for each c, 0 < e 2, and
for cach u e 4'(' lIuli = 1, there exists a cS(t.u) > 0 such that the following holds
for all u, v EX:
flu - vU > I;, lIuli "vi == I implies 112- 1 (u + t')U s J - b(t, u).
A B..pace X is called uniformly conve. iff it is locally uniformly convex and
() can be chosen to be independent of u.
We have the following implications;
X is uniformly COItWX => X is locally uniformly (.onvx
=> X;s strictly convex.
Each uniformly convex B-space is reRexive (cr. Section 10.1).
In every renexive B-space X, an equivalent norm can introduced so that
hoth X and X* arc locally uniformly convex (er. Troyanski (J 971 )).lhis result
simplifies some proots in the theory of monotone operators.
The importance of these notions with respect (0 weak convergence will
follow from Proposition 21.23{dJ below. Intuitively. in a uniformly convex
B..c;pace, the unit sphere is round (d. Fig. 10.1).
211 The Idca of Weak ('oaverrencc
257
EXAiPI.I 21.21. Each H-space is uniformly convex and hence locally uni-
formly convex.
PR()()f. 'fhis follows from the parallelogram identity
n l(u - ,')U 2 + lI!h' + v)U 2 = t( lIuU 2 + HvIl 1 ).
o
EXAMI)ll: 21.22. Let 1 < P < <XJ" and let G be a nonempty open set in R N ,
N > I. Then, for ull It 9 E l._p(G), we have the generalized parallelogram
inequality:
II!C( -- g)II" + IIl(f + y)U' < 1<11/111' + UgJlI' (34a)
provided 2 5 P < 00, and
tlltr -- g)U' + lIi{f + )1I4 < [!UIJ'IIP + Itgll')l'-l, (34b)
provided 1 < P < 2. Here, II' U denotes the norm in Lp(G), and p-I + q-I = I.
This inequality is due to Clarkson (1936). Hence the Lebesgue space L,,(G)
is ullifornrl j' conzJex for all I < I' < 00.
More generally, we: have the following result. Let Y be a 8-space over
K = R, C. for fixed p with 1 < p < 00, we set
Hull = ( f 11".11, ) 11'.
k L.I
where .\1 = I 2, ... or M = 00. Moreover, define
x={uelj Y:llu11<tX+
If we now equip X with the norm II. H, thn X is a uniformly convex B..space
over K iff Y is uniformly convex.
In particular, this implies that the B-spaces R N and eN, N > I, are uniformly
convex with resJX-t to the norm
( N ) ',p
IlxU p == L ){tl" ,
i-I
where .t :: (1'. .. , N ).
Moreover, it follows from the uniform convexity of the space Lp(G). 1 <
p < 'X" that the Sobolev spaces W,"'(G) are uniformly convex for I < P < 00
and m = O. 1, . . . . In this connection, note that the Sobolev space W p "'( G) is
norm..isomorphic to a closed linear subspace of the product space l..p(G)M by
means of the mapping u H (DClUIS,".
'rhe proofs can be found in Kothe (1969, M), Vol. I.
---.
Further important material on slrictly convex and local1y uniformly convex
B-paces can be found in the Appendix to Part III.
25H
21. Auxili:uy 'Iunls eIIullhe ('unverlfcnc..-c u(the (i:.lcrkin Mr,II41d
Rt,-c,.lIlhtt "" -+ " ..s " ,.(J in a h-Spc1CC , means "".. un -. () &IS ,. ... '1....
This usual "nvcrgcl'L-c ill X is sntnctilncs culled the: siron!! convcr#cncc
in ..:,)nl rn1 10 the wc.lk conver!!cncc. "("he fullowin1& results will he used
frc\lucntly.
Proposilion 21.13 (I)rt'pcrli\.-s or We,. k ("unvcrgcnc;c). I (', f,.,.) I,.. " !;f'II,,('IIC..' if'
,I,c' II.",,(.(, ..t (,,-{., K R. (. """(''':
(n) .,."c' .", rt'''11 (."",'('r"*,,,,(O(, "" -+" "... II.... . 1.) i"'plit' ,1,(# "'('Ilk C .,JI.rf'r"c'IIC"('
'f." ... " "," II -. 'X 10
(b, ,. din\.\' < ,"I.'.. ".,.. ,,... ""'1.k t.""f1('r"t""(I" .... -" "s n -. ,'f' ilt.pl;('s "It'
lrtl"1 ftJlUt"'"c'I'C.(' If" -+ " d.4( I' -. ,.., -. .,.
let If ".. - "(II" II .y.:. 11,(", (II.) is 1)''''lt/.tl "",I
J',I) -: lin1 11.,.11
(5)
.. ..,
tll,C'flrc"" t" IJ,JI,elfh ,,,,,I SI c,j"I'dll.'t t 1927)).
td) ,,. X is lo(Oc"')" ,,,'If'rml.,. ("'JlII-.".. "1"" fl. - ,.. II ,lIld "I'.. H ... n., n "S .. .. 'I.)
i",,,I;.. II" -. " .(I. " .... ., .
fc) ,. "fI-" (,. II..... ",.11,('" "'c'rt' ('.;I. cl .c"q"t'''(,(' (I..) i" ,Itt' ,.I,)sc',1 (tf)l"'I.,'.
",,11 fr : II.: slIc.l. ,11", ". II "." " . .. ,.,- ("lCt"rc"" ,/. Aftl:"r ( I ()J.1)).
trt I,."... "II "S,....... "' ,Iutll I' t;('. i" ,1,(' ..1".-'c',1 C',""'c't ,,,,,, ,,. :",,:. ·
Ig) ,,. (II.) ; /J""",/c',1 ill.\' ,,,,J f" 1I,."c' f'.'. i.", II ., F X "n,1 (, ,Ic"'."t' .,,(t' I) iI' ,Y.
...",.1, ,1,,11
<.I: II,,) 4 (.(.11>
" " ... ''1.' .I(w ,,11 .f 1>..
,I,e'" ",. - " "S II '1..-.
(h) If . is r('ll('.i.'t' dlt" ,h(. r('(.' (.ql,('uc"t' «.'.c'..») ("("u'('r,('s ./f,r ctl,.. EX..
II,c',' , IIt'rc' t'X;../... " ., e X ."c.,' , l'cll ".. - "" " cl.' I' .-. ,.,. I
(i) ("") is " Iu,,,"e/t.tl S('CI"t'''Coc' ill ,I't' 'f/l('.\:i,'(' 8-spl'(0(' X. Ilrc',. (II.) ,,(,... "
".(tclk I." ('o''''c'rU('I,' sr,hsc.'qllc"II."('. ·
,,: ill tIJ,/il i,,,,. (."(1/, "'('e,kl." (."'"'.-'1/(.''' ..."hs('CII'("'c.(' t" (II.) I,els I "t' san,('
/i,,,;1 II. "'('11 '." _. " el." " -+ ,,/ '.
tjt " li,ll"at's .(ro",
If. - "", ;11 X
"., " -. fl...
./ -..r ;".v ·
,",,, <./. I'. > .... <.I: I') "." " -+ "".
(k t ,,. ",- ;.-. r({Ic'.;"(', 11.t'1t i, .ff,II"".." Ii-,.,,,
" " -+ t I..
N. -. If iI' l
.f. .r ill.\' *
,h", <./.rf.>..... (.1:"),,...11-'4(,1.
". n..... (J.
U!f n - fl.'.
I).UN)I.. Ad..). rr ",. '''.,. then (./:"..) -+ <I.") for all .fE X... since .r is
c(),UinutJtls.
21.7. The Idea of Weak Convergence
259
Ad(b). Choosing a basis in X, we can identify X with 1<". Hence the weak
convergence means convergence of the components. This implies the strong
convergence.
Ad(c). This is a special case of the general Banach-Steinhaus theorem
A I (35). The proof can be found in Y osida (t 965, M), Chapter 5.
Ad(d). For u = 0 the assertion is obvious. Let u :;: O. We set
t' = u/llull
and
t'.. = u../IIII..11
for sufficiently large n. Hence II tIIII = II vII = 1 and
t'.. V
as n oc.
By (c), it follows from ('.. + r 2t' as n XJ that
2 = "2l' II S Ii m II v.. + l' II s Ii milt'.. + t'li
..-x
,,-x
slim IIt',,1I + IIl'lI = 2.
..-x
Hence II ('.. + t1I 2 as " . By Definition 21.20,
11('.. - t'li 0
as II 'XJ.
This implies u.. u as n x, since lIu..1I lIuli.
Ad(e). The proof, which is based on the Hahn-Banach theorem, can be
found in Y osida (t 965, M), Chapter 5.
Ad(r). This follows from (e).
Ad(g). Let g E X*. Use
I(g,u,,) - (g,u)1 = I(g - I,u..) + (/,u.. -I') + (I - g.II)1
s IIg - III( lIu..!! + lIull) + 1 <I. u.. - u) 1 < £
for suitable .f E D and all " no(e).
Ad(h). By the Banach-Steinhaus theorem A I (35), the sequence (u,,) is
bounded. Let
a( to) = lim < t'. u" )
for all t' E X*.
..-x
From la(r)l s IIt1I sup" Hu..11 it follows that a E X**. Since X is renexive, there
exists a u E X with a(t') = <t\ u) for alll' E X*.
Ad(i). This follows from Theorem 21.D and the convergence principle
(Proposition 10.13(4).
Ad(j). Since (u..) is bounded by (c), we get
1<1:..11,,) - <1.u)1 = 1<);. - .f,u,,) + <I.u.. - u)1
II);, - III sup" lIu..1I + 1<1. u.. - u)1 0 as n.... x,.
Ad(k). Use (j) and X** = X. 0
260
2 J Auxiliary Tools and the Convergence of the Galerkin Method
21.8. The Idea of Weak. Convergence
The crucial convergence theorem of Eberlein and Smuljan (Theorem 21.0) is
only valid in renexive B-spaces. More precisely, a B-space is renexive iff every
bounded sequence has a weakly convergent subsequence. In non reflexive
B-spaces one can frequently replace weak convergence by weak. convergence.
Definition 21.24. Let X be a B-space. A sequence (f,,) in X* is called ,,'eakl.\'.
cOlu'ergelll, i.e..
.f" .f
as '1 -+ X,
iff
< J. u > < .f. u>
as n -+ Y..) for all u E X.
Theorem 21.":. Let X be a separable B-..pace. Then, each bounded sequence (fit)
;11 X* has a ,,'eakly. conl'ergent sub.-;equence.
PROOF. Let : r,} be a countable dense subset of X. As in (33), we obtain a
subsequence ("'It) of (f,,) with
< "'", l' > -+ a( 1')
as n..... x for all VEX.
From l<tl,l'>1 < IIt'l/ sUPIt 11"',,11 it follows that a EX..
o
EXAMPLE 21.25. Let G be a nonempty bounded open set in n N , N > I. Let
X = L.(G).
Then. X. = Lx (G). We have
f,,f inX.
as n -+
iff
r f"udx - r fi,dx as n - oc for all U E X.
JG JG
Since C(G) is dense in L.(G), the space X is separable. However, X* is not
separable and both X and X. are not reflexive.
Let (f,,) be a bounded sequence in X*, i.e., the functions fIt: G -+ IR are
measurable and there exists a constant C such that
Ifll(x)l S C for all n and almost all x E G.
By Theorem 21.E, there exists a subsequence (fit') of (fll) with fit' J.
as n -+ 'X...
The proofs of the other statements can be found in Kufner. John, and Fucik
(1977, M), Chapter 2.
Proposition 21.26 (Properties of Weak. Convergence). Let X he a B-space orer
k. and let (fit) be a sequence in the dual space X*. Then:
21.9. Linear Operators
261
.
(a) If fff f in X. as n ..... 00, then fff f" as n ..... 00.
(b) If f,, f as n 00, then (};.) is bounded in X. and
IIfll S lim II};.II
.
(theorem of Banach-Steinhaus (1927)).
(c) If (f,,) is bounded in X. and if there exists an f e X. and a dense subset
D of X with
<I", u) <f, u) as n 00 for all u E D,
.
then};. f as n -+ 00.
(d) If the real sequences ( <f", u) ) are convergent for all u eX, then there exists
.
an fe X* such that f,, f as n -+ 00.
(e) It follows from
u" -+ u in X
as n ex:. ,
/,,, ...... f X.
- in
as n -+ 00,
that <fIt, U,,) ..... <f, u) as n 00.
.
(f) If X is reflexive then f" / as n..... 00 is equivalent to f" f as
n ..... 00.
PROOf. This follows analogously to the proof of Proposition 21.23 above. 0
21.9. Linear Operators
Let A: X ..... Y be a linear operator, where X and Yare B-spaces over 0(. Recall
that the following two conditions are equivalent:
(i) A is bounded, i.e., there exists a constant d > 0 such that
IIAull s dllull
for all u eX.
(ii) A is continuous, i.e., U II ..... U as n -+ 00 implies Au" Au as n ..... 00.
Proposition 21.27. Let X and Y be B-spaces over 1(. If A: X ..... Y is linear and
continuous, then A is weakly sequentially continuous, i.e.,
u u
"
as n 00
implies
Au" Au
as n..... 00.
PROOF. Let u" u. From
(v, Au,,) = (A*v, u..),
(v, Au) = (A. v, u)
it follows that
< v, Au,,) ..... < v, Au)
as n ..... 00 for all v e Y.,
i.e., Au" Au as n ..... 00.
o
262
21, Auxiliary Tools and the ConverttCl1Cc or the (jalcrkin Method
Proposition 21.28. LeI A: X .... X. be a lin('ar denlicn'lIin"ous of)('rat()r 01' Ilat'
IJ...ptl('(' X i.t'..
II" ..... U
tl. n -+ 'Y.J
;'"plie .,
Au" -a. A II
a. n -+ 00.
J.,,(,1t A ;. ('on' ;nu()II..
PR(X}':. If A is not continuou5. then there exist an c > 0 and a qucncc (lf ll )
with II" -. 0 as n -.. 00 and n Au.1I r. for &111 n. Let a" = ft"..H - &/2 and ",. = ('.U,..
Then. f',. ..... 0 and
II A 1'.11 tl".:..... + r"SJ
as n tX).
Otherwi. it follows from r'.. -.0 that A v.. O. i.e.. (Av.) is bounded. This is
" con1r:tdiction. 0
Proposition 21.29. I ('I A: X .... Y he a litltar 0p('rat or. wh('r, X anti Y (Ire!
B.pll("('S I),'('r 1\. Then:
(a) I)' A ;. ('o"JptlCI.. Ilu'" A i." ,'rollgly c,,,Uj,rll"u.'t, j.t'..
".. II
a. n -.. ,y)
;".pli('.
All" -t All
us n -+ C1J.
(b) COlu'('rsely, iJ' A i." slrullgl.v ('()"';n"()". (,,,d X i.-. reflexive. tMn A i." (t(Jmpa(OI.
I)Rf)()f. Ad(a). lei "II u. Then (u..) is bounded. Since A is compuct, there
exists u suhsequence (",..) such that Au., .-. was n -.. <YJ. By Proposition 21.27,
'" = Ar,. .rhe convergence principle (I)roposilion 10.13(1)) yields the conver-
gence or the entire sctluence, i.e.. AI'.. -+ ". as II (X>.
Ad (b). Let (II..) he bounded. Since X i reRexivc. there exists a ubsequcnce
with .',," U us II ..... 'Xl. Hence Au.- -.. Au as n -+ 00. i.e.. A is compact. 0
21.10. Bilinear Farms
We want to invetigate the relation between linear continuous operators
A: X -. X. on real B-spaces X and bilinear forms a: X x X -+ R. The key
formula is
(Au.!') = a(u. v)
for all u. I.t EX.
(36)
DelinifiOll 21..10. Let X be a real B-spacc. By a bilinear form on X, we
understand a map a: X )( X -+ A with
a(II, (IV + pw) = (I(I(u. v) + fla(u, w).
a(<<v + Ilw. u) == «6(". u) + fltJ(w, II),
for all II. ", M t E X and all tI, "e R. I n the following. c and d are positive
constants and u and " arc arbitrary clements in X.
21.10. Bilinear Forms
263
(i) a is called bounded iff la(u, v)1 s dllull IIt'li.
(ii) a is called positive iff a(u. u) O.
(iii) a is called strictly positit'e iff a(u, u) > 0 for all u o.
(i \') a is called strongly positive iff a( u, u) c II U 11 2 .
(\') a is called symmetric iff a(u, v) == a(v, u).
(vi) a is called compact iff, as n .... 00,
U II u and VII v implies a(u ll . t'lI) -+ a(u, v).
Let X be a complex B-space. Then the bilinear form a: X x X .... C is called
positive iff
Re a(u, u) > 0
for all u EX.
Similarly. a is called strongly positive (resp. strictly positive) iff
Re a(u, u) > cllull 2 for all u E X and fixed c > 0
(resp. Re a(u, u) > 0 for all u E X with u 0).
Proposition 21.31. Let X be a real B-space. Then:
(a) There exists a one-to-one correspondence bet"'een linear continuou.'" opera-
tors A: X -+ X. and bilinear forms a: X x X -+ R, which is git'en by for-
mula (36) abot'e.
(b) If X is rej1exive, then A is compact iff a is compact.
PROOF. Ad(a). Let A: X .... X. be linear and continuous. We set a(u, ['} =
(Au, v). Then
la(u, v)1 IIAullllt'li IIAlillullllt'li.
i.e., a: X x X -+ R is bilinear and bounded.
Conversely, if a: X x X -+ IR is bilinear and bounded, then, for fixed u EX.
the map t' a(u, v) is linear and continuous on X. Thus, there exists an element
in X., denoted by Au, such that
(Au, v) = a(u, t') for all (' EX.
From I(Au,l')1 s dllullllt'li it follows that IIAull s dlluJi. Hence the operator
A: X .... X. is linear and continuous.
Ad(b). Let A be compact. By Proposition 21.29(a), UII u implies AU II ....
Au. Hence UII u and VII [' implies (Au ll , VII) .... (Au, v), i.e.. a(u ll , ['11)-+
a( II, (,).
Conversely, let a be compact. Then the map vt-+a(u,v) is linear and con-
tinuous on X. Thus, for each u E X, there exists an element in X., denoted by
All, such that
(AU,t,) = a(u,v)
for all t' E X.
The operator A: X ..... X. is linear. Let U II --'-' U as n .... x . We want to show that
264
:! I. Auxiliary Tools and the C"on..ergence of the Galerkin Method
Au" -+ All. i.e..
sup I (All" - AU.l:) 1-+ 0
I II = I
if X =F :0:. Othcr\\'isc. there is an I; > 0 and there are subsequences. again
denoted by (u,,) and (1',,). such that
as n -+ x.
I < A II" - A II. r" > I > I:
and
II l"" II = 1
for all n.
Since X is reflexive. there exists a subsequence. again denoted by ((',,).
such that '-" - I' as II -+ x. This yields the contradiction (Au" - Au. r,,) =
a( II" - II. 1",,) -+ 0 as II -+ x.
By Proposition 21.29. A is compact. 0
Oefinition 21.32. Let A: X -+ X. be a linear continuous operator on the real
B-space X.
ti) A is called posit;r'f! iff the corresponding bilinear form a is positive. i.e..
(Au. II) 0 for all II e X.
(ii) A is called strictly positive. strongly positive, or symmetric iff a has the
corresponding property.
Corollary 21.33. l...et X he a real H -space. Then:
(a) There exists a Ol1e-to-one correspondence bet\\'een linear continuous opera-
tors A: X -+ X and bilinear forms a: X x X -+ R.. ,,'hich is git'en b.\' the
lo",ulll
(.411Ir) = a(lI,I')
.for a II u. rEX.
(37)
(b) A ;$ COI"PClCt iff (l is cOI"pact.
PR(X)f". This follows from Proposition 21.31 and from X = X. by the Identi-
fication Principle 21.18. 0
Definition 21.34. Let A: X -+ X be a linear continuous operator on the real
H-space X.
(i) A is called positire iff Cl in (37) is positive.. i.e... (Allfu) > 0 for all u E X.
Oi) A is called strictly positive, strongly positive, or symmetric iff a in (37) has
the corresponding property.
Let A: ...\' --. X. be a linear continuous operator on the real B-space X. or
let A: ...\' -+ X be a linear continuous operator on the real H-space X. By
Section 18.7i. we ha ve:
(a) A i. positire iff A ;. ,nolJotone:
(b) A is strictly positil'e iff A is strictly monotone:
(c) A ;s strongly positive iff A ;s strongly monotone.
2 t .12. Projection Operators
265
21.11. Application to Embeddings
Recall Definition 21.13 on embeddings.
Proposition 11.35. Let X, Y, Z be B-spaces over K. Then:
(a) If the embeddings X s; Y and Y s; Z are continuous, then so is X z.
If, in addition, one of the embeddings X s;; Y or Y s; Z is compact, then
so is X s; z.
(b) If the embedding X s; Y is continuous with the embedding operator j: X -+
Y, and if X is dense in Y, then the embedding Y. S; X. is also continuous
with the embedding operator i: Y- -+ X., where i = j-.
If, in addition, the embedding X S; Y is compact, then so is Y. S; X..
(c) If the embedding X Y is continuous, then, as n -+ 00,
U" -+ II in X implies u" -+ u in Y
and
u,,u in X
implies
u" u in Y.
(d) If the embedding X S; Y is compact, then, as n -+ oc,
u" u in X implies u" -+ u in Y.
(e) If the embedding X S; Y is continuous and X is dense in Y, then Y. is dense
in X. prot"ided the B-space X is reflexive.
PROOF. Suppose that the embedding X S; Y is continuous with the embedding
operator j: X -+ Then
(V,j(u» = (t', u)
for all v e Y., u e X.
The embedding Y. S; X. is to be understood in the natural sense that each
linear continuous functional v: Y.... K is also a linear continuous functional
on X (cr. Problem 18.6). Let
i: Y. -+ X.
be the corresponding embedding operator. Then
(i(v), u) = (v, u) for all v e Y., u e X.
Hence (i(v), u) = (t"j(u» for all v e Y. and u e X, i.e., i = j..
The remaining assertions follow from our results on linear operators in
Sections 21.5 and 21.9, and from Problem 18.6(ii). 0
21.12. Projection Operators
Projection operators are useful in order to describe the Galerkin method
in terms of operators. Furthermore, the functional analytic formulation or
266
21. Auxiliary Tools and the Convergence of the Galerkin Method
--- - L
p
II
\
'"
- --- L
,.
)..
M/
,\1
\
\
fa)
(h)
Figure 21.6
interpolation methods leads to projection operators (e.g.. the Lagrange inter-
polation). As an important application we will consider the method of finite
elements.
Definition 21.36. let X be a linear space (e.g. a B-space), and let Land M be
linear subspaces of X. We write
x = L (f) AI
iff each element II E ,,'( allows a unique decomposition of the form:
" = r + "',
r e L, \\' eM.
(38)
We call X = L $ M a (Iirefl slim (Fig. 21.6).
If X is an H-space and if (38) holds with
("",.) = 0 for all " e L. \\' e M.
then X = L (t) i\1 is called an orllrogollalllirecl sum (Fig. 21.6(b)).
The following proposition shows that there is a one-to-one correspondence
between direct sums and linear operators P with
p2 = P.
The simple geometrical interpretation of P will be given in Example 21.39
below.
Proposition 21.37.
fa) LeI X = L (!) J\I \,'illl (38). If "'£I set
Pu = .. for all u e X, (39)
tire" \\'£' obtain (I - P)u = '" }'or all u EX. and the operator P: X -+ X ;s
Iil1e"r and p2 = P.
2' , t 2 Projection Opcrat()l'S
267
A,f OreOI)(!r, M'e hal'('
f' ELi IT P r' = v
and
w e M jff (I - P)w = w. (40)
{b) C'OlUi(',.'ie!y, let P: X -+ X he a linear operator with p2 = P. Then we ohta;n
x = P(X) $ (I - P)(X).
",'II(re (38) 11l)/d." Ml;lh I' = p" anti \\' == (I - P)II fnr all u eX.
P,U)().'. Ad(a), From " = v + 0 we obtain PI? ::: v for all veL. Conversely, if
PI' = ". then I' E L, since P(X) c: L. This implies
p 2 u = PI' = ,] = I),, for all II e X,
i.e., p2 = P. From M' = 0 + w we obtain Pw -= 0 for all WE M, and hence
(I - P)M' = w. Converse]y. iffl - P),,' == "', then Pw :z: O. and hence w e M. by
(38) with" = ,\I.
Ad(b). Let p2 = I'. Obviously,
" = PII + {I -- P)II
for all U EX.
To show that this dccumposition i unique. suppt)SC that
u = t + M t ,
It E I)(X), w e (I - P)(X).
This implic't I' = I'!( for some . e X, and hence
J)" := p2 X = P.'t = v.
Mnrcnvcr we ohtain that \t' = (I - I')}' for some y € .%', und hence
(I - P))\. = (I - 1})2 y = (I - 2P + p2)y == (1 - P)" = "to
F'rum 1'1' :=:: IJ and I'M' == 0 we get
Pu = I)p + Pw = t'.
and hcnc n' := 2/.' ,- = (I - P)II.
o
I )efini.ion 21.38. I_ct , he 1 n-spClcc. Oy a proj('(" ion opera' or on X we
undcrlc'nd a linc:!r ('OIl';IJUt)U operator P: X -+ X with p2 = I), "I"hcn P(X)
is c,.lIcd thc corresponding projection space.
'rhc dircct sum .r = W (f) l is called ,t 'op,,/og;t,tll direct sum iff the carrc-
spondin operator I' in t39) is it projection operator on X.
Lei X be un H-SpICC, By un orthtJI/(}"I.II pr()j,'ctinn (,pl'rator on X we
understand a projection operator I): X -- X with
(/)uf{ I - I'):) = ()
r 0 r a II ". = EX.
(41 )
If I': X -+ X is it projection operator (rc!\p. orthogonal projection operatur),
then J - I): X -. X is ctlso a rrojcci ion orcrcltor (resp. ort hogonal projcction
npcriltf'r). 'rhis follows rronl (I - p)2 = I - 21' ;. p2 = I - P ctnd (41).
268
21. Auiliary Tools and lhe Convergence of'the Galerkin Method
finite element
__L
u
. ,
UI
I _---1
il2 b
I
a
I
b
lip
(a)
(b)
(c)
....iSu re 21.7
If X = L Q) M is a topological direct sum. lhen the linear subspaces Land
M are closed. This follows immediately from (40) and lhe continuity of P.
EXAMPLf 21.39 (Geometrical Interpretation). In R 2 and R J , projection opera.
tors and orthogonal projection operators correspond to parallel projection
and orthogonal projection, respecti vely (Fig. 21.6).
EXAMPLE 21.40 (Linear Interpolation and Finite Elements). Let X = C[atb]
with -fXJ < a < b < 00 and let a = 00 < ". < Q2 < ... < a. = b. For u e X,
we set
w=Pu,
where w is the uniquely determined piecew;. linear function which satisfies
w(a,) = u(a i ) for all i (Fig. 21.7(b».
Then P: X ... X is a projection operdtor.
All the piecewise linear Cunctions w: [at b] -+ R with
w(a,) = I for fixed ;
and '(Uj) = 0 for all j #- i are called j,nite elenlellis (Fig. 21.7(c}). The set of
these finilc elements forms a basis in the projection space P(X).
EXAMPLE 21.41 (I...agrange Interpolation). We consider the same situation as
in Example 21.40. But now we set
w = Pu,
where w is the uniquely determined polynomial of degree" which satisfies
w( 0;) == u(a,) for all ;. Here, II J.
Explicitly, we define
9o(x) = .. - a. )(x =- a2 l..:. .(x _ a.)
(ao - tit )(0 0 - il2)... (a o - all)
and analogously g I . . . · t 9.. Then
w = u(uo)Yo + ... + u(a.)y.,
and P: X ..... X is a projection operator. In fact, the uniqutness of the inter-
polcalion polynomial implies P{Pu) = PUt i.e., pZ = P.
21.12. Project ion ()pcra(nrs
269
EXAMPI.F 21.42 (Orthogonal Projection onto Finite-Dimensional Subspaccs
of l-t-Spaccs). Let L be a finite-dimensional subspace of the H-space X. We
choose an ort honorn1al basis t u I' .. . .. J',,} in LI and we set
n
Pu = L (u.lu}uj.
i r I
Then P: X -+ X is an orthogonal projection operator onto L.
EXAMI'I.E 21.4J t Projection onto Finite-Oilllcnsion,al Suhspaces of B-Spaccs).
Lel L be a finite-dimensional subspaL'C of the R-space )f. We choose a bais
U I"" . .Il,,} in L.. and we choose linC4\r continuou flinctionnls v I' ..., t.. on
, wit h
(Vi. II}) = f,fJ for (i = I.. ..... n. (42)
Such functjon(,I ..Iways do exist. In fact, since U I' . . ., u" are linearly indepen-
dent. we rind linear funclionals ('... . . ... V n on II with (42) by solving a linear
system of cq untions. Because dim L < 'XI, those Cunctionals are continuous on
L tnd hy the Hahn Banach theorem AI (34). they can be extcndClI to linear
cc.)nlinuous functionals on X.
If we set
II
Pu = L (r';, U)U it
i-I
then J): )(' .... . is a rrojcction operator onto L.
I)roposilion 21.44 (Ort hogonal Projection). Let I., be a closed linear "'lIhpa('r
oj' ,Ill' II-.puce X.. and let L L denote ,he orth(){/()nal compltmenl to L. 7'hfn:
(a) 1°/1t! ()rat(Jr P: X ..... X is an or",oYtJlwl projef,;on operator iff it ;. a
,('lfadj()int prajtt(" ion (}perat or.
(h) We hill'" III!) ortho{Jo"al dir(",.t ,..um .. = L E9 L 1.
(c) r"('re (':(;.-.,.-. (n(a('llj' one ortl,ogo,,(11 proj('(";('" operator P: X .-. X onto L.
In ,hi." ('tI.'k!" I - P: X .... X io" the ()rlhogonal proje(:tiotl operator ollin l I ,
fOllr, hl'rnJOr('..
Jlu -- lulI = inf lIu - vII
11 f= I.
for all u e X.
(d) r P: ,%. -+ X i... an orlh()g()nul projection t1fJ(#rator, Ih" P(X) is u ('losed
linear .-'Uh."pllce of ,Y, anti ,,'e ohtain the orthogonal direct sum
x = P(X) (f) (I - P)(X).
(e) f;arl, orl 1t"lInnal direct sunt ;. II/SO tllopological dir()ct .n'.
PR()()F. Ad(a). The condition (Pul{1 - P}z) = 0, i.e.,
(PIII(J -- P)z) = 0 and «(I - P)zIPu) =: 0 for all u Z E X
is cqui\'nlcnt to (l'ulz) = (ut pz) for all u. Z E X provided p2 = P.
270
:! I AU1(iliary Tools and the C"onvergencc of the Galcrkin Method
Ad(b), (C), (d). By Corollary 18.27, for each II EX.. we have the unique
decomposition II = r + "', I' E L, "' ELl, Now use Theorem 18.e and
Proposition 21.37.
Ad(e). Let X = A 8 be an orthogonal direct sum, where A and 8 are
linear subspaces of . Let P: X -+ A" be the orthogonal projection operator
onto the closure A of .4. From
U = I' + \''.
rEA.. \\' E B,
and A c A, as well as 8 £; (A)l, it follows that r = Pu, Hence A = A and
8=.4 1 . 0
EXA!\tPI..: 21.45 (('"om pact Operators). Let .4: X ...... X be a linear continuous
operator on the H-space X over IK with the adjoint operator A., Then:
,a) A is compact iff there exists a sequence (Aft) of linear continuous operators
A,,: X --+ X \vith dim R(Aft) < 'X; for all" and
lim IIA - Aftl! = O.
(43)
ft-41
(b) .4 is compact iff A * is compact.
PR(X)F. Ad(a). Suppose that (43) holds. Since dim R(A,,) < x.. the operator .4"
is compact. Let M be a bounded set in X. Since
II A II - 4" II II < I:
for all u EM, ,, > "o(e)..
and since the set A"( A-l) is relatively compact.. it follows, by the c-net criterion
A I (21 b).. that A (M) is relatively compact.. i.e... A is compact.
Conversely.. suppose that A is compact and let 8 denote the closed unit ball
in X. Since the set A (B) is relatively compact, there exists a finite-dimensional
subspace L" with
sup dist( Au, Lft) < II -1,
"E B
(44)
by the I;-net criterion A I (21 b). Let Pft: X -+ Lft be the orthogonal projection
operator onto Ln. We set Aft = PftA. By Example 21.42..
An" = L (A*II.lu)u;
;
for a II II E X
and
A: r = L (u; It') A * u i
i
for all r E X..
since (A: r! u) = (r I ,4 n u) for all u, I' EX. By (44),
sup "A 11 - p" A u II < ,,- 1 .
ME 8
This yields (43).
Ad(b). If A is compact.. then
IIA* - A:II = IIA - A"II-+O
as "-+.X .
21.13. Bases and Galerkin Schemes
271
Since dim R(A:) < 00, the operator A: is compact. Hence A* is compact
by (a).
If A. is compact, then so is A, since A = (A.).. 0
21.13. Bases and Galerkin Schemes
The following will be important for the proof of the convergence of Galerkin
methods. Recall that dist(u, f) = inf t ,. r 1114 - vU.
Definition 21.46. Let X be a B-space. By a Galerkin scheme in X. we understand
a sequence (Y,,) of finite-dimensional nonzero subspaces Y" of X with
lim dist(u, f,,) = 0
for all U EX.
" .. 2:
By a basis of X, we understand an at most countable sequence (w j ) of elements
"i eX, where finitely many WI' ..., "'" are always linearly independent and
X = u X"
"
with X" = span {WI' .. . , "',,}.
In Section 21.16 we shall show that the rapidity of convergence of the
Galerkin method depends on dist(u, fIt). Therefore, in what follows we are
interested in estimating dist(u, fIt). Such estimates will be given in (45) for
polynomials and in (46), (47) for finite elements.
EXAMPLE 21.47. Let {U"U2'...} be a complete orthonormal system in the
separable H-space X. Then (uJ) is a basis in X. If we set
"
P"u = L (uJlu)u).
J='
then PIt: X .... X is an orthogonal projection operator onto X" = span x
{u I'" .,14,,}. and (X,,) is a Galerkin scheme in X, since
dist(u, XII) = 1114 - P"ull -+ 0
as n.... 00 for all u E X.
By the Schmidt orthogonalization method. in each separable H-space X :I:
to} one can construct explicitly a complete orthonormal system (cr. Problem
19.16).
EXAMPLE 21.48 (Polynomial Bases). We set
W. (x) = x. -I , k = 1, 2, . . .
and X" = span {Wi'.... w,,}, i.e., X" consists of all real polynomials of degree
272
21. Auxiliary Tou1, and the Convergcnce of Ihe G.llcrkin Method
< n - I. Then (,,) is a hasis and (X,,) is a Galerkin scheme in the spaces
X = C[a, b], C"[a, h] Lp(a, b). W;I(tl 11),
Here.. n, = 1, 2, . . ., I P < r1J, and - f"/:' < a < b < 00. This foUow from the
fact thallhe set of polynomials is dense in all those space.
Let X C[a, h]. The famous classical approximatioll ,I'{'ore", (r J(lc'k.f\on
(1912) says that
distKhl. X".. d ::5; ; 11'''''11 x
n
(45)
for all U E C 1 [CI, h] with k :;;: I, ..., II, The constant C depends only on tl. h,
and k (cf. Cheney (1967. M), p. 141). lfcrc,
dist x tU4 X.. + 1 ) == ;nr Uu - vU x.
I' x,. . t
Formula (45) tens us that
lim disl(u, X.+ 1 ) :: 0,
n «.
and the convergence is faster, the smoother u is.
Proposition 11.49 (Existence of 8ases and (,alerkin Schemes). 14(!1 X b( a
s{'pclrahle B-.pace. Then:
(a) X "as a hil...;....
(b) If (W,.) is a hasi.4\ in X, "It'l (X..) "';1 h X" = span { w 1. . . .. , w" 1 is a Gtllerkin
S('heme in X,
(c) If (Y It ) ;s a GtlltrkilJ cl,e'n(' in X.. ,Ire" we can (:ollslrIlC" a ha."i. ;If X 1,J'
",elln.'i ,. ( Y,,).
PR()()F. Ad(a). let X :F to}. Then there exists a countable dense set t"j} in X
with fI, O. We set M'J = ",. If u 1 tI span { w J }.. then let W2 u].. etc.
Ad(h). This follows from Definition 21.46.
Ad(c). Note that U" Y. is dense in X and use (a). 0
The following approximation formulu (46) below plays a fundClmentul role
in the theory of finile elemenrs, a we will show in the next section.
EXAMP1..1: 21.50. Let G be a bounded region in R N with N > I and (1G e CO. I .
Let In = I. 2, .. . . We set
'"'", = (J L rDu,2 dX ) t/2.
(; If- 1ft
let Pm-I denote the set of all real polynomials on G of degree < In - I. Then
there exists it constant C > 0 such that
dist(l4" Pm..)) < C, "I",
for aU u e W 2 "'(G),
(46)
2 t .14. Application to Finite Elements
273
.
I.e.,
inf IIu - PIl",.2 s CI ul",
'.","-1
for all u e W;'(G).
PROOF. Let X == W 2 "'(G) and dim P"'_I == J. By (42), there exist linear continu-
ous functionals fl' ..., fJ on X such that, for p e P",-I'
Jj(p) = 0 for all j implies p = O. (46*)
As in Step 3 of Section 21.4c, the compactness of the embedding W 2 "'(G)
W2"-I(G) implies that
( ) In
lIuli = lul; + IJj(u)1 2
is an equivalent norm on X. By (46*), for given u EX, there exists a unique
p e P"'_I with
jj(p) = Jj(u),
j= I,....J.
I n this connection, note that in terms of a basis this is a system of J real
equations with J unknowns. Hence
lIu - PIl",.2 s Cllu - pll = Clul",. 0
21.14. Application to Finite Elements
Finite elements play a fundamental role in modern numerical analysis. OUf
goal is to explain the simple basic ideas. In particular. we want to show that
finite elements form Galerkin schemes. Moreover, we want to obtain infor-
mation about the accuracy of approximation in terms of the mesh size h.
Estimates of the type (47) below are very important in order to obtain
information about the rapidity of convergence of the Galerkin method, as will
be shown in Section 21.16. We consider the situation of Example 21.40 and
we make the following assumption:
(H) Let -00 < a = ao < at < az < ... < a" = b < oc and let
h = max, (a'+1 - a,).
A finite element is a piecewise linear function f: [a, b] .... R with f(a,) = I
for fixed i and f(a J ) = 0 for allj #: 1 (Fig. 21.7(c».
The corresponding finite element space F consists of the real linear hull of
all the finite elements. i.e., F consists of all real linear combinations of finite
elements (Fig. 21.7(b)). By Example 21.5, F is a finite-dimensional subspace of
the Sobolev space X == W 2 1 (a, b).
Proposition 21.51. Suppose that (H) holds. Then:
(a) There e.ists a constant D > 0 such that
distx(al, F) S h. D lIu1l2.2 for all u e Wl(d, b). (47)
Here, distx(u, F) = inf&'.Fllu - vllx.
274
21. Auxiliary Tools and the Convergence of lhe Galt..rkin MC1hod
(b) If wt fl)n.f;;tJer a .wquenct of ptlrlit;on.ft nf tl,(' interval [at I)], rh"11 lilt)
("orresplJnding sequenc:e (lf) o( /i,,;, e elenu.'nt .'ipt,('e... .forms a Galerkin .cl'ente
in X == W 2 1 (a, h) in ,he ('att ,,'h('r(' II" 0 a, n .... 00. i.e., the mtt,,,I, .4tiz() gt)es
't) zero.
Note thut (b) i an immediate consequence or (a) since W 2 2 (a./J} is dense in
X. The proof o((a) wiU be bused on the following abstract result of Proposition
21.52 below. We set
I ul. = (f' I "(II!)(xW dx )'11 . I u I: = (fo' I rill!) 1 2 d.,,)' 12 .
Furlhermore, we consider the affine transformation
. ..... (Xx + II
and define the operator
(Au)(x) = u(riX + fJ).
Our aSlImptions are the following.
(HI) Let Ie = 1.2,... and m == 0, I.,.. with k > m. The opera lor n: W2'(c t d)-.
K'2 M (t,d) is linear and conlinuou ror all (tt d: -00 < c < d < OCJ.
(H2) nil = u for aU polynomials of degree .s; k - I.
(HJ) n is ;.mncly invariant. i.e., An = nA for all A.
Proposition 21.51. Suppose ,hal (H f) ",rollg" (H3) h()/d. Tht" ,/.£'r(' ."(/t... "
("("''Qn' C JfWnJ;ng on k and", su("h t I,at
III - nul. C(d - <,'-"1,,1. lor all "e W:(,t/).
PIt(X )1:.
(I) First lel c == 0, II = 1. and denote Ihe norm on Wi'(O, 1) by 111/11:. 2 , Ir
-I be the se( ofalJ recl' polynomials of degree k -- I. Let I' e "'2'(0, ').
By (H2),
1J - nt) = (v - p) - n(., - p)
(or all p e P'-I'
Hence
II' - Ilvl: s I" - ""11:. 2 s 0/ - nu II.' - pll:t2'
By fxampJc 21 . SO,
I,' - n.,t: s 81 - nil inr IIJ - plI2 Clvl:. (48)
1',1\ I
(II) Affine transrormation from [0, 1] onto [c,t/). Let
( XC )
(AI') (.) = v -. - . - - - -- t
,I - (. d - ('
The substitution rule for intcgrl1l hows that
I A "1... = (J - cr "+1/21"1:.
x e [r.d].
21.15 Ric1 l haudcr rhenry nnd Ahlr:.CI Fredholm Alternatives
275
Lcttin " -= .4" and noting (113), we ohtain from (48) that
f" -- nlll.ftl - (.)"'-1/2 C'ull;(d - ct- 1/2 .
o
PR()()I ()) PRf)pc)SrIU)N 21.51. let U E Wf(a. hJ. We will use Proposition 21.52
wit h ,. =- (I.. (/ -= ili + I ' k = 2. m =- 0" ,.
'fhc embedding ",',/(a, h) (..( Q," J is continuous. Thus it Itlakes s.:nse to
define the operatof
11: ""ifa i , a, I) -+ W:(a,. a.. I)
in the following way: nu i the polynomiaJ of degree one which satisfies
fnu)(x) = u(:c)
for x = tilt (1'+1_
Obvjous)v. if u is a polynolnial of degree one, then nu = u. By Proposition
21.52.. for an II E K/?(ClI. (Ii + I ).
lu - nul.. s Cf".+. - tI,)2--luI2.
Hence
III - nul s C 2 h 2 (1. ".'uli.
Summa(ion over; and nJ = 0, , yield5 the desired estimate
If" - nlltfJ(..". C 2 (h 2 + 1,4)lun/f..b)
for all U E Wlf(l. h). Thi in1plics (47). Note that nu e f".
o
Finite clcmcnt in R N can he treated analogously. This will be considered
in the Appendix, A 2(58).
21.15_ R iesz-Schauder Theory and
Abstract f"redholm Alternatives
We cOMider the operator equation
Bil + Cu = b,
and the corrcponding adjoint equation
8*1' + C.v = 0,
ueX,
(49)
" E X,
(49*)
where B* and ('. denote adjoint operaton. We make the following
assumptions:
(H I) The operator B: X ..... X is linear, continuous, and bijective on the H..
space X over I( = R, C (e.g., B =-= I).
(H2) The operator C: X -.. X is linear and compact.
Theorm 21.F CRiesz (1918), Schauder (1930»). Suppo.,t that (H') and (H2) hold.
TheIl:
... -! :.t
276
21. AuxIliary Tools and the ('onvergence of the (Jalerkin Method
(a) Equation (49) has a solution iff (rib) = 0 J'or all solutions v of (49*).
(b) EquatioIJ (49) \\'itlt h = 0 and equation (49*) Ilave the .ame .finite nunlher oj
linearly independent solutions.
(c) If Bu + Cu = 0 implies u = 0, then, .It)r each b EX, equation (49) has (l
unique so/ut;o1l and the inl'erse operator (B + C)-I: X -+ X is continuous.
Statement (c) shows that
uniqr,eness i,nplies existence.
Moreover. statement (b) includes the fact that (49) with b = 0 has the unique
solution I' = 0 iff (49*) has the unique solution r = O.
Theorem 21.F is a special case of the general theory of linear Fredholm
operators in Section 8.4. In fact. since B is bijective, we have dim N(B) =
codim R(B) = O. Hence B is a Fredholm operator of index zero. Then the
compact perturbation B + C of B is a Fredholm operator of index zero. too.
This yields Theorem 21. F.
In the following we want to give a simple direct proof, which can be used
in an introductory lecture. In this connection, we shall use the open mapping
theorem A I (36). which says that each linear continuous bijective operator
1..: X --.. Y has a continuous inverse operator provided X and Yare B-spaces
over k.
PR(X)F. We set
S = B + C.
N = N(S).
R = R(S).
Obviously. the null space N of the operator S is closed. Below we shall show
that the range R of S is also closed. Therefore. we have the orthogonal direct
sums:
x = N(f)N 1 = RRl.
i.e.. codim R = dim R 1 (cr. A I (22k)). Recall that
ind S = dim N - codim R.
Step I: dim N < x. Let (un) be a bounded sequence in N.. i.e.. B"n + e"n = 0
for all II, Since C is compact. there is a subsequence (un') such that Cll n ' .... r as
II -+ 'Y_. Hence IIn' -+ - B- I r. By Theorem 21.C. dim N < x.
Step 2: dim N(S*) < 'Yj. Note that S* = B* + C*. Since C is compact. so is
C*, by Example 21.45. Furthermore. B*-I = (B- I )*. The same argument as
in Step 1 yields dim N(S.) < .
Step 3: We have the direct orthogonal sum X = R N(S*). In fact. it
follows from
(Sui t') = h'l S*l') for all I', (' E X
that N(S*) c R 1 and R 1 c: N(S*).
Using (S*)* = S. we obtain the orthogonal direct sum
.,\, = R(S*) Ef) N(S).
f50)
21.1 S. Riesl-Schauder Theory and Abstract Fredholm Alternatives
277
Step 4: R(S) is closed. By the proof of Corollary 19.59 and (50), it is sufficient
to prove the a priori estimate
IISull cllull for all u E R(S*) and fixed c > O. (51)
If (S I) is not true, then there exists a sequence (u,,) in R(S*) with lIu,,1I = I for
all nand SUIt -+ 0 as n -+ 'X). The same argument as in Step I shows the
existence of a subsequence with U,,' -+ w as n -+ 00. Hence
K' E N(S),
This contradicts (SO).
Step 5: The fundamental in variance of the index against small perturbations.
We show that if the operator D: X -+ X is linear and compact and if liD - CII
is sufficiently small, then
ind S = ind 1: where S = B + C and T = B + D.
w E R(S*),
II ". II = I.
The decisive trick consists in the construction of the operator
t( u, t') = Tu + I' for all u E N 1 , t' E R 1 .
Then t: N 1 x R 1 -+ X is linear and continuous. If T = S, then $ is bijective.
This is the ke) observation. In fact, it follows from S(u, t') = 0 that
Su + t' = 0, u E N 1 , I' E R 1 ,
and hence v = 0, Su = O. This implies u = O.
If II t - II is sufficiently small, then it follows from Problem 1.7 that t is
also bijective. Thus. if liD - CII is sufficiently small, t is bijectit'e. Hence we
obtain the two key relations:
dim T(N 1 ) = codim R 1 , N(T) c N,
and hence
codim T(N 1 ) = codim R.
In this connection, note that Tu = 0, U E N 1, implies t(u, 0) = 0, and hence
u = O. Therefore, N (T) e N.
We now choose a linear subspace M of N such that
N=N(T)M.
Then X = (N (T) E9 M) E9 N 1. Hence T is injective on M E9 N 1. Thus, we get
R(T) = T(M)E9 T(N 1 } and dim T(M) = dim M,
I.e.,
X = (T(M) EB T(N 1 » E9 R(T)l.
This immediately implies
codim R = codim T(N 1 ) = codim R(T) + dim M.
dim N - dim N(T) + dim M.
Hence ind S - ind T.
278
21. Auxiliary Tools and the Convergence of the Galerkin Method
Slep 6: ind(B + C") = ind B = O. In fact, the function I B + IC is con-
tinuous from [0.1] to L(X. X). By Step 5, ind(B + tC) = constant for all t.
Step 7: By Steps 3 and 4.
x = R(S) (f) N(S.).
This yields Theorem 2I.F(a).
From ind S = 0 it follows that dim N(S) = codim R(S) = dim N(S*). This is
Theorem 21. f'(b).
Finally, if dim N(S) = 0, then codim R(S) = 0, i.e.. S: X --. X is bijective and
S I: .,. -+ X is continuous, by the open mapping theorem A I (36). This yields
Theorem 2I.F(c). 0
Corollary 21.53. Suppose tlrat (H I) and (H2) hold. Let JJ e IK. Then:
,a) I,. JI i... nol a clraracI erisl ic nllmber of Ille pair (8, C). i.e., BII + pC II = 0
;nlpli('." = O. '''ell ,laere exi.ftlS ti,e conlilluous ina'trst operator (8 + JlC)-I:
X .\'.
(b) TI, .", ,1/ "'I"II'Uc.,,,,"11 ;,. n,,"b", u.f (II. tJ h", ItU j;;1I1I1I U,..t 4 'IHtului iun ""in,.
PRexn.. Ad(a). Replace C with JlC and use Theorem 2 t .F(c).
Ad(b).
(I) Let B = I. Suppose that there is a sequence (JJ,,) of characteristic numbers
wit h Jln ...... JI as " ...... x." Jl E K. and Jl" :;: j.l", if n :;: ,,1. Let (I + J1" C)u n = 0
"'it h ".. :f. 0 for all II. Note that JJn -# 0 and set i.. n = p; I. Then u 1 . . . . . "m
are linearly independent for all m. To show this suppose that III' . .., Un
are linearly independent and
II.. ... = a IU. + ... + ".. u...
Application of the operator C to this equation yields
(i' n + 1 - i... )alli l + ... + (i" n + 1 - i..n)anu.. = 0
and hence (Ii = 0 for all i. This contradicts U". I i= O.
By the Schmidt orthogonalization method in Problem t 9. t 6, we con-
struct elements r i such that (t'ilvj) = ij for all i.j and
i
1". = L h ilc II,
AI
for suitable b iA . If; > j. then
Cr i - CrJ = - i.;r; + =.
: E spa n { II I . . . . . "; - I :.
H f } Th .. I .
ence = E span t r I ' . . . . r. -I J' IS Imp les
IICr. - Crjll2 = 1;..;1 2 + 11:11 2 > 1;.d 2 .
Thus, (Cr..) does not contain a convergent subsequence. since inf;li"il > O.
This contradicts the compactness of the operator C.
2 t .16. The Main Theorem on Approximation-Solvability
279
(II) In the general case. apply (I) to the operator
B + pC ::: B(l + pB- 1 C).
o
Applications of the results in this section to differential and integral equa-
tions will be considered in Chapter 22. ·
21.16. The Main Theorem on the Approximation-
Solvability of Linear Operator Equations, and
the Convergence of the Galerkin Method
We consider the operator equation
Au = b,
together with the Galerkin equations
UE X,
(52)
PilAu" = Plllb.
U" E X"' n = I. 2. . . . .
(53)
We make the following assumptions:
(H I) X is a real separable infinite-dimensional H-space, and (w,,) is a basis in
x.
(H2) Let XII = span { K'I'...' w,,}. and let PIt: X -+ X.. be the orthogonal projec-
tion operator from X onto X".
Since Pili is self-adjoint. the Galerkin equation (53) is equivalent to
(Au,,1 WJ) = (bl "'J)' u" EX". j = I..... n. (53.)
The following notion is crucial for numerical analysis.
Definition 21.54. For given be X. equation (52) is called uniquely approx;ma-
tion-solt'able iff the following hold:
(i) The original equation (52) has a unique solution u.
(ii) There exists a number no such that, for all n no. the Galerkin equation
(53) has a unique solution u...
(iii) The Galerkin method converges, i.e., U III ..... U as n ..... 00.
Theorem ZI.G. Assume (HI) and (H2). Then, for each be X, equation (52) is
uniquely approximation-solvable in the case where the linear operator A: X ..... X
has one of the following four properties:
(a) A = I + K. where K: X ..... X is linear and k-contractive. i.e., II K II < I.
(b) A = I + C. where C: X ..... X is linear and compact, and Au = 0 implies
14 = o.
280
21. Auxiliary Tools and the Convergence of the Galerkin Method
(c) A: X -+ X ;s linear, continuous, and strongly monotone, i.e.,
(Aulu) > c lIull 2
for all u E X and fixed c > o.
(d) A = B + (", ",here B: X -+ X is linear, continuous, and strongly monotone,
c: X -+ X ;s linear and compact. and Au = 0 implies u = O.
In cases (a) and (c), no = I. In case (b), no is independent of b.
Corollary 21.55./n cases (a), (b), (c) abot'e. \ve hat'e the error estimates (54a. b, c).
respectit'ely, i.e.. for all n > no.
lIu - u"" < (I - II K II ) - I d is t ( u , XII ),
II u - u" II const dist ( u, X,,),
C!lll -u,,11 < IIAu" - bll.
( 54a)
(54b)
( 54c)
This shows that the rapidity of convergence of the Galerkin method
depends on dist(u, XII) in cases (a) and (b). Estimates for dist(u, X,,) were
obtained in Example 21.48 (polynomials) and Proposition 21.51 (finite ele-
ments). Roughly speaking, one has the following important principle:
The cOPlvergence of the Galerkin method is faster, the smoother the solution
u is.
Theorem 21.G contains important classes of operators and it allows many
applications. Note that (a)-(d) in Theorem 21.G correspond to the Banach
fixed-point principle, the Riesz-Schauder theory, and the theory of monotone
operators.
I n Chapter 34 we shall consider generalizations of Theorem 21.G to non-
linear operators. There we shall recognize the fundamental connection
between existence. stability, consistency, A-proper maps, and the unique
approximation-solvability of operator equations.
Lemma 21.. Assume (H 1) and (H2). Then, for all n and all u e X:
( a) II PIt II = I.
(b) Pull --+ u a. n -+ (f..).
(c) supu AI lIu - P"ull -+ 0 as PI -+ oc provided the nonempty set M is compact.
PROOF. Ad(a). Note that u = P"u + (/ - P,,)u and
II u 11 2 = II PIt U 11 2 + II (I - P,,) U 11 2 .
Ad(b). This follows from
lIu - P"ull = inf lIu - vII = dist(u,X,,),
t'E x"
and dist(u, X,,) -+ 0 as n -+ x, by (H I).
21.16. The Main Theorem on Approximation-Solvability
281
Ad(c). By (a), UP"u - P"vll S lIu - I'll for all u, veX, i.e., (P,,) is equi-
continuous on X. From Problem 19.14e it follows that the convergence in (b)
is uniform on M. 0
PROOF OF THEOREM 21.G. Ad(a). Note that p"u.. = u" for u" e X... Since
IIP..KII S UK" < I, equations (52) and (53), i.e.,
u + K u = b, u e X
and
u" + P"Ku" = P..b, 14.. e X.. (55)
have unique solutions by the Banach fixed-point principle (Theorem 1.A).
Furthermore,
ao CX)
11(1 + p..K)-11i = L {p"Kr S L IIKII'"
",-0 .-0
= (I - It K II> -I .
By (55),
(I + P..K)(u - u..) = u - p..u.
Hence
II u - u" II (I - II K II ) -I II u - PIt U II.
This yields u" -+ u as n -+ 00 and the estimate (54a).
Ad(b). By Theorem 21.F, the original equation
(I + C)u = b, u e X,
has a unique solution and (I + C)-I: X -+ X is continuous. Since P"u" = u" for
tI.. eX... the Galerkin equation is equivalent to
(I + p..C)u.. = P"b , u.. e X. (56)
Let 8 = {ue X: 111411 s I} and M = C(B).
Since the operator C is compact, the set M is compact. By Lemma 21.S6(c),
11(1 - P,,)CII -+ 0 as" -+ 00. Thus, according to Problem 1.7, there exists an "0
such that the operator
(I + p..C) = (I + C) + (P..C - C)
has a continuous inverse operator on X for all" "0 and sup" 11(1 + P"C)-III <
oc. Consequently, (56) has a unique solution. The error estimates and u.. -+ u
as " -+ 00 are obtained as in (a).
Ad(c). For all u e X"' we have p..u = u and hence
(P" A u 114) = (A u I PIt u) c II U 11 2 .
(57)
Thus, the operator P..A: X.. -+ X.. is strongly monotone. By Theorem 18.E, the
two operator equations
Au = b,
u eX,
(58)
(59)
P"Au" = P"b,
u..e X..,
ha ve unique solutions.
282
21 Auxlhary Tools and the Convergence or the Galerkin Method
If " > j. then it follows from (59) that
(Au"I"}) = (hl"j).
(Au..lu,,) = (hlu..).
(60)
(61 )
By (57)_
C lIu,,1I 2 < (hlu..) < IIhllllu"lI.
This yields the II prior; eSI;IJ,ate:
cllu"" s Ilh\!,
i.e.. (II,,) is bounded.
Let (u",) be a weakly convergent subsequence with
II,.. l'
as '1 -+ X.
By (60).
(.4 II" I"') -+ (h I \\. )
as n -+ oc,,' for all \\' E U X...
..
Since U" X.. is dense in X and (All,,) is bounded. we obtain
All", h
as n -+ x,
by Proposition 21.23(g). Since A: X -+ X is linear and continuous.
Au ,Ar
"
as '1 -. -y,
by Proposition 21.27. Hence Al = h. i.e., r = u. Since the weak limit (' is the
same for all weakly convergent subsequences of (u,,), we get
u,, u
as II -+ x,,
according to Proposition 21.23(i). I t follows from
(' Ilu" - ull 2 < (Au" - Autu" - II)
= (h I u,,) - (A 11" Ill) - (A u I rl" - u) -+ 0
a s II -+ 'X.-'
that
u" -+ u
as n -+ :x.
"'inally, the error estimates follow from
('1,11" - ull 2 < "Au" - Aullllu" - ull
and Au = h.
Ad(d). By Theorem IS.E. the operator B: X -+ X is bijective. Thus, it fol-
lows from Theorem 21.F that the equation
Bu + Cu = h.
U E X,
has a unique solution and (B + C)-I: X -+ X is continuous. Let A,u = Bil +
Cu - Ih. It follows from A,1l = 0 that lIuli < 11(8 + C)-llIltlllbli. Now, the
assertion is a special case of Theorem 29.8, by using X = X. in the sense of
21.17. Interpolation Inequalities and a Convergence Trick
283
the Identification Principle 21.18. Note that A, satisfies condition (S)o, by
Examples 27.2 and 27.3. 0
21.17. Interpolation Inequalities and
a Convergence Trick
We make the following assumption:
(H) The embeddings
Xyz
are continuous, where X, and Z are B-spaces over K = R, C. There
exist constants C > 0 and 0 < 8 < I such that
lIuU r S cnulll-'lIull
for all u EX.
(62)
Inequality (62) is called an interpolation inequalit}'. Such inequalities playa
fundamental role in the modern theory of nonlinear partial differential equa-
tions. Motivated by Standard Example 21.58 below, we call X a "very nice u
space and Z a "poor" space. In applications, the functions in X are smoother
than those in Y and, in turn, the functions in Yare smoother than those in Z.
The following sim pie result is crucial.
Proposition 21.57 (Convergence Trick). Suppose that (H) holds. Then:
(a) If(u,,) is a bounded sequence in the "very nice" space X, and (u ll ) is convergent
in the "poor" space Z, then (u ll ) is also convergent in the "nice 99 space Y, i.e.,
more precisely,
U" -+ u in Z
as n oc
implies u" u in Y as n -+ 00.
(b) If the embedding X Z is compact, then so is X Y.
PROOF. Ad(a). It follows from
nUll - u..fl r S Cllu ll - u..II-'lIuII - u",l1
s D lIu" - u",l1
that (u,,) is convergent in Y, i.e., U" -+ t' in Y as n -+ 00. Hence U II -+ V in Z as
n x), since the embedding Y c Z is continuous. Therefore, v = u.
Ad(b). If(u,,) is bounded in X, then there exists a subsequence with U,,' -+ u
in Z as n 00. By (a), "II' -+ u in Y as n 00. 0
STANDARD EXAMPLE 21.58 (Holder Spaces). Let G be a bounded region in R N ,
N I, and let 0 < fJ < (I < I. We set
X = C 2 (G), Y = C'(G), Z = C(G).
Then the interpolation inequality (62) holds with 1 - (J = PI7-.
284
2 J, Auxiliary Tools and the Convergence of the Galerkin Method
A bounded sequence (un) in X is equicontinuous by the Holder condition.
Thus, (u,,) is relatively compact in Z by the Arzehi-Ascoli theorem AI (24g).
Hence the embedding X Z is compact. From Proposition 2 t .57 we obtain
that the embedding X £; Y is also compact.
All the results above remain true for (% = 1, i.e., X = Co. 1 (G).
PR(X)F. We show that (62) holds. By definition.. the Holder constant H 2 (u) is
the smallest constant such that
114(:<) - u(y)l S H:r(u)lx - }'12
The norms on X, Y.. and Z are given by
for all x, y e G.
Ilull z = max lu(x)l,
XEG
lIullx = l1ullz + Hca(u),
,lull y = lIuH z + H,(u).
Let u e X. From
114(:<) - 14(.\')1 = lu(x) - u(y)1 1 - 8 ,u(x) - u(y)1 8
H 2 (U)1-'lx - yl'28I1ull
we obtain H,(u) 2 9 H 2 (U)1 -9 Uull. Hence
lIull t' < (I + 2 8 ) II 14 111- 911 14 II .
o
ST ANI)ARD EXAMPLE 21.59 (Lebesgue Spaces). Let G be a bounded measurable
set in R N , N > I. and let I < r < p < q < 00. Then there exists a 8: 0 < 0 < 1
with
t
p
1-0 0
- - -- + - .
q r
We set
.r = Lq(G), Y = Lr(G),
Then the interpolation inequality (62) holds.
Z = L,,(G).
PR(X>.... By the Holder inequality,
Ulul l -'l u l 8 l1, lIull-8I1u".
o
STANDARD EXAMPLE 21.60 (Sobolev Spaces). Let G = R N , N I, or let G be
a bounded region in R N with iJG E Co. 1 . Let t S P S 00, and let k, m be integers
with 0 < k < m. For G = R N , let p < 00. If we set
X = W,'"(G), Y = W,i(G), Z = L,(G),
then the interpolation inequality (62) holds with 8 = 1 - kim, Le.,
111411 C 1 1 1411"'''' 1 11411 1 -'''", for all 14 e W"'(G). (63)
'.p 1ft." ,
21.18. Application to the Refined Banach Fixed. Point Theorem
285
In particular, we have
I I u " S C II U 11 1 / 2 II U 11 112
I., 2."
The constant C depends on G, k, m, p.
Cor all u e W,2(G).
PROOF. This is a special case of the Gagliardo- Nirenberg inequalities which
will be proved in Section 21.19. In Cact, it Collows from (14) that
} lilY u II, s const II u II .-':.''''II u II -.' - r.,,,, ·
ISI=r -,
Noting lIull, S lIull",., and summing over r = 0, ..., k, we obtain (63). 0
STANDARD EXAMPLE 21.61 (C"'-Spaces). Let G be a bounded region in R N with
cG e CO. 1 and N I, and let k, m be integers with 0 S k < m. If we set
X = C"'(G), Y = C'(G), Z = C(G),
then the interpolation inequality (62) holds with 8 = 1 - kim, i.e.,
lIullClrce» s Cllull':c)lIullcA:'" for all u e C"'(G). (64)
PROOF. Use (63) with p = OC;. 0
STANDARD EXAMPLE 21.62 (C-Spaces). Let G == R N , N I, or let G be a
bounded region in R N with aG E Co. I. If we set
X = Wf(G), Y = C(G), Z = L 2 (G),
then the embedding X S;; Y is continuous and (62) holds for 8 == it i.e.,
lIutlccCt S CltuIlH2I1ull/2 for all u e Wf(G). (65)
Here, C(G) denotes the set of all continuous functions u: G -+ R such that
lIullccC:. = sUPJr.lu(x)1 < 00.
The proof will be given in Section 2 t .20 via F ourler transform.
21.18. Application to the Refined Banach
Fixed-Point Theorem and the Convergence
of Iteration Methods
We consider the operator equation
u == Au,
ue
(66)
together with the iteration method
u... I = Au",
n = 0, I, . . . .
(61)
286
21. Auxiliary Tools and the Convergence or the Galerkin Method
We make the following assumptions:
(H 1) Interpoiatio'l inequality. The embeddings
X c y c: Z
are continuous, where X, Y, and Z are B-spaces over K = R, C. There
are constants C > 0 and 0 < 8 < J such that
Jlullr S Cllun1-8t1ull
for all U EX.
(H2) Boundedne.s with respect to the u very nice" space X. We have
sup" II u"" x < ,'X,..
(H 3) k-C ontract;l,it y with respect to the "poor" space Z. For all 11,
It u" + I - u" II z < k II u" - U" -I It z + a".
where 0 < k < I. a" > 0, and Lila" < 'XJ.
(H4) Continuity with respect to the "nice" space Y. The operator A: y.-. Z is
continuous.
Theorem 11.H. Suppose that (H I) through (H4) hold. Then (u,,) converges in Y
to a solution of the original equation (66).
PROOF. From (H3) it follows that
lI u 2 - u.lll < k lIu. - uollz + a.,
""3 - u 2 11 z < k 2 nUl - uollz + ka l + a2.
IIU 4 - uJll z k J lIu. - uollz + k 2 a. + ka2 + aJ' etc.
Hence
L lIu,,+. - 1I"lI z < oc.
"
Thus, (II,,) is a Cauchy sequence in Z, i.e.,
u" -+ u in Z as n -. OCJ.
By (H 1) and (H2),
u" -+ u in Y
as n -+ 'XI.
Letting n -+ OCt it follows from (67) and (H4) that Au = u.
o
Theorem 21.H can be used in order to give existence proofs for systems of
conservation laws (see Section 33.6).
21.19. The Gagliardo-Nirenberg Inequalities
The following interpolation inequalities represent an important tool for the
investigation of nonlinear partial differential equations. Our lirst goal is the
21.19. The Gasliardo-Nirenberslnequalities
287
following inequality:
IID J ull2r S ClluIl12,lIull/2,
j= t,...,N,
(68)
where
p-l + q-l = r-1,
I S p, q, r S 00,
(69)
with the convention oc -1 = O. In particular, this means p = r if q = 00. Note
that x = (1'. .., N) and DJ = a/oJ. The following simple result is basic for
the general approach.
Lemma 21.63. Let N I. Under the assumption (69), we have
IID J ull2r S CIiDlull/2I1ull/2 for all u e C(RN), (70)
",here j = I..... N. The norm refers to R N , and C is a constant.
PROOF.
(I) First let p, r < 00 and I S q S 00. We set D = DJ. Integrating the identity
(Du)2r = D(uDu(Du)2r-2) + (I - 2r)uD 2 u(Du)2r-2
over R't', we get
f (Du)2r dx = (I - 2r) f uD 2 u(Du)2r-2 dx.
By the Holder inequality for three factors,
f (Du)2r dx S (I + 2r)lIullfIlD2ulI,,(f (DU)2 r dx )' -I/r.
This yields (70).
(II) Let p = r = OCt A more careful elementary argument shows that the
constant C in (70) is independent of r. p, q e R (cf. Friedman (1969, M).
(9.14)). Letting r, p -. 00 and q -. 00 in (70). we obtain the assertion. In
this connection, note that
lim IIflis = 1I/1I2j
S-ot
if f e Lx,(K) and meas K < 00. Moreover. note that the function u e
CO(R N ) vanishes outside a compact set K. 0
Proposition 21.64. Let G = R N , N I, or let G be a bounded region in R N with
cG e co. I. Under the assumption (69), inequality (68) holds for each
u e W,2(G) L.(G).
The constant C in (68) depends on G, p, q.
PROOF.
(I) Let G = R N . We set D = DJ. By Lemma 21.63,
II Du U 2r sell u II }2, II u II /2
(71 )
288
21 Auxiliary Tools and the Convergence or the Galerkin Method
for all II E C;(IRJ\'). By using a density argument. we want to show that
(71 ) holds for more general functions u.
(I-I) Let p. r < oc and q < Xl. Let U e W,2(R N ) n L,(R N ). The set C (R N ) is
dense in W,2(IR N ). To be precise. Problem 21.6 tells us that there is a
sequence (u,,) in C(RN) such that
Ilu" - uIl 2 "......0 as n.-. oc,
and since U E Lq(R N ).. we also obtain that
II U" II q < const II u II q for all n.
By (7 I )..
II Du" - Du", 11 2 r Sell U" - Ulft II II u" - Ulft II ;/2
for all n. m, and hence
Du" ..... v in L 2r (R N )
as '1..... X.
This implies (' = Du by Problem 21.4. Again by (71 ),
" DI',," 2r const II u" II r. 2 , II u II /2 for all n.
Letting n .-. . we obtain (73).
(1-2) Let q < X" and hence r < XJ. In this case we choose a sequence (u,,) in
CO'(R N ) such that
II"" - ull q -+ 0 as n...... oc and lIu,,1I 2 ., constllull2.p for all n.
(1-3) Let p = q = 'X, and hence r = 'X. Let u E W;(R N ). Since Ccf(R N ) is not
dense ill L-x (R N ), we need another argument as above. By Problem 2 t .6.
there exists a sequence (u,,) in C(RN) such that
lIu,,1I 2 .':1: < constlluII2.
for all n
(72)
and
Du,,(x) -+ Du(x) as n --+ oc
for almost all x E R H .
From this convergence and sup" II Du" II 00 < 00. it follows that
II Du II S lim II Du" II 00'
"-00
according to Problem 2 t.1 O. By (71).
II Du" II <X sell u" tI /. II u" II 2 S const II u II 2:c II u II 2 .
and hence II Du II x const II u II 2cx> II u II 2 .
(1-4) Summarizing we obtain that the inequality
IIDull2r constllull2,lIull'2 (73)
holds for each
U E W,2(R N ) n L,(R N ).
(II) Extension trick for G #: R H . Let
v E W,2(G) n L.(G).
(73.)
21.19. The Gagliardo- Nirenberg Inequalities
289
By the universal extension theorem from Problem 21.8b, there exists a
function u: R N -+ R with (73.) such that
u(x) = v(x) on G
and
lI u Il2." S const II vII 2.", lIull, S const IIvll"
where the norm for u and v refers to R N and G, respectively. The constants
are independent of u and v. From (73) we get
IIDvll 2r S const IIvll/.lIvll;/2,
since II Dvll2r S II Dull2r' This is the original inequality (68). 0
Remark 21.6S (A General Strategy). The proof of Proposition 21.64 is based
on a general strategy via density and extension arguments. Roughly speaking,
by using this strategy, it is enough to prove inequalities for functions U E
CO(R N ) in order to obtain inequalities for Sobolev spaces with respect to both
R N and bounded subregions G of R N .
The following results should explain the usefulness of this strategy.
EXAMPLE 21.66. Let G be given as in Proposition 21.64. Let I s p s 00, and
let k, m be integers with 0 S k < m. Then
L IIDGlull" s const null:: lIull -le/". (74)
1«1- Ie
for all U e W,,"'(G). The constant depends on 0, p, k, m.
PROOF. Let u e CO(R N ). We set
f(s) = L II DClUIl".
1«1- s
By Lemma 21.63 with p = q,
f(s) S constf(s + 1)1/2f(s - 1)1/2, S = .,..., m - 1.
By induction, this implies
f(k) S constf(m)".f(O)I-Ic/''.
This is (74), since f(m) s lIun...".
Now proceed as in the proof of Proposition 21.64. 0
EXAMPLE 21.67. Let G be given as in Proposition 21.64, and let k, m be integers
with 0 S k < m. Then
L lIo-uIl2""i S const lIull lIull -Ie,,,, (75)
1«1- Ie
for all u e W 2 "'(G) " L(G). The constant depends on G, k, m.
290
21. Auxiliary Tools and the Conver8ence of the Galerkin Method
PROOF. Set
I(s) = L IID2 u Il 2 ...,.
1«1- J
and conclude as in the proof of Example 2 t .66.
o
More general Gagliardo- Nirenberg interpolation inequalities are sum-
marized in the Appendix A 2 (54).
21.20. The Strategy of the Fourier Transform
for Sobolev Spaces
Let Ii denote the Fourier transform of u, i.e.,
II(X) = (2n)-N/2 f ei(Jrb'>u(y)dy.
JR
Important classical properties of the Fourier transform are summarized in the
Appendix A 2 (74). We set
111111:.2 = (fR' (I + 1}'1 2 "')!ti(y)1 2 d},y12.
Our first goal is the following crucial inequality:
111'":. 2 < 111111.... 2 S ('UuU:. 2 for all u e Ct(R'). (76)
Lemma 21.68. LeI N > I, nl = O. " ... . Then Illere is a conSlanl c depending
on Nand nl surh II.al (76) holds.
PROOF. Let N = I. The general case N > I proceeds analogously. Obviously,
there is a c > 0 such that
I + I '12 + ... + lyl2111
I . · I + IYI2'" < (.'
for all )' e fR.
Let u E C (IR). The Fourier transform
u(x) = (2n)-1/2 fR eiJrYu(y)dy
(77)
yields
D"u(.'() = (2n)-1/2 JR eiJrY(iy)'u(y)dy.
By the Parseval identity A 2 (74b),
fR IO'u(x)1 2 dx = fR l(iytu(y)l2 d)'.
Summing over k = 0, . .., m, we obtain lIuU;.2' and hence (76).
o
21.20. The Strategy of the Fourier Transform for Sobolev Spaces
291
Remark 21.69 (A General Strategy). If we combine Lemma 21.68 with the
extension strategy of Remark 21.65, then it is possible to prove quickly a
number of important properties of Sob ole v spaces. The following proposition
should illustrate this.
Proposition 21.70. LeI G = R N , N > I. or let G be a bounded region in R N ",;th
cG e Co. I. Suppose thaI m > N /2. Then:
fa) The embedding W 2 "'(G) c C(G) is continuous, i.e.,
II u II cu,) S const II u II 1ft. 2
(b) For all u e Wf(G),
for all u e W 2 "'( G).
(78)
lIullc.c) S const lIull,22I1ull }12, (79)
More precisely, the functions u in (a) and (b) belong to C(G) after cl,anging
their l'alues on a set of N-dimensional Lebesgue measure zero, if necessar}'.
PROOF. Ad(a).
(I) Let G = R'IV. We will critically use that
r dy < 00 if m > N /2.
JR 1 + 1)'1 2 '"
Let u E Co(R N ). In the following all the integrals are taken over R N . The
Fourier transform
u(x) = (2nfNIZ f e,(xl'>Ii(y)dy
yields
f f lu(Y)I(1 + LrI2"') '/2 dy
(2n) N1Z I u(x)1 S I li(y)1 dy = (I + 1)'1 2 "')1/2 ·
By the Holder inequality and Lemma 21.68,
(2n) NI2 Iu(x)1 S (f hi(y)lz(I + IYIZ"')dy Y'Z (f I +d'IZ"' Y'Z
const II u II:. 2 const II u II M. 2 .
This implies (78) for all u e CO(R N ).
Since Co (R N ) is dense in W 2 "'(R N }, the assertion follows as in the proof
of Theorem 21.A(d).
(II) Let G #: R N . By the extension trick in the proof of Proposition 21.64,
inequality (78) holds for all u e W 2 "'(G). Now use the fact that C'"(G) is
dense in W 2 "'( G), by Theorem 21.A.
Ad(b). By Remark 21.65, it is sufficient to prove (79) for all u e C:(R N ). To
this end, we use critically that:
r dy I r d." C
JR\" I + s 2 rYI 2N == S JRIW I + Ixl 2N = S
292
21. Auxihary Tools and the Convergence of the Galerkin Method
for all s > O. Let u E Co (R.a.'). The Fourier transform yields
(2nl''VI II (x)l2 < (f lai(Y)1 dYY
= (f lu(.r)I(1 + slL\'1 2N )I/2 ) 2
(I + s 2 IYI2N)t/2
S f ' U I2 (1 + s2JY12N)dy f dy .
1 + s21Y12'v
By Lemma 21.68.
IIIIIIf.(R\) S const(s-I /lull + SItU".2)
noting that J IIil 2 dy = J IUJ2 dx = tlull. Letting s = lIuIl2/lluIlN.2' we obtain
(79) for all u e Ct(R N ). 0
21.21. Banach Algebras and Sobolev Spaces
Definition 21.71. Let X be a function space. Then. X is called a generalized
Banach algehra iff X is a B-space such that u, v E X implies Ul' e X and
1I1l11I s; Cllulllll'lI.
(80)
where C denotes a constant. If C = 1. then we speak of a Banach algebra.
If we set 111111. = CIIIIII. then
II In 'II. 1I1l1I.1I t' II. for all II, rEX.
If X is a generalized Banach algebra. then U E X implies u' E X and
lIu'li < C'lIull' for all k = 1.2. ... . (81)
The following result demonstrates the usefulness of Banach algebras for
nonlinear problems.
Proposition 21.72. Let ,,\, be a generalized Banach algehra. Let
.{(II) = ao + a.u + a 2 11 2 + ...
be an analytic function from] - r, r[ to R. ,..,here 0 < r < x. Set B = {II EX:
\lull < rIC}. "'here C > 0 corresponds to (80). Then the operator F: B -+ X gil'en
by
u f(u)
(82)
is analytic. For each t' E Band k = 1.2, .... the kth F-derit'atil'e F tA )(l') at I' is
git'en by
("1..... h,,) It')(v)1I1 ... h,
.for all 11. . . . . , h. EX.
21.21. Banach Algebras and Sobolev Spaces
PROOF. For all u e B,
00
1I/(u)1I s L la,IC'lIull' < 00.
'-0
Now use (16) in Chapter 8.
293
o
STANDARD EXAMPLE 21.73 (C-Spaces and Holder Spaces). Let G be a bounded
region in R N , N 1. Then all the spaces
C'(G),
c..ea(G),
k = 0, I. 2, . . . ,Os « Sit
form generalized Banach algebras.
PROOF.
(I) Let G = ]0, 1 [, and let u, v E C[O, I]. Then
lIuvllc = max lu(x)v(x)1 s lIullcllvllc.
O%I
(II) If u, v e C 1 [0, 1], then (uv)' = u'v + uv', and hence
lI(uv),lIc S lIu'lIcllvllc + Hullcllv'lIc.
This implies
tluvllc' = lIuvllc + lI(uv),lI c s lIullcdlvllc'.
(III) Let u, v e CC[O, I], 0 < < I. Then
lu(x) - u(y)1 S Hea(u)lx - yf.
for all x, y e [0, I],
and hence
fu(x)v(.) - u(y)v(}')f = fu(x)(v(x) - v(y» + v(y)(u(x) - u(y»f
s (liullcH(v) + IIvllcH.(u»lx - )'Iea.
This implies Hea(uv) S lIullcH«(v) + IIvllcHea(u), and hence
lIu('II. = lIuvll c + H.(uv) s lIuIL.llvlI..
The general case proceeds analogously.
o
STANDARD EXAMPLE 21.74 (Sobolev Spaces). Let G = R N or let G be a bounded
region in R N with oG e Co. I and N I. Then the Sobolev space
X = W,"'( G), 1 S P < 00, m = I, 2, .. . ,
forms a generalized Banach algebra in the case where
mp > N.
W:(G) is a generalized Banach algebra for m = 0, I, ... .
(83)
294
21 Auxihar Tools and the Convergence of the Galerkin Method
.. or example, the spaces
i/21 (G). G £ R I ,
and
Wl(G), G C R 2 , IR J ,
form generalized Banach algebras.
PR()()F. If P = "'1'_, then the assertion follows from IIU1'1I2: S 111111:( 111'U x and
IIIIU m . x = L IID 2 ull:r.
\21 ' '"
The proof for p < Y... wit! be given in Problem 21.5, by using the continuity
of the embedding X c C(G). 0
Remark 21.75. If the crucial condition (83) is Iiolated. then the investigation
of nonlinear operators has to be based on the Sobolev embedding theorems
and the Gagliardo Nirenberg estimates. A typical example will be considered
in Section 21.23.
C()l)NTf:RI:XAfPlJ: 21.76 (Lebesgue Spaces). The function u(x) = II has the
property
U E L 2 (0. I)
and
u 2 f L 2 (O, I).
Thus, L 2 (O, 1) is 1101 a generalized Banach algebra. Consequently, the operator
II ........ ,,2
is 1I0t explained on the entire space L 2 (O, I).
21.22. Moser-Type Calculus Inequalities
Our goal is the following fundamental inequality:
IID2f1Dl't.1I2 const(lIuU m . 2 II vII x + IIt'II"'.2I1ull"1:)' (84)
for allu, l' E W 2 '"(G) fl L J' (G), where
nl = 121 + 1 PI > 0,
and for all U,l' E W 2 "'(G) if m > N /2. More generally, we consider the inequality
, ,
IID:lIU 1 ... D2 rll ,1I 2 < const L lIu;lIm.2 n lIujll tX , (85)
i1 j=l.ji
where "' = III + ... + I,t.
Proposition 21.77 (Moser (1966». Let G = R N or let G be a hounded regio'J in
R N \\'ith oG E Co. I and N > I. Then, inequality (85) holds for all
"I' ..., Il, E W 2 "'(G) " Lx(G), m = O. I, ... .
21.22. Moser-Type Calculus Inequalities
295
Coroll.ry 21.78. If m > N /2, then, for all
U I' . . . , U r E W 2 '" ( G),
inequalit y (85) holds and
IID2 I us ...D'ru r Il2 S constllu.II....2...lIu r ll....2. (85.)
This corollary follows from the continuity of the embedding W 2 "'(G) C(G).
PROOF.
(I) Let r = 2, i.e., we first prove (84). If II = 0, then
II D u D' V 11 2 = II u D' V 11 2 < II u II x II D' v It 2 ·
Thus, we may assume that 121, 1 PI I. Set
p = m/lal,
q = m/IPI,
p-' + q-' = 1.
By the Holder inequality,
II DlluD' vn 2 S II D Glu ll2p liD' VIl2,.
By the Gagliardo-Nirenberg inequality (75),
II Dill u1l2, S const II u II .P211 II II',
IID'vIl2, S constllvn2"vll".
By the Young inequality
I a I' I b I'
ab S - + - for all a, b E R,
P q
we obtain the assertion (84).
(II) In the general case, use the Holder inequality and the Young inequality
for r factors (cr. A 2 (29), (30). 0
let U = (u"..., u,) and v = (t'"..., v,). The following simple inequalities are
frequently used:
lIa(u),,' 11 2 S C( R) II ,,'11 2 ,
lI(a(u) - a(v»wIl2 s C(R)lIu - vlllIwll2'
(86)
(87)
where lIuUx = L'=-, IIUill and lIullx" Itvtlx s R. Here, R is an arbitrary rea)
number.
If G is bounded, then, in particular, we may set w = 1. In this case,
II "'11 2 < x.
Proposition 21.79 (Killing of Nonlinearities). Let G = RN or let G be a bounded
region in R N , N I. Let a: R i ..... R be locally Lipschitz continuous (e.g., a is
296
21, Auxiliary Tools and the Convergence of the Galerkin Method
C 1 ). Theil, tile ilJequalities (86) and (87) hold for all
\\'eL 2 (G) and Ui.liEL%(G), ;= I....,k.
For f86) it ;s sufficie,u that a( · ) is cO,lIinuous.
PR()()F. If lIull , 1It'1I"6 < R. then the function x t-+ a(u{x)) is bounded on G, and
hence
Ie; (aw)2 dx consl IG w 2 dx.
Moreover, it follows from
la(u(x)) - a(l'(x)1 < L(R)lu(x) - v(x}1
for all x E G
that
f (a(lI) - a(v))2 ",,2 dx S L 211 II - t'li f w 2 dx.
JG JG
o
Important inequalities for the nonlinear terms of partial differential equa-
tions can be obtained by combining (85), (85*) with (86). (87).
EXAfPLf. 21.80. Let G = A'IV or let G be a bounded region in R N with aG E Co. I
and N > 1. Suppose that the function a: Ai -+ IR is ct. Let
m > N /2
and let I 11 + ... + 12,1 < In. Set u :=. (u,..... u i ). l" = (Vi....' I'i), and !lull",.2 =
Li II u;ll m. 2' Then:
, ,
U a ( u) n D:r J \\j 11 2 < C ( R) n II W j II "'. 2 .
j = I j= I
(88)
, ,
lI(a(u) - aft')) n D 2 JKjll2 S C(R)lIu - 11I",.2 n II "j II "'.2. (89)
j=1 j='
for all Ui' r i . \\j E W 2 m (G) with null",.. "1'11",.2 s R.
These inequalities are very convenient in applications.
PR()()F. Since the embedding W 2 "'(G) C(G) is continuous. we have
lI.rll_ constl1fU",.2
Now use (85*) and (86), (87).
for all f e W 2 "'( G).
o
More general results can be found in Chapter 83.
21.23. Weakly Sequentially Continuous Nonlinear
Operators on Sobolev Spaces
The operator A: X -+ Y between the B-spaces X and Y is called weakly
sequentially continuous iff, as n -+ 00.
II" II implies Au" Au.
21.23. Weakly Sequentially Continuous Nonlinear Operators
297
Such operators play an important role in the modern theory of nonlinear
partial differential equations.
Proposition 21.81. Let X, Y, Z be B-spaces over K = Rt C. Then:
(a) If A: X -+ Y is linear and continuous, then A is also weakl}' sequentially
continuous.
(b) If A: Y -+ Z is continuous and the embedding X S; Y is compact, then
A: X -+ Z is strongly continuous, and hence also weakly sequentially
continuous.
PROOF. Ad(a). Cf. Proposition 21.27.
Ad(b). If u" u in X, then u" -+ u in Y, and hence Au" -+ Au in Z. 0
Lemma 21.82. Let G = R N or let G be a nonempt)' bounded open set in R N ,
N I. Let (v,,"',,) be a bounded sequence in W,"'(G), where m = 0, 1, 2, ...,
1 < p < OCt
In the case ",here G = R N , suppose that, for each open ball B in R N ,
('" -+ (' in C(B) as n -+ YJ,
"'" w in L,(B) as n -+ oc,
"'here t., M': G -+ R are fixed functions. In the case "'here G R N , suppose that
(90) holds for B = G. Then
(90)
t' \\' t'W in W"' ( G )
" " p
as n -+ OCt
PROOF. The B-space W,"'(G) is reflexive. If we assume that
V "'. z in W"' ( G )
" " p
as n -+ X),
(91)
then
Sa cpv"w"dx - Sa cpzdx as n -:XJ for all cp E CO'(B),
since u J IJ cpu dx is a linear continuous functional on W,,'"( G), by the Holder
inequality. From
cpt'" -+ cpt' in L,,(B) and W,,-Ia W in L,(B). p-t + q-t = I.
it follows, by Proposition 21.23(j), that
fa fPl'" WIt dx - fa cpt'W dx as n - ce,
and hence J 8 cp(VM' - z) dx = 0 for all cp e C:(B) and all B. Since Co (B) is dense
in L,,( B), we obtain that
z = vw almost everywhere on G.
If (91) holds for a subsequence (v,,- "'",), then we obtain again that z == vw. This
yields the assertion, by the convergence principle (Prop. 21.23(i»). 0
298
21. Auxiliary Tools and the Convergence of the Galerkin Method
Lemma 21.83. Let K be a 'lonempty compact set in R N , N > I. and let a: R ..... IR
he ('O,Jt;'IUOUS. (f \"e set
Au = a(u),
rl,ell ,lie operator A: C( K) .... C( K) ;s continuous.
PR(X)F. Let II" -+ " in Cf K) as n -+ yw. Since (u ll ) is bounded in C(K) and a(')
is uniformly continuous on bounded sets, we get
"All" - Aul! = max la(u,,(x)) - a(u(x))I..... 0
'( €
as n -+ x,.
o
We now study the 'ypical nonlinear operator
Au = a(u)D2 u ,
121 < m.
(92)
Proposition 21.84. Let G = R t , or let G be a bounded region in R N "pith cG E co. I
and N > I. Set
X = W 2 "'+"(G). Y = W 2 1t (G), m > N /2, k = 0, I. ... .
Let a: IR -+ R he C. +'. Tllen rile operator
A:X-+Y.
(//'"ed hy (92). is hounded, continuous, Lipschitz continuous on balls, and \veakly
seque,JI ially ("ont;11110IlS.
PR()()F.
(I) We show that A is bounded and Lipschitz continuous on balls.
(I-I) Let k = O. and let lIu1l",.2' 111'11",.2 < R. By Example 21.80.
UAuU 2 S C(R)lIull"'.2
and
UAIl - .41'11 2 = U(a(u) - a(t'))Du + a(l,)(D:l u - D2 l 'U1 2
< K ( R) II u - l'lI 1ft. 2 ·
(1-2) Lct k = I. We will use the equivalent norm IIt'IIT.2 = IIvll 2 + L II D J v ll 2
on Y = W 2 '(G). Note that
N
It A II II 1. 2 = II Au 11 2 + r " DJ Au" 2
J=1
and
DJAu = a'(u)DjuDQ/u + a(u)DjD:l u .
Thus. the case k = I can be reduced to the known case k = O.
(1-3) For k > 2. we use a similar argument.
(II) We show that A: X ..... Y is weakly sequentially continuous. To this end.
let 8 = G if G is bounded, and let B be an open ball if G = R N . Since the
Problems
299
embedding W 2 "'(B) s; C(B) is compact. it follows from
u,,u in X = W 2 "'.'(G) as n -. x
that II" -. u in C(B). and hence
a(u,,) -. a(u) in C(B)
D2U,,Du in L 2 (B)
By Lemma 21.82.
All.. Au in Y = W 2 '(G) as n 'XJ.
Note that (Au,,) is bounded in Y. since A: X Y is bounded and (u,,) is
bounded in X. 0
as n -. ,X.
as n -. x,.
PROBLEMS
21.1. Almost orthogonal elements. let M be a closed linear subpace of the 8-space
X with AI #: X. Show that. for each £ e ]0. I [. there exists an element u, with
II u, II = I
and
dist(u,. Af) > 1 - t.
Solution: Let (' e X - M. Then d = disth.. At) > O. since M is closed. We
choose an element m( e AI with" r - me ll < d( I - £)-1, Then. u, = (l' - m,)/
Ir - m,1I is the desired element. In fact. for all me Af.
Ilu t - mi: = lit' - m,lI-I'lt' - me - i;r - m,Uml ,
Ii I' - m, II - I d I - £,
21.2. Proo.( 0.( Theorem 21.C.
Solution: Let dim X = oc. We choose M' I with IIM'I" = I. Let M =
span: "'I }. By Problem 21.1 there exists a M'2 with I- M. 2 ;; = I and II "'2 - \\'1 II
. This way y,'e obtain a sequence (M',,) with 'I',',,': = I for all nand
JI "'" - '''",! ! for all m > n. Obviously. ("',,) has no convergent subsequence.
i,e.. the unit ball in X is not compact.
If dim X < x. then it follows from the classical Bolzano- Weierstrass
theorem that the closed unit ball in X is compact.
21.3. Propert;eot; of generali:ed der;('at;(Oes. We want to show that generalized
derivatives have properties similar to those of classical derivatives, I n Prob-
lems 21.3b- 21.3h below. let G be a nonempty open set in R..... N I. All
derivatives are to be understood in the generalized sense.
21.3a. Functions ",;th ranishing generali:ed der;('atit'es. let G be a region in R''V with
IV > I and let U e W I I (G). Show that
OJ U = 0 on G for j = I..... N
implies u(x) = constant for almost all .t e G,
Solution: let H c c G. and let H be bounded, By Proposition 18.35.
DJ(S,u) = Sr(Oju) = 0 on H
300
21. Auxiliary Tools and the Convergence or the Galerkin Method
for all r. E ]0. £0[' Hence Scu = constant. By Proposition 18.33,
ScU -. 14 in L, (H)
as t -. + O.
Since the constant functions form a finite-dimensional subspace in L, (H), we
obtain u = constant on H. Now, note that H can be chosen arbitrarily.
Analogously, we obtain that it follows from u E W,'(G) and
D2 U = 0 on G
for all (X: 121 < k
that u(.) = (polynomial of degree < k - I) for almost all x E G.
21.3b. Yeak limits. Let % be a fixed multi-index. Show that it follows from
u" ..-.. u in L I (G)
D 2 u" -1ao r in L afG)
as n..... 'X) ,
as n..... X',
that D:J u = (' on G.
Solution: Letting n -+ Y..,. it follows from
L 11ft D> If' dx = ( - I >I L 10> 11ft I If' dx
for all f/J e Co (G)
that
r II D> If' d x = (- n'; r "If' d.'(
JG JG
for all cp e C';: (G).
Consequently, the generalized derivative 02U exists on G and D 2 u = t' on G.
21 3c. rOhllraCleri:ClI;on of generalized deril'al;I'es. Let u, I' E t I.loc(G), and let 2 be a
fixed multi-index. Show that the following two statements are equivalent:
0) The generalized derivative OSu exists on G and [)«u = t. on G.
(ii) There exists a sequence (u,,) in C (G) such that
U,,-+U inL.(H)
0" u" ..... r in L" II)
for all compact subsets H of G.
Solution: (i) => (ii). Letting
as n -. oc,
as n..... 'X.J,
U" = s."u
for n = I. 2, . . . .
statement (ii) follows immediately from Proposition 18.3S(b).
(ii) => (i). Use the same argument as in Problem 21.3b.
21.3d. The product rule for generalized derivatives. For fixed; = I,..., N, show that
Di(ul") exists on G such that
Di(UI) = (Diuh' + uD;t' on G.
provided the following assumptions are satisfied:
(i) 14 E tp.I«(G) and v e L.,'oc(G), where 1 < p < 00 and p-a + q-' = I.
(ii) The generalized derivatives DiU and Div exist on G such that DiU e
tp,loc(G) and D,(' E L..1oc(G).
Problems
301
Solution: We set u" = S.,..U and v" = S I'"V for all n. By Propositions 18.33
and 18.35, we get II", l'.. e COO(G) for all n, as well as
u.. -+ U and D,u. -+ 0,11 in L,(H) as n 00,
V" -+ t' and D,v" -+ D,v in L.(H) as n -+ 00.
for all compact subsets H of G. Letting n -+ , we obtain from
L u"v"D,tp dx = - L (v"D,u" + u"D,v,,)tp dx (or all tp e CO'(G).
that
L ul'O,tpdx = - L (vO,u + uO,v)tpdx (or all tp e CO'(G).
21.3e. The chain rule for generalized derivatil'es. For fixed i = I, .. .. N. show that
the generalized derivative Di(f 0 u) exists on G such that
D,(/o u) = (f' 0 u)Diu on G.
provided the following assumptions are satisfied:
(i) u e LI..oc(G).
(ii) The generalized derivative D,u exists on G and DIu e L',loc(G).
(iii) fee. (R) and sup.. AI/'(x)1 < .
Solution: Let H be a compact subset of G. We choose a sequence (u..) in
ex (G) such that
u" ..... u
and
D,II" -+ D,II in L.(H)
as n -+ oc,
by Problem 21.3c. Then
r I/(u,,) - l(u)I dx s sup II'(x)1 r lu" - ul dx --+ 0
JH .A JH
as n -+ x.,
and
A" L II'(u")O,u,, - l'(u)O,ul dx
s sup II'(x)l f lO,u" - O,ul dx + r II'(u,,) - l'(u)llO,ul dx.
Jr. A JH JH
Since ". -+ II in L,(H) as n -+ oc, there exists a subsequence, again denoted
by (u,,), such that II,,(.) -+ II(X) as n ..... 00 for almost all .t e H, and hence
/'(u,,(X)) ..... f'(u(x))
as n -+ X
for almost all x e H.
This implies A. -+ 0 as n -+ 'X, by the principle of majorized convergence
Az( 19). Hence
/ou,,-+fou inL.(H)
as n..... 00,
Di(f 0 u ll ) -+ (f' 0 u)D,u in La (H)
By Problem 21.3c, D,(f 0 II) = (f' 0 11)0,11.
as n -+ 'YJ.
302
21 Auxihary Tools and the ("onvergence of the Galerkin Method
21.3f. Special generalized deri('atit'es. Let II E L, .Ioc( G). and suppose that. for fixed
; = I..... N. the generalized derivative DiU exists on G. where D;u E LI.1oc(G).
We set
II + (x) = max(u(.x),O).
u - () = min(u(x).O),
and luf( = lu( )I. Show that the generalized derivatives D;u and D;lul eXist
on G. and we have the following relations:
(D,u+1( '( I = {,U(.'(I
tD,u - H.) = { o
D;u(x)
{ + D,u(.)
D,lul)(.) = 0
if u(.) > O.
if u(x) < 0:
if u(xl > O.
if u(x) < 0:
if u() O.
if u(:<) = o.
Solution: .. or r. > O. define
{ ( U 2 + ,; 2 '2 - r.
.(u) = 0
if u > O.
if II < o.
By Problem 21.3e. for all <p E C (G).
f 1,(u)D;<I' d. = - f UD;U(U2 + (,2) -, 2 <I' clx,
c; CI .
where G. = : E G: Il(.) > O}. Letting t -+ 0, we get
f u. D,<p dx = - f (DiU)</, dx.
G G
This implies the claimed relation for Diu.. The other results follow from
II = - ( - u) + and II' I = u + - U - .
21.Jg. A ,"pE'(';ell prop(Jr,y t1 the Dirichlet integral Let II E "t(G). and let,. = 1111.
Show that r E t'21 (G) and
f 10('1 2 Jx < f I DI,,2 dx.
(i G
where IDul 2 = [_ I (D;u)2.
Solution: This follows immediately from Problem 21.3f.
21.3h. Iterated generali:ed deril'atives. Let 2. fJ be multi.indices. Show that the
generalized derivative D'u exists on G such that
DJ+1>1l = DJ(D'u) on G.
provided the following hold:
Ii) The generaliled derivative D'u of u exists. where u. D'u E LI.1oe:(G).
(iit The generalized derivative D2(D'u) of D'u exists on G. where D'(D'u) e
I. I loe: t G).
Problems
303
Solution: Let t' =: 0'&1. For all qJ E q(G),
t uDs+'rpdx = t uD'(D-rp)dx
= (_1)'11 t vlYrpdx
= (_1)1-1+111 t (D"v)rpdx.
21.4. A convergence theorem. Let G be a nonempty open subset of R N . N 1. let
m = 1.2. .... I < p < x. and p-I + q-I = I. Show that
&I.. &I in W,,"'(G)
as n -+ X
implies
D-&l" 0-&1 in L,,(G)
as n -+ X for all 2: 121 S m.
Solution: Let X = W,"'(G). For fixed t. e L,(G), define
b(u) = t (D-u)l,'dx for all u e X.
By the Holder inequality,
Ib(&I)1 s 1I&lII"lIvn, s lIullxllvll..
Hence b e X.. Consequently, as" -+ X, u......u in X implies b(u.) -+ b(u) and
hence 1)«&1" -A OS&I in L,(G) (d. A 2 (34b)).
2 I .5. Sobolev spaces and Banach algebras. Let G = AN or let G be a bounded region
in AN such that cG e CO. I and N I. Show that the Sobolev space
X = W:(G), k = I, 2, ..., I S P < 'XJ,
forms a generalized Banach algebra in the case where pk > N.
This means that &I, veX implies &IV e X and
n<'U const 111.1" nt,n
for all &I, veX.
Use the embedding theorems from A 2 (45). In particular, use the continuity
of the embedding W:(G) c C(G) for pk > N.
Solution: Let &I, t. eX. The product rule shows that the derivative
l)2(uv),
121 S k,
consists of finitely many summands of the form
D'uD'v,
We have to show that
IPI = J, h.1 < k - j, j = 0, . . . , k.
"D' 1.10" vir, const II 1.111..,,11 t'tt..".
(I) Let j ::II O. Then the continuous embedding W:(G) S; C(G) implies D'&I e
C(G) and
1ID'&lII !!! sue 1D'&I(x)l S constllull,.".
XIG
304
21 Auxiliary Tools and the Convergence of the Galerkin Method
Hence
;,D'uD't'li, s; constllD'uUT IID)tlI,
< const IluI/ 1 ."lIt"I..,.
The case J = k proceeds analogously.
(II) Let I < j < k - I. The continuous embeddings
W:(G) c (G). W:(G) c -J(G),
with r = kj, r -I + s -I = I, and the Holder inequality imply
L ID'uD't'IPd:< (L ID'U1rPJ:<Y" (L ID7VIJPd:<r'
Hence
IIDIlD'I'I!" < IID'ulI"IID/('il sp
< const lIull..,III'II..,.
21.6. A del1s;ty theorem. Let m = 0, I. ..., I < P < x), N > 1. Show that the set
C (R''') i!\ dense in W,"'(R N ). More precisely. the following is true.
(i) If II E W,"'(R'-Y). then there exists a sequence (u,,) in C(RN) such that
I'U - ull -.. 0
" m."
as n -. x.
hi) If, in addition. I' E W:(R N ) for fixed k = 0, I, . . ., I < q < 'X), then
1111,,11... S const IIull... for all n,
and
DJu,,(x) --. DJu(x)
in the case where II < k.
Solution:
(I) If we set
as 11 --. x., for almost all x E R N
('" = 5. "u,
11 = I, 2. . . . .
then 1'.. E CT tR"-) for all n. and (i), (ii) are valid for (I',,). according to the
properties of the smoothing operator S, in Section 18.14.
(II) Let I' E C 7 (R')" W"'(R N } ('\ W:(R N ). We choose a ex -function
cp: R" -.. R such that 0 < cp(x) < I on R N
and
{ I if I. I < I.
cp(x) = 0
if I x I 2.
If we set
"',,(x) = t'(x)cp(./n),
then K'" belongs to Co (R''') for all nand
:I "'.. - t7 II III.' -. 0
as 11 -. x.
"w"I1... const !lvll...
for all n.
This follows from
IDJ"',,(xH s const L ID"to(xU
Y':I
for all x e AN,
Problems
305
and hence
r ID-w R - De,,'I'dx = r IDsw R - l)Svl'dx
JR- JI"
S const r L ID7 r ,(X)/' dx -+ 0 as n -+ 00.
J Ixl II 171 $1%1
(III) Putting (I) and (II) together. we obtain the assertion. Note that cp(x/n) -+
I as n .... oc.
21.7. An e:<tension principle for smooth functions.
21.7a. Main idea. Let f: [0. a[ .... R be C. with m 0 and 0 < a soc. We set
f(x) if . O.
g(.) ==
",+1
r cJf( - ./j) if x < O.
J=I
where the numbers c J satisfy the linear system
..+ 1
L cJ(-I/j}' = J.
)-1
k == 0, . . . . m.
(93)
Show that g: ] - a, a[ .... R is a C"'-extension of f.
Solution: The determinant of (93) is a Vandermonde determinant. and
hence it is different from zero. By (93), gf.t(O) = f(.)(O) for k = 0, I, ..., m.
21.7b. Let IS p S . m == 0, I, .... N I. Let G be a bounded region in A.... with
cG e C"'. Choose an open set Go such that G c: Go (e.g.. Go == AN).
Show that given any function u e c-(G), there exists a function ve C.(G o )
such that
t' = u on G,
SUpp to c: Go.. and
!It'II,,,.,.G o s const l!ull..,.G'
The constant depends on G, Go. m, p.
Hint: Consider Figure 21.8(a). Use a partition of unity. Map neighbor-
hoods of boundary points locally to a half-space as indicated in Figure 21.8.
Extend the corresponding functions from the half-space to the entire space
A.... via Problem 21.7a. Transform the extended function back to Go. Cr.
Friedman (1969. M), p. 10.
----
"" --,
,/' "
I \
\ J
" /
/----_/
Go
'1
1p
>
H.
H_
(8)
(b)
Figure 21.8
306
21. Auxiliary T 0015 and the Convergence of the Galerkin Method
21.8. E "(te'lsion opercllor for Sobolet' spaces.
21.8a. S",ool/, hou"daries. Let m = O. I. .... I < p < OC:. and N > I. Let G be an
open half-space or let G be a bounded region in R N with cG e C"'. Show that
there exists a linear continuous operator
E: Vp"'( G) -+ t...( n.") (94
such that the function Eu: R" -. n IS an extension of u: G -+ R.
Hint: Construct the function Ell via Problem 21.7b. and use the density of
C"'( G) In W p "'( G).
21.8b.. L "lI;rersal t''e"s;()" operator. let "1 = O. I. . . .. I p < x, and N > I. Let
G be an open half-space or let G be a bounded region in R N with cG E Co. I .
Show that the assertion of Problem 21.8a remains true, where the construc-
tion of the operator E is indepe"dent of ,n and p. Moreover. the function Eu
is C7 on R\ - G.
I int: c'r. Stein (1970. M). Chapter 6. and Burenkov (1976). (1985). The
proof is based on a modification of the method in Problem 21.7a.
21.9 All i",portcult ('quirLllenl l10rnl Otl L 2 (G). Let G be a bounded region in (R" with
;\ I and i'G E CO. I. We set
B = : u E Wli (G): "u II .. 2 S I:
and
'II I I 2 = sup r G u (' d.'( .
.B J4
II D,ull-1.2 = sup I G 110.1" dx
.£8 J4
(95)
as "ell as
...
II u II! = iI u II - I. 2 + L II D, 1111 - 1. 2 .
I:: 1
lJ"e the Fourier transformation from A 2 (74) in order to show that I u:11 is
an equivalent norm on L 2 (G). i.e.. there exist positive constants c and d \\'ith
c II II 11 2 < 11 u II d II U 11 2
for all u E L 2 (G).
In Chapter 65 we shall use this inequality in order to give a simple proof for
the fundamental Korn inequality in elasticity theory.
Solution: The estimate Ilull! d 111411 2 is obvious. Thus. we only need to
prove that
C " 1411 2 < II u II !
forall ueL 2 (G).
(96)
The key to the proof is the simple identity
I .. 11,;12
I' .; 1.\12 + . i -+- I.rP = I.
ft) Proof of (96) for G = iR.... Let J = JR" Define
lIull1.2 = (f (I + I.rl 2 IIul 2 d.r)' 2 .
(97)
Problems
307
where Ii is the Fourier transform of u and )' == ('I...... 'IN)' Note that
W 1 1 (R'''') = W 1 1 (R''''). By A 1 (74). we have:
ue JYzl(RN) iff ue Lz(RN) and IIullT,l < 'Y:).
Moreover. 11'11 T. z is an equivalent norm on W Z 1 (R N ). Thus. it is sufficient
to prove (96) with B in (95) replaced by the ball
B = {u e W 2 1(R N ): lIullT.2 S I}.
Using the Fourier transform, we obtain that
IID , ull-l.l = sup f UD;t'd.t = sup f U"i d)'
1".. 1". a
= su p f a", (I + I )'IZ)lf2 d,'
I'. . (I + I)' 1 2 ) I.' 2 . .
= su f u", \f d '
'w , 1 (I + J'IZ II'z )
= (f ,",21,,;12 dy ) I/2.
I + 1..\"1 2
Similarly, we get
f (f luI2 d)' ) 1/2
/lull- u = ut'dx = I + b'lz .
By (97),
/lu/lI.Z + IID,UIlI.z = f'U,Zd}' = lIullj.
This yields (96).
(II) Proof of (96) for G = half-space H.... Let
Ht = {(,.,,) eRN: 'I O}.
where = (" I" . . '''N -I ) and 'I ::z ",V (cf. Fig. 21.8 above). We set
"'(.'r) = t.(. 'II - 3t.(. - 'II + 2t{. - i)
and B o ={t'eq(R, v ):Ut'UI.2.A"'S I}. Then, for allt'eBo. we get
".( ,. 0) == O. a nd hence
". e W 2 1 (R +) and sup /I "'111. Z.H. < X.
I' . at)
Now let u e Lz(H...). We extend the function u to AN by letting
u(. 'I) = - 3u(" -,,) + 4u(. - 2,,). " < O.
We want to prove that "uUt.A- S Cffullf.H.. i.e.,
N ( ... )
lIull-1.2.A" + L !!DiuU-I.Z.A- S C IIU"-I.Z.H. + L \lD.ull-I.Z.H. .
1=1 1:1
(98)
Then, by (I), we obtain (96) for G = H....
308
21. Auxiliary Tools and the Convergence of the Galerkin Method
First. let i = I,.... N - I. Then, a simple integral substitution yields
Dju II _ !.2.R' = sup r uDjl: d.'( + r UDiL'd.'(
I' . '0 J H . J H
= sup r uDjwdx S constllull- I ,2,H.'
.. · 10 J H .
The other estimates in (98) are obtained analogously.
(III) Proof of (96) for bounded regions G with rG E Co.,. We use a covering
of G and a corresponding partition of unity {cp;}. Let II E L 2 (G). We set
IIi = ucp;. If supp cp. c G, then we use (I). Furthermore. if supp CPi ('t cG
0. then we use a local rectification of the corresponding boundary part
as indicated in Figure 21.8 above. and we apply (11). The simple technical
details can be found in Necas (1967. M).
21.10. 1"'0 properties of the space t 7 (G).
21.10a. let G be a nonempty open set in R,Il. N t. let II" E L1[ (G) for all n such that
u"C.) .-. IIC.) I' n.... x.' for almost.1I ., e G
..Hd tUft. !i 14"". , Ittllw thll' U . 1..,.'0' ..,."
lIu 17 < lim 1111..11 r.
.. ..... 'I:
Solution: Set X = l...(G) and
.IN(I') = fG ('II N dx, 1ft') = t t'u dx for all (' eX,
Then 1.fE X. and !I.III = Ilu..!l r . alII = lIuli r . Moreover.
f"tr)....ffl") as n.... x forall ('EC(G).
.
Since (. tG) IS dense in ,,\" and sup" IIf..11 < x. we obtain that J' as
11 -. x and hence I I III < lim ...... r 11/..11, by Proposition 21.26.
21.10b. Let G be a nonempty measurable set in jR...., N I with meas G < x. Show
that
lim lIullp = :Iull
P'" r
forall ue L(G).
Hint: Cf. Kurner (1977, M). p. 84.
21.11. Properties (r integral operat()r.. We consider the integral operator
1 K u)lx) = r klx, J)ul J') dJ'. 1991
Ju
Let ,\f be a nonempty measurable set in R\ N > 1. and let k E L 2 (M x ,\f"
i.e.. the function k: M x M R is measurable and
r k(.'(, .\,)2 d.'( d.\' <. oc. II (0)
Ju.",
21.11 a. Show that the operator K: t 2 ( M) .... L 2 (M) is linear and continuous \\'ith
II K 11 2 S f k(x. y)2 d:< dYe
" /II ,\f
Problems
309
Solution: By the Fubini theorem A:z(23), it follows from (tOO) that
f", k(X.y)2dy < QO
for almost all x e M. Set w = KII, where II e L:z(M). By the Holder inequality,
W(X)2 f", k(x. y)2 dy f", U(}.)2 dy
for almost all x e M. Hence Ku e L 2 (M), and
IIKulI = L W(X)2 dx (f",x", k(..}.)2dXdY) "ull.
2t.llb. Show that the adjoint operator K*: L 2 (M).... L 2 (M) corresponds to the
adjoint kernel k*(x, )') = k(y, x).
Solution: Set
(Ll')(.) = f", k*(x. }')v(}.) d}'.
Then, for all u, v e L 2 (M),
till Lv) - f v(y)k(,., x)u(x)dx dy = (Kulv).
AI.AI
Hence L = K..
21.llc. Show that K: L 2 (M) -+ L 2 (M) is compact.
Solution: The linear hull of the set of all the functions
c(x, )') = a(x)b(y), a, b e L 2 (M),
is dense in L 2 (M x M). This follows from the density of CO:(R 2N ) in L 2 (R 2N ),
and from the Weierstrass approximation theorem A, (23e). Thus, for each
I; > O. there are functions ai' b i e L 2 (M) such that
f [ k(X. y) - t Q/(X)b/(y) ] 2 dx dy < t. (101)
AI.", i-I
Set
(C,u)(x) = r t Q,(x)b,(y)u(y) d}'
J AI 1-'
for all u e L 2 (M). Then. C,: L 2 (M) ... L 2 (M) is linear and continuous. From
C.U - t Q, ( r b,( }')u(y)d Y )
I-I J AI
it follows that dim R(C,) < 00. By (101),
ilK - C,1I 2 < t.
Now, the compactness of K follows from Example 21.4S.
21.12. An existence principle. Let X and Y be B-spaces over k. and let A: X -+ Y be
a linear and continuous operator. Suppose that the equation
All == b,
ue X,
( 102)
310
21. Auxlhary Tools and the Con\'ergence of the Galerkin Method
has approximate solutions. i.e.. there are constants C > 0 and q E [0. I [ such
that. for each hEY. there exists a u(h) E X with
'.4u(h) - hI! < ql1hll.
'114(b)1I < C dh::.
Show that. for each hEY. equation (102) has a solution 14 with Ilu!1 <
C(t - q) IlIh " .
Solution: Let hi = h. By assumption. there exists au, with
'1 A II, - hi I: < q II hi II.
Set h: = h, - Au,. Then we find a 1'2 with
I. AU2 - h 2 11 q IIh 2 11 s q2!1hall,
:Iulll C Iih,lI.
1[11 2 11 < Cllh 2 11 < Cqllb,I'.
ThiS \\.a) we get t\\.o sequences (u,,) and (h,,) with
h". I = hIt - Au".
,'h,,1! < q"-I IIhll.
1I1I,,:j < Cq"-'lIhll.
Hence
hIt . 1 = h - A (u, + . .. + u,,).
Since L" "U,,:I < C( I - q) I Ilh II. the series
( 103)
2
II = L Il"
,,=,
is convergent with Ilull < C( I - qr IlIbll. Letting n .... ":1:.# in (103), we obtain
o = b - Au.
21 13. TIle approx;"aal;olJ theorem of Kan,oror;c. We consider the operator equation
II + Lu = h.
U E X.
(104)
together with the corresponding approximate equation
UtI + L"u" = PIth.
UtI EX". n = I. 2. ... .
( 105)
For all II. we make the following assumptions:
(it X" is a linear subspace of the B-space X with dim X" = n. and PIt: X X"
is a projection operator onto X".
hi) The operators t: X ..... X and L,,: X" ..... X" are linear and continuous. and
I + l: X -.. X is bijective.
(iii) There exists a constant d" with
disH Lu. X,,) d" II u II
for all II EX.
(ivt Set Crt = ',P"L - L"II..... and suppose that. as n..... Y:..
(." O.
B P"II d" 0,
1\ P,," dist(b. X,,) ..... O.
for all hEX.
Show that equation (104) is uniquely approximation-solvable. More-
over. one has the following error estimates:
ilu" - ull const(c" + IIP"IHd" + disUh.X,,)).
( 106)
nu" - ull const(c" + ItP,,1I dist(u. X,,)).
Problems
311
The condition Crt .... 0 as n .... 00 describes the consistency of this approxi-
mation method.
Hint: Use Problem 21.12. cr. Kantorovi and Akilov ( 1964, M), Chapter
14.
In applications, L" can be a discretization of the operator L (e.g., L is an
integral operator and L" is a sum operator obtained by a quadrature ror-
mula). The terms on the right-hand side of (106) can be estimated by using
results of approximation theory.
21.14. Collocation method as a special projection method. We consider the boundary
value problem
u"(.t) + q(x)u(x) = f(.t),
u( - 1) = u( 1) = 0,
where f and q are continuous on [ - I, I]. The idea of the so-called colloca-
tion method is to make the trial
-1<.t<l,
(107)
U,,(x) = (I - x 2 )(co + c,x +... + C,,_I.t,,-I) (J08)
and to use the partition - I < a I < a2 < ... < a" < I. The unknown coeffi-
cients '0' ..., ',,_I are determined by the II equations
u;(a;) + q(a,)u..(a,) = f(a, i = 1,..., II.
21.14a. Show that this method is a special projection method.
Solution: We set
( 1(9)
x = {u e C 2 [ - I, I]: u( + I) = O}.
Y=C[-I,I].
Let X" be the set of all the polynomials of the form (108). Finally, let
Q,,: Y.... Y be the projection operator which assigns to the function f the
unique interpolation polynomial Q"f of degree S II - 1 with the property
(Q"fHa;) == f(a;),
i = I..... II.
Then, (109) is equivalent to the operator equation
All" + Q,,(qu,,) = Qrtf..
lI"eX".
( II 0)
where the operator A: X ...... Y is given by Au = u". The compact inverse
operator A -': Y .... X has the form
(A-I/)(.) = f l G(x.)')f()')d)'.
-I
where G denotes the continuous Green function to the following boundary
value problem:
II" = f on ]-1.1[,
From (110) we obtain
II" + A-1Q,,(qu,,) = A-1Q,,/, II" eX".
The original equation (107) can be written as
u + A -I (qu) = A -I I, II eX.
u( - 1) == u( I) = O.
(III)
( III.)
312
21. Auxiliary Tools and the Convergence of the Galcrkin Method
21.14b.. Let a" . . . , a" be the so-called tebyv interpolation points, i.e.,
2; + I
a, .. = cos ---1t
I 2n'
i = 0, 1. . . .. n - I.
Then
IIQ"II s Alnn + B
for all n.
where A and B are positive constants (cf. Natanson (1955, M), p. 390). Let J:
q E C. [ -I. 1]. Show that
In n
'Iu - u"ll x S const-
n
for all n,
i.e.. the collocation method converges to the solution u of the original prob-
lem (t 07) as n -+ 'X>. The convergence of the collocation method is faster, the
smoother the functions f and q are.
Hint: Use a modification of Problem 21.13, and use the approximation
theorem of Jackson for polynomials from Example 21.48. Cf. Kantorovic and
Akilov (1964, M), Chapter 14.
21.15. General projection methods ;" nonseparable B-spaces. We consider the opera-
tor equation
Au = b,
ueX, beY,
( 112)
together with the general projection method
P" AQ"u = P"h,
where P,,: D(P,,) Y -. Y and Q,,: D(Q,,) X -+ X are (not necessarily con-
tinuous) projections, i.e., P,,2 = P" and Q = Q" with finite-dimensional pro-
jection spaces. If the spaces X and Y are not separable, then the projection
method does not converge for each beY. Then it is important to describe
the set of all b for which convergence occurs. In this connection, a general
theory may be found in Pro6dorf and Silbermann (1977, L).
For example, if (112) represents a singular integral equation, then X and
Y must be Holder spaces in the smooth case. But note that Holder spaces
are not separable.
21.16. Generalized Schwar: inequalit},. Let a: X x X -+ R be a bilinear positive
symmetric functional on the real linear space X. Show that a satisfies the
inequality
a(u.I,)2 S a(u, u)a(t\ I')
for all u, I' E X.
Solution: Use the same argument as in the proof of the Schwarz inequality
in the Introduction on page 8. Replace the scalar product (ulv) with a(u, v).
References to the Literature
Classical works on Sobolev spaces: Rellich (1930), Friedrichs (1934), Sobolev (1936),
(1950,M).
Standard work on Sobolev spaces: Adams (1975, M).
References to the Literature
313
Sobolev spaces: N«as (1967, M) and Kufner, John, and Fueik (1977, M) (introduc-
tion), Friedman (1969, M), Wloka (1982, M) (Sobolev spaces and Fourier transforma-
tion), Triebel (1983, M), Mazja (1985, M).
Inequalities of Gagliardo-Nirenberg: Gagliardo (19S9), Nirenberg ( 1966), Friedman
(1969, M), Adams (1975, M).
Moser-type calculus: Moser (1966), Klainerman and Majda (1981 Majda (1984).
Extension operators for Sobolev spaces: Stein (1970, M), Burenkov (1976), (198S),
Wloka (1982, M).
Approximative solvability of linear operator equations: Kantorovi (1948) (classical
work), Kantorovi and Akilov (1964, M), Anselone (1971, M), Krasnoselskii (1973. M),
Vainikko (1976, L), ProBdorf and Silbermann (1977, L), Fenyo and Stolle (1982, M).
Vol. 4.
Finite elements: Ciarlet (1977, M).
Introduction to approximation theory: Natanson (19SS, M), Cheney (1967, M),
Laurent (1972, M), Braess (1986, M), Ditzian and Totik (1987, M).
Weak and weak. convergence: Dunford and Schwartz (1958, M), Vol. I, Yosida
(196S, M).
(Cr. also the References to the Literature in the Appendix.)
CHAPTER 22
Hilbert Space Methods and Linear
Elliptic Differential Equations
The role of functional analysis has been decisive exactly in connection with
classical problems. Almost all problems are on the applications. where func-
tional analysis enables one to focus on a specific set of concrete analytical tasks
and organize material in a clear and transparent form. so that you know what
the difficulties are.
Concrete and functional analysis exist today in an inextricable symbiosis.
When someone writes down a system of axioms. no one is going to take them
seriously unless they arise from some intuitive body of concrete subject matter
that you would really want to study. and about which you really want to find
out somet hing.
Felix E. Browder (1975t
The perfection of mathematical beauty is such that whatsoever is most beauti-
ful and regular is also found to be most useful and excellent.
D'Arcy W. Thompson (1917)
Generalized boundary value problems for linear and quasi-linear elliptic
differential equations lead to problems of the following form:
(Po) a(u.l') = h(l') for all I' E X.
For given hE...\' *, we seek U EX. Here. X is a real B-space. I n this connection
we want to explain the following two principles (1 a) and (1 b) below which will
be used frequently. Our goal is the following equivalent operator equation:
(P...) Au = h, I' E X.
with the linear or nonlinear operator A: X --+ X*.
The Equivalent Operator Equation
Suppose that the map (I: X x X -+ R is linear with respect to the second
argument" and suppose that, for each U EX, there exists a constant C(u) such
314
22. Hilbert Space Methods and Linear Elliptic Differential Equations
315
that
, a ( u, (') I c ( u) It l'lI
for all u, 1- e X.
Then, for each u, there exists a linear continuous functional on X denoted by
.4u such that
(Au, v) = a(u, l')
for all t' EX.
(I a)
This way we obtain the operator A: X -+ X.. The original problem (P G ) is
equivalent to
(Au, (-) = (b, v)
for all (' eX,
i.e., problem (P.) is equivalent to (P A ).
Let a: X x X -+ R be bilinear and bounded. Then the operator A: X -+ X.
is linear and continuous. Moreover, a is positive (resp. strictly positive,
strongly positive, symmetric) iff A has the same property, by Definition 21.32.
Note that for the linear operator A: X -+ X. the following hold:
() A is monotone iff A is positive, i.e., (Au. u) 0 for all u EX.
(/l) A is str;ctly monotone iff A is strictly positive. i.e., (Au, u) > 0 for all
nonzero u eX.
(j') A is strongly monotone iff A is strongly positive, i.e., there exists a c > 0
such that (Au,u) cllull 2 for all u eX.
If, in addition, X is renexive, then:
a is compact iff A is compact
iff A is strongly continuous.
This follows from Sections 21.9 and 21.10.
Functions as Functionals on Sobolev Spaces
The right-hand side of equation (p.), i.e., the functional b e X. corresponds
frequently to the right-hand side of elliptic differential equations. In this
connection, we will use the following result.
Let Ie L.(G) with I < q < 00, where G is a bounded region in R N , N I.
We set
bM = t I(x)t-(x) dx for all f E W,"'(G),
where m = O. I.... and p-t + q-1 = I. Recall that Wp'"(G) = Lp(G) for m = O.
Then, b ;s a linear continuous functional on the Sobolev space Wp"'(G).
This follows immediately from the Holder inequality:
I b(t') I s (t III' dx ) II, (t lvi' dx ) II'
IIfll.llvll",.p for all v e W,"'(G). (I b)
316
22. Hilbert Space Methods and Linear Elliptic Differential Equations
In addition, we get
lib II < IIfll q ,
in particular, lib I! = IIfll q in the case where m = o.
The Main Results
Suppose that a: X x X -. R is bilinear and bounded on the real H-space X..
and suppose that b: X -. R is linear and continuous. We want to solve prob-
lem (Pea).
The two key estimates are given by
(G s )
a(u,u) cllulli
for all u EX,
and
(G)
a(1l, u) > c II u \I i- - C If u II i
for all u EX.
Here, c and C are positive constants, and Z is a real H-space with the compact
embedding X c: Z. Condition (G s ) means that a is strongly positive. Further-
more, (G) is called an abstract Gdrding inequality, and (G s ) is called a strict
Garding inequality.
(i) Sym"letry a'ld quadratic variational problems. If a is symmetric and
strongly positive.. then the original problem (Pa) is equivalent to the
quadratic variational problem
(V) afu, u) - h(u) = min!, u EX,
which has a unique solution (Section 22.1). This generalizes the classical
Dirichlet principle.
(ii) N ons}'mmetric strongly positil'e problems. If a is strongly positive. then
(Pa) has a unique solution (Section 22.5). This corresponds to the main
theorem on linear monotone operators.
(iii) Gdrding"s inequality and Fredholm alternatives. If a satisfies (G), but not
(G), then (Po) need not have a unique solution. In this case, we obtain a
Fredholm alternative (Section 22.7). More precisely. the original problem
fPo) has a solution iff
b(,v) = 0
for all solutions \v e X of the dual problem
(P:) a(v, w) = 0 for all VEX.
The corresponding linear operator A: X -+ X., involved in the equivalent
operator equation (P A ), has the following properties.
Case (i): A is strongly monotone and symmetric.
Case Oi): A is strongly monotone.
Case (iii): A is a compact perturbation of a strongly monotone operator,
and hence A is a Fredholm operator of index zero.
22. Hilbert Space Methods and Linear Elliptic Differential Equations
317
Application to Boundary Value Problems
In order to explain Cases (i)-(iii) above with a simple, but typical example, let
us consider the following boundary value problem
-£\u + u( + fJu" + i'U = f on G,
u = 0 on oG
together with the dual problem
- £\w - "'( - fJw" + i'W = 0 on G,
'" = 0 on aG.
Here, G is a bounded region in R2, and ex, p, ( are real numbers. Then, for
example, we have the following situation:
Case (i): ex = p = 0, i' > o.
Case (ii): 2 2 + p2 #: 0; lal, IPI, Ii'I are sufficiently small.
Case (Hi): ,p, i' = arbitrary.
In the latter case, the principal elliptic part - £\u corresponds to a strongly
monotone operator, whereas the lower order terms correspond to a compact
perturbation of that operator. Let / e L 2 (G).
In Cases (i) and (ii), the boundary value problem (8) has a unique general-
ized solution u e J41 2 1 (G).
In Case (iii), problem (8) has a generalized solution u e W 2 1 (G) iff
t/WdX = 0
for all generalized solutions M' e J41 2 1 (G) of the dual problem (8.).
Here, we set
a(u. (') = fG (u(v( + u"v" + o!u(v + pu"v + yuv)dx.
b(t.) = t Iv dx.
and the generalized problem corresponds to (P.) with X = J41 2 1(G).
(8)
(8.)
Eigenvalue Problems
Along with the problem (p.), we will also consider eigenvalue problems of the
Corm
(P,,) Aa(u, v) = b(u, v) for all veX,
where X is a real H-space. We seek a nonzero u e X and a real number l
which satisfy (P,,). We assume that a, b: X -+ R are bilinear, symmetric, and
bounded. Further assume that a is strongly positive and that b is compact and
strictly positive.
318
22 Hilbert Space Methods and linear Elliptic Differential Equations
Then we will show that problem (P b ) has an infinite number of eigenvalues
in the case where dim X = XI,
The idea of proof is to reduce (Ph) to the equivalent operator equation
( Ps)
i.1l = Bu..
where B: X -+ X is symmetric.. contpact.. and strictly positive, Thus.. it is pos-
sible to apply the Hilbert-Schmidt theory to (Pit) and hence to (Ph)' The
operator B is defined through the relation
h(u.. to) = (Bult..)£..
forall u..reX,
where (u I r)£ = £1(u.. r) is the so-called energetic scalar product on X, which is
equivalent to the original scalar product (Ull') on X.
In connection with (P b ) and (Ps). we will also study the Courant maximum-.
minimum principle, In Part III we will consider the Ljusternik-Schnirelman
theory for nonlinear eigenvalue problems which is a generalization of that
principle.
As a typical example let us consider the following boundary- eigenvalue
problem:
-;..1u = u on G..
II = 0 on iG,
where G is a bounded region in 1R2, Then the corresponding generalized
problem is given by (P b ) with X = W 2 1 (G) and
a(u. r) = fG (u [': + 11,,(',,) dx.
b(lI. r) = [ ur dx,
JG
In this chapter, in addition to the problems mentioned above, we also
investigate the following topics:
(a) Convergence of the Ritz method and the Galerkin method,
(b) Acceleration of the convergence of the Ritz method by means of the duality
trick,
(c) Duality and t,,'o-sided a posteriori error estimates for the Ritz method.
(d) The method of finite elements,
(e) Fredholm alternatives for integral equations.
(f) Regularity of generalized solutions.
In connection with (f), we will obtain the following general principle for
linear strongly elliptic differential equations:
The soilitions are smoother, the smoother the data are, and, similarly.. the
cont'ergence of the Ritz and Galerkin methods is faster, the smoother the
solutions are,
The logical structure of this chapter is pictured in Figure 22.1.
319
22. Hilbert Space Methods and Linear Elliptic Differential Equalions
.-..
...0'
u- rJ
"0 ci >
:t (0 .-
.c ec;
/ (5 E
I .c
.i j -;
c.::
+
c
o
.-
... -
:s "O
iT .s -: :;
... .c co
0
... 1-.. &
. 8. 1:: 'n
g '0
j :t -S
+ U
.-
.
i
U)
E
..
U
c
.-
-
:s
..
J2
C
o
.-
i
S-
-
N
N
f
co
ti:
U
C
o
o
u C
> 0
.- e
U .
_en
. &. ;g eo
C C C
.C Qb.g e
Q.c...= 0
fjgd:;;i
c en C ...
u...:.= c
... 0.-.- g.
.!! .D :. 0
.c
U
en
C
en.g
e ..
U to
_ :s
-gi
... -
Q.
U .=
:s c
-; f
>
' :.a
u
-0 .-
C
:J .9-
.81i
...
cSa
i-
(
'..I
1\.1
.i i j
c1_
'- +
o
E
o
u
.c
..
.
00
-
I
ci
u
-
e
f
B
.c
..
.= c
.8
.- 0
E 00
-=
.9.
.!! u
- .c
e ..
z.--
e
E
.c
u .-
- E
c-
.ts u
C .=
. .C tU
Q.19
a-:C
U ... 0
.. 0 ...
. ___ C.
.c
U
00
-
ci
o
320
22. Hilbert Space Methods and Linear Elliptic Differential Equations
22.1. Main Theorem on Quadratic Minimum
Problems and the Ritz Method
We consider the variational problem
ia(u. u) - b(u) = min!,
together with both the variational equation
U E X,
(2)
a(U.l\) = h(v)
and the operator equation
for fixed U E X and all v E X,
(3)
Au = b.
U E X.
(4)
As we shall see below, problems (2), (3), and (4) are mutually equivalent, where
(Al,. 1') = a(u. v) for all II, VEX. (5)
We call (3) and (4) the Euler equation to (2). Set
F(w) = !a(w, "') - b(\v).
By definition, u is a strict minimum of (2) iff F(w) > F(u) for all t E X with
\\' :I: II.
Along with the variational problem (2) we consider the approximate Ritz
problem
!a(u", u,,) - b(u,,) = min!.
together with the Ritz equation
U" EX".
(6)
a( u". (') = b( to)
for fixed u" EX" and all l' E X"'
which can be written in the following form:
a( II", "'k") = b( W k ,,),
k = 1. .... n'.
(7)
Here, we set
II'
U" = L Ck" ""'k,,.
'=1
The elements "'k" are given. Then problem (7) represents a linear system of
equations for the unknown real coefficients C III' .... CII-". As we shall see below,
problem (6) is equivalent to (7). We make the following assumptions:
(H 1) Let X be a real H-space. The map a: X x X -+ R is bilinear, symmetric.
bounded, and strongly positive, i.e., there exist positive constants c and
d such that
la(u,v)1 dllullllvll,
a(u, II) cllull 2 for all u, VEX.
(H2) The functional b: X -+ R is linear and continuous, i.e., b E X..
(H3) Suppose that X is separable and infinite-dimensional, and let (X,,) be a
Galerkin scheme in X, where X" = span {Will'." , w lI .,,}.
22.1. Main Theorem on Quadratic Minimum Problems and the Ritz Method 321
Theorem 12.A (Quadratic Minimum Problems). Assume (H 1) and (H2). Then:
(a) Existence and uniqueness. The original quadratic minimum problem (2) has
a unique solut;on u.
This solution corresponds to a strict minimum.
(b) Equivalence. Problem (2) is equivalent to the Euler equation (3).
(c) A priori estimate. The solution u of the original problem (2) satisfies the
inequalit }'
lIull s c-1Ilbll.
(8)
(d) Convergence of the Ritz method. In addition, assume (H3). Then, for each
n e N, the Ritz equation (7) has a unique solution u., and, as n -+ 00, the
sequence (u,,) converges in X to the unique solution u of the original minimum
problem (2).
The Ritz equation (7) is equivalent to the Ritz problem (6).
This theorem, together with the corollaries below, allows many important
applications to elliptic boundary value problems (cr. Section 22.2) and to the
theory or elasticity (cr. Part IV). In terms of elasticity, problem (2) allows the
following physical interpretation:
u = displacement of the elastic body,
!a(u, u) = elastic potential energy of the body,
b(u) = work of the outer forces.
Then the original minimum problem (2) corresponds to the so-called principle
of minimal potential energy.
By definition, the energetic space X E is equal to the set X equipped with the
so-called energetic scalar product
(ul V)E = a(u, v)
for all u, veX.
This notation is motivated by the interpretation in terms of elasticity.
Corollary 22.1 (Equivalent Operator Equation).
(i) Assume (H I) and (H2). Then the original quadratic minimum problem (2) is
equivalent to the operator equation (4). The operator A: X -+ X. given by
(5) has the following properties:
(P) A ;s linear, continuous, symmetric, and strongly monotone.
(ii) Conversely, let A: X -+ X. be an operator on the real H-space X with (P).
Then, for each be X., the operator equation Au =- b has a unique solution
u e X, and this equation ;s equivalent to the quadratic minimum problem (2).
PROOF OF THEOREM 22.A. Ad(a), (b). This is a special case of Theorem 18.A in
Section 18.8.
322
2:! Halbert Space Methods and Linear Elliptic Differential Equations
Ad(c). It follows from a(II, t) = h(t) for alii' E X that
c liUl1 2 < a(u, u) = b(u) < IIbllllull.
and hence (" II II II < II h II.
Ad(d).
(I) Existence and uniqueness of the solution of the Ritz equation. Replacing
X by X". it follows from Theorem 22.A(a), (b) that the Ritz problem (6)
has a unique solution, and that (6) is equivalent to the Ritz equation (7).
(II) Convergence of the Ritz method. The key to the proof is the orthogonality
rela t ion ( 10) below.
Subtracting the Ritz equation (7) from the Euler equation (3). we obtain
a(u - II". r) = 0 for all (' EX". (9)
Letting (' = II". we get a(1I - "".11,,) = 0 and hence
a( II - II". II - u,,) = a(u - " 11 , U - t')
for all t' EX".
(10)
This yields
C 1111 - 11".1 2 < a(u - II". U - U,,)
= a(u - u".u - l') < dUu - 11,,111111 - ('II
for all (' E 4f\' n. Hence
d -I (" 1111 - u,,11 < inf lIu - t'l! = distx(u. XII). (11)
lEA'"
Since (X,,) is a Galerkin scheme in X. dist",(u, X,,) 0 as 11 x. Hence
II" U as 11 -+ x. 0
Corollary 22.1 follows immediately from (I a).
22,1 a. Error Estimates
Recall that f'(".) = 2 -1 a(,,'. ",) - b(\\').
Corollary 22.2. Assume (H 1) trough (H3). Let u and U II he the solution 0.( tile
or;g;'lal m;ni,nunl problenJ (2) and the Ritz equation (7), respect;('ely. Let '" E X
be gil"en (e.g.. ". = u,,). Then:
(a) Upper bounds for the minimal value. There holds F(u) F(\\').
(b) Error estimates for u via lower bounds for F(u). If 2 ;s a lo\ver hound .for
the nlinimal talu(! F(u). i.e.. S F(u), then
c
- lIu - "'11 2 F(,,') - .
2
( 12)
(c) A posteriori error estimates for u. Suppose that there is a constal1t C > 0
22.1. Main Theorem on Quadratic Minimum Problems and the Ritz Method 323
suel, that
la(\\'t t') - b(t')1 s Cllt'li
for all t' e X.
Then
ellu - wI! C. (13)
(d) A priori error estimates for the Ritz method. For all n e f\i,
lIu - u..11 s c- 1 d.dist x (u, X..). (14)
Remark 2%.3 (Lower Bounds and Duality). According to (b) it is important to
know lower bounds for the minimal value. In this connection, one uses the
methods of general duality theory which will be considered in Part III. In
Section 22.3 we will investigate the method of orthogonal projection which
represents a special duality method.
It follows from (14) that the rapidity of convergence of the Ritz method
depends on the quantity distx(u, X..) of approximation theory. Thus, as in
Section 21.16, we obtain the following rough principle:
The cOllrergence of the Ritz method is .faster, the smoother the solution u is.
PROOF. Ad(a). Note that F(u) < F(n') for all "' E X.
Ad(b). A simple calculation shows that
F(u + r) = 2- 1 a(t\ r) + (a(u, r) - b(('» + 2- 1 a(u. II) - b(u).
By (3), a(lI, I') - b(f) = O. This implies
F(u + I') = 2- 1 a(t., 1') + F(u)..
and hence
F(u + l) - F(u) 2 -I e 111'11 2 for all I' eX.
Ad(c). By (3), a(u, u - ",) = b(u - \\'). Hence
ellu - \\'11 2 S a(u - \\',U - ".) = b(u - ".) - a(,,',u - ",) S Cllu - "'11.
Ad(d). This follows from (II). 0
22.1 b. The Abstract Green Operator
As a special case of the Euler equation (3), we consider the equation
a(u, t') = h(1') for fixed u e X and all I' e X.. (15)
where
b(l') = (fll')y
for all r' e Y..
and we assume:
(H) Assumption (H I) of Theorem 22.A is satisfied. where Y is a real H-space
with the continuous embedding X s; Y.
324
22 Hilbert Space Methods and Linear Elliptic Differential Equations
Let fEY be given. Then
Ib(t')i S Ufllrlll'lI r S constllfllrlll'lIx
( 16)
for all (' EX. Hence hEX.. By Theorem 22.A. equation (15) has a unique
solution 1l EX. and we set
1l = K.f.
The operator K is called the abstract Green operator or the solution operator
to (15). If (15) corresponds to an elliptic boundary value problem, then K
corresponds to an integral operator whose kernel is the classical Green
function.
Corollary 22.4. A SSIIIne ( H). Tlte'1:
(a) The operator K: Y Y ;s linear continuous, monotone, and symmetric.
(b) If the embedding X c Y is compact, then K: Y Y is compact.
PR()()F. By Theorem 22.A and ( 16),
II K.fll." < ,,-'lIbU < const IIfll r
for all fEY.
Hence K: Y --. X is continuous. The continuity (resp. compactness) of the
embedding X £ Y implies the continuity (resp. compactness) of K: Y -+ Y.
The symmetry of K: Y Y follows from
(J'I Kg))' = a(K.r. Kg) = a(Kg, KJO) = (gl Kf)r for all f. g E Y.
Hence (Kfl.fh. = a(Kf. Kf) > 0, i.e., K is monotone. 0
22.1c. The Duality Trick and Refined Error
Estimates for the Ritz Method
Let 11 E X be the unique solution of problem ( 15), i.e.,
a (II, I') = b ( I' ) for a II l' EX.
( 17)
with b(r) = Cf!r)y, and let u" E X" be the solution of the corresponding Ritz
equation
a(u", t') = b(t,)
for all l' E XII'
( 18)
We make the following assumptions.
(A I) Assume (H I) and (H3) of Theorem 22.A concerning X and a(', ').
(A2) Let X. Y, and Z be real H-spaces with the continuous embedding X c Y.
(A3) A prior; estimate. There is a constant C > 0 such that U E Z and
lIuU z Cllfllr
for all fEY and for all the corresponding solutions U E X of the original
problem (17).
22.2. Application to Boundary Value Problems
325
(A4) Approximation property. For all z e X () Z and n = 1, 2, ...,
distx(z, X,,) S cx"lIzllz,
where cx" is a constant.
Corollary 22-S. Assume (A I )-(A4). Then we obtain the following error estimates
for the Ritz method:
lIu - u"lI x S cx,,(c- 1 dC) 11/11 r lor all fe Y.
In the space Y, we get the stronger error estimates
lIu - u"lI r S :(c-ldzCz)lI/lIr
for all fe 1':
where, n = I, 2, . . . .
This result is a special case of Corollary 22.21 whose proof is based on a
duality trick.
As we shall see in the next section, this result is responsible for the high
speed of the convergence of the method of finite elements in the space 1': For
example, in the case of the first boundary value problem
-u=f onG, u=O oncG,
where G is a bounded region in R 2 with aG e Coo, we may choose
Z = W Z 2(G), X = Jtl(G), Y = Lz(G).
Then, for each feY, we obtain that the solution u belongs to X f\ Z. More-
over, we obtain that cx" is proportional to the maximal diameter of the finite
elements (triangles) in X". Note that the functions in Z are smoother than those
in X.
Now it follows from Corollary 22.5 that the method of finite elements
converges in Y like cx .... 0 as n .... 00, i.e., we have quadratic convergence in Y
with respect to the maximal diameters of the triangles corresponding to the
triangulation of the region G.
22.2. Application to Boundary Value Problems
In this section we use the following convention.
G is a bounded region in R N with N I, x = (I"." N)'
and D, = olo,. (19)
22.2a. The First Boundary Value Problem
We consider the homogeneous boundary value problem
- L\u = I on G,
u = 0 on aG.
(20)
326
22 Hilbert Space Methods and linear Elliptic Differential Equations
We set
f 4"
a(u" v) = D i uD i l' dx,
G i= I
b( t.) = t .fi' dx.
Definition 22.6. Assume (19). Let X = W 2 1 (G). Then the generalized problenl
to (20) reads as follows. For given fe L 2 (G), we seek u e X with
a(u. r) = h(l') for all (. eX. (21 )
This definition is motivated by Section 18.3. Formally, we obtain (21) from
(20) by multiplying (20) with r e Co'(G) and by using subsequent integration
by parts. Note that the functions u in W 2 1 (G) satisfy the boundary condition
U II = 0 on ?G" in the generalized sense. According to Section 22.1, problem
(21) corresponds to the variational problem
a(u, 14) - b(u) = min!. II EX. (22)
This is exactly the Dirichlet problem
t ((DjU)2 - ')dX = min!. U E W 2 1(G).
Proposition 22.7. All rile assertions of SecticJlI 22.1 hold .for the tn'o problems
(21) and (22).
III part iCll/ar. Jc)r eClch f E L 2 (G), the generali:ed .first boundary t'a/ue problem
(21) ha... a IIII;que soilition u EX. \\'h;ch is also the unique solution of the
('ar;at;oltal proble,n (22).
Corollary 22.8. LeI Y = L 2 (G). Denote by KJ' the 5011llion II of (21). Then tlJe
operator K: Y -... Y i. Iinec!r'I monotone, compact, and symmetric.
Classically, the solutions of (20) are given by
u(x) = t k(x, y)f{y) dy.
where k('. .) is the symmetric Green function. The operator K generalizes this
integral operator. i.e.. u = 1\(.
PR()()F ()f- PRC)P{)SITI()N 22.7. By ( I b), b EX..
On "\,, we have the norm !lull = lIu1l1. 2 . Moreover" a(u" u) = nUIl.2,O.
Obviously. the map a: X x X .-. R is bilinear and symmetric. By the Holder
22.2. Applicalion 10 Boundary Value Problems
327
inequality,
la(u,t')1 S L f 1 D,u D,t'l dx
; G
S (t ID,uI 2 dx ) 1/2 (t ID/('1 2 dX) 1/2
sNlluflllvl1 forall u,veX.
Hence, a is bounded. By Section 21.3, the norm 11-11 1 . 2 is equ;t'alent to the norm
It '11,.2.0 on X. This implies
a(u,u) > cllull 2 for all u e X,
i.e., a is strongly positive.
Now, Theorem 22.A shows that (21) and (22) have unique solutions. 0
By Section 21.3, the embedding X Y is compact. Thus. Corollary 22.8
follows from Corollary 22.4.
In what follows we use the results of Section 22.1 in order to obtain
additional information on the first boundary value problem (20). Recall that
N = dimG.
22.2b. Regularity of the Generalized Solutions
Let u e W 2 ' (G) be the generalized solution of(20) for f e L 2 ( G). In Section 22.19
we will prove the following crucial result.
(i) Sobolet spaces. Let k = 0, I.... . Suppose that the boundary cG is smooth,
i.e., cG e CX. Then.
Ie Wl(G) implies u e Wl+ 2 (G).
Furthermore, there is a constant D, such that
lIull'+2.2 S D,lIfU,.2 for all fe W 2 '(G).
In particular, Ie L 2 (G) implies u e WI(G).
It follows from the Sobolev embedding theorems in Section 2 J.3 that
u e W 2 '+ 2(G) implies u e C"'(G)
in the case where k + 2 > m + N12. If k > N12.. then m 2.. and u is a
classical solution of the first boundary value problem (20).
In Chapter 6 we used the following sharp Schauder estimates.
(ii) Holder spaces. Suppose that cG e C"'+2." where m = 0.. I, . . . and 0 < P < I.
Then,
Ie C""'(G)
implies
u e C",.2"(G)..
and
II u IIc....) 'fG) S const 1I/lIc... 'fii,.
328
22 Halbert Space Methods and linear Elliptic Differential Equations
Below we will show that these results are important for estimating the
rapidity of convergence of the Ritz method.
22.2c. The Inequality of Poincare-Friedrichs and
a posteriori Error Estimates
The following error estimates depend on the constant Co of the Poincare-
Friedrichs inequality
Co f 112 dx < f .f (D j U)2 dx for all U E W 2 1 (G). (23)
G G ,-:J
This implies
c f ( U2 + .f (D i u)2 ) dx < f .f (D j U)2 dx (24)
G .=1 G .=1
for all U E Jtt'i(G), where c = co/(I + co), i.e., a(u,u) cllulli.2' Moreover, the
defect of a function ". with respect to the equation u + f = 0 on G is defined
to be
Of = (t (t\w + f) 2 dx)"2
Corollary 22.9. Suppose that G satisfies (19) "'ith oG E co. I , and let u E W 2 1 (G)
be a generalized solution of the first boundary value problem (20). Choose an
arbitrary junction \\.' E C 2 (G) with \\-' = 0 on aG. Then we obtain the lollo\,,;ng
error estimates:
Ii II - \\t' 11 2 < 'J.;' co'
II U - "'II.. 2 < 'J.jc.
(25)
Recall that 1Il''' = J G l"2dx and IIl'lIi.2 = JG(l2 + Li(D i l1)2)dx. By Section
22.13, the largest possible value of Co is equal to the smallest eigenvalue Ii of
the problem:
- l' = JJI' on G,
l" = 0 on oG.
Sharp estimates for Co can be found in Michlin (1981, M).
PROOF.
(I) Integration by parts yields
la(w, I') - b(v)1 = L (t\w + f)vdx
S (X II V 11 2 S 'J. II (,' II.. 2 for a II v E W 2 1 ( G). ( 26 )
By t I 3), c \i U - \"111. 2 < 2.
(II) It follows from (23) that
coli u - w II a (u - "', u - \").
22.2. Application to Boundary Value Problems
329
Since u is a generalized solution of (20), a(u, U - w) = b(u - K'). By (26),
collu - wll s Ib(u - w) - a(w,u - w)1 S txllu - w1l 2 . 0
22.2d. The Convergence of the Ritz Method
We make the following assumption.
(H) The set G satisfies (19), and (X,,) is a Galerkin scheme in the Sobolev space
X = WltG). Let X" = span{w.",...,w",,,}.
According to Section 22.1, the Ritz equation, corresponding to the first
boundary value problem (20), is given by
a(u", v) c: b(v) for all (' eX". (27)
We seek u" e X"' i.e.,
U II = CI"K'." + ... + C"'"W II .,,.
Equation (27) is equivalent to the system
a(u ll , "',,,) = b(K',"),
k = I,..., n'.
Explicitly, that means
f i D,u"D,w""d.1C == f fw""dx. k == ..,... n', (28)
JG 1-. JG
This is a linear system for determing the unknown real coefficients c t", . . ., Cft'II'
The following statement follows from Section 22.1 a.
Corollary 22.10. Assume (H). Let u e X be the generalized solution of the first
boundar}' talue problem (20) for given f e L 2 (G). Then, for each n e N, the Ritz
equation (28) has a unique solution u", and the Ritz method cont'erges in X, i.e.,
lim lIu - U"III. 2 = o.
"-X
The rapidity of convergence is given b)'
lIu - u"I1.. 2 S c- I N distx(u, X..), (29)
"'here N = dim G and the constant c has been introduced in Section 22.2c.
EXAMPLE 22.11. Let N == I. We consider the first boundary value problem
- u = f on G, U = 0 on aG
in the special case of the finite interval G = ]Y. [. Let / e C""'(G), where
m = 0, I, . . . and 0 < fJ < I. Let X" be the set of all the following polynomials
v(x) = (x - y)(x - )(ao + a. x + ... + a..x").
Note that v satisfies the boundary condition.
330
1 Hilbert Space Methods and Linear Elliptic Differential Equations
Then the rapidity of convergence of the corresponding Ritz method (28) is
gi ven by
cons t .
till - "" II.. 2 < m + Ii 11.1 lie'" '(()'
n
It = I. 2. . . . .
Hence. the convergence is faster, the smoother f is.
In this connection. \\'e have
"',," ( x) = (x - .)( x - <5) x" - I .
k = 1.....11 + I.
PR()()F. By Section 22.2b.
I'uqc,.,. 'tG) < const lI.fll e '" I(G)'
Moreover. approximation theory (cf. A 2 (57)) tells us that
. con st
dlst.\'(II. ,,\',,) < -;;;+I +i 'Iull c '".: 'tGt.
n
o
22.2e. The Method of Finite Elements
Generally. the Ritl method \\'ith piecewie polynomial basis functions is called
the finite-clement method. This method has the following nice properties.
fa The finite clements can be easily implemented on computers.
(b) The coefficient matrix of the Ritz equation is a sparse matrix. i.e.. most
entries are equal to zero.
(c) One may take the specifics of the problem into account. For example. one
can use an extraordinarily fine triangulation in subregions, where a subtle
behavior of the solution is expected (e.g., special parts of airplanes).
Today the method of finite clements is one of the most important numerical
methods for solving problems in natural sciences and engineering.
To explain the basic idea of this method, we consider the first boundary
value problem
-- 11 = I on G.
u = 0 on iG.
(30)
where G is a bounded region in 1R2. Let fl;') be a sequence of triangulations
of G which has the follo\\'ing two properties:
(a) It" -+ 0 as n -+ x. where II" denotes the maximal diameter of the triangles
of T,..
(b) inf" <5" > 0.. where (j" denotes the minimal angle of the triangles of 1 (cf.
Fig. 22.2).
We consider the following special finite elements with respect to T,.:
o
\\"t"(x) = I
o
for all the node points x E c'G..
for a fixed node point x E G..
for the remaining node points.
22.2. Application to Boundary Value Problems
331
/G
(a)
(b)
Figure 22.2
Further the function ",,, is extended to G via linear interpolat;o'J. We set
X" = span {"'I"..... "'".,,}. Then the space X" consists exactly of all piecewise
linear functions with respect to the triangulation T", which vanish at the
boundary.
Let f E L 2 (G). Moreover. let u be the generalized solution of (30), and let
II" be the solution of the Ritz equation (28) with respect to the finite element
space X".
Suppose that cG e ex. We now use the following two important estimates.
(a) Regular;t.'". By Section 22.2b. u belongs to the Sobolev space Wl(G) and
111111 2 . 2 S const Itfllz.
(b) Approxi"rat;on theory. Let X = WzJ (G). We prove in Appendix A 2 (59) that
distx(u, X,,) S const. h"IJuliz. 2.
Coroll.ry 22.12. We haL'e tIre follo,,,'ing error etimQtes:
Itu - u,,1I1. 2 < const. hIt II/If Z,
"u - II" 11 2 S const . h: IIf1l2.
n = I, 2. . . . .
(31 )
(32)
Consequently, the method of finite elements for (30) converges linearly in
the space W 2 1 (G), and it converges quadrat;call}' in the space L 2 (G).
PROOF. Ad(31). This follows from Corollary 22.10 and from (a) and (b) above.
Ad(32). Set Z = W 2 2 (G). X = WZ1(G), and Y = Lz(G). Then the assertion
follows from the refined error estimates obtained in Corollary 22.5 and from
(a) and (b) above. 0
22.2f. The Second Boundary Value Problem
We consider the second boundary value problem
- u = f on G.
OU
- = 9 on eG.
en
(33)
332
22 Hilbert Space Methods and linear Elliptic DilTerential Equations
We set
f N
a(u, f') = L D,uD,v dx.
G ;=1
b(t,) = f .Ii'dx + f gt' dO.
G G
Problem (33) is not uniquely solvable since u = 1 is a solution of the corre-
sponding homogeneous problem. In order to get a unique solution, we add
the normalization condition
L II dx = O.
(33.)
Definition 22.13. Let X = {u E W 2 1 (G): JGudx = O}. where G is a bounded
region in fR'\'. N > I. with ?G e CO. I. Then the generalized problem to (33) reads
as follows. For given Ie L 2 (G) and g e L 2 (cG) with
b( I) = f .f dx + f 9 dO = 0,
G rG
(34)
we seek u e X such that
a(u, 1:) = b(t,)
for all v EX.
(35)
This problem is motivated by Section 18.4. Formally, we obtain (35) from
(33) by multiplying the first equation of (33) with v e CJ(G) and by using
subsequent integration by parts. Letting v = I, equation (35) yields b( 1) = O.
This is (34).
The corresponding variational problem is given by
a(u. (4) - b(u) = min!,
u e X.
(36)
Proposition 12.14. All the assertions of Section 22.1 are l'alid for the t,,'o
problem... (35) and (36).
III particillar, the generalized second boundary value problem (35) has a unique
solution U EX. ,,'hich is also the unique solution of the variational problem (36).
PROOF. Since W 2 1 (G) C L1(G), X is a closed linear subspace of W 2 1 (G). Thus,
X is a 8-space with the norm II u II J . 2'
(I) We show that he X*. By Section 21.3, each function VE Wl(G) has
generalized boundary values on cG belonging to the space L 2 (cG).
Furthermore..
LCi ("2 dO const II t'li t 2
for all I.' E W 2 1 (G).
22.2. A ppJication to Boundary Value Problems
333
By the Holder inequality, for all v E W 2 t (G),
Ib(v)1 S i fvdx + i gvdO
G fG
S 11/11211 vll 2 + const IIglI LJ(Gtll vII.. 2
S const II vII.. 2'
(II) We investigate a(', '). As in the proorror Proposition 22.7, we obtain that
a: X x X .... R is bilinear, bounded, and symmetric.
By Section 21.3, the norm
lIullt,2 = (L (D,U)2d. + L udx 2)"2
is equit'alent to the norm 11'111. 2 on W 2 1 ( G). For all u EX, J G U dx = O.
Hence
lIull!.2 2 = a(u,u) cllullf.2
i.e., a is strongly positive.
Theorem 22.A yields the existence of a unique solution u E X for (35) and
(36). 0
for all U EX,
22.2g. The Third Boundary Value Problem
We consider the third boundary value problem
- u = f on G,
OU
on + hu = g on aGo
(37)
Here, it is important that
h(x) ho > 0
for all x E cG and fixed hoe
(38)
We set
a(u, v) = i f D,uD,vdx + i huvdO,
G'-. G
b(v) = i fvdx + i gvdO.
G G
Definition 21.IS. Let X = Wl(G), where G is a bounded region in R N , N I,
with oG e Co. t. Let h e C(oG) with (38). Then the generalized problem to (37)
reads as follows. For given Ie L 2 (G) and 9 E L 2 (oG), we seek II E X such that
a(u, v) = b(v) for all VEX. (39)
334
22 Hilbert Space Methods and Linear Elliptic DilTerential Equations
This problem is motivated by Section 18.4. Formally, we obtain (39) from
(37) by multiplying the first equation of (37) with t' E C(G) and by using
subsequent integration by parts.
The corresponding variational problem is given by
!a(u, II) - h(u) = min!. u EX. (40)
Proposition 22.16. All th(' assertions <?( Sect;O'J 22.1 are I'alid .for the 1\\'0
problem!; (39) and (40).
lit particular, tile generalized third houndary problem (39) lias a unique
soillt ion " E X \\'hiclJ is a/so t he unique SO/lit ion of the ('ariat ional problem (40).
PR()()f- .
(I) As in the proof of Proposition 22.14, it follows that b EX..
(II) The map a: X x X -+ R is bounded because
f hUt"dO maxi hi f ut,dO
iG cG
< maxIIJllluIl 2 .i\GllvIl2.('\G S const lIull1. 2 11l'II 1.2-
Obviously. " is bilinear and symmetric.
By Section 21.3, the norm
ilull.*2 = ( [ (DjU)2 dx + f U2 dO ) ' 2
JG , cG
is equitale'u to the norm II. iI,. 2 on X. Because h > ho > 0 on ('G, we
obtain
a(II, u) > cllullf.2
i.e... a is strollgly positil'e.
Theorem 22.A yields the existence of a unique solution for (39) and (40).
o
for all u EX,
22.2h. The Mixed Boundary Value Problem
The following problem
-u==f onG,
II = g on I G.
CU
--- = h on r 2 G.
en
is called the mixed boundary value problem. Here, we assume that cG =
c.Gur 2 G and j\G",r 2 G = 0- In the special case where 02 G = 0 (resp.
(41 )
22.3. Orthogonal Projection, Duality, and a posteriori Error Estimates
335
C a G = 0) we get the first (resp. second) boundary value problem. Generalized
solutions of this problem will be investigated in Theorem 25.1 in connection
with linear and nonlinear conservation laws. Mixed boundary value problems
also play an important role in elasticity. This will be considered in Chapter 62.
The first (resp. second and mixed) boundary value problem is also called
the Dirichlet (resp. Neumann and Zaremba) problem.
22.3. The Method of Orthogonal Projection,
Duality, and a posteriori Error Estimates
for the Ritz Method
Our goal is to explain the basic idea of dualit}, theorJ in simple geometrical
terms. Our approach is based on the Pythagorean theorem in H-spaces
(Fig. 22.3).
We set
F(u) = IIg - ull 2 ,
F*(r:) = -lig - vll 2 + IIg1l 2 .
Along with the original minimum problem
F(u) = min!, u e X,
we consider the dual maximum problem
(42)
F.(v) = max!,
t. e 1':
(42.)
As we shall show below, problems (42) and (42.) have the same extremal value.
This is important for obtaining error estimates for the original problem (42).
The Ritz problems to (42) (resp. (42*» are given by
F(u,,) = min!,
u ll e X"
(43)
( resp.
F.(v lI ) = max!,
VII e 1';.).
(43.)
y
\'
x
u
Figure 22.3
336
22. Hilbert Space Methods and linear Elliptic Differential Equations
According to (7) the corresponding Ritz equations are obtained through
(fill - gtu.,,) = 0.
k= I,...,n'.
(44)
wit h II" = C I "II 1" + ... + C,,' II U"',, (re s p.
(I"" - g'L'.,,) = O.
k = 1, .... ,n',
(44.)
with r ll = d 1,,1'111 + ... + d", "l",.,,). Equation (44) (resp. (44.)) is a linear system
for the unknown real coefficients c." (resp. d",,). We make the following
assumptions.
(H I) The space X is a closed linear subspace of the real separable H-space Z.
(H2) The space Y is the orthogonal complement to X. i.e..
Y = {I' E Z: (ult') = 0 for aliI' EX}.
(H3) (XII) (resp. (f,,)) is a Galerkin scheme in X (resp. Y).
(H4) Let g E Z be given.
Theorem 22.8. Assll,ne (H I) through (H4). Then:
(a) Existence and uniqueness. The original problem (42) and the dual proble",
(42.) l1as the uniqlle solution u and t'. respecrit'ely. We have the extremal
relat ion
g = u + L',
(45)
i.e., u (re.4ip. r) is the orthogonal projection of g to X (resp. Y). Furthermore,
f.(u) = F*(t,).
(b) Convergence. For each n E N. the Ritz equations (44) and (44.) hal'e tile
unique solutio'is U" and VII' respectit'ely. and
lim 1111 - u"lI x = 0,
lim Ill' - l'lIlIr = o.
II-X
,,- D
(c) A posteriori error estimates. The minimal value F(u) of the original problenl
(42) satisfies
F.(t,,,) < F(u) F(Il,.).
and tile solution II 0.( (42) satisfies
II u - II" It i < F ( II,,) - F. ( l'" ).
(46)
(47)
A more general result will be proved in Problem 22.1.
In contrast to Section 22.1, the solution of the two Ritz problems (44) and
(44.) allows us to obtain t,,'o-sided error estimates for the minimal value F(u).
If we have computed the solutions U" and v"' say, by the finite-element method
on a computer, then we also obtain estimates for u. by (47). In elasticity, F(u)
is the elastic potential energy of the elastic body, and u is the displacement.
The Ritz equation (44.) is also called the Trefftz equation.
22.4. Application to Boundary Value Problems
337
PROOF. The existence and uniqueness results as well as the convergence of the
Ritz method follow from Theorem 22.A. In this connection note that
F(u) = (ulu) - 2(ulg) + (gig),
F.(v) = - (vi v) + 2(vlg).
By Theorem 18.C, we have the unique orthogonal decomposition
9 = u + v,
U E X, V E Y,
where u and v are solutions of (42) and (42*), respectively. Since
IIgll 1 = lIull 1 + IIvU 1 ,
we obtain that F(u) = F*(t'). It follows from s; Y and X" s; X that
F*(l',,) S F.(t')
This yields (46). By (12),
and
F(u) S F(u,,).
lIu - u,,11 1 S F(u,,) - F(u).
Since F(u) = F*(v) > F*(v,,), we get (47).
o
22.4. Application to Boundary Value Problems
We consider the first boundary value problem
-.1t' = 0 on G,
(48)
v = g on cG.
The key to our approach is the fact that
(ulv) = f t D,uD,vdx + f uvdO
JGI 8 1 G
is an equivalent scalar product on the Sobolev space W 2 1 fG), by Theorem 21.A.
We set
Z = {t l E Wz'(G): f..G vdO = o}.
Then Z is a real separable H-space equipped with the scalar product (.1.).
Moreover, we set
x = W 2 1 (G) and Z = X Ef) Y,
i.e., Y is the orthogonal complement to X.ln the following proof we will show
that
y = {v E Z" C:X>(G): t\v = 0 on G}.
This important result is due to Hermann Weyl (1940).
(49)
33R
22, Hilbert Srace Mlhods and Linear Eflirtic Differential Equa'ion
We make the rollowing assumptions.
(H I) G is a bounded resion in R N , N > 1,' itb (1G e Co. '.
(H2) We choose linearly independent functions ".., . . ., u.. E X and "I.., . . .,
v...... Y, i.c
Utlt = 0 on ('G,
Ie = It..., n'.
AI'.. -= 0 on G,
and we conidcr the Rilz tquat;oll
f L DiC,,_ - Il)DiUt.J.'C '"" O.
J{i i
with u" = ('."U l11 + .., + C"-..II,,-,, 4Ind the 1,efftz ('quat;t,,,
J. I: Di(r" - g) Dit' ltll dx = 0, k = I...., m',
(; ,
with "a = dl..f'." + ... + d"...,t,..". Equation (SO) (resp. (51)) is a linear
system for the unknown real coefficients (',,, and _"'ff' respectively. If
u,,, e ('I (i) for all k, then integrution by p.rts yields that the TrclTtz
equation (51) is equivalent to the equation
f (1 l', II
(t'" - g) - .. ,10 ().
;'(i ,1"
Ie == I,... m'.
k = I, ...,n',
( 5())
(S I)
k = 1" ...,m',
(5 I il)
Proposition 22.17 t Method of ()rth()onal-J'rojection or Weyl (194(»)), Asl""e
(I-II), tfI2). IA t ' ye Z he' g;.'('". ;,t...y E W 2 J ((j) dl,d J,1(;ytlO = n. r/'Iu'n:
(a) Existence and uniqllenes. L,'I u h(J Ilr, t,rllll)gonul pr(ie('li('n I" f.I ()nt" the
S"ho/('" ."/,(U'(' W 2 ' ((j). 1'ht'" f' = II - " i. "It' IIniqlle .f)/ul ion (r lIlt f;r.1
b"",.Jl1rr I'li/,,(' prohlen. (4M).
,Wore prt'C'is(-'y. we "(H'e " E W 2 1 (G) n C'.'(G). (lnd I" is a ('/a.'is;cal,'\olutif)lJ
(,. "u' dff/(',,('ntial equal;"" AI' = 0 011 G. f.,lr".ermnre. "'e yel v = , 1'" (1G
in ,ht' ."t'n.fi( I" gelJtralizl'd hOIlHJt,r.v ,'a/rles.
(b) A po.If!ri"ri error cs(imnlcs. Fflr eu('" n € N, the Ri,z equation (SO) and Iltf!
Trt'Ri: ('qutltion (51) hOI'(' Ullit/lff' soilltions ".. ""d r:., resp('cI;l)(,ly. For the
SOllll io" f' fir II,L' fir." houlld"r.v r)ulu(' prfl/Jlem {4R we oh't';" I"t f'rror
(!.t;"'f't(.. ..
1111- I' - II n UtlaO $ I"" - gUt2.n + IIrft - gll.2.() -. IIgU z .
Ily 11,(, i"('IIIIalit}' of pO;'I(arf ".r;tdr;ch.'t. Wt get the addilional t!rror
('.'; nrtll ():
Confl-" - u..n n,/- II -1l,,11.2.0.
PR(XJI-, We use Theorem 22.8 and the lemma ur Weyl (Theorem IR.G). We
first prove (491. Lei f' e Y. Then, (ult;) = () for all U EX. i.e.,
r I: DiuDj,'dx = 0 for all u e Wl(G).
J(i ·
22.S. Main Theorem on LinC8r Strollflly Monotone Operators
339
.
By Section 18.15. l' is , weak solution of the Laplace equation AI} =-= 0 on G.
By the lemma of Weyi. IJ e c-'(G). and v is a classica] solution of the Laplace
equation.
It follows from fJ -= U + I' and" e JP 2 '(G) that u = 0 on 11G and hence " = II
on t'(j
The remaining asscrtion follow from Theorem 22 B. 0
22.5. Main Theorem on Linear Strongly Monotone
Operators al1d the Galerkin Method
We consider the equation
0(", v) == h(v)
for all "e x.
(52)
We seek II e X. The corresponding dual problem is given by
(/(V.II*) = ht,,) for all " EX. (52.)
l.Jcrc. we seck .,. E X. If CI(', .) is symmetric. then (52) and (52.) coincidc..
However. in contrust to Section 22.1, we do not assume here that 0('. .) i!;
symmetric. In the IUJIl.'iymmelr;c casc, equation (52) does not correspond to a
variational prohlen1. The nonsymmctric case i related to b()undary vtluc
prohlems for nonsyn1mctric elliptic diffcrcnti11 cquntions. Along with the
nriginul prohlcm (52) we consider the operator equ;ttion
Au == h
,
u eX,
(5:\)
and the Galcrkin cquntion
o(u.. ,,) = hh')
f()r all 'J EX",
where we seck II" EX". ".roln our ns!\umrlions below it follows that the
(iulcrkin equal inn is cqui\'.lIcot to the following equation
(1(14", "',) == h(M',)..
k = I . . . . . n,
(54)
with fI.. == C I" \"1 + ... + ('"" "'.. Iquation (54) IS a linear sytem ror the un-
k no\\'n rC1l1 coefficients (',,,.
We make the following assun1ptions.
(III) tel X be a real parahlc H..pace with the ba5i {M',.".2....}. lt
..'(" = spa n : "'1 ' . .. , M',. } .
(, 12) The map a: X x X -+ R is hilincar, boundcd, and strongly p<)itivc. i.e..
there exist poitivc constant f and cI uch that
Ifl(II. ")1 S d 111111 11"11.
utr/. u) (" nun 2
for all lit I' e X.
(IIJ) The runetiona) h: X -. R is linear nnd c()ntintlou.
340
22 Hilbert Space Methods and Linear Elliptic Differential Equations
Theorem 22.C. Assu"le (H 1) through (H3). Then:
(a) Existence and uniqueness. The original problena (52) has a unique solution
II EX, and n'e hare the a priori estimate
tlull < c- 1 I1b:!.
'rite dual prohleIPI (52.) hao a IlIlique solution u. EX. and 1111.11 < c -111 h II.
tb) Convergence of the Galerkin method. Let dim X = 7.;-. For each II E N,
the Galerk;'1 equatioll (54) lias a unique solution II" E X"' and
II" -+ U ;11 X
as n -+ 'X..,.
(c) A prior; error estimates. tOor each n E N,
II II - 11,,11 < c -1 d . dist x( u, X,,).
(55)
(d) A posteriori error estimates. Slippose thaI, for fixed '" EX, tllere is Cl
con.la'lI C > 0 .Il('h Ihat
la(,,'. r) - h(l')I < eliI'll
.for all I' EX.
Tllell
C 1111 - "'II < C.
(56)
Assertion (a also holds true in the case where X is not separable.
Corollary 22.18 (Equivalent Operator Equation).
(i) As.Il'Jle (H I) tlJro"gh (H3). Then the origin," equation (52) is equil'alent to
tIre operator eq"at;o" (53). Tire operator A: X -+ X. gil'en hy
<All, r) = a(ll, r)
lias tlte .follo\\';lIg properties:
(P) A i. linear, cOllli'Juolls. and .trolJg/J nJOIJotone.
for all u, (' E X
(57)
(ii) COlIl'ersely. let A: X ..... X. he CIII operator on the real H-space X \\'itlt (P).
Then, .for each hEX., tlte equation Au = b Itas a unique solution II E X,
alJd tltis equell iotl is equit'alenl 10 (52).
PROOF, Ad(a). The equivalence between (52) and the operator equation Au = h
follows from ( I a).
The existence of a unique solution U e X of Au = b follows from the main
theorem on linear monotone operators (Theorem 18.F in Section 18.11).
The equivalence between (52) and Au = b implies the existence of a unique
solution of (52).
The a priori estimate c lIuli IIbll follows from
cllll1l 2 a(u,u) = b(u) < IIbn lIuli.
22.S. Main Theorem on Linear Stronsly Monotone Operators
341
Since the dual problem (52.) has the same structure as (52), equation (52.)
has also a unique solution.
Ad(b). We use the Identification Principle 21.18 and set X == X.. Then
the convergence of the Galerkin method follows from the main theorem
on uniquely approximation-solvable linear operator equations (Theorem
21.G(c)).
Ad(c), (d). The error estimates are obtained as in the proof of Corollary
22.2. 0
In the preceding proof of (a) we used the main theorem on linear strongly
monotone operators (Theorem 18. F) whose proof was based on the theorem
of Riesz and on the perpendicular principle, i.e., on the idea of orthogonality.
In the following we want to give an alternative simple existence proof based
on the Riesz theorem and on the Banach fixed-point principle. This argument
will be used in Chapter 25 in order to prove the main theorem on Lipschitz
continuous strongly monotone nonlinear operators and to prove the main
theorem on quadratic variational inequalities.
EXISTENCE PROOF VIA THE BANACH FIXED-POINT THEOREM. Let a: X x X ... K
be bilinear, bounded, and strongly positive on the H-space X over IK == lA, C,
I.e.,
la(u, v)1 s dllullllt'lI,
Rea(u,u)clluIl2 forall u,veX.
We want to prove that, for each given b e X., the equation
a(u, v) == b(v) for all veX
has a unique solution u eX.
(I) Equivalent operator equation via the Riesz theorem. It follows from
la(u, (')1 S dlluflllt'li for all u, v E X that
(58)
v'-' a(u. v)
is a linear continuous functional on X. By the Riesz theorem, there exists
a unique element in X denoted by Au such that
a(u, v) = (Aulv) for all u, f' e X.
Moreover, JlAull S dllull. For all u, v, w e X and 2, {J e IK, we obtain
(A(au + flw)lv) == aa(u, v) + (Ja(K', v)
= (tiAu + PAwl v).
This implies A(!Xu + fJw) == aAu + PAw, i.e., the operator A: X -+ X is
antilinear.
Again by the Riesz theorem, there is an f E X such that
b(v) = (flv) for all veX.
342
22. Hilbert Space Methods and Linear Elliptic Differential Equations
Hence the original equation (58) is equivalent to the antilinear operator
equation
Au = .f, II eX. ( 58. )
(II) The equivalent fixed-point problem. Equation (58.) is equivalent to the
fixed-point equation
II = II - I (A u - J'), u eX, (59)
for fixed t > O. This is the decisive Irick of the proof.
(III) The Banach fixed-point theorem. For all u e X,
1(/ - lA)ull 2 = 111111 2 + t 2 11AIl1l2 - 21 Re(Aulu)
< tltllI2( 1 + t 2 d 2 - 2ct).
We no\\' choose t > 0 in such a way that
k 2 = I + t 2 d 2 - 2ct < 1.
J."orexample.let t = cld 2 . Then k 2 = 1 - c 2 jd 2 < I. Note that 0 < c d.
Thus.. the operator u II - t(Au - .f) is a k-contraction on X. By the
Banach fixed-point theorem (Theorem 1.A), equation (59) has a unique
solution. 0
22.5a. The Abstract Green Operator
Definition 22.19. Let Y be a real B-space. The operator K: Y -+ Y. is called
anlJle-houuded iff K is linear and monotone. and there is a constant j' > 0 such
that
1 < K II, (' > - < K t', II> 1 2 i' < K u, u> < K r, f>
for a II u..reX.
Let Y be a real H-space. The operator K: Y -+ Y is called angle-bounded
iff K is linear and monotone.. and there is a constant }' 0 such that
t(Klllr) - (KI'lu)l2 < j'(Kulu)(Krll')
for all u, rEX.
I n particular, each linear. monotone. and symmetric operator K: Y -+ }" is
angle-bounded. As we will show in Chapter 28. angle-bounded operators play
an important role in the theory of nonlinear Hammerstein equations. In
particular. in connection with semilinear elliptic boundary value problems..
the following result will be used in Chapter 28.
We set
b(r) = Crlr)y
and consider again the equation
for all (' E X
U(II..l') = b(l')
for fixed u E X and all t' e X
(60)
22.S. Main 'rhcnrcll1 un tinardr Sirongly Monolune ()perutors
343
togc:thcr with its dual equation
U{LJ.ll*) ::; h(t,)
for fixed u* E X and all l' eX.
(60. )
We nlake the following assumptions_
(1-11 -rhe m"p tl: X x X -+ R is biJinar. bounded, and strongly positive on
the real H-sp,.cc X.
(.t2 I.c Y he a real H-space such thaI the clnbcdding X S; Y is continuous.
,s In Sectiun 22.1 b we obt.ain that h E .,r. for given.r e Y. By Thc()rem 22.C,
equation (60) (rcsp. (60.») h,as u unique solutiun u fresp. u.). We set
II := Kf
(rsp. u. = 'I ), clnd we call/\: Y -+ Y the clh.4)/rf,,'t Grl.'(!I, ()JH!rI('r or also the
solution operator to equation (6()).
Corollary 22.10. A ....'tUIPlt. (I I). (112). 'f/'t'II:
(a) 'flit' .oIUli()1I 0pl'rtltor 1\: Y -+ Y is Ii,Jtur. ('ollti'UII)US. mOll()IOlle. ,,,,,1 unillc-
hotu.tlt'tlo
(h) /J' ll't' ('",h4.,thlillfJ X Y is ,'O'''PC,('t. then K: Y -t Y ;, c:umpact.
(c) If X ; tiC/"t' ill Y. tlu'n 1\ is slr;(.,ly "'Clnotu"e.
(d) '1'1,(, "cljo;"t 0p('rClI(,r 1\.: Y -. Y t"o;"fiJ(,s with ,he ."tolut;()n Opt.rulor L oj.
tll". Jucd t:&/,u'tiolJ (60.).
I)I()()I'. ,\J(aJ. As in the prt)of ofCoroUary 22.4. we obtain that 1<.: Y -. Y is
lincCJr, c()Jltinuous. and InOJ1otoDC. II follows frolD
t"(U,lt)J S dltu" lIoH,
tl(". u) 11"11 2 for all u, I' e X
(hat
1( 1'1 J\lJ)y - (,II I\.f>r 'Z = Itl( K.f, K,/) - t,(Kg, K.f)1 1
s 4J 1 I1Kfn 2 I1KIJI, z s 4t/ 2 ('-za(Kf, Kf)a(Kg, K!/)
lie 4J2(.-1(j O I Kr)r(OfKII), for all h g E Y.
Hence K: Y -+ Y is angle-bounded.
Ad(b). ('onlparc the proof of Corollary 22.4.
Ad(c). l.ct h(.,) == tftf')r for aU veX. Ir X is dense in Y. thcn J' t- 0 implies
h 0 ,and hence K.f O since (60) hus a unique solution. Thus. Kf = 0 implies
.J" = O. Now suppose th.,t
(gIKII)r == O.
l"his ilnplics (1/1 K I/)y = u(Kg, Kg) 0 and hence Kg = O. Consequently,
!J =:. O. 'rhjs nleans that K is st riclly monotone.
344
22. Hilbert Space Methods and Linear Elliptic Differential Equations
Ad(d). For all J, Y E Y.
Cfl Kg»)' = a(Kg, Lf) = (gl Lf))'.
Hence K* = L.
o
22.5b. The Duality Trick and Refined Error Estimates
We want to improve substantially the a priori error estimates for the Galerkin
method in Theorem 22.C. To this end, we set
h(v) = (fir))'
for all (' EX.
and we consider the equation
a(u. (') = h(l')
for fixed u e X and all VEX
(61 )
together with its dual equation
a(l'. u*) = b(t.)
for fixed u* E X and all l' EX.
(61*)
In addition. we consider the Galerkin equation
a(u".to) = b(t')
for fixed u" e X" and all 1,' e X".
(62)
where n = 1. 2. . . . . We make the following assumptions.
(H t) X. Y. and Z are real H-spaces, where the embedding X S; Y is continu-
ous. Let X" be a finite-dimensional subspace of X.
(H2) The map a: X x X -+ R is bilinear, bounded, and strongly positive.
(H3) Approximation. There exists a constant " such that
distx(z.X,,) < rl"lIzll z
for all z e X n Z.
(H4) A priori estimates. There are constants C and C. such that
Uull z CII/II)',
lIu.ll z C.Ufll)'
for all fEY and all the corresponding solutions u (resp. u*) of equation
(61 ) (resp. (61.».
It follows from Theorem 22.C that for fixed Ie Y, the equations (61). (61.),
(62) have unique solutions u, u*, u", respectively.
Corollary 22.21. Assume (H I) through (H4). Then we obtain the following a
priori error estimates for the Galerkin method:
1111 - u"lI x "(C-I dC)lIflir
for all fe Y.
(63)
In the .pace y, \\'t' get the following stronger a priori error estimates:
lIu - u"lI r rl;(c- ' d 2 CC.)lIfll)'
for all Ie Y.
(64)
22.6. Application to Boundary Value Problems
345
The point is that the factor 2! appears in (64) in contrast to (63). This is
responsible for the quadratic convergence of the Galerkin method in Y.
PROOF. Let II-II denote the norm on X. By Section 22.5a,
u=K/ and u.=K.r
(I) Subtracting (62) from (61) we get the first key relation:
a(u - U"tv) = 0
for all t- EX".
Hence, for all v e XII'
ellu - u,,112 S a(u - u", U - u,,) = a(u - u", u - v)
S d lIu - u"lIlIu - vII.
This implies
ellu - u,,11 S d.distx(u, XII) S «"dllullz S tI"dCIi/lir.
This is (63).
(II) The duality trick. For all 9 e Yand all veX", we obtain the second key
rei a tion:
(u - u"lg)y = a(u - u", K.g) = a(u - u", K*g - v)
s dllu - u..IIIIK.g - vII.
Hence
(u - u"lg)r S d lIu - u"fI. distx(K.g, X,,)
S (tllld 2 c- I ell/II y )(tI" c.Ug II y)
for all / e }-: This implies (64).
o
The duality trick above is due to Aubin (t 967) and Nitsche (1970).
22.6. Application to Boundary Value Problems
We consider the first boundary value problem
N
-l\u + L aiD.u + aou = / on G,
1= I
u = 0 on aG
(65)
together with the dual problem
N
- l\u. - L Di(afu*) + aou* = / on G,
'=-1
u. = 0 on aG.
(65.)
346
22. Ifilbert Space Methods and linear Elliptic Differential Equations
We make the following assumptions.
-
(H) G is a bounded region in AN, N > I. and a j E C(G) for all j. Let
X = "",21 (G).
We set
a(u.l') = f ( .f DjuDjL' + (ajDju)t, + aout, ) dx,
G .=1
h(r') = f fi'dx.
(j
Definition 22.22. Assume (H). The generalized problenl to (65)(resp. (65.)) reads
as follows. For given .re L 2 (G) we seek u E X (resp. u. E X) such that
a(u. (') = b(t,) for all rEX (66)
( resp.
a(l'..ll*) = b(l')
for all I' EX).
( 66. )
Formally. \\'e obtain (66) from (65) by multiplying (65) with f e C(G) and
by using subsequent integration by parts. Similarly. we obtain (66.) from (65.).
Note that in contrast to the classical problem (65.). we do not need the
differentiability of a j in (66.).
Proposition 22.23. ASSll"te (H). Let Co he tlte constant ill tlte Poincare-
Friedrichs ;lIeqllality (23) and stlppose that
II + it < Co I( I + co),
(I. it > O.
\\'''ere min. t c G uotx) > - it "lid
ma luJ(x)1 (J.
xeG
j= 1.....N.
Tltell all t lie assert ions of Theoreln 22.C in Sect;o'J 22.5 are l,alid .ror the t,vo
generali:ed problems (66) a"d (66.). In particular, (66) and (66.) Itave unique
solut iOlts.
PR()OI- .
( I) By ( 1 b). bE..\'..
(II) We investigate a: X x X -+ R. By the Holder inequality, for all 11. rEX.
1(I(II,r)l < L (IDjUDj(:1 + PHD;U)t'l)dX + maxlaol L lurldx
< L II D i 1111 2 11 D i t'11 2 + P II D i u 11 2 11 t'1I 2
i
+ maxia o lllll1l 2 11rll 2 S; constllllil 1.21!t'U 1.2.
Hence, at . . . ) is bounded and bilinear.
22.7. Compact Perturbations of Strongly Monotone Operators
347
To prove the strong positivity of a(., .). we use the Poincare-Friedrichs
inequality. By (24), we obtain for the principal part of a(u. t):
f L(D/u)2dxcllullt2 forall ueX,
JG i
where c = co/(1 + co). Clearly,
t Dou 2 d:< -}' t u 2 dx -;'/lun2 forall ue X.
By the Holder inequality,
L f o/(D,u)udx sPilullt2 forall ueX.
i JG
Putting these three inequalities together, we get
a(u,u) (c -)' - P)llufl.2 for all u EX.
i.e., a( ., · ) is strongly positive.
Thus. all the assumptions of Theorem 22.C are fulfilled.
o
The results of the next section will be used in Section 22.10 in order to
consider the first boundary value problem (65) in the case where the coeffi-
cients la}1 are arbitrarily large. Then the unique solvability of (65) can be
violated and we have to apply Fredholm alternatives.
22.7. Compact Perturbations of Strongly Monotone
Operators, Fredholm Alternatives, and
the Galerkin Method
We consider the operator equation
Au + Cu = b,
together with the dual equation
A.u. + C.u. = 0,
UE X,
(67)
u. e X.
(67.)
The Galerkin equations to (67) have the following form:
(All.. + Cu.., to) = (b, t') for fixed u.. E X.. and all t' EX".
Explicitly, that means
(Au" + CUll' W}) = (b, "i), j = I,.... n. (68)
We seek the unknown real coefficients c J .. of the Galerkin approximation
II" = C I " ",' I + ... + C ".. "'" .
348
22. Hilbert Space Methods and Linear Elliptic Differential Equations
Theorem 22.0. Suppose that the operators A, C: X -. X. are linear alld con-
tinuous all the real H-space X. Furthermore, suppose that A ;s strongly mono-
tOile, and C is compact. Then:
(a) Uniqueness implies existence. If the original equation (67) ,\'itll h = 0 has
ollly the tril'ia/ solution 11=0, then for each bE X., equation (67) has a
tll.ique solution u. and
Ilull < constUhll.
(b) Fredholm alternative. for .fixed b E X. 'I equation (67) has a solution iff
(b,u.) = 0
for all .oliltiorts u. of (67.).
(c) Index equals zero. Equation (67) M'ith b = 0 and equation (61.) have the
.am(' finite nunlher of linearly independent solutions.
(d) Convergence of the Galerkin method. Suppose that {\\'" M'2" . .} is a
basis in the infinite-dimensional separable H-space X, and set X" =
span { "'I ' . . . , \\'" }. M oreot'er, suppose that for fixed b EX., equal ion (67)
lias a uniqtle solution u,
1"hen tl.ere is a number no(b) such that, for each n > no(b), the Galerkin
equation (68) lIas a unique solution, and
U" -+ u in X
as n -+ ''X.
This theorem tells us that A + C is a Fredholm operator of index zero.
Fredholm alternatives for nonlinear operators will be considered in Chapter
29.
PROOF. Using the Identification Principle 21.18, we set X = X.. Now, the
assertions follow immediately from Theorem 21.F (Riesz-Schauder theory)
and Theorem 21.G(d). Note that A: X -+ X. is bijective. by Theorem 22.C.
o
The following example shows that Theorem 22.0 generalizes a well-known
result for linear eq ua tions in R N .
EXAMPLE 22.24. Let B: HI\' -.. R N be a linear operator. i.e., B is an N x N-
matrix. and B. is the adjoint matrix. Then the equation
Bu = b, u E R N ,
has a solution iff (blu.) = 0 for all solutions u. of the dual equation B. u. = O.
PRC)()F. Use Theorem 22.0 with X = R N and A = /. C = B-1, Note that the
continuous operator C is compact since dim X < 00. 0
In what follows we will apply Theorem 22.0 to integral equations, bilinear
forms. and boundary value problems.
22.8. Application to Intearal Equations
349
22.8. Application to Integral Equations
We consider the Fredholm integral equation of the second kind
u(x) - L k(x, y)u(y) dy = b(x), x E G, (69)
together with the adjoint equation
u.(x) - L k(y, x)u.(y) dy = 0,
x e G.
( 69*)
The Galerkin equations for u" = C."W I + ... + C""W" are given by
i u"wJdx - i k(x, y)u,,(y)WJ(x) dy dx = r bwJdx, (70)
G GxG JG
where j = I, . . . , n.
Proposition 22.25. Suppose that:
(i) G is a nonempty measurable set in R N , N > I (e.g., G is open). Let X = L:z(G).
(ii) We set X" = span {WI'...' w,,}, where {WI' W z ,... } is a basis in X.
(iii) The function k: G x G ..... R is measurable (e.g., continuous) and square-
integrable, i.e., JG xG k(x, y)2 dx dy < 00.
Then:
(a) For fixed b e X, equation (69) has a solution u e X iff
L bu. dx = 0
for all solutions u* e X of the dual equation (69*).
(b) Equation (69) with b(x) == 0 and equation (69.) have the same finite number
of linearly independent solutions in X.
(c) Suppose that equation (69) with b(x) == 0 has only the trivial solution u = 0
in X. Then. for each b e X, equation (69) has a unique solution u in X.
Furthermore, for each fixed b e X, there is a number no(b) such that, for
each n no(b), the Galerkin equation (70) has a unique solution u" eX",
and
u" ..... u in X
as n..... 00.
PROOF. We set
(Ku)(x) = t k(x,y)u(y)dy, x E G.
By Problem 21.11, the operator K: X ..... X is linear and compact. The adjoint
350
22. Hilbert Space Methods and Linear El1iptic Differnlial uati()DI
operator K.: X -+ X is liven by
(K*u)(x) = t k(y. x)u(y) dy.
Then the assertions follow from Theorem 22.D by letting X = X.. A == 1, and
C = -K. 0
22.9. Application to Bilinear Forms
With a view to elliptic boundary value problems we consider the equation
a(u.v)+c(u,v)=b(p) for fixed ueX andalJ I)EX, (11)
together with the dual equation
a(v, u., + c(v, u.) := 0 for fixed u. E X and all veX. (71.)
Th Galerkin equations for u. = ,. ,."'. + ... + '."" WIt are given by
a(w., "J) + (u.t Wj) ;:; b( ), j == 1...., n.
(72)
Proposilioa 11.26. Supposs thai:
Ii) X;s a real separable H-slJQ(:e. Set X. = span{ WI'...' w,,}. where {WI. W2..... }
is tJ basis in X.
(ii) The maps u, c: X x X R are bilinear (,,,d bounded. MOreOI'4!', u is
strongly PO$ilive, c is COIfIptJ£f, and b e X*.
'fhen:
(a) Suppose that equal;oll (71) wit" b == 0 has Q"/}' ,he Ir;I);,,1 so/r";IJlI " = o.
The'J, for each b eX., t4111d;On (71) has (J unique ,wlutiolJ U, alad lIuU S
const Ubll.
Lei dim X = 00. Then, .fnr each b E X., there is a number lIo(b) .')1l(;1J thut,
fiJr each n > "o(b).,he Citderkln eqUdI;on (72) lias" unique solution U. EX..
and
II. -+ .. in X
as n -. 00.
b) Equation (71) has tI solution iff
b(u.) = 0
for all solutions u. of (71-).
(c) Equatiull (71) witla b = 0 alld tI,e dual equation (71.) hav the sanre Jlnite
number of linearly indendellt solutions.
PROOF. 8y (la), there exist linear continuous operator A. C: X -. X. such that
(Au, v> := a(u, vlt <C v) = c(u. v) for all u, IJ eX.
22.10. Appl;calion to Boundary Value Prohlems
351
Here, A is strongly monotone, and C is compact. Moreover, since X.. = X,
(II.. A* v) d(rl.. v), (14, C*v) = c(u,v) for all u, VEX.
Thus. (71) and (71*) is equivalent to (67) and (61.), respectively. Hence,
Thc()rem 22.0 yields the as.c;crtions. · 0
22.10. Application to Boundary Value Problems
We consider the first boundary value problem
N
.-Au + L a,Dju + ao., = f on G,
;; I
u == 0 on aG
(73)
together with the dual problem
N
- u* - L D;(a1u*) + uou. = 0 on G,
;=1
u. = 0 on (JG.
(73.)
We set
1'(14,1') = Ii i D j I4D,vd,'C,
('hi, v) = L j (tljDju)v + oou.tdx. b(v) = t (vdx,
.
Le., a( ... . ) and 'c( . . ) correspond to the principal part and to the lower order
terms in (7 3). respectively.
Definition 22.27. Let G be a bounded region in R N , N J. Set X = Wi(G).
Suppose th.t Uj E C( G) for all j. Then the geMrlJlized prohlem to (73) (resp.
(73*) reads as follows" For given f e L 2 (G). we seek II E X (resp. u. e X) such
t h I t
a(u, I') + ('(fl.. .,) = b'v)
for all veX
(74)
(resp.
Ilh', u*) + C(l, u*) = 0
ror all De X).
(74*)
By the proof of Proposition 22.7, the map a: X x X ..... R is bilinear
bounded. and strongly positive. Below we shall prove that c: X x X -+ R is
lincar bounded, and compact. This corresponds to the following general
principle for elliptic differential equations:
u)\\'er ()rdr t('rm. yield ("ompactnes$.
352
22 Hilbert Space Methods and Linear Elliptic DilTerential Equations
This observation also remains true for quasi-linear elliptic differential equa-
tions. This will be the key to the theory of pseudomonotone operators in
Chapter 27.
Proposition 22.28. All the assertions of Proposition 22.26 are true for the
generalized problen,s (74) and (74.). In particular, for f e L 2 (G). the original
problen, (73) has a generalized solution u e X iff
LfU*dX = 0
for all generalized Sohltions u. E X of the dual problem (73.).
PROOF. By (I b). bE X..
As in the proof of Proposition 22.23, we obtain that c: X x X R is linear
and bounded. To prove the compactness of c(.. .), assume
II" II
and
.. t. in X
"
as n -+ 00.
Since the embedding X c L 2 (G) is compact. we get
14" --t II
and
l'" .... l' in L 2 (G)
as n -+ oc.
This implies
a;l." -+ aiL' in L 2 (G)
as n -+ 0Cj.
By Problem 2 t .4. it follows from U" u in X that
Diu" D;u in L 2 (G)
as n -+ 00.
and hence
(D,u"la,v")2 (D,ula i v)2
as n -+ 00,
where (.1. )2 denotes the scalar product in L 2 (G). Therefore,
c(u", [',,) c(u. v)
as n -+ 00,
i.e.. c( ... . ) is compact.
Consequently. all the assumptions of Proposition 22.26 are satisfied. 0
22.11. Eigenvalue Problems and the Ritz Method
We want to study the eigenvalue problem
Bu = ).u.
(75)
Here, we seek i.. E Rand u E X with u #: O. Along with (75) we consider the
Ritz equation
(Bu I "'j") = )..(ul w j ,,),
. 1 '
J= .....n,
(76)
22.11. Eigenvalue Problems and the Ritz Method
353
for A. E Rand u = c I" WI" + · .. + c"',, W"',, with u O. The Ritz equation (76)
represents an eigenvalue problem for a symmetric matrix.
We make the following assumptions.
(HI) X is a real separable H-space, X :I: to}.
(H2) The operator B: X ..... X is linear, symmetric, compact. and strictly
monotone.
(H3) (X,,) is a Galerkin scheme in X. Let X" = span {w.",.. ., w".,,}.
22.11a. The Variational Approach
In order to solve the eigenvalue problem (75), we consider the variational
problem
max(Bulu) == ).,
(ulu) == I,
u eX.
(77a)
The solution is denoted by )"1 and u.. For m == 2, 3, ..., we now determine
successively (A"" u",) by solving the variational problems
max(Bulu) == A,
(u,lu) = 0,
(ulu) == 1,
u eX.
(77b)
i==I....,m-l.
In order to characterize directly the mth eigenvvalue A",. we will use the
Courant maximum-minimum principle
)"", == max min (Bulu).
M.!t... II' AI
(78)
Here, 9'.. is the class of all the sets S n L, where S is the surface of the unit
ball in X, and L is an arbitrary m-dimensionallinear subspace of X. This
principle was used by Fischer (1905). Weyl (1911), and Courant (1920).
In 1911, Hermann Weyl used this principle in order to solve the famous
problem on the asymptotic distribution of the eigenvalues of the Laplace
equation.
Theorem 22.E. Assume (H I). (H2). Then:
(i) Complete orthonormal system. The operator B has a complete ortho-
normal system of eigenvectors in X. The eigenvalues of 8 are positive, and
they have finite multiplicity. The eigenvectors related to two different
eigenvalues are orthogonal to each other.
(ii) Construction of the eigensolutions. For m = I, 2. ..., the variational
problems (77) hal'e successively the solutions ()""" u..), which are eigen-
solutions of B. The set {UI' "2'...} forms a complete orthonormal system
of eigenvectors to B. The set {A. I' A.2'...} contains all the eigenl'alues of B
counted according to their multiplicity. Furthermore,
II A. 2 ... > o.
If dim X == OCt then l.. ..... 0 as m ..... 00.
354
22 Hilbert Space Methods and Linear Elliptic Differential Equations
(iii) The Courant maximum-minimum principle. For nJ = 19 2, . .. , tlte eigelt.
lallie i.". is ohtained through (78). MoreoL'er, u'" is solution r(78).
The convergence of the Ritz method will be proved in Section 22.11 c.
The Ljusternik Schnirelman theory generalizes the Courant maximum-
minimum principle to nonlinear eigenvalue problems. This will be studied in
detail in Part III.
PR(X>f. AdO), OJ). If Bit = O. then (Bull') = 0 and hence II = O. because B is
strictly monotone. ConsequentlY9 assertions (i) and (ii) are special cases of the
Hilbert -Schmidt theory considered in Section t 9.6.
Adfiii). We set
.r:( = max min (Bulu).
.\, ( :/'"' u t= ."
let J\1 E Y;". Since dim 1 = nJ, there exists a II E M with
(Illll) = 1
and
(uilu) -= 0
for i = t,.... "' - 1.
where we set "0 = O. From (77) it follows that
(Blllu) < i.".'1
and hence .,:1 < i. m .
In order to obtain.tY > i''''9 set Mo = S n span{I'19OO' 'Ill",}. Then Mo E !I.
If II E Mo. then II = h,lI, + 00' + b",u", and
(Bul u) = i. 1 h; + ... + i.",h;.
> i.".(hf + . .. + h;.) = i.....
Noting (Bu",lll m ) = i''''9 we get minuEMo(Bulu) = ;,,"', and hence..tt/ > i"",. 0
In order to solve the variational problem (77a), we used a simple special
argument in Section t 9.7. which is only applicable to linear problems. In the
following. we give an alternative existence proof which can be generalized to
nonlinear problems. In particular. we will use the following argument in
Section 43.4 in order to prove the existence of a bifurcation point for nonlinear
operator equations.
EXISTENCE PROl)F FOR THE LARGEST EIGENVALUE VIA MAXIMUM TRICK. We
study the maximum problem
max(Bulu) = ;"9
(ulu) = 19
U E X.
(79)
Assume (H I). (H2). The trick is to consider first the modified maximum
problem
max (Bulu) = i.,
U K
(80)
where K = {u EX: 111111 < I}. and then to show that the solution u of (80)
satisfies 1I E l'K. i.e.. 1I is also a solution of (79).
22.11. Eigenvalue Problems and the Ritz Method
355
(I) Existence for (80). Set 2 = SUPII." (Bul"), Then there is a sequence (u,,) in
K such that
(Bu" I u..) ....
as n.... 00.
Since (u,,) is a bounded sequence in the H-space X, there exists a sub-
sequence, again denoted by (u..), such that
u" u
as n -+ 00.
Since B is compact. (Bu"lu,,) .... (Bulu) as n -+ 00, by Corollary 21.33. Hence
(Bulu) = , i.e., u is a solution of (80).
(II) We show that lIuli = J. Otherwise, we have lIuli < I. Then, for each t' eX,
the real function
cp(t) = (B(u + tv)lu + tt.), t E R,
has a local maximum at t = 0, i.e., cp'(O) = O. Hence
(BulL') = 0
for all t' e X.
This implies 2 = (Bulu) = O. However, since B is strictly monotone, we
obtain (Bvl v) > 0 for all t. #:. O. Hence > O. This is a contradiction. 0
22.11 b. A Comparison Principle for Eigenvalues
The following important results are easy consequences of the Courant
maximum-minimum principle.
Corollary 22.29. Suppose that the operators B, C: X -+ X are linear, symmetric,
compact, and strictly monotone on the real separable "-space X #:. {OJ. Then:
(a) The operator B (resp. C) has the eigent'alues
o < ... ;.2 )..
(resp.
o < ... 112 III ).
counted according to their multiplicity.
(b) If B S C, i.e., (Bulu) S (Culu) for all u E X, then ).'" S Il", for all m.
(c) I... - 1l...1 s II B - ell for all m.
PROOF. Ad(a). Cf. Theorem 22.E.
Ad(b). This follows immediately from the Courant maximum-minimum
principle (78).
Ad(c). By (78),
).... = max min (Bulu),
...!I'M liE M
P... = max min (Culu).
...!I'... IIEM
356
22. Hilbert Space Methods and near Elliptic DilJerential Equationl
Set B = C + D. Note that (Dulu) -IIDuliliuli > - nOn lIuU 2 , by the Schwarz
inequality. Hence
min (Bulu) nlin (Culu) + min (Dulu)
".", iteM uM
min (Culu) - UDII.
liE M
This impties
l. > I'", - II B - ell,
and hence IA", > Am -- ne - Bllt Le., 1,1.. - Ji.1 S liB -- CII.
o
22.11 c. The Convergence of the Ritz Method
ut P,,: X -of XII be the orthogonal projection operator from X onto }(". Then
the Ritz equation (76) is equivalent to the equation
p.. Bu = Au,
A e R, u eXit, n == I, 2, . . . .
(81)
Corollary 21.30. Au"le (H I) lhrol4f1h (.13) of 1.lreoreltl 22.E. ul dim X = 00.
Theil;
(a) Ritz equation. The Ritz eige"t'alue problem (81) IfiJS the e;ge'lvalues
AI. A 211 > . . . > All',. > 0
counted at.'cording to their multiplicity. Moreol)er:, equafion (81) Itas a com-
plelt# orth(Jt,orlnal system (if eiyetltJector, ;11 XII. For all '" It,
A..,. $ l.,
(82}
where! A.. is the mth eigenl.1fIlue of ,he original equation Bu = AU_
(ht Error estimates for the eigenvalues and convergence of the eigenvalues.
f'or {ll m < lit
IA.,. - A.I < 118 - P"HU,
(83)
,,'here liB - P"BII -+ 0 a. '1-+ 00. Hence tIle sequetlce (1...,.) tlJ' the approxi-
Intlte eigelJvalues cOllverge. tv AIN us n -+ 00.
If Xl C X 2 s; '.., then (A...) is monotonically incretuing as n... 00.
(cJ Error estimates for the eigenvectors. Let UIfIII be all eigenvector of the Ritz
prlJblem (81) ("tol're.'ifN)ntling 10 rhe eigenvalue A"",. Then there is an eigen-
veclur u uf ihe original problenl Bu -= A.",U su,-h that
Itu... - 14 C",1f B - P.BIj,
where the constallt C. i... il.dependent of n.
(84)
In the following proof we will essentially muke use of the Couran't maximum-
minimuln principle.
PJU)(}.... Ad{a). It follows from 1'11* = P" that (P"Bulr;) = (ButP"p) and hence
(PABulv' = (Bulv) for all u, v E XII_
22.12. Application to Bilinear Forms
357
Therefore. the operator P"B: XII ..... XII is linear, symmetric, compact, and
strictly monotone. We now apply Theorem 22.E to the operator P"B. In
particular, we obtain
)."'" = max min (P.Bulu)
M . oV.... M t;, x.. ... M
S max min (Bulu) = A.",.
M@oV... ...M
Moreover, if X I C X 2 s; "., then ().",,,) is monotonically increasing as n -+ 00.
Ad(b). By Lemma 21.56, liB - PIIBII -+ 0 as n -+ oc, since B is compact.
Relation (83) (resp. (84» follows from Corollary 22.29 (resp. Problem 22.5a).
o
Remark 22.31 (Upper Bounds for the Eigenvalues). By (82), we obtain a lower
bound for the eigenvalue )"", of the original problem Bu = ;'u. There exists a
number of methods in the literature which provide upper bounds for )."'.
For example, we mention the methods of Bazley-Fox, Krylov-Bogoljubov,
Knauer, Temple, Weinberger, and Weinstein-Aronszajn. Unfortunately,
there is no universal method equally well suited in any situation. For example,
the stability of mechanical systems in engineering depends substantially on
the behavior of appropriate eigenvalues. Moreover, the energies of quantum
systems (e.g., atoms, molecules, solid states, semiconductors) correspond to
eigenvalues of the.SchrOdinger equation. Therefore, engineers and physicists
are strongly interested in obtaining sharp two-sided error estimates for eigen-
values. In this connection we recommend Collatz (1963, M), Fichera (1965,
L), (1978, M), Gould (1966, M), Weinstein and Stenger (1972, M), Weinberger
(1974, L), Thirring (1983, M), Vol. 3 (quantum theory), Velte (1984, S) and
Kuttler and Sigilito (1987, M) (cf. also Problem 22.5b).
In the following we describe a basic idea for obtaining upper bounds for
;.."'. Suppose we construct a sequence (B) of operators B): X -+ X which are
linear, symmetric, and compact such that
B S .. · S BI S Bo.
Let ).) denote the mth eigenvalue of B). Then it follows from Corollary 22.29
and (82) that
)."'" S A.", s ),,
for all m,
and
Il", - AI S liB - Bjll
for all m and j.
22.12. Application to Bilinear Forms
With a view to elliptic boundary value problems, we now reformulate the
results of the preceding section in terms of bilinear forms. To this end, we
358
2 Hilbert Space Methods and linear Elliptic Differential Equations
consider the equation
;.a(ll" l') = b(u, r)
for all rEX.
(85)
We seek i. E R and II E X with u :F O.
The corresponding Ritz equation reads 8S follows:
i.el(".,,),,) = h(u'\\Jn)"
. 1 '
.I = "... . 11 .
(86)
Here. we seek i. E IR and II = (' I" ''"I" + ... + C"-,, "'"" with u ¥- O. This is an
eigenvalue problem for a symmetric matrix.
We make the following assumptions.
(H 1) X is a real separable H-space with the scalar product ('1' ).t. where
X # :O.
(H2) The bilinear forms a" h: X x X -+ R are bounded and sy,nmetric. More-
over. a('.') is strongly positive. and b(.",) is compact and strictly
positive.
(H3) (XII) is a Galerkin scheme in X and dim X = oc. Let X" = span {"'JII'
. . . . ''"" " : .
By (I a). problem (85) is equivalent to the operator equation
i.Au = B, ll.
II E X"
(87)
where
a(".,') = (Au. r)
and
h(u, l.) = < B. u, l')
for all". rEX. Here. the operators A" B. : X -+ X. are linear, continuous, and
symmetric. Moreover, A is strongly monotone, and B 1 is compact and strictly
monotonc.
In order to reduce problems (85) and (87) to the situation considered in
Section 22.11" we set
(Ill (') = " ( II" r)
for all u" rEX.
Then. ( -I') is an equivalent scalar product on X called the energetic scalar
product. By (1 a) and the Identification Principle 21. t 8" the original problem
(85) and the operator equation (87) are equivalent to the equation
iJI = 811..
II E X.
(88)
where
h(u, r) = (Bulv)
for all u" l' EX.
The operator B: X -+ X is linear, symmetric, compact, and strictly monotone.
('onsequently" all the results of Section 22.11 can be applied to equation (88)
and hence to the equivalent problems (85) and (87). However.. in order to
obtain the classical formulations for elliptic boundary value problems in the
next section. we replace the variational problems and the Courant maximum--
minimum principle by equivalent expressions. To this end, we first consider
the minimum problem
min a(u" u) = A. -I, h(u, u) = 1" u E X" (89a)
22.12. Application to Bilinear Forms
359
with the solution u. . For m = 2, 3, ..., we then consider successively the
minimum problems
mina(u,u) = ).-1,
b(u,u) = I,
U EX,
(89b)
b(u f , u) = 0,
i= l,oo.,m-1.
The Courant maximum-minimum principle is replaced by the following
minimum -maximum principle:
).;1 = min max a(u, u). (90)
MeY... lIeM
Set S = {u EX: b(rl, u) = I}. Then .!l'", denotes the class of all the sets S " L,
where L is an arbitrary m-dimensionallinear subspace of X.
Proposition 22.32. Assume (H I) through (H3). Then:
(i) Complete orthonormal system. For m = 1,2,..., the variational problems
(89) possess srlccessit'ely the solutions ( A."" u",). which are eigensolutions of
the original problem (85). This way, we obtain all the eigenvalues
).1 A2 > .00 > 0
of (85) counted according to their multiplicity, and ).'" -+ 0 as m -+ oc. Each
eigent'alue ).'" has finite multiplicity.
The eigenvectors U l' U 2' ... form a complete orthogonal system in X
with respect to the energetic scalar product (ulv) = a(u, v). Furthermore,
b(u f , u j ) = lJ ij for all i. j.
and for each u eX, the series
u = L b(u, u,)u,
,:: .
converges in X.
(ii) The Courant minimum-maximum principle. For m = I, 2. ..., the mth
eigent'alue A. m can be characterized b}' (90).
(iii) Convergence of the Ritz method. The Ritz equation (86) has the eigen-
t'alues
).." > ).211 ... )."." > 0
counted according to their multiplicity. For each m,
Ii m )."'" = )."',
" .. 00
and l",,, S )..'" for all m S n. If XIS; X 2 c ..., then
A...", S A",. 1ft +. S A..... '" + 2 S . .. S A.", for a II m.
For fixed m and n = m, m + I, ..., let (u MII , A.",,,) be an eigensolution of
the Ritz equation (86) normalized by lIu",,,1I = 1. Then there exist eigen-
vectors VI' V2' ... of the original problem (85), corresponding to the
360
22. Hilbert Space Methods and linear Elliptic DilTerential Equations
eigellt'alue ;.."', srlch that
lim tI u"'" - t'" U x = O.
II-'X
PR()()F. This follows from Theorem 22.E in Section 22.11 and from Corollary
22.30. Note the following. By Theorem 22.E,
A,(u,lu j ) = ij for all i,j.
Since h(u;..ll j ) = ;.;ahl i , u j ) = ;"i(uduj), we obtain the additional orthogonality
relation
b(u" u j ) = hi) for all i, j.
Moreover, by Theorem 22.E, the eigensolutions (u"" Am) are obtained from the
maximum problem
h(u,u) .
max - - - = i...
uo (ulu)
(u;lu) = 0,
i= 1,...,m-1.
Since b(u" II) = ;.;(II;1u) by (85), and since ;..; '# 0, we obtain the equivalent
minimum problem (89).
By Theorem 22.E, for each u E X, there exists a convergent series
x'
U = L jllj'
j==1
From b(II.,Uj) = l5, j we get. = b(u,u.).
o
In order to obtain strong error estimates for the Ritz method, we make the
following assumptions.
(A I) X, y, and Z are real H-spaces. The embedding X S; Y is continuous.
(A2) Assume (H I) through (H3) above. In particular, suppose that there are
positive constants c and d such that
c II u II i < a( u, u),
I a (II.. to) I d II u II x II I' If x
In addition.. suppose that
b(u, to) = (ult')).
for all u, v EX.
for a II ll, t EX.
(A3) Approx;mat;ol1. For each n E N, there is a constant 2" such that
distx(z, X,,) S ex"lIzll z
for all z E X (\ Z.
(A4) A prior; estilnale. There is a constant C such that u E Z and
lIuliz < CII/llr
for all .I e Y and all solutions u E X of the equation
a(u, t') = (fh')r
for all v E Y.
22.13. Application to Boundary - Eigenvalue Problems
361
Corollary 22.33 (Strong Error Estimates for the Ritz Method). Assume (A I)
through (A4) and use the notation of Proposition 22.32(iii). Then, for all m S n,
M'e obtain the error estimates
I A",,, - )..'" I s (c -I d 2 C 2 ),
II u"'" - v,,11 r S : D""
,,-here the constant D", depends on m, but not on n.
Since : appears in the error estimates, we speak of quadratic convergence
of the Ritz method.
Applications of this result to the finite-element method will be considered
in Section 22.13.
PROOF.
(I) Solution operators. Let K: y.-. Y be the solution operator of the equation
in (A4), i.e., we set u = Kf Further let K,,: Y ....... Y be the solution operator
of the corresponding Ritz equation
a(u, t') = (flt')r for fixed u E X" and all l' eX".
Then the proof of Corollary 22.20 shows that the operators K, K,,: Y....... Y
are linear, symmetric, compact, and monotone. By Corollary 22.21,
IIKf - K..fllr S 2c-ld2C2I1fllr for all fe Y,
and hence ilK - K..ll r S 2c-ld2C2.
(II) Equivalent operator equation. The original equation (85) (resp. the Ritz
equation (86)) is equivalent to the operator equation
;.u = K u, U EX, ). E R
(resp.
lu = K"u,
u eX, A e R).
By the comparison principle in Section 22. J J b,
I;.",.. - ;.'" I SilK - K "II y.
By Problem 22.5a,
II u"'" - 1.',,11 r S C", II K - K" II r.
o
22.13. Application to Boundary-Eigenvalue
Problems
We consider the boundary-eigenvalue problem
- AU = u + f on G,
u = 0 on cG,
(91 )
362
Hilbert Space Methods and Linear Elliptic Differential Equations
where i. is a real number. The case r = 0 corresponds to an eigenvalue
problem. However. we also want to study the case where f :;: 0 in order
to explain the interplay between eigenvalue problems and Fredholm alter-
natives. We set
tI(lI. r) = r t DjuDjt'clx.
J (i i = 1
hili. r) = r Ill' clx.
JG
h.lr) = r ,ft. d.\:.
J (i
Definition 22.34. Let G be a bounded region in jR:\'. N > I. Set }( = Y21 (G). The
/ellerClli :ed prohl(1)' to (91 ) reads as follows. For given .r e L 2 (G). we seek u e X
such that
i.a(II.. r) = h(lI.. r) + hi (I')
for all rEX.
(92)
In the case "'here" = O. we seek ;. e IR and u e X with II =#; o.
Formally. we obtain t92t from t9J) by multiplying (91) with (' e C((i) and
h u.Ui. ."b"'1"t1' it1&.it"tlft tt It.fii,
Proposition 22.35. The yenerali:ecl prohle", (92) has the Jill/ou'ing properties:
(it Eigen\'alue problem. I r = O. i.e.. hi = O. then all tire assertions or
Proposition 22.32 are ralit/.for (92).
III particular. ther£' exist i,!finitely ,nal'Y e;genralues ;.. Each eige,u\ulllf?
Ira... a finite 1)ut!tiplicity. r "'e COII'U the eigell(\alues accordi"y to their
"ut!tiplicitr. tlren
i., > i. 2 > ... > o.
and i. m -+ 0 a "1 -. Y...
There exists a ('olJlplete orthonor"lal systen, (r e;genrectors : u...} ;11 tire
."pelce L 2 (G). and.for eacl, U E X (resp. II E L 2 (G)). Ille series
,
1I = L A:"t.
t-I
\"itlr , = Ie; UII, dx. C01u'('rges in X (resp. L 2 (G)).
(ii) lJnique solvability. ,f i. ;s not an eigenralue. thell. .It)r eClch fe L 2 (G).
equat ion (92) has a unique so/ut;on.
Oii) Fredholm alternative. Let ;. he an eige'Jralue. Then equat;o'J (92) has a
sol"t ion iff
Lfi1dX=O
lor ,,/I eiyenr(!ctors u corresponding to i..
PR()()I.. By ( I b). h. EX..
22.13. Application to Boundary- Eigenvalue Problems
363
(I) The proof of Proposition 22.7 shows that a: X x X -+ R is bilinear,
bounded, symmetric, and strongly positive. Furthermore, by the proof of
Proposition 22.28, the bilinear map b: X x X -+ R is compact. Therefore,
all the assumptions of Proposition 22.32 are fulfilled.
Note the following. By Proposition 22.32. {II",} is an orthonormal
system with respect to b('" .). i.e.. {u",} is an orthonormal system in L 2 (G).
Let (' eX. Then it follows from
b(uI' t') = t ult'dx = 0 for all i
that a(II.. t') = (u.1 t.) = 0 for all i. Since {u",} is complete in X, we get 1,' = O.
Moreover. X is dense in L 2 (G). Thus, {u",} is also complete in L 2 (G), by
Theorem 19.A(iv).
(II) The Fredholm alternatives follow from Proposition 22.26. 0
Remark 22.36 (The Rapidity of Convergence of the Method of Finite Elements
for Eigenvalue Problems). In order to explain the basic idea in a simple way.
\ve consider the boundary-eigenvalue problem
-II=U onG.
fP)
II = 0 on cG,
where G is a bounded region in R 2 with cG e C-r. Then the corresponding
generalized problem has eigensolutions (u" i.) in W 2 ' (G) x IR with the eigen-
values
;- I > ;.2 > ... > O.
counted according to their multiplicity. Here. i.", -+ 0 as m -+ oc.
We now proceed as in Section 22.2e. Consider a sequence of triangulations
of G. (T,.). where h.. denotes the maximal diameter of the triangles correspond-
ing to the triangulation 7;.. Furthermore, consider the special finite elements
"'.u. as defined in Section 22.2e. i.e., the functions \"'t.. are piecewise linear. Then
the Ritz equations are given by
(R) i. t (D1uIID1"'h + DzulIDzwh)dx = t ullw""dx
for k = I, ..... n'. We seek u.. = c,.."',.. + ... + C..',."',.',., i,e., we seek the real
coefficients C 1 ..,..., e...... Note that (R) is an eigenvalue problem for a symmetric
matrix which has the eigenvalues
;-1,. > i.. 2,. > ... i.,.,.. > 0,
counted according to their multiplicity. Set
Z = "'2 2 (G). X = W 2 1 (G).
Y = L 2 (G).
Using the estimates given in Section 22.2e, we obtain the following error
estimates from Corollary 22.33.
364
22. Hilbert Space Methods and linear Elliptic Differential Equations
(i) Eigenralues. For all ,n n.
I).",,, - Ami s; const. h,
where the constant is independent of m and n.
(ii) Eigenl'ectors. Let (u m ". )"m) be an eigensolution of the Ritz equation (R)
normalized by 1I1I",,,1I x = 1. Then there is a generalized eigensolution
(I'", i. m ) of the original problem (P) such that
IIII",n - l'nll2 = (L (u",n - I'n)z dx Y'Z const · h
for all nJ ,1. Here, the constant is independent of n.
This shows the quadratic convergence of the method of finite elements
with respect to h".
22.14. Gcirding Forms
Definition 22.37. Let X and Z be H-spaces over K = lA, C with the continuous
embedding X £ Z. Then g: X x X -+ K is called a Gdrd.ng form iff g is bilinear
and bounded. and there is a constant c > 0 and a real constant C such that
Rey(u.u) > cflulli - Cllull for all u e X. (93)
This inequality is called a Garding inequality. If C = 0, then we speak of a
strict Garding inequality. Then, g is called a strict GArding form, i.e., g is
strongly positive.
The Garding form g is called regular iff the embedding X c Z is compact.
Inequalities of type (93) for strongly elliptic operators of order 2m were first
proved by Garding (1953) in a fundamental paper (cr. Section 22.15). In this
chapter and the following two chapters we shall show the following:
Gdrdiny .forms playa .fundamental role in the modern theory of linear partial
differential equations of elliptic, parabolic, and hyperbolic type.
The importance of Garding forms for the calculus or variations will be
explained in Section 29.12.
The following simple result is the key to the theory of Garding forms and
their applications to boundary value problems.
Lemma 22.38. Let g: X x X .... K be a regular Gdrding form. Then there exists
a decomposition of the form
g = a + b,
",here a, h: X x X ...... IK are bilinear and bounded, a(. t .) is strongly positive,
and b( .. . ) is compact.
f g is symmetric, then so are a and b.
22.14. (lanling Forms
365
Consequently, regular Girding forms are compact per(urbation of strongl'!
positi vc hilinear form).
PRC)()F. \Vc set
a(u, v) = g(u, t') + C(Utt)l
and h(II. (J) = - C(1I1 ')l' By (93).
Rca(u,u) > cJlull for aU u e X.
Since the emhedding X c Z is compact. it follows from
II,. u and I'. I' in X as n -+ 00
that II" -+ II and VII ...... v in Z as n -. 00 and hence
(u"lvlI)z -+ {ulv}z
Thu, h X )( X -+ k is compact.
as " -+ 00.
o
Proposition 22.39 (Hcstcnes (1951 »). lf't '( he 1I real H-space. Then:
(i) S"ppt,se ,lrallllf hilinear j'orm,"
Q. h: X )( X -+ R
are hOllntlf.Jd and .ynlm('tri('t wht're. in addi';("1, a ;. strollgly po.ft;lir'(! 'Jlld b
i... (,()"'PUft.
,r tI + h is stricti)' positive, tllen a + h i tr()ngly po.ftitivt,
(ii) SU1'PO,C;t) l!Jllt {J: X x X -. R ;. a strictly positive, symmttric. reglllcI,
Gt;rJiny ,fornr. 1"en. g i.-' sirongly positivc.
The proof of (i) will be given in Problem 22.4. Statement (ii) follows immedi-
ately from Lemma 22.38.
In Section 22.16. Proposition 2239 will he applied to stability theory. In
Section 29,12, this proposition will playa fundamental role in order to obtajn
sufficient criteria for the existence of minima via accessory quadratic eigen.
value problems (the functional analytic generalization of the classical Jacobi
theory).
The following proposition is the key to our proof or the Girding inequality
for trongly elliptic differential equations in the next section.
PropMicion 22.40 (The Abstract Ehrling Inequality). Let X. y, and Z he thre('
B.,IH'(,(" (1"(" B< = IR" C with
x y c Z.
Suppo.-;e II,at the emh(.dding X Y is compart, and that the emhedding Y Z
i.'\ COli' ;nu()u..
The'" .for ell(.1J I; > 0, lhfrt j. a cIJn./QIJt e(l;) > 0 Juch lhat
'11 U n y < ell u n x + c( r.) II II If Z
for tll' u eX.
(94)
366
2 Hilbert Space Methods and linear Elliptic Differential Equations
EXAMPLE 22.41. Let G be a bounded region in R N , N > I. Then the inequality
(94) holds in the case where
X = W 2 '" ( G ). Y = W 2 '" - 1 ( G). Z = L 2 ( G ), nJ = I, 2, . . . .
Inequalities of this type are due to Ehrling (1954).
PR(X>I' ()I: PR()P()SITU)N 22.40. Suppose that (94) is not true for a fixed f. > O.
Then there is a sequence (II,,) such that
1111,,11.. > E lIu,,1t x + n lIu"lI z
for all II.
(95)
Hence II" ¥- O. After division. if necessary, we may assume that lIu" It x = I for
all n. Since the embedding X s; Y is compact, there is a subsequence, again
denoted by (rl,,). such that rl" -+ rl in Y as n -+ 'Xl. This implies r,,, -+ U in Z as
II ..... x. By (95), IIrl"lI.. > f. and hence II ¥- O. Again by (95), U" ..... 0 in Z as" ..... x,
i.e.. " = O. This is a contradiction. 0
22.15. The Garding Inequality for Elliptic Equations
We want to study boundary value problems for linear differential operators
of order 21n. i.e.. we consider
Lu = L ( - I )121 D2(a2DII (I).
i 2 1'11I1 S '"
We make the following assumptions.
(H) Let G be a bounded region in IR'\'. N > I. Let m= 1,2, .... All the
coe!licient functions a 2Jj : G ..... IR are measurable and bounded (e.g., a 211 E
C( G)).
(96)
Recall that ."( = (I' . . . .. ....), D; = i'/ i;, and
D « - D 21 D 22 . . . D 2."I
- 1 2 N ..
where = (x,'......'V)and II = I + ... + N'
Definition 22.42. Assume (H). The differential operator L in (96) is called
regularly strongly elliptic iff there is a number Co > 0 such that
L a2,(.'()D2D > Co L ID212 (97)
121.1111- '" 1 2 1 = '"
for all x E G and D2.. D/l E R.
The differential operator L is Cc lied strongly elliptic iff all the functions
a«/l: G -. IR are continuous for lex I, 1111 = m and there is a Co > 0 such that
L arJ(x)drJd > coldl 2 '" (97*)
tal. 1111 = 1ft
for all x E G.. d E (R.....
22.1 S. The GArding Inequality for Elliptic Equations
367
Moreover, L is called symmetric iff a",(.) = a':I(x) for all indices 2 and p,
and all x e G.
The dual differential operator L. to L is defined to be
L · u = > ( - 1)1«1 DfI(a'flD' u).
1.I)j '"
Notice the following peculiarities. In (96), Dfl denotes a differential operator,
but in (97), D" denotes a real number, where 2 is an index. Roughly speaking,
condition (97) means that the principal part of L is positive definite. If L is
symmetric. then L. = L.
In (97.), we have d = (d 1..... d N ) and
d 2 - d Cl' d 21 · · . d fl4f
- I 2 1-1 .
Since
L Id I 1 2C11 .. .ld N I 2 cr, = Idl 2M
121. M
for all d eRN,
condition (97) implies (97.). Therefore, we obtain the following:
Regularly strongly elliptic operators are also strongly elliptic in the case where
a2 ;s continuous and bounded on G for all !x, P "'ith 121. 'PI = m.
In the following sections we want to prove:
(i) Fredholm alternatives for regularly strongly elliptic or strongly elliptic
differential operators L, and
(ii) the existence of infinitely many eigensolutions in the case where L is
symmetric.
Here. the key is the Girding inequality below.
EXAMPLE 22.43. The second-order differential operator
N
Lu = L -Dlil + aolD;u + a OO
i-I
is regularly strongly elliptic. The dual operator is given by
tv
L.u = L -Dlu - D,(aoiu) + aoo,
i=1
and L is symmetric in the case where Do,(X) = 0 for; = I...., N.
EXAMPLE 22.44. The fourth-order differential operator
Lu = 2 U
is regularly strongly elliptic and symmetric.
368
22 Hilbert Space Methods and Linear Elliptic Differential Equations
PR()()F. Explicitly. we obtain that
LIl = ( D;2 ) 2 U = L asD2D'u.
I II = 2
where Cl:rll = I if = P. and a(J11 = 0 if p. Note that D2 = DiD) for II = 2.
o
Proposition 11.45 (The Garding Inequality). Assume (H) above, and assume that
tile differelltial operator L is regularly strongly elliptic or strongly elliptic. Then..
tltere are real 'lumbers c > 0 and C such that
r L a2D2uD'u dx > c II u II;', 2 - c lIuli
J (i 1 2 j.11l1 s m
(98)
lor all 11 E t'2'"( G).
PR()()F. We first assume that L is regularly strongly elliptic.
(I) The principal part. Let u E W 2 '"(G). The strong ellipticity condition (97)
implies
r L u..,D 2 uD-udx Co r L (D 2 u)2 dx
J c; 12\.(111='" J G 1 2 1='"
= c o llull;'.2.0'
By A 2 (53). the norms 11'11",.2.0 and 11'11",.2 are equivalent on W 2 "'(G).. i.e..
there is a c > 0 such that
collull;'.2.0 > 2cllull;.2'
II, The lower order terms. Again let u E W 2 "'(G). Note that supxG la 2 6(x)1 <
x for all x, fl. By the Holder inequality. there is a K > 0 such that
r L alJ,OSuDu dx S K lIull;'-1.2'
J G 12(.11 S ",-I
By the Ehrling inequality in Example 22.41 t
lIull"'-1.2 £lIull"',2 + c(£)lIuIl2'
Choosing ,: > 0 sufficiently small and using (a + b)2 2a 2 + 2b 2 ..
we obtain
K "U II ;. - 1. 2 < c II u II ;, 2 + C II u II i,
Putting these estimates together, we get (98).
The proof for strongly elliptic differential operators will be given in Problem
22.7. There. we will also prove the Girding inequality for strongly elliptic
systems. 0
22.16. The Main Theorems on GArding Forms
369
22.16. The Main Theorems on Garding Forms
The following results will be applied to strongly elliptic differential equations
of order 2m in the next section.
22.16a. Fredholm Alternative
We consider the equation
g(u, v) = b(t.)
for fixed u E X and all l E X,
(99)
together with the dual problem
g(l:, u.) = 0 for fixed u. E X and all l E X. (99.)
Theorem 22.F. Let g: X x X -+ R be a regular Gdrding form on the real
H-space X, and let b e X.. Then:
(a) Tire orig;nal equation (99) has a solution u iff
b(u.) = 0
for all 501llt;ons u. of the dual equation (99.).
(b) If the homogeneous equation (99) \\'ith b = 0 has only the trivial solution
II = 0, then, for each b E X., the inhomogeneous equation (99) has a unique
solut;on.
PROOF. By Lemma 22.38,
g(u, l') = a(u, l') + c(u, l') for all u, t' e X,
where a, c: X -+ R are bilinear and bounded. Moreover, a is strongly positive,
and c is compact. Now, the assertions follow from Proposition 22.26. 0
Note that if 9 satisfies a strict Girding inequality, then the case (b) is at hand.
22.16b. Eigenvalue Problem
We now study the eigenvalue problem
g(u, t') = p(ul v)y
We seek pER and u e X with u o.
for all L' eX.
( 1 (0)
Theorem ZZ.G. Let g: X x X ... R be a symmetric regular Gdrding form on the
real separable infinite-dimensional H-space X, i.e.,
g(u,u) cllull - Cllull
for all u EX.,
370
22 Ifilbert Space Methods and linear Elliplic Differenlial Equations
"'here r is (l real H-spClce ,,"ith the ('ompact embedding X c Y. Here.. c > 0 and
( E IR. Tllen:
(a) Prob/eIJI ( 100) lie's ;I!(illitely ,nany eigenr'a/ues Ii. A /I the eigenr'alues of ( 100)
IrCll"e (illite "udtiplicity. If "'e count tile eigenvalues according to tlleir
"lItit iplicit y. t II ell
- C < J i < J l <...
1 - 2 - "
tIlltl Jim -+ + X CIS"I -. X,
(b) Tile sn,allest e;yelJralue JJ I call he characterized by tlte nl;n;nlunl prohlenl
Jil = ming(lI.u).
null.. = 1.
fleX.
(101)
(c) The IJlth eiJelll'alue Jl", can be characterized hy the m;n;,,,uI"- nJax;nJU'"
pr;tlciple
Ji", = min max g(lI" II).
,W E !f", .. t= ,W
,,'Ilere S = : II EX: Ilull) = I}" al1d !I'm denotes the class of aI/the sets S fl L"
\"here L is '"' arbitrary "1-dinle'J.i(Jnallillear subspace of x.
22.16c. Stability
(:orollary 21.46. Suppose Illat Ill> 0 ill 1"heorenl 22.G. Theil, for eClch h E X*.
the rClriational prohle,n
g ( II" u) - b ( II) = m in!.
U E X,
( 102)
lie's a ,,"it/lle ,olllti()n II. ,,'h;ell eorrepollds to a strict ,nininlunl. M oreorer. u is
tile unique so/ut ;0" 0." the Euler equat ;0'1
/(1l. r) = h(r)
lor fixeci II E X alld all (' EX.
103)
(111,1 II clepends on h ill a stahle I1ICl1ll1er. i.e..
1111 I - U 2 II)' Jl,1 II h I - h 211 )'. .
,,'Ilere tire .'\o/ul ion "i correspollcls 10 hJ e Y.. N ole tllat Y * c X *.
Moreol'er there is (l COllstatlt (" > 0 SU(>" tllat
( 104)
y(u. u) > c Ilul! i
.for all II E X.
( 1 OS)
Remark 22.47 (Interpretation in Terms of Elasticity). In elasticity theory we
ha ve:
u = displacement of the elastic body.
/(U. u) = elastic potential energy of the body.
h(lI) = work of the outer forces.
The variational problem (102) corresponds to the principle of minimal poten-
tial energy. The solution II represents an equilibrium state of the body which
corresponds to the outcr force h. Relation (104) tells us that all the equilibrium
22.17. Application to Strongly Elliptic Differential Equations of Order 2m
371
states u are stable, i.e., small changes of the outer forces, lib, - b 2 11 y-, cause
only small changes of the displacements, lIu 1 - u 2 l1r. In addition, we obtain
from (104) the following fundamental principle:
The equilibrium states of the body are more stable, the larger the smallest
eigenvalue PI is.
Therefore, engineers are very interested in constructing such systems (e.g.,
buildings, bridges, etc.) for which III is large. Relation (105) says that the
displacement u is small in the case where the elastic energy is small. In fact,
the condition /JI > 0 is equivalent to (105) (strong positivity of the elastic
potential energy).
PROOF OF THEOREM 22.G. We consider the GArding inequality
g(u,u» cllulli-Cllull forall ueX.
We set
a(u, v) = g(u, t') + C(UIV)f'
b(u, v) = (UIV)f.
Then the original problem (I (0) is equ;t'alent to the equation
a(u,l') = (Jl + C)b(u,t') for all t' E X. (106)
The point is that the GArding inequality implies the strong positivity or a( · , . ),
I.e.,
a(u, u) C lIull 2
for all u EX.
The compact embedding X s; Y implies the compactness of b: X x X R (cr.
the proof of Lemma 22.38).
We now apply Proposition 22.32 to (106). 0
PROOF OF COROLLARY 22.46. If /JI > 0, then 9 is strictly positive by (101), and
hence g is strongly positive by the theorem of Hestenes (Proposition 22.39).
Theorem 22.A yields the assertions. In this connection, note the following.
If u is a solution of ( 103), then it follows from (10 I) that
IlIUull S g(u,u) = b(u) S Ubllr-nully
and hence /JIll u II r S II b II y-. This is ( 104). 0
22.17. Application to Strongly Elliptic Differential
Equations of Order 2m
We consider the boundary value problem
Lu - J'U = f on G,
DfJ u = 0 on aG for all (X: '(XI S m - I.
(101)
372
22. Hilbert Space Methods and Linear Elliptic Differential Equations
together with the dual problem
L*u-llu=O onG.
(107*)
D'u = 0 on tG
for all : 121 S nl - I.
Here.. I' E and
Lu = L ( - 1 t l21 D(a2Jt DP u)..
II.IIII s 1ft
L.u = L ( - 1 )121 D'(a p2 Dl'u).
12 1.1111 Sift
In order to study these problems in detail. we make the following
assumptions.
(H) Let G be a bounded region in IR", N > I.. and let ", = 1.2.. ... . All the
coetpcient functions a:J": G -+ IR are measurable and bounded (e.g., u:J1f e
C(G)). The differential operator L is regularly strongly elliptic or strongly
elliptic.. and .re L 2 (G). Let X = W 2 "'(G) and Y = L 2 (G).
We define
glll.r) = r L a%6D%{'D6udx.
J G 1 2 1.1111 '"
h(lI.t,) = r !It'd.':. bdt') = r frdx,
JG JG
By Proposition 22.45.. we obtain the Garding inequality
(I u.. u) - JI h (II.. u) > (" II u II i - (C + JI) lIuli
for all u E X and JI E iRe Recall that the embedding X s; Y is compact.
Definition 22.48. The /elleralized prohlenl to (107) (resp. (107.)) is given by
o(u.. r) - Ilh(u.. r) = hi (r) for fixed u E X and all rEX ( 108)
( resp.
g( r.. u) - I,h{t'. II) = 0
for fixed ueX and all ('eX). (IOS*)
Formally.. equation (108) is obtained from (107) by multiplying the first
equation of( 107) with I' E Co (G) and by using subsequent integration by parts.
Analogously, we obtain (I OS.) from (107.). Note that u E W2"(G) implies that
the boundary condition in (107) is satisfied in the sense of generalized bound-
ary values.
I n the following, solutions of (107) and (107.) are to be understood in the
sense of generalized solutions. We say that I' is an eigenvalue of (107) iff
equation (107), with f = O. has a solution U E X with u :F O.
Proposition 22.49 (J--redholm Alternatives). A.SU"1e (H). Theil:
(a) Suppose I IIal J.1 ;S 1101 all e;yent'ulue or ( 107) (e.g., Jl < - C). Tile". .It)r each
.r E L 2 (G).. equal ;011 ( 107) has a Ilniqlle solul ion u EX.
22.17. Application to Stronaly Elliptic: Oiffemttial Equations or Order 2m
373
Jf we set II = Kf, then ,he solution operQtor K: Lz(G) -. L 2 (G) of (107)
is linear and compact. The adjoint operator K-: L 2 (G).... L 2 (G) is the
solution operato,. 10 the dual problem (107.). i.e.. K ;s symmetric if L ;s
symmetric.. .
If p - C. then K: L 2 (G) -+ L 2 (G) is angle-bounded.
(b) Suppose tha, 11 ;. an eigenvalue of (I07 Then Jl is also an eigenvalue of
(107*). Both eigenvalues have rise same finite multiplicity.
For given f E L2(G eqlUJtioll (107) Ittu a solution iff
tfudx =0
for all eigen.WJlulions u e X of (107.).
(c) Tilt eigenvalue... Jl of (107) laa", 110 finite «cumulation poinl.
PROOF. This follows from Theorem 22.F. Corollary 22.20, and Corollary 21.S3.
o
Proposition 22.50 (Eigenvalue Problem). Assume (H) and assume that L is
symmetric. TI,en:
(a) The £';genvalue problem (I07 wltll / = 0.1uu infinitely many eigenvalues
p. All the eigenvalues have finite multiplicity. If we count the e;g('nvalues
according to their multiplicity, 'un
- C < PIS P2 S ...
and JJIc ...... + fXJ a. k ..... 00.
There exi.ts a complete orthollOmlQl system of eigentJeClors {Uk} in the
space L 1 (G}.
(b) The smallest eigenvalue PI can be cluuacterized by
p. z:: min g(u."). fo ,,2 dx = 1, u e X.
(c) The kth eigelivalue Pi ran be cluJr«terized by
Pi:a min max g(utll),
lIe., .. AI
where S = {II EX: JG .,1 4x :: I}, and ;s the class of all the sets S n L.
where L is an arbitrary k-dimensiOlUJI subspGCe of x.
Corollary 22.51 (Stability). Suppo# that we have PI > 0 in Proposition 22.50.
Then for each Ie L2(G Problem (107). w;tla JA :R 0, Iuu a unique solution u e Xt
and
L u 2 dx 1'.2 !o/2 d X .
(109)
374
22. Hilbert Space Methods and Linear Elliptic Differential EquatIons
J\1oreorer. u is lite unique solution o.f tile L'ar;ational problem
!g(l', u) - t (u dx = min!.
U E X.
'''Iel., ('orresp()n{I. to II strict nl;n;,nu"J.
PR(Xn. This is a consequence of Theorem 22.G. In order to prove the com-
pleteness of {II. . apply the same argument as in the proof of Proposition 22.35
to the equation
J(u.l') + Ch(u.r) = (Jl + C)b(u.r) for fixed II E X and all rEX.
which is equivalent to the original pioblem ( 108).
Relation f 109) follows from the following estimates:
JI! L ,,2 ax < g(".II) = t .(1. dx < (fa.[2 dx )"2 (fa 11 2 dX)' 2 · 0
22.18. Difference Approximations
The following results will be used to prove the regularity of generalized
solutions in the next sections. Let x ERN. Denote bye; = (0. .. . , O. 1,0,. . . ) the
unit vector in the ith direction. Define the translation operator
("II)(X) = u(x + lie;)
and the difference operator
VII = "[;"t U .
Hence
" u(x + Ite;) - u(x)
(Vi II)(X) = - -- - -h-------.
Proposition 22.52. Let (j be a hOllnded region in R N . N > I. and let H be a proper
suhregion 0." G. i.e.. H c c G. Fllrthermore, let ho = dist(H, (lG) and let 1 S
P < x. m = I. 2, . . . .
The'l. for all II E R N \\'ith Ihl < hOt the following are true:
fa) 7.lIe cla.sical operator. T/' and vt can be Iln;quely extended to COllt;nllOll5
linear operators .from Lp(G) to Lp(H).
(b) For all U E W,,'"( G).
IIvtIlIlMOf91 'eHt cllllll"o-eCia.
p r
lim U Vu - Dill II "0'" 'eHa = O.
11-0 "
( 110)
(III)
22.18. Difference Approximations
375
(c) Let I < P < 00. If " E L,(H) and
flvtuIlL.(Ht S a for all h E ]0, hi [,
,,'here hi > 0 and a 0 are fixed real numbers, then the generalized der;va-
tit'e D;II exists on Hand D;u e L,(H) with
II Diu II L,,(Ht S a.
(d) Let I < p < 00. If u E W,"'-I(H) and
IIvtull "''''-'(Ht S a for all h E ]0, hi [,
"
",here hi > 0 and a > 0 are fixed real numbers, then u E W,,"'(G) and
""II"''''(H. a.
,
Statements (c) and (d) remain true if we replace V: by VI-It.
Statement (c) above justifies the following principle:
A priori estimates for difference quotients yield the e:<istence of generalized
deril"atil!es.
This fundamental principle will be used critically in our regularity proofs in
the next sections.
PROOF. Ad(a). For all u E Co(G),
IH lu(x + he,W dx s t lu(xW dx.
By the extension principle in Section 18.12, we obtain the continuous linear
operator 1;': L,(G) -+ L,(H).
Ad(b). Suppose that cG E Co. 1. This assumption can always be satisfied by
passing to a smaller region G. Since C(G) is dense in W,,"'(G), it suffices to
prove (b) for u E eX (G). To simplify notation, let N = I and m = I.
Let u E CX(G). Then
V"u(x) = t l u'(x + th)dt.
By the Holder inequality with I-trick,
IV"u(x)IP S t l lu'(x + th)IP dt.
Hence
IH IV"ul' d. s t l (IH lu'(. + th)IP dx ) dt
s t lu'(x)IP dx.
376
22. Hilbert Space Methods and Linear Elliptic Differential Equations
This yields (I 10). Moreover. we obtain that
f IV"u - u'IP dx S f f I (u'(x + th) - u'(x))dt P dx
j" JH Jo
< L' (fll lu'(x + th) - U'(XWdX)dt-+O
according to A 2(32). This yields (I I I).
Ad(c). Since the B-space L,(H) is reflexive. there exists a subsequence such
that
as h..... 0"
VII g in Lp(H) as h..... O.
A simple transformation yields
f <pW" dx = - f uV,-"<p dx for all <p E C(H)
" .J II
and small 11. Letting h -+ 0, we get
f" rpg dx = - f" uD,<p dx for all <p E C (H),
and hence Dill = g on H. Finally, we obtain
If D i U "'.."cII. H!11 II vt u II '''pUI)'
11-0
Ad(d) This follows from (c) by induction.
o
22.19. Interior Regularity of Generalized Solutions
Our goal is to justify the following fundamental regularity principle:
The generalized solutions of linear elliptic equations are smoother, the
.m()other tire data are.
By data we understand coefficients, right-hand terms, boundary values"
and boundaries. We shall prove two important results on interior regularity
(Theorem 22.H) and on regularity up to the boundary (Theorem 22.1).
We consider the differential equation
L ( - I )Iarl f)«(acaIDI u) = f on G (t 12)
1 2 1.161 s '"
together with the boundary condition
Du = 0 on aG for all a: '!XI m - I.
(113)
22.19. Interior Relularity of Generalized Solutions
377
where m = I, 2, ... . Moreover, we set
a(u, II) = f > a.,D.vD'u dx,
J G I-'.I)i 1ft
b(t') = LfVdX.
We make the following assumptions:
(H I) Let G be a bounded region in R N , N > I.
(H2) All the coefficients a", belong to C<r(G), and a(', .) is strongly positive,
i.e., there is a constant d > 0 such that
a(u, u) d "ull;. 2 for all u e 2"'(G).
(H3) Let H be a proper subregion of G, i.e., H c c G.
By a generalized solution of ( 112) we understand a function u e W z "'( G) such
that
a(u, t) = b(t,)
for all v e C o ( G).
( 114)
Theorem 22." (Main Theorem of Regularity Theory). Assume (H I) through
(H3). Let
fe W:(G),
k = 0, I, . . .
be given. Then, each generalized solution u e W 2 "'(G) of (112) belongs to the
Sobolel space W 2 2 "'+"(H), and \\'e have
lIullwl"'."'H» c(lIullwr'(G» + IIfllwr(G»)' (liS)
,,-here the constant c > 0 depends on G, H, k, and m.
The same remains true if we replace the strong positivity of a( " · ) in (H2) by
tire strong ellipticity of (112).
Note that this is a very natural result. I n fact, since the differential operator
in (112) is of order 2m. we expect that the solution u of (1 J 2) has derivatives
up to order 2m + k in case f has derivatives up to order k.
The proof of Theorem 22.H will be given in the next section.
Corollary 2ZeS3 (Boundary Value Problem). Assume (H I) through (H3). Let
fe Wl(G), k = 0, I,...
be given. Then, the generalized boundary value problem (114) corresponding
to (112), (J J 3) has a unique solution u e W 2 "'(G). In addition, u belongs to the
Sobole,: space W 2 2"'+"(H), i.e.,
u e W 2 "'(G) '"' Wi"'+'(H)
378
:!2 Hilbert Space Methods and linear Elliptic Differential Equations
,,,,d
Hull,,}",...,,) + IIllI/N'r-«G) < cllfUwrtG)'
"'here llle cOlIsta,1t C > 0 depellds on G, H.. k, and ,n.
In particular, if .re c' (G).. then 14 E Wl'"'(H) for all k. Since H is an
arbitrary proper subregion of G.. this implies U E C:J"(G), according to the
Sobolev embedding theorems.
By Section 22.21, Corollary 22.53 remains true for H = G in the case where
the boundary of the region G is smooth, i.e., cG E C%J.
PR(X)F. Let X = W2'"G). By Theorem 22.C, problem (114) has a unique solu-
tion U E X and
dull.\ < const IIhIl A ,. < const lI.fIl L2CCi ).
Now use Theorem 22. H.
o
EXAtPlF. 22.54. The first boundary value problem for the Poisson equation,
611 =,. on G. II = 0 on cG.
corresponds to (112). (113) with ,JJ = I. Thus, for each given f'E W;(G).. k = 0,
I". ., we obtain a unique generalized solution U E W 2 1 (G). In addition. we have
liE Wl+'(H)
for each proper subregion H of G.
In Section 22.21 we will show that II E wl+'(G) in the case where the
boundary of G is smooth. i.e.. i 3 G E CI .
22.20. Proof of Theorem 22.H
We will essentially make use of IJegatil'£' norm.. To this end, let
W2-( G),
k = 0, 1. . . . ,
denote the set of all linear functionals h: C; (G) -+ R with the property
Ih(<p)I < B II <P 11".2 for all <P E Co (G) and fixed B > o.
The smallest possible number B is denoted by IIbll- i . 2. Hence
Ih(cp)l < IIhf! -i. 211<p1I.. 2
forall cpEC(G).
STANI)ARI) EXAtPt.E 22.55. Let G be a nonempty bounded open set in IR'\',
N > I. Define
h{fP) = t agD;'fPdx
for all <p E C (G).
:!2.20. Proof of Theorem 22.H
379
where a E Cr(G), 9 E W;(G), k, r = O. I. ..., and
I," s k + r.
Then bE W 2 - It (G) and IIblf-.. z S constllgfl,.2'
PROOF. Integration by parts yields
b(q» = t (-IY.'om(ag)D'q>dx.
where 121 s rand IPI s k. By the product rule for D"(ag) and the Holder
inequality.
Ib(cp)1 s const IIglir. zllcpll.. z
for all cp e C(G).
o
22.20a. The Fundamental Regularity Lemma
Lemma 12.56. Assume (H 1) through (H3) from Section 22.19. Let b E W z -'" + I (G)
be git"en. Then each generalized solution u E Wz"'(G) of (I 14) belongs 10 the
Sobolet' space W 2 "'. I (H), and \\'e have
lIullwr.'(H) S c(lIullwj'(G) + IIbll-",+l.z),
"'here the constant c > 0 depends on G, H, and m.
Before proving this, we want to show that Lemma 22.56 immediately
implies Theorem 22.H.
PROOF OF THEOREM 22.H. The simple idea is to set
\\' = Dju
and to consider the corresponding differential equation for the derivative w.
Classically, differentiation of the original equation (112) yields
> (-1 stD"(a,,_D'''') = g on G, (116)
12J.,ji s...
where
g = DJ! - > (-ly"ID"(D J a«,D'u).
1«1.lji s...
We want to translate this procedure to generalized problems. Note that
problem (116.) below is precisely the generalized problem to (116).
We will improve the regularity of u step by step.
Case 1: Let a(', · ) be strongly positive on W2'( G).
(I) Let f e L 2 (G), and let u E Wz"'(G) be a solution of the original problem
(114), that is,
(E)
a(u, v) = b(v)
for all t' E C<f(G),
380
22. Hilbert Space Methods and linear Elliptic Differential Equations
where
hh') = 1 It'dx..
G
By Example 22.55..
h e W 2 - '" · I ( G)
By Lemma 22.56"
u e IV 2 "'. I ( H)
a(u,t') = 1 ) a«,DvD6udx.
G 1«1.,)i!; '"
and
IIbll-"'+1.2 S constllfll2'
and
II u II '" + 1. 2 S c( II u II "'.2 + II f 11 2 ).
In the following we may set G = H, by making the region G a little
smaller.
(II) Let Ie L 2 (G)" In > 2. and let u e W{'+I(G) be a solution of (E). We set
t' = Djf/J and \\. = Dju. Then, using integration by parts, it follows from
(E) that
a(,,', (p) = hi (cp)
for all cp e C( G),
( 116.)
where
h 1 (<p) = _ 1 jDjcpdx. - 1 L Dja_D«cpD'udx.
G G 1 2 1.161 s '"
By Example 22,55.
hi e '2- '" + 1 (G) and II b 1 11- m + 1. 2 const( 11/11 2 + "U 11...+ 1. 2)'
Note that D'II E W 2 1 (G) for I PI s m.
Applying Lemma 22.56 to (116.), we obtain WE W 2 "'+1(H), and hence
U E W 2 "'+ 2(H).
(III) Let fe W 2 1 (G), In > 1, and let u e Wr+ 1 (G) be a solution of (E). Then we
can use the same argument as in (II) in order to show that u E W 2 ",+2(H),
since again hi e Ji'2- 1II + 1 (G), and
IIh.II-...+.. 2 const(Ufll,.2 + I/UIl"'+1,2)'
(IV) From this we obtain Theorem 22.H by induction.
Cas£' 2: Let the differential equation (112) be strongly elliptic,
Then problem (E) can be written as
(E*) a l (u. to) = h. (l') for all v e C(G),
where
adU.l') = a(u.l') + C L ul'd_. b.(v) = bM + C L uvdx,
By the Garding inequality in Section 22.15, a I (', .) is strongly positive on
W 2 '"(G) if C > 0 is sufficiently large. Now apply the arguments from Case 1 to
( E.).
The proof of Theorem 22.H is complete. 0
22.20. Proof of Theorem 22.H
381
22.20b. Proof of Lemma 22.56
The basic idea of the proof is the following.
(i) We use the strong pos;t;..';ty of the bilinear form a( ., .) in order to obtain
estimates for the difference quotients V"(Du), lal S m.
(ii) By the limiting process h -. 0, we get the existence of the derivatives
DJ(D"u) via Section 22.18.
In order to make the simple idea of the proof as transparent as possible, we
restrict ourselves to a special case. However, note that the proof of the general
case proceeds completely analogously. We consider the differential equation
- (Au')' = f on G,
( II 7)
where G is a bounded open interval in R I , i.e., we consider the special case
m = 1 and N = I. Let A e C(G).
In the following, all constants are denoted by c.
An elementary computation yields the following two key formulas:
V"(vw) = wVltv + T"vV"w, (118)
t vV- lI wdx = - t wVllvdx. (119)
In this connection we assume that the support of v or w is a compact subset
of the region G and that Ihl > 0 is sufficiently small. Instead of V", V-It, Tit, we
write brieny V, V-, T, respectively. Moreover, we set J = JG. Note that
a(u, v) = f Au'v' dx.
Step I: Discrete integration by parts of the bilinear form a( ., · ).
We choose a region n with H c c Q c c G and 00 E Co. I. Furthermore,
we choose a test function'" e Cg>(Q) with
'" = 1 on H.
For all cp e COO<Q). we obtain that
a(V-(t/lu), qJ) = f AV-(t/lu)' qJ' dx
= - f (t/lu)'V(AqJ')dx = - a(u, V(qJt/ln + R.
Step 2: Estimates for the remainder R.
The explicit form of R Collows from (118). However, we only need the
structure of R. It is important that the term
At/Ju'V cp'
382
2 Hilbert Space Methods and Linear Elliptic Differential Equations
does not occur in R. More precisely, R contains only terms of the structure
f I'D'uVtpdx. f pD'uDlltpdx.
where II, 1#1 < 1. In addition, there appear similar terms, which contain the
operator T But the translation operator T does not change the following
arguments. In this connection, note that <P E Co'(O) and that Ihl is sufficiently
small.
let 11'11 and 11'11 2 denote the norm on W 2 1 (G) and L 2 (G), respectively. By
Proposition 22.52.
II V fP!l2 < £"lIfPlI.
The Holder inequality yields
I R I ('U" 1111 cp 1\ .
Step 3: Estimates for the principal part a(u, V(cp.p)).
Since II is a generalized solution, we obtain that
( 120)
a(lI, V(cp.p)) = b(V(cp.p)).
Let Ibl denote the norm of hE W 2 - m + I (G) with m = I. From
Ih(V(cp.p»)1 s IhIIlV(cp.p) 11 2
< c I bill fP.p II c at bill cp II
it follows that
la(u, V(cp.p))1 S c,lblllcplj.
By ( , 20) and ( 121 ). we get
a (V - ( .p u), cp) < c ( II II II + I b I) 1\ cp II
for all <P E C (0), and hence for all cp E W 2 1 (Q).
Step 4: Strong positivity of a( " . ).
We choose <p = V-(.pu). From a(cp,cp) > dllcpll2 it follows that
nV-(t/Ju)1\ < c( lIuli + Ihl).
(121)
Since .p = I on H, we obtain the key i,tequalit y
IIV-"ull "'2 1 (,,) < c( lIuli + Ibt),
for all sufficiently small 1"1 > O. Recall that V- = V-It.
( I 22)
Step 5: The limiting process II -+ o.
From (122) and Proposition 22.52 it follows that u E Wl(H) and
lIuIlK}(H) c( lIuli + Ibl).
The proof of Lemma 22.56 is complete.
( 123)
o
22.21. Regularity of Generalized Solutions up to the Boundary
383
22.21. Regularity of Generalized Solutions
up to the Boundary
Theorem 22.1. Assume (H I) through (H3) from Section 22.19. In addition,
assume that the boundary of the region G is smooth, i.e., oG e C. Let
Ie W 2 '(G),
k = 0, I, . . . ,
be given. Then the generalized boundary value problem (114) corresponding to
the classical first boundary value problem (112), (113) of order 2m has a unique
solution u e W 2 "'(G). In addition, u belongs to the Sobolev space W 2 2 "'.'(G) and
II u II wlm..CG) c 11/11 WrCG)' (124)
where the constant c > 0 depends on G, k, and m.
EXAMPLE 22.57. We consider the first boundary value problem
-L1u = f on G, u = 0 on iJG, (125)
where G is a bounded region in R N , N I, with aG E C. Let fe L 2 (G) be
given. Then the corresponding generalized problem
f t DJuDJqJdx = f fqJdx ror all qJ e C:(G) (126)
JG jel JG
has exactly one solution u e Wl(G), and u also belongs to the Sobolev space
Wf(G).
Using this result, it is possible to give a precise characterization of the
Friedrichs extension of the negative Laplace operator. To this end, let X =
L 2 (G) and let A: D(A) X -+ X be the Friedrichs extension of the operator
-L1: C<f(G) -+ X. Then
D(A) = W 2 1 (G) n W 2 2 (G).
Moreover, we have
Au = -u
for all u e D(A).
Here, the second derivatives are to be understood in the generalized sense.
This is a very natural result.
PROOF. Let u e D(A). We set I = Au. Let X E be the energetic space of A. By
Section 19.11, X E = wi (G) and D(A) X E . The self-adjointness of A yields
(Aulq» = (uIAq» for all q> e CO'(G).
Thus, u is a solution of(126). By Theorem 22.1, u e W 2 2 (G).
Conversely, let u e W 2 1 (G) ("\ Wl(G). We set f = - u. Integration by parts
yields (126), i.e.,
(uIAf/) = (IIf/)
for all cp e C(G).
Since A is self-adjoint, we obtain Au = f
o
384
22. Hilbert Space Methods and linear Elliptic Differential Equations
22.22. Proof of Theorem 22.1
The strategy of proof is the following.
(i) The regularity of the tangential derivatives at the boundary is proved
similarly to the proof of Theorem 22.H.
(ii) The regularity of the normal derivatives at the boundary follows from the
differential equation.
We first restrict ourselves to the Poisson equation in R 2 , i.e., we consider
- 1I - U"" =.f on G, U = 0 on aGo (127)
For given .r E L 2 (G), let 11 E W 2 1 (G) be a generalized solution of ( 127).
Step I: Interior regularity.
From Theorem 22.H it follows that U E W 2 2 (H) for all proper subregions
H c c G.
Step 2: Local regularity up to the boundary in the case of straight-line
boundary parts.
Suppose that a part of the boundary cG corresponds to a part of the -axis.
Choose a half-ball as pictured in Figure 22.4(a). The same method of proof as
in Section 22.20 yields the existence of the generalized tangential derivatives
(u), (u,,)
and those derivatives belong to L 2 (H). Note that the application of the
difference operators V:t lt in the -direction does not cause any difficulties.
In order to get the existence of the remaining derivative u"", we use the
differential equation (127). More precisely, we have to use the generalized
equation
L (ufP + u"fP,,)dx = LffP dX for all fP E Co-(H).
I ntegration by parts yields
In ufP""dx = - In (f + u)fPdx.
Hence we obtain that
II"" = - f - u,
in the generalized sense. This implies u"" E L 2 (H). Therefore, U E W 2 2 (H).
Step 3: Local regularity up to the boundary in the case of curved boundary
parts.
We consider a boundary part as pictured in Figure 22.5. We choose a local
(It, ')-coordinate system, and we assume that the boundary part is given by
22.22. Proof of Theorem 22.1
385
'J
G
H Hb
(a) (b)
Figure 22.4
the equation' = g(p). The transformation
. = p. ". = , - g(p)
( 128)
corresponds to a local C«J-diffeomorphism. Note that the Jacobian is different
from zero. By (128), we reduce our problem to Step 2. More precisely, by (128),
the original differential equation (127) is transformed locally into a new
differential equation having C7J- coe fficients with respect to the new variables
.t ".. An explicit calculation shows that the corresponding transformed
bilinear form remains strongly positive. Therefore, we can use the same
argument as in Step 2.
Step 4: Globalization.
We choose a sufficiently fine finite covering {V J } of G, where all the sets
are open. Let {Cf)J} be a partition of unity subordinated to {V J }. In particular,
we have Cf)J e CO>(). We set
u J = cpJu.
By our previous local regularity results in Steps 1- 3, we can choose V J in such
a way that U e W 2 2 (G '" ). This implies
uJ E W 2 2 (G).
It follows from U = L uJ on G that U E W 2 2 (G).
'J*
===:>
Jot
t*
Figure 22.S
386
22 Hilbert Space Methods and linear Elliptic Differential Equations
This proves Theorem 22.1 in the special case of the Poisson equation.
In the general case of equations of order 2m, one can use similar arguments.
In Step 2 above one has to extend the function u from the half-ball H to the
ball H" in Figure 22.4. according to Problem 21.7. The details can be found
in Friedman (1969. M), p. 61.
PR{)BlEMS
22.1. General clludit.r principle for quadratic nlinimum prohlems. We consider the
quadratic minimum problem
(P) F(II) dd a(u.lI) - 2h(u) = min!. II E U o + X.
together with the variational equation
(V)
a{lI. w) = h(\\')
for all n' EX.
The dual problem reads as follows:
(p.) F.(r) dd - a(f, r) + 2a(ll a , t') - 2h(uo) = max!.
In preparation we formulate the ke}' relation
1111 - ('II i = F ( u) - F. (L')
t' E ('0 + Y.
( 129)
for all u E Uo + .X and aliI' E l'o + Y. We make the following assumptions.
(H I) X is a closed linear subspace of the real H-space Z.
(H2) The map a: Z x Z --. R is bilinear, bounded. symmetric, and positive.
(H3) a: X x X R is strongly positive.
(H4) The functional h: Z -+ R is linear and continuous.
(HS) We are given U a E Z.
(H6) We set
Y = {u E Z: a(u, ".) = 0 for all '" EX},
i.e.. Y consists of an the solutions u E Z of the homogeneous variational
equation (V). We choose a fixed solution t'o E Z of (V). Furthermore,
we set
(ulr).. = a(u,v)
for all u, (' E Z.
Show:
(i) The original problem (P) has a unique solution ii. This solution ii satisfies
the variational equation (V).
(ii) Relation (129) holds.
(iii) The dual problem (P*) also has the solution u. and the extremal values
of (P) and (p.) are the same, i.e., F(ii) = F.(u).
(iv) The Rilz - Tre.lJtz method. Let XII and Y" be a finite-dimensional subspace
of X and Y. respectively. Let u (resp. 1') be a solution of the Ritz problem
F(u) = min!.
U E Un + XII
( I 30)
(resp. Trefftz problem
f'*(I:) = max!.
I' e: 1'0 + Y,,).
( 1 30.)
Problems
387
Then we obtain the fundamental error estimates
f*(v) S f*(ii) = F(ii) < F(lI)
and
llii - I'lii s F(u) - F*(1').
(v) The h}'perc;rcle method. For all U E U o + X and all (' E 1'0 + Y.
,,2-'(u + r) - "liE = 2- l llu - ('!I£.
(131)
This relation tells us that the solution ii of the original problem (P)
lies on a hypercircle with center 2- 1 (u + t') and radius 2 -'lIu - I'U E . let
(U,I') be a solution of the minimum problem
IIu - t'lI == min!. U E U o + X"' I' E roo + 1-:..
Then equation (131) yields an error estimate for ii. This is the so-called
hypercircle method of Prager and Synge (cr. Synge (1957, M)), which is
frequently used in elasticity.
Solution: Ad(i). Cf. Theorem IS.A.
Ad(ii). By construction of ('0 and y, we get
a(l', u - u o ) = b(u - u o )
and hence
for all u E Uo + X. I' E "0 + y,
flU - I'I! = a(u,u) - 2a(u,t') + a(r,r)
= F(u) - F*(r) + 2a(u o - u. r) - 2b(u o - u)
= F(u) - F*(t,).
Ad(iii). By ( 129).
f*(I') S F(ii) for all f E 1'0 + t:
It follows from the variational equation (V) that
alii, M') = b(M')
By the construction of 1'0'
a(l'o. ".) = b(M')
for a II '" EX.
for all ". E X.
Hence a(fi - ('0' M') = 0 for all ". EX, i.e., ii E l'o + r. letting u = I' = ii in
( 129). we obtain F(u) == F.(u).
Ad(iv). This follows from X" X and 1': by (129).
Ad(v). Let u E U o + X and I' E ('0 + Y. From
a( ,.. ;) = 0
for all }' E Y. x E .\'.
we get a(;; - I" ii - u) = O. This implies
lIu + I' - 2iilli = lI(u - ii) + (r - ii)lli
= Il(u - ii) - (I" - u)ili = 1/11 - I'lii.
22.1 a. Application to the first boundary I'alue prohlem. Apply the preceding results to:
- Au = f on G,
u == U o on eG.
( 132)
388
22 Halbert Space Methods and Linear Elliptic Differential Equations
Here, G is a bounded region in R N , N > I. We are given .rE 1"2(G) and
"0 E W 2 1 (G).
Solution' We set
f .'V
a(fI. t') = L DiuDiv dx,
G i e I
bel') = Lfi'dX.
Moreo\cr. let
Z = W 2 1 (G),
x = W 2 1 (G).
In this case. ('0 is a generalized solution of the equation
-t'o = f on G,
and the functions r In Yare generalized solutions of the homogeneous
equation
- t\r = 0 on G.
By the lemma of Weyl (Theorem IS.G),
r = {t" E C"I (G): t" = 0 on G}.
Thus. the functions in X and Y satisfy the homogeneous boundary condition
u = 0 on cG and the homogeneous equation Av = 0 on G, respectively.
Now, we can apply the results (i) through (v) above to the first boundary
value problem ( 132).
22.1 b. Application to the method of orthogonal projection. Show that problems (P)
and (P*) above generalize the method of orthogonal projection in Section
22.3.
Solution: Choose X and Z as above. We set
a(u, to) = (ulr)z,
b = 0,
and we are given U o E Z. let ('0 = o. Then Y = Xl. i.e., Z = X $ Y. Thus, we
obtain
F(u) = IIu l 1 2 ,
F*(t') = -lIt'1I 2 + 2(uolr).
Hence the original problem (P) (resp. the dual problem (P*)) above are
identical with
tjU o - uli 2 = min!. UE X
( resp.
-llu o - rl1 2 + lIu o ll 2 = max!,
(' E X.1).
This corresponds to the method of orthogonal projection in Section 22.3.
22.2. Weak .-.equential lower semiconlinuil}' of posili('e quadratic funt'tionals. Let
a: X x X -.. R be bilinear, bounded, symmetric, and positive on the real
H-space X. Show that
U" --a.. U in,,
as n -. x
implies
a( u, u) lim a( u", u,,).
(133)
,,-.-
Problems
389
Solution: For fixed U EX. we set
b(v) = a(u. v)
for all (' e X.
Then be X., since Ib(v}l s dnullllt'li. Thus. UII u as n .... oc: implies b(u,,) -+
b(u). i.e..
lim a(u", u) = a(u, u).
II-X
It follows from a(u - u", U - UtI) 0 that
a(u, u) S a(u", u,,) + 2a(u, u) - 2a(u ll . u).
Letting n -+ x. we get (133).
22.3. Alttrnatit'e proof of the main theorem on quadratic minimum problems. We
consider the problem
F(u) de( ia(u, u) - b(u) = min!.
U EX.
( 134)
Suppose that the map a: X x X .... R is bilinear. bounded, symmetric. and
strongly positive on the real H-space X. and let be X.. By means of weak
compactness (Theorem 21.D), show that (134) has a solution.
Solution: Since a(.. . ) is strongly positive, there is a c > 0 such that a(u. u)
cJjul/ 2 for all u E X. We set
dd inf F(u).
..x
Thus, there exists a sequence (u,,) with
F(u ll ) -+
as n -+ x..
It follows from
!cllu,,11 2 - flbll Du,,11 s F(u,,) S constant
for all n
that (u,,) is bounded. By Theorem 21.D, there is a subsequence, again denoted
by (u,, such that
u" -... u
as n.... 00.
Hence b(u ll ) -+ b(u) as n .... 00. By Problem 22.2.
F(u) S lim F(u,,) = 2.
II-cr
This implies F(u) == a, i.e., u is a solution of (134).
22.4. The theorem of Hestenes. Let a. b: X )( X -+ R be bilinear. bounded, and
symmetric on the real H-space x. In addition, let a be strongly positive, and
let b be compact.
Show that if a + b is strictly positive. then a + b is strongly positive.
Solution: If a + b is not strongly positive. then there exists a sequence (u ll )
with lIu,,1I :III: I for all nand
a(u". u,,) + b(u". u,,) ... 0
as n.... oc.
( 135)
Since (u,,) is bounded, there exists a subsequence. again denoted by (u II ), such
that
u,,u in X
as n -+ 00,
390
22. Hilbert Space Methods and Linear Elliptic Differential Equations
by Theorem 21.0. The compactness of b implies
h(u". u,,) -. h(u. II)
as n...... x'.
Since a( . . . ) is strongly positive. there is a e > 0 such that a(lI. ,,) > ell ull 2 for
a 11 II E ,X. By ( I -' 5 ).
c + b( II, .4) < O.
By Problem 22.2.
a(u. u) + h(U.II) < O.
Since a + h is strictly positive, II = O. Hence c < O. This contradicts f > O.
22.5. Eiqenra/ue pr()hlenJ. Let AI and N be linear subspaces of a B-space X. We set
= sup{dist(u, N): U E M, '1111f = I}.
/I = sup{disUu. M): u E N. lIuli = 1)
and ii( ,\1. 1\') = max {. II} (Fig. 22.6).
M
.0
------- N
6(M,N) = n1axio.(J}
Figure 22.6
22.5a.. E.'itimate.. for eiyenrailles and eigen,:e(.tors. Suppose that
lim I' A - A,,!! = 0,
" -. -r
"'here the operators A, A,,: X -+ X are linear, strictly monotone, symmetric.
and compact on the real H..space X. Let
A.I > ;'2 ... > 0
( resp.
i. ,,, > ;"l" > . .. > 0)
be the eigenvcctors of A (resp. A,,) counted according to their multiplicity.
The eigenvector space to ;.. (resp. A....) is denoted by X. (resp. X...).
(i) By Corollary 22.29.
I A. - ;.." I It A - A "II for all k. n.
(Ii) Show that for each k. there is a constant C. such that
(X., X.,,) C.IIA - A"I!
for all n.
Hint: (Of. ()sborn (1975). This paper also contains similar results for
compact operators in B-spaces.
Problems
391
22.Sb.. r,,'0-5ided error e$timares for e;gent'a/ue5 (the ,nethod of Knauer). We consider
the eigenvalue problem
Lu = ;.u.
u e D(L).
( 136)
We want to obtain the two-sided error estimate
Il. S A. S A S q
( 137)
for the k th eigenvalue ;.. of (136). The key to (137) is the estimate ( 138) below.
We make the following assumptions.
(H I) The operator L: D(L) X -t X is linear. symmetric. and strictly mono-
tone on the H-space X.
(H2) There exists a sequence of eigenvalues of L with
O<;..S;.2 S ....
lim A. = x
i-7
and an associated sequence (u,,) of eigenvectors which form a complete
orthonormal system in X.
(H 3) Let X" == span {('. .. . . . t',,} be an n-dimensional linear subspace of D(L).
The eigenvalues of the Ritz method
(Lullj) = p(Ullj),
u eX,.. j = I..... n.
are denoted by 0 < P. S P2 S ... S PIt'
(H4) Suppose that we know a number q > 0 such that
(Lulu)
(ulu) q
for all nonzero u eX': r\ D( L).
( 138)
where X; denotes the orthogonal complement to X" in X.
let I'. denote the lowest eigenvalue of the matrix
A 0,.
o p,.'"
'. ',. q
where'J = (ULu J tt 2 - pJ)112.
Show that (137) holds.
Hint: cr. Knauer (1971) and Velte (1984, S). Further important methods
for obtaining two-sided error estimates for eigenvalues can be found in
Remark 22.31.
22.Sc. An example. Apply (137) to the following boundary-eigenvalue problem:
((I + .)u")" = ;.u.
o < x < n.
( I 39a)
( I 39b)
ufO) = u(n) = u"(O) = u"(n) = O.
Solution: Set X = L 2 (O. n) and Lu = (( I + x)u")" with
D(L) = {u e C 4 [O. n]: u satisfies (139b)}.
392
22. Hilbert Sracc Methods and Linear Uirlic Differential Equatinn
Along with (139 we consider
1I e4t = AU,
o < x < 1f,
( 140)
u(O) = u(n) = u"(O) :: uN(Jt) == O.
This auxiliary prohlem has the eigcnsolutions
v.'.t) = sin nx,
AU = ft4,
n = 1. 2 . . . .
Set t l U = U'4 with D(L I) :::: D(L), and let X" = 5pan {"I' .. . t 1'" J. Since A', is
the luwt!;t eigenvalue of 141 on X ' we obtain
(I"., ,! AU) J
(utu) ..
For 111111 E Df l integration hy rarl yields
(I.ulu) == t (t + X}U"l Jx f: U"2 Jx
== s: uC'ud. = (1"1 14M.
This implies the key estim.,te
ror al1 non1.ero U E X; f\ 1>(1..).
(/.u(ut (/"lulu A, (ul")
for itlJ U E X. I () D(I..). i.e.. q = In + 1 )4.
Let 1. he the kth eigen\'aluc off 139). tJ!ing the nine hasi functions I',.. . '..
"... we obtuin from (137) the following crror estimates:
2.42SR7 < A I < 2.42602
38.07887 s A 2 S 3M.OS 746
191.4660 $ lJ S 191.5835
6OJ.208R A. S 603.9993
1468.176 s l S 1411626.
22.6.. 1"h,# II"mt'ricol .lah;I;,y 'if ,,,,, Ritz mf',I"Hl. Examples show tht'l the Ritz
method may behave badly if the sY51em or basi function5 is nOl 5trongly
minimal. We want h.1 explain thi d''''lIf'rllUs effect. Our main reult will he
thut it is important '0 U5e stronldy minimal ystems.
We consider the minimum problem
(,(u. u) - /J(ut ::= min!.
u eX.
and make the rollowing a'iumptions.
(111) Let X be a rcut separable fl-space. The map (I: X x X R is bilinear"-
bounded. !\ymmetric. and strungly posilive. tet heX..
(112) let X. == span {(".... .".1. and suppose that X = u. X..
We consider the Rill equation
d(.\. ",,) = "('1.
k = I..... II.
('41'
Problems
393
where
"
"" = L C"",l.",.
",-I
Then the Ritz equation (141) means that
"
L 0(1'., v",)c"", = b(l'.),
"'= I
k = I, . . . , n.
( 142)
We write this linear system in the following matrix form:
A"e" = bIt'
( 143)
Together with this equation, we consider the perturbed equation
Aile" == '6".
We introduce the energetic norm, l1ullE = a(u, U)I/2, and we set
II BII = max 11\",1
, s.."'s"
( 143.)
for the real n x n-matrix 8.
The Ritz method is called numericall.v stable iff there are positive constants
2, p, }' such that
lIu" - ii,,11 E S 211 A" - A"II + P lib" - b"l1 for all n
and all real n x n-matrices A" with If A" - ,.1,,11 x S i". This relation tells us that,
uniformly for all n. sufficiently small perturbations of the Ritz equation lead
to small perturbations of the Ritz solutions u".
The system of basis elements {l.,. V2' . . .} is called strongly minimal in the
energetic space iff
inf ;!.") > 0,
"
where A.(,'" denotes the lowest eigenvalue of the matrix A" = (a(l'., v",H..m E I. ....".
For example, if (1'.) is a complete orthonormal system with respect to the
energetic scalar product (ult:)£ = a(u, v) on X, then A" equals the identity, and
hence (l..) is strongly minimal in the energetic space, since ).(.") = 1 for all n.
Show that the Ritz method is numerically stable iff (t'.) is strongly minimal
in the energetic space.
Hint: Cf. Michlin (1969, M), Chapter 2, Section 2. Chapter 5 of this book
also contains similar results for eigenvalue problems.
22.7. The Gdrding inequalit}, for strongly elliptic systems via Fourier transform. In
order to make the proof for the general case below as transparent as possible.
we first consider a very simple situation.
22.7a. A t}'pical special case. We consider the following differential operator in
divergence form:
Lu == -(au')' + bu' + duo
Let G be a nonempty bounded open set in R I. We set
g(u. t'),.. t (au'v' + bu' v + dud d.",
( 144)
394
Hilbert Space Methods and Linear Elliptic DifTerential Equation"
If the coefficients (1. h. d are sufficiently smooth. then integration by parts
Yields
(/1/1,1'1 = f (L/I)I'd'(
JG
for all II. r E Co (G).
Obviously. the more general differential operator
I.. ,,, = - (J, II" + h, II' + d, II
can be "'rinen in the divergence form (144) with
(I = a,.
h = a; + h,.
d=d,.
In the case ",here the denvath'e {l eXists.
We assume that the functions a. h. d: G ..... R are measurable and bounded.
Furthermorc. suppose that there is a number Co > 0 such that
a( t > Cn
for all x E G.
(145)
Recall that
/llIf.! 0 = f ,,'2 d.'(,
J (i
" f., '2 d
d' I i 2 = J G II - + II .x.
11/1 I j = f /12 d'(.
Jc;
Sho"' that there arc real constants (' > 0 and C such that
n(u.ut c'lIlIli.1 - (" 1I'1 for all U E JV 2 1 (Gt. (146t
Solution: Since the norms 11'11 1 . 2 . () and . :1 1 . 2 are equivalent on J'11 (G). and
since ("(; (Gt is dense in "'2 1 (Gt. It is sufficient to prove that
( .. I '
(/( II. III C ," i, 1 () - d II I 2
for all II E Co (,).
(147)
UI The I.-trick for the lo"'er order terms. By the Holder inequality. we
obtain
1. /1'/1 d.'( < (1. ,,'2 d.'( Y 2 (t ,,2 dx y 2.
USing the Inequality 12:x#1 f.2 2 + f. 2/1 2 for all 2. II E iRe f. > O. we
!!ct
f . hu'lle/x SUPlhl ( r. f 1I'2Jx + 1:'2 f U2dX ) .
(, G (i G
Since \\e can choose I: > 0 arbitrarily small, it is sufficient for the
proof of (147) to shoy.' that
f dU'1 dx > (' f .,'2 d'( - (" f 1,2 dx.
(, G G
( 148)
for all U E C (j) and appropriate constants c > 0 and C. This
follo\\s Immediately from (145). However. we will give another proof
of t 148) \\'hich can also be applied to strongly elliptic ystems below.
T () this end. we asc;ume that the function a is continuous on G.
Problems
395
(II) The Fourier transform. Let F(u) denote the Fourier transform of II
(cr. A 2 (74)). For all II, t' E CO: (G) we have
t ut'dx = t F(u) F(I') dx (149)
and
F(u') = ;.F(u).
(III) Proofof(148) for constant coefficients. Let .J bea fixed point in G. For
all u E Co (G). we obtain that
t a('J)u'2 dx = t a(xJ)F(u') F(u') d.
f coIF(u)l2 .2 d. = f cou'2 d..
G
(IV) Localization of the principal part and freezing of the coefficient of the
principal part. We want to describe an important technique which
reduces the case of variable coefficients to the case (III) of constant
coefficients. The idea is to use the values a(x)) at a finite number of
suitable points .). This procedure is called "freezing of the coefficints."
(IV-I) Basic estimate. Let U be a nonempty open subset of G and suppose
that
la(.) - a()')1 < "
for all .x. Y E U.
We fix cp E CO=(U) and Y E U. Then. for all u e Co(G), we obtain that
t (a(.) - a(y))(ufj))'2 dx s" t (Ufj))'2 d.
< ". canst fG (u' 2 + u 2 ) dx.
noting that (ucpr = u' cp + ucp' and 212PI < 2 2 + p2.
(IV -2) Partition of unity. Let" > 0 be given. Since a is continuous on G. there
exists a finite number of nonempty open sets U I . .... U A such that
G c UJVjand
la(x) - a(y)1 < " for all ..}' E V J G and all j.
By Section t 8.16. there exists a partition of unity subordinated to {V J }.
i.e.. there is a family {'PJ} offunctions Cl'J e C (),j = I,.... k. such that
L cpJ(.t)2 = I for all x e G. (150)
)
where we set CPj('x) = 0 for x f U J . Let u E Co (G) be given. We localize
the function u by setting
u) = Cl'Ju.
Then u) E Co(U J ). and from (ISO) we obtain the key relation:
r au; 2 = r. a(/Jl u' 2 + A = au' 2 + A. ( I 5 I )
J J
396
22 Hilbert Space Methods and Linear Elliptic Differential Equations
where A contains lower order terms, i.e., A contains only terms of the
form uu' and u 2 .
(lV-3) In order to prove (148) it is now sufficient to show that
J del f 4. a(.tjlu/ d.t c f u' 2 d.t - C f u 2 d.t (152)
JG j JG JG
in the case where we choose" > 0 sufficiently small and we fix the
points .t j such that .Y') E V J for all j. I n fact, by (151), we can estimate the
expression A according to (I) for small I: > 0, and we can use the
argument (IV-I) for small" > O.
(VI-4) Proof of (152). By (III) and (151).
J > c o f LU?dX=co f U'2d:<+ f Bd:<.
JG J G G
where B denotes lower order terms. Using (I) with respect to B. for
small f. > 0, we obtain (152).
22.7b. Strollgly elliptic sy.tems. We consider the system
L.u = r. on G.
J lj
j= I.....M.
(153)
D' Il) = 0 on ('G
for all i': 1;'1 < m - I,
\\' here
"
LJu = L L (- I rID:I(aJ"JD')Ui'
. -I :s1.." '"
and \\e set
g(u.l') = fG a.,iJ D "l' j D'u i d."<.
where we sum over i.} = I..... M and . p with II.IPI m. For u. I' E Co (G)"'.
integration by parts yields
g(u. r) = f t (L j u)l'j d.t.
G j= I
The system ( 153) is called slrongl.' elliptic iff all the functions a'ij with i.j = I.
.. ,\1 and 121. IIlI = m are continuous on G and there is a Co > 0 such that
.W
aJI,j(.t)d2dl"';"j > coldI 2 "'1"'12.
I. J .... I
( 154)
for all.t E G. d EA.... ". E R"'. where d' = d'd2l... d,--. We assume:
(HI) G is a nonempty bounded open set in R N , where N, M. m 1.
(H 2) All the functions a"i): G -+ R are measurable and bounded for i. j = I.
. .. M and 2. fJ with 121, 1/11 S m.
(H3) The system (153) is strongly elliptic.
Show that g('. .) satisfies the Gdrding inequality, i.e.. there are real numbers
c > 0 and C such that
g(U.ll) > ("llull.2 - Citulli
for all u E W 2 "'( G)M.
(155)
Problems
397
Here. u e W;'( G)M means U i e W 2 "'( G) for all it and
111111;.2 = L 1111.11;.2'
.
IIIIII = L 1I1I,lIi.
,
Recall that
lIu, II. 2 = r L I D"u,12 dx.
J G '21 $ '"
lIu,II.2.0 = r L ID"u,12 dx.
J G '"'' '"
Solution: Since the two norms 11'11",.2 and n. 11",. 2.0 are equivalent on W 2 "'(G),
and since (O(G) is dense in W 2 "'(G), it is sufficient to show that
g(II, II) c lIun!.. 2.0 - C null i for all u e C:(G)"'. (156)
As in the proof of Proposition 22.45, the IOK'er order terms of 9 with 121 +
IIlI < 2m can be estimated via Ehrling's inequality and the t-trick.
Thus it is sufficient to prove (156) for the principal part. i.e.. we may assume
that asl'} -= 0 for II + IIlI < 2m. The proof now proceeds completely analo-
gously to the proof of Problem 22.7a. In this connection, observe the following.
It follows from (154) that
M
Re r L a.,.}(x)d"d'w,w} coldI 2 "'l w I 2 .
. · } :1/ I fat .111 = 1ft
for all complex tupels K' e RM, since
Re(K'.KJ) = Re K'i Re w} + 1m K', 1m wi'
and a".)(x). d are real. Hence, for constant a"'f} and u e Co (G)"', we obtain
the following key inequality via the Fourier transform:
r a."joau,D'u j dx = r Re a,,'jx. x, F(u,) F(u j ) d:c
JG JA
:2: Co r IxI 2 -F(u,) F(u,) dx = Co r ID.u,12 dx,
JA JG
where we sum over i, j = I, ..., M and 2. fJ with ," IPI = m. Note that
the left-hand integral is real, since a . . . , u, v are real, and note that Re J . .. =
I Re .. . .
22.8. Existence theory for strongly elliptic systems. By definition, the generalized
problem to the boundary value problem (153) reads as follows.
Let X = W 2 "'(G)M. We are given J; e L 2 (G) for all i, and we are looking for
a function u e X such that
g(u, v) = b(v)
for all veX.
( I 57)
Assume (H I) to (H3) from Problem 22.7b. Then, all the abstract results from
Section 22.16 can be applied to equation (I S7). In particular, equation (1 S7)
has a solution iff
b(u.) = 0
398
." Hilbert Space Me(hod and Linear Elliptic Differential Equations
for all solutions u. E .X of the dual equation
g(r.II.) = 0
for a II r E ..\'.
( I 57. t
The nunlber of linearly independent solutions u. of (157.) is finite.
If (157) has at most one solution. then it possesses a unique solution.
22.9. 1'he 'Plt'tlJod 0' I,o,,,oye,,;:at;()lr cllrel delicate ('"ollrergellce processes (or fil"c-
rio"a'." and operators. We consider the operator equation
(E, )
trll, = f.
\\ hlch depends on the small parameter r. > O. We are looking for an asymp-
totic expansion of the solution of the form
II, = 1'0 + r.f, + r. 2 ("2 + ....
and \\'e are looking for an operator equation
(En)
Lol'o = r.
which yields the approximation 1'0 of U(' In engineering, for example. equation
(E,) describes processes in material of the small period f. > 0 (e.g., heat
conduction in granulated material). The limiting process f. -+ 0 corresponds
to a Ito"roge,ri:ation of the material (Fig. 22.7). The idea of engineers is to use
equation (Eo) as an approximation for (E l ). In this connection, note that the
parameters in fE,) and (Eo) correspond to the material constants of the
periodic (granulated) and the homogenized material. respectively (e.g.. the
heat conductivity).
To explain the typical difficulty let us consider the folloYt'ing simple
problem.
(P, )
- (a , ( x )Il;(.))' = J (x).
O<x<1.
11,(0) = u,( I ) = O.
\\'here we set
Clr(X) = a(x't).
and the function (It has the period f. > O. More precisely. assume that the
continuous function a: R ..... R has period one and inf.rf R a(x) > O. The mean
\alue of a is denoted by
M(al = t' a(x)d:<.
E {
[-.0
Figure 22.7
Problems
399
let f e C[O, I]. One can show that
at(:() M(a) in L 1 (O, 1) as £.... o.
Furthermore, the generalized solution U E W 2 ' (0, I) of (P,) satisfies the esti-
mate HUe!!.. 2 S const for all small I; > O. Hence, there is a subsequence such
that
U&u in W 2 1 (O, I)
as t.... o.
We now come to the point. Intuitively, we would expect that the limit function
u satisfies the equation
(Pot
- (bu')' = I, 0 < x < I.
u(Ot = u( I) = 0
with b = M(a). However. this is ".rong. In fact. we have to set
b = 1/ M (a - , ).
This has been discovered by engineers. Similar difficulties arise for variational
problems.
As an introduction to the method of homogenization, we recommend
Bensoussan, Lions, and Papanicolaou (1978. M). From the physical point of
view. the homogenization of material leads to new physical effects. From the
mathematical point of view, the method of homogenization corresponds to
delicate conl'ergence processes for operators and functionals in connection
with differential equations and variational problems, respectively fe.g., the
G-convergence). A general theory for both linear and nonlinear problems can
be found in Attouch (1984, M).
22.10... The end of a long struggle-the complete description of the Laplacian and of
general elliptic eqllations in terms of fun('tional anal.\'sis, We consider the first
boundary value problem:
- 6u = / on G.
u = 9 on eG.
( I 58a)
( I 58b)
let G be a bounded region in R''', N I, and let fG e C 7 . let m =- 0, I, .. .,
and I < p < oc. We set
Au = (/. g).
where f and 9 are given by (158). The following fundamental result finishes
the long struggle of mathematicians against the Dirichlet problem.
The operator
A: W,,"'+2(G).... W,,"(G) )( W"",+2-1"(fG)
is a linear homeomorphism.
The definition of the boundary spaces W:(cG) of fractional order k will be
given in the Appendix A 2(5 I). This very natural result allows the following
interpretation.
(i) If II e JV,'" + 2(G), then u satisfies (158) with
f e W,,"(G), g e W,,"+ 2 -1"(cG). (159)
400
22 Hilbert Space Methods and Linear Elliptic Differential Equations
In this connection, note that u has at least generalized second-order
derivatives. Thus, equations (158a) and (158b) are to be understood in
the sense of generalized derivatives and generalized boundary values,
respectively.
hit Conversely, for each given J" and g with (159), the Dirichlet problem 158)
has a unique solution u E W p "'+ 2 (G), and there is a constant (" > 0 depend-
ing on G, m, and p such that
IlulI.... 2.p < c( IIfll",.p + ligll... 2- IIP.P)'
Hint: For p = 2. see Lions and Magenes(1968, M), Vol. 1.
For p > I, see Triebel (1978, M), (1983, M). The proof relies on inter-
polation theory.
M ore general results in terms of Sobolev spaces can be found in Problem
6.8.
In Triebel (1983, M), Section 4.3, one finds the following fundamental result
via interpolation theory:
Broad ClclSS(JS of elliptic equations "'ith fairly general boundar}' conditio1ls
repreellt isonJorphic mappings "'irh respecr to appropriate s;,..-spaces and
F;.q-spaces.
These two important classes of function spaces coincide, for special values
s, p, q, with Sobolev spaces or Holder spaces (ef. A 2( t 16b)).
A detailed study of these spaces can be found in Triebel (1983, M). There
it is also shown that the B;...-spaces and F;..-spaces comprehend the following
important special cases: Holder-Zygmund spaces, Sobolev-Slobodeckii
spaces. Besov spaces, Lebesgue spaces, Bessel potential spaces, and Hardy
spaces. I n some sense, one can say that the B;..-spaces and F;..-spaces are
related to elliptic equations in a natural manner.
References to the Literature
Classical papers: Poincare 1890), (1916, M) (collected works), Hilbert (1900a), (t 9 t 2,
M),(1932, M)(collected works), Weyl(191 1),(1940),Courant(1920), Friedrichs (1934),
(1944), (1953), Courant and Hilbert (1937, M), Sobolev (1937), (1938), (1950, M), Viik
(1949), ( 1951), (1952), Garding (1953), Browder (1954) (strongly elliptic systems), Lax
and Milgram (1954). Nirenberg (1955), (1959). Agmon, Douglis, and Nirenberg (1959)
(fundamental a priori estimates), Hormander (1955), (1964. M), (1983. M) (general
theory of linear partial differential operators), Lions and Magenes (1960), (1968, M)
(interpolation theory and linear elliptic equations).
Modern standard work on linear partial differential equations: Dautray and Lions
(1984, M), V ols. I - 3 (theory, applications, and numerical methods).
Handbook of numerical analysis: Ciarlet and Lions (1988, M), V ols. I ff (finite
element method, difference method, etc.).
Collection of problems in the equations of mathematical physics with solutions:
Vladimirov (1986a, M).
The partial differential equations of theoretical physics and their explicit solutions:
Morse and Feshbach (1953, M), Frank and v. Mises (1961, M), Babi (1967, M), Myint
and Debnath (1987, M).
Introduction to the theory of linear partial differential equations: Courant and
References to the Literature
401
Hilbert (19S3, M) (classical standard work), Tychonov and Samarskii (19S9, M),
Garabedian (1964, M), Babi (1967, M) (handbook without proofs), John (1982, M).
Introduction to the modern functional analytic approach to linear elliptic diITer-
ential equations: Friedman (1969. M) (especially recommended), Bers, John, and
Schechter (1964, M)(basic ideas), Agmon (196S, L), Fichera (1965, L), Netas (1967, M),
Wloka (1982, M), Ladyfenskaja (1985, M).
Elementary expositions: Velte (1976, M), Rektorys (1977, M), Showalter (1977, M).
Further important monographs: Morrey (1966), Maurin (1967), Simader (1972),
Triebel (1972), Ladyfenskaja and Uralceva (1973), Treves (197S), Schechter (1977),
Gilbarg and Trudinger (1983), Grisvard (1985) (nonsmooth domains).
Survey: Miranda (1970, M).
PseudodiITerential operators: Honnander (1983, M), V ols. 1-4 (standard work),
Taylor (1981, M), Treves (1982, M), Vols. 1,2, Rempel and Schulze (1982, M) (modern
index theory (or elliptic boundary value problems). Chazarain and Pirion (1982, M)
(introduction).
Interpolation theory and linear elliptic differential equations: Lions and Magenes
(1968, M), Vol. I (Lz-theory), Triebel (1978, M), (1983, M) (general L,-theory).
Ritz method: Cf. the References to the Literature for Chapter 18.
Difference method: Cf. the References to the Literature for Chapter 20.
Finite elements: Ciarlet (1977, M), Babulka and Szabo (1988, M), Ciarlet and Lions
(1988. M), Vol. I. Cf. also the References to the Literature for the Appendix.
Boundary element method and its applications in engineering: Feng Kang (1983, S)
(abstract approach), Bannerjee (1981, M), Crouch (1983, M), Brebbia (1984, M),
Wendland (1984, S), Tanaka (1986, M).
Multigrid methods: Hackbusch (198S, M).
Software system ELLPACK for solving elliptic diITerential equations: Rice and
Boisvert (t 984).
Software system QUADPACK for integration: Piessens (1983).
Software system PDE/PROTRAN for the finite element method: Sewell (1985).
Approximation methods for eigenvalues: Gould (1966, M) (introduction), Collatz
(1963, M) (many applications in engineering), Collatz (1964, M) (Newton method),
Michlin (1969, M)(stability of the Ritz method Krasnoselskii (1973. M) and Vainikko
(1977) (Galerkin method), Bramble and Osborn (1973) and Osborn (197S) (non-
symmetric compact operators), Kolomy (1978) (iterative methods).
Two-sided error estimates for eigenvalues: Velte (1984) (survey), Collatz (1963, M),
(1964, M) (method of Temple), Fichera (196S, L) (methods of Rayleigh-Ritz, Bazley-
Fox, and Weinstein-Aronszajn), Fichera (1978, M) (method of orthogonal invariants),
Knauer (1971), Weinstein and Stenger (1972, M), Weinberger (1974, M), Kuttler and
Sigilito (1987, M) (estimating eigenvalues with a posteriori/a priori inequalities).
Eigenvalues and spectra of molecules: Reed and Simon (1971. M), Vol. 4, Thirring
(1983, M), Vol. 3.
Eigenvalue criteria for minima in the calculus of variations: Klotzler (1971, M).
Eigenvalue distributions of compact operators: Konig (1985, M), Pietsch (1987, M).
PseudodiITerential operators and eigenvalue problems: Shubin (1986, M).
Random eigenvalue problems: v. Scheidt and Puckert (1983. M).
Homogenization: Bensoussan, Lions, Papanicolau (1978, M), Oleinik (1979. S),
Attouch (t 984, L), Ericksen (t 986, P).
G-convergence and weak variational conversence and their applications to differen-
tial equations: De Giorgi (1983, S), Attouch (1984, L) (linear and nonlinear problems).
Supercomputing: Munnan (198S, S), Lichnewsky and Saguez (1987, S). Martin
(1988. S).
CHAPTER 23
Hilbert Space Methods and Linear
Parabolic Differential Equations
While studying parabolic equations it is useful to introduce and make use of
several function spaces.
Olga Aleksandrovna Ladyzenskaja ( 1985)
23.1. Particularities in the Treatment of
Parabolic Equations
In order to obtain existence theorems for linear parabolic differential equ".
tions one has two different possibilities:
(i) the theory of semigroups.. and
(ii) the Galerkin method.
In Chapter 19 we considered (i). In this chapter we want to study (ii). Let
QT = G x ]0.. T[. In order to explain the basic problems and methods we
consider the classical initial- boundary value problem for the heat equation:
II, - u = .r
u=o
on QTt
on cG x [0, T],
( I )
u(x.O) = uo(x) on G.
Here, the functions .r and 110 are given and II is to be found.
The generalized prohle," corresponding to this is obtained formally by
multiplying the differential equation by t E C (G) and subsequent integration
402
23. J. Particularities in the Treatment of Parabolic Equations
403
by parts; therefore, for all v e Co(G),
d d f u(x, t)v(x)dx + f t D,u(x, t)D,v(x)dx
1 G G '-I
= t I(x, t)v(x)dx for all t e ]0, T[, (2)
u(x,O) = uo(x) for all x e G,
where x = (I"'" N) and D i = a/Ci. We set
V = W 2 1 (G), H = L 2 (G)
and refer to four particularities which arise in the functional analytic treat-
ment of parabolic equations.
(I) The first important particularity of (2) consists in that we deal with the
space variable x and the time variable 1 in differenl )\'ays.
(a) For fixed time t, we think of the function
x..... u(x, I)
of the space variable x as an element of the Sobolev space We briefly
denote the element of V that arises this way by u(t); therefore,
u(t) E JI:
(b) If we now vary the time t in the interval [0, TJ, then we obtain a function
1 U(I).
There thus arises from the real function (x, I)..... u(x, I) the function t t-+ U(I)
with values in the B-space
We now write equation (2) in the form
d
dt (u(I)1 V)H + a(u(t), l') = (!(I)ll')H on ]0, T[
for all v e V,
(3)
U(O) = Uo e H.
We are looking for the function 1 t-+ u(t) with U(I) e V for all 1 e [0, T]. Here,
for all w, t' e Jt: we set
a(w, v) = f t D,w(x)D,t'(x)dx,
JG I-I
(f(t) 1 V)H = t I(x, t)v(x) dx.
(II) Equation (3) shows the second particularity that the generalized problem
requires, in a natural way, the use of two spaces Hand JI: Here, H is obtained
404
23 Hilbert Space Methods and Linear Parabolic Differential Equations
in connection with the time derivative, and V results from the elliptic term
-l\u in (1) and the boundary condition u = ° on oG x [0, T]. In the present
case we have V c Hand V is dense in H. This leads us to the concept of the
elolllt;on triple
"Y c H c Y.'"
in Section 23.4.
(III) The third particularity consists in that we think of the time derivative
dldt in (3) as the generalized derit'ative on ]0, T[. For this reason, equation (3)
needs to be satisfied only for almost allt E ]0, T[.
(I V) The jourth particularity results from the choice of the space for the
solution t..-.. 14(1). For (3) we shall choose the Sobolev space
W 2 ' (0, T; V, H).
We give the precise definition of this space in Section 23.6. Here. we note only
the corresponding norm
1111 II = (L T IIIl(t)II:. dt Y'2 + (L T II 11'(011:. dt) 1'2 .
where 14' is the generalized derivative of u. The definition of u' is so made that:
, d
(II (1),I')J' = di (u(t)IV)H
for all t' e V.
In order to obtain an operator equation, we introduce the operator
A: V -. V. and the functional b(t) E V. by means of
(An', v>, = a(n', t'),
(b(I), l'>' = Cf(t)lr)H
for all lO E v:
Then from the basic equation (3) there arises
(u'(t) + AlI(t) - b(t), r)v = 0 on ]0, T[
and hence we get the desired operator equation:
for all l' e V
u'(t) + AU(I) = b(l)
for almost all t E ]0, T[.
(4)
u(O) = Uo E H.
Here, we are looking for the function
u E W 2 ' (0. T; Jt: H).
This is an abstract first-order differential equation that we call a first-order
evolution equation. The point is that
u(t) E V
and
u'(t) e V.,
i.e., because V c V., the time derivative u'(t) lies in a more comprehensive
23.1. Particularities in the Treatment or Parabolic Equations
405
space than u(t}. The existence theorems concerning abstract differential equa-
tions from Chapter 3 can therefore not be applied. To solve equation (4), we
will use the Galerkin method in Section 23.7.
In Section 23.4 we shall elucidate in what sense one is to understand
V c H c V., Analogously to the regularity theorems in Sobolev spaces of
real functions there results in Section 23,6 the continuity of the embedding
wi (0, T; V, H) c C( [0, T], H).
That is, u in equation (4) is continuous as the function u: [0, T] H. It is only
then that the initial condition ufO) = U o e H is meaningful.
We emphasize again that this approach works in the t\\'O spaces V and H.
For the investigation of (I) there is still another possibility which works only
in one space, namely, in H = L 2 (G). To this end, we write the original problem
( I) in the form
u'(t) + AFu(t) = f(t)
u(O} = uo'
The main difference is the following:
(Gr) We set
for all t e ]0, T[,
(4.)
Bu = -u,
D(B) = C:(G).
Then the operator A F : D(A F ) H -+ H in (4.) is the Friedrichs extension
of B. This operator is unbounded.
(fJ) In contrast to A F . the operator A: V -+ V. in (4) is continuous. More pre-
cisely, A is the so-called energetic extension of B, i.e., the space V = W 2 1 (G)
is the energetic space of B and A is an extension of A F . We have that
D(B) S; D(A F ) S; V c H, B C A F C A.
where D(B) = C:(G), V = W 2 1 (G), and H = L 2 (G).
Equation (4.) was used in Chapter 19 in the context of semigroups, The
difficulties which arise in (4) and (4.), in different ways, have their origin in
the fact that in equation (2) the first integral originating from u, contains no
space derivatives, and the second integral originating from the elliptic part
- L\ does contain space derivatives.
In equation (4), we need
bet) e V. for almost all t e ]0, T[,
whereas, in equation (4.), we need the stronger condition
f(t) e H for all t e [0, T].
Note that H V., Since equations (4) and (4.) correspond to the Galerkin
method and the method of semi groups, respectively, we obtain the following:
The Galerkin method allows us more general right-hand sides of evolution
equations than the method of semigroups.
406
23 Hilbert Space Methods and Linear Parabolic Differential Equations
In Chapter 3 we have considered, parallel to the classical calculus, the
Riemann integral and the derivative of functions t........ u(t) of the real variable
t with values in a B-space. For the investigation of the evolution equation (4).
we need the following more general concepts for functions of this sort, namely:
(a) measurabilitv.
(b) the Lebesgue integral and Lebesgue spaces,
(c) generalized derivatives and Sobolev spaces.
In Appendix A 2 ( 1)'1 measures, measurability, and the Lebesgue integral are
introduced in a unified way for real functions and functions with values in
B-spaces. The reader who is not familiar with these concepts should look at
A 2( I) through A 2( 13) before studying this chapter.
We provide the remaining tools (c) in the following sections. In principle,
all the proofs corresponding to (c) proceed parallel to the classical proofs for
real functions. We attach importance to presenting results in such a form, and
to clarifying them by means of examples so that later treatment of linear and
nonlinear evolution equations and evolution inequalities will be as trans-
parent as possible for the reader. In particular, we emphasize the clear working
out of a number of space identifications in the context of evolution triples
../ c: H c: II.:' In the proofs, e.g., in Example 23.4, one must always pay
careful attention to the fact that the Lebesgue integral is only defined for
measurable functions. For that reason, one must also always verify the IJJell-
sllrahility of the functions under consideration.
The main result of this chapter is Theorem 23.A in Section 23.7. For linear
evolution equations, this theorem contains:
existence and uniqueness of the solution.
convergence of the Galerkin method. and
continuous dependence of the solution on the data.
Generalization to nonlinear problems will be considered
in Chapter 30 (the Galcrkin method), and
in Chapter 31 (semigroups).
23.2. The Lebesgue Space Lp(O, T; X) of
Vector- Valued Functions
Definition 23.1. Let X be a B-space and 0 < T < x.
(a) The space C m ( [0. T], X) with In = 0, 1, ... consists of all continuous
functions I': [0. T] ..... X that have continuous derivatives up to order !'I
on [0. T] with the norm
m
def , ("
lIull = L max Ill1t(t)l.
i=OO,T
(5)
23.2. The Lebesgue Space L,(O. T: X) of Vector-Valued Functions
407
Here, only the right-hand and the left-hand derivatives need exist at the
boundary points t = 0 and t = T, respectively. In (5), ufO) means u.
We write C([O, T], X) instead of CO([O, T], X).
(b) The space L,,(O, T; X) with I S P < 00 consists of all measurable functions
u: ]0, T[ -+ X for which
( rT ) 1/"
lIuli p del Jo lIu(t)lI dt < 00
(6)
holds.
The space L(O, 1: X) will be introduced in Problem 23.12.
Proposition 23.2 (Properties of Lebesgue Spaces). Let m = 0, I, ... and
1 S P < 00. Let X and Y be B-spaces over K. Then:
(a) C"'([O, T], X) with the norm (5) is a B-space over K.
(b) L,(O, T; X) with the norm (6) is a B-space over K in the case ",here one
identifies functions that are equal almost et'erywhere on ]0, T[.
Moreot'er, the set of all step functions u: [0, 11 -+ X is dense in L,(O, T; X)
(cf. A 2 (4)).
(c) C([O, T], X) is dense in L,,(O, T; X), and the embedding
C([O, T], X) s; L,(O, T; X)
is continuous.
(d) The set of all polynomials w: [0, T] -+ X, i.e.,
K'(t) = ao + at t + ... + aptt"
,,'ith a, e X for all i and n = 0, I. ... is dense in C([O, T], X) and L,(O, T; X).
(e) If X ;s an H-space with scalar product (.,. )x, then L 2 (0, T; X) ;s also an
H-space with the scalar prodllct
(ulv) = foT (u(t)lv(t)>x dt.
(f) L,(O, T: X) ;s separable in the case where X is separable and I S P < 00.
(g) L,,(O. T: X) is uniformly convex in the case where X is uniformly convex and
J < p < 00.
(h) If the embedding X S; Yis continuous, then the embedding
Lr(O, T; X) S; L.(O, T; Y),
;s also continuous.
t S q S r S 00,
The proof will be given in Problem 23.2.
EXAMPLE 23.3. Let X be a B-space. If the function
u: ]0, T[ -+ X
408
23. Hilbert Space Methods and linear Parabolic Differential Equations
is continuous almost everywhere and bounded, i.e.,
sup lIu(t)1I < oc,
O<.<T
then U E Lp(O, T; X) for all 1 S P < x.
PR(X)F.
(I) Let p < x. By A 2 (8), the function t.-. u(t) is measurable. The measurabil-
ity of t flu(t)fI results from A 2 (9). Due to the assumption that this
function is bounded, condition (6) is valid.
(II) In the case where p = 00 use the definition of L%J(O, T; X) in Problem
23.12. 0
The following example is important for the functional analytic interpreta-
tion of the right side f of parabolic differential equations. Our goal is the next
two formulas:
<h(t), 1'»)' = f f(x, t)l'(X) dx
G
(hdl).r)v = L f(x.r)v(x)dx
For this purpose we make the following assumptions:
(H) Let G be a bounded region in R N , N > I. For 0 < T < 00, let QT =
G x ]0, T[, i.e.,
for all (' E Y,
(7a)
for all v E V.
(1b)
QT = {(x, t): x E G, t E ]0. T[}
(Fig. 23.1). Moreover, we set
Y = L,(G), V = W,"'(G)
with 1 < p < ,x, p -1 + q -1 = I, m = O. I, . . . .
I 1
Qr
..
Figure 23.1
23.2. The Lebesgue Space L,(O. T; X) of Vector- Valued Functions
409
EXAMPLE 23.4. Under the assumption (H), suppose the real function (x, t) t-+
I(x, t) belongs to L.(QT). Then the following hold:
(a) For all I E [0, T], there exists a functional bet) E y* such that (7a) is valid
for almost all I E [0, T] and I t-+ b(l) belongs to L,(O, T; Y.).
To be precise, for almost all I E ]0, T[, the function
x t-+ I(x, I)
belongs to the space L,(G), and this function can be identified with a
functional b(l) E Y. such that we have (7a) and
IIb(t)IIJ. = t I/(x, t)'f dx, (8a)
II b lll.(o. T:r., = f: (t I/(x, t)l f dx )
= r I/(x, t)!' dx dt. (8b)
JOr
(b) For all I E [0, T], there exists a functional b l (I) E V* such that (7b) is valid
for almost all I E [0, T] and I t-+ b l (I) belongs to L,(O, T; V*).
Here, the linear continuous functional b l (I): V -.. R is the restriction of
the linear continuous functional bet): Y -.. R to the space V.
Moreover, we get
IIbIl1.co. T:Y.' S r I/(x, t)l f dx dt.
JOr
Remark 23-S. Because of this result one frequently says briefly that every
function
Ie L.(QT)
belongs to the spaces L,(O, T; Y*) and L,(O, T; V*).
By Az(34), we may identify y* = Lp(G)* with L.(G). Pictorially, h(t o ) cor-
responds to the function I = I(x, I) on the cross section of the cylinder QT
with the plane 1 = '0 (Fig. 23.1).
PROOF. Ad(a). By assumption, f, as a function of (x, t), is an element of L,(Qr).
Together with the Fubini theorem Az(23) this means that
for I/(x, t)lf dx dt = f: (t I/(x, t)'f dx ) dt. (9)
Here, the inner integral on the right-hand side exists for all I E [0, T] - N
where N is a set with meas N = O. For lEN, we set
I(x,t) = 0
for all x e G.
By A 2 (34), the functions from L,(G) can be identified with the elements of Y*.
410
23 Hilbert Space Methods and Linear Parabolic Differential Equations
Thereby, the element b(t) E y* corresponding to x.--. f(x, t) is given precisely
by relation (7a).
Moreover, (8a) holds. Then the relation (8b) follows immediately from (8a)
and (9).
It remains to show the measurability of the function
h: [0, T] -+ Y*.
By the crucial Petlis Iheore,n A 2 ( 10) and the renexivity of the B-space Y, for
this it suffices to show the measurability of the real function
t ....... (h( I), l' > r on [0, T],
for each l' E ¥. Recall that Y = L,(G). Let I' E Y. By the Holder inequality, it
follows from (' E L,(QT) and f E Lq(Qr) that I,.r E LI (Qr). The Fubini theorem
yields the relation
t, rx)/x. t) dx dt = t T (L t'(x)f(x. t) dX) dt
and the measurability of the right-hand inner integral as a function of t. By
(7a), that is exactly the measurability of t (b(t), l')r.
Ad(b). For almost aliI E [0, T], the Holder inequality,
L Ih.t)l'(xldx (L ,/(x.O,qdxY''(L 1t'(XWdX)"P
< (L I/(x. tW dx y,q IIt'II. for all I' E V.
implies the existence of a functional hi (I) E V. which satisfies (7b). Moreover,
we get
II hI (1)11 t. < f I/(x, 1)14 dx
G
and hence hI E L 4 (O, T; J;'.), since
IT IIhd t )1I1..dt < LT (f, 1.f(x.t)lqdX)dt < x.
The existence of the left-hand integral follows from the majorant criterion
A 2 ( 17). In this connection, note that the measurability of t 1-+ hi (t) follows as
the measurability of 11-+ h(t) above. 0
23.3. The Dual Space to Lp(O, T; V)
We lirst justify the Holder inequalily that is basic for many applications:
f T (f T ) 1/4 (i T ) 1/,
o I(L'(t).U(t),"ldt < 0 IIt'(t)lIt.dt 0 lIu(t)lIf.dt .
(10)
2J.J. The Dual Space to L,,(O, T; V)
41 I
Proposition 23.6. Let V be a B-space. Then the Holder inequalit}, ( 10) holds for
all
u E Lp(O, T; V), v E Lq(O, T; V.)
,,'ith I < P < x, p-l + q-I = I. In particular, all the integrals in (10) do exist.
PROOF. Let J = ]0. T[. Since the functions u and I' are measurable on J, there
exist step functions "II: J -. V and VII: J -. V. such that
I',,(t) utI) and I',,(t) t'(t) as n -+ oc,
for almost aliI E J. This implies
«(".(t), u,,(t».. -. (l'(t), U(I»v as n -. XY,
for almost all t E J. Thus the real function I....... (t'(t). u(t»v is measurable on
J. Beca use of
1< toft), u(t»v I s II t'(t)1I J,.II u(t)1I J'
the assertion follows now from the classical Holder inequality in Section
18.6. 0
We now justify the important duality relalion:
I I
L,(O. T; V). = Lq(O, T; V.), - + -- = I.
p q
We set X = Lp(O. T; V). Then we have to show that Xt/J = Lq(O. T; V*). For
this. we consider the two key formulas:
(F, u> x = t T (t'(t), u(t) >.. dt for all u EX, (II)
IIflfx. = IIvIIL,(O. T:V..' (J 2)
Proposition 23.7. Let V be a reflexive and separable B-space and leI 1 < P < OC'.
p-l + q-I = I. Then the following hold:
(at To each function t' E Lq(O, T; V.) there corresponds a unique functional
VEX. with (II).
(b) Conversely, to each v E X* there corresponds exaclly one t' E Lq(O, T; V.)
\\'ith (1 1 ). Moreover, (12) holds.
(c) The B-space Lp(O, T; V) is reflexive and separable.
PROOF. Assertion (a) follows from the Holder inequality (t 0).
Assertion (b) is a standard result of the theory of function spaces. Proofs
may be found in Edwards (1965, M), 8.20.5, and in Gajewski, Groger, and
Zacharias (1974, M), Chapter 4.
The reflexivity of Y = L,,(O, T; V) in (c) is a direct consequence of (b). The
separability of Y follows from Proposition 23.2(f). 0
412
23. Hilbert Space Methods and linear Parabolic Differential Equations
Convention 23.8 (First Identification). Let X = L,,(O, T; V). By Proposition
23.7. the two real 8-spaces
X* = Lp(O, T; V)*
and
Lq(O, T; V *)
are normisomorphic, i.e., there exists a linear bijective isometric mapping j:
Lq(O, T; V*) -+ X* given by j(l') = v. Normisomorphic 8-spaces can be iden-
tified with each other. Therefore, we may identify X* with Lq(O, T; V*). Hence-
forth, we write
X. = Lq(O, T; V*).
The same way. we identify i1 = j(v) with l'. Therefore, in future, we write
equations (I t) and (12) in the following form:
(I" u) X = SoT (v(c), u(c».. dc,
( fT ) l/q
II vII x. = J 0 II t'(c)II. dc
( 13)
( 14)
for all II E X and {' E X*.
The following results will be used later on in an essential way in order to
prove the convergence of the Galerkin method for linear and nonlinear
evolution equations of first and second order.
Proposition 23.9 (Limit Relations for Integrals). Let V be a reflexive and
separable 8-space. Furthermore, let t < p, q < 00, P -I + q -I = 1, and 0 < t <
T < oc. The,t ""e have the following statements:
(a) If u E L,,(O, T; V), then
("..E U(S)dS).. = E (v.,u(s»..ds
for all t* E V*.
(b) If II E L,,(O. T; V*). then
(f II(S)ds, I).. = f (u(s), (,).. ds
(c) Front u" -+ u in L,,(O, T; V) as n 00 it follows that
for all l' E V.
E un(s)ds -+ E u(s)ds in V
as n 00.
(d) From
II" -+ u in L,,(O, T; V) as n -+ 00,
l',. l' in L.(O, T; V*) as n -+ oc,
23.3. The Dual Space to L,(O. T: V) 413
if follows that
f (v,,(s), U,,(s» " ds - t (v(s), u(s»" ds as n - 00.
(e) From
U" u in L,(O, T; V) as n --+ 00,
v" --+ V in L,(O, T; V.) as n --+ 00,
it follows that
t (v,,(s), u"(s»,, ds - f (v(s), u(s»" ds.
Assertions (a), (b), (c) are also valid for arbitrary B-spaces
Further results concerning the weak. convergence in LCX)(O, T; V*) can be
found in Problem 23.12.
PROOF.
(I) First we set t = T. Moreover, let X = L,(O, T; V). Then we have that
X. = Lq(O, T; V.).
Ad(a). Let U E X. This implies U E L 1 (0, T; V) by Proposition
23.2(h). According to the definition of the integral in A 2 (14), there exists
a sequence (u,,) of step functions such that, as n --+ 00, we have that
u,,(s) --+ u(s) in V almost everywhere on ]0, T[,
f: u,,(s)ds - faT u(s)ds in V.
Moreover, we get
f: lIu,,(s) - u",(s)1I ds < I:
for all n, m no(t). Let v. E V.. By definition, the integral of a step
function is a sum. Hence we obtain that
(v*, f: u,,(s)ds) = f: (v.,u,,(s»ds.
Now set /,.(5) =: (v., u,,(s» and f(s) = (v., u(s». Then (I,,) is a sequence
of real step functions with
/,.(s) --+ I(s)
as n --+ 00
for almost all s E ]0, T[.
Furthermore, we get
f: 1/,,(s) - 1",(5)1 ds sf: II v*1111 U,,(s) - u",(s) II ds < I:
414
23. Hilbert Space Methods and Linear Parabolic Differential Equations
for all II, nl > ,J 1 (r.). Consequently, the integral J /(s) ds exists. Now a
passage to the limit as n -+ 00 yields assertion (a).
Analogously, one obtains (b).
Ad(c). We set
A(u) = IT u(s)ds
The Holder inequality together with the I-trick yields
for all U EX.
I T (I T ) l'P
;IA(u)1I < 0 lIu(s)1I ds < c 0 lIu(s)II P ds ·
Hence the operator A: X V is linear and continuous. This implies (c)
for t = T.
Ad(d). From UII ...... II in X and I'll (' in X. as n ...... x it follows that
< t'II' Un > -+ < 1', U> X .
Note that such a relation holds for each reflexive B-space X. Hence
assertion (d) follows from
(t" u) x = IT «('(s), u(s) ds.
Analogously, one obtains assertion (e).
(II) Now let us consider the more general case 0 < t < T. The decisive trick
is to use the characteristic function
x(s) = g
for 0 S s t,
otherwise,
and to replace U and v with XU and X'" respectively. This way, we reduce
this general case (I I) to the more special case (I). Note that multiplication
by 1.. does not effect strong and weak convergence in X or X*. To see this
consider, for example, a sequence CUll) with
UIIU in X as n-.oo.
We want to show that
X1411 XU in X
as n -+ OCt
In fact. it follows from UII U in X as n ...... 00 that
(t', u,,) x = IT (v(s), U,,(s) ds -+ (v. u) x as n -+ 0:.
for all r E X*, where X* = Lq(O, T; V*). Since
(' E Lq(O. T; V*) implies Xl' E L.(O, T; V*),
23.3. The Dual Space 10 L,(O. T; V)
415
we obtain that, as n --. 00,
foT (v(s), x(s)u,,(S» ds = (xv, u,,>X - (Xt'. u)x = foT (v(s), x(s)u(s) ds.
Hence xu.. XU in X as n -+ 00.
o
In order to obtain a simple proof for the uniqueness of generalized deriva-
tives in Section 23.5, we now generalize the crucial variational lemma from
Section 18.2.
Proposition 23.10 (Variational Lemma). Let X be a B-space. Then, it follows
from u e L, (0, T: X) and
f: q>(t)u(t)dt = 0
for all cp e Co (0, T)
tlrat 14 = 0 in L 1(0, T; X), i.e.,
u(t) = 0
for almost all t e ]0, T[.
PROOF.
(I) First we consider the special case X = V., where V is a separable B-space.
For each ve V we have that
S: q>(s) (u(s), v) ds = 0
by Proposition 23.9(b). The classical variational lemma (Proposition 18.2)
implies that
for all cp e CO- (0, T),
(u(s), v) = 0
for all s e ]0, T[ - Z(I-'),
where Z(v) is a set of measure zero. Let {Vi} be a countable set which
is dense in Then the set Z = U, Z(v,) also has the measure zero and
we get
(u(s), t') = °
for all S E ]0, T[ - Z,
and all l' e Hence we obtain that
u(s) = 0
for all S E ]0, T[ - Z.
This is the assertion.
(II) The special case (I) is sufficient for the proof of Proposition 23.20 below.
The proof of Proposition 23.10 in the general case will be given in Problem
23.6. 0
416
23. Hilbert Space Methods and Linear Parabolic Differential Equations
23.4 Evolution Triples
Definition 23.11. We understand an evolution triple
"V s; H s; V."
( 15)
to be the following:
(i) V is a real, separable, and reflexive B-space.
(ii) H is a real, separable H-space.
(iii) The embedding V H is continuous, i.e.,
"t'li H S; const II v II v
and V is dense in H.
for all t' E V,
( 16)
With evolution triples we describe the fact that two spaces V and H appear
in evolution equations. We shall later apply the following example to concrete
parabolic and hyperbolic equations.
EXAMPLE 23.12. Let G be a bounded region of R N with N > I. We set
V = W,"'(G), H = L 2 (G),
with 2 p < and In I. Then U V s; H c: V." is an evolution triple.
We will now justify that in a suitable sense the inclusion
H c V.
is valid. The point of departure is the formula
(h, (')v = (hlv)H
for all v E V.
( 17a)
Proposition 23.13. Let" V H c V." be an evolution triple. Then the following
hold:
(a) To each h E H there corresponds, via (17a), a linear continuous functional
- -
h: V -+ R, i.e., h E V..
(b) The mapping h Ii from H into V. is linear, injective, and continuous.
PROOF. Let h E H. By (t 6),
l(hlv)HI S IIhllHllvlI H S constllhllHllvllv
holds for all (' E V. Therefore, there exists an Ii e V. with (17a) and
II h II v. S; const II h II H'
( 18a)
The mapping h It is obviously linear; it is continuous from H to V... by
(18a). However, it is also injective for it follows from Ii = 0 that
(hlv)n = 0 for all v e V.
The space V is dense in H and therefore h = O.
o
23.S. Generalized Derivatives
417
Convention 23.14 (Second Identification). By Proposition 23.13, we may iden-
tify h with h. In this sense,
H s; V*.
Henceforth, we shall write h instead of h . Then the following are valid:
(h, v)y = (hlv)H for all h e H, v e V, (17)
IIhll y . s constnhll H for all he H. (18)
In the following, the relation
V c H c V*,
for evolution triples is to be understood in the sense of Convention 23.14.
Observe that the embeddings V s; Hand H s; V* are continuous. In parti-
cular, we get
(v, w)y = (w, v)y for all v, w e
This follows from V s; Hand (11), since (wIV)H = (Vlw)H.
Formulas (17) and (18) will be used frequently.
Using Problem 18.6, Convention 23.14 can briefly be summarized as
follows:
(a) It follows from V c Hand from the density of V in H that H* V*.
(b) We identify H with H*, i.e., we have H* = H, and hence H s; V*.
(c) Since V is reflexive, the space H* is dense in V*, and hence H is dense in V*.
23.5. Generalized Derivatives
The following definition is basic for understanding evolution equations. We
proceed from the classical formula
foT (i"'(t)u(t) dt = ( - 1 r foT q>(t)UCII'(t) dt for all q> E C:(O, T) (19)
and choose as a starting point (or the definition of the nth generalized
derivative
w = .,(...
the formula
foT q>cII'(t)u(t)dt = (-Ir f: q>(t)w(t)dt
for all qJ e C:(O, T). (20)
Definition 23.15. Let ., e L 1 (0, T; Y) and w e L. (0, T; Z). Then, the function w
is called the nth generalized derivative of the function., on ]0, T[ iff (20) is
valid. We write w = .,.....
418
2.l Hilbert Space Methods and Linear Parabolic DitTerenlial Equalions
We call the readefs attention to the fact that the values of the functions
ll: ]0, T[ Y
and
\": ]0, T[ -+ Z
may lie in different spaces Y and Z. Then (20) means that the integrals
appearing in the right and left members belong to Y (\ Z. Since
Ilcp.II)(t)u(t)1I < constllu(t)lI,
it follo\\'s from II ELI (0. T: Y) and from the majorant criterion A 2 ( 17) that the
integral appearing in the left member of (20) exists for all qJ E CO' (0, T).
Analogously. one obtains the existence of the integral appearing in the right
member of (20).
EXAMPI.E 23.16. In order to consider the simplest situation with }" = Z. let
U E C"( [0, T], Y).
n > I.
Then the continuous IJth derivative u(nl: [0, T] --+ Y is also, at the same time.
the generalized nth derivative of u on ]0, 7"[.
PR()()F. Let rEel ([0. T]: Y) and cp E C (0, T). From (qH)' = CP'f + </JI" we
obtain the classical integration by parts formula
LT q'l'dt = - L1 qn"dt.
Repeated application of this formula yields (19).
o
Remark 23.17 (Generalized Derivatives and Distributions). We explain the
connection of Definition 23.15 with the general distribution theory which we
summarize in A 2 (62). Let U E L,(O, T; Y). Then one can assign to the function
u a distrihut;o'l U via
U(cp) = LT q>(t)u(t)dt for all q> e C;(O. n.
For each II. this distribution has an nth distributive derivative u(n) which
is defined by
U(")(cp) = (-I)"U(cp.")) for all tp E C;(O. T). (2Ia)
If (20) holds, then U. II . can be obviously represented in the form
Ulnl(q» = SoT q>(t)u.nl(t) dt for all q> e Co" (0. n. (21 b)
The advantage of the distribution conception consists in that each function
U E LI(O, T: Y)
possesses derivatives of elery order in the distributional sense. Definition 23.15
23.5. Generalized Derivatives
419
singles out the cases in which by (21 b) the nth distributional derivative of u
can be represented by a function w eLl (0, T; Z). In this case, we set u ell ) = '"
and we write brieOy
u eLl (0, T; Y),
U CII ) eLl (0. T; Z).
Proposition 23.18 (Uniqueness of Generalized Derivatives). Let X and Z be
B-spaces. Moreot'er, suppose that u E L 1 (0, T; Y) and t" ", e L, (0, T; Z). If
U(II. = I' and u ell . = K'
in the sense of generalized derilatives, then we obtain
t'(t) = ",(t)
a/most er'er)',,'here on ]0. T[,
i.e.. "'e hat'e I' = '" in L I (0, T; Z).
PROOF. The assertion follows from
f: q)(l' - w)dr = 0
by Proposition 23.10.
for all cp e C (0. T).
o
The following proposition shows that generalized derivatives are com-
patible with weak limits. This fact is important for proving the convergence
of the Galerkin method for evolution equations.
Proposition 23.19 (Generalized Derivatives and Weak Convergence). Let Y
and Z be B-spaces otter I( = R, C K'ith the continuous embedding Y S; Z. Then
it fo//o''s from
uft) = t', on ]0, T[
for all k and fixed n 1
and
u, -'-0 U in L,(O. T; Y)
ti t' in L.,(O, T; Z)
as k 'X) ,
as k -+ OC.
1 S p, q < x"
that
U Cft ) = t' on ]0, T[.
PROOF. Let A and B be B-spaces over 1(. If the embedding A c: B is continuous.
then, as k 00,
a,,a in A
implies
a,a in B.
This implies that, as k 'X).
u" u
and
v,v in L.(O. T;Z),
420
2.1. Hilbert Space McrhHd5 and I.;nc:ar ')anaholi, DifercnliaJ qualit)ns
by Proposition 23.2(h). Let qJ E Co'(O, T). from
i 1. f l'
o tpClti U . tit = ( - 1}1t J (I qJV.. dl
and k -+ ry_, we obtain that
fT tpcltludl = ( _ l' fT tpl1dt.
In Jo
by Proposition 23.9.
o
We now consider, with a view to evolution cquati()ns, the situation in
evolution triples.
.
Proposition 1.20 (Existence of .,(111). Lt', "V c H V... he tilt (11,()lu,;on 'r;pl('
(".d Ie' 1 p, q S; c'X>, 0 < T < 'X..i. rll(,PJ lhe follow;n" are 1'tJliJ.
(a) Uniquene5s. Ft)r u e LI'(O, T; V), "'e generalized J("ril'"'il't u ell ) i ""iqll(' as
an ("enlt'n, tr L.,(O, T: V.), II.at ;,.., , ...... I'."(t) can he modified ",r/,V OIl Cl suhset
tJj' ]0, 1[ of mttasurt zert'.
(b) Existence. utI' e Lp(O, T: V Tht'" Iller.! exlst. ,he yenerldizt,d der;r7("il'
I/ft) € L.(O, T; V.)
iff ,'rere is a Jun('Iion W E L.(O, T; V.) .C,. that
f: (u(/)lv)lItp'It'(I)JI . f -Ir L T (w(t),tJ)...tp(/)JI (22)
It), aliI) e JI and all tp € C:(O, 1").
Tilt'" 11("» :: U' and
d"
JI'; (lI(t)1 ")" = (U'ltl(/), l' >...
ht)'J... for all v E V and alnlf},,., cJlJ I e JOt 1"[. Here, J"/dt ll m('lln. the nlh
gt'n('ra!ized derir'util'e of real /11"(.' ions on JOt T[.
,
(23)
By the definition or gcner.dized derivatives or real functions, (22) and (23)
with w = u.") arc equivalent.
We shall later frequently use formulu (22) in connection with parabolic and
hyperbolic differential equations. Their advttntage consists in that one has to
investigate only the generalized dcriv..tives of real functions. Observe thut
Lp(O, 7.; V) £ l t (0. T; V)
and
L.(O, T: V.) s L, (0. .f; V.)
(or I < p.. q < 00 a!\ well a V s; V..
23.5. Generalized Derivatives 421
PROOF. Ad(a). Use Proposition 23.18.
Ad(b). Formula (20) is equivalent to
(J: fPcII)(I)II(I)dl, t)" = (-Ir (J: fP(I)W(I)dl, v)"'
for all v e V. Therefore, by Proposition 23.9, we get
J: (fPcII)(I)II(I),t')"dl = (-1)" J: (fP(I)W(I), v)"dl,
for all v E It: Now, (22) follows from (17). 0
EXAMPLE 23.21. Assume:
(i) "V s; H s; V." is an evolution triple and 0 < T < 00;
(ii) the function u: [0. T] ..... V is continuous and the derivative
, ) . u(t + h) - u(t)
u (I) = 1m h
""'0
exists for all t E [0, T] as a limiting value in the space H; and
(iii) the function u': [0, T] ..... H is continuous.
Then, u' is the generalized derivative of u on ]0, T[. Furthermore,
u E L,(O, T; V) and u' e L.(O, T; V*) for all p, q: 1 S p, q S 00.
The peculiarity here consists in that the values of u lie in the space V,
nonetheless the limit value u'(t) exists only in the space H.
PROOF. The mapping u: [0, T] ..... H is continuous by (ii) and (16); therefore
u Eel ([0, T); H). The assertion then follows from Example 23.16. 0
Corollary 13.21. Let H be a real H-space, and let u: [0, T] ..... H be Lipschitz
continuous, that is
lIu(t) - u(s)1I Lit - sl for all t, s E [0, T] and fixed L o.
Then the following holds:
(a) For almost all t E [0, T], the function u has a derivative,
' ( ) I . u(t + h) - u(t)
u t = 1m h
""'0
and
11(1) = 11(0) + J 1I'(s)ds
for all t E [0. T].
422
21 Hilbert Space Methods and linear Parabolic Differential Equations
(b) For all'10st all t E [0. T].
II u ' ( t) II L,
and u' is tire generalized der;vat ;t'e of u on ]0, T[.
PR()()F. Ad(a). Compare Problems 23.4 and 23.5c.
Ad(b). It follows immediately from the Lipschitz condition that lIu'(t)fI L
I ntegration of
(<pur = <p'u + <pu'
yields
t T Ip' u dt = - t T Ipu' dt
for all cP E Ct (0, T).
o
23.6. The Sobolev Space W p 1 (0, T; V, H)
The following proposition, in particular the integration by parts formula (25)
below. will playa central role in the treatment of evolution equations. Let
X = LI'(O. T: /). The basic definition is given by
I(O. T: V. H) = .:" E X: u' EX.}, 1 < p < x.
Recall that X. = Lq(O. T: V.), where p-I + q-I = 1.
Proposition 13.13. Let "V c H c V." be an et'olution triple, and let 1 < p < .x.
p -I + q-I = 1.0 < T < Y..;. Then the following Ilold:
(i) The space Y,! (0, T: /, H). Tile set 0.( allu E L,(O. T; V) that Itat'e general-
;:ed der;ratit'es
u' E Lq(O, T; V.)
.1'0'1115 a real B-space ,,'it II tile nor",
II u II M' = IIllll L reo. 1'. .') + II u ' II L CI (0. 1 ; v.) .
We denote tllis B-space hy W,I (0, T; V. H).
(ii) Regularization. Tile embedding
Wi (0. T; v, H) c C( [0, T], H)
;s continuous. More precisely, ,,'e hat'e the follo"'ing. (u e Wi (0. T; v, H).
,lien there exists a lltl;quely determined continuous .function u I: [0, T] --+ H
\,'hich coincides almost et'ery"'here on [0. T] with the initial jalnction II.
Henceforth. \\'e \\'rite 14 instead of u,. Further. in thi... sense,
max lIu(t)II H < constllullw"
O'ST "
(24)
23.7. Main Theorem on Fint-Order Linear Evolution Equations
423
(Hi) Dense subset. The set of all polynomials w: [0, T] -. It: that is
wet) = L t'a, with a, e Y for all i,
I
is dense in the spaces Wpl(O. T; V. H), Lf(O. T; V), and Lp(O. T; H).
(iv) Integration by parts. For all u, v E W, (0, T; It: H) and arbitrary t, s, 0 S
s S t S T, the following generalized integration by parts formula holds:
(u(t)IV(t))H - (u(s)IV(S»H = f (u'(t). v(t».. + (v'(t). u(t».. dt. (25)
Here, the values u(t), vet), u(s), v(s) are the t'alues of the continuous functions
u, v: [0, T] -. H in the sense of (ii).
The proof will be given in Problem 23.10. Observe the following important
fact. In applications the space V contains smoother functions than the space
H. For example, we will have V = W:l(G) and H == L 2 (G). In Proposition
23.23(ii) above, the functions from W,I (0, T; H) are not continuous with
respect to the space It: but only with respect to the worse space H.
23.7. Main Theorem on First-Order Linear
Evolution Equations and
the Galerkin Method
We study the initial value problem
d
dt (u(t)IV)H + a(u(t), v) = (b(t). v)...
(26a)
u(O) = U o E H, (26b)
u E W 2 1 (0, T; V, H), (26c)
where we assume that (26a) is valid for all v E V and almost all t E ]0, T[.
To be precise, we assume that there exists a subset Z of ]0, T[ of measure
zero such that equation (26a) holds for all v e V and all t e ]0, T[ - Z. Note
that Z is independent of v. Furthermore, dldt in (26a) means the generalized
derivative of real functions on ]0, T[, i.e., (26a) means that
- f: (u(t)1 V)Hq>'(t) dt + SoT a(u(t). v)q>(t)dt
= SoT (b(t), v)..q>(t)dt for all q> e C(O, T).
Let u e W 2 1 (0, T: V, H). The initial condition (26b) is meaningful because, by
424
23. Hilbert Space Methods and linear Parabolic Differential Equations
Proposition 23.23. after a modification on a subset of [0, T] of measure zero,
II is a uniquely determined continuous mapping of the form u: [0, T] --. H.
Condition (26b) is to be understood in this sense.
We make the following assumptions:
(H I) "V c H c '.'. is an evolution triple with dim V = 00, 0 < T < 00. The
spaces V and H are real H-spaces.
(H2) The mapping a: V x V --. R is bilinear, bounded, and strongly positive.
Moreover. we are given U o e Hand b e L 2 (O, T; V*).
(H3) {"'I' "'2". . } is a basis in V, and (u"o) is a sequence from H with
l'nO -+ Uo in H
as n...... ':XJ,
where
u"o e span { "'J . . . . , "'" }
for all n.
In order to formulate the Galerkin method, we set
"
u,,(t) = L c,,,(t)w,,
.=1
It
U"O = L ,,, "',.
.1
By definition. for almost all t E ]0, T[, the Galerkin equations read as follows:
"
L fn(t)("',lwj)H + c,,,(t)a("',, Wj) = (b(t), \\)v. (27a)
trl
Cj,,(O) = ':X j ", j = 1, . . . , n. (27b)
Here, (27a) arises formally from (26a) if one replaces 14 by 14" and t' by "j.
Equation (27b) results from u,,(O) = u"o.
Since \\'1' . . . . "'n are linearly independent, we get
det((w,I\Vj)H) O. k.j = 1, .... n,
for the Gram determinant. Therefore, equation (27a) can be solved for CII and
thus represents a system of linear differential equations. Recall that
lI u lllf'1 1 = 11 14 11,..)(0. T;V) + lIu'IILz(o. T:V e ),
Theorem 2J.A. If the assumptions (H t). (H2), (H3) hold. then:
(a) Existence and uniqueness. The original equation (26) has exactly one
so/ut ion u.
(b) Continuous dependence on the data. The map
(u o , b) u
;s linear and continuous from H x L 2 (0, T; V.) to Wl (0, T; V, H), i.e., there
is a constant D > 0 such that
lIullwl' S D(lIuoliH + IIbIlL2CO,T:..e.),
.(or all U o E Hand b e L 2 (0. T; V.).
(c) Convergence of the Galerkin method. For all n = t. 2, ..., the Galerkin
23.7. Main Theorem on first-Order Linear Evolution Equations
425
equation (27) has exactly one solution
u" e W 2 1 (0, T; V, H).
The sequence (u,,) converges as n 00 to the solution u of (26) in the
following sense:
u" u in L 2 (O, T; V),
max lIu,,(t) - u(t)II H O.
Os's T
(28a)
(28b)
Corollary 23.24. The original equation (26) is equivalent to the following opera-
tor equation:
u'(t) + Au(t) = b(t)
u(O) = Uo e H,
for almost all t e ]0, T[,
u e W 2 1 (0, T; V, H).
(29)
Here, the operator A: V V. is linear, continuous, and strongly monotone with
(Au, v)., = a(u, v)
for all u, t' e
The proofs of Theorem 23.A and Corollary 23.24 will be given in Section
23.9.
Remark 23.25 (G4rding's Inequality and a Transformation Trick). In assump-
tion (H2) of Theorem 23.A, we required the strong positivity of the bilinear
form a. We want to show that it is sufficient to postulate the validity of the
more general G4rding inequality
a(u,u) cllull - dllull for all u e V, (30)
where c > 0 and d 0 are constants. To this end, we use the transformation
u(t) = ecl,,(t).
From the original equation (26) we obtain the transformed equation
d
dt (W(t)ll')H + a I (w(t), v) = (b l (t), v) for all v E V,
"'(0) = U o , w e W 2 1 (0, T; v, H)
where al(" v) = a(w, v) + d('lv)H and bt(t) = e-fl'b(t). By (30),
al(u, u) cllull for all u e
(31 )
That is, the transformed bilinear form a l is strongly positive. Thus we can
apply Theorem 23.A to the transformed problem (31).
We now replace the original problem (26) with the more general problem
d
dt (u(t)1 V)H + a(u(t), v; t) = (b(t), t)." (32a)
426
23. Hilbert Space Methods and Linear Parabolic Differential Equations
14(0) = 1'0'
14 E W 2 1 (0, T: V, H).
(32b)
In contrast to (26). the bilinear form a depends on time t. Here, (32a) must hold
for all r E J/ and almost all t E ]0, T[. I n this case, we replace the assumption
(H2) with the following assumption.
(H2.) For all t E ]0. T[. the mapping
a('. .:t): V x V -+ IR
is bilinear. bounded. and satisfies an abstract Garding inequality, where
the constants are independent of time t. That is. there exist constants
c. c > 0 and (I > 0 such that
la(u,I';t)1 < CUI4I1vll l 'II..,
a(ll,u) > cpull - dllull,
for all II, r E JI and t E ]0, T[. Moreover, the function
I ....... a (II. I': I)
is measurable on ]0, T[ for all u, r E It'.
We are given 110 E Hand b E L 2 (0, T; V.).
Corollary 23.26. Suppose Ihat (H 1), (H2*), (H3) hold true. Then all the asser-
tions 0.( Theorem 23.A are ato t'alid for equation (32).
Alorenrer, (32) is equil'alent to the follon'ing operator equation:
11'(1) + A(I)u(t) = h(l) .for almost all t E ]0, T[,
u(O) = 1'0' u E W 2 1 (0, T: V, H).
(33)
,,'here Ihe operator A (I): Jl -... V. is de.(i'Jed b}'
< A (l ) II. I' > I = Cl (II. L': I)
for all II, I' e V.
The proof will be given in Section 23.9.
23.8. Application to Parabolic Differential Equations
Let QT = G x ]0. T[. We consider the following initial-boundary value
problem:
Ur(X. I) - 6u(x, t) = f(x, I) on QT'
u(x, t) = 0 on i'G x [0, T],
u(x,O) = uo(x) on G.
(34)
Definition 23.27. Let G be a bounded region in R N with N > 1. We set
V = "'2 1 (G). H = L 2 (G).
23.8. Application to Parabolic Differential Equations
427
The generalized problem corresponding to (34) reads as follows: Let f E L 2 (QT)
and Uo E H be given. We seek U E Wi (0, T: H) with u(O) = U o so that
d
dt (u(t)lt')H + a(u(t), r) = (b(t), t')v
holds for all rEV and almost all t e ]0, T[I In this connection, for all u, reV,
we set
a(u, t) = r f D,uD,t, dx,
JG i=1
(b(t). r)v = L f(.. t)t'(x)dx.
The last equation is justified by Example 23.41 Here, ." = (1'.'.' .\') and
D i = ('/Ci. Recall that
(ulr)H = L u(x)t'(x)dx.
The generalized problem and the corresponding Galerkin method (27) are
motivated by Section 19.2. As the basis elements {"I.. "'2' . . .} in V = W 2 ' (G)
one can choose polynomials or finite elements (see A 2 (56)-A 2 (60»).
Proposition 23.28. AI/the assertions of Theorem 23.A and Corol/ar}' 23.24 hold
.for ,l,e generalized problem corresponding to (34).
PROOF. By Example 23.4, b e L 2 (O, T; V*) and
IIbIlLz(OIT;V.. S nfIlLl«(lr.'
It results from the proof of Proposition 22.7 that a: V x V -+ R is bilinear.
bounded, and strongly positive. Thus. all the assumptions of Theorem 23.A
are fullfilled. 0
We now replace the classical heat equation (34) with the following more
general initial- boundary value problem:
N N
", - L D,(a,jDju) + L iDiU + pu = f on QT'
'.j=-I '-I
14(:(, r) = 0 on cG x [0. T],
u(x,O) = l'o(X) on G.
Here. the coefficients a'j' 2 " and p may depend on time t. Such equations are
related to general heat conduction processes and diffusion processes.
Our assumptions are the following.
(AI) Let G be a bounded region in R N with N 1. and let 0 < T < oc. We
set QT = G x ]0. T[.
(35)
428
23. Hilbert Space Methods and Linear Parabolic Differential Equations
(A2) All the functions aiJ' .' p: QT -+ IR are measurable and bounded, i.e., they
belong to Lx:(QT)' In particular, this condition is satisfied if all these
functions are continuous on QT'
(A3) Uniform ellipticity. There exists a constant it > 0 such that
N ,v
L aij(x, t)'i'J > )' L '1
i.)=1 i-I
for all (x, t) E QT and all (, I' . . . ';N) ERN.
Definition 23.29. The generalized problem to (35) is defined analogously to
Definition 23.27 with
f ( N )
a(u. ( t) = . aijD;uDJI' + .L 2 j l'D i u + put, dx.
G Iole I P=t
Formally, the generalized problem is obtained in the following way. Multiply
the first equation of (35) by v E Ct(G) and use integration by parts.
Proposition 23.30. Assume that (A 1 )-(A3) are satisfied. Then all the assertions
of Corollary 23.26 hold trlle for tlae generalized problem to (35).
In particular, .for git'en u E L 2 (G), f E L 2 (Qr), the generalized problem to (35)
has a unique solution u in the space W 2 1 (0, T; V, H) and
lIuli M'l const( lIuolILlcGt + IIfIlLl«QT»'
".here V = H/ 2 1 (G) and H = L 2 (G).
PROOF. We use Corollary 23.26.
(I) The Holder inequality yields
la(u,t';t)1 s Cllullvllvll v
Observe that
f i rDiu dx supl d (f (D j u)2 dX) 1/2 (f v 2 dX) 1.2
SUpl2dllullvllvll v (36)
for all u. v E V, t E ]0, T[.
and
f puvdx supIPlliullvlll'lIv'
(II) The Poincare-- Friedrichs inequality yields
f aiJD;IIDit'dx > }' f (DiU)2 dx
I. J I
> c "ull for all u e V and fixed c > o.
23.8. Application to Parabolic Differential Equations
429
Moreover, we have
lIuliH s lIull.,
for all u e Jt:
From (I) we obtain the decisive Gdrding inequality:
a(u,u,t) cllull - dllull for all u e t e ]0. T[,
where the constants c > 0 and d 0 are independent of time t.
(III) The function
t..... a( u, v; t)
is measurable on ]0, T[ for all u, v e V. This follows from a'l' ex" (J e
L(Qr)' by Example 23.4.
(IV) Let Ie L 2 (QT). From
(b(t),v).,= t/(X,t)V(X)dX forall veV,
it follows that IIbIlLz(o. r: v.. S II/II L1 (Q,., again by Example 23.4.
The assertion is now a consequence of Corollary 23.26. 0
Let
L(t)u(x, t) = (-I-iDCI(aCl'(x, t)D-u(x, t»,
1-1 .,)i s; '"
where the partial derivatives DCI, D' refer to the space variable x. We now study
the following initial-boundary value problem of order 2m:
u,(x, t) + L(t)u(x, t) = I(x, t) on QT'
DClu(x, t) = 0 on oG x [0, T] for all : lexl S m - I,
u(x, 0) = uo(x) on G. (37)
We assume:
(BI) Let G be a bounded region in R N , N I, and let 0 < T < 00. We set
Qr = G x ]0, T[. Moreover, let m = 1, 2, ... .
(B2) All the coefficients as,: QT R are measurable and bounded.
(83) Strong ellipticity. For each t e ]0, T[, the differential operator L(t) of
order 2m is regularly strong elliptic or strong elliptic in the sense of
Definition 22.42, where the ellipticity constant Co is independent of time t.
Definition 23.31. The generalized problem to (37) reads as follows. Let
V = W 2 "'(G), H = L 1 (G),
and let Ie L 1 (Qr) and Uo e H be given. We seek u e Wl(O, T; V, H) with
14(0) = 140 such that
d
dt (u(t)1 V)H + a(u(t), v; t) = (b(t), v)v
430
2l Hilbert Space Methods and Linear Parabohc Differential Equalions
holds for all (' E Jt' and almost all t E ]0. T[. Here, for all II, IJ E V, we set
alu. r; I) = r L a 2 t/(x. ,)D 2 t,DfJ u dx.
J G 1 2 1.1"15 '"
<hlt).r> = r f(x.l)r(x)dx.
J (i
Proposition 23.32. ASSlllne (B I) through (B3). ThetJ. all the assertions (?r
("orollary 23.26 hold trill' .for the generali:ed problem to (37). In particular, tile
gellerali:ed prohlel)l to (37) has a uniqlle solution u and
!lIlI'"'1' < const( huoll1.2fG' + Il.fIl1. 2 (Qr.)'
PR()()f. Observe the Garding inequality from Section 22.15 and use the same
argument as in the proof of Proposition 23.30. 0
Remark 23.33 (Generalizations). Using the Garding inequality for strongly
elliptic systems from Problem 22.7, one can immediately generalize Proposi-
tion 23.32 to parabolic systems.
Applications of the theory of analytic semigroups to parabolic equations
will be considered in the Appendix, A 2 ( 110). However, note that this approach
needs stronger assumptions with respect to a 2 tJ and f than in Proposition
23.31.
Combining Theorem 23.A with the Banach fixed-point theorem, one can
obtain existence theorems for semilinear parabolic equations. This method
will be used in Section 72.5 in order to obtain existence theorems for the
nonstationary Navier-Stokes equations for viscous nows.
23.9. Proof of the Main Theorem
The following proof of Theorem 23.A contains a number of methods of
deduction that are typical for existence and uniqueness proofs for time-
dependent differential equations by means of the Galerkin method. We
recommend a careful study of this proof inasmuch as it contains important
elements which facilitate the understanding of more complicated existence
proofs for nonlinear evolution equations which will be considered later on.
The key to the proof is:
(i) the integration by parts formula (25); and
(ii) the Gronwall lemma (Proposition 3.10), which yields the necessary a priori
estimates for the solutions of the Galerkin equations.
"or brevity we set
(u, I') = (u, v)..,
111111 = II u II., ,
(ulv) = (UIV)H'
lul = lIull".
23.9. Proof of the Main Theorem
431
Moreover, we set J = J and
W = Wi (0, T; V, H).
Observe that the embeddings V c: H S; V. are continuous, that is,
lul S const lIuli for all u E V,
lIuliv. s constlul for all u E H.
According to (17) we have
(ulv) = (u, t>
By assumption (H2),
a(u,u) cllulI
for all u e H, v e V.
for all U E V and fixed c > o.
(38)
We set
(Au, v) = a(u, v)
for all u, t' E
Then the operator A: V -+ V. is linear. continuous, and strongly monotone.
Slep I: An equivalent operator equation.
Our goal is to prove Corollary 23.24. i.e., we want to show that the original
equation (26) is equivalent to the operator equation (29). To this end, we use
the formula
d
dt (u(t)1 v) = (u'(t). v)
for all f e V,
(39)
and almost all , e ]0, T[. This follows from Proposition 23.20. Here, did,
denotes the generalized derivative on ]0, T[.
(1- 1) Suppose that U e Wand that
d
dl (u(t)1 v) + a(u(I), t') = (b(I), v)
r or all t' E
(40)
and almost all 1 E ]0. T[. This implies
(U'(I) + Au(t) - b(I), t') = 0 for all t' e V,
and almost all t E ]0, T[. Hence
U'(I) + AU(I) - b(t) = 0 for almost all t e ]0, T[. (41)
(1-2) Conversely, (41) and u E W imply (40).
Step 2: Uniqueness.
For j = I, 2, suppose that
U;(I) + Auj(u) = b(t) for almost all t e ]0, T[,
u)(O) = U o . u J E
The difference U = U. - "2 then satisfies the equation
U'(I) + Au(t) = 0 for almost all t E ]0, T[,
u(O) = 0, U E W.
432
23. Hilbert Space Methods and Linear Parabolic Differential Equations
We have to show that
u(t) = 0 for almost all t E ]0, T[. (42)
By the integration by parts formula (25) and by (38), we get
!III( nl 2 = f < 1I'(t), u(t» dt
= - f < Au(t). u(t) dt = - f a(u(t), u(t)) dt
< -c f lIu(t)1I 2 dt.
Note that u(O) = o. This implies
11 u( nl 2 + c f lIu(t)1I 2 d, < 0,
and hence (42).
Step 3: Existence proof via the Galerkin method.
For the sake of simplicity, we assume that the function
, t-+ (b(t), l')
is continuous on [0. T] for all t' E V. The more general case that b E
L 2 (O, T: V*) will be considered at the end of this section in connection with
the proof of Corollary 23.26.
(I) Solution of the Galerkin equations. The Galerkin equations (27) are
equivalent to
(u(t)'"j) + a(u,,(t), Wj) = (b(t), wi)'
u..(O) = u"o.
j = 1,..., n,
(43)
where
"
u,,(t) = L cJ,,(t)j,
Jst
and "i E V for allj. Equation (43) represents a linear system of ordinary
differential equations. According to Corollary 3.8, for each n = I, 2,
. . . , equation (43) has a unique classical solution on [0, T].
(II) A priori estimates for the Galerkin solutions. We show that
f II u"(')11 2 d, K (lu"(0)1 2 + f IIbll. d,) (44)
holds for all n, where the constant K is independent of n.
Multiplying (43) by c J and summing over j, we obtain that
(u(I)1 U,,(I» + a(u..(I), u,,( I» = < b(I), u,,(t) >.
23.9. Proof of the Main Theorem
433
Because of the well-known product formula
:t (u,,(t)1 u,,(t» = 2( u(t)1 u,,(t»,
we get
d 2
dt lu,,1 + 2a(u", u,,) = 2(b, u,,).
Integration yields
lu,,(TW - lu,,(0)1 2 + 2 J a(u" , u,,)dt = 2 J (b, u,,) dt.
We now use the strong positivity condition
a(u", II,,) ell u,,11 2
and the classical inequality
21xyl S c- I x 2 + cy2
for all x, y E R.
This implies
lu,,(T)l2 + 2c J lIu,,1I 2 dt S lu,,(0)1 2 + 2 J IIbll..-lIu,,1I dt
S lu,,(OW + c- I J IIbll- dt + c J II u" II 2 dt,
and hence (44).
(III) Weak convergence of the Galerkin method in L 2 (O, T; V). By (44), the
sequence (u,,) of the Galerkin solutions is bounded in the H-space
L 2 (O, T; V), since 11,,(0) -+ "0 in H as n -+ 00. Thus there exists a weakly
convergent subsequence (II".), i.e.,
II,,' II in L 2 (O, T; V) as n -+ 00. (45)
Below we shall show that from (45) we obtain:
11'(1) + AII(t) = b(t) for almost all 1 E ]0, T[, (46a)
u E (46b)
11(0) = "0. (46c)
By Step 2, equation (46) determines uniquely the solution II. Con-
sequently, all the weakly convergent subsequences of(u,,) have the same
limit II. Thus we obtain that
II" II in L 2 (0, T; V) as n -+ 00, (47)
according to Proposition 21.23(i).
In order to prove (46), we will use the following idea. By means of
434
23 Hilbert Spa(."e Methods and Linear Parabolic Differential Equations
generalized derivatives" we write the Galerkin equation as an integral
identity. It is important that this identity does not contain any deriva-
tives with respect to u". Applying the weak limit (45) to this identity"
we get (46). The key equation is given by
-(lI o lrltp(OI - f (u(t).t,)tp'(t)dt
+ f (Au(t). r)tp(t)dt = f (b(t). t')tp(t)dl, (48)
for all r E J/ and all real functions
qJ E C'[O" T]
with cp( T) = O.
(49)
(III-I) Proof of (48). Choose cp as in (49). Multiplying the Galerkin equation
(43) by cp and using integration by parts.. we obtain the integral identity:
- (u" ( 0 II "i I tp (0 I - f (u" (t I, "i) tp' ( I ) d t
+ f a(II", "i)tpdl = f (b, Wj)tpdl, (50)
for all n > j. Observe that
(u,,( t)l "j) = < u,,( t).. \\j).
Applying the weak limit (45) to equation (50), we get
- (uol "i Itp(OI - f (U(I), Wi) tp'(I) dl
+ f a(u, Wjltpdl = f (b, "i) tpdl, (51)
for all j.
In order to justify this limit we need the fact that the second and
third term in (50) represent linear continuous functionals on the space
X = L 2 (0. T: V)" with respect to u... Indeed, we have that
f (11,,( I), "i) tp' (I) dl < C I f 1111,,(1)1111 "ill I tp' (1)1 dt
< Czllwjll (f lIu,,(I)lI z dl )'IZ
= C 211 j 1111 u" II x' (52)
where C. and C 2 are constants, by the Holder inequality and IIvtlr. <
23.9. Proor or the Main Theorem
435
C III (' II for all v E Analogously,
J a(u,,(t), wj)(/)(t) dt s c, J II u,,( t)1I1I Wjlll (/)(1)1 dt
C 211 w) 1111 U II II x . (53)
In order to finish our argument let v e V. By assumption (H3), there
exists a sequence (VII) with
VII -+ V in V
as n -+ oc,
where each VII is a finite linear combination of certain basis elements
Mj. letting n -+ XJ we see that equation (51) is also valid if we replace
"i with v. This is the assertion (48).
To justify this limit we need the fact that the terms in (S I) represent
linear continuous runctionals on the space with respect to wi' This
follows from (52), (53) and
J (b. "i) dt s C (J IIb(r)II. dt Y/2 II "ill.
(111-2) Proof of (46a). From (48) we obtain that
( - J (/)'udt + J (/)(Au - b) dr. 1') = O.
for all cp e Co'(O, T) and all v E Hence
- J (/)'udr + J (/)(Au - b)dt = 0 for all (/) E Co'(O, T).
This shows that u( · ) has a generalized derit'ath'e on ]0. T[ with
u' + Au = b on ]0, T[.
(111-3) Proof of (46b). From u E L 2 (O, T; V) and UAt'1I J'. S C Uvll for all (' E Jt:
we obtain that
f II AU(I)II t. dl s C2 f II u(I)1I 2 dl,
i.e., Au E L 2 (0, T: V*). From b E L 2 (0, T: V*) we get u' E L 2 (0, T; V*).
Hence I' E
(111-4) Proof of (46c). Since U E Jot: we can apply the integration by parts
formula (25). This yields
(u(T)I(/)(T)t') - (u(O)I(/)(O)r') = f (U'(I). (/)(I)t') + (U(I), (/)'(r)v) dl,
for all cP E C. [0, T] and all t' E Jt: In particular, if cp(O) = I and CPt T) =
436
23. Hilbert Space Methods and linear Parabolic Differential Equations
O. then equation (48) along with u' + Au = b yields
(ll(O) - uol v) = 0
for all v E V.
Since ' is dense in H. we get u(O) = Uo.
The existence proof of Theorem 23.A is complete.
Step 4: Continuous dependence on the data.
From
II" Il in L 2 (O. T; V)
as n -. oc
it follows that
\I u 1I'..zeo. To .') lim II u.." L;z(O. T. V).
"-00
by Proposition 21.23(c). Thus. letting n -. 00, we obtain from the a priori
estimate (44) and from u,,(O) -+ U o as n -. 00 that
lIull 1.:(0. T: VI K 1:2 (I uol 2 + f IIb(t)II:. dt Y/2 .
By (111- 3 ).
II Au II Lz(O. T; J/.) sell II It '''leo. To V).
Finally. from Il' + Au = h we get
lIullw D(luol2 + f IIb(t)Uf.. dt Y f 2.
Step 5: Convergence of the Galerkin method in C([O. T]. H).
We show that
(54)
max I u,,(t) - u(t)l --+ 0
os's T
as n --+ 'YJ.
(55)
To this end. we will use the approximation argument (56a) and (56b) below.
We have U E Wand also u" E W for all n. The embedding W c C([O. T]. H)
implies
II. II" E C( [0. T]. H)
for all n.
(I) Approximation. Set H.. = span {,,,',..... Wit}' Hence Hit V £ H. By
Proposition 23.23. for each u E Wand E > 0 there exists a polynomial
p(t) = L tia i
i
with coefficients a; E V and
1111 - pil w = (f lIu(I) - p(t)U: dt y'2 + (f lIu'(t) - p'(t)II:. dt y 2 < f..
This inequality remains true if we change the coefficients ai of p a little
with respect to the norm of the space V. Since the set U" Hit is dense in
V. we obtain the stronger result that the set of all polynomials with
23.9. Proof of the Main Theorem
437
coefficients in U" H.. is dense in the space Thus there exists a sequence
(p..) of polynomials p..: [0, T] -. H.. with
p.. --. u in W
as ,. --. 00.
The continuity of the embedding W c C([O, T], H) implies
max lu(t) - p,,(t)1 S const lIu - p..llw --.0 as n --. 00. (56a)
O'T
Below we will show that u" u in L 2 (O, T; V) as ,. --. XJ implies that
max IU,,(t) - p,,(t)I--. 0 as,. .... 00. (56b)
O'S T
From (56a) and (56b) we obtain the desired relation (55).
(II) Proof of (56b) via integration by parts.
(II-I) From u..(O) ..... u(O) in H as n ..... 00 and (56a), it follows that
114..(0) - p..(O)1 -. 0
(11-2) From the Galerkin equations we get
(u, u.. - p,,) = (b - Au.., u" - p,,)
as ,. --. oc.
= (u' + A(u - u,,), u.. - P..).
The positivity of a(., .) yields
<A(u - u,,), u - u,,) > o.
(11-3) We set X = L 2 (0, T; V), and hence X. = L 2 (0, T; V.). Using (II-I) and
(11-2), we obtain from the integration by parts formula (25) and the
Holder inequality that
2- l lu..(t) - p..(t)1 2 - 2- 1 Iu..(0) - p..(0)1 2
= E (II - p, II" - p,,) ds
= f (II' + A(II - II,,) - p, (II" - II) + (II - p,,» ds
S f (II' - p,II" - p,,) + (A(II - 11,,),11 - p,,)ds
S lIu' - plIx.llu" - p..ll x + IIAu - Au"lIx.llu - p..ll x
s const II u - p..11 IF ..... 0 as n --. 00.
Note that the sequences (u,,), (p,,) are bounded in X, and (Au..) is bounded
in X.. This follows from the weak convergence of (u..) and (p,,) in X as
well as from (54).
Step 6: Strong convergence of the Galerkin method in the space X =
L 2 (O, T; V).
438
23 Hilbert Space Methods and linear Parabolic Differential Equations
(I} F'rom "" -"II in X as " -+ X it follows that
Au u in X.
"
as "-+ X.
since A: .,\, -+ }(. is linear and continuous by (54). This yields
f<AII".II)dt-f(AU.U)dt as n-.x.
and
f (h.u,,)dt - f (h.u)t/t
by Proposition 23.9. Letting" -+ x,, we: get
f <h - AII".II) + <h - Au. u,,> dt - (u,,( T)iu( T)) + (u,,(O)lu(O))
- 2 f (h - Au. u) Jt - lu( nl 2 + lu(0)1 2 = O. (57)
as "-+ X.
Observe II' = h - Au and the integration by parts formula (25).
(II) We show that u" -+ u in X as n -+ 'X. By (25), we obtain the following
two rela t ions:
2- l lu(T) - U,,(T)l2 - 2- l iu(O) - U,,(O)l2
= f (u'(t) - 1I(t). u(t) - u,,(t) JI
= f(b-AII-II.II-U,,)dt
and
f u" ( T) III ( T») - (u" ( 0)1 u (0)) = f (II ( t). u ( t) + (II' ( t). II" ( t ) ) J t
= f (u.u) + (b - Au.u,,)dt.
The Galerkin equation (43) yields
< II ( t). u" ( , ) + < A II" ( I). U" ( t ) = < h ( t), II" ( t ) ).
Now to the key of our proof. The strong positivity of a(., .) yields
£'iIu - 1I"lIi = (' f lIu(t) - 1I,,(t)1I 2 Jt
f < A f II - u,,). II - II,,) d t + 2 - I I II (T) - II" ( T) 1 2 .
Problems
439
Hence
c II u - u" II
s r1Iu(0) - u,,(0)1 2 + f (b - Au" - u. u - u,,) dt
= 2- l lu(0) - u,,(0)1 2 + f (b - Au" - u. u) dt
= r1Iu(0) - u,,(0)1 2 + f (b - Au".u)dt
+ f (b - Au. u,,) dt - (u,,(T)lu(T)) + (u,,(O)lu(O)) ..... 0
as n -+ 00,
according to (57). This implies U II -+ u in X as " -+ 00.
The proof of Theorem 23.A is complete.
PROOf OF COROLLARY 23.26. This proof proceeds analogously to the proof or
Theorem 23.A. Observe the following two pecularities.
(i) The bilinear functional (u, v)....... a(u, v; I) depends on time I, and hence the
operator A(t) also depends on t. But this does not change our estimates,
since all the constants are independent of t according to assumption (H2*).
(ii) The Galerkin equations represent a linear system of ordinary differential
equations, where the coefficients are measurable and bounded. In order to
solve this system, we transform it into a system of Volterra integral
equations, which has a unique continuous solution (see Section 3.3). By
the main theorem of calculus A 2 (2Sc), we may differentiate this integral
equation with respect to I almost everywhere on ]0, T[. This way we
obtain a solution of the Galerkin equations with bounded derivatives,
which exist almost everywhere on ]0, T[. 0
PROBLEMS
Without stating explicitly the opposite, we assume that
1 S p. q < 'X) and
Problems 23.1 to 23.10 form a logical chain.
23.1. Generalized approximation theorem of Weierstrass. Let X be a B-space. Show
that the set of all polynomials p: [0, T] .... X with coefficients in X is dense
in C( [0. T], X).
Solution: We translate the classical proof of Bernstein (1880-1968) for
X = R to our more general case. Let L = Li.o.
o < T < .
440
23. Hilbert Space Methods and Linear Parabolic Differential Equations
(1) Two identities. For "'(I) = (:)1.(1 - Ir-., we have
L l\(t) = I,
L h.(t)(nl - k)2 = nt(r - t),
for all , E Rand n = t, 2, . . . . To prove this, we begin with
II + sr = L () I. s"-.,
Differentiation with respect to t yields
nlfl + sr -I = L (:) kl.s" "
nln - 1)1 2 (1 + sr- 2 = L ()k(k - I)I.S"-',
Setting .'i = 1 - I we obtain (58), by summation.
(II) The Bernstein polynomials B". After a similarity transformation we
may assume that T = I. Let u e C([O, I], X) with the nonn lIuUc =
maxo, I lIu(r)II. We set
(58a)
(58b)
B"Ct) = L U () "'(t).
By (58),
lIu(l) - 8,,(1)1I < L U(I) - u () "'(1),
For f. > 0 we choose '"' > 0 such that
i U(I) - U () < f.
if
k
1 - - < lJ.
n
Obviously.
I' ( k ) , ( k ) 2 I
,I ult) - u - < 2 i: ull e I - - I b 2
I' n n I
From (58 we obtain the desired result:
II U(I) - B,,(IIII S L (f. + 211 ulle (I - r / c)2 ) "'(tl
I: + 2l1ull c l( I - 1)/6 2 n 2£,
if
k
t - - > lJ.
n
for all n "0(£) with suitable no(l:).
23.2. Proof of Proposition 23.2. The proof proceeds along the lines of the classical
proof for real functions, which may be found, for example, in Kufner. John.
and Fuik (1977 11 ). Also compare Gajewski, Groger, and Zacharias ( t 974,
M). Chapter 4. Let us consider some typical arguments.
Problems
441
.
.
I
T
(a)
(b)
Figure 23.2
(I) Density.
(I-I) Prove first that the set of an step functions is dense in L,(O, T; X). In
fact, it follows from the definition of the Lebesgue integral A 2 (14) that
for each U e L,(O, T; X), there exists a sequence (u.) of step functions
such that
U.(t) -. U(I)
Setting
almost everywhere on ]0, T[
as n -. .
{ U.(t) if lIu.(I)1I S 2I1u(I)II,
v (I) =
· 0 otherwise,
we get
f: IIu(t) - v.(t)ll'dt -+ 0 as n -+ 00,
by the principle of majorized convergence A 2 (19).
(1-2) A simple approximation argument of measure theory shows that all
those special step functions are also dense in L,,(O. T; X), which are
constant on intervals (Fig. 23.2(a». Each such special step function
can be arbitrarily closely approximated by continuous functions (Fig.
23.2(b). Hence C([O, T]. X) is dense in L,,(O, T; X).
(1-3) By Problem 23.1, the set of all polynomials p: [0, T] -. X, with coeffi-
cients in X, is also dense in the space L,,(O, T; X).
(II) Separability. Let Z = L,,(O, T; X) and suppose that X is separable.
Choose an at most countable set M, which is dense in X. Let P be the
set of all the polynomials p: [0, T] -. X with coefficients in M. A simple
approximation argument shows that P is dense in Z by (1-3). Hence Z
is separable.
(III) Embeddlngs. The continuity of the embedding
C( [0, T], X) S; L,,(O, T; X)
follows from
(I T ) 1/"
flu(I)II"dl S TI/" max lIu(I)II.
o oS'S T
The continuity of the embeddings in Proposition 23.2(h) is an imme-
442
2 Hilbert Space Methods and Linear Parabolic Differential Equations
diate consequence of
(f: IIU(I}IIJdtY' c(f: IIU(t) Uldt r'
c, (foT IIU(t)lIxdtr'.
.ere. use the Holder inequality and note that t q S r < x.
23.3. I)('nse ...lIhseIS of Ille .fipace r = C'"( [0. T]. X). m 1. let M be a dense set in
the 8-space X. Show that the set of all the polynomials p: [0. T] -+ X with
coefficients in ,\1 is dense in
Solution:
(It Let .\1 = X and m = 1. We set C = C([O, T]. X). By Problem 23.1. for
each u E Y. there exists a sequence of polynomials q..: [0, T] -+ X with
q" u' in C
as n....... "X:.
Integration yields
If we set
U(I} = u(O) + I u'(s)ds.
P.(I) = u(O) + I q.(s) ds.
then p == C/" and
IIU(I) - p,,(1)1I < TUu' - q,,!I(Oo
Hence we obtain P.. ....... II in C I ([0. T]. X) as n ....... k.
01) In the case where M = X and m 2. integrate the mth derivative. If
AI eX. then use a simple approximation argument with respect to the
coefficients of the polynomials.
23.4. Ahso/llle/y ("onlinuous functions. The function \\': [0. T] -+ X with values in
the B-space X is called absolute/)' continuous iff. for each I: > O. there exists
a J > 0 so that
L IIU(I,,) - u(.C;,,),1 < f.
"
holds for all finite systems of pairwise disjoint intervals ]t". s,,[ of total
length < J.
Show that each Lipschitz continuous function ".: [0. T] ....... X is absolutely
continuous.
Solution: Observe that (lu(l,,) - u(5,,)11 S Lll" - s"l.
23.5. Genera/i:ed main theorem of calculus. Let X be a B-space. For a given function
II e t I fO. T: X). we set
1'(1) = f u(.1 d..
23.5a. Show that r E C([O. T]. X) and that
(" = u on ]0. T[.
(59)
in the generali1ed sense. Furthermore. show that t' is absolutely continuous.
Problems
443
Solution: We will use a typical approximation argument. The set of all the
polynomials q: [0, T] -+ X is dense in L, (0, T; X). Hence there exists a se-
quence (q..) of such polynomials with
q.. -. u in L, (0, T; X)
as n.... 00.
We set
PR(t) = f qR(s)ds.
From
IIt.(t) - PR(t)1I s f: lIu(s) - qR(s)1I ds
it follows that
P.. -+... in C( [0, T], X)
as n -+ 'XI.
The continuity of the embedding C([O, T], X) c La (0, T; X) implies that
P.. -. v in L, (0, T: X)
as n -+ x.
By Proposition 3.7, we have p(t) = q,,(I) on [0, T], in the classical sense. By
Example 23.16.
f: rpqR dt = - f: rp'PR dt
for all qJ e CO=(O. T).
Letting n -+ oc we obtain that
f: rpudt = - f: rp't'dt.
according to Proposition 23.9. This yields to' :a U on ]0, T[.
The absolute continuity of t,. (ollows from the absolute continuity of the
integral (see A l (20)).
23.5b.. Consider the function t' in (59). where II e L, (0. T: X). Show that
v'(t) = u(t)
for almost all t e ]0. T[.
(60)
where the derivative exists in the classical sense.
Hint: This is a special case of A l (2S). Compare Dunford and Schwartz
(1958. M). Vol. I. 111.12.8. In particular, equation (60) holds true if u is
continuous at the point t.
23.x.. The generalized theorem of Lebesgue. Let ..': [0, T] -+ X be an absolutely
continuous function with values in the renexive B-space X. Then the classical
derivative I,'(t) exists at almost all points t e [0. T]. Moreover. we have
v' e L, (0. T; X) and
r(t) = t"(O) + f v'(s) ds for all t e [0. T).
Hint: cr. Brezis (1973, L), p. 145.
444
23. n.lhcrt Spat.'e tcthod 3tH! LU1CtU II,Uc'1liC Dlfferenhal EqUtttion5
2.1 6. (j"It.'rQJized IJarialitJntlJ Jen.nta Let X be a n-space. Show that if tI E t,{O, 7.; )
and
faT (/Iuds - 0
then u :z 0 in t 1[0, T; X), that i,
for all lfJ E C:(O, T).
U(I) == 0
for almost al1 I e ]0. 1'[.
Solution. Let 0 < f T and let I: > f) be sufficiently small We define a
function tp € Co"O(O. 7.) such that 0 q1 rand
<p(.f) = {
if se [£.,1 - 1:],
if s [0. ,].
Hence
f>dsl = II "d.f - f: (/Iudsl
c
D: I f" u(1 - fP)ds + f' "(I - ,,}ds'
Jo J,-t
1: RU(.f)!! ds + f-. ftu{s)n ds.
Letting l -+ 0 we obtain that
f' uds = 0 for all I E [0. T).
Jrl
by A %(20). The assertion follows now from Problem 23.Sb, by differentiation.
23.7. T, junction....
23.73. Show that the real function I e C:(O. T) allows a representation ,")( the form
,
1.=-..
f' e Co(O. T)
iff fJ X J, = O.
Solution: Set (s) = $: xdt.
23.1b. Choose a fixed function P E CO(Of 7.) with
f: pdt = I.
Show that each runction ." E CafO, T) allows a representation or the fonn
- p l T t/tdl + tp', tp e C:(O. T). (61)
Solution: This follows from Problem 23.78.
2.8. Charocterization of conSIGnt functions by gcnerali%td deriwz.ivrs. Let Y be a
B..space. If u e L,,(O. T y I s p < 00, and if
II' := 0 on]O, T[.
(62)
ProbJem.
44S
in the generalized sense, then u == conttan'. i.e.. there exists ace Y with
11(') WE ('
for almost all I e ]0, T[.
Convcrsely, each such constant function belongs trivially to L,.(O. T; Y),
and we have u' :::: 0 on ]0, T[.
Solulion: We set c a f: pw dt. From (62) it follows that
IT ,,'udt - 0 (orall tpEC:'(O. n
Using (61). we get
f: ,ud, - !oT '" J, l T pud,.
that is,
f: "'(lot - c)d' = 0
The assertion follows now from Problem 23.6.
for all '" e C:(O. T).
23.9. p-mean conti"";,,. Let X be a 8-space. fetor each u e L,.(O. T; X) with
I s p < 'X). we have
lim r t 01.1(' + It) - u(1)II'd, :: 0,
-o Jo
(63)
where we set u(s) = 0 (or s [0, 7.].
Solu tion: Dcfine the translation operator AA by
(A,u)(t) 1.(1 + II),
and denote the norm on L,,{O. T; X) by H. A. Obviously,
IIA,ulI - (IT 11.1(' + h)l!' tl,y'P
(f: IIU(fWd,Y" = lIuM
(I) Step functions. Let M be a measurable subset of [0, T] with the char-
acteristic function
(64t
ror all heR.
x(t) = {
ir t e M,
if t, AI.
Setting II = xx for fixed x EX, we get
(L T 1111(1 + It) - U(I)UP 111 )'" 21hl "xii,
sa 21111 r- Il -gull.
The linearity or Aa implies that, ror all 'tp functions u. we get
UA..u - "II 21lrrr -l/l'nuN
Cor an II E R.
(65)
446
21 Ililhert S Mtth()d and linear .Iarabnlic l>ilferentialluatiOOl
(II) Approxinuuion. Since lhe set or all the step functions is dense in
1...,(0. "f; Xh relation (6S) also holds true ror all functions U E L,(O. 7'; X).
This is (6:\).
23.10. .1",", Sobc,lev ,"pd(' W = W:(O, "f; II), I < p < 00. Choose 'I with II 1 +
p I -. I.
23. lOa. Show that W is a B.space.
Solulion: If Cu.) is a ('auchy scquc:ncc in then
u. ... ., in t,(O. 'f; V) as" -. 00.
u -1 II in L.(O, 'I'; V. ) as" ... tX).
with suitable points u and v. Observe that 1..,,,(0. T; V) a 00 1....(0. ,,.; Y.) are
B-SpaCC5. The oontinuity or the embedding V sa V. implies v ;: y'. accurdina
to Propoition 23.19. Hence
". -. II in W
as n CX)-
\
23. lOb. Show that CI([Ot "11. V) is dense in W. '
Solution: The proof' pr(1CCCds analos()uly to the correspondiol proof for
classical Sobolcv spaces in Section 21.4. ObMerve that. because of the p-mean
continuity in J)rublcm 23.'.1, lhe sm()()thing operator for II E W has the same
propcnics itS the clas.ictal smoothing operator from Sec.ion 18.14.
f ronl Problena 23.3 it follows adllitionally that the set of all the poly..
nomials p: [0, '''1 V with cocmc..cnt5 in V is dense in the space W.
23.IOc, Show Ihal the embedding W r:. ("(rO, 1']. V.) is continuous.
Solution: r>enote the norm un V. .,nd 1..,,(0. T; V.) by 11'11 and If'I,.
respectively. Let U E W .I'he idea of the proof is 10 consider the function
I!(t):.: (' u'(s)Js.
J..
l'hcn, I' ('( [0, 'r], V.). To see thi. observe Ih..t
..'(t) . .,(.)i f 1""(:)1 J:
and U the ahsolute continuity of the Intcgr,d, A 2(20).
My Prdhlen,s 21Sa and 23.H. we ohtain 1'4' == II'. and hcnte
,,,,) = 11(1) .. (.
for almost all, E JO. 1l,
(66J
where: (" E V., l'he Holder inequality yields
max IIIJ(I)II 1'11' III' I..
0-.1 e. J
uu = .,. I/'Nt'""
(67)
Frum (66) and (67) it rullow that
mal lu{'}11 J( II.,'U. + II' - liD,,)
o ". , T
J(lIu'li. + HIJH" + 11 11 1,) s Jd'''Uw.
23.IOd. Show thallhe embedding W ("([0. 1], II) is continuous.
Prtthlcna
447
Soh.. I i, tn;
t I) Intcgration by p..rts. I,et u. I' £- ('" ([0. 1'].11). Integrating
("(I)) "(1) r ::. {u' (1)1"(/)) + (U(I)}'" (t)).
we get tht int&:gratiun by parts rormula
("{I)I,'(t)) - (u(s)I"t.U "" f (U'(:)I,,(:» ... (U(:)I'I'(:))J:, (611)
fur ..n 0 s S I S 'I: It)' f'7).
(..II') ;. (u. ")
ror ull II. t) E y,
Fur II, I' e ('1([0, .rj, V), this i'l1plic:s
(u(IH,'(I)) .- (,,(.H..(sn == f <,,'(:),1 1 (=1> ... (II'(:),II(Z)J:. (69)
(or ull () ... < I 1:
tilt In order lu cancel ,he tern' CII(!\)III')) in {69t. we ch()l\ a test functi\\n
tp f: ('I(N) wuh .,(s) :r:c nand (j)(IJ:= t. Moreuver.lel Jq>f + 1.p'1 s I on
ai. Set t' -:. cp", E\luation (6'1 ilnpliL rh.at
(U(I)lU(I)) s; cuns' H"n:" for all u e ('-([0. 1], '). (1())
lu see th;1 obM:rvc thaI .,' cp'., + .pu' and use Ih I J()Ic.ler inu.tlity.
I "Iuat i.uI (70) yields
Mult"tIO,J).,,) S c()nIII""w {ural! ue (11([0.1], V). (71)
l"hc c. ('. ([ O. 'f], J/ i dc:nsc in 51'. By (71) and the extension principle
Srnn, Sect.,..u 114.12. ,hc cn1bcddinl operator
j: (.'((0, '1'], V) W ... (:([0, Of]. II)
ha .. uni'lUC conllnuous extension j: W .... (.([0. 'f]. II In Ih.s scnt
the s"hcddn'l
w ('([0. 'I'J.II)
. con I inutttll.
21Uk:. Shuw that the inlcgrtation hy parts fornlula (69) hold for all u. ,'e w.
Sulutinu: l Jsc tiM: density ur c' ([0. 1'], V) in Wand a linliting prua.-u in
(6() t.
"! .t t. &-\/('"k. ,.C"Ut('rfl("I('e. Let X he i& a-space. Recall that a scq\lenc 'x: rr)m .,,*
IS "IId weak Iy. convcrlcnt lu the ""int .. in ,t'. iff
<-:.) -. <....)
a ,,-. fl.)
for all x eX.
We write '(: - x..
Show Ihat if the cntbeddlng X Y il continuous, then a.,.. n -t (,1(;:
tIt -..... in X
inlplies
impl
.!(. -- x in Y.
y: -:-,. in X.,
.
J': .-" )1. ia Y.
t I erc, .\' and )' 4UC R-Spa5 over K.
Su)utlon: ()bS4:rvc: thut Y. A...
448
23. Hilben Space Melhods and Linear Parabolic DifTerential fquations
23.12. Tile !.pac Y = L,yo,(O. 7'; X) tJPld wtak* fonverOtnct'. Let X be a B-space. By
definition. the space Y consists of precisely aU the measurable functions
II: ]0, T[ -+ X,. wh;ch are essentially bounded, that is, there exist a number
8 such that
Mu(r)1I s B
for almost all t e ]0, T[.
Precisely all the numbers 8 with this property are called essential bounds of
II. Moreover, we set
lIull". = inf{ 8}.
where the infimum is taken over all the e.;enti81 bounds of u.
23.12a. Show that Y equipped with the norm ". n is a A-space if we identify any
two (unctions which are identical almost everywhere on)O. 1"[.
Solution: The union of a countable number of sets of measure lero i again
a t o( measure 7.ero. lei (u.) t1e a ('auchy sequence in f: i.e.,
It II,. - II", IJ < £
for all PJ. m no(r.
Then there exists a set Z or measure lero such that
tu.(t) - 1I...(rU < 1/k
for all t e ]0. T[ - Z.
and all n. m rt()( Ilk). Further there exists a number B with
I u..(t)1 < B
for all t E ]0, 1"[ - Z.
Ilere, Z is indc"endent of n, m and k. Thus lhe sequence (u..) convcrses
uniformly on ]0, T[ - Z to a function which belongs to the space Y. Observe
that the limit of a sequence or measurable functions is aguin .. measurable
function.
23.12b. An em"tJ.lding fhMrm. Show that Ihe embedding
Lcm(O. T; X) L,,(O pr; X)
is continunus for all I p S 00.
Solution: F"or all II E t,(O. T; X p < 00, we get
""M, = (Lr UU(I}I" tll) I', TI"lIulI....
23. J 2c. f'un(.'It1nal.. Let V be a B.pace and let v E L«,(O. T; V.). We set
if") = l' <'11 uw}.. tll
for an u e l"f( T: V).
Show that veL. (O T: 11)-.
Solution: From 1(,'.u)1 UtIli lIuli it follows that
IfJ(u)1 "vi.., f' "u(t). tit := !Iv 1100 6u/ll'
This is the assertion.
The following result 5hows that this way one obtains precisely all the linear
continuous functionals on t. (O 1'; JI).
Problelnl
449
2J.12d.. Dual;, y. Let V be a reOexive separable B-space. Then the space L I (0, T; V) is
separable and
L.(O. T; V)* c: L..,(O, T; V.).
More precisely, there exists a bijecti linear mappinl;; ...... u from L. (0, T; V).
onto LCIiI(O, T; V.) with
(ii.lI) = S: (to('). 11(').. J, for all II e LI(O, T; V)
and IIi II = lu"QO. Thus. the B-sp;ices 1...(0, T; V). and 14cof'(O, T; V.) are
nurm;somflrplt;( .
One identifies v with v and writes brieny
< ", II> ; L T <11(').11(1) >.. dl.
Hint: cr. Edwards (196S. M). 8.18.3.
2l12e. Weak. quemitJl CtHftpac'rnt'ss. Let the B-space V be given as in Problem
23.12d. By Theorem 21.E, each bounded sequen (v.) in L(O. T; Y.} has a
subsequence with
p.' ., in L...(O, rf; V.)
as II --t 00.
Ihat is, for i.1I11 E L1(Ot T: V
f:' ("_,(').11(') J, - L1' ("f I). II(t) dt
23.12f.. IJmi.s wilh r('.pt'(.' I() ;nlt.tllrdls. tet V be a B-space. Show that
D. v in 1.-.,.(0, T; V.. as II -. 00
as II 00.
(72)
implies
S: v.(,)Jt f: ,J(t)JI
Solution: "'or aU u e V, as n -+ 0C1, we set
(f: ".(I)Jr,lI) '" f: (1I.(r). II) Jt
-. L" (11('),11> d. = (1" t!(I)Jt, II ).
as n -.. 00.
This rollows from Problem 23.12c.
23.12g. (ie"ertJliztJ derilNJ';s. Lei V be a separable reflexive B-space. Show that
rrom
IJ ;:; ".. on ]0, 1'[
in the generalized sense. and from
II...!.. u in L 1,(0, 'I'; V.)
v. t' in '.,a,(O, T; V.)
for all II.
as n -. 00,
(73)
as n -+ 00,
4
23. HilhcrC Space Methods and Linear ParRbolic Differential Equations
it follows that
.1" = II un)O. T[.
Solution: let fP E C:'(O, T). By Problem 23.12d it follows that (13) remain5
valid ir we replace II.. II, V. t v with fIH'.. ((JUt .' V .. tp'". respectively. Lellin.
n -. Ii) we obtain rrom
f: ",u.d, - - J: .','.4' for.n tp E C'fO. T)
the equation
f' tputl, = _ fr tp'"dt for all tp e CO'(O. T
J 0 J ..
according to Problem 23.12f.
23.1 J. T"." '/t"I('rtli S(JmJ/t'ft c'",Mdtlillll ,htorent." ./fJr flJl,I",i",. ellU('tifJn.... tct X. Y. Z
be rc;., B.p3ces. We conidcr runcrions u: [0.. 7] ..... X. Let
W = {II: U e 1",,(0, T: X)t II' e L.(O. 7'; Z)} t
where u' denote.'i the generali7.ed derivative or II u(') on ]0. T[.
2J.13... (','''';IIlIiI )'. Shuw that
w s C([O. 1'], Z)
(74)
in the 'sc where tbe following cnndition5 are salilCd:
(i) The embedding X Z is continuous (c.g., X = Z).
(ii) () < l' < iYJ. I S 'It P S rtJ.
Solution: ()se thc5&1mc &lrf.!Umenll in ')robJem 2.1. JOe. ()hservc th:u ,, H'
imrli II. II' E L, (0. 1'; Z).
A usual.. relation f74) ;s 10 be understood in ,he lollowins sense. I' " €
then there exists it continuoUl function I': [0. 1'] ..... Z such th.t 1'(1) = ..(t ror
ahnC)1 allt e [0. 7-J.
2.ll Jh.. CfJmpt"."Jt... Show that the cmhcdding
W c. 1.,(0. .f: Y)
is competet in the (,1Ise where the rullowing (:nnditions arc mttislicd'
(it The cn1bcddin
XyZ
ilrc ce'nl;nuous. .tAd rhc cnlbcdding X y i comp:ecl.
(Ii) The spaccs X .Intl Itrc rencxivc.
(iii) 0 < l' < I),. I < p, 1/ < "',
Ilere. we equip 'he spu'e It' with the norm
1.,11... :: 1I"II",w. ",.." UII'H,o."o. 'r.lt.
Thi way W hccnmes a ."space.
I lint: ('r. I.inn (19()9. M ('hurter I. 5.2.
2114. 7'/r(t inl'.'('I;tJn un(1 ",(, .u"t tr R-s/'fl('(ls. l.et X. Y. and Z he II-!\pa...-es uver
k = M. ( "tach th.lt the embc:ddinl!
.\' Z
Ind
y l'
Refcren<.. to the Literature 451
are conrinuou, and X n Y;s dense in both X and t: Show thill:
(i) The set X n Y becomes a B-space oer K with the norm
max { II u n JC' Ii un)' ).
(ii) The t X + Y becomes a B-space over K with the norm
1Iwn = inf {Ilil. + Ivlr}t
w ....
where the infimum is taen over all the possible decompositions w =
" + f'. U EX.. v (" Y.
(iilJ (.-Y Y). = X. + Y*.
*(iv) (.X + Y). = X' " Y*.
}-lint Use the I I:thn Banach theorem. <''T. Gajewski, Groger.. and 7..acharias
(1974, Mt, p. 14.
'rhc t.Ucmcnts ahovc remain true if Z is a locaRy convex space.
References to the Literature
Calculus: ()unford and Schwartl (195K.. M). Vol. I.. Gajewski. Groger and 7.acharius
(1974, M). Rrc7is (1973, L).
Mndcrn stand;ud wurk on linear partial differential equations: Dautray and '..ions
tt9K4. Mt. Vol. I J (theory.aJ1plications. and numerical methods).
I-Iandhook or numerical mathemati(S: {:iarlet and Lions (1988. M), Vols. lIT (finite
clement method, dilTcrcn(, method..etc.).
Linear parabolic differential cquatioDI of mathematical physics: Babic (1967.. M, 8)
(general overview without proofs), ('ouranl and Hilbert (1953. Ma, Tychonov and
S.amarskii (1959. M). Garahedian (1964, M). Widder fJ97S, M). Carrier and Pearson
( J 976. M). John ( '982. M'.
I ntroduction to the functional analytic lrealmeat of JillCat parabolic equatjon:
Friedman t 1969. M)etheory ofscmigroups).lionsfl969., M)(Galeriin method), Triche1
( 1972, M)" Sh()wtI ter , 1977. M), Wloka (198 M Ladyienskaja ( 1985, M), Leis (19H6,
M).
Further important monographs: Friedman (1964. M) and Ladyt.enskaja (196', M)
(classical standurd works), Lions and Magcnes (1968. M) (interpolation theory and
paraholic equations). Vejvoda '1981, M). Faltorini (1983. M). Hormander (1983. M).
Difference mcrhod5: Smith (1965, M) (elementary introduClion Richtmyer and
Morton (1967. M Mcis and Marcowitl (1978. Mt. Marchuk and Sbaidurov (1983,
M t. ('iar1ct and Lion ( 19MB.. M), Vol. 2.
(ialerkin mClhod and finite clcmcn'5: ThomCe (1984, L) (recommended as an intro-
ductinn Glowin!\ki. Lion and TremolierC! (1976. Mt. Ciar1et (J 917. M). Fletcher
(1984, M (ia rlet and Lions (198K, M ), Vol. I.
Method of discreti7.ation in time: Reklorys (1982. M).
Multistep multiderivalive mcthool: Gekcler fl984. L).
Mullittrid methods: Hc'ckbuh 1985. M).
CHAPTER 24
Hilbert Space Methods and Linear
Hyperbolic Differential Equations
If you want to keep a matter secret, publish it.
Folclore
AI a talk which I gave at a celebration of the Iwenty..fifth anniversary or the
construction of \Ion Neumann.s computer in Princeton a few years ago, I Iud.
denly staned estimating silently in my mind how many theorems are published
yearly in mathematical journals. I made a quick mental calculation and came
up with a number like 100.000 per year. I mentioned this and my audience
gasped.
The next day, two of the younger mathematicians in the audience came to teU
me that, impressed by this enormous fiaUle t they undertook a more systematic
and detailed search in the Institute library.... Their estimate came to nearly
200 t OOO theorems a year. ·
Stanislaw Ulam ( t 976)
We may distinauish between a useful theorem, i.e., one for which an applica-
tion has been rou and a very userut theorem, i.e., one for which many
applications have bien found. . . .
The mathematical literature contains millions of theorems and very likely
most of them are useless.
Philip Davis and Reuben Hersch ( 198 J)
In the sense or"pubtish or perish'" many correct but mathematically uninterest-
ing papers are written. and have led to tbe Rood or mathematical publications
which makes it difficult for the mathematician to find the significant contribu-
tions for his work in the literature. By extrapolation, as I recently heard, it
rollows that in the year 2500 the length of bookrows in the libraries wilt increase
with overUght veJocity. I
Fric:drich HirJ.ebruch (1974)
1 Recently. it has been proved tballonl befoR the year 2500 the weipt of c he existing publica..
lions will be equal to the weight or tile earth. TocbIy we also have environmental pollulion caused
by publiCations. The f"oIlected m"b of David Hilbert (1932)consist of three voJumes olmoderate
Sil.c (without his book). Nowadays there exist maRY mathematicians and other scientists whose
452
24.'. Main Theorem on Second.Order Linrar Eyolution Equations
453
Jit 1956. after about one month of work, I w. ordered 10 write up my results
and as a consequence of which I swore to myself that I would choose a subject
for research where it would take at least five years before I had anything worth
writing about.
Kenneth G. Wilson (1982)
(Winner of the Nobel Prize in Physics in 1982 for his
fundamental work on critical phenomena in connection
with phase transitions, born in 1936)
If your work doesn't please the ex that is a dangerous sign_ If, il) addition,
it is praised by fools. then at should be forBOttcn.
Christian Fiirchtegott Gellert (1715-1769)
(Professor of Poetry and Humanities in Leipzig)
Even if we have, for many reasons, abandoned the eishteenth-century ideal of
universal knowledge. it is still appropriate to sometimes remember its advan-
tages. when we consider the typical academic life of today. In every developed
country there are hundreds of active mathematicians, out of which each one
masters only a very small (-orner of his science, which then. understandably,
appc&lr to him to outshine all othcn in importance. He publishe$ the fruits of
hi work in abrup single papers in several journals, often written in various
languages and scattered an over the world. The presentation or the material is
meant only for a few specialists and contains no hint of a connection with more
general questions. and is, thcrefore, ror colleagues of other interests. very difficult
to undcrs1and but for an even greater circle of mathematicians it is completely
Incdihlc.
Felix Klein (1926)
24.1. Main Theorem on Second-Order Linear
Evolution Equations and the Galerkin Method
We study the following initial value problem:
d 2
dt 2 1II(I)lv)H + a(u.').,,) = (b{t).v)v.
u(O) = "0 E .-: u'(O) = ". e H, (I b)
U E L 2 (O. T; V), u' e LzfO. T;m u" e L 2 (O, T; V*), (Ie)
where we assume that (I a) is valid for all v E Yand almost all t E ]0, T[.
To be precise. we assume that there exists a subset 2 or ]0. T[ of measure
zero such that (J a) i! valid for all v e V and all t e ]0, T[ - Z. Note that Z is
independent of r'. Furlhermore't d 2 /dt 2 in (Ia) means the generalized derivative
(I a)
publications in journal woutd fill many mole volumes. Allover the workt bureaucrats promote
thi tendency. The easiest way to estimate the yalue of a scientist is to count the number or his
puhhc3tt(Jn. Time . ripe to l'uJi the emergency brake. But this must be done by the scientists
themscl'tcs.
454
24. tiilbert SpIce Methods and Linear Hyperbola: DifTercnliJI Euation\
of real functions on ]0, T[, that is,
f: (u(t)lv)"fP-(t)dt + J: a(II(I).v)fP(t)dt
= f: (b(t). V),'fP(I)J, for all tp E ("I: CO. 'f).
We make the following assumptions:
(H I) .. V H V." is an evolution triple with dim V = oc. and 0 < 1" < (y.).
The spaces V and I' are real H.spaces.
let h e L 2 (O, T; H). Uo e V and "I e 11 be given.
(H2) The functional a: V x V.... R is bilinear, S}'mlnetr;(", bounded nnd
strongly positive.
(H3) WI' W1.... } is a basis in We are given sequences tullo) and (ifill) with
".0 "0 in V as n -. ,
".1 II, in H
as ',..... (f.J.
The UJlO and u". belooglo span (WI""" w.) for all n E N.
In order to formulate the Galerkin method. we set
.
U.(It = L C,.(I)",.
t&1
.
",.0 = L tlb W "
*=1
.
".. = L IJ,. W t .
, :z I
By definition. for almost all , e ]0, T[, the Galerkin equation." read as rollows:
..
L c;'(t)(w.lw J )" + c.u(t)u(w, , Mj) = (h(I), w j > v'
.al
(2a)
(j.(O) == ex., ((O) = /ljft.. j = I...., n. (2h}
Here, (2a) results formally from (I a), when one replaces u by (I" ..nd t' by ,,.
Equation (2b) results from the initial condition
".(0) = ".0, u(O) = U" I ·
Since WI' W Z , ... t w. are linearly independent, dct( (w.1 w j ),,) o. Therefore.
equation (2a) can be solved for the second derivatives file' and there arises a
linear system of second-order differential equations for c 1.., ..., (',.,."
Remark 14.1 (Justification We now show that the initial conditions (I b) arc
formulated in a meaningful way The triple .. V s; II s; V." is ilD evolution
triple. By Problem 24.1, H is dense in the H..space V.. Conscquen(IYt JI and
V. generate the evolution triple
"n C V. c H.."
24.1 Main 'rhcnrm on Secc)nd.( )rdcr Linear volulion F.quatinns
455
It f()lIow from the corresponding continuous embeddings that
1"2(0, T; II) c 1"2(0, T; V*) c: L 2 (09 T; H.).
Ry Prohlem 24.2, ,," = (II')'. Thus. condition (Ie) yields ,I' e L 2 (O. T; H) and
(lI'r E "2 (0, 7': II.); therefore
u' E W 2 1 (0.. T; I', V*)
hy the definition of this Soholc\I space in Scctio 23.6. The generali7.ed de.
riv"tivc Il' is determined only up to changt,-s on a set of measure zero on
[O T]. However. by Proposition 2J.23(ii) there exists 8. uniquely determined
rcprcsentttivc
u' E (([O. T) V*).
We use this representative in the initicll condition (Ib). Because of "1 E Hand
H V. the requirement u'tO) = ". makes sense..
Anulogously tu the ahovc from (Ie) there results
u e W21(O T; H
therefore.
u e C( [0, T]; H).
Becuuse of "0 E V,tnd J/ H the condition ufO) :; Uo ; mcaningful.
Theorem 24.A. U nJ(" ,h(' ",'i.""'p';()n... (H 1 (H2), (H3), the flJllo\\Jing hold:
(a) Existence and uniqueness. Th(! original prohlrm (I) has II unique ol",;(}" II.
(b ("ontinuous dependence on the datct. TIle mtlp
(u o , U I' h)....... (Uf)
i." (Ilin(!(lr ('ontinU(UfS nlap (r tIle form
v x H x L 2 (O, T; H) -+ Lz(O. T; V) x L 2 (O. T: H),
II't" i., ,11«"(' (/x;.I... (I C()II.ta'JI D > 0 .,,('I. 1M'
""U"1tO. 'r:t t + lIu'1I1.. 1 (0. T:Ht :S; D( IluoR., + lUaU" + Il b Il LJ (O.7:11))'
fct ("on\'crgcncc of the Galerkin method. For tIt.."" = I" 2...". .,he Gtllerkil1
equal iOll (:!) lid," a '1I1iqr,(J sol,,, if)1l U. ,hat .ulisJi.. (.()ndir ;011 ( I c). ""d
"" -- II ;" /"2(0 1 T: V) as n -+ OC-..
u _:. u' in J '2(0, 1; H) 4' n .... rYJ. .
(orollary 24.2. .4.-....."",e (.f ,) (H J). Tht'" the origilUJl pr()b/(lnr (I) ;S ('qtlivtllt',J'
If) 1111- ,/()/Io'M i"/ 0p«'rlilor ()(/Ut,';on:
""(lIt + Au(t) = h(t)
.(or almosf aI' t e ]0, Tr..
u(O) = U o ' u'(O) = ",,, (3)
u F t 1(0. 7"; V), u ' E L 2 (O. T: H), u" e L 2 (O, T; V.).
456
24. Hilbert SJ*Z Methods aad Linear Hyperbolic Differential Equation5
Hret the operator A: V -+ V. resul.s/rom
< Au, v)" = a(.., v)
for ail 14, v e V.
The operator A is linear, continuous, symmelric:, anti .trongly monototle.
Pk()OF ()I-' C()ROtLARY 24.2. The existence of A follows from Proposition 21.31.
By Proposition 23.20(b), equation (Ia) is equivalent to
(u"(t), v)., + <AIICIV>" = (h(t), v)Vt
for all L' E V and almost all t E JOt T[. This is equivalent to the operator
equation (3). 0
We deal with the proof of Theorem 24.A and of Coronary 24.3 below in
Section :!4.3.
We now consider the more gencr.d case that the bilinedr form a in (1)
dt:pends also l)R time f and satislies a Girding inequality, where the cor-
responding constants arc independent of t. To this end, we replace the assump-
tion (.-12) with (H2*).
(H2*) The map
, ....... a( U t .J; I)
i C I on [0, T) for all 14, v E v. "-or each I f:: [0, 'f], the 111ap a(... .; t):
V )/ . -4 1R i bilinear, symmetric.. and bounded. 'fherc arc constants
C'.. (' > U. and d > 0 such that I
td(U. " 1)( < ("UuUJ nvl",
cI(II," I) ? ('''"If - JUlin;,
for all II.. lJ E I { rO, J'].
('orollar,' 24.3. As.wn(1 (II I), (H2.J. (HJ, '/."l'II "lithe tlS.'\erlIOIlS fr '/"laeurenJ
:!4.:\ rellhlltl ,:allt/.
24.2. Applicati{)o to Hyperbolic
IJiffcrcnti,ll Equations
'.et Q i (;' It). .fi . We cunidcr the Initial boundary vcaluc probtcn\:
U,,'X../) AU(.t.l) 1ft, I) nn QT'
l4(X..I} = () on ((; x ]0, '1'[,
u( x, 0) = I'nf t lJn (i.
Iftfx,O) = III (.') un (j.
(4)
i '('linitj(Jn 24.". I.l (, he a hounded reguau in U., N , I. WI.:, ct
v = W 2 1 (i).
II :;.- II J (i ).
24.2. Applk1lti()n 10 Hyperbolic Difl'erentiul FAluslions
457
The uell(!rtdized problena corresponding to (4) reads as follows: Let f E L 2 (Q1'),
U O E V and U l E H be given. We seek a function u so that equation (1) holds
with
a(u, v) = r t DiuDjl,d:c,
J(i ::1
(b(I), v).. = r f(x,I)V(X)Jx,
JG
where x == (l t..., N) and D j = (J/(',. Recall that
(ulv)H = L u(x)v(x)clx.
The gcncrctlized problem and the corresponding Galcrkin method (2) are
motivated by Section 19.3. As basis elements {WI' w 2 ,...} in V;;;; W 2 1 {G), one
can choose the finite c=lcments or polynonlials given in A 2 (56} lhrough A 2 (60).
Proposition 24.5. f"or lhe generalized /,robleln ('orrl'..,[XJnding 10 (4), all the tJ.uer-
lions (r 'fhl'(,rem 24.A and Corollury 24.2 t,re saci.'!fled.. where IIbll'*Jto. fill. =
II J'lILl (Ur).
In purlicu/ar.. '''ere exists a unique sol,.t;on u oj' thl' yenerali=ed pr(Iblem If'
(4) .u(:h thaI
14 E Ll(O T: V),
M(Jreo(Jer, we obtain that
u' e L 2 (O, T; H)..
u" e L 2 tO, T; V.).
UUIl/.J(H.1'.J' + Uu/fl'.z(O.T;H) S; const(lIuollwl(G) + Ilu l l1'..zf(i) + II rH'l('hJ)'
PROO.... As in the proof of Proposition 23.28, it follows that all the assumptions
(11 t- (H3) of Theorem 24.A are satisfied. 0
Remark 24.6 (Energy). We equip the space J' = W 2 1 (G) with the equivalent
energetic scalar product (III V)E; = a(u, v), and we set
f;(/) = 2 . r f (D j U(X,I))2 + u,(x,I)2)dx.
J G ;-.
Then..
IIUIILo.;...) + /lu'lIt,o,T:II) = 2 f: E(I)dt.
In the special case N = 2, we may regard u(x, t) us the displacement of a
membrane at place x and time I (cr. Fig. 18.8). Then, equation (4) describes
the motion of the membrane under the influence of the '.outer (oree f."
Moreover, E(t) represents the tolal energy of the membrane at time t. From
Proposition 24.5 we obtain that
SoT E(I) dl const( 111.10 II l(G) + II ullll(G) + II flltJ(lr)'
45N
.!-*. tlilt.cU Sp.''-'C Mclhtld ond I.inc.., IIYI1Crhohc l)ifTc,cul...1 I:4ualinns
Thi shows cleilrly .ha ('ur .arprotlch c&tn be considered :as ,In effl"¥u('lic"
,"c" 'roll. )
Insh.".uJ or (4) we nuw consider Ihe nlure cncrill initial houndary value
rroblcn1 uf urtlcr 2", =
u" ( \. ,) + 1,,(, hl( ,. .1) r( Y. I) Olt Q T" /'
, /'
1)2 Ift\:. ,) = n on c"(; x 10. ""1 fur .,n fl1 1ft I.
(4*)
fit '", O} .- - lint '") ,'..1 (i.
11,( t. ()) =; U d.\") ()f1 (;,
,
w hcrl"
..
1.( I) U( '... ,):: L f -. 1 )Iilt 1>- h'cr(.' I) 1)# II( '... ,)).
1.11i'. fit
iA) I.l"t,n -= I.. 2, .... 'n"llc' (j he, ht)\1I1l1cd rcgit)n in R N , N I. AU the
(('cfTicicnl
I'
tI",. :'1 CI'a': Q" -f R
f I
care rncasllrilhlc &Ind hounded. For ca.ch t rO. 'I"I. I he diffcrcnli,.' t'rcr.lur
1,,(/) is rc(!uhtrly slr()n!t1y elliptic or 'rf)"'II.r ('/IiPI;(" of ()rtler 2,,, in (he sense or
I)cfinili,)n 12.42. :and I ,) is ....\"II'II,c'lr;(". i.c., (',,/1 Il piI ()n Qt. for ..II fl./I. Finilily.
f()r each \ e: (;. .11 the functiuns tHtl,.(\.') .Ire (" on r() "'].
l>efiai.htn 24.7. Assulnc fA),. ,and set
1/ "'2'"((;).
II :;: 12«;)'
'fhc Ilt"'L)rllli:c.,1 proh/(-n, 1(1 (4.) rcad U fnlh)ws. I£t .r e l.z(Q, ). "n E Y und
II t f " he given. We seck t functiun II slich (hut equation (I) holds. where
tI(lI. ,,;,) = f L ll#f/h.I)/,!D"IItIx.
J ft 1-1-""..
("'1), I')" = r fC.'C.I)"(.t)Jx.
J,;
fur all II.. ,. f" J". I ( 1 0 '1 'f I.
.
I)roposition 24.S. /1 ,fUJI(# (A). '/.'.('11,. ./c,r "1(' fl('fI('r,d.:(,.1 prfJ"'f'lJl 10 {4.). ..11 ,Iu'
(/'''c'rl if'''.'' 01 .,'I,,'ol.(',n 24.A 1.,,1,/ Ir,,(', W/I("(' Jib" I. 1 (U. T: If' ::' K." II "IC(I,..
I)IUMII-. (Isc Ihe (iiirdin inCttualiry from Seclion 22.1 S ctnd use Ixamplc 23.4
with respect 10 ". ."hen. the ar'ion fullows from ('orull.try 24.3. [J JI'
Rnnlrk 24.9 «;cncra Jil.cU ic.)ns). 1 Jsin the Oiirding inequality on t rongly
elliplic yslclns fr()ln I)rnblcm 22.7. nnc can i.nmcdiallcly aprfy Thcorcn1 24.A
24. \. 1"4"" of Ihe M..in 'rltCuFCln
459
tu sccnnJ.(')rdcr hypcrhnlic systc.ns. In this connect inn. ,Ipplications 10 cl,t()-
dyn:uni,-=s will he considcrd in Sction () 1.1 () of I)nrt I V.
.rhe in\ptu't:UIt class of synunctric hypcrbnlic first-order systems will he
studied in (hartcr XJ in connccliun with Mclxwcll's equittions in electro.
d yn:unics.
24.3. Pr()of of the M\in Theorem
-rhe fntlnwing prt)()r of Theorem 24.A is h.ascd on:
fi) 1 he intcgr:lt inn by 1':lrt r(}rn,ut:ts and
(ii) " ,.,;",; cslilnatcs for rhe St)IUlinn5 of the Gulcrkin equations by n1Citn or
the (,rnnwil1llcmnltl (IJror()sition 3.ln).
-rhe intcgriltiof1 hy parts rOflUuhlS arc obtained in a simple way for smooth
functiuns, Ilnwcvcr. note Ihul we .Ire dculing here with gcncrnlil.cd dcriv.ttivcs.
'rhcrrnrc. 1 he idcnt itic!\ in (i) require :. SpeC;iill justjlicl(ion which we take
care ()( by apprnxin1ating the vector-v.tlued functions and their gcncr&\lizcd
dcriv"1 ivcs hy )Iyn()mial. Ind follow 1 hi hy pnssing to the limit. In thi
C('.HlCclion. unc nlut alwuys Pity c.lrcful ilUcntion concerning which spu(.,s
1he arrrnximatiuns arc c:.rricd out in. ApparCnll) very short proofs ror
evolution J'rnhlms in Ihe li1cr:l1urc frequenlly huvc their origin in thc fact
thai only the rormttl ..rumcnts Ire given. ilnd their justification is Icft t()
the reader.
I nstcitd uf < .. · >. we write briefly < .. · ).
24.Ju. Pro()f ()r UI1iqucns
SUPI'K'SC Ihut ". &100 "1 are solutions of the original problem (I). Then the
diffcrclM.'C
" = 111 -"2
sc.risfic, cquati()n (I) with b(t) =. () and u(()) = ,,'(0) :::: O. By (3).
.." + Au = 0..
Rcmlrk 24.1 shows that u e W 1 1 (O. T: V, II). Integration by pnrl" (2J.25) yields
IIIItS)11 f, = 2 fS <r"(1 utI) > dl, (5)
Ju
for nit S E jO. .''[. J.or fixed S. we define
{ I for 0 I S S.
X{/) = 0
r()r S < I T.
460
24. HaJh..:rl Space Methuds ;and Uncar l'ypcrbolic Differential Equations
and
v(t) = - f Xs(s)u(s) ds + (', (6)
where the constant (' is chosen SO that v(T) = O. Then v(,) = 0 for t S. It
follows from u e L 2 (O. T; V) that XsU e L 2 (O, T; V). Problem 23.Sb yields
,
v = XsU.
Now, our uniqueness proof will be based on the following two formula:
(u'(T),v(T) - (,,'(O),v(O» = s: (u", v) + (u',v')dt (7)
f: (A.,'(t), v(,» dt O. (8)
The proofs of (7) and (8) will be given below.
It follows from (7) and u'(O) = veT) = 0, ,," + Au :::: 0, v' = Xsil that
t" - (Av'(l),o(l» + ("'(I), u(l» dl = 0,
B) (5) and (8), u(S) = O. Because S is arbitrary,
14(1) == O.
and
This shows the uniqueness of the solution.
I)roof of (7). As in Remark 24.1, we consider the evolulion triple
MH s; V. s H....
8ccause of the corresponding continuous embeddings, the embeddings
L 2 (O, T; II) c L 2 (O, T; V*) !; L 2 (O, T; H.)
are also continuous.
By (I cJ, u' e L 2 (O, T; H) and "" e L 2 (O, T; V*). According to Problem 24.3,
there exist polynomials P.= [0, T] -. H with
P. -+ u' in L 2 (Ot T; H) as n 00,
p.... u" in L 1 (O, T; V*) as n -+ 00.
Then the above-mentioned continuous embeddings also yield
P. -+ u' in L 2 (O, T; H) and p -+ UN in L 2 (O, T; H.) as n -+ 00 (9a)
as well as p" -+ u' in L 2 (O, T; V*) as n -+ 00. Now (9a) means thtt
p" -+ u' in W 1 1 (0, T; H, V*)
as 11.... 00.
By Proposition 23.23, the embedding W 2 1 (0, T; H, V.) £ C([O TJ, V*) is con-
tinuous. This yields further that
P. -+ u' in C«(O, T], V*)
as n -+ 00.
24..t I)ruuf of the Main Theorem
.
461
In particular, note that
P.. --+ u' and p -. II" in L 2 (O, T; V.) as n tYJ.. (9J
p,,(O) u'(O) and p.(T) -+ u'(T) in V. as" .... 00. (10)
Since U E L 2 (O" T; V), there exist polynomials q,,: [0, T] ..... V such that
q" -+ u in L2(O T; II)
Parallel to (6) we construct the function
V(l) = - f Xs(s)q.(.s) ds + t:.
as n -+ 00.
We then obtain v.. v in Cf[O, T], V) as n -+ (".1.1, by the Holder inequality.
Moreover, v :: - Xsq.. Thus, we get
l',,-+V and vt" inL 2 (O,T;V) as 1J-+r:t. I , (It)
lJII(O) v(O) and v,,(1.) -+ v(T) in V as n -+ . (12)
F ron) this we obtain that, as n (.f.J,
L' (p,v.) + <p.,v)dt - for (u",v) + (u',V')dl,
( 13)
<p,,(T vlI(T) - (p,,(O), v,,(O) <u'(T)v(T» - (u'(O), v(O).
The first integrdI relation of(13) follows from (9), (II) and Proposition 23.9.
According to the classical fonnula for integration by parls.. the left-hand sides
in (13) arc equal. This implies (7). .
l)ruur of (K): Clasical formulas yield
i s i s d
2<Av(I), ('.(I}) dt = d - (A"a(l). vlI(t)dl
00 1
= -- (A l',.(O). V,.(O) > o.
( 14)
To prove this.. observc that VII is a polynonlial and nOlc the ."y,nmetry and
positiveness of A: V ..... V. as well as V,.(S} = O. In fact,
d
I < AV..(I), 11,,(') = <AV(I)t 0.(1) + <.4vft(t "(l»
( I
= 2 < Av(l), (',,(t).
"'rom (II) and the continuity of A: V --t V. it follows that
v" -.. " in I, 2(0, 1'; V) as IZ -+ 00,
Av--..4v' in L 2 (O,T;V.J 'as "-'V,J.
We thus ohtain (8) by means of passing to the limit in (14) as n ..... CXJ.
4(.2
14. llilhrrl SI';tCC Mctht l d, and l..incar lIyperholic l>ifcrcnti(III;(jutiuns
24..3h. IJr()()r of Existence
We cll!rcc to lIse the tahhrcviation
( II I f') = (" I ")" . I "I = flllll".
(Ii, ,,) :: (,, ")JO, I"n :: ""Ih l .
and recall via Secti()n 23.4 the cuntinuily of the emhcdding. 1/ II <;; V. with
fll119) =- (Jr. ,,) for all ,. e II. It E V,
I ,.t '.( .111'11 for &111 ,. E V.
( 15a)
where (' i n constunl. II thus rollow that
I("'.,')} = 1(".lr)1 (2I1M'1I tlrn
r()r all w. " e
( I Sb)
Sh'I' 1: ,.,1 ,.rior; ctimaIC for the oluti()ns of the Galcrkin cquutit)ns by
mctns of the <.ronwalllcmrna.
Because t)f h € I.. !«()" "': II ,and
1 <b(l), ")1 1 = 1(/J(')I,.)lz (.fh(l)l l lI"1I2,
the rc.,' function ,.-. (h(t). ,,) belongs to. the space 1.1)(0. "f) for all .. E For
the sukc or implicity. we first assume thut
,...... <hI) ,') is continuous on [0. T J for all r, Ell.
Then the (inlcrkin cquution (2) posscs a unique classical olution "11 E
("1 [0, 7.]. It imm,,-dhatcly follows rrt)m (2a) ufK)n multiI1Jicati()n by (';" und
suhsequent summati()n over j that
,I
I (l1I('tIZ + atu.(I).II..(I))) == 2<b('),I1('»,
t I
(16)
In this connect inn. note the sJ'n"l(t'". of u. i.e.. ,'(W i . Wj) = a(w j . "i) for all j, k.
I""rl hcrn10rc,. hy (I Su we get
21<h(I).I'('»1 s 2fb(')II"")1
S (lh(I)1 1 + tu(')12).
Moreover. b}'ussumption ("2).
('11 "'" 2 S a( ,v, M'),
1"( M" (7)1 < J II ".11 II r II.
( 11)
(Ir aliI'. M' e V with pusit i ve c('nslants (". ,I. Thcref()re, integrat ion ()f ( 16) over
[(). I] yields
lu{lW + dlu.f')f (l1I(OW + dllll..(O)11 1 + f' Ih(,")1 2 ds)
J..
+ f ' 1"(s)12 + «"llII.(S) II 2 ds. (8)
u
4.3 Pn)f of the M.tln '1 hc,)rm
463
By the (JronwaJllcmma (I)ropoilinn 3.10), there fullows rrom (I H) that
lu(l)ll + cllu"ltUl l S t:unsUIII(OHl t l!u,,(O)lI l + [' Ih{.)ll,I.'I.
Ju
Clnd hence, Cl)f "U ", we obt,.in the decisive II prior; (1.'.1 i",Ule:
II u II ;..1(0:,.;11» + f 1114,.lIllO. 'I'i t') $ onst (I ,,(())ll + n u..«})l1 2 + H h II il.u. ,..,,.),
( 19)
by using integration over [0, '1'].
Sit'!» 2: Weak convergence of a subsc4ucncc uf the (iulcrkin scucnc.: (Ii.).
It follows from (19) that (u..) (rp. (u)) IS boundccJ in the II-sp.tCC 1..!fO, "[t: V)
trcsp. L1(O, T II)). 'rhus there exist subscqucncct again denuted by filII) and
('I such r hat
u" u in 1.. 2 (0, J'; V)
u - t' in L 2 (O. '/.; II)
fly I)r)pt)sic ion 23, t 9.
,
U = I;.
Noting that
as ,,-. .1:"
(20)
as "... w.
lJult S liln lIu,,1I
(n:s p . lIu'lI < IIUII)
in J.{(). 'J'; V) rcsp. 1.'2(U, 'f; II)), \\c uhh,in rrun1 (19) thai. as " -+ II.
1I1I'1I:. 1f u.I."..... (';luUt,tO.T.. < onst(luI12 f-lluoU z + Il b tl;'14u:ri''').
.I(./) 3: We show that u is a solution of the original equation t J).
J'roo{ tr ( ta). It f()Uows froln the Galerk in ctju,ation (2a), after n1ultiplicution
by ." and subsequent intcgrttiun by p,lrb twice, (hitt
"r i f
J fP"(u"I\Vj) + tit".. Uj)dl = (h. '''j) Jt
u u
where «P. ",., anll h lIcpcntJ on I. As " -. I..J, this yields for all j thitt
I ' I I'
(P" (Ill "j) + u(u, "j' dl ;: (h, 'V J ) (I,
It n
This limit will be justified below.
Since spun {"'J . "'2' . . . J is dense in II. equatiun (22) implies that
J I C1'
o tp"lulv) + ulu.,J)dt = Jo (h.v)dl
.. ..,
"fhis yields 141). that is,
d l
til z (u(I)lv) + ('(14(1), v) = (b(l), (')
for itll tp E (t.: (0. 'r. (21)
fur "II qJ t (.t (0, #1). (22)
for all ({J e Ct.; (0. J.),
l't:V.
(23)
for all I' e
464
24 1llhcrl Space Methods and Linear Hyperbolic Differential Rquations
according to the definition of generalized derivatives. By Corollary 24.2,
u" + Au = h.
In order to jutify the passage to the limit as n 00 in (21), we note that
UII- u in l 2 (0, T; V) as II -t 00. Thus it is sufficient to prove that all ex ptessions
in (2 J ) arc linear continuous runctional on X ::: Lz(O, T; V with respect to
"II. This follows from (1 S) and the Holder inequality. In fact. we get, for
examplc"
J l' i T
(P"(I)(u"U)lw J ) tit const lIu,,(I}1I "w)lt d, < constnu"UxU"'JIl.
c. 0
For the terms with a(. to), use (17). Equation (23) is obtained (rom (22) in the
same way.
PrOfl.r 0" (I c). We have to show that utI e Lz(Ot T; Y.)o It fol(ow from U E
12(O, #r V) and the boundedncss or A: V ..... V. that
Au E Lz{O. T; V.).
Because h E 1.'2(0.. 1" H), and because of the continuous embedding H c V.,
we u]so ha vc h E I J 2 CO, 'f; V.). f'inaUy, from
u H + Au = h
we get II" E: L 2 (O. T; V.).
Proof fr (I b). In order to show that
u(O) = 1'o,
u' (0) U I ,
we choo a function
1/1 E C l [0. 7 0 ]
with
"'(1") = 0,
;(0) = 1.
Anulof!uusly to (7) with .,(t) = "'(I)W J , there results from integration by parts
that
fJ'
< II(O), '" (0) w) = - J 0 <..: , !/I.j) + < .., .f Wj) J"
(14'(0). !/I(O)j> = - IT (u", "'WJ > + (u',!/I' w J ) dt.
The Galcrkin equation (2) and u" + Au = h yield
(u;. "'j) = < - Au" + h, t/Ii>t
< u", '" w J ) = < - Au + h, t/1 Mj > 0
It r()lIows rrom the convergence statement (20) that the right members of (24)
tcnd 10 one another as n .... 00. This passage to the limit can be justified as for
(21). Notc thaI, by (' 5 the rjht members of (24) are linear continuous
runctinnals on L 2 .(O. 1'; V) (rcspo l"2(O, T; H)) with respect to Uft (resp. u)o Thus,
(24)
Problems
465
we obtain from (24) that
(14;(0), ) -.. (11'(0), WJ) as n -. 00 for allj.
By assumption (H3 u(O) -+ u. holds in Has" .... 00, therefore also in V..
This yieids
(lit, ) = (u'(O),) for all j.
Since span { WI' Wz t . . . } is dense in V, we get
.., = u'(O).
Analogously, we obtain that
"0 = u(O
by considering the following integration by parts formula that Collows from
(23.25):
(u..(O),,,,(O)WJ) = - f: (II;. "'WJ) + (""Wi' u .) dt.
(u(O)I,,(O)) = - f: <"'.t/1) + <t/1'wJ. u }dt.
Step 4: Weak convergence of the total sequence (u.) of the Galerkin
approximations.
We saw in Step 2 that all subsequences of (u.) that converge weakly in
L1(O, T; V) possess as limit element the uniquely determined solution u of
equation (1). It then rollow according to Proposition 2113(i), that the total
sequence (II.) converges weakly to II in Ll(O, T; Y), and (u;) converges weakly
to u' in L 2 (O, T; H).
III
Step 5: The general case b e L 2 (O, T; H).
If t t-+ (b(t), v) is not assumed to be continuous on [0, T] as in Step I, then
one has to apply tbe Caratheodory theorem A:z(61) to solve the Galerkin
equations (cf. Problem 30.3). For this purpose, one writes equation (2) as a
first-order 5ystem of ordinary differential equations. Then. analogous to the
above, one can infer taking into account the lessened regularity of u.. Now
the components c.Jt.) of u. do not lie in C 2 [0. T] but rather in Wl(O, T).
Therefore, U. E L 2 (O, T; V) and u E L 2 (O, T; H).
The proof of Theorem 24.A is complete. 0
PRC)()f ()f C()R()LI..AItY 24.3. The proof proceeds analogously to the pr()()f of
Theorem 24.A (cr. Problem 24.4). 0
PR()BI.I:MS
24.1. E.,olution triple. Show that in an evolution triple .. V S II c V..." the space II is
dense: In V..
466
24. Hilhert Space Mcth()d and Linear Hyrcrbo1ic; Dill'crentiaJ quatiuns
Solution; This foJJows immediately from statement (c) in Section 23.4. A direct
rroor goes as follows.
I r 'I i not dense in V.. ,hen there exits a point ,,. e V. with
v. ii.
therefore, IJ. o. By the ffahn -.Banach theorem, one can construct a functional
Jo' E V.. with
(/:,h).,. = 0
for all " e 'I
and
(F.v.)r. = 1.
The B-space Y is ret1exive, therefore V :: V... Thus we get f' V and
(h, ,..) = (III F)H = 0
for all " Ell.
This irnplies F = () in contntdiction to (v., "').. =: J.
24.2. II,ral('J tlf.lri"t't"'('S (II')' = u lf . Show Ihat if u' and hl r exi5t as generalrl.ed
dcri..tivcs no lO. rr[, then Ii" it'n exists and u" = fu')'.
Solution: I t follows rrom
1 1 q/u'lI, = -. [1 1/1'"')' d,
(I In
for a II " f: C f ; (0, 'I' J
anti
f 1 ""II Jt = -- J 1 "'"' J,
J() u
for all E C n to. ',..
Ih.u
J 1 J. 1'
(pH U JI w:: 'Ph" r J,
o II
hy ning t/J = tp'. Thi meal1 u. = (II')'.
('unvcrscl). thc..elistcncc of (u')' ulong with (u')' == u H follows from the exis-
tence of I" and ,,".
ror all (/) e C'(O, T)
24.3. A C/('It;I)' tlw",tm. I,cl .. V II 1/... be un evolution triple. and let
"e 1.,,(0, 7": V)..
II' E t,,(O, .,.: II)
with I /' < I,' clnd 0 < 1" < f'¥J.
Shuw Ih.11 there cists a sequence (1',,) of functions rrom (.'1[0. 'f], V such
that
P.. .... " in I..,,(O .,.: V)
p -... u' in /',(0, 1" II)
85 ".... rt'J,
us 11 IIJ.
Since the set of polyn()miaI5 i dense in C1([O. 'f]. V), one may assume that all .
p,,: In. 'r] -. r arc f\olynomi..ls with cocffi",;ents in V.
Ili.lt: lJsc the n'oolhing opcralor in Section 18.14.
24.4. I'rof,r or ('ortJlltlr.\' 24.1
I lint: \ Jsc the same arguments a in the pruof of Theorem 24.A. ('r. Lion"
(I-}tdt. M). ('hapler 4. Section I.
Ref'ercnces 10 the literature
467
References to the Literature
Linear hyperbolic differential equations in mathematical physics: Babi (1967, M. B)
(geneT.. I !lurvey without rroofs). Courant and ftithert (1953 9 M), Tychono, and
Samarkii (1959, M), Gan&hedian (1964, M). Friedlander (1976 J M), John (1982. M).
Sakamoto (1982, M). Gunther f1988. M) (Huygens' principle)..
Introduction to the functional analytic treatment or hyperbolic dilrerential equa-
tions: lions (19M. M) (Galcrkin method). Triche I (1972. M Wloka (1982, M).
FaUorini (1983. M). ladYlkaja (1985. M), Leis (J9R6. M).
....urther important n10nographs: Sobolev (1950). LadYl.enskaja (1953). Lions and
Ma,cnes f 1 96R), Hc>rmander (1983). Smoller (1983).
Standard work on linear rartial differential equations: Oautray and Lions (1984.
M Vois. I - 3 (theory, applications. and numeriatl method).
Handbook of numerical analysis: Ciarfet and lioM (1988. M Vols. fff (finite
clemen. method" difference method, etc.).